nb_aiopool - asyncio 协程并发池

nb_aiopool 是一个轻量级、高性能的 asyncio 协程并发池，专为异步编程场景设计。
nb_aiopool 附赠一套分布式asyncio异步任务队列框架 nb_aio_task ，用来演示如何使用 nb_aiopool 实现asyncio生态的分布式异步任务队列框架，

核心价值：

✅ 背压控制：防止瞬间创建海量 Task，避免内存和 CPU 失控
✅ 简化代码：无需在业务函数中侵入 asyncio.Semaphore
✅ 生产级稳定：经过压测验证，100万并发任务（大字符串入参）内存稳定在 43MB

为什么不用 asyncio.Semaphore？
asyncio.Semaphore 只能控制并发数量，但无法阻止你快速创建100万个 asyncio.Task！

当每个Task携带大字符串参数（如 f"{'task' * 100}_{i}"）和返回值时：

❌ asyncio.Semaphore：100万Task × 1.6KB = 10GB+内存 → 💥 电脑死机
✅ NbAioPool：背压保护，内存稳定在 43MB → ✨ 丝滑流畅

为什么强调背压机制？： 假设你使用 asyncio.create_task(process_message(message)) 消费 redis/rabbitmq/kafka 消息队列，如果消息队列有1亿消息，如果没有背压机制，会迅速掏空消息队列里面的1亿消息到程序内存中，严重的内存泄漏迅速宕机和负载不均衡。

1. 安装
2. 快速开始
3. NbAioPool 是伪需求吗？
4. 核心概念：pool.submit vs pool.run
5. 使用场景
- 5.1 局部变量用法（推荐）
- 5.2 全局变量用法
6. 稳定性对比：吊打 asyncio.Semaphore
7. 与其他方案对比
8. API 参考
9. 最佳实践
10. 常见问题
11. nb_aiopool 和 async-pool-executor 区别
12. 相关链接
13. nb_aio_task - 分布式异步任务队列 🎁
14. 许可证

1. 安装

pip install nb_aiopool

环境要求： Python 3.7+

2. 快速开始

import asyncio
from nb_aiopool import NbAioPool

async def my_task(x):
    """你的业务逻辑"""
    await asyncio.sleep(0.1)
    return x * 2

async def main():
    # 创建并发池：最大并发数 10，队列容量 1000
    async with NbAioPool(max_concurrency=10, max_queue_size=1000) as pool:
        # 方式1: 提交任务，返回 future（不阻塞）
        future = await pool.submit(my_task(5))
        result = await future  # 需要时再等待结果
        print(f"结果: {result}")
        
        # 方式2: 提交并立即等待结果（阻塞当前协程）
        result = await pool.run(my_task(10))
        print(f"结果: {result}")
        
        # 方式3: 批量提交
        futures = [await pool.submit(my_task(i)) for i in range(100)]
        results = await asyncio.gather(*futures)
        print(f"批量结果: {results}")

asyncio.run(main())

3. NbAioPool 是伪需求吗？

3.1🚨 问题：为什么 asyncio 也需要并发池？

很多人认为："协程这么轻量，为什么还需要并发池？直接用 asyncio.Semaphore 不就行了？"

错！大错特错！

3.2 ❌ 反面教材：只用 `asyncio.Semaphore`

import asyncio

async def task_with_semaphore(big_data, task_id, semaphore):
    async with semaphore:  # 只控制并发数量
        await asyncio.sleep(0.1)
        # 返回大字符串，加剧内存占用
        return f"result_{'x' * 200}_{task_id}"

async def bad_example():
    semaphore = asyncio.Semaphore(1000)  # 限制1000并发
    
    # 🔥 灾难：瞬间创建1000万个 Task！
    # 每个Task携带大字符串参数，内存瞬间爆炸
    tasks = [
        asyncio.create_task(
            task_with_semaphore(f"{'task' * 100}_{i}", i, semaphore)
        ) 
        for i in range(10000000)
    ]
    
    # 此时你的电脑：
    # - 内存暴涨到 10GB+（每个Task都有大字符串！）
    # - CPU 100%
    # - 鼠标键盘卡死
    # - 系统崩溃重启
    
    await asyncio.gather(*tasks)

问题分析：

问题	`asyncio.Semaphore`	`NbAioPool`
控制并发数量	✅ 支持	✅ 支持
背压机制	❌ 无法阻止快速创建Task	✅ 队列满时自动阻塞
内存稳定性	❌ 100万Task占用10GB+	✅ 100万任务仅43MB
CPU占用	❌ 100%持续飙升	✅ 稳定在1%
代码侵入性	❌ 需要改业务函数	✅ 无需改业务逻辑

3.3 ✅ 正确做法：使用 `NbAioPool`

import asyncio
from nb_aiopool import NbAioPool

async def clean_task(big_data, task_id):
    """干净的业务逻辑，无需关心并发控制"""
    await asyncio.sleep(0.1)
    # 同样处理大字符串，但内存稳定
    return f"result_{'x' * 200}_{task_id}"

async def good_example():
    async with NbAioPool(max_concurrency=1000, max_queue_size=10000) as pool:
        # ✅ 背压机制：队列满时自动阻塞，不会瞬间创建100万Task
        # 即使每个任务携带大字符串，内存依然稳定
        for i in range(1000000):
            await pool.submit(clean_task(f"{'task' * 100}_{i}", i))
        
        # 电脑状态：
        # - 内存稳定在 43MB（有背压保护！）
        # - CPU 1%
        # - 一切丝滑流畅

asyncio.run(good_example())

3.4 nb_aiopool 吊打分批处理并发协程 (预判了你的质疑)

肯定有人会质疑，没人那么愚蠢按照 bad_example 函数中快速创建 1000万tasks，聪明人都会分批并发

有人会说只有笨瓜才会这样写代码，快速创建1000万个tasks

async def bad_example():
    semaphore = asyncio.Semaphore(1000)  # 限制1000并发
    
    # 🔥 灾难：瞬间创建1000万个 Task！
    # 每个Task携带大字符串参数，内存瞬间爆炸
    tasks = [
        asyncio.create_task(
            task_with_semaphore(f"{'task' * 100}_{i}", i, semaphore)
        ) 
        for i in range(10000000)
    ]
    await asyncio.gather(*tasks)

你会说你会按下面分批

async def safe_batch_processing():
    semaphore = asyncio.Semaphore(1000)  # 限制并发数量
    batch_size = 1000  # 每批处理1000个任务
    total_tasks = 10000000  # 总共1000万个任务
    
    for batch_start in range(0, total_tasks, batch_size):
        batch_end = min(batch_start + batch_size, total_tasks)
        print(f"处理批次: {batch_start} 到 {batch_end-1}")
        
        # 仅创建当前批次的任务
        batch_tasks = [
            asyncio.create_task(
                task_with_semaphore(f"{'task' * 100}_{i}", i, semaphore)
            )
            for i in range(batch_start, batch_end)
        ]
        
        # 等待当前批次完成
        batch_results = await asyncio.gather(*batch_tasks, return_exceptions=True)

        # 可选：批次间短暂休眠，让系统资源回收
        await asyncio.sleep(0.01)

分批的缺点：

代码复杂度高：需要手动管理批次循环、边界计算和批次间协调，代码冗长且容易出错。
动态负载不均衡：每批固定数量的任务，无法根据系统实时负载动态调整，导致资源浪费或处理能力不足

举例例如1000个任务作为一批次，如果999个任务0.1秒完成，但有1个任务卡了300秒，在绝大部分99%的时间里，服务的asyncio协程并发降低到1了，严重浪费 asyncio 并发高的好处。

分批处理和nb_aiopool示意图 分批处理： [■■■■■■■■■■] → 等待300秒 → [■■■■■■■■■■] → ... ↑ 1个慢任务阻塞全部

NbAioPool： [■□□□□□□□□□] → [■■■■■□□□□□] → 持续高效处理快任务完成后立即释放槽位

**小结：**相比之下，NbAioPool 提供了自动化的背压控制和持续的任务流处理，无需手动管理批次，代码更简洁且性能更稳定。

3.5 如果你说不分批执行，使用生产者->asyncio.Queue->消费者模式来实现 (再次预判了你的质疑)

那你说的刚好就是 nb_aiopool 了， nb_aiopool 就是 生产者->asyncio.Queue->消费者 实现的封装。

nb_aiopool 就是减少了需要频繁临时手写 定义queue + produce函数 + consume函数

4. 核心概念：pool.submit vs pool.run

4.1 `pool.submit(coro)` - 提交任务，返回 Future

特点：

✅ 非阻塞：立即返回 asyncio.Future 对象
✅ 适合批量提交：可以快速提交大量任务
⚠️ 需要手动等待：稍后通过 await future 获取结果

使用场景： 需要并发执行多个任务，最后统一收集结果

async def example_submit():
    async with NbAioPool(max_concurrency=10) as pool:
        # 批量提交100个任务
        futures = [await pool.submit(my_task(i)) for i in range(100)]
        
        # 可以先做其他事情
        print("任务已提交，现在可以做别的事")
        
        # 需要结果时再等待
    results = await asyncio.gather(*futures)
        print(f"结果: {results}")

4.2 `pool.run(coro)` - 提交任务并等待结果

特点：

✅ 一步到位：直接返回任务执行结果
✅ 代码简洁：相当于 await (await pool.submit(coro))
⚠️ 阻塞当前协程：会等待任务完成

使用场景： 需要立即使用任务结果

async def example_run():
    async with NbAioPool(max_concurrency=10) as pool:
        # 逐个执行并获取结果
        for i in range(100):
            result = await pool.run(my_task(i))
            print(f"第 {i} 个任务结果: {result}")

4.3 `pool.batch_submit(coros)` - 批量提交任务 🆕

特点：

✅ 批量操作：一次性提交多个协程，无需循环
✅ 代码简洁：比列表推导式更清晰
✅ 返回 Future 列表：可以灵活控制等待时机

使用场景： 需要批量提交大量任务，后续统一收集结果

async def example_batch_submit():
    async with NbAioPool(max_concurrency=10) as pool:
        # 创建100个协程对象
        coros = [my_task(i) for i in range(100)]
        
        # 批量提交，返回 future 列表
        futures = await pool.batch_submit(coros)
        
        # 可以先做其他事情
        print(f"已批量提交 {len(futures)} 个任务")
        
        # 统一等待所有任务完成
        results = await asyncio.gather(*futures)
        print(f"结果: {results}")

对比传统方式：

# 传统方式：列表推导式 + submit
futures = [await pool.submit(my_task(i)) for i in range(100)]

# 批量方式：更简洁 ✅
coros = [my_task(i) for i in range(100)]
futures = await pool.batch_submit(coros)

4.4 `pool.batch_run(coros)` - 批量提交并等待结果 🆕

特点：

✅ 一步到位：批量提交并直接返回所有结果
✅ 极简代码：相当于 await pool.batch_submit(coros) + await asyncio.gather(*futures)
⚠️ 阻塞当前协程：会等待所有任务完成

使用场景： 批量执行任务并立即需要所有结果

async def example_batch_run():
    async with NbAioPool(max_concurrency=10) as pool:
        # 创建100个协程对象
        coros = [my_task(i) for i in range(100)]
        
        # 批量提交并等待所有结果（一步到位）
        results = await pool.batch_run(coros)
        print(f"结果: {results}")
        print(f"共完成 {len(results)} 个任务")

对比传统方式：

# 传统方式：3行代码
coros = [my_task(i) for i in range(100)]
futures = await pool.batch_submit(coros)
results = await asyncio.gather(*futures)

# 批量方式：2行代码 ✅
coros = [my_task(i) for i in range(100)]
results = await pool.batch_run(coros)

4.5 四种方法对比总结

方法	提交方式	返回值	是否等待	适用场景
`submit(coro)`	单个	Future	❌	逐个提交，灵活控制
`run(coro)`	单个	结果	✅	逐个执行，立即使用结果
`batch_submit(coros)`	批量	Future列表	❌	批量提交，统一收集
`batch_run(coros)`	批量	结果列表	✅	批量执行，立即获取结果

代码示例对比：

# 方式1: submit - 逐个提交，手动等待
future1 = await pool.submit(my_task(1))
future2 = await pool.submit(my_task(2))
result1 = await future1
result2 = await future2

# 方式2: run - 逐个执行，立即获取结果
result1 = await pool.run(my_task(1))
result2 = await pool.run(my_task(2))

# 方式3: batch_submit - 批量提交，手动等待
coros = [my_task(1), my_task(2)]
futures = await pool.batch_submit(coros)
results = await asyncio.gather(*futures)

# 方式4: batch_run - 批量执行，一步到位 ✅ 最简洁
coros = [my_task(1), my_task(2)]
results = await pool.batch_run(coros)

选择建议：

少量任务，逐个执行 → 用 run
少量任务，并发执行 → 用 submit + 手动 gather
大量任务，需要灵活控制 → 用 batch_submit （我说的大量也不能是那种几百万个coro一次batch_submit，几百个次batch_submit还行，因为coros列表和futures列表太大，内存太大；如果真的是几百万个coro需要运行，那就使用for循环逐个submit提交，使用发后不管的模式，不gather futures的模式）
大量任务，一步到位 → 用 batch_run ⭐ 最推荐

5. 使用场景

5.1 局部变量用法（推荐）

适用于 asyncio.run() 启动的应用。

5.1.1 使用 `async with`（最佳实践）

import asyncio
from nb_aiopool import NbAioPool

async def sample_task(x: int):
    await asyncio.sleep(0.1)
    print(x)
    return x * 2

async def main():
    # 推荐：使用 async with，自动处理资源释放
    async with NbAioPool(max_concurrency=10, max_queue_size=1000) as pool:
        # 方式1: submit 逐个提交
        futures = [await pool.submit(sample_task(i)) for i in range(100)]
        results = await asyncio.gather(*futures)
        print("方式1结果:", results)
        
        # 方式2: run 逐个执行
        for i in range(10):
            result = await pool.run(sample_task(i))
            print(f"任务 {i} 结果: {result}")
        
        # 方式3: batch_submit 批量提交 🆕
        coros = [sample_task(i) for i in range(100)]
        futures = await pool.batch_submit(coros)
        results = await asyncio.gather(*futures)
        print("方式3结果:", results)
        
        # 方式4: batch_run 批量执行（最简洁）🆕 ⭐
        coros = [sample_task(i) for i in range(100)]
        results = await pool.batch_run(coros)
        print("方式4结果:", results)
    
    # async with 退出时自动调用 pool.shutdown(wait=True)

asyncio.run(main())

5.1.2 手动管理生命周期（不推荐手动await pool.shutdown）

async def main():
    pool = NbAioPool(max_concurrency=10, max_queue_size=1000)
    
    # 提交任务
    futures = [await pool.submit(sample_task(i)) for i in range(100)]
    results = await asyncio.gather(*futures)
    print("结果:", results)
    
    # ⚠️ 如果你不写await asyncio.gather(*futures)，必须手动调用 shutdown，否则任务会丢失！
    await pool.shutdown(wait=True)

asyncio.run(main())

5.2 全局变量用法

适用于需要跨模块、跨函数共享 pool 的场景，这要求代码最最末尾的那一行必须是 loop.run_forever() 的应用。

完整示例： 参考 tests/t_global_nb_aiopool.py

import asyncio
from nb_aiopool import NbAioPool

# 全局 pool，可在多个模块、函数中共享
aiopool = NbAioPool(max_concurrency=3, max_queue_size=1000)

async def fun_level1(x):
    """第一层业务逻辑"""
    await asyncio.sleep(1)
    print(f"Level1: {x}")
    # 在任务内部可以继续提交子任务
    await aiopool.submit(fun_level2(x*2, x*3))

async def fun_level2(a, b):
    """第二层业务逻辑"""
    await asyncio.sleep(2)
    print(f"Level2: {a}, {b}")

async def main():
    # 批量提交任务
    for i in range(30):
        await aiopool.submit(fun_level1(i))

if __name__ == "__main__":
    loop = asyncio.get_event_loop()
    loop.create_task(main())
    
    # ⚠️ 关键：使用 全局变量pool 时必须加这行，否则任务会因为程序提前退出而丢失
    loop.run_forever()

注意事项：

全局 pool 初始化： 在模块顶层创建，确保所有函数可访问
程序需要长期运行： 使用 loop.run_forever() 时，任务会持续执行

6. 稳定性对比：吊打 asyncio.Semaphore

6.1 压测场景

任务： 执行 100 万个简单的 asyncio.sleep(5) 任务，并发数 1000

完整代码： 参考 tests/t_press_web/nopool_test_sleep.py

6.2 方案1：只用 `asyncio.Semaphore`（灾难版）

async def aio_task_use_semaphore(big_input_data, n, semaphore):
    async with semaphore:
        await asyncio.sleep(5)
        print(n)
        # 返回大字符串，进一步加剧内存占用
        return f"result_{'x' * 200}_{n}_{big_input_data[:50]}"

async def no_pool_main():
    print("正在创建100万个任务...")
    semaphore = asyncio.Semaphore(1000)
    
    # 🔥 灾难：瞬间创建100万个Task
    # 每个Task都有大字符串入参和返回值，内存爆炸式增长！
    tasks = [
        asyncio.create_task(
            aio_task_use_semaphore(f"{'task' * 100}_{i}", i, semaphore)
        ) 
        for i in range(1000000)
    ]
    
    print("开始执行任务...")
    await asyncio.gather(*tasks)
    print("执行完成")

asyncio.run(no_pool_main())

资源占用（实测）：

时间	内存	CPU	状态
0s	50MB	100%	创建Task中
10s	3GB	100%	内存持续上涨
30s	10GB+	100%	系统卡死
45s	💥	💥	电脑死机重启

6.3 方案2：使用 `NbAioPool`（稳如老狗版）

async def aio_task(big_input_data, n):
    """干净的业务逻辑，无需 semaphore"""
    await asyncio.sleep(5)
    print(n)
    # 同样返回大字符串，但有背压保护，内存依然稳定
    return f"result_{'x' * 200}_{n}_{big_input_data[:50]}"

async def pool_main():
    async with NbAioPool(max_concurrency=1000, max_queue_size=10000) as pool:
        for i in range(1000000):
            # ✅ 有背压：队列满时自动阻塞，不会瞬间创建100万Task
            # 即使每个任务都有大字符串入参和返回值，内存依然稳定！
            await pool.submit(aio_task(f"{'task' * 100}_{i}", i))

asyncio.run(pool_main())

资源占用（实测）：

时间	内存	CPU	状态
0s	43MB	1%	稳定运行
60s	43MB	1%	稳定运行
300s	43MB	1%	稳定运行
1小时+	43MB	1%	持续稳定 ✅

6.4 对比总结

┌─────────────────────────────────────────────────────────┐
│         asyncio.Semaphore          vs    NbAioPool      │
├─────────────────────────────────────────────────────────┤
│ 内存占用：    10GB+                vs       43MB        │
│ CPU占用：     100%持续             vs       1%          │
│ 稳定性：      30秒内死机           vs       持续稳定    │
│ 背压机制：    ❌ 无                vs       ✅ 有       │
│ 代码侵入：    ❌ 需改业务函数      vs       ✅ 无侵入  │
└─────────────────────────────────────────────────────────┘

结论：

为什么内存差距这么大？
因为 asyncio.Semaphore 瞬间创建100万个Task对象，每个Task都保存着：

大字符串入参：f"{'task' * 100}_{i}" ≈ 400 字节

大字符串返回值：f"result_{'x' * 200}_{task_id}" ≈ 200 字节

Task对象本身的开销：≈ 1KB

100万个Task × 1.6KB ≈ 1.6GB，再加上Python对象管理开销，轻松超过10GB！

而 NbAioPool 有背压机制，同时只保持 max_concurrency + max_queue_size 个任务在内存中，即使100万任务，内存也稳定在 43MB！

试想一下： 如果你的异步函数入参和返回值是更大的对象（如几KB的字典、图片数据），
并且需要创建 1000 万个 tasks，不使用 NbAioPool，
你需要购买阿里云 10TB 内存 的服务器才能顶得住！

7. 与其他方案对比

7.1 vs `asyncio.Semaphore`

特性	`asyncio.Semaphore`	`NbAioPool`
并发控制	✅	✅
背压机制	❌	✅
内存稳定	❌	✅
代码侵入	❌ 需改业务函数	✅ 无侵入
使用复杂度	中	低

7.2 vs `asyncio.TaskGroup` (Python 3.11+)

特性	`asyncio.TaskGroup`	`NbAioPool`
Python 版本要求	3.11+	3.7+
并发数控制	❌	✅
背压机制	❌	✅
全局共享	❌ 不适合	✅ 支持
异常处理	✅ 优秀	✅

使用建议：

TaskGroup：适合局部任务组的异常管理
NbAioPool：适合需要并发控制和背压的场景

7.3 vs `async_pool_executor`

完全不同的使用场景！

库	环境	用途
`async_pool_executor`	同步环境	在同步代码中调用异步函数
`NbAioPool`	异步环境	在异步代码中管理并发

举例说明：

# async_pool_executor: 同步代码调用异步函数
from async_pool_executor import AsyncPoolExecutor

executor = AsyncPoolExecutor()
# 在同步函数中调用异步函数
executor.submit(async_func, arg1, arg2)
# NbAioPool: 异步代码管理并发
from nb_aiopool import NbAioPool

async def main():
    async with NbAioPool(max_concurrency=100) as pool:
        # 在异步环境中控制并发
        await pool.submit(async_func(arg1, arg2))

8. API 参考

8.1 `NbAioPool`

class NbAioPool:
    def __init__(self, max_concurrency: int = 100, max_queue_size: int = 1000):
        """
        创建并发池
        
        参数:
            max_concurrency: 最大并发任务数（同时运行的worker数量）
            max_queue_size: 任务队列最大容量（背压控制）
        """

8.2 主要方法

async def submit(self, coro: Coroutine, block: bool = True) -> asyncio.Future:
    """
    提交任务，返回 Future 对象
    
    参数:
        coro: 协程对象（注意：是协程对象，不是函数！）
        block: 队列满时是否阻塞等待（True: 等待，False: 立即抛异常）
    
    返回:
        asyncio.Future 对象
    
    示例:
        future = await pool.submit(my_task(10))
        result = await future
    """

async def run(self, coro: Coroutine, block: bool = True) -> Any:
    """
    提交任务并等待结果（等价于 await pool.submit(coro)）
    
    参数:
        coro: 协程对象
        block: 队列满时是否阻塞等待
    
    返回:
        任务执行结果
    
    示例:
        result = await pool.run(my_task(10))
    """

async def batch_submit(self, coros: List[Coroutine], block: bool = True) -> List[asyncio.Future]:
    """
    批量提交任务，返回 Future 列表 🆕
    
    参数:
        coros: 协程对象列表
        block: 队列满时是否阻塞等待
    
    返回:
        asyncio.Future 对象列表
    
    示例:
        coros = [my_task(i) for i in range(100)]
        futures = await pool.batch_submit(coros)
        results = await asyncio.gather(*futures)
    """

async def batch_run(self, coros: List[Coroutine], block: bool = True) -> List[Any]:
    """
    批量提交任务并等待所有结果 🆕
    
    参数:
        coros: 协程对象列表
        block: 队列满时是否阻塞等待
    
    返回:
        所有任务的执行结果列表
    
    示例:
        coros = [my_task(i) for i in range(100)]
        results = await pool.batch_run(coros)  # 一步到位
    """

async def shutdown(self, wait: bool = True):
    """
    关闭池
    
    参数:
        wait: 是否等待所有任务完成
    """

8.3 上下文管理器

async with NbAioPool(max_concurrency=10) as pool:
    await pool.submit(my_task(1))
    # 退出时自动调用 shutdown(wait=True)

9. 最佳实践

✅ 推荐做法

# 1. 使用 async with 管理生命周期
async with NbAioPool(max_concurrency=100) as pool:
    await pool.submit(task())

# 2. 根据场景选择并发数
# - CPU密集型（少）: max_concurrency = CPU核心数 * 2
# - IO密集型（多）: max_concurrency = 100 ~ 1000
# - 网络爬虫（超多）: max_concurrency = 1000 ~ 10000

# 3. 队列大小设置
# max_queue_size 应该 >= max_concurrency * 10

# 4. 批量任务优先使用 batch_run（最简洁）🆕 ⭐
coros = [task(i) for i in range(1000)]
results = await pool.batch_run(coros)  # 一步到位

# 或使用 batch_submit（需要灵活控制时）
coros = [task(i) for i in range(1000)]
futures = await pool.batch_submit(coros)
results = await asyncio.gather(*futures)

# 传统方式（不推荐，代码冗长）
futures = [await pool.submit(task(i)) for i in range(1000)]
results = await asyncio.gather(*futures)

10. 常见问题

Q1: `batch_submit` 和 `batch_run` 有什么区别？

# batch_submit: 批量提交，返回 future 列表，需要手动等待
coros = [my_task(i) for i in range(100)]
futures = await pool.batch_submit(coros)
# 可以做其他事情...
results = await asyncio.gather(*futures)

# batch_run: 批量提交并自动等待，一步到位 ⭐ 推荐
coros = [my_task(i) for i in range(100)]
results = await pool.batch_run(coros)  # 直接得到结果

建议：

大多数场景用 batch_run，代码最简洁
需要在等待前做其他操作时用 batch_submit

Q2: `async with` 和手动 `shutdown` 有什么区别？

# 方式1: async with（推荐）
async with NbAioPool(max_concurrency=10) as pool:
    await pool.submit(task())
# 自动调用 shutdown(wait=True)

# 方式2: 手动管理
pool = NbAioPool(max_concurrency=10)
await pool.submit(task())
await pool.shutdown(wait=True)  # 必须手动调用！

建议： 优先使用 async with，避免忘记 shutdown 导致任务丢失.

11. nb_aiopool 和 async-pool-executor 区别

nb_aiopool 的定位与 async-pool-executor (例如这个库或 funboost 内置的实现) 完全不同，它们解决了不同场景下的问题，不存在竞争关系。

nb_aiopool
和以前的这两个已开发的 async_pool_executor 作用不同。

https://github.com/ydf0509/async_pool_executor https://github.com/ydf0509/funboost/blob/master/funboost/concurrent_pool/async_pool_executor.py

async_pool_executor 是在同步环境中去 pool.submit 任务给一个loop并发运行多个coro ，
当一个框架需要兼容调度同步和异步并发时候用这，
例如funboost总体生态语法是同步的，需要依靠使用async_pool_executor 实现 asyncio 模式并发。

nb_aiopool 是在异步环境中去 await pool.submit ，纯脆为了异步生态而生。

简单来说：

async_pool_executor：是一座桥梁**，连接了同步世界和异步世界。**
- 它的工作是在一个同步的代码环境中，能够方便地调用并执行异步的函数（协程），而不用把整个应用都变成 async/await。
nb_aiopool：是一个交通管制系统**，它完全生活在异步世界内部。**
- 它的工作是在一个已经存在的异步代码环境中，去管理和限制并发任务的流量，防止交通堵塞（资源耗尽）。

12. 相关链接

GitHub: https://github.com/ydf0509/nb_aiopool
PyPI: https://pypi.org/project/nb-aiopool/
作者: ydf0509

13. nb_aiopool 贡献赠送一套分布式异步函数执行框架 nb_aio_task

nb_aio_task 框架是一个基于 Redis + NbAioPool 的简易分布式asyncio生态的异步任务队列，类似 RQ/Celery/Funboost，但更简单，更纯粹(只为异步而生)。
nb_aio_task 基于 异步redis包(aioredis/redis[asyncio]) 作为 broker，使用 NbAioPool 作为并发控制和背压机制。
nb_aio_task 的教程见 nb_aiopool/contrib/README.md ,例子见 nb_aiopool/contrib/example.py
funboost + 支持所有并发模式(包括asyncio) 是更强力的万能函数调度框架，nb_aio_task 是用来演示如何使用 nb_aiopool 实现asyncio生态的分布式异步任务队列框架，证明nb_aiopool 是可以作为任何asyncio生态框架的基石存在。
nb_aio_task 的asyncio 比 funboost 的asyncio 更纯粹,天然更容易自动使用用户的 loop, 因为 nb_aio_task 是在主线程运行的，自动和用户自己其他asyncio的代码共享一个loop。 funboost 是子线程启动消费的，不是自动和用户的主线程共用一个loop,使用上要指定 specify_async_loop，对不懂loop是什么的pythoner来说略难。
nb_aio_task 使用例子

import asyncio
from nb_aiopool.contrib.nb_aio_task import aio_task, batch_consume

@aio_task(queue_name="my_queue1", max_concurrency=100)
async def my_fun1(x, y):
    await asyncio.sleep(1)
    print(f"my_fun1: {x}, {y}")
    for i in range(5): # 消费函数可以继续向其他队列中发消息
        await my_fun2.submit(a=x*3 + i)
    return x + y

@aio_task(queue_name="my_queue2", max_concurrency=50)
async def my_fun2(a):
    await asyncio.sleep(1)
    print(f"my_fun2: {a}")
    return a * 2


async def main():
    # 任然可以直接运行函数，但不会进入队列
    print(f"直接运行函数: {await my_fun1(1,2)}")

    # 提交任务
    for i in range(100):
        await my_fun1.submit(i, i+1)
    
    # 启动消费者（阻塞运行）
    await batch_consume([my_fun1, my_fun2])

if __name__ == "__main__":
    asyncio.run(main())

14. 许可证

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
nb_aiopool		nb_aiopool
tests		tests
.gitignore		.gitignore
README.md		README.md
git_nb_aiopool.py		git_nb_aiopool.py
nb_log_config.py		nb_log_config.py
pub_pypi_nb_aiopool.py		pub_pypi_nb_aiopool.py
pyproject.toml		pyproject.toml

ydf0509/nb_aiopool

Folders and files

Latest commit

History

Repository files navigation

nb_aiopool - asyncio 协程并发池

目录

1. 安装

2. 快速开始

3. NbAioPool 是伪需求吗？

3.1🚨 问题：为什么 asyncio 也需要并发池？

3.2 ❌ 反面教材：只用 asyncio.Semaphore

3.3 ✅ 正确做法：使用 NbAioPool

3.4 nb_aiopool 吊打 分批处理并发协程 (预判了你的质疑)

3.5 如果你说不分批执行，使用 生产者->asyncio.Queue->消费者 模式来实现 (再次预判了你的质疑)

4. 核心概念：pool.submit vs pool.run

4.1 pool.submit(coro) - 提交任务，返回 Future

4.2 pool.run(coro) - 提交任务并等待结果

4.3 pool.batch_submit(coros) - 批量提交任务 🆕

4.4 pool.batch_run(coros) - 批量提交并等待结果 🆕

4.5 四种方法对比总结

5. 使用场景

5.1 局部变量用法（推荐）

5.1.1 使用 async with（最佳实践）

5.1.2 手动管理生命周期（不推荐手动await pool.shutdown）

5.2 全局变量用法

6. 稳定性对比：吊打 asyncio.Semaphore

6.1 压测场景

6.2 方案1：只用 asyncio.Semaphore（灾难版）

6.3 方案2：使用 NbAioPool（稳如老狗版）

6.4 对比总结

7. 与其他方案对比

7.1 vs asyncio.Semaphore

7.2 vs asyncio.TaskGroup (Python 3.11+)

7.3 vs async_pool_executor

8. API 参考

8.1 NbAioPool

8.2 主要方法

8.3 上下文管理器

9. 最佳实践

✅ 推荐做法

10. 常见问题

Q1: batch_submit 和 batch_run 有什么区别？

Q2: async with 和手动 shutdown 有什么区别？

11. nb_aiopool 和 async-pool-executor 区别

12. 相关链接

13. nb_aiopool 贡献赠送一套分布式异步函数执行框架 nb_aio_task

14. 许可证

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Packages 0

Languages

3.2 ❌ 反面教材：只用 `asyncio.Semaphore`

3.3 ✅ 正确做法：使用 `NbAioPool`

3.4 nb_aiopool 吊打分批处理并发协程 (预判了你的质疑)

3.5 如果你说不分批执行，使用生产者->asyncio.Queue->消费者模式来实现 (再次预判了你的质疑)

4.1 `pool.submit(coro)` - 提交任务，返回 Future

4.2 `pool.run(coro)` - 提交任务并等待结果

4.3 `pool.batch_submit(coros)` - 批量提交任务 🆕

4.4 `pool.batch_run(coros)` - 批量提交并等待结果 🆕

5.1.1 使用 `async with`（最佳实践）

6.2 方案1：只用 `asyncio.Semaphore`（灾难版）

6.3 方案2：使用 `NbAioPool`（稳如老狗版）

7.1 vs `asyncio.Semaphore`

7.2 vs `asyncio.TaskGroup` (Python 3.11+)

7.3 vs `async_pool_executor`

8.1 `NbAioPool`

Q1: `batch_submit` 和 `batch_run` 有什么区别？

Q2: `async with` 和手动 `shutdown` 有什么区别？

Packages