Maintain a respectful queue of jobs to be run on Quantum Engine #2821

mpharrigan · 2020-03-06T21:31:06Z

Sometimes you have a whole host of jobs to run. Instead of submitting lots of them and filling up the queue, you could do one at a time. But you can save on latency / classical processing overhead by keeping a respectful queue. I've been using this function

async def execute_in_queue(func, params, num_workers: int):
    queue = asyncio.Queue()

    async def worker():
        while True:
            param = await queue.get()
            print(f"Processing {param}. Current queue size: {queue.qsize()}")
            await func(param)
            print(f"{param} completed")
            queue.task_done()

    tasks = [asyncio.create_task(worker()) for _ in range(num_workers)]
    for param in params:
        await queue.put(param)
    print("Added everything to the queue. Current queue size: {}".format(queue.qsize()))
    await queue.join()
    for task in tasks:
        task.cancel()
    await asyncio.gather(*tasks, return_exceptions=True)

Would something like this be welcome inside cirq.google?

dabacon · 2020-05-08T23:00:38Z

Yes!

kevinsung · 2020-05-14T05:09:30Z

Can't we achieve the same thing using concurrent.futures.ThreadPoolExecutor? I.e.,

from concurrent.futures import ThreadPoolExecutor

with ThreadPoolExecutor(max_workers=num_workers) as executor:
    for param in params:
        executor.submit(func, *param)

mpharrigan · 2020-05-14T20:42:03Z

I swear I scoured the python async docs looking for something like this!

kevinsung · 2020-05-14T21:30:53Z

I didn't know about this until @mrwojtek showed me the other day 😛 .

mrwojtek · 2020-05-15T15:40:30Z

The concucrrent.futures.ThreadPoolExecutor is a concept that is conceptually orthogonal to asyncio library and they aim to solve slightly different problem. By design, asyncio is a single threaded library which allows for concurrent execution of python code. ThreadPoolExecutor allows for actual parallel execution where different functions run on different threads. This doesn't matter much for Python code since it's protected by a global lock anyway but matters a lot in our cases: where network I/O happens.

Matthew, your code looks nice but it is dependent on how does func callback is implemented. If all func instances run on the same thread, they'll block on the same I/O operation. Could you give an example of how "func" looks like in your cases?

balopat · 2020-09-21T16:31:14Z

I'd be curious to see what you put in func as well, @mpharrigan - the parallel execution will help only when we are network bound / waiting on the remote service's response. Do you have stats on this?
Before introducing any parallel/concurrent processing primitive we should figure out under what circumstances it helps and whether we can leverage existing features. It would be interesting to study the latency / throughput of our jobs depending on job sizes (circuit depth, parameters, measurements, etc).

mpharrigan · 2020-09-23T02:05:06Z

func is a call to quantum engine. You need a variant of EngineSampler that has an async run method, where instead of blocking polling for a job to complete, you yield. When I had a bunch of 50k shot jobs to run, it gave almost a 2x speedup (heuristic) through a combination of keeping the engine-side queue warm and pipelining client-side classical processing. Now, batching might give you a bigger performance boost but still puts the onus on the developer to batch circuits into appropriately sized chunks and doesn't pipeline the client-side processing.

Really, it would be sweet if we had an auto-batcher that uses async trickery to build up a local queue until it collects enough jobs to batch up and send

MichaelBroughton · 2022-03-28T20:30:29Z

Has this been superceded by feature testbed stuff @mpharrigan ?

mpharrigan · 2022-03-29T00:49:16Z

That would be a natural place for it, although it doesn't exist yet. Probably blocked by #5023

dstrain115 · 2024-02-07T19:12:17Z

@verult Do we think this feature is obsolete now that we have streaming?

MichaelBroughton added the time/after-1.0 label Mar 28, 2022

MichaelBroughton removed the time/after-1.0 label Jul 18, 2022

dstrain115 assigned verult and wcourtney Feb 7, 2024

mhucka added this to the 2025-T1: Performance improvements for large circuits milestone Nov 13, 2024

mhucka added this to Cirq 2025 roadmap Nov 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Maintain a respectful queue of jobs to be run on Quantum Engine #2821

Maintain a respectful queue of jobs to be run on Quantum Engine #2821

mpharrigan commented Mar 6, 2020 •

edited

Loading

dabacon commented May 8, 2020

kevinsung commented May 14, 2020 •

edited

Loading

mpharrigan commented May 14, 2020

kevinsung commented May 14, 2020

mrwojtek commented May 15, 2020

balopat commented Sep 21, 2020

mpharrigan commented Sep 23, 2020

MichaelBroughton commented Mar 28, 2022

mpharrigan commented Mar 29, 2022

dstrain115 commented Feb 7, 2024

Maintain a respectful queue of jobs to be run on Quantum Engine #2821

Maintain a respectful queue of jobs to be run on Quantum Engine #2821

Comments

mpharrigan commented Mar 6, 2020 • edited Loading

dabacon commented May 8, 2020

kevinsung commented May 14, 2020 • edited Loading

mpharrigan commented May 14, 2020

kevinsung commented May 14, 2020

mrwojtek commented May 15, 2020

balopat commented Sep 21, 2020

mpharrigan commented Sep 23, 2020

MichaelBroughton commented Mar 28, 2022

mpharrigan commented Mar 29, 2022

dstrain115 commented Feb 7, 2024

mpharrigan commented Mar 6, 2020 •

edited

Loading

kevinsung commented May 14, 2020 •

edited

Loading