Distribute jobs to cores as long as they become free #747

elgabbas · 2024-11-03T14:56:29Z

elgabbas
Nov 3, 2024

Hello,

I want to use the future.lapply function to run many jobs (e.g. 1000) and I know that these jobs vary in complexity and time-consumed. I sorted them in an estimated descending order and need to force future.lapply to distribute the most complex jobs to cores, then the next complex jobs, etc. This will balance the processing time. I need to implement something like:

Cores (8 cores available for parallel processing)
 ┌───────────┬───────────┬───────────┬───────────┬───────────┬───────────┬───────────┬───────────┐
 │  Core 1   │  Core 2   │  Core 3   │  Core 4   │  Core 5   │  Core 6   │  Core 7   │  Core 8   │
 ├───────────┼───────────┼───────────┼───────────┼───────────┼───────────┼───────────┼───────────┤
 │ Job 1     │ Job 2     │ Job 3     │ Job 4     │ Job 5     │ Job 6     │ Job 7     │ Job 8     │
 │ Job 9     │ Job 10    │ Job 11    │ Job 12    │ Job 13    │ Job 14    │ Job 15    │ Job 16    │
 │ Job 17    │ Job 18    │ Job 19    │ Job 20    │ Job 21    │ Job 22    │ Job 23    │ Job 24    │
 │   ...     │   ...     │   ...     │   ...     │   ...     │   ...     │   ...     │   ...     │
 └───────────┴───────────┴───────────┴───────────┴───────────┴───────────┴───────────┴───────────┘
 … continues until all 1000 jobs are assigned to cores.

I use something similar to this

library(dplyr)
library(future.apply)
job_list <- sort(1:(8*2), decreasing = TRUE)
plan(multisession, workers = 8)

results <- future_lapply(
  job_list,
  function(job) {
    T1 <- Sys.time()
    Sys.sleep(job)
    tibble::tibble(job = job, T1 = T1,  jobID = Sys.getpid())
  },
  future.chunk.size = 1) %>% 
  dplyr::bind_rows()

results

This forces the most complex 8 jobs to be executed first, then the next 8, etc.

results
# A tibble: 16 × 3
job T1                  jobID
<int> <dttm>              <int>
  1    16 2024-11-03 15:39:06 26080
2    15 2024-11-03 15:39:06 28060
3    14 2024-11-03 15:39:06 17420
4    13 2024-11-03 15:39:07  4608
5    12 2024-11-03 15:39:07 23916
6    11 2024-11-03 15:39:07 18744
7    10 2024-11-03 15:39:07 16224
8     9 2024-11-03 15:39:07  2436
9     8 2024-11-03 15:39:17  2436
10     7 2024-11-03 15:39:18 16224
11     6 2024-11-03 15:39:19 23916
12     5 2024-11-03 15:39:20 18744
13     4 2024-11-03 15:39:20  4608
14     3 2024-11-03 15:39:21 17420
15     2 2024-11-03 15:39:23 26080
16     1 2024-11-03 15:39:23 28060

I expect Process ID 26080 to process jobs 16 and 9, not 16 and 1. How can I ensure that the jobs are distributed in the order they are processed? Using future.scheduling = Inf instead of future.chunk.size = 1 leads to the same results.

A related question is that I need to allow jobs to start in the desired order (i.e. most complex jobs first) but distribute jobs to cores as soon as they are finished with previous job. When I monitor the current implementation, I find that some cores may become idle by the end of the processing, while many left jobs are waiting to be done on only a couple of cores, which influence the total time consumed by the job. Is this achievable without affecting the overall performance of the task?

Thanks

HenrikBengtsson · 2025-07-09T08:38:02Z

HenrikBengtsson
Jul 9, 2025
Maintainer

Hi. You can use the ordering attribute for future.scheduling to achieve this, cf. https://future.apply.futureverse.org/reference/future_lapply.html#control-processing-order-of-elements.

library(future.apply)
plan(multisession, workers = 8)

job_list <- sort(1:(8*2), decreasing = TRUE)

order <- matrix(job_list, ncol = 8, byrow = TRUE)
order <- match(order, job_list)
print(order)
#> [1]  1  9  2 10  3 11  4 12  5 13  6 14  7 15  8 16

results <- future_lapply(
  job_list,
  function(job) {
    T1 <- Sys.time()
    Sys.sleep(job)
    tibble::tibble(job = job, T1 = T1, jobID = Sys.getpid())
  },
  future.scheduling = structure(TRUE, ordering = order)
) |> dplyr::bind_rows()

print(results)

gives

# A tibble: 16 × 3
     job T1                    jobID
   <int> <dttm>                <int>
 1    16 2025-07-09 10:46:20 1384789
 2    15 2025-07-09 10:46:20 1384793
 3    14 2025-07-09 10:46:20 1384795
 4    13 2025-07-09 10:46:20 1384792
 5    12 2025-07-09 10:46:20 1384791
 6    11 2025-07-09 10:46:20 1384794
 7    10 2025-07-09 10:46:20 1384796
 8     9 2025-07-09 10:46:20 1384790
 9     8 2025-07-09 10:46:36 1384789
10     7 2025-07-09 10:46:35 1384793
11     6 2025-07-09 10:46:34 1384795
12     5 2025-07-09 10:46:33 1384792
13     4 2025-07-09 10:46:32 1384791
14     3 2025-07-09 10:46:31 1384794
15     2 2025-07-09 10:46:30 1384796
16     1 2025-07-09 10:46:29 1384790

This processes the tasks in eight chunks. Here, last task completes at ~16+8=24s.

Now, if you use:

order <- order(job_list, decreasing = TRUE)  ## 1-16 since already sorted
future.chunk.size = structure(1L, ordering = order)

the tasks are processed one-by-one following the general order according to order, but each task is processed on the first available worker, which might not be perfectly predictable. (In your case, since job_list is already ordered, this is effectively the same as future.chunk.size = 1L.)

# A tibble: 16 × 3
     job T1                    jobID
   <int> <dttm>                <int>
 1    16 2025-07-09 10:56:26 1384789
 2    15 2025-07-09 10:56:26 1384793
 3    14 2025-07-09 10:56:26 1384795
 4    13 2025-07-09 10:56:26 1384792
 5    12 2025-07-09 10:56:26 1384791
 6    11 2025-07-09 10:56:26 1384794
 7    10 2025-07-09 10:56:26 1384796
 8     9 2025-07-09 10:56:26 1384790
 9     8 2025-07-09 10:56:35 1384790
10     7 2025-07-09 10:56:36 1384796
11     6 2025-07-09 10:56:37 1384794
12     5 2025-07-09 10:56:38 1384791
13     4 2025-07-09 10:56:39 1384792
14     3 2025-07-09 10:56:40 1384795
15     2 2025-07-09 10:56:41 1384793
16     1 2025-07-09 10:56:42 1384789

Here, last tasks complete at ~18s.

I think this is what you're after if you want to maximize worker utilization and finish as soon as possible without having to worry too much about the perfect optimization.

FWIW, there are probably more optimal ways to schedule tasks when we can estimate the processing time for each task. For example, pairing up tasks (16,1), (15,2), (14,3), ..., (9,8) will result in them finish about the same time.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Futureverse

Distribute jobs to cores as long as they become free #747

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

Futureverse

Distribute jobs to cores as long as they become free #747

Uh oh!

elgabbas Nov 3, 2024

Replies: 1 comment

Uh oh!

Uh oh!

HenrikBengtsson Jul 9, 2025 Maintainer

elgabbas
Nov 3, 2024

HenrikBengtsson
Jul 9, 2025
Maintainer