Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve Reduced performance by serializing task scheduling #183

Merged
merged 6 commits into from
Mar 9, 2020

Conversation

tkf
Copy link
Member

@tkf tkf commented Jan 22, 2020

Commit Message

Improve Reduced performance by serializing task scheduling (#183)

This PR implements more control over the order of the tasks to be
scheduled. It is built on top a simpler threaded reduce I posted
elsewhere [1,2]. It guarantees that tasks are executed from left to
right while allowing load balancing with uneven workload per element.
It improves parallel findfirst performance [3] by avoiding to process
elements that appear after the one marked by Reduced.

[1] The initial implementation of threaded reduce with serialized
scheduling.
JuliaLang/julia#34470 (comment)

[2] The simplified version on which this PR depends.
JuliaLang/julia#34470 (comment)

[3] #183 (comment)

@codecov-io
Copy link

codecov-io commented Jan 22, 2020

Codecov Report

Merging #183 into master will decrease coverage by 0.01%.
The diff coverage is 93.75%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master     #183      +/-   ##
==========================================
- Coverage   92.33%   92.31%   -0.02%     
==========================================
  Files          20       20              
  Lines        1291     1301      +10     
==========================================
+ Hits         1192     1201       +9     
- Misses         99      100       +1
Impacted Files Coverage Δ
src/reduce.jl 93.02% <100%> (+0.61%) ⬆️
src/progress.jl 89.36% <100%> (ø) ⬆️
src/Transducers.jl 90.9% <66.66%> (-9.1%) ⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ed09118...fb39be9. Read the comment docs.

@github-actions
Copy link
Contributor

Benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 22 Jan 2020 - 07:06
    • Baseline: 22 Jan 2020 - 07:10
  • Package commits:
    • Target: 2a4513
    • Baseline: 19c510
  • Julia commits:
    • Target: 2d5741
    • Baseline: 2d5741
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1
    • Baseline: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["cat", "base"] 1.06 (5%) ❌ 1.00 (1%)
["collect", "filter-missing"] 1.08 (5%) ❌ 1.00 (1%)
["collect", "identity-float"] 1.13 (5%) ❌ 1.00 (1%)
["collect", "identity-union"] 1.10 (5%) ❌ 1.00 (1%)
["filter_map_reduce", "man"] 1.06 (5%) ❌ 1.00 (1%)
["filter_map_reduce", "xf"] 1.06 (5%) ❌ 1.00 (1%)
["gemm", "mul", "linalg", "32"] 1.12 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "false", "8"] 0.83 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "ivdep", "32"] 1.12 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "true", "32"] 0.84 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "true", "8"] 1.11 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "false", "32"] 1.07 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "false", "8"] 0.92 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "ivdep", "32"] 0.95 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "true", "32"] 1.07 (5%) ❌ 1.00 (1%)
["missing_argmax", "man"] 0.94 (5%) ✅ 1.00 (1%)
["missing_argmax", "rf"] 0.90 (5%) ✅ 1.00 (1%)
["missing_argmax", "xf"] 0.94 (5%) ✅ 1.00 (1%)
["missing_dot", "equiv"] 1.21 (5%) ❌ 1.00 (1%)
["missing_dot", "xf"] 1.06 (5%) ❌ 1.00 (1%)
["missing_dot", "xf_nota"] 0.95 (5%) ✅ 1.00 (1%)
["partition_by", "man"] 0.89 (5%) ✅ 1.00 (1%)
["partition_by", "xf"] 0.93 (5%) ✅ 1.00 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Target

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.3 LTS
  uname: Linux 5.0.0-1028-azure #30~18.04.1-Ubuntu SMP Fri Dec 6 11:47:59 UTC 2019 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      12336 s          0 s       1038 s      41822 s          0 s
       #2  2294 MHz      41177 s          0 s       1374 s      13715 s          0 s
       
  Memory: 6.782741546630859 GB (3452.984375 MB free)
  Uptime: 572.0 sec
  Load Avg:  1.07666015625  0.99755859375  0.57080078125
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, broadwell)

Baseline

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.3 LTS
  uname: Linux 5.0.0-1028-azure #30~18.04.1-Ubuntu SMP Fri Dec 6 11:47:59 UTC 2019 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      18812 s          0 s       1153 s      58774 s          0 s
       #2  2294 MHz      58410 s          0 s       1524 s      19920 s          0 s
       
  Memory: 6.782741546630859 GB (3486.36328125 MB free)
  Uptime: 808.0 sec
  Load Avg:  1.02978515625  1.01611328125  0.685546875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, broadwell)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 22 Jan 2020 - 7:6
  • Package commit: 2a4513
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 236.598 μs (5%)
["cat", "xf"] 1.700 μs (5%)
["collect", "filter-missing"] 95.000 μs (5%) 33.05 KiB (1%) 20
["collect", "identity-float"] 79.799 μs (5%) 256.91 KiB (1%) 20
["collect", "identity-union"] 356.997 μs (5%) 284.83 KiB (1%) 6666
["dot", "blas"] 2.289 μs (5%)
["dot", "man"] 2.278 μs (5%)
["dot", "rf"] 2.688 μs (5%)
["dot", "xf"] 2.688 μs (5%)
["filter_map_map!", "man"] 67.799 μs (5%)
["filter_map_map!", "xf"] 70.299 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 206.498 μs (5%)
["filter_map_reduce", "xf"] 206.398 μs (5%)
["gemm", "fusedmul", "blas", "16"] 5.876 ms (5%)
["gemm", "fusedmul", "blas", "2"] 4.106 ms (5%)
["gemm", "fusedmul", "blas", "32"] 8.483 ms (5%)
["gemm", "fusedmul", "blas", "8"] 4.382 ms (5%)
["gemm", "fusedmul", "xf", "16"] 5.520 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 646.888 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 11.278 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.717 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 1.403 ms (5%)
["gemm", "mul", "linalg", "32"] 4.157 μs (5%)
["gemm", "mul", "linalg", "8"] 286.228 ns (5%)
["gemm", "mul", "man", "false", "256"] 4.870 ms (5%)
["gemm", "mul", "man", "false", "32"] 7.100 μs (5%)
["gemm", "mul", "man", "false", "8"] 412.366 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 4.798 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 7.050 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 390.394 ns (5%)
["gemm", "mul", "man", "true", "256"] 4.865 ms (5%)
["gemm", "mul", "man", "true", "32"] 7.333 μs (5%)
["gemm", "mul", "man", "true", "8"] 442.629 ns (5%)
["gemm", "mul", "xf", "false", "256"] 4.858 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 7.475 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 461.130 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 4.784 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 6.160 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 400.503 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 4.860 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 7.475 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 411.168 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 1.020 μs (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 2.344 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 2.356 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.620 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.320 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 4.314 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.270 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.520 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 225.598 μs (5%) 74.08 KiB (1%) 3866
["missing_dot", "xf_nota"] 204.298 μs (5%) 74.11 KiB (1%) 3866
["partition_by", "man"] 1.941 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 1.901 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.3 LTS
  uname: Linux 5.0.0-1028-azure #30~18.04.1-Ubuntu SMP Fri Dec 6 11:47:59 UTC 2019 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      12336 s          0 s       1038 s      41822 s          0 s
       #2  2294 MHz      41177 s          0 s       1374 s      13715 s          0 s
       
  Memory: 6.782741546630859 GB (3452.984375 MB free)
  Uptime: 572.0 sec
  Load Avg:  1.07666015625  0.99755859375  0.57080078125
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, broadwell)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 22 Jan 2020 - 7:10
  • Package commit: 19c510
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 222.697 μs (5%)
["cat", "xf"] 1.760 μs (5%)
["collect", "filter-missing"] 88.299 μs (5%) 33.05 KiB (1%) 20
["collect", "identity-float"] 70.500 μs (5%) 256.91 KiB (1%) 20
["collect", "identity-union"] 324.296 μs (5%) 285.78 KiB (1%) 6676
["dot", "blas"] 2.300 μs (5%)
["dot", "man"] 2.389 μs (5%)
["dot", "rf"] 2.687 μs (5%)
["dot", "xf"] 2.687 μs (5%)
["filter_map_map!", "man"] 69.499 μs (5%)
["filter_map_map!", "xf"] 70.899 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 194.997 μs (5%)
["filter_map_reduce", "xf"] 194.997 μs (5%)
["gemm", "fusedmul", "blas", "16"] 5.893 ms (5%)
["gemm", "fusedmul", "blas", "2"] 4.123 ms (5%)
["gemm", "fusedmul", "blas", "32"] 8.589 ms (5%)
["gemm", "fusedmul", "blas", "8"] 4.419 ms (5%)
["gemm", "fusedmul", "xf", "16"] 5.591 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 652.393 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 11.256 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.745 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 1.388 ms (5%)
["gemm", "mul", "linalg", "32"] 3.700 μs (5%)
["gemm", "mul", "linalg", "8"] 299.000 ns (5%)
["gemm", "mul", "man", "false", "256"] 4.967 ms (5%)
["gemm", "mul", "man", "false", "32"] 7.200 μs (5%)
["gemm", "mul", "man", "false", "8"] 499.000 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 4.969 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 6.300 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 399.000 ns (5%)
["gemm", "mul", "man", "true", "256"] 4.981 ms (5%)
["gemm", "mul", "man", "true", "32"] 8.699 μs (5%)
["gemm", "mul", "man", "true", "8"] 399.000 ns (5%)
["gemm", "mul", "xf", "false", "256"] 4.993 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 6.999 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 499.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 4.931 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 6.499 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 399.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 5.005 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 6.999 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 399.000 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 1.080 μs (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 2.600 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 2.511 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.340 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.320 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 4.314 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.210 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.500 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 212.997 μs (5%) 74.11 KiB (1%) 3866
["missing_dot", "xf_nota"] 215.297 μs (5%) 74.14 KiB (1%) 3867
["partition_by", "man"] 2.183 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 2.054 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.3 LTS
  uname: Linux 5.0.0-1028-azure #30~18.04.1-Ubuntu SMP Fri Dec 6 11:47:59 UTC 2019 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      18812 s          0 s       1153 s      58774 s          0 s
       #2  2294 MHz      58410 s          0 s       1524 s      19920 s          0 s
       
  Memory: 6.782741546630859 GB (3486.36328125 MB free)
  Uptime: 808.0 sec
  Load Avg:  1.02978515625  1.01611328125  0.685546875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, broadwell)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               79
Model name:          Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Stepping:            1
CPU MHz:             2294.682
BogoMIPS:            4589.36
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            256K
L3 cache:            51200K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm rdseed adx smap xsaveopt
Cpu Property Value
Brand Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Vendor :Intel
Architecture :Broadwell
Model Family: 0x06, Model: 0x4f, Stepping: 0x01, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 256, 51200) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 256 bit = 32 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@github-actions
Copy link
Contributor

Multi-thread benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 22 Jan 2020 - 07:08
    • Baseline: 22 Jan 2020 - 07:12
  • Package commits:
    • Target: 2a4513
    • Baseline: 19c510
  • Julia commits:
    • Target: 2d5741
    • Baseline: 2d5741
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: JULIA_NUM_THREADS => 2
    • Baseline: JULIA_NUM_THREADS => 2

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["collect", "assoc", "basesize=1"] 1.22 (5%) ❌ 1.01 (1%)
["collect", "unordered", "basesize=1024"] 1.00 (5%) 0.91 (1%) ✅
["findfirst", "n=1000", "reduce", "basesize=128"] 0.48 (5%) ✅ 0.45 (1%) ✅
["findfirst", "n=1000", "reduce", "basesize=256"] 0.70 (5%) ✅ 0.57 (1%) ✅
["findfirst", "n=1000", "reduce", "basesize=512"] 0.45 (5%) ✅ 0.37 (1%) ✅
["findfirst", "n=400", "reduce", "basesize=128"] 0.96 (5%) 0.93 (1%) ✅
["findfirst", "n=400", "reduce", "basesize=256"] 0.92 (5%) ✅ 0.86 (1%) ✅
["findfirst", "n=400", "reduce", "basesize=512"] 0.85 (5%) ✅ 0.70 (1%) ✅
["findfirst", "n=500", "reduce", "basesize=128"] 0.58 (5%) ✅ 0.41 (1%) ✅
["findfirst", "n=500", "reduce", "basesize=256"] 0.22 (5%) ✅ 0.17 (1%) ✅
["findfirst", "n=500", "reduce", "basesize=512"] 0.32 (5%) ✅ 0.22 (1%) ✅
["parallel_histogram", "comm", "basesize=4096"] 1.00 (5%) 0.94 (1%) ✅
["parallel_histogram", "comm", "basesize=8192"] 0.81 (5%) ✅ 1.00 (1%)
["words", "nthreads=1"] 0.99 (5%) 0.99 (1%) ✅
["words", "nthreads=2"] 1.10 (5%) ❌ 1.00 (1%)
["words", "nthreads=4"] 1.12 (5%) ❌ 0.99 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Target

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.3 LTS
  uname: Linux 5.0.0-1028-azure #30~18.04.1-Ubuntu SMP Fri Dec 6 11:47:59 UTC 2019 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz: 
              speed         user         nice          sys         idle          irq
       #1  2394 MHz      48167 s          0 s       1669 s      15317 s          0 s
       #2  2394 MHz      43964 s          0 s       1845 s      20281 s          0 s
       
  Memory: 6.782741546630859 GB (3556.22265625 MB free)
  Uptime: 673.0 sec
  Load Avg:  1.76611328125  1.4912109375  0.8564453125
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, haswell)

Baseline

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.3 LTS
  uname: Linux 5.0.0-1028-azure #30~18.04.1-Ubuntu SMP Fri Dec 6 11:47:59 UTC 2019 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz: 
              speed         user         nice          sys         idle          irq
       #1  2394 MHz      70209 s          0 s       2054 s      20990 s          0 s
       #2  2394 MHz      65672 s          0 s       2218 s      26268 s          0 s
       
  Memory: 6.782741546630859 GB (3567.41015625 MB free)
  Uptime: 955.0 sec
  Load Avg:  1.70361328125  1.5810546875  1.0634765625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, haswell)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 22 Jan 2020 - 7:8
  • Package commit: 2a4513
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: JULIA_NUM_THREADS => 2

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["collect", "assoc", "basesize=1"] 522.574 ms (5%) 7.646 ms 92.14 MiB (1%) 1760614
["collect", "assoc", "basesize=1024"] 276.522 ms (5%) 1.84 MiB (1%) 1969
["collect", "assoc", "basesize=32"] 284.298 ms (5%) 5.79 MiB (1%) 59267
["collect", "seq"] 548.174 ms (5%) 513.00 KiB (1%) 23
["collect", "unordered", "basesize=1"] 573.596 ms (5%) 16.564 ms 30.29 MiB (1%) 477142
["collect", "unordered", "basesize=1024"] 412.286 ms (5%) 1.01 MiB (1%) 19005
["collect", "unordered", "basesize=32"] 306.760 ms (5%) 1.56 MiB (1%) 22496
["findfirst", "n=1000", "foldl"] 752.709 ms (5%)
["findfirst", "n=1000", "reduce", "basesize=128"] 381.451 ms (5%) 603.81 KiB (1%) 11844
["findfirst", "n=1000", "reduce", "basesize=256"] 381.185 ms (5%) 307.47 KiB (1%) 6045
["findfirst", "n=1000", "reduce", "basesize=512"] 387.528 ms (5%) 159.70 KiB (1%) 3143
["findfirst", "n=400", "foldl"] 565.109 ms (5%)
["findfirst", "n=400", "reduce", "basesize=128"] 287.266 ms (5%) 1.09 MiB (1%) 21978
["findfirst", "n=400", "reduce", "basesize=256"] 285.202 ms (5%) 563.33 KiB (1%) 11082
["findfirst", "n=400", "reduce", "basesize=512"] 286.313 ms (5%) 286.06 KiB (1%) 5642
["findfirst", "n=500", "foldl"] 96.875 ms (5%)
["findfirst", "n=500", "reduce", "basesize=128"] 49.729 ms (5%) 168.14 KiB (1%) 3283
["findfirst", "n=500", "reduce", "basesize=256"] 49.514 ms (5%) 90.14 KiB (1%) 1756
["findfirst", "n=500", "reduce", "basesize=512"] 53.028 ms (5%) 51.42 KiB (1%) 1003
["parallel_histogram", "assoc", "basesize=16384"] 5.572 ms (5%) 732.22 KiB (1%) 109
["parallel_histogram", "assoc", "basesize=4096"] 6.580 ms (5%) 1.80 MiB (1%) 540
["parallel_histogram", "assoc", "basesize=8192"] 6.167 ms (5%) 1.43 MiB (1%) 260
["parallel_histogram", "comm", "basesize=16384"] 13.433 ms (5%) 1.22 MiB (1%) 275
["parallel_histogram", "comm", "basesize=4096"] 18.379 ms (5%) 1001.14 KiB (1%) 406
["parallel_histogram", "comm", "basesize=8192"] 14.487 ms (5%) 1.22 MiB (1%) 340
["parallel_histogram", "seq"] 10.164 ms (5%) 364.63 KiB (1%) 25
["sum", "random", "foldl"] 19.068 ms (5%)
["sum", "random", "reduce", "basesize=128"] 10.311 ms (5%) 331.47 KiB (1%) 6840
["sum", "random", "reduce", "basesize=256"] 10.085 ms (5%) 164.17 KiB (1%) 3397
["sum", "random", "reduce", "basesize=512"] 9.762 ms (5%) 80.78 KiB (1%) 1676
["sum", "uniform", "foldl"] 18.881 ms (5%)
["sum", "uniform", "reduce", "basesize=128"] 10.302 ms (5%) 331.69 KiB (1%) 6854
["sum", "uniform", "reduce", "basesize=256"] 9.965 ms (5%) 164.22 KiB (1%) 3400
["sum", "uniform", "reduce", "basesize=512"] 9.837 ms (5%) 80.77 KiB (1%) 1675
["sum", "valley", "foldl"] 19.358 ms (5%)
["sum", "valley", "reduce", "basesize=128"] 10.585 ms (5%) 331.41 KiB (1%) 6836
["sum", "valley", "reduce", "basesize=256"] 10.110 ms (5%) 164.16 KiB (1%) 3396
["sum", "valley", "reduce", "basesize=512"] 10.208 ms (5%) 80.77 KiB (1%) 1675
["words", "nthreads=1"] 44.445 ms (5%) 8.271 ms 64.70 MiB (1%) 2093806
["words", "nthreads=2"] 24.401 ms (5%) 65.42 MiB (1%) 2093968
["words", "nthreads=4"] 25.487 ms (5%) 65.87 MiB (1%) 2094123

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.3 LTS
  uname: Linux 5.0.0-1028-azure #30~18.04.1-Ubuntu SMP Fri Dec 6 11:47:59 UTC 2019 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz: 
              speed         user         nice          sys         idle          irq
       #1  2394 MHz      48167 s          0 s       1669 s      15317 s          0 s
       #2  2394 MHz      43964 s          0 s       1845 s      20281 s          0 s
       
  Memory: 6.782741546630859 GB (3556.22265625 MB free)
  Uptime: 673.0 sec
  Load Avg:  1.76611328125  1.4912109375  0.8564453125
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, haswell)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 22 Jan 2020 - 7:12
  • Package commit: 19c510
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: JULIA_NUM_THREADS => 2

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["collect", "assoc", "basesize=1"] 427.070 ms (5%) 10.805 ms 91.55 MiB (1%) 1754655
["collect", "assoc", "basesize=1024"] 278.876 ms (5%) 1.84 MiB (1%) 1969
["collect", "assoc", "basesize=32"] 282.762 ms (5%) 5.77 MiB (1%) 59117
["collect", "seq"] 545.751 ms (5%) 513.00 KiB (1%) 23
["collect", "unordered", "basesize=1"] 575.712 ms (5%) 30.25 MiB (1%) 475054
["collect", "unordered", "basesize=1024"] 412.707 ms (5%) 1.10 MiB (1%) 25421
["collect", "unordered", "basesize=32"] 308.976 ms (5%) 1.57 MiB (1%) 23175
["findfirst", "n=1000", "foldl"] 760.806 ms (5%)
["findfirst", "n=1000", "reduce", "basesize=128"] 797.906 ms (5%) 1.32 MiB (1%) 26514
["findfirst", "n=1000", "reduce", "basesize=256"] 545.181 ms (5%) 535.02 KiB (1%) 10433
["findfirst", "n=1000", "reduce", "basesize=512"] 866.144 ms (5%) 434.17 KiB (1%) 8477
["findfirst", "n=400", "foldl"] 568.887 ms (5%)
["findfirst", "n=400", "reduce", "basesize=128"] 300.800 ms (5%) 1.18 MiB (1%) 23757
["findfirst", "n=400", "reduce", "basesize=256"] 310.945 ms (5%) 657.86 KiB (1%) 12949
["findfirst", "n=400", "reduce", "basesize=512"] 336.969 ms (5%) 409.63 KiB (1%) 8034
["findfirst", "n=500", "foldl"] 97.210 ms (5%)
["findfirst", "n=500", "reduce", "basesize=128"] 85.768 ms (5%) 414.70 KiB (1%) 8021
["findfirst", "n=500", "reduce", "basesize=256"] 224.165 ms (5%) 520.59 KiB (1%) 10079
["findfirst", "n=500", "reduce", "basesize=512"] 164.468 ms (5%) 232.13 KiB (1%) 4474
["parallel_histogram", "assoc", "basesize=16384"] 5.530 ms (5%) 732.22 KiB (1%) 109
["parallel_histogram", "assoc", "basesize=4096"] 6.374 ms (5%) 1.80 MiB (1%) 540
["parallel_histogram", "assoc", "basesize=8192"] 5.921 ms (5%) 1.43 MiB (1%) 261
["parallel_histogram", "comm", "basesize=16384"] 13.526 ms (5%) 1.22 MiB (1%) 206
["parallel_histogram", "comm", "basesize=4096"] 18.360 ms (5%) 1.04 MiB (1%) 4212
["parallel_histogram", "comm", "basesize=8192"] 17.884 ms (5%) 1.22 MiB (1%) 336
["parallel_histogram", "seq"] 10.165 ms (5%) 364.63 KiB (1%) 25
["sum", "random", "foldl"] 19.180 ms (5%)
["sum", "random", "reduce", "basesize=128"] 10.212 ms (5%) 331.34 KiB (1%) 6832
["sum", "random", "reduce", "basesize=256"] 10.014 ms (5%) 164.09 KiB (1%) 3392
["sum", "random", "reduce", "basesize=512"] 9.775 ms (5%) 80.73 KiB (1%) 1673
["sum", "uniform", "foldl"] 18.658 ms (5%)
["sum", "uniform", "reduce", "basesize=128"] 10.066 ms (5%) 331.34 KiB (1%) 6832
["sum", "uniform", "reduce", "basesize=256"] 9.873 ms (5%) 164.08 KiB (1%) 3391
["sum", "uniform", "reduce", "basesize=512"] 9.673 ms (5%) 80.72 KiB (1%) 1672
["sum", "valley", "foldl"] 19.422 ms (5%)
["sum", "valley", "reduce", "basesize=128"] 10.288 ms (5%) 331.31 KiB (1%) 6830
["sum", "valley", "reduce", "basesize=256"] 10.092 ms (5%) 164.09 KiB (1%) 3392
["sum", "valley", "reduce", "basesize=512"] 9.918 ms (5%) 80.73 KiB (1%) 1673
["words", "nthreads=1"] 44.918 ms (5%) 7.568 ms 65.37 MiB (1%) 2114811
["words", "nthreads=2"] 22.083 ms (5%) 65.73 MiB (1%) 2114891
["words", "nthreads=4"] 22.743 ms (5%) 66.45 MiB (1%) 2115054

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.3 LTS
  uname: Linux 5.0.0-1028-azure #30~18.04.1-Ubuntu SMP Fri Dec 6 11:47:59 UTC 2019 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz: 
              speed         user         nice          sys         idle          irq
       #1  2394 MHz      70209 s          0 s       2054 s      20990 s          0 s
       #2  2394 MHz      65672 s          0 s       2218 s      26268 s          0 s
       
  Memory: 6.782741546630859 GB (3567.41015625 MB free)
  Uptime: 955.0 sec
  Load Avg:  1.70361328125  1.5810546875  1.0634765625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, haswell)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               63
Model name:          Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz
Stepping:            2
CPU MHz:             2394.456
BogoMIPS:            4788.91
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            256K
L3 cache:            30720K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm invpcid_single pti fsgsbase bmi1 avx2 smep bmi2 erms invpcid xsaveopt md_clear
Cpu Property Value
Brand Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz
Vendor :Intel
Architecture :Haswell
Model Family: 0x06, Model: 0x3f, Stepping: 0x02, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 256, 30720) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 256 bit = 32 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@tkf tkf changed the title Serialize task scheduling in reduce Improve Reduced performance by serializing task scheduling Jan 22, 2020
@tkf tkf changed the title Improve Reduced performance by serializing task scheduling Improve Reduced performance by serializing task scheduling Jan 22, 2020
@github-actions
Copy link
Contributor

Benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 24 Jan 2020 - 10:12
    • Baseline: 24 Jan 2020 - 10:16
  • Package commits:
    • Target: bbc0a7
    • Baseline: 3d75be
  • Julia commits:
    • Target: 2d5741
    • Baseline: 2d5741
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1
    • Baseline: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["filter_map_map!", "man"] 1.07 (5%) ❌ 1.00 (1%)
["gemm", "fusedmul", "blas", "16"] 1.07 (5%) ❌ 1.00 (1%)
["gemm", "fusedmul", "blas", "2"] 1.13 (5%) ❌ 1.00 (1%)
["gemm", "fusedmul", "blas", "32"] 1.15 (5%) ❌ 1.00 (1%)
["gemm", "fusedmul", "blas", "8"] 1.21 (5%) ❌ 1.00 (1%)
["gemm", "fusedmul", "xf", "16"] 1.08 (5%) ❌ 1.00 (1%)
["gemm", "fusedmul", "xf", "2"] 0.91 (5%) ✅ 1.00 (1%)
["gemm", "fusedmul", "xf", "32"] 0.94 (5%) ✅ 1.00 (1%)
["gemm", "fusedmul", "xf", "8"] 0.94 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "false", "256"] 1.08 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "ivdep", "256"] 1.08 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "ivdep", "8"] 0.92 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "false", "8"] 1.06 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "ivdep", "32"] 0.94 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "ivdep", "8"] 1.10 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "true", "8"] 0.92 (5%) ✅ 1.00 (1%)
["missing_argmax", "man"] 1.24 (5%) ❌ 1.00 (1%)
["missing_argmax", "rf"] 1.22 (5%) ❌ 1.00 (1%)
["missing_argmax", "xf"] 1.23 (5%) ❌ 1.00 (1%)
["partition_by", "man"] 1.24 (5%) ❌ 1.00 (1%)
["partition_by", "xf"] 1.23 (5%) ❌ 1.00 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Target

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.3 LTS
  uname: Linux 5.0.0-1028-azure #30~18.04.1-Ubuntu SMP Fri Dec 6 11:47:59 UTC 2019 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      31909 s          0 s       1081 s      21858 s          0 s
       #2  2095 MHz      21394 s          0 s       1228 s      32693 s          0 s
       
  Memory: 6.782741546630859 GB (3486.05078125 MB free)
  Uptime: 567.0 sec
  Load Avg:  1.06689453125  0.99609375  0.587890625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Baseline

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.3 LTS
  uname: Linux 5.0.0-1028-azure #30~18.04.1-Ubuntu SMP Fri Dec 6 11:47:59 UTC 2019 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      47320 s          0 s       1236 s      29894 s          0 s
       #2  2095 MHz      29616 s          0 s       1406 s      47868 s          0 s
       
  Memory: 6.782741546630859 GB (3500.90625 MB free)
  Uptime: 804.0 sec
  Load Avg:  1.0  1.0  0.689453125
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 24 Jan 2020 - 10:12
  • Package commit: bbc0a7
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 279.209 μs (5%)
["cat", "xf"] 2.044 μs (5%)
["collect", "filter-missing"] 112.704 μs (5%) 33.05 KiB (1%) 20
["collect", "identity-float"] 86.003 μs (5%) 256.91 KiB (1%) 20
["collect", "identity-union"] 395.613 μs (5%) 285.98 KiB (1%) 6676
["dot", "blas"] 1.430 μs (5%)
["dot", "man"] 1.430 μs (5%)
["dot", "rf"] 2.567 μs (5%)
["dot", "xf"] 2.567 μs (5%)
["filter_map_map!", "man"] 63.705 μs (5%)
["filter_map_map!", "xf"] 67.405 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 276.210 μs (5%)
["filter_map_reduce", "xf"] 276.410 μs (5%)
["findall", "base"] 985.138 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 789.930 μs (5%) 3.05 MiB (1%) 100015
["findall", "xf-iter"] 949.035 μs (5%) 2.00 MiB (1%) 29
["gemm", "fusedmul", "blas", "16"] 5.456 ms (5%)
["gemm", "fusedmul", "blas", "2"] 4.007 ms (5%)
["gemm", "fusedmul", "blas", "32"] 7.974 ms (5%)
["gemm", "fusedmul", "blas", "8"] 4.307 ms (5%)
["gemm", "fusedmul", "xf", "16"] 5.485 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 594.253 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 10.152 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.530 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 1.067 ms (5%)
["gemm", "mul", "linalg", "32"] 3.600 μs (5%)
["gemm", "mul", "linalg", "8"] 291.155 ns (5%)
["gemm", "mul", "man", "false", "256"] 2.048 ms (5%)
["gemm", "mul", "man", "false", "32"] 5.334 μs (5%)
["gemm", "mul", "man", "false", "8"] 524.136 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 2.031 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 4.615 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 459.944 ns (5%)
["gemm", "mul", "man", "true", "256"] 1.915 ms (5%)
["gemm", "mul", "man", "true", "32"] 6.081 μs (5%)
["gemm", "mul", "man", "true", "8"] 489.526 ns (5%)
["gemm", "mul", "xf", "false", "256"] 2.043 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 5.117 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 529.537 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 2.026 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 4.429 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 438.394 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 2.043 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 5.434 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 460.929 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 3.213 μs (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 3.038 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 3.050 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.730 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.620 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 5.667 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.630 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.750 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 261.809 μs (5%) 73.95 KiB (1%) 3862
["missing_dot", "xf_nota"] 262.910 μs (5%) 74.08 KiB (1%) 3864
["partition_by", "man"] 2.395 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 2.242 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.3 LTS
  uname: Linux 5.0.0-1028-azure #30~18.04.1-Ubuntu SMP Fri Dec 6 11:47:59 UTC 2019 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      31909 s          0 s       1081 s      21858 s          0 s
       #2  2095 MHz      21394 s          0 s       1228 s      32693 s          0 s
       
  Memory: 6.782741546630859 GB (3486.05078125 MB free)
  Uptime: 567.0 sec
  Load Avg:  1.06689453125  0.99609375  0.587890625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 24 Jan 2020 - 10:16
  • Package commit: 3d75be
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 280.716 μs (5%)
["cat", "xf"] 2.045 μs (5%)
["collect", "filter-missing"] 113.407 μs (5%) 33.05 KiB (1%) 20
["collect", "identity-float"] 82.605 μs (5%) 256.91 KiB (1%) 20
["collect", "identity-union"] 397.923 μs (5%) 285.59 KiB (1%) 6674
["dot", "blas"] 1.430 μs (5%)
["dot", "man"] 1.410 μs (5%)
["dot", "rf"] 2.578 μs (5%)
["dot", "xf"] 2.567 μs (5%)
["filter_map_map!", "man"] 59.502 μs (5%)
["filter_map_map!", "xf"] 64.603 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 276.216 μs (5%)
["filter_map_reduce", "xf"] 276.517 μs (5%)
["findall", "base"] 989.366 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 775.751 μs (5%) 3.05 MiB (1%) 100015
["findall", "xf-iter"] 946.462 μs (5%) 2.00 MiB (1%) 29
["gemm", "fusedmul", "blas", "16"] 5.090 ms (5%)
["gemm", "fusedmul", "blas", "2"] 3.552 ms (5%)
["gemm", "fusedmul", "blas", "32"] 6.924 ms (5%)
["gemm", "fusedmul", "blas", "8"] 3.554 ms (5%)
["gemm", "fusedmul", "xf", "16"] 5.056 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 652.432 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 10.816 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.678 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 1.066 ms (5%)
["gemm", "mul", "linalg", "32"] 3.700 μs (5%)
["gemm", "mul", "linalg", "8"] 300.000 ns (5%)
["gemm", "mul", "man", "false", "256"] 1.898 ms (5%)
["gemm", "mul", "man", "false", "32"] 5.600 μs (5%)
["gemm", "mul", "man", "false", "8"] 500.000 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 1.886 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 4.600 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 500.000 ns (5%)
["gemm", "mul", "man", "true", "256"] 1.908 ms (5%)
["gemm", "mul", "man", "true", "32"] 6.200 μs (5%)
["gemm", "mul", "man", "true", "8"] 500.000 ns (5%)
["gemm", "mul", "xf", "false", "256"] 2.041 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 5.300 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 500.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 2.024 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 4.700 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 400.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 2.042 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 5.700 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 500.000 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 2.600 μs (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 2.500 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 2.488 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.750 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.630 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 5.667 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.620 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.750 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 256.915 μs (5%) 74.11 KiB (1%) 3868
["missing_dot", "xf_nota"] 257.716 μs (5%) 73.95 KiB (1%) 3860
["partition_by", "man"] 1.939 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 1.816 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.3 LTS
  uname: Linux 5.0.0-1028-azure #30~18.04.1-Ubuntu SMP Fri Dec 6 11:47:59 UTC 2019 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      47320 s          0 s       1236 s      29894 s          0 s
       #2  2095 MHz      29616 s          0 s       1406 s      47868 s          0 s
       
  Memory: 6.782741546630859 GB (3500.90625 MB free)
  Uptime: 804.0 sec
  Load Avg:  1.0  1.0  0.689453125
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               85
Model name:          Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Stepping:            4
CPU MHz:             2095.195
BogoMIPS:            4190.39
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            1024K
L3 cache:            36608K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx avx512f avx512dq rdseed adx smap clflushopt avx512cd avx512bw avx512vl xsaveopt xsavec xsaves
Cpu Property Value
Brand Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Vendor :Intel
Architecture :Skylake
Model Family: 0x06, Model: 0x55, Stepping: 0x04, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 1024, 36608) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 512 bit = 64 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@github-actions
Copy link
Contributor

Multi-thread benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 24 Jan 2020 - 10:14
    • Baseline: 24 Jan 2020 - 10:18
  • Package commits:
    • Target: bbc0a7
    • Baseline: 3d75be
  • Julia commits:
    • Target: 2d5741
    • Baseline: 2d5741
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: JULIA_NUM_THREADS => 2
    • Baseline: JULIA_NUM_THREADS => 2

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["collect", "assoc", "basesize=1"] 1.25 (5%) ❌ 1.01 (1%)
["collect", "unordered", "basesize=1024"] 1.22 (5%) ❌ 1.11 (1%) ❌
["findfirst", "n=1000", "reduce", "basesize=128"] 0.70 (5%) ✅ 0.62 (1%) ✅
["findfirst", "n=1000", "reduce", "basesize=256"] 0.74 (5%) ✅ 0.60 (1%) ✅
["findfirst", "n=1000", "reduce", "basesize=512"] 0.55 (5%) ✅ 0.44 (1%) ✅
["findfirst", "n=400", "foldl"] 1.08 (5%) ❌ 1.00 (1%)
["findfirst", "n=400", "reduce", "basesize=128"] 0.96 (5%) 0.94 (1%) ✅
["findfirst", "n=400", "reduce", "basesize=256"] 0.94 (5%) ✅ 0.84 (1%) ✅
["findfirst", "n=400", "reduce", "basesize=512"] 0.88 (5%) ✅ 0.70 (1%) ✅
["findfirst", "n=500", "reduce", "basesize=128"] 0.25 (5%) ✅ 0.20 (1%) ✅
["findfirst", "n=500", "reduce", "basesize=256"] 0.25 (5%) ✅ 0.21 (1%) ✅
["findfirst", "n=500", "reduce", "basesize=512"] 0.14 (5%) ✅ 0.13 (1%) ✅
["overhead", "n=64"] 1.65 (5%) ❌ 1.00 (1%)
["parallel_histogram", "assoc", "basesize=16384"] 0.93 (5%) ✅ 1.00 (1%)
["parallel_histogram", "assoc", "basesize=4096"] 1.09 (5%) ❌ 1.15 (1%) ❌
["parallel_histogram", "assoc", "basesize=8192"] 1.17 (5%) ❌ 1.00 (1%)
["parallel_histogram", "comm", "basesize=4096"] 1.10 (5%) ❌ 1.03 (1%) ❌
["parallel_histogram", "comm", "basesize=8192"] 0.99 (5%) 0.98 (1%) ✅
["parallel_histogram", "seq"] 0.90 (5%) ✅ 1.00 (1%)
["sum", "random", "foldl"] 1.05 (5%) ❌ 1.00 (1%)
["sum", "random", "reduce", "basesize=256"] 1.12 (5%) ❌ 1.00 (1%)
["sum", "uniform", "reduce", "basesize=256"] 1.11 (5%) ❌ 1.00 (1%)
["sum", "valley", "reduce", "basesize=256"] 1.11 (5%) ❌ 1.00 (1%)
["sum", "valley", "reduce", "basesize=512"] 1.10 (5%) ❌ 1.00 (1%)
["words", "nthreads=4"] 1.07 (5%) ❌ 1.00 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Target

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.3 LTS
  uname: Linux 5.0.0-1028-azure #30~18.04.1-Ubuntu SMP Fri Dec 6 11:47:59 UTC 2019 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      46441 s          0 s       1769 s      13926 s          0 s
       #2  2095 MHz      45590 s          0 s       2101 s      14746 s          0 s
       
  Memory: 6.782741546630859 GB (3581.92578125 MB free)
  Uptime: 641.0 sec
  Load Avg:  1.72509765625  1.49951171875  0.857421875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Baseline

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.3 LTS
  uname: Linux 5.0.0-1028-azure #30~18.04.1-Ubuntu SMP Fri Dec 6 11:47:59 UTC 2019 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      66824 s          0 s       2286 s      20764 s          0 s
       #2  2095 MHz      68612 s          0 s       2491 s      19127 s          0 s
       
  Memory: 6.782741546630859 GB (3608.890625 MB free)
  Uptime: 920.0 sec
  Load Avg:  1.66064453125  1.560546875  1.0517578125
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 24 Jan 2020 - 10:14
  • Package commit: bbc0a7
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: JULIA_NUM_THREADS => 2

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["collect", "assoc", "basesize=1"] 533.511 ms (5%) 6.178 ms 92.16 MiB (1%) 1761390
["collect", "assoc", "basesize=1024"] 281.004 ms (5%) 1.84 MiB (1%) 1968
["collect", "assoc", "basesize=32"] 288.762 ms (5%) 5.79 MiB (1%) 59281
["collect", "seq"] 532.104 ms (5%) 513.00 KiB (1%) 23
["collect", "unordered", "basesize=1"] 530.898 ms (5%) 7.864 ms 30.24 MiB (1%) 474150
["collect", "unordered", "basesize=1024"] 412.947 ms (5%) 928.91 KiB (1%) 12565
["collect", "unordered", "basesize=32"] 314.695 ms (5%) 1.53 MiB (1%) 20476
["findfirst", "n=1000", "foldl"] 754.846 ms (5%)
["findfirst", "n=1000", "reduce", "basesize=128"] 387.113 ms (5%) 603.81 KiB (1%) 11844
["findfirst", "n=1000", "reduce", "basesize=256"] 383.087 ms (5%) 307.34 KiB (1%) 6037
["findfirst", "n=1000", "reduce", "basesize=512"] 393.951 ms (5%) 159.69 KiB (1%) 3142
["findfirst", "n=400", "foldl"] 583.769 ms (5%)
["findfirst", "n=400", "reduce", "basesize=128"] 294.171 ms (5%) 1.09 MiB (1%) 21980
["findfirst", "n=400", "reduce", "basesize=256"] 294.553 ms (5%) 563.42 KiB (1%) 11088
["findfirst", "n=400", "reduce", "basesize=512"] 297.062 ms (5%) 286.00 KiB (1%) 5638
["findfirst", "n=500", "foldl"] 99.249 ms (5%)
["findfirst", "n=500", "reduce", "basesize=128"] 50.344 ms (5%) 168.17 KiB (1%) 3285
["findfirst", "n=500", "reduce", "basesize=256"] 49.833 ms (5%) 90.22 KiB (1%) 1761
["findfirst", "n=500", "reduce", "basesize=512"] 54.072 ms (5%) 51.41 KiB (1%) 1002
["overhead", "n=64"] 344.300 μs (5%) 155.41 KiB (1%) 3026
["parallel_histogram", "assoc", "basesize=16384"] 4.889 ms (5%) 732.22 KiB (1%) 109
["parallel_histogram", "assoc", "basesize=4096"] 6.425 ms (5%) 2.07 MiB (1%) 545
["parallel_histogram", "assoc", "basesize=8192"] 5.897 ms (5%) 1.43 MiB (1%) 260
["parallel_histogram", "comm", "basesize=16384"] 15.989 ms (5%) 1.22 MiB (1%) 373
["parallel_histogram", "comm", "basesize=4096"] 24.820 ms (5%) 1.06 MiB (1%) 4045
["parallel_histogram", "comm", "basesize=8192"] 19.569 ms (5%) 2.173 ms 1.23 MiB (1%) 885
["parallel_histogram", "seq"] 8.972 ms (5%) 364.63 KiB (1%) 25
["sum", "random", "foldl"] 18.990 ms (5%)
["sum", "random", "reduce", "basesize=128"] 10.106 ms (5%) 331.53 KiB (1%) 6844
["sum", "random", "reduce", "basesize=256"] 10.287 ms (5%) 164.20 KiB (1%) 3399
["sum", "random", "reduce", "basesize=512"] 9.773 ms (5%) 80.77 KiB (1%) 1675
["sum", "uniform", "foldl"] 17.626 ms (5%)
["sum", "uniform", "reduce", "basesize=128"] 10.071 ms (5%) 331.73 KiB (1%) 6857
["sum", "uniform", "reduce", "basesize=256"] 9.856 ms (5%) 164.30 KiB (1%) 3405
["sum", "uniform", "reduce", "basesize=512"] 9.502 ms (5%) 80.84 KiB (1%) 1680
["sum", "valley", "foldl"] 19.174 ms (5%)
["sum", "valley", "reduce", "basesize=128"] 9.953 ms (5%) 331.45 KiB (1%) 6839
["sum", "valley", "reduce", "basesize=256"] 10.052 ms (5%) 164.14 KiB (1%) 3395
["sum", "valley", "reduce", "basesize=512"] 10.049 ms (5%) 80.78 KiB (1%) 1676
["words", "nthreads=1"] 43.266 ms (5%) 7.839 ms 64.85 MiB (1%) 2098866
["words", "nthreads=2"] 23.671 ms (5%) 65.57 MiB (1%) 2099028
["words", "nthreads=4"] 26.037 ms (5%) 66.21 MiB (1%) 2099335

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.3 LTS
  uname: Linux 5.0.0-1028-azure #30~18.04.1-Ubuntu SMP Fri Dec 6 11:47:59 UTC 2019 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      46441 s          0 s       1769 s      13926 s          0 s
       #2  2095 MHz      45590 s          0 s       2101 s      14746 s          0 s
       
  Memory: 6.782741546630859 GB (3581.92578125 MB free)
  Uptime: 641.0 sec
  Load Avg:  1.72509765625  1.49951171875  0.857421875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 24 Jan 2020 - 10:18
  • Package commit: 3d75be
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: JULIA_NUM_THREADS => 2

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["collect", "assoc", "basesize=1"] 426.105 ms (5%) 10.943 ms 91.55 MiB (1%) 1754654
["collect", "assoc", "basesize=1024"] 278.514 ms (5%) 1.84 MiB (1%) 1968
["collect", "assoc", "basesize=32"] 285.493 ms (5%) 5.77 MiB (1%) 59117
["collect", "seq"] 544.733 ms (5%) 513.00 KiB (1%) 23
["collect", "unordered", "basesize=1"] 511.586 ms (5%) 7.698 ms 30.24 MiB (1%) 474108
["collect", "unordered", "basesize=1024"] 338.075 ms (5%) 834.09 KiB (1%) 6497
["collect", "unordered", "basesize=32"] 311.146 ms (5%) 1.52 MiB (1%) 19863
["findfirst", "n=1000", "foldl"] 756.138 ms (5%)
["findfirst", "n=1000", "reduce", "basesize=128"] 552.817 ms (5%) 978.09 KiB (1%) 19084
["findfirst", "n=1000", "reduce", "basesize=256"] 519.808 ms (5%) 509.70 KiB (1%) 9949
["findfirst", "n=1000", "reduce", "basesize=512"] 712.817 ms (5%) 362.80 KiB (1%) 7074
["findfirst", "n=400", "foldl"] 538.711 ms (5%)
["findfirst", "n=400", "reduce", "basesize=128"] 306.324 ms (5%) 1.16 MiB (1%) 23416
["findfirst", "n=400", "reduce", "basesize=256"] 314.800 ms (5%) 668.33 KiB (1%) 13163
["findfirst", "n=400", "reduce", "basesize=512"] 339.335 ms (5%) 407.23 KiB (1%) 7993
["findfirst", "n=500", "foldl"] 97.807 ms (5%)
["findfirst", "n=500", "reduce", "basesize=128"] 205.326 ms (5%) 851.13 KiB (1%) 16472
["findfirst", "n=500", "reduce", "basesize=256"] 195.859 ms (5%) 431.55 KiB (1%) 8351
["findfirst", "n=500", "reduce", "basesize=512"] 383.678 ms (5%) 405.14 KiB (1%) 7863
["overhead", "n=64"] 208.100 μs (5%) 155.14 KiB (1%) 3009
["parallel_histogram", "assoc", "basesize=16384"] 5.250 ms (5%) 732.22 KiB (1%) 109
["parallel_histogram", "assoc", "basesize=4096"] 5.888 ms (5%) 1.80 MiB (1%) 539
["parallel_histogram", "assoc", "basesize=8192"] 5.050 ms (5%) 1.43 MiB (1%) 260
["parallel_histogram", "comm", "basesize=16384"] 16.031 ms (5%) 1.22 MiB (1%) 290
["parallel_histogram", "comm", "basesize=4096"] 22.533 ms (5%) 1.03 MiB (1%) 3623
["parallel_histogram", "comm", "basesize=8192"] 19.852 ms (5%) 1.26 MiB (1%) 2471
["parallel_histogram", "seq"] 9.945 ms (5%) 364.63 KiB (1%) 25
["sum", "random", "foldl"] 18.061 ms (5%)
["sum", "random", "reduce", "basesize=128"] 9.881 ms (5%) 331.38 KiB (1%) 6834
["sum", "random", "reduce", "basesize=256"] 9.199 ms (5%) 164.11 KiB (1%) 3393
["sum", "random", "reduce", "basesize=512"] 9.703 ms (5%) 80.72 KiB (1%) 1672
["sum", "uniform", "foldl"] 17.359 ms (5%)
["sum", "uniform", "reduce", "basesize=128"] 9.811 ms (5%) 331.38 KiB (1%) 6834
["sum", "uniform", "reduce", "basesize=256"] 8.909 ms (5%) 164.13 KiB (1%) 3394
["sum", "uniform", "reduce", "basesize=512"] 9.284 ms (5%) 80.70 KiB (1%) 1671
["sum", "valley", "foldl"] 18.367 ms (5%)
["sum", "valley", "reduce", "basesize=128"] 10.153 ms (5%) 331.31 KiB (1%) 6830
["sum", "valley", "reduce", "basesize=256"] 9.040 ms (5%) 164.08 KiB (1%) 3391
["sum", "valley", "reduce", "basesize=512"] 9.099 ms (5%) 80.75 KiB (1%) 1674
["words", "nthreads=1"] 44.138 ms (5%) 7.452 ms 64.95 MiB (1%) 2102001
["words", "nthreads=2"] 23.416 ms (5%) 65.31 MiB (1%) 2102081
["words", "nthreads=4"] 24.445 ms (5%) 66.21 MiB (1%) 2102396

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.3 LTS
  uname: Linux 5.0.0-1028-azure #30~18.04.1-Ubuntu SMP Fri Dec 6 11:47:59 UTC 2019 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      66824 s          0 s       2286 s      20764 s          0 s
       #2  2095 MHz      68612 s          0 s       2491 s      19127 s          0 s
       
  Memory: 6.782741546630859 GB (3608.890625 MB free)
  Uptime: 920.0 sec
  Load Avg:  1.66064453125  1.560546875  1.0517578125
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               85
Model name:          Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Stepping:            4
CPU MHz:             2095.075
BogoMIPS:            4190.15
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            1024K
L3 cache:            36608K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx avx512f avx512dq rdseed adx smap clflushopt avx512cd avx512bw avx512vl xsaveopt xsavec xsaves
Cpu Property Value
Brand Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Vendor :Intel
Architecture :Skylake
Model Family: 0x06, Model: 0x55, Stepping: 0x04, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 1024, 36608) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 512 bit = 64 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@tkf
Copy link
Member Author

tkf commented Jan 25, 2020

So ["overhead", "n=64"] benchmark takes 1.65x time #183 (comment). The overhead is larger (~ 2.6x) in my laptop.

@github-actions
Copy link
Contributor

github-actions bot commented Mar 9, 2020

Benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 9 Mar 2020 - 02:44
    • Baseline: 9 Mar 2020 - 02:48
  • Package commits:
    • Target: 59222b
    • Baseline: ed0911
  • Julia commits:
    • Target: 2d5741
    • Baseline: 2d5741
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1
    • Baseline: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["dot", "rf"] 0.89 (5%) ✅ 1.00 (1%)
["filter_map_map!", "man"] 0.93 (5%) ✅ 1.00 (1%)
["filter_map_map!", "xf"] 1.08 (5%) ❌ 1.00 (1%)
["findall", "xf-iter"] 0.94 (5%) ✅ 1.00 (1%)
["gemm", "fusedmul", "xf", "2"] 0.89 (5%) ✅ 1.00 (1%)
["gemm", "fusedmul", "xf", "32"] 0.88 (5%) ✅ 1.00 (1%)
["gemm", "fusedmul", "xf", "8"] 0.93 (5%) ✅ 1.00 (1%)
["gemm", "mul", "linalg", "256"] 0.89 (5%) ✅ 1.00 (1%)
["gemm", "mul", "linalg", "32"] 0.90 (5%) ✅ 1.00 (1%)
["gemm", "mul", "linalg", "8"] 0.92 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "false", "256"] 1.11 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "ivdep", "256"] 1.07 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "ivdep", "8"] 1.07 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "true", "256"] 1.12 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "false", "256"] 1.10 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "false", "32"] 0.93 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "ivdep", "256"] 1.05 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "ivdep", "8"] 0.82 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "true", "8"] 0.86 (5%) ✅ 1.00 (1%)
["partition_by", "man"] 0.90 (5%) ✅ 1.00 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Target

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      39731 s          0 s        970 s      15027 s          0 s
       #2  2095 MHz      13370 s          0 s       1371 s      41501 s          0 s
       
  Memory: 6.782737731933594 GB (3528.09375 MB free)
  Uptime: 574.0 sec
  Load Avg:  1.00830078125  0.935546875  0.54248046875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Baseline

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      48910 s          0 s       1150 s      29237 s          0 s
       #2  2095 MHz      27871 s          0 s       1437 s      50513 s          0 s
       
  Memory: 6.782737731933594 GB (3520.96484375 MB free)
  Uptime: 810.0 sec
  Load Avg:  1.04638671875  1.009765625  0.66796875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 9 Mar 2020 - 2:44
  • Package commit: 59222b
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 229.907 μs (5%)
["cat", "xf"] 1.890 μs (5%)
["collect", "filter-missing"] 103.603 μs (5%) 33.05 KiB (1%) 20
["collect", "identity-float"] 74.502 μs (5%) 256.91 KiB (1%) 20
["collect", "identity-union"] 370.611 μs (5%) 285.19 KiB (1%) 6688
["dot", "blas"] 1.330 μs (5%)
["dot", "man"] 1.300 μs (5%)
["dot", "rf"] 2.133 μs (5%)
["dot", "xf"] 2.378 μs (5%)
["filter_map_map!", "man"] 58.704 μs (5%)
["filter_map_map!", "xf"] 71.105 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 255.507 μs (5%)
["filter_map_reduce", "xf"] 255.908 μs (5%)
["findall", "base"] 878.929 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 678.122 μs (5%) 3.05 MiB (1%) 100015
["findall", "xf-iter"] 798.025 μs (5%) 2.00 MiB (1%) 29
["gemm", "fusedmul", "blas", "16"] 4.713 ms (5%)
["gemm", "fusedmul", "blas", "2"] 3.460 ms (5%)
["gemm", "fusedmul", "blas", "32"] 6.465 ms (5%)
["gemm", "fusedmul", "blas", "8"] 3.732 ms (5%)
["gemm", "fusedmul", "xf", "16"] 4.960 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 552.741 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 8.395 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.344 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 887.025 μs (5%)
["gemm", "mul", "linalg", "32"] 3.325 μs (5%)
["gemm", "mul", "linalg", "8"] 275.725 ns (5%)
["gemm", "mul", "man", "false", "256"] 1.902 ms (5%)
["gemm", "mul", "man", "false", "32"] 5.134 μs (5%)
["gemm", "mul", "man", "false", "8"] 487.733 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 1.870 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 4.272 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 429.181 ns (5%)
["gemm", "mul", "man", "true", "256"] 1.916 ms (5%)
["gemm", "mul", "man", "true", "32"] 5.651 μs (5%)
["gemm", "mul", "man", "true", "8"] 492.354 ns (5%)
["gemm", "mul", "xf", "false", "256"] 1.898 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 4.843 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 491.841 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 1.860 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 4.114 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 408.540 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 1.808 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 5.167 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 429.161 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 2.978 μs (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 2.856 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 2.856 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.630 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.510 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 5.250 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.520 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.660 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 238.807 μs (5%) 74.11 KiB (1%) 3867
["missing_dot", "xf_nota"] 239.108 μs (5%) 74.05 KiB (1%) 3864
["partition_by", "man"] 1.990 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 2.046 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      39731 s          0 s        970 s      15027 s          0 s
       #2  2095 MHz      13370 s          0 s       1371 s      41501 s          0 s
       
  Memory: 6.782737731933594 GB (3528.09375 MB free)
  Uptime: 574.0 sec
  Load Avg:  1.00830078125  0.935546875  0.54248046875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 9 Mar 2020 - 2:48
  • Package commit: ed0911
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 224.311 μs (5%)
["cat", "xf"] 1.890 μs (5%)
["collect", "filter-missing"] 103.605 μs (5%) 33.05 KiB (1%) 20
["collect", "identity-float"] 74.303 μs (5%) 256.91 KiB (1%) 20
["collect", "identity-union"] 366.618 μs (5%) 285.78 KiB (1%) 6670
["dot", "blas"] 1.330 μs (5%)
["dot", "man"] 1.330 μs (5%)
["dot", "rf"] 2.389 μs (5%)
["dot", "xf"] 2.389 μs (5%)
["filter_map_map!", "man"] 63.003 μs (5%)
["filter_map_map!", "xf"] 65.603 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 255.713 μs (5%)
["filter_map_reduce", "xf"] 255.813 μs (5%)
["findall", "base"] 870.049 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 674.637 μs (5%) 3.05 MiB (1%) 100015
["findall", "xf-iter"] 846.946 μs (5%) 2.00 MiB (1%) 29
["gemm", "fusedmul", "blas", "16"] 4.669 ms (5%)
["gemm", "fusedmul", "blas", "2"] 3.485 ms (5%)
["gemm", "fusedmul", "blas", "32"] 6.594 ms (5%)
["gemm", "fusedmul", "blas", "8"] 3.596 ms (5%)
["gemm", "fusedmul", "xf", "16"] 5.065 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 619.125 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 9.549 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.510 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 994.046 μs (5%)
["gemm", "mul", "linalg", "32"] 3.700 μs (5%)
["gemm", "mul", "linalg", "8"] 300.000 ns (5%)
["gemm", "mul", "man", "false", "256"] 1.721 ms (5%)
["gemm", "mul", "man", "false", "32"] 5.400 μs (5%)
["gemm", "mul", "man", "false", "8"] 500.000 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 1.747 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 4.300 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 400.000 ns (5%)
["gemm", "mul", "man", "true", "256"] 1.712 ms (5%)
["gemm", "mul", "man", "true", "32"] 5.700 μs (5%)
["gemm", "mul", "man", "true", "8"] 500.000 ns (5%)
["gemm", "mul", "xf", "false", "256"] 1.733 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 5.200 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 500.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 1.763 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 4.300 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 500.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 1.730 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 5.100 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 500.000 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 2.978 μs (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 2.833 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 2.867 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.640 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.510 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 5.250 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.520 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.640 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 239.011 μs (5%) 74.11 KiB (1%) 3867
["missing_dot", "xf_nota"] 229.312 μs (5%) 74.14 KiB (1%) 3870
["partition_by", "man"] 2.215 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 2.100 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      48910 s          0 s       1150 s      29237 s          0 s
       #2  2095 MHz      27871 s          0 s       1437 s      50513 s          0 s
       
  Memory: 6.782737731933594 GB (3520.96484375 MB free)
  Uptime: 810.0 sec
  Load Avg:  1.04638671875  1.009765625  0.66796875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               85
Model name:          Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Stepping:            4
CPU MHz:             2095.198
BogoMIPS:            4190.39
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            1024K
L3 cache:            36608K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx avx512f avx512dq rdseed adx smap clflushopt avx512cd avx512bw avx512vl xsaveopt xsavec xsaves
Cpu Property Value
Brand Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Vendor :Intel
Architecture :Skylake
Model Family: 0x06, Model: 0x55, Stepping: 0x04, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 1024, 36608) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 512 bit = 64 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@github-actions
Copy link
Contributor

github-actions bot commented Mar 9, 2020

Benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 9 Mar 2020 - 02:50
    • Baseline: 9 Mar 2020 - 02:54
  • Package commits:
    • Target: f639a4
    • Baseline: ed0911
  • Julia commits:
    • Target: 2d5741
    • Baseline: 2d5741
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1
    • Baseline: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["cat", "base"] 1.08 (5%) ❌ 1.00 (1%)
["cat", "xf"] 1.10 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "false", "8"] 0.93 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "ivdep", "8"] 1.23 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "true", "32"] 0.84 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "true", "8"] 1.37 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "false", "8"] 1.06 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "ivdep", "8"] 1.14 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "true", "32"] 0.90 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "true", "8"] 0.92 (5%) ✅ 1.00 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Target

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      40916 s          0 s       1073 s      17597 s          0 s
       #2  2294 MHz      12881 s          0 s       1120 s      45548 s          0 s
       
  Memory: 6.782737731933594 GB (3470.1171875 MB free)
  Uptime: 611.0 sec
  Load Avg:  1.11376953125  0.94873046875  0.54150390625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, broadwell)

Baseline

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      45569 s          0 s       1157 s      36853 s          0 s
       #2  2294 MHz      32250 s          0 s       1236 s      50019 s          0 s
       
  Memory: 6.782737731933594 GB (3457.984375 MB free)
  Uptime: 852.0 sec
  Load Avg:  1.0615234375  1.00634765625  0.6669921875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, broadwell)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 9 Mar 2020 - 2:50
  • Package commit: f639a4
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 222.599 μs (5%)
["cat", "xf"] 1.600 μs (5%)
["collect", "filter-missing"] 82.797 μs (5%) 33.05 KiB (1%) 20
["collect", "identity-float"] 68.599 μs (5%) 256.91 KiB (1%) 20
["collect", "identity-union"] 301.898 μs (5%) 285.71 KiB (1%) 6677
["dot", "blas"] 2.289 μs (5%)
["dot", "man"] 2.289 μs (5%)
["dot", "rf"] 2.655 μs (5%)
["dot", "xf"] 2.667 μs (5%)
["filter_map_map!", "man"] 68.298 μs (5%)
["filter_map_map!", "xf"] 69.898 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 194.898 μs (5%)
["filter_map_reduce", "xf"] 194.898 μs (5%)
["findall", "base"] 744.093 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 551.795 μs (5%) 3.05 MiB (1%) 100015
["findall", "xf-iter"] 764.593 μs (5%) 2.00 MiB (1%) 29
["gemm", "fusedmul", "blas", "16"] 5.211 ms (5%)
["gemm", "fusedmul", "blas", "2"] 3.494 ms (5%)
["gemm", "fusedmul", "blas", "32"] 7.338 ms (5%)
["gemm", "fusedmul", "blas", "8"] 3.768 ms (5%)
["gemm", "fusedmul", "xf", "16"] 4.997 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 603.387 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 9.866 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.420 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 1.214 ms (5%)
["gemm", "mul", "linalg", "32"] 3.700 μs (5%)
["gemm", "mul", "linalg", "8"] 285.054 ns (5%)
["gemm", "mul", "man", "false", "256"] 4.387 ms (5%)
["gemm", "mul", "man", "false", "32"] 6.960 μs (5%)
["gemm", "mul", "man", "false", "8"] 370.287 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 4.292 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 6.240 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 368.106 ns (5%)
["gemm", "mul", "man", "true", "256"] 4.374 ms (5%)
["gemm", "mul", "man", "true", "32"] 7.150 μs (5%)
["gemm", "mul", "man", "true", "8"] 408.485 ns (5%)
["gemm", "mul", "xf", "false", "256"] 4.368 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 6.840 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 422.095 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 4.337 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 5.733 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 341.223 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 4.372 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 6.580 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 367.140 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 926.923 ns (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 2.167 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 2.189 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.330 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 874.576 ns (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 4.057 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 854.653 ns (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.390 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 189.499 μs (5%) 74.08 KiB (1%) 3864
["missing_dot", "xf_nota"] 188.198 μs (5%) 73.94 KiB (1%) 3863
["partition_by", "man"] 1.627 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 1.578 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      40916 s          0 s       1073 s      17597 s          0 s
       #2  2294 MHz      12881 s          0 s       1120 s      45548 s          0 s
       
  Memory: 6.782737731933594 GB (3470.1171875 MB free)
  Uptime: 611.0 sec
  Load Avg:  1.11376953125  0.94873046875  0.54150390625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, broadwell)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 9 Mar 2020 - 2:54
  • Package commit: ed0911
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 205.597 μs (5%)
["cat", "xf"] 1.460 μs (5%)
["collect", "filter-missing"] 85.699 μs (5%) 33.05 KiB (1%) 20
["collect", "identity-float"] 66.299 μs (5%) 256.91 KiB (1%) 20
["collect", "identity-union"] 300.996 μs (5%) 285.17 KiB (1%) 6679
["dot", "blas"] 2.278 μs (5%)
["dot", "man"] 2.278 μs (5%)
["dot", "rf"] 2.667 μs (5%)
["dot", "xf"] 2.667 μs (5%)
["filter_map_map!", "man"] 67.000 μs (5%)
["filter_map_map!", "xf"] 69.699 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 194.896 μs (5%)
["filter_map_reduce", "xf"] 194.896 μs (5%)
["findall", "base"] 770.986 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 569.290 μs (5%) 3.05 MiB (1%) 100015
["findall", "xf-iter"] 750.088 μs (5%) 2.00 MiB (1%) 29
["gemm", "fusedmul", "blas", "16"] 5.083 ms (5%)
["gemm", "fusedmul", "blas", "2"] 3.587 ms (5%)
["gemm", "fusedmul", "blas", "32"] 7.559 ms (5%)
["gemm", "fusedmul", "blas", "8"] 3.815 ms (5%)
["gemm", "fusedmul", "xf", "16"] 4.859 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 602.392 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 9.810 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.466 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 1.212 ms (5%)
["gemm", "mul", "linalg", "32"] 3.799 μs (5%)
["gemm", "mul", "linalg", "8"] 299.000 ns (5%)
["gemm", "mul", "man", "false", "256"] 4.417 ms (5%)
["gemm", "mul", "man", "false", "32"] 7.299 μs (5%)
["gemm", "mul", "man", "false", "8"] 399.000 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 4.376 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 6.300 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 300.000 ns (5%)
["gemm", "mul", "man", "true", "256"] 4.506 ms (5%)
["gemm", "mul", "man", "true", "32"] 8.499 μs (5%)
["gemm", "mul", "man", "true", "8"] 299.000 ns (5%)
["gemm", "mul", "xf", "false", "256"] 4.473 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 6.699 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 399.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 4.267 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 5.799 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 300.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 4.496 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 7.299 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 399.000 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 911.500 ns (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 2.256 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 2.200 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.390 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 855.915 ns (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 4.043 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 875.987 ns (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.330 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 186.997 μs (5%) 74.02 KiB (1%) 3864
["missing_dot", "xf_nota"] 185.898 μs (5%) 74.05 KiB (1%) 3865
["partition_by", "man"] 1.651 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 1.597 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      45569 s          0 s       1157 s      36853 s          0 s
       #2  2294 MHz      32250 s          0 s       1236 s      50019 s          0 s
       
  Memory: 6.782737731933594 GB (3457.984375 MB free)
  Uptime: 852.0 sec
  Load Avg:  1.0615234375  1.00634765625  0.6669921875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, broadwell)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               79
Model name:          Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Stepping:            1
CPU MHz:             2294.683
BogoMIPS:            4589.36
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            256K
L3 cache:            51200K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm rdseed adx smap xsaveopt
Cpu Property Value
Brand Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Vendor :Intel
Architecture :Broadwell
Model Family: 0x06, Model: 0x4f, Stepping: 0x01, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 256, 51200) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 256 bit = 32 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@github-actions
Copy link
Contributor

github-actions bot commented Mar 9, 2020

Multi-thread benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 9 Mar 2020 - 02:51
    • Baseline: 9 Mar 2020 - 02:56
  • Package commits:
    • Target: f639a4
    • Baseline: ed0911
  • Julia commits:
    • Target: 2d5741
    • Baseline: 2d5741
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: JULIA_NUM_THREADS => 2
    • Baseline: JULIA_NUM_THREADS => 2

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["collect", "assoc", "basesize=1"] 1.27 (5%) ❌ 1.01 (1%)
["collect", "unordered", "basesize=1024"] 0.93 (5%) ✅ 0.96 (1%) ✅
["findfirst", "n=1000", "reduce", "basesize=128"] 0.45 (5%) ✅ 0.42 (1%) ✅
["findfirst", "n=1000", "reduce", "basesize=256"] 0.68 (5%) ✅ 0.57 (1%) ✅
["findfirst", "n=1000", "reduce", "basesize=512"] 0.58 (5%) ✅ 0.50 (1%) ✅
["findfirst", "n=400", "reduce", "basesize=128"] 0.94 (5%) ✅ 0.91 (1%) ✅
["findfirst", "n=400", "reduce", "basesize=256"] 0.86 (5%) ✅ 0.84 (1%) ✅
["findfirst", "n=400", "reduce", "basesize=512"] 0.83 (5%) ✅ 0.71 (1%) ✅
["findfirst", "n=500", "reduce", "basesize=128"] 0.19 (5%) ✅ 0.17 (1%) ✅
["findfirst", "n=500", "reduce", "basesize=256"] 0.27 (5%) ✅ 0.24 (1%) ✅
["findfirst", "n=500", "reduce", "basesize=512"] 0.29 (5%) ✅ 0.21 (1%) ✅
["overhead", "n=64"] 1.60 (5%) ❌ 1.00 (1%)
["parallel_histogram", "comm", "basesize=16384"] 1.06 (5%) ❌ 1.00 (1%)
["parallel_histogram", "comm", "basesize=4096"] 0.87 (5%) ✅ 1.00 (1%)
["words", "nthreads=2"] 0.91 (5%) ✅ 1.00 (1%)
["words", "nthreads=4"] 1.07 (5%) ❌ 1.00 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Target

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      48308 s          0 s       2142 s      26585 s          0 s
       #2  2294 MHz      42680 s          0 s       2085 s      33110 s          0 s
       
  Memory: 6.782737731933594 GB (3550.0234375 MB free)
  Uptime: 792.0 sec
  Load Avg:  1.7099609375  1.51025390625  0.89990234375
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, broadwell)

Baseline

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      73214 s          0 s       2643 s      29071 s          0 s
       #2  2294 MHz      61478 s          0 s       2523 s      41791 s          0 s
       
  Memory: 6.782737731933594 GB (3544.33203125 MB free)
  Uptime: 1073.0 sec
  Load Avg:  1.70751953125  1.5830078125  1.0927734375
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, broadwell)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 9 Mar 2020 - 2:51
  • Package commit: f639a4
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: JULIA_NUM_THREADS => 2

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["collect", "assoc", "basesize=1"] 492.146 ms (5%) 5.867 ms 92.15 MiB (1%) 1761165
["collect", "assoc", "basesize=1024"] 207.171 ms (5%) 1.84 MiB (1%) 1969
["collect", "assoc", "basesize=32"] 214.297 ms (5%) 5.79 MiB (1%) 59278
["collect", "seq"] 415.443 ms (5%) 513.00 KiB (1%) 23
["collect", "unordered", "basesize=1"] 517.328 ms (5%) 30.25 MiB (1%) 475054
["collect", "unordered", "basesize=1024"] 241.410 ms (5%) 807.34 KiB (1%) 4785
["collect", "unordered", "basesize=32"] 244.328 ms (5%) 1.56 MiB (1%) 22904
["findfirst", "n=1000", "foldl"] 642.396 ms (5%)
["findfirst", "n=1000", "reduce", "basesize=128"] 315.238 ms (5%) 603.75 KiB (1%) 11840
["findfirst", "n=1000", "reduce", "basesize=256"] 313.659 ms (5%) 307.45 KiB (1%) 6044
["findfirst", "n=1000", "reduce", "basesize=512"] 328.167 ms (5%) 159.70 KiB (1%) 3143
["findfirst", "n=400", "foldl"] 468.061 ms (5%)
["findfirst", "n=400", "reduce", "basesize=128"] 247.978 ms (5%) 1.09 MiB (1%) 21984
["findfirst", "n=400", "reduce", "basesize=256"] 244.812 ms (5%) 563.23 KiB (1%) 11076
["findfirst", "n=400", "reduce", "basesize=512"] 243.220 ms (5%) 285.97 KiB (1%) 5636
["findfirst", "n=500", "foldl"] 79.807 ms (5%)
["findfirst", "n=500", "reduce", "basesize=128"] 39.876 ms (5%) 168.17 KiB (1%) 3285
["findfirst", "n=500", "reduce", "basesize=256"] 39.996 ms (5%) 90.22 KiB (1%) 1761
["findfirst", "n=500", "reduce", "basesize=512"] 42.441 ms (5%) 51.39 KiB (1%) 1001
["overhead", "n=64"] 278.297 μs (5%) 155.41 KiB (1%) 3026
["parallel_histogram", "assoc", "basesize=16384"] 4.506 ms (5%) 732.22 KiB (1%) 109
["parallel_histogram", "assoc", "basesize=4096"] 5.498 ms (5%) 1.80 MiB (1%) 539
["parallel_histogram", "assoc", "basesize=8192"] 5.099 ms (5%) 1.43 MiB (1%) 260
["parallel_histogram", "comm", "basesize=16384"] 13.079 ms (5%) 1.22 MiB (1%) 329
["parallel_histogram", "comm", "basesize=4096"] 17.596 ms (5%) 1.03 MiB (1%) 4157
["parallel_histogram", "comm", "basesize=8192"] 14.810 ms (5%) 1.23 MiB (1%) 857
["parallel_histogram", "seq"] 8.188 ms (5%) 364.63 KiB (1%) 25
["sum", "random", "foldl"] 15.153 ms (5%)
["sum", "random", "reduce", "basesize=128"] 8.234 ms (5%) 331.61 KiB (1%) 6849
["sum", "random", "reduce", "basesize=256"] 7.947 ms (5%) 164.17 KiB (1%) 3397
["sum", "random", "reduce", "basesize=512"] 7.763 ms (5%) 80.81 KiB (1%) 1678
["sum", "uniform", "foldl"] 15.149 ms (5%)
["sum", "uniform", "reduce", "basesize=128"] 8.125 ms (5%) 331.73 KiB (1%) 6857
["sum", "uniform", "reduce", "basesize=256"] 7.844 ms (5%) 164.28 KiB (1%) 3404
["sum", "uniform", "reduce", "basesize=512"] 7.595 ms (5%) 80.86 KiB (1%) 1681
["sum", "valley", "foldl"] 15.157 ms (5%)
["sum", "valley", "reduce", "basesize=128"] 8.289 ms (5%) 331.61 KiB (1%) 6849
["sum", "valley", "reduce", "basesize=256"] 7.944 ms (5%) 164.14 KiB (1%) 3395
["sum", "valley", "reduce", "basesize=512"] 7.957 ms (5%) 80.81 KiB (1%) 1678
["words", "nthreads=1"] 34.653 ms (5%) 6.282 ms 64.50 MiB (1%) 2087475
["words", "nthreads=2"] 19.062 ms (5%) 64.86 MiB (1%) 2087557
["words", "nthreads=4"] 20.804 ms (5%) 65.58 MiB (1%) 2087724

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      48308 s          0 s       2142 s      26585 s          0 s
       #2  2294 MHz      42680 s          0 s       2085 s      33110 s          0 s
       
  Memory: 6.782737731933594 GB (3550.0234375 MB free)
  Uptime: 792.0 sec
  Load Avg:  1.7099609375  1.51025390625  0.89990234375
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, broadwell)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 9 Mar 2020 - 2:56
  • Package commit: ed0911
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: JULIA_NUM_THREADS => 2

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["collect", "assoc", "basesize=1"] 388.748 ms (5%) 13.288 ms 91.55 MiB (1%) 1754587
["collect", "assoc", "basesize=1024"] 208.193 ms (5%) 1.84 MiB (1%) 1963
["collect", "assoc", "basesize=32"] 212.870 ms (5%) 5.77 MiB (1%) 59130
["collect", "seq"] 412.216 ms (5%) 513.00 KiB (1%) 23
["collect", "unordered", "basesize=1"] 533.276 ms (5%) 30.25 MiB (1%) 474953
["collect", "unordered", "basesize=1024"] 260.399 ms (5%) 844.33 KiB (1%) 7152
["collect", "unordered", "basesize=32"] 247.732 ms (5%) 1.57 MiB (1%) 23273
["findfirst", "n=1000", "foldl"] 626.609 ms (5%)
["findfirst", "n=1000", "reduce", "basesize=128"] 706.695 ms (5%) 1.39 MiB (1%) 27813
["findfirst", "n=1000", "reduce", "basesize=256"] 463.344 ms (5%) 535.08 KiB (1%) 10437
["findfirst", "n=1000", "reduce", "basesize=512"] 564.807 ms (5%) 319.95 KiB (1%) 6251
["findfirst", "n=400", "foldl"] 477.428 ms (5%)
["findfirst", "n=400", "reduce", "basesize=128"] 264.539 ms (5%) 1.20 MiB (1%) 24203
["findfirst", "n=400", "reduce", "basesize=256"] 283.186 ms (5%) 670.97 KiB (1%) 13218
["findfirst", "n=400", "reduce", "basesize=512"] 293.389 ms (5%) 404.47 KiB (1%) 7935
["findfirst", "n=500", "foldl"] 78.489 ms (5%)
["findfirst", "n=500", "reduce", "basesize=128"] 212.250 ms (5%) 1007.45 KiB (1%) 19521
["findfirst", "n=500", "reduce", "basesize=256"] 146.562 ms (5%) 375.88 KiB (1%) 7293
["findfirst", "n=500", "reduce", "basesize=512"] 146.242 ms (5%) 242.19 KiB (1%) 4666
["overhead", "n=64"] 174.097 μs (5%) 155.13 KiB (1%) 3008
["parallel_histogram", "assoc", "basesize=16384"] 4.529 ms (5%) 732.22 KiB (1%) 109
["parallel_histogram", "assoc", "basesize=4096"] 5.254 ms (5%) 1.80 MiB (1%) 539
["parallel_histogram", "assoc", "basesize=8192"] 4.888 ms (5%) 1.43 MiB (1%) 260
["parallel_histogram", "comm", "basesize=16384"] 12.350 ms (5%) 1.22 MiB (1%) 320
["parallel_histogram", "comm", "basesize=4096"] 20.338 ms (5%) 1.04 MiB (1%) 4337
["parallel_histogram", "comm", "basesize=8192"] 15.447 ms (5%) 1.23 MiB (1%) 778
["parallel_histogram", "seq"] 8.271 ms (5%) 364.63 KiB (1%) 25
["sum", "random", "foldl"] 15.003 ms (5%)
["sum", "random", "reduce", "basesize=128"] 7.939 ms (5%) 331.33 KiB (1%) 6831
["sum", "random", "reduce", "basesize=256"] 7.715 ms (5%) 164.08 KiB (1%) 3391
["sum", "random", "reduce", "basesize=512"] 7.597 ms (5%) 80.77 KiB (1%) 1675
["sum", "uniform", "foldl"] 14.681 ms (5%)
["sum", "uniform", "reduce", "basesize=128"] 7.828 ms (5%) 331.42 KiB (1%) 6837
["sum", "uniform", "reduce", "basesize=256"] 7.662 ms (5%) 164.13 KiB (1%) 3394
["sum", "uniform", "reduce", "basesize=512"] 7.431 ms (5%) 80.78 KiB (1%) 1676
["sum", "valley", "foldl"] 15.222 ms (5%)
["sum", "valley", "reduce", "basesize=128"] 7.992 ms (5%) 331.30 KiB (1%) 6829
["sum", "valley", "reduce", "basesize=256"] 7.869 ms (5%) 164.11 KiB (1%) 3393
["sum", "valley", "reduce", "basesize=512"] 7.738 ms (5%) 80.78 KiB (1%) 1676
["words", "nthreads=1"] 33.914 ms (5%) 5.939 ms 64.47 MiB (1%) 2086031
["words", "nthreads=2"] 21.037 ms (5%) 64.83 MiB (1%) 2086114
["words", "nthreads=4"] 19.448 ms (5%) 65.73 MiB (1%) 2086436

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      73214 s          0 s       2643 s      29071 s          0 s
       #2  2294 MHz      61478 s          0 s       2523 s      41791 s          0 s
       
  Memory: 6.782737731933594 GB (3544.33203125 MB free)
  Uptime: 1073.0 sec
  Load Avg:  1.70751953125  1.5830078125  1.0927734375
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, broadwell)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               79
Model name:          Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Stepping:            1
CPU MHz:             2294.684
BogoMIPS:            4589.36
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            256K
L3 cache:            51200K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm rdseed adx smap xsaveopt md_clear
Cpu Property Value
Brand Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Vendor :Intel
Architecture :Broadwell
Model Family: 0x06, Model: 0x4f, Stepping: 0x01, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 256, 51200) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 256 bit = 32 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@github-actions
Copy link
Contributor

github-actions bot commented Mar 9, 2020

Benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 9 Mar 2020 - 06:51
    • Baseline: 9 Mar 2020 - 06:55
  • Package commits:
    • Target: c9acec
    • Baseline: ed0911
  • Julia commits:
    • Target: 2d5741
    • Baseline: 2d5741
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1
    • Baseline: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["collect", "filter-missing"] 1.11 (5%) ❌ 1.00 (1%)
["gemm", "fusedmul", "blas", "32"] 0.91 (5%) ✅ 1.00 (1%)
["gemm", "fusedmul", "xf", "2"] 0.91 (5%) ✅ 1.00 (1%)
["gemm", "fusedmul", "xf", "32"] 0.86 (5%) ✅ 1.00 (1%)
["gemm", "fusedmul", "xf", "8"] 0.94 (5%) ✅ 1.00 (1%)
["gemm", "mul", "linalg", "8"] 0.90 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "ivdep", "8"] 0.85 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "true", "256"] 0.94 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "true", "32"] 0.95 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "false", "32"] 1.07 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "ivdep", "32"] 0.91 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "ivdep", "8"] 0.81 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "true", "8"] 1.07 (5%) ❌ 1.00 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Target

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz       5342 s          0 s       1090 s      59023 s          0 s
       #2  2095 MHz      47594 s          0 s       1206 s      16904 s          0 s
       
  Memory: 6.782737731933594 GB (3422.93359375 MB free)
  Uptime: 669.0 sec
  Load Avg:  1.00146484375  0.91015625  0.52734375
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Baseline

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz       6643 s          0 s       1202 s      81109 s          0 s
       #2  2095 MHz      69915 s          0 s       1332 s      18009 s          0 s
       
  Memory: 6.782737731933594 GB (3487.19921875 MB free)
  Uptime: 905.0 sec
  Load Avg:  1.0  0.966796875  0.64404296875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 9 Mar 2020 - 6:51
  • Package commit: c9acec
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 257.315 μs (5%)
["cat", "xf"] 1.890 μs (5%)
["collect", "filter-missing"] 103.006 μs (5%) 33.05 KiB (1%) 20
["collect", "identity-float"] 72.605 μs (5%) 256.91 KiB (1%) 20
["collect", "identity-union"] 366.422 μs (5%) 285.56 KiB (1%) 6655
["dot", "blas"] 1.330 μs (5%)
["dot", "man"] 1.330 μs (5%)
["dot", "rf"] 2.389 μs (5%)
["dot", "xf"] 2.389 μs (5%)
["filter_map_map!", "man"] 61.602 μs (5%)
["filter_map_map!", "xf"] 61.702 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 255.617 μs (5%)
["filter_map_reduce", "xf"] 255.617 μs (5%)
["findall", "base"] 870.067 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 674.451 μs (5%) 3.05 MiB (1%) 100015
["findall", "xf-iter"] 837.361 μs (5%) 2.00 MiB (1%) 29
["gemm", "fusedmul", "blas", "16"] 4.650 ms (5%)
["gemm", "fusedmul", "blas", "2"] 3.468 ms (5%)
["gemm", "fusedmul", "blas", "32"] 6.144 ms (5%)
["gemm", "fusedmul", "blas", "8"] 3.660 ms (5%)
["gemm", "fusedmul", "xf", "16"] 4.778 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 587.125 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 9.251 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.345 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 990.055 μs (5%)
["gemm", "mul", "linalg", "32"] 3.288 μs (5%)
["gemm", "mul", "linalg", "8"] 269.342 ns (5%)
["gemm", "mul", "man", "false", "256"] 1.888 ms (5%)
["gemm", "mul", "man", "false", "32"] 5.500 μs (5%)
["gemm", "mul", "man", "false", "8"] 498.995 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 1.856 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 4.400 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 423.136 ns (5%)
["gemm", "mul", "man", "true", "256"] 1.905 ms (5%)
["gemm", "mul", "man", "true", "32"] 5.767 μs (5%)
["gemm", "mul", "man", "true", "8"] 498.487 ns (5%)
["gemm", "mul", "xf", "false", "256"] 1.887 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 5.117 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 487.139 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 1.844 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 4.286 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 405.520 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 1.886 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 5.284 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 427.156 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 2.978 μs (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 2.856 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 2.822 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.610 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.500 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 5.267 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.540 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.660 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 240.415 μs (5%) 74.11 KiB (1%) 3869
["missing_dot", "xf_nota"] 238.616 μs (5%) 74.02 KiB (1%) 3864
["partition_by", "man"] 2.195 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 2.041 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz       5342 s          0 s       1090 s      59023 s          0 s
       #2  2095 MHz      47594 s          0 s       1206 s      16904 s          0 s
       
  Memory: 6.782737731933594 GB (3422.93359375 MB free)
  Uptime: 669.0 sec
  Load Avg:  1.00146484375  0.91015625  0.52734375
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 9 Mar 2020 - 6:55
  • Package commit: ed0911
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 257.208 μs (5%)
["cat", "xf"] 1.890 μs (5%)
["collect", "filter-missing"] 92.603 μs (5%) 33.05 KiB (1%) 20
["collect", "identity-float"] 73.402 μs (5%) 256.91 KiB (1%) 20
["collect", "identity-union"] 356.411 μs (5%) 285.62 KiB (1%) 6674
["dot", "blas"] 1.350 μs (5%)
["dot", "man"] 1.370 μs (5%)
["dot", "rf"] 2.389 μs (5%)
["dot", "xf"] 2.389 μs (5%)
["filter_map_map!", "man"] 62.102 μs (5%)
["filter_map_map!", "xf"] 61.502 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 255.908 μs (5%)
["filter_map_reduce", "xf"] 255.709 μs (5%)
["findall", "base"] 886.631 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 680.324 μs (5%) 3.05 MiB (1%) 100015
["findall", "xf-iter"] 862.030 μs (5%) 2.00 MiB (1%) 29
["gemm", "fusedmul", "blas", "16"] 4.697 ms (5%)
["gemm", "fusedmul", "blas", "2"] 3.543 ms (5%)
["gemm", "fusedmul", "blas", "32"] 6.750 ms (5%)
["gemm", "fusedmul", "blas", "8"] 3.735 ms (5%)
["gemm", "fusedmul", "xf", "16"] 4.553 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 642.219 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 10.767 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.495 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 992.931 μs (5%)
["gemm", "mul", "linalg", "32"] 3.400 μs (5%)
["gemm", "mul", "linalg", "8"] 300.000 ns (5%)
["gemm", "mul", "man", "false", "256"] 1.878 ms (5%)
["gemm", "mul", "man", "false", "32"] 5.400 μs (5%)
["gemm", "mul", "man", "false", "8"] 500.000 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 1.868 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 4.600 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 500.000 ns (5%)
["gemm", "mul", "man", "true", "256"] 2.032 ms (5%)
["gemm", "mul", "man", "true", "32"] 6.100 μs (5%)
["gemm", "mul", "man", "true", "8"] 500.000 ns (5%)
["gemm", "mul", "xf", "false", "256"] 1.876 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 4.800 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 500.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 1.861 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 4.700 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 500.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 1.876 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 5.100 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 400.000 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 2.978 μs (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 2.889 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 2.867 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.620 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.500 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 5.267 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.500 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.640 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 239.808 μs (5%) 73.98 KiB (1%) 3861
["missing_dot", "xf_nota"] 229.107 μs (5%) 73.98 KiB (1%) 3863
["partition_by", "man"] 2.237 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 2.110 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz       6643 s          0 s       1202 s      81109 s          0 s
       #2  2095 MHz      69915 s          0 s       1332 s      18009 s          0 s
       
  Memory: 6.782737731933594 GB (3487.19921875 MB free)
  Uptime: 905.0 sec
  Load Avg:  1.0  0.966796875  0.64404296875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               85
Model name:          Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Stepping:            4
CPU MHz:             2095.196
BogoMIPS:            4190.39
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            1024K
L3 cache:            36608K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx avx512f avx512dq rdseed adx smap clflushopt avx512cd avx512bw avx512vl xsaveopt xsavec xsaves
Cpu Property Value
Brand Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Vendor :Intel
Architecture :Skylake
Model Family: 0x06, Model: 0x55, Stepping: 0x04, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 1024, 36608) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 512 bit = 64 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@github-actions
Copy link
Contributor

github-actions bot commented Mar 9, 2020

Benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 9 Mar 2020 - 07:22
    • Baseline: 9 Mar 2020 - 07:26
  • Package commits:
    • Target: f2b017
    • Baseline: ed0911
  • Julia commits:
    • Target: 2d5741
    • Baseline: 2d5741
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1
    • Baseline: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["filter_map_map!", "man"] 1.22 (5%) ❌ 1.00 (1%)
["filter_map_map!", "xf"] 1.08 (5%) ❌ 1.00 (1%)
["gemm", "fusedmul", "blas", "32"] 1.06 (5%) ❌ 1.00 (1%)
["gemm", "fusedmul", "xf", "2"] 0.79 (5%) ✅ 1.00 (1%)
["gemm", "fusedmul", "xf", "32"] 0.87 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "false", "256"] 1.11 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "ivdep", "32"] 0.90 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "ivdep", "8"] 1.15 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "true", "256"] 1.13 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "true", "8"] 1.06 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "false", "256"] 1.10 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "false", "8"] 1.05 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "ivdep", "256"] 1.15 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "ivdep", "8"] 0.88 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "true", "256"] 1.14 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "true", "8"] 1.15 (5%) ❌ 1.00 (1%)
["missing_dot", "xf"] 1.06 (5%) ❌ 1.00 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Target

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      43407 s          0 s       1121 s      12277 s          0 s
       #2  2095 MHz      10712 s          0 s       1195 s      45134 s          0 s
       
  Memory: 6.782737731933594 GB (3511.72265625 MB free)
  Uptime: 584.0 sec
  Load Avg:  1.01220703125  1.0  0.6044921875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Baseline

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      57126 s          0 s       1200 s      22576 s          0 s
       #2  2095 MHz      21222 s          0 s       1304 s      58613 s          0 s
       
  Memory: 6.782737731933594 GB (3493.015625 MB free)
  Uptime: 826.0 sec
  Load Avg:  1.0  1.0  0.70703125
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 9 Mar 2020 - 7:22
  • Package commit: f2b017
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 277.839 μs (5%)
["cat", "xf"] 2.045 μs (5%)
["collect", "filter-missing"] 111.714 μs (5%) 33.05 KiB (1%) 20
["collect", "identity-float"] 81.010 μs (5%) 256.91 KiB (1%) 20
["collect", "identity-union"] 399.255 μs (5%) 285.33 KiB (1%) 6698
["dot", "blas"] 1.430 μs (5%)
["dot", "man"] 1.410 μs (5%)
["dot", "rf"] 2.567 μs (5%)
["dot", "xf"] 2.567 μs (5%)
["filter_map_map!", "man"] 72.905 μs (5%)
["filter_map_map!", "xf"] 72.206 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 276.142 μs (5%)
["filter_map_reduce", "xf"] 276.244 μs (5%)
["findall", "base"] 972.725 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 754.319 μs (5%) 3.05 MiB (1%) 100015
["findall", "xf-iter"] 931.024 μs (5%) 2.00 MiB (1%) 29
["gemm", "fusedmul", "blas", "16"] 5.010 ms (5%)
["gemm", "fusedmul", "blas", "2"] 3.802 ms (5%)
["gemm", "fusedmul", "blas", "32"] 7.372 ms (5%)
["gemm", "fusedmul", "blas", "8"] 3.991 ms (5%)
["gemm", "fusedmul", "xf", "16"] 5.153 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 558.946 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 8.793 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.321 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 1.068 ms (5%)
["gemm", "mul", "linalg", "32"] 3.613 μs (5%)
["gemm", "mul", "linalg", "8"] 290.842 ns (5%)
["gemm", "mul", "man", "false", "256"] 2.066 ms (5%)
["gemm", "mul", "man", "false", "32"] 5.817 μs (5%)
["gemm", "mul", "man", "false", "8"] 524.131 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 2.026 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 4.615 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 460.452 ns (5%)
["gemm", "mul", "man", "true", "256"] 2.086 ms (5%)
["gemm", "mul", "man", "true", "32"] 6.081 μs (5%)
["gemm", "mul", "man", "true", "8"] 529.000 ns (5%)
["gemm", "mul", "xf", "false", "256"] 2.056 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 5.551 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 525.847 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 2.195 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 4.858 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 438.429 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 2.118 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 5.901 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 459.949 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 3.213 μs (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 3.063 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 3.063 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.740 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.620 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 5.684 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.660 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.780 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 279.640 μs (5%) 74.11 KiB (1%) 3864
["missing_dot", "xf_nota"] 255.638 μs (5%) 74.02 KiB (1%) 3864
["partition_by", "man"] 2.366 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 2.201 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      43407 s          0 s       1121 s      12277 s          0 s
       #2  2095 MHz      10712 s          0 s       1195 s      45134 s          0 s
       
  Memory: 6.782737731933594 GB (3511.72265625 MB free)
  Uptime: 584.0 sec
  Load Avg:  1.01220703125  1.0  0.6044921875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 9 Mar 2020 - 7:26
  • Package commit: ed0911
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 277.815 μs (5%)
["cat", "xf"] 2.044 μs (5%)
["collect", "filter-missing"] 111.706 μs (5%) 33.05 KiB (1%) 20
["collect", "identity-float"] 82.105 μs (5%) 256.91 KiB (1%) 20
["collect", "identity-union"] 393.221 μs (5%) 285.72 KiB (1%) 6653
["dot", "blas"] 1.430 μs (5%)
["dot", "man"] 1.440 μs (5%)
["dot", "rf"] 2.567 μs (5%)
["dot", "xf"] 2.567 μs (5%)
["filter_map_map!", "man"] 59.902 μs (5%)
["filter_map_map!", "xf"] 66.603 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 276.115 μs (5%)
["filter_map_reduce", "xf"] 276.316 μs (5%)
["findall", "base"] 972.961 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 762.746 μs (5%) 3.05 MiB (1%) 100015
["findall", "xf-iter"] 941.857 μs (5%) 2.00 MiB (1%) 29
["gemm", "fusedmul", "blas", "16"] 5.076 ms (5%)
["gemm", "fusedmul", "blas", "2"] 3.759 ms (5%)
["gemm", "fusedmul", "blas", "32"] 6.945 ms (5%)
["gemm", "fusedmul", "blas", "8"] 4.162 ms (5%)
["gemm", "fusedmul", "xf", "16"] 5.092 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 705.332 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 10.074 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.399 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 1.066 ms (5%)
["gemm", "mul", "linalg", "32"] 3.700 μs (5%)
["gemm", "mul", "linalg", "8"] 300.000 ns (5%)
["gemm", "mul", "man", "false", "256"] 1.853 ms (5%)
["gemm", "mul", "man", "false", "32"] 5.601 μs (5%)
["gemm", "mul", "man", "false", "8"] 500.000 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 1.985 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 5.100 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 400.000 ns (5%)
["gemm", "mul", "man", "true", "256"] 1.845 ms (5%)
["gemm", "mul", "man", "true", "32"] 6.200 μs (5%)
["gemm", "mul", "man", "true", "8"] 500.000 ns (5%)
["gemm", "mul", "xf", "false", "256"] 1.864 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 5.700 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 500.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 1.904 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 5.100 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 500.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 1.859 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 5.800 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 400.000 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 3.225 μs (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 3.075 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 3.075 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.730 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.610 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 5.684 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.630 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.760 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 263.014 μs (5%) 73.95 KiB (1%) 3863
["missing_dot", "xf_nota"] 255.214 μs (5%) 73.95 KiB (1%) 3863
["partition_by", "man"] 2.370 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 2.263 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      57126 s          0 s       1200 s      22576 s          0 s
       #2  2095 MHz      21222 s          0 s       1304 s      58613 s          0 s
       
  Memory: 6.782737731933594 GB (3493.015625 MB free)
  Uptime: 826.0 sec
  Load Avg:  1.0  1.0  0.70703125
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               85
Model name:          Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Stepping:            4
CPU MHz:             2095.248
BogoMIPS:            4190.49
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            1024K
L3 cache:            36608K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx avx512f avx512dq rdseed adx smap clflushopt avx512cd avx512bw avx512vl xsaveopt xsavec xsaves
Cpu Property Value
Brand Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Vendor :Intel
Architecture :Skylake
Model Family: 0x06, Model: 0x55, Stepping: 0x04, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 1024, 36608) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 512 bit = 64 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@github-actions
Copy link
Contributor

github-actions bot commented Mar 9, 2020

Benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 9 Mar 2020 - 07:30
    • Baseline: 9 Mar 2020 - 07:34
  • Package commits:
    • Target: b7701a
    • Baseline: ed0911
  • Julia commits:
    • Target: 2d5741
    • Baseline: 2d5741
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1
    • Baseline: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["cat", "xf"] 0.86 (5%) ✅ 1.00 (1%)
["filter_map_map!", "man"] 0.94 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "false", "8"] 1.23 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "ivdep", "32"] 1.12 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "ivdep", "8"] 1.06 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "true", "8"] 1.22 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "false", "8"] 1.25 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "ivdep", "32"] 0.93 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "ivdep", "8"] 0.76 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "true", "8"] 1.08 (5%) ❌ 1.00 (1%)
["missing_argmax", "man"] 1.16 (5%) ❌ 1.00 (1%)
["missing_argmax", "rf"] 1.16 (5%) ❌ 1.00 (1%)
["missing_argmax", "xf"] 1.17 (5%) ❌ 1.00 (1%)
["partition_by", "man"] 0.88 (5%) ✅ 1.00 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Target

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      37743 s          0 s       1108 s      14854 s          0 s
       #2  2095 MHz      12706 s          0 s       1156 s      40290 s          0 s
       
  Memory: 6.782737731933594 GB (3548.20703125 MB free)
  Uptime: 552.0 sec
  Load Avg:  1.01708984375  0.90625  0.50244140625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Baseline

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      52176 s          0 s       1170 s      22742 s          0 s
       #2  2095 MHz      20694 s          0 s       1358 s      54465 s          0 s
       
  Memory: 6.782737731933594 GB (3470.67578125 MB free)
  Uptime: 776.0 sec
  Load Avg:  1.009765625  0.98095703125  0.62939453125
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 9 Mar 2020 - 7:30
  • Package commit: b7701a
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 192.906 μs (5%)
["cat", "xf"] 1.420 μs (5%)
["collect", "filter-missing"] 78.503 μs (5%) 33.05 KiB (1%) 20
["collect", "identity-float"] 57.402 μs (5%) 256.91 KiB (1%) 20
["collect", "identity-union"] 277.508 μs (5%) 285.52 KiB (1%) 6685
["dot", "blas"] 1.030 μs (5%)
["dot", "man"] 1.060 μs (5%)
["dot", "rf"] 1.800 μs (5%)
["dot", "xf"] 1.790 μs (5%)
["filter_map_map!", "man"] 62.405 μs (5%)
["filter_map_map!", "xf"] 67.306 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 191.706 μs (5%)
["filter_map_reduce", "xf"] 191.707 μs (5%)
["findall", "base"] 685.424 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 516.417 μs (5%) 3.05 MiB (1%) 100015
["findall", "xf-iter"] 673.523 μs (5%) 2.00 MiB (1%) 29
["gemm", "fusedmul", "blas", "16"] 3.783 ms (5%)
["gemm", "fusedmul", "blas", "2"] 2.894 ms (5%)
["gemm", "fusedmul", "blas", "32"] 5.211 ms (5%)
["gemm", "fusedmul", "blas", "8"] 3.049 ms (5%)
["gemm", "fusedmul", "xf", "16"] 4.599 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 621.952 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 10.034 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.402 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 757.023 μs (5%)
["gemm", "mul", "linalg", "32"] 2.511 μs (5%)
["gemm", "mul", "linalg", "8"] 206.040 ns (5%)
["gemm", "mul", "man", "false", "256"] 1.365 ms (5%)
["gemm", "mul", "man", "false", "32"] 3.972 μs (5%)
["gemm", "mul", "man", "false", "8"] 368.303 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 1.378 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 3.700 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 317.505 ns (5%)
["gemm", "mul", "man", "true", "256"] 1.367 ms (5%)
["gemm", "mul", "man", "true", "32"] 4.243 μs (5%)
["gemm", "mul", "man", "true", "8"] 367.347 ns (5%)
["gemm", "mul", "xf", "false", "256"] 1.365 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 3.729 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 375.377 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 1.380 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 3.163 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 304.250 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 1.366 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 4.013 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 322.658 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 2.589 μs (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 2.467 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 2.489 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.210 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.130 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 3.925 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.150 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.250 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 180.906 μs (5%) 74.02 KiB (1%) 3866
["missing_dot", "xf_nota"] 176.406 μs (5%) 74.08 KiB (1%) 3867
["partition_by", "man"] 1.646 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 1.531 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      37743 s          0 s       1108 s      14854 s          0 s
       #2  2095 MHz      12706 s          0 s       1156 s      40290 s          0 s
       
  Memory: 6.782737731933594 GB (3548.20703125 MB free)
  Uptime: 552.0 sec
  Load Avg:  1.01708984375  0.90625  0.50244140625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 9 Mar 2020 - 7:34
  • Package commit: ed0911
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 192.910 μs (5%)
["cat", "xf"] 1.650 μs (5%)
["collect", "filter-missing"] 78.605 μs (5%) 33.05 KiB (1%) 20
["collect", "identity-float"] 55.903 μs (5%) 256.91 KiB (1%) 20
["collect", "identity-union"] 271.815 μs (5%) 285.41 KiB (1%) 6672
["dot", "blas"] 1.040 μs (5%)
["dot", "man"] 1.030 μs (5%)
["dot", "rf"] 1.790 μs (5%)
["dot", "xf"] 1.790 μs (5%)
["filter_map_map!", "man"] 66.503 μs (5%)
["filter_map_map!", "xf"] 70.803 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 191.911 μs (5%)
["filter_map_reduce", "xf"] 191.811 μs (5%)
["findall", "base"] 667.742 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 507.731 μs (5%) 3.05 MiB (1%) 100015
["findall", "xf-iter"] 651.540 μs (5%) 2.00 MiB (1%) 29
["gemm", "fusedmul", "blas", "16"] 3.805 ms (5%)
["gemm", "fusedmul", "blas", "2"] 2.906 ms (5%)
["gemm", "fusedmul", "blas", "32"] 5.247 ms (5%)
["gemm", "fusedmul", "blas", "8"] 3.045 ms (5%)
["gemm", "fusedmul", "xf", "16"] 4.743 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 621.028 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 9.698 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.526 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 756.939 μs (5%)
["gemm", "mul", "linalg", "32"] 2.500 μs (5%)
["gemm", "mul", "linalg", "8"] 200.000 ns (5%)
["gemm", "mul", "man", "false", "256"] 1.335 ms (5%)
["gemm", "mul", "man", "false", "32"] 4.000 μs (5%)
["gemm", "mul", "man", "false", "8"] 300.000 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 1.355 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 3.300 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 300.000 ns (5%)
["gemm", "mul", "man", "true", "256"] 1.332 ms (5%)
["gemm", "mul", "man", "true", "32"] 4.300 μs (5%)
["gemm", "mul", "man", "true", "8"] 300.000 ns (5%)
["gemm", "mul", "xf", "false", "256"] 1.339 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 3.800 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 300.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 1.367 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 3.400 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 400.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 1.339 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 4.000 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 300.000 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 2.233 μs (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 2.133 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 2.122 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.200 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.130 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 3.925 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.130 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.230 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 178.410 μs (5%) 74.17 KiB (1%) 3869
["missing_dot", "xf_nota"] 172.909 μs (5%) 74.05 KiB (1%) 3864
["partition_by", "man"] 1.869 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 1.573 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      52176 s          0 s       1170 s      22742 s          0 s
       #2  2095 MHz      20694 s          0 s       1358 s      54465 s          0 s
       
  Memory: 6.782737731933594 GB (3470.67578125 MB free)
  Uptime: 776.0 sec
  Load Avg:  1.009765625  0.98095703125  0.62939453125
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               85
Model name:          Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Stepping:            4
CPU MHz:             2095.194
BogoMIPS:            4190.38
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            1024K
L3 cache:            36608K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx avx512f avx512dq rdseed adx smap clflushopt avx512cd avx512bw avx512vl xsaveopt xsavec xsaves
Cpu Property Value
Brand Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Vendor :Intel
Architecture :Skylake
Model Family: 0x06, Model: 0x55, Stepping: 0x04, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 1024, 36608) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 512 bit = 64 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@github-actions
Copy link
Contributor

github-actions bot commented Mar 9, 2020

Multi-thread benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 9 Mar 2020 - 07:31
    • Baseline: 9 Mar 2020 - 07:36
  • Package commits:
    • Target: b7701a
    • Baseline: ed0911
  • Julia commits:
    • Target: 2d5741
    • Baseline: 2d5741
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: JULIA_NUM_THREADS => 2
    • Baseline: JULIA_NUM_THREADS => 2

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["collect", "assoc", "basesize=1"] 1.33 (5%) ❌ 1.01 (1%)
["collect", "assoc", "basesize=32"] 1.05 (5%) ❌ 1.00 (1%)
["collect", "unordered", "basesize=1024"] 1.21 (5%) ❌ 1.08 (1%) ❌
["collect", "unordered", "basesize=32"] 1.00 (5%) 1.01 (1%) ❌
["findfirst", "n=1000", "foldl"] 1.06 (5%) ❌ 1.00 (1%)
["findfirst", "n=1000", "reduce", "basesize=128"] 0.50 (5%) ✅ 0.45 (1%) ✅
["findfirst", "n=1000", "reduce", "basesize=256"] 0.64 (5%) ✅ 0.52 (1%) ✅
["findfirst", "n=1000", "reduce", "basesize=512"] 0.52 (5%) ✅ 0.41 (1%) ✅
["findfirst", "n=400", "foldl"] 1.07 (5%) ❌ 1.00 (1%)
["findfirst", "n=400", "reduce", "basesize=128"] 0.97 (5%) 0.91 (1%) ✅
["findfirst", "n=400", "reduce", "basesize=256"] 0.96 (5%) 0.89 (1%) ✅
["findfirst", "n=400", "reduce", "basesize=512"] 0.93 (5%) ✅ 0.71 (1%) ✅
["findfirst", "n=500", "reduce", "basesize=128"] 0.20 (5%) ✅ 0.18 (1%) ✅
["findfirst", "n=500", "reduce", "basesize=256"] 0.28 (5%) ✅ 0.20 (1%) ✅
["findfirst", "n=500", "reduce", "basesize=512"] 0.16 (5%) ✅ 0.13 (1%) ✅
["parallel_histogram", "assoc", "basesize=16384"] 1.08 (5%) ❌ 1.00 (1%)
["parallel_histogram", "assoc", "basesize=4096"] 1.14 (5%) ❌ 0.87 (1%) ✅
["parallel_histogram", "assoc", "basesize=8192"] 1.11 (5%) ❌ 1.00 (1%)
["parallel_histogram", "comm", "basesize=4096"] 0.94 (5%) ✅ 0.97 (1%) ✅
["parallel_histogram", "comm", "basesize=8192"] 1.11 (5%) ❌ 1.01 (1%) ❌
["sum", "uniform", "reduce", "basesize=128"] 1.09 (5%) ❌ 1.01 (1%)
["words", "nthreads=1"] 1.07 (5%) ❌ 1.00 (1%)
["words", "nthreads=2"] 1.13 (5%) ❌ 1.00 (1%)
["words", "nthreads=4"] 1.19 (5%) ❌ 1.00 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Target

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      44480 s          0 s       2017 s      15704 s          0 s
       #2  2095 MHz      46771 s          0 s       2169 s      13638 s          0 s
       
  Memory: 6.7827301025390625 GB (3563.32421875 MB free)
  Uptime: 643.0 sec
  Load Avg:  1.744140625  1.533203125  0.90869140625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Baseline

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      68139 s          0 s       2440 s      18850 s          0 s
       #2  2095 MHz      65802 s          0 s       2604 s      21340 s          0 s
       
  Memory: 6.7827301025390625 GB (3610.7265625 MB free)
  Uptime: 917.0 sec
  Load Avg:  1.66845703125  1.5712890625  1.0849609375
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 9 Mar 2020 - 7:31
  • Package commit: b7701a
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: JULIA_NUM_THREADS => 2

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["collect", "assoc", "basesize=1"] 475.559 ms (5%) 5.294 ms 92.16 MiB (1%) 1761504
["collect", "assoc", "basesize=1024"] 246.464 ms (5%) 1.84 MiB (1%) 1970
["collect", "assoc", "basesize=32"] 259.723 ms (5%) 5.79 MiB (1%) 59282
["collect", "seq"] 478.211 ms (5%) 513.00 KiB (1%) 23
["collect", "unordered", "basesize=1"] 453.574 ms (5%) 7.280 ms 30.26 MiB (1%) 475173
["collect", "unordered", "basesize=1024"] 352.874 ms (5%) 834.58 KiB (1%) 6528
["collect", "unordered", "basesize=32"] 268.737 ms (5%) 1.52 MiB (1%) 20200
["findfirst", "n=1000", "foldl"] 693.870 ms (5%)
["findfirst", "n=1000", "reduce", "basesize=128"] 356.568 ms (5%) 603.53 KiB (1%) 11825
["findfirst", "n=1000", "reduce", "basesize=256"] 353.502 ms (5%) 307.44 KiB (1%) 6042
["findfirst", "n=1000", "reduce", "basesize=512"] 360.907 ms (5%) 159.67 KiB (1%) 3140
["findfirst", "n=400", "foldl"] 508.233 ms (5%)
["findfirst", "n=400", "reduce", "basesize=128"] 273.519 ms (5%) 1.09 MiB (1%) 21900
["findfirst", "n=400", "reduce", "basesize=256"] 268.138 ms (5%) 562.72 KiB (1%) 11042
["findfirst", "n=400", "reduce", "basesize=512"] 271.459 ms (5%) 285.94 KiB (1%) 5633
["findfirst", "n=500", "foldl"] 82.568 ms (5%)
["findfirst", "n=500", "reduce", "basesize=128"] 44.608 ms (5%) 168.06 KiB (1%) 3277
["findfirst", "n=500", "reduce", "basesize=256"] 45.416 ms (5%) 90.14 KiB (1%) 1755
["findfirst", "n=500", "reduce", "basesize=512"] 48.586 ms (5%) 51.39 KiB (1%) 1000
["overhead", "terminatable=false"] 153.599 μs (5%) 156.02 KiB (1%) 3008
["overhead", "terminatable=true"] 271.499 μs (5%) 156.30 KiB (1%) 3026
["parallel_histogram", "assoc", "basesize=16384"] 4.326 ms (5%) 732.22 KiB (1%) 109
["parallel_histogram", "assoc", "basesize=4096"] 5.302 ms (5%) 1.80 MiB (1%) 540
["parallel_histogram", "assoc", "basesize=8192"] 4.694 ms (5%) 1.43 MiB (1%) 260
["parallel_histogram", "comm", "basesize=16384"] 15.054 ms (5%) 1.22 MiB (1%) 531
["parallel_histogram", "comm", "basesize=4096"] 23.083 ms (5%) 1.06 MiB (1%) 4326
["parallel_histogram", "comm", "basesize=8192"] 23.013 ms (5%) 1.28 MiB (1%) 3961
["parallel_histogram", "seq"] 7.830 ms (5%) 364.63 KiB (1%) 25
["sum", "random", "foldl"] 15.484 ms (5%)
["sum", "random", "reduce", "basesize=128"] 8.119 ms (5%) 333.34 KiB (1%) 6840
["sum", "random", "reduce", "basesize=256"] 7.759 ms (5%) 164.98 KiB (1%) 3392
["sum", "random", "reduce", "basesize=512"] 8.167 ms (5%) 81.25 KiB (1%) 1680
["sum", "uniform", "foldl"] 14.750 ms (5%)
["sum", "uniform", "reduce", "basesize=128"] 8.075 ms (5%) 333.50 KiB (1%) 6850
["sum", "uniform", "reduce", "basesize=256"] 7.691 ms (5%) 165.20 KiB (1%) 3406
["sum", "uniform", "reduce", "basesize=512"] 7.545 ms (5%) 81.19 KiB (1%) 1676
["sum", "valley", "foldl"] 15.415 ms (5%)
["sum", "valley", "reduce", "basesize=128"] 8.336 ms (5%) 333.39 KiB (1%) 6843
["sum", "valley", "reduce", "basesize=256"] 7.978 ms (5%) 165.05 KiB (1%) 3396
["sum", "valley", "reduce", "basesize=512"] 8.158 ms (5%) 81.23 KiB (1%) 1679
["words", "nthreads=1"] 36.364 ms (5%) 6.403 ms 64.81 MiB (1%) 2096902
["words", "nthreads=2"] 21.548 ms (5%) 65.52 MiB (1%) 2097068
["words", "nthreads=4"] 23.590 ms (5%) 66.16 MiB (1%) 2097382

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      44480 s          0 s       2017 s      15704 s          0 s
       #2  2095 MHz      46771 s          0 s       2169 s      13638 s          0 s
       
  Memory: 6.7827301025390625 GB (3563.32421875 MB free)
  Uptime: 643.0 sec
  Load Avg:  1.744140625  1.533203125  0.90869140625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 9 Mar 2020 - 7:36
  • Package commit: ed0911
  • Julia commit: 2d5741
  • Julia command flags: None
  • Environment variables: JULIA_NUM_THREADS => 2

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["collect", "assoc", "basesize=1"] 358.209 ms (5%) 9.459 ms 91.55 MiB (1%) 1754645
["collect", "assoc", "basesize=1024"] 240.286 ms (5%) 1.84 MiB (1%) 1968
["collect", "assoc", "basesize=32"] 246.764 ms (5%) 5.77 MiB (1%) 59133
["collect", "seq"] 461.289 ms (5%) 513.00 KiB (1%) 23
["collect", "unordered", "basesize=1"] 433.698 ms (5%) 7.025 ms 30.23 MiB (1%) 473310
["collect", "unordered", "basesize=1024"] 291.930 ms (5%) 772.91 KiB (1%) 2536
["collect", "unordered", "basesize=32"] 267.849 ms (5%) 1.51 MiB (1%) 19194
["findfirst", "n=1000", "foldl"] 653.449 ms (5%)
["findfirst", "n=1000", "reduce", "basesize=128"] 707.772 ms (5%) 1.32 MiB (1%) 26379
["findfirst", "n=1000", "reduce", "basesize=256"] 553.054 ms (5%) 591.78 KiB (1%) 11562
["findfirst", "n=1000", "reduce", "basesize=512"] 690.838 ms (5%) 388.67 KiB (1%) 7595
["findfirst", "n=400", "foldl"] 475.780 ms (5%)
["findfirst", "n=400", "reduce", "basesize=128"] 281.609 ms (5%) 1.20 MiB (1%) 24252
["findfirst", "n=400", "reduce", "basesize=256"] 278.637 ms (5%) 629.97 KiB (1%) 12410
["findfirst", "n=400", "reduce", "basesize=512"] 292.119 ms (5%) 404.52 KiB (1%) 7939
["findfirst", "n=500", "foldl"] 80.603 ms (5%)
["findfirst", "n=500", "reduce", "basesize=128"] 227.458 ms (5%) 913.48 KiB (1%) 17690
["findfirst", "n=500", "reduce", "basesize=256"] 162.182 ms (5%) 446.78 KiB (1%) 8650
["findfirst", "n=500", "reduce", "basesize=512"] 303.974 ms (5%) 387.64 KiB (1%) 7538
["overhead", "n=64"] 164.700 μs (5%) 155.09 KiB (1%) 3006
["parallel_histogram", "assoc", "basesize=16384"] 3.994 ms (5%) 732.22 KiB (1%) 109
["parallel_histogram", "assoc", "basesize=4096"] 4.664 ms (5%) 2.07 MiB (1%) 546
["parallel_histogram", "assoc", "basesize=8192"] 4.225 ms (5%) 1.43 MiB (1%) 260
["parallel_histogram", "comm", "basesize=16384"] 14.955 ms (5%) 1.22 MiB (1%) 157
["parallel_histogram", "comm", "basesize=4096"] 24.578 ms (5%) 1.09 MiB (1%) 7680
["parallel_histogram", "comm", "basesize=8192"] 20.712 ms (5%) 2.469 ms 1.26 MiB (1%) 3011
["parallel_histogram", "seq"] 7.563 ms (5%) 364.63 KiB (1%) 25
["sum", "random", "foldl"] 14.989 ms (5%)
["sum", "random", "reduce", "basesize=128"] 7.956 ms (5%) 331.34 KiB (1%) 6832
["sum", "random", "reduce", "basesize=256"] 7.810 ms (5%) 164.14 KiB (1%) 3395
["sum", "random", "reduce", "basesize=512"] 7.858 ms (5%) 80.75 KiB (1%) 1674
["sum", "uniform", "foldl"] 14.537 ms (5%)
["sum", "uniform", "reduce", "basesize=128"] 7.428 ms (5%) 331.39 KiB (1%) 6835
["sum", "uniform", "reduce", "basesize=256"] 7.466 ms (5%) 164.09 KiB (1%) 3392
["sum", "uniform", "reduce", "basesize=512"] 7.318 ms (5%) 80.75 KiB (1%) 1674
["sum", "valley", "foldl"] 15.300 ms (5%)
["sum", "valley", "reduce", "basesize=128"] 7.993 ms (5%) 331.30 KiB (1%) 6829
["sum", "valley", "reduce", "basesize=256"] 7.883 ms (5%) 164.08 KiB (1%) 3391
["sum", "valley", "reduce", "basesize=512"] 7.834 ms (5%) 80.77 KiB (1%) 1675
["words", "nthreads=1"] 33.849 ms (5%) 6.009 ms 64.63 MiB (1%) 2091470
["words", "nthreads=2"] 19.126 ms (5%) 65.35 MiB (1%) 2091636
["words", "nthreads=4"] 19.902 ms (5%) 65.99 MiB (1%) 2091953

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Julia Version 1.3.1
Commit 2d5741174c (2019-12-30 21:36 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1032-azure #34-Ubuntu SMP Mon Feb 10 19:37:25 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      68139 s          0 s       2440 s      18850 s          0 s
       #2  2095 MHz      65802 s          0 s       2604 s      21340 s          0 s
       
  Memory: 6.7827301025390625 GB (3610.7265625 MB free)
  Uptime: 917.0 sec
  Load Avg:  1.66845703125  1.5712890625  1.0849609375
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-6.0.1 (ORCJIT, skylake)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               85
Model name:          Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Stepping:            4
CPU MHz:             2095.075
BogoMIPS:            4190.15
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            1024K
L3 cache:            36608K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx avx512f avx512dq rdseed adx smap clflushopt avx512cd avx512bw avx512vl xsaveopt xsavec xsaves
Cpu Property Value
Brand Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Vendor :Intel
Architecture :Skylake
Model Family: 0x06, Model: 0x55, Stepping: 0x04, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 1024, 36608) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 512 bit = 64 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@mergify mergify bot merged commit 9f45c4b into master Mar 9, 2020
@delete-merged-branch delete-merged-branch bot deleted the order-reduce-tasks branch March 9, 2020 07:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants