Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New communication strategy: AllToAll #260

Merged
merged 27 commits into from
Jul 9, 2024
Merged

Conversation

mtsch
Copy link
Collaborator

@mtsch mtsch commented Jun 7, 2024

changes

  • New communication strategy for MPI: AllToAll which is more efficient with large numbers of nodes. This is now the default.
  • AllToAll and PointToPoint now take an optional argument report which reports communication timings if set.

mtsch added 11 commits April 9, 2024 09:59
…to feature/alltoall
@coveralls
Copy link

coveralls commented Jun 7, 2024

Pull Request Test Coverage Report for Build 9411100570

Details

  • 20 of 95 (21.05%) changed or added relevant lines in 3 files are covered.
  • 1 unchanged line in 1 file lost coverage.
  • Overall coverage decreased (-1.0%) to 94.106%

Changes Missing Coverage Covered Lines Changed/Added Lines %
src/DictVectors/communicators.jl 16 91 17.58%
Files with Coverage Reduction New Missed Lines %
src/BitStringAddresses/sortedparticlelist.jl 1 89.03%
Totals Coverage Status
Change from base Build 9164759738: -1.0%
Covered Lines: 6706
Relevant Lines: 7126

💛 - Coveralls

Copy link
Contributor

github-actions bot commented Jun 7, 2024

Benchmark result

Judge result

Benchmark Report for /home/runner/work/Rimu.jl/Rimu.jl

Job Properties

  • Time of benchmarks:
  • Target: 7 Jun 2024 - 03:32
  • Baseline: 7 Jun 2024 - 03:42
  • Package commits:
  • Target: 4ceba5
  • Baseline: cb0662
  • Julia commits:
  • Target: 48d4fd
  • Baseline: 48d4fd
  • Julia command flags:
  • Target: None
  • Baseline: None
  • Environment variables:
  • Target: None
  • Baseline: None

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Rimu", "Exact", "Diagonalization"]
  • ["Rimu", "Exact", "Multiplication"]
  • ["Rimu", "FCIQMC"]

Julia versioninfo

Target

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1021-azure #22~22.04.1-Ubuntu SMP Tue Apr 30 16:08:18 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 3243 MHz 2841 s 0 s 182 s 5536 s 0 s
 #2 3246 MHz 2720 s 0 s 175 s 5667 s 0 s
 #3 3241 MHz 3063 s 0 s 186 s 5320 s 0 s
 #4 3242 MHz 1540 s 0 s 201 s 6813 s 0 s
 Memory: 15.606498718261719 GB (13160.12109375 MB free)
 Uptime: 860.31 sec
 Load Avg: 1.04 1.08 0.8
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Baseline

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1021-azure #22~22.04.1-Ubuntu SMP Tue Apr 30 16:08:18 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 2765 MHz 4570 s 0 s 229 s 9952 s 0 s
 #2 2680 MHz 4188 s 0 s 227 s 10338 s 0 s
 #3 3248 MHz 4713 s 0 s 239 s 9810 s 0 s
 #4 3243 MHz 2899 s 0 s 267 s 11581 s 0 s
 Memory: 15.606498718261719 GB (12945.5625 MB free)
 Uptime: 1480.64 sec
 Load Avg: 1.04 1.05 0.94
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Target result

Benchmark Report for /home/runner/work/Rimu.jl/Rimu.jl

Job Properties

  • Time of benchmark: 7 Jun 2024 - 3:32
  • Package commit: 4ceba5
  • Julia commit: 48d4fd
  • Julia command flags: None
  • Environment variables: None

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["Rimu", "Exact", "Diagonalization", "2D Hubbard"] 1.868 s (5%) 2.152 ms 93.71 MiB (1%) 24005
["Rimu", "Exact", "Diagonalization", "Bose-Hubbard in momentum space"] 17.345 s (5%) 17.743 ms 367.02 MiB (1%) 2038351
["Rimu", "Exact", "Multiplication", "Momentum space"] 849.220 ms (5%) 10.92 MiB (1%) 253
["Rimu", "Exact", "Multiplication", "Transcorrelated"] 2.807 s (5%) 1.88 MiB (1%) 257
["Rimu", "FCIQMC", "(10, 20) Mom space with projected energy and initiator"] 26.652 s (5%) 10.564 ms 59.71 MiB (1%) 1900087
["Rimu", "FCIQMC", "(4+1, 11) 2C Mom space with G2Correlators"] 25.355 s (5%) 32.03 MiB (1%) 824125
["Rimu", "FCIQMC", "(50, 50) Real space"] 21.898 s (5%) 29.24 MiB (1%) 132000

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Rimu", "Exact", "Diagonalization"]
  • ["Rimu", "Exact", "Multiplication"]
  • ["Rimu", "FCIQMC"]

Julia versioninfo

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1021-azure #22~22.04.1-Ubuntu SMP Tue Apr 30 16:08:18 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 3243 MHz 2841 s 0 s 182 s 5536 s 0 s
 #2 3246 MHz 2720 s 0 s 175 s 5667 s 0 s
 #3 3241 MHz 3063 s 0 s 186 s 5320 s 0 s
 #4 3242 MHz 1540 s 0 s 201 s 6813 s 0 s
 Memory: 15.606498718261719 GB (13160.12109375 MB free)
 Uptime: 860.31 sec
 Load Avg: 1.04 1.08 0.8
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Baseline result

Benchmark Report for /home/runner/work/Rimu.jl/Rimu.jl

Job Properties

  • Time of benchmark: 7 Jun 2024 - 3:42
  • Package commit: cb0662
  • Julia commit: 48d4fd
  • Julia command flags: None
  • Environment variables: None

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["Rimu", "Exact", "Diagonalization", "2D Hubbard"] 1.879 s (5%) 2.179 ms 93.71 MiB (1%) 24005
["Rimu", "Exact", "Diagonalization", "Bose-Hubbard in momentum space"] 17.640 s (5%) 274.987 ms 367.02 MiB (1%) 2038351
["Rimu", "Exact", "Multiplication", "Momentum space"] 844.825 ms (5%) 10.92 MiB (1%) 253
["Rimu", "Exact", "Multiplication", "Transcorrelated"] 2.787 s (5%) 1.88 MiB (1%) 257
["Rimu", "FCIQMC", "(10, 20) Mom space with projected energy and initiator"] 26.733 s (5%) 6.215 ms 59.71 MiB (1%) 1900028
["Rimu", "FCIQMC", "(4+1, 11) 2C Mom space with G2Correlators"] 24.363 s (5%) 32.03 MiB (1%) 824159
["Rimu", "FCIQMC", "(50, 50) Real space"] 22.205 s (5%) 29.24 MiB (1%) 132005

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Rimu", "Exact", "Diagonalization"]
  • ["Rimu", "Exact", "Multiplication"]
  • ["Rimu", "FCIQMC"]

Julia versioninfo

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1021-azure #22~22.04.1-Ubuntu SMP Tue Apr 30 16:08:18 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 2765 MHz 4570 s 0 s 229 s 9952 s 0 s
 #2 2680 MHz 4188 s 0 s 227 s 10338 s 0 s
 #3 3248 MHz 4713 s 0 s 239 s 9810 s 0 s
 #4 3243 MHz 2899 s 0 s 267 s 11581 s 0 s
 Memory: 15.606498718261719 GB (12945.5625 MB free)
 Uptime: 1480.64 sec
 Load Avg: 1.04 1.05 0.94
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() lbt
Sys.CPU_THREADS 4

lscpu output:

Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Address sizes: 48 bits physical, 48 bits virtual
Byte Order: Little Endian
CPU(s): 4
On-line CPU(s) list: 0-3
Vendor ID: AuthenticAMD
Model name: AMD EPYC 7763 64-Core Processor
CPU family: 25
Model: 1
Thread(s) per core: 2
Core(s) per socket: 2
Socket(s): 1
Stepping: 1
BogoMIPS: 4890.85
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl tsc_reliable nonstop_tsc cpuid extd_apicid aperfmperf pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm cmp_legacy svm cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw topoext invpcid_single vmmcall fsgsbase bmi1 avx2 smep bmi2 erms invpcid rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves clzero xsaveerptr rdpru arat npt nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold v_vmsave_vmload umip vaes vpclmulqdq rdpid fsrm
Virtualization: AMD-V
Hypervisor vendor: Microsoft
Virtualization type: full
L1d cache: 64 KiB (2 instances)
L1i cache: 64 KiB (2 instances)
L2 cache: 1 MiB (2 instances)
L3 cache: 32 MiB (1 instance)
NUMA node(s): 1
NUMA node0 CPU(s): 0-3
Vulnerability Gather data sampling: Not affected
Vulnerability Itlb multihit: Not affected
Vulnerability L1tf: Not affected
Vulnerability Mds: Not affected
Vulnerability Meltdown: Not affected
Vulnerability Mmio stale data: Not affected
Vulnerability Retbleed: Not affected
Vulnerability Spec rstack overflow: Vulnerable: Safe RET, no microcode
Vulnerability Spec store bypass: Vulnerable
Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization
Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Not affected
Vulnerability Srbds: Not affected
Vulnerability Tsx async abort: Not affected

Cpu Property Value
Brand AMD EPYC 7763 64-Core Processor
Vendor :AMD
Architecture :Unknown
Model Family: 0xaf, Model: 0x01, Stepping: 0x01, Type: 0x00
Cores 16 physical cores, 16 logical cores (on executing CPU)
No Hyperthreading hardware capability detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 512, 32768) kbytes
64 byte cache line size
Address Size 48 bits virtual, 48 bits physical
SIMD 256 bit = 32 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC runs at constant rate (invariant from clock frequency)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft
Benchmark result

Judge result

Benchmark Report for /home/runner/work/Rimu.jl/Rimu.jl

Job Properties

  • Time of benchmarks:
  • Target: 24 Jun 2024 - 03:30
  • Baseline: 24 Jun 2024 - 03:41
  • Package commits:
  • Target: dbc453
  • Baseline: bcaae8
  • Julia commits:
  • Target: 48d4fd
  • Baseline: 48d4fd
  • Julia command flags:
  • Target: None
  • Baseline: None
  • Environment variables:
  • Target: None
  • Baseline: None

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Rimu", "Exact", "Diagonalization"]
  • ["Rimu", "Exact", "Multiplication"]
  • ["Rimu", "FCIQMC"]

Julia versioninfo

Target

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 3269 MHz 2779 s 0 s 186 s 6389 s 0 s
 #2 3244 MHz 2669 s 0 s 176 s 6519 s 0 s
 #3 2445 MHz 2323 s 0 s 189 s 6830 s 0 s
 #4 2607 MHz 2193 s 0 s 185 s 6983 s 0 s
 Memory: 15.606491088867188 GB (13328.31640625 MB free)
 Uptime: 939.54 sec
 Load Avg: 1.07 1.08 0.82
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Baseline

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 3243 MHz 4318 s 0 s 231 s 10945 s 0 s
 #2 3231 MHz 3451 s 0 s 238 s 11813 s 0 s
 #3 2702 MHz 4316 s 0 s 232 s 10935 s 0 s
 #4 2614 MHz 4031 s 0 s 241 s 11231 s 0 s
 Memory: 15.606491088867188 GB (13035.9453125 MB free)
 Uptime: 1554.53 sec
 Load Avg: 1.0 1.02 0.93
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Target result

Benchmark Report for /home/runner/work/Rimu.jl/Rimu.jl

Job Properties

  • Time of benchmark: 24 Jun 2024 - 3:30
  • Package commit: dbc453
  • Julia commit: 48d4fd
  • Julia command flags: None
  • Environment variables: None

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["Rimu", "Exact", "Diagonalization", "2D Hubbard"] 1.863 s (5%) 2.289 ms 93.71 MiB (1%) 23742
["Rimu", "Exact", "Diagonalization", "Bose-Hubbard in momentum space"] 17.514 s (5%) 17.006 ms 367.02 MiB (1%) 2037976
["Rimu", "Exact", "Multiplication", "Momentum space"] 829.495 ms (5%) 10.92 MiB (1%) 253
["Rimu", "Exact", "Multiplication", "Transcorrelated"] 2.808 s (5%) 1.88 MiB (1%) 257
["Rimu", "FCIQMC", "(10, 20) Mom space with projected energy and initiator"] 26.654 s (5%) 6.188 ms 59.71 MiB (1%) 1900186
["Rimu", "FCIQMC", "(4+1, 11) 2C Mom space with G2Correlators"] 24.625 s (5%) 20.052 ms 32.03 MiB (1%) 824127
["Rimu", "FCIQMC", "(50, 50) Real space"] 22.154 s (5%) 29.24 MiB (1%) 132010

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Rimu", "Exact", "Diagonalization"]
  • ["Rimu", "Exact", "Multiplication"]
  • ["Rimu", "FCIQMC"]

Julia versioninfo

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 3269 MHz 2779 s 0 s 186 s 6389 s 0 s
 #2 3244 MHz 2669 s 0 s 176 s 6519 s 0 s
 #3 2445 MHz 2323 s 0 s 189 s 6830 s 0 s
 #4 2607 MHz 2193 s 0 s 185 s 6983 s 0 s
 Memory: 15.606491088867188 GB (13328.31640625 MB free)
 Uptime: 939.54 sec
 Load Avg: 1.07 1.08 0.82
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Baseline result

Benchmark Report for /home/runner/work/Rimu.jl/Rimu.jl

Job Properties

  • Time of benchmark: 24 Jun 2024 - 3:41
  • Package commit: bcaae8
  • Julia commit: 48d4fd
  • Julia command flags: None
  • Environment variables: None

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["Rimu", "Exact", "Diagonalization", "2D Hubbard"] 1.861 s (5%) 2.752 ms 93.71 MiB (1%) 23742
["Rimu", "Exact", "Diagonalization", "Bose-Hubbard in momentum space"] 17.749 s (5%) 246.575 ms 367.02 MiB (1%) 2037976
["Rimu", "Exact", "Multiplication", "Momentum space"] 819.023 ms (5%) 10.92 MiB (1%) 253
["Rimu", "Exact", "Multiplication", "Transcorrelated"] 2.812 s (5%) 1.88 MiB (1%) 257
["Rimu", "FCIQMC", "(10, 20) Mom space with projected energy and initiator"] 26.582 s (5%) 9.825 ms 59.71 MiB (1%) 1900093
["Rimu", "FCIQMC", "(4+1, 11) 2C Mom space with G2Correlators"] 25.447 s (5%) 11.296 ms 32.03 MiB (1%) 824133
["Rimu", "FCIQMC", "(50, 50) Real space"] 22.039 s (5%) 29.24 MiB (1%) 132025

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Rimu", "Exact", "Diagonalization"]
  • ["Rimu", "Exact", "Multiplication"]
  • ["Rimu", "FCIQMC"]

Julia versioninfo

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 3243 MHz 4318 s 0 s 231 s 10945 s 0 s
 #2 3231 MHz 3451 s 0 s 238 s 11813 s 0 s
 #3 2702 MHz 4316 s 0 s 232 s 10935 s 0 s
 #4 2614 MHz 4031 s 0 s 241 s 11231 s 0 s
 Memory: 15.606491088867188 GB (13035.9453125 MB free)
 Uptime: 1554.53 sec
 Load Avg: 1.0 1.02 0.93
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() lbt
Sys.CPU_THREADS 4

lscpu output:

Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Address sizes: 48 bits physical, 48 bits virtual
Byte Order: Little Endian
CPU(s): 4
On-line CPU(s) list: 0-3
Vendor ID: AuthenticAMD
Model name: AMD EPYC 7763 64-Core Processor
CPU family: 25
Model: 1
Thread(s) per core: 2
Core(s) per socket: 2
Socket(s): 1
Stepping: 1
BogoMIPS: 4890.86
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl tsc_reliable nonstop_tsc cpuid extd_apicid aperfmperf pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm cmp_legacy svm cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw topoext invpcid_single vmmcall fsgsbase bmi1 avx2 smep bmi2 erms invpcid rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves clzero xsaveerptr rdpru arat npt nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold v_vmsave_vmload umip vaes vpclmulqdq rdpid fsrm
Virtualization: AMD-V
Hypervisor vendor: Microsoft
Virtualization type: full
L1d cache: 64 KiB (2 instances)
L1i cache: 64 KiB (2 instances)
L2 cache: 1 MiB (2 instances)
L3 cache: 32 MiB (1 instance)
NUMA node(s): 1
NUMA node0 CPU(s): 0-3
Vulnerability Gather data sampling: Not affected
Vulnerability Itlb multihit: Not affected
Vulnerability L1tf: Not affected
Vulnerability Mds: Not affected
Vulnerability Meltdown: Not affected
Vulnerability Mmio stale data: Not affected
Vulnerability Retbleed: Not affected
Vulnerability Spec rstack overflow: Vulnerable: Safe RET, no microcode
Vulnerability Spec store bypass: Vulnerable
Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization
Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Not affected
Vulnerability Srbds: Not affected
Vulnerability Tsx async abort: Not affected

Cpu Property Value
Brand AMD EPYC 7763 64-Core Processor
Vendor :AMD
Architecture :Unknown
Model Family: 0xaf, Model: 0x01, Stepping: 0x01, Type: 0x00
Cores 16 physical cores, 16 logical cores (on executing CPU)
No Hyperthreading hardware capability detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 512, 32768) kbytes
64 byte cache line size
Address Size 48 bits virtual, 48 bits physical
SIMD 256 bit = 32 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC runs at constant rate (invariant from clock frequency)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft
Benchmark result

Judge result

Benchmark Report for /home/runner/work/Rimu.jl/Rimu.jl

Job Properties

  • Time of benchmarks:
  • Target: 24 Jun 2024 - 03:35
  • Baseline: 24 Jun 2024 - 03:45
  • Package commits:
  • Target: 7447d9
  • Baseline: bcaae8
  • Julia commits:
  • Target: 48d4fd
  • Baseline: 48d4fd
  • Julia command flags:
  • Target: None
  • Baseline: None
  • Environment variables:
  • Target: None
  • Baseline: None

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Rimu", "Exact", "Diagonalization"]
  • ["Rimu", "Exact", "Multiplication"]
  • ["Rimu", "FCIQMC"]

Julia versioninfo

Target

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 3241 MHz 1589 s 0 s 155 s 8010 s 0 s
 #2 3256 MHz 1934 s 0 s 172 s 7666 s 0 s
 #3 2755 MHz 2783 s 0 s 166 s 6811 s 0 s
 #4 2566 MHz 3380 s 0 s 154 s 6217 s 0 s
 Memory: 15.606491088867188 GB (13298.41796875 MB free)
 Uptime: 980.37 sec
 Load Avg: 1.0 1.03 0.75
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Baseline

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 3203 MHz 3583 s 0 s 188 s 12087 s 0 s
 #2 2954 MHz 3504 s 0 s 216 s 12156 s 0 s
 #3 2445 MHz 4051 s 0 s 211 s 11602 s 0 s
 #4 3243 MHz 4664 s 0 s 199 s 10992 s 0 s
 Memory: 15.606491088867188 GB (13016.94921875 MB free)
 Uptime: 1591.69 sec
 Load Avg: 1.0 1.0 0.9
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Target result

Benchmark Report for /home/runner/work/Rimu.jl/Rimu.jl

Job Properties

  • Time of benchmark: 24 Jun 2024 - 3:35
  • Package commit: 7447d9
  • Julia commit: 48d4fd
  • Julia command flags: None
  • Environment variables: None

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["Rimu", "Exact", "Diagonalization", "2D Hubbard"] 1.845 s (5%) 2.057 ms 93.71 MiB (1%) 23742
["Rimu", "Exact", "Diagonalization", "Bose-Hubbard in momentum space"] 17.387 s (5%) 17.456 ms 367.02 MiB (1%) 2037976
["Rimu", "Exact", "Multiplication", "Momentum space"] 811.114 ms (5%) 10.92 MiB (1%) 253
["Rimu", "Exact", "Multiplication", "Transcorrelated"] 2.799 s (5%) 1.88 MiB (1%) 257
["Rimu", "FCIQMC", "(10, 20) Mom space with projected energy and initiator"] 26.650 s (5%) 59.71 MiB (1%) 1900264
["Rimu", "FCIQMC", "(4+1, 11) 2C Mom space with G2Correlators"] 24.976 s (5%) 32.03 MiB (1%) 824118
["Rimu", "FCIQMC", "(50, 50) Real space"] 21.923 s (5%) 29.24 MiB (1%) 132000

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Rimu", "Exact", "Diagonalization"]
  • ["Rimu", "Exact", "Multiplication"]
  • ["Rimu", "FCIQMC"]

Julia versioninfo

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 3241 MHz 1589 s 0 s 155 s 8010 s 0 s
 #2 3256 MHz 1934 s 0 s 172 s 7666 s 0 s
 #3 2755 MHz 2783 s 0 s 166 s 6811 s 0 s
 #4 2566 MHz 3380 s 0 s 154 s 6217 s 0 s
 Memory: 15.606491088867188 GB (13298.41796875 MB free)
 Uptime: 980.37 sec
 Load Avg: 1.0 1.03 0.75
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Baseline result

Benchmark Report for /home/runner/work/Rimu.jl/Rimu.jl

Job Properties

  • Time of benchmark: 24 Jun 2024 - 3:45
  • Package commit: bcaae8
  • Julia commit: 48d4fd
  • Julia command flags: None
  • Environment variables: None

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["Rimu", "Exact", "Diagonalization", "2D Hubbard"] 1.855 s (5%) 1.074 ms 93.71 MiB (1%) 23742
["Rimu", "Exact", "Diagonalization", "Bose-Hubbard in momentum space"] 17.529 s (5%) 242.953 ms 367.02 MiB (1%) 2037976
["Rimu", "Exact", "Multiplication", "Momentum space"] 813.919 ms (5%) 10.92 MiB (1%) 253
["Rimu", "Exact", "Multiplication", "Transcorrelated"] 2.795 s (5%) 1.88 MiB (1%) 257
["Rimu", "FCIQMC", "(10, 20) Mom space with projected energy and initiator"] 26.787 s (5%) 59.71 MiB (1%) 1900080
["Rimu", "FCIQMC", "(4+1, 11) 2C Mom space with G2Correlators"] 24.193 s (5%) 14.136 ms 32.03 MiB (1%) 824147
["Rimu", "FCIQMC", "(50, 50) Real space"] 22.013 s (5%) 6.559 ms 29.24 MiB (1%) 132010

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Rimu", "Exact", "Diagonalization"]
  • ["Rimu", "Exact", "Multiplication"]
  • ["Rimu", "FCIQMC"]

Julia versioninfo

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 3203 MHz 3583 s 0 s 188 s 12087 s 0 s
 #2 2954 MHz 3504 s 0 s 216 s 12156 s 0 s
 #3 2445 MHz 4051 s 0 s 211 s 11602 s 0 s
 #4 3243 MHz 4664 s 0 s 199 s 10992 s 0 s
 Memory: 15.606491088867188 GB (13016.94921875 MB free)
 Uptime: 1591.69 sec
 Load Avg: 1.0 1.0 0.9
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() lbt
Sys.CPU_THREADS 4

lscpu output:

Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Address sizes: 48 bits physical, 48 bits virtual
Byte Order: Little Endian
CPU(s): 4
On-line CPU(s) list: 0-3
Vendor ID: AuthenticAMD
Model name: AMD EPYC 7763 64-Core Processor
CPU family: 25
Model: 1
Thread(s) per core: 2
Core(s) per socket: 2
Socket(s): 1
Stepping: 1
BogoMIPS: 4890.84
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl tsc_reliable nonstop_tsc cpuid extd_apicid aperfmperf pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm cmp_legacy svm cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw topoext invpcid_single vmmcall fsgsbase bmi1 avx2 smep bmi2 erms invpcid rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves clzero xsaveerptr rdpru arat npt nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold v_vmsave_vmload umip vaes vpclmulqdq rdpid fsrm
Virtualization: AMD-V
Hypervisor vendor: Microsoft
Virtualization type: full
L1d cache: 64 KiB (2 instances)
L1i cache: 64 KiB (2 instances)
L2 cache: 1 MiB (2 instances)
L3 cache: 32 MiB (1 instance)
NUMA node(s): 1
NUMA node0 CPU(s): 0-3
Vulnerability Gather data sampling: Not affected
Vulnerability Itlb multihit: Not affected
Vulnerability L1tf: Not affected
Vulnerability Mds: Not affected
Vulnerability Meltdown: Not affected
Vulnerability Mmio stale data: Not affected
Vulnerability Retbleed: Not affected
Vulnerability Spec rstack overflow: Vulnerable: Safe RET, no microcode
Vulnerability Spec store bypass: Vulnerable
Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization
Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Not affected
Vulnerability Srbds: Not affected
Vulnerability Tsx async abort: Not affected

Cpu Property Value
Brand AMD EPYC 7763 64-Core Processor
Vendor :AMD
Architecture :Unknown
Model Family: 0xaf, Model: 0x01, Stepping: 0x01, Type: 0x00
Cores 16 physical cores, 16 logical cores (on executing CPU)
No Hyperthreading hardware capability detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 512, 32768) kbytes
64 byte cache line size
Address Size 48 bits virtual, 48 bits physical
SIMD 256 bit = 32 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC runs at constant rate (invariant from clock frequency)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft
Benchmark result

Judge result

Benchmark Report for /home/runner/work/Rimu.jl/Rimu.jl

Job Properties

  • Time of benchmarks:
  • Target: 26 Jun 2024 - 22:46
  • Baseline: 26 Jun 2024 - 22:56
  • Package commits:
  • Target: 321acb
  • Baseline: bcaae8
  • Julia commits:
  • Target: 48d4fd
  • Baseline: 48d4fd
  • Julia command flags:
  • Target: None
  • Baseline: None
  • Environment variables:
  • Target: None
  • Baseline: None

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Rimu", "Exact", "Diagonalization"]
  • ["Rimu", "Exact", "Multiplication"]
  • ["Rimu", "FCIQMC"]

Julia versioninfo

Target

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 3208 MHz 1360 s 0 s 121 s 6336 s 0 s
 #2 3238 MHz 1745 s 0 s 113 s 5952 s 0 s
 #3 3243 MHz 2008 s 0 s 110 s 5695 s 0 s
 #4 3235 MHz 1764 s 0 s 108 s 5946 s 0 s
 Memory: 15.606491088867188 GB (13342.49609375 MB free)
 Uptime: 785.26 sec
 Load Avg: 1.0 0.94 0.59
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Baseline

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 2445 MHz 2620 s 0 s 169 s 11176 s 0 s
 #2 3244 MHz 4148 s 0 s 155 s 9657 s 0 s
 #3 2558 MHz 3721 s 0 s 166 s 10074 s 0 s
 #4 2594 MHz 2547 s 0 s 166 s 11254 s 0 s
 Memory: 15.606491088867188 GB (13055.21875 MB free)
 Uptime: 1401.16 sec
 Load Avg: 1.02 1.02 0.83
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Target result

Benchmark Report for /home/runner/work/Rimu.jl/Rimu.jl

Job Properties

  • Time of benchmark: 26 Jun 2024 - 22:46
  • Package commit: 321acb
  • Julia commit: 48d4fd
  • Julia command flags: None
  • Environment variables: None

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["Rimu", "Exact", "Diagonalization", "2D Hubbard"] 1.934 s (5%) 1.205 ms 93.71 MiB (1%) 23742
["Rimu", "Exact", "Diagonalization", "Bose-Hubbard in momentum space"] 17.716 s (5%) 258.755 ms 367.02 MiB (1%) 2037976
["Rimu", "Exact", "Multiplication", "Momentum space"] 816.445 ms (5%) 10.92 MiB (1%) 253
["Rimu", "Exact", "Multiplication", "Transcorrelated"] 2.801 s (5%) 1.88 MiB (1%) 257
["Rimu", "FCIQMC", "(10, 20) Mom space with projected energy and initiator"] 26.766 s (5%) 10.926 ms 59.71 MiB (1%) 1900296
["Rimu", "FCIQMC", "(4+1, 11) 2C Mom space with G2Correlators"] 24.489 s (5%) 19.110 ms 32.06 MiB (1%) 824146
["Rimu", "FCIQMC", "(50, 50) Real space"] 21.981 s (5%) 29.24 MiB (1%) 132005

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Rimu", "Exact", "Diagonalization"]
  • ["Rimu", "Exact", "Multiplication"]
  • ["Rimu", "FCIQMC"]

Julia versioninfo

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 3208 MHz 1360 s 0 s 121 s 6336 s 0 s
 #2 3238 MHz 1745 s 0 s 113 s 5952 s 0 s
 #3 3243 MHz 2008 s 0 s 110 s 5695 s 0 s
 #4 3235 MHz 1764 s 0 s 108 s 5946 s 0 s
 Memory: 15.606491088867188 GB (13342.49609375 MB free)
 Uptime: 785.26 sec
 Load Avg: 1.0 0.94 0.59
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Baseline result

Benchmark Report for /home/runner/work/Rimu.jl/Rimu.jl

Job Properties

  • Time of benchmark: 26 Jun 2024 - 22:56
  • Package commit: bcaae8
  • Julia commit: 48d4fd
  • Julia command flags: None
  • Environment variables: None

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["Rimu", "Exact", "Diagonalization", "2D Hubbard"] 1.870 s (5%) 1.130 ms 93.71 MiB (1%) 23742
["Rimu", "Exact", "Diagonalization", "Bose-Hubbard in momentum space"] 17.748 s (5%) 261.050 ms 367.02 MiB (1%) 2037976
["Rimu", "Exact", "Multiplication", "Momentum space"] 818.541 ms (5%) 10.92 MiB (1%) 253
["Rimu", "Exact", "Multiplication", "Transcorrelated"] 2.793 s (5%) 1.88 MiB (1%) 257
["Rimu", "FCIQMC", "(10, 20) Mom space with projected energy and initiator"] 26.483 s (5%) 19.429 ms 59.71 MiB (1%) 1900194
["Rimu", "FCIQMC", "(4+1, 11) 2C Mom space with G2Correlators"] 25.445 s (5%) 32.03 MiB (1%) 824150
["Rimu", "FCIQMC", "(50, 50) Real space"] 22.150 s (5%) 29.24 MiB (1%) 132030

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Rimu", "Exact", "Diagonalization"]
  • ["Rimu", "Exact", "Multiplication"]
  • ["Rimu", "FCIQMC"]

Julia versioninfo

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 2445 MHz 2620 s 0 s 169 s 11176 s 0 s
 #2 3244 MHz 4148 s 0 s 155 s 9657 s 0 s
 #3 2558 MHz 3721 s 0 s 166 s 10074 s 0 s
 #4 2594 MHz 2547 s 0 s 166 s 11254 s 0 s
 Memory: 15.606491088867188 GB (13055.21875 MB free)
 Uptime: 1401.16 sec
 Load Avg: 1.02 1.02 0.83
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() lbt
Sys.CPU_THREADS 4

lscpu output:

Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Address sizes: 48 bits physical, 48 bits virtual
Byte Order: Little Endian
CPU(s): 4
On-line CPU(s) list: 0-3
Vendor ID: AuthenticAMD
Model name: AMD EPYC 7763 64-Core Processor
CPU family: 25
Model: 1
Thread(s) per core: 2
Core(s) per socket: 2
Socket(s): 1
Stepping: 1
BogoMIPS: 4890.85
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl tsc_reliable nonstop_tsc cpuid extd_apicid aperfmperf pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm cmp_legacy svm cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw topoext invpcid_single vmmcall fsgsbase bmi1 avx2 smep bmi2 erms invpcid rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves clzero xsaveerptr rdpru arat npt nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold v_vmsave_vmload umip vaes vpclmulqdq rdpid fsrm
Virtualization: AMD-V
Hypervisor vendor: Microsoft
Virtualization type: full
L1d cache: 64 KiB (2 instances)
L1i cache: 64 KiB (2 instances)
L2 cache: 1 MiB (2 instances)
L3 cache: 32 MiB (1 instance)
NUMA node(s): 1
NUMA node0 CPU(s): 0-3
Vulnerability Gather data sampling: Not affected
Vulnerability Itlb multihit: Not affected
Vulnerability L1tf: Not affected
Vulnerability Mds: Not affected
Vulnerability Meltdown: Not affected
Vulnerability Mmio stale data: Not affected
Vulnerability Retbleed: Not affected
Vulnerability Spec rstack overflow: Vulnerable: Safe RET, no microcode
Vulnerability Spec store bypass: Vulnerable
Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization
Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Not affected
Vulnerability Srbds: Not affected
Vulnerability Tsx async abort: Not affected

Cpu Property Value
Brand AMD EPYC 7763 64-Core Processor
Vendor :AMD
Architecture :Unknown
Model Family: 0xaf, Model: 0x01, Stepping: 0x01, Type: 0x00
Cores 16 physical cores, 16 logical cores (on executing CPU)
No Hyperthreading hardware capability detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 512, 32768) kbytes
64 byte cache line size
Address Size 48 bits virtual, 48 bits physical
SIMD 256 bit = 32 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC runs at constant rate (invariant from clock frequency)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft
Benchmark result

Judge result

Benchmark Report for /home/runner/work/Rimu.jl/Rimu.jl

Job Properties

  • Time of benchmarks:
  • Target: 27 Jun 2024 - 00:27
  • Baseline: 27 Jun 2024 - 00:37
  • Package commits:
  • Target: 387072
  • Baseline: bcaae8
  • Julia commits:
  • Target: 48d4fd
  • Baseline: 48d4fd
  • Julia command flags:
  • Target: None
  • Baseline: None
  • Environment variables:
  • Target: None
  • Baseline: None

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Rimu", "Exact", "Diagonalization"]
  • ["Rimu", "Exact", "Multiplication"]
  • ["Rimu", "FCIQMC"]

Julia versioninfo

Target

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 2990 MHz 1060 s 0 s 89 s 6672 s 0 s
 #2 2866 MHz 390 s 0 s 123 s 7307 s 0 s
 #3 2445 MHz 2660 s 0 s 92 s 5066 s 0 s
 #4 3244 MHz 2464 s 0 s 102 s 5246 s 0 s
 Memory: 15.606491088867188 GB (13366.01953125 MB free)
 Uptime: 786.1 sec
 Load Avg: 1.0 0.93 0.56
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Baseline

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 2887 MHz 2360 s 0 s 142 s 11338 s 0 s
 #2 2445 MHz 970 s 0 s 184 s 12684 s 0 s
 #3 3242 MHz 4466 s 0 s 141 s 9234 s 0 s
 #4 3245 MHz 4811 s 0 s 150 s 8874 s 0 s
 Memory: 15.606491088867188 GB (12319.51171875 MB free)
 Uptime: 1389.16 sec
 Load Avg: 1.03 1.03 0.82
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Target result

Benchmark Report for /home/runner/work/Rimu.jl/Rimu.jl

Job Properties

  • Time of benchmark: 27 Jun 2024 - 0:27
  • Package commit: 387072
  • Julia commit: 48d4fd
  • Julia command flags: None
  • Environment variables: None

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["Rimu", "Exact", "Diagonalization", "2D Hubbard"] 1.850 s (5%) 993.995 μs 93.71 MiB (1%) 23742
["Rimu", "Exact", "Diagonalization", "Bose-Hubbard in momentum space"] 17.601 s (5%) 242.161 ms 367.02 MiB (1%) 2037976
["Rimu", "Exact", "Multiplication", "Momentum space"] 812.469 ms (5%) 10.92 MiB (1%) 253
["Rimu", "Exact", "Multiplication", "Transcorrelated"] 2.792 s (5%) 1.88 MiB (1%) 257
["Rimu", "FCIQMC", "(10, 20) Mom space with projected energy and initiator"] 26.483 s (5%) 10.217 ms 59.71 MiB (1%) 1900017
["Rimu", "FCIQMC", "(4+1, 11) 2C Mom space with G2Correlators"] 25.410 s (5%) 32.03 MiB (1%) 824133
["Rimu", "FCIQMC", "(50, 50) Real space"] 21.979 s (5%) 37.679 ms 29.24 MiB (1%) 132020

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Rimu", "Exact", "Diagonalization"]
  • ["Rimu", "Exact", "Multiplication"]
  • ["Rimu", "FCIQMC"]

Julia versioninfo

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 2990 MHz 1060 s 0 s 89 s 6672 s 0 s
 #2 2866 MHz 390 s 0 s 123 s 7307 s 0 s
 #3 2445 MHz 2660 s 0 s 92 s 5066 s 0 s
 #4 3244 MHz 2464 s 0 s 102 s 5246 s 0 s
 Memory: 15.606491088867188 GB (13366.01953125 MB free)
 Uptime: 786.1 sec
 Load Avg: 1.0 0.93 0.56
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Baseline result

Benchmark Report for /home/runner/work/Rimu.jl/Rimu.jl

Job Properties

  • Time of benchmark: 27 Jun 2024 - 0:37
  • Package commit: bcaae8
  • Julia commit: 48d4fd
  • Julia command flags: None
  • Environment variables: None

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["Rimu", "Exact", "Diagonalization", "2D Hubbard"] 1.852 s (5%) 983.756 μs 93.71 MiB (1%) 23742
["Rimu", "Exact", "Diagonalization", "Bose-Hubbard in momentum space"] 17.724 s (5%) 238.709 ms 367.02 MiB (1%) 2037976
["Rimu", "Exact", "Multiplication", "Momentum space"] 807.749 ms (5%) 10.92 MiB (1%) 253
["Rimu", "Exact", "Multiplication", "Transcorrelated"] 2.779 s (5%) 1.88 MiB (1%) 257
["Rimu", "FCIQMC", "(10, 20) Mom space with projected energy and initiator"] 26.588 s (5%) 59.71 MiB (1%) 1900112
["Rimu", "FCIQMC", "(4+1, 11) 2C Mom space with G2Correlators"] 24.240 s (5%) 32.17 MiB (1%) 824141
["Rimu", "FCIQMC", "(50, 50) Real space"] 21.934 s (5%) 29.24 MiB (1%) 132020

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Rimu", "Exact", "Diagonalization"]
  • ["Rimu", "Exact", "Multiplication"]
  • ["Rimu", "FCIQMC"]

Julia versioninfo

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 2887 MHz 2360 s 0 s 142 s 11338 s 0 s
 #2 2445 MHz 970 s 0 s 184 s 12684 s 0 s
 #3 3242 MHz 4466 s 0 s 141 s 9234 s 0 s
 #4 3245 MHz 4811 s 0 s 150 s 8874 s 0 s
 Memory: 15.606491088867188 GB (12319.51171875 MB free)
 Uptime: 1389.16 sec
 Load Avg: 1.03 1.03 0.82
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() lbt
Sys.CPU_THREADS 4

lscpu output:

Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Address sizes: 48 bits physical, 48 bits virtual
Byte Order: Little Endian
CPU(s): 4
On-line CPU(s) list: 0-3
Vendor ID: AuthenticAMD
Model name: AMD EPYC 7763 64-Core Processor
CPU family: 25
Model: 1
Thread(s) per core: 2
Core(s) per socket: 2
Socket(s): 1
Stepping: 1
BogoMIPS: 4890.86
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl tsc_reliable nonstop_tsc cpuid extd_apicid aperfmperf pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm cmp_legacy svm cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw topoext invpcid_single vmmcall fsgsbase bmi1 avx2 smep bmi2 erms invpcid rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves clzero xsaveerptr rdpru arat npt nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold v_vmsave_vmload umip vaes vpclmulqdq rdpid fsrm
Virtualization: AMD-V
Hypervisor vendor: Microsoft
Virtualization type: full
L1d cache: 64 KiB (2 instances)
L1i cache: 64 KiB (2 instances)
L2 cache: 1 MiB (2 instances)
L3 cache: 32 MiB (1 instance)
NUMA node(s): 1
NUMA node0 CPU(s): 0-3
Vulnerability Gather data sampling: Not affected
Vulnerability Itlb multihit: Not affected
Vulnerability L1tf: Not affected
Vulnerability Mds: Not affected
Vulnerability Meltdown: Not affected
Vulnerability Mmio stale data: Not affected
Vulnerability Retbleed: Not affected
Vulnerability Spec rstack overflow: Vulnerable: Safe RET, no microcode
Vulnerability Spec store bypass: Vulnerable
Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization
Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Not affected
Vulnerability Srbds: Not affected
Vulnerability Tsx async abort: Not affected

Cpu Property Value
Brand AMD EPYC 7763 64-Core Processor
Vendor :AMD
Architecture :Unknown
Model Family: 0xaf, Model: 0x01, Stepping: 0x01, Type: 0x00
Cores 16 physical cores, 16 logical cores (on executing CPU)
No Hyperthreading hardware capability detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 512, 32768) kbytes
64 byte cache line size
Address Size 48 bits virtual, 48 bits physical
SIMD 256 bit = 32 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC runs at constant rate (invariant from clock frequency)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft
Benchmark result

Judge result

Benchmark Report for /home/runner/work/Rimu.jl/Rimu.jl

Job Properties

  • Time of benchmarks:
  • Target: 27 Jun 2024 - 00:48
  • Baseline: 27 Jun 2024 - 00:58
  • Package commits:
  • Target: 69fc6c
  • Baseline: bcaae8
  • Julia commits:
  • Target: 48d4fd
  • Baseline: 48d4fd
  • Julia command flags:
  • Target: None
  • Baseline: None
  • Environment variables:
  • Target: None
  • Baseline: None

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Rimu", "Exact", "Diagonalization"]
  • ["Rimu", "Exact", "Multiplication"]
  • ["Rimu", "FCIQMC"]

Julia versioninfo

Target

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 2445 MHz 1670 s 0 s 109 s 6568 s 0 s
 #2 2570 MHz 1260 s 0 s 113 s 6961 s 0 s
 #3 3242 MHz 2016 s 0 s 102 s 6224 s 0 s
 #4 3247 MHz 1691 s 0 s 97 s 6570 s 0 s
 Memory: 15.606491088867188 GB (13143.3828125 MB free)
 Uptime: 838.53 sec
 Load Avg: 1.01 0.93 0.55
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Baseline

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 2594 MHz 2885 s 0 s 154 s 11434 s 0 s
 #2 2445 MHz 1876 s 0 s 174 s 12411 s 0 s
 #3 3234 MHz 3938 s 0 s 150 s 10383 s 0 s
 #4 3242 MHz 4077 s 0 s 141 s 10270 s 0 s
 Memory: 15.606491088867188 GB (13071.0703125 MB free)
 Uptime: 1452.23 sec
 Load Avg: 1.09 1.05 0.82
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Target result

Benchmark Report for /home/runner/work/Rimu.jl/Rimu.jl

Job Properties

  • Time of benchmark: 27 Jun 2024 - 0:48
  • Package commit: 69fc6c
  • Julia commit: 48d4fd
  • Julia command flags: None
  • Environment variables: None

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["Rimu", "Exact", "Diagonalization", "2D Hubbard"] 1.873 s (5%) 1.068 ms 93.71 MiB (1%) 23742
["Rimu", "Exact", "Diagonalization", "Bose-Hubbard in momentum space"] 17.745 s (5%) 246.787 ms 367.02 MiB (1%) 2037976
["Rimu", "Exact", "Multiplication", "Momentum space"] 811.444 ms (5%) 10.92 MiB (1%) 253
["Rimu", "Exact", "Multiplication", "Transcorrelated"] 2.781 s (5%) 1.88 MiB (1%) 257
["Rimu", "FCIQMC", "(10, 20) Mom space with projected energy and initiator"] 26.644 s (5%) 6.558 ms 59.71 MiB (1%) 1900096
["Rimu", "FCIQMC", "(4+1, 11) 2C Mom space with G2Correlators"] 25.346 s (5%) 20.532 ms 32.03 MiB (1%) 824159
["Rimu", "FCIQMC", "(50, 50) Real space"] 22.164 s (5%) 11.335 ms 29.24 MiB (1%) 132025

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Rimu", "Exact", "Diagonalization"]
  • ["Rimu", "Exact", "Multiplication"]
  • ["Rimu", "FCIQMC"]

Julia versioninfo

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 2445 MHz 1670 s 0 s 109 s 6568 s 0 s
 #2 2570 MHz 1260 s 0 s 113 s 6961 s 0 s
 #3 3242 MHz 2016 s 0 s 102 s 6224 s 0 s
 #4 3247 MHz 1691 s 0 s 97 s 6570 s 0 s
 Memory: 15.606491088867188 GB (13143.3828125 MB free)
 Uptime: 838.53 sec
 Load Avg: 1.01 0.93 0.55
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Baseline result

Benchmark Report for /home/runner/work/Rimu.jl/Rimu.jl

Job Properties

  • Time of benchmark: 27 Jun 2024 - 0:58
  • Package commit: bcaae8
  • Julia commit: 48d4fd
  • Julia command flags: None
  • Environment variables: None

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["Rimu", "Exact", "Diagonalization", "2D Hubbard"] 1.868 s (5%) 1.060 ms 93.71 MiB (1%) 23742
["Rimu", "Exact", "Diagonalization", "Bose-Hubbard in momentum space"] 17.675 s (5%) 275.235 ms 367.02 MiB (1%) 2037976
["Rimu", "Exact", "Multiplication", "Momentum space"] 827.428 ms (5%) 10.92 MiB (1%) 253
["Rimu", "Exact", "Multiplication", "Transcorrelated"] 2.803 s (5%) 1.88 MiB (1%) 257
["Rimu", "FCIQMC", "(10, 20) Mom space with projected energy and initiator"] 26.547 s (5%) 6.447 ms 59.71 MiB (1%) 1900174
["Rimu", "FCIQMC", "(4+1, 11) 2C Mom space with G2Correlators"] 25.534 s (5%) 32.03 MiB (1%) 824127
["Rimu", "FCIQMC", "(50, 50) Real space"] 21.840 s (5%) 6.744 ms 29.24 MiB (1%) 132010

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Rimu", "Exact", "Diagonalization"]
  • ["Rimu", "Exact", "Multiplication"]
  • ["Rimu", "FCIQMC"]

Julia versioninfo

Julia Version 1.10.4
Commit 48d4fd48430 (2024-06-04 10:41 UTC)
Build Info:
 Official https://julialang.org/ release
Platform Info:
 OS: Linux (x86_64-linux-gnu)
 Ubuntu 22.04.4 LTS
 uname: Linux 6.5.0-1022-azure #23~22.04.1-Ubuntu SMP Thu May 9 17:59:24 UTC 2024 x86_64 x86_64
 CPU: AMD EPYC 7763 64-Core Processor: 
 speed user nice sys idle irq
 #1 2594 MHz 2885 s 0 s 154 s 11434 s 0 s
 #2 2445 MHz 1876 s 0 s 174 s 12411 s 0 s
 #3 3234 MHz 3938 s 0 s 150 s 10383 s 0 s
 #4 3242 MHz 4077 s 0 s 141 s 10270 s 0 s
 Memory: 15.606491088867188 GB (13071.0703125 MB free)
 Uptime: 1452.23 sec
 Load Avg: 1.09 1.05 0.82
 WORD_SIZE: 64
 LIBM: libopenlibm
 LLVM: libLLVM-15.0.7 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() lbt
Sys.CPU_THREADS 4

lscpu output:

Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Address sizes: 48 bits physical, 48 bits virtual
Byte Order: Little Endian
CPU(s): 4
On-line CPU(s) list: 0-3
Vendor ID: AuthenticAMD
Model name: AMD EPYC 7763 64-Core Processor
CPU family: 25
Model: 1
Thread(s) per core: 2
Core(s) per socket: 2
Socket(s): 1
Stepping: 1
BogoMIPS: 4890.86
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl tsc_reliable nonstop_tsc cpuid extd_apicid aperfmperf pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm cmp_legacy svm cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw topoext invpcid_single vmmcall fsgsbase bmi1 avx2 smep bmi2 erms invpcid rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves clzero xsaveerp...[Comment body truncated]

mtsch added 4 commits June 24, 2024 15:17
@coveralls
Copy link

coveralls commented Jun 27, 2024

Pull Request Test Coverage Report for Build 9688534414

Details

  • 99 of 127 (77.95%) changed or added relevant lines in 4 files are covered.
  • 58 unchanged lines in 4 files lost coverage.
  • Overall coverage decreased (-1.1%) to 93.99%

Changes Missing Coverage Covered Lines Changed/Added Lines %
src/DictVectors/pdworkingmemory.jl 3 11 27.27%
src/DictVectors/communicators.jl 94 114 82.46%
Files with Coverage Reduction New Missed Lines %
src/DictVectors/projectors.jl 1 84.31%
src/DictVectors/initiators.jl 3 90.48%
src/DictVectors/pdvec.jl 17 86.14%
src/DictVectors/communicators.jl 37 68.66%
Totals Coverage Status
Change from base Build 9637517985: -1.1%
Covered Lines: 6725
Relevant Lines: 7155

💛 - Coveralls

@mtsch mtsch marked this pull request as ready for review June 27, 2024 00:50
@mtsch mtsch requested a review from joachimbrand June 27, 2024 00:51
mtsch added 4 commits June 27, 2024 12:51
@coveralls
Copy link

coveralls commented Jun 27, 2024

Pull Request Test Coverage Report for Build 9689023607

Details

  • 124 of 129 (96.12%) changed or added relevant lines in 4 files are covered.
  • 2 unchanged lines in 1 file lost coverage.
  • Overall coverage decreased (-0.007%) to 95.097%

Changes Missing Coverage Covered Lines Changed/Added Lines %
src/DictVectors/communicators.jl 111 116 95.69%
Files with Coverage Reduction New Missed Lines %
src/DictVectors/initiators.jl 2 92.06%
Totals Coverage Status
Change from base Build 9637517985: -0.007%
Covered Lines: 6808
Relevant Lines: 7159

💛 - Coveralls

@coveralls
Copy link

coveralls commented Jun 27, 2024

Pull Request Test Coverage Report for Build 9689659963

Details

  • 124 of 129 (96.12%) changed or added relevant lines in 4 files are covered.
  • 2 unchanged lines in 1 file lost coverage.
  • Overall coverage decreased (-0.007%) to 95.097%

Changes Missing Coverage Covered Lines Changed/Added Lines %
src/DictVectors/communicators.jl 111 116 95.69%
Files with Coverage Reduction New Missed Lines %
src/DictVectors/initiators.jl 2 92.06%
Totals Coverage Status
Change from base Build 9637517985: -0.007%
Covered Lines: 6808
Relevant Lines: 7159

💛 - Coveralls

@mtsch mtsch requested a review from christofbradly June 27, 2024 04:18
Copy link
Collaborator

@joachimbrand joachimbrand left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I left a few comments. Looks all right although I cannot say that I have fully grokked how AllToAll works.

Comment on lines +202 to 220

```julia
julia> using Rimu.DictVectors: SegmentedBuffer

julia> buf = SegmentedBuffer{Int}()
0-element SegmentedBuffer{Int64}

julia> Rimu.DictVectors.replace_collections!(buf, [[1,2,3], [4,5]])
2-element SegmentedBuffer{Int64}:
[1, 2, 3]
[4, 5]

julia> Rimu.DictVectors.replace_collections!(buf, [[1], [2,3], [4]])
3-element SegmentedBuffer{Int64}:
[1]
[2, 3]
[4]
```
"""
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe add a link to SegmentedBuffer?

Comment on lines 167 to 181
"""
SegmentedBuffer
SegmentedBuffer{T} <: AbstractVector{AbstractVector{T}}

Multiple vectors stored in a single buffer with MPI communication support. Used in the
[`PointToPoint`](@ref) communication strategy.

Multiple vectors stored in a simple buffer with MPI communication.
# Supported operations

See [`replace_collections!`](@ref), [`mpi_send`](@ref), [`mpi_recv_any!`](@ref).
* [`replace_collections!`](@ref): insert data into the buffers
* [`mpi_send`](@ref): send the contents of a buffer to a given rank
* [`mpi_recv_any!`](@ref): receive a message sent by [`mpi_send`](@ref) from any rank,
storing the contents in this buffer

See also: [`NestedSegmentedBuffer`](@ref).
"""
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a bit abstract. Maybe adding a link to replace_collections!, where an example is shown, might be helpful.

@@ -269,18 +304,25 @@ mpi_size(ptp::PointToPoint) = ptp.mpi_size
mpi_comm(ptp::PointToPoint) = ptp.mpi_comm

function synchronize_remote!(ptp::PointToPoint, w)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I find the docstrings of synchronize_remote!, collect_local!, local_segments, and remote_segments rather confusing. What does synchronize_remote! actually do?

It also seems that they have to be called in the correct order. Would it make sense to link to all in each of the docstrings? It should also be mentioned in local_segments, and remote_segments that collect_local! needs to be called first.

Comment on lines 359 to 360
Matrix of vectors stored in a single buffer with collective MPI communication support. The
number of rows in the matrix `nrows`. Used in the [`AllToAll`](@ref) communication strategy.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Matrix of vectors stored in a single buffer with collective MPI communication support. The
number of rows in the matrix `nrows`. Used in the [`AllToAll`](@ref) communication strategy.
Matrix of vectors stored in a single buffer with collective MPI communication support. `nrows` is the
number of rows in the matrix. Used in the [`AllToAll`](@ref) communication strategy.

Should it be explained what the rows and columns mean in the context of AllToAll?

"""
struct SegmentedBuffer{T} <: AbstractVector{SubArray{T,1,Vector{T},Tuple{UnitRange{Int64}},true}}
struct SegmentedBuffer{T} <: AbstractVector{SubVector{T}}
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is this an AbstractVector? It should probably support the interface (size, getindex).

Comment on lines 504 to 506
AllToAll{K,V} <: Communicator

[`Communicator`](@ref) that uses collective communication using `MPI.Alltoall!`.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It would be great to document the constructor with all accepted keyword arguments.

@coveralls
Copy link

coveralls commented Jul 2, 2024

Pull Request Test Coverage Report for Build 9754098047

Details

  • 134 of 139 (96.4%) changed or added relevant lines in 4 files are covered.
  • 2 unchanged lines in 1 file lost coverage.
  • Overall coverage decreased (-0.004%) to 95.1%

Changes Missing Coverage Covered Lines Changed/Added Lines %
src/DictVectors/communicators.jl 116 121 95.87%
Files with Coverage Reduction New Missed Lines %
src/DictVectors/initiators.jl 2 92.06%
Totals Coverage Status
Change from base Build 9637517985: -0.004%
Covered Lines: 6812
Relevant Lines: 7163

💛 - Coveralls

@coveralls
Copy link

coveralls commented Jul 4, 2024

Pull Request Test Coverage Report for Build 9768912173

Details

  • 144 of 147 (97.96%) changed or added relevant lines in 4 files are covered.
  • 2 unchanged lines in 1 file lost coverage.
  • Overall coverage increased (+0.03%) to 95.133%

Changes Missing Coverage Covered Lines Changed/Added Lines %
src/DictVectors/communicators.jl 126 129 97.67%
Files with Coverage Reduction New Missed Lines %
src/DictVectors/initiators.jl 2 92.06%
Totals Coverage Status
Change from base Build 9637517985: 0.03%
Covered Lines: 6822
Relevant Lines: 7171

💛 - Coveralls

@coveralls
Copy link

coveralls commented Jul 4, 2024

Pull Request Test Coverage Report for Build 9800167899

Details

  • 146 of 151 (96.69%) changed or added relevant lines in 4 files are covered.
  • 2 unchanged lines in 1 file lost coverage.
  • Overall coverage increased (+0.004%) to 95.108%

Changes Missing Coverage Covered Lines Changed/Added Lines %
src/DictVectors/communicators.jl 128 133 96.24%
Files with Coverage Reduction New Missed Lines %
src/DictVectors/initiators.jl 2 92.06%
Totals Coverage Status
Change from base Build 9637517985: 0.004%
Covered Lines: 6824
Relevant Lines: 7175

💛 - Coveralls

mtsch added 2 commits July 8, 2024 11:18
@mtsch mtsch requested a review from joachimbrand July 7, 2024 23:20
@coveralls
Copy link

Pull Request Test Coverage Report for Build 9831054698

Details

  • 146 of 151 (96.69%) changed or added relevant lines in 4 files are covered.
  • 2 unchanged lines in 1 file lost coverage.
  • Overall coverage increased (+0.004%) to 95.108%

Changes Missing Coverage Covered Lines Changed/Added Lines %
src/DictVectors/communicators.jl 128 133 96.24%
Files with Coverage Reduction New Missed Lines %
src/DictVectors/initiators.jl 2 92.06%
Totals Coverage Status
Change from base Build 9637517985: 0.004%
Covered Lines: 6824
Relevant Lines: 7175

💛 - Coveralls

Copy link
Collaborator

@joachimbrand joachimbrand left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Much clearer, thanks!

@joachimbrand joachimbrand merged commit 8be042d into develop Jul 9, 2024
10 checks passed
@joachimbrand joachimbrand deleted the feature/alltoall branch July 9, 2024 22:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants