Skip to content

Conversation

@christiangnrd
Copy link
Member

@christiangnrd christiangnrd commented Nov 24, 2025

Metal.jl includes these in its versioninfo and it's good to have the version of these packages when helping people troubleshoot issues.

Metal.jl also includes the LLVM.jl version which isn't part of this PR. Maybe it should be?

@github-actions
Copy link
Contributor

github-actions bot commented Nov 24, 2025

Your PR requires formatting changes to meet the project's style guidelines.
Please consider running Runic (git runic master) to apply these changes.

Click here to view the suggested changes.
diff --git a/src/utilities.jl b/src/utilities.jl
index 72ddf09ba..16f4dd725 100644
--- a/src/utilities.jl
+++ b/src/utilities.jl
@@ -69,8 +69,10 @@ function versioninfo(io::IO=stdout)
 
     println(io, "Julia packages: ")
     println(io, "- CUDA: $(Base.pkgversion(CUDA))")
-    for name in [:GPUArrays, :GPUCompiler, :KernelAbstractions, :CUDA_Driver_jll,
-                 :CUDA_Compiler_jll, :CUDA_Runtime_jll, :CUDA_Runtime_Discovery]
+    for name in [
+            :GPUArrays, :GPUCompiler, :KernelAbstractions, :CUDA_Driver_jll,
+            :CUDA_Compiler_jll, :CUDA_Runtime_jll, :CUDA_Runtime_Discovery,
+        ]
         isdefined(CUDA, name) || continue
         mod = getfield(CUDA, name)
         println(io, "- $(name): $(Base.pkgversion(mod))")

Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

CUDA.jl Benchmarks

Benchmark suite Current: 86b05f1 Previous: 0048e5a Ratio
latency/precompile 56986848212 ns 57035764507 ns 1.00
latency/ttfp 8196632234 ns 8371031413.5 ns 0.98
latency/import 4370627993 ns 4496982792 ns 0.97
integration/volumerhs 9618365 ns 9624750 ns 1.00
integration/byval/slices=1 146990 ns 147060 ns 1.00
integration/byval/slices=3 426241.5 ns 426247.5 ns 1.00
integration/byval/reference 145228 ns 145079 ns 1.00
integration/byval/slices=2 286600 ns 286395 ns 1.00
integration/cudadevrt 104014 ns 103630 ns 1.00
kernel/indexing 14460 ns 14142 ns 1.02
kernel/indexing_checked 15064 ns 15085 ns 1.00
kernel/occupancy 691.6 ns 683.7784810126582 ns 1.01
kernel/launch 2138.777777777778 ns 2192.3333333333335 ns 0.98
kernel/rand 17094 ns 15556 ns 1.10
array/reverse/1d 19969 ns 19955 ns 1.00
array/reverse/2dL_inplace 67066 ns 67142 ns 1.00
array/reverse/1dL 70176 ns 70143 ns 1.00
array/reverse/2d 21753 ns 21759 ns 1.00
array/reverse/1d_inplace 9670 ns 9799 ns 0.99
array/reverse/2d_inplace 13563 ns 11305 ns 1.20
array/reverse/2dL 73835 ns 73707 ns 1.00
array/reverse/1dL_inplace 66883 ns 66873 ns 1.00
array/copy 20761 ns 20703 ns 1.00
array/iteration/findall/int 157645 ns 157587.5 ns 1.00
array/iteration/findall/bool 140172 ns 140083 ns 1.00
array/iteration/findfirst/int 161389 ns 161900 ns 1.00
array/iteration/findfirst/bool 161881.5 ns 162539 ns 1.00
array/iteration/scalar 75134 ns 73928 ns 1.02
array/iteration/logical 213840.5 ns 216280.5 ns 0.99
array/iteration/findmin/1d 50734.5 ns 53048.5 ns 0.96
array/iteration/findmin/2d 96386 ns 96623.5 ns 1.00
array/reductions/reduce/Int64/1d 43659 ns 43822 ns 1.00
array/reductions/reduce/Int64/dims=1 44750.5 ns 45024.5 ns 0.99
array/reductions/reduce/Int64/dims=2 61574 ns 61470 ns 1.00
array/reductions/reduce/Int64/dims=1L 88993.5 ns 88941 ns 1.00
array/reductions/reduce/Int64/dims=2L 88369 ns 87825.5 ns 1.01
array/reductions/reduce/Float32/1d 36908 ns 37058 ns 1.00
array/reductions/reduce/Float32/dims=1 46368.5 ns 50073 ns 0.93
array/reductions/reduce/Float32/dims=2 59994 ns 59949 ns 1.00
array/reductions/reduce/Float32/dims=1L 52581 ns 52439 ns 1.00
array/reductions/reduce/Float32/dims=2L 72286.5 ns 72064 ns 1.00
array/reductions/mapreduce/Int64/1d 43418 ns 43941 ns 0.99
array/reductions/mapreduce/Int64/dims=1 45439 ns 44815 ns 1.01
array/reductions/mapreduce/Int64/dims=2 61750 ns 61592 ns 1.00
array/reductions/mapreduce/Int64/dims=1L 89138.5 ns 88958 ns 1.00
array/reductions/mapreduce/Int64/dims=2L 88158 ns 88240 ns 1.00
array/reductions/mapreduce/Float32/1d 36768 ns 37401 ns 0.98
array/reductions/mapreduce/Float32/dims=1 41956.5 ns 43167 ns 0.97
array/reductions/mapreduce/Float32/dims=2 59893 ns 59977 ns 1.00
array/reductions/mapreduce/Float32/dims=1L 52873 ns 52531 ns 1.01
array/reductions/mapreduce/Float32/dims=2L 72020.5 ns 72397 ns 0.99
array/broadcast 20029 ns 20122 ns 1.00
array/copyto!/gpu_to_gpu 11262 ns 11368 ns 0.99
array/copyto!/cpu_to_gpu 213986 ns 215772 ns 0.99
array/copyto!/gpu_to_cpu 281623 ns 282327 ns 1.00
array/accumulate/Int64/1d 124549 ns 124672 ns 1.00
array/accumulate/Int64/dims=1 83642 ns 83443 ns 1.00
array/accumulate/Int64/dims=2 157922 ns 157725 ns 1.00
array/accumulate/Int64/dims=1L 1709836 ns 1710398.5 ns 1.00
array/accumulate/Int64/dims=2L 966274.5 ns 966565 ns 1.00
array/accumulate/Float32/1d 109573 ns 108966.5 ns 1.01
array/accumulate/Float32/dims=1 80774 ns 80321 ns 1.01
array/accumulate/Float32/dims=2 147594 ns 148101 ns 1.00
array/accumulate/Float32/dims=1L 1619192 ns 1619028 ns 1.00
array/accumulate/Float32/dims=2L 698511 ns 698667 ns 1.00
array/construct 1262.7 ns 1281.2 ns 0.99
array/random/randn/Float32 44275.5 ns 48183.5 ns 0.92
array/random/randn!/Float32 24922 ns 24910 ns 1.00
array/random/rand!/Int64 27375 ns 27271 ns 1.00
array/random/rand!/Float32 8803.5 ns 8866.333333333334 ns 0.99
array/random/rand/Int64 29917 ns 37929.5 ns 0.79
array/random/rand/Float32 13089 ns 13112 ns 1.00
array/permutedims/4d 55259 ns 55650 ns 0.99
array/permutedims/2d 53983 ns 54104.5 ns 1.00
array/permutedims/3d 54863 ns 54918 ns 1.00
array/sorting/1d 2758076 ns 2757756 ns 1.00
array/sorting/by 3345122 ns 3344340.5 ns 1.00
array/sorting/2d 1081349 ns 1081498 ns 1.00
cuda/synchronization/stream/auto 1085.8 ns 1053.2 ns 1.03
cuda/synchronization/stream/nonblocking 8263.8 ns 7607.6 ns 1.09
cuda/synchronization/stream/blocking 836.59375 ns 872.9166666666666 ns 0.96
cuda/synchronization/context/auto 1200.4 ns 1195.3 ns 1.00
cuda/synchronization/context/nonblocking 8032.200000000001 ns 7954 ns 1.01
cuda/synchronization/context/blocking 927.9655172413793 ns 921.375 ns 1.01

This comment was automatically generated by workflow using github-action-benchmark.

@codecov
Copy link

codecov bot commented Nov 26, 2025

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 89.49%. Comparing base (4db30fe) to head (11b226b).
⚠️ Report is 1 commits behind head on master.

Additional details and impacted files
@@            Coverage Diff             @@
##           master    #2983      +/-   ##
==========================================
+ Coverage   89.30%   89.49%   +0.19%     
==========================================
  Files         150      150              
  Lines       13133    13133              
==========================================
+ Hits        11729    11754      +25     
+ Misses       1404     1379      -25     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant