llama-bench : accept ranges for integer parameters #13410

slaren · 2025-05-09T17:17:03Z

Accept ranges with the syntax start-end[+step] for all integer parameters.

Example:

$ llama-bench -n 0 -p 1-5,10-100+10,256
| model                          |       size |     params | backend    | ngl |            test |                  t/s |
| ------------------------------ | ---------: | ---------: | ---------- | --: | --------------: | -------------------: |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | CUDA       |  99 |             pp1 |        171.12 ± 5.78 |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | CUDA       |  99 |             pp2 |        229.50 ± 6.88 |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | CUDA       |  99 |             pp3 |       343.37 ± 12.77 |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | CUDA       |  99 |             pp4 |        433.23 ± 9.29 |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | CUDA       |  99 |             pp5 |       494.69 ± 12.29 |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | CUDA       |  99 |            pp10 |        795.65 ± 5.96 |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | CUDA       |  99 |            pp20 |      1489.93 ± 19.02 |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | CUDA       |  99 |            pp30 |      2103.04 ± 61.36 |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | CUDA       |  99 |            pp40 |      2589.35 ± 37.22 |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | CUDA       |  99 |            pp50 |      2904.41 ± 48.25 |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | CUDA       |  99 |            pp60 |      3393.16 ± 19.55 |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | CUDA       |  99 |            pp70 |      3612.60 ± 35.55 |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | CUDA       |  99 |            pp80 |      4003.98 ± 16.02 |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | CUDA       |  99 |            pp90 |      4101.49 ± 18.51 |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | CUDA       |  99 |           pp100 |      4288.92 ± 19.39 |
| llama 7B Q4_0                  |   3.56 GiB |     6.74 B | CUDA       |  99 |           pp256 |      5728.99 ± 13.87 |

slaren · 2025-05-09T17:22:37Z

tools/llama-bench/llama-bench.cpp

-            for (const auto & t : p) {
-                ggml_type gt = ggml_type_from_name(t);
-                if (gt == GGML_TYPE_COUNT) {
+        try {


The only real change here is replacing string_split<int>(argv[i], split_delim) with parse_int_range(argv[i]), the rest of the diff is due to the change in indentation.

JohannesGaessler · 2025-05-09T17:56:10Z

tools/llama-bench/llama-bench.cpp

@@ -323,7 +347,7 @@ static void print_usage(int /* argc */, char ** argv) {
    printf("\n");
    printf(
        "Multiple values can be given for each parameter by separating them with ',' or by specifying the parameter "
-        "multiple times.\n");
+        "multiple times. Ranges can be specified with 'start-end' or 'start-end+step'.\n");


You should specify that the ranges are inclusive.

I changed it to first-last+step. Is that clear enough?

I guess that also works but I think it would make sense to also rename the variables in the code to match the help.

JohannesGaessler · 2025-05-10T07:29:54Z

Would it be possible to support not just linear step sizes but exponential ones as well? So something like -ub 16-128*2 would translate to -ub 16,32,64,128.

llama-bench : accept ranges for integer parameters

6a96720

slaren mentioned this pull request May 9, 2025

Differential mode for llama-bench + plotting code #13408

Open

github-actions bot added the examples label May 9, 2025

slaren commented May 9, 2025

View reviewed changes

JohannesGaessler approved these changes May 9, 2025

View reviewed changes

slaren added 2 commits May 9, 2025 19:58

start-end => first-last

9e18afb

update variable names

4ae188d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama-bench : accept ranges for integer parameters #13410

llama-bench : accept ranges for integer parameters #13410

slaren commented May 9, 2025

slaren May 9, 2025

JohannesGaessler May 9, 2025

slaren May 9, 2025

JohannesGaessler May 9, 2025

JohannesGaessler commented May 10, 2025

llama-bench : accept ranges for integer parameters #13410

Are you sure you want to change the base?

llama-bench : accept ranges for integer parameters #13410

Conversation

slaren commented May 9, 2025

slaren May 9, 2025

Choose a reason for hiding this comment

JohannesGaessler May 9, 2025

Choose a reason for hiding this comment

slaren May 9, 2025

Choose a reason for hiding this comment

JohannesGaessler May 9, 2025

Choose a reason for hiding this comment

JohannesGaessler commented May 10, 2025