Skip to content

Commit 9a8dc1f

Browse files
authored
Fix SpinQuant -> Quip link (#2011)
The link in the Quip README is pointing to SpinQuant. Note: We might want to link directly to the relevant lines but: - we need to say "as of llmcompressor vXX.X, the parameters are --> link-with-git-tag, possibly sending people to outdated configuration a year from now. - or we don't git tag and the line might change with refactoring. Signed-off-by: Mamy Ratsimbazafy <mamy_github@numforge.co>
1 parent 1c85a66 commit 9a8dc1f

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

examples/transform/README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -41,7 +41,7 @@ recipe = [
4141
]
4242
```
4343

44-
Note that `QuIPModifier` can be customized. For a full list of the available arguments, see the [docstring](/src/llmcompressor/modifiers/transform/spinquant/base.py) or documentation.
44+
Note that `QuIPModifier` can be customized. For a full list of the available arguments, see the [docstring](/src/llmcompressor/modifiers/transform/quip/base.py) or documentation.
4545

4646
* `rotations` determines which of the input rotation (v) or output rotations (u) should be used.
4747
* `transform_block_size` determines the size of the hadamard. Smaller hadamards require less cost at runtime.
@@ -79,4 +79,4 @@ python3 benchmarks/benchmark_latency.py --model path/to/Llama-3.2-1B-Instruct-qu
7979
#### Quantized Model Latency (sec) ####
8080
| Base W4A16 | Hadacore | GEMM |
8181
| - | - | - |
82-
| 0.4402 | 0.4489 | 1.2917 |
82+
| 0.4402 | 0.4489 | 1.2917 |

0 commit comments

Comments
 (0)