Skip to content

Commit 032bb4f

Browse files
authored
Update kvstar.md
1 parent 70ffb31 commit 032bb4f

File tree

1 file changed

+9
-8
lines changed
  • docs/source/user-guide/sparse-attention

1 file changed

+9
-8
lines changed

docs/source/user-guide/sparse-attention/kvstar.md

Lines changed: 9 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -67,16 +67,17 @@ ktc = KVTransferConfig(
6767

6868

6969

70-
 
71-
7270
## 🔥 Results
7371
The following results were obtained using `Qwen2.5-14B-Instruct` under the hyperparameters in `examples/offline_inference_kvstar.py`.
7472

73+
### 📈 Accuracy
74+
We use [LongBench](https://huggingface.co/datasets/zai-org/LongBench) to evaluate the accuracy (F1-score) of the KVstar algorithm on H20 GPU. The model is `Qwen2.5-14B-Instruct`
75+
76+
| | Dataset | full Attention | KVstar (25% KVcache on GPU) |
77+
|-------|-----------|-----------|-------|
78+
| H20 GPU | dureader | 32.20 | 29.93 |
79+
| Ascend 910B NPU | dureader | 32.46 | 31.08 |
80+
7581
### 🏆 Performance
7682

77-
### 📈 Accuracy
78-
We use [LongBench](https://huggingface.co/datasets/zai-org/LongBench) to evaluate the accuracy of the ESA algorithm.
79-
| Dataset | F1-Score for full Attention | F1-Score for KVstar |
80-
|-------|-----------|-----------|
81-
| multifieldqa_zh | | 62.63 |
82-
| dureader | | 30.96 |
83+
(coming soon)

0 commit comments

Comments
 (0)