Skip to content

Commit 7526a90

Browse files
authored
Update summary.md (#125)
1 parent 8bffb5d commit 7526a90

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

benchmarks/summary.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,8 @@ Date | Device | dtype | batch size | cache length |max input length |max output
2222
----| ------- | ------ |---------- | -------------|-----------------|------------------|----------------------
2323
2024-05-14 | TPU v5e-8 | bfloat16 | 512 | 2048 | 1024 | 1024 | 8700
2424
2024-05-14 | TPU v5e-8 | int8 | 1024 | 2048 | 1024 | 1024 | 8746
25+
2024-06-13 | TPU v5e-1 | bfloat16 | 1024 | 2048 | 1024 | 1024 | 4249
26+
2527

2628
** NOTE: ** Gemma 2B uses `--shard_on_batch` flag so it's data parallel instead
2729
of model parallel.

0 commit comments

Comments
 (0)