Skip to content

Commit e5e9067

Browse files
authored
[Misc] fix typo and add detailed log (vllm-project#28178)
Signed-off-by: Andy Xie <andy.xning@gmail.com>
1 parent 3a7d580 commit e5e9067

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

examples/offline_inference/load_sharded_state.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@
1111
--model /path/to/load \
1212
--quantization deepspeedfp \
1313
--tensor-parallel-size 8 \
14-
--output /path/to/save/sharded/modele
14+
--output /path/to/save/sharded/model
1515
1616
python load_sharded_state.py \
1717
--model /path/to/saved/sharded/model \

vllm/v1/worker/gpu_model_runner.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3035,7 +3035,7 @@ def load_model(self, eep_scale_up: bool = False) -> None:
30353035
time_after_load = time.perf_counter()
30363036
self.model_memory_usage = m.consumed_memory
30373037
logger.info_once(
3038-
"Model loading took %.4f GiB and %.6f seconds",
3038+
"Model loading took %.4f GiB memory and %.6f seconds",
30393039
self.model_memory_usage / GiB_bytes,
30403040
time_after_load - time_before_load,
30413041
scope="local",

0 commit comments

Comments
 (0)