Skip to content

Commit 4c0837b

Browse files
nv-guomingzsuyoggupta
authored andcommitted
[None][fix] Display the GPU memory information in GiB unit. (NVIDIA#9070)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
1 parent 5079a8a commit 4c0837b

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tensorrt_llm/_torch/pyexecutor/resource_manager.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -718,7 +718,7 @@ def calculate_max_num_blocks(self,
718718
if kv_cache_config.free_gpu_memory_fraction is not None:
719719
max_tokens = min(kv_cache_config.max_tokens, max_tokens)
720720
logger.warning(
721-
f'Both free_gpu_memory_fraction and max_tokens are set (to {free_mem_fraction} and {max_tokens} with free memory {free_mem / (1 << 32)} of total memory {total_mem / (1<<32)}, respectively). The smaller value will be used.'
721+
f'Both free_gpu_memory_fraction and max_tokens are set (to {free_mem_fraction} and {max_tokens} with free memory {free_mem / (1 << 30)}GiB of total memory {total_mem / (1<<30)}GiB, respectively). The smaller value will be used.'
722722
)
723723
else:
724724
max_tokens = kv_cache_config.max_tokens

0 commit comments

Comments
 (0)