Commit 067b154
authored
Skip node output dump for MemcpyToHost (microsoft#25651)
Fix node output dump for MemcpyToHost. The statistics data is not
correct, since data might not be copied to CPU yet:
```
MemcpyToHost node: Memcpy_token_232
Input 0 Name: /model/layers.6/moe/router/Add/output_0_CUDAExecutionProvider
Shape: {1,1,32}
OrtMemoryInfo:[name:Cuda OrtMemType:0 OrtAllocatorType:1 Device:[DeviceType:1 MemoryType:0 VendorId:4318 DeviceId:0 Alignment:0]]
Min=-2.5136719,Max=1.6914062
-----------
Output 0 Name: /model/layers.6/moe/router/Add/output_0
Shape: {1,1,32}
Min=-4888,Max=6672,NaN=2
```
This fix will skip the output dump (or statistics) like
```
-----------
Output 0 Name: /model/layers.6/moe/router/Add/output_0
Shape: {1,1,32}
is same as input
```1 parent 6e2a287 commit 067b154
File tree
1 file changed
+7
-0
lines changed- onnxruntime/core/framework
1 file changed
+7
-0
lines changedLines changed: 7 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
667 | 667 | | |
668 | 668 | | |
669 | 669 | | |
| 670 | + | |
| 671 | + | |
| 672 | + | |
| 673 | + | |
| 674 | + | |
| 675 | + | |
| 676 | + | |
670 | 677 | | |
671 | 678 | | |
672 | 679 | | |
| |||
0 commit comments