[5336870] AutoCast: Unblock LSTM from conversion (#544)

galagam · web-flow · commit e3e399a9a574 · 2025-11-16T09:21:47.000+02:00
Blocking was added as a WAR for TensorRT bug 5336870, fixed in TensorRT 10.14 ## What does this PR do? **Type of change:** Bug fix  **Overview:** Unblock LSTM from conversion to 16-bit in AutoCast. Blocking was added as a WAR for TensorRT bug 5336870 (TRT only supported FP32 LSTM). Bug was fixed in TensorRT 10.14 which is now released. ## Usage  ```python # Add a code snippet demonstrating how to use this ``` ## Testing  ## Before your PR is "*Ready for review*"  - **Make sure you read and follow [Contributor guidelines](https://github.com/NVIDIA/TensorRT-Model-Optimizer/blob/main/CONTRIBUTING.md)** and your commits are signed. - **Is this change backward compatible?**: Yes  - **Did you write any new necessary tests?**: No - **Did you add or update any necessary documentation?**: No - **Did you update [Changelog](https://github.com/NVIDIA/TensorRT-Model-Optimizer/blob/main/CHANGELOG.rst)?**: No  ## Additional Information  Signed-off-by: Gal Hubara Agam <96368689+galagam@users.noreply.github.com>
diff --git a/modelopt/onnx/autocast/precisionconverter.py b/modelopt/onnx/autocast/precisionconverter.py
@@ -68,7 +68,7 @@ class InitializerConsumerTracker:
 OP_TYPES_NOT_SUPPORTED_IN_LOW_PRECISION = ["Upsample", "NonMaxSuppression", "Celu"]
 
 # Temporarily block these ops in low precision, as they are not supported yet
-OP_TYPES_NOT_SUPPORTED_IN_LOW_PRECISION.extend(["Scan", "If", "Loop", "LSTM"])
+OP_TYPES_NOT_SUPPORTED_IN_LOW_PRECISION.extend(["Scan", "If", "Loop"])
 
 # Mapping of op types to indices of inputs that should not be converted to low precision.
 SKIP_LOW_PRECISION_MAPPING_FP16 = {"Resize": {2}}