-
Notifications
You must be signed in to change notification settings - Fork 370
Insights: pytorch/TensorRT
Overview
Could not load contribution data
Please try again later
8 Pull requests merged by 3 people
-
remove breakpoint()
#3750 merged
Aug 6, 2025 -
Fixed SDPA slow down and linear slow down
#3700 merged
Aug 4, 2025 -
enable back jetpack build
#3720 merged
Aug 4, 2025 -
Upgrade perf_run script to support TRT 10 and fix some issues
#3650 merged
Aug 4, 2025 -
add test cases for strong typing
#3739 merged
Aug 4, 2025 -
fix conv1d/deconv1d bug with stride more than 1
#3737 merged
Aug 2, 2025 -
Cherrypick #3703 for release/2.8
#3735 merged
Aug 1, 2025 -
Cherrypick #3719 for release/2.8
#3734 merged
Aug 1, 2025
9 Pull requests opened by 5 people
-
feat: check the num of ILayer types and compare with ONNX-TRT in converter tests
#3732 opened
Jul 31, 2025 -
feat: Add support for Groot N1.5 model
#3736 opened
Jul 31, 2025 -
fp8 pre-quantized model support
#3740 opened
Aug 1, 2025 -
Tentatively eliminate graph break overhead
#3741 opened
Aug 1, 2025 -
add sliding window support for Gemma3
#3742 opened
Aug 2, 2025 -
add typing_extensions as test dependencies which is required by modelopt
#3743 opened
Aug 4, 2025 -
fix: Inferred dimensions at build time in reshape
#3746 opened
Aug 5, 2025 -
cherry pick 3700 to 2.8 release: Broadcast removal
#3747 opened
Aug 5, 2025 -
add strong typing fix
#3749 opened
Aug 5, 2025
5 Issues closed by 2 people
-
🐛 [Bug] SDPA decomposition causing TorchTRT to be 2x slower than ONNX on SD3.5
#3682 closed
Aug 4, 2025 -
🐛 [Bug] SDPA in Torch-TRT is slower than SDPA in ONNX as batch size (num_frames) grow for Wan2.1
#3695 closed
Aug 4, 2025 -
[Bug] MHA Kernels and Linear Kernels are slower in FLUX
#3707 closed
Aug 4, 2025 -
🐛 [Bug] Changing input size would affect the TRT engine size, testing on BERT
#3634 closed
Aug 4, 2025 -
🐛 [Bug] perf_run.py script doesn't support TRT10 and there are some known bugs
#3709 closed
Aug 4, 2025
5 Issues opened by 4 people
-
🐛 [Bug] Llama2_flashinfer_rmsnorm example is broken
#3748 opened
Aug 5, 2025 -
🐛 [Bug] Compilation failure with NVFP4 Quantization with dynamic shapes
#3745 opened
Aug 5, 2025 -
✨[Feature] Pre-compiled C++ Binaries for Windows
#3744 opened
Aug 5, 2025 -
📖 [Story] Converters optimization
#3733 opened
Jul 31, 2025 -
🐛 [Bug] perf gap reduce on RAFT
#3731 opened
Jul 30, 2025
7 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
Removal of BAZEL build files from python package and changes to make cpp tests work
#3641 commented on
Aug 5, 2025 • 3 new comments -
Dynamic memory allocation
#3727 commented on
Jul 30, 2025 • 2 new comments -
TRT-LLM loading mechanism tool
#3398 commented on
Jul 31, 2025 • 1 new comment -
🐛 [Bug] Large Accuracy Issue
#3626 commented on
Aug 4, 2025 • 0 new comments -
Remove Bazel files from wheel
#3615 commented on
Aug 1, 2025 • 0 new comments -
tensorrt_rtx try on
#3679 commented on
Aug 6, 2025 • 0 new comments -
Index converter dynamic cases fix
#3694 commented on
Jul 31, 2025 • 0 new comments