Skip to content

Conversation

@PerkzZheng
Copy link
Collaborator

Fallback to cubins for now, and revisit it later.

qsang-nv and others added 2 commits July 7, 2025 02:56
Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>
Signed-off-by: Perkz Zheng <67892460+PerkzZheng@users.noreply.github.com>
@PerkzZheng PerkzZheng requested a review from a team as a code owner July 7, 2025 02:59
@PerkzZheng
Copy link
Collaborator Author

/bot run

@PerkzZheng PerkzZheng requested review from QiJune and qsang-nv July 7, 2025 02:59
@tensorrt-cicd
Copy link
Collaborator

PR_Github #11092 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #11092 [ run ] completed with state SUCCESS
/LLM/release-0.21/L0_MergeRequest_PR pipeline #176 completed with status: 'SUCCESS'
Pipeline passed with automatic retried tests. Check the rerun report for details.

@crazydemo
Copy link
Collaborator

verified the fp8 cases, all pass.

@crazydemo crazydemo self-requested a review July 8, 2025 02:19
@QiJune QiJune requested a review from litaotju July 8, 2025 02:26
@litaotju litaotju merged commit 5a50e2b into NVIDIA:release/0.21 Jul 8, 2025
4 checks passed
dc3671 pushed a commit to dc3671/TensorRT-LLM that referenced this pull request Jul 10, 2025
… fmha kernels on Ada. (NVIDIA#5779)

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>
Signed-off-by: Perkz Zheng <67892460+PerkzZheng@users.noreply.github.com>
Co-authored-by: qsang-nv <200703406+qsang-nv@users.noreply.github.com>
dc3671 pushed a commit to dc3671/TensorRT-LLM that referenced this pull request Jul 10, 2025
… fmha kernels on Ada. (NVIDIA#5779)

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>
Signed-off-by: Perkz Zheng <67892460+PerkzZheng@users.noreply.github.com>
Co-authored-by: qsang-nv <200703406+qsang-nv@users.noreply.github.com>
dc3671 pushed a commit to dc3671/TensorRT-LLM that referenced this pull request Jul 10, 2025
… fmha kernels on Ada. (NVIDIA#5779)

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>
Signed-off-by: Perkz Zheng <67892460+PerkzZheng@users.noreply.github.com>
Co-authored-by: qsang-nv <200703406+qsang-nv@users.noreply.github.com>
dc3671 pushed a commit to dc3671/TensorRT-LLM that referenced this pull request Jul 11, 2025
… fmha kernels on Ada. (NVIDIA#5779)

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>
Signed-off-by: Perkz Zheng <67892460+PerkzZheng@users.noreply.github.com>
Co-authored-by: qsang-nv <200703406+qsang-nv@users.noreply.github.com>
dc3671 pushed a commit to dc3671/TensorRT-LLM that referenced this pull request Jul 14, 2025
… fmha kernels on Ada. (NVIDIA#5779)

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>
Signed-off-by: Perkz Zheng <67892460+PerkzZheng@users.noreply.github.com>
Co-authored-by: qsang-nv <200703406+qsang-nv@users.noreply.github.com>
dc3671 pushed a commit to dc3671/TensorRT-LLM that referenced this pull request Jul 14, 2025
… fmha kernels on Ada. (NVIDIA#5779)

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>
Signed-off-by: Perkz Zheng <67892460+PerkzZheng@users.noreply.github.com>
Co-authored-by: qsang-nv <200703406+qsang-nv@users.noreply.github.com>
dc3671 pushed a commit to dc3671/TensorRT-LLM that referenced this pull request Jul 14, 2025
… fmha kernels on Ada. (NVIDIA#5779)

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>
Signed-off-by: Perkz Zheng <67892460+PerkzZheng@users.noreply.github.com>
Co-authored-by: qsang-nv <200703406+qsang-nv@users.noreply.github.com>
dc3671 pushed a commit that referenced this pull request Jul 14, 2025
… fmha kernels on Ada. (#5779)

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>
Signed-off-by: Perkz Zheng <67892460+PerkzZheng@users.noreply.github.com>
Co-authored-by: qsang-nv <200703406+qsang-nv@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants