[PyTorch] Fix L3 FA tests by cyanguwa · Pull Request #2709 · NVIDIA/TransformerEngine

cyanguwa · 2026-02-26T19:04:06Z

Description

This PR fixes the L3 tests for FP8 current scaling in L3_pytorch_FA_versions_test. The fix is only related to the selection logic in the test and not the backend support itself.

Type of change

Documentation change (change only to the documentation, either a fix or a new content)
Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Infra/Build change
Code refactoring

Changes

Please list the changes introduced in this PR:

L3 CI test fix.

Checklist:

I have read and followed the contributing guidelines
The functionality is complete
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Signed-off-by: Charlene Yang <8636796+cyanguwa@users.noreply.github.com>

for more information, see https://pre-commit.ci

greptile-apps · 2026-02-26T19:09:22Z

Greptile Summary

Fixed backend availability checking logic in FP8 attention tests (test_mha_fp8_vs_f16 and test_dpa_fp8_vs_f16). Previously, F16 reference backend availability was only checked when fp8_dpa_bwd=False, which could cause test failures when fp8_dpa_bwd=True if the F16 backend wasn't available.

Key changes:

Both tests now unconditionally check availability of FP8 and F16 backends using two separate calls to get_available_attention_backends
Improved variable naming: fused_attn_supported split into fused_attn_supported_fp8 and fused_attn_supported_f16 for clarity
Test execution and comparisons properly gated by checking which backends are actually available
More descriptive skip message: "No reference backend available" when F16 backend is missing

Confidence Score: 5/5

This PR is safe to merge with no risk
Test-only changes that fix incorrect skip logic without modifying any production code or backend implementation. The changes ensure proper backend availability checking and correct test gating
No files require special attention

Important Files Changed

Filename	Overview
tests/pytorch/attention/test_attention.py	Fixed backend availability checks to always verify both FP8 and F16 backends regardless of `fp8_dpa_bwd` flag, ensuring proper test execution and skipping

_{Last reviewed commit: 229b1ef}

greptile-apps

_{1 file reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

cyanguwa · 2026-02-26T19:30:15Z

/te-ci pytorch L3

KshitijLakhani · 2026-02-27T03:13:44Z

tests/pytorch/attention/test_attention.py

        )
-        _, fused_attn_supported, _ = available_backends
-        if not fused_attn_supported:
+        _, fused_attn_supported_f16, _ = available_backends


So if the value for fused_attn_supported_fp8=1/True and fp8_dpa_bwd=0/False then fused_attn_supported_f16 be 0/False or 1/True right ?
If fused_attn_supported_f16 is 0/False, there's no fp16 to compare the fp8 and hence we skip, but
,if fused_attn_supported_f16 is 1/True, then compare fp8 and fp16 fwd only.

Now, if fp8_dpa_bwd=1/True, then fused_attn_supported_f16 will be False/0 always
In that case what does fp8 get compared to ? Maybe I missed it, but I see no logic for this comparison - could you point me to it ?

Makes sense. I removed the if not fp8_bwd logic. Could you please take a look at 229b1ef?

Signed-off-by: Charlene Yang <8636796+cyanguwa@users.noreply.github.com>

cyanguwa · 2026-02-27T20:25:53Z

/te-ci pytorch L3

KshitijLakhani

LGTM !
Thanks !

cyanguwa · 2026-02-28T01:08:11Z

Pipeline 44991844.

* fix L3 FA fp8 tests Signed-off-by: Charlene Yang <8636796+cyanguwa@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix skip logic based on reference backend Signed-off-by: Charlene Yang <8636796+cyanguwa@users.noreply.github.com> --------- Signed-off-by: Charlene Yang <8636796+cyanguwa@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

cyanguwa and others added 2 commits February 26, 2026 10:58

fix L3 FA fp8 tests

5f4207e

Signed-off-by: Charlene Yang <8636796+cyanguwa@users.noreply.github.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

1a40989

for more information, see https://pre-commit.ci

greptile-apps bot reviewed Feb 26, 2026

View reviewed changes

cyanguwa added the 2.13.0 label Feb 26, 2026

cyanguwa requested a review from KshitijLakhani February 26, 2026 19:32

KshitijLakhani reviewed Feb 27, 2026

View reviewed changes

fix skip logic based on reference backend

229b1ef

Signed-off-by: Charlene Yang <8636796+cyanguwa@users.noreply.github.com>

cyanguwa requested a review from KshitijLakhani February 27, 2026 20:25

KshitijLakhani approved these changes Feb 27, 2026

View reviewed changes

cyanguwa merged commit 3ecb5bf into NVIDIA:main Feb 28, 2026
21 of 26 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PyTorch] Fix L3 FA tests#2709

[PyTorch] Fix L3 FA tests#2709
cyanguwa merged 3 commits intoNVIDIA:mainfrom
cyanguwa:fix_L3_FA

cyanguwa commented Feb 26, 2026 •

edited

Loading

Uh oh!

greptile-apps bot commented Feb 26, 2026 •

edited

Loading

Uh oh!

greptile-apps bot left a comment

Uh oh!

cyanguwa commented Feb 26, 2026

Uh oh!

KshitijLakhani Feb 27, 2026

Uh oh!

cyanguwa Feb 27, 2026

Uh oh!

cyanguwa commented Feb 27, 2026

Uh oh!

KshitijLakhani left a comment

Uh oh!

cyanguwa commented Feb 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cyanguwa commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Changes

Checklist:

Uh oh!

greptile-apps bot commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

cyanguwa commented Feb 26, 2026

Uh oh!

KshitijLakhani Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

cyanguwa Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

cyanguwa commented Feb 27, 2026

Uh oh!

KshitijLakhani left a comment

Choose a reason for hiding this comment

Uh oh!

cyanguwa commented Feb 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cyanguwa commented Feb 26, 2026 •

edited

Loading

greptile-apps bot commented Feb 26, 2026 •

edited

Loading