-
Notifications
You must be signed in to change notification settings - Fork 1.3k
fix(ci): Failures on Main: Compiler GPU Tests, Tokenizer, and JsonArgparse Issues #2149
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
️✅ There are no secrets present in this pull request anymore.If these secrets were true positive and are still valid, we highly recommend you to revoke them. 🦉 GitGuardian detects secrets in your source code to help developers and security teams secure the modern development process. You are seeing this because you or someone else with access to this repository has authorized GitGuardian to scan your pull request. |
for more information, see https://pre-commit.ci
…fic directory for each test run
…equirements for thunder
…inclusion of transformers
|
@lantiga needs to update the branch protection setting and replace Azure-gpu by lit-gpu |

What does this pr do ?
This PR addresses and stabilizes multiple CI failures currently affecting the main branch.
Key Fixes
Partial fix for
jsonargparse==4.42.0causingtest_config_helpfailure #2150 — temporary fix for errors caused byjsonargparse.Fix compiler GPU tests:
test_networks.pyby aligning with its dependency pintransformers==4.52.4(see: https://github.com/Lightning-AI/lightning-thunder/blob/main/requirements/test.txt).turn_off_tf32_and_set_seed, required by Thunder’s network tests.Fix tokenizer tests (
test_tokenizer_against_hf) failing due to429 Too Many Requestsby introducing reruns with delay for stability.Add
TORCH_URLto download cpu based packages for cpu tests which should resolve any space related issues (eg: https://github.com/Lightning-AI/litgpt/actions/runs/19007773138/job/54284274578?pr=2149)closes #2143
Notes
All GPU tests are now passing.
Only a minor issue remains to be addressed in the CPU test suite.