lightning
https://github.com/lightning-ai/lightning
Python
Deep learning framework to train, deploy, and ship AI products Lightning fast.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported2 Subscribers
Add a CodeTriage badge to lightning
Help out
- Issues
- Error when fast_dev_run=True or num_sanity_val_steps=0 and using torchmetrics MetricTracker
- Fabric: Incorrect `num_replicas` (ddp/fsdp) when number of GPUs on each node is different
- MLFlowLogger fails when logging hyperparameters as Trainer already does automatically
- Is "Prepare a config file for the CLI" out of date?
- MisconfigurationException: Do not set `gradient_accumulation_steps` in the DeepSpeed config
- Dataloader on multi-gpu jobs only surpport to manipulate on local_rank=0, is there a way tom manipulate every device?
- Lightning stalls with 2 GPUs on 1 node with SLURM (and apptainer)
- can't fit with ddp_notebook on a Vertex AI Workbench instance (CUDA initialized)
- WIP: Integrate Collective into strategies
- Using the MLflow logger produces Inconsistent metric plots
- Docs
- Python not yet supported