lightning
https://github.com/lightning-ai/lightning
Python
Deep learning framework to train, deploy, and ship AI products Lightning fast.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported2 Subscribers
Add a CodeTriage badge to lightning
Help out
- Issues
- [WAITING FOR PL CORE MAINTAINER OPINION] Bugfix/17958 multi optimizer step count behaviour
- Validation runs only for one iteration when restarting from checkpoint mid-epoch, wrongly reporting validation loss
- NCCL when trying to train on 2 nodes
- DDP training timeout
- FSDP hybrid shard should checkpoint in a single node
- Gather tensors of unequal shapes
- "backward pass is invalid for module in evaluation mode" with deepspeed stage 3
- FSDP checkpointing uses deprecated APIs with PyTorch 2.2
- `batch_sampler.batch_size` is None with deepspeed and `DataLoader(batch_size=None)`
- How to log artifacts on rank > 0?
- Docs
- Python not yet supported