lightning
https://github.com/lightning-ai/lightning
Python
Deep learning framework to train, deploy, and ship AI products Lightning fast.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported2 Subscribers
Add a CodeTriage badge to lightning
Help out
- Issues
- Checkpoint every_n_steps reruns epoch on restore
- Multi-node Training with DDP stuck at "Initialize distributed..." on SLURM cluster
- Support wandb_logger.watch() when using LightningCLI
- Enable batch size finder for distributed strategies
- Turn off hpc checkpoint saving in SLURM environment if trainer.fit(..., ckpt_path="last")
- Support GAN based model training with deepspeed which need to setup fabric twice
- SaveConfigCallback.save_config is conflict with DDP
- Support `ThunderModule` models
- When calling trainer.test() train_dataloader is also validated, which makes no sense
- Log `TensorBoard` histograms
- Docs
- Python not yet supported