pytorch-lightning
https://github.com/pytorchlightning/pytorch-lightning
Python
The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported8 Subscribers
Add a CodeTriage badge to pytorch-lightning
Help out
- Issues
- Support get optimizer and lr_schedulers from deepspeed config
- RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.cuda.FloatTensor [68]] is at version 3; expected version 2 instead. Hint: enable anomaly detection to find the operation that failed to compute its gradient, with torch.autograd.set_detect_anomaly(True).
- When doing tuner.scale_batch_size, check full dataset length first
- Log `TensorBoard` histograms
- `ckpt_path` in `Trainer` accepts URIs to automatically load checkpoints from remote paths
- Exception in RecordFunction callback: state_ptr INTERNAL ASSERT FAILED at "../torch/csrc/profiler/standalone/nvtx_observer.cpp":115
- trainer.fit from checkpoint without performance improvement will break 'last' link to checkpoint on window11
- Unable to extract confusion matrix as a metric from trainer
- TensorBoardLogger has the wrong epoch numbers much more than the fact
- How to incorporate vLLM in Lightning for LLM inference?
- Docs
- Python not yet supported