deepspeed
https://github.com/microsoft/deepspeed
Python
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported9 Subscribers
Add a CodeTriage badge to deepspeed
Help out
- Issues
- Error when running DeepSpeed integration test with Rocm 62
- [BUG] Deepspeed MoE Multi-GPU Expert-Parallel Training/Inference freezes when a process finishes generation first.
- [DO NOT MERGE] Log test results to file
- modify_load_save_model
- [Bug Fix] Support threads_per_head < 64 for wavefront size of 64
- [REQUEST] Is there any demo or tutorial for Deepspeed pipeline inference?
- [BUG] failed to find frozen {param} in named params
- [BUG] Non-Deterministic Model Responses when the Input Prompt Order Changes
- [REQUEST] Inquiry about code for Domino
- [REQUEST] Extend `offload_states` to support models with cpu-based optimizer
- Docs
- Python not yet supported