accelerate
https://github.com/huggingface/accelerate
Python
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported1 Subscribers
Add a CodeTriage badge to accelerate
Help out
- Issues
- PyPI published Accelerate==1.1.0 is missing Source Distributions
- ConnectionError: Tried to launch distributed communication on port `29401`, but another process is utilizing it. Please specify a different port (such as using the `--main_process_port` flag or specifying a different `main_process_port` in your config file) and rerun your script. To automatically use the next open port (on a single node), you can set this to `0`.
- create _preprare_fsdp to pre- prepare fsdp model training
- How could I convert ZeRO-0 deepspeed weights into fp32 model checkpoint?
- fix load_state_dict for npu
- Multiple node inference
- The optimizer is not receiving the FSDP model parameters.
- Multinode, multigpu example fails
- Problem with metrics calculation and dataloader
- Command line arguments related to deepspeed for `accelerate launch` do not override those of `default_config.yaml`
- Docs
- Python not yet supported