transformers
https://github.com/huggingface/transformers
Python
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported39 Subscribers
View all SubscribersAdd a CodeTriage badge to transformers
Help out
- Issues
- Fix hardcoded `float` dtypes in DeBERTa model, which caused multiple RuntimeErrors in `bfloat16`
- Option to Disable Model Caching When Using "pipeline"
- Fix syntax in HfQuantizer docstring
- [DON'T MERGE] Commont bot CI for other jobs (`generation` / `quantization`)
- Deepseek v2
- MPI environment variables are not set.
- Create Zero-Delay_QKV_Compression.md
- [Question] Why doesn't `trainer.state.epoch` fall round after training?
- Add support for DeepSpeed sequence parallelism (Ulysses)
- Multi-GPU training crashes with IterableDataset and different length input (e.g. Next token prediction)
- Docs
- Python not yet supported