transformers
https://github.com/huggingface/transformers
Python
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported35 Subscribers
View all SubscribersAdd a CodeTriage badge to transformers
Help out
- Issues
- from_pretrained's torch_dtype "auto" mode doesn't handle nested models
- ValueError: Unable to create tensor, you should probably activate truncation and/or padding with 'padding=True' 'truncation=True' to have batched tensors with the same length
- force conversion for Llama when `legacy` is set to `False`
- ../aten/src/ATen/native/cuda/Indexing.cu:1289: indexSelectLargeIndex: block: [267,0,0], thread: [25,0,0] Assertion `srcIndex < srcSelectDimSize` failed.
- Add Molmo (7B-D, 7B-O, 70B)
- Jitter Noise added to input being passed to experts in Switch Transformers
- Add DBRX GGUF Support
- Request for Iterative Generation in Pipeline (e.g., LLaMA model)
- Add training support to Meta's EnCodec
- Flash attention 2 support for PaliGemma model
- Docs
- Python not yet supported