modin
https://github.com/modin-project/modin
Python
Modin: Scale your Pandas workflows by changing a single line of code
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported5 Subscribers
Add a CodeTriage badge to modin
Help out
- Issues
- Precompute dtypes and columns cache for groupby results
- Precompute new dtypes for `.cat.codes` result
- Precompute new dtypes for `.apply` result
- Reduce the amount of remote calls when reshuffling partitions
- BUG: `concat` op has different behavior in case of dict of Series and `axis == "index"`
- UserWarning: Distributing <class 'list'> object. This may take some time. Traceback (most recent call last):
- REFACTOR: get_dummies: validate parameters at API layer
- BUG: NumPy specific QC methods should not be protected
- Support partitioned parquet files
- BUG: modin.numpy logic operators do not broadcast between 2D and 1D args properly
- Docs
- Python not yet supported