modin
https://github.com/modin-project/modin
Python
Modin: Scale your Pandas workflows by changing a single line of code
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported5 Subscribers
Add a CodeTriage badge to modin
Help out
- Issues
- Add examples to the documentation on readthedocs
- Create Docker builds for child projects (e.g. Modin Spreadsheet)
- Dictionary GroupBy renaming aggregation can't insert group names to the frame (`as_index=False`) in case of aggregation against 'by' columns
- GroupBy with `sort=False` parameter and categorical `by` produces incorrect aggregation results
- FEAT: Add support of s3 buckets for `read_excel`
- Balance partitions load during `read_csv` call with `skiprows` parameter
- Modin's `GroupBy.__getitem__` does not concord with pandas when selection intersects with 'by' columns
- Support `__partitioned__` protocol
- `read_csv` scalability issue with "wide" data sets
- BUG: KeyError when UDF in groupby.apply accesses data from another column partition
- Docs
- Python not yet supported