modin
https://github.com/modin-project/modin
Python
Modin: Scale your Pandas workflows by changing a single line of code
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported5 Subscribers
Add a CodeTriage badge to modin
Help out
- Issues
- BUG: passing in large data object to DataFrame apply() function resulting in SegFault
- Modin on Ray cluster with object spilling: process was killed when loading parquet file
- BUG: apply contains extra columns after goupby and selected columns
- read_csv(): Add heuristic for the number of partitions calculation when loading from CSV
- PERF: do window/rolling functions block-wise instead of column-wise
- After upgrade to pandas 2.1.0 `skew` results are different
- Feature Request: Integration of Delta Lake Kernel with Modin for Advanced DataFrame Operations
- PERF: query() and eval() perform poorly for queries that are not embarrassingly parallel row-wise
- Incorrect condition in test_io masks Modin bug
- Consider making `modin.pandas` an effective alias for `pandas` in a worker process
- Docs
- Python not yet supported