modin
https://github.com/modin-project/modin
Python
Modin: Scale your Pandas workflows by changing a single line of code
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported5 Subscribers
Add a CodeTriage badge to modin
Help out
- Issues
- to_parquet() needs option of how many files to create, or like rays implementation: num_rows_per_file
- BUG: outofmemory read from big file and dump to a new one
- [RAY] to_parquet() fails when spilled objects reach 64gig... Also my data is just 40gig
- Possible issue with `dropna(how="all")` not deleting data from partition on ray.
- modin with ray engine hang
- FIX-#7346: Handle execution on Dask workers to avoid creating conflic…
- BUG: Apply on axis=1 causes "daemonic processes are not allowed to have children" on some operations on Dask engine, or launches Ray instance
- BUG: groupby().apply() raise numpy ValueError when Series has multi index
- why so slow compare to dask
- merge not supported
- Docs
- Python not yet supported