kedro
https://github.com/kedro-org/kedro
Python
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported1 Subscribers
Add a CodeTriage badge to kedro
Help out
- Issues
- Docs: In CLI reference, add more detail about flags that can be passed to `kedro new` and point to detailed docs
- Update Databricks docs
- Refactor dask deployment documentation to use public methods
- Improve `catalog.list` or alternative for dataset factory?
- Documentation subproject cross-referencing needs to be more sophisticated
- Create how-to documentation for "pipeline reuse"
- Rich add newline to long log message and it cannot be copy and paste directly into terminal
- Ability to set a custom runner globally
- Improve error message when local filepath is not found
- Improve `kedro ipython` tests
- Docs
- Python not yet supported