dagster
https://github.com/dagster-io/dagster
Python
An orchestration platform for the development, production, and observation of data assets.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported4 Subscribers
Add a CodeTriage badge to dagster
Help out
- Issues
- Configure retry strategy at instance level
- [docs] - concepts overview revamp
- Graphs, jobs ops guide says its 'out of date' but link to updated docs is broken
- Mention spark installation in pyspark examples
- Downstream asset OPs should run only after the upstream asset is fully materialised without passing the upstream asset as arguments to downstream OPs
- Augment hooks for custom metrics creation
- Upstream data missing warning during asset materialisation does not occur when partitioning time window is different
- Allow jobs to be constructed from assets with differing partition definitions
- Allow alteration of config when reexecuting
- Specify Op Selection from the CLI
- Docs
- Python not yet supported