beam
https://github.com/apache/beam
Java
Apache Beam is a unified programming model for Batch and Streaming data processing.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Java not yet supported49 Subscribers
View all SubscribersAdd a CodeTriage badge to beam
Help out
- Issues
- [Task]: Add new Python SchemaTransform multi-language documentation
- [docs] Add new Python multi-lang quickstart using the SchemaTransform framework
- [Bug]: FileIO.matchAll() injects a Reshuffle step which in some case is not useful and might break desirable fusion with more CPU intensive steps
- [Bug]: Beam example repo warning log while launching on Dataflow
- [Feature Request]: Hudi Table support / IO Connector for HoodieIO
- [Bug]: TypeScript xlang ValueError: numpy.dtype size changed
- [Bug]: GroupByKey() emits only key but not values for streaming job on a FixedWindows() with early trigger
- [Bug]: Python error just in importing apache-beam line within docker image
- [Feature Request]: requester-pays flag for GCS storage
- [Feature Request][RRIO]: Enable BackOffSupplier configuration
- Docs
- Java not yet supported