beam
https://github.com/apache/beam
Java
Apache Beam is a unified programming model for Batch and Streaming data processing.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Java not yet supported35 Subscribers
View all SubscribersAdd a CodeTriage badge to beam
Help out
- Issues
- Pipeline using GenerateSequence not working with SDF
- ExecutableStageDoFnOperator should fire processing time timers properly instead of draining it directly when close() happens
- Java SDF Unbounded Wrapper doesn't work for RabbitMq finalizeCheckpoint
- datastoreio assumes all entities to have a Key
- Investigate the UX for using private pypi repositories with Beam, document best practices, and identify UX improvements.
- Dataflow's UnboundedReaderIterator should support overriding default bundle size targets
- Support more granular splitting in BigQueryStorageStreamSource
- Issue in release script mass_comment.py
- Dataflow and KafkaIO or KinesisIO has erratic watermark progress due to ReaderCache timeouts
- SDF BoundedSource consumer is ont able to split data from the runner
- Docs
- Java not yet supported