beam
https://github.com/apache/beam
Java
Apache Beam is a unified programming model for Batch and Streaming data processing.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Java not yet supported33 Subscribers
View all SubscribersAdd a CodeTriage badge to beam
Help out
- Issues
- FnRunnerTest with non-trivial (order 1000 elements) numpy input flakes in non-cython environment
- --dataflowServiceOptions=use_runner_v2 is broken
- [Bug]: RedisIO#readKeyPatterns failing with OutOfMemory on version 2.39.0
- [Feature Request]: Use unique id for Python BigQueryIO
- [Task]: Update BigQueryIO.setTriggeringFrequency documentation
- [Task]: Eliminate deprecated finalize method (overrides Object.finalize) throughout Java SDK
- [Bug]: No parallelism using WriteToParquet in Apache Spark
- Tracking issue for gaps in Python Direct Runner streaming implementation
- [Feature Request]: Add support for BigQuery structs in BigQueryIO with BEAM_ROW output type
- [Feature request] Consider installing the same artifact in Beam Python RC Docker containers as is being published to PyPi
- Docs
- Java not yet supported