beam
https://github.com/apache/beam
Java
Apache Beam is a unified programming model for Batch and Streaming data processing.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Java not yet supported35 Subscribers
View all SubscribersAdd a CodeTriage badge to beam
Help out
- Issues
- BigQueryIO : control StorageWrite parallelism in batch, by reshuffling before write on the number of streams set for BigQueryIO.write() using .withNumStorageWriteApiStreams(numStorageWriteApiStreams)
- Update CHANGES.md to include streaming NPE issue
- Bump avro, fix CVE-2024-47561
- Fix Maybereshuffle
- Integrate direct path with StreamingDataflowWorker code path
- add shutdown and start mechanics to windmill streams
- Proposal: Propagate gcs-connector options to GcsUtil
- [Feature Request]: Provide the Python equivalent to Java's BigQueryIO `writeProtos`
- SVN removed from GitHub ubuntu-latest causing build_release_candidate workflow failure
- [Managed Iceberg] Use table file size property if exists
- Docs
- Java not yet supported