beam
https://github.com/apache/beam
Java
Apache Beam is a unified programming model for Batch and Streaming data processing.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Java not yet supported33 Subscribers
View all SubscribersAdd a CodeTriage badge to beam
Help out
- Issues
- [Feature Request]: FileIO should limit writes to GCS when throttling is detected.
- [Bug]: Beam SDK is not visible to the backend code in SCIO
- Replace bigquery_v2_client with Google Cloud BigQuery client (and associated libraries)
- Replace dataflow_v1b3_client with a Google Cloud Python client
- Replace cloudbuild_v1_client with Google Cloud Build client
- [Task]: Add UseDataStreamForBatch option to the Flink runner.
- [Bug]: BigQueryIO got JSON parsing error: No such field when there is no field, or field is empty string
- [Feature Request]: Document that file-based sinks return a PCollection of output filenames
- [Feature Request]: Add retry mechanism to Flink DoFnOperator on failing to process element
- [Feature Request]: Support BigQuery column level encryption with Cloud KMS
- Docs
- Java not yet supported