beam
https://github.com/apache/beam
Java
Apache Beam is a unified programming model for Batch and Streaming data processing.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Java not yet supported33 Subscribers
View all SubscribersAdd a CodeTriage badge to beam
Help out
- Issues
- Migrate GKE cluster off basic authentication
- All beam examples should get continuously exercised
- BigQueryIO failed to load data to temp table when withSchemaUpdateOptions is set.
- Get the latest GitHub Actions runners version and update the self-hosted runners
- Update to_dataframe API Docs to focus on schema use
- Python SDK BigQuery `schemaUpdateOptions`
- Call the stored procedure in dataflow pipeline
- Integrate TFRecord/tf.train.Example with Beam Schemas and the DataFrame API
- Python version of schema docs missing examples
- Introduce Union type coder
- Docs
- Java not yet supported