beam
https://github.com/apache/beam
Java
Apache Beam is a unified programming model for Batch and Streaming data processing.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Java not yet supported32 Subscribers
View all SubscribersAdd a CodeTriage badge to beam
Help out
- Issues
- [Bug]: year 0 is out of range when connecting to MongoDB Atlas (SDK 2.53.0)
- [Bug]: Mixed data types in _id field causing pipeline failures (SDK 2.53.0)
- [Bug]: apache.calcite CannotPlanException: All the inputs have relevant nodes, however the cost is still infinite.
- [Bug]: apache_beam.dataframe.convert.to_pcollection() fails on deferred dataframe of csv with only header row
- [Bug]: Apache Beam SqlTransform does not process data distributed. It doesn't use multiple workers."
- [Bug]: Cannot transform multiple PCollections to Dataframe using the apache_beam.dataframe.transforms.DataframeTransform function
- [RRIO]: Create and update website examples using RequestResponseIO using the Python SDK
- [RRIO]: Implement RequestResponseIO using the Go SDK
- adding extra_header causing issue while reading GCS file in colab
- [Task]: Use GCP-BOM to manage google cloud dependencies in sdks/java/extensions/ml
- Docs
- Java not yet supported