beam
https://github.com/apache/beam
Java
Apache Beam is a unified programming model for Batch and Streaming data processing.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Java not yet supported32 Subscribers
View all SubscribersAdd a CodeTriage badge to beam
Help out
- Issues
- [Bug]: worker failing on Dataflow runner v2 (Failed to handle for url)
- [Bug]: Documentation update: Correct parameter name for running YAML pipeline
- [Bug]: Correct unsuscribe emails (dev and user mailing lists)
- [Bug]: Tour examples are not running
- [Bug]: year 0 is out of range when connecting to MongoDB Atlas (SDK 2.53.0)
- [Bug]: Mixed data types in _id field causing pipeline failures (SDK 2.53.0)
- [Bug]: apache.calcite CannotPlanException: All the inputs have relevant nodes, however the cost is still infinite.
- [Bug]: apache_beam.dataframe.convert.to_pcollection() fails on deferred dataframe of csv with only header row
- [Bug]: Apache Beam SqlTransform does not process data distributed. It doesn't use multiple workers."
- [Bug]: Cannot transform multiple PCollections to Dataframe using the apache_beam.dataframe.transforms.DataframeTransform function
- Docs
- Java not yet supported