beam
https://github.com/apache/beam
Java
Apache Beam is a unified programming model for Batch and Streaming data processing.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Java not yet supported35 Subscribers
View all SubscribersAdd a CodeTriage badge to beam
Help out
- Issues
- Make it easier to run seed job against open PRs
- ScopedState can not be re-entered (regression caused by Batched DoFn worker changes)
- Optimize precombining table.
- Create BigtableIO setting to report external system throttle time to Dataflow
- [Feature Request]: Allow project to be passed to expansion services.
- Add an expansion service to Go SDK
- Create MemcachedIO
- [Bug]: Go SDK registration panics on anonymous structs.
- Remove mypy ignore line from apache_beam/ml/inference/base.py once Dataclass is replaced with NamedTuple
- [Task]: grouping on categorical columns should not require Singleton partitioning
- Docs
- Java not yet supported