spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported76 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-49479][CORE] Use daemon ScheduledThreadPoolExecutor for BarrierCoordinator
- [SPARK-49485][CORE] Fix speculative task hang bug due to remaining executor lay on same host
- [SPARK-49482][SQL] Refactor V2 parquet datasource
- [SPARK-46895][CORE][3.5] Replace Timer with single thread scheduled executor
- [SPARK-49469][CONNECT][TESTS] Add dummy protos for expression and datatype
- [SPARK-48486][SQL] To solve the issue of generating excessively large execution plans when encountering multiple levels of subqueries while enabling DynamicPartitionPruning.
- [WIP][SPARK-49401][BUILD] Upgrade `checkstyle` to 10.18.0 & `scalafmt` to 3.8.3
- [SPARK-45278][YARN] Support executor bind address in Yarn executors
- [SPARK-48788][CORE][UI] Expose task peak onheap/offheap execution memory to API and Spark UI
- [SPARK-49386][SPARK-27734][CORE][SQL] Add memory based thresholds for shuffle spill
- Docs
- Scala not yet supported