spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported76 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-49670][SQL] Enable trim collation for all passthrough expressions
- [SPARK-50206][SQL] Added separate collation id for UTF8_BINARY and non-collated strings
- [SPARK-49565][SQL] Add SQL pipe syntax for the FROM operator
- [SPARK-50188][CONNECT][PYTHON] When the connect client starts, print the server's webUrl
- [SPARK-50160][SQL][KAFKA] KafkaWriteTask: allow customizing record timestamp
- [WIP] Added Logistic Matrix Factorization(LMF) and Item2Vec models
- [SPARK-50157][SQL] Using SQLConf provided by SparkSession first.
- [SPARK-50137][HIVE] Avoid fallback to Hive-incompatible ways when table creation fails by thrift exception
- [TYPING] Add type overloads for inplace dataframe operations
- [SPARK-50005][SQL] Enhance method verifyNotReadPath to identify subqueries hidden in the filter conditions.
- Docs
- Scala not yet supported