spark
https://github.com/apache/spark
Scala
Apache Spark - A unified analytics engine for large-scale data processing
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Scala not yet supported76 Subscribers
View all SubscribersAdd a CodeTriage badge to spark
Help out
- Issues
- [SPARK-48309][YARN]Stop am retry, in situations where some errors and retries may not be successful
- [SPARK-43815][SQL] Wrap NPE with AnalysisException in CSV options
- [SPARK-48322][SPARK-42965][SQL][CONNECT][PYTHON] Drop internal metadata in `DataFrame.schema`
- [SPARK-48323][SQL] DB2: Map BooleanType to BOOLEAN instead of CHAR(1)
- [SPARK-48324][SQL] Codegen Support for `hll_sketch_estimate` and `hll_union`
- [WIP][SQL] Enable hash aggregation support for all collations (StringType)
- [SPARK-48325][CORE] Always specify messages in ExecutorRunner.killProcess
- [SPARK-48314][SQL] Don't double cache files for FileStreamSource using Trigger.AvailableNow
- [SPARK-48320][CORE][DOCS] Add external third-party ecosystem access guide to the doc
- [SPARK-47818][CONNECT][FOLLOW-UP] Introduce plan cache in SparkConnectPlanner to improve performance of Analyze requests
- Docs
- Scala not yet supported