SQL with Apache Spark
These articles can help you to use SQL with Apache Spark.
- Broadcast join exceeds threshold, returns out of memory error
- Cannot grow
BufferHolder
; exceeds size limitation - Date functions only accept int values in Apache Spark 3.0
- Disable broadcast when query plan has
BroadcastNestedLoopJoin
- Duplicate columns in the metadata error
- Generate unique increasing numeric values
- Error in SQL statement:
AnalysisException: Table or view not found
- Error when downloading full results after join
- Error when running
MSCK REPAIR TABLE
in parallel - Find the size of a table
- Inner join drops records in result
- Data is incorrect when read from Snowflake
- JDBC write fails with a
PrimaryKeyViolation
error - Query does not skip header row on external table
SHOW DATABASES
command returns unexpected column name