Databricks Help Center

Main Navigation

  • Help Center
  • Documentation
  • Knowledge Base
  • Community
  • Training
  • Feedback

Job execution

These articles can help you tune and troubleshoot Apache Spark job execution.

15 Articles in this category

  • Home
  • All articles
  • Job execution

Increase the number of tasks per stage

Learn how to increase the number of tasks per stage when using the spark-xml package with Databricks....

Last updated: May 11th, 2022 by Adam Pavlacka

Maximum execution context or notebook attachment limit reached

Learn what to do when the maximum execution context or notebook attachment limit is reached in Databricks....

Last updated: May 15th, 2023 by rakesh.parija

Serialized task is too large

Learn what to do when a serialized task is too large in Databricks....

Last updated: March 15th, 2023 by Adam Pavlacka

Members of a Gmail group email not receiving notifications

Allow external entities to email the group inbox. ...

Last updated: September 18th, 2024 by walter.camacho

Apache Spark jobs failing due to stage failure when using spot instances in a cluster

Use on-demand nodes instead of spot instances. ...

Last updated: November 26th, 2024 by Vidhi Khaitan

Broadcast join hash not being used despite hints

Refresh table statistics or use supported joins that allow for broadcast join....

Last updated: January 25th, 2025 by swetha.nandajan

Recurring error “Unable to get field from serde” when trying to perform operations on table

Verify your metadata using the Glue API, then recreate or update the table with the correct metadata. ...

Last updated: January 25th, 2025 by swetha.nandajan

Error compressed buffer size exceeds 2 GB when saving data

Set the Apache Spark configs to increase the frequency of row group size check....

Last updated: January 29th, 2025 by manikandan.ganesan

Error when trying to use RDD code in shared clusters

Use a single-user cluster, which supports RDD functionality....

Last updated: January 31st, 2025 by mounika.tarigopula

File corruption error on Apache Spark Streaming jobs during file processing in DBFS

Replace dbutils.fs operations with Hadoop filesystem methods. ...

Last updated: January 31st, 2025 by swetha.nandajan

Jobs failing at data shuffle stage with error org.apache.spark.shuffle.FetchFailedException

Analyze the shuffle data distribution across executors and join query strategies....

Last updated: January 31st, 2025 by swetha.nandajan

Apache Spark job output only giving the first JSON object instead of all records

Add appropriate line breaks between each JSON object or use Photon....

Last updated: January 31st, 2025 by swetha.nandajan

DAB job parameters not passing in correctly on the task level

Correct the syntax....

Last updated: February 26th, 2025 by daniel.ruiz

Recurring Apache Spark jobs with same data set size and cluster configuration vary in duration

Build your cluster with sufficient SSD memory, monitor your cluster’s disk usage, and optimize data storage....

Last updated: March 12th, 2025 by John Benninghoff

Resolve invalid cast input error on serverless compute

Set spark.sql.ansi.enabled to false to resolve casting errors on serverless compute....

Last updated: April 16th, 2025 by anudeep.konaboina

Contact Us

If you still have questions or prefer to get help directly from an agent, please submit a request. We’ll get back to you as soon as possible.

Please enter the details of your request. A member of our support staff will respond as soon as possible.


© Databricks 2022-2025. All rights reserved. Apache, Apache Spark, Spark, and the Spark logo are trademarks of the Apache Software Foundation.

Send us feedback | Privacy Notice (Updated) | Terms of Use | Your Privacy Choices | Your California Privacy Rights Privacy Rights icon


Knowledge Base Software powered by Helpjuice

Definition by Author

0
0