Korean characters breaking when downloading Genie results as CSV

Open the CSV file from within Excel instead to set the correct encoding.

Written by vidya.sagamreddy

Last published at: July 22nd, 2025

Problem

When using Genie in Databricks to extract SQL query results and download them as CSV files, you notice Korean characters appear broken or garbled when the file is opened in Microsoft Excel. 

 

Cause

The CSV files you download from Genie into Excel are encoded in UTF-8, but are missing the BOM (Byte Order Mark) encoding. Excel by default uses a different default encoding, leading to the misinterpretation of Korean characters.

 

Solution

Open the CSV file from within Excel instead to set the correct encoding. 

  1. Launch Excel.
  2. Navigate to the Data tab.
  3. Click on From Text/CSV
  4. Select the CSV file you downloaded from Genie.
  5. In the data preview window, under File Origin, select 65001: Unicode (UTF-8).
  6. Ensure the delimiter is correctly identified (usually a comma).
  7. Click Load to import the data.