Databricks SQL CLI
The Databricks SQL CLI is not in active development.
This article covers the Databricks SQL CLI, which is provided as-is and is not supported by Databricks through customer technical support channels. Questions and feature requests can be communicated through the Issues page of the databricks/databricks-sql-cli repo on GitHub.
The Databricks SQL command line interface (Databricks SQL CLI) enables you to run SQL queries on your existing Databricks SQL warehouses from your terminal or Windows Command Prompt instead of from locations such as the Databricks SQL editor or a Databricks notebook. From the command line, you get productivity features such as suggestions and syntax highlighting.
Requirements
- At least one Databricks SQL warehouse. Create a warehouse, if you do not already have one.
- Python 3.7 or higher. To check whether you have Python installed, run the command python --versionfrom your terminal or Command Prompt. (On some systems, you may need to enterpython3instead.) Install Python, if you do not have it already installed.
- pip, the package installer for Python. Newer versions of Python install pipby default. To check whether you havepipinstalled, run the commandpip --versionfrom your terminal or Command Prompt. (On some systems, you may need to enterpip3instead.) Install pip, if you do not have it already installed.
- (Optional) A utility for creating and managing Python virtual environments, such as venv. Virtual environments help to ensure that you are using the correct versions of Python and the Databricks SQL CLI together. Setting up and using virtual environments is outside of the scope of this article. For more information, see Creating Virtual Environments.
Install the Databricks SQL CLI
After you meet the requirements, install the Databricks SQL CLI package from the Python Packaging Index (PyPI). You can use pip to install the Databricks SQL CLI package from PyPI by running pip with one of the following commands.
pip install databricks-sql-cli
# Or...
python -m pip install databricks-sql-cli
To upgrade a previously installed version of the Databricks SQL CLI, run pip with one of the following commands.
pip install databricks-sql-cli --upgrade
# Or...
python -m pip install databricks-sql-cli --upgrade
To check your installed version of the Databricks SQL CLI run pip with one of the following commands.
pip show databricks-sql-cli
# Or...
python -m pip show databricks-sql-cli
Authentication
To authenticate, you must provide the Databricks SQL CLI with your warehouse's connection details. Specifically, you need the Server hostname and HTTP path values. You must also product the Databricks SQL CLI with the proper authentication credentials.
The Databricks SQL CLI supports Databricks personal access tokens (PATs). To use Databricks PAT authentication, you must create a personal access token. For details on this process, see Authenticate with Databricks personal access tokens (legacy).
You can provide this authentication information to the Databricks SQL CLI in several ways:
- In the dbsqlclircsettings file in its default location (or by specifying an alternate settings file through the--clircoption each time you run a command with the Databricks SQL CLI). See Settings file.
- By setting the DBSQLCLI_HOST_NAME,DBSQLCLI_HTTP_PATHandDBSQLCLI_ACCESS_TOKENenvironment variables. See Environment variables.
- By specifying the --hostname,--http-path, and--access-tokenoptions each time you run a command with the Databricks SQL CLI. See Command options.
The dbsqlclirc settings file must be present, even if you set the preceding environment variables or specify the preceding command options or both.
Whenever you run the Databricks SQL CLI, it looks for authentication details in the following order and stops when it finds the first set of details:
- The --hostname,--http-path, and--access-tokenoptions.
- The DBSQLCLI_HOST_NAME,DBSQLCLI_HTTP_PATHandDBSQLCLI_ACCESS_TOKENenvironment variables.
- The dbsqlclircsettings file in its default location (or an alternate settings file specified by the--clircoption).
Settings file
To use the dbsqlclirc settings file to provide the Databricks SQL CLI with authentication details for your Databricks SQL warehouse, run the Databricks SQL CLI for the first time, as follows:
dbsqlcli
The Databricks SQL CLI creates a settings file for you, at ~/.dbsqlcli/dbsqlclirc on Unix, Linux, and macOS, and at %HOMEDRIVE%%HOMEPATH%\.dbsqlcli\dbsqlclirc or %USERPROFILE%\.dbsqlcli\dbsqlclirc on Windows. To customize this file:
- 
Use a text editor to open and edit the dbsqlclircfile.
- 
Scroll to the following section: # [credentials]
 # host_name = ""
 # http_path = ""
 # access_token = ""
- 
Remove the four #characters, and:- 
Next to host_name, enter your warehouse's Server hostname value from the requirements between the""characters.
- 
Next to http_path, enter your warehouse's HTTP path value from the requirements between the""characters.
- 
Next to access_token, enter your personal access token value from the requirements between the""characters.
 For example: [credentials]
 host_name = "12345678901234567.8.gcp.databricks.com"
 http_path = "/sql/1.0/warehouses/1abc2d3456e7f890a"
 access_token = "dapi12345678901234567890123456789012"
- 
- 
Save the dbsqlclircfile.
Alternatively, instead of using the dbsqlclirc file in its default location, you can specify a file in a different location by adding the --clirc command option and the path to the alternate file. That alternate file's contents must conform to the preceding syntax.
Environment variables
To use the DBSQLCLI_HOST_NAME, DBSQLCLI_HTTP_PATH, and DBSQLCLI_ACCESS_TOKEN environment variables to provide the Databricks SQL CLI with authentication details for your Databricks SQL warehouse, do the following:
- Unix, Linux, and macOS
- Windows
To set the environment variables for only the current terminal session, run the following commands. To set the environment variables for all terminal sessions, enter the following commands into your shell's startup file and then restart your terminal. In the following commands, replace the value of:
- DBSQLCLI_HOST_NAMEwith your warehouse's Server hostname value from the requirements.
- DBSQLCLI_HTTP_PATHwith your warehouse's HTTP path value from the requirements.
- DBSQLCLI_ACCESS_TOKENwith your personal access token value from the requirements.
export DBSQLCLI_HOST_NAME="12345678901234567.8.gcp.databricks.com"
export DBSQLCLI_HTTP_PATH="/sql/1.0/warehouses/1abc2d3456e7f890a"
export DBSQLCLI_ACCESS_TOKEN="dapi12345678901234567890123456789012"
To set the environment variables for only the current Command Prompt session, run the following commands, replacing the value of:
- DBSQLCLI_HOST_NAMEwith your warehouse's Server hostname value from the requirements.
- DBSQLCLI_HTTP_PATHwith your warehouse's HTTP path value from the requirements.
- DBSQLCLI_ACCESS_TOKENwith your personal access token value from the requirements.:
set DBSQLCLI_HOST_NAME="12345678901234567.8.gcp.databricks.com"
set DBSQLCLI_HTTP_PATH="/sql/1.0/warehouses/1abc2d3456e7f890a"
set DBSQLCLI_ACCESS_TOKEN="dapi12345678901234567890123456789012"
To set the environment variables for all Command Prompt sessions, run the following commands and then restart your Command Prompt, replacing the value of:
- DBSQLCLI_HOST_NAMEwith your warehouse's Server hostname value from the requirements.
- DBSQLCLI_HTTP_PATHwith your warehouse's HTTP path value from the requirements.
- DBSQLCLI_ACCESS_TOKENwith your personal access token value from the requirements.
setx DBSQLCLI_HOST_NAME "12345678901234567.8.gcp.databricks.com"
setx DBSQLCLI_HTTP_PATH "/sql/1.0/warehouses/1abc2d3456e7f890a"
setx DBSQLCLI_ACCESS_TOKEN "dapi12345678901234567890123456789012"
Command options
To use the --hostname, --http-path, and --access-token options to provide the Databricks SQL CLI with authentication details for your Databricks SQL warehouse, do the following:
Do the following every time you run a command with the Databricks SQL CLI:
- Specify the --hostnameoption and your warehouse's Server hostname value from the requirements.
- Specify the --http-pathoption and your warehouse's HTTP path value from the requirements.
- Specify the --access-tokenoption and your personal access token value from the requirements.
For example:
dbsqlcli -e "SELECT * FROM default.diamonds LIMIT 2" \
--hostname "12345678901234567.8.gcp.databricks.com" \
--http-path "/sql/1.0/warehouses/1abc2d3456e7f890a" \
--access-token "dapi12345678901234567890123456789012"
Query sources
The Databricks SQL CLI enables you to run queries in the following ways:
- From a query string.
- From a file.
- In a read-evaluate-print loop (REPL) approach. This approach provides suggestions as you type.
Query string
To run a query as a string, use the -e option followed by the query, represented as a string. For example:
dbsqlcli -e "SELECT * FROM default.diamonds LIMIT 2"
Output:
_c0,carat,cut,color,clarity,depth,table,price,x,y,z
1,0.23,Ideal,E,SI2,61.5,55,326,3.95,3.98,2.43
2,0.21,Premium,E,SI1,59.8,61,326,3.89,3.84,2.31
To switch output formats, use the --table-format option along with a value such as ascii for ASCII table format, for example:
dbsqlcli -e "SELECT * FROM default.diamonds LIMIT 2" --table-format ascii
Output:
+-----+-------+---------+-------+---------+-------+-------+-------+------+------+------+
| _c0 | carat | cut     | color | clarity | depth | table | price | x    | y    | z    |
+-----+-------+---------+-------+---------+-------+-------+-------+------+------+------+
| 1   | 0.23  | Ideal   | E     | SI2     | 61.5  | 55    | 326   | 3.95 | 3.98 | 2.43 |
| 2   | 0.21  | Premium | E     | SI1     | 59.8  | 61    | 326   | 3.89 | 3.84 | 2.31 |
+-----+-------+---------+-------+---------+-------+-------+-------+------+------+------+
For a list of available output format values, see the comments for the table_format setting in the dbsqlclirc file.
File
To run a file that contains SQL, use the -e option followed by the path to a .sql file. For example:
dbsqlcli -e my-query.sql
Contents of the example my-query.sql file:
SELECT * FROM default.diamonds LIMIT 2;
Output:
_c0,carat,cut,color,clarity,depth,table,price,x,y,z
1,0.23,Ideal,E,SI2,61.5,55,326,3.95,3.98,2.43
2,0.21,Premium,E,SI1,59.8,61,326,3.89,3.84,2.31
To switch output formats, use the --table-format option along with a value such as ascii for ASCII table format, for example:
dbsqlcli -e my-query.sql --table-format ascii
Output:
+-----+-------+---------+-------+---------+-------+-------+-------+------+------+------+
| _c0 | carat | cut     | color | clarity | depth | table | price | x    | y    | z    |
+-----+-------+---------+-------+---------+-------+-------+-------+------+------+------+
| 1   | 0.23  | Ideal   | E     | SI2     | 61.5  | 55    | 326   | 3.95 | 3.98 | 2.43 |
| 2   | 0.21  | Premium | E     | SI1     | 59.8  | 61    | 326   | 3.89 | 3.84 | 2.31 |
+-----+-------+---------+-------+---------+-------+-------+-------+------+------+------+
For a list of available output format values, see the comments for the table_format setting in the dbsqlclirc file.
REPL
To enter read-evaluate-print loop (REPL) mode scoped to the default database, run the following command:
dbsqlcli
You can also enter REPL mode scoped to a specific database, by running the following command:
dbsqlcli <database-name>
For example:
dbsqlcli default
To exit REPL mode, run the following command:
exit
In REPL mode, you can use the following characters and keys:
- Use the semicolon (;) to end a line.
- Use F3 to toggle multiline mode.
- Use the spacebar to show suggestions at the insertion point, if suggestions are not already displayed.
- Use the up and down arrows to navigate suggestions.
- Use the right arrow to complete the highlighted suggestion.
For example:
dbsqlcli default
hostname:default> SELECT * FROM diamonds LIMIT 2;
+-----+-------+---------+-------+---------+-------+-------+-------+------+------+------+
| _c0 | carat | cut     | color | clarity | depth | table | price | x    | y    | z    |
+-----+-------+---------+-------+---------+-------+-------+-------+------+------+------+
| 1   | 0.23  | Ideal   | E     | SI2     | 61.5  | 55    | 326   | 3.95 | 3.98 | 2.43 |
| 2   | 0.21  | Premium | E     | SI1     | 59.8  | 61    | 326   | 3.89 | 3.84 | 2.31 |
+-----+-------+---------+-------+---------+-------+-------+-------+------+------+------+
2 rows in set
Time: 0.703s
hostname:default> exit
Logging
The Databricks SQL CLI logs its messages to the file ~/.dbsqlcli/app.log by default. To change this file name or location, change the value of the log_file setting in the dbsqlclirc settings file.
By default, messages are logged at the INFO log level and below. To change this log level, change the value of the log_level setting in the dbsqlclirc settings file. Available log level values include CRITICAL, ERROR, WARNING, INFO, and DEBUG and are evaluated in that order. NONE disables logging.