Skip to content

datalakehouse/dlh-sql-formatter

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3,097 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DLH SQL Formatter NPM version

DLH SQL Formatter is a JavaScript library for pretty-printing SQL queries, maintained by DLH.io. It is a fork of sql-formatter with DLH-specific enhancements.

Looking for the VS Code extension? Install DLH SQL Optimizer for formatting directly in your editor.

What's Different from Upstream?

DLH SQL Formatter builds on the excellent sql-formatter library with the following enhancements:

  • DLH-branded packaging — published as @dlh.io/dlh-sql-formatter on npm for use across DLH products
  • DuckDB support — first-class support for DuckDB dialect
  • Enhanced comma positioning — improved leadingWithSpace comma handling with full comment support
  • AI features — multi-provider (anthropic / openai / gemini / ollama / custom) SQL rewrite and suggestions via @dlh.io/dlh-sql-formatter/ai
  • DLH default rewrite prompt (v1.3.0+) — an opinionated, warehouse-aware system prompt ships as the default for AI rewrites, overridable per call
  • VS Code integration — paired with the DLH SQL Optimizer extension
  • Ongoing upstream sync — bug fixes and improvements from upstream are regularly merged

Supported SQL Dialects

GCP BigQuery, IBM DB2, DuckDB, Apache Hive, MariaDB, MySQL, TiDB, Couchbase N1QL, Oracle PL/SQL, PostgreSQL, Amazon Redshift, SingleStoreDB, Snowflake, Spark, SQL Server Transact-SQL, Trino (and Presto).

See language option docs for more details.

Limitations

  • Stored procedures are not supported.
  • Delimiter type cannot be changed from ;.

Install

npm install @dlh.io/dlh-sql-formatter

Or with yarn:

yarn add @dlh.io/dlh-sql-formatter

Quick Start

As a Library

import { format } from '@dlh.io/dlh-sql-formatter';

console.log(format('SELECT * FROM tbl', { language: 'mysql' }));

Output:

SELECT
  *
FROM
  tbl

With configuration options:

format('SELECT * FROM tbl', {
  language: 'spark',
  tabWidth: 2,
  keywordCase: 'upper',
  linesBetweenQueries: 2,
});

Disabling the Formatter

Wrap sections with disable/enable comments to skip formatting:

/* sql-formatter-disable */
SELECT * FROM tbl1;
/* sql-formatter-enable */
SELECT * FROM tbl2;

Output:

/* sql-formatter-disable */
SELECT * FROM tbl1;
/* sql-formatter-enable */
SELECT
  *
FROM
  tbl2;

Placeholder Replacement

format('SELECT * FROM tbl WHERE foo = ?', {
  params: ["'bar'"],
});

Output:

SELECT
  *
FROM
  tbl
WHERE
  foo = 'bar'

For more details see docs of params option.

Command Line Usage

The CLI tool is installed as dlh-sql-formatter:

npx @dlh.io/dlh-sql-formatter -h
usage: dlh-sql-formatter [-h] [-o OUTPUT] \
[-l {bigquery,db2,db2i,duckdb,hive,mariadb,mysql,n1ql,plsql,postgresql,redshift,singlestoredb,snowflake,spark,sql,sqlite,tidb,transactsql,trino,tsql}] [-c CONFIG] [--version] [FILE]

SQL Formatter

positional arguments:
  FILE            Input SQL file (defaults to stdin)

optional arguments:
  -h, --help      show this help message and exit
  -o, --output    OUTPUT
                    File to write SQL output (defaults to stdout)
  --fix           Update the file in-place
  -l, --language  SQL dialect (defaults to basic sql)
  -c, --config    CONFIG
                    Path to config JSON file or json string
  --version       show program's version number and exit

Example:

echo 'select * from tbl where id = 3' | npx @dlh.io/dlh-sql-formatter

Configuration File

The tool accepts a JSON config file named .sql-formatter.json in the current or any parent directory, or via the --config option:

{
  "$schema": "https://raw.githubusercontent.com/datalakehouse/dlh-sql-formatter/master/schema.json",
  "language": "spark",
  "tabWidth": 2,
  "keywordCase": "upper",
  "linesBetweenQueries": 2
}

Tip: Add the $schema field to get autocomplete and validation in VS Code and other editors that support JSON Schema.

All fields are optional and unspecified fields use their default values.

Configuration Options

Usage without NPM

If you don't use a module bundler, clone the repository, run npm install and grab a file from /dist directory to use inside a <script> tag. This makes SQL Formatter available as a global variable window.sqlFormatter.

AI-Powered Rewrites

The @dlh.io/dlh-sql-formatter/ai entry point lets you rewrite SQL with an LLM (Anthropic / OpenAI / Gemini / Ollama / custom). By default, the library uses DLH's opinionated system prompt, which encodes warehouse-aware optimization heuristics (predicate pushdown, CTEs over correlated subqueries, explicit JOINs, partition/cluster-key awareness, etc.).

import { withAI } from '@dlh.io/dlh-sql-formatter/ai';

const result = await withAI('SELECT * FROM users WHERE country = \'US\'', {
  provider: 'anthropic',
  apiKey: process.env.ANTHROPIC_API_KEY,
  features: ['rewrite'],
  dialect: 'snowflake',
});

Overriding the default prompt

Since v1.3.0, callers can replace or extend the DLH default prompt via AIConfig.rewritePrompt:

// Extend the DLH default with an additional rule
await withAI(sql, {
  provider: 'openai',
  apiKey: process.env.OPENAI_API_KEY,
  features: ['rewrite'],
  rewritePrompt: {
    mode: 'extend',
    text: 'Always prefer CTEs over correlated subqueries, even if the query is trivial.',
  },
});

// Replace the DLH default entirely (you own the JSON contract)
await withAI(sql, {
  provider: 'anthropic',
  apiKey: process.env.ANTHROPIC_API_KEY,
  features: ['rewrite'],
  rewritePrompt: {
    mode: 'replace',
    text: 'You are a terse SQL rewriter. Return only `{ "sql": "…", "explanation": "…", "optimizations": [] }`.',
  },
});
mode Behaviour
'default' (or omitted) Use the built-in DLH prompt
'extend' DLH default + "Additional guidance:\n" + text
'replace' Use text verbatim — you must still instruct the model to return the { sql, explanation, optimizations[] } JSON shape

Empty / whitespace-only text falls back to the default regardless of mode.

The DLH default prompt is also exported as a constant so you can display it, log it, or wrap it in your own template:

import { DEFAULT_REWRITE_SYSTEM_PROMPT, buildDefaultRewriteSystemPrompt } from '@dlh.io/dlh-sql-formatter/ai';

console.log(DEFAULT_REWRITE_SYSTEM_PROMPT);                 // verbatim constant
console.log(buildDefaultRewriteSystemPrompt('bigquery'));   // default + dialect annotation

Editor Integration

VS Code

Install the DLH SQL Optimizer extension for VS Code to format SQL files directly in your editor.

JSON Schema for Config

Add the $schema property to your .sql-formatter.json for editor autocomplete:

{
  "$schema": "https://raw.githubusercontent.com/datalakehouse/dlh-sql-formatter/master/schema.json"
}

Frequently Asked Questions

Parse error: Unexpected ... at line ...

The most common cause is that you haven't specified an SQL dialect. Instead of calling the library simply:

format('select [col] from tbl');
// Throws: Parse error: Unexpected "[col] from" at line 1 column 8

Pick the proper dialect:

format('select [col] from tbl', { language: 'transactsql' });

Or when using the VS Code extension: Settings → DLH SQL Optimizer → Dialect.

The error message includes line and column information to help you locate the issue. Common causes include unsupported syntax for the selected dialect, unclosed strings or brackets, and template syntax that needs paramTypes configuration.

My SQL contains templating syntax which SQL Formatter fails to parse

Use the paramTypes config option to treat templating constructs as parameter placeholders:

format('SELECT {col1}, {col2} FROM {tablename};', {
  paramTypes: { custom: [{ regex: String.raw`\{\w+\}` }] },
});

Contributing

Please see CONTRIBUTING.md

Upstream Sync

This project regularly syncs with sql-formatter-org/sql-formatter to incorporate upstream bug fixes. See CHANGELOG.md for details on what has been merged.

License

MIT

About

DLH SQL Formatter based our friends at SQL Formatter the whitespace formatter for different query languages

Resources

License

Contributing

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages

  • TypeScript 94.4%
  • JavaScript 3.1%
  • Nearley 1.3%
  • Other 1.2%