GitHub - ankoh/dashql: [WIP] Unofficial SQL client for the Hyper database system and Salesforce Data Cloud

DashQL is a library for creating and analyzing a compact version of the PostgreSQL AST. It builds around a Bison parser that materializes AST Nodes into a single Flatbuffer vector. It can be compiled to WebAssembly and has been originally built for lightweight SQL instrumentation, running on every user keystroke in DashQL.

Each AST node is packed into 24 bytes and references matched substrings in the original script text. This encoding is compact and efficient for simple passes, but is not directly suited for a full semantic analysis.

Building

make infra_macos          # Install dependencies to .infra, `infra_linux` for linux
yarn install              # Install npm packages

make proto                # Generate flatbuffers

make core_native_o0       # Build unoptimized native core library
make core_native_o2       # Build optimized native core library with debug symbols
make core_native_tests    # Run native tests
make core_wasm_o3         # Build optimized wasm core library
make core_js_o3           # Build js bundle with wasm module and js api
make core_js_tests        # Run js tests using the wasm module
make snapshots            # Update snapshots
make benchmark_pipeline   # Benchmark the processing pipeline

make svg_symbols          # Update svg sprites used in the PWA
make pwa_tests            # Run js tests of the PWA

# PWA dev server, you will spend most time here during development
#
# Make sure to increase the number of file system watchers for hot module reloading:
# MacOS: sudo sysctl -w kern.maxfiles=524288
# Linux: echo fs.inotify.max_user_watches=524288 | sudo tee -a /etc/sysctl.conf && sudo sysctl -p
#
make pwa_dev              # Run the dev server of the PWA

make native_tests         # Run rust tests of native app
make native_mac_o0        # Create a debug build packaged as .dmg, open devtools with (cmd + alt + i)
make native_mac_universal # Create a universal build packaged as .dmg
make native_mac_updates   # Create universal app updates with code signing (requires signing key)

Incremental parsing with Tree-sitter?

Tree-sitter is a great parser framework and I recommend everyone to try it out. It gives you flexible incremental parsing without much hassle and is a perfect fit for many editors.

DashQL was built for specific database systems. The systems Hyper, Umbra, NoisePage, AlloyDB and DuckDB all use Bison parsers derived from the PostgreSQL grammar. The PostgreSQL grammar stood the test of time and established itself as de-facto reference for SQL syntax. Staying close to PostgreSQL simplifies building frontends for these database systems without worrying too much about grammar differences. DashQL builds around a carefully optimized and very fast parser based on the PostgreSQL-grammar and provides lightweight semantic analysis passes, running on every single keystroke.

DashQL is still doing work in O(text-length) with every input event, as opposed to O(change-size) by Tree-sitter. Yet, DashQL analyzes most input in well under a millisecond in your browser, even when replacing the entire text. After all, the parser is not the only component that has to be tuned for fast analysis passes, incremental parsing alone "only" gives you a head-start for the AST update. DashQL maintains B+-tree ropes, dictionary-encodes and tags SQL object names in-flight and performs efficient post-order DFS traversals through linear scans over a compact AST representation.

What does "fast" mean in numbers?

Here are timings for TPC-DS Q1 on my laptop. All steps run single-threaded on a M1Max. DashQL spends 5us with scanning, 10us with parsing, and 15us with analyzing, leaving plenty of time for Javascript to reflect changes in the UI.

Run on (10 X 24.1324 MHz CPU s)
CPU Caches:
  L1 Data 64 KiB
  L1 Instruction 128 KiB
  L2 Unified 4096 KiB (x10)
Load Average: 10.72, 6.78, 5.17
----------------------------------------------------------
Benchmark                Time             CPU   Iterations
----------------------------------------------------------
scan_query            5367 ns         5349 ns       100176
parse_query          10602 ns        10542 ns        64493
analyze_query        15533 ns        15495 ns        45057
move_cursor            435 ns          434 ns      1623384
complete_cursor       6200 ns         6173 ns       111914
compute_layout       13414 ns        13377 ns        53352

Name	Name	Last commit message	Last commit date
Latest commit dashql-renovate[bot] and ankoh Update Rust crate toml to 0.8.20 Mar 22, 2025 aae5047 · Mar 22, 2025 History 3,942 Commits
.github	.github	Bundle more renovate updates	Mar 21, 2025
.yarn	.yarn	Update Yarn to v4.5.0	Sep 28, 2024
docs	docs	Fix dashql-compute after cargo updates	Mar 15, 2025
grammar	grammar	Rename flatbuffer package to `buffers` to distinguish from protobuf	Mar 22, 2025
misc	misc	Use new logo on banner page and add hbreak	Mar 16, 2025
packages	packages	Update Rust crate toml to 0.8.20	Mar 22, 2025
proto	proto	Rename flatbuffer package to `buffers` to distinguish from protobuf	Mar 22, 2025
scripts	scripts	Rename flatbuffer package to `buffers` to distinguish from protobuf	Mar 22, 2025
snapshots	snapshots	Fix tests that rely on lexicographical order	Mar 15, 2025
.clang-format	.clang-format	Add C++ parser sources	Feb 5, 2023
.gitignore	.gitignore	Draft page for Trino connector settings	Feb 2, 2025
.prettierrc	.prettierrc	Add codemirror react wrapper	Feb 21, 2023
.yarnrc.yml	.yarnrc.yml	Update Yarn to v4.5.0	Sep 28, 2024
Cargo.lock	Cargo.lock	Update Rust crate reqwest to ^0.12.15	Mar 22, 2025
Cargo.toml	Cargo.toml	Rename project to DashQL	Mar 15, 2025
LICENSE	LICENSE	Add LICENSE	May 24, 2023
Makefile	Makefile	Add command to cleanup build output	Mar 18, 2025
README.md	README.md	Resize logo in README	Mar 15, 2025
buf.gen.yaml	buf.gen.yaml	Rename project to DashQL	Mar 15, 2025
buf.work.yaml	buf.work.yaml	Use idl subfolder to separate protobuf and flatbuf	Jan 28, 2024
eslint.config.mjs	eslint.config.mjs	Rename project to DashQL	Mar 15, 2025
package.json	package.json	Fix dashql-compute after cargo updates	Mar 15, 2025
rustfmt.toml	rustfmt.toml	Parse channel and stream headers in gRPC proxy	Mar 5, 2024
tsconfig.json	tsconfig.json	Move settings to base tsconfig for WebStorm	May 25, 2024
yarn.lock	yarn.lock	Update dependency esbuild to ^0.25.0 [SECURITY]	Mar 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Building

Incremental parsing with Tree-sitter?

What does "fast" mean in numbers?

About

Contributors 4

Languages

License

ankoh/dashql

Folders and files

Latest commit

History

Repository files navigation

Building

Incremental parsing with Tree-sitter?

What does "fast" mean in numbers?

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 4

Languages