npm install ditto-dql-cli

Ditto DQL Sandbox

A comprehensive sandbox environment for testing and benchmarking DQL queries against a Ditto database with a preloaded movie dataset, featuring performance benchmarking and baseline tracking capabilities.

> Tested with Ditto SDK versions 4.8 to 4.12

> This application is designed for local DQL query execution and does not enable sync intentionally.

> To reset the database stop the application and delete the ./ditto directory from the root.

Installation

$3

bash
npm install -g ditto-dql-cli
dql

$3

bash
npx ditto-dql-cli

$3

bash
dql                # Run with installed version
dql 4.10.0         # Run with Ditto SDK 4.10.0
dql 4.11.5         # Run with Ditto SDK 4.11.5
dql --help         # Show usage information

$3

bash
git clone https://github.com/skylerjokiel/ditto-dql-cli.git
cd ditto-dql-cli
npm install
npm run dev

The terminal will automatically import the movie dataset on first run. If a benchmark_baselines.ndjson file exists in the root directory, it will also import baseline data.

`Basic Commands`

`$3`

- DQL Query - Type any DQL query directly and press Enter -.help- Show help message with all available commands -.list- Show all available scenarios with index numbers -.run - Run a predefined scenario by name or index number (e.g., .run count_all or .run 1) -.all- Run all scenarios in sequence with comprehensive summary -.bench - Benchmark a custom query (20 runs with statistics) -.benchmarks- List all available predefined benchmarks -.benchmark [runs]- Run a specific predefined benchmark with optional run count (default: 5) -.benchmark_all [runs]- Run all predefined benchmarks with optional run count -.benchmark_baseline [runs]- Create baselines for all benchmarks (default: 50 runs) -.benchmark_baseline [runs]- Create baseline for specific benchmark -.benchmark_show- Display saved baseline comparison table -.system- Display comprehensive system information including Ditto version, hardware details, and database statistics -.export - Export query results to exports/export_.ndjsonfile -.generate_movies - Generate and insert random movies into the collection -.log_dump - Export current log buffer to logs/manual-logs_.ndjsonfile -.log_debug- Show log buffer debug information (buffer size, latest logs) -.exit - Exit the terminal

`$3`

`sql -- Count all movies SELECT count(*) FROM movies

-- Find movies by year SELECT * FROM movies WHERE _id.year = '2001'

-- Search by title SELECT * FROM movies WHERE CONTAINS(_id.title,'Star')

`Movie Document Structure`

Each movie in the database has the following structure:

`json { "_id": { "id": "573a1390f29313caabcd4135", "title": "Blacksmith Scene", "year": "1893", "type": "movie" }, "plot": "Three men hammer on an anvil...", "genres": ["Short"], "runtime": 1, "cast": ["Charles Kayser", "John Ott"], "fullplot": "A stationary camera looks at...", "countries": ["USA"], "released": "1893-05-09T00:00:00.000Z", "directors": ["William K.L. Dickson"], "rated": "UNRATED", "awards": { "wins": 1, "nominations": 0, "text": "1 win." }, "imdb": { "rating": 6.2, "votes": 1189, "id": 5 }, "tomatoes": { "viewer": { "rating": 3, "numReviews": 184, "meter": 32 } } }`

`Test Harness & Validation`

This application functions as a comprehensive test harness for DQL queries with built-in validation:

`$3`

Scenarios can include automated validation for: - Result Count: Verify queries return the expected number of documents - Index Usage: Automatically run EXPLAIN and validate which index is used - Execution Time: Ensure queries complete within specified time limits

`$3`

Scenarios support both simple strings and validation objects:

`json { "my_scenario": [ "DROP INDEX IF EXISTS my_index ON movies", { "query": "SELECT * FROM movies WHERE rated = 'PG'", "expectedCount": 1234, "expectedIndex": "full_scan", "maxExecutionTime": 500 }, "CREATE INDEX my_index ON movies (rated)", { "query": "SELECT * FROM movies WHERE rated = 'PG'", "expectedCount": 1234, "expectedIndex": "my_index", "maxExecutionTime": 50 } ] }`

`$3`

Run .listto see all scenarios with their index numbers. You can run scenarios either by name or index: -.run index_basic or .run 1- Basic index performance validation -.run index_string_contains or .run 2- Text search with CONTAINS -.run validation_test or .run 3 - Result count validation examples

Use .all to run all scenarios and get a comprehensive test report.

`Performance Benchmarking`

`$3`

Use the .bench command to benchmark any custom query:

`.bench SELECT * FROM movies WHERE rated = 'APPROVED'`

This will run the query 20 times and provide detailed statistics: - Mean, median, min, max execution times - Standard deviation and percentiles (95th, 99th) - Queries per second throughput - Progress tracking during execution

`$3`

The application includes predefined benchmark suites for common query patterns:

`.benchmarks # List all available benchmarks .benchmark count # Run the "count" benchmark .benchmark 1 # Run benchmark by index .benchmark count 10 # Run benchmark with custom run count .benchmark_all # Run all predefined benchmarks .benchmark_all 10 # Run all benchmarks with 10 runs each`

`$3`

Track performance changes across Ditto versions using the baseline system:

`.benchmark_baseline # Create baseline for all benchmarks (50 runs) .benchmark_baseline 100 # Create baseline for all with custom run count .benchmark_baseline count # Create baseline for specific benchmark .benchmark_baseline count 100 # Create baseline for specific benchmark with custom runs .benchmark_show # Display saved baseline comparison table`

When running benchmarks, the system automatically compares results against: - The current version's baseline (if available) - Last 3 patch versions (e.g., 4.12.0, 4.12.1, 4.12.2) - Latest version from up to 2 previous minor versions (e.g., 4.11.5, 4.10.5)

Both .benchmark_all and .benchmark_showdisplay comprehensive summary tables showing performance across versions with color-coded differences: - 🟢 Green = Performance improvement (>1ms or >5% faster) - 🔵 Blue = Minimal change (≤1ms or ≤5%) - 🟡 Yellow = Small regression (1-2ms or 5-15% slower) - 🔴 Red = Significant regression (>2ms or >15% slower)

Baseline Data Import: If you have abenchmark_baselines.ndjsonfile in the root directory, the application will automatically import it on startup when the baseline collection is empty. This is useful for: - Sharing baseline data between team members - Restoring baseline data after database resets - Setting up consistent baseline data across environments

Adding Custom Benchmarks:

Edit benchmarks.json to add new benchmark queries:

`json { "my_benchmark": { "query": "SELECT * FROM movies WHERE runtime > 150 LIMIT 100", "preQueries": ["CREATE INDEX IF NOT EXISTS runtime_idx ON movies (runtime)"], "postQueries": ["DROP INDEX IF EXISTS runtime_idx ON movies"] } }`

Perfect for comparing indexed vs non-indexed query performance and maintaining consistent performance testing!

`Benchmark Hardware Context`

All benchmark results in this repository were collected on the following system:

`System Information: Platform: darwin arm64 OS Release: 24.6.0

CPU Information: Model: Apple M1 Max Cores: 10`

Important Notes: - Benchmark results are highly dependent on hardware specifications - Your results will vary based on CPU, memory, storage type, and system load - Use relative performance comparisons (between versions) rather than absolute times - The.system command shows your current hardware specifications for reference

When sharing benchmark results or comparing performance: - Always include your system specifications (use.systemcommand) - Focus on percentage changes between versions rather than absolute timings - Consider running multiple benchmark iterations to account for system variance - Be aware that different CPU architectures (Intel vs ARM) will show different baseline performance

`System Information`

The .system command provides comprehensive information about your environment:

`.system`

This displays: - Ditto SDK Version: Current version and license information - System Information: OS, platform, CPU, memory details - Storage Information: Database location and size - Database Statistics: Document counts, index information, and collection details

Use this information to: - Ensure consistent testing environments - Debug performance differences - Track database growth - Verify index usage

`Data Export`

Export query results to NDJSON format for backup, analysis, or migration:

`.export SELECT * FROM movies # Export all movies .export SELECT * FROM movies WHERE rated = 'PG' # Export filtered movies .export SELECT * FROM benchmark_baselines # Export baseline data .export SELECT _id.title, runtime FROM movies LIMIT 100 # Export specific fields`

The export command: - Executes any valid DQL query - Saves results in NDJSON (newline-delimited JSON) format - Creates anexports/directory if it doesn't exist - Generates timestamped filenames:export_2024-10-04T09-30-15.ndjson- Places files in theexports/directory (ignored by git) - Shows export statistics (document count, file size, location, query) - Handles query errors gracefully

NDJSON format is ideal for: - Data backups and archiving - Importing into other systems - Analysis with tools likejqor custom scripts - Version control of dataset snapshots

`Generate Random Movies`

Create and insert randomly generated movies for testing at scale:

`.generate_movies 1000 # Generate 1,000 random movies .generate_movies 50000 # Generate 50,000 random movies .generate_movies 1000000 # Generate 1 million movies (with confirmation prompt)`

The generate_movies command: - Creates realistic movie documents with all required fields - Uses "random-" prefix for all generated movie IDs - Shows real-time progress during insertion - Benchmarks performance (movies/second insertion rate) - Displays total collection count after completion - Warns and requires confirmation for counts over 100,000

Generated movies include: - Random titles combining adjectives and nouns (e.g., "Epic Adventure", "Mysterious Journey") - Years ranging from 1920-2024 - Multiple genres (Action, Comedy, Drama, etc.) - Cast and director names - MPAA ratings (G, PG, PG-13, R, etc.) - IMDB ratings and votes - Rotten Tomatoes scores (when applicable) - Runtime between 60-180 minutes - Awards and nominations

Perfect for: - Performance testing with larger datasets - Query optimization testing at scale - Benchmarking different data volumes - Stress testing Ditto operations

`Adding New Scenarios`

To add a new scenario, edit scenarios.json and add your queries with optional validation:

`json { "my_scenario": [ "DROP INDEX IF EXISTS my_index ON movies", { "query": "SELECT * FROM movies WHERE runtime > 120", "expectedCount": 8500, "expectedIndex": "full_scan", "maxExecutionTime": 800 }, "CREATE INDEX my_index ON movies (runtime)", { "query": "SELECT * FROM movies WHERE runtime > 120", "expectedCount": 8500, "expectedIndex": "my_index", "maxExecutionTime": 100 } ] }`

Stop and restart the app then run it with .run my_scenario

`Key Features`

`$3`


- Run with any Ditto SDK version using

dql

 command
- Automatically downloads and installs the requested version
- Perfect for testing queries across different Ditto versions
- No need to manually manage multiple installations
$3

- 45+ benchmark scenarios covering various query patterns and optimizations
- Multi-field index benchmarks for compound query testing
- INSERT, UPDATE, DELETE operations with proper cleanup
- Text search, range queries, aggregations, and sorting benchmarks
- Progress tracking shows current benchmark number (e.g., "Running benchmark (40/48)")
$3

- Create and save performance baselines across Ditto versions
- Individual baseline creation with

.benchmark_baseline [runs]


- Compare current performance against historical baselines
- Color-coded performance indicators for easy interpretation
- Table-based summary view for cross-version comparisons (up to 7 versions)
-

.benchmark_show

 command to view saved baselines without running benchmarks
$3

- Export any query results to NDJSON format with

.export


- Timestamped filenames prevent overwrites
- Organized in

exports/

 directory (git-ignored)
- Perfect for data analysis, backup, or sharing
$3

- Automatic circular buffer logging captures last 100 Ditto log entries
- Auto-export logs on warnings/errors to

logs/error-logs_.ndjson


- Manual log export with

.log_dump

 command for troubleshooting
- Debug information with

.log_debug` to check buffer status
- Logs include timestamps, levels (ERROR, WARN, INFO, DEBUG, VERBOSE), and full messages
- Perfect for debugging performance issues and Ditto SDK behavior

$3

- Automatic version detection and feature compatibility checks
- DQL_STRICT_MODE automatically enabled only for Ditto 4.11.0+
- USER_COLLECTION_SYNC_SCOPES for Ditto 4.10.0+
- COLLECTION schema syntax for older Ditto versions (<4.11.0)
- Auto-import of movies and baseline data on first run

$3

- Benchmarks continue running even if individual queries fail
- Unsupported features are marked as "N/A" instead of stopping execution
- Cleanup queries run even after benchmark failures
- Graceful degradation for older Ditto versions
- TypeScript error handling for unknown error types

License

MIT

npm install ditto-dql-cli

Ditto DQL Sandbox

> Tested with Ditto SDK versions 4.8 to 4.12

> This application is designed for local DQL query execution and does not enable sync intentionally.

> To reset the database stop the application and delete the ./ditto directory from the root.

Installation

$3

bash
npm install -g ditto-dql-cli
dql

$3

bash
npx ditto-dql-cli

$3

bash
dql                # Run with installed version
dql 4.10.0         # Run with Ditto SDK 4.10.0
dql 4.11.5         # Run with Ditto SDK 4.11.5
dql --help         # Show usage information

$3

bash
git clone https://github.com/skylerjokiel/ditto-dql-cli.git
cd ditto-dql-cli
npm install
npm run dev

The terminal will automatically import the movie dataset on first run. If a benchmark_baselines.ndjson file exists in the root directory, it will also import baseline data.

`Basic Commands`

`$3`

`sql -- Count all movies SELECT count(*) FROM movies

-- Find movies by year SELECT * FROM movies WHERE _id.year = '2001'

-- Search by title SELECT * FROM movies WHERE CONTAINS(_id.title,'Star')

`Movie Document Structure`

Each movie in the database has the following structure:

`Test Harness & Validation`

This application functions as a comprehensive test harness for DQL queries with built-in validation:

`$3`

Scenarios support both simple strings and validation objects:

`$3`

Use .all to run all scenarios and get a comprehensive test report.

`Performance Benchmarking`

`$3`

Use the .bench command to benchmark any custom query:

`.bench SELECT * FROM movies WHERE rated = 'APPROVED'`

`$3`

The application includes predefined benchmark suites for common query patterns:

`$3`

Track performance changes across Ditto versions using the baseline system:

Adding Custom Benchmarks:

Edit benchmarks.json to add new benchmark queries:

Perfect for comparing indexed vs non-indexed query performance and maintaining consistent performance testing!

`Benchmark Hardware Context`

All benchmark results in this repository were collected on the following system:

`System Information: Platform: darwin arm64 OS Release: 24.6.0

CPU Information: Model: Apple M1 Max Cores: 10`

`System Information`

The .system command provides comprehensive information about your environment:

`.system`

Use this information to: - Ensure consistent testing environments - Debug performance differences - Track database growth - Verify index usage

`Data Export`

Export query results to NDJSON format for backup, analysis, or migration:

NDJSON format is ideal for: - Data backups and archiving - Importing into other systems - Analysis with tools likejqor custom scripts - Version control of dataset snapshots

`Generate Random Movies`

Create and insert randomly generated movies for testing at scale:

`.generate_movies 1000 # Generate 1,000 random movies .generate_movies 50000 # Generate 50,000 random movies .generate_movies 1000000 # Generate 1 million movies (with confirmation prompt)`

Perfect for: - Performance testing with larger datasets - Query optimization testing at scale - Benchmarking different data volumes - Stress testing Ditto operations

`Adding New Scenarios`

To add a new scenario, edit scenarios.json and add your queries with optional validation:

Stop and restart the app then run it with .run my_scenario

`Key Features`

`$3`


- Run with any Ditto SDK version using

dql

 command
- Automatically downloads and installs the requested version
- Perfect for testing queries across different Ditto versions
- No need to manually manage multiple installations
$3

- 45+ benchmark scenarios covering various query patterns and optimizations
- Multi-field index benchmarks for compound query testing
- INSERT, UPDATE, DELETE operations with proper cleanup
- Text search, range queries, aggregations, and sorting benchmarks
- Progress tracking shows current benchmark number (e.g., "Running benchmark (40/48)")
$3

- Create and save performance baselines across Ditto versions
- Individual baseline creation with

.benchmark_baseline [runs]


- Compare current performance against historical baselines
- Color-coded performance indicators for easy interpretation
- Table-based summary view for cross-version comparisons (up to 7 versions)
-

.benchmark_show

 command to view saved baselines without running benchmarks
$3

- Export any query results to NDJSON format with

.export


- Timestamped filenames prevent overwrites
- Organized in

exports/

 directory (git-ignored)
- Perfect for data analysis, backup, or sharing
$3

- Automatic circular buffer logging captures last 100 Ditto log entries
- Auto-export logs on warnings/errors to

logs/error-logs_.ndjson


- Manual log export with

.log_dump

 command for troubleshooting
- Debug information with

.log_debug` to check buffer status
- Logs include timestamps, levels (ERROR, WARN, INFO, DEBUG, VERBOSE), and full messages
- Perfect for debugging performance issues and Ditto SDK behavior

$3

License

MIT