DUMBL

Dumb Universal Markup Brutally Lightweight

Token reduction algorithm for LLM inputs. Compress JSON/TOML prompts while maintaining LLM readability.

---

💡 Why DUMBL?

LLMs understand text with missing vowels and abbreviated words due to linguistic redundancy. DUMBL exploits this to reduce token count (and costs) by 15-30%.

``Original: "Desenvolva uma aplicação completa utilizando programação orientada" DUMBL: "Desenvlva uma aplicç complta utiliznd programç orientda" Savings: ~20% fewer tokens`

Save money on API calls while maintaining semantic clarity for GPT-4, Claude, Llama, Gemini, and other LLMs.

`✨ Features`

- 🚀 15-30% token reduction on typical LLM prompts - 🔧 Three compression levels - light, medium, aggressive - 📦 Zero dependencies - TOML support is optional - 🌍 Multilingual - English & Portuguese optimized - 🔒 Smart preservation - URLs, emails, paths, tech terms stay intact - 📝 TypeScript ready - Full type definitions included - ⚡ Fast - Minimal overhead for real-time use

`📦 Installation`

`bash npm install dumbl`

For TOML support (optional):`bash npm install dumbl @iarna/toml`

`🚀 Quick Start`

`javascript const { dumbl } = require('dumbl');

// Create instance (level 1-3) const d = dumbl.aggressive(); // level 3

// Compress object const result = d.compress({ prompt: "Explique detalhadamente o processamento" }); // → { prompt: "Explqe detlhadmt o procesmt" }

// Output as JSON const json = d.toJSON(data);

// Output as DUMBL format (most compact) const compact = d.toDUMBL(data);

// Quick debug - see stats and result const { dumblDry } = require('dumbl'); dumblDry(data); // logs stats + compressed result`

`📖 API Reference`

`$3`

`javascript const { dumbl } = require('dumbl');

// With options const d = dumbl({ level: 3, // 1=light, 2=medium, 3=aggressive preserveKeys: true, // don't compress object keys minWordLength: 3 // min chars to compress });

// Presets dumbl.light() // level 1 - safe, minimal dumbl.medium() // level 2 - balanced dumbl.aggressive() // level 3 - maximum compression`

`$3`

`javascript const d = dumbl.aggressive();

// Compress anything d.compress(object) // → compressed object d.compress(jsonString) // → compressed object d.compress(text) // → compressed string

// Output formats d.toJSON(input) // → JSON string (compressed) d.toDUMBL(input) // → DUMBL format (most compact)

// With TOML (requires @iarna/toml) const TOML = require('@iarna/toml'); d.compress(tomlString, TOML) d.toTOML(input, TOML)

// Parse DUMBL back d.parseDUMBL(dumblString) // → object

// Statistics d.stats(original, compressed) // → { savedChars, ratio, estimatedTokensSaved, ... }`

`$3`

`javascript const { compress, toJSON, toDUMBL, dumblDry } = require('dumbl');

// Quick compression (uses level 3) compress({ prompt: "..." }) toJSON({ prompt: "..." }) toDUMBL({ prompt: "..." })

// Dry run - logs stats and result to console dumblDry({ prompt: "..." })`

`$3`

Use dumblDry to preview compression results with statistics:

`javascript const { dumblDry } = require('dumbl');

dumblDry({ prompt: "Explique detalhadamente o processamento de dados" });`

Output:`┌─────────────────────────────────────────┐ │ DUMBL Dry Run │ ├─────────────────────────────────────────┤ │ Original: 56 chars │ Compressed: 44 chars │ Saved: 12 chars (21.4%) │ Est. tokens: ~3 saved ├─────────────────────────────────────────┤ │ Result: └─────────────────────────────────────────┘ { "prmpt": "Explque dtalhdamt o prcesmento de ddos" }`

`📊 Compression Levels`

| Level | Description | Use Case | |-------|-------------|----------| | 1 | Remove duplicates only | Conservative, max readability | | 2 | + Suffix abbreviations | Balanced | | 3 | + Vowel removal | Maximum savings |

`🔧 DUMBL Format`

Custom ultra-compact format, JSON-compatible:

`javascript // JSON {"enabled":true,"count":null,"items":["a","b"]}

// DUMBL {enabled:T,count:N,items:["a","b"]}`

Features: -T/Ffor booleans -Nfor null - Unquoted keys when possible - No whitespace

`📈 Benchmark`

`bash npm run benchmark`

Sample results:

| Format | Size | vs JSON | |--------|------|---------| | JSON (pretty) | 1250 | +45% | | JSON (compact) | 862 | baseline | | TOML | 780 | -10% | | JSON+DUMBL L3 | 680 | -21% | | DUMBL format | 620 | -28% |

`🛡️ What's Preserved`

DUMBL intelligently preserves:

- ✅ Short words (≤3 chars) - ✅ Connectors (the, of, de, para, etc.) - ✅ URLs, emails, file paths - ✅ Tech terms (API, JSON, HTTP, etc.) - ✅ Numbers and booleans - ✅ Object structure

`🤖 LLM Compatibility`

Tested and confirmed readable by: - ✅ GPT-4 / GPT-4o / GPT-4o-mini - ✅ Claude 3.5 / Claude 4 - ✅ Llama 3 / Llama 3.1 - ✅ Gemini Pro / Gemini Ultra - ✅ Mistral / Mixtral

The compression maintains semantic meaning while reducing tokens.

`📘 TypeScript`

Full type definitions included:

`typescript import { dumbl, DumblOptions, DumblStats } from 'dumbl';

const d = dumbl.aggressive(); const stats: DumblStats = d.stats(original, compressed);`

`🧪 Testing`

`bash npm test``

🤝 Contributing

Contributions are welcome! Please read our Contributing Guide for details.

📄 License

MIT © Frederico Bezerra

👤 Author

Frederico Bezerra

- Website: neosdev.io
- LinkedIn: @fredericobezerra
- GitHub: @fredericobezerra

---

If you find DUMBL useful, please ⭐ star the repo!

DUMBL

Dumb Universal Markup Brutally Lightweight

Token reduction algorithm for LLM inputs. Compress JSON/TOML prompts while maintaining LLM readability.

---

💡 Why DUMBL?

LLMs understand text with missing vowels and abbreviated words due to linguistic redundancy. DUMBL exploits this to reduce token count (and costs) by 15-30%.

``Original: "Desenvolva uma aplicação completa utilizando programação orientada" DUMBL: "Desenvlva uma aplicç complta utiliznd programç orientda" Savings: ~20% fewer tokens`

Save money on API calls while maintaining semantic clarity for GPT-4, Claude, Llama, Gemini, and other LLMs.

`✨ Features`

`📦 Installation`

`bash npm install dumbl`

For TOML support (optional):`bash npm install dumbl @iarna/toml`

`🚀 Quick Start`

`javascript const { dumbl } = require('dumbl');

// Create instance (level 1-3) const d = dumbl.aggressive(); // level 3

// Compress object const result = d.compress({ prompt: "Explique detalhadamente o processamento" }); // → { prompt: "Explqe detlhadmt o procesmt" }

// Output as JSON const json = d.toJSON(data);

// Output as DUMBL format (most compact) const compact = d.toDUMBL(data);

// Quick debug - see stats and result const { dumblDry } = require('dumbl'); dumblDry(data); // logs stats + compressed result`

`📖 API Reference`

`$3`

`javascript const { dumbl } = require('dumbl');

// With options const d = dumbl({ level: 3, // 1=light, 2=medium, 3=aggressive preserveKeys: true, // don't compress object keys minWordLength: 3 // min chars to compress });

// Presets dumbl.light() // level 1 - safe, minimal dumbl.medium() // level 2 - balanced dumbl.aggressive() // level 3 - maximum compression`

`$3`

`javascript const d = dumbl.aggressive();

// Compress anything d.compress(object) // → compressed object d.compress(jsonString) // → compressed object d.compress(text) // → compressed string

// Output formats d.toJSON(input) // → JSON string (compressed) d.toDUMBL(input) // → DUMBL format (most compact)

// With TOML (requires @iarna/toml) const TOML = require('@iarna/toml'); d.compress(tomlString, TOML) d.toTOML(input, TOML)

// Parse DUMBL back d.parseDUMBL(dumblString) // → object

// Statistics d.stats(original, compressed) // → { savedChars, ratio, estimatedTokensSaved, ... }`

`$3`

`javascript const { compress, toJSON, toDUMBL, dumblDry } = require('dumbl');

// Quick compression (uses level 3) compress({ prompt: "..." }) toJSON({ prompt: "..." }) toDUMBL({ prompt: "..." })

// Dry run - logs stats and result to console dumblDry({ prompt: "..." })`

`$3`

Use dumblDry to preview compression results with statistics:

`javascript const { dumblDry } = require('dumbl');

dumblDry({ prompt: "Explique detalhadamente o processamento de dados" });`

`📊 Compression Levels`

`🔧 DUMBL Format`

Custom ultra-compact format, JSON-compatible:

`javascript // JSON {"enabled":true,"count":null,"items":["a","b"]}

// DUMBL {enabled:T,count:N,items:["a","b"]}`

Features: -T/Ffor booleans -Nfor null - Unquoted keys when possible - No whitespace

`📈 Benchmark`

`bash npm run benchmark`

Sample results:

`🛡️ What's Preserved`

DUMBL intelligently preserves:

- ✅ Short words (≤3 chars) - ✅ Connectors (the, of, de, para, etc.) - ✅ URLs, emails, file paths - ✅ Tech terms (API, JSON, HTTP, etc.) - ✅ Numbers and booleans - ✅ Object structure

`🤖 LLM Compatibility`

Tested and confirmed readable by: - ✅ GPT-4 / GPT-4o / GPT-4o-mini - ✅ Claude 3.5 / Claude 4 - ✅ Llama 3 / Llama 3.1 - ✅ Gemini Pro / Gemini Ultra - ✅ Mistral / Mixtral

The compression maintains semantic meaning while reducing tokens.

`📘 TypeScript`

Full type definitions included:

`typescript import { dumbl, DumblOptions, DumblStats } from 'dumbl';

const d = dumbl.aggressive(); const stats: DumblStats = d.stats(original, compressed);`

`🧪 Testing`

`bash npm test``

🤝 Contributing

Contributions are welcome! Please read our Contributing Guide for details.

📄 License

👤 Author

Frederico Bezerra

- Website: neosdev.io
- LinkedIn: @fredericobezerra
- GitHub: @fredericobezerra

---

If you find DUMBL useful, please ⭐ star the repo!