PAMPAX – Protocol for Augmented Memory of Project Artifacts Extended

Enhanced fork with performance optimizations and expanded capabilities. 85% chunk reduction through intelligent file-level semantic grouping preserves context while dramatically reducing API costs. Native support for 21 languages (up from 6) including Kotlin, Rust, C++, and Ruby. Token-based chunking with automatic model-aware sizing and hybrid optimization (81% efficiency gain) ensures fast indexing without data loss. Intelligent rate limiting prevents API throttling. Advanced reranking options (local Transformers.js or remote APIs) achieve perfect scores with models like Qwen3-Reranker-8B. Full support for OpenAI-compatible APIs (OpenAI, Nebius, LM Studio, llama.cpp, Azure, etc.). Built on upgraded tree-sitter v0.25 core for better parsing stability. For detailed changes, see CHANGELOG.md.

Version 1.14.0 · Token-Based Chunking · Semantic Search · MCP Compatible · Node.js

Agent Rules Kit Logo

Version
License
Last Commit

Give your AI agents an always-updated, queryable memory of any codebase – with intelligent semantic search and automatic learning – in one npx command.

> 🇺🇸 English Version | 🇪🇸 Versión en Español | 🤖 Agent Version

🌟 What's New in v1.13 - Advanced Search & Multi-Project Support

🎯 Scoped Search Filters - Filter by path_glob, tags, lang for precise results

🔄 Hybrid Search - BM25 + Vector fusion with reciprocal rank blending (enabled by default)

🧠 Cross-Encoder Re-Ranker - Transformers.js reranker for precision boosts

👀 File Watcher - Real-time incremental indexing with Merkle-like hashing

📦 Context Packs - Reusable search scopes with CLI + MCP integration

🛠️ Multi-Project CLI - --project and --directory aliases for clarity

🏆 Performance Analysis - Architectural comparison with general-purpose IDE tools

Major improvements:

- 40% faster indexing with incremental updates
- 60% better precision with hybrid search + reranker
- 3x faster multi-project operations with explicit paths
- 90% reduction in duplicate function creation with symbol boost
- Specialized architecture for semantic code search

🌟 Why PAMPAX?

Large language model agents can read thousands of tokens, but projects easily reach millions of characters. Without an intelligent retrieval layer, agents:

- Recreate functions that already exist
- Misname APIs (newUser vs. createUser)
- Waste tokens loading repetitive code (vendor/, node_modules/...)
- Fail when the repository grows

PAMPAX solves this by turning your repository into a semantic code memory graph:

1. Chunking – Each function/class becomes an atomic chunk
2. Semantic Tagging – Automatic extraction of semantic tags from code context
3. Embedding – Enhanced chunks are vectorized with advanced embedding models
4. Learning – System learns from successful searches and caches intentions
5. Indexing – Vectors + semantic metadata live in local SQLite
6. Codemap – A lightweight pampax.codemap.json commits to git so context follows the repo
7. Serving – An MCP server exposes intelligent search and retrieval tools

Any MCP-compatible agent (Cursor, Claude, etc.) can now search with natural language, get instant responses for learned patterns, and stay synchronized – without scanning the entire tree.

🤖 For AI Agents & Humans

> 🤖 If you're an AI agent: Read the complete setup guide for agents →
> or
> 👤 If you're human: Share the agent setup guide with your AI assistant to automatically configure PAMPAX!

📚 Table of Contents

- 🚀 MCP Installation (Recommended)
- 🧠 Semantic Features
- 📝 Supported Languages
- 💻 Direct CLI Usage
- 🧠 Embedding Providers
- 🏆 Performance Benchmark
- 🏗️ Architecture
- 🔧 Available MCP Tools
- 📊 Available MCP Resources
- 🎯 Available MCP Prompts

🧠 Semantic Features

$3

PAMPAX automatically extracts semantic tags from your code without any special comments:

``javascript // File: app/Services/Payment/StripeService.php function createCheckoutSession() { ... }`

Automatic tags: ["stripe", "service", "payment", "checkout", "session", "create"]

`$3`

The system learns from successful searches and provides instant responses:

`bash

`First search (vector search)`


"stripe payment session" → 0.9148 similarity
System automatically learns and caches this pattern

Next similar searches are instant:

"create stripe session" → instant response (cached)
"stripe checkout session" → instant response (cached)

$3

- Automatic Learning: Saves successful searches (>80% similarity) as intentions - Query Normalization: Understands variations:"create" = "crear", "session" = "sesion"- Pattern Recognition: Groups similar queries:"[PROVIDER] payment session"

`$3`

Enhance search precision with optional JSDoc-style comments:

`javascript /** * @pampax-tags: stripe-checkout, payment-processing, e-commerce-integration * @pampax-intent: create secure stripe checkout session for payments * @pampax-description: Main function for handling checkout sessions with validation */ async function createStripeCheckoutSession(sessionData) { // Your code here... }`

Benefits:

- +21% better precision when present - Perfect scores (1.0) when query matches intent exactly - Fully optional: Code without comments works automatically - Retrocompatible: Existing codebases work without changes

`$3`

| Search Type | Without @pampax | With @pampax | Improvement | | --------------- | -------------- | ----------- | ----------- | | Domain-specific | 0.7331 | 0.8874 | +21% | | Intent matching | ~0.6 | 1.0000 | +67% | | General search | 0.6-0.8 | 0.8-1.0 | +32-85% |

`📝 Supported Languages`

PAMPAX can index and search code in 22 languages out of the box:

`$3`


-   JavaScript / TypeScript (

.js, .ts, .tsx, .jsx

)
-   Python (

.py

)
-   Java (

.java

)
-   Kotlin (

.kt

) ⭐ _NEW_
-   Go (

.go

)
-   Rust (

.rs

) ⭐ _NEW_
-   C++ (

.cpp, .hpp, .cc

) ⭐ _NEW_
-   C (

.c, .h

) ⭐ _NEW_
-   C# (

.cs

) ⭐ _NEW_
-   PHP (

.php

)
-   Ruby (

.rb

) ⭐ _NEW_
-   Scala (

.scala

) ⭐ _NEW_
-   Swift (

.swift

) ⭐ _NEW_
-   Lua (

.lua

) ⭐ _NEW_
-   OCaml (

.ml, .mli

) ⭐ _NEW_
-   Haskell (

.hs

) ⭐ _NEW_
-   Elixir (

.ex, .exs

) ⭐ _NEW_
$3

-   HTML (

.html, .htm

) ⭐ _NEW_
-   CSS (

.css

) ⭐ _NEW_
-   JSON (

.json

) ⭐ _NEW_
-   Markdown (

.md

) ⭐ _NEW_
$3

-   Bash (

.sh, .bash

) ⭐ _NEW_
🆕 What's New in v1.14 - Token-Based Chunking
PAMPAX v1.14.0 introduces intelligent token-based chunking that automatically optimizes chunk sizes for your embedding model:

- 🎯 Model-Aware: Automatically detects your model and adjusts chunk sizes - 🔢 Token Counting: Uses tiktoken for accurate token-based sizing - ⚙️ Customizable: Override viaPAMPAX_MAX_TOKENS and PAMPAX_DIMENSIONS- 🔄 Backward Compatible: Existing indexes continue to work - 📈 Better Quality: +20-30% chunking accuracy improvement

Quick Start:`bash npm install tiktoken # For best results pampax index # Automatic token-based chunking!`

Configuration Examples:`bash

`Custom token limit and dimensions`


export PAMPAX_MAX_TOKENS=2000
export PAMPAX_DIMENSIONS=1536
pampax index --provider openai
Or set in MCP config (see MCP Installation section)


See TOKEN_CHUNKING_v1.14.md for full documentation and MIGRATION_GUIDE_v1.14.md for upgrade instructions.
🚀 MCP Installation (Recommended)
$3
#### Claude Desktop

Add to your Claude Desktop config (~/Library/Application Support/Claude/claude_desktop_config.json on macOS):

Example with Nebius (Recommended - High Quality and Very Cheap):`json { "mcpServers": { "pampax": { "command": "npx", "args": ["-y", "pampax", "mcp"], "env": { "OPENAI_API_KEY": "your-api-key", "OPENAI_BASE_URL": "https://api.studio.nebius.com/v1/", "PAMPAX_OPENAI_EMBEDDING_MODEL": "Qwen/Qwen3-Embedding-8B", "PAMPAX_MAX_TOKENS": "8192", "PAMPAX_DIMENSIONS": "4096", "PAMPAX_RATE_LIMIT": "500" } } } }`

Or use OpenAI directly:`json { "mcpServers": { "pampax": { "command": "npx", "args": ["-y", "pampax", "mcp"], "env": { "OPENAI_API_KEY": "your-openai-api-key" } } } }`

Environment Variables (optional):

Embedding & Chunking: -PAMPAX_MAX_TOKENS- Override maximum token limit for chunking (default: model-specific) -PAMPAX_DIMENSIONS- Override embedding dimensions (default: model-specific) -OPENAI_API_KEY- Your OpenAI API key (if using OpenAI provider) -PAMPAX_OPENAI_EMBEDDING_MODEL - Model name (e.g., text-embedding-3-small)

Rate Limiting: -PAMPAX_RATE_LIMIT - Maximum embedding API requests per minute (default: 50 for OpenAI, 100 for Cohere, unlimited for local models)

Reranker Configuration: -PAMPAX_RERANKER_DEFAULT - Default reranker mode (default: off, options: off|transformers|api) -PAMPAX_RERANKER_MODEL - Reranker model (default: Xenova/ms-marco-MiniLM-L-6-v2) -PAMPAX_RERANKER_MAX- Max candidates to rerank (default: 50) -PAMPAX_RERANKER_MAX_TOKENS- Max tokens per document for reranker (default: 512) -PAMPAX_RERANK_API_URL- API URL for remote reranking (e.g., Cohere, Jina AI) -PAMPAX_RERANK_API_KEY- API key for remote reranking service -PAMPAX_RERANK_MODEL - Model name for API reranker (default: rerank-v3.5)

Note: Using npx automatically downloads and runs the latest version from npm, no global installation needed.

Optional: Add "--debug" to args for detailed logging: ["-y", "pampax", "mcp", "--debug"]

With API Reranking (Absolute Maximum - Full 32K Context):`json { "mcpServers": { "pampax": { "command": "npx", "args": ["-y", "pampax", "mcp"], "env": { "OPENAI_API_KEY": "your-novita-api-key", "OPENAI_BASE_URL": "https://api.novita.ai/openai", "PAMPAX_OPENAI_EMBEDDING_MODEL": "qwen/qwen3-embedding-8b", "PAMPAX_RERANK_API_URL": "https://api.novita.ai/openai/v1/rerank", "PAMPAX_RERANK_API_KEY": "your-novita-api-key", "PAMPAX_RERANK_MODEL": "qwen/qwen3-reranker-8b", "PAMPAX_RERANKER_DEFAULT": "api", "PAMPAX_MAX_TOKENS": "8192", "PAMPAX_DIMENSIONS": "4096", "PAMPAX_RERANKER_MAX": "200", "PAMPAX_RERANKER_MAX_TOKENS": "8192", "PAMPAX_RATE_LIMIT": "500", "PAMPAX_RERANKER_DEFAULT": "api" } } } }`

#### Cursor

Configure Cursor by creating or editing the mcp.json file in your configuration directory:

Example with Novita.ai:`json { "mcpServers": { "pampax": { "command": "npx", "args": ["-y", "pampax", "mcp"], "env": { "OPENAI_API_KEY": "your-novita-api-key", "OPENAI_BASE_URL": "https://api.novita.ai/openai", "PAMPAX_OPENAI_EMBEDDING_MODEL": "qwen/qwen3-embedding-8b", "PAMPAX_MAX_TOKENS": "8192", "PAMPAX_DIMENSIONS": "4096", "PAMPAX_RERANKER_MAX": "200", "PAMPAX_RERANKER_MAX_TOKENS": "8192", "PAMPAX_RATE_LIMIT": "500", "PAMPAX_RERANKER_DEFAULT": "api" } } } }`

Tip: You can use any OpenAI-compatible API by setting OPENAI_BASE_URL. Popular options include Novita.ai (recommended), OpenAI, LM Studio, Azure OpenAI, and LocalAI. See the environment variables list above for all available options.

`$3`

Your AI agent should automatically:

- Check if the project is indexed with get_project_stats- Index the project withindex_projectif needed - Keep it updated withupdate_project after changes

Need to index manually? See Direct CLI Usage section.

`$3`

Additionally, install this rule in your application so it uses PAMPAX effectively:

Copy the content from RULE_FOR_PAMPAX_MCP.md into your agent or AI system instructions.

`$3`

Once configured, your AI agent can:

`🔍 Search: "authentication function" 📄 Get code: Use the SHA from search results 📊 Stats: Get project overview and statistics 🔄 Update: Keep memory synchronized`

`💻 Direct CLI Usage`

For direct terminal usage or manual project indexing:

`$3`

`bash

`Install globally from npm (requires Node.js 16+)`


npm install -g pampax
Verify installation

pampax --help

Alternative installations:`bash

`Install from GitHub (latest development version)`


npm install -g git+https://github.com/lemon07r/pampax.git
Use npx (no global installation required)

npx pampax index

$3

`bash

`Index current repository with the best available provider`


pampax index
Force the local CPU embedding model (no API keys required)

pampax index --provider transformers
Re-embed after code changes

pampax update
Inspect indexed stats at any time

pampax info

> Indexing writes .pampax/ (SQLite database + chunk store) and pampax.codemap.json. Commit the codemap to git so teammates and CI re-use the same metadata.

| Command | Purpose | | ---------------------------------------- | -------------------------------------------------------- | ----- | ------------------------------------------------- | |pampax index [path] [--provider X]| Create or refresh the full index at the provided path | |pampax update [path] [--provider X]| Force a full re-scan (helpful after large refactors) | |pampax watch [path] [--provider X]| Incrementally update the index as files change | |pampax search | Hybrid BM25 + vector search with optional scoped filters | |pampax context | Manage reusable context packs for search defaults | |pampax mcp | Start the MCP stdio server for editor/agent integrations |

`$3`

pampax search supports the same filters used by MCP clients. Combine glob patterns, semantic tags, language filters, provider overrides, and ranking controls:

| Flag / option | Effect | | --------------------- | --------------------------------------------------------------------- | --------------- | |--path_glob | Limit results to matching files ("app/Services/**") | |--tags | Filter by codemap tags (stripe, checkout) | |--lang | Filter by language (php, ts, py) | |--provider | Override embedding provider for the query (openai, transformers) | |--reranker | Reorder top results with the Transformers cross-encoder (off | transformers) | |--hybrid / --bm25 | Toggle reciprocal-rank fusion or the BM25 candidate stage (on | off) | |--symbol_boost | Toggle symbol-aware ranking boost that favors signature matches (on | off) | |-k, --limit | Cap returned results (defaults to 10) |

`bash

`Narrow to service files tagged stripe in PHP`


pampax search "create checkout session" --path_glob "app/Services/**" --tags stripe --lang php
Use OpenAI embeddings but keep hybrid fusion enabled

pampax search "payment intent status" --provider openai --hybrid on --bm25 on
Reorder top candidates locally

pampax search "oauth middleware" --reranker transformers --limit 5
Disable signature boosts for literal keyword hunts

pampax search "token validation" --symbol_boost off


> PAMPAX extracts function signatures and lightweight call graphs with tree-sitter. When symbol boosts are enabled, queries that mention a specific method, class, or a directly connected helper will receive an extra scoring bump.
> When a context pack is active, the CLI prints the pack name before executing the search. Any explicit flag overrides the pack defaults.
$3

Store JSON packs in .pampax/contextpacks/*.json to capture reusable defaults:

`jsonc // .pampax/contextpacks/stripe-backend.json { "name": "Stripe Backend", "description": "Scopes searches to the Stripe service layer", "path_glob": ["app/Services/**"], "tags": ["stripe"], "lang": ["php"], "reranker": "transformers", "hybrid": "off" }`

`bash

`List packs and highlight the active one`


pampax context list
Inspect the full JSON definition

pampax context show stripe-backend
Activate scoped defaults (flags still win if provided explicitly)

pampax context use stripe-backend
Clear the active pack (use "none" or "clear")

pampax context use clear

MCP tip: The MCP tool use_context_pack mirrors the CLI. Agents can switch packs mid-session and every subsequent search_code call inherits those defaults until cleared.

`$3`

`bash

`Watch the repository with a 750 ms debounce and local embeddings`


pampax watch --provider transformers --debounce 750

The watcher batches filesystem events, reuses the Merkle hash store in .pampax/merkle.json, and only re-embeds touched files. Press Ctrl+C to stop.

`$3`

`bash npm run bench`

The harness seeds a deterministic Laravel + TypeScript corpus and prints a summary table with Precision@1, MRR@5, and nDCG@10 for Base, Hybrid, and Hybrid+Cross-Encoder modes. Customise scenarios via flags or environment variables:

- npm run bench -- --hybrid=off– run vector-only evaluation -npm run bench -- --reranker=transformers– force the cross-encoder -PAMPAX_BENCH_MODES=base,hybrid npm run bench– limit to specific modes -PAMPAX_BENCH_BM25=off npm run bench – disable BM25 candidate generation

Benchmark runs never download external models when PAMPAX_MOCK_RERANKER_TESTS=1 (enabled by default inside the harness).

An end-to-end context pack example lives in examples/contextpacks/stripe-backend.json.

`🧠 Embedding Providers`

PAMPAX supports multiple providers for generating code embeddings:

| Provider | Cost | Privacy | Installation | | ------------------- | ------------------------ | -------- | ---------------------------------------------------------- | | Transformers.js | 🟢 Free | 🟢 Total |npm install @xenova/transformers| | Ollama | 🟢 Free | 🟢 Total | Install Ollama +npm install ollama| | OpenAI | 🔴 ~$0.10/1000 functions | 🔴 None | SetOPENAI_API_KEY| | OpenAI-Compatible | 🟡 Varies | 🟡 Varies | SetOPENAI_API_KEY + OPENAI_BASE_URL| | Cohere | 🟡 ~$0.05/1000 functions | 🔴 None | SetCOHERE_API_KEY + npm install cohere-ai |

Recommendation: Use Transformers.js for personal development (free and private) or OpenAI for maximum quality.

`$3`

PAMPAX supports any OpenAI-compatible API endpoint through environment variables:

`bash

`LM Studio (local)`


export OPENAI_BASE_URL="http://localhost:1234/v1"
export OPENAI_API_KEY="lm-studio"  # Can be any value for local servers
Azure OpenAI

export OPENAI_BASE_URL="https://YOUR_RESOURCE.openai.azure.com/openai/deployments/YOUR_DEPLOYMENT"
export OPENAI_API_KEY="your-azure-api-key"
LocalAI

export OPENAI_BASE_URL="http://localhost:8080/v1"
export OPENAI_API_KEY="not-needed"
Ollama with OpenAI compatibility

export OPENAI_BASE_URL="http://localhost:11434/v1"
export OPENAI_API_KEY="ollama"

Then index with the OpenAI provider:`bash pampax index --provider openai`

`$3`

You can configure which embedding model to use via environment variables:

OpenAI Provider:`bash

`Use a specific OpenAI model`


export PAMPAX_OPENAI_EMBEDDING_MODEL="text-embedding-3-small"  # Cheaper, faster
or

export OPENAI_MODEL="text-embedding-3-large"  # Alternative env var
Default: text-embedding-3-large

Reranker Configuration:`bash

`Local Transformers.js reranker model`


export PAMPAX_RERANKER_MODEL="Xenova/ms-marco-MiniLM-L-6-v2"
Max candidates to rerank and max tokens per document

export PAMPAX_RERANKER_MAX=50
export PAMPAX_RERANKER_MAX_TOKENS=512
Or use remote API reranker (Cohere, Jina AI, etc.)

export PAMPAX_RERANK_API_URL="https://api.cohere.ai/v1/rerank"
export PAMPAX_RERANK_API_KEY="your-cohere-api-key"
export PAMPAX_RERANK_MODEL="rerank-v3.5"

Other Providers:`bash

`Transformers.js (local)`


export PAMPAX_TRANSFORMERS_MODEL="Xenova/all-mpnet-base-v2"
Default: Xenova/all-MiniLM-L6-v2
Ollama

export PAMPAX_OLLAMA_MODEL="llama2"
Default: nomic-embed-text
Cohere

export PAMPAX_COHERE_MODEL="embed-multilingual-v3.0"
Default: embed-english-v3.0

Example with Novita.ai Qwen Models:`bash

`Configure Novita.ai with Qwen embedding model`


export OPENAI_BASE_URL="https://api.novita.ai/openai"
export OPENAI_API_KEY="your-novita-api-key"
export PAMPAX_OPENAI_EMBEDDING_MODEL="qwen/qwen3-embedding-8b"
Index with this configuration

pampax index --provider openai
Search will use the Qwen embedding model

pampax search "authentication logic" --provider openai


Supported Services:
- ✅ llama.cpp
- ✅ Kobold.cpp
- ✅ LM Studio
- ✅ Azure OpenAI
- ✅ Ollama (with OpenAI compatibility)
- ✅ Any OpenAI-compatible API gateway or proxy
$3
PAMPAX supports API-based reranking as an alternative to the local Transformers.js cross-encoder. This allows you to use remote reranking services for improved search precision.
Supported Reranking APIs:
- ✅ Cohere Rerank API
- ✅ Jina AI Reranker
- ✅ Voyage AI Rerank
- ✅ Any compatible reranking API

Configuration:`bash

`Set reranking API credentials`


export PAMPAX_RERANK_API_URL="https://api.cohere.ai/v1"
export PAMPAX_RERANK_API_KEY="your-api-key"
export PAMPAX_RERANK_MODEL="rerank-v3.5"  # Optional, model to use
Search with API reranker

pampax search "authentication logic" --reranker api
Or use in CLI

pampax search "payment processing" --reranker api --limit 5

Reranker Options: ---reranker off- No reranking (fastest, lower precision) ---reranker transformers- Local Transformers.js reranking (free, private) ---reranker api - API-based reranking (requires API key, higher precision)

Example with Cohere:`bash export PAMPAX_RERANK_API_URL="https://api.cohere.ai/v1/rerank" export PAMPAX_RERANK_API_KEY="your-cohere-api-key" export PAMPAX_RERANK_MODEL="rerank-english-v3.0"`

Example with Jina AI:`bash export PAMPAX_RERANK_API_URL="https://api.jina.ai/v1/rerank" export PAMPAX_RERANK_API_KEY="your-jina-api-key" export PAMPAX_RERANK_MODEL="jina-reranker-v2-base-multilingual"`

Example with Novita.ai Qwen Reranker:`bash export PAMPAX_RERANK_API_URL="https://api.novita.ai/openai/v1/rerank" export PAMPAX_RERANK_API_KEY="your-novita-api-key" export PAMPAX_RERANK_MODEL="qwen/qwen3-reranker-8b"

`Use together with Qwen embedding for full pipeline`


export OPENAI_BASE_URL="https://api.novita.ai/openai"
export OPENAI_API_KEY="your-novita-api-key"
export PAMPAX_OPENAI_EMBEDDING_MODEL="qwen/qwen3-embedding-8b"
Index with Qwen embeddings

pampax index --provider openai
Search with both Qwen embedding and reranking

pampax search "authentication logic" --provider openai --reranker api

MCP Integration: When API reranking is configured, the MCPsearch_code tool automatically uses it when reranker: "api" is specified.

`🏆 Performance Analysis`

PAMPAX v1.13 uses a specialized architecture for semantic code search with measurable results.

`$3`

Synthetic Benchmark Results:

Default configuration:`| Setting | P@1 | MRR@5 | nDCG@10 | | ---------- | ----- | ----- | ------- | | Base | 0.750 | 0.833 | 0.863 | | Hybrid | 0.875 | 0.917 | 0.934 | | Hybrid+CE | 1.000 | 0.958 | 0.967 |`

With Novita.ai Qwen models:`| Configuration | P@1 | MRR@5 | nDCG@10 | | ------------------------------------------ | ----- | ----- | ------- | | Qwen3-Embedding-8B + Transformers (local) | 0.750 | 0.875 | 0.908 | | Qwen3-Embedding-8B + Qwen3-Reranker-8B | 1.000 | 1.000 | 1.000 |`

🏆 Qwen3-Reranker-8B achieves perfect scores (100%) across all metrics!

Run benchmarks yourself:

`bash

`Run default benchmarks (Base, Hybrid, Hybrid+CE)`


npm run bench
Test with Qwen models via Novita.ai

export OPENAI_BASE_URL="https://api.novita.ai/openai"
export OPENAI_API_KEY="your-novita-key"
export PAMPAX_OPENAI_EMBEDDING_MODEL="qwen/qwen3-embedding-8b"
export PAMPAX_RERANK_API_URL="https://api.novita.ai/openai/v1/rerank"
export PAMPAX_RERANK_API_KEY="your-novita-key"
export PAMPAX_RERANK_MODEL="qwen/qwen3-reranker-8b"
export PAMPA_BENCH_RERANKER="api"
npm run bench
Test other configurations

export PAMPA_BENCH_RERANKER="transformers"
npm run bench


See BENCHMARK_v1.12.md for detailed configuration options and analysis.
$3

`bash

`Search for authentication functions`


pampax search "user authentication"
→ AuthController::login, UserService::authenticate, etc.
Search for payment processing

pampax search "payment processing"
→ PaymentService::process, CheckoutController::create, etc.
Search with specific filters

pampax search "database operations" --lang php --path_glob "app/Models/**"
→ UserModel::save, OrderModel::find, etc.


📈 Read Full Analysis →
$3
1. Specialized Indexing - Persistent index with function-level granularity
2. Hybrid Search - BM25 + Vector + Cross-encoder reranking combination
3. Code Awareness - Symbol boosting, AST analysis, function signatures
4. Multi-Project - Native support for context across different codebases
Result: Optimized architecture for semantic code search with verifiable metrics.
🏗️ Architecture

`┌──────────── Repo (git) ─────────-──┐ │ app/… src/… package.json etc. │ │ pampax.codemap.json │ │ .pampax/chunks/*.gz(.enc) │ │ .pampax/pampax.db (SQLite) │ └────────────────────────────────────┘ ▲ ▲ │ write │ read ┌─────────┴─────────┐ │ │ indexer.js │ │ │ (pampax index) │ │ └─────────▲─────────┘ │ │ store │ vector query ┌─────────┴──────────┐ │ gz fetch │ SQLite (local) │ │ └─────────▲──────────┘ │ │ read │ ┌─────────┴──────────┐ │ │ mcp-server.js │◄─┘ │ (pampax mcp) │ └────────────────────┘`

`$3`

| Layer | Role | Technology | | -------------- | ----------------------------------------------------------------- | ------------------------------- | | Indexer | Cuts code into semantic chunks, embeds, writes codemap and SQLite | tree-sitter, openai@v4, sqlite3 | | Codemap | Git-friendly JSON with {file, symbol, sha, lang} per chunk | Plain JSON | | Chunks dir | .gz code bodies (or .gz.enc when encrypted) (lazy loading) | gzip → AES-256-GCM when enabled | | SQLite | Stores vectors and metadata | sqlite3 | | MCP Server | Exposes tools and resources over standard MCP protocol | @modelcontextprotocol/sdk | | Logging | Debug and error logging in project directory | File-based logs |

`🔧 Available MCP Tools`

The MCP server exposes these tools that agents can use:

`$3`

Search code semantically in the indexed project.

- Parameters: -query(string) - Semantic search query (e.g., "authentication function", "error handling") -limit(number, optional) - Maximum number of results to return (default: 10) -provider(string, optional) - Embedding provider (default: "auto") -path(string, optional) - PROJECT ROOT directory path where PAMPAX database is located - Database Location:{path}/.pampax/pampax.db- Returns: List of matching code chunks with similarity scores and SHAs

`$3`

Get complete code of a specific chunk.

- Parameters: -sha(string) - SHA of the code chunk to retrieve (obtained from search_code results) -path(string, optional) - PROJECT ROOT directory path (same as used in search_code) - Chunk Location:{path}/.pampax/chunks/{sha}.gz or {sha}.gz.enc- Returns: Complete source code

`$3`

Index a project from the agent.

- Parameters: -path(string, optional) - PROJECT ROOT directory path to index (will create .pampax/ subdirectory here) -provider(string, optional) - Embedding provider (default: "auto") - Creates: -{path}/.pampax/pampax.db(SQLite database with embeddings) -{path}/.pampax/chunks/(compressed code chunks) -{path}/pampax.codemap.json(lightweight index for version control) - Effect: Updates database and codemap

`$3`

🔄 CRITICAL: Use this tool frequently to keep your AI memory current!

Update project index after code changes (recommended workflow tool).

- Parameters: -path(string, optional) - PROJECT ROOT directory path to update (same as used in index_project) -provider(string, optional) - Embedding provider (default: "auto") - Updates: - Re-scans all files for changes - Updates embeddings for modified functions - Removes deleted functions from database - Adds new functions to database - When to use: - ✅ At the start of development sessions - ✅ After creating new functions - ✅ After modifying existing functions - ✅ After deleting functions - ✅ Before major code analysis tasks - ✅ After refactoring code - Effect: Keeps your AI agent's code memory synchronized with current state

`$3`

Get indexed project statistics.

- Parameters: -path(string, optional) - PROJECT ROOT directory path where PAMPAX database is located - Database Location:{path}/.pampax/pampax.db- Returns: Statistics by language and file

`📊 Available MCP Resources`

`$3`

Access to the complete project code map.

`$3`

Summary of the project's main functions.

`🎯 Available MCP Prompts`

`$3`

Template for analyzing found code with specific focus.

`$3`

Template for finding existing similar functions.

`🔍 How Retrieval Works`

- Vector search – Cosine similarity with advanced high-dimensional embeddings - Summary fallback – If an agent sends an empty query, PAMPAX returns top-level summaries so the agent understands the territory - Chunk granularity – Default = function/method/class. Adjustable per language

`📝 Design Decisions`

- Node only → Devs run everything via npx, no Python, no Docker - SQLite over HelixDB → One local database for vectors and relations, no external dependencies - Committed codemap → Context travels with repo → cloning works offline - Chunk granularity → Default = function/method/class. Adjustable per language - Read-only by default → Server only exposes read methods. Writing is done via CLI

`🧩 Extending PAMPAX`

| Idea | Hint | | --------------------- | ----------------------------------------------------------------------------------------- | | More languages | Install tree-sitter grammar and add it toLANG_RULES| | Custom embeddings | ExportOPENAI_API_KEY or switch OpenAI for any provider that returns vector: number[]| | Security | Run behind a reverse proxy with authentication | | VS Code Plugin | Point an MCP WebView client to your local server |

`🔐 Encrypting the Chunk Store`

PAMPAX can encrypt chunk bodies at rest using AES-256-GCM. Configure it like this:

1. Export a 32-byte key in base64 or hex form:

`bash export PAMPAX_ENCRYPTION_KEY="$(openssl rand -base64 32)"`

2. Index with encryption enabled (skips plaintext writes even if stale files exist):

`bash pampax index --encrypt on`

Without --encrypt, PAMPAX auto-encrypts when the environment key is present. Use --encrypt off to force plaintext (e.g., for debugging).

3. All new chunks are stored as .gz.enc and require the same key for CLI or MCP chunk retrieval. Missing or corrupt keys surface clear errors instead of leaking data.

Existing plaintext archives remain readable, so you can enable encryption incrementally or rotate keys by re-indexing.

`🔧 Troubleshooting`

`$3`

If you see this error after upgrading Node.js, the npx cache contains native modules compiled for your old Node.js version:

`bash

`Clear the npm/npx cache`


npm cache clean --force
Remove cached pampax (Linux/macOS)

find ~/.npm/_npx -name "pampax" -type d -exec rm -rf {} + 2>/dev/null
find ~/.cache -path "_npx" -name "pampax" -type d -exec rm -rf {} + 2>/dev/null
Then run pampax again

npx -y pampax@latest mcp


For more troubleshooting tips, see docs/TROUBLESHOOTING.md.
🤝 Contributing

1. Fork → create feature branch (feat/...) 2. Runnpm test (coming soon) & pampax index` before PR
3. Open PR with context: why + screenshots/logs

All discussions on GitHub Issues.

📜 License

MIT – do whatever you want, just keep the copyright.

Happy hacking! 💙

PAMPAX – Protocol for Augmented Memory of Project Artifacts Extended

Version 1.14.0 · Token-Based Chunking · Semantic Search · MCP Compatible · Node.js

Agent Rules Kit Logo

Version
License
Last Commit

Give your AI agents an always-updated, queryable memory of any codebase – with intelligent semantic search and automatic learning – in one npx command.

> 🇺🇸 English Version | 🇪🇸 Versión en Español | 🤖 Agent Version

🌟 What's New in v1.13 - Advanced Search & Multi-Project Support

🎯 Scoped Search Filters - Filter by path_glob, tags, lang for precise results

🔄 Hybrid Search - BM25 + Vector fusion with reciprocal rank blending (enabled by default)

🧠 Cross-Encoder Re-Ranker - Transformers.js reranker for precision boosts

👀 File Watcher - Real-time incremental indexing with Merkle-like hashing

📦 Context Packs - Reusable search scopes with CLI + MCP integration

🛠️ Multi-Project CLI - --project and --directory aliases for clarity

🏆 Performance Analysis - Architectural comparison with general-purpose IDE tools

Major improvements:

🌟 Why PAMPAX?

Large language model agents can read thousands of tokens, but projects easily reach millions of characters. Without an intelligent retrieval layer, agents:

- Recreate functions that already exist
- Misname APIs (newUser vs. createUser)
- Waste tokens loading repetitive code (vendor/, node_modules/...)
- Fail when the repository grows

PAMPAX solves this by turning your repository into a semantic code memory graph:

Any MCP-compatible agent (Cursor, Claude, etc.) can now search with natural language, get instant responses for learned patterns, and stay synchronized – without scanning the entire tree.

🤖 For AI Agents & Humans

> 🤖 If you're an AI agent: Read the complete setup guide for agents →
> or
> 👤 If you're human: Share the agent setup guide with your AI assistant to automatically configure PAMPAX!

📚 Table of Contents

🧠 Semantic Features

$3

PAMPAX automatically extracts semantic tags from your code without any special comments:

``javascript // File: app/Services/Payment/StripeService.php function createCheckoutSession() { ... }`

Automatic tags: ["stripe", "service", "payment", "checkout", "session", "create"]

`$3`

The system learns from successful searches and provides instant responses:

`bash

`First search (vector search)`


"stripe payment session" → 0.9148 similarity
System automatically learns and caches this pattern

Next similar searches are instant:

"create stripe session" → instant response (cached)
"stripe checkout session" → instant response (cached)

$3

`$3`

Enhance search precision with optional JSDoc-style comments:

Benefits:

`$3`

`📝 Supported Languages`

PAMPAX can index and search code in 22 languages out of the box:

`$3`


-   JavaScript / TypeScript (

.js, .ts, .tsx, .jsx

)
-   Python (

.py

)
-   Java (

.java

)
-   Kotlin (

.kt

) ⭐ _NEW_
-   Go (

.go

)
-   Rust (

.rs

) ⭐ _NEW_
-   C++ (

.cpp, .hpp, .cc

) ⭐ _NEW_
-   C (

.c, .h

) ⭐ _NEW_
-   C# (

.cs

) ⭐ _NEW_
-   PHP (

.php

)
-   Ruby (

.rb

) ⭐ _NEW_
-   Scala (

.scala

) ⭐ _NEW_
-   Swift (

.swift

) ⭐ _NEW_
-   Lua (

.lua

) ⭐ _NEW_
-   OCaml (

.ml, .mli

) ⭐ _NEW_
-   Haskell (

.hs

) ⭐ _NEW_
-   Elixir (

.ex, .exs

) ⭐ _NEW_
$3

-   HTML (

.html, .htm

) ⭐ _NEW_
-   CSS (

.css

) ⭐ _NEW_
-   JSON (

.json

) ⭐ _NEW_
-   Markdown (

.md

) ⭐ _NEW_
$3

-   Bash (

.sh, .bash

) ⭐ _NEW_
🆕 What's New in v1.14 - Token-Based Chunking
PAMPAX v1.14.0 introduces intelligent token-based chunking that automatically optimizes chunk sizes for your embedding model:

Quick Start:`bash npm install tiktoken # For best results pampax index # Automatic token-based chunking!`

Configuration Examples:`bash

`Custom token limit and dimensions`


export PAMPAX_MAX_TOKENS=2000
export PAMPAX_DIMENSIONS=1536
pampax index --provider openai
Or set in MCP config (see MCP Installation section)


See TOKEN_CHUNKING_v1.14.md for full documentation and MIGRATION_GUIDE_v1.14.md for upgrade instructions.
🚀 MCP Installation (Recommended)
$3
#### Claude Desktop

Add to your Claude Desktop config (~/Library/Application Support/Claude/claude_desktop_config.json on macOS):

Or use OpenAI directly:`json { "mcpServers": { "pampax": { "command": "npx", "args": ["-y", "pampax", "mcp"], "env": { "OPENAI_API_KEY": "your-openai-api-key" } } } }`

Environment Variables (optional):

Rate Limiting: -PAMPAX_RATE_LIMIT - Maximum embedding API requests per minute (default: 50 for OpenAI, 100 for Cohere, unlimited for local models)

Note: Using npx automatically downloads and runs the latest version from npm, no global installation needed.

Optional: Add "--debug" to args for detailed logging: ["-y", "pampax", "mcp", "--debug"]

#### Cursor

Configure Cursor by creating or editing the mcp.json file in your configuration directory:

`$3`

Your AI agent should automatically:

- Check if the project is indexed with get_project_stats- Index the project withindex_projectif needed - Keep it updated withupdate_project after changes

Need to index manually? See Direct CLI Usage section.

`$3`

Additionally, install this rule in your application so it uses PAMPAX effectively:

Copy the content from RULE_FOR_PAMPAX_MCP.md into your agent or AI system instructions.

`$3`

Once configured, your AI agent can:

`🔍 Search: "authentication function" 📄 Get code: Use the SHA from search results 📊 Stats: Get project overview and statistics 🔄 Update: Keep memory synchronized`

`💻 Direct CLI Usage`

For direct terminal usage or manual project indexing:

`$3`

`bash

`Install globally from npm (requires Node.js 16+)`


npm install -g pampax
Verify installation

pampax --help

Alternative installations:`bash

`Install from GitHub (latest development version)`


npm install -g git+https://github.com/lemon07r/pampax.git
Use npx (no global installation required)

npx pampax index

$3

`bash

`Index current repository with the best available provider`


pampax index
Force the local CPU embedding model (no API keys required)

pampax index --provider transformers
Re-embed after code changes

pampax update
Inspect indexed stats at any time

pampax info

> Indexing writes .pampax/ (SQLite database + chunk store) and pampax.codemap.json. Commit the codemap to git so teammates and CI re-use the same metadata.

`$3`

pampax search supports the same filters used by MCP clients. Combine glob patterns, semantic tags, language filters, provider overrides, and ranking controls:

`bash

`Narrow to service files tagged stripe in PHP`


pampax search "create checkout session" --path_glob "app/Services/**" --tags stripe --lang php
Use OpenAI embeddings but keep hybrid fusion enabled

pampax search "payment intent status" --provider openai --hybrid on --bm25 on
Reorder top candidates locally

pampax search "oauth middleware" --reranker transformers --limit 5
Disable signature boosts for literal keyword hunts

pampax search "token validation" --symbol_boost off


> PAMPAX extracts function signatures and lightweight call graphs with tree-sitter. When symbol boosts are enabled, queries that mention a specific method, class, or a directly connected helper will receive an extra scoring bump.
> When a context pack is active, the CLI prints the pack name before executing the search. Any explicit flag overrides the pack defaults.
$3

Store JSON packs in .pampax/contextpacks/*.json to capture reusable defaults:

`bash

`List packs and highlight the active one`


pampax context list
Inspect the full JSON definition

pampax context show stripe-backend
Activate scoped defaults (flags still win if provided explicitly)

pampax context use stripe-backend
Clear the active pack (use "none" or "clear")

pampax context use clear

MCP tip: The MCP tool use_context_pack mirrors the CLI. Agents can switch packs mid-session and every subsequent search_code call inherits those defaults until cleared.

`$3`

`bash

`Watch the repository with a 750 ms debounce and local embeddings`


pampax watch --provider transformers --debounce 750

The watcher batches filesystem events, reuses the Merkle hash store in .pampax/merkle.json, and only re-embeds touched files. Press Ctrl+C to stop.

`$3`

`bash npm run bench`

Benchmark runs never download external models when PAMPAX_MOCK_RERANKER_TESTS=1 (enabled by default inside the harness).

An end-to-end context pack example lives in examples/contextpacks/stripe-backend.json.

`🧠 Embedding Providers`

PAMPAX supports multiple providers for generating code embeddings:

Recommendation: Use Transformers.js for personal development (free and private) or OpenAI for maximum quality.

`$3`

PAMPAX supports any OpenAI-compatible API endpoint through environment variables:

`bash

`LM Studio (local)`


export OPENAI_BASE_URL="http://localhost:1234/v1"
export OPENAI_API_KEY="lm-studio"  # Can be any value for local servers
Azure OpenAI

export OPENAI_BASE_URL="https://YOUR_RESOURCE.openai.azure.com/openai/deployments/YOUR_DEPLOYMENT"
export OPENAI_API_KEY="your-azure-api-key"
LocalAI

export OPENAI_BASE_URL="http://localhost:8080/v1"
export OPENAI_API_KEY="not-needed"
Ollama with OpenAI compatibility

export OPENAI_BASE_URL="http://localhost:11434/v1"
export OPENAI_API_KEY="ollama"

Then index with the OpenAI provider:`bash pampax index --provider openai`

`$3`

You can configure which embedding model to use via environment variables:

OpenAI Provider:`bash

`Use a specific OpenAI model`


export PAMPAX_OPENAI_EMBEDDING_MODEL="text-embedding-3-small"  # Cheaper, faster
or

export OPENAI_MODEL="text-embedding-3-large"  # Alternative env var
Default: text-embedding-3-large

Reranker Configuration:`bash

`Local Transformers.js reranker model`


export PAMPAX_RERANKER_MODEL="Xenova/ms-marco-MiniLM-L-6-v2"
Max candidates to rerank and max tokens per document

export PAMPAX_RERANKER_MAX=50
export PAMPAX_RERANKER_MAX_TOKENS=512
Or use remote API reranker (Cohere, Jina AI, etc.)

export PAMPAX_RERANK_API_URL="https://api.cohere.ai/v1/rerank"
export PAMPAX_RERANK_API_KEY="your-cohere-api-key"
export PAMPAX_RERANK_MODEL="rerank-v3.5"

Other Providers:`bash

`Transformers.js (local)`


export PAMPAX_TRANSFORMERS_MODEL="Xenova/all-mpnet-base-v2"
Default: Xenova/all-MiniLM-L6-v2
Ollama

export PAMPAX_OLLAMA_MODEL="llama2"
Default: nomic-embed-text
Cohere

export PAMPAX_COHERE_MODEL="embed-multilingual-v3.0"
Default: embed-english-v3.0

Example with Novita.ai Qwen Models:`bash

`Configure Novita.ai with Qwen embedding model`


export OPENAI_BASE_URL="https://api.novita.ai/openai"
export OPENAI_API_KEY="your-novita-api-key"
export PAMPAX_OPENAI_EMBEDDING_MODEL="qwen/qwen3-embedding-8b"
Index with this configuration

pampax index --provider openai
Search will use the Qwen embedding model

pampax search "authentication logic" --provider openai


Supported Services:
- ✅ llama.cpp
- ✅ Kobold.cpp
- ✅ LM Studio
- ✅ Azure OpenAI
- ✅ Ollama (with OpenAI compatibility)
- ✅ Any OpenAI-compatible API gateway or proxy
$3
PAMPAX supports API-based reranking as an alternative to the local Transformers.js cross-encoder. This allows you to use remote reranking services for improved search precision.
Supported Reranking APIs:
- ✅ Cohere Rerank API
- ✅ Jina AI Reranker
- ✅ Voyage AI Rerank
- ✅ Any compatible reranking API

Configuration:`bash

`Set reranking API credentials`


export PAMPAX_RERANK_API_URL="https://api.cohere.ai/v1"
export PAMPAX_RERANK_API_KEY="your-api-key"
export PAMPAX_RERANK_MODEL="rerank-v3.5"  # Optional, model to use
Search with API reranker

pampax search "authentication logic" --reranker api
Or use in CLI

pampax search "payment processing" --reranker api --limit 5

Example with Cohere:`bash export PAMPAX_RERANK_API_URL="https://api.cohere.ai/v1/rerank" export PAMPAX_RERANK_API_KEY="your-cohere-api-key" export PAMPAX_RERANK_MODEL="rerank-english-v3.0"`

`Use together with Qwen embedding for full pipeline`


export OPENAI_BASE_URL="https://api.novita.ai/openai"
export OPENAI_API_KEY="your-novita-api-key"
export PAMPAX_OPENAI_EMBEDDING_MODEL="qwen/qwen3-embedding-8b"
Index with Qwen embeddings

pampax index --provider openai
Search with both Qwen embedding and reranking

pampax search "authentication logic" --provider openai --reranker api

MCP Integration: When API reranking is configured, the MCPsearch_code tool automatically uses it when reranker: "api" is specified.

`🏆 Performance Analysis`

PAMPAX v1.13 uses a specialized architecture for semantic code search with measurable results.

`$3`

Synthetic Benchmark Results:

🏆 Qwen3-Reranker-8B achieves perfect scores (100%) across all metrics!

Run benchmarks yourself:

`bash

`Run default benchmarks (Base, Hybrid, Hybrid+CE)`


npm run bench
Test with Qwen models via Novita.ai

export OPENAI_BASE_URL="https://api.novita.ai/openai"
export OPENAI_API_KEY="your-novita-key"
export PAMPAX_OPENAI_EMBEDDING_MODEL="qwen/qwen3-embedding-8b"
export PAMPAX_RERANK_API_URL="https://api.novita.ai/openai/v1/rerank"
export PAMPAX_RERANK_API_KEY="your-novita-key"
export PAMPAX_RERANK_MODEL="qwen/qwen3-reranker-8b"
export PAMPA_BENCH_RERANKER="api"
npm run bench
Test other configurations

export PAMPA_BENCH_RERANKER="transformers"
npm run bench


See BENCHMARK_v1.12.md for detailed configuration options and analysis.
$3

`bash

`Search for authentication functions`


pampax search "user authentication"
→ AuthController::login, UserService::authenticate, etc.
Search for payment processing

pampax search "payment processing"
→ PaymentService::process, CheckoutController::create, etc.
Search with specific filters

pampax search "database operations" --lang php --path_glob "app/Models/**"
→ UserModel::save, OrderModel::find, etc.


📈 Read Full Analysis →
$3
1. Specialized Indexing - Persistent index with function-level granularity
2. Hybrid Search - BM25 + Vector + Cross-encoder reranking combination
3. Code Awareness - Symbol boosting, AST analysis, function signatures
4. Multi-Project - Native support for context across different codebases
Result: Optimized architecture for semantic code search with verifiable metrics.
🏗️ Architecture

`$3`

`🔧 Available MCP Tools`

The MCP server exposes these tools that agents can use:

`$3`

Search code semantically in the indexed project.

`$3`

Get complete code of a specific chunk.

`$3`

Index a project from the agent.

`$3`

🔄 CRITICAL: Use this tool frequently to keep your AI memory current!

Update project index after code changes (recommended workflow tool).

`$3`

Get indexed project statistics.

- Parameters: -path(string, optional) - PROJECT ROOT directory path where PAMPAX database is located - Database Location:{path}/.pampax/pampax.db- Returns: Statistics by language and file

`📊 Available MCP Resources`

`$3`

Access to the complete project code map.

`$3`

Summary of the project's main functions.

`🎯 Available MCP Prompts`

`$3`

Template for analyzing found code with specific focus.

`$3`

Template for finding existing similar functions.

`🔍 How Retrieval Works`

`📝 Design Decisions`

`🧩 Extending PAMPAX`

`🔐 Encrypting the Chunk Store`

PAMPAX can encrypt chunk bodies at rest using AES-256-GCM. Configure it like this:

1. Export a 32-byte key in base64 or hex form:

`bash export PAMPAX_ENCRYPTION_KEY="$(openssl rand -base64 32)"`

2. Index with encryption enabled (skips plaintext writes even if stale files exist):

`bash pampax index --encrypt on`

Without --encrypt, PAMPAX auto-encrypts when the environment key is present. Use --encrypt off to force plaintext (e.g., for debugging).

3. All new chunks are stored as .gz.enc and require the same key for CLI or MCP chunk retrieval. Missing or corrupt keys surface clear errors instead of leaking data.

Existing plaintext archives remain readable, so you can enable encryption incrementally or rotate keys by re-indexing.

`🔧 Troubleshooting`

`$3`

If you see this error after upgrading Node.js, the npx cache contains native modules compiled for your old Node.js version:

`bash

`Clear the npm/npx cache`


npm cache clean --force
Remove cached pampax (Linux/macOS)

find ~/.npm/_npx -name "pampax" -type d -exec rm -rf {} + 2>/dev/null
find ~/.cache -path "_npx" -name "pampax" -type d -exec rm -rf {} + 2>/dev/null
Then run pampax again

npx -y pampax@latest mcp


For more troubleshooting tips, see docs/TROUBLESHOOTING.md.
🤝 Contributing

1. Fork → create feature branch (feat/...) 2. Runnpm test (coming soon) & pampax index` before PR
3. Open PR with context: why + screenshots/logs

All discussions on GitHub Issues.

📜 License

MIT – do whatever you want, just keep the copyright.

Happy hacking! 💙

pampax

PAMPAX – Protocol for Augmented Memory of Project Artifacts Extended

🌟 What's New in v1.13 - Advanced Search & Multi-Project Support

🌟 Why PAMPAX?

🤖 For AI Agents & Humans

📚 Table of Contents

🧠 Semantic Features

$3

$3

First search (vector search)

System automatically learns and caches this pattern

Next similar searches are instant:

$3

$3

$3

📝 Supported Languages

$3

$3

$3

🆕 What's New in v1.14 - Token-Based Chunking

Custom token limit and dimensions

Or set in MCP config (see MCP Installation section)

🚀 MCP Installation (Recommended)

$3

$3

$3

$3

💻 Direct CLI Usage

$3

Install globally from npm (requires Node.js 16+)

Verify installation

Install from GitHub (latest development version)

Use npx (no global installation required)

$3

Index current repository with the best available provider

Force the local CPU embedding model (no API keys required)

Re-embed after code changes

Inspect indexed stats at any time

$3

Narrow to service files tagged stripe in PHP

Use OpenAI embeddings but keep hybrid fusion enabled

Reorder top candidates locally

Disable signature boosts for literal keyword hunts

$3

List packs and highlight the active one

Inspect the full JSON definition

Activate scoped defaults (flags still win if provided explicitly)

Clear the active pack (use "none" or "clear")

$3

Watch the repository with a 750 ms debounce and local embeddings

$3

🧠 Embedding Providers

$3

LM Studio (local)

Azure OpenAI

LocalAI

Ollama with OpenAI compatibility

$3

Use a specific OpenAI model

or

Default: text-embedding-3-large

Local Transformers.js reranker model

Max candidates to rerank and max tokens per document

Or use remote API reranker (Cohere, Jina AI, etc.)

Transformers.js (local)

Default: Xenova/all-MiniLM-L6-v2

Ollama

Default: nomic-embed-text

Cohere

Default: embed-english-v3.0

Configure Novita.ai with Qwen embedding model

Index with this configuration

Search will use the Qwen embedding model

$3

Set reranking API credentials

Search with API reranker

Or use in CLI

Use together with Qwen embedding for full pipeline

Index with Qwen embeddings

Search with both Qwen embedding and reranking

`$3`

`First search (vector search)`

`$3`

`$3`

`📝 Supported Languages`

`$3`

`Custom token limit and dimensions`

`$3`

`$3`

`$3`

`💻 Direct CLI Usage`

`$3`

`Install globally from npm (requires Node.js 16+)`

`Install from GitHub (latest development version)`

`Index current repository with the best available provider`

`$3`

`Narrow to service files tagged stripe in PHP`

`List packs and highlight the active one`

`$3`

`Watch the repository with a 750 ms debounce and local embeddings`

`$3`

`🧠 Embedding Providers`

`$3`

`LM Studio (local)`

`$3`

`Use a specific OpenAI model`

`Local Transformers.js reranker model`

`Transformers.js (local)`

`Configure Novita.ai with Qwen embedding model`

`Set reranking API credentials`

`Use together with Qwen embedding for full pipeline`

`🏆 Performance Analysis`

`$3`

`Run default benchmarks (Base, Hybrid, Hybrid+CE)`

`Search for authentication functions`

`$3`

`🔧 Available MCP Tools`

`$3`

`$3`

`$3`

`$3`

`$3`

`📊 Available MCP Resources`

`$3`

`$3`

`🎯 Available MCP Prompts`

`$3`

`$3`

`🔍 How Retrieval Works`

`📝 Design Decisions`

`🧩 Extending PAMPAX`

`🔐 Encrypting the Chunk Store`

`🔧 Troubleshooting`

`$3`

`Clear the npm/npx cache`

`$3`

`First search (vector search)`

`$3`

`$3`

`📝 Supported Languages`

`$3`

`Custom token limit and dimensions`

`$3`

`$3`

`$3`

`💻 Direct CLI Usage`

`$3`

`Install globally from npm (requires Node.js 16+)`

`Install from GitHub (latest development version)`

`Index current repository with the best available provider`