RagForge

An MCP (Model Context Protocol) server that gives Claude persistent memory and powerful code analysis capabilities through a Neo4j knowledge graph.

What is RagForge?

RagForge extends Claude with:

- Persistent Memory - Everything Claude reads, searches, or analyzes is stored in a knowledge graph
- Semantic Code Search - Find code by meaning, not just text matching
- Multi-Project Context - Work across multiple codebases with full relationship tracking
- Web Research - Crawl, index, and search web documentation
- Visual Understanding - OCR, image analysis, 3D model generation

Features

| Feature | Description |
|---------|-------------|
| MCP Server | Direct integration with Claude Code and Claude Desktop |
| Knowledge Graph | Neo4j-based persistent storage with relationship tracking |
| Code Parsing | TypeScript, JavaScript, Python, Vue, Svelte, HTML, CSS |
| Document Processing | PDF, DOCX, XLSX, Markdown, images (OCR) |
| Semantic Search | Vector embeddings for intelligent retrieval |
| Entity Extraction | Named entity recognition (GPU-accelerated) |
| Web Crawling | Index documentation and web pages |

---

Prerequisites

$3

| Requirement | Version | Notes |
|-------------|---------|-------|
| Node.js | ≥ 18.0.0 | Download |
| Docker | Latest | Install Docker |
| Docker Compose | v2+ | Included with Docker Desktop |

$3

| Requirement | Notes |
|-------------|-------|
| NVIDIA GPU | CUDA-capable GPU (RTX 2000+ recommended) |
| NVIDIA Driver | ≥ 525.60.13 |
| nvidia-container-toolkit | Required for GPU in Docker |

---

Quick Start

$3

``bash

`Clone or navigate to ragforge`


cd packages/ragforge-core
Run setup script (CPU mode - Neo4j only)

chmod +x scripts/setup.sh
./scripts/setup.sh
Or with GPU acceleration (Neo4j + GLiNER + TEI)

./scripts/setup.sh --with-gpu


$3
#### 1. Install the package

`bash npm install @luciformresearch/ragforge`

#### 2. Start services

`bash cd node_modules/@luciformresearch/ragforge/services

`Copy and customize configuration`


cp .env.example .env
Start all services (Neo4j required, GLiNER + TEI optional)

docker compose up -d neo4j
Or start with GPU services for faster processing

docker compose up -d


#### 3. Configure API keys

Create ~/.ragforge/.env:

`bash

`Required for embeddings (if not using local TEI/Ollama)`


GEMINI_API_KEY=your-gemini-api-key
Optional - for 3D model generation

REPLICATE_API_TOKEN=your-replicate-token


#### 4. Configure MCP for Claude Code

Create .mcp.json at your project root:

`json { "mcpServers": { "ragforge": { "command": "node", "args": [ "./node_modules/@luciformresearch/ragforge/dist/esm/cli/index.js", "mcp-server" ] } } }`

Create .claude/settings.local.json in your project:

`json { "enableAllProjectMcpServers": true, "enabledMcpjsonServers": ["ragforge"] }`

#### 5. Configure environment

RagForge reads from ~/.ragforge/.env:

`bash

`Neo4j connection`


NEO4J_URI=bolt://localhost:7687
NEO4J_USER=neo4j
NEO4J_PASSWORD=ragforge
API keys

GEMINI_API_KEY=your-gemini-api-key


#### 6. Restart Claude Code
After configuration, restart Claude Code to load the MCP server. You should see "ragforge" in the MCP servers list.
---
Usage with Claude Code
$3
When you open Claude Code in a project for the first time, ask Claude to ingest it:

`"Ingest this project into your brain"`

Claude will call ingest_directorywhich: 1. Parses all code files (TypeScript, Python, Rust, Go, C, C++, C#, etc.) 2. Extracts scopes (functions, classes, methods, interfaces) 3. Resolves relationships (imports, inheritance, function calls) 4. Extracts entities (GLiNER - people, organizations, dates) if enabled 5. Generates embeddings for semantic search

Duration: 1-10 minutes depending on project size: - Small project (100 files): ~1 minute - Medium project (500 files): ~3-5 minutes - Large project (1000+ files): ~5-10 minutes

`$3`

After ingestion, Claude can search semantically:

`"Find code that handles user authentication" "Where is the payment processing logic?" "Show me all API endpoints"`

This uses brain_search with vector embeddings - finds code by meaning, not just keywords.

`$3`

If you need to reset the knowledge graph:

`"Clean up the brain for this project and re-ingest"`

Or manually:`cleanup_brain({ mode: "project", project_id: "your-project-id", confirm: true })`

`$3`

#### Core Operations

| Tool | Description | Example | |------|-------------|---------| |ingest_directory | Index a codebase | { path: "/path/to/project" }| |brain_search | Semantic search | { query: "auth logic", semantic: true }| |cleanup_brain | Delete project data | { mode: "project", project_id: "...", confirm: true }| |list_brain_projects | List all indexed projects | {} |

#### File Operations

| Tool | Description | |------|-------------| |read_file| Read file with line numbers, images, PDFs | |write_file| Write/overwrite file content | |edit_file| Search/replace or line-based editing | |grep_files| Regex search in files | |glob_files| Find files by pattern | |analyze_files | Extract code structure on-the-fly |

#### Graph Navigation

| Tool | Description | |------|-------------| |explore_node| Navigate relationships from a node | |extract_dependency_hierarchy| Get dependency trees | |run_cypher | Execute raw Cypher queries |

#### Web & Research

| Tool | Description | |------|-------------| |search_web| Google search with AI synthesis | |fetch_web_page| Render and extract web page content | |ingest_web_page| Index web pages into the brain | |call_research_agent | Autonomous research agent |

#### Images & 3D

| Tool | Description | |------|-------------| |read_image| OCR text extraction | |describe_image| AI image description | |generate_image| Create images from text | |generate_3d_from_text | Create 3D models from description |

#### Configuration

| Tool | Description | |------|-------------| |switch_embedding_provider| Change embedding provider (gemini/tei/ollama) | |get_brain_status| Check Neo4j connection and config | |set_api_key | Configure API keys |

---

`Installation Details`

`$3`

Neo4j stores the knowledge graph. Start it with Docker:

`bash docker compose up -d neo4j`

Access Neo4j Browser: http://localhost:7474 - Username:neo4j- Password:ragforge (or value from .env)

`$3`

Required to run GLiNER and TEI with GPU acceleration.

#### Ubuntu/Debian

`bash

`Add NVIDIA package repository`


curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey | \
  sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg
curl -s -L https://nvidia.github.io/libnvidia-container/stable/deb/nvidia-container-toolkit.list | \
  sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g' | \
  sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list
Install

sudo apt-get update
sudo apt-get install -y nvidia-container-toolkit
Configure Docker

sudo nvidia-ctk runtime configure --runtime=docker
sudo systemctl restart docker
Verify

docker run --rm --gpus all nvidia/cuda:12.1.0-base-ubuntu22.04 nvidia-smi


#### Fedora/RHEL

`bash curl -s -L https://nvidia.github.io/libnvidia-container/stable/rpm/nvidia-container-toolkit.repo | \ sudo tee /etc/yum.repos.d/nvidia-container-toolkit.repo

sudo dnf install -y nvidia-container-toolkit sudo nvidia-ctk runtime configure --runtime=docker sudo systemctl restart docker`

#### Arch Linux

`bash sudo pacman -S nvidia-container-toolkit sudo nvidia-ctk runtime configure --runtime=docker sudo systemctl restart docker`

`$3`

Extracts named entities (people, organizations, dates, etc.) from text.

`bash docker compose up -d gliner`

- Port: 6971 - GPU Memory: ~2-3 GB VRAM - Model:fastino/gliner2-large-v1 (default) or urchade/gliner_multi_pii-v1

#### GLiNER Domain Configuration

Entity extraction is configured per-domain in services/gliner_service/entity-extraction.yaml:

`yaml

`Domains with enabled=false skip entity extraction entirely`


domains:
  # CODE - disabled (TypeScript parser already extracts functions/classes as Scope nodes)
  code:
    enabled: false
    description: "Text about programming, software, functions, APIs"
  # ECOMMERCE - enabled: extract products, brands, ingredients
  ecommerce:
    enabled: true
    description: "Text about products, shopping, prices, brands, beauty"
    entity_types:
      product: "A specific commercial product with a name"
      brand: "A company brand name"
      ingredient: "A specific ingredient in a product"
      price: "A monetary amount with currency"
    relation_types:
      made_by: "product is made by brand"
      contains: "product contains ingredient"

# LEGAL - enabled: extract parties, contracts, dates legal: enabled: true description: "Legal documents, contracts, terms, obligations" entity_types: person: "A human individual mentioned by their full name" organization: "A company, institution, or legal entity" contract: "A named legal agreement or contract" date: "A specific date or deadline"`

Why enabled: false instead of removing the domain?- Withenabled: false, the domain is still detected during classification - When a file is classified as "code" or "documentation", entity extraction is skipped entirely - This is faster than running extraction with empty entity types - If you remove the domain, the file would fall back todefaults and run unnecessary extraction

In short: Keep domains with enabled: false to skip extraction for those file types.

`$3`

HuggingFace's fast embedding server. Faster than Ollama for embeddings.

`bash docker compose up -d tei`

- Port: 8081 - GPU Memory: ~1 GB (bge-base) or ~0.5 GB (bge-small) - Model:BAAI/bge-base-en-v1.5

For 8GB VRAM GPUs, use the smaller model:

`bash TEI_MODEL=BAAI/bge-small-en-v1.5 docker compose up -d tei`

---

`Embedding Providers`

RagForge supports multiple embedding providers:

| Provider | Speed | Quality | GPU Required | Cost | |----------|-------|---------|--------------|------| | Gemini | Fast | Best | No | API credits | | TEI | Very Fast | Good | Yes | Free (local) | | Ollama | Slow | Good | Optional | Free (local) |

`$3`

`bash

`Via MCP tool`


switch_embedding_provider({ provider: "gemini" })
switch_embedding_provider({ provider: "ollama", model: "nomic-embed-text" })

$3

`bash

`Install Ollama`


curl -fsSL https://ollama.com/install.sh | sh
Pull embedding model

ollama pull nomic-embed-text
Start Ollama service

ollama serve


---
Services Architecture

`┌─────────────────────────────────────────────────────────────┐ │ RagForge Core │ │ ┌─────────────┐ ┌─────────────┐ ┌─────────────────────┐ │ │ │ MCP Server │ │ CLI │ │ Ingestion Engine │ │ │ └──────┬──────┘ └──────┬──────┘ └──────────┬──────────┘ │ │ │ │ │ │ │ └────────────────┼─────────────────────┘ │ │ │ │ └──────────────────────────┼──────────────────────────────────┘ │ ┌────────────────┼────────────────┐ │ │ │ ▼ ▼ ▼ ┌──────────┐ ┌──────────┐ ┌──────────┐ │ Neo4j │ │ GLiNER │ │ TEI │ │ :7687 │ │ :6971 │ │ :8081 │ │ │ │ (GPU) │ │ (GPU) │ └──────────┘ └──────────┘ └──────────┘`

---

`GPU Memory Usage`

Approximate VRAM requirements:

| Service | VRAM Usage | Notes | |---------|------------|-------| | GLiNER | ~2-3 GB | Can load/unload dynamically | | TEI (bge-base) | ~1 GB | Persistent | | TEI (bge-small) | ~0.5 GB | Recommended for 8GB GPUs | | Ollama (nomic) | ~0.7 GB | Alternative to TEI |

Tip: GLiNER automatically unloads from GPU after entity extraction to free VRAM for embeddings.

---

`Configuration Reference`

`$3`

#### Services (services/.env)

| Variable | Default | Description | |----------|---------|-------------| |NEO4J_PASSWORD| ragforge | Neo4j database password | |NEO4J_BOLT_PORT| 7687 | Neo4j Bolt protocol port | |NEO4J_HTTP_PORT| 7474 | Neo4j Browser port | |GLINER_PORT| 6971 | GLiNER API port | |GLINER_MODEL| urchade/gliner_multi_pii-v1 | GLiNER model | |GLINER_CONFIDENCE| 0.5 | Entity confidence threshold | |TEI_PORT| 8081 | TEI API port | |TEI_MODEL | BAAI/bge-base-en-v1.5 | Embedding model |

#### API Keys (~/.ragforge/.env)

| Variable | Description | |----------|-------------| |GEMINI_API_KEY| Google Gemini API key (embeddings, vision) | |REPLICATE_API_TOKEN | Replicate API token (3D generation) |

---

`Verify Installation`

`bash

`Check Neo4j`


curl http://localhost:7474
Expected: Neo4j Browser HTML
Check GLiNER

curl http://localhost:6971/health
Expected: {"status":"healthy",...}
Check TEI

curl http://localhost:8081/health
Expected: {"status":"ok",...}
Check GPU in Docker

docker run --rm --gpus all nvidia/cuda:12.1.0-base-ubuntu22.04 nvidia-smi


---
Troubleshooting
$3

`docker: Error response from daemon: could not select device driver "" with capabilities: [[gpu]]`

Solution: Install nvidia-container-toolkit (see installation section above).

`$3`

`bash docker compose logs neo4j`

Common issues: - Port 7687 already in use - Insufficient disk space

`$3`

Normal - the model (~205M params) needs to load into GPU memory. Subsequent calls are fast.

`$3`

- Use smaller TEI model: TEI_MODEL=BAAI/bge-small-en-v1.5- GLiNER and TEI share GPU memory dynamically

`$3`

Verify with:`bash ollama ps # Check GPU usage nvidia-smi # Check VRAM allocation`

---

`Development`

`bash

`Clone repository`


git clone https://github.com/LuciformResearch/ragforge-core.git
cd ragforge-core
Install dependencies

npm install
Build

npm run build
Run tests

npm run test
Start services

cd services && docker compose up -d

---

License - Luciform Research Source License (LRSL) v1.1

Free to use for:
- Research, education, personal exploration
- Freelance or small-scale projects (gross monthly revenue up to 100,000 EUR)
- Internal tools (if your company revenue is up to 100,000 EUR/month)

Commercial use above this threshold requires a separate agreement.

Contact for commercial licensing: legal@luciformresearch.com

Grace period: 60 days after crossing the revenue threshold

Full text: LICENSE

---

Note: This is a custom "source-available" license, NOT an OSI-approved open source license.

---

Links

- GitHub Repository
- Issue Tracker
- Luciform Research

RagForge

An MCP (Model Context Protocol) server that gives Claude persistent memory and powerful code analysis capabilities through a Neo4j knowledge graph.

What is RagForge?

RagForge extends Claude with:

Features

---

Prerequisites

$3

---

Quick Start

$3

``bash

`Clone or navigate to ragforge`


cd packages/ragforge-core
Run setup script (CPU mode - Neo4j only)

chmod +x scripts/setup.sh
./scripts/setup.sh
Or with GPU acceleration (Neo4j + GLiNER + TEI)

./scripts/setup.sh --with-gpu


$3
#### 1. Install the package

`bash npm install @luciformresearch/ragforge`

#### 2. Start services

`bash cd node_modules/@luciformresearch/ragforge/services

`Copy and customize configuration`


cp .env.example .env
Start all services (Neo4j required, GLiNER + TEI optional)

docker compose up -d neo4j
Or start with GPU services for faster processing

docker compose up -d


#### 3. Configure API keys

Create ~/.ragforge/.env:

`bash

`Required for embeddings (if not using local TEI/Ollama)`


GEMINI_API_KEY=your-gemini-api-key
Optional - for 3D model generation

REPLICATE_API_TOKEN=your-replicate-token


#### 4. Configure MCP for Claude Code

Create .mcp.json at your project root:

`json { "mcpServers": { "ragforge": { "command": "node", "args": [ "./node_modules/@luciformresearch/ragforge/dist/esm/cli/index.js", "mcp-server" ] } } }`

Create .claude/settings.local.json in your project:

`json { "enableAllProjectMcpServers": true, "enabledMcpjsonServers": ["ragforge"] }`

#### 5. Configure environment

RagForge reads from ~/.ragforge/.env:

`bash

`Neo4j connection`


NEO4J_URI=bolt://localhost:7687
NEO4J_USER=neo4j
NEO4J_PASSWORD=ragforge
API keys

GEMINI_API_KEY=your-gemini-api-key


#### 6. Restart Claude Code
After configuration, restart Claude Code to load the MCP server. You should see "ragforge" in the MCP servers list.
---
Usage with Claude Code
$3
When you open Claude Code in a project for the first time, ask Claude to ingest it:

`"Ingest this project into your brain"`

Duration: 1-10 minutes depending on project size: - Small project (100 files): ~1 minute - Medium project (500 files): ~3-5 minutes - Large project (1000+ files): ~5-10 minutes

`$3`

After ingestion, Claude can search semantically:

`"Find code that handles user authentication" "Where is the payment processing logic?" "Show me all API endpoints"`

This uses brain_search with vector embeddings - finds code by meaning, not just keywords.

`$3`

If you need to reset the knowledge graph:

`"Clean up the brain for this project and re-ingest"`

Or manually:`cleanup_brain({ mode: "project", project_id: "your-project-id", confirm: true })`

`$3`

#### Core Operations

#### File Operations

#### Graph Navigation

#### Web & Research

#### Images & 3D

#### Configuration

---

`Installation Details`

`$3`

Neo4j stores the knowledge graph. Start it with Docker:

`bash docker compose up -d neo4j`

Access Neo4j Browser: http://localhost:7474 - Username:neo4j- Password:ragforge (or value from .env)

`$3`

Required to run GLiNER and TEI with GPU acceleration.

#### Ubuntu/Debian

`bash

`Add NVIDIA package repository`


curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey | \
  sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg
curl -s -L https://nvidia.github.io/libnvidia-container/stable/deb/nvidia-container-toolkit.list | \
  sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g' | \
  sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list
Install

sudo apt-get update
sudo apt-get install -y nvidia-container-toolkit
Configure Docker

sudo nvidia-ctk runtime configure --runtime=docker
sudo systemctl restart docker
Verify

docker run --rm --gpus all nvidia/cuda:12.1.0-base-ubuntu22.04 nvidia-smi


#### Fedora/RHEL

`bash curl -s -L https://nvidia.github.io/libnvidia-container/stable/rpm/nvidia-container-toolkit.repo | \ sudo tee /etc/yum.repos.d/nvidia-container-toolkit.repo

sudo dnf install -y nvidia-container-toolkit sudo nvidia-ctk runtime configure --runtime=docker sudo systemctl restart docker`

#### Arch Linux

`bash sudo pacman -S nvidia-container-toolkit sudo nvidia-ctk runtime configure --runtime=docker sudo systemctl restart docker`

`$3`

Extracts named entities (people, organizations, dates, etc.) from text.

`bash docker compose up -d gliner`

- Port: 6971 - GPU Memory: ~2-3 GB VRAM - Model:fastino/gliner2-large-v1 (default) or urchade/gliner_multi_pii-v1

#### GLiNER Domain Configuration

Entity extraction is configured per-domain in services/gliner_service/entity-extraction.yaml:

`yaml

`Domains with enabled=false skip entity extraction entirely`


domains:
  # CODE - disabled (TypeScript parser already extracts functions/classes as Scope nodes)
  code:
    enabled: false
    description: "Text about programming, software, functions, APIs"
  # ECOMMERCE - enabled: extract products, brands, ingredients
  ecommerce:
    enabled: true
    description: "Text about products, shopping, prices, brands, beauty"
    entity_types:
      product: "A specific commercial product with a name"
      brand: "A company brand name"
      ingredient: "A specific ingredient in a product"
      price: "A monetary amount with currency"
    relation_types:
      made_by: "product is made by brand"
      contains: "product contains ingredient"

In short: Keep domains with enabled: false to skip extraction for those file types.

`$3`

HuggingFace's fast embedding server. Faster than Ollama for embeddings.

`bash docker compose up -d tei`

- Port: 8081 - GPU Memory: ~1 GB (bge-base) or ~0.5 GB (bge-small) - Model:BAAI/bge-base-en-v1.5

For 8GB VRAM GPUs, use the smaller model:

`bash TEI_MODEL=BAAI/bge-small-en-v1.5 docker compose up -d tei`

---

`Embedding Providers`

RagForge supports multiple embedding providers:

`$3`

`bash

`Via MCP tool`


switch_embedding_provider({ provider: "gemini" })
switch_embedding_provider({ provider: "ollama", model: "nomic-embed-text" })

$3

`bash

`Install Ollama`


curl -fsSL https://ollama.com/install.sh | sh
Pull embedding model

ollama pull nomic-embed-text
Start Ollama service

ollama serve


---
Services Architecture

---

`GPU Memory Usage`

Approximate VRAM requirements:

Tip: GLiNER automatically unloads from GPU after entity extraction to free VRAM for embeddings.

---

`Configuration Reference`

`$3`

#### Services (services/.env)

#### API Keys (~/.ragforge/.env)

| Variable | Description | |----------|-------------| |GEMINI_API_KEY| Google Gemini API key (embeddings, vision) | |REPLICATE_API_TOKEN | Replicate API token (3D generation) |

---

`Verify Installation`

`bash

`Check Neo4j`


curl http://localhost:7474
Expected: Neo4j Browser HTML
Check GLiNER

curl http://localhost:6971/health
Expected: {"status":"healthy",...}
Check TEI

curl http://localhost:8081/health
Expected: {"status":"ok",...}
Check GPU in Docker

docker run --rm --gpus all nvidia/cuda:12.1.0-base-ubuntu22.04 nvidia-smi


---
Troubleshooting
$3

`docker: Error response from daemon: could not select device driver "" with capabilities: [[gpu]]`

Solution: Install nvidia-container-toolkit (see installation section above).

`$3`

`bash docker compose logs neo4j`

Common issues: - Port 7687 already in use - Insufficient disk space

`$3`

Normal - the model (~205M params) needs to load into GPU memory. Subsequent calls are fast.

`$3`

- Use smaller TEI model: TEI_MODEL=BAAI/bge-small-en-v1.5- GLiNER and TEI share GPU memory dynamically

`$3`

Verify with:`bash ollama ps # Check GPU usage nvidia-smi # Check VRAM allocation`

---

`Development`

`bash

`Clone repository`


git clone https://github.com/LuciformResearch/ragforge-core.git
cd ragforge-core
Install dependencies

npm install
Build

npm run build
Run tests

npm run test
Start services

cd services && docker compose up -d

---

License - Luciform Research Source License (LRSL) v1.1

Commercial use above this threshold requires a separate agreement.

Contact for commercial licensing: legal@luciformresearch.com

Grace period: 60 days after crossing the revenue threshold

Full text: LICENSE

---

Note: This is a custom "source-available" license, NOT an OSI-approved open source license.

---

Links

- GitHub Repository
- Issue Tracker
- Luciform Research