@echoes-io/rag

Semantic search and context retrieval system for Echoes.io storytelling platform

Overview

The RAG (Retrieval-Augmented Generation) system provides semantic search capabilities across all Echoes timeline content, enabling intelligent context retrieval for AI-powered storytelling assistance.

Features

- Vector Embeddings: Generate embeddings for chapters using Gemini or local E5 models
- Semantic Search: Find similar content across timelines and characters
- Character Extraction: Automatic NER (Named Entity Recognition) to extract character names
- Context Retrieval: Intelligent context selection for AI interactions
- Timeline Awareness: Search within specific timelines or across all content
- Character Filtering: Find chapters by characters present
- SQLite Storage: Lightweight, file-based vector database with cosine similarity search

Architecture

``RAG System (LlamaIndexTS + LanceDB) ├── LlamaIndexTS - Document indexing and retrieval framework ├── LanceDB - High-performance vector database with ANN search ├── NER Extractor - Extract character names from content └── Search API - Query interface with semantic + metadata filtering`

`Installation`

`bash npm install @echoes-io/rag`

`Usage`

`$3`

`typescript import { RAGSystem } from '@echoes-io/rag';

const rag = new RAGSystem({ provider: 'qwen3', // or 'nomic-embed', 'bge-base', 'e5-large', 'e5-small', 'gemini' geminiApiKey: process.env.GEMINI_API_KEY, // Only required for 'gemini' dbPath: './lancedb' });`

`$3`

`typescript await rag.addChapter({ id: 'anima-discovery-03-05', metadata: { pov: 'nic', title: 'Sotto le stelle', timeline: 'anima', arc: 'discovery', // ... other metadata }, content: 'Nic guardò Alex negli occhi. Sara sorrise...' });

// Characters are automatically extracted: ['Nic', 'Alex', 'Sara']`

`$3`

`typescript // Search across all timelines const results = await rag.search('passionate encounter in London');

// Search within specific timeline const erosResults = await rag.search('work dynamics', { timeline: 'eros' });

// Search by POV const nicChapters = await rag.search('internal conflict', { pov: 'nic' });`

`$3`

`typescript // Find chapters where specific characters appear const results = await rag.search('romantic moment', { characters: ['Nic', 'Alex'], allCharacters: true // Both must be present (AND) });

// Find chapters with any of the characters (OR) const results = await rag.search('conflict', { characters: ['Nic', 'Alex'], allCharacters: false // At least one present (default) });

// Get all characters that interact with a specific character const characters = await rag.getCharacterMentions('Nic'); // → ['Alex', 'Sara', 'Marco', ...]`

`$3`

`typescript // Get relevant context for AI interaction const context = await rag.getContext({ query: 'relationship development', timeline: 'anima', maxChapters: 5, characters: ['Nic', 'Alex'] });`

`Use Cases`

`$3`

`typescript // "In which chapter do X and Y kiss?" await rag.search("kiss", { characters: ['Nic', 'Alex'], allCharacters: true });

// "What's the relationship between X and Y?" await rag.search("relationship Nic Alex", { characters: ['Nic', 'Alex'] });

// "In which chapter does XYZ happen?" await rag.search("xyz event description");

// "Who does character X interact with?" await rag.getCharacterMentions('Nic');`

`Configuration`

`typescript const config = { provider: 'qwen3', // 'qwen3', 'nomic-embed', 'bge-base', 'e5-large', 'e5-small', or 'gemini' geminiApiKey: process.env.GEMINI_API_KEY, // Required for 'gemini' provider dbPath: './lancedb', // LanceDB directory maxResults: 10, // Default max results storeFullContent: true // true = store full content (default) };`

`$3`

- qwen3 - Qwen3-Embedding-0.6B-ONNX (1024 dimensions, SOTA multilingual, #1 on MTEB) - nomic-embed - Nomic Embed v1 (768 dimensions, excellent accuracy, 86.2% benchmark) - bge-base - BGE-Base-v1.5 (768 dimensions, balanced performance, 84.7% accuracy) - e5-large - E5 multilingual large (1024 dimensions, robust multilingual) - e5-small - E5 multilingual small (384 dimensions, fast, offline, good baseline) - gemini - Google's gemini-embedding-001 (768 dimensions, API required)

All local embeddings run via HuggingFace Transformers.js and don't require API keys.

`Character Extraction (NER)`

The system automatically extracts character names from chapter content using Named Entity Recognition:

- Model: Xenova/bert-base-multilingual-cased-ner-hrl- Language Support: Multilingual (including Italian) - Performance: ~100-200ms per chapter (cached after first extraction) - Automatic: No configuration needed, always enabled

Characters are stored in metadata.characterNames and can be used for filtering.

`API Reference`

`$3`

Main class for semantic search operations.

#### Constructor

`typescript new RAGSystem(config: RAGConfig)`

#### Methods

##### search(query, options?)

Semantic search across content.

`typescript search( query: string, options?: { timeline?: string, arc?: string, pov?: string, maxResults?: number, characters?: string[], // Filter by characters present allCharacters?: boolean // true = AND, false = OR } ): Promise`

##### getContext(options)

Retrieve context for AI interactions.

`typescript getContext(options: { query: string, timeline?: string, arc?: string, pov?: string, maxChapters?: number, characters?: string[] }): Promise`

##### addChapter(chapter)

Add single chapter to vector database (with automatic character extraction).

`typescript addChapter(chapter: EmbeddingChapter): Promise`

##### addChapters(chapters)

Batch add chapters.

`typescript addChapters(chapters: EmbeddingChapter[]): Promise`

##### deleteChapter(id)

Remove chapter from database.

`typescript deleteChapter(id: string): Promise`

##### getCharacterMentions(characterName)

Get all characters that appear in chapters with the specified character.

`typescript getCharacterMentions(characterName: string): Promise`

`$3`

`typescript interface SearchOptions { timeline?: string; arc?: string; pov?: string; maxResults?: number; characters?: string[]; allCharacters?: boolean; }

interface ContextOptions { query: string; timeline?: string; arc?: string; pov?: string; maxChapters?: number; characters?: string[]; }

interface SearchResult { id: string; metadata: ChapterWithCharacters; content: string; similarity: number; // 0-1 cosine similarity }

interface ChapterWithCharacters extends Chapter { characterNames?: string[]; // Extracted by NER }

interface RAGConfig { provider: 'qwen3' | 'nomic-embed' | 'bge-base' | 'e5-large' | 'e5-small' | 'gemini'; geminiApiKey?: string; openaiApiKey?: string; dbPath?: string; maxResults?: number; }`

`Storage`

The system uses LanceDB for vector storage with optimized ANN (Approximate Nearest Neighbor) search. This approach is:

- High Performance: Native ANN indices (IVF, HNSW) for fast similarity search - Scalable: Handles thousands to millions of vectors efficiently - File-based: Directory-based storage, portable and easy to backup - Zero-copy: Efficient memory usage with columnar storage - Type-safe: Full TypeScript support via LlamaIndexTS

The LanceDB directory contains embeddings, metadata (including extracted characters), and content for all indexed chapters.

`$3`

- Test databases: Ignored in git (pattern: test-, -test*) - Production databases: Should be committed in timeline repositories (e.g.,lancedb/, rag-db/) - Backup: Simply copy the LanceDB directory

`Development`

`bash

`Install dependencies`


npm install
Build

npm run build
Test

npm test
Lint

npm run lint


Integration
$3

The RAG system integrates with @echoes-io/mcp-server to provide context-aware AI tools:

- semantic-search- Find similar content -get-context- Retrieve relevant chapters for AI prompts -find-echoes - Discover narrative connections across timelines

`$3`

Automatically processes content from timeline repositories: -timeline-anima/content/-timeline-eros/content/-timeline-bloom/content/`

Performance

$3

| Model | Accuracy | Speed | Dimensions | Best For |
|-------|----------|-------|------------|----------|
| qwen3 | 70.58 MTEB | Medium | 1024 | Best overall, multilingual |
| nomic-embed | 86.2% | Slow | 768 | High accuracy tasks |
| bge-base | 84.7% | Medium | 768 | Balanced performance |
| e5-large | 83.5% | Fast | 1024 | Good multilingual baseline |
| e5-small | 78.1% | Very Fast | 384 | Speed-critical applications |

- Indexing: ~100-200ms per chapter (including NER)
- Search: <50ms for typical queries
- Character Extraction: Cached after first run
- Database: Suitable for 100-1000+ chapters

License

MIT

---

Echoes - Multi-POV storytelling platform ✨

@echoes-io/rag

Semantic search and context retrieval system for Echoes.io storytelling platform

Overview

Features

Architecture

`Installation`

`bash npm install @echoes-io/rag`

`Usage`

`$3`

`typescript import { RAGSystem } from '@echoes-io/rag';

`$3`

// Characters are automatically extracted: ['Nic', 'Alex', 'Sara']`

`$3`

`typescript // Search across all timelines const results = await rag.search('passionate encounter in London');

// Search within specific timeline const erosResults = await rag.search('work dynamics', { timeline: 'eros' });

// Search by POV const nicChapters = await rag.search('internal conflict', { pov: 'nic' });`

`$3`

// Find chapters with any of the characters (OR) const results = await rag.search('conflict', { characters: ['Nic', 'Alex'], allCharacters: false // At least one present (default) });

// Get all characters that interact with a specific character const characters = await rag.getCharacterMentions('Nic'); // → ['Alex', 'Sara', 'Marco', ...]`

`$3`

`typescript // Get relevant context for AI interaction const context = await rag.getContext({ query: 'relationship development', timeline: 'anima', maxChapters: 5, characters: ['Nic', 'Alex'] });`

`Use Cases`

`$3`

`typescript // "In which chapter do X and Y kiss?" await rag.search("kiss", { characters: ['Nic', 'Alex'], allCharacters: true });

// "What's the relationship between X and Y?" await rag.search("relationship Nic Alex", { characters: ['Nic', 'Alex'] });

// "In which chapter does XYZ happen?" await rag.search("xyz event description");

// "Who does character X interact with?" await rag.getCharacterMentions('Nic');`

`Configuration`

`$3`

All local embeddings run via HuggingFace Transformers.js and don't require API keys.

`Character Extraction (NER)`

The system automatically extracts character names from chapter content using Named Entity Recognition:

Characters are stored in metadata.characterNames and can be used for filtering.

`API Reference`

`$3`

Main class for semantic search operations.

#### Constructor

`typescript new RAGSystem(config: RAGConfig)`

#### Methods

##### search(query, options?)

Semantic search across content.

##### getContext(options)

Retrieve context for AI interactions.

`typescript getContext(options: { query: string, timeline?: string, arc?: string, pov?: string, maxChapters?: number, characters?: string[] }): Promise`

##### addChapter(chapter)

Add single chapter to vector database (with automatic character extraction).

`typescript addChapter(chapter: EmbeddingChapter): Promise`

##### addChapters(chapters)

Batch add chapters.

`typescript addChapters(chapters: EmbeddingChapter[]): Promise`

##### deleteChapter(id)

Remove chapter from database.

`typescript deleteChapter(id: string): Promise`

##### getCharacterMentions(characterName)

Get all characters that appear in chapters with the specified character.

`typescript getCharacterMentions(characterName: string): Promise`

`$3`

`typescript interface SearchOptions { timeline?: string; arc?: string; pov?: string; maxResults?: number; characters?: string[]; allCharacters?: boolean; }

interface ContextOptions { query: string; timeline?: string; arc?: string; pov?: string; maxChapters?: number; characters?: string[]; }

interface SearchResult { id: string; metadata: ChapterWithCharacters; content: string; similarity: number; // 0-1 cosine similarity }

interface ChapterWithCharacters extends Chapter { characterNames?: string[]; // Extracted by NER }

interface RAGConfig { provider: 'qwen3' | 'nomic-embed' | 'bge-base' | 'e5-large' | 'e5-small' | 'gemini'; geminiApiKey?: string; openaiApiKey?: string; dbPath?: string; maxResults?: number; }`

`Storage`

The system uses LanceDB for vector storage with optimized ANN (Approximate Nearest Neighbor) search. This approach is:

The LanceDB directory contains embeddings, metadata (including extracted characters), and content for all indexed chapters.

`$3`

`Development`

`bash

`Install dependencies`


npm install
Build

npm run build
Test

npm test
Lint

npm run lint


Integration
$3

The RAG system integrates with @echoes-io/mcp-server to provide context-aware AI tools:

- semantic-search- Find similar content -get-context- Retrieve relevant chapters for AI prompts -find-echoes - Discover narrative connections across timelines

`$3`

Automatically processes content from timeline repositories: -timeline-anima/content/-timeline-eros/content/-timeline-bloom/content/`

Performance

$3

- Indexing: ~100-200ms per chapter (including NER)
- Search: <50ms for typical queries
- Character Extraction: Cached after first run
- Database: Suitable for 100-1000+ chapters

License

MIT

---

Echoes - Multi-POV storytelling platform ✨