RAG Module

A TypeScript implementation of a Retrieval-Augmented Generation (RAG) module using node-llama-cpp for embeddings and SQLite with vector extensions for efficient similarity search.

Features

Text embedding generation using LLaMA-based models
Vector storage and retrieval with SQLite
Efficient similarity search for semantic queries
Simple API for saving and querying documents
Built with TypeScript for type safety

Project Structure

rag-module/
├── src/
│   ├── rag.ts         # Core RAG functionality
│   ├── embed.ts       # Text embedding utilities
│   ├── db.ts          # Database connection and setup
│   ├── test-rag.ts    # Integration tests for RAG
│   ├── test-embed.ts  # Tests for embedding functionality
│   └── test-db.ts     # Database setup tests
├── model/             # Directory for model files (not included in repo)
├── dist/              # Compiled JavaScript output
└── rag.db            # SQLite database file (created at runtime)

Prerequisites

Node.js (v16 or later)
npm or yarn
SQLite3 development files

Installation

Clone the repository:

git clone <repository-url>
cd rag-module

Install dependencies:
```
npm install
```
Download the embedding model:
- Create a model directory in the project root
- Download the nomic-embed-text-v1.5.Q5_K_M.gguf model file into the model directory
- Or modify embed.ts to point to your preferred model
Build the project:
```
npm run build
```

Usage

Basic Example

import { initEmbedder } from './embed.js';
import { loadVectorExtension } from './db.js';
import { Embed, Save, Search } from './rag.js';

async function example() {
  // Initialize the embedding model and database
  await initEmbedder();
  await loadVectorExtension();

  // Generate an embedding
  const vector = await Embed('Hello world');
  console.log('Embedding:', vector.slice(0, 5), '...');

  // Save a document
  await Save(1, 'Banana is yellow', 'fruits');
  await Save(2, 'Apple is red and crunchy', 'fruits');
  await Save(3, 'Orange is citrus and orange colored', 'fruits');

  // Search for similar documents
  const results = await Search('yellow fruit', 'fruits', 2);
  console.log('Search results:', results);
}

example().catch(console.error);

API Reference

`Embed(text: string): Promise<number[]>`

Generates an embedding vector for the given text.

`Save(id: number, text: string, tablename: string): Promise<boolean>`

Saves a text document with its embedding to the specified table.

`Search(query: string, tablename: string, limit: number): Promise<number[]>`

Searches for documents similar to the query and returns matching document IDs.

Running Tests

Test the embedding functionality:
```
npm run test-embed
```
Test database operations:
```
npm run test-db
```
Test the complete RAG pipeline:
```
npm run test-rag
```

Implementation Details

Embedding

Uses node-llama-cpp for generating text embeddings
Supports any LLaMA-compatible model (default: nomic-embed-text-v1.5)
Handles model loading and inference

Database

SQLite with vector extension for efficient similarity search
Stores document text alongside their vector embeddings
Uses sqlite-vec for vector operations

Search

Implements cosine similarity for finding similar documents
Returns results ordered by relevance
Supports configurable result limits

License

ISC

Contributing

Contributions are welcome! Please open an issue or submit a pull request.

Acknowledgements

node-llama-cpp - For efficient LLM inference
SQLite - For embedded database storage
sqlite-vec - For vector similarity search

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
dist		dist
model		model
sqlite-vec		sqlite-vec
src		src
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
solution.md		solution.md
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Module

Features

Project Structure

Prerequisites

Installation

Usage

Basic Example

API Reference

`Embed(text: string): Promise<number[]>`

`Save(id: number, text: string, tablename: string): Promise<boolean>`

`Search(query: string, tablename: string, limit: number): Promise<number[]>`

Running Tests

Implementation Details

Embedding

Database

Search

License

Contributing

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAG Module

Features

Project Structure

Prerequisites

Installation

Usage

Basic Example

API Reference

Embed(text: string): Promise<number[]>

Save(id: number, text: string, tablename: string): Promise<boolean>

Search(query: string, tablename: string, limit: number): Promise<number[]>

Running Tests

Implementation Details

Embedding

Database

Search

License

Contributing

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`Embed(text: string): Promise<number[]>`

`Save(id: number, text: string, tablename: string): Promise<boolean>`

`Search(query: string, tablename: string, limit: number): Promise<number[]>`

Packages