Testing Guide for Big RAG Plugin

This guide provides instructions for testing the Big RAG plugin with various scenarios and dataset sizes.

Prerequisites

LM Studio installed and running
At least one embedding model loaded (e.g., nomic-ai/nomic-embed-text-v1.5-GGUF)
At least one LLM loaded for chat
Node.js and npm installed

Setup for Testing

0. Run Parser Smoke Tests

Before larger end-to-end runs, ensure the core parsers succeed:

This builds the project and executes the HTML/Markdown/Text regression tests located in src/tests/parseDocument.test.ts.

1. Install Dependencies

2. Create Test Data

Create a test directory structure:

Add some test files:

3. Create Vector Store Directory

Test Scenarios

Test 1: Basic Functionality

Objective: Verify the plugin can index and retrieve from a small dataset.

Steps:

Success Criteria:

✅ Indexing completes without errors
✅ Retrieval finds relevant content
✅ Response includes citations from test files

Test 2: Large Directory Handling

Objective: Test with a larger dataset (100+ files).

Steps:

Generate test files:

Success Criteria:

✅ All files are processed
✅ No memory issues
✅ Indexing completes in reasonable time
✅ Retrieval returns relevant results

Test 3: Multiple File Types

Objective: Verify all supported file types are processed correctly.

Steps:

Add different file types to test directory:
- Copy a sample PDF
- Copy a sample EPUB
- Copy a sample image (if OCR enabled)
Clear vector store and reindex
Query for content that should be in each file type

Success Criteria:

✅ PDF files are parsed correctly
✅ EPUB files are parsed correctly
✅ HTML files are parsed correctly
✅ Text files are parsed correctly
✅ Images are processed (if OCR enabled)

Test 4: Incremental Indexing

Objective: Verify that already-indexed files are skipped.

Steps:

Index the test directory (first time)
Note the indexing time
Restart the plugin
Send another query (triggers reindex check)
Note the indexing time

Expected Result: Second indexing should be much faster as files are already indexed.

Success Criteria:

✅ Already-indexed files are skipped
✅ Only new/modified files are processed
✅ Retrieval still works correctly

Test 5: Concurrent Processing

Objective: Test different concurrency settings.

Steps:

Set maxConcurrentFiles to 1
Index 50 files and note the time
Clear vector store
Set maxConcurrentFiles to 5
Index the same 50 files and note the time

Expected Result: Higher concurrency should be faster (but use more memory).

Success Criteria:

✅ Both settings work correctly
✅ Higher concurrency is faster
✅ No race conditions or errors

Test 6: Retrieval Threshold Tuning

Objective: Test different threshold settings.

Steps:

Index test documents
Set retrievalAffinityThreshold to 0.9 (very strict)
Send a query
Set retrievalAffinityThreshold to 0.3 (very loose)
Send the same query

Expected Result:

High threshold: Fewer, more relevant results
Low threshold: More results, some less relevant

Success Criteria:

✅ Threshold affects number of results
✅ Results are properly filtered
✅ No errors with extreme values

Test 7: OCR Testing (Optional)

Objective: Verify OCR works for image files.

Steps:

Enable OCR in settings
Add an image with text to test directory
Clear vector store and reindex
Query for content that's in the image

Expected Result: Text from image should be extracted and searchable.

Success Criteria:

✅ Image is processed
✅ Text is extracted correctly
✅ Content is retrievable

Test 8: Error Handling

Objective: Verify graceful handling of errors.

Test Cases:

Success Criteria:

✅ Plugin doesn't crash
✅ Clear error messages
✅ Other files continue to process

Performance Benchmarks

Small Dataset (10 files, ~1MB total)

Expected Indexing Time: 10-30 seconds
Expected Query Time: < 1 second
Memory Usage: < 200MB

Medium Dataset (100 files, ~10MB total)

Expected Indexing Time: 1-3 minutes
Expected Query Time: < 2 seconds
Memory Usage: < 500MB

Large Dataset (1000 files, ~100MB total)

Expected Indexing Time: 10-30 minutes
Expected Query Time: < 3 seconds
Memory Usage: < 1GB

Very Large Dataset (10000+ files, 1GB+ total)

Expected Indexing Time: 2-6 hours
Expected Query Time: < 5 seconds
Memory Usage: 1-3GB

Note: Times vary based on hardware, file types, and OCR usage.

Debugging

Enable Debug Logging

The plugin uses LM Studio's logging system. To see debug output:

Check LM Studio's developer console
Look for messages prefixed with plugin name
Use ctl.debug() calls in code for detailed logging

Common Issues

Cleanup

After testing, clean up test data:

Automated Testing (Future)

For automated testing, consider:

Unit tests for parsers
Integration tests for indexing pipeline
Performance benchmarks
Regression tests for bug fixes

Example test structure:

Reporting Issues

When reporting issues, include:

Plugin version
LM Studio version
Operating system
Dataset size and composition
Configuration settings
Error messages and logs
Steps to reproduce

Performance Tuning Checklist

Tested with small dataset
Tested with large dataset
Optimized chunk size for use case
Tuned retrieval threshold

big-rag

Testing Guide for Big RAG Plugin

Prerequisites

Setup for Testing

0. Run Parser Smoke Tests

1. Install Dependencies

2. Create Test Data

3. Create Vector Store Directory

Test Scenarios

Test 1: Basic Functionality

Test 2: Large Directory Handling

Test 3: Multiple File Types

Test 4: Incremental Indexing

Test 5: Concurrent Processing

Test 6: Retrieval Threshold Tuning

Test 7: OCR Testing (Optional)

Test 8: Error Handling

Performance Benchmarks

Small Dataset (10 files, ~1MB total)

Medium Dataset (100 files, ~10MB total)

Large Dataset (1000 files, ~100MB total)

Very Large Dataset (10000+ files, 1GB+ total)

Debugging

Enable Debug Logging

Common Issues

Cleanup

Automated Testing (Future)

Reporting Issues

Performance Tuning Checklist