Skip to content

Instantly share code, notes, and snippets.

View ruvnet's full-sized avatar
πŸ’­
hacking the multiverse.

rUv ruvnet

πŸ’­
hacking the multiverse.
View GitHub Profile
@ruvnet
ruvnet / agentic-validation-system.md
Last active May 9, 2026 03:25
Agentic Validation System β€” three-layer regression protection for AI-built codebases (smoke harness + cryptographic witness + temporal history)

Agentic Validation System

A three-layer regression-protection stack for AI-built codebases that ship fast across many small fixes. Designed for projects where:

  • Releases are frequent (daily alphas, not monthly majors)
  • Fixes are small (one-line CLI parser swaps, dependency moves) but many in number
  • Multiple agents touch overlapping code in the same release window
  • A regression that ships affects every user immediately, not just one customer

The stack catches three distinct regression classes that traditional CI misses, then provides forensic tools to answer "when did this break and what changed?"

@ruvnet
ruvnet / README.md
Created May 8, 2026 18:49
Pattern fill, without training: a Rust library that copies the style of a few example sequences (Mario levels, drum loops, configs) without any model training. Bidirectional fill mode beats a 1st-order Markov chain by 4x.

Pattern fill, without training

A small Rust library that copies the style of a few example sequences and produces new ones in the same shape β€” without training a model.

You give it a handful of examples (Mario level slices, drum loops, snippets of structured text β€” any short tokens that have a pattern). It reads them once. From then on it can produce new sequences that look like they came from the same source. No GPUs. No PyTorch. No model files. Just Rust.

@ruvnet
ruvnet / README.md
Last active May 8, 2026 18:35
Sparse-Mario: a Rust sparse attention kernel as a training-free Super Mario Bros level generator (5.9x faster than dense at 2K tokens)

Sparse-Mario

A 2,200-line Rust example that uses a subquadratic attention kernel β€” built for edge LLM inference on Raspberry Pi Zero 2W β€” as a training-free Super Mario Bros level generator. The same kernel runs in two modes from one binary:

  • Autoregressive β€” token-by-token retrieval LM, walks the corpus's empirical bigram statistics. Now incremental via KvCache + decode_step: 2,880Γ— faster than the original full-forward path (25 s β†’ 9 ms for a 14Γ—50 grid).
  • Masked discrete diffusion β€” bidirectional context, iterative denoising with a MaskGIT cosine schedule. SOTA on this artifact: 3.8Γ— lower L2 distance to corpus than a 1st-order Markov bigram baseline, 6.9Γ— lower than the autoregressive path itself.

No autograd. No learned weights. No Python in the loop. The Mario corpus is the model.


@ruvnet
ruvnet / routine.md
Created May 8, 2026 16:26
scheduled nightly research agent for Claude Code Routines

You are a scheduled nightly research agent for the ruvector project. Produce deep state-of-the-art research on practical-to-exotic applications and improvements for ruvector, deliver a new feature branch with WORKING RUST code, a detailed ADR, a research document, and publish a public GitHub gist overview.

CONSTRAINTS (absolute):

  • RUST ONLY. No Python, JS, TS, or anything beyond glue shell.
  • NO mocks, NO TODO stubs, NO placeholder benchmarks β€” real cargo-run numbers only.
  • Files under 500 lines. Never commit secrets. Never save to repo root.
  • Use /docs for docs, /crates or /examples for code, /docs/adr for ADRs.

STEP 1 β€” ORIENT

  • git fetch origin && git checkout main && git pull
@ruvnet
ruvnet / ruvector-muvera-fde.md
Created May 8, 2026 16:26
ruvector 2026: MUVERA FDE Rust crate for ColBERT multi-vector late-interaction search β€” 9.5x QPS, NeurIPS 2024, pure Rust, no unsafe

ruvector 2026: MUVERA FDE β€” High-Performance Rust Multi-Vector Late-Interaction Search

150-word summary: ruvector now ships MUVERA Fixed Dimensional Encoding (NeurIPS 2024) as a pure Rust crate for ColBERT-style multi-vector retrieval. FDE converts O(nΓ—T_qΓ—T_dΓ—D) brute-force MaxSim into a single dot-product scan, delivering 9.5Γ— QPS improvement over brute-force at n=10K documents. Benchmark: 19 QPS vs 2 QPS (exact MaxSim oracle), x86-64 Linux, cargo --release. Three index variants β€” CentroidIndex, MaxSimIndex (oracle), MuveraFdeIndex β€” plus a two-stage FDE+Rerank pipeline.

Introduction: The Multi-Vector Search Gap in 2026

ColBERT, ColPali, and BGE-M3 have made late-interaction retrieval the dominant paradigm for precision-critical RAG pipelines. Each document is represented as T token embeddings rather than a single vector. The MaxSim score β€” Ξ£_i max_j dot(q_i, d_j) β€” captures nuanced semantic overlap that single-vector cosine similarity misses entirely.

The problem: scoring one query aga

@ruvnet
ruvnet / ruvector-lorann-overview.md
Created May 8, 2026 16:15
ruvector 2026: LoRANN Rust vector search NeurIPS 2024 SVD IVF ANN high-performance 30x speedup

ruvector 2026: LoRANN β€” High-Performance Rust Vector Search with Per-Cluster SVD Score Approximation

30.9Γ— QPS speedup over brute-force at 56% recall@10 on 50K vectors, 54.9Γ— at moderate recall β€” pure Rust, no BLAS, no Python.

ruvector now implements LoRANN (NeurIPS 2024) β€” a clustering-based approximate nearest-neighbour index that replaces the expensive per-cluster exact scorer with a compact rank-r SVD factorisation, achieving massive throughput gains while remaining production-deployable on commodity hardware.

Branch: research/nightly/2026-05-08-lorann Β· PR: #444


@ruvnet
ruvnet / ruvector-symphony-qg-overview.md
Created May 8, 2026 16:12
ruvector SymphonyQG: graph-coupled 4-bit FastScan ANN in pure Rust β€” 4-10x faster neighbor scoring, SIGMOD 2025 implementation, vector search benchmark

ruvector 2026: SymphonyQG β€” High-Performance Rust Vector Search with Graph-Coupled 4-bit FastScan

ruvector is a high-performance Rust vector search library. This research spike implements SymphonyQG (SIGMOD 2025) β€” the first graph-coupled FastScan ANN implementation in pure Rust, achieving 4–10Γ— faster neighbor scoring than exact f32 distance computation.

Introduction

Approximate Nearest Neighbor (ANN) search powers production RAG pipelines, semantic

@ruvnet
ruvnet / ruvector-muvera-2026.md
Created May 8, 2026 16:10
ruvector MUVERA: Multi-Vector ColBERT Retrieval in Rust β€” 42x faster MaxSim, NeurIPS 2024, fixed dimensional encodings, HNSW, vector search

ruvector 2026: MUVERA Multi-Vector Retrieval β€” High-Performance Rust ColBERT Search

150-char summary: ruvector-muvera brings ColBERT-style late-interaction retrieval to Rust: 42Γ— faster than brute-force MaxSim, zero bespoke infrastructure, pure safe Rust.

Introduction

Modern neural search is dominated by late-interaction retrieval models like ColBERT, ColBERT v2, and PLAID. Instead of compressing a 200-word document into a single vector, these models produce one high-dimensional embedding per token β€” capturing fine-grained semantics that bi-encoders discard. The result: 3–7% better nDCG@10 on BEIR benchmarks over state-of-the-art bi-encoders like E5-large and text-embedding-3.

The catch? Searching 1 million documents requires scoring 16 query tokens Γ— 200 doc tokens Γ— 1 million docs = 3.2 billion dot products per query. Without approximation, late-interaction retrieval is a compute problem, not a search problem.

@ruvnet
ruvnet / ruvector-pdx-columnar-vector-search-2026.md
Created May 8, 2026 16:08
ruvector PDX: Columnar Vector Layout β€” 2-3.4x faster Rust ANN scans, SIGMOD 2025, auto-vectorised, 100% recall, dimension-pruning search

ruvector 2026: PDX Columnar Vector Layout β€” High-Performance Rust Vector Search

2–3.4Γ— faster ANN scans with zero code changes and 100% recall, using LLVM auto-vectorisation via columnar memory layout (SIGMOD 2025)

ruvector now ships ruvector-pdx: the first Rust implementation of the PDX (Partition-Dimension-eXchange) data layout from CWI Amsterdam's SIGMOD 2025 paper. By transposing vector storage from row-major to columnar within each partition block, the inner L2 distance loop becomes stride-1 and LLVM auto-vectorises it with AVX2 β€” no hand-written intrinsics, no unsafe code, no platform-specific dependencies.

@ruvnet
ruvnet / ruvector-soar-ivf-2026.md
Created May 8, 2026 16:06
ruvector SOAR-IVF: High-Performance Rust vector search NeurIPS 2023 ANN IVF spilling

ruvector 2026: SOAR-IVF β€” High-Performance Rust Vector Search with Orthogonality-Amplified Residual Spilling

ruvector is a high-performance Rust vector database. This nightly research note introduces ruvector-soar: the first Rust implementation of SOAR-IVF (Spilling with Orthogonality-Amplified Residuals), the NeurIPS 2023 algorithm deployed in Google Cloud Vertex AI Vector Search. SOAR-IVF improves recall@10 by up to +10.4pp at the same query cost versus standard IVF-PQ β€” a critical improvement for production embedding search and RAG pipelines.

Keywords: approximate nearest neighbor search, IVF, vector search Rust,