Benchmark Report

WaveflowDB Benchmarking: Enterprise Retrieval Reinvented

Redefining accuracy, scalability, and cost-efficiency with full-corpus reranking.

Executive Summary

WaveflowDB delivers an enterprise-grade retrieval engine that eliminates traditional top-k constraints by applying full-corpus reranking across all documents.

MS MARCO Benchmark Results

MRR @ 161,536 documents: 0.55
Top-2 average result position
End-to-end API suite for retrieval, reranking, embeddings, and monitoring
Predictable, linear cost scaling versus the multi-vendor, multi-pipeline RAG stack

This positions WaveflowDB as a single-source, unified retrieval layer built for modern, agentic enterprise workflows.

Why Enterprises Choose WaveflowDB

Full-Corpus Reranking

Ensures relevant results surface—even deep in large corpora.

Unified Retrieval Stack

Storage, embeddings, vector search, reranking, and monitoring in one API suite.

Linear, Predictable Cost Model

No hidden inference or reranking overhead.

Enterprise-Grade Scalability

Maintains precision even at 150k+ document scale.

Waveflow — Suite of Platform Capabilities

Waveflow-UI

No-code agent builder UI

Waveflow-DB

Serverless vector database

Waveflow-Rerank

Enterprise-class reranker

Waveflow-Framework

Multi-agent workflows

Waveflow-Observability

Full-stack monitoring

MS MARCO Benchmark Overview

Dataset

MS MARCO Passage Ranking (Dev)

Corpus Size

161,536 documents

Queries Evaluated

19,697

MRR Result

0.55

Result: MRR = 0.55 → Relevant answer appears in top 2 positions on average

Why Full-Corpus Reranking Matters

Traditional retrieval stacks rely on top-k filtering, assuming relevant answers lie in the first 100 documents.

This is rarely true in enterprise data.

WaveflowDB performs global scoring across the full corpus, eliminating retrieval bias and significantly enhancing precision.

Impact: No more "missed documents" due to aggressive filtering.

MRR Comparison

Method	MRR@100	Notes
BM25	0.18	Lexical limits
DPR Dense	0.38	Bi-encoder
ColBERT	0.349	Late interaction
MonoT5/DuoT5	0.39	Cross-encoder
WaveflowDB	0.55 @161,536 docs	Full-corpus

Architecture & API Suite

WaveflowDB APIs

waveflow-doc-search

waveflow-upload-batch

waveflow-update-batch

waveflow-reranking

waveflow-meta

waveflow-conversational-AI

End-to-end simplicity—no multi-service orchestration required.

TCO & Cost Advantages

Traditional RAG Stack

Pinecone + Cohere + LLM + OpenSearch

✗High infra costs, reranking fees
✗Complex orchestration
✗Unpredictable scaling

WaveflowDB

Flat, predictable pricing
No per-query rerank fees
Single unified stack
Lower operational load

Business Leader Viewpoint

For CEOs

Reduced TCO
Faster strategic insights
Higher accuracy → better customer experiences

For CTOs

Scalable, storage-native architecture
Model-agnostic embeddings
Unified monitoring and metering

Key Takeaways

Retrieval precision: Top-2 accuracy across full corpus
Unified RAG stack: Storage, embeddings, reranking, search, observability
Scalable architecture with linear pricing
Purpose-built for enterprise and agentic workflows