Petabase-Scale Search for Genomics & Beyond

Scalable DNA, RNA, and Protein search for biological discovery

Making Biology Searchable

MetaGraph indexes and compresses vast collections of DNA, RNA, and protein sequences—representing petabases of data—and lets you query them instantly to turn big data into biological insight.

From Archives to Insight

MetaGraph unlocks the world’s sequencing archives, transforming data from sources like SRA and ENA into a unified, searchable landscape. Identify whether a sequence has been observed before and trace its biological context.

start a search

Powerful and Flexible

MetaGraph supports both exact and inexact sequence searches, pairing precision with flexibility. Each result is annotated with sample and metadata context, and the open-source framework is ready to run on your own data.

explore on GitHub

Database Statistics

Aggregated view of our comprehensive biological database collection

Start Search View Databases View Examples

> Paste your FASTA sequence and search across millions of genomes

Powerful Search Capabilities

Everything you need to perform comprehensive sequence analysis across multiple biological databases.

Multi-Database Search

Search across assembled sequences (RefSeq, UHGG) and raw sequences databases (SRA/ENA/DRA) simultaneously for comprehensive results.

Data Enrichment

Automatic enrichment with geographic, taxonomic, and metadata integrated directly from the MetaGraph service.

Interactive Visualization

Rich data visualization with interactive maps, charts, and AI summary for detailed analysis.

Real-time Processing

Live job progress updates with Server-Sent Events for immediate feedback on search status.

Robust Validation

Comprehensive FASTA sequence validation with detailed error reporting and format checking.

Global Coverage

Access to worldwide genomic data with geographic mapping and continental distribution analysis.

MetaGraph under the hood

Understand the details of the MetaGraph service.