Petabase-Scale Search for Genomics & Beyond

Scalable DNA, RNA, and Protein search for biological discovery

Making Biology Searchable

MetaGraph indexes and compresses vast collections of DNA, RNA, and protein sequences—representing petabases of data—and lets you query them instantly to turn big data into biological insight.

read more in the paper

From Archives to Insight

MetaGraph unlocks the world’s sequencing archives, transforming data from sources like SRA and ENA into a unified, searchable landscape. Identify whether a sequence has been observed before and trace its biological context.

start a search

Powerful and Flexible

MetaGraph supports both exact and inexact sequence searches, pairing precision with flexibility. Each result is annotated with sample and metadata context, and the open-source framework is ready to run on your own data.

explore on GitHub

Database Statistics

Aggregated view of our comprehensive biological database collection

> Paste your FASTA sequence and search across millions of genomes

Powerful Search Capabilities

Everything you need to perform comprehensive sequence analysis across multiple biological databases.

Multi-Database Search
Search across assembled sequences (RefSeq, UHGG) and raw sequences databases (SRA/ENA/DRA) simultaneously for comprehensive results.
Data Enrichment
Automatic enrichment with geographic, taxonomic, and metadata integrated directly from the MetaGraph service.
Interactive Visualization
Rich data visualization with interactive maps, charts, and AI summary for detailed analysis.
Real-time Processing
Live job progress updates with Server-Sent Events for immediate feedback on search status.
Robust Validation
Comprehensive FASTA sequence validation with detailed error reporting and format checking.
Global Coverage
Access to worldwide genomic data with geographic mapping and continental distribution analysis.

Ready to Start Your Search?

Upload your FASTA sequences and discover matches across our comprehensive biological databases.