Scalable DNA, RNA, and Protein search for biological discovery
MetaGraph indexes and compresses vast collections of DNA, RNA, and protein sequences—representing petabases of data—and lets you query them instantly to turn big data into biological insight.
read more in the paperMetaGraph unlocks the world’s sequencing archives, transforming data from sources like SRA and ENA into a unified, searchable landscape. Identify whether a sequence has been observed before and trace its biological context.
start a searchMetaGraph supports both exact and inexact sequence searches, pairing precision with flexibility. Each result is annotated with sample and metadata context, and the open-source framework is ready to run on your own data.
explore on GitHubAggregated view of our comprehensive biological database collection
> Paste your FASTA sequence and search across millions of genomesEverything you need to perform comprehensive sequence analysis across multiple biological databases.
Understand the details of the MetaGraph service.
Upload your FASTA sequences and discover matches across our comprehensive biological databases.