MetaGraph Ultra Scalable Framework for DNA Search


The MetaGraph framework allows for indexing and analysis of very large biological sequence collections, producing compresed indexes that can represent several petabases of input data. The indexes can be efficiently queried with any query sequence of interest.

Sourcing on raw sequencing data available in public archives such as SRA or ENA, the MetaGraph framework makes this treasure trove of information directly accessible for full text search, helping to discover whether any given sequence has ever been observed before and if yes in which context.

The featureful API allows for both exact mapping as well as inexact matching (alignment). The mapping results will be associates with the annotations available for the matches in the index, providing information on, e.g., the sample source or other associated metadata.