Key publications describing the MetaGraph framework, compression algorithms, and alignment methods.
Why it matters: Demonstrates practical feasibility of economical full‑text search in 67 petabase pairs of public sequence data, making biological archives searchable at scale.
OA: Open Access
Why it matters: First end‑to‑end description of MetaGraph for petabase‑scale annotated de Bruijn‑graph indexing and search.
Why it matters: Column‑hierarchical (Multi‑BRWT) compression—cornerstone for compact colored/annotated DBGs.
Why it matters: Exploits graph topology to sparsify labels—big annotation size reductions with fast queries.
Why it matters: Early compact/dynamic color encoding that informed later MetaGraph annotation designs.
Why it matters: Adds counts and coordinates; enables lossless quantitative and positional queries in MetaGraph.
Why it matters: Label‑consistent SCA/MLA alignment used as MetaGraph's sensitive mode for experiment discovery.
Why it matters: Long inexact sketch‑based seeding (MG‑Sketch) boosts recall at high divergence; complements MetaGraph's alignment toolkit.