Indexes

All index files generated within context of the MetaGraph project that can be publicly shared are available for download through the AWS S3 bucket s3://metagraph-data-public. We are constantly extending and completing the list of available index files, so please stay tuned and check back regularly for further updates.

Available indexes

Index Name Path Size
RefSeq s3://metagraph-data-public/refseq/ 771 GB
SRA-Fungi s3://metagraph-data-public/fungi/ 74 GB
SRA-Human s3://metagraph-data-public/human/ 3.17 TB
SRA-MetaGut s3://metagraph-data-public/metagut/ 1.04 TB
SRA-Metazoa s3://metagraph-data-public/metazoa/ 5.00 TB
SRA-Microbe s3://metagraph-data-public/microbe/ 53 GB
SRA-Plants s3://metagraph-data-public/plants/ 1.72 TB

How to download

To access the MetaGraph indexes via AWS S3, you can use the AWS Command Line Interface (CLI) . To install the AWS CLI, follow the step-by-step guide available in the official AWS documentation . The installation instructions cover various operating systems, including Windows, macOS, and Linux.

Once the command line interface is installed successfully, you can access the data available in the MetaGraph S3 bucket:


# Example of using AWS CLI to list available objects at metagraph-data-public
aws s3 ls s3://metagraph-data-public/ --no-sign-request --region eu-central-2