About

Annotation Cache is an Open Data on AWS resource providing pre-built Ensembl VEP and SnpEff annotation caches, as well as SvAnna databases, for the nf-core community. Created with Nextflow and Seqera Platform, it eliminates the need for cloud users to download large annotation databases individually by using a distributed cloud file system like Fusion.

Since all caches are hosted on the cloud, an extra layer of organization was added to the data to facilitate access and navigation.

Credits

This project was initiated by Maxime U. Garcia while at Seqera, and maintenance is now continued by him at NGI.

Thanks to all contributors for their extensive assistance in the development and maintenance of this resource.

Contributors

Nextflow Summit 2023

Maxime U. Garcia presented “Annotation cache: using nf-core/modules and Seqera Platform to build an AWS open data resource” at the Nextflow Summit 2023 in Barcelona.

The talk described the journey of building this public resource using nf-core/modules, orchestrating workflows with Nextflow, deploying via Seqera Platform, and publishing the data on AWS Open Data.

Watch: Summit talk on YouTube