AWS re:Invent 2015 | (DAT311) Large-Scale Genomic Analysis with Amazon Redshift

Published on Oct 10, 2015

Genomics analysis is one of the biggest data problems out there. With DNA sequencing finally down to an affordable cost, the current bottleneck is shifting from sequencing genomes to deriving meaning from genomes at a large scale. Learn how Human Longevity, Inc., uses Amazon Redshift to analyze thousands of whole genomes every month. Dive into their detailed architecture, including how they ingest terabytes of genomic information each day. Learn how they optimize their schema, rapidly analyzing thousands of genomes in a single query using a "select, aggregate, annotate" paradigm. Finally, learn best practices for using Amazon Redshift to accelerate research.