Interactive Genomic Data Analysis Using Amazon Athena

Published on Jul 10, 2018

Learn more about AWS at https://amzn.to/2ukXIRd London Innovation Series 2017 The session kicks off with an introduction to Big Data in the healthcare realm, along with some Big Data use cases leveraging AWS Big Data platform. This is followed by an overview of Amazon Athena – an interactive query service to analyse data on Amazon S3 – that can query different file types straight from S3 including Parquet files. Pratim then goes on to introduce ADAM – a Spark Genomics Library and analysis platform with specialized file formats. The latter part of the presentation demonstrates preparation and analysis of genomic data in ADAM parquet files using Spark and conducting quality control of genomic sequencing by analysing ‘variations’.   Speaker: Pratim Das, Specialist Solutions Architect, AWS