Process Web Logs with AWS Data Pipeline, Amazon EMR, and Hive

Published on Jan 25, 2013

In this video, you will learn how to use AWS Data Pipeline and a console template to create a functional pipeline. The pipeline uses an Amazon EMR cluster and a Hive script to read Apache web access logs, select certain columns, and write the reformatted output to an Amazon S3 bucket. Learn more about AWS Data Pipeline at http://aws.amazon.com/datapipeline