Process Web Logs with AWS Data Pipeline, Amazon EMR, and Hive
Published on Jan 25, 2013
In this video, you will learn how to use AWS Data Pipeline and a console template to create a functional pipeline. The pipeline uses an Amazon EMR cluster and a Hive script to read Apache web access logs, select certain columns, and write the reformatted output to an Amazon S3 bucket. Learn more about AWS Data Pipeline at http://aws.amazon.com/datapipeline