Hadoop

Posted by admin June - 7 - 2012 4 COMMENTS

www.nearinfinity.com Scott Leberknight presents on Hadoop. Hadoop is an open source framework maintained by the Apache Software Foundation for creating fault-tolerant, distributed applications that process vast amounts of data in parallel across a cluster of commodity servers. Hadoop consists of two primary components the Hadoop Distributed Filesystem (HDFS) and a MapReduce framework. HDFS is a distributed filesystem which efficiently stores very large files across a cluster in a fault-tolerant manner. MapReduce is a framework for dividing data processing into two distinct phases, mapping and reducing, in order to deconstruct a problem so it can be run in parallel across many machines in order to speed data transformation and aggregation. In this talk we’ll look at both HDFS and the MapReduce framework. We’ll also look at one specific Hadoop subproject, Hive, which provides a data warehousing capability on top of Hadoop and allows developers and analysts to query their data stored in HDFS using SQL queries.
Video Rating: 5 / 5

Tags : Hadoop, Big Data Analytics

4 Responses to “Hadoop”

srinivassr1985 Says:
June 8th, 2012 at 12:05 am
Thx.. Best map Reduce Tutorial I have ever watched..
stholy32 Says:
June 8th, 2012 at 12:47 am
super good vid !!! many thx !!!
mailvkjain Says:
June 8th, 2012 at 1:37 am
Awesome Video loved the presentation and ease with which its presented
Gavrillo Says:
June 8th, 2012 at 1:56 am
Tough crowd. Great presentation.

You must be logged in to post a comment.

YottaByte Me

Hadoop

4 Responses to “Hadoop”

Leave a Reply

Recent Posts

Recent Comments

Archives

Categories