The browser version you are using is not recommended for this site.Please consider upgrading to the latest version of your browser by clicking one of the following links.
We are sorry, This PDF is available in download format only
Deveraj Das describes Apache MapReduce*, a powerful model for parallel processing large data sets—and the heart of the Apache Hadoop* system. This overview by an expert from the Apache Hadoop open-source community explains MapReduce master-slave architecture, how MapReduce works, running jobs in isolation in multitenant environments, MapReduce limitations, the significance of Apache YARN for overcoming scalability issues, and where the software is headed. Part of the Intel IT Center’s Hadoop* Community Spotlight series. Also listen to the podcast of the interview.
Apache MapReduce overview.
Apache HDFS* overview.
Apache Pig* overview.
Linda Feldt highlights big data research—video
Shows how Hadoop* clusters analyze big data more effectively over Intel® 10Gb Ethernet.