How-to Install Apache Hadoop on Mac OS X 10.11 El Capitan Step-by-Step Guide

Hadoop 2.X QuickStart on Mac OS X 10.11 El Capitan




The Mac Tutorial Shows You Step-by-Step How-to Install and Getting-Started with Apache Hadoop/Map-Reduce vanilla in Pseudo-Distributed mode on Mac OS X 10.11 El Capitan 32/64bit Desktop.

Hadoop is a distributed master-slave that consists of the Hadoop Distributed File System (HDFS) for storage and Map-Reduce for computational capabilities.

The Guide Describe a System-Wide Installation with Root Privileges but You Can Easily Convert the Procedure to a Local One.

Apache Hadoop Require the Java JDK 7+ Installed so if Needed Follow the guide on How-to Install Oracle JDK on Mac.

Hadoop Getting-Started on Mac OS X 10.11 El Capitan - Featured

Apache Hadoop 2.x Includes the following Modules:

  • Hadoop Common: The common utilities that support the other Hadoop modules.
  • Hadoop Distributed File System (HDFS): A distributed file system that provides high-throughput access to application data.
  • Hadoop YARN: A framework for job scheduling and cluster resource management.
  • Hadoop MapReduce: A YARN-based system for parallel processing of large data sets.
  1. Download Latest Apache Hadoop Stable Release:

    Apache Hadoop Stable tar.gz

  2. Double-Click on Archive to Extract

  3. Open Terminal Window
    (Press “Enter” to Execute Commands)

    Install Hadoop for Mac OS X 10.11 El Capitan - Open Terminal
  4. Relocate Apache Hadoop Directory

  5. Check if Java JDK 7+ is Installed

    How-to Install Required Oracle JDK 7+ on MacOS X:

    Install Oracle JDK 7+ for Mac
  6. Set JAVA_HOME in Hadoop Env File

    If Got “User is Not in Sudoers file” then Look: Solution

    Append:

    Ctrl+x to Save & Exit 🙂

  7. Configuration for Pseudo-Distributed mode

    The Content Should Look Like:

    Next:

    The Content Should Look Like:

    Last:

    The Content Should Look Like:

  8. SetUp Local Path & Environment

    Inserts:

    The JAVA_HOME is Set Following Oracle Java JDK6+ Installation Version…

    Then Load New Setup:

  9. SetUp Needed Local SSH Connection
    Enable SSH Connection:

    Mac El Capitan 10.11 Hadoop Quick-Start - Enabling Remote Login
    To Enable SSH Login without Pass:

    Press enter for each line…

    Testing Connection:

  10. Formatting HDFS

    Install Hadoop for Mac OS X 10.12 Sierra - Terminal Apache Hadoop HDFS Formatting Succcess

  11. Starting Up Hadoop Database

  12. Apache Hadoop Database Quick-Start Guide:

    Hadoop MapReduce Quick-Start
  13. Eclipse Hadoop 2.X Integration with Free Plugin:

    Hadoop 2.X Eclipse Plugin SetUp