Kurskode: OSHA

varighet: 4 Dag(er)

Sted: Virtual, Instructor Led Training
Katergori: EMC

Course Overview

This open source course provides participants with a comprehensive understanding of the steps necessary to install, configure, operate and maintain Hadoop. The course begins with an overview of the Big Data landscape, and then dives into a system administration working view of running Hadoop.

    Upon successful completion of this course, participants should be able to:

  • โ€ข Describe the fundamental concepts of using Big Data
  • โ€ข Identify where Hadoop fits into a Big Data strategy
  • โ€ข Learn to plan your Hadoop cluster.
  • โ€ข Learn HDFS features.
  • โ€ข Learn how to get data into HDFS.
  • โ€ข Learn to work with MapReduce.
  • โ€ข Learn installation and configuration of Hadoop.
  • โ€ข Learn cluster maintenance.

  • โ€ข The content of this course is designed to support the course objectives.

Hadoop Introduction

  • โ€ข A Brief History of Hadoop
  • โ€ข Core Hadoop Components
  • โ€ข Fundamental Concepts

Planning Your Hadoop Cluster

  • โ€ข General Planning Considerations
  • โ€ข Choosing Hardware
  • โ€ข Network Considerations
  • โ€ข Configuring Nodes
  • โ€ข Planning for Cluster Management

HDFS

  • โ€ข HDFS Features
  • โ€ข Writing and Reading Files
  • โ€ข NameNode Considerations
  • โ€ข HDFS Security
  • โ€ข Namenode Web UI
  • โ€ข Hadoop File Shell

Getting Data into HDFS

  • โ€ข Pulling data from External Sources with Flume
  • โ€ข Importing Data from Relational Databases with Sqoop
  • โ€ข REST Interfaces
  • โ€ข Best Practices

โ€ข MapReduce

  • โ€ข MapReduce overview
  • โ€ข Features of MapReduce
  • โ€ข Architectural Overview
  • โ€ข YARN MapReduce Version 2
  • โ€ข Failure Recovery
  • โ€ข The JobTracker Web UI

Hadoop Installation & Initial Configuration

  • โ€ข Configuration & Deployment Types
  • โ€ข Installing Hadoop
  • โ€ข Specifying the Hadoop Configuration
  • โ€ข Initial HDFS & MapReduce Configuration
  • โ€ข Log Files

Installing/Configuring Hive, Impala, and Pig

  • โ€ข Hive
  • โ€ข Impala
  • โ€ข Pig

Hadoop Clients

  • โ€ข What is a Hadoop Client?
  • โ€ข Installing and Configuring Hadoop Clients
  • โ€ข Installing and Configuring Hue
  • โ€ข Hue Authentication and Configuration

Advanced Cluster Configuration

  • โ€ข Advanced Configuration Parameters
  • โ€ข Configuring Hadoop Ports
  • โ€ข Explicitly Including and Excluding Hosts
  • โ€ข Configuring HDFS for Rack Awareness & HDFS High Availability

Hadoop Security

  • โ€ข Why Hadoop Security Is Important
  • โ€ข Hadoopโ€™s Security System Concepts
  • โ€ข What Kerberos Is and How it Works
  • โ€ข Securing a Hadoop Cluster with Kerberos

Managing and Scheduling Jobs

  • โ€ข Managing Running Jobs
  • โ€ข Scheduling Hadoop Jobs
  • โ€ข Configuring the FairScheduler

Cluster Maintenance

  • โ€ข Checking HDFS Status
  • โ€ข Copying Data Between Clusters
  • โ€ข Adding/Removing Cluster Nodes
  • โ€ข Rebalancing the Cluster
  • โ€ข NameNode Metadata Backup
  • โ€ข Cluster Upgrades

 Cluster Monitoring and Troubleshooting

  • โ€ข General System Monitoring
  • โ€ข Managing Hadoopโ€™s Log Files
  • โ€ข Monitoring the Clusters
  • โ€ข Common Troubleshooting Issues

Not available. Please contact.

This course is intended for System administrators, DevOps engineers, and software developers responsible for managing and maintaining Hadoop clusters.

Kontakt oss: Kurs@sgpartner.no

Kurskode: OSHA Kateegori: ,

Relaterte kurs