• Course
  • Vendor

Introduction to Hadoop Administration is an introductory-level, hands-on lab-intensive course geared for the administrator (new to Hadoop) who is charged with maintaining a Hadoop cluster and its related components.

  • Course Start Date: 2021-07-21
  • Time: 10:00:00 - 18:00:00
  • Duration: 3 Day(s)
  • Location: Virtual
  • Delivery Method(s): Virtual Instructor Led
$1,916.00
REGULAR PRICE $2,395.00 Save $479.00
2 discount seats left!
or make an offer

Course Outline

Pre-Requisites

This is an introductory-level course designed to teach experienced systems administrators how to install, maintain, monitor, troubleshoot, optimize, and secure Hadoop. Previous Hadoop experience is not required.

Lessons

Course Overview

Apache Hadoop is an open source framework for creating reliable and distributable compute clusters. Hadoop provides an excellent platform (with other related frameworks) to process large unstructured or semi-structured data sets from multiple sources to dissect, classify, learn from and make suggestions for business analytics, decision support, and other advanced forms of machine intelligence.

Introduction to Hadoop Administration is an introductory-level, hands-on lab-intensive course geared for the administrator (new to Hadoop) who is charged with maintaining a Hadoop cluster and its related components. Attending students will learn how to install, maintain, monitor, troubleshoot, optimize, and secure Hadoop.

Course Objectives

Working within in an engaging, hands-on learning environment, guided by our expert team, attendees will learn to:

  • Understand the benefits of distributed computing
  • Understand the Hadoop architecture (including HDFS and MapReduce)
  • Define administrator participation in Big Data projects
  • Plan, implement, and maintain Hadoop clusters
  • Deploy and maintain additional Big Data tools (Pig, Hive, Flume, etc.)
  • Plan, deploy and maintain HBase on a Hadoop cluster
  • Monitor and maintain hundreds of servers
  • Pinpoint performance bottlenecks and fix them

Course Agenda

Please note that this list of topics is based on our standard course offering, evolved from typical industry uses and trends. We will work with you to tune this course and level of coverage to target the skills you need most.  

Introduction

  • Hadoop history and concepts
  • Ecosystem
  • Distributions
  • High level architecture
  • Hadoop myths
  • Hadoop challenges (hardware / software)

Planning and installation

  • Selecting software and Hadoop distributions
  • Sizing the cluster and planning for growth
  • Selecting hardware and network
  • Rack topology
  • Installation
  • Multi-tenancy
  • Directory structure and logs
  • Benchmarking

HDFS operations

  • Concepts (horizontal scaling, replication, data locality, rack awareness)
  • Nodes and daemons (NameNode, Secondary NameNode, HA Standby NameNode, DataNode)
  • Health monitoring
  • Command-line and browser-based administration
  • Adding storage and replacing defective drives

MapReduce operations

  • Parallel computing before MapReduce: compare HPC versus Hadoop administration
  • MapReduce cluster loads
  • Nodes and Daemons (JobTracker, TaskTracker)
  • MapReduce UI walk through
  • MapReduce configuration
  • Job config
  • Job schedulers
  • Administrator view of MapReduce best practices
  • Optimizing MapReduce
  • Fool proofing MR: what to tell your programmers
  • YARN: architecture and use

Advanced topics

  • Hardware monitoring
  • System software monitoring
  • Hadoop cluster monitoring
  • Adding and removing servers and upgrading Hadoop
  • Backup, recovery, and business continuity planning
  • Cluster configuration tweaks
  • Hardware maintenance schedule
  • Oozie scheduling for administrators
  • Securing your cluster with Kerberos
  • The future of Hadoop

Course Materials

Student Materials: Each participant will receive a Student Guide with course notes, code samples, software tutorials, step-by-step written lab instructions, diagrams and related reference materials and resource links. Students will also receive the project files (or code, if applicable) and solutions required for the hands-on work

Hands-On Setup Made Simple! Our dedicated tech team will work with you to ensure our ‘easy-access’ cloud-based course environment is accessible, fully-tested and verified as ready to go well in advance of the course start date, ensuring a smooth start to class and effective learning experience for all participants. Please inquire for details and options.

Related Courses

TAKE AFTER

We offer a variety of courses that serve as an excellent follow on to this course, with a few possible options listed below. Our team can work with you to help you select the best next steps based on your role or learning goals.

Cancellation Policy

TBD

Training Location

Virtual Instructor Led Online Training
your home or offce

your city, your province
your country   

About Trivera Technologies LLC

x

Trivera Technologies is a woman-owned IT training education firm that has provides engaging, comprehensive technical training, consulting, mentoring and courseware development and licensing services to hundreds of organizations globally, on an annual basis. Our collaborative, skills-focused, consultative approach to developing and delivering learning helps organizations bring technical teams of all skills-levels up to speed with the latest technologies, tools, skills and best practices surrounding all aspects of application development, from concept through completion, all targeted to their specific needs and goals. 

We offer skills-focused training events onsite, online, or in blended solutions for distributed teams, from small groups to large-scale, worldwide enterprise organizations.  Services include assessment, development and delivery of targeted learning solutions for new-hire cohort programs; skills immersion boot camps and code camps; skills assessment and skills-gap training; enterprise-wide reskilling, upskilling and new-skilling programs; extensive public schedule offerings; mentoring and coaching and much more. 

Areas of specialty include: application development & programming; modern web development and design; CyberSecurity & secure coding; Data Science / AI / Machine Learning / Deep Learning; Python; DevOps; Cloud; Software architecture, design, testing and development; Agile development & Scrum; Networking & Sys Admin; O/S and Tools; project management; business information and data; IT professional skills; ITIL; COMPTIA and much more. 

Training Provider Rating

No Reviews Yet

Course Reviews

No Reviews Yet

More Courses from Trivera Technologies LLC

TRIVERA TECHNOLOGIES LLC
2021-07-08
Virtual
TRIVERA TECHNOLOGIES LLC
2021-10-20
Virtual
TRIVERA TECHNOLOGIES LLC
2021-07-12
Virtual

More Courses in 'Category to be Determined' Category