To access the old site click here! X

  • Course
  • Vendor

Learn the basic principles, concepts, and techniques/tools used for big data and business analytics, which including data mining, Hadoop, HDFS & MapReduce, Apache HBase and Apache Hive. GK# 9099

  • Course Start Date: 2019-01-14
  • Time: 19:30:00 - 16:30:00
  • Duration: 5 days 07:30 PM - 04:30 PM
  • Location: Virtual
  • Delivery Methods(s): Virtual Instructor Led
$4,909.50
REGULAR PRICE $5,455.00 Save $545.50
2 discount seats left!
or make an offer

Course Outline

Pre-Requisites

Participants are recommended to have preferably min. 2 years of experience in software development with Java/Unix/Linux environment and a good understanding of data and business analytics.

Lessons

Big Data Analytics delivers competitive advantage in two ways compared to the traditional analytical model. First, Big Data Analytics describes the efficient use of a simple model applied to volumes of data that would be too large for the traditional analytical environment. Second, Big Data Analytics refers to the sophistication of the model itself. Increasingly, analysis algorithms are provided directly by database management system (DBMS) vendors. To pull away from the pack, companies must go well beyond what is provided and innovate by using newer, more sophisticated statistical analysis.

This specialized course covers the concept of business analytics and big data technologies with its strategic importance to any organization. Participants will be introduced to the concept of business analytics with big data technologies: Hadoop, Hive and HBase. The course deals with basic principles, concepts, and techniques/tools used for big data and business analytics, which includes data mining, Hadoop, HDFS & MapReduce, Apache HBase and Apache Hive. Also, this course covers different types of business analytics with real life use cases including association rule mining and regression models. Participants will get good picture of all these concepts and how they all are interconnected to each other in organizational context.

WHAT YOU'LL LEARN
  • Understand business analytics and big data technologies with its impact on enterprises
  • Learning data mining concepts, techniques through an open source data mining tool
  • Understand the role of big data technologies (Hadoop, HBase, Hive) in business analytics
  • Acquire the knowledge and learn to use Hadoop (HDFS and MapReduce), HBase and Hive

OUTLINE
Unit 1: Introduction to Business Analytics
  • The concept of Business Analytics
  • Data, Information, Knowledge and Wisdom
  • Data as Unique Enterprise Asset
  • Data, Information and Analytics Lifecycle
  • Business Analytics – Current Context
  • Types of Analytics 
    • Descriptive Analytics
    • Predictive Analytics
    • Prescriptive Analytics

Unit 2: Data/Information Architecture for Business Analytics
  • Data/Information Architecture
  • Concept of Data Warehouse/Enterprise Data Warehouse (EDW)
  • ETL – Key Process
  • Concept of Data Mart
  • Business Intelligence
  • Data Mining

Unit 3: Data Mining Tool
  • Understand the open source data mining tool RapidMiner
  • Explore the various features of RapidMiner
  • Walk through a RapidMiner demo with different scenarios

Unit 4: Data Mining Techniques
  • Understand the various data mining techniques
  • Understand how correlation matrix works
  • Understand how association rule mining works
  • Understanding the Predictive Analytics technique
  • Understand the forecasting technique

Unit 5: Introduction to Big Data
  • What is Big Data? Why Big Data?
  • 3V's of Big Data
  • The Rapid Growth of Unstructured Data
  • Big Data Market Forecast
  • Big Data Analytics
  • Big Data in Business
  • Big Data Types & Architecture

Unit 6: Introduction to Hadoop
  • Big Data – Current Industry Trends
  • Why Process Big Data?
  • Challenges in Data Processing
  • Why Hadoop?
  • What is Hadoop offering?
  • Hadoop Network Structure
  • Hadoop Eco-System
  • Hadoop Core Components
  • Hadoop – Features
  • Hadoop – Relevance
  • Hadoop in Action

Unit 7: Hadoop HDFS & MapReduce
  • Hadoop HDFS
    • What does HDFS Facilitate?
    • HDFS Architecture
    • Hadoop Network and Server Infrastructure
    • NameNode, Secondary NameNode and DataNode
    • Ensuring Data Correctness
    • Data Pipelining while Loading Data
    • fs Operations
  • Hadoop MapReduce
    • MapReduce Conceptualization
    • MapReduce – Overview
    • MapReduce – Programming Model
    • MapReduce – Execution Overview
    • Hadoop – Application Examples
    • Word Count – Example

Unit 8: Apache HBase
  • What is HBase?
  • HBase Architecture
  • ZooKeeper
  • HBase Data model
  • HBase Deployment
  • HBase Cluster Architecture
  • Indexes in HBase
  • Scaling HBase
  • Data Locality, Coherence and Concurrency, Fault Tolerance
  • Hadoop Integration
  • High-Level Architecture
  • Replication of Data Across Data Centres
  • HBase Applications
  • Advantages and Disadvantages

Unit 9: Apache Hive
  • What is Hive?
  • Why Hive?
  • Where to use Hive?
  • Hive Architecture
  • Hive: Benefits
  • Hive: Tradeoffs
  • Hive: Real world Examples

WHO SHOULD ATTEND
  • Data Analyst - Statistics and Mining
  • Big Data Analyst
  • Operations Research Analyst
  • Senior Data Analyst- Statistics and Mining
  • Data Scientist

Cancellation Policy

We require 16 calendar days notice to reschedule or cancel any registration. Failure to provide the required notification will result in 100% charge of the course. If a student does not attend a scheduled course without prior notification it will result in full forfeiture of the funds and no reschedule will be allowed. Within the required notification period, only student substitutions will be permitted. Reschedules are permitted at anytime with 16 or more calendar days notice. Enrollments must be rescheduled within six months of the cancel date or funds on account will be forfeited.

Training Location

Online Classroom
your office

your city, your province
your country   

About Global Knowledge

x

Global Knowledge is the world's leading learning services and professional development solutions provider. We deliver learning solutions to support customers as they adapt to key business transformations and technological advancements that drive the way that organizations around the world differentiate themselves and thrive. Our learning programs, whether designed for a global organization or an individual professional, help businesses close skills gaps and foster an environment of continuous talent development.

Training Provider Rating

This vendor has an overall average rating of 4.38 out of 5 based on 419 reviews.

No comment
No comment
No comment
No comment
No comment
No comment
No comment
Wasn’t as advanced as I thought it would be. There was an issue when the day my course was the first time they used a new platfo ... Read more
x

Wasn’t as advanced as I thought it would be. There was an issue when the day my course was the first time they used a new platform.. from adobe to something called zoom; I had to call support line cause it stated our instructor wasn’t present. Thankfully I called cause everyone online was in the adobe virtual classroom waiting for what looked like a teacher who didn’t show up for class (IT didn’t get anything resolved until 10mins after start time). I felt like he was really getting hung up on very basic knowledge for the first half of the course (talking about how to create tabs and drag formulas as an example). I completed files a few times before he was done explaining. There was a scheduled fire drill for them (roughly 30mins)that also cut into our time, which wasn’t deducted from the hour lunch break or the two, fifteen min breaks. I also really wish he touched base more on the automating workbook functions portion which we barely did. I'm happy there were/are those study guides (learning videos) and exams to take on my own time that I hope after I've had the class are still available for me to learn from.

No comment
No comment
No comment
No comment
It was difficult to practice on my PC while trying to watch the presentation online.
No comment
David was excellent!! I am very for having this course!!
No comment
Everything was great, but the instructor wasted a lot of time talking about unrelated subjects (like demo-ing different programs, ... Read more
x

Everything was great, but the instructor wasted a lot of time talking about unrelated subjects (like demo-ing different programs, talking about other classes, and talks about how Excel/technology has changed) took up way too much time. The course could have been condensed or better focus would have been great

Did not actually receive the course materials yet (and the course has concluded). Ratings assume that I will receive the course m ... Read more
x

Did not actually receive the course materials yet (and the course has concluded). Ratings assume that I will receive the course materials as soon as possible.

Facilitator was excellent
No comment

Course Reviews

No Reviews Yet

More Courses from Global Knowledge

GLOBAL KNOWLEDGE
2019-04-23
Virtual
GLOBAL KNOWLEDGE
2019-04-10
Virtual
GLOBAL KNOWLEDGE
2019-02-11
Virtual

More Courses in 'AWS Big Data' Category

GLOBAL KNOWLEDGE
2019-03-04
Virtual
GLOBAL KNOWLEDGE
2019-02-15
Virtual
GLOBAL KNOWLEDGE
2019-02-11
Virtual
GLOBAL KNOWLEDGE
2019-04-08
Virtual