• Course
  • Vendor

Apache Hive is the de-facto standard for data warehousing Hadoop. This course starts with standard Hive setup and operations, continues into Advanced Hive use, discusses performance and execution engines, and ends with a practical workshop.

  • Course Start Date: 2021-05-24
  • Time: 10:00:00 - 18:00:00
  • Duration: 2 Day(s)
  • Location: Virtual
  • Delivery Method(s): Virtual Instructor Led

Course Outline

Pre-Requisites

This is an Introductory-level course is geared for experienced data scientists and engineers seeking a quick start to working with Hive. Attendees should have some familiarity with basic SQL, and should also be able to navigate Linux command line and have basic knowledge of Linux editors (such as VI / nano) for editing code. This course is a core component of our Big Data, AI & Machine Learning Skills Path, designed to train participants of all skill levels in modern data science across the enterprise. We offer courses in next level Hadoop, Hive, Analytics, Kafka and more. Please contact us for details and next step recommendations based on your specific roles and. goals.

Lessons

Course Overview

Apache Hive is the de-facto standard for data warehousing Hadoop. This course starts with standard Hive setup and operations, continues into Advanced Hive use, discusses performance and execution engines, and ends with a practical workshop.

Course Objectives

This course is intended for data scientists and software engineers. It gives them practical level of experience, achieved through a combination of about 50% lecture, 50% lab work.

Course Agenda

Please note that this list of topics is based on our standard course offering, evolved from typical industry uses and trends. We’ll work with you to tune this course and level of coverage to target the skills you need most.  

Hive Basics

  • Defining Hive Tables
  • SQL Queries over Structured Data
  • Filtering / Search
  • Aggregations / Ordering
  • Partitions
  • Joins
  • Text Analytics (Semi-Structured Data)

Hive Advanced

  • Transformation, Aggregation
  • Working with Dates, Timestamps, and Arrays
  • Converting Strings to Date, Time, and Numbers
  • Create new Attributes, Mathematical Calculations, Windowing Functions
  • Use Character and String Functions
  • Binning and Smoothing
  • Processing JSON Data
  • Execution Engines (Tez, MR, Spark)

Impala (for Cloudera track)

  • Architecture
  • Impala joins and other SQL specifics

Bonus Project

  • Students will work in teams to do this end-to-end workshop
  • Setup a data warehouse with Hive
  • Query and analyze data with Hive and Spark

Course Materials

Student Materials: Each participant will receive a Student Guide with course notes, code samples, software tutorials, step-by-step written lab instructions, diagrams and related reference materials and resource links. Students will also receive the project files (or code, if applicable) and solutions required for the hands-on work.

Hands-On Setup Made Simple! Our dedicated tech team will work with you to ensure our ‘easy-access’ cloud-based course environment is accessible, fully-tested and verified as ready to go well in advance of the course start date, ensuring a smooth start to class and effective learning experience for all participants. Please inquire for details and options.

Related Courses

OTHER RELATED COURSES

Below are a few of the popular Related Courses we offer in this space. Please see the complete Course Catalog for additional options and titles.

Cancellation Policy

TBD

Training Location

Virtual Instructor Led Online Training
your home or offce

your city, your province
your country   

About Trivera Technologies LLC

x

Trivera Technologies is a woman-owned IT training education firm that has provides engaging, comprehensive technical training, consulting, mentoring and courseware development and licensing services to hundreds of organizations globally, on an annual basis. Our collaborative, skills-focused, consultative approach to developing and delivering learning helps organizations bring technical teams of all skills-levels up to speed with the latest technologies, tools, skills and best practices surrounding all aspects of application development, from concept through completion, all targeted to their specific needs and goals. 

We offer skills-focused training events onsite, online, or in blended solutions for distributed teams, from small groups to large-scale, worldwide enterprise organizations.  Services include assessment, development and delivery of targeted learning solutions for new-hire cohort programs; skills immersion boot camps and code camps; skills assessment and skills-gap training; enterprise-wide reskilling, upskilling and new-skilling programs; extensive public schedule offerings; mentoring and coaching and much more. 

Areas of specialty include: application development & programming; modern web development and design; CyberSecurity & secure coding; Data Science / AI / Machine Learning / Deep Learning; Python; DevOps; Cloud; Software architecture, design, testing and development; Agile development & Scrum; Networking & Sys Admin; O/S and Tools; project management; business information and data; IT professional skills; ITIL; COMPTIA and much more. 

Training Provider Rating

No Reviews Yet

Course Reviews

No Reviews Yet