Mindteck Academy
  • Home
  • Hadoop
  • MLPython
  • PythonMongoDB
  • DevOps
  • AWS
  • Blog
  • Registration
  • More Info
  • RegistrationOld
  • Home Save
  • Home
  • Hadoop
  • MLPython
  • PythonMongoDB
  • DevOps
  • AWS
  • Blog
  • Registration
  • More Info
  • RegistrationOld
  • Home Save
Search by typing & pressing enter

YOUR CART

Picture

Hadoop,  Spark and Scala

Course Duration:  66 hours

Option 1 - Weekday Classes
Date:  February 8th 
– April 5th, 2019
Time:  8PM – 10PM ET, Mon, Tues, Thurs, Fri
Cost:  $749.00 

Option 2 - Weekend Classes
Date:  TBD

Time:  9AM – 2PM ET, Saturday – Sunday
Cost:  $749.00 


Click HERE to book your seat

| About

Mindteck Academy’s live, instructor-led Hadoop, Spark and Scala online course prepares experienced and rookie professionals for in-demand roles at data-driven enterprises around the globe.

In this structured program, curated and taught by an industry expert, you will learn about Big Data and the Hadoop Ecosystem.

The first half of the course begins with an introduction to Big Data concepts and its application in the real world. Then the lectures move on to advanced topics, such as the Hadoop Distributed File System, batch/parallel processing using MapReduce, and various frameworks defined in the Hadoop Ecosystem  (i.e. YARN, HBase, Pig, Hive, Oozie and Zookeeper).

In the second half of the course, you will learn Apache Spark and Scala in detail. Apache Spark is a fast  general processing engine compatible with Hadoop data. It is designed to perform both batch processing  and new workloads like streaming, interactive queries, and machine learning. Scala combines object-oriented and functional programming in one concise, high-level language. it is used to work with Spark.

The course ends with a Capstone Project which will help you apply skills that you've learned in solving a real-world problem.

What you will learn:
  • The usage of the complex business solution
  • Data ingestion techniques with using MapReduce Sqoop and Flume
  • ETL operations and data analytics
  • Implementing Partitioning, Bucketing, and Hive Five
  • HBase, i.e. a NoSQL Database
  • Integrating HBase with Hive in Hadoop, HBase Architecture and Mechanisms
  • Oozie for scheduling
  • Tools for Hadoop development
  • Big Data Analytics use cases
For the Apache Spark module:
  • Spark and its Ecosystem an Introduction
  • RDD in Spark
  • The concepts of HDFS Architecture
  • Understanding Hadoop 2.X
  • Data loading techniques through Sqoop
  • Implementing Spark
  • Implementing Spark operations applications on YARN
  • Implementing machine learning algorithms                
  • Understanding Spark SQL
  • Understanding Kafka                      
  • Integrating Kafka with real time streaming
  • Using Kafka to produce and consume messages from various sources
  • Spark Streaming                          
  • Process Multiple Batches in Spark
Who should take this course:
The market for Big Data analytics is growing across the world, and this strong growth pattern translates into a great opportunity for all IT professionals – seasoned and new – to accelerate their careers.
The course is best suited for:
  • Software Developers, Project Managers, and Software Architects
  • ETL and Data Warehousing Professionals
  • Analytics and Business Intelligence Professionals
  • DBAs and DB professionals
  • Senior IT Technical Managers
  • Testing Professionals 
  • Mainframe Professionals
  • Rookies seeking to build a career in Big Data
Click HERE to view FAQs, sample use cases and more!

| Hadoop, Spark and Scala Curriculum - 66 Hours

Picture
Picture
Schedule
​Option 1 - Weekday Schedule
Starts Monday, February 8th promptly at 8PM ET with Orientation, continuing on with the initial lecture until 10PM.  The remainder of the live, instructor-led online course occurs 4 days each week - Monday, Tuesday, Thursday and Friday.

Option 2 - Weekend Schedule – Saturdays and Sundays
Starts Saturday, TBD promptly at 9AM ET with Orientation, continuing on with the initial lecture until 2PM.  The remainder of the live, instructor-led online course occurs every Saturday and Sunday.
​

Prerequisites
There are no prerequisites for this course.  Prior knowledge of Core Java and SQL will be helpful, though it is not mandatory.  There will be case study driven sessions and extensive hands-on coding.  To brush up your skills, our training partner provides a complimentary, self-paced Java Essentials for Hadoop course when you enroll in this course.

System Requirements
You will use your own computer and be expected to have a Windows 7/10, Linux or Mac system.  You will be utilizing CloudLab, a Hadoop environment accessible via a browser with minimal configuration.  An up-to-date browser on your system is necessary. 

Cost
$749.00 


Click here to book your seat.
Check out our Blog
If at any time you’d prefer to speak with us, please call 1-844-323-CODE. Or, email info@mindteckacademy.com and we’ll be in touch shortly thereafter.  Thank you!
Picture
© 2018 Mindteck.  All Rights Reserved.