Easy way to IT Job

Big Data and Hadoop Course Syllabus

4.50
(8596)

This Big Data and Hadoop course syllabus gives you an idea of all the incredible topics we offer for students to learn under the Big Data Hadoop Course. This course syllabus carefully curated by our experts from the IT industry keeping in mind the current trends in the IT sector. This makes the syllabus completely modern and reliable, starting from concepts like Hadoop Cluster, Master Nodes to Kafka and Scala.

Download our Big Data and Hadoop course syllabus PDF for the best Big Data and Hadoop Training Institute in Chennai.

DURATION
Real-Time Location Services
2 Months
JOB READY
Syllabus
CERTIFIED
Courses

Let's take the first step to becoming an expert in Big Data and Hadoop

Click Here to Get Started

100% Placement
Assurance

What Learning at SLA gives you

  • Technology Training
  • Aptitude Training
  • Learn to Code (Codeathon)
  • Real Time Projects
  • Learn to Crack Interviews
  • Panel Mock Interview
  • Unlimited Interviews
  • Life Long Placement Support

Breakdown of Big Data and Hadoop Course Fee and Batches

Hands On Training
3-5 Real Time Projects
60-100 Practical Assignments
3+ Assessments / Mock Interviews
May 2024
Week days
(Mon-Fri)
Online/Offline

2 Hours Real Time Interactive Technical Training 

1 Hour Aptitude 

1 Hour Communication & Soft Skills

(Suitable for Fresh Jobseekers / Non IT to IT transition)

Course Fee
May 2024
Week ends
(Sat-Sun)
Online/Offline

4 Hours Real Time Interactive Technical Training

(Suitable for working IT Professionals)

Course Fee

Save up to 20% in your Course Fee on our Job Seeker Course Series

Learn More

Syllabus for The Big Data and Hadoop Course

Download Syllabus
Big Data : Introduction

1

❖ What is Big Data
❖ Evolution of Big Data
❖ Benefits of Big Data
❖ Operational vs Analytical Big Data
❖ Need for Big Data Analytics
❖ Big Data Challenges

Hadoop cluster

2

❖ Master Nodes
❖ Name Node
❖ Secondary Name Node
❖ Job Tracker
❖ Client Nodes
❖ Slaves
❖ Hadoop configuration
❖ Setting up a Hadoop cluster

HDFS

3

❖ Introduction to HDFS
❖ HDFS Features
❖ HDFS Architecture
❖ Blocks
❖ Goals of HDFS
❖ The Name node & Data Node
❖ Secondary Name node
❖ The Job Tracker
❖ The Process of a File Read
❖ How does a File Write work
❖ Data Replication
❖ Rack Awareness
❖ HDFS Federation
❖ Configuring HDFS
❖ HDFS Web Interface
❖ Fault tolerance
❖ Name node failure management
❖ Access HDFS from Java

Yarn

4

❖ Introduction to Yarn
❖ Why Yarn
❖ Classic MapReduce v/s Yarn
❖ Advantages of Yarn
❖ Yarn Architecture
❖ Resource Manager
❖ Node Manager
❖ Application Master
❖ Application submission in YARN
❖ Node Manager containers
❖ Resource Manager components
❖ Yarn applications
❖ Scheduling in Yarn
❖ Fair Scheduler
❖ Capacity Scheduler
❖ Fault tolerance

MapReduce

5

❖ What is MapReduce
❖ Why MapReduce
❖ How MapReduce works
❖ Difference between Hadoop 1 & Hadoop 2
❖ Identity mapper & reducer
❖ Data flow in MapReduce
❖ Input Splits
❖ Relation Between Input Splits and HDFS Blocks
❖ Flow of Job Submission in MapReduce
❖ Job submission & Monitoring
❖ MapReduce algorithms
❖ Sorting
❖ Searching
❖ Indexing
❖ TF-IDF

Hadoop Fundamentals

6

❖ What is Hadoop
❖ History of Hadoop
❖ Hadoop Architecture
❖ Hadoop Ecosystem Components
❖ How does Hadoop work
❖ Why Hadoop & Big Data
❖ Hadoop Cluster introduction
❖ Cluster Modes
❖ Standalone
❖ Pseudo-distributed
❖ Fully – distributed
❖ HDFS Overview
❖ Introduction to MapReduce
❖ Hadoop in demand

HDFS Operations

7

❖ Starting HDFS
❖ Listing files in HDFS
❖ Writing a file into HDFS
❖ Reading data from HDFS
❖ Shutting down HDFS

HDFS Command Reference

8

❖ Listing contents of directory
❖ Displaying and printing disk usage
❖ Moving files & directories
❖ Copying files and directories
❖ Displaying file contents

Java Overview For Hadoop

9

❖ Object oriented concepts
❖ Variables and Data types
❖ Static data type
❖ Primitive data types
❖ Objects & Classes
❖ Java Operators
❖ Method and its types
❖ Constructors
❖ Conditional statements
❖ Looping in Java
❖ Access Modifiers
❖ Inheritance
❖ Polymorphism
❖ Method overloading & overriding
❖ Interfaces

MapReduce Programming

10

❖ Hadoop data types
❖ The Mapper Class
❖ Map method
❖ The Reducer Class
❖ Shuffle Phase
❖ Sort Phase
❖ Secondary Sort
❖ Reduce Phase
❖ The Job class
❖ Job class constructor
❖ Job Context interface
❖ Combiner Class
❖ How Combiner works
❖ Record Reader
❖ Map Phase
❖ Combiner Phase
❖ Reducer Phase
❖ Record Writer
❖ Partitioners
❖ Input Data
❖ Map Tasks
❖ Partitioner Task
❖ Reduce Task
❖ Compilation & Execution

Hadoop Ecosystems Pig

11

❖ What is Apache Pig?
❖ Why Apache Pig?
❖ Pig features
❖ Where should Pig be used
❖ Where not to use Pig
❖ The Pig Architecture
❖ Pig components
❖ Pig v/s MapReduce
❖ Pig v/s SQL
❖ Pig v/s Hive
❖ Pig Installation
❖ Pig Execution Modes & Mechanisms
❖ Grunt Shell Commands
❖ Pig Latin – Data Model
❖ Pig Latin Statements
❖ Pig data types
❖ Pig Latin operators
❖ Case Sensitivity
❖ Grouping & Co Grouping in Pig Latin
❖ Sorting & Filtering
❖ Joins in Pig latin
❖ Built-in Function
❖ Writing UDFs
❖ Macros in Pig

HBase

12

❖ What is HBase
❖ History Of HBase
❖ The NoSQL Scenario
❖ HBase & HDFS
❖ Physical Storage
❖ HBase v/s RDBMS
❖ Features of HBase
❖ HBase Data model
❖ Master server
❖ Region servers & Regions
❖ HBase Shell
❖ Create table and column family
❖ The HBase Client API

Spark

13

❖ Introduction to Apache Spark
❖ Features of Spark
❖ Spark built on Hadoop
❖ Components of Spark
❖ Resilient Distributed Datasets
❖ Data Sharing using Spark RDD
❖ Iterative Operations on Spark RDD
❖ Interactive Operations on Spark RDD
❖ Spark shell
❖ RDD transformations
❖ Actions
❖ Programming with RDD
❖ Start Shell
❖ Create RDD
❖ Execute Transformations
❖ Caching Transformations
❖ Applying Action
❖ Checking output
❖ GraphX overview

Impala

14

❖ Introducing Cloudera Impala
❖ Impala Benefits
❖ Features of Impala
❖ Relational databases vs Impala
❖ How Impala works
❖ Architecture of Impala
❖ Components of the Impala
❖ The Impala Daemon
❖ The Impala Statestore
❖ The Impala Catalog Service
❖ Query Processing Interfaces
❖ Impala Shell Command Reference
❖ Impala Data Types
❖ Creating & deleting databases and tables
❖ Inserting & overwriting table data
❖ Record Fetching and ordering
❖ Grouping records
❖ Using the Union clause
❖ Working of Impala with Hive
❖ Impala v/s Hive v/s HBase

MongoDB Overview

15

❖ Introduction to MongoDB
❖ MongoDB v/s RDBMS
❖ Why & Where to use MongoDB
❖ Databases & Collections
❖ Inserting & querying documents
❖ Schema Design
❖ CRUD Operations

Oozie & Hue Overview

16

❖ Introduction to Apache Oozie
❖ Oozie Workflow
❖ Oozie Coordinators
❖ Property File
❖ Oozie Bundle system
❖ CLI and extensions
❖ Overview of Hue

Hive

17

❖ What is Hive?
❖ Features of Hive
❖ The Hive Architecture
❖ Components of Hive
❖ Installation & configuration
❖ Primitive types
❖ Complex types
❖ Built in functions
❖ Hive UDFs
❖ Views & Indexes
❖ Hive Data Models
❖ Hive vs Pig
❖ Co-groups
❖ Importing data
❖ Hive DDL statements
❖ Hive Query Language
❖ Data types & Operators
❖ Type conversions
❖ Joins
❖ Sorting & controlling data flow
❖ local vs mapreduce mode
❖ Partitions
❖ Buckets

Sqoop

18

❖ Introducing Sqoop
❖ Scoop installation
❖ Working of Sqoop
❖ Understanding connectors
❖ Importing data from MySQL to Hadoop HDFS
❖ Selective imports
❖ Importing data to Hive
❖ Importing to Hbase
❖ Exporting data to MySQL from Hadoop
❖ Controlling import process

Flume

19

❖ What is Flume?
❖ Applications of Flume
❖ Advantages of Flume
❖ Flume architecture
❖ Data flow in Flume
❖ Flume features
❖ Flume Event
❖ Flume Agent
❖ Sources
❖ Channels
❖ Sinks
❖ Log Data in Flume

Zookeeper Overview

20

❖ Zookeeper Introduction
❖ Distributed Application
❖ Benefits of Distributed Applications
❖ Why use Zookeeper
❖ Zookeeper Architecture
❖ Hierarchial Namespace
❖ Znodes
❖ Stat structure of a Znode
❖ Electing a leader

Want more details about the Big Data and Hadoop Syllabus?

Fill out the form, and our counsellors will get in touch with you at your preferred time. You can have all your queries answered. Once you decide that SLA is the perfect fit for your training needs, our counselors will guide you through the process every step of the way.

Course Schedules

PDF Course Syllabus

Course Fees

CRM System Testing

or any other questions...

The SLA way to Become
a Big Data and Hadoop Expert

Enrollment

Technology Training

Coding Practices
Realtime Projects

Placement Training

Aptitude Training
Interview Skills
CRM System Testing

Panel Mock
Interview

Unlimited
Interviews

Interview
Feedback

100%
IT Career

Google Reviews

Rating
4.8
1,053 Google reviews

Aswin Pandiyan

It's my genuine review i searched many institutes to do software testing course and finally landed in SLA. My counselor Bala backed me and my…
Click here for Full Review

Hari Krishnan

I am from Mechanical background. I have no command over programming before joining SLA Such a wonderful place to learn and achieve your goals.Having wonderful…
Click here for Full Review

Venkatesh

I joined SLA for python course one year back, and i got selected as software developer with decent salary, with after one year of experience,…
Click here for Full Review

Ganesan Vasu

I'm from non-IT background, but always wanted to be in IT, SLA make my dream true and I don't know how many will see this…
Click here for Full Review

Discover What Our Students Have To Say

See More Reviews

Our counselors will share the Syllabus PDF with you via Email / Whatsapp

Just a minute!

If you have any questions that you did not find answers for, our counsellors are here to answer them. You can get all your queries answered before deciding to join SLA and move your career forward.

We are excited to get started with you

Give us your information and we will arange for a free call (at your convenience) with one of our counsellors. You can get all your queries answered before deciding to join SLA and move your career forward.