Introduction to Big Data
GLIMPSE of Big Data Analytics
The Hadoop Distributed Files System (HDFS)
MapReduce
Glimpse of MapReduce
Implement Word Count Application using MapReduce on Single Node Cluster
NoSQL Databases
GLIMPSE of NoSQL databases
Review of Traditional Databases
NoSQL – Introduction
Need for NoSQL Databases
Columnar Databases
Failover and Reliability Principles
CAP Theorem
Differences between SQL and NoSQL databases
Types of NoSQL Databases
Advantages & Disadvantages of NoSQL
Impact of NoSQL on Data Science
Commonly used NoSQL Databases
Working mechanisms of MongoDB
Glimpse of MongoDB
MongoDB- Overview
MongoDB – Advantages | Environment
MongoDB – Installation on Windows
MongoDB – Create & Drop Database
MongoDB- Create & Drop Collection
MongoDB – Data Types
APACHE PIG
Glimpse of Apache Pig
Apache Pig – Introduction by Sri K. Gangadhar Rao
Apache Pig – Installing and Running Pig by Sri K. Gangadhar Rao
Working with Pig Latin for implementing the Word Count application
APACHE HIVE
GLIMPSE of Apache HIVE
GLIMPSE of Hive Query Language (HiveQL)
Introduction to Spark
GLIMPSE of Apache Spark
Spark’s Toolset
Spark SQL
Datasets
Resilient Distributed Datasets (RDD)