Big Data Analytics (BDA)

Introduction to Big Data

GLIMPSE of Big Data Analytics

The Hadoop Distributed Files System (HDFS)

MapReduce

Glimpse of MapReduce
Implement Word Count Application using MapReduce on Single Node Cluster 

NoSQL Databases

GLIMPSE of NoSQL databases
Review of Traditional Databases
NoSQL – Introduction
Need for NoSQL Databases
Columnar Databases
Failover and Reliability Principles
CAP Theorem
Differences between SQL and NoSQL databases
Types of NoSQL Databases
Advantages & Disadvantages of NoSQL
Impact of NoSQL on Data Science
Commonly used NoSQL Databases

Working mechanisms of MongoDB

Glimpse of MongoDB
MongoDB- Overview
MongoDB – Advantages | Environment
MongoDB – Installation on Windows
MongoDB – Create & Drop Database
MongoDB- Create & Drop Collection
MongoDB – Data Types

    APACHE PIG

    Glimpse of Apache Pig
    Apache Pig – Introduction by Sri K. Gangadhar Rao
    Apache Pig – Installing and Running Pig by Sri K. Gangadhar Rao
    Working with Pig Latin for implementing the Word Count application

    APACHE HIVE

    GLIMPSE of Apache HIVE
    GLIMPSE of Hive Query Language (HiveQL)

    Introduction to Spark

    GLIMPSE of Apache Spark

    Spark’s Toolset

    Spark SQL

    Datasets

    Resilient Distributed Datasets (RDD)