Introduction to Big Data
- Introduction to Big Data
- Big Data Enabling Technologies
- Hadoop Stack for Big Data-Introduction
- Hadoop Stack for Big Data – Hadoop Ecosystem
The Hadoop Distributed Files system
MapReduce
- MapReduce-Introduction & Architecture
- How MapReduce Works-MapReduce Job Run
- How MapReduce Works-Failures in MapReduce
- MapReduce Types – The Default MapReduce Job
- MapReduce Types and Formats-Input & Output Formats
- Developing a MapReduce Application-Video Links
- Developing a MapReduce Application-Lab Demo Example
Apache Pig
- Apache Pig – Introduction, Installing and Running Pig
- Apache Pig – An Example, Generating Examples
- Apache Pig – Comparison with Databases
- Pig Latin
- User-Defined Functions
- Data Processing Operators
- Pig in Practice
Apache Hive
- Hive -Introduction & Comparison with Databases
- HiveQL & Tables
- Hive – Querying Data & User Defined Functions
Parallel programming with Spark
- Overview of Spark
- Fundamentals of Scala and Functional Programming
- Spark Concepts
- Spark Operations-Job Execution-Application