Course Description
This course is a general overview of the Apache Pig Framework. It will provide an introduction to the structure and methodologies of Apache Pig and an overview of Pig Latin, the Language of Apache Pig. No prior knowledge of Pig or Pig Latin is assumed, but it may be helpful to be familiar with one other programming language, such as python.
This course will include interactive tutorials for processing and aggregating data with Apache Pig, it will cover many of the functionality that is built into the language as well as how to incorporate user defined functions into pig scripts to further increase their functionality.
In the end you should be able to read and understand pig code and write your own scripts that you can implement in the interactive grunt shell or directly from the command line.
What am I going to get from this course?
- Process and aggregate data with Apache Pig
Curriculum
Module 1: Introduction
02:28
Lecture 1
Intro and Overview
01:37
Lecture 2
About the Instructor
00:51
Module 2: What is Pig?
01:52
Lecture 3
So what is Pig anyway?
01:08
Lecture 4
Why It's Called Pig
00:44
Module 3: Data Types
03:20
Lecture 5
Basic Types 1
00:27
Quiz 3
Basic Data Types Q1
Lecture 6
Basic Types 2
00:27
Quiz 4
Basic Data Types Q2
Lecture 7
Non Basic Types
01:51
Quiz 5
Non Basic Types Quiz
Lecture 8
Nulls vs Empty
00:35
Module 4: Getting Started with Pig
12:49
Lecture 9
Introduction to the Data
01:29
Lecture 10
Getting Hadoop
01:16
If you don't have access to a hadoop environment, download and set up a sandbox now.
Lecture 11
Starting Hadoop and moving data
07:15
Quiz 7
Start Hadoop and Move Data
Lecture 12
Three Ways to Run Pig Commands
00:47
Lecture 13
Utility Commands: Help and Quit
00:52
Quiz 8
Try it out: Help and Quit
Lecture 14
Common Development Environments
01:10
Module 5: Basic Elements of a Pig Script
14:55
Lecture 15
Pig Latin Statements
01:39
Lecture 16
Load Data
01:17
Lecture 17
Store/dump Data
02:12
Lecture 18
Setting up Sublime Text
00:47
Quiz 11
Set up Sublime Text Exercise
Lecture 19
Load Data Example
03:29
Quiz 12
Load Data Exercise
Lecture 20
Store/dump Example
04:41
Lecture 21
Quick Note about pig Logs
00:50
Module 6: Relational Operators
58:49
Lecture 22
Describe
00:58
Quiz 13
Describe Exercise
Lecture 23
Limit and Sample
03:28
Lecture 28
Disambiguation
06:14
Quiz 16
Disambiguation Exercise
Lecture 31
Distinct
02:37
Lecture 35
Conditional Statements
02:33
Quiz 19
Order By Exercise
Lecture 38
Nested Foreach
03:31
Quiz 20
Nested ForEach Exercise
Module 7: Built In Functions
33:56
Lecture 40
Eval Functions
06:31
Lecture 41
Eval Functions 2
01:37
Quiz 21
Eval Functions Exercise
Lecture 42
Arithmetic Functions
04:04
Quiz 22
Arithmetic Functions Exercise
Lecture 43
Datetime Functions
10:18
Lecture 44
String Functions
06:01
Quiz 23
String Functions Exercise
Lecture 45
Tuple/map/bag
01:22
Lecture 46
User Defined Functions
02:46
Module 8: Configuring Pig
07:01
Lecture 47
Part 3 Intro
00:19
Lecture 48
Parametrization
04:31
Lecture 49
Utility Commands
02:11