Pig for Beginners

hadoop-logo  pig-in-overalls-medium

The following link is to a simple tutorial to get started with Pig.

Pig is a data flow platform for writing Hadoop operations in a language called Pig Latin. It adds a layer of abstraction on top of Hadoop to simplify its use by giving a SQL-like interface to process data on Hadoop and thus help the programmer focus on business logic and help increase productivity. It supports a variety of data types and the use of user-defined functions (UDFs) to write custom operations in Java, Python and JavaScript. Due its simple interface,  support for doing complex operations such as joins and filters, Pig is popular for performing query operations in hadoop.


The objective of this tutorial is to get you up and running Pig scripts on a real-world dataset stored in Hadoop.

Pig for Beginners

Via: Orzota