[已软注销]对《Programming Pig》的笔记(1)

[已软注销] (Hello world)

书名: Programming Pig
作者: Alan Gates
页数: 222
出版社: O'Reilly Media
出版年: 2011-10-20

Pigs eat anything
Pig can operate on data whether it has metadata or not. It can operate on data that is relational, nested, or unstructured. And it can easily be extended to operate on data beyond files, including key/value stores, databases, etc.

Pigs live anywhere
Pig is intended to be a language for parallel data processing. It is not tied to one particular parallel framework. It has been implemented first on Hadoop, but we do not intend that to be only on Hadoop.

Pigs are domestic animals
Pig is designed to be easily controlled and modified by its users. Pig allows integration of user code wherever possible, so it currently supports user defined field transformation functions, user defined aggregates, and user defined conditionals. These functions can be written in Java or in scripting languages that can compile down to Java (e.g., Jython). Pig supports user provided load and store functions. It supports external executables via its stream command and MapReduce JARs via its mapreduce command. It allows users to provide a custom partitioner for their jobs in some circumstances, and to set the level of reduce parallelism for their jobs.
Pig has an optimizer that rearranges some operations in Pig Latin scripts to give better performance, combines MapReduce jobs together, etc. However, users can easily turn this optimizer off to prevent it from making changes that do not make sense in their situation.

Pigs fly
Pig processes data quickly. We want to consistently improve performance, and not implement features in ways that weigh Pig down so it can’t fly.引自 Pig Philosophy

2013-04-16 14:22:13 回应

[已软注销]的其他笔记 · · · · · · ( 全部82条 )

论美国的民主: 1
Big Debt Crises: 1
论美国的民主: 1
The Defining Decade: 1
In The Plex: 1
Verbal Advantage: 2
Introduction to Algorithms (3/e): 1
Merriam-Webster's Vocabulary Builder: 1
Programming Erlang, Second Edition: 1
Capital in the Twenty First Century: 1
Programming Clojure: 1
编程珠玑: 1
我们都要性小康: 1
Haskell趣学指南: 1
The Joy of Clojure: 1
经济为什么会崩溃: 1
ZeroMQ: 2
通往奴役之路: 1
Linux Firewalls: 1
The Datacenter as a Computer: 1
国富论: 1
构建高性能Web站点: 1
HTTP权威指南: 1
flex & bison: 1
Understanding the Linux Virtual Memory Manager: 2
The Little Book of Semaphores, 2nd Edition: 2
依靠自我: 1
Operating Systems: 9
Structure and Interpretation of Computer Programs - 2nd Edition (MIT): 4
Linux内核完全剖析: 1
TCP/IP基础教程基于实验的方法: 1
MongoDB: 1
如彗星划过夜空: 3
Just for Fun: 8
编译原理及实践: 1
TCP/IP详解卷1：协议: 1
Coders at Work: 4
什么是数学: 1
那些忧伤的年轻人: 3
我也有一个梦想: 1
软件随想录: 1
Event Processing in Action: 1
FLEX 与 BISON(影印版): 1
ANSI Common Lisp: 1
黑客与画家: 7
九型人格: 1

[已软注销]对《Programming Pig》的笔记(1)

[已软注销] (Hello world)

第9页 Pig Philosophy

[已软注销]的其他笔记 · · · · · · ( 全部82条 )