出版社: O’Reilly Media, Inc
译者: 智普教育 / jeapedu.com
出版年: 2015-2-1
页数: 239
定价: US $39.99
装帧: 平装
ISBN: 9781449357627
内容简介 · · · · · ·
As parallel data analysis has grown common, practitioners in many fields have sought
easier tools for this task. Apache Spark has quickly emerged as one of the most popu‐
lar, extending and generalizing MapReduce. Spark offers three main benefits. First, it
is easy to use—you can develop applications on your laptop, using a high-level API
that lets you focus on the content of y...
As parallel data analysis has grown common, practitioners in many fields have sought
easier tools for this task. Apache Spark has quickly emerged as one of the most popu‐
lar, extending and generalizing MapReduce. Spark offers three main benefits. First, it
is easy to use—you can develop applications on your laptop, using a high-level API
that lets you focus on the content of your computation. Second, Spark is fast, ena‐
bling interactive use and complex algorithms. And third, Spark is a general engine,
letting you combine multiple types of computations (e.g., SQL queries, text process‐
ing, and machine learning) that might previously have required different engines.
These features make Spark an excellent starting point to learn about Big Data in
general.
This introductory book is meant to get you up and running with Spark quickly.
You’ll learn how to download and run Spark on your laptop and use it interactively
to learn the API. Once there, we’ll cover the details of available operations and dis‐
tributed execution. Finally, you’ll get a tour of the higher-level libraries built into
Spark, including libraries for machine learning, stream processing, and SQL. We
hope that this book gives you the tools to quickly tackle data analysis problems,
whether you do so on one machine or hundreds.
作者简介 · · · · · ·
The authors would like to thank the reviewers who offered feedback on this book:
Joseph Bradley, Dave Bridgeland, Chaz Chandler, Mick Davies, Sam DeHority, Vida
Ha, Andrew Gal, Michael Gregson, Jan Joeppen, Stephan Jou, Jeff Martinez, Josh
Mahonin, Andrew Or, Mike Patterson, Josh Rosen, Bruce Szalwinski, Xiangrui
Meng, and Reza Zadeh.
The authors would like to extend a special ...
The authors would like to thank the reviewers who offered feedback on this book:
Joseph Bradley, Dave Bridgeland, Chaz Chandler, Mick Davies, Sam DeHority, Vida
Ha, Andrew Gal, Michael Gregson, Jan Joeppen, Stephan Jou, Jeff Martinez, Josh
Mahonin, Andrew Or, Mike Patterson, Josh Rosen, Bruce Szalwinski, Xiangrui
Meng, and Reza Zadeh.
The authors would like to extend a special thanks to David Andrzejewski, David But‐
tler, Juliet Hougland, Marek Kolodziej, Taka Shinagawa, Deborah Siegel, Dr. Normen
Müller, Ali Ghodsi, and Sameer Farooqui. They provided detailed feedback on the
majority of the chapters and helped point out many significant improvements.
We would also like to thank the subject matter experts who took time to edit and
write parts of their own chapters. Tathagata Das worked with us on a very tight
schedule to finish Chapter 10. Tathagata went above and beyond with clarifying
Learning Spark的书评 · · · · · · ( 全部 7 条 )
对于小白来说还是有点晦涩
相对书还是有些老旧了
入门spark的好书
Spark快速大数据分析
基于Python Spark的大数据分析(第一期)
> 更多书评 7篇
论坛 · · · · · ·
在这本书的论坛里发言这本书的其他版本 · · · · · · ( 全部5 )
-
人民邮电出版社 (2015)7.9分 473人读过
-
人民邮电出版社 (2021)8.1分 30人读过
-
O'Reilly Media (2020)暂无评分 21人读过
-
东南大学出版社 (2015)暂无评分 3人读过
以下书单推荐 · · · · · · ( 全部 )
- 数字化表征----与环境利益相关者的数据互动 (小毛叔)
谁读这本书? · · · · · ·
二手市场
· · · · · ·
- 在豆瓣转让 手里有一本闲着?
订阅关于Learning Spark的评论:
feed: rss 2.0
0 有用 有一个这样的人 2021-11-26 21:02:08
入门书籍
0 有用 有一个这样的人 2021-11-26 21:02:08
入门书籍