Name: Hadoop: The Definitive Guide
ISBN: 9780596521998

出版社: O'Reilly Media, Inc.
副标题: MapReduce for the Cloud
出版年: 2009
页数: 250
定价: 44.99
装帧: pap
ISBN: 9780596521998

豆瓣评分

8.3

274人评价

5星

38.3%
4星

46.0%
3星

14.6%
2星

0.7%
1星

0.4%

评价:

内容简介 · · · · · ·

Apache Hadoop is ideal for organizations with a growing need to store and process massive application datasets. Hadoop: The Definitive Guide is a comprehensive resource for using Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters. The...

(展开全部)

Organizations large and small are adopting Apache Hadoop to deal with huge application datasets. Hadoop: The Definitive Guide provides you with the key for unlocking the wealth this data holds. Hadoop is ideal for storing and processing massive amounts of data, but until now, information on this open-source project has been lacking -- especially with regard to best practices. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters.

With case studies that illustrate how Hadoop solves specific problems, this book helps you:

* Learn the Hadoop Distributed File System (HDFS), including ways to use its many APIs to transfer data

* Write distributed computations with MapReduce, Hadoop's most vital component

* Become familiar with Hadoop's data and IO building blocks for compression, data integrity, serialization, and persistence

* Learn the common pitfalls and advanced features for writing real-world MapReduce programs

* Design, build, and administer a dedicated Hadoop cluster

* Use HBase, Hadoop's database for structured and semi-structured data

And more. Hadoop: The Definitive Guide is still in progress, but you can get started on this technology with the Rough Cuts edition, which lets you read the book online or download it in PDF format as the manuscript evolves.

原文摘录 · · · · · · ( 全部 )

* The architecture of HDFS is described in “The Hadoop Distributed File System” by Konstantin Shvachko, Hairong Kuang, Sanjay Radia, and Robert Chansler (Proceedings of MSST2010, May 2010, http:// storageconference.org/2010/Papers/MSST/Shvachko.pdf). † “Scaling Hadoop to 4000 nodes at Yahoo!,” http://developer.yahoo.net/blogs/hadoop/2008/09/scaling_hadoop _to_4000_nodes_a.html. (查看原文)

?.. 2012-01-16 20:12:17

—— 引自第41页
在许多情况下，可以视Mapreduce为关系型数据库管理系统的补充。MapReduce比较适合以批处理的方式处理需要分析整个数据集的问题，尤其是即席分析。RDBMS适用于点查询和更新，数据集被索引后，数据库系统能够提供低延迟的数据检索和快速的少量数据更新。MapReduce适合数据一次写入、多次读取的应用，而关系型数据库更适合持续更新数据集. (查看原文)

hipilee 2012-08-18 16:58:06

—— 引自第1页

> 全部原文摘录

喜欢读"Hadoop: The Definitive Guide"的人也喜欢的电子书 · · · · · ·

支持 Web、iPhone、iPad、Android 阅读器

: 分布式Java应用

12.45元

: 构建高性能Web站点

14.75元

: Python源码剖析

38.39元

: 淘宝技术这十年

4.99元

: CoffeeScript小书

1.99元

喜欢读"Hadoop: The Definitive Guide"的人也喜欢 · · · · · ·

: HBase 8.5

: Data-intensive Text Processing Wit... 8.8

: The Architecture of Open Source A... 8.4

: Programming Collective Intelligenc... 8.9

: Mahout in Action 8.1

: ZooKeeper 7.9

: Mining of Massive Datasets 8.7

: RESTful Web Services 7.8

: 深入搜索引擎 8.1

: Maven 8.2

我来说两句

短评 · · · · · · ( 全部 78 条 )

Hadoop: The Definitive Guide的书评 · · · · · · ( 全部 37 条 )

热门最新好友只看本版本的评论

ares 2010-06-01 13:48:43 清华大学出版社2010版

期待后的又一次失望

这篇书评可能有关键情节透露

因为翻过原版，对书的内容十分期待，中文版出来第一时间就入手一本。看到今天已经读完一半了，基本每一节都需要对照英文版才能看懂，甚至很多地方都是表述错误的，没有逻辑，混沌的因果关系，失望到了顶点。对于译者，我只有一句话：“别拿google翻译出来骗人了！” (展开)

4 24回应

量子纠缠 2011-12-23 14:52:52 清华大学出版社2011版

三聚氰胺、瘦肉精和Hadoop

买了第一版，时间太紧，没来得及看，后来出了个号称修订升级的第二版，毫不犹豫又买了，后来听说第二版比第一版翻译得好，心中窃喜，再后来看了第二版，我震惊了，我TM就是一傻子，放着好好的英文版不看，赶什么时髦买中文版呢。在这个神奇的国度，牛奶里放的是三聚氰胺，火腿... (展开)

2 8回应

真·随机需求 2011-11-06 00:28:19 清华大学出版社2011版

对Hadoop的简单了解

其实也不算全部读完了，读它主要是为了技术选型，考虑升级持久层架构、提高系统可扩展性，仔细研读了前几章，对Hadoop、MapReduce、HDFS的模型、机制、使用场景有了一定了解。后面几章及其生态圈内的其他项目抱着了解的心态简单浏览了一下。整体感觉还行，至少从我看过的章节来... (展开)

4回应

想啊想 2014-05-11 12:15:24 清华大学出版社2010版

压根就在瞎翻译

中文版412页：所以理论上，任何东西都可以表示成二进制形式，然后转化成为长整型的字符串或直接对数据结构进行序列化，来作为键值。原文460页： ..., so theoretically anything can serve as row key, from strings to binary representations of long or even serialized ... (展开)

0回应

郭大路-Roy 2011-08-12 17:50:45 清华大学出版社2011版

第二版的翻译质量还行

-- china-pub 赠书活动 -- http://www.douban.com/group/topic/20965935/ 一直比较忙，整本书还没读完，只是粗略翻了个大概，其中有两三章细读了一遍。先做个大体评价吧，有时间全部细读后再评论。从书的内容上来讲，大致上与网上该书的内容介绍一致。简单点概括：这本书对... (展开)

1 8回应

FreshAIR 2010-10-11 16:38:26 清华大学出版社2010版

勘误征集

这篇书评可能有关键情节透露

群众的眼睛是雪亮的，勘误征集中，请大家给出具体的错误，我们当纠正之。也欢迎大家积极加入第2版审读工作，联系qq：506193994。 (展开)

25回应

AHa 2011-08-17 10:56:42 清华大学出版社2011版

玩玩大象吧

参加豆瓣China-pub抽奖，比较幸运的得到这本Hadoop权威指南中文第二版，拿来与第一版相比，发现新加入了Hive和Sqoop章节，译文质量也提高了不少，并且保留了英文索引。这本书对Hadoop的介绍还算全面，有实践冲动的朋友基本可以拿着书、配合Google百度马上实现梦想。个人感觉“... (展开)

0回应

linegroup 2010-02-06 23:12:30

很好的Hadoop教程

很好的Hadoop教程，比Apache和Yahoo !网页版guide详细很多，很多想不明白的Hadoop实现细节都可以在这本书里找到。 (展开)

1 6回应

geutivs 2011-04-25 17:24:30 清华大学出版社2010版

凑合看吧

很多地方翻译的不行，需要对照英文看才能明白。。。不过对于快速学习，仍然是不错的选择。建议译者看看每部分内容的重要性，不重要的瞎翻翻就算了，重要的部分还是好好花点功夫，不要本末倒置了。比如第三章的数据流部分，这么经典的地方居然被翻译烂的一塌糊涂。不知道译者会... (展开)

1回应

c.chen 2017-03-23 00:54:08 清华大学出版社2015版

垃~圾~

笔误就忍了，翻译水平惨啊。。。代码是各种错，少个多个单词倒无大碍，包路径少一级，代码少一句，就完全没法用了。译者太不负责，大好原著被糟蹋成垃圾看这代码多牛逼，Chapter2 P26那一丢丢代码就一堆问题 import org.apache.hadoop.mapreduce.input.FileOutputFormat; ... (展开)

1回应

> 更多书评 37篇

论坛 · · · · · ·


有电子版了	来自conservatism	5 回应	2011-10-25 16:54:55
比pro hadoop好多了好像国内有翻译了	来自Kevin		2009-12-09 17:10:16
source code reading	来自caibinbupt		2009-11-21 18:19:48
我等.	来自大熊	4 回应	2009-07-28 10:57:57

+ 加入购书单

这本书的其他版本 · · · · · · ( 全部11 )

以下书单推荐 · · · · · · ( 全部 )

谁读这本书? · · · · · ·

東出然天下

3月19日想读

SelfCtrl

2月23日读过

hedongxiucai

2月2日想读

> 159人在读

> 286人读过

> 546人想读

二手市场 · · · · · ·

2本二手书欲转让 (0.01 至 50.00元)
在豆瓣转让有546人想读，手里有一本闲着?

订阅关于Hadoop: The Definitive Guide的评论:
feed: rss 2.0

Hadoop: The Definitive Guide

内容简介 · · · · · ·

原文摘录 · · · · · · ( 全部 )

喜欢读"Hadoop: The Definitive Guide"的人也喜欢的电子书 · · · · · ·

喜欢读"Hadoop: The Definitive Guide"的人也喜欢 · · · · · ·

短评 · · · · · · ( 全部 78 条 )

0 有用 Julian 2015-02-03 14:19:33

0 有用 optman 2012-10-15 18:13:42

0 有用散关清渭 2013-03-05 13:52:50

0 有用 jason 2011-01-27 15:59:18

0 有用 Asura 2014-10-21 09:18:43

0 有用阿凡达弟弟 2022-02-23 13:13:14

0 有用 Валия 2020-06-28 14:26:53

0 有用 herihe 2020-06-11 08:36:18

0 有用 memex 2020-03-29 08:50:10

0 有用 ren 2019-11-29 18:13:03

Hadoop: The Definitive Guide的书评 · · · · · · ( 全部 37 条 )

期待后的又一次失望

三聚氰胺、瘦肉精和Hadoop

对Hadoop的简单了解

压根就在瞎翻译

第二版的翻译质量还行

勘误征集

玩玩大象吧

很好的Hadoop教程

凑合看吧

垃~圾~

论坛 · · · · · ·

这本书的其他版本 · · · · · · ( 全部11 )

以下书单推荐 · · · · · · ( 全部 )

谁读这本书? · · · · · ·

二手市场 · · · · · ·

Hadoop: The Definitive Guide

内容简介 · · · · · ·

原文摘录 · · · · · · ( 全部 )

喜欢读"Hadoop: The Definitive Guide"的人也喜欢的电子书 · · · · · ·

喜欢读"Hadoop: The Definitive Guide"的人也喜欢 · · · · · ·

短评 · · · · · · ( 全部 78 条 )

0 有用 Julian 2015-02-03 14:19:33

0 有用 optman 2012-10-15 18:13:42

0 有用 散关清渭 2013-03-05 13:52:50

0 有用 jason 2011-01-27 15:59:18

0 有用 Asura 2014-10-21 09:18:43

0 有用 阿凡达弟弟 2022-02-23 13:13:14

0 有用 Валия 2020-06-28 14:26:53

0 有用 herihe 2020-06-11 08:36:18

0 有用 memex 2020-03-29 08:50:10

0 有用 ren 2019-11-29 18:13:03

论坛 · · · · · ·

这本书的其他版本 · · · · · · ( 全部11 )

以下书单推荐 · · · · · · ( 全部 )

谁读这本书? · · · · · ·

二手市场 · · · · · ·

0 有用散关清渭 2013-03-05 13:52:50

0 有用阿凡达弟弟 2022-02-23 13:13:14