Name: Designing Data-Intensive Applications
ISBN: 9781449373320

作者: Martin Kleppmann
出版社: O'Reilly Media
副标题: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems
出版年: 2017-4-2
页数: 614
定价: USD 44.99
装帧: Paperback
ISBN: 9781449373320

豆瓣评分

9.8

1176人评价

5星

87.9%
4星

10.8%
3星

1.3%
2星

0.0%
1星

0.0%

评价:

内容简介 · · · · · ·

Data is at the center of many challenges in system design today. Difficult issues need to be figured out, such as scalability, consistency, reliability, efficiency, and maintainability. In addition, we have an overwhelming variety of tools, including relational databases, NoSQL datastores, stream or batch processors, and message brokers. What are the right choices for your appl...

(展开全部)

作者简介 · · · · · ·

Martin is a researcher in distributed systems at the University of Cambridge. Previously he was a software engineer and entrepreneur at Internet companies including LinkedIn and Rapportive, where he worked on large-scale data infrastructure. In the process he learned a few things the hard way, and he hopes this book will save you from repeating the same mistakes.

Martin is a re...

(展开全部)

原文摘录 · · · · · · ( 全部 )

UDP is a good choice in situations where delayed data is worthless. For example, in a VoIP phone call, there probably isn’t enough time to retransmit a lost packet before its data is due to be played over the loudspeakers. In this case, there’s no point in retransmitting the packet—the application must instead fill the missing packet’s time slot with silence (causing a brief interruption in the second) and move on in the stream. The retry happens at the human layer instead. (“Could you repeat that please? The sound just cut out for a moment.”) (查看原文)

Jake 2赞 2021-04-30 00:53:53

—— 引自第283页
For data warehouse queries that need to scan over millions of rows, a big bottleneck is the bandwidth for getting data from disk into memory. However, that is not the only bottleneck. Developers of analytical databases also worry about efficiently using the bandwidth from main memory into the CPU cache, avoiding branch mispredictions and bubbles in the CPU instruction processing pipeline, and making use of single-instruction-multi-data (SIMD) instructions in modern CPUs. Besides reducing the volume of data that needs to be loaded from disk, columnoriented storage layouts are also good for making efficient use of CPU cycles. For example, the query engine can take a chunk of compressed column data that fits comfortably in the CPU’s L1 cache and iterate through it in a tight loop (that is, w... (查看原文)

无心 1赞 2019-11-09 08:17:33

—— 引自第99页

> 全部原文摘录

喜欢读"Designing Data-Intensive Applications"的人也喜欢的电子书 · · · · · ·

支持 Web、iPhone、iPad、Android 阅读器

: 性能之巅

29.80元

: SRE：Google运维解密

27.00元

: Python源码剖析

38.39元

: CoffeeScript小书

1.99元

: MacTalk·人生元编程

2.99元

喜欢读"Designing Data-Intensive Applications"的人也喜欢 · · · · · ·

: Streaming Systems 9.0

: Clean Code 8.8

: Kubernetes in Action 9.4

: Software Engineering at Google 8.7

: A Philosophy of Software Design 9.2

: The Go Programming Language 9.4

: Microservice Patterns 9.1

: Building Microservices 7.9

: Effective Modern C++ 9.5

: Fundamentals of Software Architec... 8.5

我来说两句

短评 · · · · · · ( 全部 344 条 )

Designing Data-Intensive Applications的书评 · · · · · · ( 全部 47 条 )

热门只看本版本的评论

思寇特牌搬砖工 2017-11-17 13:42:02

数据处理行业新晋从业人员必读

大概十一放假开始读这本书，中间经历了加班，双十一值班，自己病假，老婆生病请病假，娃生病请病假，断断续续到现在终于算是从头到尾看了一遍，实在是觉得不容易。这本书的作者是少有的从工业界干到学术界的牛人，知识面广得惊人，也善于举一反三，知识之间互相关联，比如有... (展开)

109

2 7回应

本赖克 2019-07-19 20:10:21 东南大学出版社2017版

又名《同样是CRUD boy，为什么他做的比我好？》

这篇书评可能有关键情节透露

首先要说明的是，这本书没有介绍什么新技术，很多内容都是我们所熟悉的。也没有具体讲解某一种技术的细节，不能期望读完本书后成为某种专家。本书的意义在于，一方面是百科全书式的广度科普，涉及大家耳熟能详的技术名词：NoSQL, 大数据，最终一致性，CAP，MapReduce，流处理... (展开)

4回应

姚钢强 2019-01-20 14:15:38

笔记 02 - 流式架构-数据库技术在应用架构的应用

这篇书评可能有关键情节透露

封面图片来自 Event Sourcing pattern - Cloud Design Patterns 文章内容主要来自 Turning the database inside-out Materialized View pattern 《Designing Data-Intensive Applications》The Future of Data Systems 这篇文章只是翻译和总结，如果感兴趣，一定去原文，... (展开)

5回应

星野君 2019-03-18 13:21:00

开启优秀程序员职业生涯的书

为什么会起这个标题? 我在吴军的 <硅谷来信> 中有闻工程师的几个级别. 感兴趣的朋友可以自己去翻来看看. 工作个一两年, 大部分码农都能达到调用API, 写RESTFul API, CRUD特别溜. 但是我经常停下来想, 这些就够了吗? 这些就足以支撑一个优秀程序员所具备的素养了吗? 直到... (展开)

2回应

瞬光 2019-08-08 16:11:30 中国电力出版社2018版

最后一章升华了整本书

Martin Kleppmann 不仅是个牛逼的程序员，更是一个极富社会责任和人文关怀的牛逼程序员。而这是更难能可贵的。习武之人讲究“习武先修德”。Martin Kleppmann 亦是如此。他用前十一章教会我们如何处理海量数据，用最后一章告诉我们如何正确使用数据。要保护用户隐私、要对自己... (展开)

1回应

陈原 2021-05-06 13:48:57

数据系统是无意识的，但人应该是有温度的

作为分布式课的教材之一，从学期中就开始阅读，断断续续一直到如今学期结束，也算是读完大部分内容。作者的知识面之深从行文中也可见一斑，能把技术的本质，使用场景，和不同解决办法的trade-off讲解得深入浅出。第一部分里，第一章介绍了分布式系统的衡量标准，Reliability, ... (展开)

1回应

姚钢强 2018-12-22 18:40:01

笔记 01 - 序列化协议（JSON Thrift AVRO）

这篇书评可能有关键情节透露

声明：此文内容全部来自《Designing Data-Intensive Applications》，这只是我感兴趣部分的笔记梳理，极力建议去读原著。好的系统是可演化的，即系统可以容易地做变更，前后兼容性好。对于系统设计而言，序列化协议的选择尤为重要。如何理解兼容：向后兼容：新代码可以正确... (展开)

1回应