Machine learning systems are both complex and unique. They are complex because they consist of many different components and involve many different stakeholders. They are unique because they are data-dependent, and data varies wildly from one use case to the next.
This book takes a holistic approach to designing machine learning systems that are reliable, scalable, maintainable, and adaptive to changing environments and business requirements. It considers each design decision — e.g. how to create training data, what features to include, how to deploy, what to monitor, how often to retrain your model — in the context of how it can help the system as a whole achieve its objectives. The iterative framework laid out in this book is illustrated using actual case studies and backed by ample references.
Examples of the scenarios that this book will be able to help you tackle.
You have been given a business problem and a lot of raw data. You want to engineer this data and choose the right metrics to solve this problem.
Your initial models perform well in offline experiments and you want to deploy them.
You have little feedback on how your models are performing after your models are deployed, and you want to figure out a way to quickly detect, debug, and address any issue your models might run into in production.
The process of developing, evaluating, deploying, and updating models for your team has been mostly manual, slow, and error-prone. You want to automate and improve this process.
Each machine learning use case in your organization has been deployed using its own workflow, and you want to lay down the foundation (e.g. model store, feature store, monitoring tools) that can be shared and reused across use cases.
You're worried that there might be biases in your machine learning systems and you want to make your systems responsible!
Read less
0 有用 Lillian 2022-09-22 13:43:35 新加坡
不是讲机器学习算法的机器学习书。覆盖内容广,有很多作者自己的踩坑经历。
0 有用 1Feng 2022-12-10 20:43:39 北京
简单翻了几章,比较易懂,逻辑性很强。作者是真的了解这个领域实践中痛点的人。具体到一些细节上,文中经常提供了一些干系人/利益方的视角分析,对于一些问题的解释会有种豁然开朗的感觉。另外,援引资料也非常丰富,非常新,感觉会火
6 有用 massquantity 2022-11-06 23:44:17 上海
覆盖面很广,但大多泛泛而谈。每章的标题都很吸引人,但看完一章感觉像什么都没说一样。
0 有用 聪 2022-12-24 10:04:29 英国
覆盖面很广,不过有些泛泛而谈还有很多 open questions。拓宽了知识面,但用来指导实践还有一点距离,可能需要细读引用的论文和开源项目。读到 MLOps 一章感觉这并不是作者最擅长的领域,讲了很多 principle 但没有什么 practice。另外感慨天天觉得公司代码乱,居然样样业界前沿,几乎每个领域都有人在做了。
1 有用 流光 2023-02-22 01:33:14 美国
作者写得还是挺简单易懂的,但是整本书只是走马观花地把machine learning system介绍一遍。当然这其中也有这个field才刚刚开始发展的原因,各个技术细节虽有提及,但基本都是一笔掠过。面试前读读复习一下挺好的。