可靠的機器學習(影印版)
出版時間:2023年03月
頁數(shù):376
“在將基于機器學習的真實系統(tǒng)投入部署之前,你便能從閱讀本書中受益。放心吧,書中的內容來自數(shù)十年間來之不易的經驗。”
——Andrew Moore
Google Cloud AI副總裁兼總經理
無論你是小型創(chuàng)業(yè)公司還是跨國公司的一員,這本實踐用書都為你(數(shù)據科學家、軟件和網站可靠性工程師、產品經理或企業(yè)主)展示了如何在組織內可靠、有效和負責地運行和建立機器學習。你將深入了解其中涉及的方方面面,從如何在生產中進行模型監(jiān)控到如何在產品組織中運營一個完善的模型開發(fā)團隊。
通過將SRE思維應用于機器學習,作為本書作者和工程專業(yè)人士的Cathy Chen、Kranti Parisa、Niall Richard Murphy、D. Sculley、Todd Underwood以及特邀作者向你展示了如何運行高效可靠的機器學習系統(tǒng)。無論你是想增加收入、優(yōu)化決策、解決問題,還是想理解和影響客戶行為,你都將學到如何執(zhí)行日常的機器學習任務,同時保持更廣闊的視野。
本書內容包括:
● 什么是ML:運作方式以及依賴什么
● 用于理解機器學習“環(huán)路”如何工作的概念框架
● 有效的生產如何使機器學習系統(tǒng)易于監(jiān)控、部署和操作
● 為什么機器學習系統(tǒng)使生產故障排除更加困難,以及如何進行相應的補償
● 機器學習、產品和生產團隊如何有效溝通
- Foreword
- Preface
- 1. Introduction
- The ML Lifecycle
- Lessons from the Loop
- 2. Data Management Principles
- Data as Liability
- The Data Sensitivity of ML Pipelines
- Phases of Data
- Data Reliability
- Data Integrity
- Conclusion
- 3. Basic Introduction to Models
- What Is a Model?
- A Basic Model Creation Workflow
- Model Architecture Versus Model Definition Versus Trained Model
- Where Are the Vulnerabilities?
- Infrastructure and Pipelines
- A Set of Useful Questions to Ask About Any Model
- An Example ML System
- Conclusion
- 4. Feature and Training Data
- Features
- Labels
- Human-Generated Labels
- Metadata
- Data Privacy and Fairness
- Conclusion
- 5. Evaluating Model Validity and Quality
- Evaluating Model Validity
- Evaluating Model Quality
- Operationalizing Verification and Evaluation
- Conclusion
- 6. Fairness, Privacy, and Ethical ML Systems
- Fairness (a.k.a. Fighting Bias)
- Privacy
- Responsible AI
- Responsible AI Along the ML Pipeline
- Conclusion
- 7. Training Systems
- Requirements
- Basic Training System Implementation
- General Reliability Principles
- Common Training Reliability Problems
- Structural Reliability
- Conclusion
- 8. Serving
- Key Questions for Model Serving
- Model Serving Architectures
- Model API Design
- Serving for Accuracy or Resilience?
- Scaling
- Disaster Recovery
- Ethics and Fairness Considerations
- Conclusion
- 9. Monitoring and Observability for Models
- What Is Production Monitoring and Why Do It?
- Problems with ML Production Monitoring
- Best Practices for ML Model Monitoring
- Conclusion
- 10. Continuous ML
- Anatomy of a Continuous ML System
- Observations About Continuous ML Systems
- Continuous Organizations
- Rethinking Noncontinuous ML Systems
- Conclusion
- 11. Incident Response
- Incident Management Basics
- Anatomy of an ML-Centric Outage
- Terminology Reminder: Model
- Story Time
- ML Incident Management Principles
- Special Topics
- Conclusion
- 12. How Product and ML Interact
- Different Types of Products
- Agile ML?
- ML Product Development Phases
- Build Versus Buy
- Sample YarnIt Store Features Powered by ML
- Conclusion
- 13. Integrating ML into Your Organization
- Chapter Assumptions
- Significant Organizational Risks
- Implementation Models
- Organizational Design and Incentives
- Conclusion
- 14. Practical ML Org Implementation Examples
- Scenario 1: A New Centralized ML Team
- Scenario 2: Decentralized ML Infrastructure and Expertise
- Scenario 3: Hybrid with Centralized Infrastructure/Decentralized Modeling
- Conclusion
- 15. Case Studies: MLOps in Practice
- 1. Accommodating Privacy and Data Retention Policies in ML Pipelines
- 2. Continuous ML Model Impacting Traffic
- 3. Steel Inspection
- 4. NLP MLOps: Profiling and Staging Load Test
- 5. Ad Click Prediction: Databases Versus Reality
- 6. Testing and Measuring Dependencies in ML Workflow
- Index
書名:可靠的機器學習(影印版)
國內出版社:東南大學出版社
出版時間:2023年03月
頁數(shù):376
書號:978-7-5766-0552-5
原版書書名:Reliable Machine Learning
原版書出版商:O'Reilly Media
Cathy Chen
Cathy Chen曾在Google擔任技術項目經理、產品經理和工程經理。
Niall Richard Murphy
Niall Richard Murphy是Google網站可靠性工程組織里曾經和現(xiàn)任的成員,他們的職責是關注和維護Google的生產系統(tǒng)。
Kranti Parisa
Kranti Parisa是Dialpad的副總裁兼產品工程主管。
D. Sculley
D. Sculley是Kaggle的首席執(zhí)行官和Google第三方機器學習生態(tài)系統(tǒng)的總經理。
Todd Underwood
Todd Underwood是Google的高級主管以及機器學習SRE的創(chuàng)始人。
The insect on the cover of Reliable Machine Learning is the honeypot ant (Myrmecocystus mimicus). Honeypot ants are found in southwest North America and parts of Mexico.
Similar to other ants, honeypot ant colonies consist of a variety of worker ants who scavenge food from flowers, fruit, and other insects. What is most notable about honeypot ants is how they store food. The repletes—one type of worker ant in the colony—grow large abdomens that they use to store the liquid they scavenge. During times when food supply is low, the repletes regurgitate liquid for the rest of the colony to eat. Repletes have a hard time moving around because of the size of their abdomen, so they are often found hanging from the roof of their nest.