盗墓笔记全集,怎样写网络小说

數(shù)據(jù)科學(xué)入門（影印版）

Sam Lau, Joseph Gonzalez, Deborah Nolan 著

出版時(shí)間：2024年03月

頁數(shù)：594

“我真希望在第一次用‘?dāng)?shù)據(jù)科學(xué)家’這個(gè)詞來描述我們的工作時(shí)能有這本書。如果你想從事數(shù)據(jù)科學(xué)/工程、AI或機(jī)器學(xué)習(xí)，這本書就是你的起點(diǎn)。”
——DJ Patil博士
美國第一位首席數(shù)據(jù)科學(xué)家

作為一名有抱負(fù)的數(shù)據(jù)科學(xué)家，你理解為什么組織機(jī)構(gòu)的重要決策都依賴于數(shù)據(jù) —— 無論是設(shè)計(jì)網(wǎng)站的公司、決定如何改善服務(wù)的城市，還是致力于阻止疾病傳播的科學(xué)家。你需要具備將一堆雜亂的數(shù)據(jù)提煉成可操作的洞見所需的技能。我們稱之為數(shù)據(jù)科學(xué)生命周期：收集、整理、分析數(shù)據(jù)并從中得出結(jié)論的過程。
本書是第一本兼顧編程和統(tǒng)計(jì)學(xué)基礎(chǔ)技能的書籍，涵蓋了整個(gè)數(shù)據(jù)科學(xué)生命周期。它面向那些希望成為數(shù)據(jù)科學(xué)家或與數(shù)據(jù)科學(xué)家合作的讀者，以及希望跨越“技術(shù)/非技術(shù)”界限的數(shù)據(jù)分析師。如果具備基本的Python編程知識(shí)，你將學(xué)到如何使用像pandas這樣的行業(yè)標(biāo)準(zhǔn)工具來處理數(shù)據(jù)。
● 將感興趣的問題提煉為可通過數(shù)據(jù)研究的問題
● 進(jìn)行數(shù)據(jù)收集可能涉及的文本處理、Web抓取等技術(shù)
● 通過數(shù)據(jù)清洗、探索和可視化獲得有價(jià)值的洞見
● 學(xué)習(xí)如何使用建模來描述數(shù)據(jù)
● 將研究結(jié)果推廣到數(shù)據(jù)之外

目錄
產(chǎn)品信息
關(guān)于作者
封面介紹

Preface
Part I. The Data Science Lifecycle
1. The Data Science Lifecycle
The Stages of the Lifecycle
Examples of the Lifecycle
Summary
2. Questions and Data Scope
Big Data and New Opportunities
Target Population, Access Frame, and Sample
Instruments and Protocols
Measuring Natural Phenomena
Accuracy
Summary
3. Simulation and Data Design
The Urn Model
Example: Simulating Election Poll Bias and Variance
Example: Simulating a Randomized Trial for a Vaccine
Example: Measuring Air Quality
Summary
4. Modeling with Summary Statistics
The Constant Model
Minimizing Loss
Summary
5. Case Study: Why Is My Bus Always Late?
Question and Scope
Data Wrangling
Exploring Bus Times
Modeling Wait Times
Summary
Part II. Rectangular Data
6. Working with Dataframes Using pandas
Subsetting
Aggregating
Joining
Transforming
How Are Dataframes Different from Other Data Representations?
Summary
7. Working with Relations Using SQL
Subsetting
Aggregating
Joining
Transforming and Common Table Expressions
Summary
Part III. Understanding The Data
8. Wrangling Files
Data Source Examples
File Formats
File Encoding
File Size
The Shell and Command-Line Tools
Table Shape and Granularity
Summary
9. Wrangling Dataframes
Example: Wrangling CO2 Measurements from the Mauna Loa Observatory
Quality Checks
Missing Values and Records
Transformations and Timestamps
Modifying Structure
Example: Wrangling Restaurant Safety Violations
Summary
10. Exploratory Data Analysis
Feature Types
What to Look For in a Distribution
What to Look For in a Relationship
Comparisons in Multivariate Settings
Guidelines for Exploration
Example: Sale Prices for Houses
Summary
11. Data Visualization
Choosing Scale to Reveal Structure
Smoothing and Aggregating Data
Facilitating Meaningful Comparisons
Incorporating the Data Design
Adding Context
Creating Plots Using plotly
Other Tools for Visualization
Summary
12. Case Study: How Accurate Are Air Quality Measurements?
Question, Design, and Scope
Finding Collocated Sensors
Wrangling and Cleaning AQS Sensor Data
Wrangling PurpleAir Sensor Data
Exploring PurpleAir and AQS Measurements
Creating a Model to Correct PurpleAir Measurements
Summary
Part IV. Other Data Sources
13. Working with Text
Examples of Text and Tasks
String Manipulation
Regular Expressions
Text Analysis
Summary
14. Data Exchange
NetCDF Data
JSON Data
HTTP
REST
XML, HTML, and XPath
Summary
Part V. Linear Modeling
15. Linear Models
Simple Linear Model
Example: A Simple Linear Model for Air Quality
Fitting the Simple Linear Model
Multiple Linear Model
Fitting the Multiple Linear Model
Example: Where Is the Land of Opportunity?
Feature Engineering for Numeric Measurements
Feature Engineering for Categorical Measurements
Summary
16. Model Selection
Overfitting
Train-Test Split
Cross-Validation
Regularization
Model Bias and Variance
Summary
17. Theory for Inference and Prediction
Distributions: Population, Empirical, Sampling
Basics of Hypothesis Testing
Bootstrapping for Inference
Basics of Confidence Intervals
Basics of Prediction Intervals
Probability for Inference and Prediction
Summary
18. Case Study: How to Weigh a Donkey
Donkey Study Question and Scope
Wrangling and Transforming
Exploring
Modeling a Donkey’s Weight
Summary
Part VI. Classification
19. Classification
Example: Wind-Damaged Trees
Modeling and Classification
Modeling Proportions (and Probabilities)
A Loss Function for the Logistic Model
From Probabilities to Classification
Summary
20. Numerical Optimization
Gradient Descent Basics
Minimizing Huber Loss
Convex and Differentiable Loss Functions
Variants of Gradient Descent
Summary
21. Case Study: Detecting Fake News
Question and Scope
Obtaining and Wrangling the Data
Exploring the Data
Modeling
Summary
Additional Material
Data Sources
Index

書名：數(shù)據(jù)科學(xué)入門（影印版）

作者：Sam Lau, Joseph Gonzalez, Deborah Nolan 著

國內(nèi)出版社：東南大學(xué)出版社

出版時(shí)間：2024年03月

頁數(shù)：594

書號(hào)：978-1098113001

原版書書名：Learning Data Science

原版書出版商：O'Reilly Media

Sam Lau

Sam Lau是加州大學(xué)圣地亞哥分校Halicioglu數(shù)據(jù)科學(xué)研究所的助理教學(xué)教授。Sam擁有十年的教學(xué)經(jīng)驗(yàn)，并曾在加州大學(xué)伯克利分校和加州大學(xué)圣地亞哥分校設(shè)計(jì)并教授一流的數(shù)據(jù)科學(xué)課程。

查看Sam Lau更多信息

Joseph Gonzalez

Joey Gonzalez是加州大學(xué)伯克利分校電子工程與計(jì)算機(jī)科學(xué)系副教授，是伯克利人工智能研究組成員，也是伯克利RISE實(shí)驗(yàn)室創(chuàng)始成員。他還共同創(chuàng)立了Turi Inc.和Aqueduct，為數(shù)據(jù)科學(xué)家開發(fā)各種工具。

查看Joseph Gonzalez更多信息

Deborah Nolan

Deborah Nolan是加州大學(xué)伯克利分校計(jì)算機(jī)、數(shù)據(jù)科學(xué)和社會(huì)學(xué)院的統(tǒng)計(jì)學(xué)名譽(yù)教授兼學(xué)生事務(wù)副院長。

查看Deborah Nolan更多信息

The animal on the cover of Learning Data Science is an edible dormouse (Glis glis). As you might suspect, these creatures have wound up in human cuisine. The edible dormouse was served grilled as a delicacy in ancient Rome and is still consumed today in Croatia and Slovenia. Edible dormice have squirrel-like bodies with small ears, short legs, large feet, and long, bushy tails. Their front feet have four digits and their hind feet have five. They are predominantly covered in gray to gray-brown fur with white underbellies. Their feet have naked soles that secrete a sticky substance that enables climbing.
These nocturnal creatures spend most of their time in trees. They can be found across Europe and in parts of western and central Asia. While the IUCN categorizes edible dormice as a species of Least Concern, they are threatened by illegal hunting and habitat loss. Many of the animals on O’Reilly covers are endangered; all of them are important to the world. The cover illustration is by Karen Montgomery, based on an antique line engraving from Lydekker’s Royal Natural History.

購買選項(xiàng)

定價(jià)：169.00元

書號(hào)：978-1098113001

出版社：東南大學(xué)出版社

聯(lián)系出版社郵購

91精品国产综合久久四虎久久_国产成人午夜高潮毛片_99er视频精品免费观看_2020亚洲熟女在线观看_日本女优人体写真_国内黄色毛片_年轻的老师中文版在线_丰满女邻居做爰_久久久久久精品成人免费图片