logo
logo
x
바코드검색
BOOKPRICE.co.kr
책, 도서 가격비교 사이트
바코드검색

인기 검색어

실시간 검색어

검색가능 서점

도서목록 제공

[eBook Code] Applied Univariate, Bivariate, and Multivariate Statistics Using Python

[eBook Code] Applied Univariate, Bivariate, and Multivariate Statistics Using Python (eBook Code, 1st)

(A Beginner's Guide to Advanced Data Analysis)

Daniel J. Denis (지은이)
  |  
Wiley
2021-04-21
  |  
179,240원

일반도서

검색중
서점 할인가 할인률 배송비 혜택/추가 실질최저가 구매하기
알라딘 143,390원 -20% 0원 0원 143,390원 >
yes24 로딩중
교보문고 로딩중
notice_icon 검색 결과 내에 다른 책이 포함되어 있을 수 있습니다.

중고도서

검색중
로딩중

e-Book

검색중
서점 정가 할인가 마일리지 실질최저가 구매하기
로딩중

해외직구

책 이미지

[eBook Code] Applied Univariate, Bivariate, and Multivariate Statistics Using Python

책 정보

· 제목 : [eBook Code] Applied Univariate, Bivariate, and Multivariate Statistics Using Python (eBook Code, 1st) (A Beginner's Guide to Advanced Data Analysis)
· 분류 : 외국도서 > 과학/수학/생태 > 수학 > 확률과 통계 > 다변량 분석
· ISBN : 9781119578178
· 쪽수 : 304쪽

목차

Preface xii

1 A Brief Introduction and Overview of Applied Statistics 1

1.1 How Statistical Inference Works 4

1.2 Statistics and Decision-Making 7

1.3 Quantifying Error Rates in Decision-Making: Type I and Type II Errors 8

1.4 Estimation of Parameters 9

1.5 Essential Philosophical Principles for Applied Statistics 11

1.6 Continuous vs. Discrete Variables 13

1.6.1 Continuity Is Not Always Clear-Cut 15

1.7 Using Abstract Systems to Describe Physical Phenomena:

Understanding Numerical vs. Physical Differences 16

1.8 Data Analysis, Data Science, Machine Learning, Big Data 18

1.9 “Training” and “Testing” Models: What “Statistical Learning” Means in the Age of Machine Learning and Data Science 20

1.10 Where We Are Going From Here: How to Use This Book 22

Review Exercises 23

2 Introduction to Python and the Field of Computational Statistics 25

2.1 The Importance of Specializing in Statistics and Research, Not Python: Advice for Prioritizing Your Hierarchy 26

2.2 How to Obtain Python 28

2.3 Python Packages 29

2.4 Installing a New Package in Python 31

2.5 Computing z-Scores in Python 32

2.6 Building a Dataframe in Python: And Computing Some Statistical Functions 35

2.7 Importing a .txt or .csv File 38

2.8 Loading Data into Python 39

2.9 Creating Random Data in Python 40

2.10 Exploring Mathematics in Python 40

2.11 Linear and Matrix Algebra in Python: Mechanics of Statistical Analyses 41

2.11.1 Operations on Matrices 44

2.11.2 Eigenvalues and Eigenvectors 47

Review Exercises 48

3 Visualization in Python: Introduction to Graphs and Plots 50

3.1 Aim for Simplicity and Clarity in Tables and Graphs: Complexity is for Fools! 52

3.2 State Population Change Data 54

3.3 What Do the Numbers Tell Us? Clues to Substantive Theory 56

3.4 The Scatterplot 58

3.5 Correlograms 59

3.6 Histograms and Bar Graphs 61

3.7 Plotting Side-by-Side Histograms 62

3.8 Bubble Plots 63

3.9 Pie Plots 65

3.10 Heatmaps 66

3.11 Line Charts 68

3.12 Closing Thoughts 69

Review Exercises 70

4 Simple Statistical Techniques for Univariate and Bivariate Analyses 72

4.1 Pearson Product-Moment Correlation 73

4.2 A Pearson Correlation Does Not (Necessarily) Imply Zero Relationship 75

4.3 Spearman’s Rho 76

4.4 More General Comments on Correlation: Don’t Let a Correlation Impress You Too Much! 79

4.5 Computing Correlation in Python 80

4.6 T-Tests for Comparing Means 84

4.7 Paired-Samples t-Test in Python 88

4.8 Binomial Test 90

4.9 The Chi-Squared Distribution and Goodness-of-Fit Test 91

4.10 Contingency Tables 93

Review Exercises 94

5 Power, Effect Size, P-Values, and Estimating Required Sample Size Using Python 96

5.1 What Determines the Size of a P-Value? 96

5.2 How P-Values Are a Function of Sample Size 99

5.3 What is Effect Size? 100

5.4 Understanding Population Variability in the Context of Experimental Design 102

5.5 Where Does Power Fit into All of This? 103

5.6 Can You Have Too Much Power? Can a Sample Be Too Large? 104

5.7 Demonstrating Power Principles in Python: Estimating Power or Sample Size 106

5.8 Demonstrating the Influence of Effect Size 108

5.9 The Influence of Significance Levels on Statistical Power 108

5.10 What About Power and Hypothesis Testing in the Age of “Big Data”? 110

5.11 Concluding Comments on Power, Effect Size, and Significance Testing 111

Review Exercises 112

6 Analysis of Variance 113

6.1 T-Tests for Means as a “Special Case” of ANOVA 114

6.2 Why Not Do Several t-Tests? 116

6.3 Understanding ANOVA Through an Example 117

6.4 Evaluating Assumptions in ANOVA 121

6.5 ANOVA in Python 124

6.6 Effect Size for Teacher 125

6.7 Post-Hoc Tests Following the ANOVA F-Test 125

6.8 A Myriad of Post-Hoc Tests 127

6.9 Factorial ANOVA 129

6.10 Statistical Interactions 131

6.11 Interactions in the Sample Are a Virtual Guarantee: Interactions in the Population Are Not 133

6.12 Modeling the Interaction Term 133

6.13 Plotting Residuals 134

6.14 Randomized Block Designs and Repeated Measures 135

6.15 Nonparametric Alternatives 138

6.15.1 Revisiting What “Satisfying Assumptions” Means: A Brief Discussion and Suggestion of How to Approach the Decision Regarding Nonparametrics 140

6.15.2 Your Experience in the Area Counts 140

6.15.3 What If Assumptions Are Truly Violated? 141

6.15.4 Mann-Whitney U Test 144

6.15.5 Kruskal-Wallis Test as a Nonparametric Alternative to ANOVA 145

Review Exercises 147

7 Simple and Multiple Linear Regression 148

7.1 Why Use Regression? 150

7.2 The Least-Squares Principle 152

7.3 Regression as a “New” Least-Squares Line 153

7.4 The Population Least-Squares Regression Line 154

7.5 How to Estimate Parameters in Regression 155

7.6 How to Assess Goodness of Fit? 157

7.7 R2 – Coefficient of Determination 158

7.8 Adjusted R2 159

7.9 Regression in Python 161

7.10 Multiple Linear Regression 164

7.11 Defining the Multiple Regression Model 164

7.12 Model Specification Error 166

7.13 Multiple Regression in Python 167

7.14 Model-Building Strategies: Forward, Backward, Stepwise 168

7.15 Computer-Intensive “Algorithmic” Approaches 171

7.16 Which Approach Should You Adopt? 171

7.17 Concluding Remarks and Further Directions: Polynomial Regression 172

Review Exercises 174

8 Logistic Regression and the Generalized Linear Model 176

8.1 How Are Variables Best Measured? Are There Ideal Scales on Which a Construct Should Be Targeted? 178

8.2 The Generalized Linear Model 180

8.3 Logistic Regression for Binary Responses: A Special Subclass of the Generalized Linear Model 181

8.4 Logistic Regression in Python 184

8.5 Multiple Logistic Regression 188

8.5.1 A Model with Only Lag1 191

8.6 Further Directions 192

Review Exercises 192

9 Multivariate Analysis of Variance (MANOVA) and Discriminant Analysis 194

9.1 Why Technically Most Univariate Models are Actually Multivariate 195

9.2 Should I Be Running a Multivariate Model? 196

9.3 The Discriminant Function 198

9.4 Multivariate Tests of Significance: Why They Are Different from the F-Ratio 199

9.4.1 Wilks’ Lambda 200

9.4.2 Pillai’s Trace 201

9.4.3 Roy’s Largest Root 201

9.4.4 Lawley-Hotelling’s Trace 202

9.5 Which Multivariate Test to Use? 202

9.6 Performing MANOVA in Python 203

9.7 Effect Size for MANOVA 205

9.8 Linear Discriminant Function Analysis 205

9.9 How Many Discriminant Functions Does One Require? 207

9.10 Discriminant Analysis in Python: Binary Response 208

9.11 Another Example of Discriminant Analysis: Polytomous Classification 211

9.12 Bird’s Eye View of MANOVA, ANOVA, Discriminant Analysis, and Regression: A Partial Conceptual Unification 212

9.13 Models “Subsumed” Under the Canonical Correlation Framework 214

Review Exercises 216

10 Principal Components Analysis 218

10.1 What Is Principal Components Analysis? 218

10.2 Principal Components as Eigen Decomposition 221

10.3 PCA on Correlation Matrix 223

10.4 Why Icebergs Are Not Good Analogies for PCA 224

10.5 PCA in Python 226

10.6 Loadings in PCA: Making Substantive Sense Out of an Abstract Mathematical Entity 229

10.7 Naming Components Using Loadings: A Few Issues 230

10.8 Principal Components Analysis on USA Arrests Data 232

10.9 Plotting the Components 237

Review Exercises 240

11 Exploratory Factor Analysis 241

11.1 The Common Factor Analysis Model 242

11.2 Factor Analysis as a Reproduction of the Covariance Matrix 243

11.3 Observed vs. Latent Variables: Philosophical Considerations 244

11.4 So, Why is Factor Analysis Controversial? The Philosophical Pitfalls of Factor Analysis 247

11.5 Exploratory Factor Analysis in Python 248

11.6 Exploratory Factor Analysis on USA Arrests Data 250

Review Exercises 254

12 Cluster Analysis 255

12.1 Cluster Analysis vs. ANOVA vs. Discriminant Analysis 258

12.2 How Cluster Analysis Defines “Proximity” 259

12.2.1 Euclidean Distance 260

12.3 K-Means Clustering Algorithm 261

12.4 To Standardize or Not? 262

12.5 Cluster Analysis in Python 263

12.6 Hierarchical Clustering 266

12.7 Hierarchical Clustering in Python 268

Review Exercises 272

References 273

Index 276

이 포스팅은 쿠팡 파트너스 활동의 일환으로,
이에 따른 일정액의 수수료를 제공받습니다.
도서 DB 제공 : 알라딘 서점(www.aladin.co.kr)
최근 본 책