책 이미지
책 정보
· 분류 : 외국도서 > 컴퓨터 > 데이터베이스 관리 > 데이터 마이닝
· ISBN : 9781118607558
· 쪽수 : 408쪽
· 출판일 : 2014-04-14
목차
Introduction 1
Part I: Getting Started with Hadoop 7
Chapter 1: Introducing Hadoop and Seeing What It’s Good For 9
Chapter 2: Common Use Cases for Big Data in Hadoop 23
Chapter 3: Setting Up Your Hadoop Environment 41
Part II: How Hadoop Works 51
Chapter 4: Storing Data in Hadoop: The Hadoop Distributed File System 53
Chapter 5: Reading and Writing Data 69
Chapter 6: MapReduce Programming 83
Chapter 7: Frameworks for Processing Data in Hadoop: YARN and MapReduce 103
Chapter 8: Pig: Hadoop Programming Made Easier 117
Chapter 9: Statistical Analysis in Hadoop 129
Chapter 10: Developing and Scheduling Application Workflows with Oozie 139
Part III: Hadoop and Structured Data 155
Chapter 11: Hadoop and the Data Warehouse: Friends or Foes? 157
Chapter 12: Extremely Big Tables: Storing Data in HBase 179
Chapter 13: Applying Structure to Hadoop Data with Hive 227
Chapter 14: Integrating Hadoop with Relational Databases Using Sqoop 269
Chapter 15: The Holy Grail: Native SQL Access to Hadoop Data 303
Part IV: Administering and Configuring Hadoop 313
Chapter 16: Deploying Hadoop 315
Chapter 17: Administering Your Hadoop Cluster 335
Part V: The Part of Tens 359
Chapter 18: Ten Hadoop Resources Worthy of a Bookmark 361
Chapter 19: Ten Reasons to Adopt Hadoop 371
Index 379















