책 이미지
책 정보
· 분류 : 외국도서 > 컴퓨터 > 데이터베이스 관리 > 일반
· ISBN : 9781484221983
· 쪽수 : 421쪽
· 출판일 : 2016-10-01
목차
Introduction 1. HDFS and MapReduce Hadoop Distributed FileSystem MapReduce Frameworks Setting the Environment Hadoop Cluster Modes Running a MapReduce Job with MR1 Framework Running MR1 in Standalone Mode Running MR1 in Psuedo-Distributed Mode Running MapReduce with Yarn Framework Running YARN in Psuedo-Distributed Mode Running Hadoop Streaming Section II Storing & Querying 2. Apache Hive Setting the Environment Configuring Hadoop Configuring Hive Starting HDFS Starting the Hive Server Starting the Hive CLI Creating a Database Using a Database Creating a Managed Table Loading Data into a Table Creating a table using LIKE Adding Data with INSERT INTO TABLE Adding Data with INSERT OVERWRITECreating Table using AS SELECT Altering a Table Truncating a Table Dropping a Table Creating an External Table 3. Apache HBase Setting the Environment Configuring Hadoop Configuring HBase Configuring Hive Starting HBase Starting HBase Shell Creating a HBase Table Adding Data To HBase Table Listing All Tables Getting a Row of Data Scanning a Table Counting Number of Rows in a Table Altering a Table Deleting a Row Deleting a Column Disabling and Enabling a Table Truncating a Table Dropping a Table Finding if a Table exists Creating a Hive External Table Section III Bulk Transferring & Streaming 4. Apache Sqoop Installing MySQL Database Creating MySQL Database Tables Setting the Environment Configuring Hadoop Starting HDFS Configuring Hive Configuring HBase Importing into HDFS Exporting from HDFS Importing into Hive Importing into HBase 5. Apache Flume Setting the Environment Configuring Hadoop Configuring HBase Starting HDFS Configuring Flume Running a Flume Agent Configuring Flume for HBase Sink Streaming MySQL Log to HBase Sink Section IV Serializing 6. Apache Avro Setting the Environment Creating an Avro Schema Creating a Hive Managed Table Creating a Hive (version prior to 0.14) External Table Stored as Avro< Creating a Hive (version 0.14 and later) External Table Stored as Avro Transferring MySQL Table Data as Avro Data File with Sqoop 7. Apache Parquet Setting the Environment Creating a Oracle Database Table Exporting Oracle Database to a CSV File Importing the CSV File in MongoDB Exporting MongoDB Document as CSV File Importing a CSV File to Oracle Database Section V Messaging & Indexing 8. Apache Kafka Setting the Environment Starting the Kafka Server Creating a Topic Starting a Kafka Producer Starting a Kafka Consumer Producing and Consuming Messages Streaming Log Data to Apache Kafka with Apache Flume Setting the Environment Creating Kafka Topics Configuring Flume< Running Flume Agent Consuming Log Data as Kafka Messages 9. Apache Solr Setting the Environment Configuring the Solr Schema Starting the Solr Server Indexing a Document in Solr Deleting a Document from Solr Indexing a Document in Solr with Java Client Searching a Document in Solr Creating a Hive Managed Table Creating a Hive External Table Loading Hive External Table Data Searching Hive Table Data Indexed in Solr Section VI Machine Learning 10.Apache Mahout Setting the Environment Starting HDFS Setting the Mahout Environment Running a Mahout Classification Sample Running a Mahout Clustering Sample Developing a User Based Recommender System The Sample Data Setting the Environment Creating a Maven Project in Eclipse Creating a User Based Recommender Creating a Recommender Evaluator Running the Recommender Choosing a Recommender Type Choosing a User Similarity Measure Choosing a Neighborhood Type Choosing a Neighborhood Size for NearestNUserNeighborhood Choosing a Threshold for ThresholdUserNeighborhood Running the Evaluator Choosing the Split between Training Percentage and Test Percentage














