logo
logo
x
바코드검색
BOOKPRICE.co.kr
책, 도서 가격비교 사이트
바코드검색

인기 검색어

일간
|
주간
|
월간

실시간 검색어

검색가능 서점

도서목록 제공

Beginning Apache Spark Using Azure Databricks: Unleashing Large Cluster Analytics in the Cloud

Beginning Apache Spark Using Azure Databricks: Unleashing Large Cluster Analytics in the Cloud (Paperback)

Robert Ilijason (지은이)
Apress
82,150원

일반도서

검색중
서점 할인가 할인률 배송비 혜택/추가 실질최저가 구매하기
67,360원 -18% 0원
3,370원
63,990원 >
yes24 로딩중
교보문고 로딩중
notice_icon 검색 결과 내에 다른 책이 포함되어 있을 수 있습니다.

중고도서

검색중
서점 유형 등록개수 최저가 구매하기
로딩중

eBook

검색중
서점 정가 할인가 마일리지 실질최저가 구매하기
로딩중

책 이미지

Beginning Apache Spark Using Azure Databricks: Unleashing Large Cluster Analytics in the Cloud
eBook 미리보기

책 정보

· 제목 : Beginning Apache Spark Using Azure Databricks: Unleashing Large Cluster Analytics in the Cloud (Paperback) 
· 분류 : 외국도서 > 경제경영 > 산업 > 컴퓨터
· ISBN : 9781484257807
· 쪽수 : 274쪽
· 출판일 : 2020-06-12

목차

Chapter 1, Introduction to large scale data analytics.

Chapter goal: Reader should understand the data analytics and the workflow.

·         So what is data analysis?

·         The process of running a data analysis project.

·         Real world example.

·         What about data scientists?

Chapter 3, Distributed processing, Spark and Databricks.

Chapter goal: Reader should understand Spark on a high level and what Databricks is.

·         Computational history.

·         Scale up vs scale out.

·         Traditional analytics platforms.

·         The power of Spark.

·         The simplicity of Databricks.

Chapter 4, Getting started with Databricks.

Chapter goal: Reader should understand how to get a Databricks installation up and running.

·         A short introduction to Spark architecture

·         Setting up a cloud account.

·         Getting Databricks running.

·         Finally ? time to start Databricks.

Chapter 5, Workspaces, Clusters and Notebooks.

Chapter goal: Reader should understand how to find his or her way around the UI.

·         Finding your way around the user interface.

·         Starting the engine ? cluster creation.

·         A short note about checkboxes and configurations.

·         Picking the right notebook.

·         Keeping track of the workspace

Chapter 6, Getting data into Databricks.

Chapter goal: Reader should understand the many ways they can get data into Databricks.

·         Filesystems and data formats.

·         Working with schemas.

·         Importing Excel data.

·         Picking up information from the web.

·         Mounting the cloud data lake.

Chapter 7, Querying data using SQL.

Chapter goal: Reader should understand how to use SQL for looking and manipulating data.

·         Databases and tables in the Hive Metastore.

·         Pulling some data.

·         Joining, grouping and summarizing.

·         Views and procedures.

·         Hey ? what’s up with updates?

Chapter 8, Python (and a little bit of Scala and R).

Chapter goal: Reader should understand how to use Python for playing around with data.

·         An introduction to Dataframes.

·         Python vs SQL.

·         Working with data

·         But what about Scala and R?

Chapter 9, ETL and more advanced data wrangling.

Chapter goal: Reader should understand even more around manipulating data.

·         Stars and snowflakes.

·         Cleaning the data.

·         Speeding things up.

·         Working with partitions.

·         Setting parameters.

Chapter 10, Connecting from afar.

Chapter goal: Reader should understand how they can connect to Databricks from other tools.

·         Setting up ODBC and JDBC.

·         Getting to know the API:s.

·         Example: Connecting Power BI.

Chapter 11, Running in Production

Chapter goal: Reader should understand how to run and monitor jobs in production.

·         How to set up jobs.

·         Working with schedules.

·         Monitoring the jobs.

Chapter 12, Removing the training wheels.

Chapter goal: Reader should get to know the more advanced options.

·         Security in Databricks.

·         Machine learning using MLlib.

·         Going full ACID with Delta lake.

·         High speed streaming.

·         A deep dive into Spark architecture.

저자소개

Robert Ilijason (지은이)    정보 더보기
펼치기
이 포스팅은 쿠팡 파트너스 활동의 일환으로,
이에 따른 일정액의 수수료를 제공받습니다.
이 포스팅은 제휴마케팅이 포함된 광고로 커미션을 지급 받습니다.
도서 DB 제공 : 알라딘 서점(www.aladin.co.kr)
최근 본 책