이야기박스

Spark in action, 2nd edition study 본문

Computer & Data/Big Data

Spark in action, 2nd edition study

박스님 2020. 7. 16. 10:36
반응형

spark 3.0을 다루는 Spark in Action 2판이 출판되어 스터디를 진행하게 되었습니다.

 

책은 Manning에서 구매하여 e-book으로 보게 되었네요.

 

https://www.manning.com/books/spark-in-action-second-edition

 

Spark in Action, Second Edition

The Spark distributed data processing platform provides an easy-to-implement tool for ingesting, streaming, and processing data from any source. In Spark in Action, Second Edition, you’ll learn to take advantage of Spark’s core features and incredi

www.manning.com

 

 

# Spark 3.0 release

https://spark.apache.org/releases/spark-release-3-0-0.html

 

Spark Release 3.0.0 | Apache Spark

Spark Release 3.0.0 Apache Spark 3.0.0 is the first release of the 3.x line. The vote passed on the 10th of June, 2020. This release is based on git tag v3.0.0 which includes all commits up to June 10. Apache Spark 3.0 builds on many of the innovations fro

spark.apache.org

 

## 언어

  • Python 3.x 지원 (2.x 미지원)
  • Java 11 지원
  • PANDAS UDF API

 

## 환경

  • Hadoop 3 지원

 

## 성능

  • DDP 도입
  • Spark SQL Engine : Adaptive Query Execution Framework
반응형