이야기박스
Spark in action, 2nd edition study 본문

spark 3.0을 다루는 Spark in Action 2판이 출판되어 스터디를 진행하게 되었습니다.
책은 Manning에서 구매하여 e-book으로 보게 되었네요.
https://www.manning.com/books/spark-in-action-second-edition
Spark in Action, Second Edition
The Spark distributed data processing platform provides an easy-to-implement tool for ingesting, streaming, and processing data from any source. In Spark in Action, Second Edition, you’ll learn to take advantage of Spark’s core features and incredi
www.manning.com
# Spark 3.0 release
https://spark.apache.org/releases/spark-release-3-0-0.html
Spark Release 3.0.0 | Apache Spark
Spark Release 3.0.0 Apache Spark 3.0.0 is the first release of the 3.x line. The vote passed on the 10th of June, 2020. This release is based on git tag v3.0.0 which includes all commits up to June 10. Apache Spark 3.0 builds on many of the innovations fro
spark.apache.org
## 언어
- Python 3.x 지원 (2.x 미지원)
- Java 11 지원
- PANDAS UDF API
## 환경
- Hadoop 3 지원
## 성능
- DDP 도입
- Spark SQL Engine : Adaptive Query Execution Framework
'Computer & Data > Big Data' 카테고리의 다른 글
Chapter 5. Building a simple app for deployment (0) | 2020.07.30 |
---|---|
Chapter 2. Architecture and flow (0) | 2020.07.16 |
[Flume] No configuration found for this host:** (0) | 2020.06.19 |
Kafka ; linger time callback (0) | 2020.06.08 |
(작성중) 쿠버네티스 & 부하 테스트 (0) | 2019.11.28 |