- Category: Computer
- Author: Srini Penchikala
- File type: PDF (104 pages)
Read and download free eBook intituled Big Data Processing with Apache Spark in format PDF (104 pages) created by Srini Penchikala.
Apache Spark is an open-source big-data processing framework built around speed, ease of use, and sophisticated analytics.
Spark has several advantages compared to other big-data and MapReduce technologies like Hadoop and Storm. It provides a comprehensive, unified framework with which to manage big-data processing requirements for datasets that are diverse in nature (text data, graph data, etc.) and that come from a variety of sources (batch versus real-time streaming data).
Spark enables applications in HDFS clusters to run up to a hundred times faster in memory and ten times faster even when running on disk.
Read and Download Links: