Learning spark lightning-fast big data analytics

Data in all domains is getting bigger. How can you work with it efficiently? Recently updated for Spark 1.3 , this book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. With Spark, you can tackle big datasets quickly through s...

Descripción completa

Detalles Bibliográficos
Otros Autores: Karau, Holden, author (author)
Formato: Libro electrónico
Idioma:Inglés
Publicado: Sebastopol, California : O'Reilly 2015.
Edición:First edition
Materias:
Ver en Biblioteca Universitat Ramon Llull:https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009629217906719
Tabla de Contenidos:
  • Introduction to data analysis with Spark
  • Downloading Spark and getting started
  • Programming with RDDs
  • Working with key/value pairs
  • Loading and saving your data
  • Advanced Spark programming
  • Running on a cluster
  • Tuning and debugging Spark
  • Spark SQL
  • Spark streaming
  • Machine learning with MLlib.