Data algorithms

If you are ready to dive into the MapReduce framework for processing large datasets, this practical book takes you step by step through the algorithms and tools you need to build distributed MapReduce applications with Apache Hadoop or Apache Spark. Each chapter provides a recipe for solving a massi...

Descripción completa

Detalles Bibliográficos
Otros Autores: Parsian, Mahmoud, author (author)
Formato: Libro electrónico
Idioma:Inglés
Publicado: Beijing, China : O'Reilly 2015.
Edición:1st edition
Materias:
Ver en Biblioteca Universitat Ramon Llull:https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009629802006719
Descripción
Sumario:If you are ready to dive into the MapReduce framework for processing large datasets, this practical book takes you step by step through the algorithms and tools you need to build distributed MapReduce applications with Apache Hadoop or Apache Spark. Each chapter provides a recipe for solving a massive computational problem, such as building a recommendation system. You’ll learn how to implement the appropriate MapReduce solution with code that you can use in your projects. Dr. Mahmoud Parsian covers basic design patterns, optimization techniques, and data mining and machine learning solutions for problems in bioinformatics, genomics, statistics, and social network analysis. This book also includes an overview of MapReduce, Hadoop, and Spark.
Notas:Description based upon print version of record.
Descripción Física:1 online resource (778 p.)
Bibliografía:Includes bibliographical references and index.
ISBN:9781491906132
9781491906170
9781491906156
9781491906187