Apache Spark 2.0

In March 2016 at Strata in San Jose, CA, a standing room only audience of excited developers heard the first public overview of the dramatic changes coming to Apache Spark. Listen and watch as Databrick presenters Michael Armbrust and Tathagata Das run through the breakthrough concepts and technolo...

Descripción completa

Detalles Bibliográficos
Autor Corporativo: Strata Conference and Hadoop World (-)
Otros Autores: O'Reilly Media, Inc., author (author)
Formato: Video
Idioma:Inglés
Publicado: O'Reilly Media, Inc 2016.
Edición:1st edition
Materias:
Ver en Biblioteca Universitat Ramon Llull:https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009631194606719
Descripción
Sumario:In March 2016 at Strata in San Jose, CA, a standing room only audience of excited developers heard the first public overview of the dramatic changes coming to Apache Spark. Listen and watch as Databrick presenters Michael Armbrust and Tathagata Das run through the breakthrough concepts and technologies driving the Structured Streaming capabilities in Spark 2.0. Get a first-look preview of the break-through changes coming to Structured Streaming in 2.0 Understand the unified input/out API that works with virtually any format (JSON, Parquet, etc.) Learn about Datasets – a new abstraction that eliminates large swaths of unnecessary code Get how 2.0 simplifies exploration of large data stores and ensures error-free production pipelines Learn about the Catalyst optimizer and Tungsten – new tools for efficient pipeline analysis Explore an end-to-end execution pipeline that allows difficult ad-hoc interactive queries and more Learn about streaming DataFrames and how they unify interactive analysis See a demo of Structured Streaming and learn why it was developed Michael Arbrust and Tathagata Das work at Databricks, the company founded by the creators of Apache Spark. Das is the lead developer behind Spark Streaming, which he started while a PhD student in the UC Berkeley AMPLab. Armbrust is the lead developer for Spark’s SQL project. He holds a PhD from Berkeley.
Notas:Title and publication information from resource description page (Safari, viewed May 24, 2016).
Descripción Física:1 online resource (1 video file, approximately 1 hr., 12 min.)