Productive and efficient data science with Python with modularizing, memory profiles, and parallel/GPU processing

This book focuses on the Python-based tools and techniques to help you become highly productive at all aspects of typical data science stacks such as statistical analysis, visualization, model selection, and feature engineering. You'll review the inefficiencies and bottlenecks lurking in the da...

Descripción completa

Detalles Bibliográficos
Otros Autores: Sarkar, Tirthajyoti, author (author)
Formato: Libro electrónico
Idioma:Inglés
Publicado: New York, NY : Apress [2022]
Edición:[First edition]
Materias:
Ver en Biblioteca Universitat Ramon Llull:https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009669535106719
Descripción
Sumario:This book focuses on the Python-based tools and techniques to help you become highly productive at all aspects of typical data science stacks such as statistical analysis, visualization, model selection, and feature engineering. You'll review the inefficiencies and bottlenecks lurking in the daily business process and solve them with practical solutions. Automation of repetitive data science tasks is a key mindset that is promoted throughout the book. You'll learn how to extend the existing coding practice to handle larger datasets with high efficiency with the help of advanced libraries and packages that already exist in the Python ecosystem. The book focuses on topics such as how to measure the memory footprint and execution speed of machine learning models, quality test a data science pipelines, and modularizing a data science pipeline for app development. You'll review Python libraries which come in very handy for automating and speeding up the day-to-day tasks. In the end, you'll understand and perform data science and machine learning tasks beyond the traditional methods and utilize the full spectrum of the Python data science ecosystem to increase productivity.
Notas:Includes index.
Descripción Física:1 online resource (395 pages)
ISBN:9781484281215