Advanced R Programming

Overview Alternative Backends for R LiveLessons teaches R programmers techniques for dealing with large data, both in memory and in databases. Description In this video training Jared starts with some common data manipulation operations using various base R functions and packages like plyr, c...

Descripción completa

Detalles Bibliográficos
Otros Autores: Lander, Jared, author (author)
Formato: Video
Idioma:Inglés
Publicado: Addison-Wesley Professional 2015.
Edición:1st edition
Colección:LiveLessons
Materias:
Ver en Biblioteca Universitat Ramon Llull:https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009629600306719
Descripción
Sumario:Overview Alternative Backends for R LiveLessons teaches R programmers techniques for dealing with large data, both in memory and in databases. Description In this video training Jared starts with some common data manipulation operations using various base R functions and packages like plyr, comparing the speed of in memory calculations. He then demonstrates more advanced techniques for accomplishing the same task such as data.table, dplyr, Rcpp and parallel computation for increased speed. Finally, for when data size is an even bigger factor than speed he introduces external memory and database techniques using bibmemory, ff, SciDB, dplyr and Hadoop. About the Instructor Jared P. Lander is the Founder and CEO of Lander Analytics, the Organizer of the New York Open Statistical Programming Meetup and an Adjunct Professor of Statistics at Columbia University. With a masters from Columbia University in statistics and a bachelors from Muhlenberg College in mathematics, he has experience in both academic research and industry. Jared oversees the long-term direction of the company and acts as Lead Data Scientist, researching the best strategy, models and algorithms for modern data needs. This is in addition to his client-facing consulting and training. He specializes in data management, multilevel models, machine learning, generalized linear models, data management, visualization and statistical computing. He is the author of R for Everyone , a book about R Programming geared toward Data Scientists and Non-Statisticians alike. The book is available from Amazon, Barnes & Noble, and InformIT. The material is drawn from the classes he teaches at Columbia and is incorporated into his corporate training. Very active in the data community, Jared is a frequent speaker at conferences, universities and meetups around the world. He is a member of the 2014 Strata New York selection committee. Skill Level Intermediate Advanced What You Will Learn Basic Aggregation plyr dplyr data.table Rcpp Parallel Processing Code Benchmarking Who Should Take This Course R programmers who already have an intermediate level of knowledge such as that gained from Reading R for Everyone . Course Requirements Basic Programming Skills Proficiency in R, including working with packages Table of Contents Lesson 1: Reading XML Data 1.1.  Read HTML Table 1.2.  Use xpath for complex searches in HTML 1.3.  xmlToList for easier parsing Lesson 2: Faster...
Notas:Title from resource description page (viewed December 3, 2014)
Descripción Física:1 online resource (1 video file, approximately 3 hr., 18 min.)