Learning HBase learn the fundamentals of HBase administration and development with the help of real-time scenarios

If you are an administrator or developer who wants to enter the world of Big Data and BigTables and would like to learn about HBase, this is the book for you.

Detalles Bibliográficos
Otros Autores: Shriparv, Shashwat, author (author)
Formato: Libro electrónico
Idioma:Inglés
Publicado: Birmingham, England : Packt Publishing 2014.
Edición:1st edition
Colección:Community experience distilled.
Materias:
Ver en Biblioteca Universitat Ramon Llull:https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009628613906719
Tabla de Contenidos:
  • Cover; Copyright; Credits; About the Author; Acknowledgments; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Understanding the HBase Ecosystem; HBase layout on top of Hadoop; Comparing architectural differences between RDBMs and HBase; HBase features; HBase in the Hadoop ecosystem; Data representation in HBase; Hadoop; Core daemons of Hadoop; Comparing HBase with Hadoop; Comparing functional differences between RDBMs and HBase; Logical view of row-oriented databases; Logical view of column-oriented databases; Pros and cons of column-oriented databases
  • About the internal storage architecture of HBaseGetting started with HBase; When it started; HBase components and functionalities; ZooKeeper; Why an odd number of ZooKeepers?; HMaster; RegionServer; Client; Catalog tables; Who is using HBase and why?; When should we think of using HBase?; When not to use HBase; Understanding some open source HBase tools; The Hadoop-HBase version compatibility table; Applications of HBase; HBase pros and cons; Summary; Chapter 2: Let's Begin with HBase; Understanding HBase components in detail; HFile; Region
  • Scalability - understanding the scale up and scale out processesScale in; Scale out; Reading and writing cycle; Write-Ahead Logs; MemStore; HBase housekeeping; Compaction; Minor compaction; Major compaction; Region split; Region assignment; Region merge; RegionServer failovers; The HBase delete request; The reading and writing cycle; List of available HBase distributions; Prerequisites and capacity planning for HBase; The forward DNS resolution; The reverse DNS resolution; Java; SSH; Domain Name Server; Using Network Time Protocol to keep your node on time
  • OS-level changes and tuning up OS for HBaseSummary; Chapter 3: Let's Start Building It; Downloading Java on Ubuntu; Considering host configurations; Host file based; Command based; File based; DNS based; Installing and configuring SSH; Installing SSH on Ubuntu/Red Hat/CentOS; Configuring SSH; Installing and configuring NTP; Performing capacity planning; Installing and configuring Hadoop; core-site.xml; hdfs-site.xml; yarn-site.xml; mapred-site.xml; hadoop-env.sh; yarn-env.sh; Slaves file; Hadoop start up steps; Configuring Apache HBase; Configuring HBase in the standalone mode
  • Configuring HBase in the distributed modehbase-site.xml; HBase-env.sh; regionservers; Installing and configuring ZooKeeper; Installing Cloudera Hadoop and HBase; Downloading the required RPM packages; Installing Cloudera in an easier way; Installing the Hadoop and MapReduce packages; Installing Hadoop on Windows; Summary; Chapter 4: Optimizing the HBase/Hadoop Cluster; Setup types for Hadoop and HBase clusters; Recommendations for CDH cluster configuration; Capacity planning; Hadoop optimization; General optimization tips; Optimizing Java GC; Optimizing Linux OS
  • Optimizing the Hadoop parameter