HBase: The Definitive Guide

HBase: The Definitive Guide

Author: Lars George

Publisher: "O'Reilly Media, Inc."

Published: 2011-08-29

Total Pages: 555

ISBN-13: 1449315224

DOWNLOAD EBOOK

If you're looking for a scalable storage solution to accommodate a virtually endless amount of data, this book shows you how Apache HBase can fulfill your needs. As the open source implementation of Google's BigTable architecture, HBase scales to billions of rows and millions of columns, while ensuring that write and read performance remain constant. Many IT executives are asking pointed questions about HBase. This book provides meaningful answers, whether you’re evaluating this non-relational database or planning to put it into practice right away. Discover how tight integration with Hadoop makes scalability with HBase easier Distribute large datasets across an inexpensive cluster of commodity servers Access HBase with native Java clients, or with gateway servers providing REST, Avro, or Thrift APIs Get details on HBase’s architecture, including the storage format, write-ahead log, background processes, and more Integrate HBase with Hadoop's MapReduce framework for massively parallelized data processing jobs Learn how to tune clusters, design schemas, copy tables, import bulk data, decommission nodes, and many other tasks


HBase in Action

HBase in Action

Author: Amandeep Khurana

Publisher: Simon and Schuster

Published: 2012-11-01

Total Pages: 507

ISBN-13: 1638355355

DOWNLOAD EBOOK

Summary HBase in Action has all the knowledge you need to design, build, and run applications using HBase. First, it introduces you to the fundamentals of distributed systems and large scale data handling. Then, you'll explore real-world applications and code samples with just enough theory to understand the practical techniques. You'll see how to build applications with HBase and take advantage of the MapReduce processing framework. And along the way you'll learn patterns and best practices. About the Technology HBase is a NoSQL storage system designed for fast, random access to large volumes of data. It runs on commodity hardware and scales smoothly from modest datasets to billions of rows and millions of columns. About this Book HBase in Action is an experience-driven guide that shows you how to design, build, and run applications using HBase. First, it introduces you to the fundamentals of handling big data. Then, you'll explore HBase with the help of real applications and code samples and with just enough theory to back up the practical techniques. You'll take advantage of the MapReduce processing framework and benefit from seeing HBase best practices in action. Purchase of the print book comes with an offer of a free PDF, ePub, and Kindle eBook from Manning. Also available is all code from the book. What's Inside When and how to use HBase Practical examples Design patterns for scalable data systems Deployment, integration, and design Written for developers and architects familiar with data storage and processing. No prior knowledge of HBase, Hadoop, or MapReduce is required. Table of Contents PART 1 HBASE FUNDAMENTALS Introducing HBase Getting started Distributed HBase, HDFS, and MapReduce PART 2 ADVANCED CONCEPTS HBase table design Extending HBase with coprocessors Alternative HBase clients PART 3 EXAMPLE APPLICATIONS HBase by example: OpenTSDB Scaling GIS on HBase PART 4 OPERATIONALIZING HBASE Deploying HBase Operations


Learning HBase

Learning HBase

Author: Shashwat Shriparv

Publisher: Packt Publishing Ltd

Published: 2014-11-25

Total Pages: 516

ISBN-13: 178398595X

DOWNLOAD EBOOK

If you are an administrator or developer who wants to enter the world of Big Data and BigTables and would like to learn about HBase, this is the book for you.


Architecting HBase Applications

Architecting HBase Applications

Author: Jean-Marc Spaggiari

Publisher: "O'Reilly Media, Inc."

Published: 2016-07-18

Total Pages: 251

ISBN-13: 1491916117

DOWNLOAD EBOOK

Lots of HBase books, online HBase guides, and HBase mailing lists/forums are available if you need to know how HBase works. But if you want to take a deep dive into use cases, features, and troubleshooting, Architecting HBase Applications is the right source for you. With this book, you'll learn a controlled set of APIs that coincide with use-case examples and easily deployed use-case models, as well as sizing/best practices to help jump start your enterprise application development and deployment.


Learn Hbase in 24 Hours

Learn Hbase in 24 Hours

Author: Alex Nordeen

Publisher: Guru99

Published: 2020-09-15

Total Pages: 70

ISBN-13:

DOWNLOAD EBOOK

HBase data storage technology is rapidly adopted by traditional RDMS users. Unlike RDMS, where scaling the server vertically for a huge data is a big challenge. With HBase, you can do this easily. It allows you to integrate with Hadoop's MapReduce framework for massively parallelized data processing jobs. Many expert and beginners are asking for a point-to-point guide that helps them to get a complete insight on HBase working. This book will answer all their queries and give them a complete tour of HBase technology. In this edition, you will begin with some very basic concept like HBase’s architecture, including the storage format, write-ahead log, background processes, and some of the advance topics. You will also learn about accessing HBase with native Java clients, how to tune clusters, design schemas, copy tables, etc. So far if tracking other resources for HBase have disappointed you, you must try this e-book. It is cheap, easy to comprehend and concise in its content. The examples and images are an additional benefit of this book. While to enhance your knowledge pool for related topics, more referrals and links are provided. Table Of Contents Chapter 1: Introduction Chapter 2: Architecture, Data Flow, and Use cases Storage Mechanism in Hbase HBase Architecture and its Important Components Data flow in HBase HBASE vs. HDFS Chapter 3: Installation Guide How to Download Hbase tar file stable version Hbase - Standalone mode installation Hbase - Pseudo Distributed mode of installation Hbase - Fully Distributed mode installation Chapter 4: Shell and General Commands General commands Tables Managements commands Data manipulation commands Cluster Replication Commands Chapter 5: Handling Tables Creation of Table with Rows and Column names Placing values into tables and retrieving values from table Retrieving Inserted Values in HBase shell mode Chapter 6: Limitations, Advantage & Problems Chapter 7: Troubleshooting


Hbase Administration Cookbook

Hbase Administration Cookbook

Author: Yifeng Jiang

Publisher: Packt Publishing Ltd

Published: 2012-08-16

Total Pages: 507

ISBN-13: 1849517150

DOWNLOAD EBOOK

As part of Packt's cookbook series, each recipe offers a practical, step-by-step solution to common problems found in HBase administration. This book is for HBase administrators, developers, and will even help Hadoop administrators. You are not required to have HBase experience, but are expected to have a basic understanding of Hadoop and MapReduce.


Pro Apache Phoenix

Pro Apache Phoenix

Author: Shakil Akhtar

Publisher: Apress

Published: 2016-12-29

Total Pages: 148

ISBN-13: 1484223705

DOWNLOAD EBOOK

Leverage Phoenix as an ANSI SQL engine built on top of the highly distributed and scalable NoSQL framework HBase. Learn the basics and best practices that are being adopted in Phoenix to enable a high write and read throughput in a big data space. This book includes real-world cases such as Internet of Things devices that send continuous streams to Phoenix, and the book explains how key features such as joins, indexes, transactions, and functions help you understand the simple, flexible, and powerful API that Phoenix provides. Examples are provided using real-time data and data-driven businesses that show you how to collect, analyze, and act in seconds. Pro Apache Phoenix covers the nuances of setting up a distributed HBase cluster with Phoenix libraries, running performance benchmarks, configuring parameters for production scenarios, and viewing the results. The book also shows how Phoenix plays well with other key frameworks in the Hadoop ecosystem such as Apache Spark, Pig, Flume, and Sqoop. You will learn how to: Handle a petabyte data store by applying familiar SQL techniques Store, analyze, and manipulate data in a NoSQL Hadoop echo system with HBase Apply best practices while working with a scalable data store on Hadoop and HBase Integrate popular frameworks (Apache Spark, Pig, Flume) to simplify big data analysis Demonstrate real-time use cases and big data modeling techniques Who This Book Is For Data engineers, Big Data administrators, and architects.


Using Flume

Using Flume

Author: Hari Shreedharan

Publisher: "O'Reilly Media, Inc."

Published: 2014-09-16

Total Pages: 238

ISBN-13: 1491905344

DOWNLOAD EBOOK

How can you get your data from frontend servers to Hadoop in near real time? With this complete reference guide, you’ll learn Flume’s rich set of features for collecting, aggregating, and writing large amounts of streaming data to the Hadoop Distributed File System (HDFS), Apache HBase, SolrCloud, Elastic Search, and other systems. Using Flume shows operations engineers how to configure, deploy, and monitor a Flume cluster, and teaches developers how to write Flume plugins and custom components for their specific use-cases. You’ll learn about Flume’s design and implementation, as well as various features that make it highly scalable, flexible, and reliable. Code examples and exercises are available on GitHub. Learn how Flume provides a steady rate of flow by acting as a buffer between data producers and consumers Dive into key Flume components, including sources that accept data and sinks that write and deliver it Write custom plugins to customize the way Flume receives, modifies, formats, and writes data Explore APIs for sending data to Flume agents from your own applications Plan and deploy Flume in a scalable and flexible way—and monitor your cluster once it’s running


Analytics for the Internet of Things (IoT)

Analytics for the Internet of Things (IoT)

Author: Andrew Minteer

Publisher: Packt Publishing Ltd

Published: 2017-07-24

Total Pages: 369

ISBN-13: 1787127575

DOWNLOAD EBOOK

Break through the hype and learn how to extract actionable intelligence from the flood of IoT data About This Book Make better business decisions and acquire greater control of your IoT infrastructure Learn techniques to solve unique problems associated with IoT and examine and analyze data from your IoT devices Uncover the business potential generated by data from IoT devices and bring down business costs Who This Book Is For This book targets developers, IoT professionals, and those in the field of data science who are trying to solve business problems through IoT devices and would like to analyze IoT data. IoT enthusiasts, managers, and entrepreneurs who would like to make the most of IoT will find this equally useful. A prior knowledge of IoT would be helpful but is not necessary. Some prior programming experience would be useful What You Will Learn Overcome the challenges IoT data brings to analytics Understand the variety of transmission protocols for IoT along with their strengths and weaknesses Learn how data flows from the IoT device to the final data set Develop techniques to wring value from IoT data Apply geospatial analytics to IoT data Use machine learning as a predictive method on IoT data Implement best strategies to get the most from IoT analytics Master the economics of IoT analytics in order to optimize business value In Detail We start with the perplexing task of extracting value from huge amounts of barely intelligible data. The data takes a convoluted route just to be on the servers for analysis, but insights can emerge through visualization and statistical modeling techniques. You will learn to extract value from IoT big data using multiple analytic techniques. Next we review how IoT devices generate data and how the information travels over networks. You'll get to know strategies to collect and store the data to optimize the potential for analytics, and strategies to handle data quality concerns. Cloud resources are a great match for IoT analytics, so Amazon Web Services, Microsoft Azure, and PTC ThingWorx are reviewed in detail next. Geospatial analytics is then introduced as a way to leverage location information. Combining IoT data with environmental data is also discussed as a way to enhance predictive capability. We'll also review the economics of IoT analytics and you'll discover ways to optimize business value. By the end of the book, you'll know how to handle scale for both data storage and analytics, how Apache Spark can be leveraged to handle scalability, and how R and Python can be used for analytic modeling. Style and approach This book follows a step-by-step, practical approach to combine the power of analytics and IoT and help you get results quickly


Apache HBase Primer

Apache HBase Primer

Author: Deepak Vohra

Publisher: Apress

Published: 2016-11-17

Total Pages: 147

ISBN-13: 1484224248

DOWNLOAD EBOOK

Learn the fundamental foundations and concepts of the Apache HBase (NoSQL) open source database. It covers the HBase data model, architecture, schema design, API, and administration. Apache HBase is the database for the Apache Hadoop framework. HBase is a column family based NoSQL database that provides a flexible schema model. What You'll Learn Work with the core concepts of HBase Discover the HBase data model, schema design, and architecture Use the HBase API and administration Who This Book Is For Apache HBase (NoSQL) database users, designers, developers, and admins.