HBase权威指南查看源代码讨论查看历史
HBase权威指南 |
《HBase权威指南(影印版)(英文版)》探讨了与Hadoop的高度集成如何使HBase的可伸缩性变得简单;把大型数据集分布到相对廉价的商业服务器集群中;使用本地Java客户端,或者通过提供了REST、Avro和Thrift应用编程接口的网关服务器来访问HBase;让你了解HBase架构的细节,包括存储格式、预写日志、后台进程等等;在HBase中集成用于海量并行数据处理任务的Hadoop的MapReduce框架;助你了解如何调节集群、设计模式、拷贝表、导入批量数据、删除节点以及其他更多的任务等。
基本内容
作者:乔治
出版时间:2012-4
定价:72.00元
ISBN:9787564133924
外文名:HBase:The Definitive Guide
语种:英语
副标题:HBase权威指南
页数:522
装帧:平装
书名:HBase权威指南
开本:16
类型:计算机与互联网
内容介绍
《HBase权威指南(影印版)(英文版)》探讨了与Hadoop的高度集成如何使HBase的可伸缩性变得简单;把大型数据集分布到相对廉价的商业服务器集群中;使用本地Java客户端,或者通过提供了REST、Avro和Thrift应用编程接口的网关服务器来访问HBase;让你了解HBase架构的细节,包括存储格式、预写日志、后台进程等等;在HBase中集成用于海量并行数据处理任务的Hadoop的MapReduce框架;助你了解如何调节集群、设计模式、拷贝表、导入批量数据、删除节点以及其他更多的任务等。
推荐
《HBase权威指南(影印版)(英文版)》提供了很多有意义的答案,无论你是否正在评估这种非关系型数据库或者正试图立刻把它付诸实践。
目录
Foreword
Preface
1.Introduction
The Dawn of Big Data
The Problem with Relational Database Systems
Nonrelational Database Systems,Not-Only SQL or NoSQL?
Dimensions
Scalability
Database(De-)Normalization
Building Blocks
Backdrop
Tables,Rows,Columns,and Cells
Auto-Sharding
Storage API
Implementation
Summary
HBase: The Hadoop Database
History
Nomenclature
Summary
2.Installation
Quick-Start Guide
Requirements
Hardware
Software
Filesystems for HBase
Local
HDFS
S3
Other Filesystems
Installation Choices
Apache Binary Release
Building from Source
Run Modes
Standalone Mode
Distributed Mode
Configuration
hbase-site.xml and hbase'-default.xml
hbase-env.sh
regionserver
log4j.properties
Example Configuration
Client Configuration
Deployment
Script-Based
Apache Whirr
Puppet and Chef
Operating a Cluster
Running and Confirming Your Installation
Web-based UI Introduction
Shell Introduction
Stopping the Cluster
3.Client APh The Basics
General Notes
CRUD Operations
Put Method
Get Method
Delete Method
Batch Operations
Row Locks
Scans
Introduction
The ResultScanner Class
Caching Versus Batching
Miscellaneous Features
The HTable Utility Methods
The Bytes Class
4.Client API:Advanced Features
Filters
Introduction to Filters
Comparison Filters
Dedicated Filters
Decorating Filters
FilterList
Custom Filters
Filters Summary
Counters
Introduction to Counters
Single Counters
Multiple Counters
Coprocessors
Introduction to Coprocessors
The Coprocessor Class
Coprocessor Loading
The RegionObserver Class
The MasterObserver Class
Endpoints
HTablePool
Connection Handling
5.Client APh Administrative Features
Schema Definition
Tables
Table Properties
Column Families
HBaseAdmin
Basic Operations
Table Operations
Schema Operations
Cluster Operations
Cluster Status Information
6.Available Clients
Introduction to REST,Thrift,and Avro
Interactive Clients
Native Java
REST
Thrift
Avro
Other Clients
Batch Clients
MapReduce
Hive
Pig
Cascading
Shell
Basics
Commands
Scripting
Web-based UI
Master UI
Region Server UI
Shared Pages
7.MapReduce Integration
Framework
MapReduce Introduction
Classes
Supporting Classes
MapReduce Locality
Table Splits
MapReduce over HBase
Preparation
Data Sink
Data Source
Data Source and Sink
Custom Processing
8.Architecture
Seek Versus Transfer
B+ Trees
Log-Structured Merge-Trees
Storage
Overview
Write Path
Files
HFile Format
KeyValue Format
Write-Ahead Log
Overview
HLog Class
HLogKey Class
WALEdit Class
LogSyncer Class
LogRoller Class
……
9.Advanced Usage
10.Cluster Monitoring
11.Performance Tuning
12.Cluster Administration
A.HBase Configuration Properties
B.Road Map
C Upgrade from Previous Releases
D.Distributions
E.Hush SQL Schema
F.HBaseVersusBigtable
Index
作者简介
作者:(比利时)乔治(Lars George)
Lars George,HBase项目组成员,他是cloudera的解决方案架构师,提供了关于Hadoop和HBase的技术支持、咨询服务和培训工作。他曾经在多个不同的Hadoop用户组会议发言,并且在如布鲁塞尔的自由及开源软件开发者欧洲会议(FOSDEM)这样的大型会议中发言。
媒体推荐
“分布式系统充其量只能说是经常‘多云’的。Lars George从那些主题的‘水蒸气’中提炼出了实际的项目,在此过程中,他为我们呈现了一本卓越的HBase指南。”
——Josh Patterson Cloudera公司[1]
参考文献
- ↑ HBase权威指南—读书笔记(一)简书网,2019-12-13