how to design Hbase schema?

Tags:

suppose that I have this RDBM table (Entity-attribute-value_model):

col1: entityID
col2: attributeName
col3: value

and I want to use HBase due to scaling issues.

I know that the only way to access Hbase table is using a primary key (cursor). you can get a cursor for a specific key, and iterate the rows one-by-one .

The issue is, that in my case, I want to be able to iterate on all 3 columns. for example :

for a given an entityID I want to get all its attriutes and values
for a give attributeName and value I want to all the entitiIDS ...

so one idea I had is to build one Hbase table that will hold the data (table DATA, with entityID as primary index), and 2 "index" tables one with attributeName as a primary key, and the other one with value

each index table will hold a list of pointers (entityIDs) for the DATA table.

Is it a reasonable approach ? or is is an 'abuse' of Hbase concepts ?

In this blog the author say:

HBase allows get operations by primary key and scans (think: cursor) over row ranges. (If you have both scale and need of secondary indexes, don’t worry - Lucene to the rescue! But that’s another post.)

Do you know how Lucene can help ?

-- Yonatan

330

asked Dec 17 '08 16:12

Yonatan Maman

1 Answers

Secondary indexes would indeed be useful for many potential applications of HBase, and I believe the developers are in fact looking at it. Checkout http://www.mail-archive.com/[email protected]/msg04801.html.

In the mean time though, if your application data storage can be modelled as a star schema (see http://en.wikipedia.org/wiki/Star_schema) you might like to checkout the solution that Hypertable proposes for secondary index-type needs http://markmail.org/message/rphm4q6cbar2ycgp

100

answered Sep 21 '22 00:09

The D Williams

Related questions
                            
                                Using 'where then Union' or Using 'Union then Where'
                            
                                What makes Oracle more scalable?
                            
                                Ready web query interface to SQL databases [closed]
                            
                                Data structure used in a relational database system
                            
                                Postgres update with an inner join across 2 tables?
                            
                                How to create a database diagrams in visual studio code?
                            
                                How does PostgreSQL's scaling compare to MongoDB?
                            
                                Efficient persistent data structures for relational database
                            
                                Separating user table from people table in a relational database
                            
                                Can Sqoop export create a new table?
                            
                                How to develop a web application compatible with multiple database management systems
                            
                                Should VARCHAR columns be put at the end of table definitions in MySQL?
                            
                                How do you setup your connection pool?
                            
                                Is it possible to create a deadlock with read-only access?
                            
                                SQL: Primary key column. Artificial "Id" column vs "Natural" columns [duplicate]
                            
                                is MongoDB and Ldap the same concept?
                            
                                What are the pros and cons of object databases?
                            
                                Concurrent DB connection pool in Haskell

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how to design Hbase schema?

Tags:

hadoop

rdbms

hive

hbase

Yonatan Maman

People also ask

1 Answers

The D Williams

Recent Activity

Donate For Us