Is it possible to store graphs hbase? if so how do you model the database to support a graph structure?

Tags:

I have been playing around with using graphs to analyze big data. Its been working great and really fun but I'm wondering what to do as the data gets bigger and bigger?

Let me know if there's any other solution but I thought of trying Hbase because it scales horizontally and I can get hadoop to run analytics on the graph(most of my code is already written in java), but I'm unsure how to structure a graph on a nosql database? I know each node can be an entry in the database but I'm not sure how to model edges and add properties to them(like name of nodes, attributes, pagerank, weights on edges,etc..).

Seeing how hbase/hadoop is modeled after big tables and map reduce I suspect there is a way to do this but not sure how. Any suggestions?

Also, does this make sense what I'm trying to do? or is it there better solutions for big data graphs?

410

asked Mar 26 '12 01:03

Lostsoul

1 Answers

You can store an adjacency list in HBase/Accumulo in a column oriented fashion. I'm more familiar with Accumulo (HBase terminology might be slightly different) so you might use a schema similar to:

SrcNode(RowKey) EdgeType(CF):DestNode(CFQ) Edge/Node Properties(Value)

Where CF=ColumnFamily and CFQ=ColumnFamilyQualifier

You might also store node/vertex properties as separate rows using something like:

Node(RowKey) PropertyType(CF):PropertyValue(CFQ) PropertyValue(Value)

The PropertyValue could be either in the CFQ or the Value

From a graph processing perspective as mentioned by @Arnon Rotem-Gal-Oz you could look at Apache Giraph which is an implementation of Google Pregel. Pregel is the method Google use for large graph processing.

Using HBase/Accumulo as input to giraph has been submitted recently (7 Mar 2012) as a new feature request to Giraph: HBase/Accumulo Input and Output formats (GIRAPH-153)

108

answered Dec 08 '22 17:12

Binary Nerd

Related questions
                            
                                postgresql use bytea blob or file location to store serialized object?
                            
                                manipulating 15+ million records in mysql with php?
                            
                                PHP simple text database with SQL syntax [closed]
                            
                                Large Sample Database with Latitudes and Longitudes
                            
                                MySQL: how to do row-level security (like Oracle's Virtual Private Database)?
                            
                                Storing user profile data in the users table or separate profile table?
                            
                                Suggest a simple ORM on .NET - design for maintaining legacy apps
                            
                                Intermittent SQL Exception - network-related or instance-specific error
                            
                                .CSV to SQL CE Table?
                            
                                Problem with auto-incremented "id" column
                            
                                Using BinaryWriter on an Object
                            
                                Storing large amounts of data in a database
                            
                                Why Redis considered to be CP? [closed]
                            
                                When to mock database access
                            
                                How to find stored procedures by name?
                            
                                Custom Fields for a Form representing an object
                            
                                Examining SQLite database of my Android app?
                            
                                CURDATE() causes an syntax error
                            
                                mysql select count with another select count
                            
                                SQL Alter Trigger hanging

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is it possible to store graphs hbase? if so how do you model the database to support a graph structure?

Tags:

database

data-structures

graph

graph-theory

hbase

Lostsoul

People also ask

1 Answers

Binary Nerd

Recent Activity

Donate For Us