Does anybody know how to scan records based on some scan filter i.e.: <code>column:something = "somevalue"</code> Something like this, but from HBase shell?

<pre class="prettyprint"><code>scan 'test', {COLUMNS => ['F'],FILTER => \ "(SingleColumnValueFilter('F','u',=,'regexstring:http:.*pdf',true,true)) AND \ (SingleColumnValueFilter('F','s',=,'binary:2',true,true))"} </code></pre> More information can be found here. Note that multiple examples reside in the attached <code>Filter Language.docx</code> file.

Scan with filter using HBase shell

2 Answers

Try this. It's kind of ugly, but it works for me.

import org.apache.hadoop.hbase.filter.CompareFilter import org.apache.hadoop.hbase.filter.SingleColumnValueFilter import org.apache.hadoop.hbase.filter.SubstringComparator import org.apache.hadoop.hbase.util.Bytes scan 't1', { COLUMNS => 'family:qualifier', FILTER =>     SingleColumnValueFilter.new         (Bytes.toBytes('family'),          Bytes.toBytes('qualifier'),          CompareFilter::CompareOp.valueOf('EQUAL'),          SubstringComparator.new('somevalue')) }

The HBase shell will include whatever you have in ~/.irbrc, so you can put something like this in there (I'm no Ruby expert, improvements are welcome):

# imports like above def scan_substr(table,family,qualifier,substr,*cols)     scan table, { COLUMNS => cols, FILTER =>         SingleColumnValueFilter.new             (Bytes.toBytes(family), Bytes.toBytes(qualifier),              CompareFilter::CompareOp.valueOf('EQUAL'),              SubstringComparator.new(substr)) } end

and then you can just say in the shell:

scan_substr 't1', 'family', 'qualifier', 'somevalue', 'family:qualifier'

113

answered Sep 26 '22 08:09

bhavanki

scan 'test', {COLUMNS => ['F'],FILTER => \  "(SingleColumnValueFilter('F','u',=,'regexstring:http:.*pdf',true,true)) AND \ (SingleColumnValueFilter('F','s',=,'binary:2',true,true))"}

More information can be found here. Note that multiple examples reside in the attached Filter Language.docx file.

answered Sep 25 '22 08:09

dape

Related questions
                            
                                Can relational database scale horizontally
                            
                                What are the pros and cons of DynamoDB with respect to other NoSQL databases?
                            
                                NoSql and Data-Warehouse
                            
                                How to choose which type of NoSQL to use [closed]
                            
                                Cassandra frozen keyword meaning
                            
                                MongoDB vs. Redis vs. Cassandra for a fast-write, temporary row storage solution
                            
                                Querying internal array size in MongoDB
                            
                                MongoDB mongoose Deprecation Warning
                            
                                How does Voldemort compare to Cassandra?
                            
                                MongoDB: Terrible MapReduce Performance
                            
                                Why HBase is a better choice than Cassandra with Hadoop?
                            
                                Redis, CouchDB or Cassandra? [closed]
                            
                                What's the attraction of schemaless database systems?
                            
                                What exactly is NoSQL?
                            
                                MySQL and NoSQL: Help me to choose the right one
                            
                                Choosing MongoDb/CouchDb/RavenDb - performance and scalability advice [closed]
                            
                                Any detailed and specific reasons for Why MongoDB is much faster than SQL DBs?
                            
                                Is there something like Redis DB, but not limited with RAM size? [closed]
                            
                                Why isn't RDBMS Partition Tolerant in CAP Theorem and why is it Available?
                            
                                Sorted String Table (SSTable) or B+ Tree for a Database Index?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Scan with filter using HBase shell

Tags:

nosql

hbase

Gandalf StormCrow

People also ask

2 Answers

bhavanki

dape

Recent Activity

Donate For Us