Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> RowKey design with hashing


Copy link to this message
-
Re: RowKey design with hashing
Jean-Marc:
You can find almost all the details you need from this JIRA:
HBASE-4218 Data Block Encoding of KeyValues (aka delta encoding / prefix
compression)

Cheers

On Wed, Feb 13, 2013 at 6:09 PM, Jean-Marc Spaggiari <
[EMAIL PROTECTED]> wrote:

> Hi Lars,
>
> Can you please tell more about key prefix block encoding? Or refer to
> some blog/doc? How it works, what it is, etc.?
>
> Thanks,
>
> JM
>
> 2013/2/13, lars hofhansl <[EMAIL PROTECTED]>:
> > Depends on you search pattern.
> > If you never care about scans ordering i.e. you only do point gets to see
> > whether you've already seen an email address, do the hash part.
> >
> > I'd perfer #1 over #2, because it would let you do efficient key prefix
> > block encoding (FAST_DIFF).
> >
> > -- Lars
> >
> >
> >
> > ________________________________
> >  From: Nurettin Şimşek <[EMAIL PROTECTED]>
> > To: [EMAIL PROTECTED]
> > Sent: Wednesday, February 13, 2013 12:35 AM
> > Subject: RowKey design with hashing
> >
> > Hi All,
> >
> > In our project mail adresses are row key. Which rowkey design  we should
> > choose?
> >
> > 1) com.yahoo@xxxx (Reversed)
> > 2) [EMAIL PROTECTED]
> > 3) md5 hash([EMAIL PROTECTED])
> > 4) Any other solution.
> >
> > Many thanks.
> >
> > --
> > M. Nurettin ŞİMŞEK
>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB