Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> RowKey design with hashing


Copy link to this message
-
Re: RowKey design with hashing
Hi Lars,

Can you please tell more about key prefix block encoding? Or refer to
some blog/doc? How it works, what it is, etc.?

Thanks,

JM

2013/2/13, lars hofhansl <[EMAIL PROTECTED]>:
> Depends on you search pattern.
> If you never care about scans ordering i.e. you only do point gets to see
> whether you've already seen an email address, do the hash part.
>
> I'd perfer #1 over #2, because it would let you do efficient key prefix
> block encoding (FAST_DIFF).
>
> -- Lars
>
>
>
> ________________________________
>  From: Nurettin Şimşek <[EMAIL PROTECTED]>
> To: [EMAIL PROTECTED]
> Sent: Wednesday, February 13, 2013 12:35 AM
> Subject: RowKey design with hashing
>
> Hi All,
>
> In our project mail adresses are row key. Which rowkey design  we should
> choose?
>
> 1) com.yahoo@xxxx (Reversed)
> 2) [EMAIL PROTECTED]
> 3) md5 hash([EMAIL PROTECTED])
> 4) Any other solution.
>
> Many thanks.
>
> --
> M. Nurettin ŞİMŞEK
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB