Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> RowKey design with hashing


Copy link to this message
-
Re: RowKey design with hashing
Depends on you search pattern.
If you never care about scans ordering i.e. you only do point gets to see whether you've already seen an email address, do the hash part.

I'd perfer #1 over #2, because it would let you do efficient key prefix block encoding (FAST_DIFF).

-- Lars

________________________________
 From: Nurettin Şimşek <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]
Sent: Wednesday, February 13, 2013 12:35 AM
Subject: RowKey design with hashing
 
Hi All,

In our project mail adresses are row key. Which rowkey design  we should
choose?

1) com.yahoo@xxxx (Reversed)
2) [EMAIL PROTECTED]
3) md5 hash([EMAIL PROTECTED])
4) Any other solution.

Many thanks.

--
M. Nurettin ŞİMŞEK
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB