Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase, mail # user - RowKey design with hashing


Copy link to this message
-
Re: RowKey design with hashing
lars hofhansl 2013-02-14, 00:50
Depends on you search pattern.
If you never care about scans ordering i.e. you only do point gets to see whether you've already seen an email address, do the hash part.

I'd perfer #1 over #2, because it would let you do efficient key prefix block encoding (FAST_DIFF).

-- Lars

________________________________
 From: Nurettin Şimşek <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]
Sent: Wednesday, February 13, 2013 12:35 AM
Subject: RowKey design with hashing
 
Hi All,

In our project mail adresses are row key. Which rowkey design  we should
choose?

1) com.yahoo@xxxx (Reversed)
2) [EMAIL PROTECTED]
3) md5 hash([EMAIL PROTECTED])
4) Any other solution.

Many thanks.

--
M. Nurettin ŞİMŞEK