Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase, mail # user - Secondary indexes suggestions


Copy link to this message
-
Re: Secondary indexes suggestions
Otis Gospodnetic 2012-08-13, 21:49
Lukáš, have a look at this recent post on this topic:
http://blog.sematext.com/2012/08/09/consider-using-fuzzyrowfilter-when-in-need-for-secondary-indexes-in-hbase/ 
Otis 
----
Performance Monitoring for Solr / ElasticSearch / HBase - http://sematext.com/spm 

>________________________________
> From: Luk� Drbal <[EMAIL PROTECTED]>
>To: [EMAIL PROTECTED]
>Sent: Sunday, August 12, 2012 8:15 AM
>Subject: Secondary indexes suggestions
>
>Hi all,
>
>iam new user of Hbase and i need help with secondary indexes.
>
>For example i have messages and users. Each user has many messages.
>Data structure will be like this:
>
>Message:
>- String id
>- Long sender_id
>- Long recipient_id
>- String text
>- Timestamp created_at
>[...]
>
>User:
>- Long id
>- String username
>[...]
>
>I need create secondary indexes for reading all messages:
>a) inbox (by recipient_id) in timerange.
>b) outbox (by sender_id) in timerange
>
>Can someone give me suggestions for this index(es) and attributes for
>columnFamily?
>I expect here 500M messages and 50M users.
>
>Thanks a lot for response.
>
>
>P.S. Sorry for my bad english, isn't my primary language
>
>
>Lukas Drbal
>
>
>