Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
HBase, mail # user - how to model data based on "time bucket"


+
Oleg Ruchovets 2013-01-28, 13:06
Copy link to this message
-
Re: how to model data based on "time bucket"
Rodrigo Ribeiro 2013-01-28, 15:17
You can use another table as a index, using a rowkey like
'{time}:{event_id}', and then scan in the range ["10:07", "10:15").

On Mon, Jan 28, 2013 at 10:06 AM, Oleg Ruchovets <[EMAIL PROTECTED]>wrote:

> Hi ,
>
> I have such row data structure:
>
> event_id | time
> ============> event1 | 10:07
> event2 | 10:10
> event3 | 10:12
>
> event4 | 10:20
> event5 | 10:23
> event6 | 10:25
>
>
> Numbers of records is 50-100 million.
>
>
> Question:
>
> I need to find group of events starting form eventX and enters to the time
> window bucket = T.
>
>
> For example: if T=7 munutes.
> Starting from event event1- {event1, event2 , event3} were detected durint
> 7 minutes.
>
> Starting from event event2- {event2 , event3} were detected durint 7
> minutes.
>
> Starting from event event4 - {event4, event5 , event6} were detected during
> 7 minutes.
> Is there a way to model the data in hbase to get?
>
> Thanks
>

--

*Rodrigo Pereira Ribeiro*
Software Developer
www.jusbrasil.com.br
+
Oleg Ruchovets 2013-01-28, 15:49
+
Rodrigo Ribeiro 2013-01-28, 16:27
+
Oleg Ruchovets 2013-01-28, 17:07
+
Rodrigo Ribeiro 2013-01-28, 17:24
+
Oleg Ruchovets 2013-01-28, 17:45
+
Oleg Ruchovets 2013-01-30, 09:57
+
Rodrigo Ribeiro 2013-01-30, 18:34
+
Oleg Ruchovets 2013-01-31, 13:52
+
Rodrigo Ribeiro 2013-01-31, 14:34
+
Oleg Ruchovets 2013-01-31, 15:39
+
Rodrigo Ribeiro 2013-01-31, 15:51
+
Michel Segel 2013-01-28, 15:54
+
Oleg Ruchovets 2013-01-28, 16:24