Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase, mail # user - how to model data based on "time bucket"


Copy link to this message
-
how to model data based on "time bucket"
Oleg Ruchovets 2013-01-28, 13:06
Hi ,

I have such row data structure:

event_id | time
============event1 | 10:07
event2 | 10:10
event3 | 10:12

event4 | 10:20
event5 | 10:23
event6 | 10:25
Numbers of records is 50-100 million.
Question:

I need to find group of events starting form eventX and enters to the time
window bucket = T.
For example: if T=7 munutes.
Starting from event event1- {event1, event2 , event3} were detected durint
7 minutes.

Starting from event event2- {event2 , event3} were detected durint 7
minutes.

Starting from event event4 - {event4, event5 , event6} were detected during
7 minutes.
Is there a way to model the data in hbase to get?

Thanks