Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
Hive >> mail # user >> Rolling MAU computation


Copy link to this message
-
Rolling MAU computation
I'm trying to compute the number of active users in the previous 30 days
for each day over a date range. I can't think of any way to do it directly
within Hive so I'm wondering if you guys have any ideas.

Basically the algorithm is something like:

For each day in date range:
   SELECT day, COUNT(DISTINCT(userid)) FROM logins WHERE day - logins.day <
30;

Thanks for your help!

Tom
+
Roberto Sanabria 2012-10-10, 22:59
+
Tom Hubina 2012-10-10, 23:04
+
MiaoMiao 2012-10-11, 03:50
+
Tom Hubina 2012-10-12, 19:59
+
Igor Tatarinov 2012-10-11, 06:05
+
Tom Hubina 2012-10-12, 20:02
+
Igor Tatarinov 2012-10-12, 20:08
+
Vijay 2012-10-12, 20:42