Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Plain View
HBase >> mail # user >> HBase Schema Design for clickstream data


Copy link to this message
-
HBase Schema Design for clickstream data
I am starting out with a new application where I need to store users
clickstream data. I'll have Visitor Id, session id along with other page
related data. I am wondering if I should just key off randomly generated
session id and store all the page related data as columns inside that row
assuming that this would also give good distribution accross region
servers. In a session user could send 100s of HTML requests and get
responses. If someone is already doing this in HBase I would like to learn
more about it as to how they have designed the schema.
+
Dhaval Shah 2012-06-26, 17:52
+
Amandeep Khurana 2012-06-27, 18:01
+
Mohit Anchlia 2012-06-27, 18:13
+
Amandeep Khurana 2012-06-27, 18:20
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB