Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 6 from 6 (0.053s).
Loading phrases to help you
refine your search...
Re: Using HBase for Deduping - HBase - [mail # user]
...I could surround with a Try..Catch, but that would each time I insert a UUID for the first time (99% of the time), I would do a checkAndPut(), catch the resultant exception and perform a Put...
   Author: Rahul Ravindran, 2013-02-19, 17:32
Re: Using HBase for Deduping - HBase - [mail # user]
...Checkandput() does not work when the row does not exist, or am I missing something?  Sent from my phone.Excuse the terseness.  On Feb 14, 2013, at 5:33 PM, Michael Segel  wrot...
   Author: Rahul Ravindran, 2013-02-15, 01:35
Re: Using HBase for Deduping - HBase - [mail # user]
...We can't rely on the the assumption event dupes will not dupe outside an hour boundary. So, your take is that, doing a lookup per event within the MR job is going to be bad?   _________...
   Author: Rahul Ravindran, 2013-02-14, 21:07
Re: Using HBase for Deduping - HBase - [mail # user]
...Most will be in the same hour. Some will be across 3-6 hours.   Sent from my phone.Excuse the terseness.  On Feb 14, 2013, at 12:19 PM, Viral Bajaria  wrote:  ...
   Author: Rahul Ravindran, 2013-02-14, 20:29
Using HBase for Deduping - HBase - [mail # user]
...Hi,    We have events which are delivered into our HDFS cluster which may be duplicated. Each event has a UUID and we were hoping to leverage HBase to dedupe them. We run a MapReduce job whi...
   Author: Rahul Ravindran, 2013-02-14, 19:40
Using Hbase for Dedupping - HBase - [mail # user]
...Hi,    We have events which are delivered into our HDFS cluster which may be duplicated. Each event has a UUID and we were hoping to leverage HBase to dedupe them. We run a MapReduce job whi...
   Author: Rahul Ravindran, 2013-02-14, 19:23
Sort:
project
Flume (53)
HBase (6)
type
mail # user (6)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (6)
last 9 months (6)
author
Stack (3750)
Ted Yu (2467)
Jean-Daniel Cryans (2270)
Andrew Purtell (1454)
lars hofhansl (980)
Ryan Rawson (862)
Jean-Marc Spaggiari (676)
Todd Lipcon (670)
Doug Meil (571)
Jonathan Gray (516)
Jonathan Hsieh (510)
Michael Segel (477)
Lars George (438)
stack (414)
Gary Helmling (290)
Rahul Ravindran