Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> How to remove duplicate data in HBase?


Copy link to this message
-
Re: How to remove duplicate data in HBase?
First, you should define what you mean when you say duplicate data.

Depending on your definition… it may already be handled.

On Jan 17, 2014, at 7:39 AM, Ted Yu <[EMAIL PROTECTED]> wrote:

> Can you tell us where the duplicate data resides - between column families or between columns in a single column family ?
>
> Cheers
>
> On Jan 17, 2014, at 4:46 AM, oc tsdb <[EMAIL PROTECTED]> wrote:
>
>> Hi all,
>>
>> We want to know if there is any option to remove duplicate data in Hbase
>> based on column family dynamically?
>>
>> Thanks,
>> OC
>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB