Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase, mail # user - How to remove duplicate data in HBase?


Copy link to this message
-
Re: How to remove duplicate data in HBase?
Michael Segel 2014-01-17, 14:54
First, you should define what you mean when you say duplicate data.

Depending on your definition… it may already be handled.

On Jan 17, 2014, at 7:39 AM, Ted Yu <[EMAIL PROTECTED]> wrote:

> Can you tell us where the duplicate data resides - between column families or between columns in a single column family ?
>
> Cheers
>
> On Jan 17, 2014, at 4:46 AM, oc tsdb <[EMAIL PROTECTED]> wrote:
>
>> Hi all,
>>
>> We want to know if there is any option to remove duplicate data in Hbase
>> based on column family dynamically?
>>
>> Thanks,
>> OC
>