First, you should define what you mean when you say duplicate data.
Depending on your definition… it may already be handled.
On Jan 17, 2014, at 7:39 AM, Ted Yu <[EMAIL PROTECTED]> wrote:
> Can you tell us where the duplicate data resides - between column families or between columns in a single column family ?
> On Jan 17, 2014, at 4:46 AM, oc tsdb <[EMAIL PROTECTED]> wrote:
>> Hi all,
>> We want to know if there is any option to remove duplicate data in Hbase
>> based on column family dynamically?