Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Plain View
HBase >> mail # user >> Question regarding bulkload : overwriting duplicate records


+
Gaetan Deputier 2013-02-08, 00:23
Copy link to this message
-
Re: Question regarding bulkload : overwriting duplicate records
I logged HBASE-7793 to backport.

Cheers

On Thu, Feb 7, 2013 at 4:23 PM, Gaetan Deputier <[EMAIL PROTECTED]> wrote:

> Hi HBase users,
>
> I am using Hbase 0.92.1 from the cloudera distribution cdh4.1.1.
> I am loading bulk files using the ImportTsv job but i have an issue
> regarding records having a different cell value.
>
> I guessed that the underlying Map/Reducer sets the timestamp to the
> currentTime. Is there a way to inform the Tsv job to read the timestamp
> from a column ?
>
> I can still do my own hadoop mapper and then split the lines and treat them
> but i was wondering if the issue on the Hbase Jira (HBASE-5564) which is
> solving this problem would be released soon.
>
> Regards,
>
> G.
>
+
Gaetan Deputier 2013-02-08, 01:30
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB