Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Bulk import - key, value ambiguity


Copy link to this message
-
Re: Bulk import - key, value ambiguity
Probably because M/R requires a key and because you want M/R to sort on
that key which is required for writing hfiles.

On 8/4/12 8:22 AM, "Ioakim Perros" <[EMAIL PROTECTED]> wrote:

>Hi,
>
>Does anyone knows why at HFileOutputFormat the API (
>http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/mapreduce/HFileOut
>putFormat.html#configureIncrementalLoad(org.apache.hadoop.mapreduce.Job,
>org.apache.hadoop.hbase.client.HTable)
><http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/mapreduce/HFileOu
>tputFormat.html#configureIncrementalLoad%28org.apache.hadoop.mapreduce.Job
>,%20org.apache.hadoop.hbase.client.HTable%29>
>)
>
>suggests using as key an ImmutableBytesWritable object and as value a
>KeyValue object,
>
>when the KeyValue object has as its field the row that each KeyValue
>will lead to ? And as I experienced, this row field is being used as the
>table's key.
>
>Thanks in advance!
>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB