Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
HBase, mail # user - Bulk import - key, value ambiguity


+
Ioakim Perros 2012-08-04, 12:22
Copy link to this message
-
Re: Bulk import - key, value ambiguity
Paul Mackles 2012-08-04, 20:03
Probably because M/R requires a key and because you want M/R to sort on
that key which is required for writing hfiles.

On 8/4/12 8:22 AM, "Ioakim Perros" <[EMAIL PROTECTED]> wrote:

>Hi,
>
>Does anyone knows why at HFileOutputFormat the API (
>http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/mapreduce/HFileOut
>putFormat.html#configureIncrementalLoad(org.apache.hadoop.mapreduce.Job,
>org.apache.hadoop.hbase.client.HTable)
><http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/mapreduce/HFileOu
>tputFormat.html#configureIncrementalLoad%28org.apache.hadoop.mapreduce.Job
>,%20org.apache.hadoop.hbase.client.HTable%29>
>)
>
>suggests using as key an ImmutableBytesWritable object and as value a
>KeyValue object,
>
>when the KeyValue object has as its field the row that each KeyValue
>will lead to ? And as I experienced, this row field is being used as the
>table's key.
>
>Thanks in advance!
>