|
|
-
Re: Bulk import - key, value ambiguityPaul Mackles 2012-08-04, 20:03
Probably because M/R requires a key and because you want M/R to sort on
that key which is required for writing hfiles. On 8/4/12 8:22 AM, "Ioakim Perros" <[EMAIL PROTECTED]> wrote: >Hi, > >Does anyone knows why at HFileOutputFormat the API ( >http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/mapreduce/HFileOut >putFormat.html#configureIncrementalLoad(org.apache.hadoop.mapreduce.Job, >org.apache.hadoop.hbase.client.HTable) ><http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/mapreduce/HFileOu >tputFormat.html#configureIncrementalLoad%28org.apache.hadoop.mapreduce.Job >,%20org.apache.hadoop.hbase.client.HTable%29> >) > >suggests using as key an ImmutableBytesWritable object and as value a >KeyValue object, > >when the KeyValue object has as its field the row that each KeyValue >will lead to ? And as I experienced, this row field is being used as the >table's key. > >Thanks in advance! > |