-Re: Creating a Table using HFileOutputFormat
Renaud Delbru 2010-09-23, 16:50
On 23/09/10 17:13, Stack wrote:
> You've seen this documentation for bulk import in 0.20.x:
> (Make sure you are on 0.20.6).
No, I missed this one. Thanks for pointing me this one.
> In TRUNK bulk import was revamped. Its all fancy and robust now. See
Yes, I see this one, but we are using the 0.20.x version.
> In both versions a partitioner is required. In TRUNK the hadoop total
> order partitioner is brought local and should work for most key types.
> In 0.20.x you'd need to write your own.
Will the TotalOrderPartitioner found in the hadoop library not work for
> In 0.20.x, there is no support for incremental loading. It will only
> load a fresh table. Incremental is a feature of the TRUNK version.
> In 0.20.x, you use the loadtable.rb script. In TRUNK, you run a
> little java program.
All is more clear now.