Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase, mail # dev - reason to do major compaction after split


Copy link to this message
-
Re: reason to do major compaction after split
Matteo Bertozzi 2013-03-07, 22:28
This is seems to going in a super messy direction.
With HBASE-7806 the ideas was to cleanup all this crazy stuff (HFileLink,
References, ...)

unfortunately the initial decision of tight together the fs layout
and the tables/regions/families is bringing to all this workaround to have
something cool.

If you put the files in one place, and the association in another  you can
avoid all this complexity.

/hbase/data/[file1, file 2, file 3, file N]

table 1/region 1: [file 2]
table 1/region 2: [file 1 (from 0 to 50)]
table 1/region 3: [file 1 (from 50 to 100)]
table 2/region 1: [file 1, file 2]

On Thu, Mar 7, 2013 at 10:13 PM, Stack <[EMAIL PROTECTED]> wrote:

> Yes.  That is a few trips to the NN listing directory contents and then
> some edits/reading of .META.  We would have to introduce a QuarterHFile to
> go with our HalfHFile (or rename HalfHFile as PieceO'HFile).
>
>
> St.Ack
>