Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Accumulo, mail # dev - RFile configuration preferences


Copy link to this message
-
Re: RFile configuration preferences
Eric Newton 2012-11-28, 23:50
Sounds to me like an ancient holdover from the days of MapFile.

If we can change it easily, I'm all for that.

-Eric

On Wed, Nov 28, 2012 at 5:55 PM, Christopher Tubbs <[EMAIL PROTECTED]>wrote:

> It seems RFile has a preference for the Hadoop configuration object holding
> Accumulo configuration over Accumulo per-table configuration in ZooKeeper.
>
> See RFileOperations.openWriter(...).
> The affected configuration properties are:
>
> table.file.replication
> table.file.blocksize
> table.file.compress.blocksize
> table.file.compress.blocksize.index
> table.file.compress.type
>
> Furthermore, when they appear in Hadoop configuration, they cannot contain
> the Accumulo shortcuts for specifying byte sizes (like "1G").
>
> Is this a bug, or a feature? It seems like there's a potential for it to be
> a feature, particularly in AccumuloFileOutputFormat, so one can specify the
> property in Hadoop, but it could also be a bug if it shows up in the Hadoop
> configuration files... especially since we don't prefix these configuration
> properties with something unique, like "accumulo."
>
> Thoughts?
>
> --
> Christopher L Tubbs II
> http://gravatar.com/ctubbsii
>