Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Accumulo >> mail # dev >> RFile configuration preferences

Copy link to this message
Re: RFile configuration preferences
Sounds to me like an ancient holdover from the days of MapFile.

If we can change it easily, I'm all for that.


On Wed, Nov 28, 2012 at 5:55 PM, Christopher Tubbs <[EMAIL PROTECTED]>wrote:

> It seems RFile has a preference for the Hadoop configuration object holding
> Accumulo configuration over Accumulo per-table configuration in ZooKeeper.
> See RFileOperations.openWriter(...).
> The affected configuration properties are:
> table.file.replication
> table.file.blocksize
> table.file.compress.blocksize
> table.file.compress.blocksize.index
> table.file.compress.type
> Furthermore, when they appear in Hadoop configuration, they cannot contain
> the Accumulo shortcuts for specifying byte sizes (like "1G").
> Is this a bug, or a feature? It seems like there's a potential for it to be
> a feature, particularly in AccumuloFileOutputFormat, so one can specify the
> property in Hadoop, but it could also be a bug if it shows up in the Hadoop
> configuration files... especially since we don't prefix these configuration
> properties with something unique, like "accumulo."
> Thoughts?
> --
> Christopher L Tubbs II
> http://gravatar.com/ctubbsii