Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
Accumulo >> mail # dev >> RFile configuration preferences

Christopher Tubbs 2012-11-28, 22:55
Copy link to this message
Re: RFile configuration preferences
Sounds to me like an ancient holdover from the days of MapFile.

If we can change it easily, I'm all for that.


On Wed, Nov 28, 2012 at 5:55 PM, Christopher Tubbs <[EMAIL PROTECTED]>wrote:

> It seems RFile has a preference for the Hadoop configuration object holding
> Accumulo configuration over Accumulo per-table configuration in ZooKeeper.
> See RFileOperations.openWriter(...).
> The affected configuration properties are:
> table.file.replication
> table.file.blocksize
> table.file.compress.blocksize
> table.file.compress.blocksize.index
> table.file.compress.type
> Furthermore, when they appear in Hadoop configuration, they cannot contain
> the Accumulo shortcuts for specifying byte sizes (like "1G").
> Is this a bug, or a feature? It seems like there's a potential for it to be
> a feature, particularly in AccumuloFileOutputFormat, so one can specify the
> property in Hadoop, but it could also be a bug if it shows up in the Hadoop
> configuration files... especially since we don't prefix these configuration
> properties with something unique, like "accumulo."
> Thoughts?
> --
> Christopher L Tubbs II
> http://gravatar.com/ctubbsii
Christopher Tubbs 2012-12-11, 22:19