Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Plain View
Accumulo >> mail # dev >> RFile configuration preferences


Copy link to this message
-
RFile configuration preferences
It seems RFile has a preference for the Hadoop configuration object holding
Accumulo configuration over Accumulo per-table configuration in ZooKeeper.

See RFileOperations.openWriter(...).
The affected configuration properties are:

table.file.replication
table.file.blocksize
table.file.compress.blocksize
table.file.compress.blocksize.index
table.file.compress.type

Furthermore, when they appear in Hadoop configuration, they cannot contain
the Accumulo shortcuts for specifying byte sizes (like "1G").

Is this a bug, or a feature? It seems like there's a potential for it to be
a feature, particularly in AccumuloFileOutputFormat, so one can specify the
property in Hadoop, but it could also be a bug if it shows up in the Hadoop
configuration files... especially since we don't prefix these configuration
properties with something unique, like "accumulo."

Thoughts?

--
Christopher L Tubbs II
http://gravatar.com/ctubbsii
+
Eric Newton 2012-11-28, 23:50
+
Christopher Tubbs 2012-12-11, 22:19
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB