David Koch 2013-01-27, 22:29
On Sun, Jan 27, 2013 at 2:29 PM, David Koch <[EMAIL PROTECTED]> wrote:
> I read about "short circuit reads" in the HBase documentation's performance
> section and was wondering what people's experiences were using this in a
> production setting.
> 1. Since only one dedicated user can take advantage of the feature do you
> launch all jobs as this user?
That's the big limitation right now. Running everything as the same
user can make managing jobs difficult, also that user would need to be
the same as HBase's.
FWIW, HDFS-347 should fix those limitions but it's not committed yet
(getting close tho).
> 2. Can dfs.client.read.shortcircuit be set to false for jobs wich are not
> launched by the short-circuit user in order to avoid exceptions? In other
> words - can this setting be overriden by the client configuration's
Yes, but those exceptions are really harmless.
> 3. In the same context, it is suggested to enable HBase internal
> checksums. Is this a feature which can be enabled in HBase 0.92.1 which
> is part of the Cloudera 4.1.x release?
Yes on the first question, no on the second one (what Ted said)
> Thank you,
>  http://hbase.apache.org/book/perf.hdfs.html#ftn.d2145e7370
>  https://issues.apache.org/jira/browse/HBASE-5074
Ted 2013-01-27, 22:34
Jean-Marc Spaggiari 2013-01-27, 22:38
Jean-Daniel Cryans 2013-01-27, 22:50
Jean-Marc Spaggiari 2013-01-27, 22:55
David Koch 2013-01-27, 23:16
Jean-Daniel Cryans 2013-01-28, 02:56
Jean-Marc Spaggiari 2013-01-28, 12:45
Ted Yu 2013-01-27, 22:45