Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Drill >> mail # user >> Distributed mode troubles: ZK/Curator connection time out


Copy link to this message
-
Re: Distributed mode troubles: ZK/Curator connection time out

> One thing to add to the diagram is that all of the drill java processes will look at what is in drill-override.conf.

Thanks, done.
> You must set zk.connect to the correct zk host:port.
Can you be a tad more explicit, please? In drill-override.conf I have

[[

zk: {
connect: "localhost:2181”,

]]
What am I overlooking?

Also, any directions re the rest of my questions (re bin/submit_plan etc.)?
With a little help from here,  I’m happy to put together the description how to set this up in the Wiki, also to address a query we’ve now lying around for more than three weeks, by Steve McPherson – see http://mail-archives.apache.org/mod_mbox/incubator-drill-user/201310.mbox/%3CCE71A20F.14F5B%25stevemp%40amazon.com%3E – the fact that it attracted 0 responses I find slightly embarrassing, and if I were Steve, I’d prolly not touch Drill anymore, but let’s hope for the best …
Cheers,
Michael

--
Michael Hausenblas
Ireland, Europe
http://mhausenblas.info/

On 27 Oct 2013, at 21:32, Steven Phillips <[EMAIL PROTECTED]> wrote:

> One thing to add to the diagram is that all of the drill java processes
> will look at what is in drill-override.conf. You must set zk.connect to the
> correct zk host:port.
>
>
> On Sun, Oct 27, 2013 at 2:00 PM, Michael Hausenblas <
> [EMAIL PROTECTED]> wrote:
>
>>
>> Folks,
>>
>> I’m trying to set up Drill in distributed mode. Here’s what I have so far:
>> when I launch the first Drillbit with bin/drillbit.sh I get the following
>> in log/drillbit.out:
>>
>> [[
>> 20:47:20.963 [main] ERROR com.netflix.curator.ConnectionState - Connection
>> timed out for connection string (localhost:2181) and timeout (5000) /
>> elapsed (5045)
>> org.apache.zookeeper.KeeperException$ConnectionLossException:
>> KeeperErrorCode = ConnectionLoss
>>        at
>> com.netflix.curator.ConnectionState.getZooKeeper(ConnectionState.java:94)
>> ~[curator-client-1.1.9.jar:na]
>>        at
>> com.netflix.curator.CuratorZookeeperClient.getZooKeeper(CuratorZookeeperClient.java:106)
>> [curator-client-1.1.9.jar:na]
>>        at
>> com.netflix.curator.framework.imps.CuratorFrameworkImpl.getZooKeeper(CuratorFrameworkImpl.java:393)
>> [curator-framework-1.1.9.jar:na]
>>        at
>> com.netflix.curator.framework.imps.GetChildrenBuilderImpl$3.call(GetChildrenBuilderImpl.java:184)
>> [curator-framework-1.1.9.jar:na]
>>        at
>> com.netflix.curator.framework.imps.GetChildrenBuilderImpl$3.call(GetChildrenBuilderImpl.java:173)
>> [curator-framework-1.1.9.jar:na]
>>        at com.netflix.curator.RetryLoop.callWithRetry(RetryLoop.java:85)
>> [curator-client-1.1.9.jar:na]
>>        at
>> com.netflix.curator.framework.imps.GetChildrenBuilderImpl.pathInForeground(GetChildrenBuilderImpl.java:169)
>> [curator-framework-1.1.9.jar:na]
>>        at
>> com.netflix.curator.framework.imps.GetChildrenBuilderImpl.forPath(GetChildrenBuilderImpl.java:161)
>> [curator-framework-1.1.9.jar:na]
>>        at
>> com.netflix.curator.framework.imps.GetChildrenBuilderImpl.forPath(GetChildrenBuilderImpl.java:36)
>> [curator-framework-1.1.9.jar:na]
>>        at
>> com.netflix.curator.x.discovery.details.ServiceDiscoveryImpl.getChildrenWatched(ServiceDiscoveryImpl.java:306)
>> [curator-x-discovery-1.1.9.jar:na]
>>        at
>> com.netflix.curator.x.discovery.details.ServiceDiscoveryImpl.queryForInstances(ServiceDiscoveryImpl.java:276)
>> [curator-x-discovery-1.1.9.jar:na]
>>        at
>> com.netflix.curator.x.discovery.details.ServiceCache.refresh(ServiceCache.java:193)
>> [curator-x-discovery-1.1.9.jar:na]
>>        at
>> com.netflix.curator.x.discovery.details.ServiceCache.start(ServiceCache.java:116)
>> [curator-x-discovery-1.1.9.jar:na]
>>        at
>> org.apache.drill.exec.coord.ZKClusterCoordinator.start(ZKClusterCoordinator.java:89)
>> [java-exec-1.0.0-m1-rebuffed.jar:1.0.0-m1]
>>        at org.apache.drill.exec.server.Drillbit.run(Drillbit.java:94)
>> [java-exec-1.0.0-m1-rebuffed.jar:1.0.0-m1]
>>        at org.apache.drill.exec.server.Drillbit.start(Drillbit.java:56)