|
|
-
Re: There are no tablet servers
William Slacum 2012-07-18, 18:21
50030 is the http port for HDFS. You probably want 9000 or 8020. Check your hdfs-site.xml
On Wed, Jul 18, 2012 at 11:15 AM, Cardon, Tejay E <[EMAIL PROTECTED]>wrote:
> All,**** > > I’m running into a strange challenge in my latest Accumulo installation. > I’ve developed some chef recipes for deploying Accumulo, and have tested > them on three clusters now with no problems. Using the same scripts, I > recent did another deployment, but I’m having trouble on this one.**** > > ** ** > > After installing Accumulo, updating the config files, and setting up > passwordless ssh, I ran:**** > > ./accumulo init**** > > ** ** > > Everything went normally with me setting the instanceID and password**** > > ** ** > > Then I ran **** > > ./start-all.sh**** > > > Again, everything went smoothly with the following output:**** > > bash-3.2$ ./start-all.sh**** > > Starting tablet servers and loggers ....... done**** > > Starting tablet server on de8-9a-8f-83-be-52**** > > Starting logger on de8-9a-8f-83-be-52**** > > Starting tablet server on d04-7d-7b-06-5e-48**** > > Starting logger on de8-9a-8f-d3-3e-f8**** > > Starting tablet server on d04-7d-7b-06-5d-f4**** > > Starting logger on d04-7d-7b-06-5e-48**** > > Starting logger on d04-7d-7b-06-5d-f4**** > > Starting tablet server on de8-9a-8f-d3-3e-f8**** > > 18 12:48:50,970 [server.Accumulo] INFO : Attempting to talk to zookeeper** > ** > > 18 12:48:51,182 [server.Accumulo] INFO : Zookeeper connected and > initialized, attemping to talk to HDFS**** > > 18 12:48:51,568 [server.Accumulo] INFO : Connected to HDFS**** > > Starting master on d04-7d-7b-06-5d-80**** > > Starting garbage collector on d04-7d-7b-06-5e-ba**** > > Starting monitor on d04-7d-7b-06-5e-ba**** > > Starting tracer on d04-7d-7b-06-5d-80**** > > Starting tracer on de8-9a-8f-d3-3e-f8**** > > Starting tracer on d04-7d-7b-06-5e-48**** > > ** ** > > I can also run a stop-all.sh with no complaints from the script.**** > > ** ** > > However, if I try to start the Accumulo shell, I get **** > > ** ** > > bash-3.2$ ./accumulo shell**** > > Enter current password for 'hdfs'@'test4': ********** > > 18 13:00:17,906 [impl.ServerClient] WARN : There are no tablet servers: > check that zookeeper and accumulo are running.**** > > ** ** > > If I check the tablet server machines I find that they do not have any > Accumulo processes running, and the master does not have any tablet server > logs. (it does have the tracer logs, however).**** > > ** ** > > I’ve attached the log files here (without the empty ones). There is an > error trying to “clean up old log sort” and a thrift error.**** > > I’m at a loss for where to begin on the debugging for this. Any thoughts > would be greatly appreciated.**** > > ** ** > > ** ** > > 18 12:48:54,100 [master.CoordinateRecoveryTask] ERROR: Error cleaning up > old Log Sort jobsjava.io.IOException: Call to /10.1.24.65:50030 failed on > local exception: java.io.EOFException**** > > ** ** > > 18 12:48:57,016 [impl.ServerClient] DEBUG: ClientService request failed > null, retrying ... **** > > org.apache.thrift.transport.TTransportException: Failed to connect to a > server**** > > at > org.apache.accumulo.core.client.impl.ThriftTransportPool.getAnyTransport(ThriftTransportPool.java:437) > **** > > at > org.apache.accumulo.core.client.impl.ServerClient.getConnection(ServerClient.java:145) > **** > > at > org.apache.accumulo.core.client.impl.ServerClient.getConnection(ServerClient.java:123) > **** > > at > org.apache.accumulo.core.client.impl.ServerClient.executeRaw(ServerClient.java:105) > **** > > at > org.apache.accumulo.core.client.impl.ServerClient.execute(ServerClient.java:71) > **** > > at > org.apache.accumulo.core.client.impl.ConnectorImpl.<init>(ConnectorImpl.java:75) > **** > > at > org.apache.accumulo.server.client.HdfsZooInstance.getConnector(HdfsZooInstance.java:145) > **** > > at
+
William Slacum 2012-07-18, 18:21
-
Re: There are no tablet servers
William Slacum 2012-07-18, 18:22
I'm sorry, 50030 is the http port for the JobTracker.
On Wed, Jul 18, 2012 at 11:21 AM, William Slacum < [EMAIL PROTECTED]> wrote:
> 50030 is the http port for HDFS. You probably want 9000 or 8020. Check > your hdfs-site.xml > > > On Wed, Jul 18, 2012 at 11:15 AM, Cardon, Tejay E <[EMAIL PROTECTED] > > wrote: > >> All,**** >> >> I’m running into a strange challenge in my latest Accumulo installation. >> I’ve developed some chef recipes for deploying Accumulo, and have tested >> them on three clusters now with no problems. Using the same scripts, I >> recent did another deployment, but I’m having trouble on this one.**** >> >> ** ** >> >> After installing Accumulo, updating the config files, and setting up >> passwordless ssh, I ran:**** >> >> ./accumulo init**** >> >> ** ** >> >> Everything went normally with me setting the instanceID and password**** >> >> ** ** >> >> Then I ran **** >> >> ./start-all.sh**** >> >> >> Again, everything went smoothly with the following output:**** >> >> bash-3.2$ ./start-all.sh**** >> >> Starting tablet servers and loggers ....... done**** >> >> Starting tablet server on de8-9a-8f-83-be-52**** >> >> Starting logger on de8-9a-8f-83-be-52**** >> >> Starting tablet server on d04-7d-7b-06-5e-48**** >> >> Starting logger on de8-9a-8f-d3-3e-f8**** >> >> Starting tablet server on d04-7d-7b-06-5d-f4**** >> >> Starting logger on d04-7d-7b-06-5e-48**** >> >> Starting logger on d04-7d-7b-06-5d-f4**** >> >> Starting tablet server on de8-9a-8f-d3-3e-f8**** >> >> 18 12:48:50,970 [server.Accumulo] INFO : Attempting to talk to zookeeper* >> *** >> >> 18 12:48:51,182 [server.Accumulo] INFO : Zookeeper connected and >> initialized, attemping to talk to HDFS**** >> >> 18 12:48:51,568 [server.Accumulo] INFO : Connected to HDFS**** >> >> Starting master on d04-7d-7b-06-5d-80**** >> >> Starting garbage collector on d04-7d-7b-06-5e-ba**** >> >> Starting monitor on d04-7d-7b-06-5e-ba**** >> >> Starting tracer on d04-7d-7b-06-5d-80**** >> >> Starting tracer on de8-9a-8f-d3-3e-f8**** >> >> Starting tracer on d04-7d-7b-06-5e-48**** >> >> ** ** >> >> I can also run a stop-all.sh with no complaints from the script.**** >> >> ** ** >> >> However, if I try to start the Accumulo shell, I get **** >> >> ** ** >> >> bash-3.2$ ./accumulo shell**** >> >> Enter current password for 'hdfs'@'test4': ********** >> >> 18 13:00:17,906 [impl.ServerClient] WARN : There are no tablet servers: >> check that zookeeper and accumulo are running.**** >> >> ** ** >> >> If I check the tablet server machines I find that they do not have any >> Accumulo processes running, and the master does not have any tablet server >> logs. (it does have the tracer logs, however).**** >> >> ** ** >> >> I’ve attached the log files here (without the empty ones). There is an >> error trying to “clean up old log sort” and a thrift error.**** >> >> I’m at a loss for where to begin on the debugging for this. Any thoughts >> would be greatly appreciated.**** >> >> ** ** >> >> ** ** >> >> 18 12:48:54,100 [master.CoordinateRecoveryTask] ERROR: Error cleaning up >> old Log Sort jobsjava.io.IOException: Call to /10.1.24.65:50030 failed >> on local exception: java.io.EOFException**** >> >> ** ** >> >> 18 12:48:57,016 [impl.ServerClient] DEBUG: ClientService request failed >> null, retrying ... **** >> >> org.apache.thrift.transport.TTransportException: Failed to connect to a >> server**** >> >> at >> org.apache.accumulo.core.client.impl.ThriftTransportPool.getAnyTransport(ThriftTransportPool.java:437) >> **** >> >> at >> org.apache.accumulo.core.client.impl.ServerClient.getConnection(ServerClient.java:145) >> **** >> >> at >> org.apache.accumulo.core.client.impl.ServerClient.getConnection(ServerClient.java:123) >> **** >> >> at >> org.apache.accumulo.core.client.impl.ServerClient.executeRaw(ServerClient.java:105) >> **** >> >> at >> org.apache.accumulo.core.client.impl.ServerClient.execute(ServerClient.java:71)
+
William Slacum 2012-07-18, 18:22
-
Re: There are no tablet servers
Eric Newton 2012-07-18, 19:59
Don't start a tracer on every server. Just start one on a master server. You won't need more than 1 until you get several hundred servers.
Do you have anything in the .out or .err files on the tserver hosts? If the files don't exist, something is failing in the ssh to those hosts.
-Eric
On Wed, Jul 18, 2012 at 2:15 PM, Cardon, Tejay E <[EMAIL PROTECTED]> wrote: > All, > > I’m running into a strange challenge in my latest Accumulo installation. > I’ve developed some chef recipes for deploying Accumulo, and have tested > them on three clusters now with no problems. Using the same scripts, I > recent did another deployment, but I’m having trouble on this one. > > > > After installing Accumulo, updating the config files, and setting up > passwordless ssh, I ran: > > ./accumulo init > > > > Everything went normally with me setting the instanceID and password > > > > Then I ran > > ./start-all.sh > > > Again, everything went smoothly with the following output: > > bash-3.2$ ./start-all.sh > > Starting tablet servers and loggers ....... done > > Starting tablet server on de8-9a-8f-83-be-52 > > Starting logger on de8-9a-8f-83-be-52 > > Starting tablet server on d04-7d-7b-06-5e-48 > > Starting logger on de8-9a-8f-d3-3e-f8 > > Starting tablet server on d04-7d-7b-06-5d-f4 > > Starting logger on d04-7d-7b-06-5e-48 > > Starting logger on d04-7d-7b-06-5d-f4 > > Starting tablet server on de8-9a-8f-d3-3e-f8 > > 18 12:48:50,970 [server.Accumulo] INFO : Attempting to talk to zookeeper > > 18 12:48:51,182 [server.Accumulo] INFO : Zookeeper connected and > initialized, attemping to talk to HDFS > > 18 12:48:51,568 [server.Accumulo] INFO : Connected to HDFS > > Starting master on d04-7d-7b-06-5d-80 > > Starting garbage collector on d04-7d-7b-06-5e-ba > > Starting monitor on d04-7d-7b-06-5e-ba > > Starting tracer on d04-7d-7b-06-5d-80 > > Starting tracer on de8-9a-8f-d3-3e-f8 > > Starting tracer on d04-7d-7b-06-5e-48 > > > > I can also run a stop-all.sh with no complaints from the script. > > > > However, if I try to start the Accumulo shell, I get > > > > bash-3.2$ ./accumulo shell > > Enter current password for 'hdfs'@'test4': ****** > > 18 13:00:17,906 [impl.ServerClient] WARN : There are no tablet servers: > check that zookeeper and accumulo are running. > > > > If I check the tablet server machines I find that they do not have any > Accumulo processes running, and the master does not have any tablet server > logs. (it does have the tracer logs, however). > > > > I’ve attached the log files here (without the empty ones). There is an > error trying to “clean up old log sort” and a thrift error. > > I’m at a loss for where to begin on the debugging for this. Any thoughts > would be greatly appreciated. > > > > > > 18 12:48:54,100 [master.CoordinateRecoveryTask] ERROR: Error cleaning up old > Log Sort jobsjava.io.IOException: Call to /10.1.24.65:50030 failed on local > exception: java.io.EOFException > > > > 18 12:48:57,016 [impl.ServerClient] DEBUG: ClientService request failed > null, retrying ... > > org.apache.thrift.transport.TTransportException: Failed to connect to a > server > > at > org.apache.accumulo.core.client.impl.ThriftTransportPool.getAnyTransport(ThriftTransportPool.java:437) > > at > org.apache.accumulo.core.client.impl.ServerClient.getConnection(ServerClient.java:145) > > at > org.apache.accumulo.core.client.impl.ServerClient.getConnection(ServerClient.java:123) > > at > org.apache.accumulo.core.client.impl.ServerClient.executeRaw(ServerClient.java:105) > > at > org.apache.accumulo.core.client.impl.ServerClient.execute(ServerClient.java:71) > > at > org.apache.accumulo.core.client.impl.ConnectorImpl.<init>(ConnectorImpl.java:75) > > at > org.apache.accumulo.server.client.HdfsZooInstance.getConnector(HdfsZooInstance.java:145) > > at > org.apache.accumulo.server.trace.TraceServer.<init>(TraceServer.java:152)
+
Eric Newton 2012-07-18, 19:59
|
|