Forum: Managing VoltDB

Post: custer initialization error

custer initialization error
miguelbiscaia
Nov 21, 2012
Hi,

I need some assistance to tackle a startup error in a two cluster environment. I'm receiving the following error during cluster startup:

2012-11-21 12:50:23,144 FATAL [main] HOST: Unable to write topology to ZK, dying
2012-11-21 12:50:23,144 FATAL [main] HOST: Stack trace of thrown exception: java.lang.RuntimeException: Provided 1 host ids when host count is 2
2012-11-21 12:50:23,144 FATAL [main] HOST: org.voltdb.compiler.ClusterConfig.getTopology(ClusterConfig.java:458)
2012-11-21 12:50:23,145 FATAL [main] HOST: org.voltdb.RealVoltDB.registerClusterConfig(RealVoltDB.java:1061)
2012-11-21 12:50:23,145 FATAL [main] HOST: org.voltdb.RealVoltDB.getTopology(RealVoltDB.java:1027)
2012-11-21 12:50:23,145 FATAL [main] HOST: org.voltdb.RealVoltDB.initialize(RealVoltDB.java:413)
2012-11-21 12:50:23,145 FATAL [main] HOST: org.voltdb.VoltDB.initialize(VoltDB.java:729)
2012-11-21 12:50:23,145 FATAL [main] HOST: org.voltdb.VoltDB.main(VoltDB.java:713)

And the deployment.xml file is:

<?xml version="1.0"?>

I'm using voltdb 2.8.4.

Thanks in advance.
Regards.
cluster initialization error
awilson
Nov 21, 2012
Hi,

The deployment file didn't come through but it probably has a line like:
<cluster hostcount="2" ... >
You need to specify a leader host on the voltdb command line:
voltdb create \
host \
catalog mycatalog.jar \
deployment deployment.xml

Could you double check that you are specifying the same host name or ip address for both nodes when you start voltdb?

Thanks,
Andy
Hi, Thank you very much for
miguelbiscaia
Nov 21, 2012
Hi,

The deployment file didn't come through but it probably has a line like:
<cluster hostcount="2" ... >...


Hi,

Thank you very much for your reply.
Indeed, the deployment file contains the cluster configuration:
hostcount="2" sitesperhost="16" kfactor="1"

And I started up the cluster with the following command in both nodes:
voltdb create catalog webstore.jar deployment deployment.xml host blade15

The host blade15 is intended to be the primary node.

Best regards.
Additionally, I can see in
miguelbiscaia
Nov 21, 2012
Additionally, I can see in the log some zookeeper exceptions, like:

2012-11-21 12:50:01,647 DEBUG [ZooKeeperServer] ZK-SERVER: Got user-level KeeperException when processing sessionid:0x11f82f3232800001 type:create cxid:0x2 zxid:0xfffffffffffffffe txntype:unknown reqpath:n/a Error Path:/db Error:KeeperErrorCode = NodeExists for /db
2012-11-21 12:50:01,647 DEBUG [ZooKeeperServer] ZK-SERVER: Processing request:: sessionid:0x11f82f3232800001 type:create cxid:0x2 zxid:0x11f82f3251800001 txntype:-1 reqpath:n/a
2012-11-21 12:50:01,654 DEBUG [ZooKeeperServer] ZK-SERVER: Got user-level KeeperException when processing sessionid:0x11f82f3232800001 type:create cxid:0x3 zxid:0xfffffffffffffffe txntype:unknown reqpath:n/a Error Path:/db/mailboxes Error:KeeperErrorCode = NodeExists for /db/mailboxes
custer initialization error
miguelbiscaia
Nov 21, 2012
Log on the host node:
2012-11-21 17:16:49,527 INFO [Socket Joiner - 0] HOST: Received request type REQUEST_HOSTID
2012-11-21 17:16:49,547 INFO [Socket Joiner - 0] HOST: Heartbeat timeout to host: /10.112.76.47:49176 is 10000 milliseconds
2012-11-21 17:16:49,570 INFO [ZooKeeperServer] JOIN: Joining site 1:-1 known active sites 0:-1, 1:-1
2012-11-21 17:16:49,667 INFO [ZooKeeperServer] JOIN: Shipping ZK snapshot from 0:-1 to 1:-1
2012-11-21 17:17:11,533 INFO [ZooKeeperServer] JOIN: Agreement, Sending fault data 1:-1 to 0:-1 survivors
2012-11-21 17:17:11,534 INFO [ZooKeeperServer] JOIN: Agreement, Sent fault data. Expecting 1 responses.
2012-11-21 17:17:11,535 INFO [ZooKeeperServer] JOIN: Agreement, Received failure message from 0:-1 for failed sites 1:-1 safe txn id 1294971252825391105 failed site 1:-1
2012-11-21 17:17:11,536 INFO [ZooKeeperServer] JOIN: Agreement, handling site faults for newly failed sites 1:-1 initiatorSafeInitPoints {1:-11294971252825391105}
2012-11-21 17:17:11,536 INFO [ZooKeeperServer] ZK-SERVER: Initiating close of session 0x11f8a954ce000001
2012-11-21 17:17:11,542 INFO [ZooKeeperServer] ZK-SERVER: Processed session termination for sessionid: 0x11f8a954ce000001
2012-11-21 17:17:11,736 FATAL [main] HOST: Unable to write topology to ZK, dying
2012-11-21 17:17:11,736 FATAL [main] HOST: Stack trace of thrown exception: java.lang.RuntimeException: Provided 1 host ids when host count is 2
2012-11-21 17:17:11,736 FATAL [main] HOST: org.voltdb.compiler.ClusterConfig.getTopology(ClusterConfig.java:458)
2012-11-21 17:17:11,736 FATAL [main] HOST: org.voltdb.RealVoltDB.registerClusterConfig(RealVoltDB.java:1061)
2012-11-21 17:17:11,736 FATAL [main] HOST: org.voltdb.RealVoltDB.getTopology(RealVoltDB.java:1027)
2012-11-21 17:17:11,736 FATAL [main] HOST: org.voltdb.RealVoltDB.initialize(RealVoltDB.java:413)
2012-11-21 17:17:11,736 FATAL [main] HOST: org.voltdb.VoltDB.initialize(VoltDB.java:729)
2012-11-21 17:17:11,736 FATAL [main] HOST: org.voltdb.VoltDB.main(VoltDB.java:713)
Log on the other node:
2012-11-21 17:16:22,558 INFO [main] HOST: TOPO: {
"hostcount": 2,
"kfactor": 0,
"partitions": [
{
"master": 0,
"partition_id": 0,
"replicas": [0]
},
{
"master": 1,
"partition_id": 1,
"replicas": [1]
},
{
"master": 0,
"partition_id": 2,
"replicas": [0]
},
{
"master": 1,
"partition_id": 3,
"replicas": [1]
},
{
"master": 0,
"partition_id": 4,
"replicas": [0]
....
...
...
"replicas": [0]
},
{
"master": 1,
"partition_id": 25,
"replicas": [1]
},
{
"master": 0,
"partition_id": 26,
"replicas": [0]
},
{
"master": 1,
"partition_id": 27,
"replicas": [1]
},
{
"master": 0,
"partition_id": 28,
"replicas": [0]
},
{
"master": 1,
"partition_id": 29,
"replicas": [1]
},
{
"master": 0,
"partition_id": 30,
"replicas": [0]
},
{
"master": 1,
"partition_id": 31,
"replicas": [1]
}
],
"sites_per_host": 16
}
2012-11-21 17:16:22,602 INFO [main] HOST: Registering stats mailbox id 1:-2
2012-11-21 17:16:43,048 FATAL [main] HOST: Failed to initialize site tracker with all hosts before timeout
2012-11-21 17:16:43,048 FATAL [main] HOST: Stack trace from crashLocalVoltDB() method:
2012-11-21 17:16:43,048 FATAL [main] HOST: java.lang.Thread.dumpThreads(Native Method)
2012-11-21 17:16:43,048 FATAL [main] HOST: java.lang.Thread.getAllStackTraces(Thread.java:1618)
2012-11-21 17:16:43,048 FATAL [main] HOST: org.voltdb.VoltDB.crashLocalVoltDB(VoltDB.java:564)
2012-11-21 17:16:43,048 FATAL [main] HOST: org.voltdb.RealVoltDB.initialize(RealVoltDB.java:550)
2012-11-21 17:16:43,048 FATAL [main] HOST: org.voltdb.VoltDB.initialize(VoltDB.java:729)
2012-11-21 17:16:43,049 FATAL [main] HOST: org.voltdb.VoltDB.main(VoltDB.java:713)