NodeA Cannot Establish HTTP Connection to NodeB
When HTTP request can't be established between nodes, jobs which are delegated between nodes or jobs running in parallel on more nodes will fail. The error is visible in the Execution History. Each node periodically executes a check-task which checks the HTTP connection to other nodes. If the problem is detected, one of the nodes is suspended, since they can't cooperate with each other.
Time-line describing the scenario:
0s network connection between NodeA and NodeB is down
0-40s a check-task running on NodeA can't establish HTTP connection to NodeB; check may last for 30s until it times-out; there is no re-try, if connection fails even just once, it is considered as unreliable, so the nodes can't cooperate.
status of NodeA or NodeB (the one with shorter uptime) is changed to “suspended”
The following configuration properties set the time intervals mentioned above:
cluster.node.check.checkMinInterval
Periodicity of Cluster node checks, in milliseconds.
Default: 20000
cluster.sync.connection.readTimeout
An HTTP connection response timeout, in milliseconds.
Default: 90000
cluster.sync.connection.connectTimeout
Establishing HTTP connection timeout, in milliseconds.
Default: 7000