This is a follow up ticket to ONOS-5347
We are able to reproduce the StorageException:$Timeout issue.
We are using Cord-Tester framework for Fault Reproduction.
(http://github.com/opencord/cord-tester)
The following test case cluster_exchange.test_cluster_controller_restarts in the framework is run.
The Test Case starts a 3 node cluster where each Instance is a docker Container.
The test kills a docker container one and then restart that Docker Container.
Waits for ONOS to starts in the restarted Docker Container.
Wait for 60 seconds and then run Leaders command in all the three nodes.
If leaders commands passed, then take snapshot of logs and zip it and continue with next Container.
If leaders Failed, take snapshot of the logs and then Halt the Test case.
We are still able to see the problem where leaders commands hangs.
The Test Instruction on how to reproduce the issue is attached in the enclosures.()
The logs are attached in the enclosures.
- relates to
-
ONOS-5992 ONOS HA cluster failure
-
- Closed
-