[jboss-jira] [JBoss JIRA] (JGRP-2253) FD_SOCK is not working in AWS environment

Sibin Karnavar (JIRA) issues at jboss.org
Thu Feb 22 12:15:00 EST 2018


    [ https://issues.jboss.org/browse/JGRP-2253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13537097#comment-13537097 ] 

Sibin Karnavar edited comment on JGRP-2253 at 2/22/18 12:14 PM:
----------------------------------------------------------------

All timestamps are in UTC

Members in the cluster:  ip-10-93-136-91, ip-10-93-133-149 and ip-10-93-135-215

Node 0: (Master Node / Leader)

ip-10-93-136-91

This node was the leader node and I have killed it at 2018-02-22 16:19:58.186 UTC time. If you see the FD_SOCK Trace timestamp, its not detecting the TCP socket connection break immediately.

2018-02-22 16:21:19.430 TRACE 23603 --- [jgroups-13,ABC_test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: i-have-sock: ip-10-93-136-91-22320 --> 

10.93.136.91:7804 (cache is ip-10-93-133-149-13458: 10.93.133.149:7804 (559 secs old)
ip-10-93-136-91-22320: 10.93.136.91:7804 (0 ms old)
ip-10-93-135-215-41546: 10.93.135.215:7804 (559 secs old)
)



Node 1:

ip-10-93-133-149


2018-02-22 16:20:26.917  INFO 23603 --- [jgroups-13,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] c.w.s.c.ServiceClusterCoordinator        :  -  - Detected change in view membership: [ip-10-93-135-215-41546|17] (2) [ip-10-93

-135-215-41546, ip-10-93-133-149-13458]
2018-02-22 16:20:26.917  INFO 23603 --- [jgroups-13,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] c.w.s.c.ServiceClusterCoordinator        :  -  - I am not the master!
2018-02-22 16:20:26.917  INFO 23603 --- [jgroups-13,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] c.w.s.c.ServiceClusterCoordinator        :  -  - ip-10-93-135-215-41546 is the master for service ABC
2018-02-22 16:21:19.408  INFO 23603 --- [jgroups-13,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] c.w.s.c.ServiceClusterCoordinator        :  -  - Detected change in view membership: [ip-10-93-135-215-41546|18] (3) [ip-10-93

-135-215-41546, ip-10-93-133-149-13458, ip-10-93-136-91-22320]
2018-02-22 16:21:19.409  INFO 23603 --- [jgroups-13,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] c.w.s.c.ServiceClusterCoordinator        :  -  - I am not the master!
2018-02-22 16:21:19.430 TRACE 23603 --- [jgroups-13,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: i-have-sock: ip-10-93-136-91-22320 --> 

10.93.136.91:7804 (cache is ip-10-93-133-149-13458: 10.93.133.149:7804 (559 secs old)
ip-10-93-136-91-22320: 10.93.136.91:7804 (0 ms old)
ip-10-93-135-215-41546: 10.93.135.215:7804 (559 secs old)
)
2018-02-22 16:21:19.434 TRACE 23603 --- [jgroups-13,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: received SUSPECT message from ip-10-93-136-91-22320: 

suspects=[ip-10-93-135-215-41546]
2018-02-22 16:21:19.434 DEBUG 23603 --- [jgroups-13,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: suspecting [ip-10-93-135-215-41546]
2018-02-22 16:21:19.435 DEBUG 23603 --- [jgroups-13,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: broadcasting unsuspect(ip-10-93-135-215-41546)
2018-02-22 16:21:19.435 TRACE 23603 --- [jgroups-22,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: received UNSUSPECT message from ip-10-93-133-149-

13458: mbrs=[ip-10-93-135-215-41546]
2018-02-22 16:21:19.436 TRACE 23603 --- [jgroups-22,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: received SUSPECT message from ip-10-93-136-91-22320: 

suspects=[ip-10-93-133-149-13458]
2018-02-22 16:21:19.437 TRACE 23603 --- [jgroups-22,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: received UNSUSPECT message from ip-10-93-135-215-

41546: mbrs=[ip-10-93-133-149-13458]
2018-02-22 16:21:49.429 TRACE 23603 --- [jgroups-22,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: received SUSPECT message from ip-10-93-136-91-22320: 

suspects=[ip-10-93-135-215-41546]
2018-02-22 16:21:49.429 TRACE 23603 --- [jgroups-16,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: who-has-sock ip-10-93-133-149-13458
2018-02-22 16:21:49.429 DEBUG 23603 --- [jgroups-22,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: suspecting [ip-10-93-135-215-41546]
2018-02-22 16:21:49.430 DEBUG 23603 --- [jgroups-22,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: broadcasting unsuspect(ip-10-93-135-215-41546)
2018-02-22 16:21:49.430 TRACE 23603 --- [jgroups-16,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: received UNSUSPECT message from ip-10-93-133-149-

13458: mbrs=[ip-10-93-135-215-41546]
2018-02-22 16:21:49.431 TRACE 23603 --- [jgroups-16,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: received SUSPECT message from ip-10-93-136-91-22320: 

suspects=[ip-10-93-133-149-13458]
2018-02-22 16:21:49.432 TRACE 23603 --- [jgroups-22,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: received UNSUSPECT message from ip-10-93-135-215-

41546: mbrs=[ip-10-93-133-149-13458]
2018-02-22 16:21:49.437 TRACE 23603 --- [jgroups-16,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: received UNSUSPECT message from ip-10-93-136-91-22320: 

mbrs=[ip-10-93-133-149-13458]
2018-02-22 16:21:49.527  INFO 23603 --- [jgroups-16,ABC-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] c.w.s.c.ServiceClusterCoordinator        :  -  - Detected change in view membership: MergeView::[ip-10-93-135-215-41546|20] (3) 

[ip-10-93-135-215-41546, ip-10-93-133-149-13458, ip-10-93-136-91-22320], 2 subgroups: [ip-10-93-136-91-22320|19] (1) [ip-10-93-136-91-22320], [ip-10-93-135-215-41546|18] (3) [ip-10-93-135-215-41546, ip-10-93-133-149-13458, ip-10-93-136-

91-22320]

Node-2

ip-10-93-135-215


2018-02-22 16:21:19.403  INFO 19074 --- [jgroups-24,ABC-test_SJX_080220180246_XSJ,ip-10-93-135-215-41546] c.w.s.c.ServiceClusterCoordinator        :  -  - Detected change in view membership: [ip-10-93-135-215-41546|18] (3) [ip-10-93

-135-215-41546, ip-10-93-133-149-13458, ip-10-93-136-91-22320]
2018-02-22 16:21:19.426 TRACE 19074 --- [jgroups-27,ABC-test_SJX_080220180246_XSJ,ip-10-93-135-215-41546] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-135-215-41546: i-have-sock: ip-10-93-136-91-22320 --> 

10.93.136.91:7804 (cache is ip-10-93-133-149-13458: 10.93.133.149:7804 (559 secs old)
ip-10-93-136-91-22320: 10.93.136.91:7804 (0 ms old)
ip-10-93-135-215-41546: 10.93.135.215:7804 (1599 secs old)
)
2018-02-22 16:21:19.430 TRACE 19074 --- [jgroups-22,ABC-test_SJX_080220180246_XSJ,ip-10-93-135-215-41546] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-135-215-41546: received SUSPECT message from ip-10-93-136-91-22320: 

suspects=[ip-10-93-135-215-41546]
2018-02-22 16:21:19.430 DEBUG 19074 --- [jgroups-22,ABC-test_SJX_080220180246_XSJ,ip-10-93-135-215-41546] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-135-215-41546: suspecting []
2018-02-22 16:21:19.432 TRACE 19074 --- [jgroups-22,ABC-test_SJX_080220180246_XSJ,ip-10-93-135-215-41546] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-135-215-41546: received UNSUSPECT message from ip-10-93-133-149-

13458: mbrs=[ip-10-93-135-215-41546]
2018-02-22 16:21:19.432 TRACE 19074 --- [jgroups-22,ABC-test_SJX_080220180246_XSJ,ip-10-93-135-215-41546] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-135-215-41546: received SUSPECT message from ip-10-93-136-91-22320: 

suspects=[ip-10-93-133-149-13458]
2018-02-22 16:21:19.432 DEBUG 19074 --- [jgroups-22,ABC-test_SJX_080220180246_XSJ,ip-10-93-135-215-41546] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-135-215-41546: suspecting [ip-10-93-133-149-13458]
2018-02-22 16:21:19.433 DEBUG 19074 --- [jgroups-22,ABC-test_SJX_080220180246_XSJ,ip-10-93-135-215-41546] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-135-215-41546: broadcasting unsuspect(ip-10-93-133-149-13458)
2018-02-22 16:21:19.433 TRACE 19074 --- [jgroups-27,ABC-test_SJX_080220180246_XSJ,ip-10-93-135-215-41546] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-135-215-41546: received UNSUSPECT message from ip-10-93-135-215-

41546: mbrs=[ip-10-93-133-149-13458]




was (Author: sibin.karnavar):
All timestamps are in UTC

Members in the cluster:  ip-10-93-136-91, ip-10-93-133-149 and ip-10-93-135-215

Node 0: (Master Node / Leader)

ip-10-93-136-91

This node was the leader node and I have killed it at 2018-02-22 16:19:58.186 UTC time. If you see the FD_SOCK Trace timestamp, its not detecting the TCP socket connection break immediately.

2018-02-22 16:21:19.430 TRACE 23603 --- [jgroups-13,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: i-have-sock: ip-10-93-136-91-22320 --> 

10.93.136.91:7804 (cache is ip-10-93-133-149-13458: 10.93.133.149:7804 (559 secs old)
ip-10-93-136-91-22320: 10.93.136.91:7804 (0 ms old)
ip-10-93-135-215-41546: 10.93.135.215:7804 (559 secs old)
)



Node 1:

ip-10-93-133-149


2018-02-22 16:20:26.917  INFO 23603 --- [jgroups-13,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] c.w.s.c.ServiceClusterCoordinator        :  -  - Detected change in view membership: [ip-10-93-135-215-41546|17] (2) [ip-10-93

-135-215-41546, ip-10-93-133-149-13458]
2018-02-22 16:20:26.917  INFO 23603 --- [jgroups-13,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] c.w.s.c.ServiceClusterCoordinator        :  -  - I am not the master!
2018-02-22 16:20:26.917  INFO 23603 --- [jgroups-13,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] c.w.s.c.ServiceClusterCoordinator        :  -  - ip-10-93-135-215-41546 is the master for service SOM-SKS
2018-02-22 16:21:19.408  INFO 23603 --- [jgroups-13,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] c.w.s.c.ServiceClusterCoordinator        :  -  - Detected change in view membership: [ip-10-93-135-215-41546|18] (3) [ip-10-93

-135-215-41546, ip-10-93-133-149-13458, ip-10-93-136-91-22320]
2018-02-22 16:21:19.409  INFO 23603 --- [jgroups-13,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] c.w.s.c.ServiceClusterCoordinator        :  -  - I am not the master!
2018-02-22 16:21:19.430 TRACE 23603 --- [jgroups-13,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: i-have-sock: ip-10-93-136-91-22320 --> 

10.93.136.91:7804 (cache is ip-10-93-133-149-13458: 10.93.133.149:7804 (559 secs old)
ip-10-93-136-91-22320: 10.93.136.91:7804 (0 ms old)
ip-10-93-135-215-41546: 10.93.135.215:7804 (559 secs old)
)
2018-02-22 16:21:19.434 TRACE 23603 --- [jgroups-13,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: received SUSPECT message from ip-10-93-136-91-22320: 

suspects=[ip-10-93-135-215-41546]
2018-02-22 16:21:19.434 DEBUG 23603 --- [jgroups-13,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: suspecting [ip-10-93-135-215-41546]
2018-02-22 16:21:19.435 DEBUG 23603 --- [jgroups-13,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: broadcasting unsuspect(ip-10-93-135-215-41546)
2018-02-22 16:21:19.435 TRACE 23603 --- [jgroups-22,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: received UNSUSPECT message from ip-10-93-133-149-

13458: mbrs=[ip-10-93-135-215-41546]
2018-02-22 16:21:19.436 TRACE 23603 --- [jgroups-22,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: received SUSPECT message from ip-10-93-136-91-22320: 

suspects=[ip-10-93-133-149-13458]
2018-02-22 16:21:19.437 TRACE 23603 --- [jgroups-22,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: received UNSUSPECT message from ip-10-93-135-215-

41546: mbrs=[ip-10-93-133-149-13458]
2018-02-22 16:21:49.429 TRACE 23603 --- [jgroups-22,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: received SUSPECT message from ip-10-93-136-91-22320: 

suspects=[ip-10-93-135-215-41546]
2018-02-22 16:21:49.429 TRACE 23603 --- [jgroups-16,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: who-has-sock ip-10-93-133-149-13458
2018-02-22 16:21:49.429 DEBUG 23603 --- [jgroups-22,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: suspecting [ip-10-93-135-215-41546]
2018-02-22 16:21:49.430 DEBUG 23603 --- [jgroups-22,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: broadcasting unsuspect(ip-10-93-135-215-41546)
2018-02-22 16:21:49.430 TRACE 23603 --- [jgroups-16,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: received UNSUSPECT message from ip-10-93-133-149-

13458: mbrs=[ip-10-93-135-215-41546]
2018-02-22 16:21:49.431 TRACE 23603 --- [jgroups-16,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: received SUSPECT message from ip-10-93-136-91-22320: 

suspects=[ip-10-93-133-149-13458]
2018-02-22 16:21:49.432 TRACE 23603 --- [jgroups-22,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: received UNSUSPECT message from ip-10-93-135-215-

41546: mbrs=[ip-10-93-133-149-13458]
2018-02-22 16:21:49.437 TRACE 23603 --- [jgroups-16,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-133-149-13458: received UNSUSPECT message from ip-10-93-136-91-22320: 

mbrs=[ip-10-93-133-149-13458]
2018-02-22 16:21:49.527  INFO 23603 --- [jgroups-16,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-133-149-13458] c.w.s.c.ServiceClusterCoordinator        :  -  - Detected change in view membership: MergeView::[ip-10-93-135-215-41546|20] (3) 

[ip-10-93-135-215-41546, ip-10-93-133-149-13458, ip-10-93-136-91-22320], 2 subgroups: [ip-10-93-136-91-22320|19] (1) [ip-10-93-136-91-22320], [ip-10-93-135-215-41546|18] (3) [ip-10-93-135-215-41546, ip-10-93-133-149-13458, ip-10-93-136-

91-22320]

Node-2

ip-10-93-135-215


2018-02-22 16:21:19.403  INFO 19074 --- [jgroups-24,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-135-215-41546] c.w.s.c.ServiceClusterCoordinator        :  -  - Detected change in view membership: [ip-10-93-135-215-41546|18] (3) [ip-10-93

-135-215-41546, ip-10-93-133-149-13458, ip-10-93-136-91-22320]
2018-02-22 16:21:19.426 TRACE 19074 --- [jgroups-27,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-135-215-41546] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-135-215-41546: i-have-sock: ip-10-93-136-91-22320 --> 

10.93.136.91:7804 (cache is ip-10-93-133-149-13458: 10.93.133.149:7804 (559 secs old)
ip-10-93-136-91-22320: 10.93.136.91:7804 (0 ms old)
ip-10-93-135-215-41546: 10.93.135.215:7804 (1599 secs old)
)
2018-02-22 16:21:19.430 TRACE 19074 --- [jgroups-22,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-135-215-41546] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-135-215-41546: received SUSPECT message from ip-10-93-136-91-22320: 

suspects=[ip-10-93-135-215-41546]
2018-02-22 16:21:19.430 DEBUG 19074 --- [jgroups-22,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-135-215-41546] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-135-215-41546: suspecting []
2018-02-22 16:21:19.432 TRACE 19074 --- [jgroups-22,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-135-215-41546] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-135-215-41546: received UNSUSPECT message from ip-10-93-133-149-

13458: mbrs=[ip-10-93-135-215-41546]
2018-02-22 16:21:19.432 TRACE 19074 --- [jgroups-22,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-135-215-41546] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-135-215-41546: received SUSPECT message from ip-10-93-136-91-22320: 

suspects=[ip-10-93-133-149-13458]
2018-02-22 16:21:19.432 DEBUG 19074 --- [jgroups-22,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-135-215-41546] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-135-215-41546: suspecting [ip-10-93-133-149-13458]
2018-02-22 16:21:19.433 DEBUG 19074 --- [jgroups-22,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-135-215-41546] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-135-215-41546: broadcasting unsuspect(ip-10-93-133-149-13458)
2018-02-22 16:21:19.433 TRACE 19074 --- [jgroups-27,SOM-SKS-test_SJX_080220180246_XSJ,ip-10-93-135-215-41546] org.jgroups.protocols.FD_SOCK            :  -  - ip-10-93-135-215-41546: received UNSUSPECT message from ip-10-93-135-215-

41546: mbrs=[ip-10-93-133-149-13458]



> FD_SOCK is not working in AWS environment
> -----------------------------------------
>
>                 Key: JGRP-2253
>                 URL: https://issues.jboss.org/browse/JGRP-2253
>             Project: JGroups
>          Issue Type: Bug
>    Affects Versions: 4.0.10
>         Environment: AWS - EC2
>            Reporter: Sibin Karnavar
>            Assignee: Bela Ban
>
> We have our failure detection defined like below. 
>  <FD_SOCK  external_port="7804" />
>  <FD timeout="3000" max_tries="3" />
> <VERIFY_SUSPECT timeout="3000" />
> Please note that we have used FD instead of FD_ALL in AWS. We will be changing it to FD_ALL later after detailed testing.
> In my local, this is working perfect. As soon as I kill my node, I was able to see that view change was happening immediately with FD_SOCK.
> We were not mentioning the external_port in the FD_SOCK but later I thought it may be an issue with the port and defined it as 7804 and added the same port to the security group that allows to access this port among all the nodes.  So no issue with the port.
> Can you please let us know if we need any additional configurations to make FD_SOCK works well in AWS.
> Thanks,
> Sibin



--
This message was sent by Atlassian JIRA
(v7.5.0#75005)


More information about the jboss-jira mailing list