Hazelcast Split-Brain

Hazelcast Split-Brain

This bundle provides scripts, configuration files, and apps for creating a network split-brain environment where you can test Hazelcast's split-brain capabilities.

Installing Bundle

install_bundle -download bundle-hazelcast-3-app-perf_test_sb-cluster-sb

Use Case

To prepare for encountering cluster split-brain situations, this use case provides step-by-step instructions for creating and monitoring a Hazelcast cluster split-brain.

Required Software

This bundle uses PadoGrid pods which depend on Vagrant and VirtualBox. If you have not installed them, then please download and install them now by following the above links. For details on PadoGrid pods, see Understanding PadoGrid Pods.
Hazelcast Desktop is integrated with PadoGrid. We will install it using install_padogrid later.
We need Hazelcast OSS and JDK for Lunx in the VirtualBox VMs. We will install them later.

Bundle Contents

apps
└── perf_test_sb

clusters
└── sb

This bundle includes the following components.

Cluster sb. The sb cluster is configured with five (5) VM members running in the hz_pod_sb pod. It includes scripts that use iptables to drop TCP packets to split the sb cluster into two (2). It is configured with split-brain quorum rules for th following maps.
- nw/customers
- nw/orders
App perf_test_sb. The perf_test_sb app is configured to run on a split cluster.
App desktop. The desktop app is used to compare data in the split clusters and the merged cluster. It is not included in the bundle because the vanilla desktop works without any modifications. You will be installing the desktop app as one of the steps shown in the Creating Split-Brain section.

Note that the sb cluster is configured to run in the hz_pod_sb pod with its members running as VM hosts and not Vagrant pod hosts.

Installation Steps

We will be taking the following steps as we setup and run the environment.

Install Linux products
Create pod
Build pod
Build perf_test_sb
Create desktop

Follow the instructions in the subsequent sections.

Install Linux products

We need the following products installed before wen can setup Vagrant VMs. Download their tarball distributions by following the following links.

Assuming you have installed PadoGrid in the default directory, untar the downloaded tarballs in the ~/Padogrid/products/linux directory as shown in the example below. If you have installed PadoGrid in a different directory, then make the appriate changes.

mkdir ~/Padogrid/products/linux
tar -C ~/Padogrid/products/linux -xzf  ~/Downloads/hazelcast-3.12.13.tar.gz
tar -C ~/Padogrid/products/linux -xzf  ~/Downloads/jdk-8u401-linux-x64.tar.gz

Create Pod

Create a pod named hz_pod_sb with five (5) data nodes. The pod name must be hz_pod_sb since the bundle's cluster, sb, has been paired with that pod name. Take default values for all but the memory size which you can conserve by reducing it to 1024 MiB as shown in the ouput example below. The included sb cluster has been preconfigured with the member max heap size of 512 MiB.

create_pod -pod hz_pod_sb

Input:

Take the default values except for the following prompts.

Primary node memory size in MiB [2048]: 1024
Data node memory size in MiB [2048]: 1024
Number of data nodes  [2]: 5
Products installation directory path: /Users/dpark/Padogrid/products/linux
Install Avahi? true

Ouput:

Please answer the prompts that appear below. You can abort this command at any time
by entering 'Ctrl-C'.

Pod name [hz_pod_sb]:
Primary node name [pnode]:
Data node name prefix [node]:
This machine has the following IP addresses. Choose one from the list. The IP address
must be a private IP address.
192.168.56.1
Host private IP address [192.168.56.1]:
First node IP address' octect [10]:
Primary node memory size in MiB [2048]: 1024
Data node memory size in MiB [2048]: 1024
Number of data nodes  [2]: 5
Products installation directory path.
[/Users/dpark/Padogrid/products]:
/Usres/dpark/Padogrid/products/linux
Directory does not exist or not a directory.
Products installation directory path.
[/Users/dpark/Padogrid/products]:
/Users/dpark/Padogrid/products/linux
Install Avahi? This allows VMs to enable local network discovery service via
the mDNS/DNS-SD protocol. All VM host names can be looked up with the suffix
'.local', i.e., pnode.local, node-01.local, etc.
Enter 'true' or 'false' [false]: true
Vagrant box image [ubuntu/jammy64]:

You have entered the following.
                       Pod name: hz_pod_sb
              Primary node name: pnode
          Data node name prefix: node
        Host private IP address: 192.168.56.1
      Node IP addres last octet: 10
 Primary node memory size (MiB): 1024
    Data node memory size (MiB): 1024
                Data node count: 5
             Products directory: /Users/dpark/Padogrid/products/linux
                  Avahi enabled: true
              Vagrant box image: ubuntu/jammy64
Enter 'c' to continue, 'r' to re-enter, 'q' to quit: c

Build pod

Build the pod you just created.

# Build and start the pod
build_pod -pod hz_pod_sb

Configuring Cluster

If you changed the default memory size of the primary and data nodes when you created the pod, then you can adjust the Hazelcast member min/max heap size in the etc/cluster.properties file as follows:

switch_cluster sb
vi etc/cluster.properties

Change the heap min/max sizes in the etc/cluster.properties file.

# Heap min and max values in etc/cluster.properties
heap.min=512m
heap.max=512m

Creating Split-Brain

1. Start cluster and Management Center

Login to pnode.local and start the sb cluster as follows:

cd_pod hz_pod_sb
vagrant ssh

# If prompts for password
password: vagrant

# Once logged in to Varant VM, pnode, execute the following:
switch_cluster sb
start_cluster
start_mc

2. Monitor Management Center

Enter the following URL in your browser to monitor the Hazelcast cluster from the Management Center.

http://pnode.local:8080/hazelcast-mancenter

3. Ingest data - `perf_test_sb`

From your host OS, build perf_test_sb and run test_group as follows:

cd_app perf_test_sb; cd bin_sh
./build_app
./test_group -prop ../etc/group-factory.properties -run

4. View data - `desktop`

If you haven't installed the Hazelcast Desktop, then install it on your host OS as shown below.

install_padogrid -product hazelcast-desktop
update_padogrid -product hazelcast-desktop

Once installed, create and run the desktop app as follows:

create_app -app desktop -name desktop_sb
cd_app desktop_sb/bin_sh
./desktop

You can enter any of the member addresses for the Locators text field when you login from the desktop. For example, node-01.local:5701 connects to the node-01.local member. User Name is required and can be any name. Password is optional and ignored.

Locators: node-01.local:5701
App ID: sys
User Name: foo
Password: <leave blank>

5. Split cluster

From pnode.local, run split_cluster as follows:

switch_cluster sb; cd bin_sh
./split_cluster

To see the iptables rules set by split_cluster, run list_rules as follows:

./list_rules

6. From `pnode.local`, monitor the log files to see the cluster splits into two (2) as follows:

Cluster	Nodes
A	node-01.local, node-02.local
B	node-03.local, node-04.local, node-05.local

# See Cluster A (view node-01.local or node-02.local log file)
show_log

# See Cluster B (view node-03.local, node-04.local, node-05.local log file)
show_log -num 5

You should see something like the following in the log files:

Cluster A

Members {size:2, ver:8} [
	Member [192.168.56.11]:5701 - 49a65895-221d-4b26-b562-7c53eec092e1 this
	Member [192.168.56.12]:5701 - 17fcfc4a-5593-47be-bde1-a4f5ed3e50b2
]

Cluster B

Members {size:3, ver:6} [
	Member [192.168.56.15]:5701 - 9c41e05d-1acf-449b-a25f-1991c8756617 this
	Member [192.168.56.14]:5701 - 235691e0-bc36-407e-971a-f36189fc0801
	Member [192.168.56.13]:5701 - 3a3c8639-0b7b-4f4f-9daf-e262160126b9
]

Try refreshing the management center from your browser. You should see the list of members changing sporadically indicating there is a network issue.

7. Ingest data into Cluster B - `perf_test_sb`

From your host OS, run test_group which has been preconfigured to connect to Cluster B, i.e., node-03.local, node-04.local, node-05.local (see etc/hazelcast-client.xml). test_group updates the data that was inserted earlier. We'll compare the new data with the old data in the split clusters.

cd_app perf_test_sb; cd bin_sh
./test_group -prop ../etc/group-factory.properties -run

8. Compare data between Cluster A and Cluster B

Launch desktop and login to Cluster A, e.g., node-01.local:5701
Launch desktop and login to Cluster B, e.g., node-05.local:5701

cd_app desktop_sb
cd hazelcast-desktop_<version>/bin_sh

# Launch two (2) instances of desktop
./desktop
./desktop

9. Execute queries on both desktop instances

Execute queries on both desktop instances so that we can compare the results later when we merge the clusters.

--- From each desktop instacne, execute the following queries
--- (Note that the desktop supports SQL comments):
select * from nw/customers order by customerId;
select * from nw/orders order by orderId;

When you execute the above queries you should see the following behavior:

Map	Cluster A	Cluster B
nw/customers	Query Success	Query Success
nw/orders	Query Failure (Exception)	Query Success

Query Exception: com.hazelcast.quorum.QuorumException: Split brain protection exception: quorumRuleWithThreeMembers has failed!

The exception occurs in Cluster A because the split-brain quorum size for the nw/customers map is configured with two (2) as follows (See etc/hazelcast.xml):

   <quorum name="quorumRuleWithTwoMembers" enabled="true">
      <quorum-size>2</quorum-size>
   </quorum>
   <quorum name="quorumRuleWithThreeMembers" enabled="true">
      <quorum-size>3</quorum-size>
   </quorum>

   <map name="nw/customers">
      <merge-policy batch-size="100">LatestUpdateMergePolicy</merge-policy>
      <quorum-ref>quorumRuleWithTwoMembers</quorum-ref>
   </map>
   <map name="nw/orders">
      <merge-policy batch-size="100">LatestUpdateMergePolicy</merge-policy>
      <quorum-ref>quorumRuleWithThreeMembers</quorum-ref>
   </map>

You can view the hazelcast.xml file as follows:

cd_cluster sb
less etc/hazelcast.xml

10. Merge clusters

From pnode.local, run merge_cluster as follows:

switch_cluster sb; cd bin_sh
./merge_cluster

11. Monitor merged cluster

show_log

Upon a successful merge, which takes about a minute, you should see something like the following in all the log files:

Members {size:5, ver:11} [
	Member [192.168.56.15]:5701 - 9c41e05d-1acf-449b-a25f-1991c8756617
	Member [192.168.56.14]:5701 - 235691e0-bc36-407e-971a-f36189fc0801
	Member [192.168.56.13]:5701 - 3a3c8639-0b7b-4f4f-9daf-e262160126b9
	Member [192.168.56.12]:5701 - c10d17a9-fe0c-4c22-af7a-67f8dd702b9c
	Member [192.168.56.11]:5701 - 79d0fa6e-d7dd-46d4-8525-52250dee6e80 this
]

The management center should also show five (5) members.

12. Compare data between the merged cluster and Cluster B

The merged cluster should have the exact same data as Cluster B since both maps are configured with LatestUpdateMergePolicy.

Teardown

Stop Cluster

From your host OS or any of the pods, execute the following:

stop_cluster -cluster sb
stop_mc -cluster sb

Stop Pod

From your host OS, execute the following:

stop_pod -pod hz_pod_sb

Remove Pod

From you host OS, execute the following:

remove_pod -pod hz_pod_sb

Close Desktop

Close desktop app instances by clicking on the close icon.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
apps/perf_test_sb		apps/perf_test_sb
clusters/sb		clusters/sb
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_HEADER.md		README_HEADER.md
assembly-descriptor.xml		assembly-descriptor.xml
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hazelcast Split-Brain

Installing Bundle

Use Case

Required Software

Bundle Contents

Installation Steps

Install Linux products

Create Pod

Build pod

Configuring Cluster

Creating Split-Brain

1. Start cluster and Management Center

2. Monitor Management Center

3. Ingest data - `perf_test_sb`

4. View data - `desktop`

5. Split cluster

6. From `pnode.local`, monitor the log files to see the cluster splits into two (2) as follows:

7. Ingest data into Cluster B - `perf_test_sb`

8. Compare data between Cluster A and Cluster B

9. Execute queries on both desktop instances

10. Merge clusters

11. Monitor merged cluster

12. Compare data between the merged cluster and Cluster B

Teardown

Stop Cluster

Stop Pod

Remove Pod

Close Desktop

About

Releases

Packages

Contributors 3

Languages

License

padogrid/bundle-hazelcast-3-app-perf_test_sb-cluster-sb

Folders and files

Latest commit

History

Repository files navigation

Hazelcast Split-Brain

Installing Bundle

Use Case

Required Software

Bundle Contents

Installation Steps

Install Linux products

Create Pod

Build pod

Configuring Cluster

Creating Split-Brain

1. Start cluster and Management Center

2. Monitor Management Center

3. Ingest data - perf_test_sb

4. View data - desktop

5. Split cluster

6. From pnode.local, monitor the log files to see the cluster splits into two (2) as follows:

7. Ingest data into Cluster B - perf_test_sb

8. Compare data between Cluster A and Cluster B

9. Execute queries on both desktop instances

10. Merge clusters

11. Monitor merged cluster

12. Compare data between the merged cluster and Cluster B

Teardown

Stop Cluster

Stop Pod

Remove Pod

Close Desktop

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

3. Ingest data - `perf_test_sb`

4. View data - `desktop`

6. From `pnode.local`, monitor the log files to see the cluster splits into two (2) as follows:

7. Ingest data into Cluster B - `perf_test_sb`

Packages