bidhankhatri.com.np

Unassigned shards in Elasticsearch 7 and 8

2024-06-27T02:16:41+00:00

There are multiple reasons why shards might get unassigned, ranging from misconfigured allocation settings to lack of disk space.

To reassign all unassigned shards in Elasticsearch, you can use the following steps:

STEPS:

Check the status of unassigned shards
Check cluster allocation explanation
Automatic reallocation
Force reallocation of shards

Check current cluster status

curl -XGET -u elastic:password http://172.16.0.1:9200/_cluster/health?pretty=true

{
  "cluster_name" : "devcluster",
  "status" : "red",
  "timed_out" : false,
  "number_of_nodes" : 3,
  "number_of_data_nodes" : 3,
  "active_primary_shards" : 24,
  "active_shards" : 24,
  "relocating_shards" : 0,
  "initializing_shards" : 0,
  "unassigned_shards" : 26,
  "delayed_unassigned_shards" : 0,
  "number_of_pending_tasks" : 0,
  "number_of_in_flight_fetch" : 0,
  "task_max_waiting_in_queue_millis" : 0,
  "active_shards_percent_as_number" : 48.0
}

Check the status of unassigned shards:
First, you need to identify which shards are unassigned. You can do this by using the _cat/shards API. This will give you a list of all shards in your cluster along with their status.

curl -XGET -u elastic:password http://172.16.0.1:9200/_cat/shards?h=index,shard,prirep,state,unassigned.reason

.monitoring-kibana-7-2024.06.27                               0 p STARTED
.monitoring-kibana-7-2024.06.27                               0 r UNASSIGNED CLUSTER_RECOVERED
.ds-.logs-deprecation.elasticsearch-default-2024.05.29-000001 0 p STARTED
.ds-.logs-deprecation.elasticsearch-default-2024.05.29-000001 0 r UNASSIGNED CLUSTER_RECOVERED
.apm-agent-configuration                                      0 p STARTED
.apm-agent-configuration                                      0 r UNASSIGNED CLUSTER_RECOVERED
.kibana_7.17.13_001                                           0 p STARTED
.kibana_7.17.13_001                                           0 r UNASSIGNED CLUSTER_RECOVERED
.apm-custom-link                                              0 p STARTED
.apm-custom-link                                              0 r UNASSIGNED CLUSTER_RECOVERED
.monitoring-es-7-2024.06.25                                   0 p STARTED
.monitoring-es-7-2024.06.25                                   0 r UNASSIGNED CLUSTER_RECOVERED
.monitoring-kibana-7-2024.06.24                               0 p STARTED
.monitoring-kibana-7-2024.06.24                               0 r UNASSIGNED CLUSTER_RECOVERED
.ds-ilm-history-5-2024.05.29-000001                           0 p STARTED
.ds-ilm-history-5-2024.05.29-000001                           0 r UNASSIGNED CLUSTER_RECOVERED
.kibana_task_manager_7.17.13_001                              0 p STARTED
.kibana_task_manager_7.17.13_001                              0 r UNASSIGNED CLUSTER_RECOVERED
.tasks                                                        0 p STARTED
.tasks                                                        0 r UNASSIGNED CLUSTER_RECOVERED
.monitoring-kibana-7-2024.06.22                               0 p STARTED
.monitoring-kibana-7-2024.06.22                               0 r UNASSIGNED CLUSTER_RECOVERED
.monitoring-kibana-7-2024.06.26                               0 p STARTED
.monitoring-kibana-7-2024.06.26                               0 r UNASSIGNED CLUSTER_RECOVERED
.kibana_security_session_1                                    0 p UNASSIGNED INDEX_CREATED
.kibana_security_session_1                                    0 r UNASSIGNED REPLICA_ADDED
.monitoring-kibana-7-2024.06.25                               0 p STARTED
.monitoring-kibana-7-2024.06.25                               0 r UNASSIGNED CLUSTER_RECOVERED
.monitoring-es-7-2024.06.27                                   0 p STARTED
.monitoring-es-7-2024.06.27                                   0 r UNASSIGNED CLUSTER_RECOVERED
.monitoring-kibana-7-2024.06.21                               0 p STARTED
.monitoring-kibana-7-2024.06.21                               0 r UNASSIGNED CLUSTER_RECOVERED
.monitoring-es-7-2024.06.26                                   0 p STARTED
.monitoring-es-7-2024.06.26                                   0 r UNASSIGNED CLUSTER_RECOVERED
.geoip_databases                                              0 p STARTED
.geoip_databases                                              0 r UNASSIGNED CLUSTER_RECOVERED
.monitoring-kibana-7-2024.06.23                               0 p STARTED
.monitoring-kibana-7-2024.06.23                               0 r UNASSIGNED CLUSTER_RECOVERED
.monitoring-es-7-2024.06.23                                   0 p STARTED
.monitoring-es-7-2024.06.23                                   0 r UNASSIGNED CLUSTER_RECOVERED
.kibana-event-log-7.17.13-000001                              0 p STARTED
.kibana-event-log-7.17.13-000001                              0 r UNASSIGNED CLUSTER_RECOVERED
.monitoring-es-7-2024.06.22                                   0 p STARTED
.monitoring-es-7-2024.06.22                                   0 r UNASSIGNED CLUSTER_RECOVERED
.monitoring-es-7-2024.06.21                                   0 p STARTED
.monitoring-es-7-2024.06.21                                   0 r UNASSIGNED CLUSTER_RECOVERED
.monitoring-es-7-2024.06.24                                   0 p STARTED
.monitoring-es-7-2024.06.24                                   0 r UNASSIGNED CLUSTER_RECOVERED
.security-7                                                   0 p STARTED
.security-7                                                   0 r UNASSIGNED CLUSTER_RECOVERED

Check cluster allocation explanation:
To understand why shards are unassigned, you can use the _cluster/allocation/explain API. This will provide a detailed explanation of the allocation decisions.

curl -XGET -u elastic:password http://172.16.0.1:9200/_cluster/allocation/explain?pretty"

    {
      "node_id" : "vO3fSByBTxOCFzNsql-95g",
      "node_name" : "dev01",
      "transport_address" : "172.16.0.2:9300",
      "node_attributes" : {
        "ml.machine_memory" : "6067675136",
        "ml.max_open_jobs" : "512",
        "xpack.installed" : "true",
        "ml.max_jvm_size" : "2147483648",
        "transform.node" : "true"
      },
      "node_decision" : "no",
      "weight_ranking" : 2,
      "deciders" : [
        {
          "decider" : "enable",
          "decision" : "NO",
          "explanation" : "no allocations are allowed due to cluster setting [cluster.routing.allocation.enable=none]"
        }
      ]
    },
    {
      "node_id" : "kXPPZQDzTBu4ClLeWO95qQ",
      "node_name" : "dev02",
      "transport_address" : "172.16.0.3:9300",
      "node_attributes" : {
        "ml.machine_memory" : "6067666944",
        "ml.max_open_jobs" : "512",
        "xpack.installed" : "true",
        "ml.max_jvm_size" : "2147483648",
        "transform.node" : "true"
      },
      "node_decision" : "no",
      "weight_ranking" : 3,
      "deciders" : [
        {
          "decider" : "enable",
          "decision" : "NO",
          "explanation" : "no allocations are allowed due to cluster setting [cluster.routing.allocation.enable=none]"
        }

Automatic reallocation:
Sometimes, cluster settings might prevent the allocation of shards. You can check your current cluster settings with:

curl -XGET -u elastic:password http://172.16.0.1:9200/_cluster/settings?pretty"

{
    "peristent" : {
        "cluster" : {
            "routing" : {
                "allocation" : {
                    "enable" : "none"
                }
            }
        }
    }
}

If shard allocation is disabled just like above, you can enable it by updating the cluster settings:

curl -XPUT -u elastic:password http://172.16.0.1:9200/_cluster/settings" -H 'Content-Type: application/json' -d '{
  "persistent": {
    "cluster.routing.allocation.enable": "all"
  }
}'

Force reallocation of shards:
If you want to manually allocate an unassigned shard to a specific node, you can use the _cluster/reroute API. Here’s how to force the allocation of a stale primary shard:

curl -XPOST -u elastic:password http://172.16.0.1:9200/_cluster/reroute" -H 'Content-Type: application/json' -d '{
  "commands": [
    {
      "allocate_stale_primary": {
        "index": "index_name",
        "shard": shard_number,
        "node": "node_name",
        "accept_data_loss": true
      }
    }
  ]
}'

Replace index_name, shard_number, and node_name with the appropriate values.
index: The name of the index to which the shard belongs.
shard: The shard number.
node: The name of the node to which you want to assign the shard.
accept_data_loss: Set to true if you want to allocate a stale primary shard, acknowledging that data loss might occur.

To allocate an unassigned replica shard, you can use:

curl -XPOST -u elastic:password http://172.16.0.1:9200/_cluster/reroute" -H 'Content-Type: application/json' -d '{
  "commands": [
    {
      "allocate_replica": {
        "index": "index_name",
        "shard": shard_number,
        "node": "node_name"
      }
    }
  ]
}'

To get the shard number and index name of unassigned shards in Elasticsearch, you can use the _cat/shards API. This API provides detailed information about all the shards in your cluster, including their index names, shard numbers, and current states.

Additional Steps and Considerations:
Check Node Availability: Ensure all nodes are up and running. Unassigned shards might be due to nodes being down or unreachable.

Disk Space: Make sure there is enough disk space on the nodes. Elasticsearch won’t allocate shards to nodes running low on disk space.

By following these steps, you should be able to diagnose and resolve issues related to unassigned shards in your Elasticsearch cluster.

Percona XtraDB Multi-Master Replication cluster setup between 3 nodes

2024-04-22T02:16:41+00:00

This guide describes the steps to establish a Percona XtraDB Cluster v8.0 among three Ubuntu 22.04 nodes.

Install Percona XtraDB Cluster on all hosts that you are planning to use as cluster nodes and ensure you have root access to the MySQL server on each node. In this setup, Multi-Master replication is implemented.

In Multi-Master replication, there are multiple nodes acting as master nodes. Data is replicated between nodes, allowing updates and insertions on a group of master nodes, resulting in multiple copies of the data.

Node	IP
node1	172.16.0.12
node2	172.16.0.13
node3	172.16.0.14

Execute installation commands in all 3 nodes.

apt update
apt install -y wget gnupg2 lsb-release curl
wget https://repo.percona.com/apt/percona-release_latest.generic_all.deb
dpkg -i percona-release_latest.generic_all.deb
apt update
percona-release setup pxc80
apt install percona-xtradb-cluster

Encrypt PXC Traffic

There are two kinds of traffic in Percona XtraDB Cluster: client-server traffic (the one between client applications and cluster nodes), and replication traffic, which includes SST, IST, write-set replication, and various service messages.

Percona XtraDB Cluster supports encryption for all types of traffic. Replication traffic encryption can be configured either automatically or manually. In this guide, we are configuring automatic version.

SST (State Snapshot Transfer) is the full copy of data from one node to another. It’s used when a new node joins the cluster and needs to transfer data from an existing node.

IST (Incremental State Transfer) is a functionality which, instead of transferring the whole state snapshot, catches up with the group by receiving the missing writesets, but only if the writeset is still in the donor’s writeset cache.

Encrypt Relication Traffic

Replication traffic refers to the inter-node traffic, which includes SST, IST, and regular replication traffic.
Percona XtraDB Cluster supports a single configuration option to secure the entire replication traffic, often referred to as SSL automatic configuration. Alternatively, you can configure the security of each channel by specifying independent parameters.
The automatic SSL encryption configuration requires key and certificate files. MySQL generates default key and certificate files and places them in the data directory.
Percona XtraDB Cluster includes the pxc-encrypt-cluster-traffic variable, enabling automatic SSL encryption for SST, IST, and replication traffic.
By default, pxc-encrypt-cluster-traffic is enabled, ensuring a secured channel for replication. This variable is not dynamic and cannot be changed at runtime.
If you wish to disable encryption for replication traffic, you must stop the cluster and update the [mysqld] section of the configuration file on each node with pxc-encrypt-cluster-traffic=OFF. Then, restart the cluster.

But in our case we are not disabling the encryption traffic between the nodes so follow the below steps. Update/Add these variables in /etc/mysql/mysql.conf.d/mysqld.cnf file.

wsrep_node_name=node1
wsrep_node_address=172.16.0.12
pxc_strict_mode=ENFORCING
wsrep_provider=/usr/lib/galera4/libgalera_smm.so
wsrep_cluster_name=pxc-cluster
wsrep_cluster_address=gcomm://172.16.0.12,172.16.0.13,172.16.0.14

Similary add/update above variables in other 2 nodes too. wsrep_node_name and wsrep_node_address variables only need to be updated as per node. Other parameters remain same.

Now Bootstrap the first node. After you configure all PXC nodes, initialize the cluster by bootstrapping the first node. The initial node must contain all the data that you want to be replicated to other nodes.

systemctl start mysql@bootstrap.service

When you start the node using the previous command, it runs in bootstrap mode with wsrep_cluster_address=gcomm://. This tells the node to initialize the cluster with wsrep_cluster_conf_id variable set to 1. After you add other nodes to the cluster, you can then restart this node as normal, and it will use standard configuration again.

To make sure that the cluster has been initialized, run the following:

show status like 'wsrep%';

The output shows that the cluster size is 1 node, it is the primary component, the node is in the Synced state, it is fully connected and ready for write-set replication.

Now copy the certs from node1 to other 2 remaining nodes. Certs will be in path /var/lib/mysql.
It is important that your cluster use the same SSL certificates and keys on all nodes.

bdn@node02:~$ ls /var/lib/mysql/*pem
/var/lib/mysql/ca-key.pem       /var/lib/mysql/client-key.pem   /var/lib/mysql/server-cert.pem
/var/lib/mysql/ca.pem           /var/lib/mysql/private_key.pem  /var/lib/mysql/server-key.pem
/var/lib/mysql/client-cert.pem  /var/lib/mysql/public_key.pem

To verify that the server and client certificates are correctly signed by the same CA certificate, run the following command:

bdn@node02:/var/lib/mysql$ openssl verify -CAfile ca.pem server-cert.pem client-cert.pem
server-cert.pem: OK
client-cert.pem: OK

By default, it generates certificates valid for 10 years.

bdn@node02:/var/lib/mysql$ openssl x509 -enddate -noout -in server-cert.pem
notAfter=Apr 17 13:36:50 2034 GMT

Now start mysql in second node.

systemctl start mysql

Similary, start mysql service in third node too.
Now you can stop the mysql@bootstrap.service service in node1 and start mysql service
Now all 3 nodes are connected to the cluster.

You will see similar logs in /var/log/mysql/error.log file.

2024-04-21T17:21:17.442573Z 1 [Note] [MY-000000] [Galera] ===========================================
=====
View:
  id: 0c233cf9-fe50-11ee-a3e9-dfbfab2fa1fe:31
  status: primary
  protocol_version: 4
  capabilities: MULTI-MASTER, CERTIFICATION, PARALLEL_APPLYING, REPLAY, ISOLATION, PAUSE, CAUSAL_READ
, INCREMENTAL_WS, UNORDERED, PREORDERED, STREAMING, NBO
  final: no
  own_index: 0
  members(3):
        0: 60c1bc1f-0003-11ef-bae7-7347836516bcb, node01
        1: 7b9841ae-0003-11ef-8586-4b8504592b099, node02
        2: 8f92b85a-0003-11ef-b411-5a2ad6404a18f, node03

Now, any changes made to any MySQL server will automatically be reflected on the other nodes.

Site-to-Site VPN between Mikrotik router and Ubuntu 22.04 through strongSwan using IPsec IKEv2

2024-04-13T02:16:41+00:00

We will configure a site-to-site IPsec IKEv2 tunnel between the Mikrotik Router and the StrongSwan server. This will enable secure communication between devices connected behind the Mikrotik router and the StrongSwan server.

flowchart LR; subgraph Mikrotik Site srv01(srv01\n172.16.1.14/24) <--172.16.1.0/24 --> mikrotik((Router)) end subgraph INTERNET VPN((IKEv2)) end subgraph strongSwan Site strongSwan(strongSwan\n2.2.2.2/32 - WAN IP - eth0\n17.16.2.10/24 - LAN IP - eth1\n) -- 172.16.2.0/24 <--> srv02(srv02\n 172.16.2.14/24) end mikrotik(Mikrotik Router\n 1.1.1.1/32 - WAN IP\n172.16.1.1/24 - LAN IP) == IPSec Tunnel <==> INTERNET ==IPSec Tunnel <==> strongSwan style mikrotik fill:#bbf,stroke:#f66,stroke-width:2px,color:#fff,stroke-dasharray: 5 5 style strongSwan fill:#bbf,stroke:#f66,stroke-width:2px,color:#fff,stroke-dasharray: 5 5

Install strongSwan in ubuntu

apt-get install strongswan libcharon-extra-plugins strongswan-pki

vim /etc/ipsec.conf

config setup
        charondebug="all"
        uniqueids=yes
        strictcrlpolicy=no
conn B1-TO-HO
        authby=secret
        left=%defaultroute
        leftid=2.2.2.2
        leftsubnet=172.16.2.0/24
        right=1.1.1.1
        rightsubnet=172.16.1.0/24  
        ike=aes256-sha2_256-modp1024!
        esp=aes256-sha256-modp1024!
        keyingtries=0
        ikelifetime=1h
        lifetime=8h
        dpddelay=30

config setup: This section contains global configuration options for StrongSwan.
charondebug=”all”: This option sets the debugging level for the IKE daemon (charon) to “all”, which means it will log all debugging messages. This can be useful for troubleshooting.
uniqueids=yes: This option ensures that each IKE_SA (Internet Key Exchange Security Association) has a unique ID. This helps avoid conflicts in case multiple connections are established.
strictcrlpolicy=no: This option specifies whether strict certificate revocation list (CRL) checking is enforced. Setting it to “no” means that CRL checking will not be strictly enforced.
conn B1-TO-HO: This section defines a specific connection between two peers.
authby=secret: This option specifies that authentication will be performed using a shared secret key.
left=%defaultroute: This option specifies that the local endpoint (left side) of the connection will be determined based on the default route.
leftid=2.2.2.2: This option specifies the identity (ID) of the local endpoint. In this case, it’s set to the IP address 2.2.2.2.
leftsubnet=172.16.2.0/24: This option specifies the local subnet that will be accessible through the VPN tunnel.
right=1.1.1.1: This option specifies the IP address of the remote endpoint (right side) of the connection.
rightsubnet=172.16.1.0/24: This option specifies the remote subnet that will be accessible through the VPN tunnel.
ike=aes256-sha2_256-modp1024!: This option specifies the IKE (Internet Key Exchange) encryption algorithm, integrity algorithm, and Diffie-Hellman group to be used for negotiating phase 1 of the IPsec tunnel. In this case, AES 256-bit encryption, SHA-256 hashing, and MODP 1024-bit Diffie-Hellman group are used.
esp=aes256-sha256-modp1024!: This option specifies the ESP (Encapsulating Security Payload) encryption algorithm, integrity algorithm, and Diffie-Hellman group to be used for negotiating phase 2 of the IPsec tunnel. In this case, AES 256-bit encryption, SHA-256 hashing, and MODP 1024-bit Diffie-Hellman group are used.
keyingtries=0: This option specifies the number of attempts for establishing the IKE_SA. Setting it to 0 means that no retries will be attempted.
ikelifetime=1h: This option specifies the lifetime of the IKE_SA (Phase 1) in hours.
lifetime=8h: This option specifies the lifetime of the CHILD_SA (Phase 2) in hours.
dpddelay=30: This option specifies the delay (in seconds) before Dead Peer Detection (DPD) is initiated. DPD is used to detect if the peer is still reachable. In this case, DPD will be initiated after 30 seconds of inactivity.

set preshared key:

vim /etc/ipsec.secrets

# This file holds shared secrets or RSA private keys for authentication.

# RSA private key for this host, authenticating it to any other host
# which knows the public part.

2.2.2.2 1.1.1.1 : PSK $I8WC#53D@$#%#

2.2.2.2: This is the local endpoint’s IP address.
1.1.1.1: This is the remote endpoint’s IP address.
PSK: This indicates that a pre-shared key is used for authentication.
$I8WC#53D@$#%#: This is the actual pre-shared key used for authentication.

So, in this case, the pre-shared key $I8WC#53D@$#%# is used for authentication between the local endpoint with IP address 2.2.2.2 and the remote endpoint with IP address 1.1.1.1

systemctl restart strongswan-starter

Now we need to enable IP forwarding. The configuration line "net.ipv4.ip_forward = 1" enables IP forwarding on a Linux system. IP forwarding allows the system to forward packets from one network interface to another, essentially acting as a router or gateway.

vim /etc/sysctl.conf
net.ipv4.ip_forward = 1

root@int-vpn-01:~# sysctl -p
net.ipv4.ip_forward = 1

Allow traffic through ufw firewall

ufw allow from 1.1.1.1/32 proto udp to any port 500,4500 comment 'Mikrotik' 
ufw route allow in on any out on any comment 'for VPN Traffic'

Mikrotik Configuration

 /ip ipsec profile
add name="DH_HO_Profile1" hash-algorithm=sha256 enc-algorithm=aes-256 dh-group=modp1024 lifetime=1d proposal-check=obey nat-traversal=yes dpd-interval=2m dpd-maximum-failures=5

/ip ipsec peer
add name="DH_HO_Peer1" address=2.2.2.2/32 profile=DH_HO_Profile1 exchange-mode=ike2 send-initial-contact=yes 

/ip ipsec identity
add peer=DH_HO_Peer1 auth-method=pre-shared-key secret="$I8WC#53D@$#%#" generate-policy=no

/ip ipsec proposal
add name="DH_HO_Proposal1" auth-algorithms=sha256 enc-algorithms=aes-256-cbc lifetime=30m pfs-group=modp1024

/ip ipsec policy
add dst-address=172.16.2.0/24 peer="DH_HO_Peer1" proposal="DH_HO_Proposal1" sa-dst-address=2.2.2.2 sa-src-address=1.1.1.1 src-address=172.16.1.0/24 tunnel=yes

Further NAT rule:

/ip firewall nat
add chain=srcnat action=accept src-address=172.16.1.0/24 dst-address=172.16.2.0/24 log=yes log-prefix=""

Allow port 500/4500/UDP in Mikrotik

/ip firewall filter
add chain=input action=accept protocol=udp src-port="" dst-port=500,4500 log=n> log-prefix="" 

check phase 1 status:

check phase 2 status:

open UDP port 500,4500 for remote server:

check connectivity:

check ipsec status in strongSwan VPN server:

root@int-vpn-01:~# ipsec status
Security Associations (1 up, 0 connecting):
 B1-TO-HO[13]: ESTABLISHED 5 minutes ago, 2.2.2.2[2.2.2.2]...1.1.1.1[1.1.1.1]
 B1-TO-HO{26}:  INSTALLED, TUNNEL, reqid 1, ESP in UDP SPIs: c006274a_i 01310acd_o
 B1-TO-HO{26}:   172.16.2.0/24 === 172.16.1.0/24
root@int-vpn-01:~#

Now our Mikrotik site can communicate with strongSwan site.

Additional, You have to configure the static route on every node behind the VPN gateway server. Otherwise, the 172.16.1.0/24 network will not be able to reach other 172.16.2.x IPs.

ip route add 172.16.1.0/24 via 172.16.2.10

Monitor HA Cluster running Pacemaker and Corosync using Prometheus and Grafana using Docker

2023-08-15T02:16:41+00:00

We will use Grafana with prometheus in container to monitor High availability cluster running by Pacemaker and Corosync.

Grafana dashboard which we will be using shows the details of a HA cluster running Pacemaker/Corosync. It is built on top of ha_cluster_exporter but it also requires Prometheus node_exporter to be configured on the target nodes, and it also assumes that the target nodes in each cluster are grouped via the job label.

INSTALL HA_CLUSTER_EXPORTER

mkdir /usr/local/ha_cluster_exporter
cd /usr/local/ha_cluster_exporter
wget https://github.com/ClusterLabs/ha_cluster_exporter/releases/download/1.3.3/ha_cluster_exporter-amd64.gz 
gunzip ha_cluster_exporter-amd64.gz
mv ha_cluster_exporter-amd64 ha_cluster_exporter
chmmod +x ha_cluster_exporter

Create systemd file.

vim /etc/systemd/system/ha_cluster_exporter.service

[Unit]
Description=HA Cluster Exporter

[Service]
User=root
WorkingDirectory=/usr/local/ha_cluster_exporter
ExecStart=/usr/local/ha_cluster_exporter/ha_cluster_exporter
Restart=always

[Install]
WantedBy=multi-user.target

Start and enable service.

systemctl enable --now ha_cluster_expoter

INSTALL NODE_EXPORTER

wget https://github.com/prometheus/node_exporter/releases/download/v1.6.1/node_exporter-1.6.1.linux-amd64.tar.gz 
tar xvf node_exporter-1.6.1.linux-amd64.tar.gz 
mv node_exporter-1.6.1.linux-amd64.tar.gz node_exporter
mv node_exporter /usr/local/
cd /usr/local
mv node_exporter-1.6.1.linux-amd64 node_exporter

vim /etc/systemd/system/node_exporter.service

[Unit]
Description=Prometheus Node Exporter

[Service]
User=root
WorkingDirectory=/usr/local/node_exporter
ExecStart=/usr/local/node_exporter/node_exporter --collector.systemd --collector.systemd.unit-include="pcsd|pacemaker|corosync)".service
Restart=always

[Install]
WantedBy=multi-user.target

systemctl enable --now node_exporter

Now Build Prometheus and Grafana Container
For the persistent container storage, execute below commands.

mkdir /var/Docker_home
cd /var/Docker_home
mkdir Prometheus Grafana
mkdir /Prometheus/PromDB
mkdir /Grafana/data
chmod 777 /prometheus/PromDB
chmod 777 /Grafana/data

Create Docker Compose file.

vim docker-compose.yml

version: '3'
services:
  prometheus:
    image: prom/prometheus
    container_name: prometheus
    ports:
      - 9090:9090
    volumes:
      - ./Prometheus/prometheus.yml:/etc/prometheus/prometheus.yml
      - ./Prometheus/PromDB:/prometheus

  grafana:
    image: grafana/grafana-oss
    container_name: grafana
    ports:
      - 3000:3000
    volumes:
      - ./Grafana/data:/var/lib/grafana

The code defines Docker services: Prometheus for monitoring (port 9090, config and data persistence), and Grafana for visualization (port 3000, data persistence). Docker volumes ensure host-container data sharing, allowing configuration and data to be stored persistently across restarts in respective directories.

Create prometheus config file.

vim /var/Docker_home/Prometheus/prometheus.yml

global:
  scrape_interval:     10s
  evaluation_interval: 10s

scrape_configs:
  - job_name: "ha-cluster"
    static_configs:
      - targets: ['ram-01.bidhankhatri.com.np:9664', 'ram-02.bidhankhatri.com.np:9664', 'ram-01.bidhankhatri.com.np:9100', 'ram-02.bidhankhatri.com.np:9100']

Port 9664 for ha_cluster_exporter and Port 9100 for node_exporter

docker-compose up -d

Pulling prometheus (prom/prometheus:)...
latest: Pulling from prom/prometheus
d5c4df21b127: Pull complete
2f5f7d8898a1: Pull complete
300c29bb5b04: Pull complete
be6ad5a51a35: Pull complete
ea6cf9f81dfe: Pull complete
b5ac85a4be54: Pull complete
d32980b63d51: Pull complete
502ed6d3bdc8: Pull complete
7bed70210741: Pull complete
3b19398e1689: Pull complete
d358eb0a0392: Pull complete
d6eaeaf54563: Pull complete
Digest: sha256:d6ead9daf2355b9923479e24d7e93f246253ee6a5eb18a61b0f607219f341a80
Status: Downloaded newer image for prom/prometheus:latest
Pulling grafana (grafana/grafana-oss:)...
latest: Pulling from grafana/grafana-oss
4db1b89c0bd1: Pull complete
312681f4cad0: Pull complete
8b7b65888846: Pull complete
dd9c3d04d541: Pull complete
959325519a8e: Pull complete
16cb2df7bffd: Pull complete
94d1f5f5bfea: Pull complete
e3281a1a7e8f: Pull complete
5c0c2b741753: Pull complete
Digest: sha256:423040d62678074111e4e72d7dcef23480a94eb4f21b9173204d1a5ee972ec59
Status: Downloaded newer image for grafana/grafana-oss:latest
Creating prometheus ... done
Creating grafana    ... done

docker ps

CONTAINER ID   IMAGE                 COMMAND                  CREATED         STATUS              PORTS                                       NAMES
1fb48738a930   prom/prometheus       "/bin/prometheus --c…"   2 minutes ago   Up About a minute   0.0.0.0:9090->9090/tcp, :::9090->9090/tcp   prometheus
f5362d262246   grafana/grafana-oss   "/run.sh"                2 minutes ago   Up About a minute   0.0.0.0:3000->3000/tcp, :::3000->3000/tcp   grafana

Both Grafana and Prometheus are up now.

CONFIGURE GRAFANA

Go to http://YOURIPADDRESS:3000 to access grafana UI. Default username/password is admin/admin. change it after login.

1st connect Grafana to Prometheus:
Home » Connections » Add new connection » Search for Prometheus » Create a Prometheus data source » update prometheus server URL: http://YOURIPADDRESS:9090
Click on Save & Test
Second, Import Grafana Dashboard for HA cluster. Ref: https://grafana.com/grafana/dashboards/12229-ha-cluster-details/
Home » Dashboard » New [ drop down] » Import » Put ID 12229 and click on Load » Choose prometheus which we configured earlier
Click on Import

Similar graph you will see.

GFS2 Filesystem setup in RHEL8 with Pacemaker and Corosync

2023-07-14T02:16:41+00:00

We will configure Pacemaker/Corosync to enable the sharing of a disk between two nodes through the GFS2 clustered filesystem.

flowchart TB subgraph Vmware ram-01.bidhankhatri.com.np\n10.12.6.10-->Shared-Disk-/dev/sdb ram-02.bidhankhatri.com.np\n10.12.6.11-->Shared-Disk-/dev/sdb end

GFS2 File system

GFS2 (Global File System 2) is a cluster file system designed for use in Linux-based environments. It is an enhanced version of the original GFS (Global File System), which was developed by Red Hat. GFS2 allows multiple servers to have concurrent read and write access to a shared file system, providing high availability and scalability for data storage.

GFS2 is commonly used in scenarios where multiple servers need simultaneous access to a shared file system, such as in high-performance computing (HPC) clusters, database clusters, or virtualization environments. It provides a reliable and scalable solution for managing data across a cluster of Linux servers.

Now lets start the setup.

Cluster setup with Pacemaker/Corosync
GFS2 Setup

Various components of the cluster stack (corosync, pacemaker, etc.) has to configured.

Pacemaker:

Pacemaker is a high-availability cluster resource manager — software that runs on a set of hosts (a cluster of nodes) in order to preserve integrity and minimize downtime of desired services (resources).

Corosync:

Corosync is an cluster messaging and membership service that is often used in conjunction with the Pacemaker software to build high availability (HA) clusters. It provides a reliable and scalable communication infrastructure for coordinating and synchronizing the activities of nodes in a cluster. Corosync operates as a cluster messaging layer, enabling the exchange of information and cluster-related events among the nodes.

Cluster setup with Pacemaker/Corosync

We will setup 2 node cluster and share the disk between them.
Nodes:

ram-01.bidhankhatri.com.np
ram-02.bidhankhatri.com.np

Prerequisites:

Update both node with its IP in each other /etc/hosts file. Its always recommended to update host details in /etc/hosts file when you are trying to setup any cluster instead depending upon the external DNS.

vim /etc/hosts

10.12.6.10 ram-01.bidhankhatri.com.np
10.12.6.11 ram-02.bidhankhatri.com.np

Ensure NTP is set up and synchronize the time between nodes to avoid significant time differences, which could adversely affect cluster performance. Verify this configuration on both nodes.

ntpstat

Now first enable the HighAvailability repo on both nodes and install packages. After that start and enable pcsd, pacemaker and corosync service.

yum install pcs pacemaker fence-agents-all
systemctl enable --now pcsd
systemctl enable --now pacemaker
systemctl enable --now corosync

Enable ports required for HighAvailability

firewall-cmd --permanent --add-service=high-availability
firewall-cmd --reload

The installed packages will create a hacluster user with a disabled password. While this is fine for running pcs commands locally, the account needs a login password in order to perform such tasks as syncing the corosync configuration, or starting and stopping the cluster on other nodes.

[root@ram-01 ~]# passwd hacluster
[root@ram-02 ~]# passwd hacluster

CONFIGURE COROSYNC

On either node, use pcs cluster auth to authenticate as the hacluster user:

[root@ram-01 ~]# pcs host auth ram-01.bidhankhatri.com.np ram-02.bidhankhatri.com.np
Username: hacluster
Password: ******
ram-01.bidhankhatri.com.np: Authorized
ram-02.bidhankhatri.com.np: Authorized
[root@ram-01 ~]#

NOTE: If you are using RHEL7/Centos7 or Fedora then command to authenticate hosts is:
pcs cluster auth ram-01.bidhankhatri.com.np ram-02.bidhankhatri.com.np

Next, use below command on the same node to generate and synchronize the corosync between the cluster nodes. We will be setting “gfs-cluster” as our cluster name.

[root@ram-01 ~]# pcs cluster setup gfs-cluster ram-01.bidhankhatri.com.np ram-02.bidhankhatri.com.np

NOTE: In Centos7, command is:
pcs cluster setup --name gfs-cluster ram-01.bidhankhatri.com.np ram-02.bidhankhatri.com.np

Check cluster status:

[root@ram-01 ~]# pcs cluster status
Cluster Status:
 Status of pacemakerd: 'Pacemaker is running' (last updated 2023-07-07 15:03:12 -04:00)
 Cluster Summary:
  * Stack: corosync
  * Current DC: ram-01.bidhankhatri.com.np (version 2.0.0-10.el8-b67d8d0de9) - partition with quorum
  * Last updated: Fri Jul 7 16:11:18 2023
  * Last change: Fri Jul 7 16:11:00 2023 by hacluster via crmd on ram-01.bidhankhatri.com.np
  * 2 nodes configured
  * 0 resources instances configured
Node List:
  * Online: [ram-01.bidhankhatri.com.np ram-02.bidhankhatri.com.np ]

PCSD Status:
  ram-01.bidhankhatri.com.np: Online
  ram-02.bidhankhatri.com.np: Online

A Red Hat High Availability cluster requires that you configure fencing for the cluster. So before doing the GFS2 setup, we have to configure fencing first.

STONITH SETUP

pcs stonith create vmfence fence_vmware_rest pcmk_host_map="ram-01.bidhankhatri.com.np:ram-01-vm;ram-02.bidhankhatri.com.np:ram-02-vm" ip=http://192.168.2.1 ssl_insecure=1 username="adminuser@bidhankhatri.com.np" password="****" delay=10 pcmk_monitor_timeout=120s

The above provided command is used to create a fencing resource in the Pacemaker cluster manager using the VMware REST API. Fencing, also known as STONITH (Shoot The Other Node In The Head)

pcs stonith create vmfence: This is the command to create a new fencing resource named “vmfence” in Pacemaker.
fence_vmware_rest: This specifies the fence agent to be used, which in this case is the VMware REST API fence agent. This agent allows Pacemaker to communicate with VMware vSphere to fence a misbehaving node.
pcmk_host_map=”ram-01.bidhankhatri.com.np:ram-01-vm;ram-02.bidhankhatri.com.np:ram-02-vm”: This option specifies the mapping between the hostnames of the nodes in the Pacemaker cluster and the corresponding virtual machine (VM) names in the VMware environment. It indicates which VM corresponds to which cluster node.
ip=http://192.168.2.1 This option specifies the IP address or hostname of the VMware vCenter server.
ssl_insecure=1: This option tells the fence agent to ignore SSL certificate validation errors. Use this option only if you have a self-signed or untrusted SSL certificate on your vCenter server.
username=”adminuser@bidhankhatri.com.np”: This option specifies the username to authenticate with the VMware vCenter server.

GFS2 SETUP

Enable repo: rhel-8-for-x86_64-resilientstorage-rpms

On both nodes of the cluster, install the lvm2-lockd, gfs2-utils, and dlm packages.

yum install lvm2-lockd gfs2-utils dlm

On both nodes of the cluster, set the use_lvmlockd configuration option in the /etc/lvm/lvm.conf file to use_lvmlockd=1.

use_lvmlockd=1

Set the global Pacemaker parameter no-quorum-policy to freeze.

[root@ram-01 ~]# pcs property set no-quorum-policy=freeze

By default, when quorum is lost, all resources on the remaining partition are immediately stopped. This is the safest and most optimal option, but GFS2 requires quorum to function. If quorum is lost, both GFS2 applications and the GFS2 mount cannot be stopped correctly. To solve this, set no-quorum-policy to freeze when using GFS2. This means the remaining partition will remain inactive until quorum is regained.

To configure a GFS2 file system in a cluster, it is necessary to establish a dlm resource, which is a mandatory dependency. In this case, the provided example creates the dlm resource within a resource group called "locking."

[root@ram-01 ~]# pcs resource create dlm --group locking ocf:pacemaker:controld op monitor interval=30s on-fail=fence

Clone the locking resource group so that the resource group can be active on both nodes of the cluster.

pcs resource clone locking interleave=true

Set up an lvmlockd resource as part of the locking resource group.

pcs resource create lvmlockd --group locking ocf:heartbeat:lvmlockd op monitor interval=30s on-fail=fence

Check the status of the cluster to ensure that the locking resource group has started on both nodes of the cluster.

# pcs status --full

Cluster name: gfs-cluster
Status of pacemakerd: 'Pacemaker is running' (last updated 2023-07-09 08:03:40 -04:00)
Cluster Summary:
  * Stack: corosync
  * Current DC: ram-01.bidhankhatri.com.np (version 2.1.4-5.e18_7.2-dc6eb4362e) - partition with quorum
  * Last updated: Sun Jul 09 08:03:41 2023
  * Last change: Sun Jul 9 08:03:37 2023 by root via cibadmin on ram-01.bidhankhatri.com.np
  * 2 ndoes configured
  * 5 resource instances configured

Node List:
  * Online: [ ram-01.bidhankhatri.com.np (1) ram-02.bidhankhatri.com.np (2) ]

Full List of Resources:
  * Clone Set: locking-clone [locking]:
    * Resource Group: locking:0:
      * dlm    (ocf::pacemaker:controld):       Started ram-01.bidhankhatri.com.np
      * lvmlockd    (ocf:pacemaker:lvmlockd):         Started ram-01.bidhankhatri.com.np
    * Resource Group: locking:1:
      * dlm    (ocf::pacemaker:controld):       Started ram-02.bidhankhatri.com.np
      * lvmlockd    (ocf:pacemaker:lvmlockd):         Started ram-02.bidhankhatri.com.np
  * vmfence     (stonith:fence_vmware_rest):    started ram-01.bidhankhatri.com.np

Migrration Summary:

Tickets:

PCSD Status:
  ram-01.bidhankhatri.com.np: Online
  ram-02.bidhankhatri.com.np: Online

Daemon Status:
  corosync: active/enabled
  pacemaker: active/enabled
  pcsd: active/enabled

On one node, create a shared volume group. Create volume group "shared_vg"

vgcreate --shared shared_vg /dev/sdb
  Physical volume "/dev/vdb" successfully created.
  Volume group "shared_vg" successfully created
  VG shared_vg starting dlm lockspace
  Starting locking.  Waiting until locks are ready...

In 2nd node.

vgchange --lockstart shared_vg
  VG shared_vg starting dlm lockspace
  Starting locking.  Waiting until locks are ready...

lvcreate --activate sy -l 100%FREE -n shared_lv shared_vg
  Logical volume "shared_lv" created.

[root@ram-01 ~]# mkfs.gfs2 -j2 -p lock_dlm -t gfs-cluster:gfs2 /dev/shared_vg/shared_lv
/dev/shared-vg is a symbolic link to /dev/dm-7
this will destory any data on /dev/dm-7
Are you sure you want to proceed? [y/n] y
Discarding device contents (may take a while on large devices): Done
Adding journals: Done
Building resource groups: Done
Creating quota file: Done
Writing superblock and syncing: Done
Device:              /dev/shared_vg/shared_lv
Block size:          4096
Device size:         8.00 GB (2096128 blocks)
Filesystem size:     8.00 GB (2096128 blocks)
Journals:            2
Journal size:        32MB
Resource groups:     34
Locking protocol:    "lock_dlm"
Lock table:          "gfs-cluster:gfs2"
UUID:                6c5a011c-188a-48d1-adc8-774b256b2850

• -p lock_dlm specifies that we want to use the kernel’s DLM.
• -j 2 indicates that the filesystem should reserve enough space for two journals (one for each node that will access the filesystem).
• -t gfs-cluster:gfs2 specifies the lock table name. The format for this field is clustername:fsname.
For clustername, we need to use the same value we specified originally with pcs cluster setup (which is also the value of cluster_name in /etc/corosync/corosync.conf). If you are unsure what your cluster name is, you can look in /etc/corosync/corosync.conf or execute the command.
pcs cluster corosync ram-01.bidhankhatri.com.np | grep -i "Cluster name".

Create an LVM-activate resource for logical volume to automatically activate that logical volume on all nodes.
Create an LVM-activate resource named sharedlv for the logical volume shared_lv in volume group shared_vg.
Below command creates a Pacemaker cluster resource called “sharedlv” that manages a logical volume named “shared_lv” belonging to the volume group “shared_vg.” The resource agent used is “ocf:heartbeat:LVM-activate,” and the LV is configured to be activated in shared mode, allowing multiple nodes to access it concurrently. The VG access mode is set to “lvmlockd” to enable distributed locking for concurrent VG access.

pcs resource create sharedlv --group shared_vg ocf:heartbeat:LVM-activate lvname=shared_lv vgname=shared_vg activation_mode=shared vg_access_mode=lvmlockd

pcs: It stands for “Pacemaker Configuration System” and refers to the command-line tool used for managing Pacemaker clusters.
resource create: This command is used to create a new resource in the Pacemaker cluster.
sharedlv: It is the name given to the resource being created.
–group shared_vg: It specifies that the resource should be added to the resource group named “shared_vg.” A resource group is a logical grouping of resources that are managed together.
ocf:heartbeat:LVM-activate: It specifies the resource agent to be used for managing the shared logical volume. In this case, the Heartbeat OCF resource agent is used. The LVM-activate resource agent is responsible for activating an LVM logical volume.
lvname=shared_lv: It specifies the name of the logical volume to be managed by the resource. In this case, the LV is named “shared_lv.”
vgname=shared_vg: It specifies the name of the volume group (VG) to which the logical volume belongs. In this case, the VG is named “shared_vg.”
activation_mode=shared: It specifies the activation mode for the LV. “Shared” mode indicates that the LV can be activated by multiple nodes simultaneously.
vg_access_mode=lvmlockd: It specifies the access mode for the volume group. "lvmlockd" is a locking daemon that provides a distributed lock manager for LVM, allowing multiple nodes to access the VG concurrently.

Clone the new resource group.

pcs resource clone shared_vg interleave=true

Configure ordering constraints to ensure that the locking resource group that includes the dlm and lvmlockd resources starts first.

[root@ram-01 ~]# pcs constraint order start locking-clone then shared_vg-clone

Configure colocation constraints to ensure that the vg resource group start on the same node as the locking resource group.

[root@ram-01 ~]# pcs constraint colocation add shared_vg-clone with locking-clone

On both nodes in the cluster, verify that the logical volumes are active. There may be a delay of a few seconds.

[root@ram-01 ~]# lvs
  LV         VG          Attr       LSize
  shared_lv shared_vg  -wi-ao-----  <8.00g

[root@ram-02 ~]# lvs
  LV         VG          Attr       LSize
  shared_lv shared_vg  -wi-ao-----  <8.00g

Create a file system resource to automatically mount each GFS2 file system on all nodes. You should not add the file system to the /etc/fstab file because it will be managed as a Pacemaker cluster resource.

Create a file system resource to automatically mount each GFS2 file system on all nodes. You should not add the file system to the /etc/fstab file because it will be managed as a Pacemaker cluster resource.

pcs resource create sharedfs --group shared_vg ocf:heartbeat:Filesystem device="/dev/shared_vg/shared_lv" directory="/app" fstype="gfs2" options=noatime op monitor interval=10s on-fail=fence

resource create: This command is used to create a new resource in the Pacemaker cluster.
sharedfs: It is the name given to the resource being created.
–group shared_vg: It specifies that the resource should be added to the resource group named “shared_vg.”
ocf:heartbeat:Filesystem: It specifies the resource agent to be used for managing the filesystem resource. In this case, the Heartbeat OCF resource agent for Filesystem is used.
device=”/dev/shared_vg/shared_lv”: It specifies the device associated with the filesystem resource. In this case, the filesystem is located on the logical volume /dev/shared_vg/shared_lv.
directory=”/app”: It specifies the directory where the filesystem should be mounted. In this case, the filesystem will be mounted on the directory /app.
fstype=”gfs2”: It specifies the filesystem type. In this case, the filesystem type is “gfs2,” which stands for Global File System 2.
options=acl,noatime: Specifies the mount options for the filesystem. The acl option enables Access Control Lists (ACLs), which provide more granular access control on files and directories. The noatime option disables updating the access time for files when they are accessed, potentially improving performance.
op monitor interval=10s on-fail=fence: It defines the resource’s monitoring behavior. The monitor operation will be performed on the resource every 10 seconds (interval=10s). If the monitor operation fails, the node where the resource is running will be fenced (on-fail=fence), indicating a failure and allowing the resource to be recovered on another node.

Verification steps:

mount | grep gfs2
/dev/mapper/shared_vg-shared_lv on /app type gfs2 (rw,noatime,acl)

mount | grep gfs2
/dev/mapper/shared_vg-shared_lv on /app type gfs2 (rw,noatime,acl)

df -h /mnt/gfs/
Filesystem                       Size    Used   Avail   Use%  Mounted on
/dev/mapper/shared_vg-shared_lv  8.0G    67M    8.0G    1%    /app

check cluster status:

# pcs status --full

Cluster name: gfs-cluster
Status of pacemakerd: 'Pacemaker is running' (last updated 2023-07-09 08:03:40 -04:00)
Cluster Summary:
  * Stack: corosync
  * Current DC: ram-01.bidhankhatri.com.np (version 2.1.4-5.e18_7.2-dc6eb4362e) - partition with quorum
  * Last updated: Sun Jul 09 08:03:41 2023
  * Last change: Sun Jul 9 08:03:37 2023 by root via cibadmin on ram-01.bidhankhatri.com.np
  * 2 ndoes configured
  * 9 resource instances configured

Node List:
  * Online: [ ram-01.bidhankhatri.com.np (1) ram-02.bidhankhatri.com.np (2) ]

Full List of Resources:
  * Clone Set: locking-clone [locking]:
    * Resource Group: locking:0:
      * dlm    (ocf::pacemaker:controld):       Started ram-01.bidhankhatri.com.np
      * lvmlockd    (ocf:pacemaker:lvmlockd):         Started ram-01.bidhankhatri.com.np
    * Resource Group: locking:1:
      * dlm    (ocf::pacemaker:controld):       Started ram-02.bidhankhatri.com.np
      * lvmlockd    (ocf:pacemaker:lvmlockd):         Started ram-02.bidhankhatri.com.np
  * vmfence     (stonith:fence_vmware_rest):    started ram-01.bidhankhatri.com.np
  * Clone Set: shared_vg-clone [shared_vg]:
    * Resource Group: shared_vg:0:
      * sharedlv    (ocf::heartbeat:LVM-activate):       Started ram-01.bidhankhatri.com.np
      * sharedfs    (ocf:heartbeat:Filesystem):         Started ram-01.bidhankhatri.com.np
    * Resource Group: shared_vg:1:
      * sharedlv    (ocf::heartbeat:LVM-activate):       Started ram-02.bidhankhatri.com.np
      * sharedfs    (ocf:heartbeat:Filesystem):         Started ram-02.bidhankhatri.com.np

Migrration Summary:

Tickets:

PCSD Status:
  ram-01.bidhankhatri.com.np: Online
  ram-02.bidhankhatri.com.np: Online

Daemon Status:
  corosync: active/enabled
  pacemaker: active/enabled
  pcsd: active/enabled

SAML Authentication for AWS OpenSearch with Okta and Role mapping

2023-05-21T02:16:41+00:00

We are going to set up IdP-initiated (Okta) SAML authentication for AWS OpenSearch. We will create two Okta groups: “opensearch-admin” and “opensearch-user,” and define different roles for OpenSearch.

Introduction

SAML authentication for OpenSearch Dashboards allows you to utilize your existing identity provider for offering single sign-on (SSO) on Amazon OpenSearch Service domains running OpenSearch or Elasticsearch 6.7 or later. To enable SAML authentication, fine-grained access control must be enabled.

Instead of authenticating through Amazon Cognito or the internal user database, SAML authentication for OpenSearch Dashboards allows you to utilize third-party identity providers to log in, manage fine-grained access control, search your data, and create visualizations. OpenSearch Service supports providers that adhere to the SAML 2.0 standard, including Okta, Keycloak, Active Directory Federation Services (ADFS), Auth0, and AWS IAM Identity Center (the successor to AWS Single Sign-On).

SAML authentication for Dashboards is specifically designed for accessing OpenSearch Dashboards through a web browser. It’s important to note that your SAML credentials cannot be used to directly make HTTP requests to the OpenSearch or Dashboards APIs.

SAML configuration overview

This documentation assumes that you have an existing identity provider and some familiarity with it.

The OpenSearch Dashboards login flow can take one of two forms:

Service provider (SP) initiated: You navigate to Dashboards (for example, https://my-domain.us-east-1.es.amazonaws.com/_dashboards), which redirects you to the login screen. After you log in, the identity provider redirects you to Dashboards.
Identity provider (IdP) initiated: You navigate to your identity provider ( E.g Okta), log in, and choose OpenSearch Dashboards from an application directory.

OpenSearch Service provides two single sign-on URLs, SP-initiated and IdP-initiated, but you only need the one that matches your desired OpenSearch Dashboards login flow. Here In our case, We are going to setup IdP initiated setup.

Regardless of the authentication type you choose, the objective is to log in through your identity provider and obtain a SAML assertion that includes your username (required) and any backend roles (optional, but recommended). This information enables fine-grained access control to assign permissions to SAML users. In external identity providers, backend roles are commonly referred to as ‘roles’ or ‘groups’.

STEPS:

Identity Provider (Okta) setup
Prepare OpenSearch Service for SAML configuration
SAML configuration in OpenSearch Service Domain and Okta
Validate Okta users access and role

Identity Provider (Okta) setup

( Create 2 groups “opensearch-admin” and “opensearch-user” and assign users to it. )

Signup for Okta Account
Create Users in Okta
Create Groups in Okta
Assign Users to Groups

Login to your Okta Admin page. If you don’t have Okta then you can singup for Okta free trial account which will be valid for 30 days. To get trail access, go to this page and fill the form. https://www.okta.com/free-trial/

Create Users in Okta: Directory » People » Add Person
Create multiple users according to your need. Here we created 2 users “bidhan.khatri” and “ram.thapa”

Create Groups in Okta: Directory » Groups » Add Group
We will create 2 groups: “opensearch-admin” and “opensearch-user.”
Assign user “bidhan.khatri” to “opensearch-admin” and user “ram.thapa” to “opensearch-user”.
“opensearch-admin” is a Okta group for admin access in OpenSearch Dashboard and “opensearch-user” for read access which we will configure later.

Assign Users to Groups:
Directory » Groups » opensearch-user » Assign people ( Click + sign of the user “ram.thapa” )
Similary, do for another group.
Directory » Groups » opensearch-admin » Assign people ( select user “bidhan.khatri”)

Create OpenSearch in AWS

If you haven’t created OpenSearch then follow the below steps, otherwise go to next section “SAML configuration in OpenSearch Service Domain and Okta”

Open AWS OpenSearch Service and Click on Create domain.
Domain Name: bdn
Domain Created Method: Standard create
Templates: Dev/test
Deployment Options(s): Domain without standby
Availability Zones(s): 1-AZ
Engine options: 2.5(latest)
Data nodes: t3.small.search
Number of Nodes: 1
Storage Type: EBS
EBS storage size per node: 10G
Network: Public Access
Fine-grained access control: Create master user // This credential will be used for logging opensearch dashboard.
Access Policy: Domain access Policy > Only use fine-grained access control
Other things as default
Now look at the summary and confirm for new domain and click “create”

SAML configuration in OpenSearch Service Domain and Okta

Go to Amazon OpenSearch Service and in your domain. Go to “bdn Domain”
Click on Actions » Edit security configuration

Tick on “Enable SAML Authentication”

We will be using the Service Provider entity ID and IdP-initiated SSO URL for Okta SAML configuration.

The OpenSearch Dashboards login flows below step:
You navigate to your identity provider (Okta), log in, and choose OpenSearch Dashboards from an application directory.

We will complete rest of the above OpenSearch Service SAML configuration after the Okta SAML configuration.
Above SAML information is required for Okta SAML setup.

Okta SAML Configuration

Go back to your OKTA ADMIN PAGE and choose Applications
Click on Create App Integration Choose SAML 2.0 and click NEXT

App name: OpenSearch
SAML Settings
Single sign-on URL: https://search-bdn-gu3ogldk6nakq776cama6dmjq4.us-east-2.es.amazonaws.com/_dashboards/_opendistro/_security/saml/acs/idpinitiated [ IdP - initiated SSO URL from bdn domain]
Put tick on Use this for Recipient URL and Destination URL
Audience URI (SP Entity ID): https://search-bdn-gu3ogldk6nakq776cama6dmjq4.us-east-2.es.amazonaws.com [ Service provider entity ID from bdn domain]
( Both URL’s are copied from SAML configuration in OpenSearch Service Domain )

Default RelayState: leave it blank
Name ID format: Select EmailAdress from drop down
Application Username: Okta username
Update application username on: Create and update

Attribute Statements (optional)
Name: http://schemas.xmlsoap.org/ws/2005/05/identity/claims/emailaddress
Name: Select URI Reference from dropdown
value: user.email

Group Attribute Statements (optional)
Name:http://schemas.xmlsoap.org/claims/Group
Normal Format: Unspecified
Filter: Starts with “opensearch”

Now click on Next

As our both Okta Groups “opensearch-admin” and “opensearch-user” naming starts with string opensearch. Here while configuring group section, we defined opensearch in Group attribute and used filter Starts with to catch both groups.

Are you a customer or partner? Tick on “I’m a software vendor. I’d like to integrate my app with Okta” » Finish

Choose Sign on menu. Under SAML Setup: Click on “View SAML setup instructions”
Go down and select and copy all the content of IDP metadata section. This is an Okta Identity provider metadata for SAML configutation. We have to use this XML content in OpenSearch service. Copy below XML information from the box to somewhere for now.

Assign Groups to Application “OpenSearch”:
Go to option Assignments menu and click on assign » Assign to Groups Choose both groups “opensearch-admin” and “opensearch-user”. Click on Assign

As you can see now both groups are assigned to application OpenSearch.

Back to AWS OpenSearch domain SAML authentication options

Under the Import IdP metadata section:
Metadata fromm IdP: Copy the Okta Identity Provider metadata which we copied earlier here in the IdP box.
IdP entity ID: It will auto populate once you provide IdP metadata.
SAML master username - optional: Leave it blank.
SAML master backend role - optional: opensearch-admin (Okta group which need full permission in OpenSearch Dashboard.)

Click on Additional settings:
Subject key - optional: Leave it blank.
Roles Key - optional: http://schemas.xmlsoap.org/claims/Group
Session time to live: 60 minutes
and save the changes.

Validate Okta users access and RBAC

Now login the Okta dashboard from your user “bidhan.khatri”.
Click on “My Apps” » under “work” » click on “OpenSearch” app to open the AWS OpenSearch Dashboard. User will get full access as user “bidhan.khatri” belongs to okta group “opensearch-admin”

All the users belongs to okta group “opensearch-admin” will receive full permission in OpenSearch Dashboard. We have defined to get full access for “opensearch-admin” while configuring SAML for AWS OpenSearch domain.
Similary, We can setup Role-based access control (RBAC) using Okta Group based on the OpenSearch Role. For okta group “opensearch-user” to give Read Only access in all index in OpenSearch dashboard, we have to login OpenSearch Dashboard with full permission user first and create a role in OpenSearch for “opensearch-user” group and map to it.

Give READ Access to Group “opensearch-user”

First, login OpenSearch from the user belonging to Okta group “opensearch-admin”. We need full permission user access to setup a new role. We will be giving only read permission to all the index to group user “opensearch-user”

Security » Roles » Create role » Role-Read » Index permissisions
Index: *
Index permmissions: read get
Tenant permissions: global_tenant

click Update

Go to Mapped users » Map users » In Backend roles » Add “opensearch-user” » Map

Backend Role is basically a Group in external authentication system like AD,LDAP with whom OpenSearch role is mapped so that instead of giving access to the 100 different users, we can simply provide access to a single AD group.

Now once user ram.thapa login OpenSearch Dashboard, he will see only read access given to all the indices.
To check your permission, Click on your user icon » click View roles and identities.”
He will get below roles defined.

Ansible Lab Environment setup in Docker container

2021-08-22T02:16:41+00:00

For ansible lab environment setup, first I will generate ssh key for non-root user and then create a Dockerfile to build a docker image. After that, we will start the container and you will be able to practice your Ansible playbook in the Docker container.

Generate ssh key

ssh-keygen -f keycontainer

Generating public/private rsa key pair.
Enter passphrase (empty for no passphrase):
Enter same passphrase again:
Your identification has been saved in keycontainer.
Your public key has been saved in keycontainer.pub.


The key's randomart image is:
+---[RSA 3072]----+
|     . .ooo=..   |
|      +...+ =    |
|     o ..+oo .   |
|    E o +==o     |
|   o + =S+o+.    |
|  . + =.o.o.o    |
|   . oo+....     |
|  ..o.+o.        |
|   o**..         |
+----[SHA256]-----+

The command above will create two files, private and public keys. We have to add a public key to the container.

ls                                                                       
keycontainer     keycontainer.pub

In Dockerfile, I am using latest version of ubuntu. Dockerfile will install ssh package and run it. It also copy keycontainer.pub key file which we generated earlier to container. And for privilege access I have added non-root user to sudoers file. Here user will be authenticated as key based.

FROM ubuntu:latest
MAINTAINER bdn@bidhankhatri.com.np

ARG USERNAME=thunder
EXPOSE 22

# Apt update & apt install required packages
RUN apt update && apt -y install openssh-server sudo net-tools less


# Add a non-root user
RUN useradd -ms /bin/bash $USERNAME

# Remove no-needed packages
RUN apt -y autoremove && apt -y autoclean && apt -y clean

# Create the ssh directory and authorized_keys file
USER $USERNAME
RUN mkdir /home/$USERNAME/.ssh
COPY keycontainer.pub /home/$USERNAME/.ssh/authorized_keys

USER root
RUN chown $USERNAME /home/$USERNAME/.ssh/authorized_keys && \
chmod 600 /home/$USERNAME/.ssh/authorized_keys
RUN echo "${USERNAME} ALL=(ALL) NOPASSWD: ALL " >> /etc/sudoers

# Run ssh service
RUN service ssh start

CMD ["/usr/sbin/sshd","-D"]

Dockerfile Instruction:
FROM To specify the parent image.
EXPOSE is for documentating. To define which port through which to access your container application.
ARG for defining variable that can be passed at build time.
RUN install your application and packages required to container. It executes any commands on top of the current image and creates a new layer by committing the results.
USER switch to the non root user first and later to root user to perform their task.
CMD Arguments passed to the entrypoint. If ENTRYPOINT is not set (defaults to /bin/sh -c), the CMD will be the commands the container executes.

Now build Image from a Dockerfile.

docker build -t ubuntu-lab .

[+] Building 4.6s (14/14) FINISHED
 => [internal] load build definition from Dockerfile                                                                                                                                                   0.0s
 => => transferring dockerfile: 752B                                                                                                                                                                   0.0s
 => [internal] load .dockerignore                                                                                                                                                                      0.0s
 => => transferring context: 2B                                                                                                                                                                        0.0s
 => [internal] load metadata for docker.io/library/ubuntu:latest                                                                                                                                       3.3s
 => [1/9] FROM docker.io/library/ubuntu:latest@sha256:82becede498899ec668628e7cb0ad87b6e1c371cb8a1e597d83a47fac21d6af3                                                                                 0.0s
 => [internal] load build context                                                                                                                                                                      0.0s
 => => transferring context: 625B                                                                                                                                                                      0.0s
 => CACHED [2/9] RUN apt update && apt -y install openssh-server sudo net-tools less                                                                                                                   0.0s
 => CACHED [3/9] RUN useradd -ms /bin/bash thunder                                                                                                                                                     0.0s
 => CACHED [4/9] RUN apt -y autoremove && apt -y autoclean && apt -y clean                                                                                                                             0.0s
 => CACHED [5/9] RUN mkdir /home/thunder/.ssh                                                                                                                                                          0.0s
 => [6/9] COPY keycontainer.pub /home/thunder/.ssh/authorized_keys                                                                                                                                     0.0s
 => [7/9] RUN chown thunder /home/thunder/.ssh/authorized_keys && chmod 600 /home/thunder/.ssh/authorized_keys                                                                                         0.3s
 => [8/9] RUN echo "thunder ALL=(ALL) NOPASSWD: ALL " >> /etc/sudoers                                                                                                                                  0.3s
 => [9/9] RUN service ssh start                                                                                                                                                                        0.3s
 => exporting to image                                                                                                                                                                                 0.1s
 => => exporting layers                                                                                                                                                                                0.1s
 => => writing image sha256:3aadca7b60cb4bc97df0956539fcf8116295f58234a096242b682deba5df3439                                                                                                           0.0s
 => => naming to docker.io/library/ubuntu-lab                                                                                                                                                          0.0s

Now view the newly created docker image by executing below command.

docker images

REPOSITORY                            TAG       IMAGE ID       CREATED         SIZE
ubuntu-lab                            latest    3aadca7b60cb   53 seconds ago   230MB

now run container by executing below command.

docker run -d -p 2022:22 ubuntu-lab

Container port to be accessible through docker host, we need to publish it. Above command will map port 2022 of the docker host to port 22 of the container.

It’s better to centralize all your ssh key so I have copied my newly generated ssh key to my home directory ssh folder. Now, to access container through ssh, execute below command.

cp containerkey ~/.ssh/
ssh thunder@localhost -p2022 -i ~/.ssh/containerkey

OR
simply add in your ssh config file like below so that from next time u can easily access container.

less ~/.ssh/config

Host localhost
        HostName localhost
        User thunder
        Port 2022
        IdentityFile ~/.ssh/keycontainer

Now try to ssh your container.

ssh localhost

Welcome to Ubuntu 20.04.2 LTS (GNU/Linux 5.10.25-linuxkit x86_64)

 * Documentation:  https://help.ubuntu.com
 * Management:     https://landscape.canonical.com
 * Support:        https://ubuntu.com/advantage

This system has been minimized by removing packages and content that are
not required on a system that users do not log into.

To restore this content, you can run the 'unminimize' command.
Last login: Sun Aug 22 11:00:31 2021 from 172.17.0.1

thunder@da64f58578e3:~$

Container SSH part is completed. Now follow the ansible config part.

Ansible

I hope you have already installed ansible on your control host. If yes then first create an inventory file like below.

cat inventory.ini 

[local]
localhost

Now check ansible connection through AD-Hoc Command

ansible local -i inventory.ini -m ping       

localhost | SUCCESS => {
    "ansible_facts": {
        "discovered_interpreter_python": "/usr/bin/python3"
    },
    "changed": false,
    "ping": "pong"
}

ansible local -i inventory.ini -a 'hostname -I' 

localhost | CHANGED | rc=0 >>
172.17.0.2

That’s it.

Setup is now completed. Now you can test your ansible playbooks in Docker container.

Add and remove nodes in your Elasticsearch Cluster

2021-08-01T02:16:41+00:00

I will be adding a new node elk-04 in the existing 3 nodes Elasticsearch cluster and again remove it later.

I suggest you test the below steps in your staging environment first. I am working on ES version 7.5.1 with 3 nodes cluster where Xpack security is enabled. I have used self-signed certificates on my ES cluster.
I hope while configuring TLS during cluster setup earlier you have used –keep-ca-key option while generating certificates. It means the output zip file will contain ca/ca.key file along with ca/ca.crt file. ca.crt and ca.key are two CA (Certificate Authority) files that needed here to sign a new certificate for the node elk-04.
Elasticsearch has two levels of communications, Transport communications and HTTP communications. The transport protocol is used for internal communications between Elasticsearch nodes, and the HTTP protocol is used for communications from clients to the Elasticsearch cluster. In my case, I am going to define the certificate in only the transport part here.

Add new node elk-04 on /etc/hosts file of all nodes.

16.0.110 elk-01
16.0.111 elk-02
16.0.112 elk-03
16.0.115 elk-04

Download the same ES version and install it on a new node.

[root@elk-04 ~]# wget https://artifacts.elastic.co/downloads/elasticsearch/elasticsearch-7.5.1-x86_64.rpm
[root@elk-04 ~]# yum localinstall elasticsearch

Add new node in existing ES cluster

Step 1: Generate a new server certificate signed by existing ca.

[root@elk-01 lab-certs]# pwd
/root/lab-certs

[root@elk-01 lab-certs]# ls
drwxr-xr-x 2 root root   34 Aug  2 20:13 ca

[root@elk-01 lab-certs]# ls ca/
ca.crt  ca.key

Define your new node in yml syntax like below. If you want to add more nodes then you can define accordingly. Follow the yml syntax.

vim newnode.yml

 instances:
  - name: 'elk-04'
    dns: [ 'elk-04' ]
    ip: [ '172.16.0.115' ]

[root@elk-01 lab-certs]# ls
ca  newnode.yml

Now generate a new certificate for new node elk-04
The elasticsearch-certutil command simplifies the creation of certificates for use with Transport Layer Security (TLS) in the Elastic Stack.

/usr/share/elasticsearch/bin/elasticsearch-certutil cert --ca-cert ~/lab-certs/ca/ca.crt --ca-key ~/lab-certs/ca/ca.key --days 3650 --pem --in ~/lab-certs/newnode.yml --out ~/lab-certs/new_node.zip

Parameters
cert: Specifies to generate new X.509 certificates and keys. This parameter cannot be used with the csr or ca parameters.
–ca-cert : Specifies the path to an existing CA certificate (in PEM format). You must also specify the –ca-key : parameter. The –ca-cert parameter cannot be used with the ca or csr parameters.
–ca-key : Specifies the path to an existing CA private key (in PEM format). You must also specify the –ca-cert parameter. The –ca-key parameter cannot be used with the ca or csr parameters.
–days: Specifies an integer value that represents the number of days the generated certificates are valid. The default value is 1095. This parameter cannot be used with the csr parameter.
–pem: Generates certificates and keys in PEM format instead of PKCS#12. This parameter cannot be used with the csr parameter.
–in: Specifies the file that is used to run in silent mode. The input file must be a YAML file. This parameter cannot be used with the ca parameter.
–out: Specifies a path for the output files.

[root@elk-01 lab-certs]# ll
total 8
drwxr-xr-x 2 root root   34 Aug  2 20:13 ca
-rw-r--r-- 1 root root   82 Aug  2 20:15 newnode.yml
-rw------- 1 root root 2574 Aug  2 20:20 new_node.zip

[root@elk-01 lab-certs]# unzip new_node.zip
Archive:  new_node.zip
   creating: elk-04/
  inflating: elk-04/elk-04.crt
  inflating: elk-04/elk-04.key

You can verify the node certificate signed by particular CA or not. It should show OK.

[root@elk-01 lab-certs]# openssl verify -verbose -CAfile ca/ca.crt elk-04/elk-04.crt
elk-04/elk-04.crt: OK

If you want to look at the certificate details then execute below command. It will show certs details.

[root@elk-01 lab-certs]# openssl x509 -in elk-04/elk-04.crt -text -noout

Step 2: Now copy the new certificate and CA certificate to new node elk-04 and place it in /etc/elasticsearch/certs folder.

In elk-04 node:

mkdir /etc/elasticsearch/certs

[root@elk-04 certs]# pwd
/etc/elasticsearch/certs

[root@elk-04 certs]# ll
total 12
-rw-r--r-- 1 root elasticsearch 1200 Aug  2 20:36 ca.crt
-rw-r--r-- 1 root elasticsearch 1180 Aug  2 20:26 elk-04.crt
-rw-r--r-- 1 root elasticsearch 1679 Aug  2 20:26 elk-04.key

Step 3: configure elasticsearch.yml

vim /etc/elasticsearch/elasticsearch.yml

cluster.name: lab-elk
node.name: elk-04
node.master: true
node.data: true
bootstrap.memory_lock: false
network.host: 0.0.0.0
discovery.zen.ping.unicast.hosts: ["elk-01", "elk-02", "elk-03"]

indices.query.bool.max_clause_count: 8192
search.max_buckets: 250000
path.data: /var/lib/elasticsearch
path.logs: /var/log/elasticsearch
xpack.security.enabled: true
xpack.security.transport.ssl.enabled: true
xpack.security.transport.ssl.key: certs/elk-04.key
xpack.security.transport.ssl.certificate: certs/elk-04.crt
xpack.security.transport.ssl.certificate_authorities: [ "certs/ca.crt"]

Step 5: start ES service

systemctl start elasticsearch
systemctl enable elasticsearch

Now your new node is added to the Elasticsearch cluster. you can verify this through dev tools also.

GET _cluster/health 

{
  "cluster_name" : "lab-elk",
  "status" : "green",
  "timed_out" : false,
  "number_of_nodes" : 4,
  "number_of_data_nodes" : 4,
  "active_primary_shards" : 36,
  "active_shards" : 72,
  "relocating_shards" : 0,
  "initializing_shards" : 0,
  "unassigned_shards" : 0,
  "delayed_unassigned_shards" : 0,
  "number_of_pending_tasks" : 0,
  "number_of_in_flight_fetch" : 0,
  "task_max_waiting_in_queue_millis" : 0,
  "active_shards_percent_as_number" : 100.0
}

After new nodes added in cluster, ES instantly starts distributing shards between them. You can verify it through Kibana Stack Monitoring.

Remove node from existing ES cluster

Step 1: check all indices in removing the host first.

curl -s -u elastic:y3pphNGwzEeJXeWr7y8zcj -XGET "http://elk-01:9200/_cat/shards" | grep -i "elk-04"

.monitoring-kibana-7-2021.08.01   0 r STARTED   8610   1.8mb 172.16.0.115 elk-04
.monitoring-kibana-7-2021.07.30   0 r STARTED   8639   1.6mb 172.16.0.115 elk-04
.monitoring-logstash-7-2021.07.30 0 p STARTED 155466   9.6mb 172.16.0.115 elk-04
securelog-2021-07-20              0 r STARTED      4  74.4kb 172.16.0.115 elk-04
.monitoring-es-7-2021.07.28       0 p STARTED 380580 208.9mb 172.16.0.115 elk-04
.monitoring-es-7-2021.07.30       0 p STARTED 374714 191.8mb 172.16.0.115 elk-04
messageslog-2021-07-20            0 p STARTED     46 175.7kb 172.16.0.115 elk-04
.monitoring-logstash-7-2021.07.29 0 r STARTED 155466   9.6mb 172.16.0.115 elk-04
.apm-agent-configuration          0 p STARTED      0    284b 172.16.0.115 elk-04
.monitoring-logstash-7-2021.08.03 0 r STARTED  12114 949.6kb 172.16.0.115 elk-04
.monitoring-kibana-7-2021.08.03   0 p STARTED    673 192.1kb 172.16.0.115 elk-04
.monitoring-kibana-7-2021.08.02   0 p STARTED   8640   1.7mb 172.16.0.115 elk-04
filebeat-7.5.0-2021.06.10-000001  0 p STARTED   2660   1.6mb 172.16.0.115 elk-04
.monitoring-logstash-7-2021.08.02 0 p STARTED 155466   9.9mb 172.16.0.115 elk-04
.monitoring-kibana-7-2021.07.29   0 r STARTED   8640   1.5mb 172.16.0.115 elk-04

Step 2: Identify the IP address of the Elasticsearch node that needs to be removed from the cluster. When the command is executed, Elasticsearch tries to move the existing shards out of the node that will be removed and moves it to other nodes in the cluster.

PUT _cluster/settings
{
  "transient" :{
     "cluster.routing.allocation.exclude._ip" : "172.16.0.115"
   }
}

now execute Step 1 command again to verify or you can also verify through cluster health. relocating_shards shard should show 0 if everything goes fine.

GET _cluster/health 

{
  "cluster_name" : "lab-elk",
  "status" : "green",
  "timed_out" : false,
  "number_of_nodes" : 4,
  "number_of_data_nodes" : 4,
  "active_primary_shards" : 37,
  "active_shards" : 74,
  "relocating_shards" : 0,
  "initializing_shards" : 0,
  "unassigned_shards" : 0,
  "delayed_unassigned_shards" : 0,
  "number_of_pending_tasks" : 0,
  "number_of_in_flight_fetch" : 0,
  "task_max_waiting_in_queue_millis" : 0,
  "active_shards_percent_as_number" : 100.0
}

Step 3: Now you can shutdown that node. or stop elasticsearch service.

systemctl stop elasticsearch

Step 4: Now remove above rule again.

PUT _cluster/settings
{
  "transient" :{
     "cluster.routing.allocation.exclude._ip" : ""
   }
}

If you check your cluster health again then you will find only 3 nodes in your cluster now.

GET _cluster/health

{
  "cluster_name" : "lab-elk",
  "status" : "green",
  "timed_out" : false,
  "number_of_nodes" : 3,
  "number_of_data_nodes" : 3,
  "active_primary_shards" : 36,
  "active_shards" : 72,
  "relocating_shards" : 0,
  "initializing_shards" : 0,
  "unassigned_shards" : 0,
  "delayed_unassigned_shards" : 0,
  "number_of_pending_tasks" : 0,
  "number_of_in_flight_fetch" : 0,
  "task_max_waiting_in_queue_millis" : 0,
  "active_shards_percent_as_number" : 100.0
}

HAProxy configuration for Windows Exchange Server 2016/2019

2021-07-24T02:16:41+00:00

HAProxy is a free, very fast and reliable solution offering high availability, load balancing, and proxying for TCP and HTTP-based applications. It is particularly suited for very high traffic web sites and powers quite a number of the world’s most visited ones.

graph TD A(mail.bidhankhatri.com.np) --> B[mail1.bidhankhatri.com.np] A --> C[mail2.bidhankhatri.com.np] A --> D[mail3.bidhankhatri.com.np] A --> E[mail4.bidhankhatri.com.np]

Nginx vs HAProxy: I prefer haproxy over open source nginx in this exchange application scenario. Haproxy provides better load balancing, better logging and provides nice stats interface than nginx. TCP load balancing is not available in open source nginx but nginx plus does provide it. Nginx plus is a commercial one so you have to pay to get that feature. We can also monitor and visualize haproxy metrics stats in real time as well as through Grafana as new haproxy is built with the Prometheus exporter.

The term SSL Termination means that you are performing all encryption and decryption at the edge of your network, such as at the load balancer (haproxy). The load balancer strips away the encryption and passes the messages in the clear to your backend servers. This is known as SSL offloading. We can SSL offload at haproxy end and ignored SSL verification to internal backend mail servers. You can also store the CA certificate on the load balancer and reference it. Here, I have used both.
SSL termination has few benefits. These include the following:

You can maintain certificates in fewer places, making your job easier.
You don’t need to expose your servers to the Internet for certificate renewal purposes.
Servers are unburdened from the task of processing encrypted messages, freeing up CPU time.

HAProxy can run in two different modes: TCP or HTTP. When operating in TCP mode, we say that it acts as a layer 4 proxy. In HTTP mode, we say that it acts as a layer 7 proxy.

Layer 7 – Application: application protocols like HTTP, SSH and SMTP
Layer 4 – Transport: data transfer protocols like TCP and UDP

We have used both HTTP & TCP mode here. If you need further details then you can click on this link. https://www.haproxy.com/blog/layer-4-and-layer-7-proxy-mode/

In Exchange 2016 and Exchange 2019, the Client Access services are stateless . In other words, because all processing for the mailbox happens in the backend services on the Mailbox server, it doesn’t matter which instance of the Client Access service in an array of Client Access services receives each individual client request. This means that session affinity is no longer required at the load balancer level. This allows inbound connections to Client Access services to be balanced by using simple load balancing techniques such as DNS round-robin. It also allows hardware load balancing devices to support significantly more concurrent connections.

Each web service has a virtual file to tell its state, called HealthCheck.htm. Let HAProxy use the contents of this file for the server health check. That way it’ll know to redirect clients if one of the services is down, even though the Exchange server in question may still be listening on port 443.

I am using haproxy version 2.0.13 on Ubuntu 20.04.1 LTS. We also tighten our TLS security by allowing only TLS 1.2 & 1.3. By default, Outlook clients 2010/2013 in windows 7 will try to connect exchange mail server through TLS 1.0 & TLS 1.1 so for that you have to do some changes in TLS part on your windows 7 pc. Windows 7 supports TLS 1.1 and TLS 1.2. But these protocol versions are not enabled on it by default. On Windows 8 and higher these protocols are enabled by default. Earlier I have written an article explaining enabling TLS 1.1 & TLS 1.2 in windows 7 pc. You can check this link if you are interested. https://bidhankhatri.com.np/windows/windows7-enable-tls12/

Configuration details:

Only allowed if HTTP host header contains mail.bidhankhatri.com.np www.mail.bidhankhatri.com.np autodiscover.bidhankhatri.com.np www.autodiscover.bidhankhatri.com.np
Exchange Admin Center is only allowed from specific networks.
TLS 1.2 & above are only allowed.
HAProxy metrics are monitored through Prometheus.

PEM file layout for HAProxy

cat certificate.crt intermediates.pem private.key > /etc/haproxy/proxy.pem

Mozilla SSL configuration generator tool.The goal of this tool is to help operational teams with the configuration of TLS. If you are interested then you can have a look. https://ssl-config.mozilla.org/#server=haproxy&server-version=2.0.13&config=intermediate

HAProxy config file

vim /etc/haproxy/haproxy.cfg

global
log /dev/log local0

maxconn 8000
log /dev/log local1 notice
chroot /var/lib/haproxy
stats socket /run/haproxy/admin.sock mode 660 level admin expose-fd listeners
stats timeout 30s
user haproxy
group haproxy
daemon

# Default SSL material locations
ca-base /etc/ssl/certs
crt-base /etc/ssl/private
ssl-default-bind-options ssl-min-ver TLSv1.2 no-tls-tickets

#If you want to enable TLS 1.0 & TLS 1.1 also then use below line.
#ssl-default-bind-ciphers ECDHE-ECDSA-AES128-GCM-SHA256:ECDHE-RSA-AES128-GCM-SHA256:ECDHE-ECDSA-AES256-GCM-SHA384:ECDHE-RSA-AES256-GCM-SHA384:ECDHE-ECDSA-CHACHA20-POLY1305:ECDHE-RSA-CHACHA20-POLY1305:DHE-RSA-AES128-GCM-SHA256:DHE-RSA-AES256-GCM-SHA384:DHE-RSA-CHACHA20-POLY1305:ECDHE-ECDSA-AES128-SHA256:ECDHE-RSA-AES128-SHA256:ECDHE-ECDSA-AES128-SHA:ECDHE-RSA-AES128-SHA:ECDHE-ECDSA-AES256-SHA384:ECDHE-RSA-AES256-SHA384:ECDHE-ECDSA-AES256-SHA:ECDHE-RSA-AES256-SHA:DHE-RSA-AES128-SHA256:DHE-RSA-AES256-SHA256:AES128-GCM-SHA256:AES256-GCM-SHA384:AES128-SHA256:AES256-SHA256:AES128-SHA:AES256-SHA:DES-CBC3-SHA:TLS_AES_128_GCM_SHA256:TLS_AES_256_GCM_SHA384:TLS_CHACHA20_POLY1305_SHA256:@SECLEVEL=1


# This ciphers should be on production: This should be used if u want to disable TLS1.0 & TLS1.1
ssl-default-bind-ciphers ECDHE-ECDSA-AES128-GCM-SHA256:ECDHE-RSA-AES128-GCM-SHA256:ECDHE-ECDSA-AES256-GCM-SHA384:ECDHE-RSA-AES256-GCM-SHA384:ECDHE-ECDSA-CHACHA20-POLY1305:ECDHE-RSA-CHACHA20-POLY1305:DHE-RSA-AES128-GCM-SHA256:DHE-RSA-AES256-GCM-SHA384:DHE-RSA-CHACHA20-POLY1305:ECDHE-ECDSA-AES128-SHA256:ECDHE-RSA-AES128-SHA256:ECDHE-ECDSA-AES128-SHA:ECDHE-RSA-AES128-SHA:ECDHE-ECDSA-AES256-SHA384:ECDHE-RSA-AES256-SHA384:ECDHE-ECDSA-AES256-SHA:ECDHE-RSA-AES256-SHA:DHE-RSA-AES128-SHA256:DHE-RSA-AES256-SHA256:AES128-GCM-SHA256:AES256-GCM-SHA384:AES128-SHA256:AES256-SHA256:AES128-SHA:AES256-SHA:DES-CBC3-SHA:TLS_AES_128_GCM_SHA256:TLS_AES_256_GCM_SHA384:TLS_CHACHA20_POLY1305_SHA256

tune.ssl.default-dh-param 2048

defaults
log global
mode http
option httplog
option dontlognull

retries 3 # Try to connect up to 3 times in case of failure
timeout connect 5s # 5 seconds max to connect or to stay in queue
timeout http-keep-alive 1s # 1 second max for the client to post next request
timeout http-request 15s # 15 seconds max for the client to send a request
timeout queue 30s # 30 seconds max queued on load balancer
timeout client 300s
timeout server 300s



errorfile 400 /etc/haproxy/errors/400.http
errorfile 403 /etc/haproxy/errors/403.http
errorfile 408 /etc/haproxy/errors/408.http
errorfile 500 /etc/haproxy/errors/500.http
errorfile 502 /etc/haproxy/errors/502.http
errorfile 503 /etc/haproxy/errors/503.http
errorfile 504 /etc/haproxy/errors/504.http
no option http-use-htx

#---------------------------------------------------------------------
#HAProxy Monitoring Config
#---------------------------------------------------------------------
listen stats
bind :1111
mode http
option forwardfor
option httpclose
stats enable
stats uri /
stats refresh 5s
stats show-legends
stats realm Haproxy\ Statistics
stats auth admin:1password2$


frontend fe_prom_exporter
bind :8004
option http-use-htx
http-request use-service prometheus-exporter if { path /metrics }
stats enable
stats uri /stats
stats refresh 10s

#-----------------------
# FrontEnd Begins
# -------------------
#
#
frontend fe_mail
# receives traffic from clients
bind :80
http-response set-header X-Frame-Options SAMEORIGIN
http-response set-header X-Content-Type-Options nosniff
http-response set-header Strict-Transport-Security max-age=63072000


mode http
maxconn 10000

# Allow Exchange Admin Center to certain private network only
acl private_network src XXX.XXX.XXX.XXX/24 XXX.XXX.XXX.XXX/24
acl ecp_req url_beg /ecp
http-request deny if ecp_req !private_network


redirect scheme https code 301 if !{ ssl_fc } # redirect 80 -> 443 (for owa)
bind *:443 ssl crt /etc/haproxy/proxy.pem alpn h2,http/1.1

acl xmail hdr(host) -i mail.bidhankhatri.com.np www.mail.bidhankhatri.com.np autodiscover.bidhankhatri.com.np www.autodiscover.bidhankhatri.com.np

acl autodiscover url_beg /Autodiscover
acl autodiscover url_beg /autodiscover
acl mapi url_beg /mapi
acl rpc url_beg /rpc
acl owa url_beg /owa
acl owa url_beg /OWA
acl eas url_beg /Microsoft-Server-ActiveSync
acl eas url_beg /Microsoft-Server-activeSync
acl ecp url_beg /ecp
acl ews url_beg /EWS
acl ews url_beg /ews
acl oab url_beg /OAB
acl default_for_mail url_beg /

use_backend be_ex2019_owa if xmail owa
use_backend be_ex2019_autodiscover if xmail autodiscover
use_backend be_ex2019_mapi if xmail mapi
use_backend be_ex2019_activesync if xmail eas
use_backend be_ex2019_ews if xmail ews
use_backend be_ex2019_rpc if xmail rpc
use_backend be_ex2019_default if xmail default_for_mail


frontend fe_exchange_imaps
mode tcp
option tcplog
bind :993 name imaps # ssl crt /etc/ssl/certs/exchange_certificate_and_key_nopassword.pem <-- No need, certificate is read straight from the Exchange servers.
default_backend be_exchange_imaps

frontend fe_exchange_smtp
mode tcp
option tcplog
bind :25 name smtp 
default_backend be_exchange_smtp

#------------------------------
# Back-end section
#------------------------------

backend be_ex2019_autodiscover
mode http
# balance source
option httpchk GET /autodiscover/healthcheck.htm
option log-health-checks
http-check expect status 200
server mail1 mail1.bidhankhatri.com.np:443 check maxconn 1000 ssl ca-file /etc/ssl/certs/ca-certificates.crt
server mail2 mail2.bidhankhatri.com.np:443 check maxconn 1000 ssl ca-file /etc/ssl/certs/ca-certificates.crt

backend be_ex2019_mapi
mode http
balance source
option httpchk GET /mapi/healthcheck.htm
option log-health-checks
http-check expect status 200
server mail3 mail3.bidhankhatri.com.np:443 check maxconn 1000 ssl ca-file /etc/ssl/certs/ca-certificates.crt
server mail4 mail4.bidhankhatri.com.np:443 check maxconn 1000 ssl ca-file /etc/ssl/certs/ca-certificates.crt

backend be_ex2019_rpc
mode http
balance source
option httpchk GET /rpc/healthcheck.htm
option log-health-checks
http-check expect status 200
server mail3 mail3.bidhankhatri.com.np:443 check maxconn 1000 ssl ca-file /etc/ssl/certs/ca-certificates.crt

backend be_ex2019_owa
mode http
# balance source
option httpchk GET /owa/healthcheck.htm
option log-health-checks
http-check expect status 200
server mail1 mail1.bidhankhatri.com.np:443 check maxconn 1000 ssl ca-file /etc/ssl/certs/ca-certificates.crt
server mail2 mail2.bidhankhatri.com.np:443 check maxconn 1000 ssl ca-file /etc/ssl/certs/ca-certificates.crt
server mail3 mail3.bidhankhatri.com.np:443 check maxconn 1000 ssl ca-file /etc/ssl/certs/ca-certificates.crt



backend be_ex2019_activesync
mode http
#balance source
option httpchk GET /microsoft-server-activesync/healthcheck.htm
option log-health-checks
http-check expect status 200
server mail2 mail2.bidhankhatri.com.np:443 check ssl verify none
server mail1 mail1.bidhankhatri.com.np:443 check ssl verify none

backend be_exchange_imaps
mode tcp
#option tcplog
balance source
option log-health-checks
server mail1 mail1.bidhankhatri.com.np:993 weight 10 check
server mail2 mail2.bidhankhatri.com.np:993 weight 20 check
server mail3 mail3.bidhankhatri.com.np:993 weight 30 check

backend be_ex2019_ews
mode http
balance source
option httpchk GET /ews/healthcheck.htm
option log-health-checks
http-check expect status 200
server mail2 mail2.bidhankhatri.com.np:443 check ssl verify none
server mail1 mail1.bidhankhatri.com.np:443 check ssl verify none


backend be_ex2019_default
mode http
balance source
server mail2 mail2.bidhankhatri.com.np:443 check ssl verify none
server mail1 mail1.bidhankhatri.com.np:443 check ssl verify none


backend be_exchange_smtp
mode tcp
#balance source
balance source
option log-health-checks
server mail2 mail2.bidhankhatri.com.np:25 weight 20 check
server mail1 mail1.bidhankhatri.com.np:25 weight 10 check

haproxy -c -f /etc/haproxy/haproxy.cfg
    Configuration file is valid

systemctl start haproxy
systemctl enable haproxy

Now check TLS versions

openssl s_client -connect mail.bidhankhatri.com.np:443 -tls1_2    // to check tls1.2
openssl s_client -connect mail.bidhankhatri.com.np:443 -tls1    // to check tls 1.0

Monitoring

In prometheus server end.

vim /etc/prometheus/prometheus.yml

  - job_name: 'haproxy'
    static_configs:
    - targets: ['xxx.xxx.xxx.xxx:8004']

systemctl restart prometheus

HAProxy Real-Time Dashboard

HAProxy Prometheus Dashboard

Ansible: To use the ‘ssh’ connection type with passwords, you must install the sshpass program

2021-07-21T02:16:41+00:00

“To use the ‘ssh’ connection type with passwords, you must install the sshpass program” to resolve this error message, you have to install sshpass package in your ansible control machine.

What is sshpass?
The sshpass utility is designed to run SSH using the keyboard-interactive password authentication mode, but in a non-interactive way. SSH uses direct TTY access to ensure that the password is indeed issued by an interactive keyboard user. sshpass runs SSH in a dedicated TTY, fooling SSH into thinking it is getting the password from an interactive user.

In OS X:
If you are trying to install through brew then you might face an error.

brew install https://raw.githubusercontent.com/kadwanev/bigboybrew/master/Library/Formula/sshpass.rb

Error message:

Traceback (most recent call last):
`brew extract` or `brew create` and `brew tap-new` to create a formula file in a tap on GitHub instead.: Invalid usage: Non-checksummed download of sshpass formula file from an arbitrary URL is unsupported!  (UsageError)
`brew extract` or `brew create` and `brew tap-new` to create a formula file in a tap on GitHub instead.: Invalid usage: Non-checksummed download of sshpass formula file from an arbitrary URL is unsupported!  (UsageError)

Therefore, I prefered installing sshpass through source code.

wget https://sourceforge.net/projects/sshpass/files/latest/download/sshpass/1.08/sshpass-1.08.tar.gz
tar xvf sshpass-1.08.tar.gz
cd sshpass-1.08
./configure
make
make install

In CentOS 7:

yum install sshpass

In Ubuntu:

apt install sshpass