Evoluzione dell’alta affidabilità su Linux: creare un cluster Tomcat utilizzando la soluzione nativa, Heartbeat e Pacemaker

jre-6u23-linux-i586.bin

# chmod +x jre-6u23-linux-i586.bin 
# ./jre-6u23-linux-i586.bin

# mv jre1.6.0_23 /usr/local
# ln -s /usr/local/jre1.6.0_23 /usr/local/java

# wget http://it.apache.contactlab.it/tomcat/tomcat-7/v7.0.6/bin/apache-tomcat-7.0.6.tar.gz -o /tmp/apache-tomcat-7.0.6.tar.gz

# cd /usr/local
# tar -xzvf /tmp/apache-tomcat-7.0.6.tar.gz
# ln -s apache-tomcat-7.0.6 tomcat

# /etc/init.d/heartbeat start

# crm configure property stonith-enabled="false"
# crm configure property no-quorum-policy="ignore"
# crm configure primitive ping ocf:pacemaker:ping params host_list="192.168.0.254" name="ping" op monitor interval="10s" timeout="60s" op start timeout="60s" op stop timeout="60s"
# crm configure clone ping_clone ping meta globally-unique="false"

crm configure primitive cluster_tomcat ocf:heartbeat:tomcat params java_home="/usr/local/java/" catalina_home="/usr/local/tomcat/" op monitor interval="60s" timeout="30s" op start interval="0" timeout="60s" op stop interval="0" timeout="120s"
crm configure clone cluster_tomcat_clone cluster_tomcat meta globally-unique="false" target-role="Started"
crm configure location cluster_tomcat_on_connected_node cluster_tomcat_clone -inf: not_defined ping or ping lte 0

============
Last updated: Mon Jan 24 10:17:05 2011
Stack: Heartbeat
Current DC: tomcatcluster-nodo2 (b091eddc-aa64-4d7d-8407-65a7dddafbdf) - partition with quorum
Version: 1.0.10-da7075976b5ff0bee71074385f8fd02f296ec8a3
2 Nodes configured, unknown expected votes
2 Resources configured.
============

Online: [ tomcatcluster-nodo1 tomcatcluster-nodo2 ]

 Clone Set: ping_clone
     Started: [ tomcatcluster-nodo1 tomcatcluster-nodo2 ]
 Clone Set: cluster_tomcat_clone
     Started: [ tomcatcluster-nodo1 tomcatcluster-nodo2 ]

# cat /usr/local/tomcat/logs/catalina.out
...
INFO: Cluster is about to start
...
INFO: Setting cluster mcast soTimeout to 500
...
INFO: Server startup in 3153 ms
...
21-gen-2011 17.05.47 org.apache.catalina.ha.tcp.SimpleTcpCluster memberAdded
INFO: Replication member added:org.apache.catalina.tribes.membership.MemberImpl[tcp://{192, 168, 0, 1}:4000,{192, 168, 0, 2},4000, alive=1259, securePort=-1, UDP Port=-1, id={101 64 89 59 2 -44 67 123 -73 23 4 -50 -57 105 8 58 }, payload={}, command={}, domain={}, ]
...

192.168.0.1 tomcatcluster
#192.168.0.2 tomcatcluster

http://tomcatcluster:8080/TestSession/

http://tomcatcluster:8080/TestSession/session.jsp?aaa=111

Server info

    * Name: tomcatcluster-nodo1
    * Address: 192.168.0.1

Session Attributes

    * aaa = 111

http://tomcatcluster:8080/TestSession/session.jsp?bbb=222

Server info

    * Name: tomcatcluster-nodo1
    * Address: 192.168.0.1

Session Attributes

    * aaa = 111
    * bbb = 222

#192.168.0.1 tomcatcluster
192.168.0.2 tomcatcluster

http://tomcatcluster:8080/TestSession/session.jsp?ccc=333

Server info

    * Name: tomcatcluster-nodo2
    * Address: 192.168.0.2

Session Attributes

    * aaa = 111
    * bbb = 222
    * ccc = 333

configure clone SquidBalancedIP SquidIP meta globally-unique=”true” clone-max=”2” clone-node-max=”2”

primitive cluster-batch-auth-server-ip ocf:custom:anything \
	params user="jboss" binfile="/usr/java/latest/bin/java" cmdline_options="-Xms128m -Xmx521m com/myapp" workdir="/myapp/" \
	op monitor interval="10s" timeout="20s" depth="0" \
	op start interval="0" timeout="20s" \
	op stop interval="0" timeout="20s"

Last updated: Mon Mar 21 12:52:49 2011
Stack: Heartbeat
Current DC: proxy-1 - partition with quorum
Version: 1.0.9-unknown
2 Nodes configured, unknown expected votes
3 Resources configured.
============

Online: [ proxy-1 proxy-2 ]

 Clone Set: ping_clone
     Started: [ proxy-1 proxy-2 ]
 cluster-ip     (ocf::heartbeat:IPaddr2):       Started proxy-1

Failed actions:
    cluster-squid_start_0 (node=proxy-2, call=5, rc=-2, status=Timed Out): unknown exec error
    cluster-squid_start_0 (node=proxy-1, call=5, rc=-2, status=Timed Out): unknown exec error

---------------------------------------------
Mar 22 11:20:02 proxy-1 crmd: [7521]: info: do_lrm_rsc_op: Performing key=17:3:0:28b5e648-efaf-46e7-897b-ffdb29713675 op=cluster-squid_start_0 )
Mar 22 11:20:02 proxy-1 lrmd: [7518]: info: rsc:cluster-squid:9: start
Mar 22 11:20:02 proxy-1 squid[7754]: Squid Parent: child process 7756 started
Mar 22 11:20:02 proxy-1 Squid[7713]: INFO: squid:Waiting for squid to be invoked
Mar 22 11:20:06 proxy-1 Squid[7713]: INFO: squid:Waiting for squid to be invoked
Mar 22 11:20:07 proxy-1 Squid[7713]: INFO: squid:Waiting for squid to be invoked
Mar 22 11:20:13 proxy-1 Squid[7713]: INFO: squid:Waiting for squid to be invoked
Mar 22 11:20:22 proxy-1 lrmd: [7518]: WARN: cluster-squid:start process (PID 7713) timed out (try 1).  Killing with signal SIGTERM (15).
Mar 22 11:20:22 proxy-1 lrmd: [7518]: WARN: operation start[9] on ocf::Squid::cluster-squid for client 7521, its parameters: crm_feature_set=[3.0.1] squid_conf=[/etc/squid3/squid.conf] CRM_meta_timeout=[20000] squid_exe=[/usr/sbin/squid3] squid_pidfile=[/var/run/squid3.pid] squid_port=[3366] : pid [7713] timed out
Mar 22 11:20:22 proxy-1 crmd: [7521]: ERROR: process_lrm_event: LRM operation cluster-squid_start_0 (9) Timed Out (timeout=20000ms)
Mar 22 11:20:23 proxy-1 attrd: [7520]: info: find_hash_entry: Creating hash entry for fail-count-cluster-squid
Mar 22 11:20:23 proxy-1 attrd: [7520]: info: attrd_trigger_update: Sending flush op to all hosts for: fail-count-cluster-squid (INFINITY)
Mar 22 11:20:23 proxy-1 crmd: [7521]: info: do_lrm_rsc_op: Performing key=4:4:0:28b5e648-efaf-46e7-897b-ffdb29713675 op=cluster-squid_stop_0 )
Mar 22 11:20:23 proxy-1 lrmd: [7518]: info: rsc:cluster-squid:11: stop
Mar 22 11:20:23 proxy-1 attrd: [7520]: info: attrd_perform_update: Sent update 22: fail-count-cluster-squid=INFINITY
Mar 22 11:20:23 proxy-1 attrd: [7520]: info: find_hash_entry: Creating hash entry for last-failure-cluster-squid
Mar 22 11:20:23 proxy-1 attrd: [7520]: info: attrd_trigger_update: Sending flush op to all hosts for: last-failure-cluster-squid (1300789224)
Mar 22 11:20:23 proxy-1 attrd: [7520]: info: attrd_perform_update: Sent update 25: last-failure-cluster-squid=1300789224
Mar 22 11:20:24 proxy-1 Squid[8118]: INFO: squid:stop_squid:311:  stop NORM 1/5
Mar 22 11:20:25 proxy-1 Squid[8118]: INFO: squid:stop_squid:311:  stop NORM 2/5
Mar 22 11:20:27 proxy-1 Squid[8118]: INFO: squid:stop_squid:311:  stop NORM 3/5
Mar 22 11:20:28 proxy-1 Squid[8118]: INFO: squid:stop_squid:311:  stop NORM 4/5
Mar 22 11:20:29 sspa-proxy-1 Squid[8118]: INFO: squid:stop_squid:311:  stop NORM 5/5
Mar 22 11:20:29 proxy-1 Squid[8118]: INFO: squid:stop_squid:318:  try to stop by SIGKILL:7754
Mar 22 11:20:30 proxy-1 Squid[8118]: INFO: squid:stop_squid:318:  try to stop by SIGKILL:
Mar 22 11:20:30 proxy-1 lrmd: [7518]: info: RA output: (cluster-squid:stop:stderr) kill: usage: kill [-s sigspec | -n signum | -sigspec] pid | jobspec ... or kill -l [sigspec]
Mar 22 11:20:31 proxy-1 crmd: [7521]: info: process_lrm_event: LRM operation cluster-squid_stop_0 (call=11, rc=0, cib-update=18, confirmed=true) ok
-----------------------------------------------

-----------------------------------------------
Mar 22 11:20:01 proxy-2 pengine: [3427]: notice: native_print:      cluster-squid#011(ocf::heartbeat:Squid):#011Stopped
Mar 22 11:20:01 proxy-2 pengine: [3427]: notice: RecurringOp:  Start recurring monitor (10s) for cluster-squid on proxy-1
Mar 22 11:20:01 proxy-2 pengine: [3427]: notice: LogActions: Start cluster-squid#011(proxy-1)
Mar 22 11:20:02 proxy-2 crmd: [3422]: info: te_rsc_command: Initiating action 17: start cluster-squid_start_0 on proxy-1
Mar 22 11:20:23 proxy-2 crmd: [3422]: WARN: status_from_rc: Action 17 (cluster-squid_start_0) on proxy-1 failed (target: 0 vs. rc: -2): Error
Mar 22 11:20:24 proxy-2 crmd: [3422]: WARN: update_failcount: Updating failcount for cluster-squid on proxy-1 after failed start: rc=-2 (update=INFINITY, time=1300789224)
Mar 22 11:20:24 proxy-2 crmd: [3422]: info: abort_transition_graph: match_graph_event:272 - Triggered transition abort (complete=0, tag=lrm_rsc_op, id=cluster-squid_start_0, magic=2:-2;17:3:0:28b5e648-efaf-46e7-897b-ffdb29713675, cib=0.183.18) : Event failed
Mar 22 11:20:24 proxy-2 crmd: [3422]: info: match_graph_event: Action cluster-squid_start_0 (17) confirmed on proxy-1 (rc=4)
Mar 22 11:20:24 proxy-2 pengine: [3427]: WARN: unpack_rsc_op: Processing failed op cluster-squid_start_0 on proxy-1: unknown exec error (-2)
Mar 22 11:20:24 proxy-2 pengine: [3427]: notice: native_print:      cluster-squid#011(ocf::heartbeat:Squid):#011Started proxy-1 FAILED
Mar 22 11:20:24 proxy-2 pengine: [3427]: notice: RecurringOp:  Start recurring monitor (10s) for cluster-squid on proxy-1
Mar 22 11:20:24 proxy-2 pengine: [3427]: notice: LogActions: Recover resource cluster-squid#011(Started proxy-1)
Mar 22 11:20:24 proxy-2 crmd: [3422]: info: te_rsc_command: Initiating action 4: stop cluster-squid_stop_0 on proxy-1
Mar 22 11:20:24 proxy-2 attrd: [3421]: info: find_hash_entry: Creating hash entry for fail-count-cluster-squid
Mar 22 11:20:24 proxy-2 attrd: [3421]: info: find_hash_entry: Creating hash entry for last-failure-cluster-squid
Mar 22 11:20:31 proxy-2 crmd: [3422]: info: match_graph_event: Action cluster-squid_stop_0 (4) confirmed on proxy-1 (rc=0)
Mar 22 11:20:31 proxy-2 pengine: [3427]: WARN: unpack_rsc_op: Processing failed op cluster-squid_start_0 on proxy-1: unknown exec error (-2)
Mar 22 11:20:31 proxy-2 pengine: [3427]: notice: native_print:      cluster-squid#011(ocf::heartbeat:Squid):#011Stopped
Mar 22 11:20:31 proxy-2 pengine: [3427]: info: get_failcount: cluster-squid has failed INFINITY times on proxy-1
Mar 22 11:20:31 proxy-2 pengine: [3427]: WARN: common_apply_stickiness: Forcing cluster-squid away from proxy-1 after 1000000 failures (max=1000000)
Mar 22 11:20:31 proxy-2 pengine: [3427]: notice: RecurringOp:  Start recurring monitor (10s) for cluster-squid on proxy-2
Mar 22 11:20:31 proxy-2 pengine: [3427]: notice: LogActions: Start cluster-squid#011(proxy-2)
Mar 22 11:20:32 proxy-2 crmd: [3422]: info: te_rsc_command: Initiating action 19: start cluster-squid_start_0 on proxy-2 (local)
Mar 22 11:20:32 proxy-2 crmd: [3422]: info: do_lrm_rsc_op: Performing key=19:5:0:28b5e648-efaf-46e7-897b-ffdb29713675 op=cluster-squid_start_0 )
Mar 22 11:20:32 proxy-2 lrmd: [3419]: info: rsc:cluster-squid:9: start
Mar 22 11:20:32 proxy-2 squid[3712]: Squid Parent: child process 3714 started
Mar 22 11:20:32 proxy-2 Squid[3671]: INFO: squid:Waiting for squid to be invoked
Mar 22 11:20:43 proxy-2 Squid[3671]: INFO: squid:Waiting for squid to be invoked
Mar 22 11:20:52 proxy-2 lrmd: [3419]: WARN: cluster-squid:start process (PID 3671) timed out (try 1).  Killing with signal SIGTERM (15).
Mar 22 11:20:52 proxy-2 lrmd: [3419]: WARN: operation start[9] on ocf::Squid::cluster-squid for client 3422, its parameters: crm_feature_set=[3.0.1] squid_conf=[/etc/squid3/squid.conf] CRM_meta_timeout=[20000] squid_exe=[/usr/sbin/squid3] squid_pidfile=[/var/run/squid3.pid] squid_port=[3366] : pid [3671] timed out
Mar 22 11:20:52 proxy-2 crmd: [3422]: ERROR: process_lrm_event: LRM operation cluster-squid_start_0 (9) Timed Out (timeout=20000ms)
Mar 22 11:20:52 proxy-2 crmd: [3422]: WARN: status_from_rc: Action 19 (cluster-squid_start_0) on proxy-2 failed (target: 0 vs. rc: -2): Error
Mar 22 11:20:53 proxy-2 crmd: [3422]: WARN: update_failcount: Updating failcount for cluster-squid on proxy-2 after failed start: rc=-2 (update=INFINITY, time=1300789253)
Mar 22 11:20:53 proxy-2 crmd: [3422]: info: abort_transition_graph: match_graph_event:272 - Triggered transition abort (complete=0, tag=lrm_rsc_op, id=cluster-squid_start_0, magic=2:-2;19:5:0:28b5e648-efaf-46e7-897b-ffdb29713675, cib=0.183.26) : Event failed
Mar 22 11:20:53 proxy-2 attrd: [3421]: info: attrd_trigger_update: Sending flush op to all hosts for: fail-count-cluster-squid (INFINITY)
Mar 22 11:20:53 proxy-2 crmd: [3422]: info: match_graph_event: Action cluster-squid_start_0 (19) confirmed on proxy-2 (rc=4)
Mar 22 11:20:53 proxy-2 attrd: [3421]: info: attrd_perform_update: Sent update 24: fail-count-cluster-squid=INFINITY
Mar 22 11:20:53 proxy-2 attrd: [3421]: info: attrd_trigger_update: Sending flush op to all hosts for: last-failure-cluster-squid (1300789253)
Mar 22 11:20:53 proxy-2 attrd: [3421]: info: attrd_perform_update: Sent update 27: last-failure-cluster-squid=1300789253
Mar 22 11:20:53 proxy-2 pengine: [3427]: WARN: unpack_rsc_op: Processing failed op cluster-squid_start_0 on proxy-2: unknown exec error (-2)
Mar 22 11:20:53 proxy-2 pengine: [3427]: WARN: unpack_rsc_op: Processing failed op cluster-squid_start_0 on proxy-1: unknown exec error (-2)
Mar 22 11:20:53 proxy-2 pengine: [3427]: notice: native_print:      cluster-squid#011(ocf::heartbeat:Squid):#011Started proxy-2 FAILED
Mar 22 11:20:53 proxy-2 pengine: [3427]: info: get_failcount: cluster-squid has failed INFINITY times on proxy-1
Mar 22 11:20:53 proxy-2 pengine: [3427]: WARN: common_apply_stickiness: Forcing cluster-squid away from proxy-1 after 1000000 failures (max=1000000)
Mar 22 11:20:53 proxy-2 pengine: [3427]: notice: RecurringOp:  Start recurring monitor (10s) for cluster-squid on proxy-2
Mar 22 11:20:53 proxy-2 pengine: [3427]: notice: LogActions: Recover resource cluster-squid#011(Started proxy-2)
Mar 22 11:20:53 proxy-2 pengine: [3427]: WARN: unpack_rsc_op: Processing failed op cluster-squid_start_0 on proxy-2: unknown exec error (-2)
Mar 22 11:20:53 proxy-2 pengine: [3427]: WARN: unpack_rsc_op: Processing failed op cluster-squid_start_0 on proxy-1: unknown exec error (-2)
Mar 22 11:20:53 proxy-2 pengine: [3427]: notice: native_print:      cluster-squid#011(ocf::heartbeat:Squid):#011Started proxy-2 FAILED
Mar 22 11:20:53 proxy-2 pengine: [3427]: info: get_failcount: cluster-squid has failed INFINITY times on proxy-1
Mar 22 11:20:53 proxy-2 pengine: [3427]: WARN: common_apply_stickiness: Forcing cluster-squid away from proxy-1 after 1000000 failures (max=1000000)
Mar 22 11:20:53 proxy-2 pengine: [3427]: info: get_failcount: cluster-squid has failed INFINITY times on proxy-2
Mar 22 11:20:53 proxy-2 pengine: [3427]: WARN: common_apply_stickiness: Forcing cluster-squid away from proxy-2 after 1000000 failures (max=1000000)
Mar 22 11:20:53 proxy-2 pengine: [3427]: info: native_merge_weights: cluster-ip: Rolling back scores from cluster-squid
Mar 22 11:20:53 proxy-2 pengine: [3427]: info: native_color: Resource cluster-squid cannot run anywhere
Mar 22 11:20:53 proxy-2 pengine: [3427]: notice: LogActions: Stop resource cluster-squid#011(proxy-2)
Mar 22 11:20:53 proxy-2 crmd: [3422]: info: te_rsc_command: Initiating action 3: stop cluster-squid_stop_0 on proxy-2 (local)
Mar 22 11:20:53 proxy-2 crmd: [3422]: info: do_lrm_rsc_op: Performing key=3:7:0:28b5e648-efaf-46e7-897b-ffdb29713675 op=cluster-squid_stop_0 )
Mar 22 11:20:53 proxy-2 lrmd: [3419]: info: rsc:cluster-squid:10: stop
Mar 22 11:20:54 proxy-2 Squid[4109]: INFO: squid:stop_squid:311:  stop NORM 1/5
Mar 22 11:20:55 proxy-2 Squid[4109]: INFO: squid:stop_squid:311:  stop NORM 2/5
Mar 22 11:20:56 proxy-2 Squid[4109]: INFO: squid:stop_squid:311:  stop NORM 3/5
Mar 22 11:20:57 proxy-2 Squid[4109]: INFO: squid:stop_squid:311:  stop NORM 4/5
Mar 22 11:20:58 proxy-2 Squid[4109]: INFO: squid:stop_squid:311:  stop NORM 5/5
Mar 22 11:20:58 proxy-2 Squid[4109]: INFO: squid:stop_squid:318:  try to stop by SIGKILL:3712
Mar 22 11:20:59 proxy-2 Squid[4109]: INFO: squid:stop_squid:318:  try to stop by SIGKILL:
Mar 22 11:20:59 proxy-2 lrmd: [3419]: info: RA output: (cluster-squid:stop:stderr) kill: usage: kill [-s sigspec | -n signum | -sigspec] pid | jobspec ... or kill -l [sigspec]
Mar 22 11:21:00 proxy-2 crmd: [3422]: info: process_lrm_event: LRM operation cluster-squid_stop_0 (call=10, rc=0, cib-update=43, confirmed=true) ok
Mar 22 11:21:00 proxy-2 crmd: [3422]: info: match_graph_event: Action cluster-squid_stop_0 (3) confirmed on proxy-2 (rc=0)
Mar 22 11:36:02 proxy-2 pengine: [3427]: WARN: unpack_rsc_op: Processing failed op cluster-squid_start_0 on proxy-2: unknown exec error (-2)
Mar 22 11:36:02 proxy-2 pengine: [3427]: WARN: unpack_rsc_op: Processing failed op cluster-squid_start_0 on proxy-1: unknown exec error (-2)
Mar 22 11:36:02 proxy-2 pengine: [3427]: notice: native_print:      cluster-squid#011(ocf::heartbeat:Squid):#011Stopped
Mar 22 11:36:02 proxy-2 pengine: [3427]: info: get_failcount: cluster-squid has failed INFINITY times on proxy-1
Mar 22 11:36:02 proxy-2 pengine: [3427]: WARN: common_apply_stickiness: Forcing cluster-squid away from proxy-1 after 1000000 failures (max=1000000)
Mar 22 11:36:02 proxy-2 pengine: [3427]: info: get_failcount: cluster-squid has failed INFINITY times on proxy-2
Mar 22 11:36:02 proxy-2 pengine: [3427]: WARN: common_apply_stickiness: Forcing cluster-squid away from proxy-2 after 1000000 failures (max=1000000)
Mar 22 11:36:02 proxy-2 pengine: [3427]: info: native_merge_weights: cluster-ip: Rolling back scores from cluster-squid
Mar 22 11:36:02 proxy-2 pengine: [3427]: info: native_color: Resource cluster-squid cannot run anywhere
Mar 22 11:36:02 proxy-2 pengine: [3427]: notice: LogActions: Leave resource cluster-squid#011(Stopped)
-----------------------------------------------

-------------------------------------------
autojoin none
keepalive 1
deadtime 10
warntime 5
initdead 20
mcast eth0 239.0.0.43 694 1 0
bcast eth1
node    proxy-1
node    proxy-2
crm respawn
logfacility local0
-----------------------------------------------

-----------------------------------------------
#!/bin/bash
#
# Description:  Manages a Squid Server provided by NTT OSSC as an 
#               OCF High-Availability resource under Heartbeat/LinuxHA control
#
# This program is free software; you can redistribute it and/or
# modify it under the terms of the GNU General Public License
# as published by the Free Software Foundation; either version 2
# of the License, or (at your option) any later version.
#
# This program is distributed in the hope that it will be useful,
# but WITHOUT ANY WARRANTY; without even the implied warranty of
# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
# GNU General Public License for more details.
#
# You should have received a copy of the GNU General Public License
# along with this program; if not, write to the Free Software
# Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA
# 02110-1301, USA.
#
# Copyright (c) 2008 NIPPON TELEGRAPH AND TELEPHONE CORPORATION
#
#######################################################################
# OCF parameters:
#   OCF_RESKEY_squid_exe    : Executable file
#   OCF_RESKEY_squid_conf   : Configuration file
#   OCF_RESKEY_squid_pidfile: Process id file
#   OCF_RESKEY_squid_port   : Port number
#   OCF_RESKEY_debug_mode   : Debug mode
#   OCF_RESKEY_debug_log    : Debug log file
#   OCF_RESKEY_squid_stop_timeout:
#                             Number of seconds to await to confirm a
#                             normal stop method
#
#   OCF_RESKEY_squid_exe, OCF_RESKEY_squid_conf, OCF_RESKEY_squid_pidfile
#   and OCF_RESKEY_squid_port must be specified. Each of the rests
#   has its default value or refers OCF_RESKEY_squid_conf to make
#   its value when no explicit value is given.
###############################################################################

: ${OCF_FUNCTIONS_DIR=${OCF_ROOT}/usr/lib/heartbeat}
. ${OCF_FUNCTIONS_DIR}/ocf-shellfuncs

usage() 
{
	cat <<-!
usage: $0 action

action:
        start       : start a new squid instance

        stop        : stop the running squid instance

        status      : return the status of squid, run or down

        monitor     : return TRUE if the squid appears to be working.

        meta-data   : show meta data message

        validate-all: validate the instance parameters
!
	return $OCF_ERR_ARGS
}

metadata_squid()
{
    cat <<END

1.0

The resource agent of Squid.
This manages a Squid instance as an HA resource.

Manages a Squid proxy server instance

This is a required parameter. This parameter specifies squids
executable file.

Executable file

This is a required parameter. This parameter specifies a configuration file
for a squid instance managed by this RA.

Configuration file

This is a required parameter. This parameter specifies a process id file
for a squid instance managed by this RA.

Pidfile

This is a required parameter. This parameter specifies a port number
for a squid instance managed by this RA. If plural ports are used,
you must specifiy the only one of them.

Port number

This is an omittable parameter.
On a stop action, a normal stop method is firstly used.
and then the confirmation of its completion is awaited for
the specified seconds by this parameter.
The default value is 10.

Number of seconds to await to confirm a normal stop method

This is an optional parameter.
This RA runs in debug mode when this parameter includes 'x' or 'v'.
If 'x' is included, both of STDOUT and STDERR redirect to the logfile
specified by "debug_log", and then the builtin shell option 'x' is turned on.
It is similar about 'v'.

Debug mode

This is an optional and omittable parameter.
This parameter specifies a destination file for debug logs
and works only if this RA run in debug mode.  Refer to "debug_mode"
about debug mode. If no value is given but it's requied, it's made by the
following rules: "/var/log/" as a directory part, the basename of
the configuration file given by "syslog_ng_conf" as a basename part,
".log" as a suffix.

A destination of the debug log

END

	return $OCF_SUCCESS
}

get_pids()
{
	SQUID_PIDS=( )

	# Seek by pattern
	SQUID_PIDS[0]=$(pgrep -f "$PROCESS_PATTERN")

	# Seek by pidfile
	SQUID_PIDS[1]=$(awk '1{print $1}' $SQUID_PIDFILE 2>/dev/null)

	if [[ -n "${SQUID_PIDS[1]}" ]]; then
		typeset exe
		exe=$(ls -l "/proc/${SQUID_PIDS[1]}/exe")
		if [[ $? = 0 ]]; then
			exe=${exe##*-> }
			if ! [[ "$exe" = $SQUID_EXE ]]; then
				SQUID_PIDS[1]=""
			fi
		else
			SQUID_PIDS[1]=""
		fi
	fi

	# Seek by port
	SQUID_PIDS[2]=$(
		netstat -apn |
		awk '/tcp.*[0-9]+\.[0-9]+\.+[0-9]+\.[0-9]+:'$SQUID_PORT' /{
			sub("\\/.*", "", $7); print $7; exit}')
}

are_all_pids_found()
{
	if 
		[[ -n "${SQUID_PIDS[0]}" ]] &&
		[[ -n "${SQUID_PIDS[1]}" ]] &&
		[[ -n "${SQUID_PIDS[2]}" ]]
	then
		return 0
	else
		return 1
	fi
}

are_pids_sane()
{
	if [[ "${SQUID_PIDS[1]}" = "${SQUID_PIDS[2]}" ]]; then
		return $OCF_SUCCESS
	else
		ocf_log err "$SQUID_NAME:Pid unmatch"
		return $OCF_ERR_GENERIC
	fi
}

is_squid_dead()
{
	if 
		[[ -z "${SQUID_PIDS[0]}" ]] &&
		[[ -z "${SQUID_PIDS[2]}" ]]
	then
		return 0
	else
		return 1
	fi
}

monitor_squid()
{
	typeset trialcount=0

	while true; do
		get_pids

		if are_all_pids_found; then
			are_pids_sane
			return $OCF_SUCCESS
		fi

		if is_squid_dead; then
			return $OCF_NOT_RUNNING
		fi

		ocf_log info "$SQUID_NAME:Inconsistent processes:" \
			"${SQUID_PIDS[0]},${SQUID_PIDS[1]},${SQUID_PIDS[2]}"
		(( trialcount = trialcount + 1 ))
		if (( trialcount > SQUID_CONFIRM_TRIALCOUNT )); then
			ocf_log err "$SQUID_NAME:Inconsistency of processes remains unsolved"
			return $OCF_ERR_GENERIC
		fi
		sleep 1
	done
}

start_squid()
{
	typeset status

	monitor_squid
	status=$?

	if [[ $status != $OCF_NOT_RUNNING ]]; then
		return $status
	fi

	set -- "$SQUID_OPTS"
	ocf_run $SQUID_EXE -f "$SQUID_CONF" "$@"
	status=$?
	if [[ $status != $OCF_SUCCESS ]]; then
		return $OCF_ERR_GENERIC
	fi

	while true; do
		get_pids
		if are_all_pids_found && are_pids_sane; then
			return $OCF_SUCCESS
		fi
		ocf_log info "$SQUID_NAME:Waiting for squid to be invoked"
		sleep 1
	done

	return $OCF_ERR_GENERIC
}

stop_squid()
{
	typeset lapse_sec

	if ocf_run $SQUID_EXE -f $SQUID_CONF -k shutdown; then
		lapse_sec=0
		while true; do
			get_pids
			if is_squid_dead; then
				rm -f $SQUID_PIDFILE
				return $OCF_SUCCESS
			fi
			(( lapse_sec = lapse_sec + 1 ))
			if (( lapse_sec > SQUID_STOP_TIMEOUT )); then
				break
			fi
			sleep 1
			ocf_log info "$SQUID_NAME:$FUNCNAME:$LINENO: " \
				"stop NORM $lapse_sec/$SQUID_STOP_TIMEOUT"
		done
	fi

	while true; do
		get_pids
		ocf_log info "$SQUID_NAME:$FUNCNAME:$LINENO: " \
			"try to stop by SIGKILL:${SQUID_PIDS[0]} ${SQUID_PIDS[2]}"
		kill -KILL ${SQUID_PIDS[0]} ${SQUID_PIDS[2]}
		sleep 1
		if is_squid_dead; then
			rm -f $SQUID_PIDFILE
			return $OCF_SUCCESS
		fi
	done

	return $OCF_ERR_GENERIC
}

status_squid()
{
	return $OCF_SUCCESS
}

validate_all_squid()
{
	ocf_log info "validate_all_squid[$SQUID_NAME]"
	return $OCF_SUCCESS
}

: === Debug ${0##*/} $1 ===

if [[ "$1" = "meta-data" ]]; then
	metadata_squid
	exit $?
fi

SQUID_CONF="${OCF_RESKEY_squid_conf}"
if [[ -z "$SQUID_CONF" ]]; then
	ocf_log err "SQUID_CONF is not defined"
	exit $OCF_ERR_CONFIGURED
fi

SQUID_NAME="${SQUID_CONF##*/}"
SQUID_NAME="${SQUID_NAME%.*}"

DEBUG_LOG="${OCF_RESKEY_debug_log-/var/log/squid_${SQUID_NAME}_debug}.log"

DEBUG_MODE=""
case $OCF_RESKEY_debug_mode in
	*x*) DEBUG_MODE="${DEBUG_MODE}x";;
esac
case $OCF_RESKEY_debug_mode in
	*v*) DEBUG_MODE="${DEBUG_MODE}v";;
esac

if [ -n "$DEBUG_MODE" ]; then
	PS4='\d \t \h '"${1-unknown} "
	export PS4
	exec 1>>$DEBUG_LOG 2>&1
	set -$DEBUG_MODE
fi

SQUID_EXE="${OCF_RESKEY_squid_exe}"
if [[ -z "$SQUID_EXE" ]]; then
	ocf_log err "SQUID_EXE is not defined"
	exit $OCF_ERR_CONFIGURED
fi
if [[ ! -x "$SQUID_EXE" ]]; then
	ocf_log err "$SQUID_EXE is not found"
	exit $OCF_ERR_CONFIGURED
fi

SQUID_PIDFILE="${OCF_RESKEY_squid_pidfile}"
if [[ -z "$SQUID_PIDFILE" ]]; then
	ocf_log err "SQUID_PIDFILE is not defined"
	exit $OCF_ERR_CONFIGURED
fi

SQUID_PORT="${OCF_RESKEY_squid_port}"
if [[ -z "$SQUID_PORT" ]]; then
	ocf_log err "SQUID_PORT is not defined"
	exit $OCF_ERR_CONFIGURED
fi

SQUID_OPTS="${OCF_RESKEY_squid_opts}"

SQUID_PIDS=( )

SQUID_CONFIRM_TRIALCOUNT="${OCF_RESKEY_squid_confirm_trialcount-3}"

SQUID_STOP_TIMEOUT="${OCF_RESKEY_squid_stop_timeout-5}"
SQUID_SUSPEND_TRIALCOUNT="${OCF_RESKEY_squid_suspend_trialcount-10}"

PROCESS_PATTERN="$SQUID_EXE -f $SQUID_CONF"

COMMAND=$1

case "$COMMAND" in
	start)
		ocf_log debug  "[$SQUID_NAME] Enter squid start"
		start_squid
		func_status=$?
		ocf_log debug  "[$SQUID_NAME] Leave squid start $func_status"
		exit $func_status
		;;
	stop)
		ocf_log debug  "[$SQUID_NAME] Enter squid stop"
		stop_squid
		func_status=$?
		ocf_log debug  "[$SQUID_NAME] Leave squid stop $func_status"
		exit $func_status
		;;
	status)
		status_squid
		exit $?
		;;
	monitor)
		#ocf_log debug  "[$SQUID_NAME] Enter squid monitor"
		monitor_squid
		func_status=$?
		#ocf_log debug  "[$SQUID_NAME] Leave squid monitor $func_status"
		exit $func_status
		;;
	validate-all)
		validate_all_squid
		exit $?
		;;
	*)
		usage
		;;
esac

# vim: set sw=4 ts=4 :
-----------------------------------------------

node $id="578d50ed-5ed2-416e-87ff-352d9c30b773" proxy-1
node $id="6cb310c1-7f2f-43f2-8d91-1cdf2ed936ea" proxy-2
primitive cluster-ip ocf:heartbeat:IPaddr2 \
        params ip="**********" nic="eth0:0" \
        op monitor interval="10s" timeout="20s" \
        op start interval="0" timeout="20s" broadcast="***********" \
        op stop interval="0" timeout="20s"
primitive cluster-squid ocf:heartbeat:Squid \
        params squid_exe="/usr/sbin/squid3" squid_conf="/etc/squid3/squid.conf" squid_pidfile="/var/run/squid3.pid" squid_port="*****" \
        op monitor interval="10s" timeout="30s" depth="0"
primitive ping ocf:pacemaker:ping \
        params host_list="************" name="ping" \
        op monitor interval="10s" timeout="60s" \
        op start interval="0" timeout="60s" \
        op stop interval="0" timeout="60s"
group cluster-proxy cluster-ip cluster-squid
clone ping_clone ping \
        meta globally-unique="false"
location cluster_on_connected_node cluster-proxy \
        rule $id="cluster_on_connected_node-rule" -inf: not_defined ping or ping lte 0
property $id="cib-bootstrap-options" \
        dc-version="1.0.9-unknown" \
        cluster-infrastructure="Heartbeat" \
        no-quorum-policy="ignore" \
        stonith-enabled="false"

# cd /usr/lib/ocf/resource.d/heartbeat
# export OCF_RESKEY_squid_exe="/usr/sbin/squid3"
# export OCF_RESKEY_squid_conf="/etc/squid3/squid.conf"
# export OCF_RESKEY_squid_pidfile="/var/run/squid3.pid"
# export OCF_RESKEY_squid_port="*****"
# ./Squid start

Squid[30536]: WARN: Use of @HA_LIBHBDIR@/ocf-shellfuncs is deprecated.
Squid[30536]: WARN: Please use /usr/lib/ocf/resource.d//heartbeat/.ocf-shellfuncs instead.
Squid[30536]: WARN: Note that the $OCF_ROOT environment variable points to /usr/lib/ocf
Squid[30536]: WARN: We recommend using the $OCF_ROOT environment variable
Squid[30536]: WARN: Please fix /etc/ha.d/resource.d/Squid at your earliest convenience
Squid[30536]: DEBUG: [squid] Enter squid start
Squid[30536]: INFO: squid:Waiting for squid to be invoked
Squid[30536]: INFO: squid:Waiting for squid to be invoked
Squid[30536]: INFO: squid:Waiting for squid to be invoked
Squid[30536]: INFO: squid:Waiting for squid to be invoked
Squid[30536]: INFO: squid:Waiting for squid to be invoked
Squid[30536]: INFO: squid:Waiting for squid to be invoked
Squid[30536]: INFO: squid:Waiting for squid to be invoked
Squid[30536]: INFO: squid:Waiting for squid to be invoked
Squid[30536]: INFO: squid:Waiting for squid to be invoked
Squid[30536]: INFO: squid:Waiting for squid to be invoked
Squid[30536]: INFO: squid:Waiting for squid to be invoked
Squid[30536]: INFO: squid:Waiting for squid to be invoked
Squid[30536]: INFO: squid:Waiting for squid to be invoked

Squid[31755]: WARN: Use of @HA_LIBHBDIR@/ocf-shellfuncs is deprecated.
Squid[31755]: WARN: Please use /usr/lib/ocf/resource.d//heartbeat/.ocf-shellfuncs instead.
Squid[31755]: WARN: Note that the $OCF_ROOT environment variable points to /usr/lib/ocf
Squid[31755]: WARN: We recommend using the $OCF_ROOT environment variable
Squid[31755]: WARN: Please fix /etc/ha.d/resource.d/Squid at your earliest convenience
Squid[31755]: DEBUG: [squid] Enter squid start
Squid[31755]: INFO: squid:Inconsistent processes: 30553,30555,
Squid[31755]: INFO: squid:Inconsistent processes: 30553,30555,
Squid[31755]: INFO: squid:Inconsistent processes: 30553,30555,
Squid[31755]: INFO: squid:Inconsistent processes: 30553,30555,
Squid[31755]: ERROR: squid:Inconsistency of processes remains unsolved
Squid[31755]: DEBUG: [squid] Leave squid start 1

# rm -rf /var/lib/heartbeat/crm/*

Evoluzione dell’alta affidabilità su Linux: creare un cluster Tomcat utilizzando la soluzione nativa, Heartbeat e Pacemaker

Raoul Scarazzini

43 risposte a “Evoluzione dell’alta affidabilità su Linux: creare un cluster Tomcat utilizzando la soluzione nativa, Heartbeat e Pacemaker”

Lascia un commento

Git & Tricks – Pillole di source code management | Parte 3: l’importanza del rebase per un mondo migliore

Kubelab, un ruolo Ansible per imparare ad installare e gestire Kubernetes

Kubernetes, CPU Limits e Requests per i Pod, spiegazione e confronto: massimo controllo o massima efficienza?

Git & Tricks – Pillole di source code management | Parte 2: gestire i commit con empatia

Installare Kubernetes in ambienti totalmente isolati si può, kubeadm supporta gli Air Gap Cluster!

Git & Tricks – Pillole di source code management | Parte 1: un ambiente confortevole

Errori di battitura nel terminale: quando il typo di un singolo carattere fa tutta la differenza del mondo

Platform Bloody Platform: ecco il primo nuovo meetup di Mia Mamma Usa Linux con tutti i video dei talk!

Una prova su strada di k0s, Kubernetes in un singolo eseguibile by Mirantis, che ha raggiunto la versione 1.27

Saturday’s Talks: di DevOps, di Solomon Hykes (creatore di Docker) e Dagger (la sua nuova creatura) e di complessità

Saturday’s Talks: se la Linux Foundation continuerà con i fork non risolverà mai il problema delle licenze open-source che diventano closed

Saturday’s Talks: non sai come creare un modello AI su Azure? Ci pensa l’operator Kubernetes AI di Microsoft… A confonderti le idee 🙂

Saturday’s Talks: full-remote vs presenza in ufficio, l’eterna dicotomia tra produttività percepita e produttività effettiva

Categories

Tag cloud

Collabora con noi!