Monit » History » Version 5
Anonymous, 08/20/2008 01:37 PM
Installation Guide: Update Monit
1 | 1 | Anonymous | |
---|---|---|---|
2 | 5 | Anonymous | h1. Monit: Monitoring of Services |
3 | 1 | Anonymous | |
4 | 5 | Anonymous | This installation guide will describe how to set up _independent_ instances of "Monit":http://www.tildeslash.com/monit/ on the master node and each slave.[[br]] |
5 | 5 | Anonymous | In the future, "M|Monit":http://www.tildeslash.com/mmonit/ should be considered, which allows easy single point administration and monitoring (from the master node). |
6 | 5 | Anonymous | |
7 | 5 | Anonymous | |
8 | 5 | Anonymous | |
9 | 5 | Anonymous | h2. Installation |
10 | 5 | Anonymous | |
11 | 5 | Anonymous | * Add the DAG repository on the _master node_ and _slave nodes_. Enter at the command line as _root_: |
12 | 5 | Anonymous | <pre> |
13 | 1 | Anonymous | wget http://apt.sw.be/redhat/el5/en/x86_64/rpmforge/RPMS/rpmforge-release-0.3.6-1.el5.rf.x86_64.rpm |
14 | 1 | Anonymous | rpm -Uvh rpmforge-release-0.3.6-1.el5.rf.x86_64.rpm |
15 | 5 | Anonymous | </pre> |
16 | 1 | Anonymous | |
17 | 5 | Anonymous | * Install Monit on the _master node_ and _slave nodes_. Enter at the command line as _root_: |
18 | 5 | Anonymous | <pre> |
19 | 1 | Anonymous | yum install monit |
20 | 5 | Anonymous | </pre> |
21 | 1 | Anonymous | |
22 | 5 | Anonymous | |
23 | 5 | Anonymous | h2. Configuration |
24 | 5 | Anonymous | |
25 | 1 | Anonymous | |
26 | 5 | Anonymous | |
27 | 5 | Anonymous | h3. Master node |
28 | 5 | Anonymous | |
29 | 1 | Anonymous | On the master node, the following services will be monitore:[[br]] |
30 | 5 | Anonymous | _apache_, _cron_, _devices_ (/ & /home), _mysql_, _nfs_ (/home_nfs), _ntp_, _pbs_mom_, _pbs_sched_, _pbs_server_, _postfix_, _ssh_, _system_, _ypbind_, _yppasswd_, _ypserv_, [[br]] |
31 | 5 | Anonymous | and if all _slaves_ are reachable (_ping_)[[br]] |
32 | 5 | Anonymous | Currently, the monitoring of _pbs_maui_ is switched off in favour of _pbs_sched_.[[br]] |
33 | 1 | Anonymous | |
34 | 1 | Anonymous | |
35 | 5 | Anonymous | * Download the [source:Externals/Cluster/procksi_monit.tgz configuration files] from the repository and extract the files. Enter at the command line: |
36 | 5 | Anonymous | <pre> |
37 | 1 | Anonymous | tar -xvzf procksi_monit.tgz |
38 | 5 | Anonymous | </pre> |
39 | 1 | Anonymous | |
40 | 5 | Anonymous | * Copy the files in _./monit/master_ to the appropriate directories (_/etc/_, _/etc/monit.d/_, _/home/procksi/monit/_). |
41 | 1 | Anonymous | |
42 | 5 | Anonymous | * Change permissions of the monit token file. Enter at the command line: |
43 | 5 | Anonymous | <pre> |
44 | 1 | Anonymous | chown -R procksi.procksi_dev /home/procksi/monit/token |
45 | 5 | Anonymous | </pre> |
46 | 1 | Anonymous | |
47 | 5 | Anonymous | * Edit the Apache configuration file _/etc/httpd/conf/httpd.conf_: |
48 | 5 | Anonymous | <pre> |
49 | 1 | Anonymous | #General Aliases for Monitoring and Testing |
50 | 1 | Anonymous | Alias /monit/ "/home/procksi/monit/" |
51 | 1 | Anonymous | Alias /ganglia/ "/usr/local/ganglia/html/" |
52 | 1 | Anonymous | Alias /trees/ "/home/procksi/trees/" |
53 | 1 | Anonymous | |
54 | 1 | Anonymous | #Conditional Logging: Don't log Ganglia and Monit requests |
55 | 5 | Anonymous | [[SetEnvIf]] Request_URI "ganglia" dontlog |
56 | 5 | Anonymous | [[SetEnvIf]] Request_URI "^\/monit\/token$" dontlog |
57 | 5 | Anonymous | </pre> |
58 | 1 | Anonymous | |
59 | 5 | Anonymous | * Restart the Apache server. Enter at the command line as _root_: |
60 | 5 | Anonymous | <pre> |
61 | 1 | Anonymous | /sbin/service httpd restart |
62 | 5 | Anonymous | </pre> |
63 | 1 | Anonymous | |
64 | 5 | Anonymous | * Make the Monit daemon start at bootup. Enter at the command line as _root_: |
65 | 5 | Anonymous | <pre> |
66 | 1 | Anonymous | /sbin/chkconfig monit on |
67 | 5 | Anonymous | </pre> |
68 | 4 | Anonymous | |
69 | 5 | Anonymous | * Start the Monit daemon. Enter at the command line as _root_: |
70 | 5 | Anonymous | <pre> |
71 | 1 | Anonymous | /sbin/service monit start |
72 | 5 | Anonymous | </pre> |
73 | 1 | Anonymous | |
74 | 1 | Anonymous | |
75 | 5 | Anonymous | |
76 | 5 | Anonymous | h3. Slave nodes |
77 | 5 | Anonymous | |
78 | 1 | Anonymous | On the master node, the following services will be monitore:[[br]] |
79 | 5 | Anonymous | _devices_ (/ and /scratch), _nfs_ (/home), _ntp_, _pbs_mom_, _ssh_, _system_, _ypbind_ |
80 | 1 | Anonymous | |
81 | 5 | Anonymous | * Download the [source:Externals/Cluster/procksi_monit.tgz configuration files] from the repository and extract the files. Enter at the command line: |
82 | 5 | Anonymous | <pre> |
83 | 1 | Anonymous | tar -xvzf procksi_monit.tgz |
84 | 5 | Anonymous | </pre> |
85 | 1 | Anonymous | |
86 | 5 | Anonymous | * Copy a the files in _./monit/slave_ to the appropriate directories (_/etc/_, _/etc/monit.d/_). |
87 | 1 | Anonymous | |
88 | 5 | Anonymous | * Edit _/etc/monit.d/system_ and set the correct host name for each slave node. |
89 | 1 | Anonymous | |
90 | 5 | Anonymous | * Make the Monit daemon start at bootup. Enter at the command line as _root_: |
91 | 5 | Anonymous | <pre> |
92 | 1 | Anonymous | /sbin/chkconfig monit on |
93 | 5 | Anonymous | </pre> |
94 | 1 | Anonymous | |
95 | 5 | Anonymous | * Start the Monit daemon. Enter at the command line as _root_: |
96 | 5 | Anonymous | <pre> |
97 | 1 | Anonymous | /sbin/service monit start |
98 | 5 | Anonymous | </pre> |
99 | 1 | Anonymous | |
100 | 1 | Anonymous | |
101 | 1 | Anonymous | |
102 | 5 | Anonymous | h2. Online Monitoring |
103 | 1 | Anonymous | |
104 | 5 | Anonymous | |
105 | 5 | Anonymous | The status of each monitored service, process, file, etc. is available with the Monit's integrated webserver at port 2812 from _localhost_ and selected machines. Username and password can be found at the secret [[[secretAuthentication|authentication]]] page. |
106 | 5 | Anonymous | |
107 | 5 | Anonymous | || master01 || "|| slave01 || [http://procksi1.cs.nott.ac.uk:2812":http://procksi0.cs.nott.ac.uk:2812] |
108 | 5 | Anonymous | || slave02 || "|| slave03 || [http://procksi3.cs.nott.ac.uk:2812":http://procksi2.cs.nott.ac.uk:2812] |
109 | 1 | Anonymous | || slave04 || [http://procksi4.cs.nott.ac.uk:2812] |
110 | 1 | Anonymous | |
111 | 1 | Anonymous | |
112 | 5 | Anonymous | |
113 | 5 | Anonymous | h2. Offline Monitoring |
114 | 5 | Anonymous | |
115 | 1 | Anonymous | |
116 | 1 | Anonymous | Monit sends alerts to "procksi@cs.nott.ac.uk" if services are unavailable, have been restarted, or similar events. |