InstallationGuide » History » Version 19

Anonymous, 09/06/2007 10:20 AM

1 1 Anonymous
= Installation Guide =
2 1 Anonymous
3 1 Anonymous
This guide describes the installation procedure for ''ProCKSI''. 
4 1 Anonymous
5 4 Anonymous
 ||'''Release'''     ||procksi_8-2
6 4 Anonymous
 ||'''Environment''' ||Cluster with one head node and two compute nodes 
7 1 Anonymous
         
8 1 Anonymous
All installations must be done with ''root'' access.
9 1 Anonymous
10 1 Anonymous
11 1 Anonymous
12 1 Anonymous
13 1 Anonymous
= System Design and Requirements =
14 1 Anonymous
15 1 Anonymous
== Cluster Design ==
16 1 Anonymous
The cluster is assumed to have a ''head node'' and several ''compute nodes''. 
17 1 Anonymous
18 1 Anonymous
 * The head node runs the webserver, the file server, the central database server, the email server, and the server for the queuing system. It can be used as a compute node itself. 
19 1 Anonymous
 * The compute nodes run the calculations.
20 1 Anonymous
 
21 1 Anonymous
22 3 Anonymous
[[Image(ClusterDesign.png)]]
23 2 Anonymous
24 1 Anonymous
== Software Requirements ==
25 4 Anonymous
We are assuming the following software components to be already installed the '''head node''':
26 1 Anonymous
27 4 Anonymous
 ||'''Operating system''' ||''Centos4'' (''RHEL4'')
28 4 Anonymous
 ||'''Webserver''' 	  ||Apache2 
29 4 Anonymous
 ||'''Database'''	  ||MySQL
30 4 Anonymous
 ||'''Email server'''	  ||Postfix (SMTP) 
31 4 Anonymous
 ||'''Queuing system'''	  ||PBS torque + maui
32 1 Anonymous
33 1 Anonymous
34 4 Anonymous
The '''compute nodes''' only requires the following components:
35 1 Anonymous
36 19 Anonymous
 ||'''Operating system'''||	''Centos5'' (''RHEL5'')
37 4 Anonymous
 ||'''Queuing system  '''||	PBS torque
38 1 Anonymous
39 1 Anonymous
40 1 Anonymous
The configuration for these components will be described later in this installation guide.
41 1 Anonymous
42 1 Anonymous
43 1 Anonymous
44 1 Anonymous
= URL and Email Forwarding =
45 1 Anonymous
ProCKSI uses URL and email forwarding in order to provide a stable internet address and corresponding email addresses.
46 1 Anonymous
47 1 Anonymous
48 1 Anonymous
49 1 Anonymous
== Provider ==
50 4 Anonymous
These are data for the domain and email domain provider.
51 1 Anonymous
52 4 Anonymous
 ||'''Provider'''  ||	[http://www.planetdomain.com/ukcheap/home.jsp www.planetdomain.com/ukcheap/home.jsp]
53 4 Anonymous
 ||'''Login'''     || 	nkrasnogor
54 4 Anonymous
 ||'''Password'''  || 	[BBSRC GRANT NUMBER]
55 1 Anonymous
56 4 Anonymous
57 1 Anonymous
== Domains ==
58 4 Anonymous
ProCKSI's main domain name is [http://www.procksi.net www.procksi.net], which is redirected to [http://procksi.cs.nott.ac.uk procksi.cs.nott.ac.uk], which is an alias for [http://procksi0.cs.nott.ac.uk procksi0.cs.nott.ac.uk]. All other domain names are redirected to its main domain name.
59 1 Anonymous
60 1 Anonymous
  	
61 1 Anonymous
62 4 Anonymous
 ||'''Domain Name''' ||'''Redirected to'''                                  ||'''Expires at'''
63 4 Anonymous
 ||www.procksi.net   ||[http://procksi.cs.nott.ac.uk procksi.cs.nott.ac.uk] ||11-01-2011 
64 4 Anonymous
 ||www.procksi.org   ||[http://www.procksi.net/ www.procksi.net]            ||11-01-2011 
65 4 Anonymous
 ||www.procksi.com   ||[http://www.procksi.net/ www.procksi.net]            ||11-01-2011 
66 4 Anonymous
 ||www.procksi.info  ||[http://www.procksi.net/ www.procksi.net]            ||11-01-2008 
67 1 Anonymous
68 1 Anonymous
 
69 1 Anonymous
70 1 Anonymous
71 1 Anonymous
== DNS Settings	 ==
72 1 Anonymous
The primary and secondary DNS servers must be set as follows:
73 1 Anonymous
74 4 Anonymous
 {{{
75 4 Anonymous
 Primary	ns1.iprimus.com.au
76 4 Anonymous
 Secondary	ns2.iprimus.com.au 
77 4 Anonymous
 }}}
78 1 Anonymous
79 1 Anonymous
80 1 Anonymous
The following changes must be made manually in ''Advanced DNS settings'':
81 1 Anonymous
82 4 Anonymous
 {{{
83 4 Anonymous
 CNAME    *.procksi.net	    procksi.cs.nott.ac.uk.
84 4 Anonymous
 CNAME    *.procksi.org	    www.procksi.net.
85 4 Anonymous
 CNAME    *.procksi.com	    www.procksi.net.
86 4 Anonymous
 CNAME    *.procksi.info    www.procksi.net.
87 4 Anonymous
 }}}
88 1 Anonymous
      
89 1 Anonymous
90 1 Anonymous
91 1 Anonymous
== Email Settings ==
92 1 Anonymous
The following email addresses must be created and redirected to ''procksi@cs.nott.ac.uk'', which must be available:
93 1 Anonymous
 	
94 4 Anonymous
 ||'''Email Address'''  ||'''Redirected to'''	
95 4 Anonymous
 ||admin@procksi.net	||procksi@cs.nott.ac.uk
96 4 Anonymous
 ||develop@procksi.net	||procksi@cs.nott.ac.uk   
97 4 Anonymous
 ||info@procksi.net	||procksi@cs.nott.ac.uk
98 4 Anonymous
 ||research@procksi.net	||procksi@cs.nott.ac.uk
99 4 Anonymous
 ||pbs@procksi.net	||procksi@cs.nott.ac.uk
100 4 Anonymous
 ||webmaster@procksi.net||procksi@cs.nott.ac.uk
101 1 Anonymous
102 1 Anonymous
The following changes must be made manually in ''Advanced DNS settings:''
103 1 Anonymous
104 4 Anonymous
 {{{
105 4 Anonymous
 MX    @.procksi.net    mailhost.planetdomain.com    10
106 4 Anonymous
 }}}
107 1 Anonymous
 
108 1 Anonymous
109 1 Anonymous
110 1 Anonymous
== Domain Usage Monitoring ==
111 1 Anonymous
The usage of ProCKSI's domains is monitored. 
112 1 Anonymous
113 8 Anonymous
 ||Provider	||[http://www.sitemeter.com www.sitemeter.com] 
114 4 Anonymous
 ||Login:	||s18procksi
115 4 Anonymous
 ||Password:	||FAKUIL
116 1 Anonymous
 
117 1 Anonymous
118 1 Anonymous
All HTML documents must contain the following code in order to be tracked correctly.
119 1 Anonymous
120 4 Anonymous
 {{{
121 1 Anonymous
<!-- Site Meter -->
122 1 Anonymous
	<script type="text/javascript" src="http://s18.sitemeter.com/js/counter.js?site=s18procksi">
123 1 Anonymous
	</script>
124 1 Anonymous
	<noscript>
125 1 Anonymous
		<a href="http://s18.sitemeter.com/stats.asp?site=s18procksi" target="_top">
126 1 Anonymous
			<img	src=[http://s18.sitemeter.com/meter.asp?site=s18procksi http://s18.sitemeter.com/meter.asp?site=s18procksi]
127 1 Anonymous
    				alt="Site Meter" border="0"/>
128 1 Anonymous
		</a>
129 1 Anonymous
	</noscript>
130 1 Anonymous
131 1 Anonymous
<!-- Copyright (c)2006 Site Meter -->
132 4 Anonymous
 }}}
133 1 Anonymous
 
134 1 Anonymous
135 1 Anonymous
136 1 Anonymous
= Data Management and Exchange  =
137 4 Anonymous
The head node and all compute nodes must be able to communicate with each other and exchange data. Therefore, a common user management and shared file system is necessary.
138 1 Anonymous
139 17 Anonymous
140 17 Anonymous
== Network Configuration == 
141 17 Anonymous
Make the following changes on the head node and each compute node:
142 17 Anonymous
143 17 Anonymous
 * Modify ''/etc/sysconfig/network'' in order to enable networking, set the hostname, and disable the Zero Configuration Newtworking:
144 17 Anonymous
  {{{
145 17 Anonymous
  NETWORKING=yes
146 17 Anonymous
  HOSTNAME=[Add Hostname]
147 17 Anonymous
  NOZEROCONF=yes
148 17 Anonymous
  }}}
149 17 Anonymous
150 17 Anonymous
 * Configure the internal network inferface (eth0) in ''/etc/sysconfig/networking/devices/ifcfg-eth0''(Example given for ''procksi0''):
151 17 Anonymous
  {{{
152 17 Anonymous
  DEVICE=eth0
153 17 Anonymous
  TYPE=Ethernet
154 17 Anonymous
  ONBOOT=yes
155 17 Anonymous
  BOOTPROTO=none
156 17 Anonymous
  HWADDR=[Add MAC Address]
157 17 Anonymous
  IPADDR=192.168.0.10
158 17 Anonymous
  BROADCAST=192.168.199.255
159 17 Anonymous
  GATEWAY=192.168.0.10
160 17 Anonymous
  NETWORK=192.168.199.0
161 17 Anonymous
  NETMASK=255.255.255.0
162 17 Anonymous
  }}}
163 17 Anonymous
164 17 Anonymous
 * Configure the external network inferface (eth1) in ''/etc/sysconfig/networking/devices/ifcfg-eth1'' (Example given for ''procksi0''):
165 17 Anonymous
  {{{
166 17 Anonymous
  DEVICE=eth1
167 17 Anonymous
  TYPE=Ethernet
168 17 Anonymous
  ONBOOT=yes
169 17 Anonymous
  BOOTPROTO=none
170 17 Anonymous
  HWADDR=[Add MAC Address]
171 17 Anonymous
  IPADDR=128.243.21.180
172 17 Anonymous
  BROADCAST=128.243.21.255
173 17 Anonymous
  GATEWAY=128.243.21.1
174 17 Anonymous
  NETWORK=128.243.21.0
175 17 Anonymous
  NETMASK=255.255.255.0
176 17 Anonymous
  }}}
177 17 Anonymous
178 17 Anonymous
 * Add a default gateway, and routes to the internal and external networks to the Routing Table:
179 17 Anonymous
  {{{
180 17 Anonymous
  /sbin/route add -net 192.168.199.0 netmask 255.255.255.0 dev eth0
181 17 Anonymous
  /sbin/route add -net 128.243.21.0  netmask 255.255.255.0 dev eth1
182 17 Anonymous
  /sbin/route add default gw 128.243.21.1 dev eth1
183 17 Anonymous
  }}}
184 1 Anonymous
185 1 Anonymous
186 1 Anonymous
== User Management ==
187 4 Anonymous
Make the following changes on the head node and each compute node:
188 1 Anonymous
189 4 Anonymous
 * Add a new user into ''/etc/passwd'': 
190 4 Anonymous
   {{{
191 4 Anonymous
   procksi:x:510:510:ProCKSI-Server:/home/procksi:/bin/bash
192 4 Anonymous
   }}}
193 4 Anonymous
 * Add an entry for the new user into ''/etc/shadow'' if desired: 
194 4 Anonymous
   {{{
195 4 Anonymous
   procksi:[ENCRYPTED_PASSWORD]:13483:0:99999:7:::
196 4 Anonymous
   }}}
197 4 Anonymous
 * Add a new group into ''/etc/group'', and add all users who should have access:
198 4 Anonymous
   {{{
199 4 Anonymous
   procksi:x:510:dxb
200 4 Anonymous
   }}} 
201 4 Anonymous
   The members for group procksi are now: ''procksi'', ''dxb''
202 1 Anonymous
 * Generate home directory for ''procksi''
203 1 Anonymous
  
204 1 Anonymous
205 1 Anonymous
206 1 Anonymous
== Firewall ==
207 4 Anonymous
 All network traffic using the internal (private) network is trusted and considered to be secure. So no firewall is needed on the internal network interface (''eth1'').
208 1 Anonymous
209 4 Anonymous
 * Modify ''/etc/sysconfig/iptables'' on the head node and on each compute node. [[BR]] 
210 4 Anonymous
   If ''eth1'' is on the private network, add
211 4 Anonymous
   {{{
212 4 Anonymous
   -A RH-Firewall-1-INPUT -i eth1 -j ACCEPT
213 4 Anonymous
   }}}
214 4 Anonymous
   directly after
215 4 Anonymous
   {{{
216 4 Anonymous
    -A RH-Firewall-1-INPUT -i lo -j ACCEPT 
217 4 Anonymous
   }}}
218 4 Anonymous
 * Restart the firewall on the head node and on each compute node:
219 4 Anonymous
   {{{
220 4 Anonymous
   /sbin/service iptables restart
221 4 Anonymous
   }}}
222 1 Anonymous
223 4 Anonymous
 Changes in the firewall settings regarding the external network interface (''eth0'') will be described in other sections where necessary.
224 1 Anonymous
225 1 Anonymous
 
226 1 Anonymous
227 1 Anonymous
== Host Name Resolution ==
228 1 Anonymous
As each node consists of two network interfaces (= multihomed host), the host name resolution must be configured correctly in order to prioritize the internal, trusted network for communication between different nodes.
229 1 Anonymous
230 4 Anonymous
 * The official hostname for each node must be set to the ''internal'' name of the machine in ''/etc/sysconfig/network''. This is an example for the head node:
231 4 Anonymous
 {{{
232 4 Anonymous
 HOSTNAME=procksi0-priv.cs.nott.ac.uk
233 4 Anonymous
 }}}
234 1 Anonymous
235 4 Anonymous
 The compute nodes must be named and configured accordingly.
236 4 Anonymous
 * Add the following to ''/etc/hosts'' on the head node:
237 4 Anonymous
 {{{
238 4 Anonymous
 127.0.0.1  procksi0-priv.cs.nott.ac.uk  procksi0-privlocalhost.localdomain  localhost
239 4 Anonymous
 }}}
240 4 Anonymous
 and alter the line for each compute node (procksi1, procksi2) accordingly.
241 4 Anonymous
 * Add the following to ''/etc/hosts'' on the head node and each compute node:
242 4 Anonymous
 {{{
243 4 Anonymous
 192.168.199.10  procksi0-priv.cs.nott.ac.uk  procksi0-priv
244 4 Anonymous
 192.168.199.11  procksi1-priv.cs.nott.ac.uk  procksi1-priv
245 4 Anonymous
 192.168.199.12  procksi2-priv.cs.nott.ac.uk  procksi2-priv
246 4 Anonymous
 }}}
247 4 Anonymous
 Edit ''/etc/host.conf'' so that local setting in ''/etc/hosts'' take precedence over DNS:
248 4 Anonymous
 {{{
249 4 Anonymous
 order hosts,bind
250 4 Anonymous
 }}}
251 4 Anonymous
252 1 Anonymous
== Data Access ==
253 1 Anonymous
 The head node hosts a RAID system of hard disks that will store all data generated by ProCKSI on all compute nodes and the head node itself. This partition must be accessible by all nodes and is exported as a network file system (NFS) therefore. Executables used by ProCKSI must be installed locally on each compute node for better performance.
254 1 Anonymous
255 4 Anonymous
 * Add the following to the end of ''/etc/exports'' on the head node (''procksi0''):
256 4 Anonymous
   {{{
257 4 Anonymous
   /home/procksi  procksi?-priv.cs.nott.ac.uk(sync,rw,no_root_squash)
258 4 Anonymous
   }}}
259 4 Anonymous
 * Add the following to the end of ''/etc/fstab'' on each compute node (''procksi1'', ''procksi2''): 
260 4 Anonymous
   {{{
261 4 Anonymous
   procksi0:/home/procksi       /home/procksi       nfs  bg,hard,intr  0 0
262 4 Anonymous
   procksi0:/usr/local/procksi	/usr/local/procksi  nfs  bg,hard,intr  0 0
263 4 Anonymous
   }}}
264 11 Anonymous
 * Tune NFS by increasing the number of nfsd threads. Modify ''/etc/sysconfig/nfs'':
265 11 Anonymous
   {{{
266 11 Anonymous
   RPCNFSDCOUNT=32
267 11 Anonymous
   }}}
268 4 Anonymous
 * Make the NFS daemons start at bootup. Enter at the command line of the head node and each compute node: 
269 4 Anonymous
   {{{
270 4 Anonymous
   /sbin/chkconfig  nfs  on
271 4 Anonymous
   }}}
272 4 Anonymous
 * Start the NFS daemons. Enter at the command line on the head node and on each compute node:
273 4 Anonymous
   {{{
274 4 Anonymous
   /sbin/service  nfsd  start
275 4 Anonymous
   }}}
276 4 Anonymous
 * Generate the following temp directory on the head node and each compute node, at best on a separate partition:
277 4 Anonymous
   {{{ 
278 4 Anonymous
   mkdir /scratch
279 4 Anonymous
   }}}
280 1 Anonymous
281 1 Anonymous
282 1 Anonymous
== Time Synchronisation ==
283 4 Anonymous
 The system time on all nodes must be synchronized as 
284 4 Anonymous
 a) data is written/read on/from a common, shared file system or even expires after a certain period of time and must be deleted, and 
285 4 Anonymous
 b) system logs are maintained independently but entries must be able to be associated with each other.
286 1 Anonymous
287 4 Anonymous
 * Add your own time server to ''/etc/ntp/ntpservers'': 
288 4 Anonymous
   {{{   
289 15 Anonymous
   128.243.21.16 #marian.cs.nott.ac.uk
290 15 Anonymous
   128.243.21.17 #robin.cs.nott.ac.uk
291 15 Anonymous
   128.243.21.18 #tuck.cs.nott.ac.uk
292 15 Anonymous
   128.243.21.19 #pat.cs.nott.ac.uk
293 1 Anonymous
   }}}
294 15 Anonymous
295 15 Anonymous
 * Modify ''/etc/ntp.conf'' in order to permit systems on the subnet to synchronise with this time service:
296 1 Anonymous
   {{{
297 15 Anonymous
   # -- CLIENT NETWORK -------
298 15 Anonymous
   restrict 192.168.199.0 mask 255.255.255.0 nomodify notrap
299 15 Anonymous
   broadcastclient
300 15 Anonymous
   }}}
301 15 Anonymous
 * Modify ''/etc/ntp.conf'' and add further time servers:
302 15 Anonymous
   {{{
303 15 Anonymous
   # --- OUR TIMESERVERS -----
304 15 Anonymous
   server 128.243.21.16 #marian.cs.nott.ac.uk
305 15 Anonymous
   restrict 128.243.21.16 mask 255.255.255.255 nomodify notrap noquery
306 15 Anonymous
   server 128.243.21.17 #robin.cs.nott.ac.uk
307 15 Anonymous
   restrict 128.243.21.17 mask 255.255.255.255 nomodify notrap noquery
308 15 Anonymous
   server 128.243.21.18 #tuck.cs.nott.ac.uk
309 15 Anonymous
   restrict 128.243.21.18 mask 255.255.255.255 nomodify notrap noquery
310 15 Anonymous
   server 128.243.21.19 #pat.cs.nott.ac.uk
311 15 Anonymous
   restrict 128.243.21.19 mask 255.255.255.255 nomodify notrap noquery
312 4 Anonymous
   }}}
313 4 Anonymous
 * Make the NTP daemon start at bootup. Enter at the command line:
314 4 Anonymous
   {{{
315 4 Anonymous
   /sbin/chkconfig  ntpd  on
316 4 Anonymous
   }}}
317 4 Anonymous
 * Start the NTP daemon. Enter at the command line:
318 4 Anonymous
   {{{
319 4 Anonymous
   /sbin/service ntpd start
320 1 Anonymous
   }}}
321 1 Anonymous
322 1 Anonymous
323 1 Anonymous
324 1 Anonymous
= Queuing System =
325 1 Anonymous
 The queueing system (resource manager) is the heart of the distributed computing on a cluster. It consists of three parts, the server, the scheduler, and the machine-oriented mini-server (MOM) executing the jobs.
326 1 Anonymous
327 1 Anonymous
 
328 1 Anonymous
329 1 Anonymous
 We are assuming the following configuration:
330 1 Anonymous
331 1 Anonymous
 ||PBS TORQUE|| version 2.1.6           ||server, basic scheduler, mom 
332 1 Anonymous
 ||MAUI      || version 3.2.6.p18	||scheduler
333 1 Anonymous
334 1 Anonymous
335 1 Anonymous
 The sources can be obtained from:
336 1 Anonymous
337 1 Anonymous
 ||PBS TORQUE ||http://www.clusterresources.com/pages/products/torque-resource-manager.php
338 1 Anonymous
 ||MAUI       ||http://www.clusterresources.com/pages/products/maui-cluster-scheduler.php
339 1 Anonymous
 
340 1 Anonymous
341 1 Anonymous
 The install directories for ''TORQUE'' and ''MAUI'' will be:
342 1 Anonymous
343 1 Anonymous
 ||PBS TORQUE ||''/usr/local/torque''
344 1 Anonymous
 ||MAUI       ||''/usr/local/maui''
345 1 Anonymous
 
346 1 Anonymous
347 1 Anonymous
348 1 Anonymous
== TORQUE ==
349 1 Anonymous
350 1 Anonymous
=== Register new services ===
351 4 Anonymous
  Edit ''/etc/services'' and add at the end:
352 1 Anonymous
  {{{
353 1 Anonymous
  # PBS/Torque services
354 1 Anonymous
355 1 Anonymous
  pbs           15001/tcp    # pbs_server
356 1 Anonymous
  pbs           15001/udp    # pbs_server
357 1 Anonymous
  pbs_mom       15002/tcp    # pbs_mom <-> pbs_server
358 1 Anonymous
  pbs_mom       15002/udp    # pbs_mom <-> pbs_server
359 1 Anonymous
  pbs_resmom    15003/tcp    # pbs_mom resource management
360 1 Anonymous
  pbs_resmom    15003/udp    # pbs_mom resource management
361 1 Anonymous
  pbs_sched     15004/tcp    # pbs scheduler (pbs_sched)
362 4 Anonymous
  pbs_sched     15004/udp    # pbs scheduler (pbs_sched)
363 1 Anonymous
  }}}
364 1 Anonymous
  
365 1 Anonymous
366 1 Anonymous
  
367 1 Anonymous
368 1 Anonymous
369 4 Anonymous
=== Setup and Configuration on the Head Node ===
370 4 Anonymous
Extract and build the distribution TORQUE on the head node. Configure server, monitor and clients to use secure file transfer (scp).
371 4 Anonymous
{{{
372 4 Anonymous
export TORQUECFG=/usr/local/torque
373 4 Anonymous
tar -xzvf TORQUE.tar.gz
374 4 Anonymous
cd TORQUE
375 4 Anonymous
}}}
376 4 Anonymous
Configuration for a 64bit machine with the following compiler options:
377 13 Anonymous
{{{
378 13 Anonymous
FFLAGS   = "-m64 -march=nocona -O3 -fPIC"
379 13 Anonymous
CFLAGS   = "-m64 -march=nocona -O3 -fPIC"
380 13 Anonymous
CXXFLAGS = "-m64 -march=nocona -O3 -fPIC"
381 4 Anonymous
LDFLAGS  = "-L/usr/local/lib -L/usr/local/lib64"
382 4 Anonymous
}}}
383 4 Anonymous
Configure, build, and install:
384 4 Anonymous
{{{
385 4 Anonymous
./configure  --enable-server  --enable-monitor  --enable-clients
386 4 Anonymous
             --with-server-home=$TORQUECFG  --with-server-name
387 4 Anonymous
             --with-rcp=scp  --disable-filesync
388 4 Anonymous
make
389 4 Anonymous
make install 
390 1 Anonymous
}}}
391 1 Anonymous
 
392 4 Anonymous
393 4 Anonymous
If not configures otherwise, binaries are installed in ''/usr/local/bin'' and ''/usr/local/sbin''. You should have these directories included in your path. But you can configure TORQUE to have the binaries in the default system directory with
394 1 Anonymous
{{{
395 4 Anonymous
./configure --bindir=/usr/bin --sbindir=/usr/sbin
396 1 Anonymous
}}}
397 1 Anonymous
 
398 4 Anonymous
399 4 Anonymous
Initialise/configure the queueing system's server daemon (pbs_server):
400 4 Anonymous
{{{
401 4 Anonymous
pbs_server -t create
402 1 Anonymous
}}}
403 4 Anonymous
404 4 Anonymous
Set the PBS operator and manager (must be a valid user name). 
405 4 Anonymous
{{{
406 4 Anonymous
qmgr
407 4 Anonymous
> set server_name = procksi0-priv.cs.nott.ac.uk
408 4 Anonymous
> set server scheduling = true
409 4 Anonymous
> set server operators += “root@procksi.cs.nott.ac.uk"
410 4 Anonymous
> set server operators += “procksi@ procksi.cs.nott.ac.uk"
411 4 Anonymous
> set server managers  += “root@ procksi.cs.nott.ac.uk"
412 4 Anonymous
> set server managers  += “procksi@ procksi.cs.nott.ac.uk"
413 1 Anonymous
}}}
414 4 Anonymous
415 4 Anonymous
Allow only ''procksi'' and ''root'' to submit jobs into the queue:
416 4 Anonymous
{{{
417 4 Anonymous
> set server acl_users = “root, procksi" 
418 4 Anonymous
> set server acl_user_enable = true
419 1 Anonymous
}}}
420 4 Anonymous
 
421 4 Anonymous
Set the default queue to ''batch''
422 4 Anonymous
{{{
423 4 Anonymous
> set server default_queue=batch
424 1 Anonymous
}}}
425 1 Anonymous
 
426 4 Anonymous
427 4 Anonymous
Set email address for email that is sent by PBS:
428 4 Anonymous
{{{
429 4 Anonymous
> set mail_from = pbs@procksi.net	
430 1 Anonymous
}}}
431 1 Anonymous
 
432 4 Anonymous
433 4 Anonymous
Allow submissions from compute hosts (only):
434 4 Anonymous
{{{
435 4 Anonymous
> set server allow_node_submit = true
436 4 Anonymous
> set server submit_hosts = procksi0-priv.cs.nott.ac.uk
437 4 Anonymous
                            procksi1-priv.cs.nott.ac.uk
438 4 Anonymous
                            procksi2-priv.cs.nott.ac.uk
439 1 Anonymous
}}}
440 1 Anonymous
 
441 4 Anonymous
442 1 Anonymous
Restrict nodes that can access the PBS server:
443 4 Anonymous
{{{
444 4 Anonymous
> set server acl_hosts = procksi0-priv.cs.nott.ac.uk
445 4 Anonymous
                         procksi1-priv.cs.nott.ac.uk                           
446 4 Anonymous
                         procksi2-priv.cs.nott.ac.uk
447 1 Anonymous
> set acl_host_enable = true
448 4 Anonymous
}}}
449 4 Anonymous
And set in ''torque.cfg'' in order to use the internal interface:
450 4 Anonymous
{{{
451 4 Anonymous
SERVERHOST              procksi0-priv.cs.nott.ac.uk
452 4 Anonymous
ALLOWCOMPUTEHOSTSUBMIT  true
453 1 Anonymous
}}}
454 1 Anonymous
 
455 4 Anonymous
456 4 Anonymous
Configure the main queue ''batch'':
457 4 Anonymous
{{{
458 4 Anonymous
> create queue batch queue_type=execution
459 4 Anonymous
> set queue batch started=true
460 4 Anonymous
> set queue batch enabled=true
461 4 Anonymous
> set queue batch resources_default.nodes=1
462 1 Anonymous
}}}
463 4 Anonymous
464 1 Anonymous
Configure queue ''test ''accordingly''. 
465 4 Anonymous
466 4 Anonymous
Configure default node to be used (see below):
467 4 Anonymous
{{{
468 4 Anonymous
> set server default_node = slave
469 1 Anonymous
}}}
470 4 Anonymous
471 1 Anonymous
Specify all compute nodes to be used by creating/editing ''$TORQUECFG/server_priv/nodes.'' This may include the same machine where pbs_server will run. If the compute nodes have more than one processor, just add np=X after the name with X being the number of processors. Add node attributes so that a subset of nodes can be requested during the submission stage.
472 4 Anonymous
{{{
473 4 Anonymous
procksi0-priv.cs.nott.ac.uk  np=1  procksi  head
474 4 Anonymous
procksi1-priv.cs.nott.ac.uk  np=2  procksi  slave  slave1
475 1 Anonymous
procksi2-priv.cs.nott.ac.uk  np=2  procksi  slave  slave2
476 1 Anonymous
}}}
477 4 Anonymous
478 1 Anonymous
Although the head node (''procksi0'') has two processors as well, we only allow one processor to be used for the queueing system as the other processor will be used for handling all frontend communication and I/O. (Make sure that hyperthreading technology is disabled on the head node and all compute nodes!)
479 1 Anonymous
480 4 Anonymous
 
481 4 Anonymous
Build packages for the compute nodes and copy them to each compute node:
482 4 Anonymous
{{{
483 1 Anonymous
cd $TORQUE
484 4 Anonymous
make packages
485 4 Anonymous
scp torque-package-mom-linux-i686.sh     procksi1|procksi2
486 4 Anonymous
scp torque-package-clients-linux-i686.sh procksi1|procksi2
487 13 Anonymous
}}}
488 1 Anonymous
ATTENTION: Does only work for the same architecture! Thus, building on the Intel head node and deploying to AMD slaves does not work!
489 1 Anonymous
490 1 Anonymous
491 4 Anonymous
=== Setup and Configuration on the Compute Nodes ===
492 4 Anonymous
Install prepared packes. A directory similar to ''$TORQUECFG'' will be automatically created.
493 13 Anonymous
{{{
494 13 Anonymous
pdsh torque-package-mom-linux-i686.sh --install
495 4 Anonymous
pdsh torque-package-clients-linux-i686.sh --install
496 1 Anonymous
}}}
497 1 Anonymous
498 4 Anonymous
 
499 4 Anonymous
Check if the nodes know the head node
500 4 Anonymous
{{{
501 4 Anonymous
$TORQUECFG/server_name
502 4 Anonymous
#procksi0-priv.cs.nott.ac.uk
503 1 Anonymous
}}}
504 4 Anonymous
505 1 Anonymous
Configure the compute nodes by creating/editing ''$TORQUECFG/mom_priv/config''. The first line specifies the PBS server, the second line specifies hosts which can be trusted to access mom services as non-root, and the last line allows to copy data via NFS without using SCP.
506 1 Anonymous
507 4 Anonymous
{{{
508 4 Anonymous
$pbsserver   procksi0-priv.cs.nott.ac.uk
509 4 Anonymous
$loglevel    255
510 4 Anonymous
$restricted  procksi?-priv.cs.nott.ac.uk
511 1 Anonymous
$usecp       procksi0-priv.cs.nott.ac.uk:/home/procksi  /home/procksi
512 1 Anonymous
}}}
513 4 Anonymous
514 4 Anonymous
Start the queueing system (manually) in the correct order:
515 4 Anonymous
 * Start the mom:
516 4 Anonymous
 {{{
517 4 Anonymous
 /usr/local/sbin/pbs_mom
518 4 Anonymous
 }}}
519 4 Anonymous
 * Kill the server:
520 4 Anonymous
 {{{
521 4 Anonymous
 /usr/local/sbin/qterm -t quick
522 4 Anonymous
 }}}
523 4 Anonymous
 * Start the server:
524 4 Anonymous
 {{{ 
525 4 Anonymous
 /usr/local/sbin/pbs_server
526 4 Anonymous
 }}}	
527 4 Anonymous
 * Start the scheduler:		
528 4 Anonymous
 {{{
529 4 Anonymous
 /usr/local/sbin/pbs_sched
530 1 Anonymous
 }}} 
531 4 Anonymous
532 1 Anonymous
If you want to use MAUI as the final scheduler, keep in mind to kill ''pbs_sched'' after testing the TORQURE installation.
533 1 Anonymous
534 4 Anonymous
535 4 Anonymous
Check that all nodes are properly configured and correctly reporting
536 4 Anonymous
{{{
537 4 Anonymous
qstat  -q
538 4 Anonymous
pbsnodes -a
539 1 Anonymous
}}}
540 1 Anonymous
541 1 Anonymous
 
542 1 Anonymous
543 1 Anonymous
544 4 Anonymous
=== Prologue and Epilogue Scripts ===
545 4 Anonymous
The ''prologue'' script is executed just before the submitted job starts. Here, it generates a unique temp directory for each job in ''/scratch''. It must be installed on each node:
546 4 Anonymous
{{{
547 4 Anonymous
cp $PROCKSI/install/prologue $TORQUECFG/mom_priv
548 4 Anonymous
chmod 500 $TORQUECFG/mom_priv/prologue
549 1 Anonymous
}}}
550 4 Anonymous
 
551 4 Anonymous
The ''epilogue'' script is executed right after the submitted job has ended. Here, it deletes the job's temp directory from ''/scratch.'' It must be installed on each node:
552 4 Anonymous
{{{
553 4 Anonymous
cp $PROCKSI/install/epilogue $TORQUECFG/mom_priv
554 4 Anonymous
chmod 500 $TORQUECFG/mom_priv/epilogue
555 4 Anonymous
}}}
556 1 Anonymous
  
557 1 Anonymous
558 1 Anonymous
559 1 Anonymous
== MAUI ==
560 1 Anonymous
561 4 Anonymous
=== Register new services ===
562 1 Anonymous
Edit ''/etc/services'' and add at the end:
563 4 Anonymous
{{{ 
564 4 Anonymous
# PBS/MAUI services
565 4 Anonymous
pbs_maui  42559/tcp    # pbs scheduler (maui)
566 1 Anonymous
pbs_maui  42559/udp    # pbs scheduler (maui)
567 1 Anonymous
}}}
568 1 Anonymous
569 1 Anonymous
570 4 Anonymous
=== Setup and Configuration on the Head Node ===
571 4 Anonymous
Extract and build the distribution MAUI.
572 4 Anonymous
{{{
573 4 Anonymous
export MAUIDIR=/usr/local/maui
574 4 Anonymous
tar -xzvf MAUI.tar.gz
575 4 Anonymous
cd TORQUE
576 1 Anonymous
}}}
577 4 Anonymous
578 4 Anonymous
Configuration for a 64bit machine with the following compiler options:
579 4 Anonymous
{{{
580 18 Anonymous
FFLAGS   = “-m64 -march=[Add Architecture] -O3 -fPIC"
581 18 Anonymous
CFLAGS   = “-m64 -march=[Add Architecture] -O3 -fPIC"
582 18 Anonymous
CXXFLAGS = “-m64 -march=[Add Architecture] -O3 -fPIC"
583 4 Anonymous
LDFLAGS	 = “-L/usr/local/lib -L/usr/local/lib64"
584 4 Anonymous
}}}
585 18 Anonymous
Attention: For Intel Xenon processors use ''-march=nocona'', for AMD Opteron processors use ''-march=opteron''.
586 4 Anonymous
587 4 Anonymous
Configure, build, and install:
588 4 Anonymous
{{{
589 4 Anonymous
./configure --with-pbs=$TORQUECFG --with-spooldir=$MAUIDIR
590 4 Anonymous
make
591 4 Anonymous
make install 
592 1 Anonymous
}}}
593 4 Anonymous
594 1 Anonymous
Fine-tune MAUI in $''MAUIDIR/maui.cfg'':
595 4 Anonymous
{{{
596 1 Anonymous
SERVERHOST            procksi0-priv.cs.nott.ac.uk
597 4 Anonymous
598 4 Anonymous
# primary admin must be first in list
599 4 Anonymous
ADMIN1                procksi
600 1 Anonymous
ADMIN1                root
601 4 Anonymous
        
602 4 Anonymous
# Resource Manager Definition
603 4 Anonymous
RMCFG[PROCKSI0-PRIV.CS.NOTT.AC.UK]		
604 4 Anonymous
TYPE=PBS 			
605 4 Anonymous
HOST=PROCKSI0-PRIV.CS.NOTT.AC.UK 		
606 4 Anonymous
PORT=15001
607 4 Anonymous
EPORT=15004	[CAN BE ALTERNATIVELY: 15017 - TRY!!!]
608 4 Anonymous
SERVERPORT  42559
609 1 Anonymous
SERVERMODE  NORMAL
610 4 Anonymous
611 4 Anonymous
# Node Allocation:
612 4 Anonymous
NODEALLOCATIONPOLICY  PRIORITY
613 1 Anonymous
NODECFG[DEFAULT] PRIORITY='- JOBCOUNT'
614 1 Anonymous
}}}
615 4 Anonymous
616 4 Anonymous
Configure attributes of compute nodes:
617 4 Anonymous
{{{
618 4 Anonymous
qmgr
619 4 Anonymous
> set node procksi0.cs.nott.ac.uk properties = “procksi, head"
620 4 Anonymous
> set node procksi1.cs.nott.ac.uk properties = “procksi, slave"
621 4 Anonymous
> set node procksi0.cs.nott.ac.uk properties = “procksi, slave"
622 1 Anonymous
}}}
623 4 Anonymous
624 1 Anonymous
Request job to be run on specific nodes (on submission):
625 4 Anonymous
626 4 Anonymous
 * Run on any compute node: 	
627 4 Anonymous
 {{{
628 4 Anonymous
 qsub -q batch -l nodes=1:procksi
629 4 Anonymous
 }}}
630 4 Anonymous
 * Run on any slave node:	
631 4 Anonymous
 {{{
632 4 Anonymous
 qsub -q batch -l nodes=1:slave
633 4 Anonymous
 }}}
634 4 Anonymous
 * Run on head node:		
635 4 Anonymous
 {{{
636 4 Anonymous
 qsub -q batch -l nodes=1:head
637 1 Anonymous
 }}}
638 4 Anonymous
639 1 Anonymous
Start the MAUI scheduler manually. Make sure that pbs_sched is not running any longer.
640 4 Anonymous
641 4 Anonymous
 * Start the scheduler:
642 4 Anonymous
 {{{
643 4 Anonymous
 /usr/local/sbin/maui
644 1 Anonymous
 }}}
645 1 Anonymous
 
646 4 Anonymous
647 4 Anonymous
Make the entire queueing system start at bootup:
648 4 Anonymous
{{{
649 4 Anonymous
cp /home/procksi/latest/install/pbs_head-node /etc/init.d/pbs 
650 4 Anonymous
/sbin/chkconfig --add pbs
651 4 Anonymous
/sbin/chkconfig pbs on
652 1 Anonymous
}}}
653 1 Anonymous
654 1 Anonymous
655 4 Anonymous
=== Setup and Configuration on the Compute Nodes ===
656 1 Anonymous
__Attention:__ 	If the head node is a compute node itself, do NOT proceed with the following steps as the head node was configured in the previous step!
657 1 Anonymous
    
658 4 Anonymous
659 4 Anonymous
Make the entire queueing system start at bootup:
660 4 Anonymous
{{{
661 4 Anonymous
cp /home/procksi/latest/install/pbs_compute-node /etc/init.d/pbs
662 4 Anonymous
/sbin/chkconfig --add pbs
663 4 Anonymous
/sbin/chkconfig pbs on
664 1 Anonymous
}}}
665 1 Anonymous
666 1 Anonymous
667 1 Anonymous
= Cluster Monitoring =
668 1 Anonymous
669 4 Anonymous
== Ganglia ==
670 1 Anonymous
“Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and Grids."
671 14 Anonymous
672 14 Anonymous
 * Download the latest release of the ''Ganglia Monitoring Core'' from [http://ganglia.sourceforge.net/ http://ganglia.sourceforge.net].
673 14 Anonymous
 * Install Ganglia into ''/usr/local/ganglia'', its web frontend into ''/usr/local/ganglia/html/', and its databases into ''/usr/local/ganglia/rrds/'.
674 1 Anonymous
 * Install the ''Ganglia Monitoring Daemon'' (gmond) on each node, and the ''Ganglia Meta Daemon'' (gmetad) on the the head node.
675 14 Anonymous
676 14 Anonymous
=== Ganglia Monitoring Daemon ===
677 14 Anonymous
 * Configure the ''Ganglia Moditoring Daemon'' in ''/etc/gmond.conf'':
678 14 Anonymous
  * Set the name of the cluster: 
679 14 Anonymous
  {{{
680 14 Anonymous
  cluster {
681 14 Anonymous
    name = "ProCKSI"
682 14 Anonymous
  }
683 14 Anonymous
  }}}
684 14 Anonymous
  * Set the IP address and port for multicast data exchange:
685 14 Anonymous
  {{{
686 14 Anonymous
  udp_send_channel {
687 14 Anonymous
    mcast_join = 239.2.11.71
688 14 Anonymous
    port = 8649
689 14 Anonymous
  }
690 14 Anonymous
  udp_recv_channel {
691 14 Anonymous
    mcast_join = 239.2.11.71
692 14 Anonymous
    port = 8649
693 14 Anonymous
    bind = 239.2.11.71
694 14 Anonymous
  }
695 14 Anonymous
  }}}
696 16 Anonymous
 * Add additional route for correct data exchange via multicast using the ''internal'' interface (''eth0''). Modify ''/etc/inid.d/gmond'':
697 1 Anonymous
  {{{
698 16 Anonymous
   #Add multicast route to internal interface
699 16 Anonymous
   /sbin/route add -host 239.2.11.71 dev eth0
700 16 Anonymous
   daemon $GMOND
701 16 Anonymous
  }}}
702 16 Anonymous
  {{{
703 16 Anonymous
   #Remove multicast route to internal interface
704 16 Anonymous
   /sbin/route delete -host 239.2.11.71 dev eth0
705 16 Anonymous
   killproc gmond
706 14 Anonymous
  }}}
707 14 Anonymous
 * Make the Ganglia Monitoring Daemon start at bootup.
708 14 Anonymous
  {{{
709 14 Anonymous
   /sbin/chkconfig  gmond  on
710 14 Anonymous
  }}}
711 14 Anonymous
 * Start the Ganglia Monitoring Daemon:
712 14 Anonymous
  {{{
713 14 Anonymous
   /sbin/service  gmond  start
714 14 Anonymous
  }}}
715 14 Anonymous
  
716 14 Anonymous
=== Ganglia Meta Daemon ===
717 14 Anonymous
 * Install and configure the ''Ganglia Meta Daeomn'' (gmetad) on the head node.
718 14 Anonymous
 * Make the Ganglia Meta Daemon start at bootup.
719 14 Anonymous
  {{{
720 14 Anonymous
   /sbin/chkconfig  gmetad  on
721 14 Anonymous
  }}}
722 14 Anonymous
 * Start the Meta Meta Daemon:
723 14 Anonymous
  {{{
724 14 Anonymous
   /sbin/service  gmetad  start
725 14 Anonymous
  }}}
726 14 Anonymous
727 1 Anonymous
728 6 Anonymous
  
729 6 Anonymous
=== Further Customisation ===
730 6 Anonymous
In order to display more fine-grained time intervals, edit the following files in ''/usr/local/ganglia/html/'':
731 6 Anonymous
 * '''header.php'''
732 6 Anonymous
 {{{
733 6 Anonymous
  if (!$physical) {
734 6 Anonymous
   $context_ranges[]="10 minutes";
735 6 Anonymous
   $context_ranges[]="20 minutes";
736 6 Anonymous
   $context_ranges[]="30 minutes";
737 6 Anonymous
   $context_ranges[]="1 hour";
738 6 Anonymous
   $context_ranges[]="2 hours";
739 6 Anonymous
   $context_ranges[]="4 hours";
740 6 Anonymous
   $context_ranges[]="8 hours";
741 6 Anonymous
   $context_ranges[]="12 hours";
742 6 Anonymous
   $context_ranges[]="1 day";
743 6 Anonymous
   $context_ranges[]="2 days";
744 6 Anonymous
   $context_ranges[]="week";
745 6 Anonymous
   $context_ranges[]="month";
746 6 Anonymous
   $context_ranges[]="year";
747 6 Anonymous
 }}}
748 6 Anonymous
749 6 Anonymous
 * '''get_context.php'''
750 6 Anonymous
 {{{
751 6 Anonymous
  switch ($range) {
752 6 Anonymous
   case "10 minutes":   $start = -600; break;
753 6 Anonymous
   case "20 minutes":   $start = -1200; break;
754 6 Anonymous
   case "30 minutes":   $start = -1800; break;
755 6 Anonymous
   case "1 hour":       $start = -3600; break;
756 6 Anonymous
   case "2 hours":      $start = -7200; break;
757 6 Anonymous
   case "4 hours":      $start = -14400; break;
758 6 Anonymous
   case "8 hours":      $start = -28800; break;
759 6 Anonymous
   case "12 hours":     $start = -43200; break;
760 6 Anonymous
   case "1 day":        $start = -86400; break;
761 6 Anonymous
   case "2 days":       $start = -172800; break;
762 6 Anonymous
   case "week":         $start = -604800; break;
763 6 Anonymous
   case "month":        $start = -2419200; break;
764 6 Anonymous
   case "year":         $start = -31449600; break;
765 1 Anonymous
 }}}
766 1 Anonymous
767 1 Anonymous
768 4 Anonymous
== !JobMonarch ==
769 1 Anonymous
!JobMonarch is an add-on to Ganglia which provides PBS job monitoring through the web browser.
770 4 Anonymous
771 1 Anonymous
See [http://subtrac.rc.sara.nl/oss/jobmonarch/wiki/Documentation http://subtrac.rc.sara.nl/oss/jobmonarch/wiki/Documentation] for information on requirements, configuration and installation.
772 1 Anonymous
773 1 Anonymous
774 1 Anonymous
= Additional Software  =
775 1 Anonymous
776 1 Anonymous
== PERL Libraries ==
777 1 Anonymous
Please make sure that the following libraries are installed in the official library directory and install all depending libraries, if necessary. For ''Image::Magick'', use the corresponding libraries that come with the main installation.
778 1 Anonymous
779 1 Anonymous
780 18 Anonymous
||Error||  0.17008
781 18 Anonymous
||Config::Simple||4.58
782 18 Anonymous
||DBI||1.53||Remember to install DBD::mysql from the OS sources, too!
783 18 Anonymous
||CGI||3.25
784 18 Anonymous
||CGI::Session||4.13
785 18 Anonymous
||Data::!FormValidator||4.40
786 18 Anonymous
||HTML::Template||2.8
787 18 Anonymous
||HTML::Template::Pro||0.64
788 18 Anonymous
||MIME::Lite||
789 18 Anonymous
||!FreezeThaw||
790 18 Anonymous
||Storable||
791 18 Anonymous
||Time::Format||
792 18 Anonymous
||IMAP::Client||
793 18 Anonymous
||Time::Local||
794 18 Anonymous
||Clone||
795 18 Anonymous
||SOAP::Lite||
796 18 Anonymous
||Inline::Python||
797 1 Anonymous
798 1 Anonymous
	
799 4 Anonymous
PERL modules are installed best with the CPAN shell:
800 4 Anonymous
{{{
801 4 Anonymous
perl -MCPAN -eshell
802 1 Anonymous
}}}
803 1 Anonymous
804 1 Anonymous
805 1 Anonymous
806 1 Anonymous
807 1 Anonymous
== Third Party Executables for ProCKSI ==
808 4 Anonymous
Generate the following directories on each compute node to contain important executables:
809 4 Anonymous
{{{
810 5 Anonymous
/usr/local/procksi/Cluster/
811 4 Anonymous
/usr/local/procksi/DaliLite
812 5 Anonymous
/usr/local/procksi/MaxCMO
813 4 Anonymous
/usr/local/procksi/MolScript
814 1 Anonymous
}}}
815 1 Anonymous
816 4 Anonymous
For the following installation of the ProCKSI server components, the following executables must be present:
817 4 Anonymous
{{{
818 5 Anonymous
/usr/local/procksi/Cluster/qclust
819 4 Anonymous
/usr/local/procksi/DaliLite/DaliLite
820 4 Anonymous
/usr/local/procksi/MaxCMO/ProtCompVNS
821 4 Anonymous
/usr/local/procksi/molauto
822 4 Anonymous
/usr/local/procksi/molscript
823 1 Anonymous
}}}
824 1 Anonymous
825 1 Anonymous
826 1 Anonymous
== Image Software ==
827 1 Anonymous
828 4 Anonymous
=== Installation ===
829 4 Anonymous
   * Install ''!ImageMagick'' from [http://www.imagemagick.org www.imagemagick.org] if not already installed.
830 1 Anonymous
   * Install ''!MolScript'' from [http://www.avatar.se/molscript www.avatar.se/molscript]. Please link the MesaGL libraries instead of the OpenGL libraries; a modified makefile can be found under [source:ProCKSI/install/Makefile.molscript]
831 1 Anonymous
 
832 1 Anonymous
833 1 Anonymous
834 1 Anonymous
=== Virtual Display  ===
835 1 Anonymous
!MolScript needs an X display in order to generate images (jpg, gif, …). Its possible to use the console X display for the OpenGL bits even if it is not logged in. Therefore, ''procksi'' must be authenticated and allowed to use this X display virtually.
836 1 Anonymous
837 19 Anonymous
Get and unpack the ProCKSI x-authentication patch from [repos:Externals/Cluster/xauth.tgz].
838 1 Anonymous
839 4 Anonymous
On each node copy magic cookie file for x-authentication:
840 4 Anonymous
{{{
841 19 Anonymous
cp :0.Xauth /var/gdm/:0.Xauth
842 1 Anonymous
}}}
843 1 Anonymous
844 4 Anonymous
On each node copy scripts for automatic x-authentication:
845 4 Anonymous
{{{
846 19 Anonymous
cp procksixauth /usr/local/sbin/procksixauth
847 19 Anonymous
cp :0 /etc/gdm/Init/:0
848 1 Anonymous
}}}
849 1 Anonymous
850 4 Anonymous
Restart the X display manager for the changes to take effect:
851 4 Anonymous
{{{
852 4 Anonymous
/usr/sbin/gdm-restart
853 1 Anonymous
}}}
854 1 Anonymous
855 4 Anonymous
856 4 Anonymous
The virtual X display can be used with unix socket '':0'', e.g.:
857 4 Anonymous
{{{
858 4 Anonymous
molauto protein.pdb | DISPLAY=unix:0.0 molscript -jpeg -out protein.jpeg
859 1 Anonymous
}}}
860 1 Anonymous
861 1 Anonymous
862 1 Anonymous
= ProCKSI Server Component =
863 1 Anonymous
864 1 Anonymous
== Installation and Basic Configuration ==
865 1 Anonymous
This section describes the installation and configuration of the ProCKSI server component. This includes the configuration of the web server and the database.
866 1 Anonymous
867 1 Anonymous
The server component will be installed into the home directory of the user ''procksi''. Therefore, make sure that it is on a separate partition / hard disk with much space. In the best case, this will be a RAID system.
868 1 Anonymous
869 4 Anonymous
Get the latest release of the server component, referred to in the following as ''RELEASE'', and extract it into ''/home/procksi/RELEASE.'' 
870 4 Anonymous
{{{
871 4 Anonymous
tar -xvzf RELEASE.tgz
872 1 Anonymous
}}}
873 1 Anonymous
874 4 Anonymous
Create a softlink from ''RELEASE'' to a generic directory ''/home/procksi/latest''. This will be accessed by the web server:
875 4 Anonymous
{{{
876 4 Anonymous
ln -s /home/procksi/RELEASE /home/procksi/latest
877 1 Anonymous
}}}
878 1 Anonymous
879 4 Anonymous
In order to test new versions, referred in the following as ''TEST'', before taking them officially online, create a softlink from ''TEST'' to a generic directory ''/home/procksi/test''. This will be accessed by the web server:
880 4 Anonymous
{{{
881 4 Anonymous
ln -s /home/procksi/TEST /home/procksi/test
882 1 Anonymous
}}}
883 1 Anonymous
884 1 Anonymous
In case that you want to bring the test version online, just delete the softlinks and repeat the previous steps for the new release. Please make sure that always both softlinks exist!
885 1 Anonymous
886 4 Anonymous
Change into the administrative directory and run the installation script. Change the server settings, database settings and directory settings if necessary.
887 4 Anonymous
{{{
888 4 Anonymous
cd /home/procksi/latest/admin
889 4 Anonymous
./configure.pl
890 1 Anonymous
}}}
891 1 Anonymous
892 1 Anonymous
893 1 Anonymous
== Database Configuration ==
894 4 Anonymous
Make sure that the MySQL daemon is running, and that it will start at boot time:
895 4 Anonymous
{{{
896 4 Anonymous
/sbin/service mysqld start
897 4 Anonymous
/sbin/chkconfig --add mysqld
898 4 Anonymous
/sbin/chkconfig mysqld on
899 1 Anonymous
}}}
900 1 Anonymous
901 4 Anonymous
Make sure that you have access to the MySQL database management as ''root'' and login as user ''root ''with the corresponding password:
902 4 Anonymous
{{{
903 4 Anonymous
mysql -u root -p
904 1 Anonymous
}}}
905 1 Anonymous
906 4 Anonymous
Create new mysql users ''procksi_user ''and ''procksi_admin'':
907 4 Anonymous
{{{
908 4 Anonymous
USE mysql;
909 4 Anonymous
INSERT INTO user SET host='localhost', user='procksi_user', password=PASSWORD('''password_procksi_user''');
910 4 Anonymous
INSERT INTO user SET host='localhost', user='procksi_admin', password=PASSWORD('''password_procksi_admin''');
911 4 Anonymous
FLUSH PRIVILEGES;
912 1 Anonymous
}}}
913 4 Anonymous
914 1 Anonymous
Repeat these steps analogously for ''procksi0-priv'', ''procksi1-priv'', and ''procksi2-priv.''
915 1 Anonymous
916 1 Anonymous
917 1 Anonymous
918 4 Anonymous
Create a new database:
919 4 Anonymous
{{{
920 4 Anonymous
CREATE DATABASE procksi_latest;
921 1 Anonymous
}}}
922 1 Anonymous
923 1 Anonymous
924 4 Anonymous
Give privileges to users  ''procksi_user ''and ''procksi_admin'' for all compute nodes:
925 4 Anonymous
{{{
926 4 Anonymous
GRANT ALL ON procksi_latest.* TO procksi_admin@localhost WITH GRANT OPTION;
927 4 Anonymous
GRANT SELECT, UPDATE, INSERT, DELETE ON procksi_latest.* TO procksi_user@localhost;
928 4 Anonymous
GRANT ALL ON procksi_latest.* TO procksi_admin@procksi0.cs.nott.ac.uk WITH GRANT OPTION;
929 4 Anonymous
GRANT SELECT, UPDATE, INSERT, DELETE ON procksi_latest.* TO procksi_user@procksi0.cs.nott.ac.uk;
930 4 Anonymous
GRANT SELECT, UPDATE, INSERT, DELETE ON procksi_latest.* TO procksi_user@procksi1.cs.nott.ac.uk;
931 4 Anonymous
GRANT SELECT, UPDATE, INSERT, DELETE ON procksi_latest.* TO procksi_user@procksi2.cs.nott.ac.uk;
932 4 Anonymous
FLUSH PRIVILEGES;
933 1 Anonymous
}}}
934 1 Anonymous
935 1 Anonymous
If you change the password for ''procksi_user'', please make sure that you also change it in ''/home/procksi/latest/config/main.ini''
936 1 Anonymous
937 4 Anonymous
Import the main database ''procksi_latest'' from the backup given in ''/home/procksi/RELEASE/admin'':
938 4 Anonymous
{{{
939 4 Anonymous
msysql -u procksi_admin -p procksi_latest < procksi_latest.sql
940 1 Anonymous
}}}
941 1 Anonymous
942 1 Anonymous
In order to create a database ''procksi_test'' for the test version, repeat the previous steps and set the privileges accordingly.
943 1 Anonymous
   
944 1 Anonymous
945 1 Anonymous
946 1 Anonymous
== Web Server Configuration ==
947 1 Anonymous
Make the following changes to the Apache configuration file (''/etc/httpd/conf/httpd.conf''):
948 4 Anonymous
{{{
949 4 Anonymous
User  			procksi
950 4 Anonymous
Group 			procksi
951 4 Anonymous
ServerAdmin		procksi@cs.nott.ac.uk
952 4 Anonymous
ServerName 		procksi.cs.nott.ac.uk
953 4 Anonymous
DocumentRoot /home/procksi/latest/html
954 4 Anonymous
<Directory /home/procksi/latest/html">
955 4 Anonymous
   AllowOverride AuthConfig
956 4 Anonymous
</Directory>
957 4 Anonymous
LogFormat "%t %h %l %u \"%r\" %>s %b \"%{Referer}i\" \"%{User-Agent}i\"" combined
958 4 Anonymous
LogFormat "%t %h %l %u \"%r\" %>s %b" common
959 4 Anonymous
LogFormat "%t %{Referer}i -> %U" referer
960 1 Anonymous
LogFormat "%t %{User-agent}i" agent
961 4 Anonymous
962 4 Anonymous
#Exclude Logging of Ganglia Requests
963 1 Anonymous
SetEnvIf Request_URI "ganglia" ganglia
964 4 Anonymous
965 4 Anonymous
#
966 4 Anonymous
# The location and format of the access logfile (Common Logfile Format).
967 4 Anonymous
# If you do not define any access logfiles within a <VirtualHost>
968 4 Anonymous
# container, they will be logged here.  Contrariwise, if you *do*
969 4 Anonymous
# define per-<VirtualHost> access logfiles, transactions will be
970 4 Anonymous
# logged therein and *not* in this file.
971 1 Anonymous
#
972 5 Anonymous
973 1 Anonymous
CustomLog /home/procksi/latest/logs/access.log common env=!ganglia
974 4 Anonymous
975 4 Anonymous
#
976 4 Anonymous
# If you would like to have agent and referer logfiles, uncomment the
977 4 Anonymous
# following directives.
978 1 Anonymous
#
979 4 Anonymous
980 4 Anonymous
CustomLog /home/procksi/latest/logs/referer.log referer env=!ganglia
981 1 Anonymous
CustomLog /home/procksi/latest/logs/agent.log agent env=!ganglia
982 4 Anonymous
983 4 Anonymous
#
984 4 Anonymous
# For a single logfile with access, agent, and referer information
985 4 Anonymous
# (Combined Logfile Format), use the following directive:
986 4 Anonymous
#
987 1 Anonymous
#CustomLog logs/access_log combined env=!ganglia
988 4 Anonymous
989 1 Anonymous
ScriptAlias /cgi-bin/ /home/procksi/latest/cgi-bin/
990 4 Anonymous
991 4 Anonymous
<Directory "/home/procksi/latest/cgi-bin">
992 4 Anonymous
    AllowOverride None
993 4 Anonymous
    Options None
994 4 Anonymous
    Order allow,deny
995 4 Anonymous
    Allow from all
996 1 Anonymous
</Directory>
997 4 Anonymous
998 4 Anonymous
Alias  /data/ 	  /home/procksi/latest/data/
999 4 Anonymous
Alias  /images/   /home/procksi/latest/images/
1000 4 Anonymous
Alias  /styles/   /home/procksi/latest/styles/
1001 4 Anonymous
Alias  /applets/  /home/procksi/latest/applets/
1002 7 Anonymous
Alias  /scripts/  /home/procksi/latest/scripts/
1003 7 Anonymous
Alias  /ganglia/  /usr/local/ganglia/html/
1004 7 Anonymous
1005 7 Anonymous
#Redirection
1006 1 Anonymous
Redirect /trac https://psiren.cs.nott.ac.uk/projects/procksi/
1007 4 Anonymous
1008 4 Anonymous
AddLanguage de .de
1009 4 Anonymous
AddLanguage en .en
1010 4 Anonymous
AddLanguage es .es
1011 4 Anonymous
AddLanguage fr .fr 
1012 1 Anonymous
LanguagePriority en es de fr
1013 4 Anonymous
1014 4 Anonymous
Alias /errordocs/ "/home/procksi/errordocs"
1015 4 Anonymous
<IfModule mod_negotiation.c>
1016 4 Anonymous
    <IfModule mod_include.c>
1017 4 Anonymous
        <Directory /home/procksi/errordocs>
1018 4 Anonymous
            AllowOverride none
1019 4 Anonymous
            Options MultiViews IncludesNoExec FollowSymLinks
1020 4 Anonymous
            AddType text/html .shtml
1021 5 Anonymous
            <FilesMatch "\.shtml[.$]">
1022 4 Anonymous
                SetOutputFilter INCLUDES
1023 4 Anonymous
            </FilesMatch>
1024 1 Anonymous
        </Directory>
1025 4 Anonymous
1026 4 Anonymous
        ErrorDocument 400 /errordocs/400_BAD_REQUEST
1027 4 Anonymous
        ErrorDocument 401 /errordocs/401_UNAUTHORIZED
1028 4 Anonymous
        ErrorDocument 403 /errordocs/403_FORBIDDEN
1029 4 Anonymous
        ErrorDocument 404 /errordocs/404_NOT_FOUND
1030 4 Anonymous
        ErrorDocument 405 /errordocs/405_METHOD_NOT_ALLOWED
1031 4 Anonymous
        ErrorDocument 406 /errordocs/406_NOT_ACCEPTABLE
1032 4 Anonymous
        ErrorDocument 408 /errordocs/408_REQUEST_TIMEOUT
1033 4 Anonymous
        ErrorDocument 410 /errordocs/410_GONE
1034 4 Anonymous
        ErrorDocument 411 /errordocs/411_LENGTH_REQUIRED
1035 4 Anonymous
        ErrorDocument 412 /errordocs/412_PRECONDITION_FAILED
1036 4 Anonymous
        ErrorDocument 413 /errordocs/413_REQUEST_ENTITY_TOO_LARGE
1037 4 Anonymous
        ErrorDocument 414 /errordocs/414_REQUEST_URI_TOO_LARGE
1038 4 Anonymous
        ErrorDocument 415 /errordocs/415_UNSUPPORTED_MEDIA_TYPE
1039 4 Anonymous
        ErrorDocument 500 /errordocs/500_INTERNAL_SERVER_ERROR
1040 4 Anonymous
        ErrorDocument 501 /errordocs/501_NOT_IMPLEMENTED
1041 4 Anonymous
        ErrorDocument 502 /errordocs/502_BAD_GATEWAY
1042 4 Anonymous
        ErrorDocument 503 /errordocs/503_SERVICE_UNAVAILABLE
1043 4 Anonymous
        ErrorDocument 506 /errordocs/506_VARIANT_ALSO_VARIES
1044 4 Anonymous
    </IfModule>
1045 1 Anonymous
</IfModule>
1046 4 Anonymous
1047 4 Anonymous
<Location /server-status>
1048 4 Anonymous
    SetHandler server-status
1049 4 Anonymous
    Order deny,allow
1050 4 Anonymous
    Deny from all
1051 4 Anonymous
    Allow from .cs.nott.ac.uk
1052 1 Anonymous
</Location>
1053 4 Anonymous
1054 4 Anonymous
<Location /server-info>
1055 4 Anonymous
    SetHandler server-info
1056 4 Anonymous
    Order deny,allow
1057 4 Anonymous
    Deny from all
1058 4 Anonymous
    Allow from .cs.nott.ac.uk
1059 1 Anonymous
</Location>
1060 1 Anonymous
}}} 
1061 4 Anonymous
1062 4 Anonymous
Make sure that the server accepts connections to port 80. Check the firewall settings in  ''/etc/sysconfig/iptables'' for the following entry:
1063 4 Anonymous
{{{
1064 4 Anonymous
-A RH-Firewall-1-INPUT -m state --state NEW -m tcp -p tcp --dport 80 -j ACCEPT
1065 1 Anonymous
}}}
1066 1 Anonymous
1067 1 Anonymous
1068 4 Anonymous
Make sure that the apache daemon is running, and that it will start at boot time:
1069 4 Anonymous
{{{
1070 4 Anonymous
/sbin/service httpd start
1071 4 Anonymous
/sbin/chkconfig --add httpd
1072 4 Anonymous
/sbin/chkconfig httpd on
1073 1 Anonymous
}}}
1074 1 Anonymous
1075 1 Anonymous
1076 1 Anonymous
1077 1 Anonymous
== Email Configuration ==
1078 1 Anonymous
The ProCKSI server component send emails to the user for several occasions. In order to make sure that they are delivered correctly even when the internet is temporarily not available, a local SMTP server (''postfix'') is set up. This will accect emails from the private network only, store them temporarily (if necessary), and forward them to an email relay server.
1079 1 Anonymous
1080 1 Anonymous
1081 1 Anonymous
1082 4 Anonymous
Make sure that ''postfix'' is the default mailing software (and not ''sendmail''!).
1083 4 Anonymous
{{{
1084 4 Anonymous
system-switch-mail -activate postfix
1085 1 Anonymous
}}}
1086 1 Anonymous
1087 1 Anonymous
Make the following changes to the ''postfix ''configuration file (''/etc/postfix/main.cf''):
1088 4 Anonymous
{{{
1089 4 Anonymous
myhostname = procksi0.cs.nott.ac.uk
1090 4 Anonymous
mydomain = cs.nott.ac.uk
1091 4 Anonymous
myorigin = $mydomai
1092 5 Anonymous
inet_interfaces = all
1093 4 Anonymous
mydestination = $myhostname, localhost.$mydomain, localhost
1094 4 Anonymous
mynetworks_style = subnet
1095 4 Anonymous
virtual_alias_maps = hash:/etc/postfix/virtual
1096 1 Anonymous
relayhost = marian.cs.nott.ac.uk
1097 1 Anonymous
}}}
1098 4 Anonymous
1099 1 Anonymous
Create or modify ''/etc/postfix/virtual'':
1100 4 Anonymous
{{{
1101 4 Anonymous
root        root@localhost
1102 4 Anonymous
postmaster  postmaster@localhost
1103 1 Anonymous
adm         root@localhost
1104 1 Anonymous
}}}
1105 1 Anonymous
1106 4 Anonymous
Generate the corresponding database file (''virtual.db''):
1107 4 Anonymous
{{{
1108 4 Anonymous
postmap /etc/postfix/virtual
1109 1 Anonymous
}}}
1110 1 Anonymous
1111 4 Anonymous
Make sure that the postfix daemon is running, and that it will start at boot time:
1112 4 Anonymous
{{{
1113 4 Anonymous
/sbin/service postfix start
1114 4 Anonymous
/sbin/chkconfig --add postfix
1115 4 Anonymous
/sbin/chkconfig postfix on
1116 1 Anonymous
}}}
1117 1 Anonymous
1118 1 Anonymous
Make sure that the firewall is not open for port 25 or port 28!
1119 1 Anonymous
1120 1 Anonymous
Check that the STMTP server in ''/home/procksi/latest/conf/main.ini'' is set correctly set to ''procksi0.cs.nott.ac.uk''
1121 1 Anonymous
1122 1 Anonymous
1123 1 Anonymous
1124 1 Anonymous
== Garbage Cleanup Scheduling ==
1125 1 Anonymous
After a certain period of time, given in ''/home/procksi/latest/conf/main.ini'', sessions and requests expire and must be deleted. 
1126 1 Anonymous
1127 1 Anonymous
Edit ''procksi's'' crontab file taking effect for the ''latest'' and ''test'' version:
1128 4 Anonymous
{{{
1129 4 Anonymous
crontab -e
1130 4 Anonymous
  0-59/1 * * * * /home/procksi/latest/cron/check_sessions.sh
1131 4 Anonymous
  1-59/1 * * * * /home/procksi/latest/cron/check_tasks.sh
1132 1 Anonymous
  2-59/1 * * * * /home/procksi/latest/cron/check_requests.sh
1133 1 Anonymous
}}}
1134 4 Anonymous
1135 1 Anonymous
Analogously for ''/home/procksi/test''.
1136 1 Anonymous
1137 1 Anonymous
1138 1 Anonymous
1139 1 Anonymous
== Linking External Software ==
1140 4 Anonymous
Make sure that all links in ''/home/procksi/latest/bin'' point to the correct files of the operating system: 
1141 4 Anonymous
{{{
1142 4 Anonymous
sh, compress, bzip2, gzip, zip, ppmz, qsub
1143 1 Anonymous
}}}
1144 1 Anonymous
1145 1 Anonymous
1146 4 Anonymous
Make sure that all further executable links in ''/home/procksi/latest/bin'' point to the correct files on the file system: 
1147 1 Anonymous
{{{
1148 4 Anonymous
 exec_cluster, exec_!DaliLite, exec_MaxCMO, exec_molauto, exec_molscript
1149 1 Anonymous
}}}