Difference between revisions of "HV HowTo for Experts"
Line 68: | Line 68: | ||
On the DVCS computer in the test lab, you have to start everything by hand. Go to ~/slowc and see the README file. | On the DVCS computer in the test lab, you have to start everything by hand. Go to ~/slowc and see the README file. | ||
− | == | + | == Restarting the Servers == |
On ahut* the servers are run from the ahut account. If you need to restart: | On ahut* the servers are run from the ahut account. If you need to restart: |
Revision as of 14:26, 14 February 2014
High Voltage in Hall A
First of all, please be aware of the simple instructions for users:
https://hallaweb.jlab.org/wiki/index.php?title=HowTo_for_Users
Contents
Overview of Architecture
The HV crates sit in the hall, e.g. on the detector stacks. A set of cards with (usually) 12 channels of negative or positive HV are inserted into the crates. A custom "serial board" (built by Javier Gomez and Jack Segal) talks to the cards. This "serial board" replaces an old, obsolete motherboard. (There are still a few crates with this motherboard, however -- e.g. the beamline crate.) A Perl "Shim" server (written by Brad Sawatzky) runs on a PC nearby the HV crate. The "Shim" server uses (via the Perl "Expect" module) a low-level C code written by Javier to talk to his serial card in the HV crate. On the User end, a Java GUI (written by Roman Pomatsalyuk) displays the HV information and provides User control. This Java GUI talks to the Shim server. Alternatively, the Java GUI can talk to the motherboard via a Portserver.
The portserver/motherboard alternative is being phased out, but still exists in two places at the moment: the beamline HV and one of the HV crates in the test lab DVCS setup, as I write this (Jan 2014).
All the elements in this chain must be working in order to have control and readback of HV.
There are three ways to control HV:
1. Using the Java GUI and the software chain mentioned above.
2. Using the "i2lchv_linux_XXX" code in shim/LecroyHV_shim/LecroyHV_FE. At the moment, XXX = bob on ahut1,2 / XXX = S2 on intelha3 / XXX = S0 on halladaq8
3. hvctrl code -- explained below.
Existing Crates
Here is a list of HV crates in Hall A and the Test Lab as of Jan 2014.
The crates that user portservers are talking to the old motherboard. The crates that use a PC with Shim have no motherboard. See Architecture above.
Location Portserver, or PC for Shim Config on ~/slowc How to start on adev Left HRS (1 crate) Intel PC: intelha3 LEFT cd ~adev/slowc ; ./hvs LEFT Right HRS (top crate) laptop: ahut1 RIGHT Right HRS (bottom crate) Intel PC: halladaq8 RIGHT cd ~adev/slowc ; ./hvs RIGHT (this starts both R-HRS crates) Beamline portserver hatsv4: 2003 BEAMLINE cd ~adev/slowc ; ./hvs BEAMLINE Test Lab 1 crate on portserver DVCS cd ~/slowc ; ./hvs DVCS 1 crate on a PC (see the README file)
Where the Servers are
The Intel PCs share the following which becomes their root partition /root/diskless/i386/Centos5.8/root/shim/LecroyHV_shim and the low-level C code is in ./LecroyHV_FE
Software that runs on the Intel PCs must, however, run from their local disk because output is not permitted on the root partition.
A cron script ensures that the server is running [root@intelha3 ~]# crontab -l
- Start the shim server for HV
2,10,20,30,40,50 * * * * /shim/scripts/prepHV
On the "ahut" laptops, it's a bit different. See /home/ahut/shim/LecroyHV_shim. An ahut* a cron script (prepHV) runs under ahut to ensure the HV server is running.
On the DVCS computer in the test lab, you have to start everything by hand. Go to ~/slowc and see the README file.
Restarting the Servers
On ahut* the servers are run from the ahut account. If you need to restart:
[ahut@ahut2 ~]$ ps awx | grep -i shim
and "kill -9" the PID. Then it will either restart on it's own within a few minutes (there's a cron job), or you can restart by hand:
/home/ahut/scripts/startHV
On the intelPCs we do not (yet) have an ordinary account -- only root. The simplest way to restart the server is to reboot the intelPC. However, if you know the root password, you can follow a similar procedure as above to kill and restart the HV. The HV is started on intelPC with a command /shim/scripts/prepHV which runs as a cron job under root.
hvctrl standalone local control
Note: The Java GUI is preferred ! Mainly because of SAFETY. You could get into trouble using hvctrl, but if you must ...
On the ahut computers, go to /home/ahut/hvctrl
On the IntelPC it's a little different. Use /hvctrl/hvctrl. The input/output files are on /root/hvctrl. These have to be different because /hvctrl is part of a disk area shared by all IntelPCs which, however, is not writeable, while /root/hvctrl is a writeable area which is local to each IntelPC.
If you run the code "hvctrl" it's usage is self-explanatory. The code talks to the HV crate to which that PC is connected via a serial cable.
Setting Up A New Installation
If you install on an Intel PC in Hall A, note that these share a root partition, so they all see the software. Suppose, however, that you want to install on a new PC like a laptop. That's what these instructions are for.
1. Need Redhat 5 or greater in order to have proper glibc.
2. Needed Perl 5.8.8 or later install. On intel PC this appears as /perl after being put (on adaql1) in the shared area.
3. Set write permission for /dev/ttyS0 (or is your PC using /dev/ttyS2 ?) Typically the permissions gets reset when the PC is rebooted, such that users cannot write there. A wrong write permission causes a silent failure of the software.
4. Need telnet server because the network connection is via telnet
Install telnetd server by typing "yum install telnet-server" and allow telnet as follows: /etc/xinetd.d/telnet needs "disable no" and you need to restart xinetd /sbin/service xinetd restart
Note, if telnet mysteriously stops working ... Kill cfengine, which is Computer Center's rather rude security script that deletes /etc/xinetd.d/telnet. Yes, telnet is an old, insecure protocol, but it's needed by this software. The Java code uses telnet to communicate with the Perl server.
On ahut1 or ahut2, I have a cron script /root/scripts/prepHV which takes care of running the server automatically. It also periodically restores the "telnet" file mentioned above, and periodically restarts xinetd
Here is a simple test of the telnet server: If you are on, for example, ahut2 and can "telnet ahut2" (i.e. telnet into yourself). Then yes, the server is running.
5. Install the software. Assuming you don't need the Java code because it runs on adaql1 in the counting room, there are two pieces: the Shim perl server and the low-level frontend C code, see "Architecture" above. You'll need to find the tar file for this, which
ahut1:/home/ahut//home/shim_laptop_8nov13.tar
I probably have this in /mss/home/rom, and you should always pick the latest date (which is written in the filename) because I tend to update / improve things.
What can go wrong ?
Common troublshooting items are listed in the User Guide (search for troubleshooting)
https://hallaweb.jlab.org/wiki/index.php?title=HV_HowTo_for_Users
Below is a list of other problems I've seen and their solution.
1. Power-cycling a HV crate has been known to help. Especially if more than one HV card does not respond. Then it's probably a crate problem not a card problem.
2. Cannot login to intelPC as root. Probably because you don't have /root, in which case the solution is to mount it by doing the following as superuser on adaql1: "cd /root ; ./mount_diskless". (Note, at the moment, /root/diskless disappears when you reboot adaql1 ! Hopefully this can get fixed.)
3. No connection to server. Try rebooting the PC.
4. If you try to run the Shim software by hand you might see "Can't connect to mainframe" and the Shim script dies ! This is a good one. Note that:
5. Restarting the servers.
On ahut* the servers are run from the ahut account. If you need to restart:
[ahut@ahut2 ~]$ ps awx | grep -i shim
and "kill -9" the PID. Then it will either restart on it's own within a few minutes (there's a cron job), or you can restart by hand:
/home/ahut/scripts/startHV
On the intelPCs we do not (yet) have an ordinary account -- only root. The simplest way to restart the server is to reboot the intelPC. However, if you know the root password, you can follow a similar procedure as above to kill and restart the HV. The HV is started on intelPC with a command /shim/scripts/prepHV which runs as a cron job under root.
On the IntelPCs, the /shim directory where the software exists is not writeable. You must run from /root/shim, or else the software cannot write its log file and will die very mysteriously ! But this should not affect ordinary users since the Shim software is running automatically.
Another possibility is that the mainframe really cannot be connected to. First thing to try is to run, by hand, ./shim/LecroyHV_shim/LecroyHV_FE/i2lchv_linux* depending on if you're connected to S0 or S2. If that runs ok, the HV crate is on and is talking. If not, then check your cabling and power status.