Running   CODA   on   Spectrometers   and   Parity   DAQ   Systems



Robert Michaels, rom@jlab.org, Jefferson Lab Hall A, updated Jan 2003

This file: hallaweb.jlab.org/equipment/daq/guide2.html


I.  Running Spectrometer and Parity DAQ

This assumes that runcontrol is running.   If not, see Cold Start in section III.

II.  Frequently Asked Questions


III.  REBOOTING STUFF

HOW TO SHUTDOWN or REBOOT WORKSTATIONS

Rarely, the workstations don't function properly and the simplest way out is to reboot. To reboot adaqs2,s3 (SunOS): Login as "adaq" and type "reboot". In a few minutes the workstation comes back alive. This solves most problems. Shutdown is necessary in case of power failure. To shutdown, login as "adaq" and type "shutdown". After several minutes screen goes black, wait a minute more, then power off. When you power on again, computer reboots. For Linux, Ole usually keeps some instructions posted near the PC terminal.   One may hit Ctrl-Alt-F1 to go to console mode, then Ctrl-Alt-Del.

QUICK   RESETS

Problems with CODA can usually be solved with a simple reset or with a Cold Start. If not call Bob Michaels. Also can call Bodo Reitz for Spectrometer DAQ and Bryan Moffit for Parity DAQ. Do NOT waste an hour stuck on resets. I claim I can bring CODA up in less than 10 minutes 90% of the time.

If a ROC seems to be hung up, you can reboot it by going to the workspace "Components" and typing "reboot" at the vxWorks prompt (-> reboot). Wait 2 minutes and telnet back in to verify its alive. The name of the ROC computer is normally written in the name of the xterm window. You need to know what subset of these computers are used for your configuration. For GDH experiment, the following are used:   Left Arm uses 1) ROC3-hallasfi3-(hatsv4, port2);   2) ROC4-hallasfi4-(hatsv4, port3);   3) ROC14-hallavme1-(hatsv5, port14),   4) TS1-hallavme4-(hatsv4, port 14). Notation means for example ROC3 is the CODA component, hallasfi3 is the network name (hallasfi3.jlab.org), and the RS232 connection is on portserver hatsv4.jlab.org at port 2. Continuing, the RightHRS uses:   1) ROC1-hallasfi1-(hatsv3, port 2)   2) ROC2-hallasfi2-(hatsv3, port 8);   TS0-hallavme2-(hatsv3, port 3).   The Parity DAQ uses one crate:   ROC23-halladaq6-(hatsv9, 2004). If a ROC seems really frozen, use the "Crate Resets" button in the General Tools EPICS screen on hacsbc. Here the notation is roc14=bpm/raster, roc1 and roc2 are tied together to SFI#1 (R-arm) roc3 and roc4 are tied to SFI#1 (L-arm). TS0=R-arm Trig Supervisor. TS1=L-arm Trig Supervisor. This GUI does not have the Parity crate, but that crate is located in the middle room of the counting house (see the big sign) and can be power cycled if necessary. To reset from the GUI, toggle the state of the button. If you reboot the ROC, or if something on the workstation is hung up, try pushing the "Reset" button in runcontrol. Then Configure, Download, and StartRun as usual.

I emphasize that the procedure of just booting the one crashed ROC and Resetting runcontrol is the quickest solution to most crashes, and it is not necessary to reboot or reset a different DAQ, as they are decoupled. If this reset procedure doesn't work, try a Cold Start (see below).

COLD   START

First kill all CODA process on the workstation where CODA is running by typing "kcoda" from the relevant account and computer. This stops runcontrol, the event builder, event recorder, the runcontrol server, and cleans up the ET system. Now you can start everything again -- look at the printout from "kcoda" script which tells you what else to do, which typically involves starting "rcServer" and "coda_er" by hand, plus resetting runcontrol after connect and before download. Note: rcServer, coda_er, coda_eb must be left in the foreground of a window.

Before downloading, it is first a good idea to make sure the fastbus and VME crates (i.e. the ROCs) are running. One finds that when resetting, one can frequently avoid rebooting the frontend crates, and just restart runcontrol, but if you must: Press the reset buttons for the relevant subset of crates as explained in the Reset section above. A convenient way to reset (reboot) the ROCs is to go to the Components workspace and enter "reboot" at the vxWorks prompt -> of each ROC. But one can also use the EPICS GUI as explained above. After a few minutes, telnet back in and verify they are up. Some of the crates take longer than others to reboot. Be patient. To start everything on the workstation, do what was printed out by "kcoda". As it explains, start "rcServer" first. Then start "coda_er" and "coda_eb" interactively with the arguments as "kcoda" explained. These can also start automatically by CODA but running them interactively provides useful diagnostics (e.g. if coda_eb reports "event type mismatch" it is a sign of severe loss of synch). Finally, type "runcontrol". In runcontrol press the Connect button. After "connect", wait 10 seconds, then press "Run Type"; a dialog box pops up and you must choose the configuration you want, which is normally the first choice. Then press download and wait about 30 seconds. If download hangs up, press "Cancel", "Reset", and "Download" in runcontrol. After a successful download, you can "Start Run" to start a run.

After bringing up runcontrol from a crash its a good idea to "Reset" after configuring the "Run Type". If its ever necessary to reprogram the NVRAMS on the ROCs, see adaqs2:/home/adev/doc/vmeram.doc. Sometimes connection via portserver (RS232 interface) is useful, see adaqs2:/home/adaq/doc/portserver.doc

If you ever logout of the DAQ computer (e.g. adaqs2), or have rebooted it, here is how to restart the preferred setup. We'll run our two spectrometer DAQs and parity DAQ from the adaqs2 terminal. Login there as adev. Passwords are written on a piece of paper on the wall. Run runcontrol on the workspaces called "LeftHRS", "RightHRS", and "Parity" for the 3 DAQs. Login to the respective accounts (adev account on adaqs2 is L-arm; you're already there. a-onl on adaql2 for R-arm, and apar on adaql1 for Parity). In the HRS workspaces, start emacs in background: emacs ~/prescale/ prescale.dat &     Also start datamon by typing "datamon". In the "components" workspace, login to all the frontend computers by typing "telnet hallasfi1" where the name of the computer like hallasfi1 is in the name of the xterm window. If these xterm windows for components aren't there, type "setupxterms".

MSQL server: When you reboot adaqs2 you must start msql by hand by executing the following command from adev account:   /etc/rc3.d/S99coda   I don't know why the system doesn't start it for you.


IV.   THINGS THAT GO WRONG WITH DAQ or COMPUTERS

This page maintained by Robert Michaels   rom@jlab.org