Robert Michaels, rom@jlab.org, Jefferson Lab Hall A, updated Jan 2003
This file: hallaweb.jlab.org/equipment/daq/guide2.html
I. Running Spectrometer and Parity DAQ
This assumes that runcontrol is running. If not, see Cold Start in section III.
II. Frequently Asked Questions
It takes a couple hours for files to appear in MSS, and the files remain on adaq disk for typically 2 or 3 days.
III. REBOOTING STUFF
HOW TO SHUTDOWN or REBOOT WORKSTATIONS
Rarely, the workstations don't function properly and the simplest way out is to reboot. To reboot adaqs2,s3 (SunOS): Login as "adaq" and type "reboot". In a few minutes the workstation comes back alive. This solves most problems. Shutdown is necessary in case of power failure. To shutdown, login as "adaq" and type "shutdown". After several minutes screen goes black, wait a minute more, then power off. When you power on again, computer reboots. For Linux, Ole usually keeps some instructions posted near the PC terminal. One may hit Ctrl-Alt-F1 to go to console mode, then Ctrl-Alt-Del.
QUICK RESETS
Problems with CODA can usually be solved with a simple reset or with a Cold Start. If not call Bob Michaels. Also can call Bodo Reitz for Spectrometer DAQ and Bryan Moffit for Parity DAQ. Do NOT waste an hour stuck on resets. I claim I can bring CODA up in less than 10 minutes 90% of the time.
If a ROC seems to be hung up, you can reboot it by going to the workspace "Components" and typing "reboot" at the vxWorks prompt (-> reboot). Wait 2 minutes and telnet back in to verify its alive. The name of the ROC computer is normally written in the name of the xterm window. You need to know what subset of these computers are used for your configuration. For Spin Duality, the following are used: LeftHRS uses 1) ROC3-hallasfi3-(hatsv4, port2); 2) ROC14-hallavme1-(hatsv5, port14), 3) TS1-hallavme4-(hatsv4, port 14). Notation means for example ROC3 is the CODA component, hallasfi3 is the network name (hallasfi3.jlab.org), and the RS232 connection is on portserver hatsv4.jlab.org at port 2. Continuing, the RightHRS uses: 1) ROC1-hallasfi1-(hatsv3, port 2) 2) ROC2-hallasfi2-(hatsv3, port 8); TS0-hallavme2-(hatsv3, port 3). The Parity DAQ uses one crate: ROC23-halladaq6-(hatsv9, 2004). If a ROC seems really frozen, use the "Crate Resets" button in the General Tools EPICS screen on hacsbc. Here the notation is roc14=bpm/raster, roc1=R-arm fastbus1, roc2 on R-arm is also booted when you boot roc1. roc3=L-arm fastbus1, TS0=R-arm Trig Super, TS1=L-arm Trig Super. This GUI doesn't have the Parity crate, but that crate is located in the middle room of the counting house (see the big sign) and can be power cycled if necessary. To reset from the GUI, toggle the state of the button. If you reboot the ROC, or if something on the workstation is hung up, try pushing the "Reset" button in runcontrol. Then Configure, Download, and StartRun as usual.
I emphasize that the procedure of just booting the one crashed ROC and Resetting runcontrol is the quickest solution to most crashes, and it is not necessary to reboot or reset a different DAQ, as they are decoupled. If this reset procedure doesn't work, try a Cold Start (see below).
COLD START
First kill all CODA process on the workstation where CODA is running by typing from anywhere on the relevant account "kcoda". This stops runcontrol, the event builder, event recorder, the runcontrol server, and cleans up the ET system. Now you can start everything again by typing "runcontrol". The "kcoda" script will tell you what else to do, which typically involves starting "rcServer" and "coda_er" by hand, plus resetting runcontrol after connect and before download. Both rcServer and coda_er must be left in the foreground of a window.
Before downloading, it is first a good idea to make sure the fastbus and VME crates (i.e. the ROCs) are running. One finds that when resetting, one can frequently avoid rebooting the frontend crates, and just restart runcontrol, but if you must: Press the reset buttons for the relevant subset of crates as explained in the Reset section above. A convenient way to reset (reboot) the ROCs is to go to the Components workspace and enter "reboot" at the vxWorks prompt -> of each ROC. But one can also use the EPICS GUI as explained above. After a few minutes, telnet back in and verify they are up. Some of the crates take longer than others to reboot. Be pateient. To start everything on the workstation, type "runcontrol". Then press the Connect button. After "connect", wait 10 seconds, then press "Run Type"; a dialog box pops up and you must choose the configuration you want, which is presumably the first choice. Then press download and wait about 30 seconds. Now you can "Start Run" to start a run.
After bringing up runcontrol from a crash its a good idea to "Reset" after configuring the "Run Type". If its ever necessary to reprogram the NVRAMS on the ROCs, see adaqs2:/home/adev/doc/vmeram.doc. Sometimes connection via portserver (RS232 interface) is useful, see adaqs2:/home/adaq/doc/portserver.doc
If you ever logout of the DAQ computer (e.g. adaqs2), or have rebooted it, here is how to restart the preferred setup. We'll run our two spectrometer DAQs and parity DAQ from the adaqs2 terminal. Login there as adev. Passwords are written on a piece of paper on the wall. Run runcontrol on the workspaces called "LeftHRS", "RightHRS", and "Parity" for the 3 DAQs. Login to the respective accounts (adev account on adaqs2 is L-arm; you're already there. a-onl on adaql2 for R-arm, and apar on adaql1 for Parity). In the HRS workspaces, start emacs in background: emacs ~/prescale/ prescale.dat & Also start datamon by typing "datamon". In the "components" workspace, login to all the frontend computers by typing "telnet hallasfi1" where the name of the computer like hallasfi1 is in the name of the xterm window. If these xterm windows for components aren't there, type "setupxterms".
V. THINGS THAT GO WRONG WITH DAQ or COMPUTERS
The Event Recorder (ER) or Event Builder (EB) may complain something like "ER1 not responding" and you get no events. Solution: before running "runcontrol" one must run "rcServer" interactively from the login shell of the relevant account on the relevant computer.
CODA may get stuck downloading. This has been happening lately with Left HRS DAQ. Solution: Press "Cancel" in top left corner of runcontrol, then "Reset", and then "Download" again.
This page maintained by Robert Michaels rom@jlab.org