Robert Michaels, rom@jlab.org, Jefferson Lab Hall A, updated Sept 19, 2002
This file: hallaweb.jlab.org/equipment/daq/guide2.html
I. NORMAL DAQ OPERATIONS
This assumes that runcontrol is running. If not, see Cold Start in section III.
II. FREQUENTLY ASKED QUESTIONS
It takes a couple hours for files to appear in MSS, and the files remain on adaq disk for typically 2 or 3 days.
III. REBOOTING STUFF
HOW TO SHUTDOWN or REBOOT WORKSTATIONS
Rarely, the workstations don't function properly and the simplest way out is to reboot. To reboot adaqs2,s3 (SunOS): Login as "adaq" and type "reboot". In a few minutes the workstation comes back alive. To shutdown on SunOS, login as "adaq" and type "shutdown". After several minutes screen goes black, wait a minute more, then power off. For Linux, Ole usually keeps some instructions posted near the PC terminal. One may hit Ctrl-Alt-F1 to go to console mode, then Ctrl-Alt-Del.
QUICK RESETS
Problems with CODA 2.2 can usually be solved with a simple reset or with a Cold Start. If not call Bob Michaels or Bodo Reitz. Do NOT waste an hour stuck on resets.
If a ROC seems to be hung up, you can reboot it by going to the workspace "Components" and typing "reboot" at the vxWorks prompt (-> reboot). Wait 2 minutes and telnet back in to verify its alive. The name of the ROC computer is normally written in the name of the xterm (hallasfi1, hallasfi2, etc). You need to know what subset of these computers are used for your configuration. For example, for e00007 we use hallasfi1 (R-arm fastbus ROC1), hallasfi2 (R-arm fastbus ROC2), hallasfi3 (L-arm fastbus ROC3), hallasfi4 (L-arm fastbus ROC4), hallavme2 (R-arm Scaler Crate TS0), hallavme4 (L-arm Scaler Crate TS1), and perhaps hallavme1 (BPM/raster crate ROC14). If the ROC seems really frozen, use the "Crate Resets" button in the magnet EPICS screen on hac. Some labelling in this GUI: roc14=bpm/raster, roc1=R-arm fastbus1 (this also boots roc2), roc3=L-arm fastbus1, roc4=L-arm fastbus2, TS0=R-arm Trig Super, TS1=L-arm Trig Super. NOTE: both fastbbus crates in on an arm are reset with one button.. To reset from the GUI, toggle the state of the button.
If you reboot the ROC, or if something on the workstation is hung up, try pushing the "Reset" button in runcontrol. Then Configure, Download, and StartRun as usual.
If a ``quick'' reset doesn't work, try a Cold Start (see below).
COLD START
First kill all CODA process on the workstation where CODA is running by typing from anywhere on the relevant account and relevant computer "kcoda". This script stops runcontrol, the event builder, event recorder, the runcontrol server, and cleans up the ET system. Now you can start everything again by typing "runcontrol" NOTE: For Linux based DAQ as we are using for HRS, one should start "rcServer" interactively before starting "runcontrol". This avoids some problems like "ER not connected". Just type "rcServer" in a window and leave it. Since upgrading to Redhat 7.3 it has also become necessary to run ``coda_er'' interactively (otherwise it sometimes fails to open a data file, and we don't know why). The instructions are printed by the ``kcoda'' script when if finishes.
Before downloading, it is first a good idea to make sure the fastbus and VME crates (i.e. the ROCs) are running. One finds that when resetting, one can frequently avoid rebooting the frontend crates, and just restart runcontrol, but if you must: Reset the ROCs according to the reset procedure described in ``quick reset'' section above. An alternative and convenient way to reset (reboot) the ROCs is to go to the Components workspace and enter "reboot" at the vxWorks prompt -> of each ROC. After a few minutes, telnet back in and verify they are up. Some of the crates take longer than others to reboot. Be patient. However if the ROCs are really frozen, press the reset buttons (as explained above) instead.
To start runcontrol GUI on the workstation, type "runcontrol". Then press the Connect button. After "connect", wait 10 seconds, then press "Run Type"; a dialog box pops up and you must choose the configuration you want, which is probably "COINC" for buffered mode or "UNBUFF" for unbuffered mode. Then press download and wait about 30 seconds. Now you can "Start Run" to start a run.
Reminders: For Linux based DAQ, one must start "rcServer" by hand before starting "runcontrol". Also when recovering from a DAQ crash, you must press the "reset" button after configuring and before "download" in the runcontrol GUI. Lately we've needed to run coda_er interactively.
If you ever logout of the DAQ computer adaql2 or have rebooted it, here is how to restart the preferred setup. Login as "a-onl". Start emacs in background: emacs ~a-onl/prescale/prescale.dat & Also start datamon by typing "datamon". Next, in the "components" workspace, login to all the frontend computers by typing "telnet hallasfi1" where the name of the computer like hallasfi1 is in the name of the xterm window. Also in the name of these xterms is the portserver port where you can connect via RS232 (portserver instructions are at hallaweb.jlab.org/equipment/daq/portserver.html. So, an example name may be an xterm with title "ROC2--hallasfi2--hatsv3-port-8" meaning ROC2 is IP address hallasfi2 and on portserver hatsv3 at port 8. Note: If these xterm windows for components are not there, type "setupxterms" to bring them up.
V. THINGS THAT GO WRONG WITH DAQ or COMPUTERS
The Event Recorder (ER) or Event Builder (EB) may complain something like "ER1 not responding" on the Linux version of CODA, and you get no events. This may be because you forgot: Before running "runcontrol" one must run "rcServer" interactively from the login shell of the relevant account on the relevant computer ("a-onl" on "adaql2").
This page maintained by Robert Michaels rom@jlab.org