Robert Michaels, rom@jlab.org, Jefferson Lab Hall A, updated Jan 4, 2008
General info about JLab computers (e.g. what is a work or cache disk ?) can be learned from the computer center. This page pertains to the management of Hall A work and cache disks.
We have several Terabytes of scratch work disk on JLab's Common Unix Environment (CUE). Several partitions are dedicated to "contemporary experiments", while other partitions are for everyone (including contemporary experiments). Contemporary experiments include the one running, the one or two upcoming, and recent experiments. See table below for a list of work disks.
To get to your work area, contemporary or not, follow the link: For the example of e98108, you "cd /work/halla/e98108" on JLab computers. If you are in the Unix group a-e98108 you can write there. You must be in the Unix group corresponding to your experiment. Ask me (rom@jlab.org) to be added to a group or for other questions. Some experiments have multiple disk areas, like /work/halla/e97103/disk1 and /work/halla/e97103/disk2, usually one of these is the "dedicated" disk, the other points to a region on a big "shared" disk.
A "cleanup" script will maintain free space on the work disks. In simple terms, the largest and oldest files are deleted nightly, if necessary, to keep some space free. For the dedicated disks, the spokespersons can elect to have me turn off the cleanup script; but it will run on the shared disks. The cleanup script is described in the appendix below.
IMPORTANT: Work disks are scratch space and have no backups. Plan for the possibility that files on /work might be deleted or the disk might die. Keep important files in /home or in MSS.
Thanks to Ole Hansen, we also have several TB of work disk on the adaq Linux cluster (adaql machines, l = linux). Disks are /adaqlN/workM where N = 1,2... and M = 1,2,3... The running experiment may keep stuff there, e.g. huge root files, etc. But these disks are not backed up. Also, a cleanup script may operate there (see appendix). At the end of the experiment, the adaq work disks are cleaned up (files erased) to make room for the next experiment.
In addition to /work, there are two kinds of /cache disk. One kind is ``hidden'' from users and is used to feed the batch farm from data in MSS. Another kind is open to users and is the temporary repository for MSS data. The latter disks are /cache/halla/EXPERIMENT where EXPERIMENT corresponds to different experiments. See Computer Center Scicomp pages.
Here is my response to the common request "Please give me more disk immediately." A short technical justification via e-mail from the spokesperson is needed. Procurement cycles take two months but we might have some disk in our "back pocket". It is always best to plan ahead. The "disk is cheap" argument does not really work because: 1) if we bought very cheap disks the failure rate would be too high for the huge amount of disk we have; and 2) Significant costs include the server that houses the disk, space, power, and especially sysadmin time.
It is also possible to stage your huge output to MSS. Hall B does it that way, as it is impossible for them to store all output on /work.
Below is the table of CUE work disks assigned to hall A experiments.
Always follow the link (shown in middle column) because this could change to point to another mount point. To find out what Unix group you need to belong to, you can do the following, e.g. for e93049, "ls -la /work/halla | grep e93049", then realizing from this that it is a link to /w/work1601, do this: "ls -la /w/work1601 | grep e93049" and now you can see that you must be in Unix group a-e93049 to write to this disk. Ask me to be assigned to the group.
I remind you that a "dedicated" disk means the physical partition belongs to only that experiment, while a "shared" disk means the link points to an area on a partition which is shared by several other experiments.
| EXPERIMENT | DISK (link) | SHARED / DEDICATED |
| e06007 | /work/halla/e06007/disk1 | dedicated to e06007 |
| e03101 | /work/halla/e03101/disk1 | a shared disk |
| Ledex (e05004, e05103) | /work/halla/ledex/disk1 | dedicated to ledex |
| Ledex (e05004, e05103) | /work/halla/ledex/disk2-6 | shared with other expts |
| e03104 | /work/halla/e03104/disk1 | dedicated to e03104 |
| e03104 | /work/halla/e03104/disk2, disk3 | shared with others |
| e04018 | /work/halla/e04018/disk1 | dedicated to e04018 |
| e04018 | /work/halla/e04018/disk2, disk3, disk4, disk5 | shared with others |
| Coul. Sum Rule | /work/halla/csr/disk1 | shared disk |
| DVCS | /work/halla/dvcs/disk2 | dedicated to e03106 |
| DVCS | /work/halla/dvcs/diskN (N=3,4,6,7) | shared |
| DVCS | /work/halla/dvcs/disk5 | dedicated |
| e00007 | /work/halla/e00007/disk1 | dedicated |
| e00007 | /work/halla/e00007/diskN (N=2,3) | shared |
| e00102 | /work/halla/e00102/disk1 | dedicated |
| e00102 | /work/halla/e00102/diskN (N=2,3,4,5) | shared |
| e01012 | /work/halla/e01012/diskN (N=1,2) | shared |
| e01012 | /work/halla/e01012/disk3 | dedicated |
| e01001 | /work/halla/e01001 | dedicated |
| e01020 | /work/halla/e01020 | dedicated |
| e01020 | /work/halla/e01020-2 (dash-2) | shared |
| e89003 | /work/halla/e89003 | shared |
| e89033 | /work/halla/e89033 | shared |
| e89044 | /work/halla/e89044/diskN (N=1,2) | shared |
| e91004 | /work/halla/e91004 | shared |
| e91026 | /work/halla/e91026 | shared |
| e93027 | /work/halla/e93027 | shared |
| e93049 | /work/halla/e93049 | shared |
| e93050 | /work/halla/e93050 | shared |
| e93108 | /work/halla/e93108 | shared |
| e94010 | /work/halla/e94010/diskN (N=1-3) | shared |
| e94104 | /work/halla/e94104/diskN (N=1-3) | shared |
| e94107 | /work/halla/e94107 | dedicated |
| e94107 | /work/halla/e94107-2 e94107-3 | shared |
| e95001 | /work/halla/e95001 | shared |
| e97103 | /work/halla/e97103/disk1 | dedicated |
| e97103 | /work/halla/e97103/disk2 | shared |
| e97108 | /work/halla/e97108 | shared |
| e97111 | /work/halla/e97111 | shared |
| e98108 | /work/halla/e98108 | dedicated |
| e99007 | /work/halla/e99007 | dedicated |
| e99114 | /work/halla/e99114/disk1 | dedicated |
| e99114 | /work/halla/e99114/disk2, disk3 | shared |
| e99117 | /work/halla/e99117 | shared |
| gammap99 | /work/halla/gammap99 | shared |
| gdh | /work/halla/gdh | shared |
| gdh | /work/halla/gdh-2 | dedicated |
| gdh | /work/halla/gdh-3 gdh-4 gdh-5 | shared |
| ndelta | /work/halla/ndelta/diskN (N=1-3) | shared |
| ndelta | /work/halla/ndelta-2 | shared |
| HAPPEX (Hyd-I, II, He4) | /work/halla/parity/disk1 | dedicated |
| HAPPEX (Hyd-I, II, He4) | /work/halla/parity/diskN (N=2-4) | shared |
| HAPPEX (Hyd-I, II, He4) | /work/halla/parity/disk5 | dedicated |
| HAPPEX (Hyd-I, II, He4) | /work/halla/parity/diskN (N=6,7) | shared |
| e01015 (SRC) | /work/halla/e01015/disk1 | dedicated |
| e01015 (SRC) | /work/halla/e01015/diskN (N=2,3) | shared |
| e05110 | /work/halla/e05110/disk1 | dedicated |
| e05110 | /work/halla/e05110/disk2 | dedicated |
| e05015 | /work/halla/e05015/disk1 | dedicated |
| e06014 | /work/halla/e06014/disk1 | dedicated |
| bigbite | /work/halla/bigbite/disk1 | shared |
| e04007 | /work/halla/e04007/disk1 | shared |
| e04007 | /work/halla/e04007/disk2 | dedicated |
| transversity | /work/halla/transversity/disk1 | shared |
| transversity | /work/halla/transversity/disk2, disk3 | shared |
| e08007 | /work/halla/e08007/disk1, disk2 | shared |
| e02013 | /work/halla/e02013/disk1 | dedicated |
| e02013 | /work/halla/e02013/diskN (N=2,3,4,5,6,7) | shared |
APPENDIX -- Cleanup Script
The WORK Disk Cleanup Script is similar for CUE disks and ADAQ disks. Every night the disks are checked. If less than 95% full, nothing happens. If greater than 95% full, files will be deleted as follows: Initially, files greater than NB bytes are considered, then NB/(10**i), i=1,2,3...(loop) where NB is big. I.e., first we consider 10 Gbyte files, then 1 Gbyte, then 100 Mbyte, etc. At each level of filesize, files are sorted by last usage; oldest files considered first. A file is deleted only if it has not been used within NDAYS (= 10). After each deletion the disk usage is checked; when it falls below 90% the deletion stops. For dedicated CUE disks, the script can be disabled at request of spokesperson, but it cannot be disabled for any shared disks.