Main INDEX
Monthly INDEX
PREV
NEXT
User name R. Michaels
Log entry time 06:18:40 on June10,2005
Entry number 145723
keyword=adaql1 reboot (again!)
adaql1 needed a reboot (at least that's what revives it).
This is the 4th time in the past week. The symptoms:
1. adaql1 is alive at console. Can open new windows, type some
commands like "ls", "top".
2. Cannot ssh INTO adaql1, but you can ssh OUT.
3. Cannot "su" (I wanted to kill a zombie sshd process).
Type "su" and enter password, the prompt hangs.
4. cron jobs apparently stop. Hence no update of halog.
5. Typing "crontab -l" at the console is similar to #3 --
prompt hangs and never returns.
Possible hint:
Just prior to the hangup, the /var/log/messages said the
following. I doubt it matters but...
Jun 10 05:47:01 adaql1 audbin[726]: saving binary audit log /var/log/audit.d/bin
.2
Jun 10 05:47:01 adaql1 audbin[726]: threshold 20.00 exceeded for filesystem /var
/log/audit.d/. - free blocks down to 19.85%
Jun 10 05:47:01 adaql1 auditd[10458]: Notify command /usr/sbin/audbin -S /var/log/audit.d/save.%u -C -T 20% exited with status 1
Jun 10 05:47:01 adaql1 auditd[10458]: output error
Jun 10 05:47:01 adaql1 auditd[10458]: output error
Jun 10 05:47:01 adaql1 auditd[10458]: output error; suspending execution
Also note, we've been running xscaler on adaql4.
Recommend users keep #processes on adaql1 minimal.