• Main INDEX
  • Monthly INDEX
  • PREV
  • NEXT

    User name R. Michaels

    Log entry time 06:18:40 on June10,2005

    Entry number 145723

    keyword=adaql1 reboot (again!)

    adaql1 needed a reboot (at least that's what revives it).
    This is the 4th time in the past week. The symptoms:

    1. adaql1 is alive at console. Can open new windows, type some
    commands like "ls", "top".
    2. Cannot ssh INTO adaql1, but you can ssh OUT.
    3. Cannot "su" (I wanted to kill a zombie sshd process).
    Type "su" and enter password, the prompt hangs.
    4. cron jobs apparently stop. Hence no update of halog.
    5. Typing "crontab -l" at the console is similar to #3 --
    prompt hangs and never returns.

    Possible hint:
    Just prior to the hangup, the /var/log/messages said the
    following. I doubt it matters but...

    Jun 10 05:47:01 adaql1 audbin[726]: saving binary audit log /var/log/audit.d/bin
    .2
    Jun 10 05:47:01 adaql1 audbin[726]: threshold 20.00 exceeded for filesystem /var
    /log/audit.d/. - free blocks down to 19.85%
    Jun 10 05:47:01 adaql1 auditd[10458]: Notify command /usr/sbin/audbin -S /var/log/audit.d/save.%u -C -T 20% exited with status 1
    Jun 10 05:47:01 adaql1 auditd[10458]: output error
    Jun 10 05:47:01 adaql1 auditd[10458]: output error
    Jun 10 05:47:01 adaql1 auditd[10458]: output error; suspending execution


    Also note, we've been running xscaler on adaql4.
    Recommend users keep #processes on adaql1 minimal.