|
Home > Archive > Unix administration > March 2004 > Sudden reboot solaris 9 sun 220R
You are viewing an archived Text-only version of the thread.
To view this thread in it's original format and/or if you want to reply to
this thread please [click here]
| Author |
Sudden reboot solaris 9 sun 220R
|
|
| Kareem Mahgoub 2004-02-08, 8:33 pm |
| Hi List,
I have a strange problem with my system, which I hope anyone would
guide me through it or point me to any resources to at least identify
it.
I am administrating a UNIX box Sun 220R 1024 MB RAM, dual SPARC-II 400
processors, 2 SCSI HD IBM.
This Machine is Solaris 9 with latest Recommended Patch from Sun
Running Sendmail-8.12-11, Qpopper 4.04 & apache 1.29
About a month or so, the machine started to boot suddenly without a
Kernel panic or any trace of a problem.
I suspected that Spam Assasin which I was running on the machine was
the cause of the problem, so I disabled it, yet the Problem still
persists.
I know that there are many reasons for such a behaviour but I just
want tools or resources to detect the cause of the reboot if it
possible.
Thanks in advance for your help and sorry for such a long post.
Best Regards,
Kareem Mahgoub
| |
| Thomas Nau 2004-02-09, 7:34 am |
| Kareem Mahgoub <kareemash@hotmail.com> wrote:
| Hi List,
| I have a strange problem with my system, which I hope anyone would
| guide me through it or point me to any resources to at least identify
| it.
| I am administrating a UNIX box Sun 220R 1024 MB RAM, dual SPARC-II 400
| processors, 2 SCSI HD IBM.
| This Machine is Solaris 9 with latest Recommended Patch from Sun
| Running Sendmail-8.12-11, Qpopper 4.04 & apache 1.29
| About a month or so, the machine started to boot suddenly without a
| Kernel panic or any trace of a problem.
| I suspected that Spam Assasin which I was running on the machine was
| the cause of the problem, so I disabled it, yet the Problem still
| persists.
| I know that there are many reasons for such a behaviour but I just
| want tools or resources to detect the cause of the reboot if it
| possible.
| Thanks in advance for your help and sorry for such a long post.
| Best Regards,
| Kareem Mahgoub
I suggest you make sure that syslog level for kernel is set to debug just
to make sure you don't miss some messages.
One thing I have in mind but cannot quite recall is that Solaris 9 is abler
to reboot a system if too many memory faults are detected. Did you
experience some? One things remains unclear, at least to me: does the
machine shutdown and reboot or simply 'start all over again'?
Thomas
-----------------------------------------------------------------
PGP fingerprint: B1 EE D2 39 2C 82 26 DA A5 4D E0 50 35 75 9E ED
| |
| Kareem Mahgoub 2004-02-12, 2:36 am |
| Dear Thomas,
Thanks for your reply.
I have already added an entry in syslog.conf for *.debug inorder to
grab any signal form the OS.
To make things clear, the machin just start all over again ( no
shutdwon then start) I mean no clean shutdown.
Last thing, I have doubts about the memory chips, so I have replaced
them with other chips. After 2 days and 10 hours of replacement,
things seem to be stable, yet I need more time to judge.
Waiting for any comments from you.
Best Regards,
Kareem Mahgoub
| |
| Kareem Mahgoub 2004-02-14, 7:33 pm |
| Just an update to the thread,
THe problem still exist after memory replacement.
Now am replacing the CPU to see if there is any progress.
The thing to be added is that the machine sometimes hang ( No response
for any service, no response to keyboard strokes)
Regards,
Kareem Mahgoub
| |
| Thomas Nau 2004-02-15, 12:33 am |
| Kareem Mahgoub <kareemash@hotmail.com> wrote:
| Dear Thomas,
| Thanks for your reply.
| I have already added an entry in syslog.conf for *.debug inorder to
| grab any signal form the OS.
| To make things clear, the machin just start all over again ( no
| shutdwon then start) I mean no clean shutdown.
| Last thing, I have doubts about the memory chips, so I have replaced
| them with other chips. After 2 days and 10 hours of replacement,
| things seem to be stable, yet I need more time to judge.
| Waiting for any comments from you.
| Best Regards,
| Kareem Mahgoub
Hmm, sorry, no additional hints. The only thing I could think of is the
power supply or the CPU, prefering the first one. If there's another
failure I would expect some log entries or console messages
Thomas
-----------------------------------------------------------------
PGP fingerprint: B1 EE D2 39 2C 82 26 DA A5 4D E0 50 35 75 9E ED
| |
| Kareem Mahgoub 2004-03-02, 2:34 am |
| For the sake of google searcher for similar problem, the problem was
in a faulty CPU. Replacing the CPU helped the system to be up for 15
days.
Last question, is there any tool to check CPU functionality?
Best Regards,
Kareem Mahgoub
|
|
|
|
|