AES:6.2:S8800 Server - IBM 3550 M2 Reboots Frequently - PCI Card Shows Amber Light On Hardware


Doc ID    SOLN215920
Version:    10.0
Status:    Published
Published date:    04 Oct 2019
Created Date:    06 Dec 2012
Author:   
Mark Hardwick
 

Details

S8800  IBM server for AES (version 5 and 6).  This issue affects any application software running on this server type.

 

Problem Clarification

 S8800 server  - IBM 3550 M2 reboots frequently. and PCI card shows amber light on hardware

Cause

The S8800 - IBM 3550 M2 has an issue with the add-in 1GB daughter NIC card that can become unseated and cause frequrent server reboots.

You can check the number of reboots by logging into dom0 and using last -x

HQ-AES01-AES: last -x
                  runlevel (to lvl 3)   2.6.18-128.7.1.e Thu Oct 10 19:08 - 22:55  (03:47)
                  reboot   system boot  2.6.18-128.7.1.e Thu Oct 10 19:08          (03:47)
                  runlevel (to lvl 3)   2.6.18-128.7.1.e Thu Oct 10 17:43 - 19:08  (01:25)
                  reboot   system boot  2.6.18-128.7.1.e Thu Oct 10 17:43          (05:12)
                  runlevel (to lvl 3)   2.6.18-128.7.1.e Thu Oct 10 17:28 - 17:43  (00:15)
                  reboot   system boot  2.6.18-128.7.1.e Thu Oct 10 17:28          (05:27)

Also:

/var/log/messages:
                  Oct 10 17:32:24 HQ-AES01 kernel: Dazed and confused, but trying to continue
                  Oct 10 19:04:40 HQ-AES01 kernel: Dazed and confused, but trying to continue

Server Type:

[root@HQ-AES01 tmp]# dmidecode -t system
                  # dmidecode 2.7
                  SMBIOS 2.5 present.
                 
                  Handle 0x0028, DMI type 1, 27 bytes.
                  System Information
                          Manufacturer: IBM
                          Product Name: System x3550 M2 -[7946PBU]-
                          Version: 00
                          Serial Number: 06G2087
                          UUID: B32A9BCC-D919-11DE-85B3-E41F132C29C8
                          Wake-up Type: Other
                          SKU Number: XxXxXxX
                          Family: System x

Solution

Replace the server using  PCN 1716H. (attached)

Additional Relevant Phrases

S8800 constantly rebooting Rebooting every 10 days

Avaya -- Proprietary. Use pursuant to the terms of your signed agreement or Avaya policy