Incomplete Insertion of a Hot Plug of Power Supplies in Sun Fire F6800/E6900 Servers may Result in a System Outage |
|
| Category : | Availability |
| Release Phase : | Resolved |
| Product : | Sun Fire 6800 Server Sun Fire E6900 Server
|
| Bug Id : | 6299217
|
| Date of Workaround Release : | 14-NOV-2005
|
| Date of Resolved Release : | 29-AUG-2006
|
Impact
In very rare cases, performing a hot plug of an A184 Power Supply (P/N 300-1595) into a Sun Fire F6800 or E6900 during a normal service operation may result in the loss of all domains on the system.
In these cases, the power supply insertion was not done in a continuous motion and resulted in a momentary drop in the 56V power line inside the server. Depending on the size of the drop, this can result in various boards powering off and interrupting domain operations.
A power cycle of the entire platform may be required to restore normal operations.
Note: The correct process to replace an A184 Power Supply in Sun Fire F6800/E6900 servers is to insert the replacement PSU with a single smooth action. As the Power Supply comes to the end of it's travel the physical resistance increases. If the installer pauses at this point, it is possible that not all of the PSU's pins will have connected to the power centerplane (inside the system). If the PSU is left in this state, a momentary drop in the 56V power line inside the server may occur.
Contributing Factors
This issue can occur on the following platforms:
- Sun Fire F6800/E6900 Server
The described issue only occurs while inserting an A184 Power Supply (P/N 300-1595) and when insertion difficulty is encountered due to a tight fitting power supply.
Note: Even if the power supply fit is tight, this problem does not always occur.
Symptoms
If the described issue occurs, shortly after inserting an A184 Power Supply, errors similar to the following will be seen on the System Controller, and domains on the system will shut down:
str5-sp0:SC[engineering]> Oct 11 16:35:40 str5-sp0 Platform.SC:
Please check to make sure PS5 is plugged in and the power switch
is in the on position.
Oct 11 16:35:46 str5-sp0 Platform.SC: Repeater Board Slot 10
Device poll caused: sun.serengeti.HpuFailedException:
VoltageA2D.getOutputVoltage: sun.serengeti.I2cException:
I2cComm.busyWait:
busyWait() timeout waiting for WRDY, status=0x404b8008,
bus=17(RP0) ring=04 addr=4b
Oct 11 16:35:46 str5-sp0 Platform.SC: Device will not be polled
...
Workaround
To reduce the chances of the described issue from occurring, attempt to insert the power supplies in a smooth and continuous motion. Avoid having the insertion motion stop before the power supply is fully seated, as this is believed to increase the chances of this issue occurring.
Resolution
This issue is addressed with power supply 300-1595-03 and with the release of power supply 300-1930-01. The 300-1930 is the RoHS compliant version of the 300-1595 power supply and can be used in place of 300-1595-03 if 300-1595-03 is not available.
Modification HistoryDate: 29-AUG-2006
- State Resolved
- Updated Resolution section
AttachmentsThis solution has no attachment