80 - Bcache Tag Address Parity Error | |
Indicator: | EI_STAT<bc_tperr> = 1 |
Description: | A Bcache read transaction encountered bad parity in the tag address RAM. |
Recovery: | No Recovery |
Analysis: | EI_ADDR - contains the physical address associated with this
error. BC_TAG_ADDR - contains the results of the Bcache tag read. |
Faulting FRU: | MLB, Bcache |
82 - Bcache Tag Control Parity Error | |
Indicator: | EI_STAT<bc_tc_perr> = 1 |
Description: | A Bcache read transaction encountered bad parity in the tag control RAM. |
Recovery: | No Recovery |
Analysis: | EI_ADDR - contains the physical address associated with this
error. BC_TAG_ADDR - contains the results of the Bcache tag read. |
Faulting FRU: | MLB, Bcache |
86 - CPU-Detected Correctable ECC Error | |
Indicator: | ISR<crd> = 1 (note that EI_STAT<unc_cor_err> is not used) |
Description: | The CPU detected a correctable ECC error. |
Recovery: | None required. The operating system may choose to log this error. |
Analysis: | EI_ADDR - Contains the physical address of the octaword
associated with this error. BC_TAG_ADDR - contains the results of the Bcache tag read:
|
Faulting FRU: | MLB, Bcache, DIMM |
88 - CPU-Detected Noncorrectable ECC Error | |
Indicator: | EI_STAT<unc_ecc_err> = 1 |
Description: | The CPU detected an uncorrectable ECC error. |
Recovery: | TBD |
Analysis: | EI_ADDR - Contains the physical address of the octaword
associated with this error. BC_TAG_ADDR - contains the results of the Bcache tag read; if BC_TAG_ADDR<hit> = 1, the error is from a Bcache read, otherwise it is from a memory read. FILL_SYN - contains the ECC syndrome bits for the two quadwords at this address. |
Faulting FRU: | MLB, Bcache, DIMM |
8A - Unknown Error | |
Indicator: | |
Description: | If none of the errors contain a bit that is set to indicate an error, an unknown error machine check will result. |
Recovery: | |
Analysis: | |
Faulting FRU: |
8E - Bugcheck | |
Indicator: | CALL_PAL BUGCHK |
Description: | The operating system requested a BUGCHECK. |
Recovery: | No recovery. |
Analysis: | |
Faulting FRU: | Software. |
90 - OS Bugcheck | |
Indicator: | |
Description: | This is the result of a CALL_PAL to perform queue manipulation and catastrophe strikes resulting in an operating system bugcheck. |
Recovery: | No recovery. |
Analysis: | |
Faulting FRU: | Software, PAL code |
92 - Dcache Parity Error | |
Indicator: | DC_PERR_STAT<5:2> not = 0 |
Description: | A parity error was received from Dcache. |
Recovery: | No Recovery |
Analysis: | VA - contains the
effective virtual address associated with the error. VA_FORM - contains the virtual page table entry address associated with the error. DC_PERR_STAT - indicates the type of error:
|
Faulting FRU: | MLB |
94 - Icache Parity Error | |
Indicator: | ICPERR_STAT<12:11> not = 0 |
Description: | A parity error was received from Icache. |
Recovery: | No recovery. |
Analysis: | ICPERR_STAT - indicates the type
of error:
|
Faulting FRU: | MLB |
96 - Istream Retryable Error | |
Indicator: | EI_STAT<fil_ird> = 1 |
Description: | TDB |
Recovery: | TDB |
Analysis: | TBD |
Faulting FRU: | TBD |
98 - CPU-Detected Machine Check | |
Indicator: |
ICPERR_STAT<tmr> = 1 DC_PERR_STAT<lock> = 1 EI_STAT<bc_tperr> = 1 EI_STAT<bc_tc_perr> = 1 EI_STAT<ei_par_err> = 1 EI_STAT<seo_hrd_err> = 1 EI_STAT<unc_ecc_err> = 1 .and. EI_STAT<fil_ird> = 0 SC_STAT<2:0> != 0 (sc_tperr) SC_STAT<10:3> != 0 .and. SC_STAT<15:11> != 00001b SC_STAT<scnd_err> = 1 |
Description: | Machine check trap. |
Recovery: | No recovery. |
Analysis: |
If ICPERR_STAT<tmr> is set, there was a timeout reset error or a cfail_h/no cack_h error If DC_PERR_STAT<lock> is set if a parity error was detected in the Dcache. Bits <5:2> are locked against further updates. If EI_STAT<bc_tperr> is set, there was a parity error reading Bcache tag address RAM. If EI_STAT<bc_tc_perr> is set, there was a parity error reading Bcache tag control RAM. If EI_STAT<ei_par_err> is set, there was a parity error in an address and command received by the CPU. If EI_STAT<seo_hrd_err> = 1, a fill from Bcache or main memory, or a system address/command received by the CPU has a hard error while one of the hard error bits is already set. If EI_STAT<unc_ecc_err> = 1 AND EI_STAT<fil_ird> = 0, an uncorrectable ECC error occured during a Dstream fill. If SC_STAT<2:0> not = 0 (sc_tperr), there was a parity error reading Scache tags. If SC_STAT<10:3> not = 0 AND SC_STAT<15:11> not = 00001b, there was a parity error resulting from an Scache read, but not during an IREAD command. If SC_STAT<scnd_err> = 1, an Scache transaction resulted in an error while SC_TPERR or SC_DPERR was already set. |
Faulting FRU: | MLB, Bcache |
201 - PYXIS-Detected Correctable ECC Error | |
Indicator: | PYXIS_ERR<cor_err> = 1 |
Description: | The PYXIS detected a correctable single-bit error on a read from memory. |
Recovery: | Pyxis returns the correct data. The operating system may choose to log this error. |
Analysis: | The octaword address of the error, EA, can be determined from
various registers: EA<33:32> = MESR<01:00> EA<31:06> = MEAR<31:06> EA<05:04> = (MEAR<05:04> + MESR<17:16>) MOD 4 The quadword in error can be determined, note that both quadwords can incur simultaneous single-bit errors. PYXIS_SYN<24> set ==> low quadword (EA<03>=0) PYXIS_SYN<25> set ==> high quadword (EA<03>=1) PYXIS_SYN<07:00> is the syndrome for the low quadword. PYXIS_SYN<15:08> is the syndrome for the high quadword. MESR<24:20> is the DATA_CYCLE_TYPE in progress at the time of the error. |
Faulting FRU: | MLB, Bcache, DIMM |
203 - PYXIS-Detected Noncorrectable ECC Error | |
Indicator: | PYXIS_ERR<un_cor_err> = 1 |
Description: | PYXIS detected an uncorrectable error on a read from memory |
Recovery: | No recovery |
Analysis: | The octaword address of
the error, EA, can be determined from various registers: EA<33:32> = MESR<01:00> EA<31:06> = MEAR<31:06> EA<05:04> = (MEAR<05:04> + MESR<17:16>) MOD 4 The quadword in error can be determined, note that both quadwords can incur simultaneous uncorrectable errors: PYXIS_SYN<26> set ==> low quadword (EA<03>=0) PYXIS_SYN<27> set ==> high quadword (EA<03>=1) MESR<24:20> is the DATA_CYCLE_TYPE in progress at the time of the error |
Faulting FRU: | MLB, Bcache, DIMM |
205 - PYXIS-Detected Command Bus Parity Error | |
Indicator: | PYXIS_ERR<cpu_pe> = 1 |
Description: | Parity error on the CPU/PYXIS command bus. |
Recovery: | No recovery |
Analysis: | PYXIS must not check parity on the CPU command bus, so either firmware failed to disable the check or PYXIS is broken. |
Faulting FRU: | Firmware, MLB |
207 - PYXIS-Detected Nonexistent Memory Reference | |
Indicator: | PYXIS_ERR<mem_nem> = 1 |
Description: | Indicates that a read or write occurred to an invalid address which does not map to any memory bank, CSR or I/O quadrant. |
Recovery: | No recovery |
Analysis: | The octaword address
of the error, EA, can be determined from various registers: EA<33:32> = MESR<01:00> EA<31:06> = MEAR<31:06> EA<05:04> = (MEAR<05:04> + MESR<17:16>) MOD 4 The type of memory access which caused the failure can be determined from the MESR register:
|
Faulting FRU: | MLB, software, firmware, PCI option, ISA option |
209 - PCI System Error (SERR) | |
Indicator: | PYXIS_ERR<pci_serr> = 1 |
Description: | A PCI SERR (system error) was reported. |
Recovery: | No recovery |
Analysis: | SERR is reported by any
PCI device when it detects a parity error on the PCI bus during an address cycle. (PYXIS reports an address parity error through PYXIS_ERR<6>.) These errors are hard to trace,
but the following registers may have relevant information:
|
Faulting FRU: | MLB, riser, PCI option |
20B - PCI Data Parity Error (PERR) | |
Indicator: | PYXIS_ERR<pci_perr> = 1 |
Description: | A PCI parity error occurred in the data phase of an I/O read/write transaction, or a PCI PERR (parity error) was reported. |
Recovery: | No recovery |
Analysis: | PERR is reported by
any PCI device when it detects a parity error on the PCI bus during a data cycle. PYXIS
reports a data parity error which it detects here, too. These errors are hard to trace, but
the following registers may have relevant information:
|
Faulting FRU: | PCI option, MLB, riser |
20D - PCI Address Parity Error | |
Indicator: | PYXIS_ERR<pci_addr_pe> = 1 |
Description: | PYXIS detected a parity error in the address phase of PCI bus transaction. |
Recovery: | No recovery |
Analysis: | PCI_ERR0<3:0>
(dma_cmd) holds the C/BE command for this DMA transaction PCI_ERR0<dac> is set if this DMA transaction is a dual-address cycle PCI_ERR1 contains the DMA address PYXIS_STAT might have pertinent data also. |
Faulting FRU: | PCI option, MLB, riser |
20F - PCI Master Abort | |
Indicator: | PYXIS_ERR<rcvd_mas_abt> = 1 |
Description: | PYXIS generated a PCI master abort cycle. Note that PYXIS does not report if a master ends a transaction with a target abort. |
Recovery: | No recovery |
Analysis: | TBD |
Faulting FRU: | PCI option, MLB, riser |
211 - PCI Target Abort | |
Indicator: | PYXIS_ERR<rcvd_tar_abt> = 1 |
Description: | A PCI target device ended a transaction using the PCI target abort protocol. Note that Pyxis does not report if it ends a transaction with a target abort. |
Recovery: | No recovery |
Analysis: | If PYXIS_STAT<0> = 1,
PYXIS PCI master state engine is active and:
|
Faulting FRU: | PCI option, ISA option |
213 - Invalid Scatter/Gather TLB Entry | |
Indicator: | PYXIS_ERR<pa_pte_inv> = 1 |
Description: | Invalid page table entry (PTE) on scatter/gather access. A read error on a TLB fill is reported elsewhere. |
Recovery: | No recovery |
Analysis: | PCI_ERR0<dma_cmd>
holds the C/BE command for this DMA transaction. PCI_ERR0<dma_dac> is set if this DMA transaction is a dual-address cycle. PCI_ERR1 contains the DMA address. |
Faulting FRU: | MLB, DIMM, PCI option, software |
215 - FlashROM Write Error | |
Indicator: | Not used on a-Series |
Description: | |
Recovery: | |
Analysis: | |
Faulting FRU: |
217 - PCI I/O Timeout | |
Indicator: | PYXIS_ERR<ioa_timeout> = 1 |
Description: | I/O read/write timed out. |
Recovery: | No recovery |
Analysis: | PCI_ERR0<pci_cmd> holds the
C/BE command for this transaction. PCI_ERR0<pci_dac> is set if this transaction is a dual-address cycle. PCI_ERR2 contains the PCI address. |
Faulting FRU: | PCI option, ISA option, MLB, riser, software |
219 - ISA NMI | |
Indicator: | Not used on a-Series |
Description: | |
Recovery: | |
Analysis: | |
Faulting FRU: |
221 - Unexpected Interrupt At IPL23 | |
Indicator: | |
Description: | The CPU detected an interrupt on a line that is not supposed to be connected in the Personal Workstation a-Series. |
Recovery: | |
Analysis: | |
Faulting FRU: | MLB, firmware, software |
223 - Fan Failure | |
Indicator: | INT_REQ<0> = 1 (fan fault) |
Description: | One or both of the system fans are either not properly connected to the riser, or have stopped spinning due to failure or obstruction. |
Recovery: | No Recovery. The operating system should attempt to log this error to disk, then report the failure to the console firmware which will turn off the power supply to prevent the system from overheating. |
Analysis: | |
Faulting FRU: | Fans, riser, MLB |