Sunday 29 September 2013

AIX OS errpt error output explained

errpt - The errpt command generates an error report from entries in an error log
# errpt
982C78BF   0930152713 T S mir0           DISPLAY ADAPTER CONFIGURATION ERROR
0BA49C99   0930153113 T H scsi0          SCSI BUS ERROR
0BA49C99   0930153013 T H scsi0          SCSI BUS ERROR
49A83216   0930152913 T H hdisk0         DISK OPERATION ERROR
0BA49C99   0930152913 T H scsi0          SCSI BUS ERROR
0BA49C99   0930152813 T H scsi0          SCSI BUS ERROR
0BA49C99   0930152913 T H scsi0          SCSI BUS ERROR
7F88E76D   0929104913 P S console        SOFTWARE PROGRAM ERROR
9DBCFDEE   0929104613 T O errdemon       ERROR LOGGING TURNED ON
192AC071   0926092513 T O errdemon       ERROR LOGGING TURNED OFF
A6DF45AA   0925162713 I O RMCdaemon      The daemon is started.
1BA7DF4E   0925162713 P S SRC            SOFTWARE PROGRAM ERROR
BA431EB7   0925162713 P S SRC            SOFTWARE PROGRAM ERROR

Column-1 : ErrorID

Error Identifier can be used to get the full report from either errorlog using -j flag or from the error-template repository in combination with -t flag.

Column-2 : TIMESTAMP

    The timestamp can be interpreted like the following.
    0930152913 : (09 = Month, 30 = Day, 15 = Hour, 29 = Minutes, 13 = Year)

Column-3 : TYPE OF ERROR

    P - Permanent
    T - Temporary

Column-4 : ERRCLASS

    H - Hardware
    S - Software
    O - Operator Notice(Intiated with errlogger command)
    U - Undetermined

Column-5 : RESOURCE NAME

    The name of the resource that detected the error. It may not be the component having the problem.

Column-6: DESCRIPTION

    A short description of the error message extracted from the Template repository

A detailed format for a single identifier - Sample Output

# errpt  -a -j 49A83216
LABEL:          DISK_ERR4
IDENTIFIER:     49A83216

Date/Time:       Thu Sep 30 15:29:45 CST 2013
Sequence Number: 25
Machine Id:      000788574C00
Node Id:         localhost
Class:           H
Type:            TEMP
Resource Name:   hdisk0          
Resource Class:  disk
Resource Type:   scsd
Location:        P1/Z1-A0
VPD:             
        Manufacturer................IBM     
        Machine Type and Model......ST318305LW      
        FRU Number..................09P4429     
        ROS Level and ID............43353042
        Serial Number...............000A2D99
        EC Level....................H11936    
        Part Number.................09P4428     
        Device Specific.(Z0)........000003129F00013E
        Device Specific.(Z1)........0501C50B
        Device Specific.(Z2)........1000
        Device Specific.(Z3)........02121
        Device Specific.(Z4)........0001
        Device Specific.(Z5)........22
        Device Specific.(Z6)........162870 C  

Description
DISK OPERATION ERROR

Probable Causes
MEDIA
DASD DEVICE

User Causes
MEDIA DEFECTIVE

        Recommended Actions
        FOR REMOVABLE MEDIA, CHANGE MEDIA AND RETRY
        PERFORM PROBLEM DETERMINATION PROCEDURES

Failure Causes
MEDIA
DISK DRIVE

        Recommended Actions
        FOR REMOVABLE MEDIA, CHANGE MEDIA AND RETRY
        PERFORM PROBLEM DETERMINATION PROCEDURES

Detail Data
PATH ID
           0
SENSE DATA
0A00 0000 2800 00DB 67B0 0000 1800 0000 0200 0400 0000 0000 0000 0000 0000 0000 
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 001A 0000 

6 comments:

  1. Nice article, but there is some tool or web with DB background about, how can we analyze the SENSE DATA deeply? If i know right there is only IBM internal analyzer tool so need to ask IBM aix support.

    ReplyDelete
  2. Is it possible to have different identifier for Software program abnormally terminated errors based on the program name? I wish to have different unique identifier for applications

    A924A5FC 0513151615 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED

    PROGRAM NAME
    XMLForm

    ReplyDelete
  3. How can I get SRC and what next step to fix it?

    ReplyDelete