04-Aug-2011

Crash while verifying boot options

While cleaning up some boot definitions with SYS$MANAGER:BOOT_OPTIONS.COM ready to relocate the system disk from one EVA disk group to another, we experienced a crash. The problem has been escalated to OpenVMS Engineering as we are running the latest and greatest update patch (UPDATE-V1000).

The crash footprint is as follows:



SDA> clue crash
Crashdump Summary Information:
------------------------------
Crash Time:        29-JUL-2011 12:21:38.02
Bugcheck Type:     SSRVEXCEPT, Unexpected system service exception
Node:              XXXXXX  (Cluster)
CPU Type:          HP BL860c  (1.59GHz/9.0MB)
VMS Version:       V8.3-1H1
Current Process:   xxxxxxxxx
Current Image:     DSA201:[SYS2.SYSCOMMON.][SYSEXE]EFI$BCFG.EXE;1
Failing PC:        00000000.00041431    EFI$BCFG+41431
Failing PS:        00000000.0000000B
Module:            EFI$BCFG
Offset:            00041431

Boot Time:         26-JUN-2011 11:28:50.00
System Uptime:              33 00:52:48.02
Crash/Primary CPU: 3./0.
System/CPU Type:   4020
Saved Processes:   1195
Pagesize:          8 KByte (8192 bytes)
Physical Memory:   20479 MByte (134742016 PFNs, discontiguous memory)
Dumpfile Pagelets: 11678841 blocks
Dump Flags:        olddump,writecomp,errlogcomp
Dump Type:         compressed,selective,dosd,shared_mem
EXE$GL_FLAGS:      poolpging,init,bugdump,tbchk
Paging Files:      1 Pagefile and 0 Swapfiles installed

Stack Pointers:
KSP = 00000000.7FF43750   ESP = 00000000.7FF68000   SSP = 00000000.7FFAC000
USP = 00000000.7AB01160

General Registers:
R0  = 00000000.00000000   GP  = FFFFFFFF.AD906C00   R2  = 00000000.00000233
R3  = 00000000.00000001   R4  = 00000000.7FF437F0   R5  = 00000000.7FF43A48
R6  = 00000000.7FF43AB0   R7  = 00000000.00000002   R8  = 00000000.7FF43810
R9  = 00000000.000005AC   R10 = 00000000.00000B1E   R11 = FFFFFFFF.FFFFFFE0
SP  = 00000000.00000000   TP  = 00000000.7B3581C8   R14 = 11D9008B.D487DDB4
R15 = FFFFFFFF.88045D90   R16 = 00020000.00000021   R17 = 00000000.000003C4
R18 = FFFFFFFF.886DF658   R19 = FFFFFFFF.8023B1C0   R20 = 000007FD.BFFB8000
R21 = 00000000.00000000   R22 = 00000000.00000001   R23 = 00000000.0004BFE0
R24 = 00000000.7FF43530   AI  = 00000000.00000002   RA  = FFFFFFFF.AD6215C0
PV  = 000007FD.BFFB87F0   R28 = FFFFFFFF.7FFFBBB8   FP  = 00000000.00000000
R30 = 000007FD.BFFB4020   R31 = 00000000.7FF43780

Exception Frame:
Exception taken at IP 00000000.00041430, slot 01 from Kernel mode
Trap Type   00000008 (Access control violation fault)
IVT Offset  00000800 (Data TLB Fault)

Control Registers:
CR0   Default Control Register (DCR)         00000000.00007F00
CR16  Processor Status Register (IPSR)       00001210.084A6010
CR17  Interrupt Status Register (ISR)        00000A04.00000000
CR19  Instruction Pointer (IIP)              00000000.00041430
CR20  Faulting Address (IFA)                 00000000.00000044
CR21  TLB Insertion Register (ITIR)          00000000.A144F634
CR22  Instruction Previous Address (IIPA)    00000000.00041430
CR23  Function State (IFS)                   80000000.00000489
CR24  Instruction immediate (IIM)            00000000.00000000
CR25  VHPT Hash Address (IHA)                FFFFFFFF.7FEA9EC0

Application Registers:
AR16  Register Stack Config Reg (RSC)        00000000.00000003
AR17  Backing Store Pointer (BSP)            00000000.7FF2E320
AR18  Backing Store for Mem Store (BSPSTORE) 00000000.7FF2E218
AR19  RSE NaT Collection Register (RNAT)     00000000.00000000
AR32  Compare/Exchange Comp Value Reg (CCV)  00070000.000A24E0
AR36  User NaT Collection Register (UNAT)    00000000.00000000
AR64  Previous Function State (PFS)          00000000.00000695
AR65  Loop Count Register (LC)               00000000.00000000
AR66  Epilog Count Register (EC)             00000000.00000000

Processor Status Register (IPSR):
AC = 0   MFL= 1   MFH= 0   IC = 1   I  = 1   DT = 1
DFL= 0   DFH= 1   RT = 1   CPL= 0   IT = 1   MC = 0   RI = 1
Interrupt Status Register (ISR):
Code 00000000     X  = 0   W  = 0   R  = 1   NA = 0   SP = 0
RS = 0   IR = 0   NI = 0   SO = 0   EI = 1   ED = 1

Branch Registers:
B0        00000000.0003F720
B1        00000000.00000000
B2        00000000.00000000
B3        00000000.00000000
B4        00000000.00000000
B5        00000000.00000000
B6        FFFFFFFF.8467F070
B7        FFFFFFFF.80D159F0

Floating Point Registers:          FPSR      0009804C.0270033F
F6        00000000.0001003E.00000000.00000A08
F7        00000000.0001003E.00000000.00000A08
F8        00000000.0001003E.00000000.00000001
F9        00000000.0001003E.00000000.00000A00
F10       00000000.0001003E.00000000.00000018
F11       00000000.0000FFDD.80000000.00000000

Miscellaneous Registers:
Interrupt Priority Level (IPL)                        00000000
Stack Align                                           000002D0
NaT Mask                                                  0016
PPrev Mode                                                  03
Previous Stack                                              00
Interrupt Depth                                             00
Preds                                        00000000.00750D53
Nats                                         00000000.00000000
Context                                      00000000.00718E13

General Registers:
R0   00000000.00000000     GP   00000000.00270000     R2   00000000.7FF43E80
R3   00000000.00000001     R4   FFFFFFFF.8B86D100     R5   00000000.00000088
R6   00000000.000805D8     R7   00000000.00000002     R8   00000000.00000000
R9   00000000.7FF43E18     R10  00000000.7FF43E20     R11  FFFFFFFF.886DF3C0
SP   00000000.7FF43D80     TP   00000000.7B3581C8     R14  11D9008B.D487DDB4
R15  00000000.7FF43D80     R16  FFFFFFFF.8467F070     R17  00000000.7AB013A5
R18  FFFFFFFF.886DF658     R19  00000000.00000044     R20  00000000.7FF43E44
R21  00000000.00000002     R22  00000000.00000000     R23  00000000.7FF43E58
R24  00000000.00000010     R25  00000000.00000003     R26  00000000.7FFFFFFF
R27  00000000.00000050     R28  FFFFFFFF.FFFFFF9F     R29  00000000.00000001
R30  00000000.00000000     R31  00000000.00000000


Signal Array:                            64-bit Signal Array:
Arg Count    = 00000005                  Arg Count      =          00000005
Condition    = 0000000C                  Condition      = 00000000.0000000C
Argument #2  = 00000000                  Argument #2    = 00000000.00000000
Argument #3  = 00000044                  Argument #3    = 00000000.00000044
Argument #4  = 00041431                  Argument #4    = 00000000.00041431
Argument #5  = 00000003                  Argument #5    = 00000000.00000003

Mechanism Array:
Arguments    = 00000049                  Establisher FP = 00000000.7AB01160
Flags        = 00000001                  Exception FP   = 00000000.7FF43AB0
Depth        = FFFFFFFD                  Signal Array   = 00000000.7FF43A48
Handler Data = 00000000.00000000         Signal64 Array = 00000000.7FF43A60

R0  = 00000000.00000000   R1  = 00000000.7FF43E18
F2  = 00000000.00000000   F3  = 00000000.00000000   F4  = 00000000.00000000
F5  = 00000000.00000000   F12 = 00000000.0000001F   F13 = F8000000.00000000
F14 = 80000000.00000000   F15 = 00000000.00000000   F16 = 00000000.00000000
F17 = 00000000.00000000   F18 = 00000000.00000000   F19 = 00000000.00000000
F20 = 00000000.00000000   F21 = 00000000.00000000   F22 = 00000000.00000000
F23 = 00000000.00000000   F24 = 00000000.00000000   F25 = 00000000.00000000
F26 = 00000000.00000000   F27 = 00000000.00000000   F28 = 00000000.00000000
F29 = 00000000.00000000   F30 = 00000000.00000000   F31 = 00000000.00000000

System Registers:
Page Table Base Register (PTBR)                           00000000.0023E5AE
Processor Base Register (PRBR)                            FFFFFFFF.881B6A80
Privileged Context Block Base (PCBB)                      FFFFFFFF.AED5E080
System Control Block Base (SCBB)                          00000000.00000004
Software Interrupt Summary Register (SISR)                00000000.00000000
Address Space Number (ASN)                                00000000.00A144F6
AST Summary / AST Enable (ASTSR_ASTEN)                    00000000.0000000F
Floating-Point Enable (FEN)                               00000000.00000001
Interrupt Priority Level (IPL)                            00000000.00000000
Machine Check Error Summary (MCES)                        00000000.00000000
Virtual Page Table Base Register (VPTB)                   00000000.00000000

Failing Instruction:
EFI$BCFG+41431:              ld4         r3 = [r19]

Instruction Stream (last 20 instructions):
SH$C_CIP_MASK+00FDC:              ld4         r23 = [r23] ;;
SH$C_CIP_MASK+00FDD:              st2         [r11] = r23
SH$C_CIP_MASK+00FDE:              nop.i       000000 ;;
SH$C_CIP_MASK+00FEC:              br.ret.sptk.many b0
SH$C_CIP_MASK+00FED:              nop.b       000000
SH$C_CIP_MASK+00FEE:              nop.b       000000 ;;
SH$C_CIP_MASK+00FFC:              alloc       r40 = ar.pfs, 09, 00, 00
SH$C_CIP_MASK+00FFD:              mov         r11 = r32
SH$C_CIP_MASK+00FFE:              mov         r10 = r33 ;;
EFI$BCFG+41410:              mov         r9 = r34
EFI$BCFG+41411:              add         r18 = 0298, r11 ;;
EFI$BCFG+41412:              nop.i       000000 ;;
EFI$BCFG+41420:              ld4         r8 = [r18] ;;
EFI$BCFG+41421:              nop.m       000000
EFI$BCFG+41422:              sxt4        r8 = r8 ;;
EFI$BCFG+41430:              add         r19 = 0044, r8 ;;
EFI$BCFG+41431:              ld4         r3 = [r19]
EFI$BCFG+41432:              nop.i       000000 ;;
EFI$BCFG+41440:              nop.m       000000
EFI$BCFG+41441:              sxt4        r3 = r3
EFI$BCFG+41442:              add         r20 = 0060, r8 ;;
EFI$BCFG+41450:              ld8         r20 = [r20]
EFI$BCFG+41451:              mov         r21 = 0000FF ;;
EFI$BCFG+41452:              and         r20 = r20, r21 ;;
EFI$BCFG+41460:              mov         r22 = 000038 ;;


Posted at August 4, 2011 6:30 PM
Tag Set:
Comments

I've received a patched version of EFI$BCFG.EXE with identification "X-36A1" from OpenVMS Engineering. This has been installed on our development systems with no adverse effects. Engineering tell me it will be included in a future EFI patch kit.

Hopefully, problem solved.

Posted by: Jim Duff at August 19, 2011 1:11 PM

Comments are closed