[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference orarep::nomahs::dbms

Title:	VAX DBMS
Notice:	THIS NOTESFILE IS NOT A FORMAL SUPPORT CHANNEL
Moderator:	SCARY::CHARLAND

Created:	Thu Feb 20 1986
Last Modified:	Tue Jun 03 1997
Last Successful Update:	Fri Jun 06 1997
Number of topics:	2642
Total number of notes:	11044

2630.0. "piofetch$within_db +448" by M5::MONEIL () Fri May 09 1997 14:11

    
      I have a MANMAN customer running 6.0-11 of DBMS, and VMS 6.1.  He is
    running an application that is consistently causing a bugcheck with the
    exception piofetch$within_db +448, with a cosi-f-bugcheck error.  This
    is on an Alpha by the way.  I found that the dio_cur_dbk is -1:-1:-1.
    I had seen other articles on this, but did not find anything concrete
    as an answer.  I had him run a full verify on a copy of this database
    from a recent backup, and it ran clean.  The verb from the parameter
    block was a fetch.
      Is this a possible memory glitch, and upgrading VMS and DBMS would
    help ?  If not, what should I look at next ?
    
    Mark

T.R	Title	User	Personal Name	Date	Lines
2630.1		ukvms3.uk.oracle.com::PJACKSON	Oracle UK Rdb Support	`Fri May 09 1997 14:42`	8
	> Is this a possible memory glitch, and upgrading VMS and DBMS would > help ? If not, what should I look at next ? A lot of things could cause this. The bugcheck needs to be examined more fully. The next thing I would look at are the routines on the call stack, to see what was being done. Peter
2630.2	Stack for bugcheck	M5::LSMITH		`Fri May 09 1997 22:30`	29
	Posting stack for Mark O'Neil. If anyone can provide some insight tonight, the customer would appreciate it. Thanks, Lisa Saved PC = 80006F04 : S0 address Saved PC = 9BDE42BC : S0 address ***** Exception at 00B43140 : PIOFETCH$WITHIN_DB + 00000448 %COSI-F-BUGCHECK, internal consistency failure Saved PC = 00B41064 : PIO$FETCH_RET + 000002BC Saved PC = 00B40A7C : PIO$FETCH + 000004BC Saved PC = 00B0855C : DIOFETCH$FETCH_VISIBLE_SEG + 00000498 Saved PC = 00B07B68 : DIOFETCH$FETCH_ONE_LINE + 0000048C Saved PC = 00B0901C : DIOFETCH$SCAN_ONE_PAGE + 000002D0 Saved PC = 00B0730C : DIO$FETCH_AREA + 00000194 Saved PC = 00AF2A88 : DSIFIND$SCAN_AREA + 00000068 Saved PC = 00AF2884 : DSI$FIND_WITHIN_AREA + 00000304 Saved PC = 00AD0B30 : LSIFETCH$WITHIN_REALM2 + 00000130 Saved PC = 00AD097C : LSIFETCH$WITHIN_REALM + 00000184 Saved PC = 00AD04F4 : LSIFETCH$RSE1 + 0000019C Saved PC = 00ACF99C : LSI$FETCH + 0000007C Saved PC = 009F9788 : DBM$DML + 000001A8 Saved PC = 009F8714 : DBM$32X + 00000314 Saved PC = 00D3590C : BLI$CALLG + 000000BC Saved PC = 00B21910 : KOD$SETSTK_AND_CONTINUE + 00000140 Saved PC = 00B216A0 : KODSTREAM$DO_THREAD + 00000188
2630.3	bad dbkey most likely	hotrdb.us.oracle.com::LASTOVICA	Can you be a closet claustrophobic?	`Fri May 09 1997 23:29`	6
	> ***** Exception at 00B43140 : PIOFETCH$WITHIN_DB + 00000448 this bugcheck means that a page number was requested that is past the end of a storage area. I'd first recommend that the database gets completely verified. From there, I dunno. Sounds like a bad DBKEY is probably getting passed around.
2630.4	more information from the customer	BROKE::BASTINE		`Sat May 10 1997 00:57`	12
	He did a complete verify and it came up clean. He also did a full backup and restore, all successful. Is there is anyway to locate such a bad dbkey if it is floating around? He did a full verify and it didn't pick up anything and he sees the problem in 2 different areas of the database. What can we do to find this? Thanks, Renee (for Mike)
2630.5		hotrdb.us.oracle.com::LASTOVICA	Can you be a closet claustrophobic?	`Sat May 10 1997 04:29`	2
	one would need to dig through the dump file to find the offending dbkey and from there try to determine where it originated.
2630.6		HOTRDB::PMEAD	Paul, pmead@us.oracle.com, 719-577-8032	`Sun May 11 1997 16:09`	2
	Post the stack with the register dumps and we probably can pinpoint the cause.
2630.7	Erroneous TSN	M5::DMACKENZ		`Tue May 20 1997 19:30`	29
	FYI, the cause of the bugcheck: The FETCH was for a sequential walk of the realm. The problem only occurs in batch retrieval jobs. When walking the realm, DBMS checked the TSN index and determined that it needed to fetch the snapshot page to get a visible version of the record. The snapshot page was empty, hence the bugcheck. All the snapshot pages were empty, as they should be, because the database was just restored. The problem was with the TSN number for this line in the TSN index. It was erroneous. The currency indicators in the bugcheck did not point to this problem page. The problem page was in the buffers seen in the bugcheck. To find the problem page, I started with the record's current dbkey (FETCH DBKEY), turned on the currency update display (SET CURSIG), and looped through fetching the next record in a batch retrieval transaction. Starting with the last dbkey shown before the bugcheck, I had to dump pages till I found the next record (the problem record). To fix the problem, we updated the record. This fixed the TSN number for the line in the TSN index. Per the customer, this problem record was current when the node crashed. Diane
2630.8		HOTRDB::PMEAD	Paul, pmead@us.oracle.com, 719-577-8032	`Tue May 20 1997 19:57`	1
	How erroneous was the TSN?