Return to BSD News archive
Newsgroups: comp.unix.bsd.freebsd.misc,comp.unix.bsd.bsdi.misc Path: euryale.cc.adfa.oz.au!newshost.carno.net.au!harbinger.cc.monash.edu.au!munnari.OZ.AU!news.mel.connect.com.au!news.mira.net.au!vic.news.telstra.net!act.news.telstra.net!psgrain!iafrica.com!uct.ac.za!und.ac.za!peacenjoy.mikom.csir.co.za!news.uoregon.edu!enews.sgi.com!www.nntp.primenet.com!nntp.primenet.com!howland.erols.net!news1.erols.com!uunet!in3.uu.net!news.new-york.net!spcuna!spcvxb!terry From: terry@spcvxb.spc.edu (Terry Kennedy, Operations Mgr.) Subject: Re: Why one should buy parity memory for reliability? Nntp-Posting-Host: spcvxa.spc.edu References: <32485B0D.41C6@austin.ibm.com> Sender: news@spcuna.spc.edu (Network News) Organization: St. Peter's College, US Date: Wed, 25 Sep 1996 21:31:46 GMT Message-ID: <1996Sep25.173146.1@spcvxb.spc.edu> Lines: 17 Xref: euryale.cc.adfa.oz.au comp.unix.bsd.freebsd.misc:28010 comp.unix.bsd.bsdi.misc:4988 In article <32485B0D.41C6@austin.ibm.com>, Tushar Patel <tpatel@austin.ibm.com> writes: > If the Board supports the parity memory and error occurs then in > theory the OS should be notified and the access should be reapeted. Unfortunately, the PC's design doesn't latch the failing address, so there isn't any easy way to determine it. I've seen implementations that scan all of memory sequentially looking for the error, but there's no guarantee that the error will happen again. The ECC designs (like in the Triton II chipset) save the failing address (and syndrome) which will make it much easier to do so something useful with the error. However, the upper layers of the OS still need to be modified to handle de-allocating bad memory, terminating a user process, etc. Terry Kennedy Operations Manager, Academic Computing terry@spcvxa.spc.edu St. Peter's College, Jersey City, NJ USA +1 201 915 9381 (voice) +1 201 435-3662 (FAX)