Welcome
Username or Email:

Password:


Missing Code




[ ]
[ ]
Online
  • Guests: 34
  • Members: 0
  • Newest Member: omjtest
  • Most ever online: 396
    Guests: 396, Members: 0 on 12 Jan : 12:51
Members Birthdays:
One birthday today, congrats!
uzzors2k (35)


Next birthdays
03/29 GrantX (34)
03/30 Adam Horden (39)
03/30 Mr.Warwickshire (23)
Contact
If you need assistance, please send an email to forum at 4hv dot org. To ensure your email is not marked as spam, please include the phrase "4hv help" in the subject line. You can also find assistance via IRC, at irc.shadowworld.net, room #hvcomm.
Support 4hv.org!
Donate:
4hv.org is hosted on a dedicated server. Unfortunately, this server costs and we rely on the help of site members to keep 4hv.org running. Please consider donating. We will place your name on the thanks list and you'll be helping to keep 4hv.org alive and free for everyone. Members whose names appear in red bold have donated recently. Green bold denotes those who have recently donated to keep the server carbon neutral.


Special Thanks To:
  • Aaron Holmes
  • Aaron Wheeler
  • Adam Horden
  • Alan Scrimgeour
  • Andre
  • Andrew Haynes
  • Anonymous000
  • asabase
  • Austin Weil
  • barney
  • Barry
  • Bert Hickman
  • Bill Kukowski
  • Blitzorn
  • Brandon Paradelas
  • Bruce Bowling
  • BubeeMike
  • Byong Park
  • Cesiumsponge
  • Chris F.
  • Chris Hooper
  • Corey Worthington
  • Derek Woodroffe
  • Dalus
  • Dan Strother
  • Daniel Davis
  • Daniel Uhrenholt
  • datasheetarchive
  • Dave Billington
  • Dave Marshall
  • David F.
  • Dennis Rogers
  • drelectrix
  • Dr. John Gudenas
  • Dr. Spark
  • E.TexasTesla
  • eastvoltresearch
  • Eirik Taylor
  • Erik Dyakov
  • Erlend^SE
  • Finn Hammer
  • Firebug24k
  • GalliumMan
  • Gary Peterson
  • George Slade
  • GhostNull
  • Gordon Mcknight
  • Graham Armitage
  • Grant
  • GreySoul
  • Henry H
  • IamSmooth
  • In memory of Leo Powning
  • Jacob Cash
  • James Howells
  • James Pawson
  • Jeff Greenfield
  • Jeff Thomas
  • Jesse Frost
  • Jim Mitchell
  • jlr134
  • Joe Mastroianni
  • John Forcina
  • John Oberg
  • John Willcutt
  • Jon Newcomb
  • klugesmith
  • Leslie Wright
  • Lutz Hoffman
  • Mads Barnkob
  • Martin King
  • Mats Karlsson
  • Matt Gibson
  • Matthew Guidry
  • mbd
  • Michael D'Angelo
  • Mikkel
  • mileswaldron
  • mister_rf
  • Neil Foster
  • Nick de Smith
  • Nick Soroka
  • nicklenorp
  • Nik
  • Norman Stanley
  • Patrick Coleman
  • Paul Brodie
  • Paul Jordan
  • Paul Montgomery
  • Ped
  • Peter Krogen
  • Peter Terren
  • PhilGood
  • Richard Feldman
  • Robert Bush
  • Royce Bailey
  • Scott Fusare
  • Scott Newman
  • smiffy
  • Stella
  • Steven Busic
  • Steve Conner
  • Steve Jones
  • Steve Ward
  • Sulaiman
  • Thomas Coyle
  • Thomas A. Wallace
  • Thomas W
  • Timo
  • Torch
  • Ulf Jonsson
  • vasil
  • Vaxian
  • vladi mazzilli
  • wastehl
  • Weston
  • William Kim
  • William N.
  • William Stehl
  • Wesley Venis
The aforementioned have contributed financially to the continuing triumph of 4hv.org. They are deserving of my most heartfelt thanks.
Forums
4hv.org :: Forums :: Computer Science
« Previous topic | Next topic »   

Computer had incident

Move Thread LAN_403
Hon1nbo
Wed Aug 27 2014, 06:39PM
Hon1nbo Registered Member #902 Joined: Sun Jul 15 2007, 08:17PM
Location: North Texas
Posts: 1040
Hi all,

so I had my AC fail over the weekend, and i shut my computer off to protect it (my condo got over 100 F). When the AC was restored, I brought the computer back online. However, since it would take time to get the entire unit down to temp, I put a block of dry ice on my radiator (I'd taken measures to prevent the water from freezing).

When the AC got cool enough, I figured I'd benchmark an overclock while I'm at it since there was still plenty of Dry Ice left. After the CPU portion of the benchmark finished, I went into the next room for a couple minutes while the GPU ran.

I came back to white screens. I shut the computer off, and I couldn't even get POST to start. I started disabling devices (motherboard has a handy set of DIP switches that allow the disabling of controllers and PCIe devices for debugging as well as a HEX readout). I found that everything works fine when the GPU on PCIe lane 1 was disabled. I know the water lines didn't freeze on me as the water was still moving through the system with Dry Ice remaining, as indicated through a flow window I have.

I would chuck it up to a failed GPU (possible overheat), but I also found something that was much worse for me after the POST finally started: the incident had removed two drives from my RAID 5 array (which could only tolerate 1 drive failure out of the 4). I figured I'd start restoring from backup, maybe the system crashed as the benchmark started doing I/O. However, I found Windows Backup does not check integrity of files before updating with an incremental (how? am currently having to restore all files by hand, and the process starts over if I encounter a invalid ZIP file.

So this leaves me with a couple questions for the community:

I am wary of assuming the GPU is the only issue, as the two drives were dropped off the array (though their SMART history shows no issues, it could have been an extreme case of a write hole or the array was being reinitialized during the crash).
I don't currently have a safe PCIe device to test if the lane is bad and not the GPU (or both are bad). I am afraid to put my other GPU in it until I can confirm the lane is good.

But I am trying to think of any other cases that could have happened. Mobo seems to be operating fine (though the 12V rail is registering low, maybe the PSU got tripped from not cooling off fully after the AC was restored? Or the benchmark drew more power than expected? I currently have a 750 which should be enough, but I won't rule out a surge). This would also be able to explain how the HDDs went out of sync with the raid and dropped their membership.


Any ideas?

-Jim

P.S: I am also looking for a new backup solution. I used to have Acronis, but it has issues with my RAMDisk drivers, and their hotfix doesn't work for me.
Back to top
hen918
Wed Sept 03 2014, 06:22PM
hen918 Registered Member #11591 Joined: Wed Mar 20 2013, 08:20PM
Location: UK
Posts: 556
I had exactly the same issue with a GPU: Playing Crisis too hard, got a slight glitch, checked GPU temp: 95degrees C. Oops, shut down computer, let it cool, failed to display POST and the GPU got hot very quickly - It must have been drawing a fair bit of power, so this might have caused power supply damage perhaps? I RMAd the graphics card in the end.

Anyway, I can't see what caused the RAID failure possibly from the under-voltage, but I can't see this likely.

Good Luck!
Henry
Back to top

Moderator(s): Chris Russell, Noelle, Alex, Tesladownunder, Dave Marshall, Dave Billington, Bjørn, Steve Conner, Wolfram, Kizmo, Mads Barnkob

Go to:

Powered by e107 Forum System
 
Legal Information
This site is powered by e107, which is released under the GNU GPL License. All work on this site, except where otherwise noted, is licensed under a Creative Commons Attribution-ShareAlike 2.5 License. By submitting any information to this site, you agree that anything submitted will be so licensed. Please read our Disclaimer and Policies page for information on your rights and responsibilities regarding this site.