Hi all,
I've been building my own PCs for 20 years and think of myself as fairly cluey with tech stuff, but im completely stumped with this particular PC crashing issue. Hoping the brains trust can help! Sorry for the long post, but there's a lot of detail to convey.
I built a new PC in late 2020, for gaming, running Win10 and with an MSI RTX 3090 and an 850W PSU (rest of specs at bottom of post). The system IS NOT OVERCLOCKED.
Previous problem:
For a long time with this setup i used to have an issue with instantaneous resets (as if you hard-pressed the reset button) while playing graphically demanding video games. This used to happen once every few days (i tend to game every evening). At the time i put this down to power spikes in the GPU, although i never confirmed this and didnt really investigate that further.
About a year back i upgraded to Windows11. Actually dont recall if i in-place upgraded or installed clean.
Since then my system has developed a different type of issue. I no longer get the spontaneous resets, but i get a weird freezing problem, which manifests in 2 ways:
Current problem:
If im not gaming but using the Win11 PC for productivity (watching youtube, usually), i get a partial crash/freeze of the OS. If watching youtube for example, the audio cuts out, but the video keeps going for another 10secs (whatever was cached). All the apps in Windows still respond but only partially. I can switch windows and scroll through windows, but cant open any new app. Task manager also will not open. Trying to reboot via windows reboot menu causes a freeze, so the only way to remedy this is a hard-reset via button on the PC case.
If im gaming, graphically intense games insta-freeze (audio and graphics), although i can still alt-tab out of them, with the rest of the behaviour same as discribed in point #1. If the game is not graphically intense, i sometimes a situation where the audio cuts out, but i can still interact with the graphical parts of the game. Trying to create a savegame in the game though never works, even though the game often tells you saving the game was a success. Checking later, the savegame doesnt exist. (makes me wonder if my whole problem is related to storage?).
These crashes sometimes happen once in a few days, and sometimes twice in 10 minutes. I've not been able to detect any pattern to what causes these more or less frequently.
Games have been both DX11 and DX12, and maybe even Vulkan, not sure.
Things i've tried:
Monitoring PC specs and logging them via MSI Afterburner. Nothing seems to spike around crash time, and there's no issues with CPU/GPU temperatures.
Check eventviewer for any errors. Nothing seems to spring out at me, but i feel the crash is such that no log entries can be written from time of crash, which makes this not very useful.
Have run a number of diagnostic tools such as memtest and the OCCT stability testing tool with many passes of the GPU, VRAM, MEM, CPU & Power tests, with no crashes occuring during those tests.
Have checked the status of my SSD's (samsung) with the Samsung Magician software. Everything seems fine.
Have reinstalled the Nvidia drivers clean with DDU. This hasnt helped.
Have updated all chipset drivers from my motherboard manufacturer (Asus). This hasnt helped.
Have used usb dongles for bluetooth and wifi while disabling the NICs/Wifi Adapters & Bluetooth adapter on the motherboard to eliminate those as a potential cause. Still got crashes anyway.
Using console commands to restart explorer.exe and reset the windows audio and gfx services - attempting either of these causes a hard freeze of the system
Things i havent tried:
Reinstalling Windows clean.
Replacing any hardware components (except the NIC/bluetooth test described above).
As you can see this is an annoyingly complicated problem, as there's no error logs, error messages or crashdumps to go off. I'm completely stumped.
The ask:
Would any wise PC experts be able to give me any advice on these crashes please!?
thank you!
kommz
Full specs:
Asus X570 ROG Crosshair VIII Formula
AMD Ryzen 5950x
NZXT Kraken x73 AIO CPU water cooler
MSI Gaming X Trio RTX 3090 graphics
Corsair Dominator Platinum 32GB 4x8GB 3600Mhz CL16
Samsung EVO 980 1TB m2 x2 SSDs
Lian Li O11 Dynamic XL Case
Corsair RM850x Gold - White - 850W
LG 38GN950 38" Ultrawide Screen
I often use a bluetooth headset while gaming, but i've also had crashes while using the PC speakers.
There system is NOT OVERCLOCKED, i'm using memory timings that are recommended by the memory vendor, and there's no specific changes to BIOS settings except for enabling REBAR.
FINAL UPDATE ~1month later
I seem to have managed to achieve good but not perfect stability. It seems upping the DRAM voltage was the solution. Ended up bumping this to 1.46v up from 1.35v, even though 1.35v is what the XMP/DOCP memory profile suggests. Strangely this only became a problem on Win11 and not Win10. I still get about 1 crash per week, but this is significantly improved from 3 crashes per evening, so thats a huge win in my books.
Thanks very much to @7ekn00 for his suggestions - lifesaver!
For posterity - other things I've tried (in addition to those described above in this post) that were suggested by posters but didn't seem to yield any results:
DISM & sfcscan commands
Disabling CPU C-States in bios
Disabling REBAR in bios
What if you reduce the memory timing, and do you have a higher wattage PSU you can use to test with?
Also try booting into a live linux distro like Ubuntu and running some benchmarks/diagnostics so you can rule out software or driver issues