Complete guide for testing, diagnosing, and replacing failing RAM in server/workstation systems.
This guide was created to help diagnose and fix memory failure issues on systems experiencing:
- Memory failure errors in system logs
- Random system crashes
- Data corruption
- Performance degradation due to memory errors
System administrators, developers, and power users who need to:
- Test RAM for hardware failures
- Identify failing memory modules
- Replace defective RAM
- Verify system stability after repairs
-
RAM_TESTING_GUIDE.md - Complete comprehensive guide
- Understanding memory failure issues
- Using Memtest86+
- Interpreting test results
- Identifying bad DIMMs
- Buying replacement RAM
- Installation and verification
-
RAM_TESTING_QUICKREF.txt - Quick reference card
- System specifications
- Step-by-step flow
- Slot layout diagram
- Installation checklist
- Critical commands
-
RAM_TESTING_LOG.txt - Testing log template
- Test result sheets
- DIMM test results table
- Verification checklist
sudo rebootThen select "Memory test (memtest86+x64.bin)" from the GRUB boot menu.
Wait for 4+ complete passes (2-4 hours) and take photos of any errors.
- No errors? RAM passed basic test, but may still need individual DIMM testing
- Errors found? Proceed to individual DIMM testing (see Section 5 of main guide)
See RAM_TESTING_GUIDE.md for complete instructions.
This guide was created for:
- CPU: Intel Xeon E5-2680 v4
- RAM: 96GB (6x 16GB) DDR4 RDIMM ECC Registered
- Issue: Memory hardware failures detected
However, the testing methodology applies to most DDR4 systems with Memtest86+ support.
β οΈ Always power off completely before handling RAMβ οΈ Use anti-static precautions (touch metal case)β οΈ Handle DIMMs by edges, avoid gold contactsβ οΈ Don't force RAM - if it doesn't fit, check alignment
This guide is provided as-is for educational purposes.
Feel free to adapt this guide for your specific hardware configuration.
Last Updated: 2026-02-23 Status: Memory failure detected - replacement required