Fixing PCIe AER Errors on Minisforum Venus Series (AMD Ryzen 9 7940HS)

Introduction

If you own a Minisforum Venus series mini PC with an AMD Ryzen 9 7940HS and run heavy workloads on the built-in Kingston OM8TAP41024K1-A00 NVMe SSD, you may encounter PCIe Advanced Error Reporting (AER) Correctable errors in Linux, especially when the system is set to 35 W performance mode.

While these errors don’t crash the system, they can cause data retransmissions, minor slowdowns, and leave you wondering if your SSD or motherboard is failing.

In this post, I’ll show:

  • The exact error messages I saw
  • The fix that eliminated the problem
  • Why it happens

  1. Problem Symptoms

During long, sustained workloads, such as scanning the AOSP source tree with ScanCode, my system log would show repeated correctable PCIe errors from the NVMe drive:

pcieport 0000:00:02.4: AER: Correctable error message received from 0000:03:00.0
nvme 0000:03:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
nvme 0000:03:00.0:   device [2646:502d] error status/mask=00000001/0000e000

System context:

  • Minisforum Venus series mini PC
  • AMD Ryzen 9 7940HS, 35 W mode
  • Built-in Kingston OM8TAP41024K1-A00, PCIe Gen4 x4, DRAM-less
  • Ubuntu 24.04

  1. Solution: Disable PCIe ASPM

The root trigger was ASPM (Active State Power Management), a PCIe feature that reduces power usage by changing link states. On my system, these power-state transitions were causing bit errors at Gen4 speeds.

  • Edit GRUB configuration
sudo vi /etc/default/grub

Find the line:

GRUB_CMDLINE_LINUX_DEFAULT="quiet splash"

Change it to:

GRUB_CMDLINE_LINUX_DEFAULT="quiet splash pcie_aspm=off"
  • Update GRUB and reboot
sudo update-grub
sudo reboot

  1. Why It Happens

Based on testing, the cause is likely a mix of:

  • Motherboard PCIe power management — Minisforum’s ASPM handling may not be fully stable at PCIe Gen4 x4.
  • DRAM-less SSD controller — Lower-cost controllers may be less tolerant to rapid low-power transitions.
  • 35 W mode power draw — Higher sustained power and heat may reduce PCIe signal margins.
  • Gen4 signal integrity — ASPM transitions stress link stability; disabling ASPM keeps the link at full power, avoiding glitches.

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top