Bug 287389 - graphics/drm-61-kmod: GPU hang
Summary: graphics/drm-61-kmod: GPU hang
Status: New
Alias: None
Product: Ports & Packages
Classification: Unclassified
Component: Individual Port(s) (show other bugs)
Version: Latest
Hardware: amd64 Any
: --- Affects Only Me
Assignee: freebsd-x11 (Nobody)
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2025-06-08 21:24 UTC by Pero Oršolić
Modified: 2025-06-14 20:54 UTC (History)
0 users

See Also:
bugzilla: maintainer-feedback? (x11)


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Pero Oršolić 2025-06-08 21:24:10 UTC
FreeBSD 14.3-RC1, AMD Ryzen 4750G, integrated GPU, uptime of 5 days, working in KiCAD for the few hours and then both monitors (one is HDMI, other is DP) turn blank.
After 2-3 seconds they came back, but picture is duplicated on both (picture of DP monitor is duplicated onto HDMI monitor).
GUI is frozen, but I can ssh into the box and everything (except GUI) works as usual.
For the last few days I am experiencing GUI slowdowns - changing virtual desktop s, alt-tabbing, doing things in KiCAD were noticeably slower then after fresh reboot.

% freebsd-version -ku
14.3-RC1
14.3-RC1

% pkg info -f drm\*kmod\*
drm-61-kmod-6.1.128.1403000_4
Name           : drm-61-kmod
Version        : 6.1.128.1403000_4
Installed on   : Sun Jun  1 18:44:40 2025 CEST
Origin         : graphics/drm-61-kmod
Architecture   : FreeBSD:14:amd64
Prefix         : /usr/local
Categories     : graphics kld
Licenses       : BSD2CLAUSE, MIT, GPLv2
Maintainer     : x11@FreeBSD.org
WWW            : https://github.com/freebsd/drm-kmod/
Comment        : DRM drivers modules
Annotations    :
        FreeBSD_version: 1403000
Flat size      : 17.2MiB
Description    :
amdgpu, i915, and radeon DRM drivers modules.
Currently corresponding to Linux 6.1 DRM.
This version is for FreeBSD 14-STABLE 1400508
and above.

% dmesg
[drm ERROR :amdgpu_job_timedout] ring gfx timeout, signaled seq=38354975, emitted seq=38354977
[drm ERROR :amdgpu_job_timedout] Process information: process  pid 325616 thread  pid 325616
drmn0: GPU reset begin!
[drm] psp gfx command UNLOAD_TA(0x2) failed and response status is (0x117)
drmn0: free PSP TMR buffer
drmn0: MODE2 reset
drmn0: GPU reset succeeded, trying to resume
[drm] PCIE GART of 1024M enabled.
[drm] PTB located at 0x000000F43FC00000
[drm] PSP is resuming...
[drm] reserve 0x400000 from 0xf43f800000 for PSP TMR
drmn0: RAS: optional ras ta ucode is not available
drmn0: RAP: optional rap ta ucode is not available
drmn0: SECUREDISPLAY: securedisplay ta ucode is not available
drmn0: SMU is resuming...
drmn0: SMU is resumed successfully!
[drm] DMUB hardware initialized: version=0x01010027
[drm] kiq ring mec 2 pipe 1 q 0
[drm] VCN decode and encode initialized successfully(under DPG Mode).
[drm] JPEG decode initialized successfully.
drmn0: ring gfx uses VM inv eng 0 on hub 0
drmn0: ring comp_1.0.0 uses VM inv eng 1 on hub 0
drmn0: ring comp_1.1.0 uses VM inv eng 4 on hub 0
drmn0: ring comp_1.2.0 uses VM inv eng 5 on hub 0
drmn0: ring comp_1.3.0 uses VM inv eng 6 on hub 0
drmn0: ring comp_1.0.1 uses VM inv eng 7 on hub 0
drmn0: ring comp_1.1.1 uses VM inv eng 8 on hub 0
drmn0: ring comp_1.2.1 uses VM inv eng 9 on hub 0
drmn0: ring comp_1.3.1 uses VM inv eng 10 on hub 0
drmn0: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
drmn0: ring sdma0 uses VM inv eng 0 on hub 1
drmn0: ring vcn_dec uses VM inv eng 1 on hub 1
drmn0: ring vcn_enc0 uses VM inv eng 4 on hub 1
drmn0: ring vcn_enc1 uses VM inv eng 5 on hub 1
drmn0: ring jpeg_dec uses VM inv eng 6 on hub 1
drmn0: recover vram bo from shadow start
drmn0: recover vram bo from shadow done
drmn0: GPU reset(2) succeeded!
[drm ERROR :amdgpu_job_timedout] ring gfx timeout, signaled seq=38354980, emitted seq=38354980
[drm ERROR :amdgpu_job_timedout] Process information: process  pid 325616 thread  pid 325616
drmn0: GPU reset begin!
Comment 1 Pero Oršolić 2025-06-09 19:27:39 UTC
These logs are from 1st of June. Hang happened while watching pictures with feh on FreeBSD 14.3-BETA4

HDMI monitor shows picture from DP monitor.
Image on both monitors were frozen.
Killing Xorg didn't change image on the monitors.

[drm ERROR :amdgpu_job_timedout] ring gfx timeout, signaled seq=824673, emitted seq=824675
[drm ERROR :amdgpu_job_timedout] Process information: process  pid 111579 thread  pid 111579
drmn0: GPU reset begin!
[drm] psp gfx command UNLOAD_TA(0x2) failed and response status is (0x117)
drmn0: free PSP TMR buffer
drmn0: MODE2 reset
drmn0: GPU reset succeeded, trying to resume
[drm] PCIE GART of 1024M enabled.
[drm] PTB located at 0x000000F43FC00000
[drm] PSP is resuming...
[drm] reserve 0x400000 from 0xf43f800000 for PSP TMR
drmn0: RAS: optional ras ta ucode is not available
drmn0: RAP: optional rap ta ucode is not available
drmn0: SECUREDISPLAY: securedisplay ta ucode is not available
drmn0: SMU is resuming...
drmn0: SMU is resumed successfully!
[drm] DMUB hardware initialized: version=0x01010027
[drm] kiq ring mec 2 pipe 1 q 0
[drm] VCN decode and encode initialized successfully(under DPG Mode).
[drm] JPEG decode initialized successfully.
drmn0: ring gfx uses VM inv eng 0 on hub 0
drmn0: ring comp_1.0.0 uses VM inv eng 1 on hub 0
drmn0: ring comp_1.1.0 uses VM inv eng 4 on hub 0
drmn0: ring comp_1.2.0 uses VM inv eng 5 on hub 0
drmn0: ring comp_1.3.0 uses VM inv eng 6 on hub 0
drmn0: ring comp_1.0.1 uses VM inv eng 7 on hub 0
drmn0: ring comp_1.1.1 uses VM inv eng 8 on hub 0
drmn0: ring comp_1.2.1 uses VM inv eng 9 on hub 0
drmn0: ring comp_1.3.1 uses VM inv eng 10 on hub 0
drmn0: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
drmn0: ring sdma0 uses VM inv eng 0 on hub 1
drmn0: ring vcn_dec uses VM inv eng 1 on hub 1
drmn0: ring vcn_enc0 uses VM inv eng 4 on hub 1
drmn0: ring vcn_enc1 uses VM inv eng 5 on hub 1
drmn0: ring jpeg_dec uses VM inv eng 6 on hub 1
drmn0: recover vram bo from shadow start
drmn0: recover vram bo from shadow done
drmn0: GPU reset(2) succeeded!
[drm ERROR :amdgpu_job_timedout] ring gfx timeout, signaled seq=824676, emitted seq=824676
[drm ERROR :amdgpu_job_timedout] Process information: process  pid 111579 thread  pid 111579
drmn0: GPU reset begin!
Comment 2 Pero Oršolić 2025-06-14 20:54:34 UTC
Happened again. This time I was trying to play video file with mpv.
Again both monitors lost picture for 1-2 seconds, after that DP monitor is all black, HDMI monitor show picture which should be on DP.
Trying to kill Xorg return success but process is not killed.
ps aux | grep Xorg
root       40477    0.0  0.0       0     16  -  DE   Thu16      32:31.68 /usr/local/libexec/Xorg -nolisten tcp vt09 -auth /var/run/slim.auth
Uptime is 4 days, but few hours after the reboot I am staring to experience GUI slowdowns. System was sluggish before freeze.

Dmesg:
[drm ERROR :amdgpu_job_timedout] ring gfx timeout, signaled seq=24277354, emitted seq=24277356
[drm ERROR :amdgpu_job_timedout] Process information: process  pid 666259 thread  pid 666259
drmn0: GPU reset begin!
[drm] psp gfx command UNLOAD_TA(0x2) failed and response status is (0x117)
drmn0: free PSP TMR buffer
drmn0: MODE2 reset
drmn0: GPU reset succeeded, trying to resume
[drm] PCIE GART of 1024M enabled.
[drm] PTB located at 0x000000F43FC00000
[drm] PSP is resuming...
[drm] reserve 0x400000 from 0xf43f800000 for PSP TMR
drmn0: RAS: optional ras ta ucode is not available
drmn0: RAP: optional rap ta ucode is not available
drmn0: SECUREDISPLAY: securedisplay ta ucode is not available
drmn0: SMU is resuming...
drmn0: SMU is resumed successfully!
[drm] DMUB hardware initialized: version=0x01010027
[drm] kiq ring mec 2 pipe 1 q 0
[drm] VCN decode and encode initialized successfully(under DPG Mode).
[drm] JPEG decode initialized successfully.
drmn0: ring gfx uses VM inv eng 0 on hub 0
drmn0: ring comp_1.0.0 uses VM inv eng 1 on hub 0
drmn0: ring comp_1.1.0 uses VM inv eng 4 on hub 0
drmn0: ring comp_1.2.0 uses VM inv eng 5 on hub 0
drmn0: ring comp_1.3.0 uses VM inv eng 6 on hub 0
drmn0: ring comp_1.0.1 uses VM inv eng 7 on hub 0
drmn0: ring comp_1.1.1 uses VM inv eng 8 on hub 0
drmn0: ring comp_1.2.1 uses VM inv eng 9 on hub 0
drmn0: ring comp_1.3.1 uses VM inv eng 10 on hub 0
drmn0: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
drmn0: ring sdma0 uses VM inv eng 0 on hub 1
drmn0: ring vcn_dec uses VM inv eng 1 on hub 1
drmn0: ring vcn_enc0 uses VM inv eng 4 on hub 1
drmn0: ring vcn_enc1 uses VM inv eng 5 on hub 1
drmn0: ring jpeg_dec uses VM inv eng 6 on hub 1
drmn0: [drm] *ERROR* drmn0: [drm] *ERROR* drmn0: ib ring test failed (-60).
drmn0: free PSP TMR buffer
drmn0: MODE2 reset
drmn0: GPU reset succeeded, trying to resume
[drm] PCIE GART of 1024M enabled.
[drm] PTB located at 0x000000F43FC00000
[drm] PSP is resuming...
[drm] reserve 0x400000 from 0xf43f800000 for PSP TMR
drmn0: RAS: optional ras ta ucode is not available
drmn0: RAP: optional rap ta ucode is not available
drmn0: SECUREDISPLAY: securedisplay ta ucode is not available
drmn0: SMU is resuming...
drmn0: SMU is resumed successfully!
[drm] DMUB hardware initialized: version=0x01010027
[drm] kiq ring mec 2 pipe 1 q 0
[drm] VCN decode and encode initialized successfully(under DPG Mode).
[drm] JPEG decode initialized successfully.
drmn0: ring gfx uses VM inv eng 0 on hub 0
drmn0: ring comp_1.0.0 uses VM inv eng 1 on hub 0
drmn0: ring comp_1.1.0 uses VM inv eng 4 on hub 0
drmn0: ring comp_1.2.0 uses VM inv eng 5 on hub 0
drmn0: ring comp_1.3.0 uses VM inv eng 6 on hub 0
drmn0: ring comp_1.0.1 uses VM inv eng 7 on hub 0
drmn0: ring comp_1.1.1 uses VM inv eng 8 on hub 0
drmn0: ring comp_1.2.1 uses VM inv eng 9 on hub 0
drmn0: ring comp_1.3.1 uses VM inv eng 10 on hub 0
drmn0: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
drmn0: ring sdma0 uses VM inv eng 0 on hub 1
drmn0: ring vcn_dec uses VM inv eng 1 on hub 1
drmn0: ring vcn_enc0 uses VM inv eng 4 on hub 1
drmn0: ring vcn_enc1 uses VM inv eng 5 on hub 1
drmn0: ring jpeg_dec uses VM inv eng 6 on hub 1
drmn0: recover vram bo from shadow start
drmn0: recover vram bo from shadow done
drmn0: GPU reset(2) succeeded!
[drm ERROR :amdgpu_job_timedout] ring gfx timeout, signaled seq=24277358, emitted seq=24277358
[drm ERROR :amdgpu_job_timedout] Process information: process  pid 666259 thread  pid 666259
drmn0: GPU reset begin!