Bug 264793 - graphics/drm-510-kmod: Panic: Resetting chip for stopped heartbeat on rcs0
Summary: graphics/drm-510-kmod: Panic: Resetting chip for stopped heartbeat on rcs0
Status: Closed Overcome By Events
Alias: None
Product: Ports & Packages
Classification: Unclassified
Component: Individual Port(s) (show other bugs)
Version: Latest
Hardware: Any Any
: --- Affects Only Me
Assignee: freebsd-x11 (Nobody)
URL:
Keywords: crash, needs-qa
Depends on:
Blocks:
 
Reported: 2022-06-20 18:48 UTC by Cy Schubert
Modified: 2023-10-01 16:37 UTC (History)
2 users (show)

See Also:
koobs: maintainer-feedback? (x11)
koobs: merge-quarterly?


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Cy Schubert freebsd_committer freebsd_triage 2022-06-20 18:48:38 UTC
Panic in drm-510-kmod due to GPU hang:

Unread portion of the kernel message buffer:
drmn0: [drm] GPU HANG: ecode 6:1:97eefffd, in MainThread [100854]
drmn0: [drm] Resetting chip for stopped heartbeat on rcs0
drmn0: [drm] Xorg[100854] context reset due to GPU hang
drmn0: [drm] GPU HANG: ecode 6:1:b7eefffd, in MainThread [100854]


Fatal trap 12: page fault while in kernel mode
cpuid = 1; apic id = 01
fault virtual address   = 0x61
fault code              = supervisor read data, page not present
instruction pointer     = 0x20:0xffffffff809867e7
stack pointer           = 0x28:0xfffffe0082fd8b60
frame pointer           = 0x28:0xfffffe0082fd8ba0
code segment            = base rx0, limit 0xfffff, type 0x1b
                        = DPL 0, pres 1, long 1, def32 0, gran 1
processor eflags        = interrupt enabled, resume, IOPL = 0
current process         = 0 (linuxkpi_short_wq_4)
trap number             = 12
panic: page fault
cpuid = 1
time = 1655749895
KDB: stack backtrace:
db_trace_self_wrapper() at db_trace_self_wrapper+0x2b/frame 0xfffffe0082fd8920
vpanic() at vpanic+0x151/frame 0xfffffe0082fd8970
panic() at panic+0x43/frame 0xfffffe0082fd89d0
trap_fatal() at trap_fatal+0x387/frame 0xfffffe0082fd8a30
trap_pfault() at trap_pfault+0x4f/frame 0xfffffe0082fd8a90
calltrap() at calltrap+0x8/frame 0xfffffe0082fd8a90
--- trap 0xc, rip = 0xffffffff809867e7, rsp = 0xfffffe0082fd8b60, rbp = 0xfffffe0082fd8ba0 ---
_kmem_unback() at _kmem_unback+0x77/frame 0xfffffe0082fd8ba0
kmem_free() at kmem_free+0x2d/frame 0xfffffe0082fd8bc0
__i915_gpu_coredump_free() at __i915_gpu_coredump_free+0x10d/frame 0xfffffe0082fd8c10
intel_gt_handle_error() at intel_gt_handle_error+0xa9/frame 0xfffffe0082fd8d90
heartbeat() at heartbeat+0x2c8/frame 0xfffffe0082fd8de0
linux_work_fn() at linux_work_fn+0xe3/frame 0xfffffe0082fd8e40
taskqueue_run_locked() at taskqueue_run_locked+0x191/frame 0xfffffe0082fd8ec0
taskqueue_thread_loop() at taskqueue_thread_loop+0xc3/frame 0xfffffe0082fd8ef0
fork_exit() at fork_exit+0x7e/frame 0xfffffe0082fd8f30
fork_trampoline() at fork_trampoline+0xe/frame 0xfffffe0082fd8f30
--- trap 0xcb659aff, rip = 0x1d64681dd1940e88, rsp = 0x30b7b3bb66d96013, rbp = 0x56cb7c65cb709ad4 ---
Uptime: 3h1m53s
Dumping 1920 out of 7970 MB:..1%..11%..21%..31%..41%..51%..61%..71%..81%..91%

__curthread () at /opt/src/git-src/sys/amd64/include/pcpu_aux.h:59
59              __asm("movq %%gs:%P1,%0" : "=r" (td) : "n" (offsetof(struct pcpu,
(kgdb) #0  __curthread () at /opt/src/git-src/sys/amd64/include/pcpu_aux.h:59
#1  dump_savectx () at /opt/src/git-src/sys/kern/kern_shutdown.c:401
#2  0xffffffff806bdb58 in dumpsys (di=0x0)
    at /opt/src/git-src/sys/x86/include/dump.h:87
#3  doadump (textdump=1) at /opt/src/git-src/sys/kern/kern_shutdown.c:430
#4  kern_reboot (howto=260) at /opt/src/git-src/sys/kern/kern_shutdown.c:537
#5  0xffffffff806be03e in vpanic (fmt=<optimized out>,
    ap=ap@entry=0xfffffe0082fd89b0)
    at /opt/src/git-src/sys/kern/kern_shutdown.c:975
#6  0xffffffff806bde73 in panic (fmt=<unavailable>)
    at /opt/src/git-src/sys/kern/kern_shutdown.c:899
#7  0xffffffff80a301a7 in trap_fatal (frame=0xfffffe0082fd8aa0, eva=97)
    at /opt/src/git-src/sys/amd64/amd64/trap.c:942
#8  0xffffffff80    usermode=false, signo=<optimized out>, ucode=<optimized out>)
    at /opt/src/git-src/sys/amd64/amd64/trap.c:761
#9  <signal handler called>
#10 vm_page_domain (m=0x0) at /opt/src/git-src/sys/vm/vm_page.h:1009
#11 _kmem_unback (object=0xffffffff80f74db0 <kernel_object_store>,
    addr=addr@entry=18446735283280392192, size=size@entry=4096)
    at /opt/src/git-src/sys/vm/vm_kern.c:584
#12 0xffffffff809868dd in kmem_free (addr=18446735283280392192, size=4096)
    at /opt/src/git-src/sys/vm/vm_kern.c:620
#13 0xffffffff8249982d in __i915_gpu_coredump_free ()
   from /boot/modules/i915kms.ko
#14 0xffffffff8246ad99 in intel_gt_handle_error ()
   from /boot/modules/i915kms.ko
#15 0xffffffff824561e8 in heartbeat () from /boot/modules/i915kms.ko
#16 0xffffffff808cfff3 in linux_work_fn (context=0x0, pending=<optimized out>)
    at /opt/src/git-src/sys/compat/linuxkpi/common/src/linux_work.c:299
#17 0xffffffff80723141 in taskqueue_run_locked (
    queue=queue@entry=0xfffff8000361a900)
    at /opt/src/git-src/sys/kern/subr_taskqueue.c:514
#18 0xffffffff80724403 in taskqueue_thread_loop (
    arg=arg@entry=0xfffff800030d1c40)
    at /opt/src/git-src/sys/kern/subr_taskqueue.c:826
#19 0xffffffff806780fe in fork_exit (
    callout=0xffffffff80724340 <taskqueue_thread_loop>,
    arg=0xfffff800030d1c40, frame=0xfffffe0082fd8f40)
    at /opt/src/git-src/sys/kern/kern_fork.c:1112
#20 <signal handler called>
#21 0x1d64681dd1940e88 in ?? ()
Backtrace stopped: Cannot access memory at address 0x30b7b3bb66d96013
a301ff in trap_pfault (frame=0xfffffe0082fd8aa0,

There were artifacts in mate-terminal and exmh2 prior to the GPU hang and panic.

Kernel core is available on request.
Comment 1 Cy Schubert freebsd_committer freebsd_triage 2022-06-20 19:21:18 UTC
slippy 14.0-CURRENT FreeBSD 14.0-CURRENT #20 komquats-n256182-ad0a7ea65084-dirty: Mon Jun 20 07:45:19 PDT 2022     root@cwsys:/export/obj/opt/src/git-src/amd64.amd64/sys/BREAK amd64

Local mods include:

- GOEM foot shooting patch
- preliminary bge(4) WOL support
- ipfilter jail patch
- heimdal security patch
- wpa_supplicant debug code

None of these affect graphical support.
Comment 2 Cy Schubert freebsd_committer freebsd_triage 2023-10-01 16:37:20 UTC
Using drm-515-kmod now. It does not produce this error, though the overall problem still exists.