Bug 267782 - AMD Ryzen 5 hangs on 13.2, fine on 12.4
Summary: AMD Ryzen 5 hangs on 13.2, fine on 12.4
Status: Closed Not A Bug
Alias: None
Product: Base System
Classification: Unclassified
Component: kern (show other bugs)
Version: 13.2-RELEASE
Hardware: amd64 Any
: --- Affects Only Me
Assignee: freebsd-bugs (Nobody)
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2022-11-15 12:53 UTC by Philip Homburg
Modified: 2023-04-21 17:17 UTC (History)
4 users (show)

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Philip Homburg 2022-11-15 12:53:16 UTC
I'm trying to upgrade an AMD Ryzen 5 2600X system with a Gigabyte AB350-Gaming mainboard from 12.3 to 13.1.

The system was completely stable on 12.3. On 13.1 during a poudriere run it hangs. I captured the following message:

MCA: Bank 3, Status 0x9820000000000150
MCA: Global Cap 0x0000000000000117, Status 0x0000000000000000
MCA: Vendor "AuthenticAMD", ID 0x800f82, APIC ID 5
MCA: CPU 5 COR EN ICACHE L0 IRD error
MCA: Misc 0xd01b0fff00000000

I ran memtest, that gave nothing. I installed microcode_update, but that doesn't seem to load anything.

I didn't upgrade the BIOS yet. That is listed as 
American Megatrends Inc.
        Version: F21
        BIOS Revision: 5.13
Comment 1 kronenpj 2023-01-09 01:02:00 UTC
I believe I have a similar problem with OpnSense 22.7 (13.1) under LibVirt/QEMU on a Gigabyte Aorus Motherboard (Gigabyte Technology Co., Ltd. X570 AORUS MASTER/X570 AORUS MASTER, BIOS F30 09/07/2020)

FreeBSD 13.0-RELEASE works fine, as does OpnSense 22.1 (13.0 also). FreeBSD 13.1-RELEASE crashes in weird ways but OpnSense freezes during kernel init. I haven't been able to capture a screenshot - it flies by quickly.

The host is a Fedora 36 system.

CPU Info:
Architecture:            x86_64
  CPU op-mode(s):        32-bit, 64-bit
  Address sizes:         43 bits physical, 48 bits virtual
  Byte Order:            Little Endian
CPU(s):                  24
  On-line CPU(s) list:   0-23
Vendor ID:               AuthenticAMD
  Model name:            AMD Ryzen 9 3900XT 12-Core Processor
    CPU family:          23
    Model:               113
    Thread(s) per core:  2
    Core(s) per socket:  12
    Socket(s):           1
    Stepping:            0
    Frequency boost:     enabled
    CPU(s) scaling MHz:  46%
    CPU max MHz:         4775.9761
    CPU min MHz:         2200.0000
    BogoMIPS:            7599.92
    Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush
                          mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc
                          rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf rapl pni pclmulqdq moni
                         tor ssse3 fma cx16 sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand lahf_lm
                          cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs
                          skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb ca
                         t_l3 cdp_l3 hw_pstate ssbd mba ibpb stibp vmmcall fsgsbase bmi1 avx2 smep bmi2 
                         cqm rdt_a rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves
                          cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzero irperf xsaveerptr rdp
                         ru wbnoinvd arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid d
                         ecodeassists pausefilter pfthreshold avic v_vmsave_vmload vgif v_spec_ctrl umip
                          rdpid overflow_recov succor smca sev sev_es
Virtualization features: 
  Virtualization:        AMD-V
Caches (sum of all):     
  L1d:                   384 KiB (12 instances)
  L1i:                   384 KiB (12 instances)
  L2:                    6 MiB (12 instances)
  L3:                    64 MiB (4 instances)
NUMA:                    
  NUMA node(s):          1
  NUMA node0 CPU(s):     0-23
Vulnerabilities:         
  Itlb multihit:         Not affected
  L1tf:                  Not affected
  Mds:                   Not affected
  Meltdown:              Not affected
  Mmio stale data:       Not affected
  Retbleed:              Mitigation; untrained return thunk; SMT enabled with STIBP protection
  Spec store bypass:     Mitigation; Speculative Store Bypass disabled via prctl
  Spectre v1:            Mitigation; usercopy/swapgs barriers and __user pointer sanitization
  Spectre v2:            Mitigation; Retpolines, IBPB conditional, STIBP always-on, RSB filling, PBRSB-e
                         IBRS Not affected
  Srbds:                 Not affected
  Tsx async abort:       Not affected
Comment 2 Philip Homburg 2023-01-26 19:44:26 UTC
I just tried 13.0 on the same system and it hangs when I run poudriere. Just a complete hang, nothing on the console.

12.4 runs fine.
Comment 3 Philip Homburg 2023-04-13 15:04:43 UTC
Also fails on 13.2
Comment 4 Philip Homburg 2023-04-21 09:46:02 UTC
A BIOS update seems to have solved the problem. I upgraded to BIOS version F31, which upgraded AMI BIOS from 5.13 to 5.14. Now the system has been compiling for about 24 hours.
Comment 5 Graham Perrin freebsd_committer freebsd_triage 2023-04-21 17:17:28 UTC
(In reply to Philip Homburg from comment #4)

Thanks for the follow-up.