Bug 27059

Summary: [sym] SCSI subsystem hangs under heavy load on (Serverworks) SMP
Product: Base System Reporter: lele <lele>
Component: kernAssignee: freebsd-scsi (Nobody) <scsi>
Status: Closed FIXED    
Severity: Affects Only Me    
Priority: Normal    
Version: 4.3-RELEASE   
Hardware: Any   
OS: Any   

Description lele 2001-05-03 14:40:01 UTC
When the system is under a heavy disk load the SCSI subsystem becomes completely irresponsive. The network stays up and processes continue to run as long the do not require access to the scsi disks.
Nothing is logged or displayed on the console.
Uniprocessor kernel with the same configuration run stable for weeks.
4.2-R had the same problem.



The mainboard is ASUS CUR-DLS


May  3 01:48:19 deimos /kernel: Copyright (c) 1992-2001 The FreeBSD Project.
May  3 01:48:19 deimos /kernel: Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
May  3 01:48:19 deimos /kernel: The Regents of the University of California. All rights reserved.
May  3 01:48:19 deimos /kernel: FreeBSD 4.3-RELEASE #0: Thu May  3 01:44:05 CEST 2001
May  3 01:48:19 deimos /kernel: root@deimos.intercom.it:/usr/src/sys/compile/IBNETWORK-SMP
May  3 01:48:19 deimos /kernel: Timecounter "i8254"  frequency 1193182 Hz
May  3 01:48:19 deimos /kernel: CPU: Pentium III/Pentium III Xeon/Celeron (800.03-MHz 686-class CPU)
May  3 01:48:19 deimos /kernel: Origin = "GenuineIntel"  Id = 0x686  Stepping = 6
May  3 01:48:19 deimos /kernel: Features=0x387fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,PN,MMX,FXSR,SSE>
May  3 01:48:19 deimos /kernel: real memory  = 536850432 (524268K bytes)
May  3 01:48:19 deimos /kernel: avail memory = 518393856 (506244K bytes)
May  3 01:48:19 deimos /kernel: Programming 16 pins in IOAPIC #0
May  3 01:48:19 deimos /kernel: IOAPIC #0 intpin 2 -> irq 0
May  3 01:48:19 deimos /kernel: Programming 16 pins in IOAPIC #1
May  3 01:48:19 deimos /kernel: FreeBSD/SMP: Multiprocessor motherboard
May  3 01:48:19 deimos /kernel: cpu0 (BSP): apic id:  3, version: 0x00040011, at 0xfee00000
May  3 01:48:19 deimos /kernel: cpu1 (AP):  apic id:  0, version: 0x00040011, at 0xfee00000
May  3 01:48:19 deimos /kernel: io0 (APIC): apic id:  4, version: 0x000f0011, at 0xfec00000
May  3 01:48:19 deimos /kernel: io1 (APIC): apic id:  5, version: 0x000f0011, at 0xfec01000
May  3 01:48:19 deimos /kernel: Preloaded elf kernel "kernel" at 0xc036e000.
May  3 01:48:19 deimos /kernel: Preloaded userconfig_script "/boot/kernel.conf" at 0xc036e09c.
May  3 01:48:19 deimos /kernel: Pentium Pro MTRR support enabled
May  3 01:48:19 deimos /kernel: md0: Malloc disk
May  3 01:48:19 deimos /kernel: npx0: <math processor> on motherboard
May  3 01:48:19 deimos /kernel: npx0: INT 16 interface
May  3 01:48:19 deimos /kernel: pcib0: <ServerWorks NB6635 3.0LE host to PCI bridge> on motherboard
May  3 01:48:19 deimos /kernel: IOAPIC #1 intpin 4 -> irq 2
May  3 01:48:19 deimos /kernel: IOAPIC #1 intpin 14 -> irq 5
May  3 01:48:19 deimos /kernel: pci0: <PCI bus> on pcib0
May  3 01:48:19 deimos /kernel: fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0xd800-0xd83f mem 0xfd800000-0xfd8fffff,0xfe000000-0xfe000fff irq 2 at device 2.0 on pci0
May  3 01:48:19 deimos /kernel: fxp0: Ethernet address 00:e0:18:04:96:d8
May  3 01:48:19 deimos /kernel: pci0: <ATI Mach64-GR graphics accelerator> at 7.0
May  3 01:48:19 deimos /kernel: isab0: <ServerWorks IB6566 PCI to ISA bridge> at device 15.0 on pci0
May  3 01:48:19 deimos /kernel: isa0: <ISA bus> on isab0
May  3 01:48:19 deimos /kernel: atapci0: <ServerWorks ROSB4 ATA33 controller> port 0xd000-0xd00f at device 15.1 on pci0
May  3 01:48:19 deimos /kernel: ata0: at 0x1f0 irq 14 on atapci0
May  3 01:48:19 deimos /kernel: ata1: at 0x170 irq 15 on atapci0
May  3 01:48:19 deimos /kernel: ohci0: <OHCI (generic) USB controller> mem 0xfb000000-0xfb000fff irq 5 at device 15.2 on pci0
May  3 01:48:19 deimos /kernel: usb0: OHCI version 1.0, legacy support
May  3 01:48:19 deimos /kernel: usb0: <OHCI (generic) USB controller> on ohci0
May  3 01:48:19 deimos /kernel: usb0: USB revision 1.0
May  3 01:48:19 deimos /kernel: uhub0: (unknown) OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
May  3 01:48:19 deimos /kernel: uhub0: 4 ports with 4 removable, self powered
May  3 01:48:19 deimos /kernel: pcib1: <ServerWorks NB6635 3.0LE host to PCI bridge> on motherboard
May  3 01:48:19 deimos /kernel: IOAPIC #1 intpin 8 -> irq 6
May  3 01:48:19 deimos /kernel: IOAPIC #1 intpin 9 -> irq 10
May  3 01:48:19 deimos /kernel: pci1: <PCI bus> on pcib1
May  3 01:48:19 deimos /kernel: sym0: <896> port 0xb800-0xb8ff mem 0xfa000000-0xfa001fff,0xfa800000-0xfa8003ff irq 6 at device 5.0 on pci1
May  3 01:48:19 deimos /kernel: sym0: Symbios NVRAM, ID 7, Fast-40, SE, parity checking
May  3 01:48:19 deimos /kernel: sym0: open drain IRQ line driver, using on-chip SRAM
May  3 01:48:19 deimos /kernel: sym0: using LOAD/STORE-based firmware.
May  3 01:48:19 deimos /kernel: sym0: handling phase mismatch from SCRIPTS.
May  3 01:48:19 deimos /kernel: sym0: SCAN AT BOOT disabled for targets 1 2 3 4 5 6 8 9 10 11 12 13 14 15.
May  3 01:48:19 deimos /kernel: sym1: <896> port 0xb400-0xb4ff mem 0xf9000000-0xf9001fff,0xf9800000-0xf98003ff irq 10 at device 5.1 on pci1
May  3 01:48:19 deimos /kernel: sym1: Symbios NVRAM, ID 7, Fast-40, LVD, parity checking
May  3 01:48:19 deimos /kernel: sym1: open drain IRQ line driver, using on-chip SRAM
May  3 01:48:19 deimos /kernel: sym1: using LOAD/STORE-based firmware.
May  3 01:48:19 deimos /kernel: sym1: handling phase mismatch from SCRIPTS.
May  3 01:48:19 deimos /kernel: sym1: SCAN AT BOOT disabled for targets 0 1 2 3 4 6 8 9 10 11 12 13 14 15.
May  3 01:48:19 deimos /kernel: fdc0: cannot reserve interrupt line
May  3 01:48:19 deimos /kernel: atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
May  3 01:48:19 deimos /kernel: atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0
May  3 01:48:19 deimos /kernel: kbd0 at atkbd0
May  3 01:48:19 deimos /kernel: psm0: <PS/2 Mouse> irq 12 on atkbdc0
May  3 01:48:19 deimos /kernel: psm0: model Generic PS/2 mouse, device ID 0
May  3 01:48:19 deimos /kernel: vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
May  3 01:48:19 deimos /kernel: sc0: <System console> at flags 0x100 on isa0
May  3 01:48:19 deimos /kernel: sc0: VGA <16 virtual consoles, flags=0x300>
May  3 01:48:19 deimos /kernel: sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
May  3 01:48:19 deimos /kernel: sio0: type 16550A
May  3 01:48:19 deimos /kernel: sio1 at port 0x2f8-0x2ff irq 3 on isa0
May  3 01:48:19 deimos /kernel: sio1: type 16550A
May  3 01:48:19 deimos /kernel: ppc0: <Parallel port> at port 0x378-0x37f irq 7 on isa0
May  3 01:48:19 deimos /kernel: ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode
May  3 01:48:19 deimos /kernel: ppc0: FIFO with 16/16/8 bytes threshold
May  3 01:48:19 deimos /kernel: lpt0: <Printer> on ppbus0
May  3 01:48:19 deimos /kernel: lpt0: Interrupt-driven port
May  3 01:48:19 deimos /kernel: ppi0: <Parallel I/O> on ppbus0
May  3 01:48:19 deimos /kernel: APIC_IO: Testing 8254 interrupt delivery
May  3 01:48:19 deimos /kernel: APIC_IO: Broken MP table detected: 8254 is not connected to IOAPIC #0 intpin 2
May  3 01:48:19 deimos /kernel: APIC_IO: routing 8254 via 8259 and IOAPIC #0 intpin 0
May  3 01:48:19 deimos /kernel: SMP: AP CPU #1 Launched!
May  3 01:48:19 deimos /kernel: ad0: 43979MB <IBM-DTLA-307045> [89355/16/63] at ata0-master UDMA33
May  3 01:48:19 deimos /kernel: acd0: CDROM <MATSHITA CR-586> at ata1-master using PIO4
May  3 01:48:19 deimos /kernel: Waiting 15 seconds for SCSI devices to settle
May  3 01:48:19 deimos /kernel: (noperiph:sym0:0:-1:-1): SCSI BUS reset delivered.
May  3 01:48:19 deimos /kernel: (noperiph:sym1:0:-1:-1): SCSI BUS reset delivered.
May  3 01:48:19 deimos /kernel: sa0 at sym1 bus 0 target 5 lun 0
May  3 01:48:19 deimos /kernel: sa0: <OnStream ADR50 Drive 2.33> Removable Sequential Access SCSI-2 device
May  3 01:48:19 deimos /kernel: sa0: 40.000MB/s transfers (20.000MHz, offset 8, 16bit)
May  3 01:48:19 deimos /kernel: da0 at sym0 bus 0 target 0 lun 0
May  3 01:48:19 deimos /kernel: da0: <ACS 8600  > Fixed Direct Access SCSI-2 device
May  3 01:48:19 deimos /kernel: da0: 40.000MB/s transfers (20.000MHz, offset 16, 16bit), Tagged Queueing Enabled
May  3 01:48:19 deimos /kernel: da0: 156320MB (320143360 512 byte sectors: 255H 63S/T 19928C)
May  3 01:48:19 deimos /kernel: Mounting root from ufs:/dev/da0s1a

How-To-Repeat: Just cp or tar of a large amount of data from the scsi disks, usually hangs in few minutes. Reading from the scsi disk is enough (it's not necessary to write)
Comment 1 Matt Jacob freebsd_committer freebsd_triage 2001-10-02 03:08:20 UTC
State Changed
From-To: open->feedback

Is this still a problem? 


Comment 2 Matt Jacob freebsd_committer freebsd_triage 2001-10-02 03:08:20 UTC
Responsible Changed
From-To: freebsd-bugs->groudier

Gerard's driver
Comment 3 Mark Linimon freebsd_committer freebsd_triage 2004-08-26 04:13:20 UTC
Responsible Changed
From-To: groudier->freebsd-bugs

With bugmeister hat on, reassign from inactive committer. 

Is this still a problem with modern versions of FreeBSD?
Comment 4 Mark Linimon freebsd_committer freebsd_triage 2004-08-26 20:30:48 UTC
Responsible Changed
From-To: freebsd-bugs->freebsd-scsi

Reassign to appropriate mailing list.
Comment 5 Mark Linimon 2004-08-26 20:47:29 UTC
Does this problem still recur on modern versions of FreeBSD?
Comment 6 david 2004-10-07 20:20:51 UTC
This still appears on 4.10 and 5.2.1.  I've also tried installing an Adaptec
2940 (ahc) card and had similar lockups.  Large disk copies between IDE and
USB drives seemed to be OK.

 

I've got some ASUS AP1400R servers with CUR-DLS motherboards that experience
this.  I can make one available for testing if there is any interest.

 

Dave

 

----

David Grant,    

CanWeb Internet Services

 
Comment 7 Thomas.Hermann 2005-01-25 14:44:29 UTC
I'm running FreeBSD 5.3 SMP on a Dell Precision 530 and have experienced
similar behavior. Please email me if I can help resolve this issue.

Copyright (c) 1992-2004 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
	The Regents of the University of California. All rights reserved.
FreeBSD 5.3-RELEASE-p5 #3: Mon Jan 24 08:28:42 CST 2005
    root@:/usr/obj/usr/src/sys/ORVILLE
ACPI APIC Table: <DELL   WS 530 >
Timecounter "i8254" frequency 1193182 Hz quality 0
CPU: Intel(R) Xeon(TM) CPU 2.80GHz (2791.53-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0xf27  Stepping = 7

Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE>
  Hyperthreading: 2 logical CPUs
real memory  = 1073180672 (1023 MB)
avail memory = 1036431360 (988 MB)
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
 cpu0 (BSP): APIC ID:  0
 cpu1 (AP): APIC ID:  2
ioapic0: Changing APIC ID to 2
ioapic0 <Version 2.0> irqs 0-23 on motherboard
npx0: [FAST]
npx0: <math processor> on motherboard
npx0: INT 16 interface
acpi0: <DELL WS 530 > on motherboard
acpi0: Power Button (fixed)
Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000
acpi_timer0: <24-bit timer at 3.579545MHz> port 0x808-0x80b on acpi0
cpu0: <ACPI CPU> on acpi0
cpu1: <ACPI CPU> on acpi0
acpi_button0: <Power Button> on acpi0
pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0
pci0: <ACPI PCI bus> on pcib0
pcib1: <PCI-PCI bridge> at device 1.0 on pci0
pci1: <PCI bus> on pcib1
nvidia0: <Quadro4 700 XGL> mem
0xf3f80000-0xf3ffffff,0xf4000000-0xf7ffffff,0xfc000000-0xfcffffff irq 16
at device 0.0 on pci1
nvidia0: [GIANT-LOCKED]
pcib2: <ACPI PCI-PCI bridge> at device 2.0 on pci0
pci2: <ACPI PCI bus> on pcib2
pcib3: <ACPI PCI-PCI bridge> at device 31.0 on pci2
pci3: <ACPI PCI bus> on pcib3
pci3: <base peripheral, interrupt controller> at device 0.0 (no driver
attached)
mpt0: <LSILogic 1030 Ultra4 Adapter> port 0xec00-0xecff mem
0xfe4a0000-0xfe4bffff,0xfe4c0000-0xfe4dffff irq 20 at device 12.0 on
pci3
mpt0: [GIANT-LOCKED]
mpt1: <LSILogic 1030 Ultra4 Adapter> port 0xe800-0xe8ff mem
0xfe460000-0xfe47ffff,0xfe480000-0xfe49ffff irq 21 at device 12.1 on
pci3
mpt1: [GIANT-LOCKED]
pcib4: <ACPI PCI-PCI bridge> at device 30.0 on pci0
pci4: <ACPI PCI bus> on pcib4
xl0: <3Com 3c905C-TX Fast Etherlink XL> port 0xdc80-0xdcff mem
0xfe1ffc00-0xfe1ffc7f irq 23 at device 11.0 on pci4
miibus0: <MII bus> on xl0
ukphy0: <Generic IEEE 802.3u media interface> on miibus0
ukphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
xl0: Ethernet address: 00:08:74:39:6e:03
pci4: <serial bus, FireWire> at device 12.0 (no driver attached)
pcm0: <Creative EMU10K1> port 0xdc60-0xdc7f irq 17 at device 13.0 on
pci4
pcm0: <TriTech TR28602 AC97 Codec>
isab0: <PCI-ISA bridge> at device 31.0 on pci0
isa0: <ISA bus> on isab0
atapci0: <Intel ICH2 UDMA100 controller> port
0xffa0-0xffaf,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 at device 31.1 on pci0
ata0: channel #0 on atapci0
ata1: channel #1 on atapci0
uhci0: <Intel 82801BA/BAM (ICH2) USB controller USB-A> port
0xff80-0xff9f irq 19 at device 31.2 on pci0
uhci0: [GIANT-LOCKED]
usb0: <Intel 82801BA/BAM (ICH2) USB controller USB-A> on uhci0
usb0: USB revision 1.0
uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
ums0: Logitech USB-PS/2 Optical Mouse, rev 2.00/18.00, addr 2, iclass
3/1
ums0: 7 buttons and Z dir.
pci0: <serial bus, SMBus> at device 31.3 (no driver attached)
uhci1: <Intel 82801BA/BAM (ICH2) USB controller USB-B> port
0xff60-0xff7f irq 23 at device 31.4 on pci0
uhci1: [GIANT-LOCKED]
usb1: <Intel 82801BA/BAM (ICH2) USB controller USB-B> on uhci1
usb1: USB revision 1.0
uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
fdc0: <floppy drive controller> port 0x3f7,0x3f0-0x3f5 irq 6 drq 2 on
acpi0
fdc0: [FAST]
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
atkbdc0: <Keyboard controller (i8042)> port 0x64,0x60 irq 1 on acpi0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
kbd0 at atkbd0
atkbd0: [GIANT-LOCKED]
sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on
acpi0
sio0: type 16550A
sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0
sio1: type 16550A
ppc0: <ECP parallel printer port> port 0x778-0x77f,0x378-0x37f irq 7 on
acpi0
ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode
ppc0: FIFO with 16/16/8 bytes threshold
ppbus0: <Parallel port bus> on ppc0
plip0: <PLIP network interface> on ppbus0
lpt0: <Printer> on ppbus0
lpt0: Interrupt-driven port
ppi0: <Parallel I/O> on ppbus0
pmtimer0 on isa0
orm0: <ISA Option ROMs> at iomem
0xd2000-0xd3fff,0xce000-0xd1fff,0xc0000-0xcdfff on isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on
isa0
Timecounters tick every 10.000 msec
acd0: CDROM <Lite-On LTN486S 48x Max/YDS4> at ata1-master UDMA33
acd1: CDRW <PHILIPS DVD+RW-D28/1.29> at ata1-slave UDMA33
Waiting 15 seconds for SCSI devices to settle
SMP: AP CPU #1 Launched!
da0 at mpt1 bus 0 target 0 lun 0
da0: <SEAGATE ST336732LW 2223> Fixed Direct Access SCSI-3 device 
da0: 320.000MB/s transfers (160.000MHz, offset 63, 16bit), Tagged
Queueing Enabled
da0: 34732MB (71132959 512 byte sectors: 255H 63S/T 4427C)
da1 at mpt1 bus 0 target 1 lun 0
da1: <FUJITSU MAP3147NP 5605> Fixed Direct Access SCSI-3 device 
da1: 320.000MB/s transfers (160.000MHz, offset 127, 16bit), Tagged
Queueing Enabled
da1: 140014MB (286749480 512 byte sectors: 255H 63S/T 17849C)
Mounting root from ufs:/dev/da0s1a

Thank you,

Tom H.

-- 
--------------------------
-- Thomas M. Hermann
-- Research Associate
-- NIAR
--------------------------
Comment 8 Mark Linimon freebsd_committer freebsd_triage 2005-03-21 06:42:27 UTC
State Changed
From-To: feedback->open

Feedback was received some time ago.
Comment 9 Harrison Grundy 2007-03-27 16:33:02 UTC
Just confirming that this still occurs. Have you tried updating the 
BIOS/Firmware/etc of your system?
Comment 10 Remko Lodder freebsd_committer freebsd_triage 2007-03-27 20:40:46 UTC
State Changed
From-To: open->closed

The submitter informed Harrison that he no longer has the hardware, this 
was problably a old firmware issue. Closing the ticket (Thanks submitter 
and Harrison)