Bug 255359 - www/firefox: Fails to build with LTO enabled after recent update: LLVM ERROR: Broken module found
Summary: www/firefox: Fails to build with LTO enabled after recent update: LLVM ERROR:...
Status: Closed FIXED
Alias: None
Product: Ports & Packages
Classification: Unclassified
Component: Individual Port(s) (show other bugs)
Version: Latest
Hardware: Any Any
: --- Affects Some People
Assignee: freebsd-gecko (Nobody)
Keywords: needs-qa
Depends on:
Reported: 2021-04-24 10:17 UTC by jakub_lach
Modified: 2021-05-11 18:11 UTC (History)
5 users (show)

See Also:
jbeich: maintainer-feedback+
koobs: maintainer-feedback? (dim)


Note You need to log in before you can comment on or make changes to this bug.
Description jakub_lach 2021-04-24 10:17:51 UTC

readonly align 4 dereferenceable(12) %2939) #26, !noalias !103756
Wrong types for attribute: inalloca nest noalias nocapture noundef nonnull readnone readonly signext zeroext byref(void) byval(void) preallocated(void) sret(void) align 1 derefe
ceable(1) dereferenceable_or_null(1)
  call fastcc align 8 void _ZN9wgpu_core2id1_75_$LT$impl$u20$serde..ser..Serialize$u20$for$u20$wgpu_core..id..SerialId$GT$9serialize17hf6dd92712563b176E"(%"wgpu_core::id::Seri
d::Id"* noalias nonnull readonly align 4 dereferenceable(12) %3, i64** nonnull align 8 dereferenceable(8) %1) #26, !noalias !103828
LLVM ERROR: Broken module found, compilation aborted!
PLEASE submit a bug report to https://bugs.llvm.org/ and include the crash backtrace.
#0 0x0000000803159779 llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) (/usr/local/llvm12/lib/libLLVM-12.so+0x2959779)
#1 0x00000008031579b5 llvm::sys::RunSignalHandlers() (/usr/local/llvm12/lib/libLLVM-12.so+0x29579b5)
#2 0x000000080315a181 (/usr/local/llvm12/lib/libLLVM-12.so+0x295a181)
#3 0x000000080078fd20 (/lib/libthr.so.3+0x19d20)
Comment 1 Dimitry Andric freebsd_committer 2021-04-24 11:04:17 UTC
From the stack trace, it looks like this is using the devel/llvm12 port? I'm adding Brooks on CC, because if any fix is identified, it will have to be added to that port.
Comment 2 jakub_lach 2021-04-24 13:12:09 UTC
Yes, it's using llvm12 package (autoinstalled as a build dependency).

FWIW, I just have completed firefox built without LTO.
Comment 3 Jan Beich freebsd_committer 2021-04-25 00:51:13 UTC
Cross-language LTO likely requires Clang and rustc to use the same LLVM version. lang/rust at 1.51.0 bundles LLVM 11.0.1, so to use LTO add DEFAULT_VERSIONS+=llvm=11 to /etc/make.conf or www/firefox/Makefile.local. However, the next lang/rust update to 1.52.0 (scheduled on 2021-05-06) will have LLVM 12.0.0.

If LTO is important it needs to be enabled by default to avoid regressions due to changes in framework, dependencies or Firefox updates.
Comment 4 Dimitry Andric freebsd_committer 2021-04-26 08:13:26 UTC
(In reply to Jan Beich from comment #3)
Ah yes, the rust+llvm11 and firefox+llvm12 mismatch is probably the cause, then. On my latest poudriere run I see the same "Wrong types for attribute" errors, followed by lld SIGABRT'ing.

Indeed llvm12 was only pulled in because of Firefox, so I'll try with llvm11 instead.
Comment 5 parv 2021-04-30 22:27:32 UTC
I still see the problem with llvm12 (on FreeBSD 13).

An earlier attempt to build firefox with llvm12 with LTO option & MAKE_JOBS_UNSAFE=yes had failed.

Not using LTO option, there is no "broken module" error from llvm and firefox build succeeds.
Comment 6 jakub_lach 2021-05-03 18:27:39 UTC
(In reply to jakub_lach from comment #2)

I've just finished building firefox with LTO, with

.if ${.CURDIR:M*/www/firefox}
Comment 7 Dimitry Andric freebsd_committer 2021-05-05 10:02:41 UTC
Hm I've put DEFAULT_VERSIONS+=llvm=11 in my make.conf, but it seems at least the mesa-dri ports still hardcode llvm10. :) So that's a little annyoing, but I guess it can't be helped?
Comment 8 Evgeniy Khramtsov 2021-05-05 11:20:25 UTC
(In reply to Dimitry Andric from comment #7)

See: https://gitter.im/FreeBSDDesktop/Lobby?at=6021a47aaa6a6f319d1326b5

To override LLVM ver for mesa-libs, one should locally patch the ports tree. And if one patches the ports tree, it makes more sense to ditch mesa-libs and use mesa-devel instead.

I don't know why x11@ didn't go the bsd.gecko.mk route (ignore llvm default in ports but respect make.conf).

FWIW, LLVM dependency is optional with mesa-devel, which IMO is much more flexible/configurable/usable for users who consume ports the Gentoo-way.
Comment 9 Dimitry Andric freebsd_committer 2021-05-05 13:40:04 UTC
(In reply to Evgeniy Khramtsov from comment #8)
I'm unsure if mesa can "just" compile with different LLVM versions than what it was released for, as it may be that they depend on specific APIs.

The LLVM_DEFAULT=  10 in graphics/mesa-dri/Makefile.common was put there on 2020-10-12 here: https://cgit.freebsd.org/ports/commit/?id=57ce68a0c0cab56c3d08120e6f55c11a8c248f06

Note that it says "=" not "?=" which must be for a reason. :)

In any case, I can at least confirm that LTO works fine for www/firefox, if I set DEFAULT_VERSIONS+=llvm=11.

I think it would be handy if in general the LLVM versions for firefox and rust are kept the same?
Comment 10 Evgeniy Khramtsov 2021-05-05 14:06:02 UTC
(In reply to Dimitry Andric from comment #9)

I don't feel like hijacking the topic of this bug for discussion of x11@ policy, but I think I should clarify:

> if mesa can compile

Upstream accepts LLVM versions >= 8.0.0 https://gitlab.freedesktop.org/mesa/mesa/-/blob/mesa-20.2.3/meson.build#L1449-1455

For example, Arch Linux already builds against LLVM 11 (see llvm-libs version): https://archlinux.org/packages/extra/x86_64/mesa/

The current LLVM 10 for mesa-libs is outdated.

> Note that it says "=" not "?=" which must be for a reason. :)

x11@ favors apparent stability and the use of binary packages.

They also think that putting artificial limitations like non-overridable LLVM version for people who build from source is a good idea, maybe because handling possible reports is too hard. Not to mention the time and carbon footprint of the hardcoded LLVM version.
Comment 11 Evgeniy Khramtsov 2021-05-05 14:23:59 UTC
(In reply to Evgeniy Khramtsov from comment #10)

Note: Arch also built against 11 when mesa was at 20. This one is important.
Comment 12 Jan Beich freebsd_committer 2021-05-05 15:02:35 UTC
(In reply to Dimitry Andric from comment #9)
> I think it would be handy if in general the LLVM versions for firefox and rust are kept the same?

Without enabling LTO by default it'd double (or triple as LTO builds are longer) the amount of pre-commit testing. ports/ committers have to test changes on their own hardware as poudriere requires root which universe* machines[1] don't provide. Without poudriere[2] testing is fragile and hard to automate, so would likely trip on complex desktop ports with many dependencies (like www/firefox). Some folks experimented[3] with offloading testing to third-party CI.

For example, SNDIO used to break often until it was enabled by default.

[1] https://www.freebsd.org/internal/machines/
[2] https://people.freebsd.org/~dinoex/howto/README.txt
[3] https://bugs.freebsd.org/bugzilla/buglist.cgi?quicksearch=comment:pipelines+status:closed
Comment 13 Jan Beich freebsd_committer 2021-05-05 15:14:19 UTC
Kinda forgot, LLVM_DEFAULT desync can be caused by either gecko@ bumping too early and rust@ forgetting to bump.
Comment 14 commit-hook freebsd_committer 2021-05-05 15:47:34 UTC
A commit in branch main references this bug:

URL: https://cgit.FreeBSD.org/ports/commit/?id=b1c90afe23f93698805a22802f2dd02acbe4488a

commit b1c90afe23f93698805a22802f2dd02acbe4488a
Author:     Jan Beich <jbeich@FreeBSD.org>
AuthorDate: 2021-05-05 15:19:58 +0000
Commit:     Jan Beich <jbeich@FreeBSD.org>
CommitDate: 2021-05-05 15:46:08 +0000

    gecko: wait for rust before swiching llvm* next time

    Cross-language LTO in www/firefox requires devel/llvm* and bundled
    LLVM in lang/rust to be of the same major version. Adjust the comment
    to delay future bumps but until LTO is enabled desync can still happen.

    PR:             255359
    Suggested by:   dim

 Mk/bsd.gecko.mk | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)
Comment 15 Jan Beich freebsd_committer 2021-05-05 15:52:29 UTC
Can someone check LTO build after applying review D30099? Don't forget to remove DEFAULT_VERSION+=llvm=11 workaround.

$ rustc -vV
rustc 1.52.0
binary: rustc
commit-hash: unknown
commit-date: unknown
host: x86_64-unknown-freebsd
release: 1.52.0
LLVM version: 12.0.0   <--------- ATTENTION HERE
Comment 16 jakub_lach 2021-05-05 18:34:06 UTC
(In reply to Jan Beich from comment #13)

Yes, I've tried building now without llvm11 hack with LTO, wasn't successful.

LLVM ERROR: Broken module found, compilation aborted!
PLEASE submit a bug report to https://bugs.llvm.org/ and include the crash backtrace.
#0 0x0000000803159779 llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) (/usr/local/llvm12/lib/libLLVM-12.so+0x2959779)
#1 0x00000008031579b5 llvm::sys::RunSignalHandlers() (/usr/local/llvm12/lib/libLLVM-12.so+0x29579b5)
#2 0x000000080315a181 (/usr/local/llvm12/lib/libLLVM-12.so+0x295a181)
#3 0x000000080078fe10 (/lib/libthr.so.3+0x19e10)
clang++: error: unable to execute command: Abort trap (core dumped)
clang++: error: linker command failed due to signal (use -v to see invocation)
gmake[5]: *** [/usr/obj/usr/ports/www/firefox/work/firefox-88.0.1/config/rules.mk:545: libxul.so] Error 254
gmake[5]: Leaving directory '/usr/obj/usr/ports/www/firefox/work/.build/toolkit/library/build'
gmake[4]: *** [/usr/obj/usr/ports/www/firefox/work/firefox-88.0.1/config/recurse.mk:72: toolkit/library/build/target] Error 2
gmake[4]: Leaving directory '/usr/obj/usr/ports/www/firefox/work/.build'
gmake[3]: *** [/usr/obj/usr/ports/www/firefox/work/firefox-88.0.1/config/recurse.mk:34: compile] Error 2
gmake[3]: Leaving directory '/usr/obj/usr/ports/www/firefox/work/.build'
gmake[2]: *** [/usr/obj/usr/ports/www/firefox/work/firefox-88.0.1/config/rules.mk:355: all] Error 2
gmake[2]: Leaving directory '/usr/obj/usr/ports/www/firefox/work/.build'
Comment 17 Dimitry Andric freebsd_committer 2021-05-06 11:19:24 UTC
(In reply to Jan Beich from comment #15)
How does review D30099 pull in LLVM 12 exactly? I don't see that detail anywhere in the patch. There doesn't seem to be any change to LLVM_DEFAULT, nor does the port have any dependency on another llvm port.

Maybe it's got its own vendored copy of llvm 12 embedded? That's awful of course, but probably unavoidable if they're doing incompatible changes to it.
Comment 18 Evgeniy Khramtsov 2021-05-06 15:27:46 UTC
(In reply to Jan Beich from comment #15)

I can't reproduce this with llvm=12 and new rust from this review. To mimic a more vanilla environment, I built firefox without DankBSD lightweight hardening and with X11 on. I can post a compressed poudriere log if you need this. Sorry for not posting yesterday, my build machine is kinda slow. The options were:

---Begin OPTIONS List---
===> The following configuration options are available for firefox-88.0.1,2:
     CANBERRA=off: Sound theme alerts
     DBUS=on: D-Bus IPC system support
     DEBUG=off: Build with debugging support
     FFMPEG=on: FFmpeg support (WMA, AIFF, AC3, APE...)
     LIBPROXY=off: Proxy support via libproxy
     LTO=on: Use Link-Time Optimization
     OPTIMIZED_CFLAGS=on: Use extra compiler optimizations
     PROFILE=off: Build with profiling support
     TEST=off: Build and/or run tests
====> Extra cubeb audio backends (OSS is always available)
     ALSA=off: ALSA audio architecture support
     JACK=off: JACK audio server support
     PULSEAUDIO=on: PulseAudio sound server support
     SNDIO=off: Sndio audio support
===> Use 'make config' to modify these settings
---End OPTIONS List---

LTO likely requires more memory to build, but I have 16 GB swap so the build passed. Note that profiling on may increase the requirements for an LTO build. The reporter should check if there are any out of memory messages in dmesg.
Comment 19 Charlie Li freebsd_committer 2021-05-07 01:33:14 UTC
(In reply to Dimitry Andric from comment #17)
rust 1.52.0's copy/fork of LLVM is 12. Remember that we had an OPTION to use ports LLVM but the rust folks added a target-features bit that the gecko@ ports need to build, per bug 223342 and bug 223300.

review D30099 finally finished building on my machine, so will check LTO builds for both www/firefox and mail/thunderbird soon.
Comment 20 Charlie Li freebsd_committer 2021-05-09 03:21:49 UTC
(In reply to Charlie Li from comment #19)
LTO with rust 1.52.0 and LLVM 12 works with www/firefox. The combination fails with mail/thunderbird for an unrelated reason (fails with rust 1.51.0 and LLVM 11 there).
Comment 21 Jan Beich freebsd_committer 2021-05-11 18:11:49 UTC
Thanks for confirming. Fixed by ports d8837418ea74 and ensured by ports 6ec985b72d58.