Bug 249120

Summary:

math/openblas: update to 0.3.10

Product:

Ports & Packages

Reporter:

Piotr Kubaj <pkubaj>

Component:

Individual Port(s)

Assignee:

Piotr Kubaj <pkubaj>

Status:

Closed FIXED

Severity:

Affects Only Me

CC:

fernape, phd_kimberlite

Priority:

---

Flags:

phd_kimberlite: maintainer-feedback+

Version:

Latest

Hardware:

Any

OS:

Any

URL:

https://github.com/xianyi/OpenBLAS/releases/tag/v0.3.10

Attachments:

Description	Flags
patch	pkubaj: maintainer-approval? (phd_kimberlite)

Description Piotr Kubaj freebsd_committer

2020-09-05 00:37:23 UTC

Created attachment 217758 [details]
patch

Update to the newest release.

Since POWER8 kernels now work on big-endian, make a new option for POWER8.

Comment 1 Fernando Apesteguía freebsd_committer

2020-09-06 17:03:31 UTC

^Triage: Reporter is committer, assign accordingly.

^Triage: If there is a changelog or release notes URL available for this version, please add it to the URL field.

Thanks!

Comment 2 commit-hook freebsd_committer

2020-09-06 17:53:08 UTC

A commit references this bug:

Author: pkubaj
Date: Sun Sep  6 17:52:26 UTC 2020
New revision: 547859
URL: https://svnweb.freebsd.org/changeset/ports/547859

Log:
  math/openblas: update to 0.3.10, add POWER8 option

  Changelog:

  common:

      Improved thread locking behaviour in blas_server and parallel getrf
      Imported bugfix 394 from LAPACK (spurious reference to "XERBL"
      due to overlong lines)
      Imported bugfix 403 from LAPACK (compile option "recursive" required
      for correctness with Intel and PGI)
      Imported bugfix 408 from LAPACK (wrong scaling in ZHEEQUB)
      Imported bugfix 411 from LAPACK (infinite loop in LARGV/LARTG/LARTGP)
      Fixed mismatches between BUFFERSIZE and GEMM_UNROLL parameters that
      could lead to crashes at large matrix sizes
      Restored internal soname in dynamic libraries on FreeBSD and Dragonfly
      Added API (openblas_setaffinity) to set thread affinity
      programmatically on Linux
      Added initial infrastructure for half-precision floating point
      (bfloat16) support with a generic implementation of SHGEMM
      Added CMAKE build system support for building the cblas_Xgemm3m
      functions
      Fixed CMAKE support for building in a path with embedded spaces
      Fixed CMAKE (non)handling of NO_EXPRECISION and MAX_STACK_ALLOC
      Fixed GCC version detection in the Makefiles
      Allowed overriding the names of AR, AS and LD in Makefile builds

  POWER:

      fixed big-endian POWER8 ELFv2 builds on FreeBSD
      Fixed GCC version checks and DYNAMIC_ARCH builds on POWER9
      Fixed CMAKE build support for POWER9
      fixed a potential race condition in the thread buffer allocation
      Worked around LAPACK test failures on PPC G4

  MIPS:

      fixed a potential race condition in the thread buffer allocation
      Added support for MIPS 24K/24KE family based on P5600 kernels

  MIPS64:

      fixed a potential race condition in the thread buffer allocation
      Added TARGET=GENERIC

  ARMV7:

      fixed a race condition in the thread buffer allocation

  ARMV8:

      Fixed a race condition in the thread buffer allocation
      Fixed zero initialisation in the assembly for SGEMM and DGEMM BETA
      Improved performance of the ThunderX2 DAXPY kernel
      Added an optimized SGEMM kernel for Cortex A53
      Fixed Makefile support for INTERFACE64 (8-byte integer)

  x86_64:

      Fixed a syntax error in the CMAKE setup for SkylakeX
      Improved performance of STRSM on Haswell, SkylakeX and Ryzen
      Improved SGEMM performance on SGEMM for workloads with ldc a
      multiple of 1024
      Improved DGEMM performance on Skylake X
      Fixed unwanted AVX512-dependency of SGEMM in DYNAMIC_ARCH
      builds created on SkylakeX
      Removed data alignment requirement in the SSE2 copy kernels
      that could cause spurious crashes
      Added a workaround for an optimizer bug in AppleClang 11.0.3
      Fixed LAPACK-TEST failures with Intel Fortran
      Fixed compilation and LAPACK test results with recent Flang
      and AMD AOCC
      Fixed DYNAMIC_ARCH builds with CMAKE on OS X
      Fixed missing exports of cblas_i?amin, cblas_i?min, cblas_i?max,
      cblas_?sum, cblas_?gemm3m in the shared library on OS X
      Fixed reporting of cpu name in DYNAMIC_ARCH builds (would sometimes
      show the name of an older generation chip supported by the same kernels)

  IBM Z:

      Improved performance of SGEMM/STRMM and DGEMM/DTRMM on Z14

  PR:		249120
  Approved by:	phd_kimberlite@yahoo.co.jp (maintainer)

Changes:
  head/math/openblas/Makefile
  head/math/openblas/distinfo