OpenBLAS v0.3.9 Release Notes

Release Date: 2020-03-01 // about 4 years ago
  • common:

    • ๐Ÿ›  Fixed a miscompilation of the GETRF functions with CMAKE
    • The size of the memory buffer used for splitting GEMM tasks across
      ๐Ÿ‘ท multiple threads can now be configured in the build system.
    • ๐Ÿ›  Imported bugfix 390 from LAPACK (missing NaN propagation in xCOMBSSQ)

    POWER:

    • ๐Ÿ›  fixed several compilation problems related to endianness and
      ๐Ÿ‘ ELF version support on POWER8 and POWER9.
    • ๐Ÿ›  fixed misuse of the absolute value IAMIN/IAMAX in place of IMIN/IMAX
    • ๐Ÿ›  fixed a race condition in the level3 blas code

    MIPS64:

    • ๐Ÿ›  fixed misuse of the absolute value IAMIN/IAMAX in place of IMIN/IMAX

    ARMV7:

    • ๐Ÿ›  fixed a race condition in the level3 blas code
    • ๐Ÿ›  fixed a compilation problem on Android

    ARMV8:

    • โž• Added support for Ampere EMAG8180
    • โž• Added support for Neoverse N1
    • ๐Ÿ‘Œ improved performance of the blas_lock function
    • ๐Ÿ›  fixed a race condition in the level3 blas code
    • ๐Ÿ›  Fixed a performance regression on TSV110 servers

    x86_64:

    • ๐Ÿ›  Fixed a long-standing error with undeclared register clobbers in
      the DSCAL microkernel for Haswell,SkylakeX and Zen exposed by gcc9.2
    • ๐Ÿ›  Fixed a long-standing bug in the SSE implementation of the IAMAX functions
    • Fixed a cmake build failure with DYNAMIC_ARCH on x86_64
    • ๐Ÿ›  Fixed an oversight in the cpu detection code for Intel Goldmont+,
      Cannon Lake and Ice Lake
    • ๐Ÿ›  Fixed compile failure on OSX when the compiler name contains a dash
      (e.g. gcc-9)
    • ๐Ÿ›  Fixed compilation with MinGW on SkylakeX
    • ๐Ÿ‘Œ Improved speed of the AVX512 GEMM3M code, added an AVX512 kernel for
      ๐ŸŽ STRMM and improved performance of the AVX2 GEMM kernels

    IBM Z:

    • ๐Ÿ›  fixed compilation of the DYNAMIC_ARCH code

    Download OpenBLAS