OpenBLAS v0.3.5 Release Notes

Release Date: 2018-12-31 // over 5 years ago
  • common:

    • loop unrolling in TRMV has been enabled again.
    • A domain error in the thread workload distribution for SYRK
      ๐Ÿ›  has been fixed.
    • ๐Ÿ— gmake builds will now automatically add -fPIC to the build
      options if the platform requires it.
    • a pthreads key leakage (and associate crash on dlclose) in
      ๐Ÿ›  the USE_TLS codepath was fixed.
    • ๐Ÿ— building of the utest cases on systems that do not provide
      ๐Ÿ›  an implementation of complex.h was fixed.

    x86_64:

    • the SkylakeX code was changed to compile on OSX.
    • unwanted application of the -march=skylake-avx512 option
      ๐Ÿ— to the common code parts of a DYNAMIC_ARCH build was fixed.
    • ๐Ÿ‘Œ improved performance of SGEMM for small workloads on Skylake X.
    • ๐ŸŽ performance of SGEMM and DGEMM was improved on Haswell.

    ARMV8:

    • ๐Ÿ”ง a configuration error that broke the CNRM2 kernel was corrected.
    • ๐Ÿ›  compilation of the GEMM kernels with CMAKE was fixed.
    • ๐Ÿ— DYNAMIC_ARCH builds are now available with CMAKE as well.
    • using CMAKE for cross-compilation to the new cpu TARGETs
      introduced in 0.3.4 now works.

    POWER:

    • a problem in cpu autodetection for AIX has been corrected.

    Download OpenBLAS