OpenBLAS v0.3.1 Release Notes

Release Date: 2018-07-01 // almost 6 years ago
  • common:

    • rewritten thread initialization code with significantly reduced overhead
    • โž• added CBLAS interfaces to the IxAMIN BLAS extension functions
    • ๐Ÿ›  fixed the lapack-test target
    • ๐Ÿ— CMAKE builds now create an OpenBLASConfig.cmake file
    • ZAXPY now uses a single thread for small input sizes
    • โšก๏ธ the LAPACK code was updated from Reference-LAPACK/lapack#253

    POWER:

    • corrected CROT and ZROT behaviour with zero INC_X

    ARMV7:

    • corrected xDOT behaviour with zero INC_X or INC_Y

    x86_64:

    • ๐Ÿ— retired some older targets of DYNAMIC_ARCH builds to a new option DYNAMIC_OLDER,
      this affects PENRYN,DUNNINGTON,OPTERON,OPTERON_SSE3,BOBCAT,ATOM and NANO
      ๐Ÿ‘ (which will still be supported via the slower PRESCOTT kernels when this option is not set)
    • added an option DYNAMIC_LIST that (used in conjunction with DYNAMIC_ARCH) allows
      ๐Ÿ‘ to specify the list of x86_64 targets to include. Any target not on the list will be supported by
      the Sandybridge or Nehalem kernels if available, or by Prescott.
    • ๐Ÿ‘Œ improved SWITCH_RATIO on Haswell for increased GEMM throughput
    • โž• added initial support for Intel Skylake X, including an AVX512 SGEMM kernel
    • โž• added autodetection of Intel Cannon Lake series as Skylake X
    • โž• added a default L2 cache size for hypervisors that return zero here (Chromebook)
    • ๐Ÿ›  fixed a name clash with recent Windows10 headers that broke the build with (at least)
      recent mingw from MSYS2
    • ๐Ÿ›  fixed a link error in mixed clang/gfortran builds with OpenMP
    • ๐Ÿš€ updated the OSX deployment target to 10.8
    • ๐Ÿ switched on parallel make for builds on MS Windows by default

    x86:

    • fixed SSWAP and DSWAP behaviour with zero INC_X and INC_Y

    Download OpenBLAS