OpenBLAS v0.3.1 Release Notes
Release Date: 2018-07-01 // almost 6 years ago-
common:
- rewritten thread initialization code with significantly reduced overhead
- โ added CBLAS interfaces to the IxAMIN BLAS extension functions
- ๐ fixed the lapack-test target
- ๐ CMAKE builds now create an OpenBLASConfig.cmake file
- ZAXPY now uses a single thread for small input sizes
- โก๏ธ the LAPACK code was updated from Reference-LAPACK/lapack#253
POWER:
- corrected CROT and ZROT behaviour with zero INC_X
ARMV7:
- corrected xDOT behaviour with zero INC_X or INC_Y
x86_64:
- ๐ retired some older targets of DYNAMIC_ARCH builds to a new option DYNAMIC_OLDER,
this affects PENRYN,DUNNINGTON,OPTERON,OPTERON_SSE3,BOBCAT,ATOM and NANO
๐ (which will still be supported via the slower PRESCOTT kernels when this option is not set) - added an option DYNAMIC_LIST that (used in conjunction with DYNAMIC_ARCH) allows
๐ to specify the list of x86_64 targets to include. Any target not on the list will be supported by
the Sandybridge or Nehalem kernels if available, or by Prescott. - ๐ improved SWITCH_RATIO on Haswell for increased GEMM throughput
- โ added initial support for Intel Skylake X, including an AVX512 SGEMM kernel
- โ added autodetection of Intel Cannon Lake series as Skylake X
- โ added a default L2 cache size for hypervisors that return zero here (Chromebook)
- ๐ fixed a name clash with recent Windows10 headers that broke the build with (at least)
recent mingw from MSYS2 - ๐ fixed a link error in mixed clang/gfortran builds with OpenMP
- ๐ updated the OSX deployment target to 10.8
- ๐ switched on parallel make for builds on MS Windows by default
x86:
- fixed SSWAP and DSWAP behaviour with zero INC_X and INC_Y