OpenBLAS v0.3.12 Release Notes
Release Date: 2020-10-24 // over 3 years ago-
common:
- ๐ Fixed missing BLAS/LAPACK functions (inadvertently dropped during
๐ท the build system restructuring to support selective compilation) - ๐ Fixed argument conversion macro in LAPACKE_zgesvdq (LAPACK #458)
POWER:
- โ Added optimized SCOPY/CCOPY kernels for POWER10
- 0๏ธโฃ Increased and unified the default size of the GEMM buffer
- ๐ Fixed building for POWER10 in DYNAMIC_ARCH mode
- โ POWER10 compatibility test now checks binutils version as well
- โ Cleaned up compiler warnings
x86_64:
- corrected compiler version checks for AVX2 compatibility
- โ added compiler option -mavx2 for building with flang
- ๐ fixed direct SGEMM pathway for small matrix sizes (broken by
๐จ the code refactoring in 0.3.11) - ๐ fixed unhandled partial register clobbers in several kernels
for AXPY,DOT,GEMV_N and GEMV_T flagged by gcc10 tree-vectorizer
ARMV8:
- ๐ improved Apple Vortex support to include cross-compiling
md5sums:
03bff4558fc701b7d0e689814055ecb2 OpenBLAS-0.3.12.zip
baf8c58c0ef6ebe0f9eb74a5c4acd662 OpenBLAS-0.3.12.tar.gz
4df4ebb7b5c4f1b5ec8fa58f48be6a51 OpenBLAS-0.3.12-x64.zip - ๐ Fixed missing BLAS/LAPACK functions (inadvertently dropped during