OpenBLAS v0.3.5 Release Notes
Release Date: 2018-12-31 // over 5 years ago-
common:
- loop unrolling in TRMV has been enabled again.
- A domain error in the thread workload distribution for SYRK
๐ has been fixed. - ๐ gmake builds will now automatically add -fPIC to the build
options if the platform requires it. - a pthreads key leakage (and associate crash on dlclose) in
๐ the USE_TLS codepath was fixed. - ๐ building of the utest cases on systems that do not provide
๐ an implementation of complex.h was fixed.
x86_64:
- the SkylakeX code was changed to compile on OSX.
- unwanted application of the -march=skylake-avx512 option
๐ to the common code parts of a DYNAMIC_ARCH build was fixed. - ๐ improved performance of SGEMM for small workloads on Skylake X.
- ๐ performance of SGEMM and DGEMM was improved on Haswell.
ARMV8:
- ๐ง a configuration error that broke the CNRM2 kernel was corrected.
- ๐ compilation of the GEMM kernels with CMAKE was fixed.
- ๐ DYNAMIC_ARCH builds are now available with CMAKE as well.
- using CMAKE for cross-compilation to the new cpu TARGETs
introduced in 0.3.4 now works.
POWER:
- a problem in cpu autodetection for AIX has been corrected.