ArrayFire v3.7.2 Release Notes

Release Date: 2020-07-13 // almost 4 years ago
  • v3.7.2

    👌 Improvements

    • Cache CUDA kernels to disk to improve load times(Thanks to @cschreib-ibex) #2848
    • Staticly link against cuda libraries #2785
    • 🏗 Make cuDNN an optional build dependency #2836
    • 👌 Improve support for different compilers and OS #2876 #2945 #2925 #2942 #2943 #2945
    • 👌 Improve performance of join and transpose on CPU #2849
    • 👌 Improve documentation #2816 #2821 #2846 #2918 #2928 #2947
    • ⬇️ Reduce binary size using NVRTC and template reducing instantiations #2849 #2861 #2890
    • 👌 Improve reduceByKey performance on OpenCL by using builtin functions #2851
    • 👌 Improve support for Intel OpenCL GPUs #2855
    • 👍 Allow staticly linking against MKL #2877 (Sponsered by SDL)
    • 👍 Better support for older CUDA toolkits #2923
    • ➕ Add support for CUDA 11 #2939
    • ➕ Add support for ccache for faster builds #2931
    • ➕ Add support for the conan package manager on linux #2875
    • 🏗 Propagate build errors up the stack in AFError exceptions #2948 #2957
    • 👌 Improve runtime dependency library loading #2954
    • 👌 Improved cuDNN runtime checks and warnings #2960
    • Document af_memory_manager_* native memory return values #2911
    • ➕ Add support for cuDNN 8 #2963

    🛠 Fixes

    • 🐛 Bug crash when allocating large arrays #2827
    • 🛠 Fix various compiler warnings #2827 #2849 #2872 #2876
    • 🛠 Fix minor leaks in OpenCL functions #2913
    • 🛠 Various continuous integration related fixes #2819
    • 🛠 Fix zero padding with convolv2NN #2820
    • Fix af_get_memory_pressure_threshold return value #2831
    • Increased the max filter length for morph
    • 🖐 Handle empty array inputs for LU, QR, and Rank functions #2838
    • 🛠 Fix FindMKL.cmake script for sequential threading library #2840
    • 🔨 Various internal refactoring #2839 #2861 #2864 #2873 #2890 #2891 #2913
    • 🛠 Fix OpenCL 2.0 builtin function name conflict #2851
    • 🛠 Fix error caused when releasing memory with multiple devices #2867
    • 🛠 Fix missing set stacktrace symbol from unified API #2915
    • 🛠 Fix zero padding issue in convolve2NN #2820
    • 🛠 Fixed bugs in ReduceByKey #2957
    • ➕ Add clblast patch to handle custom context with multiple devices #2967

    Contributions

    Special thanks to our contributors:
    Corentin Schreiber
    Jacob Kahn
    Paul Jurczak
    Christoph Junghans