Skip to content

Releases: vosen/ZLUDA

Version 5-preview.49

16 Jul 23:06
7c6b95a

Choose a tag to compare

Version 5-preview.49 Pre-release
Pre-release

What's Changed

Full Changelog: v5-preview.48...v5-preview.49

Version 5-preview.48

16 Jul 19:32
0396892

Choose a tag to compare

Version 5-preview.48 Pre-release
Pre-release

What's Changed

Full Changelog: v5-preview.47...v5-preview.48

Version 5-preview.47

16 Jul 18:54
777392f

Choose a tag to compare

Version 5-preview.47 Pre-release
Pre-release

What's Changed

New Contributors

Full Changelog: v5-preview.46...v5-preview.47

Version 5-preview.46

16 Jul 18:14
6fb09f3

Choose a tag to compare

Version 5-preview.46 Pre-release
Pre-release

What's Changed

Full Changelog: v5-preview.45...v5-preview.46

Version 5-preview.45

14 Jul 22:15
06b28cf

Choose a tag to compare

Version 5-preview.45 Pre-release
Pre-release

What's Changed

Full Changelog: v5-preview.44...v5-preview.45

Version 5-preview.44

10 Jul 19:57
373d6d9

Choose a tag to compare

Version 5-preview.44 Pre-release
Pre-release

What's Changed

Full Changelog: v5-preview.43...v5-preview.44

Version 5-preview.43

09 Jul 16:34
081f7d0

Choose a tag to compare

Version 5-preview.43 Pre-release
Pre-release

What's Changed

New Contributors

Full Changelog: v4...v5-preview.43

Version 4

31 Dec 15:19
de870db

Choose a tag to compare

This is the first release post-rollback and is very limited: only Geekbench is supported

Version 3

12 Feb 14:09

Choose a tag to compare

Nobody expects the Red Team

Too many changes to list, but broadly:

  • Remove Intel GPU support from the compiler
  • Add AMD GPU support to the compiler
  • Remove Intel GPU host code
  • Add AMD GPU host code
  • More device instructions. From 40 to 68
  • More host functions. From 48 to 184
  • Add proof of concept implementation of OptiX framework
  • Add minimal support of cuDNN, cuBLAS, cuSPARSE, cuFFT, NCCL, NVML
  • Improve ZLUDA launcher for Windows

Version 2

22 Feb 17:17
4d3e37b

Choose a tag to compare

The goal of version 2 has been to fix end to end execution of GeekBench and improve Windows support:

  • Several new host-side functions are supported now (e.g. cuModuleLoadDataEx)
  • Several bugs have been fixed on the kernel side (e.g. threadIdx/blockIdx is now handled correctly)
  • Minor improvement in generated code brought better I/O performance when reading/writing vector objects. This improved performance by several percentage points in select GeekBench benchmarks
  • ZLUDA now ships its own injector (with_zluda.exe) which should make running ZLUDA on Windows much easier
  • Additionally, we have gained ability to easily create traces of CUDA kernel execution, making enabling new workloads much easier
  • ZLUDA now has a CI, which produces binaries on every pull request and commit

Special thanks to @take-cheeze, @nilsmartel and @ritschwumm for contributing to this release