Download Latest Version ispc-v1.30.0-macOS.x86_64.tar.gz (112.2 MB)
Email in envelope

Get an email when there's a new version of ispc

Home / v1.30.0
Name Modified Size InfoDownloads / Week
Parent folder
ispc-examples-v1.30.0.tar.gz 2026-02-04 18.3 MB
ispc-v1.30.0-windows.zip.sig 2026-02-04 310 Bytes
ispc-v1.30.0-linux.tar.gz.sig 2026-02-04 310 Bytes
ispc-v1.30.0-macOS.universal.tar.gz.sig 2026-02-04 310 Bytes
ispc-v1.30.0-macOS.x86_64.tar.gz.sig 2026-02-04 310 Bytes
ispc-examples-v1.30.0.tar.gz.sig 2026-02-04 310 Bytes
ispc-v1.30.0-macOS.arm64.tar.gz.sig 2026-02-04 310 Bytes
ispc-examples-v1.30.0.zip.sig 2026-02-04 310 Bytes
ispc-v1.30.0-linux.aarch64.tar.gz.sig 2026-02-04 310 Bytes
ispc-v1.30.0-linux.tar.gz 2026-02-04 109.3 MB
ispc-v1.30.0-windows.zip 2026-02-04 137.9 MB
ispc-examples-v1.30.0.zip 2026-02-04 18.4 MB
ispc-v1.30.0-linux.aarch64.tar.gz 2026-02-04 64.7 MB
ispc-v1.30.0-macOS.universal.tar.gz 2026-02-04 216.9 MB
ispc-v1.30.0-macOS.arm64.tar.gz 2026-02-04 105.0 MB
ispc-v1.30.0-macOS.x86_64.tar.gz 2026-02-04 112.2 MB
=== v1.30.0 === (04 February 2026) source code.tar.gz 2026-02-04 20.1 MB
=== v1.30.0 === (04 February 2026) source code.zip 2026-02-04 21.4 MB
README.md 2026-02-04 3.2 kB
Totals: 19 Items   824.1 MB 10

ISPC release featuring Intel AMX (Advanced Matrix Extensions) support in the standard library, performance improvements for backward memory access patterns, and bug fixes. Based on a patched LLVM 21.1.8.

Standard Library:

  • Intel AMX (Advanced Matrix Extensions) support has been added to the standard library. AMX provides hardware acceleration for matrix operations, particularly useful for machine learning workloads. The new <amx.isph> header provides functions for tile configuration, data loading/storing, and matrix dot products for INT8, BF16, and FP16 data types. AMX is supported on avx512spr, avx512gnr, and avx10.2dmr targets.

Language Changes:

  • Integral type aliases (size_t, ptrdiff_t, intptr_t, uintptr_t) can now be used as non-type template parameters.

Performance:

  • Optimized backward memory access patterns (e.g., dst[size-1-i]) to use contiguous load/store operations with vector shuffle, providing 5-10x speedup compared to scatter/gather operations.

Deprecated Targets:

  • The sse2-i32x4 and sse2-i32x8 targets are no longer deprecated. Based on customer feedback indicating active use, we have decided to retain these targets and removed the deprecation warning.

Bug Fixes:

  • Fixed integral type aliases not being accepted as non-type template parameters.

  • Fixed varying control flow regression on NEON targets introduced in v1.26.0.

  • Fixed performance regression on Apple Silicon (and other ARM platforms) introduced in v1.26.0, which caused up to 30% slowdown in some workloads.

  • Fixed sub-optimal code generation when using extract() that caused unnecessary stack spills.

Build System:

  • Updated default LLVM version to 21.1.8.
  • Added support for building with LLVM 22.0 and LLVM 23.0.

Recommended versions of Runtime Dependencies when targeting GPU:

Linux:

Alternatively, you can use a validated gfx driver stack supporting Intel Arc(TM) available at https://dgpu-docs.intel.com/driver/installation.html

Windows:

Components revisions used in GPU-enabled build:

Source: README.md, updated 2026-02-04