Blog Posts
November 3 2024
A quick overview of the functionality of the new instructions introduced by AVX-10.2.
June 7 2024
Tackling the problem of creating correct and fast implementations of fmod, including considerations for SIMD vectorization.
June 13 2023
A discussion of how changing the memory layout of the table used in Levenshtein edit distance can make it more SIMD friendly and how this can be leveraged with AVX-512.
April 29 2023
Exploring how AVX-512VBMI can be used to perform Granlund-Montgomery division on 8-bit uints, and how a simpler more naive algorithm beats the hardware div instruction by up to ~30x .
January 9 2023
A look at the problem of integer averaging, techniques for implementing averaging while following various rounding schemes, and potential techniques for creating SIMD vectorized implementations thereof.