pub fn _mm512_add_ph(a: __m512h, b: __m512h) -> __m512h
avx512fp16
Add packed half-precision (16-bit) floating-point elements in a and b, and store the results in dst.
Intel’s documentation