Opcode/ Instruction |
Op/ En |
64/32 bit Mode Support |
CPUID Feature Flag |
Description |
EVEX.128.NP.MAP5.W0 5E /r VDIVPH xmm1{k1}{z}, xmm2, xmm3/ m128/m16bcst |
A |
V/V |
AVX512-FP16 AVX512VL |
Divide packed FP16 values in xmm2 by packed FP16 values in xmm3/m128/m16bcst, and store the result in xmm1 subject to writemask k1. |
EVEX.256.NP.MAP5.W0 5E /r VDIVPH ymm1{k1}{z}, ymm2, ymm3/ m256/m16bcst |
A |
V/V |
AVX512-FP16 AVX512VL |
Divide packed FP16 values in ymm2 by packed FP16 values in ymm3/m256/m16bcst, and store the result in ymm1 subject to writemask k1. |
EVEX.512.NP.MAP5.W0 5E /r VDIVPH zmm1{k1}{z}, zmm2, zmm3/ m512/m16bcst {er} |
A |
V/V |
AVX512-FP16 |
Divide packed FP16 values in zmm2 by packed FP16 values in zmm3/m512/m16bcst, and store the result in zmm1 subject to writemask k1. |
Op/En |
Tuple |
Operand 1 |
Operand 2 |
Operand 3 |
Operand 4 |
A |
Full |
ModRM:reg (w) |
VEX.vvvv (r) |
ModRM:r/m (r) |
N/A |
This instruction divides packed FP16 values from the first source operand by the corresponding elements in the second source operand, storing the packed FP16 result in the destination operand. The destination elements are updated according to the writemask.
VL = 128, 256 or 512 KL := VL/16 IF (VL = 512) AND (EVEX.b = 1): SET_RM(EVEX.RC) ELSE SET_RM(MXCSR.RC) FOR j := 0 TO KL-1: IF k1[j] OR *no writemask*: DEST.fp16[j] := SRC1.fp16[j] / SRC2.fp16[j] ELSE IF *zeroing*: DEST.fp16[j] := 0 // else dest.fp16[j] remains unchanged DEST[MAXVL-1:VL] := 0
VL = 128, 256 or 512 KL := VL/16 FOR j := 0 TO KL-1: IF k1[j] OR *no writemask*: IF EVEX.b = 1: DEST.fp16[j] := SRC1.fp16[j] / SRC2.fp16[0] ELSE: DEST.fp16[j] := SRC1.fp16[j] / SRC2.fp16[j] ELSE IF *zeroing*: DEST.fp16[j] := 0 // else dest.fp16[j] remains unchanged DEST[MAXVL-1:VL] := 0
VDIVPH __m128h _mm_div_ph (__m128h a, __m128h b); VDIVPH __m128h _mm_mask_div_ph (__m128h src, __mmask8 k, __m128h a, __m128h b); VDIVPH __m128h _mm_maskz_div_ph (__mmask8 k, __m128h a, __m128h b); VDIVPH __m256h _mm256_div_ph (__m256h a, __m256h b); VDIVPH __m256h _mm256_mask_div_ph (__m256h src, __mmask16 k, __m256h a, __m256h b); VDIVPH __m256h _mm256_maskz_div_ph (__mmask16 k, __m256h a, __m256h b); VDIVPH __m512h _mm512_div_ph (__m512h a, __m512h b); VDIVPH __m512h _mm512_mask_div_ph (__m512h src, __mmask32 k, __m512h a, __m512h b); VDIVPH __m512h _mm512_maskz_div_ph (__mmask32 k, __m512h a, __m512h b); VDIVPH __m512h _mm512_div_round_ph (__m512h a, __m512h b, int rounding); VDIVPH __m512h _mm512_mask_div_round_ph (__m512h src, __mmask32 k, __m512h a, __m512h b, int rounding); VDIVPH __m512h _mm512_maskz_div_round_ph (__mmask32 k, __m512h a, __m512h b, int rounding);
Invalid, Underflow, Overflow, Precision, Denormal, Zero.
EVEX-encoded instructions, see Table 2-46, "Type E2 Class Exception Conditions."