VGETEXPPH - Convert Exponents of Packed FP16 Values to FP16 Values

Opcode/ Instruction

Op/ En

64/32 bit Mode Support

CPUID Feature Flag

Description

EVEX.128.66.MAP6.W0 42 /r VGETEXPPH xmm1{k1}{z}, xmm2/ m128/m16bcst

A

V/V

AVX512-FP16 AVX512VL

Convert the exponent of FP16 values in the source operand to FP16 results representing unbiased integer exponents and stores the results in the destination register subject to writemask k1.

EVEX.256.66.MAP6.W0 42 /r VGETEXPPH ymm1{k1}{z}, ymm2/ m256/m16bcst

A

V/V

AVX512-FP16 AVX512VL

Convert the exponent of FP16 values in the source operand to FP16 results representing unbiased integer exponents and stores the results in the destination register subject to writemask k1.

EVEX.512.66.MAP6.W0 42 /r VGETEXPPH zmm1{k1}{z}, zmm2/ m512/m16bcst {sae}

A

V/V

AVX512-FP16

Convert the exponent of FP16 values in the source operand to FP16 results representing unbiased integer exponents and stores the results in the destination register subject to writemask k1.

Instruction Operand Encoding

Op/En

Tuple

Operand 1

Operand 2

Operand 3

Operand 4

A

Full

ModRM:reg (w)

ModRM:r/m (r)

N/A

N/A

Description

This instruction extracts the biased exponents from the normalized FP16 representation of each word element of the source operand (the second operand) as unbiased signed integer value, or convert the denormal representa- tion of input data to unbiased negative integer values. Each integer value of the unbiased exponent is converted to an FP16 value and written to the corresponding word elements of the destination operand (the first operand) as FP16 numbers.

The destination elements are updated according to the writemask.

Each GETEXP operation converts the exponent value into a floating-point number (permitting input value in denormal representation). Special cases of input values are listed in Table 5-5.

The formula is:

GETEXP(x) = floor(log2(|x|)) Notation floor(x) stands for maximal integer not exceeding real number x.

Software usage of VGETEXPxx and VGETMANTxx instructions generally involve a combination of GETEXP operation and GETMANT operation (see VGETMANTPH). Thus, the VGETEXPPH instruction does not require software to handle SIMD floating-point exceptions.

Table 5-6. VGETEXPPH/VGETEXPSH Special Cases

Input Operand

Result

Comments

src1 = NaN

QNaN(src1)

0 < |src1| < INF

floor(log2(|src1|))

If (SRC = SNaN), then #IE. If (SRC = denormal), then #DE.

| src1| = +INF

+INF

| src1| = 0

-INF

Operation

def normalize_exponent_tiny_fp16(src):
   jbit := 0
   // src & dst are FP16 numbers with sign(1b), exp(5b) and fraction (10b) fields
   dst.exp := 1                     // write bits 14:10
   dst.fraction := src.fraction // copy bits 9:0
   while jbit == 0:
       jbit := dst.fraction[9]         // msb of the fraction
       dst.fraction := dst.fraction << 1
       dst.exp := dst.exp - 1
   dst.fraction := 0
   return dst
def getexp_fp16(src):
   src.sign := 0                     // make positive
   exponent_all_ones := (src[14:10] == 0x1F)
   exponent_all_zeros := (src[14:10] == 0)
   mantissa_all_zeros := (src[9:0] == 0)
   zero := exponent_all_zeros and mantissa_all_zeros
   signaling_bit := src[9]
   nan := exponent_all_ones and not(mantissa_all_zeros)
   snan := nan and not(signaling_bit)
   qnan := nan and signaling_bit
   positive_infinity := not(negative) and exponent_all_ones and mantissa_all_zeros
   denormal := exponent_all_zeros and not(mantissa_all_zeros)
   if nan:
       if snan:
            MXCSR.IE := 1
       return qnan(src)             // convert snan to a qnan
   if positive_infinity:
       return src
   if zero:
       return -INF
   if denormal:
       tmp := normalize_exponent_tiny_fp16(src)
       MXCSR.DE := 1
   else:
       tmp := src
   tmp := SAR(tmp, 10)             // shift arithmetic right
   tmp := tmp - 15                  // subtract bias
   return convert_integer_to_fp16(tmp)

VGETEXPPH dest{k1}, src

VL = 128, 256 or 512
KL := VL/16
FOR i := 0 to KL-1:
   IF k1[i] or *no writemask*:
       IF SRC is memory and (EVEX.b = 1):
            tsrc := src.fp16[0]
       ELSE:
            tsrc := src.fp16[i]
       DEST.fp16[i] := getexp_fp16(tsrc)
   ELSE IF *zeroing*:
       DEST.fp16[i] := 0
   //else DEST.fp16[i] remains unchanged
DEST[MAXVL-1:VL] := 0 

Intel C/C++ Compiler Intrinsic Equivalent

VGETEXPPH __m128h _mm_getexp_ph (__m128h a);
VGETEXPPH __m128h _mm_mask_getexp_ph (__m128h src, __mmask8 k, __m128h a);
VGETEXPPH __m128h _mm_maskz_getexp_ph (__mmask8 k, __m128h a);
VGETEXPPH __m256h _mm256_getexp_ph (__m256h a);
VGETEXPPH __m256h _mm256_mask_getexp_ph (__m256h src, __mmask16 k, __m256h a);
VGETEXPPH __m256h _mm256_maskz_getexp_ph (__mmask16 k, __m256h a);
VGETEXPPH __m512h _mm512_getexp_ph (__m512h a);
VGETEXPPH __m512h _mm512_mask_getexp_ph (__m512h src, __mmask32 k, __m512h a);
VGETEXPPH __m512h _mm512_maskz_getexp_ph (__mmask32 k, __m512h a);
VGETEXPPH __m512h _mm512_getexp_round_ph (__m512h a, const int sae);
VGETEXPPH __m512h _mm512_mask_getexp_round_ph (__m512h src, __mmask32 k, __m512h a, const int sae);
VGETEXPPH __m512h _mm512_maskz_getexp_round_ph (__mmask32 k, __m512h a, const int sae);

SIMD Floating-Point Exceptions

Invalid, Denormal.

Other Exceptions

EVEX-encoded instructions, see Table 2-46, "Type E2 Class Exception Conditions."