BF1CVTLT, BF2CVTLT

8-bit floating-point convert to BFloat16 (top)

Convert each odd-numbered 8-bit floating-point element of the source vector to BFloat16 while downscaling the value, and place the results in the overlapping 16-bit elements of the destination vector. BF1CVTLT scales the values by 2-UInt(FPMR.LSCALE[5:0]). BF2CVTLT scales the values by 2-UInt(FPMR.LSCALE2[5:0]).

The 8-bit floating-point encoding format for BF1CVTLT is selected by FPMR.F8S1. The 8-bit floating-point encoding format for BF2CVTLT is selected by FPMR.F8S2.

This instruction is unpredicated.

Encoding: BF1CVTLT

Variants: (FEAT_SVE2 || FEAT_SME2) && FEAT_FP8 ((FEAT_SVE2 || FEAT_SME2) && FEAT_FP8)

313029282726252423222120191817161514131211109876543210
0110010100001001001110
LopcZnZd

BF1CVTLT <Zd>.H, <Zn>.B

Decoding algorithm

if ((!IsFeatureImplemented(FEAT_SVE2) && !IsFeatureImplemented(FEAT_SME2)) ||
      !IsFeatureImplemented(FEAT_FP8)) then EndOfDecode(Decode_UNDEF);
constant integer esize = 16;
constant integer n = UInt(Zn);
constant integer d = UInt(Zd);
constant boolean issrc2 = FALSE;

Encoding: BF2CVTLT

Variants: (FEAT_SVE2 || FEAT_SME2) && FEAT_FP8 ((FEAT_SVE2 || FEAT_SME2) && FEAT_FP8)

313029282726252423222120191817161514131211109876543210
0110010100001001001111
LopcZnZd

BF2CVTLT <Zd>.H, <Zn>.B

Decoding algorithm

if ((!IsFeatureImplemented(FEAT_SVE2) && !IsFeatureImplemented(FEAT_SME2)) ||
      !IsFeatureImplemented(FEAT_FP8)) then EndOfDecode(Decode_UNDEF);
constant integer esize = 16;
constant integer n = UInt(Zn);
constant integer d = UInt(Zd);
constant boolean issrc2 = TRUE;

Operation

CheckFPMREnabled();
if IsFeatureImplemented(FEAT_SME2) then CheckSVEEnabled(); else CheckNonStreamingSVEEnabled();
constant integer VL = CurrentVL;
constant integer elements = VL DIV esize;
constant bits(VL) operand = Z[n, VL];
bits(VL) result;
for e = 0 to elements-1
    constant bits(esize DIV 2) element = Elem[operand, 2*e + 1, esize DIV 2];
    Elem[result, e, esize] = FP8ConvertBF(element, issrc2, FPCR, FPMR);

Z[d, VL] = result;

Explanations

<Zd>: Is the name of the destination scalable vector register, encoded in the "Zd" field.
<Zn>: Is the name of the source scalable vector register, encoded in the "Zn" field.