From nobody Sat Feb  7 07:15:23 2026
Delivered-To: importer@patchew.org
Authentication-Results: mx.zohomail.com;
	dkim=pass;
	spf=pass (zohomail.com: domain of gnu.org designates 209.51.188.17 as
 permitted sender)
  smtp.mailfrom=qemu-devel-bounces+importer=patchew.org@nongnu.org;
	dmarc=pass(p=reject dis=none)  header.from=sifive.com
ARC-Seal: i=1; a=rsa-sha256; t=1770182357; cv=none;
	d=zohomail.com; s=zohoarc;
	b=YRxCDwq7RJG77ROv5ykJD5WQnJS4cEtM3OEwKZlwxjJyUZ1/0teDTBdG+41w1JfYReYUZk34qMSEEeW+wFi7ZwEY9t1iqCFuUZU/ZkgfzbYtXxO3xFupAfjowpQBm1T2aqbPBjeSKytRWP9fgP4vXu/QHB/Ui94eNNTMt7YMrcM=
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=zohomail.com;
 s=zohoarc;
	t=1770182357;
 h=Content-Transfer-Encoding:Cc:Cc:Date:Date:From:From:In-Reply-To:List-Subscribe:List-Post:List-Id:List-Archive:List-Help:List-Unsubscribe:MIME-Version:Message-ID:References:Sender:Subject:Subject:To:To:Message-Id:Reply-To;
	bh=NBm8cJa+NaJXvp8Xo1trq7kshynnI+Xb2Mp3kpd0S7w=;
	b=KCLCvd+G83OlsFyiEM1R3qgP0wKJoHI35x2uFnX0ecVPi+Ek//dc+N0DtZ1DWmWj1Vz1M7433D+Ln3h55pb9MThYnD08/IDT2vVLMOht7rjBA8YkjE6QpuuSzj7VGcSz11QrlgQUv748zTc09QnkUOF/+9Fluh7yGBpprFix/E0=
ARC-Authentication-Results: i=1; mx.zohomail.com;
	dkim=pass;
	spf=pass (zohomail.com: domain of gnu.org designates 209.51.188.17 as
 permitted sender)
  smtp.mailfrom=qemu-devel-bounces+importer=patchew.org@nongnu.org;
	dmarc=pass header.from=<max.chou@sifive.com> (p=reject dis=none)
Return-Path: <qemu-devel-bounces+importer=patchew.org@nongnu.org>
Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) by
 mx.zohomail.com
	with SMTPS id 1770182357437343.90230310708114;
 Tue, 3 Feb 2026 21:19:17 -0800 (PST)
Received: from localhost ([::1] helo=lists1p.gnu.org)
	by lists.gnu.org with esmtp (Exim 4.90_1)
	(envelope-from <qemu-devel-bounces@nongnu.org>)
	id 1vnVHU-0007TY-Ch; Wed, 04 Feb 2026 00:18:24 -0500
Received: from eggs.gnu.org ([2001:470:142:3::10])
 by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.90_1) (envelope-from <max.chou@sifive.com>)
 id 1vnVHS-0007Ru-I7
 for qemu-devel@nongnu.org; Wed, 04 Feb 2026 00:18:22 -0500
Received: from mail-pj1-x1031.google.com ([2607:f8b0:4864:20::1031])
 by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128)
 (Exim 4.90_1) (envelope-from <max.chou@sifive.com>)
 id 1vnVHP-0007yj-Lu
 for qemu-devel@nongnu.org; Wed, 04 Feb 2026 00:18:22 -0500
Received: by mail-pj1-x1031.google.com with SMTP id
 98e67ed59e1d1-35305538592so5380010a91.0
 for <qemu-devel@nongnu.org>; Tue, 03 Feb 2026 21:18:19 -0800 (PST)
Received: from duncan.localdomain (114-35-142-126.hinet-ip.hinet.net.
 [114.35.142.126]) by smtp.gmail.com with ESMTPSA id
 98e67ed59e1d1-35485eb8d90sm1266314a91.12.2026.02.03.21.18.15
 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256);
 Tue, 03 Feb 2026 21:18:17 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=sifive.com; s=google; t=1770182298; x=1770787098; darn=nongnu.org;
 h=content-transfer-encoding:mime-version:references:in-reply-to
 :message-id:date:subject:cc:to:from:from:to:cc:subject:date
 :message-id:reply-to;
 bh=NBm8cJa+NaJXvp8Xo1trq7kshynnI+Xb2Mp3kpd0S7w=;
 b=AtWcanGNGv7TfmeAgkaFSLQO/8Zv15ckLLggAAXiXmV70l6eul9kdmk2VoukwNMGh/
 0IANx5BMDuSMou2gXAs7+qyfwk81uFtNN0DI9UBmn0JWzcvcF8cSpAnsXTQD/H031fU3
 uIVBbho9DNagdZ3yP2z2yS+dbWR5mIn3NHExM5/WGHajFq26EU+AZrndfykTd6qVW9nO
 dsw+pQVlKJI1hTpoQMQ3MywcFLUaKj82YkjrHlsV+lt+WbeyIu5LcG64qrFLcZ2M4Sxe
 /j/LztOUFo6EgFCSOUl7CHMg+Ukr447IBMNft186XpOshLvGsMA752PvxCOLPj8zL+yp
 qO1w==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=1e100.net; s=20230601; t=1770182298; x=1770787098;
 h=content-transfer-encoding:mime-version:references:in-reply-to
 :message-id:date:subject:cc:to:from:x-gm-gg:x-gm-message-state:from
 :to:cc:subject:date:message-id:reply-to;
 bh=NBm8cJa+NaJXvp8Xo1trq7kshynnI+Xb2Mp3kpd0S7w=;
 b=j4xf2/ENw5BhKNgn/LCvtIRE2gsnYbl8yHnnlhGH6s4l9JLbpSvkojPrq+0ISyUnas
 Y3paNjXY+eGU8OD7uAi9wSzVBreBTQmUNeYCOt7bJZHSysJwcG9JYRCm8hIrLFPHDbSb
 q5Ns7LgHmgh3B7bPw4l4k9fLWx6V2OXO+N6cATqrfikVWwZ1oc86G1Za9wrjzaz/RF2a
 zzaInOrggWJyIQXAhByeZwU/QP817IVnysP8Bq3pyKF+tMVbyHDMlo1BKKkWD2gPbiFO
 LD5DWnJMliDueWbOrO5Ecy5VjfSksaP2dPXcoWE5r/9dz/Wp7uZGTRoCEVBN5Sr5n0hg
 BRKQ==
X-Gm-Message-State: AOJu0YwMhKw5dCE9W/hCEalJ44zt2OD9T/iy2ZZ0pi95ibrOD04LYOiJ
 /eevdc6EhfrBoH2Po6G7WQTXoboXH4m2ayxTd791MJSpVhPUfsfsfZPU8xKQ1CsmP6UdcOV82BH
 O1CcEOdOEg0XwgC5ynrwAC8PeLeUS4SMWOpIbbKvYKFOa5KiC+gDZpTtUCo1Gss9J/p3vQDb6A7
 m46qeVVU2Re/gTCPbMmM73Ca682MiioH7OFpedu0Z6z21J
X-Gm-Gg: AZuq6aLZ/51F/Zpk5ZqY0pL2yKmLKQoePUnhDkSJVyep7MUSPYD1PSz61F66Fj9eHYT
 mDGmh/OrizRO9UpimVhCPpe+ZuxZ+USW3BlECYMY+UkmKiOooF1DJpea+hFJHrJFmMpS6LpL6lV
 dO8KRSnxSzP3FxkOfQ9u/9eNK4pW8222PpN59SKpoErwf1IODaqPpveVjwsDKJNqgYQzjTXWLXl
 MF49rTzI+NxBynOeLRkgfFXNiezt1PlndUfyuyEyDRxF1kFT7YZjtCA9Dv7gnm4UsIMvxa3LSZ2
 0cil99M0W/Xmb1g5AQ+QOGdlDckaaUGr1Muqwd72DEcqdueK6eDweggQYcrxEural/WxRZWvXVo
 WtDxPoz1Uz5WBJQYrk3emZO8LJvIFeMhGkCDz49qG8mbhK7ZKf0WqukrhJf++O1k57OJch2rKID
 X7idDoXquHK+SUmKwW6gt+U+WqHgTN2fOqNaxFQ3ViXSsw1lfkH7FdfD6vYcsycPfJ5C09Lfcim
 DtdMnyKM5AbcSA=
X-Received: by 2002:a17:90b:48c9:b0:349:5b1b:78be with SMTP id
 98e67ed59e1d1-354871ae4d5mr1905593a91.17.1770182297997;
 Tue, 03 Feb 2026 21:18:17 -0800 (PST)
From: Max Chou <max.chou@sifive.com>
To: qemu-devel@nongnu.org,
	qemu-riscv@nongnu.org
Cc: Palmer Dabbelt <palmer@dabbelt.com>,
 Alistair Francis <Alistair.Francis@wdc.com>,
 Aurelien Jarno <aurelien@aurel32.net>,
 Peter Maydell <peter.maydell@linaro.org>,
 =?UTF-8?q?Alex=20Benn=C3=A9e?= <alex.bennee@linaro.org>,
 Paolo Bonzini <pbonzini@redhat.com>,
 Richard Henderson <richard.henderson@linaro.org>,
 Eduardo Habkost <eduardo@habkost.net>, Weiwei Li <liwei1518@gmail.com>,
 Daniel Henrique Barboza <dbarboza@ventanamicro.com>,
 Liu Zhiwei <zhiwei_liu@linux.alibaba.com>, Max Chou <max.chou@sifive.com>
Subject: [PATCH v3 05/19] fpu/softfloat: Support OCP(Open Compute Project)
 OFP8 data type
Date: Wed,  4 Feb 2026 13:17:41 +0800
Message-ID: <20260204051756.667397-6-max.chou@sifive.com>
X-Mailer: git-send-email 2.43.0
In-Reply-To: <20260204051756.667397-1-max.chou@sifive.com>
References: <20260204051756.667397-1-max.chou@sifive.com>
MIME-Version: 1.0
Content-Transfer-Encoding: quoted-printable
Received-SPF: pass (zohomail.com: domain of gnu.org designates 209.51.188.17
 as permitted sender) client-ip=209.51.188.17;
 envelope-from=qemu-devel-bounces+importer=patchew.org@nongnu.org;
 helo=lists.gnu.org;
Received-SPF: pass client-ip=2607:f8b0:4864:20::1031;
 envelope-from=max.chou@sifive.com; helo=mail-pj1-x1031.google.com
X-Spam_score_int: -20
X-Spam_score: -2.1
X-Spam_bar: --
X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
 DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
 RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001,
 SPF_PASS=-0.001 autolearn=unavailable autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: qemu development <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+importer=patchew.org@nongnu.org
Sender: qemu-devel-bounces+importer=patchew.org@nongnu.org
X-ZohoMail-DKIM: pass (identity @sifive.com)
X-ZM-MESSAGEID: 1770182359755154100
Content-Type: text/plain; charset="utf-8"

This commit provides the implementation defined behavior flags and the basic
operation support for the OCP float8 data types(E4M3 & E5M2).

According to the definition in OFP8 spec, the conversion from a wider
format infinity depends on the saturation mode defined in the spec.

Signed-off-by: Max Chou <max.chou@sifive.com>
---
 fpu/softfloat-parts.c.inc      | 159 +++++++++++++++++++++------
 fpu/softfloat-specialize.c.inc |  62 +++++++++++
 fpu/softfloat.c                | 191 +++++++++++++++++++++++++++++++--
 include/fpu/softfloat-types.h  |  12 +++
 include/fpu/softfloat.h        |  81 ++++++++++++++
 5 files changed, 467 insertions(+), 38 deletions(-)

diff --git a/fpu/softfloat-parts.c.inc b/fpu/softfloat-parts.c.inc
index 5e0438fc0b..eee7daae4d 100644
--- a/fpu/softfloat-parts.c.inc
+++ b/fpu/softfloat-parts.c.inc
@@ -227,11 +227,28 @@ static void partsN(canonicalize)(FloatPartsN *p, floa=
t_status *status,
             p->exp =3D fmt->frac_shift - fmt->exp_bias
                    - shift + !has_pseudo_denormals;
         }
-    } else if (likely(p->exp < fmt->exp_max) || fmt->arm_althp) {
+    } else if (likely(p->exp < fmt->exp_max)) {
         p->cls =3D float_class_normal;
         p->exp -=3D fmt->exp_bias;
         frac_shl(p, fmt->frac_shift);
         p->frac_hi |=3D DECOMPOSED_IMPLICIT_BIT;
+    } else if (fmt->limited_nan) {
+        /*
+         * Formats with limited NaN encodings (E4M3, E2M1, ARM Alt HP).
+         */
+        frac_shl(p, fmt->frac_shift);
+        p->frac_hi |=3D DECOMPOSED_IMPLICIT_BIT;
+        if (fmt->normal_frac_max =3D=3D NORMAL_FRAC_MAX_ALL ||
+            p->frac_hi <=3D fmt->normal_frac_max) {
+            p->cls =3D float_class_normal;
+            p->exp -=3D fmt->exp_bias;
+        } else {
+            if (parts_is_snan_frac(p->frac_hi, status)) {
+                p->cls =3D float_class_snan;
+            } else {
+                p->cls =3D float_class_qnan;
+            }
+        }
     } else if (likely(frac_eqz(p))) {
         p->cls =3D float_class_inf;
     } else {
@@ -241,14 +258,39 @@ static void partsN(canonicalize)(FloatPartsN *p, floa=
t_status *status,
     }
 }
=20
+/*
+ * Set FloatPartsN to the maximum normal value for the given format.
+ * - IEEE formats (!no_infinity): exp =3D exp_max - 1, frac =3D all ones
+ * - Limited NaN formats (E4M3): exp =3D exp_max, frac =3D normal_frac_max
+ * - No NaN/InF formats (E2M1, ARM AHP): exp =3D exp_max, frac =3D all ones
+ */
+static void partsN(set_max_normal)(FloatPartsN *p, const FloatFmt *fmt)
+{
+    if (!fmt->no_infinity) {
+        p->exp =3D fmt->exp_max - 1;
+        frac_allones(p);
+    } else if (fmt->normal_frac_max !=3D NORMAL_FRAC_MAX_ALL) {
+        p->exp =3D fmt->exp_max;
+        frac_clear(p);
+        p->frac_hi =3D fmt->normal_frac_max;
+    } else {
+        p->exp =3D fmt->exp_max;
+        frac_allones(p);
+    }
+}
+
 /*
  * Round and uncanonicalize a floating-point number by parts. There
  * are FRAC_SHIFT bits that may require rounding at the bottom of the
  * fraction; these bits will be removed. The exponent will be biased
  * by EXP_BIAS and must be bounded by [EXP_MAX-1, 0].
+ *
+ * The saturate parameter controls saturation behavior for formats that
+ * support it (OCP FP8 E4M3/E5M2). When true, overflow produces max normal
+ * instead of infinity (E5M2) or NaN (E4M3).
  */
 static void partsN(uncanon_normal)(FloatPartsN *p, float_status *s,
-                                   const FloatFmt *fmt)
+                                   const FloatFmt *fmt, bool saturate)
 {
     const int exp_max =3D fmt->exp_max;
     const int frac_shift =3D fmt->frac_shift;
@@ -256,8 +298,8 @@ static void partsN(uncanon_normal)(FloatPartsN *p, floa=
t_status *s,
     const uint64_t frac_lsb =3D round_mask + 1;
     const uint64_t frac_lsbm1 =3D round_mask ^ (round_mask >> 1);
     const uint64_t roundeven_mask =3D round_mask | frac_lsb;
+    bool overflow_norm =3D saturate;
     uint64_t inc;
-    bool overflow_norm =3D false;
     int exp, flags =3D 0;
=20
     switch (s->float_rounding_mode) {
@@ -313,30 +355,64 @@ static void partsN(uncanon_normal)(FloatPartsN *p, fl=
oat_status *s,
             }
             p->frac_lo &=3D ~round_mask;
         }
+        p->exp =3D exp;
=20
-        if (fmt->arm_althp) {
-            /* ARM Alt HP eschews Inf and NaN for a wider exponent.  */
-            if (unlikely(exp > exp_max)) {
-                /* Overflow.  Return the maximum normal.  */
-                flags =3D float_flag_invalid;
-                exp =3D exp_max;
-                frac_allones(p);
-                p->frac_lo &=3D ~round_mask;
+        /*
+         * Unified overflow handling based on format capabilities.
+         * 1. Format has infinity -> overflow to infinity (or saturate)
+         * 2. Format has NaN but no infinity -> overflow to NaN (or satura=
te)
+         * 3. Format has neither -> always saturate
+         */
+        if (!fmt->no_infinity) {
+            if (unlikely(exp >=3D exp_max)) {
+                flags |=3D float_flag_overflow;
+                if (s->rebias_overflow) {
+                    exp -=3D fmt->exp_re_bias;
+                } else if (overflow_norm) {
+                    flags |=3D float_flag_inexact;
+                    parts_set_max_normal(p, fmt);
+                    exp =3D p->exp;
+                    p->frac_lo &=3D ~round_mask;
+                } else {
+                    flags |=3D float_flag_inexact;
+                    p->cls =3D float_class_inf;
+                    exp =3D exp_max;
+                    frac_clear(p);
+                }
             }
-        } else if (unlikely(exp >=3D exp_max)) {
-            flags |=3D float_flag_overflow;
-            if (s->rebias_overflow) {
-                exp -=3D fmt->exp_re_bias;
-            } else if (overflow_norm) {
+        } else if (fmt_has_nan_encoding(fmt)) {
+            bool is_overflow =3D (exp > exp_max) ||
+                               (exp =3D=3D exp_max &&
+                                p->frac_hi > fmt->normal_frac_max);
+
+            if (unlikely(is_overflow)) {
+                flags |=3D float_flag_overflow;
                 flags |=3D float_flag_inexact;
-                exp =3D exp_max - 1;
-                frac_allones(p);
+
+                if (overflow_norm) {
+                    parts_set_max_normal(p, fmt);
+                    exp =3D p->exp;
+                } else {
+                    uint8_t dnan =3D s->default_nan_pattern;
+                    p->cls =3D float_class_qnan;
+                    p->sign =3D dnan >> 7;
+                    exp =3D exp_max;
+                    frac_allones(p);
+                }
+            }
+        } else {
+            if (unlikely(exp > exp_max)) {
+                if (fmt->overflow_raises_invalid) {
+                    /* ARM Alt HP: raise Invalid, not Overflow */
+                    flags =3D float_flag_invalid;
+                } else {
+                    flags |=3D float_flag_overflow;
+                    flags |=3D float_flag_inexact;
+                }
+
+                parts_set_max_normal(p, fmt);
+                exp =3D p->exp;
                 p->frac_lo &=3D ~round_mask;
-            } else {
-                flags |=3D float_flag_inexact;
-                p->cls =3D float_class_inf;
-                exp =3D exp_max;
-                frac_clear(p);
             }
         }
         frac_shr(p, frac_shift);
@@ -422,11 +498,11 @@ static void partsN(uncanon_normal)(FloatPartsN *p, fl=
oat_status *s,
     float_raise(flags, s);
 }
=20
-static void partsN(uncanon)(FloatPartsN *p, float_status *s,
-                            const FloatFmt *fmt)
+static void partsN(uncanon_sat)(FloatPartsN *p, float_status *s,
+                                const FloatFmt *fmt, bool saturate)
 {
     if (likely(is_anynorm(p->cls))) {
-        parts_uncanon_normal(p, s, fmt);
+        parts_uncanon_normal(p, s, fmt, saturate);
     } else {
         switch (p->cls) {
         case float_class_zero:
@@ -434,13 +510,30 @@ static void partsN(uncanon)(FloatPartsN *p, float_sta=
tus *s,
             frac_clear(p);
             return;
         case float_class_inf:
-            g_assert(!fmt->arm_althp);
-            p->exp =3D fmt->exp_max;
-            frac_clear(p);
+            /*
+             * Unified infinity handling using format capabilities.
+             * Formats with no_infinity must convert infinity to something=
 else
+             */
+            if (!fmt->no_infinity) {
+                p->exp =3D fmt->exp_max;
+                frac_clear(p);
+            } else if (fmt_has_nan_encoding(fmt)) {
+                if (saturate) {
+                    parts_set_max_normal(p, fmt);
+                } else {
+                    uint8_t dnan =3D s->default_nan_pattern;
+                    p->cls =3D float_class_qnan;
+                    p->sign =3D dnan >> 7;
+                    p->exp =3D fmt->exp_max;
+                    frac_allones(p);
+                }
+            } else {
+                parts_set_max_normal(p, fmt);
+            }
             return;
         case float_class_qnan:
         case float_class_snan:
-            g_assert(!fmt->arm_althp);
+            g_assert(fmt_has_nan_encoding(fmt));
             p->exp =3D fmt->exp_max;
             frac_shr(p, fmt->frac_shift);
             return;
@@ -451,6 +544,12 @@ static void partsN(uncanon)(FloatPartsN *p, float_stat=
us *s,
     }
 }
=20
+static void partsN(uncanon)(FloatPartsN *p, float_status *s,
+                            const FloatFmt *fmt)
+{
+    partsN(uncanon_sat)(p, s, fmt, false);
+}
+
 /*
  * Returns the result of adding or subtracting the values of the
  * floating-point values `a' and `b'. The operation is performed
diff --git a/fpu/softfloat-specialize.c.inc b/fpu/softfloat-specialize.c.inc
index 9ed968c79b..40c574283f 100644
--- a/fpu/softfloat-specialize.c.inc
+++ b/fpu/softfloat-specialize.c.inc
@@ -226,6 +226,68 @@ floatx80 floatx80_default_inf(bool zSign, float_status=
 *status)
     return packFloatx80(zSign, 0x7fff, z ? 0 : (1ULL << 63));
 }
=20
+/*------------------------------------------------------------------------=
----
+| Determine if a OCP FP8 E4M3 NaN is signaling NaN.
+| E4M3 has only one NaN encoding, so classification is policy-based.
+*-------------------------------------------------------------------------=
---*/
+
+static bool float8_e4m3_nan_is_snan(float8_e4m3 a, float_status *status)
+{
+    if (no_signaling_nans(status)) {
+        return false;
+    }
+    return snan_bit_is_one(status);
+}
+
+/*------------------------------------------------------------------------=
----
+| Returns 1 if the OCP FP8 E4M3 value `a' is a quiet NaN; otherwise return=
s 0.
+*-------------------------------------------------------------------------=
---*/
+
+bool float8_e4m3_is_quiet_nan(float8_e4m3 a_, float_status *status)
+{
+    return float8_e4m3_is_any_nan(a_) && !float8_e4m3_nan_is_snan(a_, stat=
us);
+}
+
+/*------------------------------------------------------------------------=
----
+| Returns 1 if the OCP FP8 E4M3 value `a' is a signaling NaN; otherwise 0.
+*-------------------------------------------------------------------------=
---*/
+
+bool float8_e4m3_is_signaling_nan(float8_e4m3 a_, float_status *status)
+{
+    return float8_e4m3_is_any_nan(a_) && float8_e4m3_nan_is_snan(a_, statu=
s);
+}
+
+/*------------------------------------------------------------------------=
----
+| Determine if a OCP FP8 E5M2 NaN is signaling NaN.
+*-------------------------------------------------------------------------=
---*/
+
+static bool float8_e5m2_nan_is_snan(float8_e5m2 a, float_status *status)
+{
+    if (no_signaling_nans(status)) {
+        return false;
+    }
+    bool frac_msb_is_one =3D (a >> 1) & 1;
+    return frac_msb_is_one =3D=3D snan_bit_is_one(status);
+}
+
+/*------------------------------------------------------------------------=
----
+| Returns 1 if the OCP FP8 E5M2 value `a' is a quiet NaN; otherwise return=
s 0.
+*-------------------------------------------------------------------------=
---*/
+
+bool float8_e5m2_is_quiet_nan(float8_e5m2 a_, float_status *status)
+{
+    return float8_e5m2_is_any_nan(a_) && !float8_e5m2_nan_is_snan(a_, stat=
us);
+}
+
+/*------------------------------------------------------------------------=
----
+| Returns 1 if the OCP FP8 E5M2 value `a' is a signaling NaN; otherwise 0.
+*-------------------------------------------------------------------------=
---*/
+
+bool float8_e5m2_is_signaling_nan(float8_e5m2 a_, float_status *status)
+{
+    return float8_e5m2_is_any_nan(a_) && float8_e5m2_nan_is_snan(a_, statu=
s);
+}
+
 /*------------------------------------------------------------------------=
----
 | Determine if a float16 NaN is signaling NaN.
 *-------------------------------------------------------------------------=
---*/
diff --git a/fpu/softfloat.c b/fpu/softfloat.c
index 8094358c2e..533f96dcda 100644
--- a/fpu/softfloat.c
+++ b/fpu/softfloat.c
@@ -522,6 +522,13 @@ typedef struct {
 #define DECOMPOSED_BINARY_POINT    63
 #define DECOMPOSED_IMPLICIT_BIT    (1ull << DECOMPOSED_BINARY_POINT)
=20
+/*
+ * Sentinel value for normal_frac_max indicating "all fraction values at
+ * exp_max are normal" (i.e., the format has no NaN encoding at exp_max).
+ * Used by E2M1 and ARM Alternative Half Precision formats.
+ */
+#define NORMAL_FRAC_MAX_ALL        0
+
 /* Structure holding all of the relevant parameters for a format.
  *   exp_size: the size of the exponent field
  *   exp_bias: the offset applied to the exponent field
@@ -542,11 +549,39 @@ typedef struct {
     int exp_max;
     int frac_size;
     int frac_shift;
-    bool arm_althp;
     bool has_explicit_bit;
     uint64_t round_mask;
+    /*
+     * Format capability flags:
+     * no_infinity: Format has no infinity encoding. When true, exp=3Dexp_=
max
+     *   with frac=3D0 is NOT infinity - it's either NaN or max normal.
+     *
+     * limited_nan: Format has limited or no NaN patterns. When combined
+     *   with normal_frac_max, determines NaN encoding capability:
+     *   - limited_nan=3Dfalse: Standard IEEE NaN (exp=3Dexp_max, frac!=3D=
0)
+     *   - limited_nan=3Dtrue && normal_frac_max!=3D0: Limited NaN (E4M3)
+     *   - limited_nan=3Dtrue && normal_frac_max=3D=3D0: No NaN encoding (=
AHP, E2M1)
+     *
+     * overflow_raises_invalid: Raise Invalid (not Overflow) exception.
+     *   ARM Alt HP uses this to signal overflow as an invalid operation.
+     *
+     * normal_frac_max: For formats with limited_nan, the maximum fraction
+     *   value (after normalization shift, including implicit bit) that is
+     *   still considered normal at exp=3Dexp_max.
+     *   Use NORMAL_FRAC_MAX_ALL (0) to indicate all frac values at exp_max
+     *   are normal (E2M1, ARM Alt HP), which also implies no NaN encoding.
+     */
+    bool no_infinity;
+    bool limited_nan;
+    bool overflow_raises_invalid;
+    uint64_t normal_frac_max;
 } FloatFmt;
=20
+static inline bool fmt_has_nan_encoding(const FloatFmt *fmt)
+{
+    return !fmt->limited_nan || fmt->normal_frac_max !=3D NORMAL_FRAC_MAX_=
ALL;
+}
+
 /* Expand fields based on the size of exponent and fraction */
 #define FLOAT_PARAMS_(E)                                \
     .exp_size       =3D E,                                \
@@ -560,13 +595,27 @@ typedef struct {
     .frac_shift     =3D (-F - 1) & 63,                    \
     .round_mask     =3D (1ull << ((-F - 1) & 63)) - 1
=20
+static const FloatFmt float8_e4m3_params =3D {
+    FLOAT_PARAMS(4, 3),
+    .no_infinity =3D true,
+    .limited_nan =3D true,
+    .normal_frac_max =3D 0xE000000000000000ULL,
+};
+
+static const FloatFmt float8_e5m2_params =3D {
+    FLOAT_PARAMS(5, 2),
+};
+
 static const FloatFmt float16_params =3D {
     FLOAT_PARAMS(5, 10)
 };
=20
 static const FloatFmt float16_params_ahp =3D {
     FLOAT_PARAMS(5, 10),
-    .arm_althp =3D true
+    .no_infinity =3D true,
+    .limited_nan =3D true,
+    .overflow_raises_invalid =3D true,
+    .normal_frac_max =3D NORMAL_FRAC_MAX_ALL,
 };
=20
 static const FloatFmt bfloat16_params =3D {
@@ -614,6 +663,16 @@ static void unpack_raw64(FloatParts64 *r, const FloatF=
mt *fmt, uint64_t raw)
     };
 }
=20
+static void QEMU_FLATTEN float8_e4m3_unpack_raw(FloatParts64 *p, float8_e4=
m3 f)
+{
+    unpack_raw64(p, &float8_e4m3_params, f);
+}
+
+static void QEMU_FLATTEN float8_e5m2_unpack_raw(FloatParts64 *p, float8_e5=
m2 f)
+{
+    unpack_raw64(p, &float8_e5m2_params, f);
+}
+
 static void QEMU_FLATTEN float16_unpack_raw(FloatParts64 *p, float16 f)
 {
     unpack_raw64(p, &float16_params, f);
@@ -671,6 +730,16 @@ static uint64_t pack_raw64(const FloatParts64 *p, cons=
t FloatFmt *fmt)
     return ret;
 }
=20
+static float8_e4m3 QEMU_FLATTEN float8_e4m3_pack_raw(const FloatParts64 *p)
+{
+    return make_float8_e4m3(pack_raw64(p, &float8_e4m3_params));
+}
+
+static float8_e5m2 QEMU_FLATTEN float8_e5m2_pack_raw(const FloatParts64 *p)
+{
+    return make_float8_e5m2(pack_raw64(p, &float8_e5m2_params));
+}
+
 static float16 QEMU_FLATTEN float16_pack_raw(const FloatParts64 *p)
 {
     return make_float16(pack_raw64(p, &float16_params));
@@ -758,12 +827,26 @@ static void parts128_canonicalize(FloatParts128 *p, f=
loat_status *status,
     PARTS_GENERIC_64_128(canonicalize, A)(A, S, F)
=20
 static void parts64_uncanon_normal(FloatParts64 *p, float_status *status,
-                                   const FloatFmt *fmt);
+                                   const FloatFmt *fmt, bool saturate);
 static void parts128_uncanon_normal(FloatParts128 *p, float_status *status,
-                                    const FloatFmt *fmt);
+                                    const FloatFmt *fmt, bool saturate);
+
+#define parts_uncanon_normal(A, S, F, SAT) \
+    PARTS_GENERIC_64_128(uncanon_normal, A)(A, S, F, SAT)
=20
-#define parts_uncanon_normal(A, S, F) \
-    PARTS_GENERIC_64_128(uncanon_normal, A)(A, S, F)
+static void parts64_uncanon_sat(FloatParts64 *p, float_status *status,
+                                const FloatFmt *fmt, bool saturate);
+static void parts128_uncanon_sat(FloatParts128 *p, float_status *status,
+                                 const FloatFmt *fmt, bool saturate);
+
+#define parts_uncanon_sat(A, S, F, SAT) \
+    PARTS_GENERIC_64_128(uncanon_sat, A)(A, S, F, SAT)
+
+static void parts64_set_max_normal(FloatParts64 *p, const FloatFmt *fmt);
+static void parts128_set_max_normal(FloatParts128 *p, const FloatFmt *fmt);
+
+#define parts_set_max_normal(P, F) \
+    PARTS_GENERIC_64_128(set_max_normal, P)(P, F)
=20
 static void parts64_uncanon(FloatParts64 *p, float_status *status,
                             const FloatFmt *fmt);
@@ -1662,6 +1745,20 @@ static const uint16_t rsqrt_tab[128] =3D {
  * Pack/unpack routines with a specific FloatFmt.
  */
=20
+static void float8_e4m3_unpack_canonical(FloatParts64 *p, float8_e4m3 f,
+                                         float_status *s)
+{
+    float8_e4m3_unpack_raw(p, f);
+    parts_canonicalize(p, s, &float8_e4m3_params);
+}
+
+static void float8_e5m2_unpack_canonical(FloatParts64 *p, float8_e5m2 f,
+                                         float_status *s)
+{
+    float8_e5m2_unpack_raw(p, f);
+    parts_canonicalize(p, s, &float8_e5m2_params);
+}
+
 static void float16a_unpack_canonical(FloatParts64 *p, float16 f,
                                       float_status *s, const FloatFmt *par=
ams)
 {
@@ -1682,6 +1779,24 @@ static void bfloat16_unpack_canonical(FloatParts64 *=
p, bfloat16 f,
     parts_canonicalize(p, s, &bfloat16_params);
 }
=20
+static float8_e4m3 float8_e4m3_round_pack_canonical(FloatParts64 *p,
+                                                    float_status *status,
+                                                    const FloatFmt *params,
+                                                    const bool saturate)
+{
+    parts_uncanon_sat(p, status, params, saturate);
+    return float8_e4m3_pack_raw(p);
+}
+
+static float8_e5m2 float8_e5m2_round_pack_canonical(FloatParts64 *p,
+                                                    float_status *status,
+                                                    const FloatFmt *params,
+                                                    const bool saturate)
+{
+    parts_uncanon_sat(p, status, params, saturate);
+    return float8_e5m2_pack_raw(p);
+}
+
 static float16 float16a_round_pack_canonical(FloatParts64 *p,
                                              float_status *s,
                                              const FloatFmt *params)
@@ -1838,7 +1953,7 @@ static floatx80 floatx80_round_pack_canonical(FloatPa=
rts128 *p,
     case float_class_normal:
     case float_class_denormal:
         if (s->floatx80_rounding_precision =3D=3D floatx80_precision_x) {
-            parts_uncanon_normal(p, s, fmt);
+            parts_uncanon_normal(p, s, fmt, false);
             frac =3D p->frac_hi;
             exp =3D p->exp;
         } else {
@@ -1847,7 +1962,7 @@ static floatx80 floatx80_round_pack_canonical(FloatPa=
rts128 *p,
             p64.sign =3D p->sign;
             p64.exp =3D p->exp;
             frac_truncjam(&p64, p);
-            parts_uncanon_normal(&p64, s, fmt);
+            parts_uncanon_normal(&p64, s, fmt, false);
             frac =3D p64.frac;
             exp =3D p64.exp;
         }
@@ -2823,6 +2938,66 @@ static void parts_float_to_float_widen(FloatParts128=
 *a, FloatParts64 *b,
     }
 }
=20
+bfloat16 float8_e4m3_to_bfloat16(float8_e4m3 a, float_status *s)
+{
+    FloatParts64 p;
+
+    float8_e4m3_unpack_canonical(&p, a, s);
+    parts_float_to_float(&p, s);
+
+    return bfloat16_round_pack_canonical(&p, s);
+}
+
+bfloat16 float8_e5m2_to_bfloat16(float8_e5m2 a, float_status *s)
+{
+    FloatParts64 p;
+
+    float8_e5m2_unpack_canonical(&p, a, s);
+    parts_float_to_float(&p, s);
+
+    return bfloat16_round_pack_canonical(&p, s);
+}
+
+float8_e4m3 bfloat16_to_float8_e4m3(bfloat16 a, bool saturate, float_statu=
s *s)
+{
+    FloatParts64 p;
+
+    bfloat16_unpack_canonical(&p, a, s);
+    parts_float_to_float(&p, s);
+    return float8_e4m3_round_pack_canonical(&p, s, &float8_e4m3_params,
+                                            saturate);
+}
+
+float8_e5m2 bfloat16_to_float8_e5m2(bfloat16 a, bool saturate, float_statu=
s *s)
+{
+    FloatParts64 p;
+
+    bfloat16_unpack_canonical(&p, a, s);
+    parts_float_to_float(&p, s);
+    return float8_e5m2_round_pack_canonical(&p, s, &float8_e5m2_params,
+                                            saturate);
+}
+
+float8_e4m3 float32_to_float8_e4m3(float32 a, bool saturate, float_status =
*s)
+{
+    FloatParts64 p;
+
+    float32_unpack_canonical(&p, a, s);
+    parts_float_to_float(&p, s);
+    return float8_e4m3_round_pack_canonical(&p, s, &float8_e4m3_params,
+                                            saturate);
+}
+
+float8_e5m2 float32_to_float8_e5m2(float32 a, bool saturate, float_status =
*s)
+{
+    FloatParts64 p;
+
+    float32_unpack_canonical(&p, a, s);
+    parts_float_to_float(&p, s);
+    return float8_e5m2_round_pack_canonical(&p, s, &float8_e5m2_params,
+                                            saturate);
+}
+
 float32 float16_to_float32(float16 a, bool ieee, float_status *s)
 {
     const FloatFmt *fmt16 =3D ieee ? &float16_params : &float16_params_ahp;
diff --git a/include/fpu/softfloat-types.h b/include/fpu/softfloat-types.h
index 8f82fdfc97..b781bf10b7 100644
--- a/include/fpu/softfloat-types.h
+++ b/include/fpu/softfloat-types.h
@@ -119,6 +119,18 @@ typedef struct {
  */
 typedef uint16_t bfloat16;
=20
+/*
+ * Software OCP(Open Compute Project) floating point types
+ */
+typedef uint8_t float8_e4m3;
+typedef uint8_t float8_e5m2;
+#define float8_e4m3_val(x) (x)
+#define float8_e5m2_val(x) (x)
+#define make_float8_e4m3(x) (x)
+#define make_float8_e5m2(x) (x)
+#define const_float8_e4m3(x) (x)
+#define const_float8_e5m2(x) (x)
+
 /*
  * Software IEC/IEEE floating-point underflow tininess-detection mode.
  */
diff --git a/include/fpu/softfloat.h b/include/fpu/softfloat.h
index ac6a392375..7abbf92b7e 100644
--- a/include/fpu/softfloat.h
+++ b/include/fpu/softfloat.h
@@ -189,6 +189,87 @@ float128 int128_to_float128(Int128, float_status *stat=
us);
 float128 uint64_to_float128(uint64_t, float_status *status);
 float128 uint128_to_float128(Int128, float_status *status);
=20
+/*------------------------------------------------------------------------=
----
+| Software OCP conversion routines.
+*-------------------------------------------------------------------------=
---*/
+
+bfloat16 float8_e4m3_to_bfloat16(float8_e4m3, float_status *status);
+bfloat16 float8_e5m2_to_bfloat16(float8_e5m2, float_status *status);
+float8_e4m3 bfloat16_to_float8_e4m3(bfloat16, bool saturate, float_status =
*status);
+float8_e5m2 bfloat16_to_float8_e5m2(bfloat16, bool saturate, float_status =
*status);
+float8_e4m3 float32_to_float8_e4m3(float32, bool saturate, float_status *s=
tatus);
+float8_e5m2 float32_to_float8_e5m2(float32, bool saturate, float_status *s=
tatus);
+
+/*------------------------------------------------------------------------=
----
+| Software OCP operations.
+*-------------------------------------------------------------------------=
---*/
+
+bool float8_e4m3_is_quiet_nan(float8_e4m3, float_status *status);
+bool float8_e4m3_is_signaling_nan(float8_e4m3, float_status *status);
+bool float8_e5m2_is_quiet_nan(float8_e5m2, float_status *status);
+bool float8_e5m2_is_signaling_nan(float8_e5m2, float_status *status);
+
+static inline bool float8_e4m3_is_any_nan(float8_e4m3 a)
+{
+    return ((float8_e4m3_val(a) & ~0x80) =3D=3D 0x7f);
+}
+
+static inline bool float8_e5m2_is_any_nan(float8_e5m2 a)
+{
+    return ((float8_e5m2_val(a) & ~0x80) > 0x7c);
+}
+
+static inline bool float8_e4m3_is_neg(float8_e4m3 a)
+{
+    return float8_e4m3_val(a) >> 7;
+}
+
+static inline bool float8_e5m2_is_neg(float8_e5m2 a)
+{
+    return float8_e5m2_val(a) >> 7;
+}
+
+static inline bool float8_e4m3_is_infinity(float8_e4m3 a)
+{
+    return false;
+}
+
+static inline bool float8_e5m2_is_infinity(float8_e5m2 a)
+{
+    return (float8_e5m2_val(a) & 0x7f) =3D=3D 0x7c;
+}
+
+static inline bool float8_e4m3_is_zero(float8_e4m3 a)
+{
+    return (float8_e4m3_val(a) & 0x7f) =3D=3D 0;
+}
+
+static inline bool float8_e5m2_is_zero(float8_e5m2 a)
+{
+    return (float8_e5m2_val(a) & 0x7f) =3D=3D 0;
+}
+
+static inline bool float8_e4m3_is_zero_or_denormal(float8_e4m3 a)
+{
+    return (float8_e4m3_val(a) & 0x78) =3D=3D 0;
+}
+
+static inline bool float8_e5m2_is_zero_or_denormal(float8_e5m2 a)
+{
+    return (float8_e5m2_val(a) & 0x7c) =3D=3D 0;
+}
+
+static inline bool float8_e4m3_is_normal(float8_e4m3 a)
+{
+    uint8_t em =3D float8_e4m3_val(a) & 0x7f;
+    return em >=3D 0x8 && em <=3D 0x7e;
+}
+
+static inline bool float8_e5m2_is_normal(float8_e5m2 a)
+{
+    return (((float8_e5m2_val(a) >> 2) + 1) & 0x1f) >=3D 2;
+}
+
 /*------------------------------------------------------------------------=
----
 | Software half-precision conversion routines.
 *-------------------------------------------------------------------------=
---*/
--=20
2.52.0