Series comparison

-[PULL 00/32] target-arm queue
+[PULL 00/36] target-arm queue
-target-arm queue: the big stuff here is the final part of
+Hi; here's another arm pullreq; by volume most of this is
-rth's patches for Cortex-A76 and Neoverse-N1 support;
+refactoring from me, but there are also some bugfixes and
-also present are Gavin's NUMA series and a few other things.
+other bits and pieces here.
 thanks
 -- PMM
-The following changes since commit 554623226f800acf48a2ed568900c1c968ec9a8b:
+The following changes since commit ed734377ab3f3f3cc15d7aa301a87ab6370f2eed:
-  Merge tag 'qemu-sparc-20220508' of https://github.com/mcayland/qemu into staging (2022-05-08 17:03:26 -0500)
+  Merge tag 'linux-user-fix-gupnp-pull-request' of https://github.com/hdeller/qemu-hppa into staging (2025-01-24 14:43:07 -0500)
 are available in the Git repository at:
-  https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20220509
+  https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20250128-1
-for you to fetch changes up to ae9141d4a3265553503bf07d3574b40f84615a34:
+for you to fetch changes up to 664280abddcb3cacc9c6204706bb739fcc1316f7:
-  hw/acpi/aml-build: Use existing CPU topology to build PPTT table (2022-05-09 11:47:55 +0100)
+  hw/usb/canokey: Fix buffer overflow for OUT packet (2025-01-28 18:40:19 +0000)
 ----------------------------------------------------------------
 target-arm queue:
- * MAINTAINERS/.mailmap: update email for Leif Lindholm
+ * hw/arm: Remove various uses of first_cpu global
- * hw/arm: add version information to sbsa-ref machine DT
+ * hw/char/imx_serial: Fix reset value of UFCR register
- * Enable new features for -cpu max:
+ * hw/char/imx_serial: Update all state before restarting ageing timer
-   FEAT_Debugv8p2, FEAT_Debugv8p4, FEAT_RAS (minimal version only),
+ * hw/pci-host/designware: Expose MSI IRQ
-   FEAT_IESB, FEAT_CSV2, FEAT_CSV2_2, FEAT_CSV3, FEAT_DGH
+ * hw/arm/stellaris: refactoring, cleanup
- * Emulate Cortex-A76
+ * hw/arm/stellaris: map both I2C controllers
- * Emulate Neoverse-N1
+ * tests/functional: Add a test for the arm microbit machine
- * Fix the virt board default NUMA topology
+ * target/arm: arm_reset_sve_state() should set FPSR, not FPCR
  * target/arm: refactorings preparatory to FEAT_AFP implementation
  * fpu: Rename float_flag_input_denormal to float_flag_input_denormal_flushed
  * fpu: Rename float_flag_output_denormal to float_flag_output_denormal_flushed
  * hw/usb/canokey: Fix buffer overflow for OUT packet
 ----------------------------------------------------------------
-Gavin Shan (6):
+Bernhard Beschow (3):
-      qapi/machine.json: Add cluster-id
+      hw/char/imx_serial: Fix reset value of UFCR register
-      qtest/numa-test: Specify CPU topology in aarch64_numa_cpu()
+      hw/char/imx_serial: Update all state before restarting ageing timer
-      hw/arm/virt: Consider SMP configuration in CPU topology
+      hw/pci-host/designware: Expose MSI IRQ
       qtest/numa-test: Correct CPU and NUMA association in aarch64_numa_cpu()
       hw/arm/virt: Fix CPU's default NUMA node ID
       hw/acpi/aml-build: Use existing CPU topology to build PPTT table
-Leif Lindholm (2):
+Hongren Zheng (1):
-      MAINTAINERS/.mailmap: update email for Leif Lindholm
+      hw/usb/canokey: Fix buffer overflow for OUT packet
       hw/arm: add versioning to sbsa-ref machine DT
-Richard Henderson (24):
+Peter Maydell (22):
-      target/arm: Handle cpreg registration for missing EL
+      target/arm: arm_reset_sve_state() should set FPSR, not FPCR
-      target/arm: Drop EL3 no EL2 fallbacks
+      target/arm: Use FPSR_ constants in vfp_exceptbits_from_host()
-      target/arm: Merge zcr reginfo
+      target/arm: Use uint32_t in vfp_exceptbits_from_host()
-      target/arm: Adjust definition of CONTEXTIDR_EL2
+      target/arm: Define new fp_status_a32 and fp_status_a64
-      target/arm: Move cortex impdef sysregs to cpu_tcg.c
+      target/arm: Use vfp.fp_status_a64 in A64-only helper functions
-      target/arm: Update qemu-system-arm -cpu max to cortex-a57
+      target/arm: Use fp_status_a64 or fp_status_a32 in is_ebf()
-      target/arm: Set ID_DFR0.PerfMon for qemu-system-arm -cpu max
+      target/arm: Use fp_status_a32 in vjvct helper
-      target/arm: Split out aa32_max_features
+      target/arm: Use fp_status_a32 in vfp_cmp helpers
-      target/arm: Annotate arm_max_initfn with FEAT identifiers
+      target/arm: Use FPST_A32 in A32 decoder
-      target/arm: Use field names for manipulating EL2 and EL3 modes
+      target/arm: Use FPST_A64 in A64 decoder
-      target/arm: Enable FEAT_Debugv8p2 for -cpu max
+      target/arm: Remove now-unused vfp.fp_status and FPST_FPCR
-      target/arm: Enable FEAT_Debugv8p4 for -cpu max
+      target/arm: Define new fp_status_f16_a32 and fp_status_f16_a64
-      target/arm: Add minimal RAS registers
+      target/arm: Use fp_status_f16_a32 in AArch32-only helpers
-      target/arm: Enable SCR and HCR bits for RAS
+      target/arm: Use fp_status_f16_a64 in AArch64-only helpers
-      target/arm: Implement virtual SError exceptions
+      target/arm: Use FPST_A32_F16 in A32 decoder
-      target/arm: Implement ESB instruction
+      target/arm: Use FPST_A64_F16 in A64 decoder
-      target/arm: Enable FEAT_RAS for -cpu max
+      target/arm: Remove now-unused vfp.fp_status_f16 and FPST_FPCR_F16
-      target/arm: Enable FEAT_IESB for -cpu max
+      fpu: Rename float_flag_input_denormal to float_flag_input_denormal_flushed
-      target/arm: Enable FEAT_CSV2 for -cpu max
+      fpu: Rename float_flag_output_denormal to float_flag_output_denormal_flushed
-      target/arm: Enable FEAT_CSV2_2 for -cpu max
+      fpu: Fix a comment in softfloat-types.h
-      target/arm: Enable FEAT_CSV3 for -cpu max
+      target/arm: Remove redundant advsimd float16 helpers
-      target/arm: Enable FEAT_DGH for -cpu max
+      target/arm: Use FPST_A64_F16 for halfprec-to-other conversions
       target/arm: Define cortex-a76
       target/arm: Define neoverse-n1
- docs/system/arm/emulation.rst |  10 +
+Philippe Mathieu-Daudé (9):
- docs/system/arm/virt.rst      |   2 +
+      hw/arm/nrf51: Rename ARMv7MState 'cpu' -> 'armv7m'
- qapi/machine.json             |   6 +-
+      hw/arm/stellaris: Add 'armv7m' local variable
- target/arm/cpregs.h           |  11 +
+      hw/arm/v7m: Remove use of &first_cpu in machine_init()
- target/arm/cpu.h              |  23 ++
+      hw/arm/stellaris: Link each board schematic
- target/arm/helper.h           |   1 +
+      hw/arm/stellaris: Constify read-only arrays
- target/arm/internals.h        |  16 ++
+      hw/arm/stellaris: Remove incorrect unimplemented i2c-0 at 0x40002000
- target/arm/syndrome.h         |   5 +
+      hw/arm/stellaris: Replace magic numbers by definitions
- target/arm/a32.decode         |  16 +-
+      hw/arm/stellaris: Use DEVCAP macro to access DeviceCapability registers
- target/arm/t32.decode         |  18 +-
+      hw/arm/stellaris: Map both I2C controllers
- hw/acpi/aml-build.c           | 111 ++++----
- hw/arm/sbsa-ref.c             |  16 ++
+Thomas Huth (1):
- hw/arm/virt.c                 |  21 +-
+      tests/functional: Add a test for the arm microbit machine
- hw/core/machine-hmp-cmds.c    |   4 +
- hw/core/machine.c             |  16 ++
+ MAINTAINERS                           |   1 +
- target/arm/cpu.c              |  66 ++++-
+ hw/usb/canokey.h                      |   4 --
- target/arm/cpu64.c            | 353 ++++++++++++++-----------
+ include/fpu/softfloat-types.h         |  10 +--
- target/arm/cpu_tcg.c          | 227 +++++++++++-----
+ include/hw/arm/fsl-imx6.h             |   4 +-
- target/arm/helper.c           | 600 +++++++++++++++++++++++++-----------------
+ include/hw/arm/fsl-imx7.h             |   4 +-
- target/arm/op_helper.c        |  43 +++
+ include/hw/arm/nrf51_soc.h            |   2 +-
- target/arm/translate-a64.c    |  18 ++
+ include/hw/char/imx_serial.h          |   2 +-
- target/arm/translate.c        |  23 ++
+ include/hw/pci-host/designware.h      |   1 +
- tests/qtest/numa-test.c       |  19 +-
+ target/arm/cpu.h                      |  12 ++--
- .mailmap                      |   3 +-
+ target/arm/tcg/helper-a64.h           |   8 ---
- MAINTAINERS                   |   2 +-
+ target/arm/tcg/translate.h            |  32 ++++++---
-files changed, 1068 insertions(+), 562 deletions(-)
+ fpu/softfloat.c                       |   6 +-
  hw/arm/b-l475e-iot01a.c               |   2 +-
  hw/arm/fsl-imx6.c                     |  13 +++-
  hw/arm/fsl-imx7.c                     |  13 +++-
  hw/arm/microbit.c                     |   2 +-
  hw/arm/mps2-tz.c                      |   2 +-
  hw/arm/mps2.c                         |   2 +-
  hw/arm/msf2-som.c                     |   2 +-
  hw/arm/musca.c                        |   2 +-
  hw/arm/netduino2.c                    |   2 +-
  hw/arm/netduinoplus2.c                |   2 +-
  hw/arm/nrf51_soc.c                    |  18 ++---
  hw/arm/olimex-stm32-h405.c            |   2 +-
  hw/arm/stellaris.c                    | 118 +++++++++++++++++++-----------
  hw/arm/stm32vldiscovery.c             |   2 +-
  hw/char/imx_serial.c                  |   7 +-
  hw/pci-host/designware.c              |   7 +-
  hw/usb/canokey.c                      |   6 +-
  target/arm/cpu.c                      |   6 +-
  target/arm/helper.c                   |   2 +-
  target/arm/tcg/helper-a64.c           |   9 ---
  target/arm/tcg/sme_helper.c           |   6 +-
  target/arm/tcg/sve_helper.c           |   6 +-
  target/arm/tcg/translate-a64.c        | 103 ++++++++++++++-------------
  target/arm/tcg/translate-sme.c        |   4 +-
  target/arm/tcg/translate-sve.c        | 130 +++++++++++++++++-----------------
  target/arm/tcg/translate-vfp.c        |  78 ++++++++++----------
  target/arm/tcg/vec_helper.c           |  22 +++---
  target/arm/vfp_helper.c               |  73 +++++++++++--------
  target/i386/tcg/fpu_helper.c          |   8 +--
  target/m68k/fpu_helper.c              |   2 +-
  target/mips/tcg/msa_helper.c          |   4 +-
  target/rx/op_helper.c                 |   4 +-
  target/tricore/fpu_helper.c           |   6 +-
  fpu/softfloat-parts.c.inc             |   4 +-
  hw/arm/Kconfig                        |   2 +
  tests/functional/meson.build          |   1 +
  tests/functional/test_arm_microbit.py |  31 ++++++++
 files changed, 452 insertions(+), 337 deletions(-)
  create mode 100755 tests/functional/test_arm_microbit.py

-[PULL 11/32] target/arm: Use field names for manipulating EL2 and EL3 modes
+[PULL 01/36] hw/arm/nrf51: Rename ARMv7MState 'cpu' -> 'armv7m'
-From: Richard Henderson <richard.henderson@linaro.org>
+From: Philippe Mathieu-Daudé <philmd@linaro.org>
-Use FIELD_DP{32,64} to manipulate id_pfr1 and id_aa64pfr0
+The ARMv7MState object is not simply a CPU, it also
-during arm_cpu_realizefn.
+contains the NVIC, SysTick timer, and various MemoryRegions.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+Rename the field as 'armv7m', like other Cortex-M boards.
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220506180242.216785-11-richard.henderson@linaro.org
+Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
 Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
 Message-id: 20250112225614.33723-2-philmd@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/cpu.c | 22 +++++++++++++---------
+ include/hw/arm/nrf51_soc.h |  2 +-
-file changed, 13 insertions(+), 9 deletions(-)
+ hw/arm/nrf51_soc.c         | 18 +++++++++---------
 files changed, 10 insertions(+), 10 deletions(-)
-diff --git a/target/arm/cpu.c b/target/arm/cpu.c
+diff --git a/include/hw/arm/nrf51_soc.h b/include/hw/arm/nrf51_soc.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.c
+--- a/include/hw/arm/nrf51_soc.h
-+++ b/target/arm/cpu.c
++++ b/include/hw/arm/nrf51_soc.h
-@@ -XXX,XX +XXX,XX @@ static void arm_cpu_realizefn(DeviceState *dev, Error **errp)
+@@ -XXX,XX +XXX,XX @@ struct NRF51State {
-          */
+     SysBusDevice parent_obj;
-         unset_feature(env, ARM_FEATURE_EL3);
+     /*< public >*/
--        /* Disable the security extension feature bits in the processor feature
+-    ARMv7MState cpu;
--         * registers as well. These are id_pfr1[7:4] and id_aa64pfr0[15:12].
++    ARMv7MState armv7m;
-+        /*
-+         * Disable the security extension feature bits in the processor
+     NRF51UARTState uart;
-+         * feature registers as well.
+     NRF51RNGState rng;
-          */
+diff --git a/hw/arm/nrf51_soc.c b/hw/arm/nrf51_soc.c
--        cpu->isar.id_pfr1 &= ~0xf0;
+index XXXXXXX..XXXXXXX 100644
--        cpu->isar.id_aa64pfr0 &= ~0xf000;
+--- a/hw/arm/nrf51_soc.c
-+        cpu->isar.id_pfr1 = FIELD_DP32(cpu->isar.id_pfr1, ID_PFR1, SECURITY, 0);
++++ b/hw/arm/nrf51_soc.c
-+        cpu->isar.id_aa64pfr0 = FIELD_DP64(cpu->isar.id_aa64pfr0,
+@@ -XXX,XX +XXX,XX @@ static void nrf51_soc_realize(DeviceState *dev_soc, Error **errp)
 +                                           ID_AA64PFR0, EL3, 0);
      }
+     /* This clock doesn't need migration because it is fixed-frequency */
-     if (!cpu->has_el2) {
+     clock_set_hz(s->sysclk, HCLK_FRQ);
-@@ -XXX,XX +XXX,XX @@ static void arm_cpu_realizefn(DeviceState *dev, Error **errp)
+-    qdev_connect_clock_in(DEVICE(&s->cpu), "cpuclk", s->sysclk);
 +    qdev_connect_clock_in(DEVICE(&s->armv7m), "cpuclk", s->sysclk);
      /*
       * This SoC has no systick device, so don't connect refclk.
       * TODO: model the lack of systick (currently the armv7m object
       * will always provide one).
       */
 -    object_property_set_link(OBJECT(&s->cpu), "memory", OBJECT(&s->container),
 +    object_property_set_link(OBJECT(&s->armv7m), "memory", OBJECT(&s->container),
                               &error_abort);
 -    if (!sysbus_realize(SYS_BUS_DEVICE(&s->cpu), errp)) {
 +    if (!sysbus_realize(SYS_BUS_DEVICE(&s->armv7m), errp)) {
          return;
      }
-     if (!arm_feature(env, ARM_FEATURE_EL2)) {
+@@ -XXX,XX +XXX,XX @@ static void nrf51_soc_realize(DeviceState *dev_soc, Error **errp)
--        /* Disable the hypervisor feature bits in the processor feature
+     mr = sysbus_mmio_get_region(SYS_BUS_DEVICE(&s->uart), 0);
--         * registers if we don't have EL2. These are id_pfr1[15:12] and
+     memory_region_add_subregion_overlap(&s->container, NRF51_UART_BASE, mr, 0);
--         * id_aa64pfr0_el1[11:8].
+     sysbus_connect_irq(SYS_BUS_DEVICE(&s->uart), 0,
-+        /*
+-                       qdev_get_gpio_in(DEVICE(&s->cpu),
-+         * Disable the hypervisor feature bits in the processor feature
++                       qdev_get_gpio_in(DEVICE(&s->armv7m),
-+         * registers if we don't have EL2.
+                        BASE_TO_IRQ(NRF51_UART_BASE)));
-          */
--        cpu->isar.id_aa64pfr0 &= ~0xf00;
+     /* RNG */
--        cpu->isar.id_pfr1 &= ~0xf000;
+@@ -XXX,XX +XXX,XX @@ static void nrf51_soc_realize(DeviceState *dev_soc, Error **errp)
-+        cpu->isar.id_aa64pfr0 = FIELD_DP64(cpu->isar.id_aa64pfr0,
+     mr = sysbus_mmio_get_region(SYS_BUS_DEVICE(&s->rng), 0);
-+                                           ID_AA64PFR0, EL2, 0);
+     memory_region_add_subregion_overlap(&s->container, NRF51_RNG_BASE, mr, 0);
-+        cpu->isar.id_pfr1 = FIELD_DP32(cpu->isar.id_pfr1,
+     sysbus_connect_irq(SYS_BUS_DEVICE(&s->rng), 0,
-+                                       ID_PFR1, VIRTUALIZATION, 0);
+-                       qdev_get_gpio_in(DEVICE(&s->cpu),
 +                       qdev_get_gpio_in(DEVICE(&s->armv7m),
                         BASE_TO_IRQ(NRF51_RNG_BASE)));
      /* UICR, FICR, NVMC, FLASH */
@@ -XXX,XX +XXX,XX @@ static void nrf51_soc_realize(DeviceState *dev_soc, Error **errp)
          sysbus_mmio_map(SYS_BUS_DEVICE(&s->timer[i]), 0, base_addr);
          sysbus_connect_irq(SYS_BUS_DEVICE(&s->timer[i]), 0,
 -                           qdev_get_gpio_in(DEVICE(&s->cpu),
 +                           qdev_get_gpio_in(DEVICE(&s->armv7m),
                                              BASE_TO_IRQ(base_addr)));
      }
- #ifndef CONFIG_USER_ONLY
+@@ -XXX,XX +XXX,XX @@ static void nrf51_soc_init(Object *obj)
      memory_region_init(&s->container, obj, "nrf51-container", UINT64_MAX);
 -    object_initialize_child(OBJECT(s), "armv6m", &s->cpu, TYPE_ARMV7M);
 -    qdev_prop_set_string(DEVICE(&s->cpu), "cpu-type",
 +    object_initialize_child(OBJECT(s), "armv6m", &s->armv7m, TYPE_ARMV7M);
 +    qdev_prop_set_string(DEVICE(&s->armv7m), "cpu-type",
                           ARM_CPU_TYPE_NAME("cortex-m0"));
 -    qdev_prop_set_uint32(DEVICE(&s->cpu), "num-irq", 32);
 +    qdev_prop_set_uint32(DEVICE(&s->armv7m), "num-irq", 32);
      object_initialize_child(obj, "uart", &s->uart, TYPE_NRF51_UART);
      object_property_add_alias(obj, "serial0", OBJECT(&s->uart), "chardev");
 --
-.25.1
+.34.1

-[PULL 30/32] qtest/numa-test: Correct CPU and NUMA association in aarch64_numa_cpu()
+[PULL 02/36] hw/arm/stellaris: Add 'armv7m' local variable
-From: Gavin Shan <gshan@redhat.com>
+From: Philippe Mathieu-Daudé <philmd@linaro.org>
-In aarch64_numa_cpu(), the CPU and NUMA association is something
+While the TYPE_ARMV7M object forward its NVIC interrupt lines,
-like below. Two threads in the same core/cluster/socket are
+it is somehow misleading to name it 'nvic'. Add the 'armv7m'
-associated with two individual NUMA nodes, which is unreal as
+local variable for clarity, but also keep the 'nvic' variable
-Igor Mammedov mentioned. We don't expect the association to break
+behaving like before when used for wiring IRQ lines.
 NUMA-to-socket boundary, which matches with the real world.
-    NUMA-node  socket  cluster   core   thread
+Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
-    ------------------------------------------
+Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
-0        0        0      0
+Message-id: 20250112225614.33723-3-philmd@linaro.org
 0        0        0      1
 This corrects the topology for CPUs and their association with
 NUMA nodes. After this patch is applied, the CPU and NUMA
 association becomes something like below, which looks real.
 Besides, socket/cluster/core/thread IDs are all checked when
 the NUMA node IDs are verified. It helps to check if the CPU
 topology is properly populated or not.
     NUMA-node  socket  cluster   core   thread
     ------------------------------------------
 1        0        0       0
 0        0        0       0
 Suggested-by: Igor Mammedov <imammedo@redhat.com>
 Signed-off-by: Gavin Shan <gshan@redhat.com>
 Acked-by: Igor Mammedov <imammedo@redhat.com>
 Message-id: 20220503140304.855514-5-gshan@redhat.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- tests/qtest/numa-test.c | 18 ++++++++++++------
+ hw/arm/stellaris.c | 21 +++++++++++----------
-file changed, 12 insertions(+), 6 deletions(-)
+file changed, 11 insertions(+), 10 deletions(-)
-diff --git a/tests/qtest/numa-test.c b/tests/qtest/numa-test.c
+diff --git a/hw/arm/stellaris.c b/hw/arm/stellaris.c
 index XXXXXXX..XXXXXXX 100644
---- a/tests/qtest/numa-test.c
+--- a/hw/arm/stellaris.c
-+++ b/tests/qtest/numa-test.c
++++ b/hw/arm/stellaris.c
-@@ -XXX,XX +XXX,XX @@ static void aarch64_numa_cpu(const void *data)
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
-     g_autofree char *cli = NULL;
+      */
-     cli = make_cli(data, "-machine "
+     Object *soc_container;
--        "smp.cpus=2,smp.sockets=1,smp.clusters=1,smp.cores=1,smp.threads=2 "
+-    DeviceState *gpio_dev[7], *nvic;
-+        "smp.cpus=2,smp.sockets=2,smp.clusters=1,smp.cores=1,smp.threads=1 "
++    DeviceState *gpio_dev[7], *armv7m, *nvic;
-         "-numa node,nodeid=0,memdev=ram -numa node,nodeid=1 "
+     qemu_irq gpio_in[7][8];
--        "-numa cpu,node-id=1,thread-id=0 "
+     qemu_irq gpio_out[7][8];
--        "-numa cpu,node-id=0,thread-id=1");
+     qemu_irq adc;
-+        "-numa cpu,node-id=0,socket-id=1,cluster-id=0,core-id=0,thread-id=0 "
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
-+        "-numa cpu,node-id=1,socket-id=0,cluster-id=0,core-id=0,thread-id=0");
+     qdev_prop_set_uint32(ssys_dev, "dc4", board->dc4);
-     qts = qtest_init(cli);
+     sysbus_realize_and_unref(SYS_BUS_DEVICE(ssys_dev), &error_fatal);
-     cpus = get_cpus(qts, &resp);
-     g_assert(cpus);
+-    nvic = qdev_new(TYPE_ARMV7M);
+-    object_property_add_child(soc_container, "v7m", OBJECT(nvic));
-     while ((e = qlist_pop(cpus))) {
+-    qdev_prop_set_uint32(nvic, "num-irq", NUM_IRQ_LINES);
-         QDict *cpu, *props;
+-    qdev_prop_set_uint8(nvic, "num-prio-bits", NUM_PRIO_BITS);
--        int64_t thread, node;
+-    qdev_prop_set_string(nvic, "cpu-type", ms->cpu_type);
-+        int64_t socket, cluster, core, thread, node;
+-    qdev_prop_set_bit(nvic, "enable-bitband", true);
+-    qdev_connect_clock_in(nvic, "cpuclk",
-         cpu = qobject_to(QDict, e);
++    armv7m = qdev_new(TYPE_ARMV7M);
-         g_assert(qdict_haskey(cpu, "props"));
++    object_property_add_child(soc_container, "v7m", OBJECT(armv7m));
-@@ -XXX,XX +XXX,XX @@ static void aarch64_numa_cpu(const void *data)
++    qdev_prop_set_uint32(armv7m, "num-irq", NUM_IRQ_LINES);
++    qdev_prop_set_uint8(armv7m, "num-prio-bits", NUM_PRIO_BITS);
-         g_assert(qdict_haskey(props, "node-id"));
++    qdev_prop_set_string(armv7m, "cpu-type", ms->cpu_type);
-         node = qdict_get_int(props, "node-id");
++    qdev_prop_set_bit(armv7m, "enable-bitband", true);
-+        g_assert(qdict_haskey(props, "socket-id"));
++    qdev_connect_clock_in(armv7m, "cpuclk",
-+        socket = qdict_get_int(props, "socket-id");
+                           qdev_get_clock_out(ssys_dev, "SYSCLK"));
-+        g_assert(qdict_haskey(props, "cluster-id"));
+     /* This SoC does not connect the systick reference clock */
-+        cluster = qdict_get_int(props, "cluster-id");
+-    object_property_set_link(OBJECT(nvic), "memory",
-+        g_assert(qdict_haskey(props, "core-id"));
++    object_property_set_link(OBJECT(armv7m), "memory",
-+        core = qdict_get_int(props, "core-id");
+                              OBJECT(get_system_memory()), &error_abort);
-         g_assert(qdict_haskey(props, "thread-id"));
+     /* This will exit with an error if the user passed us a bad cpu_type */
-         thread = qdict_get_int(props, "thread-id");
+-    sysbus_realize_and_unref(SYS_BUS_DEVICE(nvic), &error_fatal);
++    sysbus_realize_and_unref(SYS_BUS_DEVICE(armv7m), &error_fatal);
--        if (thread == 0) {
++    nvic = armv7m;
-+        if (socket == 0 && cluster == 0 && core == 0 && thread == 0) {
-             g_assert_cmpint(node, ==, 1);
+     /* Now we can wire up the IRQ and MMIO of the system registers */
--        } else if (thread == 1) {
+     sysbus_mmio_map(SYS_BUS_DEVICE(ssys_dev), 0, 0x400fe000);
 +        } else if (socket == 1 && cluster == 0 && core == 0 && thread == 0) {
              g_assert_cmpint(node, ==, 0);
          } else {
              g_assert(false);
 --
-.25.1
+.34.1

-[PULL 29/32] hw/arm/virt: Consider SMP configuration in CPU topology
+[PULL 03/36] hw/arm/v7m: Remove use of &first_cpu in machine_init()
-From: Gavin Shan <gshan@redhat.com>
+From: Philippe Mathieu-Daudé <philmd@linaro.org>
-Currently, the SMP configuration isn't considered when the CPU
+When instanciating the machine model, the machine_init()
-topology is populated. In this case, it's impossible to provide
+implementations usually create the CPUs, so have access
-the default CPU-to-NUMA mapping or association based on the socket
+to its first CPU. Use that rather then the &first_cpu
-ID of the given CPU.
+global.
-This takes account of SMP configuration when the CPU topology
+Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
-is populated. The die ID for the given CPU isn't assigned since
+Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
-it's not supported on arm/virt machine. Besides, the used SMP
+Reviewed-by: Samuel Tardieu <sam@rfc1149.net>
-configuration in qtest/numa-test/aarch64_numa_cpu() is corrcted
+Message-id: 20250112225614.33723-4-philmd@linaro.org
 to avoid testing failure
 Signed-off-by: Gavin Shan <gshan@redhat.com>
 Reviewed-by: Yanan Wang <wangyanan55@huawei.com>
 Acked-by: Igor Mammedov <imammedo@redhat.com>
 Message-id: 20220503140304.855514-4-gshan@redhat.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- hw/arm/virt.c | 15 ++++++++++++++-
+ hw/arm/b-l475e-iot01a.c    | 2 +-
-file changed, 14 insertions(+), 1 deletion(-)
+ hw/arm/microbit.c          | 2 +-
  hw/arm/mps2-tz.c           | 2 +-
  hw/arm/mps2.c              | 2 +-
  hw/arm/msf2-som.c          | 2 +-
  hw/arm/musca.c             | 2 +-
  hw/arm/netduino2.c         | 2 +-
  hw/arm/netduinoplus2.c     | 2 +-
  hw/arm/olimex-stm32-h405.c | 2 +-
  hw/arm/stellaris.c         | 2 +-
  hw/arm/stm32vldiscovery.c  | 2 +-
 files changed, 11 insertions(+), 11 deletions(-)
-diff --git a/hw/arm/virt.c b/hw/arm/virt.c
+diff --git a/hw/arm/b-l475e-iot01a.c b/hw/arm/b-l475e-iot01a.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/virt.c
+--- a/hw/arm/b-l475e-iot01a.c
-+++ b/hw/arm/virt.c
++++ b/hw/arm/b-l475e-iot01a.c
-@@ -XXX,XX +XXX,XX @@ static const CPUArchIdList *virt_possible_cpu_arch_ids(MachineState *ms)
+@@ -XXX,XX +XXX,XX @@ static void bl475e_init(MachineState *machine)
-     int n;
+     sysbus_realize(SYS_BUS_DEVICE(&s->soc), &error_fatal);
-     unsigned int max_cpus = ms->smp.max_cpus;
-     VirtMachineState *vms = VIRT_MACHINE(ms);
+     sc = STM32L4X5_SOC_GET_CLASS(&s->soc);
-+    MachineClass *mc = MACHINE_GET_CLASS(vms);
+-    armv7m_load_kernel(ARM_CPU(first_cpu), machine->kernel_filename, 0,
++    armv7m_load_kernel(s->soc.armv7m.cpu, machine->kernel_filename, 0,
-     if (ms->possible_cpus) {
+                        sc->flash_size);
-         assert(ms->possible_cpus->len == max_cpus);
-@@ -XXX,XX +XXX,XX @@ static const CPUArchIdList *virt_possible_cpu_arch_ids(MachineState *ms)
+     if (object_class_by_name(TYPE_DM163)) {
-         ms->possible_cpus->cpus[n].type = ms->cpu_type;
+diff --git a/hw/arm/microbit.c b/hw/arm/microbit.c
-         ms->possible_cpus->cpus[n].arch_id =
+index XXXXXXX..XXXXXXX 100644
-             virt_cpu_mp_affinity(vms, n);
+--- a/hw/arm/microbit.c
-+
++++ b/hw/arm/microbit.c
-+        assert(!mc->smp_props.dies_supported);
+@@ -XXX,XX +XXX,XX @@ static void microbit_init(MachineState *machine)
-+        ms->possible_cpus->cpus[n].props.has_socket_id = true;
+     memory_region_add_subregion_overlap(&s->nrf51.container, NRF51_TWI_BASE,
-+        ms->possible_cpus->cpus[n].props.socket_id =
+                                         mr, -1);
-+            n / (ms->smp.clusters * ms->smp.cores * ms->smp.threads);
-+        ms->possible_cpus->cpus[n].props.has_cluster_id = true;
+-    armv7m_load_kernel(ARM_CPU(first_cpu), machine->kernel_filename,
-+        ms->possible_cpus->cpus[n].props.cluster_id =
++    armv7m_load_kernel(s->nrf51.armv7m.cpu, machine->kernel_filename,
-+            (n / (ms->smp.cores * ms->smp.threads)) % ms->smp.clusters;
+, s->nrf51.flash_size);
-+        ms->possible_cpus->cpus[n].props.has_core_id = true;
+ }
-+        ms->possible_cpus->cpus[n].props.core_id =
-+            (n / ms->smp.threads) % ms->smp.cores;
+diff --git a/hw/arm/mps2-tz.c b/hw/arm/mps2-tz.c
-         ms->possible_cpus->cpus[n].props.has_thread_id = true;
+index XXXXXXX..XXXXXXX 100644
--        ms->possible_cpus->cpus[n].props.thread_id = n;
+--- a/hw/arm/mps2-tz.c
-+        ms->possible_cpus->cpus[n].props.thread_id =
++++ b/hw/arm/mps2-tz.c
-+            n % ms->smp.threads;
+@@ -XXX,XX +XXX,XX @@ static void mps2tz_common_init(MachineState *machine)
                                      mms->remap_irq);
      }
-     return ms->possible_cpus;
 -    armv7m_load_kernel(ARM_CPU(first_cpu), machine->kernel_filename,
 +    armv7m_load_kernel(mms->iotkit.armv7m[0].cpu, machine->kernel_filename,
 , boot_ram_size(mms));
  }
 diff --git a/hw/arm/mps2.c b/hw/arm/mps2.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/mps2.c
 +++ b/hw/arm/mps2.c
@@ -XXX,XX +XXX,XX @@ static void mps2_common_init(MachineState *machine)
                   qdev_get_gpio_in(armv7m,
                                    mmc->fpga_type == FPGA_AN511 ? 47 : 13));
 -    armv7m_load_kernel(ARM_CPU(first_cpu), machine->kernel_filename,
 +    armv7m_load_kernel(mms->armv7m.cpu, machine->kernel_filename,
 , 0x400000);
  }
 diff --git a/hw/arm/msf2-som.c b/hw/arm/msf2-som.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/msf2-som.c
 +++ b/hw/arm/msf2-som.c
@@ -XXX,XX +XXX,XX @@ static void emcraft_sf2_s2s010_init(MachineState *machine)
      cs_line = qdev_get_gpio_in_named(spi_flash, SSI_GPIO_CS, 0);
      sysbus_connect_irq(SYS_BUS_DEVICE(&soc->spi[0]), 1, cs_line);
 -    armv7m_load_kernel(ARM_CPU(first_cpu), machine->kernel_filename,
 +    armv7m_load_kernel(soc->armv7m.cpu, machine->kernel_filename,
 , soc->envm_size);
  }
 diff --git a/hw/arm/musca.c b/hw/arm/musca.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/musca.c
 +++ b/hw/arm/musca.c
@@ -XXX,XX +XXX,XX @@ static void musca_init(MachineState *machine)
                                                       "cfg_sec_resp", 0));
      }
 -    armv7m_load_kernel(ARM_CPU(first_cpu), machine->kernel_filename,
 +    armv7m_load_kernel(mms->sse.armv7m[0].cpu, machine->kernel_filename,
 , 0x2000000);
  }
 diff --git a/hw/arm/netduino2.c b/hw/arm/netduino2.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/netduino2.c
 +++ b/hw/arm/netduino2.c
@@ -XXX,XX +XXX,XX @@ static void netduino2_init(MachineState *machine)
      qdev_connect_clock_in(dev, "sysclk", sysclk);
      sysbus_realize_and_unref(SYS_BUS_DEVICE(dev), &error_fatal);
 -    armv7m_load_kernel(ARM_CPU(first_cpu), machine->kernel_filename,
 +    armv7m_load_kernel(STM32F205_SOC(dev)->armv7m.cpu, machine->kernel_filename,
 , FLASH_SIZE);
  }
 diff --git a/hw/arm/netduinoplus2.c b/hw/arm/netduinoplus2.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/netduinoplus2.c
 +++ b/hw/arm/netduinoplus2.c
@@ -XXX,XX +XXX,XX @@ static void netduinoplus2_init(MachineState *machine)
      qdev_connect_clock_in(dev, "sysclk", sysclk);
      sysbus_realize_and_unref(SYS_BUS_DEVICE(dev), &error_fatal);
 -    armv7m_load_kernel(ARM_CPU(first_cpu),
 +    armv7m_load_kernel(STM32F405_SOC(dev)->armv7m.cpu,
                         machine->kernel_filename,
 , FLASH_SIZE);
  }
 diff --git a/hw/arm/olimex-stm32-h405.c b/hw/arm/olimex-stm32-h405.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/olimex-stm32-h405.c
 +++ b/hw/arm/olimex-stm32-h405.c
@@ -XXX,XX +XXX,XX @@ static void olimex_stm32_h405_init(MachineState *machine)
      qdev_connect_clock_in(dev, "sysclk", sysclk);
      sysbus_realize_and_unref(SYS_BUS_DEVICE(dev), &error_fatal);
 -    armv7m_load_kernel(ARM_CPU(first_cpu),
 +    armv7m_load_kernel(STM32F405_SOC(dev)->armv7m.cpu,
                         machine->kernel_filename,
 , FLASH_SIZE);
  }
 diff --git a/hw/arm/stellaris.c b/hw/arm/stellaris.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/stellaris.c
 +++ b/hw/arm/stellaris.c
@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
      create_unimplemented_device("hibernation", 0x400fc000, 0x1000);
      create_unimplemented_device("flash-control", 0x400fd000, 0x1000);
 -    armv7m_load_kernel(ARM_CPU(first_cpu), ms->kernel_filename, 0, flash_size);
 +    armv7m_load_kernel(ARMV7M(armv7m)->cpu, ms->kernel_filename, 0, flash_size);
  }
  /* FIXME: Figure out how to generate these from stellaris_boards.  */
 diff --git a/hw/arm/stm32vldiscovery.c b/hw/arm/stm32vldiscovery.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/stm32vldiscovery.c
 +++ b/hw/arm/stm32vldiscovery.c
@@ -XXX,XX +XXX,XX @@ static void stm32vldiscovery_init(MachineState *machine)
      qdev_connect_clock_in(dev, "sysclk", sysclk);
      sysbus_realize_and_unref(SYS_BUS_DEVICE(dev), &error_fatal);
 -    armv7m_load_kernel(ARM_CPU(first_cpu),
 +    armv7m_load_kernel(STM32F100_SOC(dev)->armv7m.cpu,
                         machine->kernel_filename,
 , FLASH_SIZE);
  }
 --
-.25.1
+.34.1

-[PULL 27/32] qapi/machine.json: Add cluster-id
+[PULL 04/36] hw/char/imx_serial: Fix reset value of UFCR register
-From: Gavin Shan <gshan@redhat.com>
+From: Bernhard Beschow <shentey@gmail.com>
-This adds cluster-id in CPU instance properties, which will be used
+The value of the UCFR register is respected when echoing characters to the
-by arm/virt machine. Besides, the cluster-id is also verified or
+terminal, but its reset value is reserved. Fix the reset value to the one
-dumped in various spots:
+documented in the datasheet.
-  * hw/core/machine.c::machine_set_cpu_numa_node() to associate
+While at it move the related attribute out of the section of unimplemented
-    CPU with its NUMA node.
+registers since its value is actually respected.
-  * hw/core/machine.c::machine_numa_finish_cpu_init() to record
+Signed-off-by: Bernhard Beschow <shentey@gmail.com>
-    CPU slots with no NUMA mapping set.
+Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
   * hw/core/machine-hmp-cmds.c::hmp_hotpluggable_cpus() to dump
     cluster-id.
 Signed-off-by: Gavin Shan <gshan@redhat.com>
 Reviewed-by: Yanan Wang <wangyanan55@huawei.com>
 Acked-by: Igor Mammedov <imammedo@redhat.com>
 Message-id: 20220503140304.855514-2-gshan@redhat.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- qapi/machine.json          |  6 ++++--
+ include/hw/char/imx_serial.h | 2 +-
- hw/core/machine-hmp-cmds.c |  4 ++++
+ hw/char/imx_serial.c         | 1 +
- hw/core/machine.c          | 16 ++++++++++++++++
+files changed, 2 insertions(+), 1 deletion(-)
 files changed, 24 insertions(+), 2 deletions(-)
-diff --git a/qapi/machine.json b/qapi/machine.json
+diff --git a/include/hw/char/imx_serial.h b/include/hw/char/imx_serial.h
 index XXXXXXX..XXXXXXX 100644
---- a/qapi/machine.json
+--- a/include/hw/char/imx_serial.h
-+++ b/qapi/machine.json
++++ b/include/hw/char/imx_serial.h
-@@ -XXX,XX +XXX,XX @@
+@@ -XXX,XX +XXX,XX @@ struct IMXSerialState {
- # @node-id: NUMA node ID the CPU belongs to
+     uint32_t ucr1;
- # @socket-id: socket number within node/board the CPU belongs to
+     uint32_t ucr2;
- # @die-id: die number within socket the CPU belongs to (since 4.1)
+     uint32_t uts1;
--# @core-id: core number within die the CPU belongs to
++    uint32_t ufcr;
-+# @cluster-id: cluster number within die the CPU belongs to (since 7.1)
-+# @core-id: core number within cluster the CPU belongs to
+     /*
- # @thread-id: thread number within core the CPU belongs to
+      * The registers below are implemented just so that the
- #
+      * guest OS sees what it has written
--# Note: currently there are 5 properties that could be present
+      */
-+# Note: currently there are 6 properties that could be present
+     uint32_t onems;
- #       but management should be prepared to pass through other
+-    uint32_t ufcr;
- #       properties with device_add command to allow for future
+     uint32_t ubmr;
- #       interface extension. This also requires the filed names to be kept in
+     uint32_t ubrc;
-@@ -XXX,XX +XXX,XX @@
+     uint32_t ucr3;
-   'data': { '*node-id': 'int',
+diff --git a/hw/char/imx_serial.c b/hw/char/imx_serial.c
              '*socket-id': 'int',
              '*die-id': 'int',
 +            '*cluster-id': 'int',
              '*core-id': 'int',
              '*thread-id': 'int'
    }
 diff --git a/hw/core/machine-hmp-cmds.c b/hw/core/machine-hmp-cmds.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/core/machine-hmp-cmds.c
+--- a/hw/char/imx_serial.c
-+++ b/hw/core/machine-hmp-cmds.c
++++ b/hw/char/imx_serial.c
-@@ -XXX,XX +XXX,XX @@ void hmp_hotpluggable_cpus(Monitor *mon, const QDict *qdict)
+@@ -XXX,XX +XXX,XX @@ static void imx_serial_reset(IMXSerialState *s)
-         if (c->has_die_id) {
+     s->ucr3 = 0x700;
-             monitor_printf(mon, "    die-id: \"%" PRIu64 "\"\n", c->die_id);
+     s->ubmr = 0;
-         }
+     s->ubrc = 4;
-+        if (c->has_cluster_id) {
++    s->ufcr = BIT(11) | BIT(0);
-+            monitor_printf(mon, "    cluster-id: \"%" PRIu64 "\"\n",
-+                           c->cluster_id);
+     fifo32_reset(&s->rx_fifo);
-+        }
+     timer_del(&s->ageing_timer);
          if (c->has_core_id) {
              monitor_printf(mon, "    core-id: \"%" PRIu64 "\"\n", c->core_id);
          }
 diff --git a/hw/core/machine.c b/hw/core/machine.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/core/machine.c
 +++ b/hw/core/machine.c
@@ -XXX,XX +XXX,XX @@ void machine_set_cpu_numa_node(MachineState *machine,
              return;
          }
 +        if (props->has_cluster_id && !slot->props.has_cluster_id) {
 +            error_setg(errp, "cluster-id is not supported");
 +            return;
 +        }
 +
          if (props->has_socket_id && !slot->props.has_socket_id) {
              error_setg(errp, "socket-id is not supported");
              return;
@@ -XXX,XX +XXX,XX @@ void machine_set_cpu_numa_node(MachineState *machine,
                  continue;
          }
 +        if (props->has_cluster_id &&
 +            props->cluster_id != slot->props.cluster_id) {
 +                continue;
 +        }
 +
          if (props->has_die_id && props->die_id != slot->props.die_id) {
                  continue;
          }
@@ -XXX,XX +XXX,XX @@ static char *cpu_slot_to_string(const CPUArchId *cpu)
          }
          g_string_append_printf(s, "die-id: %"PRId64, cpu->props.die_id);
      }
 +    if (cpu->props.has_cluster_id) {
 +        if (s->len) {
 +            g_string_append_printf(s, ", ");
 +        }
 +        g_string_append_printf(s, "cluster-id: %"PRId64, cpu->props.cluster_id);
 +    }
      if (cpu->props.has_core_id) {
          if (s->len) {
              g_string_append_printf(s, ", ");
 --
-.25.1
+.34.1

-[PULL 10/32] target/arm: Annotate arm_max_initfn with FEAT identifiers
+[PULL 05/36] hw/char/imx_serial: Update all state before restarting ageing timer
-From: Richard Henderson <richard.henderson@linaro.org>
+From: Bernhard Beschow <shentey@gmail.com>
-Update the legacy feature names to the current names.
+Fixes characters to be "echoed" after each keystroke rather than after every
-Provide feature names for id changes that were not marked.
+other since imx_serial_rx_fifo_ageing_timer_restart() would see ~UTS1_RXEMPTY
-Sort the field updates into increasing bitfield order.
+only after every other keystroke.
+Signed-off-by: Bernhard Beschow <shentey@gmail.com>
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220506180242.216785-10-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/cpu64.c   | 100 +++++++++++++++++++++----------------------
+ hw/char/imx_serial.c | 6 +++---
- target/arm/cpu_tcg.c |  48 ++++++++++-----------
+file changed, 3 insertions(+), 3 deletions(-)
 files changed, 74 insertions(+), 74 deletions(-)
-diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
+diff --git a/hw/char/imx_serial.c b/hw/char/imx_serial.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu64.c
+--- a/hw/char/imx_serial.c
-+++ b/target/arm/cpu64.c
++++ b/hw/char/imx_serial.c
-@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
+@@ -XXX,XX +XXX,XX @@ static void imx_put_data(void *opaque, uint32_t value)
-     cpu->midr = t;
+     if (fifo32_num_used(&s->rx_fifo) >= rxtl) {
+         s->usr1 |= USR1_RRDY;
-     t = cpu->isar.id_aa64isar0;
+     }
--    t = FIELD_DP64(t, ID_AA64ISAR0, AES, 2); /* AES + PMULL */
+-
--    t = FIELD_DP64(t, ID_AA64ISAR0, SHA1, 1);
+-    imx_serial_rx_fifo_ageing_timer_restart(s);
--    t = FIELD_DP64(t, ID_AA64ISAR0, SHA2, 2); /* SHA512 */
+-
-+    t = FIELD_DP64(t, ID_AA64ISAR0, AES, 2);      /* FEAT_PMULL */
+     s->usr2 |= USR2_RDR;
-+    t = FIELD_DP64(t, ID_AA64ISAR0, SHA1, 1);     /* FEAT_SHA1 */
+     s->uts1 &= ~UTS1_RXEMPTY;
-+    t = FIELD_DP64(t, ID_AA64ISAR0, SHA2, 2);     /* FEAT_SHA512 */
+     if (value & URXD_BRK) {
-     t = FIELD_DP64(t, ID_AA64ISAR0, CRC32, 1);
+         s->usr2 |= USR2_BRCD;
--    t = FIELD_DP64(t, ID_AA64ISAR0, ATOMIC, 2);
+     }
--    t = FIELD_DP64(t, ID_AA64ISAR0, RDM, 1);
++
--    t = FIELD_DP64(t, ID_AA64ISAR0, SHA3, 1);
++    imx_serial_rx_fifo_ageing_timer_restart(s);
--    t = FIELD_DP64(t, ID_AA64ISAR0, SM3, 1);
++
--    t = FIELD_DP64(t, ID_AA64ISAR0, SM4, 1);
+     imx_update(s);
 -    t = FIELD_DP64(t, ID_AA64ISAR0, DP, 1);
 -    t = FIELD_DP64(t, ID_AA64ISAR0, FHM, 1);
 -    t = FIELD_DP64(t, ID_AA64ISAR0, TS, 2); /* v8.5-CondM */
 -    t = FIELD_DP64(t, ID_AA64ISAR0, TLB, 2); /* FEAT_TLBIRANGE */
 -    t = FIELD_DP64(t, ID_AA64ISAR0, RNDR, 1);
 +    t = FIELD_DP64(t, ID_AA64ISAR0, ATOMIC, 2);   /* FEAT_LSE */
 +    t = FIELD_DP64(t, ID_AA64ISAR0, RDM, 1);      /* FEAT_RDM */
 +    t = FIELD_DP64(t, ID_AA64ISAR0, SHA3, 1);     /* FEAT_SHA3 */
 +    t = FIELD_DP64(t, ID_AA64ISAR0, SM3, 1);      /* FEAT_SM3 */
 +    t = FIELD_DP64(t, ID_AA64ISAR0, SM4, 1);      /* FEAT_SM4 */
 +    t = FIELD_DP64(t, ID_AA64ISAR0, DP, 1);       /* FEAT_DotProd */
 +    t = FIELD_DP64(t, ID_AA64ISAR0, FHM, 1);      /* FEAT_FHM */
 +    t = FIELD_DP64(t, ID_AA64ISAR0, TS, 2);       /* FEAT_FlagM2 */
 +    t = FIELD_DP64(t, ID_AA64ISAR0, TLB, 2);      /* FEAT_TLBIRANGE */
 +    t = FIELD_DP64(t, ID_AA64ISAR0, RNDR, 1);     /* FEAT_RNG */
      cpu->isar.id_aa64isar0 = t;
      t = cpu->isar.id_aa64isar1;
 -    t = FIELD_DP64(t, ID_AA64ISAR1, DPB, 2);
 -    t = FIELD_DP64(t, ID_AA64ISAR1, JSCVT, 1);
 -    t = FIELD_DP64(t, ID_AA64ISAR1, FCMA, 1);
 -    t = FIELD_DP64(t, ID_AA64ISAR1, SB, 1);
 -    t = FIELD_DP64(t, ID_AA64ISAR1, SPECRES, 1);
 -    t = FIELD_DP64(t, ID_AA64ISAR1, BF16, 1);
 -    t = FIELD_DP64(t, ID_AA64ISAR1, FRINTTS, 1);
 -    t = FIELD_DP64(t, ID_AA64ISAR1, LRCPC, 2); /* ARMv8.4-RCPC */
 -    t = FIELD_DP64(t, ID_AA64ISAR1, I8MM, 1);
 +    t = FIELD_DP64(t, ID_AA64ISAR1, DPB, 2);      /* FEAT_DPB2 */
 +    t = FIELD_DP64(t, ID_AA64ISAR1, JSCVT, 1);    /* FEAT_JSCVT */
 +    t = FIELD_DP64(t, ID_AA64ISAR1, FCMA, 1);     /* FEAT_FCMA */
 +    t = FIELD_DP64(t, ID_AA64ISAR1, LRCPC, 2);    /* FEAT_LRCPC2 */
 +    t = FIELD_DP64(t, ID_AA64ISAR1, FRINTTS, 1);  /* FEAT_FRINTTS */
 +    t = FIELD_DP64(t, ID_AA64ISAR1, SB, 1);       /* FEAT_SB */
 +    t = FIELD_DP64(t, ID_AA64ISAR1, SPECRES, 1);  /* FEAT_SPECRES */
 +    t = FIELD_DP64(t, ID_AA64ISAR1, BF16, 1);     /* FEAT_BF16 */
 +    t = FIELD_DP64(t, ID_AA64ISAR1, I8MM, 1);     /* FEAT_I8MM */
      cpu->isar.id_aa64isar1 = t;
      t = cpu->isar.id_aa64pfr0;
 +    t = FIELD_DP64(t, ID_AA64PFR0, FP, 1);        /* FEAT_FP16 */
 +    t = FIELD_DP64(t, ID_AA64PFR0, ADVSIMD, 1);   /* FEAT_FP16 */
      t = FIELD_DP64(t, ID_AA64PFR0, SVE, 1);
 -    t = FIELD_DP64(t, ID_AA64PFR0, FP, 1);
 -    t = FIELD_DP64(t, ID_AA64PFR0, ADVSIMD, 1);
 -    t = FIELD_DP64(t, ID_AA64PFR0, SEL2, 1);
 -    t = FIELD_DP64(t, ID_AA64PFR0, DIT, 1);
 +    t = FIELD_DP64(t, ID_AA64PFR0, SEL2, 1);      /* FEAT_SEL2 */
 +    t = FIELD_DP64(t, ID_AA64PFR0, DIT, 1);       /* FEAT_DIT */
      cpu->isar.id_aa64pfr0 = t;
      t = cpu->isar.id_aa64pfr1;
 -    t = FIELD_DP64(t, ID_AA64PFR1, BT, 1);
 -    t = FIELD_DP64(t, ID_AA64PFR1, SSBS, 2);
 +    t = FIELD_DP64(t, ID_AA64PFR1, BT, 1);        /* FEAT_BTI */
 +    t = FIELD_DP64(t, ID_AA64PFR1, SSBS, 2);      /* FEAT_SSBS2 */
      /*
       * Begin with full support for MTE. This will be downgraded to MTE=0
       * during realize if the board provides no tag memory, much like
       * we do for EL2 with the virtualization=on property.
       */
 -    t = FIELD_DP64(t, ID_AA64PFR1, MTE, 3);
 +    t = FIELD_DP64(t, ID_AA64PFR1, MTE, 3);       /* FEAT_MTE3 */
      cpu->isar.id_aa64pfr1 = t;
      t = cpu->isar.id_aa64mmfr0;
@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
      cpu->isar.id_aa64mmfr0 = t;
      t = cpu->isar.id_aa64mmfr1;
 -    t = FIELD_DP64(t, ID_AA64MMFR1, HPDS, 1); /* HPD */
 -    t = FIELD_DP64(t, ID_AA64MMFR1, LO, 1);
 -    t = FIELD_DP64(t, ID_AA64MMFR1, VH, 1);
 -    t = FIELD_DP64(t, ID_AA64MMFR1, PAN, 2); /* ATS1E1 */
 -    t = FIELD_DP64(t, ID_AA64MMFR1, VMIDBITS, 2); /* VMID16 */
 -    t = FIELD_DP64(t, ID_AA64MMFR1, XNX, 1); /* TTS2UXN */
 +    t = FIELD_DP64(t, ID_AA64MMFR1, VMIDBITS, 2); /* FEAT_VMID16 */
 +    t = FIELD_DP64(t, ID_AA64MMFR1, VH, 1);       /* FEAT_VHE */
 +    t = FIELD_DP64(t, ID_AA64MMFR1, HPDS, 1);     /* FEAT_HPDS */
 +    t = FIELD_DP64(t, ID_AA64MMFR1, LO, 1);       /* FEAT_LOR */
 +    t = FIELD_DP64(t, ID_AA64MMFR1, PAN, 2);      /* FEAT_PAN2 */
 +    t = FIELD_DP64(t, ID_AA64MMFR1, XNX, 1);      /* FEAT_XNX */
      cpu->isar.id_aa64mmfr1 = t;
      t = cpu->isar.id_aa64mmfr2;
 -    t = FIELD_DP64(t, ID_AA64MMFR2, UAO, 1);
 -    t = FIELD_DP64(t, ID_AA64MMFR2, CNP, 1); /* TTCNP */
 -    t = FIELD_DP64(t, ID_AA64MMFR2, ST, 1); /* TTST */
 -    t = FIELD_DP64(t, ID_AA64MMFR2, VARANGE, 1); /* FEAT_LVA */
 -    t = FIELD_DP64(t, ID_AA64MMFR2, TTL, 1); /* FEAT_TTL */
 -    t = FIELD_DP64(t, ID_AA64MMFR2, BBM, 2); /* FEAT_BBM at level 2 */
 +    t = FIELD_DP64(t, ID_AA64MMFR2, CNP, 1);      /* FEAT_TTCNP */
 +    t = FIELD_DP64(t, ID_AA64MMFR2, UAO, 1);      /* FEAT_UAO */
 +    t = FIELD_DP64(t, ID_AA64MMFR2, VARANGE, 1);  /* FEAT_LVA */
 +    t = FIELD_DP64(t, ID_AA64MMFR2, ST, 1);       /* FEAT_TTST */
 +    t = FIELD_DP64(t, ID_AA64MMFR2, TTL, 1);      /* FEAT_TTL */
 +    t = FIELD_DP64(t, ID_AA64MMFR2, BBM, 2);      /* FEAT_BBM at level 2 */
      cpu->isar.id_aa64mmfr2 = t;
      t = cpu->isar.id_aa64zfr0;
      t = FIELD_DP64(t, ID_AA64ZFR0, SVEVER, 1);
 -    t = FIELD_DP64(t, ID_AA64ZFR0, AES, 2);  /* PMULL */
 -    t = FIELD_DP64(t, ID_AA64ZFR0, BITPERM, 1);
 -    t = FIELD_DP64(t, ID_AA64ZFR0, BFLOAT16, 1);
 -    t = FIELD_DP64(t, ID_AA64ZFR0, SHA3, 1);
 -    t = FIELD_DP64(t, ID_AA64ZFR0, SM4, 1);
 -    t = FIELD_DP64(t, ID_AA64ZFR0, I8MM, 1);
 -    t = FIELD_DP64(t, ID_AA64ZFR0, F32MM, 1);
 -    t = FIELD_DP64(t, ID_AA64ZFR0, F64MM, 1);
 +    t = FIELD_DP64(t, ID_AA64ZFR0, AES, 2);       /* FEAT_SVE_PMULL128 */
 +    t = FIELD_DP64(t, ID_AA64ZFR0, BITPERM, 1);   /* FEAT_SVE_BitPerm */
 +    t = FIELD_DP64(t, ID_AA64ZFR0, BFLOAT16, 1);  /* FEAT_BF16 */
 +    t = FIELD_DP64(t, ID_AA64ZFR0, SHA3, 1);      /* FEAT_SVE_SHA3 */
 +    t = FIELD_DP64(t, ID_AA64ZFR0, SM4, 1);       /* FEAT_SVE_SM4 */
 +    t = FIELD_DP64(t, ID_AA64ZFR0, I8MM, 1);      /* FEAT_I8MM */
 +    t = FIELD_DP64(t, ID_AA64ZFR0, F32MM, 1);     /* FEAT_F32MM */
 +    t = FIELD_DP64(t, ID_AA64ZFR0, F64MM, 1);     /* FEAT_F64MM */
      cpu->isar.id_aa64zfr0 = t;
      t = cpu->isar.id_aa64dfr0;
 -    t = FIELD_DP64(t, ID_AA64DFR0, PMUVER, 5); /* v8.4-PMU */
 +    t = FIELD_DP64(t, ID_AA64DFR0, PMUVER, 5);    /* FEAT_PMUv3p4 */
      cpu->isar.id_aa64dfr0 = t;
      /* Replicate the same data to the 32-bit id registers.  */
 diff --git a/target/arm/cpu_tcg.c b/target/arm/cpu_tcg.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu_tcg.c
 +++ b/target/arm/cpu_tcg.c
@@ -XXX,XX +XXX,XX @@ void aa32_max_features(ARMCPU *cpu)
      /* Add additional features supported by QEMU */
      t = cpu->isar.id_isar5;
 -    t = FIELD_DP32(t, ID_ISAR5, AES, 2);
 -    t = FIELD_DP32(t, ID_ISAR5, SHA1, 1);
 -    t = FIELD_DP32(t, ID_ISAR5, SHA2, 1);
 +    t = FIELD_DP32(t, ID_ISAR5, AES, 2);          /* FEAT_PMULL */
 +    t = FIELD_DP32(t, ID_ISAR5, SHA1, 1);         /* FEAT_SHA1 */
 +    t = FIELD_DP32(t, ID_ISAR5, SHA2, 1);         /* FEAT_SHA256 */
      t = FIELD_DP32(t, ID_ISAR5, CRC32, 1);
 -    t = FIELD_DP32(t, ID_ISAR5, RDM, 1);
 -    t = FIELD_DP32(t, ID_ISAR5, VCMA, 1);
 +    t = FIELD_DP32(t, ID_ISAR5, RDM, 1);          /* FEAT_RDM */
 +    t = FIELD_DP32(t, ID_ISAR5, VCMA, 1);         /* FEAT_FCMA */
      cpu->isar.id_isar5 = t;
      t = cpu->isar.id_isar6;
 -    t = FIELD_DP32(t, ID_ISAR6, JSCVT, 1);
 -    t = FIELD_DP32(t, ID_ISAR6, DP, 1);
 -    t = FIELD_DP32(t, ID_ISAR6, FHM, 1);
 -    t = FIELD_DP32(t, ID_ISAR6, SB, 1);
 -    t = FIELD_DP32(t, ID_ISAR6, SPECRES, 1);
 -    t = FIELD_DP32(t, ID_ISAR6, BF16, 1);
 -    t = FIELD_DP32(t, ID_ISAR6, I8MM, 1);
 +    t = FIELD_DP32(t, ID_ISAR6, JSCVT, 1);        /* FEAT_JSCVT */
 +    t = FIELD_DP32(t, ID_ISAR6, DP, 1);           /* Feat_DotProd */
 +    t = FIELD_DP32(t, ID_ISAR6, FHM, 1);          /* FEAT_FHM */
 +    t = FIELD_DP32(t, ID_ISAR6, SB, 1);           /* FEAT_SB */
 +    t = FIELD_DP32(t, ID_ISAR6, SPECRES, 1);      /* FEAT_SPECRES */
 +    t = FIELD_DP32(t, ID_ISAR6, BF16, 1);         /* FEAT_AA32BF16 */
 +    t = FIELD_DP32(t, ID_ISAR6, I8MM, 1);         /* FEAT_AA32I8MM */
      cpu->isar.id_isar6 = t;
      t = cpu->isar.mvfr1;
 -    t = FIELD_DP32(t, MVFR1, FPHP, 3);     /* v8.2-FP16 */
 -    t = FIELD_DP32(t, MVFR1, SIMDHP, 2);   /* v8.2-FP16 */
 +    t = FIELD_DP32(t, MVFR1, FPHP, 3);            /* FEAT_FP16 */
 +    t = FIELD_DP32(t, MVFR1, SIMDHP, 2);          /* FEAT_FP16 */
      cpu->isar.mvfr1 = t;
      t = cpu->isar.mvfr2;
 -    t = FIELD_DP32(t, MVFR2, SIMDMISC, 3); /* SIMD MaxNum */
 -    t = FIELD_DP32(t, MVFR2, FPMISC, 4);   /* FP MaxNum */
 +    t = FIELD_DP32(t, MVFR2, SIMDMISC, 3);        /* SIMD MaxNum */
 +    t = FIELD_DP32(t, MVFR2, FPMISC, 4);          /* FP MaxNum */
      cpu->isar.mvfr2 = t;
      t = cpu->isar.id_mmfr3;
 -    t = FIELD_DP32(t, ID_MMFR3, PAN, 2); /* ATS1E1 */
 +    t = FIELD_DP32(t, ID_MMFR3, PAN, 2);          /* FEAT_PAN2 */
      cpu->isar.id_mmfr3 = t;
      t = cpu->isar.id_mmfr4;
 -    t = FIELD_DP32(t, ID_MMFR4, HPDS, 1); /* AA32HPD */
 -    t = FIELD_DP32(t, ID_MMFR4, AC2, 1); /* ACTLR2, HACTLR2 */
 -    t = FIELD_DP32(t, ID_MMFR4, CNP, 1); /* TTCNP */
 -    t = FIELD_DP32(t, ID_MMFR4, XNX, 1); /* TTS2UXN */
 +    t = FIELD_DP32(t, ID_MMFR4, HPDS, 1);         /* FEAT_AA32HPD */
 +    t = FIELD_DP32(t, ID_MMFR4, AC2, 1);          /* ACTLR2, HACTLR2 */
 +    t = FIELD_DP32(t, ID_MMFR4, CNP, 1);          /* FEAT_TTCNP */
 +    t = FIELD_DP32(t, ID_MMFR4, XNX, 1);          /* FEAT_XNX*/
      cpu->isar.id_mmfr4 = t;
      t = cpu->isar.id_pfr0;
 -    t = FIELD_DP32(t, ID_PFR0, DIT, 1);
 +    t = FIELD_DP32(t, ID_PFR0, DIT, 1);           /* FEAT_DIT */
      cpu->isar.id_pfr0 = t;
      t = cpu->isar.id_pfr2;
 -    t = FIELD_DP32(t, ID_PFR2, SSBS, 1);
 +    t = FIELD_DP32(t, ID_PFR2, SSBS, 1);          /* FEAT_SSBS */
      cpu->isar.id_pfr2 = t;
      t = cpu->isar.id_dfr0;
 -    t = FIELD_DP32(t, ID_DFR0, PERFMON, 5); /* v8.4-PMU */
 +    t = FIELD_DP32(t, ID_DFR0, PERFMON, 5);       /* FEAT_PMUv3p4 */
      cpu->isar.id_dfr0 = t;
  }
 --
-.25.1
+.34.1

-[PULL 04/32] target/arm: Merge zcr reginfo
+[PULL 06/36] hw/pci-host/designware: Expose MSI IRQ
-From: Richard Henderson <richard.henderson@linaro.org>
+From: Bernhard Beschow <shentey@gmail.com>
-Drop zcr_no_el2_reginfo and merge the 3 registers into one array,
+Fixes INTD and MSI interrupts poking the same IRQ line without keeping track of
-now that ZCR_EL2 can be squashed to RES0 and ZCR_EL3 dropped
+each other's IRQ level. Furthermore, SoCs such as the i.MX 8M Plus don't share
-while registering.
+the MSI IRQ with the INTx lines, so expose it as a dedicated pin.
+Signed-off-by: Bernhard Beschow <shentey@gmail.com>
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220506180242.216785-4-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/helper.c | 55 ++++++++++++++-------------------------------
+ include/hw/arm/fsl-imx6.h        |  4 +++-
-file changed, 17 insertions(+), 38 deletions(-)
+ include/hw/arm/fsl-imx7.h        |  4 +++-
  include/hw/pci-host/designware.h |  1 +
  hw/arm/fsl-imx6.c                | 13 ++++++++++++-
  hw/arm/fsl-imx7.c                | 13 ++++++++++++-
  hw/pci-host/designware.c         |  7 +++----
  hw/arm/Kconfig                   |  2 ++
 files changed, 36 insertions(+), 8 deletions(-)
-diff --git a/target/arm/helper.c b/target/arm/helper.c
+diff --git a/include/hw/arm/fsl-imx6.h b/include/hw/arm/fsl-imx6.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper.c
+--- a/include/hw/arm/fsl-imx6.h
-+++ b/target/arm/helper.c
++++ b/include/hw/arm/fsl-imx6.h
-@@ -XXX,XX +XXX,XX @@ static void zcr_write(CPUARMState *env, const ARMCPRegInfo *ri,
+@@ -XXX,XX +XXX,XX @@
  #include "hw/usb/chipidea.h"
  #include "hw/usb/imx-usb-phy.h"
  #include "hw/pci-host/designware.h"
 +#include "hw/or-irq.h"
  #include "exec/memory.h"
  #include "cpu.h"
  #include "qom/object.h"
@@ -XXX,XX +XXX,XX @@ struct FslIMX6State {
      ChipideaState      usb[FSL_IMX6_NUM_USBS];
      IMXFECState        eth;
      DesignwarePCIEHost pcie;
 +    OrIRQState         pcie4_msi_irq;
      MemoryRegion       rom;
      MemoryRegion       caam;
      MemoryRegion       ocram;
@@ -XXX,XX +XXX,XX @@ struct FslIMX6State {
  #define FSL_IMX6_PCIE1_IRQ 120
  #define FSL_IMX6_PCIE2_IRQ 121
  #define FSL_IMX6_PCIE3_IRQ 122
 -#define FSL_IMX6_PCIE4_IRQ 123
 +#define FSL_IMX6_PCIE4_MSI_IRQ 123
  #define FSL_IMX6_DCIC1_IRQ 124
  #define FSL_IMX6_DCIC2_IRQ 125
  #define FSL_IMX6_MLB150_HIGH_IRQ 126
 diff --git a/include/hw/arm/fsl-imx7.h b/include/hw/arm/fsl-imx7.h
 index XXXXXXX..XXXXXXX 100644
 --- a/include/hw/arm/fsl-imx7.h
 +++ b/include/hw/arm/fsl-imx7.h
@@ -XXX,XX +XXX,XX @@
  #include "hw/net/imx_fec.h"
  #include "hw/pci-host/designware.h"
  #include "hw/usb/chipidea.h"
 +#include "hw/or-irq.h"
  #include "cpu.h"
  #include "qom/object.h"
  #include "qemu/units.h"
@@ -XXX,XX +XXX,XX @@ struct FslIMX7State {
      IMX7GPRState       gpr;
      ChipideaState      usb[FSL_IMX7_NUM_USBS];
      DesignwarePCIEHost pcie;
 +    OrIRQState         pcie4_msi_irq;
      MemoryRegion       rom;
      MemoryRegion       caam;
      MemoryRegion       ocram;
@@ -XXX,XX +XXX,XX @@ enum FslIMX7IRQs {
      FSL_IMX7_PCI_INTA_IRQ = 125,
      FSL_IMX7_PCI_INTB_IRQ = 124,
      FSL_IMX7_PCI_INTC_IRQ = 123,
 -    FSL_IMX7_PCI_INTD_IRQ = 122,
 +    FSL_IMX7_PCI_INTD_MSI_IRQ = 122,
      FSL_IMX7_UART7_IRQ    = 126,
 diff --git a/include/hw/pci-host/designware.h b/include/hw/pci-host/designware.h
 index XXXXXXX..XXXXXXX 100644
 --- a/include/hw/pci-host/designware.h
 +++ b/include/hw/pci-host/designware.h
@@ -XXX,XX +XXX,XX @@ struct DesignwarePCIEHost {
          MemoryRegion io;
          qemu_irq     irqs[4];
 +        qemu_irq     msi;
      } pci;
      MemoryRegion mmio;
 diff --git a/hw/arm/fsl-imx6.c b/hw/arm/fsl-imx6.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/fsl-imx6.c
 +++ b/hw/arm/fsl-imx6.c
@@ -XXX,XX +XXX,XX @@ static void fsl_imx6_init(Object *obj)
      object_initialize_child(obj, "eth", &s->eth, TYPE_IMX_ENET);
      object_initialize_child(obj, "pcie", &s->pcie, TYPE_DESIGNWARE_PCIE_HOST);
 +    object_initialize_child(obj, "pcie4-msi-irq", &s->pcie4_msi_irq,
 +                            TYPE_OR_IRQ);
  }
  static void fsl_imx6_realize(DeviceState *dev, Error **errp)
@@ -XXX,XX +XXX,XX @@ static void fsl_imx6_realize(DeviceState *dev, Error **errp)
      sysbus_realize(SYS_BUS_DEVICE(&s->pcie), &error_abort);
      sysbus_mmio_map(SYS_BUS_DEVICE(&s->pcie), 0, FSL_IMX6_PCIe_REG_ADDR);
 +    object_property_set_int(OBJECT(&s->pcie4_msi_irq), "num-lines", 2,
 +                            &error_abort);
 +    qdev_realize(DEVICE(&s->pcie4_msi_irq), NULL, &error_abort);
 +
 +    irq = qdev_get_gpio_in(DEVICE(&s->a9mpcore), FSL_IMX6_PCIE4_MSI_IRQ);
 +    qdev_connect_gpio_out(DEVICE(&s->pcie4_msi_irq), 0, irq);
 +
      irq = qdev_get_gpio_in(DEVICE(&s->a9mpcore), FSL_IMX6_PCIE1_IRQ);
      sysbus_connect_irq(SYS_BUS_DEVICE(&s->pcie), 0, irq);
      irq = qdev_get_gpio_in(DEVICE(&s->a9mpcore), FSL_IMX6_PCIE2_IRQ);
      sysbus_connect_irq(SYS_BUS_DEVICE(&s->pcie), 1, irq);
      irq = qdev_get_gpio_in(DEVICE(&s->a9mpcore), FSL_IMX6_PCIE3_IRQ);
      sysbus_connect_irq(SYS_BUS_DEVICE(&s->pcie), 2, irq);
 -    irq = qdev_get_gpio_in(DEVICE(&s->a9mpcore), FSL_IMX6_PCIE4_IRQ);
 +    irq = qdev_get_gpio_in(DEVICE(&s->pcie4_msi_irq), 0);
      sysbus_connect_irq(SYS_BUS_DEVICE(&s->pcie), 3, irq);
 +    irq = qdev_get_gpio_in(DEVICE(&s->pcie4_msi_irq), 1);
 +    sysbus_connect_irq(SYS_BUS_DEVICE(&s->pcie), 4, irq);
      /*
       * PCIe PHY
 diff --git a/hw/arm/fsl-imx7.c b/hw/arm/fsl-imx7.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/fsl-imx7.c
 +++ b/hw/arm/fsl-imx7.c
@@ -XXX,XX +XXX,XX @@ static void fsl_imx7_init(Object *obj)
       * PCIE
       */
      object_initialize_child(obj, "pcie", &s->pcie, TYPE_DESIGNWARE_PCIE_HOST);
 +    object_initialize_child(obj, "pcie4-msi-irq", &s->pcie4_msi_irq,
 +                            TYPE_OR_IRQ);
      /*
       * USBs
@@ -XXX,XX +XXX,XX @@ static void fsl_imx7_realize(DeviceState *dev, Error **errp)
      sysbus_realize(SYS_BUS_DEVICE(&s->pcie), &error_abort);
      sysbus_mmio_map(SYS_BUS_DEVICE(&s->pcie), 0, FSL_IMX7_PCIE_REG_ADDR);
 +    object_property_set_int(OBJECT(&s->pcie4_msi_irq), "num-lines", 2,
 +                            &error_abort);
 +    qdev_realize(DEVICE(&s->pcie4_msi_irq), NULL, &error_abort);
 +
 +    irq = qdev_get_gpio_in(DEVICE(&s->a7mpcore), FSL_IMX7_PCI_INTD_MSI_IRQ);
 +    qdev_connect_gpio_out(DEVICE(&s->pcie4_msi_irq), 0, irq);
 +
      irq = qdev_get_gpio_in(DEVICE(&s->a7mpcore), FSL_IMX7_PCI_INTA_IRQ);
      sysbus_connect_irq(SYS_BUS_DEVICE(&s->pcie), 0, irq);
      irq = qdev_get_gpio_in(DEVICE(&s->a7mpcore), FSL_IMX7_PCI_INTB_IRQ);
      sysbus_connect_irq(SYS_BUS_DEVICE(&s->pcie), 1, irq);
      irq = qdev_get_gpio_in(DEVICE(&s->a7mpcore), FSL_IMX7_PCI_INTC_IRQ);
      sysbus_connect_irq(SYS_BUS_DEVICE(&s->pcie), 2, irq);
 -    irq = qdev_get_gpio_in(DEVICE(&s->a7mpcore), FSL_IMX7_PCI_INTD_IRQ);
 +    irq = qdev_get_gpio_in(DEVICE(&s->pcie4_msi_irq), 0);
      sysbus_connect_irq(SYS_BUS_DEVICE(&s->pcie), 3, irq);
 +    irq = qdev_get_gpio_in(DEVICE(&s->pcie4_msi_irq), 1);
 +    sysbus_connect_irq(SYS_BUS_DEVICE(&s->pcie), 4, irq);
      /*
       * USBs
 diff --git a/hw/pci-host/designware.c b/hw/pci-host/designware.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/pci-host/designware.c
 +++ b/hw/pci-host/designware.c
@@ -XXX,XX +XXX,XX @@
  #define DESIGNWARE_PCIE_ATU_DEVFN(x)               (((x) >> 16) & 0xff)
  #define DESIGNWARE_PCIE_ATU_UPPER_TARGET           0x91C
 -#define DESIGNWARE_PCIE_IRQ_MSI                    3
 -
  static DesignwarePCIEHost *
  designware_pcie_root_to_host(DesignwarePCIERoot *root)
  {
@@ -XXX,XX +XXX,XX @@ static void designware_pcie_root_msi_write(void *opaque, hwaddr addr,
      root->msi.intr[0].status |= BIT(val) & root->msi.intr[0].enable;
      if (root->msi.intr[0].status & ~root->msi.intr[0].mask) {
 -        qemu_set_irq(host->pci.irqs[DESIGNWARE_PCIE_IRQ_MSI], 1);
 +        qemu_set_irq(host->pci.msi, 1);
      }
  }
--static const ARMCPRegInfo zcr_el1_reginfo = {
+@@ -XXX,XX +XXX,XX @@ static void designware_pcie_root_config_write(PCIDevice *d, uint32_t address,
--    .name = "ZCR_EL1", .state = ARM_CP_STATE_AA64,
+     case DESIGNWARE_PCIE_MSI_INTR0_STATUS:
--    .opc0 = 3, .opc1 = 0, .crn = 1, .crm = 2, .opc2 = 0,
+         root->msi.intr[0].status ^= val;
--    .access = PL1_RW, .type = ARM_CP_SVE,
+         if (!root->msi.intr[0].status) {
--    .fieldoffset = offsetof(CPUARMState, vfp.zcr_el[1]),
+-            qemu_set_irq(host->pci.irqs[DESIGNWARE_PCIE_IRQ_MSI], 0);
--    .writefn = zcr_write, .raw_writefn = raw_write
++            qemu_set_irq(host->pci.msi, 0);
--};
+         }
--
+         break;
--static const ARMCPRegInfo zcr_el2_reginfo = {
--    .name = "ZCR_EL2", .state = ARM_CP_STATE_AA64,
+@@ -XXX,XX +XXX,XX @@ static void designware_pcie_host_realize(DeviceState *dev, Error **errp)
--    .opc0 = 3, .opc1 = 4, .crn = 1, .crm = 2, .opc2 = 0,
+     for (i = 0; i < ARRAY_SIZE(s->pci.irqs); i++) {
--    .access = PL2_RW, .type = ARM_CP_SVE,
+         sysbus_init_irq(sbd, &s->pci.irqs[i]);
 -    .fieldoffset = offsetof(CPUARMState, vfp.zcr_el[2]),
 -    .writefn = zcr_write, .raw_writefn = raw_write
 -};
 -
 -static const ARMCPRegInfo zcr_no_el2_reginfo = {
 -    .name = "ZCR_EL2", .state = ARM_CP_STATE_AA64,
 -    .opc0 = 3, .opc1 = 4, .crn = 1, .crm = 2, .opc2 = 0,
 -    .access = PL2_RW, .type = ARM_CP_SVE,
 -    .readfn = arm_cp_read_zero, .writefn = arm_cp_write_ignore
 -};
 -
 -static const ARMCPRegInfo zcr_el3_reginfo = {
 -    .name = "ZCR_EL3", .state = ARM_CP_STATE_AA64,
 -    .opc0 = 3, .opc1 = 6, .crn = 1, .crm = 2, .opc2 = 0,
 -    .access = PL3_RW, .type = ARM_CP_SVE,
 -    .fieldoffset = offsetof(CPUARMState, vfp.zcr_el[3]),
 -    .writefn = zcr_write, .raw_writefn = raw_write
 +static const ARMCPRegInfo zcr_reginfo[] = {
 +    { .name = "ZCR_EL1", .state = ARM_CP_STATE_AA64,
 +      .opc0 = 3, .opc1 = 0, .crn = 1, .crm = 2, .opc2 = 0,
 +      .access = PL1_RW, .type = ARM_CP_SVE,
 +      .fieldoffset = offsetof(CPUARMState, vfp.zcr_el[1]),
 +      .writefn = zcr_write, .raw_writefn = raw_write },
 +    { .name = "ZCR_EL2", .state = ARM_CP_STATE_AA64,
 +      .opc0 = 3, .opc1 = 4, .crn = 1, .crm = 2, .opc2 = 0,
 +      .access = PL2_RW, .type = ARM_CP_SVE,
 +      .fieldoffset = offsetof(CPUARMState, vfp.zcr_el[2]),
 +      .writefn = zcr_write, .raw_writefn = raw_write },
 +    { .name = "ZCR_EL3", .state = ARM_CP_STATE_AA64,
 +      .opc0 = 3, .opc1 = 6, .crn = 1, .crm = 2, .opc2 = 0,
 +      .access = PL3_RW, .type = ARM_CP_SVE,
 +      .fieldoffset = offsetof(CPUARMState, vfp.zcr_el[3]),
 +      .writefn = zcr_write, .raw_writefn = raw_write },
  };
  void hw_watchpoint_update(ARMCPU *cpu, int n)
@@ -XXX,XX +XXX,XX @@ void register_cp_regs_for_features(ARMCPU *cpu)
      }
++    sysbus_init_irq(sbd, &s->pci.msi);
-     if (cpu_isar_feature(aa64_sve, cpu)) {
--        define_one_arm_cp_reg(cpu, &zcr_el1_reginfo);
+     memory_region_init_io(&s->mmio,
--        if (arm_feature(env, ARM_FEATURE_EL2)) {
+                           OBJECT(s),
--            define_one_arm_cp_reg(cpu, &zcr_el2_reginfo);
+diff --git a/hw/arm/Kconfig b/hw/arm/Kconfig
--        } else {
+index XXXXXXX..XXXXXXX 100644
--            define_one_arm_cp_reg(cpu, &zcr_no_el2_reginfo);
+--- a/hw/arm/Kconfig
--        }
++++ b/hw/arm/Kconfig
--        if (arm_feature(env, ARM_FEATURE_EL3)) {
+@@ -XXX,XX +XXX,XX @@ config FSL_IMX6
--            define_one_arm_cp_reg(cpu, &zcr_el3_reginfo);
+     select PL310  # cache controller
--        }
+     select PCI_EXPRESS_DESIGNWARE
-+        define_arm_cp_regs(cpu, zcr_reginfo);
+     select SDHCI
-     }
++    select OR_IRQ
- #ifdef TARGET_AARCH64
+ config ASPEED_SOC
      bool
@@ -XXX,XX +XXX,XX @@ config FSL_IMX7
      select WDT_IMX2
      select PCI_EXPRESS_DESIGNWARE
      select SDHCI
 +    select OR_IRQ
      select UNIMP
  config ARM_SMMUV3
 --
-.25.1
+.34.1

-[PULL 03/32] target/arm: Drop EL3 no EL2 fallbacks
+[PULL 07/36] hw/arm/stellaris: Link each board schematic
-From: Richard Henderson <richard.henderson@linaro.org>
+From: Philippe Mathieu-Daudé <philmd@linaro.org>
-Drop el3_no_el2_cp_reginfo, el3_no_el2_v8_cp_reginfo, and the local
+Board schematic is useful to corroborate GPIOs/IRQs wiring.
 vpidr_regs definition, and rely on the squashing to ARM_CP_CONST
 while registering for v8.
-This is a behavior change for v7 cpus with Security Extensions and
+Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
 without Virtualization Extensions, in that the virtualization cpregs
 are now correctly not present.  This would be a migration compatibility
 break, except that we have an existing bug in which migration of 32-bit
 cpus with Security Extensions enabled does not work.
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250110160204.74997-2-philmd@linaro.org
-Message-id: 20220506180242.216785-3-richard.henderson@linaro.org
+[PMM: Use https:// URLs]
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/helper.c | 158 ++++----------------------------------------
+ hw/arm/stellaris.c | 8 ++++++++
-file changed, 13 insertions(+), 145 deletions(-)
+file changed, 8 insertions(+)
-diff --git a/target/arm/helper.c b/target/arm/helper.c
+diff --git a/hw/arm/stellaris.c b/hw/arm/stellaris.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper.c
+--- a/hw/arm/stellaris.c
-+++ b/target/arm/helper.c
++++ b/hw/arm/stellaris.c
-@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo v8_cp_reginfo[] = {
+@@ -XXX,XX +XXX,XX @@ static void lm3s6965evb_init(MachineState *machine)
-       .fieldoffset = offsetoflow32(CPUARMState, cp15.mdcr_el3) },
+     stellaris_init(machine, &stellaris_boards[1]);
  }
 +/*
 + * Stellaris LM3S811 Evaluation Board Schematics:
 + * https://www.ti.com/lit/ug/symlink/spmu030.pdf
 + */
  static void lm3s811evb_class_init(ObjectClass *oc, void *data)
  {
      MachineClass *mc = MACHINE_CLASS(oc);
@@ -XXX,XX +XXX,XX @@ static const TypeInfo lm3s811evb_type = {
      .class_init = lm3s811evb_class_init,
  };
--/* Used to describe the behaviour of EL2 regs when EL2 does not exist.  */
++/*
--static const ARMCPRegInfo el3_no_el2_cp_reginfo[] = {
++ * Stellaris: LM3S6965 Evaluation Board Schematics:
--    { .name = "VBAR_EL2", .state = ARM_CP_STATE_BOTH,
++ * https://www.ti.com/lit/ug/symlink/spmu029.pdf
--      .opc0 = 3, .opc1 = 4, .crn = 12, .crm = 0, .opc2 = 0,
++ */
--      .access = PL2_RW,
+ static void lm3s6965evb_class_init(ObjectClass *oc, void *data)
 -      .readfn = arm_cp_read_zero, .writefn = arm_cp_write_ignore },
 -    { .name = "HCR_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 1, .crm = 1, .opc2 = 0,
 -      .access = PL2_RW,
 -      .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "HACR_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 1, .crm = 1, .opc2 = 7,
 -      .access = PL2_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "ESR_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 5, .crm = 2, .opc2 = 0,
 -      .access = PL2_RW,
 -      .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "CPTR_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 1, .crm = 1, .opc2 = 2,
 -      .access = PL2_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "MAIR_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 10, .crm = 2, .opc2 = 0,
 -      .access = PL2_RW, .type = ARM_CP_CONST,
 -      .resetvalue = 0 },
 -    { .name = "HMAIR1", .state = ARM_CP_STATE_AA32,
 -      .cp = 15, .opc1 = 4, .crn = 10, .crm = 2, .opc2 = 1,
 -      .access = PL2_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "AMAIR_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 10, .crm = 3, .opc2 = 0,
 -      .access = PL2_RW, .type = ARM_CP_CONST,
 -      .resetvalue = 0 },
 -    { .name = "HAMAIR1", .state = ARM_CP_STATE_AA32,
 -      .cp = 15, .opc1 = 4, .crn = 10, .crm = 3, .opc2 = 1,
 -      .access = PL2_RW, .type = ARM_CP_CONST,
 -      .resetvalue = 0 },
 -    { .name = "AFSR0_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 5, .crm = 1, .opc2 = 0,
 -      .access = PL2_RW, .type = ARM_CP_CONST,
 -      .resetvalue = 0 },
 -    { .name = "AFSR1_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 5, .crm = 1, .opc2 = 1,
 -      .access = PL2_RW, .type = ARM_CP_CONST,
 -      .resetvalue = 0 },
 -    { .name = "TCR_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 2, .crm = 0, .opc2 = 2,
 -      .access = PL2_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "VTCR_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 2, .crm = 1, .opc2 = 2,
 -      .access = PL2_RW, .accessfn = access_el3_aa32ns,
 -      .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "VTTBR", .state = ARM_CP_STATE_AA32,
 -      .cp = 15, .opc1 = 6, .crm = 2,
 -      .access = PL2_RW, .accessfn = access_el3_aa32ns,
 -      .type = ARM_CP_CONST | ARM_CP_64BIT, .resetvalue = 0 },
 -    { .name = "VTTBR_EL2", .state = ARM_CP_STATE_AA64,
 -      .opc0 = 3, .opc1 = 4, .crn = 2, .crm = 1, .opc2 = 0,
 -      .access = PL2_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "SCTLR_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 1, .crm = 0, .opc2 = 0,
 -      .access = PL2_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "TPIDR_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 13, .crm = 0, .opc2 = 2,
 -      .access = PL2_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "TTBR0_EL2", .state = ARM_CP_STATE_AA64,
 -      .opc0 = 3, .opc1 = 4, .crn = 2, .crm = 0, .opc2 = 0,
 -      .access = PL2_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "HTTBR", .cp = 15, .opc1 = 4, .crm = 2,
 -      .access = PL2_RW, .type = ARM_CP_64BIT | ARM_CP_CONST,
 -      .resetvalue = 0 },
 -    { .name = "CNTHCTL_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 14, .crm = 1, .opc2 = 0,
 -      .access = PL2_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "CNTVOFF_EL2", .state = ARM_CP_STATE_AA64,
 -      .opc0 = 3, .opc1 = 4, .crn = 14, .crm = 0, .opc2 = 3,
 -      .access = PL2_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "CNTVOFF", .cp = 15, .opc1 = 4, .crm = 14,
 -      .access = PL2_RW, .type = ARM_CP_64BIT | ARM_CP_CONST,
 -      .resetvalue = 0 },
 -    { .name = "CNTHP_CVAL_EL2", .state = ARM_CP_STATE_AA64,
 -      .opc0 = 3, .opc1 = 4, .crn = 14, .crm = 2, .opc2 = 2,
 -      .access = PL2_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "CNTHP_CVAL", .cp = 15, .opc1 = 6, .crm = 14,
 -      .access = PL2_RW, .type = ARM_CP_64BIT | ARM_CP_CONST,
 -      .resetvalue = 0 },
 -    { .name = "CNTHP_TVAL_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 14, .crm = 2, .opc2 = 0,
 -      .access = PL2_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "CNTHP_CTL_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 14, .crm = 2, .opc2 = 1,
 -      .access = PL2_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "MDCR_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 1, .crm = 1, .opc2 = 1,
 -      .access = PL2_RW, .accessfn = access_tda,
 -      .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "HPFAR_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 6, .crm = 0, .opc2 = 4,
 -      .access = PL2_RW, .accessfn = access_el3_aa32ns,
 -      .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "HSTR_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 1, .crm = 1, .opc2 = 3,
 -      .access = PL2_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "FAR_EL2", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 4, .crn = 6, .crm = 0, .opc2 = 0,
 -      .access = PL2_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "HIFAR", .state = ARM_CP_STATE_AA32,
 -      .type = ARM_CP_CONST,
 -      .cp = 15, .opc1 = 4, .crn = 6, .crm = 0, .opc2 = 2,
 -      .access = PL2_RW, .resetvalue = 0 },
 -};
 -
 -/* Ditto, but for registers which exist in ARMv8 but not v7 */
 -static const ARMCPRegInfo el3_no_el2_v8_cp_reginfo[] = {
 -    { .name = "HCR2", .state = ARM_CP_STATE_AA32,
 -      .cp = 15, .opc1 = 4, .crn = 1, .crm = 1, .opc2 = 4,
 -      .access = PL2_RW,
 -      .type = ARM_CP_CONST, .resetvalue = 0 },
 -};
 -
  static void do_hcr_write(CPUARMState *env, uint64_t value, uint64_t valid_mask)
  {
-     ARMCPU *cpu = env_archcpu(env);
+     MachineClass *mc = MACHINE_CLASS(oc);
@@ -XXX,XX +XXX,XX @@ void register_cp_regs_for_features(ARMCPU *cpu)
          define_arm_cp_regs(cpu, v8_idregs);
          define_arm_cp_regs(cpu, v8_cp_reginfo);
      }
 -    if (arm_feature(env, ARM_FEATURE_EL2)) {
 +
 +    /*
 +     * Register the base EL2 cpregs.
 +     * Pre v8, these registers are implemented only as part of the
 +     * Virtualization Extensions (EL2 present).  Beginning with v8,
 +     * if EL2 is missing but EL3 is enabled, mostly these become
 +     * RES0 from EL3, with some specific exceptions.
 +     */
 +    if (arm_feature(env, ARM_FEATURE_EL2)
 +        || (arm_feature(env, ARM_FEATURE_EL3)
 +            && arm_feature(env, ARM_FEATURE_V8))) {
          uint64_t vmpidr_def = mpidr_read_val(env);
          ARMCPRegInfo vpidr_regs[] = {
              { .name = "VPIDR", .state = ARM_CP_STATE_AA32,
@@ -XXX,XX +XXX,XX @@ void register_cp_regs_for_features(ARMCPU *cpu)
              };
              define_one_arm_cp_reg(cpu, &rvbar);
          }
 -    } else {
 -        /* If EL2 is missing but higher ELs are enabled, we need to
 -         * register the no_el2 reginfos.
 -         */
 -        if (arm_feature(env, ARM_FEATURE_EL3)) {
 -            /* When EL3 exists but not EL2, VPIDR and VMPIDR take the value
 -             * of MIDR_EL1 and MPIDR_EL1.
 -             */
 -            ARMCPRegInfo vpidr_regs[] = {
 -                { .name = "VPIDR_EL2", .state = ARM_CP_STATE_BOTH,
 -                  .opc0 = 3, .opc1 = 4, .crn = 0, .crm = 0, .opc2 = 0,
 -                  .access = PL2_RW, .accessfn = access_el3_aa32ns,
 -                  .type = ARM_CP_CONST, .resetvalue = cpu->midr,
 -                  .fieldoffset = offsetof(CPUARMState, cp15.vpidr_el2) },
 -                { .name = "VMPIDR_EL2", .state = ARM_CP_STATE_BOTH,
 -                  .opc0 = 3, .opc1 = 4, .crn = 0, .crm = 0, .opc2 = 5,
 -                  .access = PL2_RW, .accessfn = access_el3_aa32ns,
 -                  .type = ARM_CP_NO_RAW,
 -                  .writefn = arm_cp_write_ignore, .readfn = mpidr_read },
 -            };
 -            define_arm_cp_regs(cpu, vpidr_regs);
 -            define_arm_cp_regs(cpu, el3_no_el2_cp_reginfo);
 -            if (arm_feature(env, ARM_FEATURE_V8)) {
 -                define_arm_cp_regs(cpu, el3_no_el2_v8_cp_reginfo);
 -            }
 -        }
      }
 +
 +    /* Register the base EL3 cpregs. */
      if (arm_feature(env, ARM_FEATURE_EL3)) {
          define_arm_cp_regs(cpu, el3_cp_reginfo);
          ARMCPRegInfo el3_regs[] = {
 --
-.25.1
+.34.1

-[PULL 25/32] target/arm: Define neoverse-n1
+[PULL 08/36] hw/arm/stellaris: Constify read-only arrays
-From: Richard Henderson <richard.henderson@linaro.org>
+From: Philippe Mathieu-Daudé <philmd@linaro.org>
-Enable the n1 for virt and sbsa board use.
+Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250110160204.74997-3-philmd@linaro.org
 Message-id: 20220506180242.216785-25-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- docs/system/arm/virt.rst |  1 +
+ hw/arm/stellaris.c | 6 +++---
- hw/arm/sbsa-ref.c        |  1 +
+file changed, 3 insertions(+), 3 deletions(-)
  hw/arm/virt.c            |  1 +
  target/arm/cpu64.c       | 66 ++++++++++++++++++++++++++++++++++++++++
 files changed, 69 insertions(+)
-diff --git a/docs/system/arm/virt.rst b/docs/system/arm/virt.rst
+diff --git a/hw/arm/stellaris.c b/hw/arm/stellaris.c
 index XXXXXXX..XXXXXXX 100644
---- a/docs/system/arm/virt.rst
+--- a/hw/arm/stellaris.c
-+++ b/docs/system/arm/virt.rst
++++ b/hw/arm/stellaris.c
-@@ -XXX,XX +XXX,XX @@ Supported guest CPU types:
+@@ -XXX,XX +XXX,XX @@ static void ssys_update(ssys_state *s)
- - ``cortex-a76`` (64-bit)
+   qemu_set_irq(s->irq, (s->int_status & s->int_mask) != 0);
- - ``a64fx`` (64-bit)
+ }
- - ``host`` (with KVM only)
-+- ``neoverse-n1`` (64-bit)
+-static uint32_t pllcfg_sandstorm[16] = {
- - ``max`` (same as ``host`` for KVM; best possible emulation with TCG)
++static const uint32_t pllcfg_sandstorm[16] = {
+x31c0, /* 1 Mhz */
- Note that the default is ``cortex-a15``, so for an AArch64 guest you must
+x1ae0, /* 1.8432 Mhz */
-diff --git a/hw/arm/sbsa-ref.c b/hw/arm/sbsa-ref.c
+x18c0, /* 2 Mhz */
-index XXXXXXX..XXXXXXX 100644
+@@ -XXX,XX +XXX,XX @@ static uint32_t pllcfg_sandstorm[16] = {
---- a/hw/arm/sbsa-ref.c
+x585b /* 8.192 Mhz */
 +++ b/hw/arm/sbsa-ref.c
@@ -XXX,XX +XXX,XX @@ static const char * const valid_cpus[] = {
      ARM_CPU_TYPE_NAME("cortex-a57"),
      ARM_CPU_TYPE_NAME("cortex-a72"),
      ARM_CPU_TYPE_NAME("cortex-a76"),
 +    ARM_CPU_TYPE_NAME("neoverse-n1"),
      ARM_CPU_TYPE_NAME("max"),
  };
-diff --git a/hw/arm/virt.c b/hw/arm/virt.c
+-static uint32_t pllcfg_fury[16] = {
-index XXXXXXX..XXXXXXX 100644
++static const uint32_t pllcfg_fury[16] = {
---- a/hw/arm/virt.c
+x3200, /* 1 Mhz */
-+++ b/hw/arm/virt.c
+x1b20, /* 1.8432 Mhz */
-@@ -XXX,XX +XXX,XX @@ static const char *valid_cpus[] = {
+x1900, /* 2 Mhz */
-     ARM_CPU_TYPE_NAME("cortex-a72"),
+@@ -XXX,XX +XXX,XX @@ static void stellaris_adc_init(Object *obj)
      ARM_CPU_TYPE_NAME("cortex-a76"),
      ARM_CPU_TYPE_NAME("a64fx"),
 +    ARM_CPU_TYPE_NAME("neoverse-n1"),
      ARM_CPU_TYPE_NAME("host"),
      ARM_CPU_TYPE_NAME("max"),
  };
 diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu64.c
 +++ b/target/arm/cpu64.c
@@ -XXX,XX +XXX,XX @@ static void aarch64_a76_initfn(Object *obj)
      cpu->isar.mvfr2 = 0x00000043;
  }
-+static void aarch64_neoverse_n1_initfn(Object *obj)
+ /* Board init.  */
-+{
+-static stellaris_board_info stellaris_boards[] = {
-+    ARMCPU *cpu = ARM_CPU(obj);
++static const stellaris_board_info stellaris_boards[] = {
-+
+   { "LM3S811EVB",
-+    cpu->dtb_compatible = "arm,neoverse-n1";
+,
-+    set_feature(&cpu->env, ARM_FEATURE_V8);
+x0032000e,
 +    set_feature(&cpu->env, ARM_FEATURE_NEON);
 +    set_feature(&cpu->env, ARM_FEATURE_GENERIC_TIMER);
 +    set_feature(&cpu->env, ARM_FEATURE_AARCH64);
 +    set_feature(&cpu->env, ARM_FEATURE_CBAR_RO);
 +    set_feature(&cpu->env, ARM_FEATURE_EL2);
 +    set_feature(&cpu->env, ARM_FEATURE_EL3);
 +    set_feature(&cpu->env, ARM_FEATURE_PMU);
 +
 +    /* Ordered by B2.4 AArch64 registers by functional group */
 +    cpu->clidr = 0x82000023;
 +    cpu->ctr = 0x8444c004;
 +    cpu->dcz_blocksize = 4;
 +    cpu->isar.id_aa64dfr0  = 0x0000000110305408ull;
 +    cpu->isar.id_aa64isar0 = 0x0000100010211120ull;
 +    cpu->isar.id_aa64isar1 = 0x0000000000100001ull;
 +    cpu->isar.id_aa64mmfr0 = 0x0000000000101125ull;
 +    cpu->isar.id_aa64mmfr1 = 0x0000000010212122ull;
 +    cpu->isar.id_aa64mmfr2 = 0x0000000000001011ull;
 +    cpu->isar.id_aa64pfr0  = 0x1100000010111112ull; /* GIC filled in later */
 +    cpu->isar.id_aa64pfr1  = 0x0000000000000020ull;
 +    cpu->id_afr0       = 0x00000000;
 +    cpu->isar.id_dfr0  = 0x04010088;
 +    cpu->isar.id_isar0 = 0x02101110;
 +    cpu->isar.id_isar1 = 0x13112111;
 +    cpu->isar.id_isar2 = 0x21232042;
 +    cpu->isar.id_isar3 = 0x01112131;
 +    cpu->isar.id_isar4 = 0x00010142;
 +    cpu->isar.id_isar5 = 0x01011121;
 +    cpu->isar.id_isar6 = 0x00000010;
 +    cpu->isar.id_mmfr0 = 0x10201105;
 +    cpu->isar.id_mmfr1 = 0x40000000;
 +    cpu->isar.id_mmfr2 = 0x01260000;
 +    cpu->isar.id_mmfr3 = 0x02122211;
 +    cpu->isar.id_mmfr4 = 0x00021110;
 +    cpu->isar.id_pfr0  = 0x10010131;
 +    cpu->isar.id_pfr1  = 0x00010000; /* GIC filled in later */
 +    cpu->isar.id_pfr2  = 0x00000011;
 +    cpu->midr = 0x414fd0c1;          /* r4p1 */
 +    cpu->revidr = 0;
 +
 +    /* From B2.23 CCSIDR_EL1 */
 +    cpu->ccsidr[0] = 0x701fe01a; /* 64KB L1 dcache */
 +    cpu->ccsidr[1] = 0x201fe01a; /* 64KB L1 icache */
 +    cpu->ccsidr[2] = 0x70ffe03a; /* 1MB L2 cache */
 +
 +    /* From B2.98 SCTLR_EL3 */
 +    cpu->reset_sctlr = 0x30c50838;
 +
 +    /* From B4.23 ICH_VTR_EL2 */
 +    cpu->gic_num_lrs = 4;
 +    cpu->gic_vpribits = 5;
 +    cpu->gic_vprebits = 5;
 +
 +    /* From B5.1 AdvSIMD AArch64 register summary */
 +    cpu->isar.mvfr0 = 0x10110222;
 +    cpu->isar.mvfr1 = 0x13211111;
 +    cpu->isar.mvfr2 = 0x00000043;
 +}
 +
  void arm_cpu_sve_finalize(ARMCPU *cpu, Error **errp)
  {
      /*
@@ -XXX,XX +XXX,XX @@ static const ARMCPUInfo aarch64_cpus[] = {
      { .name = "cortex-a72",         .initfn = aarch64_a72_initfn },
      { .name = "cortex-a76",         .initfn = aarch64_a76_initfn },
      { .name = "a64fx",              .initfn = aarch64_a64fx_initfn },
 +    { .name = "neoverse-n1",        .initfn = aarch64_neoverse_n1_initfn },
      { .name = "max",                .initfn = aarch64_max_initfn },
  #if defined(CONFIG_KVM) || defined(CONFIG_HVF)
      { .name = "host",               .initfn = aarch64_host_initfn },
 --
-.25.1
+.34.1

-[PULL 28/32] qtest/numa-test: Specify CPU topology in aarch64_numa_cpu()
+[PULL 09/36] hw/arm/stellaris: Remove incorrect unimplemented i2c-0 at 0x40002000
-From: Gavin Shan <gshan@redhat.com>
+From: Philippe Mathieu-Daudé <philmd@linaro.org>
-The CPU topology isn't enabled on arm/virt machine yet, but we're
+There is nothing mapped at 0x40002000.
 going to do it in next patch. After the CPU topology is enabled by
 next patch, "thread-id=1" becomes invalid because the CPU core is
 preferred on arm/virt machine. It means these two CPUs have 0/1
 as their core IDs, but their thread IDs are all 0. It will trigger
 test failure as the following message indicates:
-  [14/21 qemu:qtest+qtest-aarch64 / qtest-aarch64/numa-test  ERROR
+I2C#0 is already mapped at 0x40021000.
 .48s   killed by signal 6 SIGABRT
   >>> G_TEST_DBUS_DAEMON=/home/gavin/sandbox/qemu.main/tests/dbus-vmstate-daemon.sh \
       QTEST_QEMU_STORAGE_DAEMON_BINARY=./storage-daemon/qemu-storage-daemon         \
       QTEST_QEMU_BINARY=./qemu-system-aarch64                                       \
       QTEST_QEMU_IMG=./qemu-img MALLOC_PERTURB_=83                                  \
       /home/gavin/sandbox/qemu.main/build/tests/qtest/numa-test --tap -k
   ――――――――――――――――――――――――――――――――――――――――――――――
   stderr:
   qemu-system-aarch64: -numa cpu,node-id=0,thread-id=1: no match found
-This fixes the issue by providing comprehensive SMP configurations
+Remove the invalid mapping added in commits aecfbbc97a2 & 394c8bbfb7a.
 in aarch64_numa_cpu(). The SMP configurations aren't used before
 the CPU topology is enabled in next patch.
-Signed-off-by: Gavin Shan <gshan@redhat.com>
+Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
-Reviewed-by: Yanan Wang <wangyanan55@huawei.com>
+Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Message-id: 20220503140304.855514-3-gshan@redhat.com
+Message-id: 20250110160204.74997-4-philmd@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- tests/qtest/numa-test.c | 3 ++-
+ hw/arm/stellaris.c | 2 --
-file changed, 2 insertions(+), 1 deletion(-)
+file changed, 2 deletions(-)
-diff --git a/tests/qtest/numa-test.c b/tests/qtest/numa-test.c
+diff --git a/hw/arm/stellaris.c b/hw/arm/stellaris.c
 index XXXXXXX..XXXXXXX 100644
---- a/tests/qtest/numa-test.c
+--- a/hw/arm/stellaris.c
-+++ b/tests/qtest/numa-test.c
++++ b/hw/arm/stellaris.c
-@@ -XXX,XX +XXX,XX @@ static void aarch64_numa_cpu(const void *data)
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
-     QTestState *qts;
+      * http://www.ti.com/lit/ds/symlink/lm3s6965.pdf
-     g_autofree char *cli = NULL;
+      *
+      * 40000000 wdtimer
--    cli = make_cli(data, "-machine smp.cpus=2 "
+-     * 40002000 i2c (unimplemented)
-+    cli = make_cli(data, "-machine "
+      * 40004000 GPIO
-+        "smp.cpus=2,smp.sockets=1,smp.clusters=1,smp.cores=1,smp.threads=2 "
+      * 40005000 GPIO
-         "-numa node,nodeid=0,memdev=ram -numa node,nodeid=1 "
+      * 40006000 GPIO
-         "-numa cpu,node-id=1,thread-id=0 "
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
-         "-numa cpu,node-id=0,thread-id=1");
+     /* Add dummy regions for the devices we don't implement yet,
       * so guest accesses don't cause unlogged crashes.
       */
 -    create_unimplemented_device("i2c-0", 0x40002000, 0x1000);
      create_unimplemented_device("i2c-2", 0x40021000, 0x1000);
      create_unimplemented_device("PWM", 0x40028000, 0x1000);
      create_unimplemented_device("QEI-0", 0x4002c000, 0x1000);
 --
-.25.1
+.34.1

-[PULL 32/32] hw/acpi/aml-build: Use existing CPU topology to build PPTT table
+[PULL 10/36] hw/arm/stellaris: Replace magic numbers by definitions
-From: Gavin Shan <gshan@redhat.com>
+From: Philippe Mathieu-Daudé <philmd@linaro.org>
-When the PPTT table is built, the CPU topology is re-calculated, but
+Add definitions for the number of controllers.
 it's unecessary because the CPU topology has been populated in
 virt_possible_cpu_arch_ids() on arm/virt machine.
-This reworks build_pptt() to avoid by reusing the existing IDs in
+Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
-ms->possible_cpus. Currently, the only user of build_pptt() is
+Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-arm/virt machine.
+Message-id: 20250110160204.74997-5-philmd@linaro.org
 Signed-off-by: Gavin Shan <gshan@redhat.com>
 Tested-by: Yanan Wang <wangyanan55@huawei.com>
 Reviewed-by: Yanan Wang <wangyanan55@huawei.com>
 Acked-by: Igor Mammedov <imammedo@redhat.com>
 Acked-by: Michael S. Tsirkin <mst@redhat.com>
 Message-id: 20220503140304.855514-7-gshan@redhat.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- hw/acpi/aml-build.c | 111 +++++++++++++++++++-------------------------
+ hw/arm/stellaris.c | 25 +++++++++++++++----------
-file changed, 48 insertions(+), 63 deletions(-)
+file changed, 15 insertions(+), 10 deletions(-)
-diff --git a/hw/acpi/aml-build.c b/hw/acpi/aml-build.c
+diff --git a/hw/arm/stellaris.c b/hw/arm/stellaris.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/acpi/aml-build.c
+--- a/hw/arm/stellaris.c
-+++ b/hw/acpi/aml-build.c
++++ b/hw/arm/stellaris.c
-@@ -XXX,XX +XXX,XX @@ void build_pptt(GArray *table_data, BIOSLinker *linker, MachineState *ms,
+@@ -XXX,XX +XXX,XX @@
-                 const char *oem_id, const char *oem_table_id)
+ #define NUM_IRQ_LINES 64
  #define NUM_PRIO_BITS 3
 +#define NUM_GPIO    7
 +#define NUM_UART    4
 +#define NUM_GPTM    4
 +#define NUM_I2C     2
 +
  typedef const struct {
      const char *name;
      uint32_t did0;
@@ -XXX,XX +XXX,XX @@ static const stellaris_board_info stellaris_boards[] = {
  static void stellaris_init(MachineState *ms, stellaris_board_info *board)
  {
-     MachineClass *mc = MACHINE_GET_CLASS(ms);
+-    static const int uart_irq[] = {5, 6, 33, 34};
--    GQueue *list = g_queue_new();
+-    static const int timer_irq[] = {19, 21, 23, 35};
--    guint pptt_start = table_data->len;
+-    static const uint32_t gpio_addr[7] =
--    guint parent_offset;
++    static const int uart_irq[NUM_UART] = {5, 6, 33, 34};
--    guint length, i;
++    static const int timer_irq[NUM_GPTM] = {19, 21, 23, 35};
--    int uid = 0;
++    static const uint32_t gpio_addr[NUM_GPIO] =
--    int socket;
+       { 0x40004000, 0x40005000, 0x40006000, 0x40007000,
-+    CPUArchIdList *cpus = ms->possible_cpus;
+x40024000, 0x40025000, 0x40026000};
-+    int64_t socket_id = -1, cluster_id = -1, core_id = -1;
+-    static const int gpio_irq[7] = {0, 1, 2, 3, 4, 30, 31};
-+    uint32_t socket_offset = 0, cluster_offset = 0, core_offset = 0;
++    static const int gpio_irq[NUM_GPIO] = {0, 1, 2, 3, 4, 30, 31};
-+    uint32_t pptt_start = table_data->len;
-+    int n;
+     /* Memory map of SoC devices, from
-     AcpiTable table = { .sig = "PPTT", .rev = 2,
+      * Stellaris LM3S6965 Microcontroller Data Sheet (rev I)
-                         .oem_id = oem_id, .oem_table_id = oem_table_id };
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
+      */
-     acpi_table_begin(&table, table_data);
+     Object *soc_container;
--    for (socket = 0; socket < ms->smp.sockets; socket++) {
+-    DeviceState *gpio_dev[7], *armv7m, *nvic;
--        g_queue_push_tail(list,
+-    qemu_irq gpio_in[7][8];
--            GUINT_TO_POINTER(table_data->len - pptt_start));
+-    qemu_irq gpio_out[7][8];
--        build_processor_hierarchy_node(
++    DeviceState *gpio_dev[NUM_GPIO], *armv7m, *nvic;
--            table_data,
++    qemu_irq gpio_in[NUM_GPIO][8];
--            /*
++    qemu_irq gpio_out[NUM_GPIO][8];
--             * Physical package - represents the boundary
+     qemu_irq adc;
--             * of a physical package
+     int sram_size;
--             */
+     int flash_size;
--            (1 << 0),
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
--            0, socket, NULL, 0);
+     } else {
--    }
+         adc = NULL;
--
+     }
--    if (mc->smp_props.clusters_supported) {
+-    for (i = 0; i < 4; i++) {
--        length = g_queue_get_length(list);
++    for (i = 0; i < NUM_GPTM; i++) {
--        for (i = 0; i < length; i++) {
+         if (board->dc2 & (0x10000 << i)) {
--            int cluster;
+             SysBusDevice *sbd;
--
--            parent_offset = GPOINTER_TO_UINT(g_queue_pop_head(list));
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
--            for (cluster = 0; cluster < ms->smp.clusters; cluster++) {
+     }
--                g_queue_push_tail(list,
--                    GUINT_TO_POINTER(table_data->len - pptt_start));
--                build_processor_hierarchy_node(
+-    for (i = 0; i < 7; i++) {
--                    table_data,
++    for (i = 0; i < NUM_GPIO; i++) {
--                    (0 << 0), /* not a physical package */
+         if (board->dc4 & (1 << i)) {
--                    parent_offset, cluster, NULL, 0);
+             gpio_dev[i] = sysbus_create_simple("pl061_luminary", gpio_addr[i],
--            }
+                                                qdev_get_gpio_in(nvic,
-+    /*
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
 +     * This works with the assumption that cpus[n].props.*_id has been
 +     * sorted from top to down levels in mc->possible_cpu_arch_ids().
 +     * Otherwise, the unexpected and duplicated containers will be
 +     * created.
 +     */
 +    for (n = 0; n < cpus->len; n++) {
 +        if (cpus->cpus[n].props.socket_id != socket_id) {
 +            assert(cpus->cpus[n].props.socket_id > socket_id);
 +            socket_id = cpus->cpus[n].props.socket_id;
 +            cluster_id = -1;
 +            core_id = -1;
 +            socket_offset = table_data->len - pptt_start;
 +            build_processor_hierarchy_node(table_data,
 +                (1 << 0), /* Physical package */
 +                0, socket_id, NULL, 0);
          }
 -    }
 -    length = g_queue_get_length(list);
 -    for (i = 0; i < length; i++) {
 -        int core;
 -
 -        parent_offset = GPOINTER_TO_UINT(g_queue_pop_head(list));
 -        for (core = 0; core < ms->smp.cores; core++) {
 -            if (ms->smp.threads > 1) {
 -                g_queue_push_tail(list,
 -                    GUINT_TO_POINTER(table_data->len - pptt_start));
 -                build_processor_hierarchy_node(
 -                    table_data,
 -                    (0 << 0), /* not a physical package */
 -                    parent_offset, core, NULL, 0);
 -            } else {
 -                build_processor_hierarchy_node(
 -                    table_data,
 -                    (1 << 1) | /* ACPI Processor ID valid */
 -                    (1 << 3),  /* Node is a Leaf */
 -                    parent_offset, uid++, NULL, 0);
 +        if (mc->smp_props.clusters_supported) {
 +            if (cpus->cpus[n].props.cluster_id != cluster_id) {
 +                assert(cpus->cpus[n].props.cluster_id > cluster_id);
 +                cluster_id = cpus->cpus[n].props.cluster_id;
 +                core_id = -1;
 +                cluster_offset = table_data->len - pptt_start;
 +                build_processor_hierarchy_node(table_data,
 +                    (0 << 0), /* Not a physical package */
 +                    socket_offset, cluster_id, NULL, 0);
              }
 +        } else {
 +            cluster_offset = socket_offset;
          }
 -    }
 -    length = g_queue_get_length(list);
 -    for (i = 0; i < length; i++) {
 -        int thread;
 +        if (ms->smp.threads == 1) {
 +            build_processor_hierarchy_node(table_data,
 +                (1 << 1) | /* ACPI Processor ID valid */
 +                (1 << 3),  /* Node is a Leaf */
 +                cluster_offset, n, NULL, 0);
 +        } else {
 +            if (cpus->cpus[n].props.core_id != core_id) {
 +                assert(cpus->cpus[n].props.core_id > core_id);
 +                core_id = cpus->cpus[n].props.core_id;
 +                core_offset = table_data->len - pptt_start;
 +                build_processor_hierarchy_node(table_data,
 +                    (0 << 0), /* Not a physical package */
 +                    cluster_offset, core_id, NULL, 0);
 +            }
 -        parent_offset = GPOINTER_TO_UINT(g_queue_pop_head(list));
 -        for (thread = 0; thread < ms->smp.threads; thread++) {
 -            build_processor_hierarchy_node(
 -                table_data,
 +            build_processor_hierarchy_node(table_data,
                  (1 << 1) | /* ACPI Processor ID valid */
                  (1 << 2) | /* Processor is a Thread */
                  (1 << 3),  /* Node is a Leaf */
 -                parent_offset, uid++, NULL, 0);
 +                core_offset, n, NULL, 0);
          }
      }
--    g_queue_free(list);
+-    for (i = 0; i < 4; i++) {
-     acpi_table_end(linker, &table);
++    for (i = 0; i < NUM_UART; i++) {
- }
+         if (board->dc2 & (1 << i)) {
              SysBusDevice *sbd;
 --
-.25.1
+.34.1

-[PULL 02/32] target/arm: Handle cpreg registration for missing EL
+[PULL 11/36] hw/arm/stellaris: Use DEVCAP macro to access DeviceCapability registers
-From: Richard Henderson <richard.henderson@linaro.org>
+From: Philippe Mathieu-Daudé <philmd@linaro.org>
-More gracefully handle cpregs when EL2 and/or EL3 are missing.
+Add definitions (DCx_periph) for the DeviceCapability bits,
-If the reg is entirely inaccessible, do not register it at all.
+replace direct bitmask checks with the DEV_CAP() macro,
-If the reg is for EL2, and EL3 is present but EL2 is not,
+which use the extract/deposit API.
 either discard, squash to res0, const, or keep unchanged.
-Per rule RJFFP, mark the 4 aarch32 hypervisor access registers
+Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
 with ARM_CP_EL3_NO_EL2_KEEP, and mark all of the EL2 address
 translation and tlb invalidation "regs" ARM_CP_EL3_NO_EL2_UNDEF.
 Mark the 2 virtualization processor id regs ARM_CP_EL3_NO_EL2_C_NZ.
 This will simplify cpreg registration for conditional arm features.
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250110160204.74997-6-philmd@linaro.org
 Message-id: 20220506180242.216785-2-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/cpregs.h |  11 +++
+ hw/arm/stellaris.c | 37 +++++++++++++++++++++++++++++--------
- target/arm/helper.c | 178 ++++++++++++++++++++++++++++++--------------
+file changed, 29 insertions(+), 8 deletions(-)
 files changed, 133 insertions(+), 56 deletions(-)
-diff --git a/target/arm/cpregs.h b/target/arm/cpregs.h
+diff --git a/hw/arm/stellaris.c b/hw/arm/stellaris.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpregs.h
+--- a/hw/arm/stellaris.c
-+++ b/target/arm/cpregs.h
++++ b/hw/arm/stellaris.c
-@@ -XXX,XX +XXX,XX @@ enum {
+@@ -XXX,XX +XXX,XX @@
-     ARM_CP_SVE                   = 1 << 14,
+  */
-     /* Flag: Do not expose in gdb sysreg xml. */
-     ARM_CP_NO_GDB                = 1 << 15,
+ #include "qemu/osdep.h"
-+    /*
++#include "qemu/bitops.h"
-+     * Flags: If EL3 but not EL2...
+ #include "qapi/error.h"
-+     *   - UNDEF: discard the cpreg,
+ #include "hw/core/split-irq.h"
-+     *   -  KEEP: retain the cpreg as is,
+ #include "hw/sysbus.h"
-+     *   -  C_NZ: set const on the cpreg, but retain resetvalue,
+@@ -XXX,XX +XXX,XX @@
-+     *   -  else: set const on the cpreg, zero resetvalue, aka RES0.
+ #define NUM_GPTM    4
-+     * See rule RJFFP in section D1.1.3 of DDI0487H.a.
+ #define NUM_I2C     2
-+     */
-+    ARM_CP_EL3_NO_EL2_UNDEF      = 1 << 16,
++/*
-+    ARM_CP_EL3_NO_EL2_KEEP       = 1 << 17,
++ * See Stellaris Data Sheet chapter 5.2.5 "System Control",
-+    ARM_CP_EL3_NO_EL2_C_NZ       = 1 << 18,
++ * Register 13 .. 17: Device Capabilities 0 .. 4 (DC0 .. DC4).
- };
++ */
++#define DC1_WDT          3
- /*
++#define DC1_HIB          6
-diff --git a/target/arm/helper.c b/target/arm/helper.c
++#define DC1_MPU          7
-index XXXXXXX..XXXXXXX 100644
++#define DC1_ADC          16
---- a/target/arm/helper.c
++#define DC1_PWM          20
-+++ b/target/arm/helper.c
++#define DC2_UART(n)     (n)
-@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo v8_cp_reginfo[] = {
++#define DC2_SSI          4
-       .access = PL1_RW, .readfn = spsel_read, .writefn = spsel_write },
++#define DC2_QEI(n)      (8 + n)
-     { .name = "FPEXC32_EL2", .state = ARM_CP_STATE_AA64,
++#define DC2_I2C(n)      (12 + 2 * n)
-       .opc0 = 3, .opc1 = 4, .crn = 5, .crm = 3, .opc2 = 0,
++#define DC2_GPTM(n)     (16 + n)
--      .access = PL2_RW, .type = ARM_CP_ALIAS | ARM_CP_FPU,
++#define DC2_COMP(n)     (24 + n)
-+      .access = PL2_RW,
++#define DC4_GPIO(n)     (n)
-+      .type = ARM_CP_ALIAS | ARM_CP_FPU | ARM_CP_EL3_NO_EL2_KEEP,
++#define DC4_EMAC         28
-       .fieldoffset = offsetof(CPUARMState, vfp.xregs[ARM_VFP_FPEXC]) },
++
-     { .name = "DACR32_EL2", .state = ARM_CP_STATE_AA64,
++#define DEV_CAP(_dc, _cap) extract32(board->dc##_dc, DC##_dc##_##_cap, 1)
-       .opc0 = 3, .opc1 = 4, .crn = 3, .crm = 0, .opc2 = 0,
++
--      .access = PL2_RW, .resetvalue = 0,
+ typedef const struct {
-+      .access = PL2_RW, .resetvalue = 0, .type = ARM_CP_EL3_NO_EL2_KEEP,
+     const char *name;
-       .writefn = dacr_write, .raw_writefn = raw_write,
+     uint32_t did0;
-       .fieldoffset = offsetof(CPUARMState, cp15.dacr32_el2) },
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
-     { .name = "IFSR32_EL2", .state = ARM_CP_STATE_AA64,
+     sysbus_mmio_map(SYS_BUS_DEVICE(ssys_dev), 0, 0x400fe000);
-       .opc0 = 3, .opc1 = 4, .crn = 5, .crm = 0, .opc2 = 1,
+     sysbus_connect_irq(SYS_BUS_DEVICE(ssys_dev), 0, qdev_get_gpio_in(nvic, 28));
--      .access = PL2_RW, .resetvalue = 0,
-+      .access = PL2_RW, .resetvalue = 0, .type = ARM_CP_EL3_NO_EL2_KEEP,
+-    if (board->dc1 & (1 << 16)) {
-       .fieldoffset = offsetof(CPUARMState, cp15.ifsr32_el2) },
++    if (DEV_CAP(1, ADC)) {
-     { .name = "SPSR_IRQ", .state = ARM_CP_STATE_AA64,
+         dev = sysbus_create_varargs(TYPE_STELLARIS_ADC, 0x40038000,
-       .type = ARM_CP_ALIAS,
+                                     qdev_get_gpio_in(nvic, 14),
-@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo el2_cp_reginfo[] = {
+                                     qdev_get_gpio_in(nvic, 15),
-       .writefn = tlbimva_hyp_is_write },
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
-     { .name = "TLBI_ALLE2", .state = ARM_CP_STATE_AA64,
+         adc = NULL;
-       .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 7, .opc2 = 0,
+     }
--      .type = ARM_CP_NO_RAW, .access = PL2_W,
+     for (i = 0; i < NUM_GPTM; i++) {
-+      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_EL3_NO_EL2_UNDEF,
+-        if (board->dc2 & (0x10000 << i)) {
-       .writefn = tlbi_aa64_alle2_write },
++        if (DEV_CAP(2, GPTM(i))) {
-     { .name = "TLBI_VAE2", .state = ARM_CP_STATE_AA64,
+             SysBusDevice *sbd;
-       .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 7, .opc2 = 1,
--      .type = ARM_CP_NO_RAW, .access = PL2_W,
+             dev = qdev_new(TYPE_STELLARIS_GPTM);
-+      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_EL3_NO_EL2_UNDEF,
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
        .writefn = tlbi_aa64_vae2_write },
      { .name = "TLBI_VALE2", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 7, .opc2 = 5,
 -      .access = PL2_W, .type = ARM_CP_NO_RAW,
 +      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_EL3_NO_EL2_UNDEF,
        .writefn = tlbi_aa64_vae2_write },
      { .name = "TLBI_ALLE2IS", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 3, .opc2 = 0,
 -      .access = PL2_W, .type = ARM_CP_NO_RAW,
 +      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_EL3_NO_EL2_UNDEF,
        .writefn = tlbi_aa64_alle2is_write },
      { .name = "TLBI_VAE2IS", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 3, .opc2 = 1,
 -      .type = ARM_CP_NO_RAW, .access = PL2_W,
 +      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_EL3_NO_EL2_UNDEF,
        .writefn = tlbi_aa64_vae2is_write },
      { .name = "TLBI_VALE2IS", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 3, .opc2 = 5,
 -      .access = PL2_W, .type = ARM_CP_NO_RAW,
 +      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_EL3_NO_EL2_UNDEF,
        .writefn = tlbi_aa64_vae2is_write },
  #ifndef CONFIG_USER_ONLY
      /* Unlike the other EL2-related AT operations, these must
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo el2_cp_reginfo[] = {
      { .name = "AT_S1E2R", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 7, .crm = 8, .opc2 = 0,
        .access = PL2_W, .accessfn = at_s1e2_access,
 -      .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC, .writefn = ats_write64 },
 +      .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC | ARM_CP_EL3_NO_EL2_UNDEF,
 +      .writefn = ats_write64 },
      { .name = "AT_S1E2W", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 7, .crm = 8, .opc2 = 1,
        .access = PL2_W, .accessfn = at_s1e2_access,
 -      .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC, .writefn = ats_write64 },
 +      .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC | ARM_CP_EL3_NO_EL2_UNDEF,
 +      .writefn = ats_write64 },
      /* The AArch32 ATS1H* operations are CONSTRAINED UNPREDICTABLE
       * if EL2 is not implemented; we choose to UNDEF. Behaviour at EL3
       * with SCR.NS == 0 outside Monitor mode is UNPREDICTABLE; we choose
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo debug_cp_reginfo[] = {
      { .name = "DBGVCR32_EL2", .state = ARM_CP_STATE_AA64,
        .opc0 = 2, .opc1 = 4, .crn = 0, .crm = 7, .opc2 = 0,
        .access = PL2_RW, .accessfn = access_tda,
 -      .type = ARM_CP_NOP },
 +      .type = ARM_CP_NOP | ARM_CP_EL3_NO_EL2_KEEP },
      /* Dummy MDCCINT_EL1, since we don't implement the Debug Communications
       * Channel but Linux may try to access this register. The 32-bit
       * alias is DBGDCCINT.
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo tlbirange_reginfo[] = {
        .access = PL2_W, .type = ARM_CP_NOP },
      { .name = "TLBI_RVAE2IS", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 2, .opc2 = 1,
 -      .access = PL2_W, .type = ARM_CP_NO_RAW,
 +      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_EL3_NO_EL2_UNDEF,
        .writefn = tlbi_aa64_rvae2is_write },
     { .name = "TLBI_RVALE2IS", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 2, .opc2 = 5,
 -      .access = PL2_W, .type = ARM_CP_NO_RAW,
 +      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_EL3_NO_EL2_UNDEF,
        .writefn = tlbi_aa64_rvae2is_write },
      { .name = "TLBI_RIPAS2E1", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 4, .opc2 = 2,
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo tlbirange_reginfo[] = {
        .access = PL2_W, .type = ARM_CP_NOP },
     { .name = "TLBI_RVAE2OS", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 5, .opc2 = 1,
 -      .access = PL2_W, .type = ARM_CP_NO_RAW,
 +      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_EL3_NO_EL2_UNDEF,
        .writefn = tlbi_aa64_rvae2is_write },
     { .name = "TLBI_RVALE2OS", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 5, .opc2 = 5,
 -      .access = PL2_W, .type = ARM_CP_NO_RAW,
 +      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_EL3_NO_EL2_UNDEF,
        .writefn = tlbi_aa64_rvae2is_write },
      { .name = "TLBI_RVAE2", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 6, .opc2 = 1,
 -      .access = PL2_W, .type = ARM_CP_NO_RAW,
 +      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_EL3_NO_EL2_UNDEF,
        .writefn = tlbi_aa64_rvae2_write },
     { .name = "TLBI_RVALE2", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 6, .opc2 = 5,
 -      .access = PL2_W, .type = ARM_CP_NO_RAW,
 +      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_EL3_NO_EL2_UNDEF,
        .writefn = tlbi_aa64_rvae2_write },
     { .name = "TLBI_RVAE3IS", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 6, .crn = 8, .crm = 2, .opc2 = 1,
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo tlbios_reginfo[] = {
        .writefn = tlbi_aa64_vae1is_write },
      { .name = "TLBI_ALLE2OS", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 1, .opc2 = 0,
 -      .access = PL2_W, .type = ARM_CP_NO_RAW,
 +      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_EL3_NO_EL2_UNDEF,
        .writefn = tlbi_aa64_alle2is_write },
      { .name = "TLBI_VAE2OS", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 1, .opc2 = 1,
 -      .access = PL2_W, .type = ARM_CP_NO_RAW,
 +      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_EL3_NO_EL2_UNDEF,
        .writefn = tlbi_aa64_vae2is_write },
     { .name = "TLBI_ALLE1OS", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 1, .opc2 = 4,
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo tlbios_reginfo[] = {
        .writefn = tlbi_aa64_alle1is_write },
      { .name = "TLBI_VALE2OS", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 1, .opc2 = 5,
 -      .access = PL2_W, .type = ARM_CP_NO_RAW,
 +      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_EL3_NO_EL2_UNDEF,
        .writefn = tlbi_aa64_vae2is_write },
      { .name = "TLBI_VMALLS12E1OS", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 1, .opc2 = 6,
@@ -XXX,XX +XXX,XX @@ void register_cp_regs_for_features(ARMCPU *cpu)
              { .name = "VPIDR", .state = ARM_CP_STATE_AA32,
                .cp = 15, .opc1 = 4, .crn = 0, .crm = 0, .opc2 = 0,
                .access = PL2_RW, .accessfn = access_el3_aa32ns,
 -              .resetvalue = cpu->midr, .type = ARM_CP_ALIAS,
 +              .resetvalue = cpu->midr,
 +              .type = ARM_CP_ALIAS | ARM_CP_EL3_NO_EL2_C_NZ,
                .fieldoffset = offsetoflow32(CPUARMState, cp15.vpidr_el2) },
              { .name = "VPIDR_EL2", .state = ARM_CP_STATE_AA64,
                .opc0 = 3, .opc1 = 4, .crn = 0, .crm = 0, .opc2 = 0,
                .access = PL2_RW, .resetvalue = cpu->midr,
 +              .type = ARM_CP_EL3_NO_EL2_C_NZ,
                .fieldoffset = offsetof(CPUARMState, cp15.vpidr_el2) },
              { .name = "VMPIDR", .state = ARM_CP_STATE_AA32,
                .cp = 15, .opc1 = 4, .crn = 0, .crm = 0, .opc2 = 5,
                .access = PL2_RW, .accessfn = access_el3_aa32ns,
 -              .resetvalue = vmpidr_def, .type = ARM_CP_ALIAS,
 +              .resetvalue = vmpidr_def,
 +              .type = ARM_CP_ALIAS | ARM_CP_EL3_NO_EL2_C_NZ,
                .fieldoffset = offsetoflow32(CPUARMState, cp15.vmpidr_el2) },
              { .name = "VMPIDR_EL2", .state = ARM_CP_STATE_AA64,
                .opc0 = 3, .opc1 = 4, .crn = 0, .crm = 0, .opc2 = 5,
 -              .access = PL2_RW,
 -              .resetvalue = vmpidr_def,
 +              .access = PL2_RW, .resetvalue = vmpidr_def,
 +              .type = ARM_CP_EL3_NO_EL2_C_NZ,
                .fieldoffset = offsetof(CPUARMState, cp15.vmpidr_el2) },
          };
          define_arm_cp_regs(cpu, vpidr_regs);
@@ -XXX,XX +XXX,XX @@ static void add_cpreg_to_hashtable(ARMCPU *cpu, const ARMCPRegInfo *r,
                                     int crm, int opc1, int opc2,
                                     const char *name)
  {
 +    CPUARMState *env = &cpu->env;
      uint32_t key;
      ARMCPRegInfo *r2;
      bool is64 = r->type & ARM_CP_64BIT;
      bool ns = secstate & ARM_CP_SECSTATE_NS;
      int cp = r->cp;
 -    bool isbanked;
      size_t name_len;
 +    bool make_const;
      switch (state) {
      case ARM_CP_STATE_AA32:
@@ -XXX,XX +XXX,XX @@ static void add_cpreg_to_hashtable(ARMCPU *cpu, const ARMCPRegInfo *r,
          }
      }
-+    /*
+-    if (board->dc1 & (1 << 3)) { /* watchdog present */
-+     * Eliminate registers that are not present because the EL is missing.
++    if (DEV_CAP(1, WDT)) {
-+     * Doing this here makes it easier to put all registers for a given
+         dev = qdev_new(TYPE_LUMINARY_WATCHDOG);
-+     * feature into the same ARMCPRegInfo array and define them all at once.
+         object_property_add_child(soc_container, "wdg", OBJECT(dev));
-+     */
+         qdev_connect_clock_in(dev, "WDOGCLK",
-+    make_const = false;
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
-+    if (arm_feature(env, ARM_FEATURE_EL3)) {
-+        /*
-+         * An EL2 register without EL2 but with EL3 is (usually) RES0.
+     for (i = 0; i < NUM_GPIO; i++) {
-+         * See rule RJFFP in section D1.1.3 of DDI0487H.a.
+-        if (board->dc4 & (1 << i)) {
-+         */
++        if (DEV_CAP(4, GPIO(i))) {
-+        int min_el = ctz32(r->access) / 2;
+             gpio_dev[i] = sysbus_create_simple("pl061_luminary", gpio_addr[i],
-+        if (min_el == 2 && !arm_feature(env, ARM_FEATURE_EL2)) {
+                                                qdev_get_gpio_in(nvic,
-+            if (r->type & ARM_CP_EL3_NO_EL2_UNDEF) {
+                                                                 gpio_irq[i]));
-+                return;
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
 +            }
 +            make_const = !(r->type & ARM_CP_EL3_NO_EL2_KEEP);
 +        }
 +    } else {
 +        CPAccessRights max_el = (arm_feature(env, ARM_FEATURE_EL2)
 +                                 ? PL2_RW : PL1_RW);
 +        if ((r->access & max_el) == 0) {
 +            return;
 +        }
 +    }
 +
      /* Combine cpreg and name into one allocation. */
      name_len = strlen(name) + 1;
      r2 = g_malloc(sizeof(*r2) + name_len);
@@ -XXX,XX +XXX,XX @@ static void add_cpreg_to_hashtable(ARMCPU *cpu, const ARMCPRegInfo *r,
          r2->opaque = opaque;
      }
 -    isbanked = r->bank_fieldoffsets[0] && r->bank_fieldoffsets[1];
 -    if (isbanked) {
 +    if (make_const) {
 +        /* This should not have been a very special register to begin. */
 +        int old_special = r2->type & ARM_CP_SPECIAL_MASK;
 +        assert(old_special == 0 || old_special == ARM_CP_NOP);
          /*
 -         * Register is banked (using both entries in array).
 -         * Overwriting fieldoffset as the array is only used to define
 -         * banked registers but later only fieldoffset is used.
 +         * Set the special function to CONST, retaining the other flags.
 +         * This is important for e.g. ARM_CP_SVE so that we still
 +         * take the SVE trap if CPTR_EL3.EZ == 0.
           */
 -        r2->fieldoffset = r->bank_fieldoffsets[ns];
 -    }
 +        r2->type = (r2->type & ~ARM_CP_SPECIAL_MASK) | ARM_CP_CONST;
 +        /*
 +         * Usually, these registers become RES0, but there are a few
 +         * special cases like VPIDR_EL2 which have a constant non-zero
 +         * value with writes ignored.
 +         */
 +        if (!(r->type & ARM_CP_EL3_NO_EL2_C_NZ)) {
 +            r2->resetvalue = 0;
 +        }
 +        /*
 +         * ARM_CP_CONST has precedence, so removing the callbacks and
 +         * offsets are not strictly necessary, but it is potentially
 +         * less confusing to debug later.
 +         */
 +        r2->readfn = NULL;
 +        r2->writefn = NULL;
 +        r2->raw_readfn = NULL;
 +        r2->raw_writefn = NULL;
 +        r2->resetfn = NULL;
 +        r2->fieldoffset = 0;
 +        r2->bank_fieldoffsets[0] = 0;
 +        r2->bank_fieldoffsets[1] = 0;
 +    } else {
 +        bool isbanked = r->bank_fieldoffsets[0] && r->bank_fieldoffsets[1];
 -    if (state == ARM_CP_STATE_AA32) {
          if (isbanked) {
              /*
 -             * If the register is banked then we don't need to migrate or
 -             * reset the 32-bit instance in certain cases:
 -             *
 -             * 1) If the register has both 32-bit and 64-bit instances then we
 -             *    can count on the 64-bit instance taking care of the
 -             *    non-secure bank.
 -             * 2) If ARMv8 is enabled then we can count on a 64-bit version
 -             *    taking care of the secure bank.  This requires that separate
 -             *    32 and 64-bit definitions are provided.
 +             * Register is banked (using both entries in array).
 +             * Overwriting fieldoffset as the array is only used to define
 +             * banked registers but later only fieldoffset is used.
               */
 -            if ((r->state == ARM_CP_STATE_BOTH && ns) ||
 -                (arm_feature(&cpu->env, ARM_FEATURE_V8) && !ns)) {
 +            r2->fieldoffset = r->bank_fieldoffsets[ns];
 +        }
 +        if (state == ARM_CP_STATE_AA32) {
 +            if (isbanked) {
 +                /*
 +                 * If the register is banked then we don't need to migrate or
 +                 * reset the 32-bit instance in certain cases:
 +                 *
 +                 * 1) If the register has both 32-bit and 64-bit instances
 +                 *    then we can count on the 64-bit instance taking care
 +                 *    of the non-secure bank.
 +                 * 2) If ARMv8 is enabled then we can count on a 64-bit
 +                 *    version taking care of the secure bank.  This requires
 +                 *    that separate 32 and 64-bit definitions are provided.
 +                 */
 +                if ((r->state == ARM_CP_STATE_BOTH && ns) ||
 +                    (arm_feature(env, ARM_FEATURE_V8) && !ns)) {
 +                    r2->type |= ARM_CP_ALIAS;
 +                }
 +            } else if ((secstate != r->secure) && !ns) {
 +                /*
 +                 * The register is not banked so we only want to allow
 +                 * migration of the non-secure instance.
 +                 */
                  r2->type |= ARM_CP_ALIAS;
              }
 -        } else if ((secstate != r->secure) && !ns) {
 -            /*
 -             * The register is not banked so we only want to allow migration
 -             * of the non-secure instance.
 -             */
 -            r2->type |= ARM_CP_ALIAS;
 -        }
 -        if (HOST_BIG_ENDIAN &&
 -            r->state == ARM_CP_STATE_BOTH && r2->fieldoffset) {
 -            r2->fieldoffset += sizeof(uint32_t);
 +            if (HOST_BIG_ENDIAN &&
 +                r->state == ARM_CP_STATE_BOTH && r2->fieldoffset) {
 +                r2->fieldoffset += sizeof(uint32_t);
 +            }
          }
      }
-@@ -XXX,XX +XXX,XX @@ static void add_cpreg_to_hashtable(ARMCPU *cpu, const ARMCPRegInfo *r,
+-    if (board->dc2 & (1 << 12)) {
-      * multiple times. Special registers (ie NOP/WFI) are
++    if (DEV_CAP(2, I2C(0))) {
-      * never migratable and not even raw-accessible.
+         dev = sysbus_create_simple(TYPE_STELLARIS_I2C, 0x40020000,
-      */
+                                    qdev_get_gpio_in(nvic, 8));
--    if (r->type & ARM_CP_SPECIAL_MASK) {
+         i2c = (I2CBus *)qdev_get_child_bus(dev, "i2c");
-+    if (r2->type & ARM_CP_SPECIAL_MASK) {
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
          r2->type |= ARM_CP_NO_RAW;
      }
-     if (((r->crm == CP_ANY) && crm != 0) ||
      for (i = 0; i < NUM_UART; i++) {
 -        if (board->dc2 & (1 << i)) {
 +        if (DEV_CAP(2, UART(i))) {
              SysBusDevice *sbd;
              dev = qdev_new("pl011_luminary");
@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
              sysbus_connect_irq(sbd, 0, qdev_get_gpio_in(nvic, uart_irq[i]));
          }
      }
 -    if (board->dc2 & (1 << 4)) {
 +    if (DEV_CAP(2, SSI)) {
          dev = sysbus_create_simple("pl022", 0x40008000,
                                     qdev_get_gpio_in(nvic, 7));
          if (board->peripherals & BP_OLED_SSI) {
@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
              qemu_irq_raise(gpio_out[GPIO_D][0]);
          }
      }
 -    if (board->dc4 & (1 << 28)) {
 +    if (DEV_CAP(4, EMAC)) {
          DeviceState *enet;
          enet = qdev_new("stellaris_enet");
 --
-.25.1
+.34.1

-[PULL 26/32] hw/arm: add versioning to sbsa-ref machine DT
+[PULL 12/36] hw/arm/stellaris: Map both I2C controllers
-From: Leif Lindholm <quic_llindhol@quicinc.com>
+From: Philippe Mathieu-Daudé <philmd@linaro.org>
-The sbsa-ref machine is continuously evolving. Some of the changes we
+There are 2 I2C controllers, map them both, removing
-want to make in the near future, to align with real components (e.g.
+the unimplemented one. Keep the OLED controller on the
-the GIC-700), will break compatibility for existing firmware.
+first I2C bus.
-Introduce two new properties to the DT generated on machine generation:
+Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
 - machine-version-major
   To be incremented when a platform change makes the machine
   incompatible with existing firmware.
 - machine-version-minor
   To be incremented when functionality is added to the machine
   without causing incompatibility with existing firmware.
   to be reset to 0 when machine-version-major is incremented.
 This versioning scheme is *neither*:
 - A QEMU versioned machine type; a given version of QEMU will emulate
   a given version of the platform.
 - A reflection of level of SBSA (now SystemReady SR) support provided.
 The version will increment on guest-visible functional changes only,
 akin to a revision ID register found on a physical platform.
 These properties are both introduced with the value 0.
 (Hence, a machine where the DT is lacking these nodes is equivalent
 to version 0.0.)
 Signed-off-by: Leif Lindholm <quic_llindhol@quicinc.com>
 Message-id: 20220505113947.75714-1-quic_llindhol@quicinc.com
 Cc: Peter Maydell <peter.maydell@linaro.org>
 Cc: Radoslaw Biernacki <rad@semihalf.com>
 Cc: Cédric Le Goater <clg@kaod.org>
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+Message-id: 20250110160204.74997-7-philmd@linaro.org
+[PMM: tweak to appease maybe-use-uninitialized warning]
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- hw/arm/sbsa-ref.c | 14 ++++++++++++++
+ hw/arm/stellaris.c | 21 +++++++++++++--------
-file changed, 14 insertions(+)
+file changed, 13 insertions(+), 8 deletions(-)
-diff --git a/hw/arm/sbsa-ref.c b/hw/arm/sbsa-ref.c
+diff --git a/hw/arm/stellaris.c b/hw/arm/stellaris.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/sbsa-ref.c
+--- a/hw/arm/stellaris.c
-+++ b/hw/arm/sbsa-ref.c
++++ b/hw/arm/stellaris.c
-@@ -XXX,XX +XXX,XX @@ static void create_fdt(SBSAMachineState *sms)
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
-     qemu_fdt_setprop_cell(fdt, "/", "#address-cells", 0x2);
+       { 0x40004000, 0x40005000, 0x40006000, 0x40007000,
-     qemu_fdt_setprop_cell(fdt, "/", "#size-cells", 0x2);
+x40024000, 0x40025000, 0x40026000};
+     static const int gpio_irq[NUM_GPIO] = {0, 1, 2, 3, 4, 30, 31};
-+    /*
++    static const uint32_t i2c_addr[NUM_I2C] = {0x40020000, 0x40021000};
-+     * This versioning scheme is for informing platform fw only. It is neither:
++    static const int i2c_irq[NUM_I2C] = {8, 37};
-+     * - A QEMU versioned machine type; a given version of QEMU will emulate
-+     *   a given version of the platform.
+     /* Memory map of SoC devices, from
-+     * - A reflection of level of SBSA (now SystemReady SR) support provided.
+      * Stellaris LM3S6965 Microcontroller Data Sheet (rev I)
-+     *
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
-+     * machine-version-major: updated when changes breaking fw compatibility
+     qemu_irq adc;
-+     *                        are introduced.
+     int sram_size;
-+     * machine-version-minor: updated when features are added that don't break
+     int flash_size;
-+     *                        fw compatibility.
+-    I2CBus *i2c;
-+     */
++    DeviceState *i2c_dev[NUM_I2C] = { };
-+    qemu_fdt_setprop_cell(fdt, "/", "machine-version-major", 0);
+     DeviceState *dev;
-+    qemu_fdt_setprop_cell(fdt, "/", "machine-version-minor", 0);
+     DeviceState *ssys_dev;
      int i;
@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
          }
      }
 -    if (DEV_CAP(2, I2C(0))) {
 -        dev = sysbus_create_simple(TYPE_STELLARIS_I2C, 0x40020000,
 -                                   qdev_get_gpio_in(nvic, 8));
 -        i2c = (I2CBus *)qdev_get_child_bus(dev, "i2c");
 -        if (board->peripherals & BP_OLED_I2C) {
 -            i2c_slave_create_simple(i2c, "ssd0303", 0x3d);
 +    for (i = 0; i < NUM_I2C; i++) {
 +        if (DEV_CAP(2, I2C(i))) {
 +            i2c_dev[i] = sysbus_create_simple(TYPE_STELLARIS_I2C, i2c_addr[i],
 +                                              qdev_get_gpio_in(nvic,
 +                                                               i2c_irq[i]));
          }
      }
 +    if (board->peripherals & BP_OLED_I2C) {
 +        I2CBus *bus = (I2CBus *)qdev_get_child_bus(i2c_dev[0], "i2c");
 +
-     if (ms->numa_state->have_numa_distance) {
++        i2c_slave_create_simple(bus, "ssd0303", 0x3d);
-         int size = nb_numa_nodes * nb_numa_nodes * 3 * sizeof(uint32_t);
++    }
-         uint32_t *matrix = g_malloc0(size);
      for (i = 0; i < NUM_UART; i++) {
          if (DEV_CAP(2, UART(i))) {
@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
      /* Add dummy regions for the devices we don't implement yet,
       * so guest accesses don't cause unlogged crashes.
       */
 -    create_unimplemented_device("i2c-2", 0x40021000, 0x1000);
      create_unimplemented_device("PWM", 0x40028000, 0x1000);
      create_unimplemented_device("QEI-0", 0x4002c000, 0x1000);
      create_unimplemented_device("QEI-1", 0x4002d000, 0x1000);
 --
-.25.1
+.34.1

-[PULL 01/32] MAINTAINERS/.mailmap: update email for Leif Lindholm
+[PULL 13/36] tests/functional: Add a test for the arm microbit machine
-From: Leif Lindholm <quic_llindhol@quicinc.com>
+From: Thomas Huth <thuth@redhat.com>
-NUVIA was acquired by Qualcomm in March 2021, but kept functioning on
+We don't have any functional tests for this machine yet, thus let's
-separate infrastructure for a transitional period. We've now switched
+add a test with a MicroPython binary that is available online
-over to contributing as Qualcomm Innovation Center (quicinc), so update
+(thanks to Joel Stanley for providing it, see:
-my email address to reflect this.
+ https://www.mail-archive.com/qemu-devel@nongnu.org/msg606064.html ).
-Signed-off-by: Leif Lindholm <quic_llindhol@quicinc.com>
+Signed-off-by: Thomas Huth <thuth@redhat.com>
-Message-id: 20220505113740.75565-1-quic_llindhol@quicinc.com
+Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
-Cc: Leif Lindholm <leif@nuviainc.com>
+Message-id: 20250124101709.1591761-1-thuth@redhat.com
 Cc: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
 [Fixed commit message typo]
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- .mailmap    | 3 ++-
+ MAINTAINERS                           |  1 +
- MAINTAINERS | 2 +-
+ tests/functional/meson.build          |  1 +
-files changed, 3 insertions(+), 2 deletions(-)
+ tests/functional/test_arm_microbit.py | 31 +++++++++++++++++++++++++++
 files changed, 33 insertions(+)
  create mode 100755 tests/functional/test_arm_microbit.py
-diff --git a/.mailmap b/.mailmap
-index XXXXXXX..XXXXXXX 100644
---- a/.mailmap
-+++ b/.mailmap
-@@ -XXX,XX +XXX,XX @@ Greg Kurz <groug@kaod.org> <gkurz@linux.vnet.ibm.com>
- Huacai Chen <chenhuacai@kernel.org> <chenhc@lemote.com>
- Huacai Chen <chenhuacai@kernel.org> <chenhuacai@loongson.cn>
- James Hogan <jhogan@kernel.org> <james.hogan@imgtec.com>
--Leif Lindholm <leif@nuviainc.com> <leif.lindholm@linaro.org>
-+Leif Lindholm <quic_llindhol@quicinc.com> <leif.lindholm@linaro.org>
-+Leif Lindholm <quic_llindhol@quicinc.com> <leif@nuviainc.com>
- Radoslaw Biernacki <rad@semihalf.com> <radoslaw.biernacki@linaro.org>
- Paul Burton <paulburton@kernel.org> <paul.burton@mips.com>
- Paul Burton <paulburton@kernel.org> <paul.burton@imgtec.com>
 diff --git a/MAINTAINERS b/MAINTAINERS
 index XXXXXXX..XXXXXXX 100644
 --- a/MAINTAINERS
 +++ b/MAINTAINERS
-@@ -XXX,XX +XXX,XX @@ F: include/hw/ssi/imx_spi.h
+@@ -XXX,XX +XXX,XX @@ F: hw/*/microbit*.c
- SBSA-REF
+ F: include/hw/*/nrf51*.h
- M: Radoslaw Biernacki <rad@semihalf.com>
+ F: include/hw/*/microbit*.h
- M: Peter Maydell <peter.maydell@linaro.org>
+ F: tests/qtest/microbit-test.c
--R: Leif Lindholm <leif@nuviainc.com>
++F: tests/functional/test_arm_microbit.py
-+R: Leif Lindholm <quic_llindhol@quicinc.com>
+ F: docs/system/arm/nrf.rst
- L: qemu-arm@nongnu.org
- S: Maintained
+ ARM PL011 Rust device
- F: hw/arm/sbsa-ref.c
+diff --git a/tests/functional/meson.build b/tests/functional/meson.build
 index XXXXXXX..XXXXXXX 100644
 --- a/tests/functional/meson.build
 +++ b/tests/functional/meson.build
@@ -XXX,XX +XXX,XX @@ tests_arm_system_thorough = [
    'arm_cubieboard',
    'arm_emcraft_sf2',
    'arm_integratorcp',
 +  'arm_microbit',
    'arm_orangepi',
    'arm_quanta_gsj',
    'arm_raspi2',
 diff --git a/tests/functional/test_arm_microbit.py b/tests/functional/test_arm_microbit.py
 new file mode 100755
 index XXXXXXX..XXXXXXX
 --- /dev/null
 +++ b/tests/functional/test_arm_microbit.py
@@ -XXX,XX +XXX,XX @@
 +#!/usr/bin/env python3
 +#
 +# SPDX-License-Identifier: GPL-2.0-or-later
 +#
 +# Copyright 2025, The QEMU Project Developers.
 +#
 +# A functional test that runs MicroPython on the arm microbit machine.
 +
 +from qemu_test import QemuSystemTest, Asset, exec_command_and_wait_for_pattern
 +from qemu_test import wait_for_console_pattern
 +
 +
 +class MicrobitMachine(QemuSystemTest):
 +
 +    ASSET_MICRO = Asset('https://ozlabs.org/~joel/microbit-micropython.hex',
 +        '021641f93dfb11767d4978dbb3ca7f475d1b13c69e7f4aec3382f212636bffd6')
 +
 +    def test_arm_microbit(self):
 +        self.set_machine('microbit')
 +
 +        micropython = self.ASSET_MICRO.fetch()
 +        self.vm.set_console()
 +        self.vm.add_args('-device', f'loader,file={micropython}')
 +        self.vm.launch()
 +        wait_for_console_pattern(self, 'Type "help()" for more information.')
 +        exec_command_and_wait_for_pattern(self, 'import machine as mch', '>>>')
 +        exec_command_and_wait_for_pattern(self, 'mch.reset()', 'MicroPython')
 +        wait_for_console_pattern(self, '>>>')
 +
 +if __name__ == '__main__':
 +    QemuSystemTest.main()
 --
-.25.1
+.34.1

-[PULL 05/32] target/arm: Adjust definition of CONTEXTIDR_EL2
+[PULL 14/36] target/arm: arm_reset_sve_state() should set FPSR, not FPCR
-From: Richard Henderson <richard.henderson@linaro.org>
+The pseudocode ResetSVEState() does:
     FPSR = ZeroExtend(0x0800009f<31:0>, 64);
 but QEMU's arm_reset_sve_state() called vfp_set_fpcr() by accident.
-This register is present for either VHE or Debugv8p2.
+Before the advent of FEAT_AFP, this was only setting a collection of
 RES0 bits, which vfp_set_fpsr() would then ignore, so the only effect
 was that we didn't actually set the FPSR the way we are supposed to
 do.  Once FEAT_AFP is implemented, setting the bottom bits of FPSR
 will change the floating point behaviour.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+Call vfp_set_fpsr(), as we ought to.
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220506180242.216785-5-richard.henderson@linaro.org
+(Note for stable backports: commit 7f2a01e7368f9 moved this function
 from sme_helper.c to helper.c, but it had the same bug before the
 move too.)
 Cc: qemu-stable@nongnu.org
 Fixes: f84734b87461 ("target/arm: Implement SMSTART, SMSTOP")
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250124162836.2332150-4-peter.maydell@linaro.org
 ---
- target/arm/helper.c | 15 +++++++++++----
+ target/arm/helper.c | 2 +-
-file changed, 11 insertions(+), 4 deletions(-)
+file changed, 1 insertion(+), 1 deletion(-)
 diff --git a/target/arm/helper.c b/target/arm/helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper.c
 +++ b/target/arm/helper.c
-@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo jazelle_regs[] = {
+@@ -XXX,XX +XXX,XX @@ static void arm_reset_sve_state(CPUARMState *env)
-       .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
+     memset(env->vfp.zregs, 0, sizeof(env->vfp.zregs));
- };
+     /* Recall that FFR is stored as pregs[16]. */
+     memset(env->vfp.pregs, 0, sizeof(env->vfp.pregs));
-+static const ARMCPRegInfo contextidr_el2 = {
+-    vfp_set_fpcr(env, 0x0800009f);
-+    .name = "CONTEXTIDR_EL2", .state = ARM_CP_STATE_AA64,
++    vfp_set_fpsr(env, 0x0800009f);
-+    .opc0 = 3, .opc1 = 4, .crn = 13, .crm = 0, .opc2 = 1,
+ }
-+    .access = PL2_RW,
-+    .fieldoffset = offsetof(CPUARMState, cp15.contextidr_el[2])
+ void aarch64_set_svcr(CPUARMState *env, uint64_t new, uint64_t mask)
 +};
 +
  static const ARMCPRegInfo vhe_reginfo[] = {
 -    { .name = "CONTEXTIDR_EL2", .state = ARM_CP_STATE_AA64,
 -      .opc0 = 3, .opc1 = 4, .crn = 13, .crm = 0, .opc2 = 1,
 -      .access = PL2_RW,
 -      .fieldoffset = offsetof(CPUARMState, cp15.contextidr_el[2]) },
      { .name = "TTBR1_EL2", .state = ARM_CP_STATE_AA64,
        .opc0 = 3, .opc1 = 4, .crn = 2, .crm = 0, .opc2 = 1,
        .access = PL2_RW, .writefn = vmsa_tcr_ttbr_el2_write,
@@ -XXX,XX +XXX,XX @@ void register_cp_regs_for_features(ARMCPU *cpu)
          define_one_arm_cp_reg(cpu, &ssbs_reginfo);
      }
 +    if (cpu_isar_feature(aa64_vh, cpu) ||
 +        cpu_isar_feature(aa64_debugv8p2, cpu)) {
 +        define_one_arm_cp_reg(cpu, &contextidr_el2);
 +    }
      if (arm_feature(env, ARM_FEATURE_EL2) && cpu_isar_feature(aa64_vh, cpu)) {
          define_arm_cp_regs(cpu, vhe_reginfo);
      }
 --
-.25.1
+.34.1

-[PULL 13/32] target/arm: Enable FEAT_Debugv8p4 for -cpu max
+[PULL 15/36] target/arm: Use FPSR_ constants in vfp_exceptbits_from_host()
-From: Richard Henderson <richard.henderson@linaro.org>
+Use the FPSR_ named constants in vfp_exceptbits_from_host(),
 rather than hardcoded magic numbers.
-This extension concerns changes to the External Debug interface,
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-with Secure and Non-secure access to the debug registers, and all
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-of it is outside the scope of QEMU.  Indicating support for this
+Message-id: 20250124162836.2332150-5-peter.maydell@linaro.org
-is mandatory with FEAT_SEL2, which we do implement.
+---
  target/arm/vfp_helper.c | 12 ++++++------
 file changed, 6 insertions(+), 6 deletions(-)
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
 Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20220506180242.216785-13-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
  docs/system/arm/emulation.rst | 1 +
  target/arm/cpu64.c            | 2 +-
  target/arm/cpu_tcg.c          | 4 ++--
 files changed, 4 insertions(+), 3 deletions(-)
 diff --git a/docs/system/arm/emulation.rst b/docs/system/arm/emulation.rst
 index XXXXXXX..XXXXXXX 100644
---- a/docs/system/arm/emulation.rst
+--- a/target/arm/vfp_helper.c
-+++ b/docs/system/arm/emulation.rst
++++ b/target/arm/vfp_helper.c
-@@ -XXX,XX +XXX,XX @@ the following architecture extensions:
+@@ -XXX,XX +XXX,XX @@ static inline int vfp_exceptbits_from_host(int host_bits)
- - FEAT_DIT (Data Independent Timing instructions)
+     int target_bits = 0;
- - FEAT_DPB (DC CVAP instruction)
- - FEAT_Debugv8p2 (Debug changes for v8.2)
+     if (host_bits & float_flag_invalid) {
-+- FEAT_Debugv8p4 (Debug changes for v8.4)
+-        target_bits |= 1;
- - FEAT_DotProd (Advanced SIMD dot product instructions)
++        target_bits |= FPSR_IOC;
- - FEAT_FCMA (Floating-point complex number instructions)
+     }
- - FEAT_FHM (Floating-point half-precision multiplication instructions)
+     if (host_bits & float_flag_divbyzero) {
-diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
+-        target_bits |= 2;
-index XXXXXXX..XXXXXXX 100644
++        target_bits |= FPSR_DZC;
---- a/target/arm/cpu64.c
+     }
-+++ b/target/arm/cpu64.c
+     if (host_bits & float_flag_overflow) {
-@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
+-        target_bits |= 4;
-     cpu->isar.id_aa64zfr0 = t;
++        target_bits |= FPSR_OFC;
+     }
-     t = cpu->isar.id_aa64dfr0;
+     if (host_bits & (float_flag_underflow | float_flag_output_denormal)) {
--    t = FIELD_DP64(t, ID_AA64DFR0, DEBUGVER, 8);  /* FEAT_Debugv8p2 */
+-        target_bits |= 8;
-+    t = FIELD_DP64(t, ID_AA64DFR0, DEBUGVER, 9);  /* FEAT_Debugv8p4 */
++        target_bits |= FPSR_UFC;
-     t = FIELD_DP64(t, ID_AA64DFR0, PMUVER, 5);    /* FEAT_PMUv3p4 */
+     }
-     cpu->isar.id_aa64dfr0 = t;
+     if (host_bits & float_flag_inexact) {
+-        target_bits |= 0x10;
-diff --git a/target/arm/cpu_tcg.c b/target/arm/cpu_tcg.c
++        target_bits |= FPSR_IXC;
-index XXXXXXX..XXXXXXX 100644
+     }
---- a/target/arm/cpu_tcg.c
+     if (host_bits & float_flag_input_denormal) {
-+++ b/target/arm/cpu_tcg.c
+-        target_bits |= 0x80;
-@@ -XXX,XX +XXX,XX @@ void aa32_max_features(ARMCPU *cpu)
++        target_bits |= FPSR_IDC;
-     cpu->isar.id_pfr2 = t;
+     }
+     return target_bits;
      t = cpu->isar.id_dfr0;
 -    t = FIELD_DP32(t, ID_DFR0, COPDBG, 8);        /* FEAT_Debugv8p2 */
 -    t = FIELD_DP32(t, ID_DFR0, COPSDBG, 8);       /* FEAT_Debugv8p2 */
 +    t = FIELD_DP32(t, ID_DFR0, COPDBG, 9);        /* FEAT_Debugv8p4 */
 +    t = FIELD_DP32(t, ID_DFR0, COPSDBG, 9);       /* FEAT_Debugv8p4 */
      t = FIELD_DP32(t, ID_DFR0, PERFMON, 5);       /* FEAT_PMUv3p4 */
      cpu->isar.id_dfr0 = t;
  }
 --
-.25.1
+.34.1

-New patch
+[PULL 16/36] target/arm: Use uint32_t in vfp_exceptbits_from_host()
+In vfp_exceptbits_from_host(), we accumulate the FPSR flags in
+an "int", and our return type is also "int". However, the only
+callsite returns the same information as a uint32_t, and
+more generally we handle FPSR values in the code as uint32_t,
+not int. Bring this function in to line with that convention.
+There is no behaviour change because none of the FPSR bits
+we set in this function are bit 31. The input argument to
+the function remains 'int' because that is the return type
+of the softfloat get_float_exception_flags().
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250124162836.2332150-6-peter.maydell@linaro.org
+---
+ target/arm/vfp_helper.c | 4 ++--
+file changed, 2 insertions(+), 2 deletions(-)
+diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
+index XXXXXXX..XXXXXXX 100644
+--- a/target/arm/vfp_helper.c
++++ b/target/arm/vfp_helper.c
+@@ -XXX,XX +XXX,XX @@
+ #ifdef CONFIG_TCG
+ /* Convert host exception flags to vfp form.  */
+-static inline int vfp_exceptbits_from_host(int host_bits)
++static inline uint32_t vfp_exceptbits_from_host(int host_bits)
+ {
+-    int target_bits = 0;
++    uint32_t target_bits = 0;
+     if (host_bits & float_flag_invalid) {
+         target_bits |= FPSR_IOC;
+--
+.34.1

-[PULL 12/32] target/arm: Enable FEAT_Debugv8p2 for -cpu max
+[PULL 17/36] target/arm: Define new fp_status_a32 and fp_status_a64
-From: Richard Henderson <richard.henderson@linaro.org>
+We want to split the existing fp_status in the Arm CPUState into
 separate float_status fields for AArch32 and AArch64.  (This is
 because new control bits defined by FEAT_AFP only have an effect for
 AArch64, not AArch32.) To make this split we will:
  * define new fp_status_a32 and fp_status_a64 which have
    identical behaviour to the existing fp_status
  * move existing uses of fp_status to fp_status_a32 or
    fp_status_a64 as appropriate
  * delete the old fp_status when it has no uses left
-The only portion of FEAT_Debugv8p2 that is relevant to QEMU
+In this patch we add the new float_status fields.
 is CONTEXTIDR_EL2, which is also conditionally implemented
 with FEAT_VHE.  The rest of the debug extension concerns the
 External debug interface, which is outside the scope of QEMU.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+We will also need to split fp_status_f16, but we will do that
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+as a separate series of patches.
-Message-id: 20220506180242.216785-12-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250124162836.2332150-7-peter.maydell@linaro.org
 ---
- docs/system/arm/emulation.rst | 1 +
+ target/arm/cpu.h           |  4 ++++
- target/arm/cpu.c              | 1 +
+ target/arm/tcg/translate.h | 12 ++++++++++++
- target/arm/cpu64.c            | 1 +
+ target/arm/cpu.c           |  2 ++
- target/arm/cpu_tcg.c          | 2 ++
+ target/arm/vfp_helper.c    | 12 ++++++++++++
-files changed, 5 insertions(+)
+files changed, 30 insertions(+)
-diff --git a/docs/system/arm/emulation.rst b/docs/system/arm/emulation.rst
+diff --git a/target/arm/cpu.h b/target/arm/cpu.h
 index XXXXXXX..XXXXXXX 100644
---- a/docs/system/arm/emulation.rst
+--- a/target/arm/cpu.h
-+++ b/docs/system/arm/emulation.rst
++++ b/target/arm/cpu.h
-@@ -XXX,XX +XXX,XX @@ the following architecture extensions:
+@@ -XXX,XX +XXX,XX @@ typedef struct CPUArchState {
- - FEAT_BTI (Branch Target Identification)
+         /* There are a number of distinct float control structures:
- - FEAT_DIT (Data Independent Timing instructions)
+          *
- - FEAT_DPB (DC CVAP instruction)
+          *  fp_status: is the "normal" fp status.
-+- FEAT_Debugv8p2 (Debug changes for v8.2)
++         *  fp_status_a32: is the "normal" fp status for AArch32 insns
- - FEAT_DotProd (Advanced SIMD dot product instructions)
++         *  fp_status_a64: is the "normal" fp status for AArch64 insns
- - FEAT_FCMA (Floating-point complex number instructions)
+          *  fp_status_fp16: used for half-precision calculations
- - FEAT_FHM (Floating-point half-precision multiplication instructions)
+          *  standard_fp_status : the ARM "Standard FPSCR Value"
           *  standard_fp_status_fp16 : used for half-precision
@@ -XXX,XX +XXX,XX @@ typedef struct CPUArchState {
           * an explicit FPSCR read.
           */
          float_status fp_status;
 +        float_status fp_status_a32;
 +        float_status fp_status_a64;
          float_status fp_status_f16;
          float_status standard_fp_status;
          float_status standard_fp_status_f16;
 diff --git a/target/arm/tcg/translate.h b/target/arm/tcg/translate.h
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/tcg/translate.h
 +++ b/target/arm/tcg/translate.h
@@ -XXX,XX +XXX,XX @@ static inline CPUARMTBFlags arm_tbflags_from_tb(const TranslationBlock *tb)
   */
  typedef enum ARMFPStatusFlavour {
      FPST_FPCR,
 +    FPST_A32,
 +    FPST_A64,
      FPST_FPCR_F16,
      FPST_STD,
      FPST_STD_F16,
@@ -XXX,XX +XXX,XX @@ typedef enum ARMFPStatusFlavour {
   *
   * FPST_FPCR
   *   for non-FP16 operations controlled by the FPCR
 + * FPST_A32
 + *   for AArch32 non-FP16 operations controlled by the FPCR
 + * FPST_A64
 + *   for AArch64 non-FP16 operations controlled by the FPCR
   * FPST_FPCR_F16
   *   for operations controlled by the FPCR where FPCR.FZ16 is to be used
   * FPST_STD
@@ -XXX,XX +XXX,XX @@ static inline TCGv_ptr fpstatus_ptr(ARMFPStatusFlavour flavour)
      case FPST_FPCR:
          offset = offsetof(CPUARMState, vfp.fp_status);
          break;
 +    case FPST_A32:
 +        offset = offsetof(CPUARMState, vfp.fp_status_a32);
 +        break;
 +    case FPST_A64:
 +        offset = offsetof(CPUARMState, vfp.fp_status_a64);
 +        break;
      case FPST_FPCR_F16:
          offset = offsetof(CPUARMState, vfp.fp_status_f16);
          break;
 diff --git a/target/arm/cpu.c b/target/arm/cpu.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu.c
 +++ b/target/arm/cpu.c
-@@ -XXX,XX +XXX,XX @@ static void arm_cpu_realizefn(DeviceState *dev, Error **errp)
+@@ -XXX,XX +XXX,XX @@ static void arm_cpu_reset_hold(Object *obj, ResetType type)
-          * feature registers as well.
+     set_default_nan_mode(1, &env->vfp.standard_fp_status);
-          */
+     set_default_nan_mode(1, &env->vfp.standard_fp_status_f16);
-         cpu->isar.id_pfr1 = FIELD_DP32(cpu->isar.id_pfr1, ID_PFR1, SECURITY, 0);
+     arm_set_default_fp_behaviours(&env->vfp.fp_status);
-+        cpu->isar.id_dfr0 = FIELD_DP32(cpu->isar.id_dfr0, ID_DFR0, COPSDBG, 0);
++    arm_set_default_fp_behaviours(&env->vfp.fp_status_a32);
-         cpu->isar.id_aa64pfr0 = FIELD_DP64(cpu->isar.id_aa64pfr0,
++    arm_set_default_fp_behaviours(&env->vfp.fp_status_a64);
-                                            ID_AA64PFR0, EL3, 0);
+     arm_set_default_fp_behaviours(&env->vfp.standard_fp_status);
      arm_set_default_fp_behaviours(&env->vfp.fp_status_f16);
      arm_set_default_fp_behaviours(&env->vfp.standard_fp_status_f16);
 diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/vfp_helper.c
 +++ b/target/arm/vfp_helper.c
@@ -XXX,XX +XXX,XX @@ static uint32_t vfp_get_fpsr_from_host(CPUARMState *env)
      uint32_t i;
      i = get_float_exception_flags(&env->vfp.fp_status);
 +    i |= get_float_exception_flags(&env->vfp.fp_status_a32);
 +    i |= get_float_exception_flags(&env->vfp.fp_status_a64);
      i |= get_float_exception_flags(&env->vfp.standard_fp_status);
      /* FZ16 does not generate an input denormal exception.  */
      i |= (get_float_exception_flags(&env->vfp.fp_status_f16)
@@ -XXX,XX +XXX,XX @@ static void vfp_clear_float_status_exc_flags(CPUARMState *env)
       * be the architecturally up-to-date exception flag information first.
       */
      set_float_exception_flags(0, &env->vfp.fp_status);
 +    set_float_exception_flags(0, &env->vfp.fp_status_a32);
 +    set_float_exception_flags(0, &env->vfp.fp_status_a64);
      set_float_exception_flags(0, &env->vfp.fp_status_f16);
      set_float_exception_flags(0, &env->vfp.standard_fp_status);
      set_float_exception_flags(0, &env->vfp.standard_fp_status_f16);
@@ -XXX,XX +XXX,XX @@ static void vfp_set_fpcr_to_host(CPUARMState *env, uint32_t val, uint32_t mask)
              break;
          }
          set_float_rounding_mode(i, &env->vfp.fp_status);
 +        set_float_rounding_mode(i, &env->vfp.fp_status_a32);
 +        set_float_rounding_mode(i, &env->vfp.fp_status_a64);
          set_float_rounding_mode(i, &env->vfp.fp_status_f16);
      }
-diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
+     if (changed & FPCR_FZ16) {
-index XXXXXXX..XXXXXXX 100644
+@@ -XXX,XX +XXX,XX @@ static void vfp_set_fpcr_to_host(CPUARMState *env, uint32_t val, uint32_t mask)
---- a/target/arm/cpu64.c
+         bool ftz_enabled = val & FPCR_FZ;
-+++ b/target/arm/cpu64.c
+         set_flush_to_zero(ftz_enabled, &env->vfp.fp_status);
-@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
+         set_flush_inputs_to_zero(ftz_enabled, &env->vfp.fp_status);
-     cpu->isar.id_aa64zfr0 = t;
++        set_flush_to_zero(ftz_enabled, &env->vfp.fp_status_a32);
++        set_flush_inputs_to_zero(ftz_enabled, &env->vfp.fp_status_a32);
-     t = cpu->isar.id_aa64dfr0;
++        set_flush_to_zero(ftz_enabled, &env->vfp.fp_status_a64);
-+    t = FIELD_DP64(t, ID_AA64DFR0, DEBUGVER, 8);  /* FEAT_Debugv8p2 */
++        set_flush_inputs_to_zero(ftz_enabled, &env->vfp.fp_status_a64);
-     t = FIELD_DP64(t, ID_AA64DFR0, PMUVER, 5);    /* FEAT_PMUv3p4 */
+     }
-     cpu->isar.id_aa64dfr0 = t;
+     if (changed & FPCR_DN) {
+         bool dnan_enabled = val & FPCR_DN;
-diff --git a/target/arm/cpu_tcg.c b/target/arm/cpu_tcg.c
+         set_default_nan_mode(dnan_enabled, &env->vfp.fp_status);
-index XXXXXXX..XXXXXXX 100644
++        set_default_nan_mode(dnan_enabled, &env->vfp.fp_status_a32);
---- a/target/arm/cpu_tcg.c
++        set_default_nan_mode(dnan_enabled, &env->vfp.fp_status_a64);
-+++ b/target/arm/cpu_tcg.c
+         set_default_nan_mode(dnan_enabled, &env->vfp.fp_status_f16);
-@@ -XXX,XX +XXX,XX @@ void aa32_max_features(ARMCPU *cpu)
+     }
      cpu->isar.id_pfr2 = t;
      t = cpu->isar.id_dfr0;
 +    t = FIELD_DP32(t, ID_DFR0, COPDBG, 8);        /* FEAT_Debugv8p2 */
 +    t = FIELD_DP32(t, ID_DFR0, COPSDBG, 8);       /* FEAT_Debugv8p2 */
      t = FIELD_DP32(t, ID_DFR0, PERFMON, 5);       /* FEAT_PMUv3p4 */
      cpu->isar.id_dfr0 = t;
  }
 --
-.25.1
+.34.1

-New patch
+[PULL 18/36] target/arm: Use vfp.fp_status_a64 in A64-only helper functions
+Switch from vfp.fp_status to vfp.fp_status_a64 for helpers which:
+ * directly reference an fp_status field
+ * are called only from the A64 decoder
+ * are not called inside a set_rmode/restore_rmode sequence
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Message-id: 20250124162836.2332150-8-peter.maydell@linaro.org
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+---
+ target/arm/tcg/sme_helper.c | 2 +-
+ target/arm/tcg/vec_helper.c | 8 ++++----
+files changed, 5 insertions(+), 5 deletions(-)
+diff --git a/target/arm/tcg/sme_helper.c b/target/arm/tcg/sme_helper.c
+index XXXXXXX..XXXXXXX 100644
+--- a/target/arm/tcg/sme_helper.c
++++ b/target/arm/tcg/sme_helper.c
+@@ -XXX,XX +XXX,XX @@ void HELPER(sme_fmopa_h)(void *vza, void *vzn, void *vzm, void *vpn,
+      * round-to-odd -- see above.
+      */
+     fpst_f16 = env->vfp.fp_status_f16;
+-    fpst_std = env->vfp.fp_status;
++    fpst_std = env->vfp.fp_status_a64;
+     set_default_nan_mode(true, &fpst_std);
+     set_default_nan_mode(true, &fpst_f16);
+     fpst_odd = fpst_std;
+diff --git a/target/arm/tcg/vec_helper.c b/target/arm/tcg/vec_helper.c
+index XXXXXXX..XXXXXXX 100644
+--- a/target/arm/tcg/vec_helper.c
++++ b/target/arm/tcg/vec_helper.c
+@@ -XXX,XX +XXX,XX @@ void HELPER(gvec_fmlal_a32)(void *vd, void *vn, void *vm,
+ void HELPER(gvec_fmlal_a64)(void *vd, void *vn, void *vm,
+                             CPUARMState *env, uint32_t desc)
+ {
+-    do_fmlal(vd, vn, vm, &env->vfp.fp_status, desc,
++    do_fmlal(vd, vn, vm, &env->vfp.fp_status_a64, desc,
+              get_flush_inputs_to_zero(&env->vfp.fp_status_f16));
+ }
+@@ -XXX,XX +XXX,XX @@ void HELPER(sve2_fmlal_zzzw_s)(void *vd, void *vn, void *vm, void *va,
+     intptr_t i, oprsz = simd_oprsz(desc);
+     uint16_t negn = extract32(desc, SIMD_DATA_SHIFT, 1) << 15;
+     intptr_t sel = extract32(desc, SIMD_DATA_SHIFT + 1, 1) * sizeof(float16);
+-    float_status *status = &env->vfp.fp_status;
++    float_status *status = &env->vfp.fp_status_a64;
+     bool fz16 = get_flush_inputs_to_zero(&env->vfp.fp_status_f16);
+     for (i = 0; i < oprsz; i += sizeof(float32)) {
+@@ -XXX,XX +XXX,XX @@ void HELPER(gvec_fmlal_idx_a32)(void *vd, void *vn, void *vm,
+ void HELPER(gvec_fmlal_idx_a64)(void *vd, void *vn, void *vm,
+                                 CPUARMState *env, uint32_t desc)
+ {
+-    do_fmlal_idx(vd, vn, vm, &env->vfp.fp_status, desc,
++    do_fmlal_idx(vd, vn, vm, &env->vfp.fp_status_a64, desc,
+                  get_flush_inputs_to_zero(&env->vfp.fp_status_f16));
+ }
+@@ -XXX,XX +XXX,XX @@ void HELPER(sve2_fmlal_zzxw_s)(void *vd, void *vn, void *vm, void *va,
+     uint16_t negn = extract32(desc, SIMD_DATA_SHIFT, 1) << 15;
+     intptr_t sel = extract32(desc, SIMD_DATA_SHIFT + 1, 1) * sizeof(float16);
+     intptr_t idx = extract32(desc, SIMD_DATA_SHIFT + 2, 3) * sizeof(float16);
+-    float_status *status = &env->vfp.fp_status;
++    float_status *status = &env->vfp.fp_status_a64;
+     bool fz16 = get_flush_inputs_to_zero(&env->vfp.fp_status_f16);
+     for (i = 0; i < oprsz; i += 16) {
+--
+.34.1

-New patch
+[PULL 19/36] target/arm: Use fp_status_a64 or fp_status_a32 in is_ebf()
+In is_ebf(), we might be called for A64 or A32, but we have
+the CPUARMState* so we can select fp_status_a64 or
+fp_status_a32 accordingly.
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+---
+ target/arm/tcg/vec_helper.c | 2 +-
+file changed, 1 insertion(+), 1 deletion(-)
+diff --git a/target/arm/tcg/vec_helper.c b/target/arm/tcg/vec_helper.c
+index XXXXXXX..XXXXXXX 100644
+--- a/target/arm/tcg/vec_helper.c
++++ b/target/arm/tcg/vec_helper.c
+@@ -XXX,XX +XXX,XX @@ bool is_ebf(CPUARMState *env, float_status *statusp, float_status *oddstatusp)
+      */
+     bool ebf = is_a64(env) && env->vfp.fpcr & FPCR_EBF;
+-    *statusp = env->vfp.fp_status;
++    *statusp = is_a64(env) ? env->vfp.fp_status_a64 : env->vfp.fp_status_a32;
+     set_default_nan_mode(true, statusp);
+     if (ebf) {
+--
+.34.1

-[PULL 22/32] target/arm: Enable FEAT_CSV3 for -cpu max
+[PULL 20/36] target/arm: Use fp_status_a32 in vjvct helper
-From: Richard Henderson <richard.henderson@linaro.org>
+Use fp_status_a32 in the vjcvt helper function; this is called only
 from the A32/T32 decoder and is not used inside a
 set_rmode/restore_rmode sequence.
-This extension concerns cache speculation, which TCG does
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-not implement.  Thus we can trivially enable this feature.
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20250124162836.2332150-9-peter.maydell@linaro.org
 ---
  target/arm/vfp_helper.c | 2 +-
 file changed, 1 insertion(+), 1 deletion(-)
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
 Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20220506180242.216785-22-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
  docs/system/arm/emulation.rst | 1 +
  target/arm/cpu64.c            | 1 +
  target/arm/cpu_tcg.c          | 1 +
 files changed, 3 insertions(+)
 diff --git a/docs/system/arm/emulation.rst b/docs/system/arm/emulation.rst
 index XXXXXXX..XXXXXXX 100644
---- a/docs/system/arm/emulation.rst
+--- a/target/arm/vfp_helper.c
-+++ b/docs/system/arm/emulation.rst
++++ b/target/arm/vfp_helper.c
-@@ -XXX,XX +XXX,XX @@ the following architecture extensions:
+@@ -XXX,XX +XXX,XX @@ uint64_t HELPER(fjcvtzs)(float64 value, float_status *status)
- - FEAT_CSV2_1p1 (Cache speculation variant 2, version 1.1)
- - FEAT_CSV2_1p2 (Cache speculation variant 2, version 1.2)
+ uint32_t HELPER(vjcvt)(float64 value, CPUARMState *env)
- - FEAT_CSV2_2 (Cache speculation variant 2, version 2)
+ {
-+- FEAT_CSV3 (Cache speculation variant 3)
+-    uint64_t pair = HELPER(fjcvtzs)(value, &env->vfp.fp_status);
- - FEAT_DIT (Data Independent Timing instructions)
++    uint64_t pair = HELPER(fjcvtzs)(value, &env->vfp.fp_status_a32);
- - FEAT_DPB (DC CVAP instruction)
+     uint32_t result = pair;
- - FEAT_Debugv8p2 (Debug changes for v8.2)
+     uint32_t z = (pair >> 32) == 0;
 diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu64.c
 +++ b/target/arm/cpu64.c
@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
      t = FIELD_DP64(t, ID_AA64PFR0, SEL2, 1);      /* FEAT_SEL2 */
      t = FIELD_DP64(t, ID_AA64PFR0, DIT, 1);       /* FEAT_DIT */
      t = FIELD_DP64(t, ID_AA64PFR0, CSV2, 2);      /* FEAT_CSV2_2 */
 +    t = FIELD_DP64(t, ID_AA64PFR0, CSV3, 1);      /* FEAT_CSV3 */
      cpu->isar.id_aa64pfr0 = t;
      t = cpu->isar.id_aa64pfr1;
 diff --git a/target/arm/cpu_tcg.c b/target/arm/cpu_tcg.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu_tcg.c
 +++ b/target/arm/cpu_tcg.c
@@ -XXX,XX +XXX,XX @@ void aa32_max_features(ARMCPU *cpu)
      cpu->isar.id_pfr0 = t;
      t = cpu->isar.id_pfr2;
 +    t = FIELD_DP32(t, ID_PFR2, CSV3, 1);          /* FEAT_CSV3 */
      t = FIELD_DP32(t, ID_PFR2, SSBS, 1);          /* FEAT_SSBS */
      cpu->isar.id_pfr2 = t;
 --
-.25.1
+.34.1

-New patch
+[PULL 21/36] target/arm: Use fp_status_a32 in vfp_cmp helpers
+The helpers vfp_cmps, vfp_cmpes, vfp_cmpd, vfp_cmped are used only from
+the A32 decoder; the A64 decoder uses separate vfp_cmps_a64 etc helpers
+(because for A64 we update the main NZCV flags and for A32 we update
+the FPSCR NZCV flags). So we can make these helpers use the fp_status_a32
+field instead of fp_status.
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250124162836.2332150-10-peter.maydell@linaro.org
+---
+ target/arm/vfp_helper.c | 4 ++--
+file changed, 2 insertions(+), 2 deletions(-)
+diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
+index XXXXXXX..XXXXXXX 100644
+--- a/target/arm/vfp_helper.c
++++ b/target/arm/vfp_helper.c
+@@ -XXX,XX +XXX,XX @@ void VFP_HELPER(cmpe, P)(ARGTYPE a, ARGTYPE b, CPUARMState *env) \
+         FLOATTYPE ## _compare(a, b, &env->vfp.FPST)); \
+ }
+ DO_VFP_cmp(h, float16, dh_ctype_f16, fp_status_f16)
+-DO_VFP_cmp(s, float32, float32, fp_status)
+-DO_VFP_cmp(d, float64, float64, fp_status)
++DO_VFP_cmp(s, float32, float32, fp_status_a32)
++DO_VFP_cmp(d, float64, float64, fp_status_a32)
+ #undef DO_VFP_cmp
+ /* Integer to float and float to integer conversions */
+--
+.34.1

-[PULL 23/32] target/arm: Enable FEAT_DGH for -cpu max
+[PULL 22/36] target/arm: Use FPST_A32 in A32 decoder
-From: Richard Henderson <richard.henderson@linaro.org>
+In the A32 decoder, use FPST_A32 rather than FPST_FPCR.  By
 doing an automated conversion of the whole file we avoid possibly
 using more than one fpst value in a set_rmode/op/restore_rmode
 sequence.
-This extension concerns not merging memory access, which TCG does
+Patch created with
-not implement.  Thus we can trivially enable this feature.
+  perl -p -i -e 's/FPST_FPCR(?!_)/FPST_A32/g' target/arm/tcg/translate-vfp.c
 Add a comment to handle_hint for the DGH instruction, but no code.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220506180242.216785-23-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250124162836.2332150-11-peter.maydell@linaro.org
 ---
- docs/system/arm/emulation.rst | 1 +
+ target/arm/tcg/translate-vfp.c | 54 +++++++++++++++++-----------------
- target/arm/cpu64.c            | 1 +
+file changed, 27 insertions(+), 27 deletions(-)
  target/arm/translate-a64.c    | 1 +
 files changed, 3 insertions(+)
-diff --git a/docs/system/arm/emulation.rst b/docs/system/arm/emulation.rst
+diff --git a/target/arm/tcg/translate-vfp.c b/target/arm/tcg/translate-vfp.c
 index XXXXXXX..XXXXXXX 100644
---- a/docs/system/arm/emulation.rst
+--- a/target/arm/tcg/translate-vfp.c
-+++ b/docs/system/arm/emulation.rst
++++ b/target/arm/tcg/translate-vfp.c
-@@ -XXX,XX +XXX,XX @@ the following architecture extensions:
+@@ -XXX,XX +XXX,XX @@ static bool trans_VRINT(DisasContext *s, arg_VRINT *a)
- - FEAT_CSV2_1p2 (Cache speculation variant 2, version 1.2)
+     if (sz == 1) {
- - FEAT_CSV2_2 (Cache speculation variant 2, version 2)
+         fpst = fpstatus_ptr(FPST_FPCR_F16);
- - FEAT_CSV3 (Cache speculation variant 3)
+     } else {
-+- FEAT_DGH (Data gathering hint)
+-        fpst = fpstatus_ptr(FPST_FPCR);
- - FEAT_DIT (Data Independent Timing instructions)
++        fpst = fpstatus_ptr(FPST_A32);
- - FEAT_DPB (DC CVAP instruction)
+     }
- - FEAT_Debugv8p2 (Debug changes for v8.2)
-diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
+     tcg_rmode = gen_set_rmode(rounding, fpst);
-index XXXXXXX..XXXXXXX 100644
+@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT(DisasContext *s, arg_VCVT *a)
---- a/target/arm/cpu64.c
+     if (sz == 1) {
-+++ b/target/arm/cpu64.c
+         fpst = fpstatus_ptr(FPST_FPCR_F16);
-@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
+     } else {
-     t = FIELD_DP64(t, ID_AA64ISAR1, SB, 1);       /* FEAT_SB */
+-        fpst = fpstatus_ptr(FPST_FPCR);
-     t = FIELD_DP64(t, ID_AA64ISAR1, SPECRES, 1);  /* FEAT_SPECRES */
++        fpst = fpstatus_ptr(FPST_A32);
-     t = FIELD_DP64(t, ID_AA64ISAR1, BF16, 1);     /* FEAT_BF16 */
+     }
-+    t = FIELD_DP64(t, ID_AA64ISAR1, DGH, 1);      /* FEAT_DGH */
-     t = FIELD_DP64(t, ID_AA64ISAR1, I8MM, 1);     /* FEAT_I8MM */
+     tcg_shift = tcg_constant_i32(0);
-     cpu->isar.id_aa64isar1 = t;
+@@ -XXX,XX +XXX,XX @@ static bool do_vfp_3op_sp(DisasContext *s, VFPGen3OpSPFn *fn,
+     f0 = tcg_temp_new_i32();
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
+     f1 = tcg_temp_new_i32();
-index XXXXXXX..XXXXXXX 100644
+     fd = tcg_temp_new_i32();
---- a/target/arm/translate-a64.c
+-    fpst = fpstatus_ptr(FPST_FPCR);
-+++ b/target/arm/translate-a64.c
++    fpst = fpstatus_ptr(FPST_A32);
-@@ -XXX,XX +XXX,XX @@ static void handle_hint(DisasContext *s, uint32_t insn,
-         break;
+     vfp_load_reg32(f0, vn);
-     case 0b00100: /* SEV */
+     vfp_load_reg32(f1, vm);
-     case 0b00101: /* SEVL */
+@@ -XXX,XX +XXX,XX @@ static bool do_vfp_3op_dp(DisasContext *s, VFPGen3OpDPFn *fn,
-+    case 0b00110: /* DGH */
+     f0 = tcg_temp_new_i64();
-         /* we treat all as NOP at least for now */
+     f1 = tcg_temp_new_i64();
-         break;
+     fd = tcg_temp_new_i64();
-     case 0b00111: /* XPACLRI */
+-    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      vfp_load_reg64(f0, vn);
      vfp_load_reg64(f1, vm);
@@ -XXX,XX +XXX,XX @@ static bool do_vfm_sp(DisasContext *s, arg_VFMA_sp *a, bool neg_n, bool neg_d)
          /* VFNMA, VFNMS */
          gen_vfp_negs(vd, vd);
      }
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      gen_helper_vfp_muladds(vd, vn, vm, vd, fpst);
      vfp_store_reg32(vd, a->vd);
      return true;
@@ -XXX,XX +XXX,XX @@ static bool do_vfm_dp(DisasContext *s, arg_VFMA_dp *a, bool neg_n, bool neg_d)
          /* VFNMA, VFNMS */
          gen_vfp_negd(vd, vd);
      }
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      gen_helper_vfp_muladdd(vd, vn, vm, vd, fpst);
      vfp_store_reg64(vd, a->vd);
      return true;
@@ -XXX,XX +XXX,XX @@ static void gen_VSQRT_hp(TCGv_i32 vd, TCGv_i32 vm)
  static void gen_VSQRT_sp(TCGv_i32 vd, TCGv_i32 vm)
  {
 -    gen_helper_vfp_sqrts(vd, vm, fpstatus_ptr(FPST_FPCR));
 +    gen_helper_vfp_sqrts(vd, vm, fpstatus_ptr(FPST_A32));
  }
  static void gen_VSQRT_dp(TCGv_i64 vd, TCGv_i64 vm)
  {
 -    gen_helper_vfp_sqrtd(vd, vm, fpstatus_ptr(FPST_FPCR));
 +    gen_helper_vfp_sqrtd(vd, vm, fpstatus_ptr(FPST_A32));
  }
  DO_VFP_2OP(VSQRT, hp, gen_VSQRT_hp, aa32_fp16_arith)
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_f32_f16(DisasContext *s, arg_VCVT_f32_f16 *a)
          return true;
      }
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      ahp_mode = get_ahp_flag();
      tmp = tcg_temp_new_i32();
      /* The T bit tells us if we want the low or high 16 bits of Vm */
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_f64_f16(DisasContext *s, arg_VCVT_f64_f16 *a)
          return true;
      }
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      ahp_mode = get_ahp_flag();
      tmp = tcg_temp_new_i32();
      /* The T bit tells us if we want the low or high 16 bits of Vm */
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_b16_f32(DisasContext *s, arg_VCVT_b16_f32 *a)
          return true;
      }
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      tmp = tcg_temp_new_i32();
      vfp_load_reg32(tmp, a->vm);
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_f16_f32(DisasContext *s, arg_VCVT_f16_f32 *a)
          return true;
      }
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      ahp_mode = get_ahp_flag();
      tmp = tcg_temp_new_i32();
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_f16_f64(DisasContext *s, arg_VCVT_f16_f64 *a)
          return true;
      }
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      ahp_mode = get_ahp_flag();
      tmp = tcg_temp_new_i32();
      vm = tcg_temp_new_i64();
@@ -XXX,XX +XXX,XX @@ static bool trans_VRINTR_sp(DisasContext *s, arg_VRINTR_sp *a)
      tmp = tcg_temp_new_i32();
      vfp_load_reg32(tmp, a->vm);
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      gen_helper_rints(tmp, tmp, fpst);
      vfp_store_reg32(tmp, a->vd);
      return true;
@@ -XXX,XX +XXX,XX @@ static bool trans_VRINTR_dp(DisasContext *s, arg_VRINTR_dp *a)
      tmp = tcg_temp_new_i64();
      vfp_load_reg64(tmp, a->vm);
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      gen_helper_rintd(tmp, tmp, fpst);
      vfp_store_reg64(tmp, a->vd);
      return true;
@@ -XXX,XX +XXX,XX @@ static bool trans_VRINTZ_sp(DisasContext *s, arg_VRINTZ_sp *a)
      tmp = tcg_temp_new_i32();
      vfp_load_reg32(tmp, a->vm);
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      tcg_rmode = gen_set_rmode(FPROUNDING_ZERO, fpst);
      gen_helper_rints(tmp, tmp, fpst);
      gen_restore_rmode(tcg_rmode, fpst);
@@ -XXX,XX +XXX,XX @@ static bool trans_VRINTZ_dp(DisasContext *s, arg_VRINTZ_dp *a)
      tmp = tcg_temp_new_i64();
      vfp_load_reg64(tmp, a->vm);
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      tcg_rmode = gen_set_rmode(FPROUNDING_ZERO, fpst);
      gen_helper_rintd(tmp, tmp, fpst);
      gen_restore_rmode(tcg_rmode, fpst);
@@ -XXX,XX +XXX,XX @@ static bool trans_VRINTX_sp(DisasContext *s, arg_VRINTX_sp *a)
      tmp = tcg_temp_new_i32();
      vfp_load_reg32(tmp, a->vm);
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      gen_helper_rints_exact(tmp, tmp, fpst);
      vfp_store_reg32(tmp, a->vd);
      return true;
@@ -XXX,XX +XXX,XX @@ static bool trans_VRINTX_dp(DisasContext *s, arg_VRINTX_dp *a)
      tmp = tcg_temp_new_i64();
      vfp_load_reg64(tmp, a->vm);
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      gen_helper_rintd_exact(tmp, tmp, fpst);
      vfp_store_reg64(tmp, a->vd);
      return true;
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_sp(DisasContext *s, arg_VCVT_sp *a)
      vm = tcg_temp_new_i32();
      vd = tcg_temp_new_i64();
      vfp_load_reg32(vm, a->vm);
 -    gen_helper_vfp_fcvtds(vd, vm, fpstatus_ptr(FPST_FPCR));
 +    gen_helper_vfp_fcvtds(vd, vm, fpstatus_ptr(FPST_A32));
      vfp_store_reg64(vd, a->vd);
      return true;
  }
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_dp(DisasContext *s, arg_VCVT_dp *a)
      vd = tcg_temp_new_i32();
      vm = tcg_temp_new_i64();
      vfp_load_reg64(vm, a->vm);
 -    gen_helper_vfp_fcvtsd(vd, vm, fpstatus_ptr(FPST_FPCR));
 +    gen_helper_vfp_fcvtsd(vd, vm, fpstatus_ptr(FPST_A32));
      vfp_store_reg32(vd, a->vd);
      return true;
  }
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_int_sp(DisasContext *s, arg_VCVT_int_sp *a)
      vm = tcg_temp_new_i32();
      vfp_load_reg32(vm, a->vm);
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      if (a->s) {
          /* i32 -> f32 */
          gen_helper_vfp_sitos(vm, vm, fpst);
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_int_dp(DisasContext *s, arg_VCVT_int_dp *a)
      vm = tcg_temp_new_i32();
      vd = tcg_temp_new_i64();
      vfp_load_reg32(vm, a->vm);
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      if (a->s) {
          /* i32 -> f64 */
          gen_helper_vfp_sitod(vd, vm, fpst);
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_fix_sp(DisasContext *s, arg_VCVT_fix_sp *a)
      vd = tcg_temp_new_i32();
      vfp_load_reg32(vd, a->vd);
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      shift = tcg_constant_i32(frac_bits);
      /* Switch on op:U:sx bits */
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_fix_dp(DisasContext *s, arg_VCVT_fix_dp *a)
      vd = tcg_temp_new_i64();
      vfp_load_reg64(vd, a->vd);
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      shift = tcg_constant_i32(frac_bits);
      /* Switch on op:U:sx bits */
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_sp_int(DisasContext *s, arg_VCVT_sp_int *a)
          return true;
      }
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      vm = tcg_temp_new_i32();
      vfp_load_reg32(vm, a->vm);
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_dp_int(DisasContext *s, arg_VCVT_dp_int *a)
          return true;
      }
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A32);
      vm = tcg_temp_new_i64();
      vd = tcg_temp_new_i32();
      vfp_load_reg64(vm, a->vm);
 --
-.25.1
+.34.1

-[PULL 20/32] target/arm: Enable FEAT_CSV2 for -cpu max
+[PULL 23/36] target/arm: Use FPST_A64 in A64 decoder
-From: Richard Henderson <richard.henderson@linaro.org>
+In the A64 decoder, use FPST_A64 rather than FPST_FPCR.  By
 doing an automated conversion of the whole file we avoid possibly
 using more than one fpst value in a set_rmode/op/restore_rmode
 sequence.
-This extension concerns branch speculation, which TCG does
+Patch created with
 not implement.  Thus we can trivially enable this feature.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+  perl -p -i -e 's/FPST_FPCR(?!_)/FPST_A64/g' target/arm/tcg/translate-{a64,sve,sme}.c
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20220506180242.216785-20-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250124162836.2332150-12-peter.maydell@linaro.org
 ---
- docs/system/arm/emulation.rst | 1 +
+ target/arm/tcg/translate-a64.c |  70 +++++++++++-----------
- target/arm/cpu64.c            | 1 +
+ target/arm/tcg/translate-sme.c |   4 +-
- target/arm/cpu_tcg.c          | 1 +
+ target/arm/tcg/translate-sve.c | 106 ++++++++++++++++-----------------
-files changed, 3 insertions(+)
+files changed, 90 insertions(+), 90 deletions(-)
-diff --git a/docs/system/arm/emulation.rst b/docs/system/arm/emulation.rst
+diff --git a/target/arm/tcg/translate-a64.c b/target/arm/tcg/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
---- a/docs/system/arm/emulation.rst
+--- a/target/arm/tcg/translate-a64.c
-+++ b/docs/system/arm/emulation.rst
++++ b/target/arm/tcg/translate-a64.c
-@@ -XXX,XX +XXX,XX @@ the following architecture extensions:
+@@ -XXX,XX +XXX,XX @@ static void gen_gvec_op3_fpst(DisasContext *s, bool is_q, int rd, int rn,
- - FEAT_BBM at level 2 (Translation table break-before-make levels)
+                               int rm, bool is_fp16, int data,
- - FEAT_BF16 (AArch64 BFloat16 instructions)
+                               gen_helper_gvec_3_ptr *fn)
- - FEAT_BTI (Branch Target Identification)
+ {
-+- FEAT_CSV2 (Cache speculation variant 2)
+-    TCGv_ptr fpst = fpstatus_ptr(is_fp16 ? FPST_FPCR_F16 : FPST_FPCR);
- - FEAT_DIT (Data Independent Timing instructions)
++    TCGv_ptr fpst = fpstatus_ptr(is_fp16 ? FPST_FPCR_F16 : FPST_A64);
- - FEAT_DPB (DC CVAP instruction)
+     tcg_gen_gvec_3_ptr(vec_full_reg_offset(s, rd),
- - FEAT_Debugv8p2 (Debug changes for v8.2)
+                        vec_full_reg_offset(s, rn),
-diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
+                        vec_full_reg_offset(s, rm), fpst,
@@ -XXX,XX +XXX,XX @@ static void gen_gvec_op4_fpst(DisasContext *s, bool is_q, int rd, int rn,
                                int rm, int ra, bool is_fp16, int data,
                                gen_helper_gvec_4_ptr *fn)
  {
 -    TCGv_ptr fpst = fpstatus_ptr(is_fp16 ? FPST_FPCR_F16 : FPST_FPCR);
 +    TCGv_ptr fpst = fpstatus_ptr(is_fp16 ? FPST_FPCR_F16 : FPST_A64);
      tcg_gen_gvec_4_ptr(vec_full_reg_offset(s, rd),
                         vec_full_reg_offset(s, rn),
                         vec_full_reg_offset(s, rm),
@@ -XXX,XX +XXX,XX @@ static bool do_fp3_scalar(DisasContext *s, arg_rrr_e *a, const FPScalar *f)
          if (fp_access_check(s)) {
              TCGv_i64 t0 = read_fp_dreg(s, a->rn);
              TCGv_i64 t1 = read_fp_dreg(s, a->rm);
 -            f->gen_d(t0, t0, t1, fpstatus_ptr(FPST_FPCR));
 +            f->gen_d(t0, t0, t1, fpstatus_ptr(FPST_A64));
              write_fp_dreg(s, a->rd, t0);
          }
          break;
@@ -XXX,XX +XXX,XX @@ static bool do_fp3_scalar(DisasContext *s, arg_rrr_e *a, const FPScalar *f)
          if (fp_access_check(s)) {
              TCGv_i32 t0 = read_fp_sreg(s, a->rn);
              TCGv_i32 t1 = read_fp_sreg(s, a->rm);
 -            f->gen_s(t0, t0, t1, fpstatus_ptr(FPST_FPCR));
 +            f->gen_s(t0, t0, t1, fpstatus_ptr(FPST_A64));
              write_fp_sreg(s, a->rd, t0);
          }
          break;
@@ -XXX,XX +XXX,XX @@ static bool do_fcmp0_s(DisasContext *s, arg_rr_e *a,
              TCGv_i64 t0 = read_fp_dreg(s, a->rn);
              TCGv_i64 t1 = tcg_constant_i64(0);
              if (swap) {
 -                f->gen_d(t0, t1, t0, fpstatus_ptr(FPST_FPCR));
 +                f->gen_d(t0, t1, t0, fpstatus_ptr(FPST_A64));
              } else {
 -                f->gen_d(t0, t0, t1, fpstatus_ptr(FPST_FPCR));
 +                f->gen_d(t0, t0, t1, fpstatus_ptr(FPST_A64));
              }
              write_fp_dreg(s, a->rd, t0);
          }
@@ -XXX,XX +XXX,XX @@ static bool do_fcmp0_s(DisasContext *s, arg_rr_e *a,
              TCGv_i32 t0 = read_fp_sreg(s, a->rn);
              TCGv_i32 t1 = tcg_constant_i32(0);
              if (swap) {
 -                f->gen_s(t0, t1, t0, fpstatus_ptr(FPST_FPCR));
 +                f->gen_s(t0, t1, t0, fpstatus_ptr(FPST_A64));
              } else {
 -                f->gen_s(t0, t0, t1, fpstatus_ptr(FPST_FPCR));
 +                f->gen_s(t0, t0, t1, fpstatus_ptr(FPST_A64));
              }
              write_fp_sreg(s, a->rd, t0);
          }
@@ -XXX,XX +XXX,XX @@ static bool do_fp3_scalar_idx(DisasContext *s, arg_rrx_e *a, const FPScalar *f)
              TCGv_i64 t1 = tcg_temp_new_i64();
              read_vec_element(s, t1, a->rm, a->idx, MO_64);
 -            f->gen_d(t0, t0, t1, fpstatus_ptr(FPST_FPCR));
 +            f->gen_d(t0, t0, t1, fpstatus_ptr(FPST_A64));
              write_fp_dreg(s, a->rd, t0);
          }
          break;
@@ -XXX,XX +XXX,XX @@ static bool do_fp3_scalar_idx(DisasContext *s, arg_rrx_e *a, const FPScalar *f)
              TCGv_i32 t1 = tcg_temp_new_i32();
              read_vec_element_i32(s, t1, a->rm, a->idx, MO_32);
 -            f->gen_s(t0, t0, t1, fpstatus_ptr(FPST_FPCR));
 +            f->gen_s(t0, t0, t1, fpstatus_ptr(FPST_A64));
              write_fp_sreg(s, a->rd, t0);
          }
          break;
@@ -XXX,XX +XXX,XX @@ static bool do_fmla_scalar_idx(DisasContext *s, arg_rrx_e *a, bool neg)
              if (neg) {
                  gen_vfp_negd(t1, t1);
              }
 -            gen_helper_vfp_muladdd(t0, t1, t2, t0, fpstatus_ptr(FPST_FPCR));
 +            gen_helper_vfp_muladdd(t0, t1, t2, t0, fpstatus_ptr(FPST_A64));
              write_fp_dreg(s, a->rd, t0);
          }
          break;
@@ -XXX,XX +XXX,XX @@ static bool do_fmla_scalar_idx(DisasContext *s, arg_rrx_e *a, bool neg)
              if (neg) {
                  gen_vfp_negs(t1, t1);
              }
 -            gen_helper_vfp_muladds(t0, t1, t2, t0, fpstatus_ptr(FPST_FPCR));
 +            gen_helper_vfp_muladds(t0, t1, t2, t0, fpstatus_ptr(FPST_A64));
              write_fp_sreg(s, a->rd, t0);
          }
          break;
@@ -XXX,XX +XXX,XX @@ static bool do_fp3_scalar_pair(DisasContext *s, arg_rr_e *a, const FPScalar *f)
              read_vec_element(s, t0, a->rn, 0, MO_64);
              read_vec_element(s, t1, a->rn, 1, MO_64);
 -            f->gen_d(t0, t0, t1, fpstatus_ptr(FPST_FPCR));
 +            f->gen_d(t0, t0, t1, fpstatus_ptr(FPST_A64));
              write_fp_dreg(s, a->rd, t0);
          }
          break;
@@ -XXX,XX +XXX,XX @@ static bool do_fp3_scalar_pair(DisasContext *s, arg_rr_e *a, const FPScalar *f)
              read_vec_element_i32(s, t0, a->rn, 0, MO_32);
              read_vec_element_i32(s, t1, a->rn, 1, MO_32);
 -            f->gen_s(t0, t0, t1, fpstatus_ptr(FPST_FPCR));
 +            f->gen_s(t0, t0, t1, fpstatus_ptr(FPST_A64));
              write_fp_sreg(s, a->rd, t0);
          }
          break;
@@ -XXX,XX +XXX,XX @@ static bool do_fmadd(DisasContext *s, arg_rrrr_e *a, bool neg_a, bool neg_n)
              if (neg_n) {
                  gen_vfp_negd(tn, tn);
              }
 -            fpst = fpstatus_ptr(FPST_FPCR);
 +            fpst = fpstatus_ptr(FPST_A64);
              gen_helper_vfp_muladdd(ta, tn, tm, ta, fpst);
              write_fp_dreg(s, a->rd, ta);
          }
@@ -XXX,XX +XXX,XX @@ static bool do_fmadd(DisasContext *s, arg_rrrr_e *a, bool neg_a, bool neg_n)
              if (neg_n) {
                  gen_vfp_negs(tn, tn);
              }
 -            fpst = fpstatus_ptr(FPST_FPCR);
 +            fpst = fpstatus_ptr(FPST_A64);
              gen_helper_vfp_muladds(ta, tn, tm, ta, fpst);
              write_fp_sreg(s, a->rd, ta);
          }
@@ -XXX,XX +XXX,XX @@ static bool do_fp_reduction(DisasContext *s, arg_qrr_e *a,
      if (fp_access_check(s)) {
          MemOp esz = a->esz;
          int elts = (a->q ? 16 : 8) >> esz;
 -        TCGv_ptr fpst = fpstatus_ptr(esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR);
 +        TCGv_ptr fpst = fpstatus_ptr(esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
          TCGv_i32 res = do_reduction_op(s, a->rn, esz, 0, elts, fpst, fn);
          write_fp_sreg(s, a->rd, res);
      }
@@ -XXX,XX +XXX,XX @@ static void handle_fp_compare(DisasContext *s, int size,
                                bool cmp_with_zero, bool signal_all_nans)
  {
      TCGv_i64 tcg_flags = tcg_temp_new_i64();
 -    TCGv_ptr fpst = fpstatus_ptr(size == MO_16 ? FPST_FPCR_F16 : FPST_FPCR);
 +    TCGv_ptr fpst = fpstatus_ptr(size == MO_16 ? FPST_FPCR_F16 : FPST_A64);
      if (size == MO_64) {
          TCGv_i64 tcg_vn, tcg_vm;
@@ -XXX,XX +XXX,XX @@ static bool do_fp1_scalar(DisasContext *s, arg_rr_e *a,
          return check == 0;
      }
 -    fpst = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR);
 +    fpst = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
      if (rmode >= 0) {
          tcg_rmode = gen_set_rmode(rmode, fpst);
      }
@@ -XXX,XX +XXX,XX @@ static bool trans_FCVT_s_ds(DisasContext *s, arg_rr *a)
      if (fp_access_check(s)) {
          TCGv_i32 tcg_rn = read_fp_sreg(s, a->rn);
          TCGv_i64 tcg_rd = tcg_temp_new_i64();
 -        TCGv_ptr fpst = fpstatus_ptr(FPST_FPCR);
 +        TCGv_ptr fpst = fpstatus_ptr(FPST_A64);
          gen_helper_vfp_fcvtds(tcg_rd, tcg_rn, fpst);
          write_fp_dreg(s, a->rd, tcg_rd);
@@ -XXX,XX +XXX,XX @@ static bool trans_FCVT_s_hs(DisasContext *s, arg_rr *a)
      if (fp_access_check(s)) {
          TCGv_i32 tmp = read_fp_sreg(s, a->rn);
          TCGv_i32 ahp = get_ahp_flag();
 -        TCGv_ptr fpst = fpstatus_ptr(FPST_FPCR);
 +        TCGv_ptr fpst = fpstatus_ptr(FPST_A64);
          gen_helper_vfp_fcvt_f32_to_f16(tmp, tmp, fpst, ahp);
          /* write_fp_sreg is OK here because top half of result is zero */
@@ -XXX,XX +XXX,XX @@ static bool trans_FCVT_s_sd(DisasContext *s, arg_rr *a)
      if (fp_access_check(s)) {
          TCGv_i64 tcg_rn = read_fp_dreg(s, a->rn);
          TCGv_i32 tcg_rd = tcg_temp_new_i32();
 -        TCGv_ptr fpst = fpstatus_ptr(FPST_FPCR);
 +        TCGv_ptr fpst = fpstatus_ptr(FPST_A64);
          gen_helper_vfp_fcvtsd(tcg_rd, tcg_rn, fpst);
          write_fp_sreg(s, a->rd, tcg_rd);
@@ -XXX,XX +XXX,XX @@ static bool trans_FCVT_s_hd(DisasContext *s, arg_rr *a)
          TCGv_i64 tcg_rn = read_fp_dreg(s, a->rn);
          TCGv_i32 tcg_rd = tcg_temp_new_i32();
          TCGv_i32 ahp = get_ahp_flag();
 -        TCGv_ptr fpst = fpstatus_ptr(FPST_FPCR);
 +        TCGv_ptr fpst = fpstatus_ptr(FPST_A64);
          gen_helper_vfp_fcvt_f64_to_f16(tcg_rd, tcg_rn, fpst, ahp);
          /* write_fp_sreg is OK here because top half of tcg_rd is zero */
@@ -XXX,XX +XXX,XX @@ static bool trans_FCVT_s_sh(DisasContext *s, arg_rr *a)
      if (fp_access_check(s)) {
          TCGv_i32 tcg_rn = read_fp_hreg(s, a->rn);
          TCGv_i32 tcg_rd = tcg_temp_new_i32();
 -        TCGv_ptr tcg_fpst = fpstatus_ptr(FPST_FPCR);
 +        TCGv_ptr tcg_fpst = fpstatus_ptr(FPST_A64);
          TCGv_i32 tcg_ahp = get_ahp_flag();
          gen_helper_vfp_fcvt_f16_to_f32(tcg_rd, tcg_rn, tcg_fpst, tcg_ahp);
@@ -XXX,XX +XXX,XX @@ static bool trans_FCVT_s_dh(DisasContext *s, arg_rr *a)
      if (fp_access_check(s)) {
          TCGv_i32 tcg_rn = read_fp_hreg(s, a->rn);
          TCGv_i64 tcg_rd = tcg_temp_new_i64();
 -        TCGv_ptr tcg_fpst = fpstatus_ptr(FPST_FPCR);
 +        TCGv_ptr tcg_fpst = fpstatus_ptr(FPST_A64);
          TCGv_i32 tcg_ahp = get_ahp_flag();
          gen_helper_vfp_fcvt_f16_to_f64(tcg_rd, tcg_rn, tcg_fpst, tcg_ahp);
@@ -XXX,XX +XXX,XX @@ static bool do_cvtf_scalar(DisasContext *s, MemOp esz, int rd, int shift,
      TCGv_i32 tcg_shift, tcg_single;
      TCGv_i64 tcg_double;
 -    tcg_fpstatus = fpstatus_ptr(esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR);
 +    tcg_fpstatus = fpstatus_ptr(esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
      tcg_shift = tcg_constant_i32(shift);
      switch (esz) {
@@ -XXX,XX +XXX,XX @@ static void do_fcvt_scalar(DisasContext *s, MemOp out, MemOp esz,
      TCGv_ptr tcg_fpstatus;
      TCGv_i32 tcg_shift, tcg_rmode, tcg_single;
 -    tcg_fpstatus = fpstatus_ptr(esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR);
 +    tcg_fpstatus = fpstatus_ptr(esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
      tcg_shift = tcg_constant_i32(shift);
      tcg_rmode = gen_set_rmode(rmode, tcg_fpstatus);
@@ -XXX,XX +XXX,XX @@ static bool trans_FJCVTZS(DisasContext *s, arg_FJCVTZS *a)
      }
      if (fp_access_check(s)) {
          TCGv_i64 t = read_fp_dreg(s, a->rn);
 -        TCGv_ptr fpstatus = fpstatus_ptr(FPST_FPCR);
 +        TCGv_ptr fpstatus = fpstatus_ptr(FPST_A64);
          gen_helper_fjcvtzs(t, t, fpstatus);
@@ -XXX,XX +XXX,XX @@ static void gen_fcvtxn_sd(TCGv_i64 d, TCGv_i64 n)
       * with von Neumann rounding (round to odd)
       */
      TCGv_i32 tmp = tcg_temp_new_i32();
 -    gen_helper_fcvtx_f64_to_f32(tmp, n, fpstatus_ptr(FPST_FPCR));
 +    gen_helper_fcvtx_f64_to_f32(tmp, n, fpstatus_ptr(FPST_A64));
      tcg_gen_extu_i32_i64(d, tmp);
  }
@@ -XXX,XX +XXX,XX @@ static void gen_fcvtn_hs(TCGv_i64 d, TCGv_i64 n)
  {
      TCGv_i32 tcg_lo = tcg_temp_new_i32();
      TCGv_i32 tcg_hi = tcg_temp_new_i32();
 -    TCGv_ptr fpst = fpstatus_ptr(FPST_FPCR);
 +    TCGv_ptr fpst = fpstatus_ptr(FPST_A64);
      TCGv_i32 ahp = get_ahp_flag();
      tcg_gen_extr_i64_i32(tcg_lo, tcg_hi, n);
@@ -XXX,XX +XXX,XX @@ static void gen_fcvtn_hs(TCGv_i64 d, TCGv_i64 n)
  static void gen_fcvtn_sd(TCGv_i64 d, TCGv_i64 n)
  {
      TCGv_i32 tmp = tcg_temp_new_i32();
 -    TCGv_ptr fpst = fpstatus_ptr(FPST_FPCR);
 +    TCGv_ptr fpst = fpstatus_ptr(FPST_A64);
      gen_helper_vfp_fcvtsd(tmp, n, fpst);
      tcg_gen_extu_i32_i64(d, tmp);
@@ -XXX,XX +XXX,XX @@ TRANS(FCVTXN_v, do_2misc_narrow_vector, a, f_scalar_fcvtxn)
  static void gen_bfcvtn_hs(TCGv_i64 d, TCGv_i64 n)
  {
 -    TCGv_ptr fpst = fpstatus_ptr(FPST_FPCR);
 +    TCGv_ptr fpst = fpstatus_ptr(FPST_A64);
      TCGv_i32 tmp = tcg_temp_new_i32();
      gen_helper_bfcvt_pair(tmp, n, fpst);
      tcg_gen_extu_i32_i64(d, tmp);
@@ -XXX,XX +XXX,XX @@ static bool do_fp1_vector(DisasContext *s, arg_qrr_e *a,
          return check == 0;
      }
 -    fpst = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR);
 +    fpst = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
      if (rmode >= 0) {
          tcg_rmode = gen_set_rmode(rmode, fpst);
      }
@@ -XXX,XX +XXX,XX @@ static bool do_gvec_op2_fpst(DisasContext *s, MemOp esz, bool is_q,
          return check == 0;
      }
 -    fpst = fpstatus_ptr(esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR);
 +    fpst = fpstatus_ptr(esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
      tcg_gen_gvec_2_ptr(vec_full_reg_offset(s, rd),
                         vec_full_reg_offset(s, rn), fpst,
                         is_q ? 16 : 8, vec_full_reg_size(s),
@@ -XXX,XX +XXX,XX @@ static bool trans_FCVTL_v(DisasContext *s, arg_qrr_e *a)
          return true;
      }
 -    fpst = fpstatus_ptr(FPST_FPCR);
 +    fpst = fpstatus_ptr(FPST_A64);
      if (a->esz == MO_64) {
          /* 32 -> 64 bit fp conversion */
          TCGv_i64 tcg_res[2];
 diff --git a/target/arm/tcg/translate-sme.c b/target/arm/tcg/translate-sme.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu64.c
+--- a/target/arm/tcg/translate-sme.c
-+++ b/target/arm/cpu64.c
++++ b/target/arm/tcg/translate-sme.c
-@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
+@@ -XXX,XX +XXX,XX @@ static bool do_outprod_env(DisasContext *s, arg_op *a, MemOp esz,
-     t = FIELD_DP64(t, ID_AA64PFR0, SVE, 1);
+ TRANS_FEAT(FMOPA_h, aa64_sme, do_outprod_env, a,
-     t = FIELD_DP64(t, ID_AA64PFR0, SEL2, 1);      /* FEAT_SEL2 */
+            MO_32, gen_helper_sme_fmopa_h)
-     t = FIELD_DP64(t, ID_AA64PFR0, DIT, 1);       /* FEAT_DIT */
+ TRANS_FEAT(FMOPA_s, aa64_sme, do_outprod_fpst, a,
-+    t = FIELD_DP64(t, ID_AA64PFR0, CSV2, 1);      /* FEAT_CSV2 */
+-           MO_32, FPST_FPCR, gen_helper_sme_fmopa_s)
-     cpu->isar.id_aa64pfr0 = t;
++           MO_32, FPST_A64, gen_helper_sme_fmopa_s)
+ TRANS_FEAT(FMOPA_d, aa64_sme_f64f64, do_outprod_fpst, a,
-     t = cpu->isar.id_aa64pfr1;
+-           MO_64, FPST_FPCR, gen_helper_sme_fmopa_d)
-diff --git a/target/arm/cpu_tcg.c b/target/arm/cpu_tcg.c
++           MO_64, FPST_A64, gen_helper_sme_fmopa_d)
  TRANS_FEAT(BFMOPA, aa64_sme, do_outprod_env, a, MO_32, gen_helper_sme_bfmopa)
 diff --git a/target/arm/tcg/translate-sve.c b/target/arm/tcg/translate-sve.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu_tcg.c
+--- a/target/arm/tcg/translate-sve.c
-+++ b/target/arm/cpu_tcg.c
++++ b/target/arm/tcg/translate-sve.c
-@@ -XXX,XX +XXX,XX @@ void aa32_max_features(ARMCPU *cpu)
+@@ -XXX,XX +XXX,XX @@ static bool gen_gvec_fpst_arg_zz(DisasContext *s, gen_helper_gvec_2_ptr *fn,
-     cpu->isar.id_mmfr4 = t;
+                                  arg_rr_esz *a, int data)
+ {
-     t = cpu->isar.id_pfr0;
+     return gen_gvec_fpst_zz(s, fn, a->rd, a->rn, data,
-+    t = FIELD_DP32(t, ID_PFR0, CSV2, 2);          /* FEAT_CVS2 */
+-                            a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR);
-     t = FIELD_DP32(t, ID_PFR0, DIT, 1);           /* FEAT_DIT */
++                            a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
-     t = FIELD_DP32(t, ID_PFR0, RAS, 1);           /* FEAT_RAS */
+ }
-     cpu->isar.id_pfr0 = t;
  /* Invoke an out-of-line helper on 3 Zregs. */
@@ -XXX,XX +XXX,XX @@ static bool gen_gvec_fpst_arg_zzz(DisasContext *s, gen_helper_gvec_3_ptr *fn,
                                    arg_rrr_esz *a, int data)
  {
      return gen_gvec_fpst_zzz(s, fn, a->rd, a->rn, a->rm, data,
 -                             a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR);
 +                             a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
  }
  /* Invoke an out-of-line helper on 4 Zregs. */
@@ -XXX,XX +XXX,XX @@ static bool gen_gvec_fpst_arg_zpzz(DisasContext *s, gen_helper_gvec_4_ptr *fn,
                                     arg_rprr_esz *a)
  {
      return gen_gvec_fpst_zzzp(s, fn, a->rd, a->rn, a->rm, a->pg, 0,
 -                              a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR);
 +                              a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
  }
  /* Invoke a vector expander on two Zregs and an immediate.  */
@@ -XXX,XX +XXX,XX @@ static bool do_FMLA_zzxz(DisasContext *s, arg_rrxr_esz *a, bool sub)
      };
      return gen_gvec_fpst_zzzz(s, fns[a->esz], a->rd, a->rn, a->rm, a->ra,
                                (a->index << 1) | sub,
 -                              a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR);
 +                              a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
  }
  TRANS_FEAT(FMLA_zzxz, aa64_sve, do_FMLA_zzxz, a, false)
@@ -XXX,XX +XXX,XX @@ static gen_helper_gvec_3_ptr * const fmul_idx_fns[4] = {
  };
  TRANS_FEAT(FMUL_zzx, aa64_sve, gen_gvec_fpst_zzz,
             fmul_idx_fns[a->esz], a->rd, a->rn, a->rm, a->index,
 -           a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR)
 +           a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
  /*
   *** SVE Floating Point Fast Reduction Group
@@ -XXX,XX +XXX,XX @@ static bool do_reduce(DisasContext *s, arg_rpr_esz *a,
      tcg_gen_addi_ptr(t_zn, tcg_env, vec_full_reg_offset(s, a->rn));
      tcg_gen_addi_ptr(t_pg, tcg_env, pred_full_reg_offset(s, a->pg));
 -    status = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR);
 +    status = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
      fn(temp, t_zn, t_pg, status, t_desc);
@@ -XXX,XX +XXX,XX @@ static bool do_ppz_fp(DisasContext *s, arg_rpr_esz *a,
      if (sve_access_check(s)) {
          unsigned vsz = vec_full_reg_size(s);
          TCGv_ptr status =
 -            fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR);
 +            fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
          tcg_gen_gvec_3_ptr(pred_full_reg_offset(s, a->rd),
                             vec_full_reg_offset(s, a->rn),
@@ -XXX,XX +XXX,XX @@ static gen_helper_gvec_3_ptr * const ftmad_fns[4] = {
  };
  TRANS_FEAT_NONSTREAMING(FTMAD, aa64_sve, gen_gvec_fpst_zzz,
                          ftmad_fns[a->esz], a->rd, a->rn, a->rm, a->imm,
 -                        a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR)
 +                        a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
  /*
   *** SVE Floating Point Accumulating Reduction Group
@@ -XXX,XX +XXX,XX @@ static bool trans_FADDA(DisasContext *s, arg_rprr_esz *a)
      t_pg = tcg_temp_new_ptr();
      tcg_gen_addi_ptr(t_rm, tcg_env, vec_full_reg_offset(s, a->rm));
      tcg_gen_addi_ptr(t_pg, tcg_env, pred_full_reg_offset(s, a->pg));
 -    t_fpst = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR);
 +    t_fpst = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
      t_desc = tcg_constant_i32(simd_desc(vsz, vsz, 0));
      fns[a->esz - 1](t_val, t_val, t_rm, t_pg, t_fpst, t_desc);
@@ -XXX,XX +XXX,XX @@ static void do_fp_scalar(DisasContext *s, int zd, int zn, int pg, bool is_fp16,
      tcg_gen_addi_ptr(t_zn, tcg_env, vec_full_reg_offset(s, zn));
      tcg_gen_addi_ptr(t_pg, tcg_env, pred_full_reg_offset(s, pg));
 -    status = fpstatus_ptr(is_fp16 ? FPST_FPCR_F16 : FPST_FPCR);
 +    status = fpstatus_ptr(is_fp16 ? FPST_FPCR_F16 : FPST_A64);
      desc = tcg_constant_i32(simd_desc(vsz, vsz, 0));
      fn(t_zd, t_zn, t_pg, scalar, status, desc);
  }
@@ -XXX,XX +XXX,XX @@ static bool do_fp_cmp(DisasContext *s, arg_rprr_esz *a,
      }
      if (sve_access_check(s)) {
          unsigned vsz = vec_full_reg_size(s);
 -        TCGv_ptr status = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR);
 +        TCGv_ptr status = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
          tcg_gen_gvec_4_ptr(pred_full_reg_offset(s, a->rd),
                             vec_full_reg_offset(s, a->rn),
                             vec_full_reg_offset(s, a->rm),
@@ -XXX,XX +XXX,XX @@ static gen_helper_gvec_4_ptr * const fcadd_fns[] = {
  };
  TRANS_FEAT(FCADD, aa64_sve, gen_gvec_fpst_zzzp, fcadd_fns[a->esz],
             a->rd, a->rn, a->rm, a->pg, a->rot,
 -           a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR)
 +           a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
  #define DO_FMLA(NAME, name) \
      static gen_helper_gvec_5_ptr * const name##_fns[4] = {              \
@@ -XXX,XX +XXX,XX @@ TRANS_FEAT(FCADD, aa64_sve, gen_gvec_fpst_zzzp, fcadd_fns[a->esz],
      };                                                                  \
      TRANS_FEAT(NAME, aa64_sve, gen_gvec_fpst_zzzzp, name##_fns[a->esz], \
                 a->rd, a->rn, a->rm, a->ra, a->pg, 0,                    \
 -               a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR)
 +               a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
  DO_FMLA(FMLA_zpzzz, fmla_zpzzz)
  DO_FMLA(FMLS_zpzzz, fmls_zpzzz)
@@ -XXX,XX +XXX,XX @@ static gen_helper_gvec_5_ptr * const fcmla_fns[4] = {
  };
  TRANS_FEAT(FCMLA_zpzzz, aa64_sve, gen_gvec_fpst_zzzzp, fcmla_fns[a->esz],
             a->rd, a->rn, a->rm, a->ra, a->pg, a->rot,
 -           a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR)
 +           a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
  static gen_helper_gvec_4_ptr * const fcmla_idx_fns[4] = {
      NULL, gen_helper_gvec_fcmlah_idx, gen_helper_gvec_fcmlas_idx, NULL
  };
  TRANS_FEAT(FCMLA_zzxz, aa64_sve, gen_gvec_fpst_zzzz, fcmla_idx_fns[a->esz],
             a->rd, a->rn, a->rm, a->ra, a->index * 4 + a->rot,
 -           a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR)
 +           a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
  /*
   *** SVE Floating Point Unary Operations Predicated Group
   */
  TRANS_FEAT(FCVT_sh, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvt_sh, a, 0, FPST_FPCR)
 +           gen_helper_sve_fcvt_sh, a, 0, FPST_A64)
  TRANS_FEAT(FCVT_hs, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvt_hs, a, 0, FPST_FPCR)
 +           gen_helper_sve_fcvt_hs, a, 0, FPST_A64)
  TRANS_FEAT(BFCVT, aa64_sve_bf16, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_bfcvt, a, 0, FPST_FPCR)
 +           gen_helper_sve_bfcvt, a, 0, FPST_A64)
  TRANS_FEAT(FCVT_dh, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvt_dh, a, 0, FPST_FPCR)
 +           gen_helper_sve_fcvt_dh, a, 0, FPST_A64)
  TRANS_FEAT(FCVT_hd, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvt_hd, a, 0, FPST_FPCR)
 +           gen_helper_sve_fcvt_hd, a, 0, FPST_A64)
  TRANS_FEAT(FCVT_ds, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvt_ds, a, 0, FPST_FPCR)
 +           gen_helper_sve_fcvt_ds, a, 0, FPST_A64)
  TRANS_FEAT(FCVT_sd, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvt_sd, a, 0, FPST_FPCR)
 +           gen_helper_sve_fcvt_sd, a, 0, FPST_A64)
  TRANS_FEAT(FCVTZS_hh, aa64_sve, gen_gvec_fpst_arg_zpz,
             gen_helper_sve_fcvtzs_hh, a, 0, FPST_FPCR_F16)
@@ -XXX,XX +XXX,XX @@ TRANS_FEAT(FCVTZU_hd, aa64_sve, gen_gvec_fpst_arg_zpz,
             gen_helper_sve_fcvtzu_hd, a, 0, FPST_FPCR_F16)
  TRANS_FEAT(FCVTZS_ss, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvtzs_ss, a, 0, FPST_FPCR)
 +           gen_helper_sve_fcvtzs_ss, a, 0, FPST_A64)
  TRANS_FEAT(FCVTZU_ss, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvtzu_ss, a, 0, FPST_FPCR)
 +           gen_helper_sve_fcvtzu_ss, a, 0, FPST_A64)
  TRANS_FEAT(FCVTZS_sd, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvtzs_sd, a, 0, FPST_FPCR)
 +           gen_helper_sve_fcvtzs_sd, a, 0, FPST_A64)
  TRANS_FEAT(FCVTZU_sd, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvtzu_sd, a, 0, FPST_FPCR)
 +           gen_helper_sve_fcvtzu_sd, a, 0, FPST_A64)
  TRANS_FEAT(FCVTZS_ds, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvtzs_ds, a, 0, FPST_FPCR)
 +           gen_helper_sve_fcvtzs_ds, a, 0, FPST_A64)
  TRANS_FEAT(FCVTZU_ds, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvtzu_ds, a, 0, FPST_FPCR)
 +           gen_helper_sve_fcvtzu_ds, a, 0, FPST_A64)
  TRANS_FEAT(FCVTZS_dd, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvtzs_dd, a, 0, FPST_FPCR)
 +           gen_helper_sve_fcvtzs_dd, a, 0, FPST_A64)
  TRANS_FEAT(FCVTZU_dd, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvtzu_dd, a, 0, FPST_FPCR)
 +           gen_helper_sve_fcvtzu_dd, a, 0, FPST_A64)
  static gen_helper_gvec_3_ptr * const frint_fns[] = {
      NULL,
@@ -XXX,XX +XXX,XX @@ static gen_helper_gvec_3_ptr * const frint_fns[] = {
      gen_helper_sve_frint_d
  };
  TRANS_FEAT(FRINTI, aa64_sve, gen_gvec_fpst_arg_zpz, frint_fns[a->esz],
 -           a, 0, a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR)
 +           a, 0, a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
  static gen_helper_gvec_3_ptr * const frintx_fns[] = {
      NULL,
@@ -XXX,XX +XXX,XX @@ static gen_helper_gvec_3_ptr * const frintx_fns[] = {
      gen_helper_sve_frintx_d
  };
  TRANS_FEAT(FRINTX, aa64_sve, gen_gvec_fpst_arg_zpz, frintx_fns[a->esz],
 -           a, 0, a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR);
 +           a, 0, a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
  static bool do_frint_mode(DisasContext *s, arg_rpr_esz *a,
                            ARMFPRounding mode, gen_helper_gvec_3_ptr *fn)
@@ -XXX,XX +XXX,XX @@ static bool do_frint_mode(DisasContext *s, arg_rpr_esz *a,
      }
      vsz = vec_full_reg_size(s);
 -    status = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR);
 +    status = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
      tmode = gen_set_rmode(mode, status);
      tcg_gen_gvec_3_ptr(vec_full_reg_offset(s, a->rd),
@@ -XXX,XX +XXX,XX @@ static gen_helper_gvec_3_ptr * const frecpx_fns[] = {
      gen_helper_sve_frecpx_s, gen_helper_sve_frecpx_d,
  };
  TRANS_FEAT(FRECPX, aa64_sve, gen_gvec_fpst_arg_zpz, frecpx_fns[a->esz],
 -           a, 0, a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR)
 +           a, 0, a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
  static gen_helper_gvec_3_ptr * const fsqrt_fns[] = {
      NULL,                   gen_helper_sve_fsqrt_h,
      gen_helper_sve_fsqrt_s, gen_helper_sve_fsqrt_d,
  };
  TRANS_FEAT(FSQRT, aa64_sve, gen_gvec_fpst_arg_zpz, fsqrt_fns[a->esz],
 -           a, 0, a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR)
 +           a, 0, a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
  TRANS_FEAT(SCVTF_hh, aa64_sve, gen_gvec_fpst_arg_zpz,
             gen_helper_sve_scvt_hh, a, 0, FPST_FPCR_F16)
@@ -XXX,XX +XXX,XX @@ TRANS_FEAT(SCVTF_dh, aa64_sve, gen_gvec_fpst_arg_zpz,
             gen_helper_sve_scvt_dh, a, 0, FPST_FPCR_F16)
  TRANS_FEAT(SCVTF_ss, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_scvt_ss, a, 0, FPST_FPCR)
 +           gen_helper_sve_scvt_ss, a, 0, FPST_A64)
  TRANS_FEAT(SCVTF_ds, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_scvt_ds, a, 0, FPST_FPCR)
 +           gen_helper_sve_scvt_ds, a, 0, FPST_A64)
  TRANS_FEAT(SCVTF_sd, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_scvt_sd, a, 0, FPST_FPCR)
 +           gen_helper_sve_scvt_sd, a, 0, FPST_A64)
  TRANS_FEAT(SCVTF_dd, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_scvt_dd, a, 0, FPST_FPCR)
 +           gen_helper_sve_scvt_dd, a, 0, FPST_A64)
  TRANS_FEAT(UCVTF_hh, aa64_sve, gen_gvec_fpst_arg_zpz,
             gen_helper_sve_ucvt_hh, a, 0, FPST_FPCR_F16)
@@ -XXX,XX +XXX,XX @@ TRANS_FEAT(UCVTF_dh, aa64_sve, gen_gvec_fpst_arg_zpz,
             gen_helper_sve_ucvt_dh, a, 0, FPST_FPCR_F16)
  TRANS_FEAT(UCVTF_ss, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_ucvt_ss, a, 0, FPST_FPCR)
 +           gen_helper_sve_ucvt_ss, a, 0, FPST_A64)
  TRANS_FEAT(UCVTF_ds, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_ucvt_ds, a, 0, FPST_FPCR)
 +           gen_helper_sve_ucvt_ds, a, 0, FPST_A64)
  TRANS_FEAT(UCVTF_sd, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_ucvt_sd, a, 0, FPST_FPCR)
 +           gen_helper_sve_ucvt_sd, a, 0, FPST_A64)
  TRANS_FEAT(UCVTF_dd, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_ucvt_dd, a, 0, FPST_FPCR)
 +           gen_helper_sve_ucvt_dd, a, 0, FPST_A64)
  /*
   *** SVE Memory - 32-bit Gather and Unsized Contiguous Group
@@ -XXX,XX +XXX,XX @@ DO_ZPZZ_FP(FMINP, aa64_sve2, sve2_fminp_zpzz)
  TRANS_FEAT_NONSTREAMING(FMMLA_s, aa64_sve_f32mm, gen_gvec_fpst_zzzz,
                          gen_helper_fmmla_s, a->rd, a->rn, a->rm, a->ra,
 -                        0, FPST_FPCR)
 +                        0, FPST_A64)
  TRANS_FEAT_NONSTREAMING(FMMLA_d, aa64_sve_f64mm, gen_gvec_fpst_zzzz,
                          gen_helper_fmmla_d, a->rd, a->rn, a->rm, a->ra,
 -                        0, FPST_FPCR)
 +                        0, FPST_A64)
  static gen_helper_gvec_4 * const sqdmlal_zzzw_fns[] = {
      NULL,                           gen_helper_sve2_sqdmlal_zzzw_h,
@@ -XXX,XX +XXX,XX @@ TRANS_FEAT_NONSTREAMING(RAX1, aa64_sve2_sha3, gen_gvec_fn_arg_zzz,
                          gen_gvec_rax1, a)
  TRANS_FEAT(FCVTNT_sh, aa64_sve2, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve2_fcvtnt_sh, a, 0, FPST_FPCR)
 +           gen_helper_sve2_fcvtnt_sh, a, 0, FPST_A64)
  TRANS_FEAT(FCVTNT_ds, aa64_sve2, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve2_fcvtnt_ds, a, 0, FPST_FPCR)
 +           gen_helper_sve2_fcvtnt_ds, a, 0, FPST_A64)
  TRANS_FEAT(BFCVTNT, aa64_sve_bf16, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_bfcvtnt, a, 0, FPST_FPCR)
 +           gen_helper_sve_bfcvtnt, a, 0, FPST_A64)
  TRANS_FEAT(FCVTLT_hs, aa64_sve2, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve2_fcvtlt_hs, a, 0, FPST_FPCR)
 +           gen_helper_sve2_fcvtlt_hs, a, 0, FPST_A64)
  TRANS_FEAT(FCVTLT_sd, aa64_sve2, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve2_fcvtlt_sd, a, 0, FPST_FPCR)
 +           gen_helper_sve2_fcvtlt_sd, a, 0, FPST_A64)
  TRANS_FEAT(FCVTX_ds, aa64_sve2, do_frint_mode, a,
             FPROUNDING_ODD, gen_helper_sve_fcvt_ds)
@@ -XXX,XX +XXX,XX @@ static gen_helper_gvec_3_ptr * const flogb_fns[] = {
      gen_helper_flogb_s, gen_helper_flogb_d
  };
  TRANS_FEAT(FLOGB, aa64_sve2, gen_gvec_fpst_arg_zpz, flogb_fns[a->esz],
 -           a, 0, a->esz == MO_16 ? FPST_FPCR_F16 : FPST_FPCR)
 +           a, 0, a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
  static bool do_FMLAL_zzzw(DisasContext *s, arg_rrrr_esz *a, bool sub, bool sel)
  {
@@ -XXX,XX +XXX,XX @@ TRANS_FEAT_NONSTREAMING(BFMMLA, aa64_sve_bf16, gen_gvec_env_arg_zzzz,
  static bool do_BFMLAL_zzzw(DisasContext *s, arg_rrrr_esz *a, bool sel)
  {
      return gen_gvec_fpst_zzzz(s, gen_helper_gvec_bfmlal,
 -                              a->rd, a->rn, a->rm, a->ra, sel, FPST_FPCR);
 +                              a->rd, a->rn, a->rm, a->ra, sel, FPST_A64);
  }
  TRANS_FEAT(BFMLALB_zzzw, aa64_sve_bf16, do_BFMLAL_zzzw, a, false)
@@ -XXX,XX +XXX,XX @@ static bool do_BFMLAL_zzxw(DisasContext *s, arg_rrxr_esz *a, bool sel)
  {
      return gen_gvec_fpst_zzzz(s, gen_helper_gvec_bfmlal_idx,
                                a->rd, a->rn, a->rm, a->ra,
 -                              (a->index << 1) | sel, FPST_FPCR);
 +                              (a->index << 1) | sel, FPST_A64);
  }
  TRANS_FEAT(BFMLALB_zzxw, aa64_sve_bf16, do_BFMLAL_zzxw, a, false)
 --
-.25.1
+.34.1

-[PULL 14/32] target/arm: Add minimal RAS registers
+[PULL 24/36] target/arm: Remove now-unused vfp.fp_status and FPST_FPCR
-From: Richard Henderson <richard.henderson@linaro.org>
+Now we have moved all the uses of vfp.fp_status and FPST_FPCR
 to either the A32 or A64 fields, we can remove these.
-Add only the system registers required to implement zero error
-records.  This means that all values for ERRSELR are out of range,
-which means that it and all of the indexed error record registers
-need not be implemented.
-Add the EL2 registers required for injecting virtual SError.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220506180242.216785-14-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250124162836.2332150-13-peter.maydell@linaro.org
 ---
- target/arm/cpu.h    |  5 +++
+ target/arm/cpu.h           | 2 --
- target/arm/helper.c | 84 +++++++++++++++++++++++++++++++++++++++++++++
+ target/arm/tcg/translate.h | 6 ------
-files changed, 89 insertions(+)
+ target/arm/cpu.c           | 1 -
  target/arm/vfp_helper.c    | 8 +-------
 files changed, 1 insertion(+), 16 deletions(-)
 diff --git a/target/arm/cpu.h b/target/arm/cpu.h
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu.h
 +++ b/target/arm/cpu.h
 @@ -XXX,XX +XXX,XX @@ typedef struct CPUArchState {
-         uint64_t tfsr_el[4]; /* tfsre0_el1 is index 0.  */
-         uint64_t gcr_el1;
+         /* There are a number of distinct float control structures:
-         uint64_t rgsr_el1;
+          *
-+
+-         *  fp_status: is the "normal" fp status.
-+        /* Minimal RAS registers */
+          *  fp_status_a32: is the "normal" fp status for AArch32 insns
-+        uint64_t disr_el1;
+          *  fp_status_a64: is the "normal" fp status for AArch64 insns
-+        uint64_t vdisr_el2;
+          *  fp_status_fp16: used for half-precision calculations
-+        uint64_t vsesr_el2;
+@@ -XXX,XX +XXX,XX @@ typedef struct CPUArchState {
-     } cp15;
+          * only thing which needs to read the exception flags being
+          * an explicit FPSCR read.
-     struct {
+          */
-diff --git a/target/arm/helper.c b/target/arm/helper.c
+-        float_status fp_status;
          float_status fp_status_a32;
          float_status fp_status_a64;
          float_status fp_status_f16;
 diff --git a/target/arm/tcg/translate.h b/target/arm/tcg/translate.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper.c
+--- a/target/arm/tcg/translate.h
-+++ b/target/arm/helper.c
++++ b/target/arm/tcg/translate.h
-@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo debug_lpae_cp_reginfo[] = {
+@@ -XXX,XX +XXX,XX @@ static inline CPUARMTBFlags arm_tbflags_from_tb(const TranslationBlock *tb)
-       .access = PL0_R, .type = ARM_CP_CONST|ARM_CP_64BIT, .resetvalue = 0 },
+  * Enum for argument to fpstatus_ptr().
- };
+  */
+ typedef enum ARMFPStatusFlavour {
-+/*
+-    FPST_FPCR,
-+ * Check for traps to RAS registers, which are controlled
+     FPST_A32,
-+ * by HCR_EL2.TERR and SCR_EL3.TERR.
+     FPST_A64,
-+ */
+     FPST_FPCR_F16,
-+static CPAccessResult access_terr(CPUARMState *env, const ARMCPRegInfo *ri,
+@@ -XXX,XX +XXX,XX @@ typedef enum ARMFPStatusFlavour {
-+                                  bool isread)
+  * been set up to point to the requested field in the CPU state struct.
-+{
+  * The options are:
-+    int el = arm_current_el(env);
+  *
-+
+- * FPST_FPCR
-+    if (el < 2 && (arm_hcr_el2_eff(env) & HCR_TERR)) {
+- *   for non-FP16 operations controlled by the FPCR
-+        return CP_ACCESS_TRAP_EL2;
+  * FPST_A32
-+    }
+  *   for AArch32 non-FP16 operations controlled by the FPCR
-+    if (el < 3 && (env->cp15.scr_el3 & SCR_TERR)) {
+  * FPST_A64
-+        return CP_ACCESS_TRAP_EL3;
+@@ -XXX,XX +XXX,XX @@ static inline TCGv_ptr fpstatus_ptr(ARMFPStatusFlavour flavour)
-+    }
+     int offset;
-+    return CP_ACCESS_OK;
-+}
+     switch (flavour) {
-+
+-    case FPST_FPCR:
-+static uint64_t disr_read(CPUARMState *env, const ARMCPRegInfo *ri)
+-        offset = offsetof(CPUARMState, vfp.fp_status);
-+{
+-        break;
-+    int el = arm_current_el(env);
+     case FPST_A32:
-+
+         offset = offsetof(CPUARMState, vfp.fp_status_a32);
-+    if (el < 2 && (arm_hcr_el2_eff(env) & HCR_AMO)) {
+         break;
-+        return env->cp15.vdisr_el2;
+diff --git a/target/arm/cpu.c b/target/arm/cpu.c
-+    }
+index XXXXXXX..XXXXXXX 100644
-+    if (el < 3 && (env->cp15.scr_el3 & SCR_EA)) {
+--- a/target/arm/cpu.c
-+        return 0; /* RAZ/WI */
++++ b/target/arm/cpu.c
-+    }
+@@ -XXX,XX +XXX,XX @@ static void arm_cpu_reset_hold(Object *obj, ResetType type)
-+    return env->cp15.disr_el1;
+     set_flush_inputs_to_zero(1, &env->vfp.standard_fp_status);
-+}
+     set_default_nan_mode(1, &env->vfp.standard_fp_status);
-+
+     set_default_nan_mode(1, &env->vfp.standard_fp_status_f16);
-+static void disr_write(CPUARMState *env, const ARMCPRegInfo *ri, uint64_t val)
+-    arm_set_default_fp_behaviours(&env->vfp.fp_status);
-+{
+     arm_set_default_fp_behaviours(&env->vfp.fp_status_a32);
-+    int el = arm_current_el(env);
+     arm_set_default_fp_behaviours(&env->vfp.fp_status_a64);
-+
+     arm_set_default_fp_behaviours(&env->vfp.standard_fp_status);
-+    if (el < 2 && (arm_hcr_el2_eff(env) & HCR_AMO)) {
+diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
-+        env->cp15.vdisr_el2 = val;
+index XXXXXXX..XXXXXXX 100644
-+        return;
+--- a/target/arm/vfp_helper.c
-+    }
++++ b/target/arm/vfp_helper.c
-+    if (el < 3 && (env->cp15.scr_el3 & SCR_EA)) {
+@@ -XXX,XX +XXX,XX @@ static inline uint32_t vfp_exceptbits_from_host(int host_bits)
-+        return; /* RAZ/WI */
-+    }
+ static uint32_t vfp_get_fpsr_from_host(CPUARMState *env)
-+    env->cp15.disr_el1 = val;
+ {
-+}
+-    uint32_t i;
-+
++    uint32_t i = 0;
-+/*
-+ * Minimal RAS implementation with no Error Records.
+-    i = get_float_exception_flags(&env->vfp.fp_status);
-+ * Which means that all of the Error Record registers:
+     i |= get_float_exception_flags(&env->vfp.fp_status_a32);
-+ *   ERXADDR_EL1
+     i |= get_float_exception_flags(&env->vfp.fp_status_a64);
-+ *   ERXCTLR_EL1
+     i |= get_float_exception_flags(&env->vfp.standard_fp_status);
-+ *   ERXFR_EL1
+@@ -XXX,XX +XXX,XX @@ static void vfp_clear_float_status_exc_flags(CPUARMState *env)
-+ *   ERXMISC0_EL1
+      * values. The caller should have arranged for env->vfp.fpsr to
-+ *   ERXMISC1_EL1
+      * be the architecturally up-to-date exception flag information first.
-+ *   ERXMISC2_EL1
+      */
-+ *   ERXMISC3_EL1
+-    set_float_exception_flags(0, &env->vfp.fp_status);
-+ *   ERXPFGCDN_EL1  (RASv1p1)
+     set_float_exception_flags(0, &env->vfp.fp_status_a32);
-+ *   ERXPFGCTL_EL1  (RASv1p1)
+     set_float_exception_flags(0, &env->vfp.fp_status_a64);
-+ *   ERXPFGF_EL1    (RASv1p1)
+     set_float_exception_flags(0, &env->vfp.fp_status_f16);
-+ *   ERXSTATUS_EL1
+@@ -XXX,XX +XXX,XX @@ static void vfp_set_fpcr_to_host(CPUARMState *env, uint32_t val, uint32_t mask)
-+ * and
+             i = float_round_to_zero;
-+ *   ERRSELR_EL1
+             break;
-+ * may generate UNDEFINED, which is the effect we get by not
+         }
-+ * listing them at all.
+-        set_float_rounding_mode(i, &env->vfp.fp_status);
-+ */
+         set_float_rounding_mode(i, &env->vfp.fp_status_a32);
-+static const ARMCPRegInfo minimal_ras_reginfo[] = {
+         set_float_rounding_mode(i, &env->vfp.fp_status_a64);
-+    { .name = "DISR_EL1", .state = ARM_CP_STATE_BOTH,
+         set_float_rounding_mode(i, &env->vfp.fp_status_f16);
-+      .opc0 = 3, .opc1 = 0, .crn = 12, .crm = 1, .opc2 = 1,
+@@ -XXX,XX +XXX,XX @@ static void vfp_set_fpcr_to_host(CPUARMState *env, uint32_t val, uint32_t mask)
 +      .access = PL1_RW, .fieldoffset = offsetof(CPUARMState, cp15.disr_el1),
 +      .readfn = disr_read, .writefn = disr_write, .raw_writefn = raw_write },
 +    { .name = "ERRIDR_EL1", .state = ARM_CP_STATE_BOTH,
 +      .opc0 = 3, .opc1 = 0, .crn = 5, .crm = 3, .opc2 = 0,
 +      .access = PL1_R, .accessfn = access_terr,
 +      .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "VDISR_EL2", .state = ARM_CP_STATE_BOTH,
 +      .opc0 = 3, .opc1 = 4, .crn = 12, .crm = 1, .opc2 = 1,
 +      .access = PL2_RW, .fieldoffset = offsetof(CPUARMState, cp15.vdisr_el2) },
 +    { .name = "VSESR_EL2", .state = ARM_CP_STATE_BOTH,
 +      .opc0 = 3, .opc1 = 4, .crn = 5, .crm = 2, .opc2 = 3,
 +      .access = PL2_RW, .fieldoffset = offsetof(CPUARMState, cp15.vsesr_el2) },
 +};
 +
  /* Return the exception level to which exceptions should be taken
   * via SVEAccessTrap.  If an exception should be routed through
   * AArch64.AdvSIMDFPAccessTrap, return 0; fp_exception_el should
@@ -XXX,XX +XXX,XX @@ void register_cp_regs_for_features(ARMCPU *cpu)
      if (cpu_isar_feature(aa64_ssbs, cpu)) {
          define_one_arm_cp_reg(cpu, &ssbs_reginfo);
      }
-+    if (cpu_isar_feature(any_ras, cpu)) {
+     if (changed & FPCR_FZ) {
-+        define_arm_cp_regs(cpu, minimal_ras_reginfo);
+         bool ftz_enabled = val & FPCR_FZ;
-+    }
+-        set_flush_to_zero(ftz_enabled, &env->vfp.fp_status);
+-        set_flush_inputs_to_zero(ftz_enabled, &env->vfp.fp_status);
-     if (cpu_isar_feature(aa64_vh, cpu) ||
+         set_flush_to_zero(ftz_enabled, &env->vfp.fp_status_a32);
-         cpu_isar_feature(aa64_debugv8p2, cpu)) {
+         set_flush_inputs_to_zero(ftz_enabled, &env->vfp.fp_status_a32);
          set_flush_to_zero(ftz_enabled, &env->vfp.fp_status_a64);
@@ -XXX,XX +XXX,XX @@ static void vfp_set_fpcr_to_host(CPUARMState *env, uint32_t val, uint32_t mask)
      }
      if (changed & FPCR_DN) {
          bool dnan_enabled = val & FPCR_DN;
 -        set_default_nan_mode(dnan_enabled, &env->vfp.fp_status);
          set_default_nan_mode(dnan_enabled, &env->vfp.fp_status_a32);
          set_default_nan_mode(dnan_enabled, &env->vfp.fp_status_a64);
          set_default_nan_mode(dnan_enabled, &env->vfp.fp_status_f16);
 --
-.25.1
+.34.1

-[PULL 16/32] target/arm: Implement virtual SError exceptions
+[PULL 25/36] target/arm: Define new fp_status_f16_a32 and fp_status_f16_a64
-From: Richard Henderson <richard.henderson@linaro.org>
+As the first part of splitting the existing fp_status_f16
 into separate float_status fields for AArch32 and AArch64
 (so that we can make FEAT_AFP control bits apply only
 for AArch64), define the two new fp_status_f16_a32 and
 fp_status_f16_a64 fields, but don't use them yet.
-Virtual SError exceptions are raised by setting HCR_EL2.VSE,
-and are routed to EL1 just like other virtual exceptions.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220506180242.216785-16-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250124162836.2332150-14-peter.maydell@linaro.org
 ---
- target/arm/cpu.h       |  2 ++
+ target/arm/cpu.h           |  4 ++++
- target/arm/internals.h |  8 ++++++++
+ target/arm/tcg/translate.h | 12 ++++++++++++
- target/arm/syndrome.h  |  5 +++++
+ target/arm/cpu.c           |  2 ++
- target/arm/cpu.c       | 38 +++++++++++++++++++++++++++++++++++++-
+ target/arm/vfp_helper.c    | 14 ++++++++++++++
- target/arm/helper.c    | 40 +++++++++++++++++++++++++++++++++++++++-
+files changed, 32 insertions(+)
 files changed, 91 insertions(+), 2 deletions(-)
 diff --git a/target/arm/cpu.h b/target/arm/cpu.h
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu.h
 +++ b/target/arm/cpu.h
-@@ -XXX,XX +XXX,XX @@
+@@ -XXX,XX +XXX,XX @@ typedef struct CPUArchState {
- #define EXCP_LSERR          21   /* v8M LSERR SecureFault */
+          *  fp_status_a32: is the "normal" fp status for AArch32 insns
- #define EXCP_UNALIGNED      22   /* v7M UNALIGNED UsageFault */
+          *  fp_status_a64: is the "normal" fp status for AArch64 insns
- #define EXCP_DIVBYZERO      23   /* v7M DIVBYZERO UsageFault */
+          *  fp_status_fp16: used for half-precision calculations
-+#define EXCP_VSERR          24
++         *  fp_status_fp16_a32: used for AArch32 half-precision calculations
- /* NB: add new EXCP_ defines to the array in arm_log_exception() too */
++         *  fp_status_fp16_a64: used for AArch64 half-precision calculations
+          *  standard_fp_status : the ARM "Standard FPSCR Value"
- #define ARMV7M_EXCP_RESET   1
+          *  standard_fp_status_fp16 : used for half-precision
-@@ -XXX,XX +XXX,XX @@ enum {
+          *       calculations with the ARM "Standard FPSCR Value"
- #define CPU_INTERRUPT_FIQ   CPU_INTERRUPT_TGT_EXT_1
+@@ -XXX,XX +XXX,XX @@ typedef struct CPUArchState {
- #define CPU_INTERRUPT_VIRQ  CPU_INTERRUPT_TGT_EXT_2
+         float_status fp_status_a32;
- #define CPU_INTERRUPT_VFIQ  CPU_INTERRUPT_TGT_EXT_3
+         float_status fp_status_a64;
-+#define CPU_INTERRUPT_VSERR CPU_INTERRUPT_TGT_INT_0
+         float_status fp_status_f16;
++        float_status fp_status_f16_a32;
- /* The usual mapping for an AArch64 system register to its AArch32
++        float_status fp_status_f16_a64;
-  * counterpart is for the 32 bit world to have access to the lower
+         float_status standard_fp_status;
-diff --git a/target/arm/internals.h b/target/arm/internals.h
+         float_status standard_fp_status_f16;
 diff --git a/target/arm/tcg/translate.h b/target/arm/tcg/translate.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/internals.h
+--- a/target/arm/tcg/translate.h
-+++ b/target/arm/internals.h
++++ b/target/arm/tcg/translate.h
-@@ -XXX,XX +XXX,XX @@ void arm_cpu_update_virq(ARMCPU *cpu);
+@@ -XXX,XX +XXX,XX @@ typedef enum ARMFPStatusFlavour {
-  */
+     FPST_A32,
- void arm_cpu_update_vfiq(ARMCPU *cpu);
+     FPST_A64,
+     FPST_FPCR_F16,
-+/**
++    FPST_A32_F16,
-+ * arm_cpu_update_vserr: Update CPU_INTERRUPT_VSERR bit
++    FPST_A64_F16,
-+ *
+     FPST_STD,
-+ * Update the CPU_INTERRUPT_VSERR bit in cs->interrupt_request,
+     FPST_STD_F16,
-+ * following a change to the HCR_EL2.VSE bit.
+ } ARMFPStatusFlavour;
-+ */
+@@ -XXX,XX +XXX,XX @@ typedef enum ARMFPStatusFlavour {
-+void arm_cpu_update_vserr(ARMCPU *cpu);
+  *   for AArch64 non-FP16 operations controlled by the FPCR
-+
+  * FPST_FPCR_F16
- /**
+  *   for operations controlled by the FPCR where FPCR.FZ16 is to be used
-  * arm_mmu_idx_el:
++ * FPST_A32_F16
-  * @env: The cpu environment
++ *   for AArch32 operations controlled by the FPCR where FPCR.FZ16 is to be used
-diff --git a/target/arm/syndrome.h b/target/arm/syndrome.h
++ * FPST_A64_F16
-index XXXXXXX..XXXXXXX 100644
++ *   for AArch64 operations controlled by the FPCR where FPCR.FZ16 is to be used
---- a/target/arm/syndrome.h
+  * FPST_STD
-+++ b/target/arm/syndrome.h
+  *   for A32/T32 Neon operations using the "standard FPSCR value"
-@@ -XXX,XX +XXX,XX @@ static inline uint32_t syn_pcalignment(void)
+  * FPST_STD_F16
-     return (EC_PCALIGNMENT << ARM_EL_EC_SHIFT) | ARM_EL_IL;
+@@ -XXX,XX +XXX,XX @@ static inline TCGv_ptr fpstatus_ptr(ARMFPStatusFlavour flavour)
- }
+     case FPST_FPCR_F16:
+         offset = offsetof(CPUARMState, vfp.fp_status_f16);
-+static inline uint32_t syn_serror(uint32_t extra)
+         break;
-+{
++    case FPST_A32_F16:
-+    return (EC_SERROR << ARM_EL_EC_SHIFT) | ARM_EL_IL | extra;
++        offset = offsetof(CPUARMState, vfp.fp_status_f16_a32);
-+}
++        break;
-+
++    case FPST_A64_F16:
- #endif /* TARGET_ARM_SYNDROME_H */
++        offset = offsetof(CPUARMState, vfp.fp_status_f16_a64);
 +        break;
      case FPST_STD:
          offset = offsetof(CPUARMState, vfp.standard_fp_status);
          break;
 diff --git a/target/arm/cpu.c b/target/arm/cpu.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu.c
 +++ b/target/arm/cpu.c
-@@ -XXX,XX +XXX,XX @@ static bool arm_cpu_has_work(CPUState *cs)
+@@ -XXX,XX +XXX,XX @@ static void arm_cpu_reset_hold(Object *obj, ResetType type)
-     return (cpu->power_state != PSCI_OFF)
+     arm_set_default_fp_behaviours(&env->vfp.fp_status_a64);
-         && cs->interrupt_request &
+     arm_set_default_fp_behaviours(&env->vfp.standard_fp_status);
-         (CPU_INTERRUPT_FIQ | CPU_INTERRUPT_HARD
+     arm_set_default_fp_behaviours(&env->vfp.fp_status_f16);
--         | CPU_INTERRUPT_VFIQ | CPU_INTERRUPT_VIRQ
++    arm_set_default_fp_behaviours(&env->vfp.fp_status_f16_a32);
-+         | CPU_INTERRUPT_VFIQ | CPU_INTERRUPT_VIRQ | CPU_INTERRUPT_VSERR
++    arm_set_default_fp_behaviours(&env->vfp.fp_status_f16_a64);
-          | CPU_INTERRUPT_EXITTB);
+     arm_set_default_fp_behaviours(&env->vfp.standard_fp_status_f16);
  #ifndef CONFIG_USER_ONLY
 diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/vfp_helper.c
 +++ b/target/arm/vfp_helper.c
@@ -XXX,XX +XXX,XX @@ static uint32_t vfp_get_fpsr_from_host(CPUARMState *env)
      /* FZ16 does not generate an input denormal exception.  */
      i |= (get_float_exception_flags(&env->vfp.fp_status_f16)
            & ~float_flag_input_denormal);
 +    i |= (get_float_exception_flags(&env->vfp.fp_status_f16_a32)
 +          & ~float_flag_input_denormal);
 +    i |= (get_float_exception_flags(&env->vfp.fp_status_f16_a64)
 +          & ~float_flag_input_denormal);
      i |= (get_float_exception_flags(&env->vfp.standard_fp_status_f16)
            & ~float_flag_input_denormal);
      return vfp_exceptbits_from_host(i);
@@ -XXX,XX +XXX,XX @@ static void vfp_clear_float_status_exc_flags(CPUARMState *env)
      set_float_exception_flags(0, &env->vfp.fp_status_a32);
      set_float_exception_flags(0, &env->vfp.fp_status_a64);
      set_float_exception_flags(0, &env->vfp.fp_status_f16);
 +    set_float_exception_flags(0, &env->vfp.fp_status_f16_a32);
 +    set_float_exception_flags(0, &env->vfp.fp_status_f16_a64);
      set_float_exception_flags(0, &env->vfp.standard_fp_status);
      set_float_exception_flags(0, &env->vfp.standard_fp_status_f16);
  }
+@@ -XXX,XX +XXX,XX @@ static void vfp_set_fpcr_to_host(CPUARMState *env, uint32_t val, uint32_t mask)
-@@ -XXX,XX +XXX,XX @@ static inline bool arm_excp_unmasked(CPUState *cs, unsigned int excp_idx,
+         set_float_rounding_mode(i, &env->vfp.fp_status_a32);
-             return false;
+         set_float_rounding_mode(i, &env->vfp.fp_status_a64);
-         }
+         set_float_rounding_mode(i, &env->vfp.fp_status_f16);
-         return !(env->daif & PSTATE_I);
++        set_float_rounding_mode(i, &env->vfp.fp_status_f16_a32);
-+    case EXCP_VSERR:
++        set_float_rounding_mode(i, &env->vfp.fp_status_f16_a64);
 +        if (!(hcr_el2 & HCR_AMO) || (hcr_el2 & HCR_TGE)) {
 +            /* VIRQs are only taken when hypervized.  */
 +            return false;
 +        }
 +        return !(env->daif & PSTATE_A);
      default:
          g_assert_not_reached();
      }
-@@ -XXX,XX +XXX,XX @@ static bool arm_cpu_exec_interrupt(CPUState *cs, int interrupt_request)
+     if (changed & FPCR_FZ16) {
-             goto found;
+         bool ftz_enabled = val & FPCR_FZ16;
-         }
+         set_flush_to_zero(ftz_enabled, &env->vfp.fp_status_f16);
 +        set_flush_to_zero(ftz_enabled, &env->vfp.fp_status_f16_a32);
 +        set_flush_to_zero(ftz_enabled, &env->vfp.fp_status_f16_a64);
          set_flush_to_zero(ftz_enabled, &env->vfp.standard_fp_status_f16);
          set_flush_inputs_to_zero(ftz_enabled, &env->vfp.fp_status_f16);
 +        set_flush_inputs_to_zero(ftz_enabled, &env->vfp.fp_status_f16_a32);
 +        set_flush_inputs_to_zero(ftz_enabled, &env->vfp.fp_status_f16_a64);
          set_flush_inputs_to_zero(ftz_enabled, &env->vfp.standard_fp_status_f16);
      }
-+    if (interrupt_request & CPU_INTERRUPT_VSERR) {
+     if (changed & FPCR_FZ) {
-+        excp_idx = EXCP_VSERR;
+@@ -XXX,XX +XXX,XX @@ static void vfp_set_fpcr_to_host(CPUARMState *env, uint32_t val, uint32_t mask)
-+        target_el = 1;
+         set_default_nan_mode(dnan_enabled, &env->vfp.fp_status_a32);
-+        if (arm_excp_unmasked(cs, excp_idx, target_el,
+         set_default_nan_mode(dnan_enabled, &env->vfp.fp_status_a64);
-+                              cur_el, secure, hcr_el2)) {
+         set_default_nan_mode(dnan_enabled, &env->vfp.fp_status_f16);
-+            /* Taking a virtual abort clears HCR_EL2.VSE */
++        set_default_nan_mode(dnan_enabled, &env->vfp.fp_status_f16_a32);
-+            env->cp15.hcr_el2 &= ~HCR_VSE;
++        set_default_nan_mode(dnan_enabled, &env->vfp.fp_status_f16_a64);
 +            cpu_reset_interrupt(cs, CPU_INTERRUPT_VSERR);
 +            goto found;
 +        }
 +    }
      return false;
   found:
@@ -XXX,XX +XXX,XX @@ void arm_cpu_update_vfiq(ARMCPU *cpu)
      }
  }
-+void arm_cpu_update_vserr(ARMCPU *cpu)
-+{
-+    /*
-+     * Update the interrupt level for VSERR, which is the HCR_EL2.VSE bit.
-+     */
-+    CPUARMState *env = &cpu->env;
-+    CPUState *cs = CPU(cpu);
-+
-+    bool new_state = env->cp15.hcr_el2 & HCR_VSE;
-+
-+    if (new_state != ((cs->interrupt_request & CPU_INTERRUPT_VSERR) != 0)) {
-+        if (new_state) {
-+            cpu_interrupt(cs, CPU_INTERRUPT_VSERR);
-+        } else {
-+            cpu_reset_interrupt(cs, CPU_INTERRUPT_VSERR);
-+        }
-+    }
-+}
-+
- #ifndef CONFIG_USER_ONLY
- static void arm_cpu_set_irq(void *opaque, int irq, int level)
- {
-diff --git a/target/arm/helper.c b/target/arm/helper.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper.c
-+++ b/target/arm/helper.c
-@@ -XXX,XX +XXX,XX @@ static uint64_t isr_read(CPUARMState *env, const ARMCPRegInfo *ri)
-         }
-     }
--    /* External aborts are not possible in QEMU so A bit is always clear */
-+    if (hcr_el2 & HCR_AMO) {
-+        if (cs->interrupt_request & CPU_INTERRUPT_VSERR) {
-+            ret |= CPSR_A;
-+        }
-+    }
-+
-     return ret;
- }
-@@ -XXX,XX +XXX,XX @@ static void do_hcr_write(CPUARMState *env, uint64_t value, uint64_t valid_mask)
-     g_assert(qemu_mutex_iothread_locked());
-     arm_cpu_update_virq(cpu);
-     arm_cpu_update_vfiq(cpu);
-+    arm_cpu_update_vserr(cpu);
- }
- static void hcr_write(CPUARMState *env, const ARMCPRegInfo *ri, uint64_t value)
-@@ -XXX,XX +XXX,XX @@ void arm_log_exception(CPUState *cs)
-             [EXCP_LSERR] = "v8M LSERR UsageFault",
-             [EXCP_UNALIGNED] = "v7M UNALIGNED UsageFault",
-             [EXCP_DIVBYZERO] = "v7M DIVBYZERO UsageFault",
-+            [EXCP_VSERR] = "Virtual SERR",
-         };
-         if (idx >= 0 && idx < ARRAY_SIZE(excnames)) {
-@@ -XXX,XX +XXX,XX @@ static void arm_cpu_do_interrupt_aarch32(CPUState *cs)
-         mask = CPSR_A | CPSR_I | CPSR_F;
-         offset = 4;
-         break;
-+    case EXCP_VSERR:
-+        {
-+            /*
-+             * Note that this is reported as a data abort, but the DFAR
-+             * has an UNKNOWN value.  Construct the SError syndrome from
-+             * AET and ExT fields.
-+             */
-+            ARMMMUFaultInfo fi = { .type = ARMFault_AsyncExternal, };
-+
-+            if (extended_addresses_enabled(env)) {
-+                env->exception.fsr = arm_fi_to_lfsc(&fi);
-+            } else {
-+                env->exception.fsr = arm_fi_to_sfsc(&fi);
-+            }
-+            env->exception.fsr |= env->cp15.vsesr_el2 & 0xd000;
-+            A32_BANKED_CURRENT_REG_SET(env, dfsr, env->exception.fsr);
-+            qemu_log_mask(CPU_LOG_INT, "...with IFSR 0x%x\n",
-+                          env->exception.fsr);
-+
-+            new_mode = ARM_CPU_MODE_ABT;
-+            addr = 0x10;
-+            mask = CPSR_A | CPSR_I;
-+            offset = 8;
-+        }
-+        break;
-     case EXCP_SMC:
-         new_mode = ARM_CPU_MODE_MON;
-         addr = 0x08;
-@@ -XXX,XX +XXX,XX @@ static void arm_cpu_do_interrupt_aarch64(CPUState *cs)
-     case EXCP_VFIQ:
-         addr += 0x100;
-         break;
-+    case EXCP_VSERR:
-+        addr += 0x180;
-+        /* Construct the SError syndrome from IDS and ISS fields. */
-+        env->exception.syndrome = syn_serror(env->cp15.vsesr_el2 & 0x1ffffff);
-+        env->cp15.esr_el[new_el] = env->exception.syndrome;
-+        break;
-     default:
-         cpu_abort(cs, "Unhandled exception 0x%x\n", cs->exception_index);
-     }
 --
-.25.1
+.34.1

-[PULL 31/32] hw/arm/virt: Fix CPU's default NUMA node ID
+[PULL 26/36] target/arm: Use fp_status_f16_a32 in AArch32-only helpers
-From: Gavin Shan <gshan@redhat.com>
+We directly use fp_status_f16 in a handful of helpers that
 are AArch32-specific; switch to fp_status_f16_a32 for these.
-When CPU-to-NUMA association isn't explicitly provided by users,
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-the default one is given by mc->get_default_cpu_node_id(). However,
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-the CPU topology isn't fully considered in the default association
+Message-id: 20250124162836.2332150-15-peter.maydell@linaro.org
-and this causes CPU topology broken warnings on booting Linux guest.
+---
  target/arm/tcg/vec_helper.c | 4 ++--
  target/arm/vfp_helper.c     | 2 +-
 files changed, 3 insertions(+), 3 deletions(-)
-For example, the following warning messages are observed when the
+diff --git a/target/arm/tcg/vec_helper.c b/target/arm/tcg/vec_helper.c
 Linux guest is booted with the following command lines.
   /home/gavin/sandbox/qemu.main/build/qemu-system-aarch64 \
   -accel kvm -machine virt,gic-version=host               \
   -cpu host                                               \
   -smp 6,sockets=2,cores=3,threads=1                      \
   -m 1024M,slots=16,maxmem=64G                            \
   -object memory-backend-ram,id=mem0,size=128M            \
   -object memory-backend-ram,id=mem1,size=128M            \
   -object memory-backend-ram,id=mem2,size=128M            \
   -object memory-backend-ram,id=mem3,size=128M            \
   -object memory-backend-ram,id=mem4,size=128M            \
   -object memory-backend-ram,id=mem4,size=384M            \
   -numa node,nodeid=0,memdev=mem0                         \
   -numa node,nodeid=1,memdev=mem1                         \
   -numa node,nodeid=2,memdev=mem2                         \
   -numa node,nodeid=3,memdev=mem3                         \
   -numa node,nodeid=4,memdev=mem4                         \
   -numa node,nodeid=5,memdev=mem5
          :
   alternatives: patching kernel code
   BUG: arch topology borken
   the CLS domain not a subset of the MC domain
   <the above error log repeats>
   BUG: arch topology borken
   the DIE domain not a subset of the NODE domain
 With current implementation of mc->get_default_cpu_node_id(),
 CPU#0 to CPU#5 are associated with NODE#0 to NODE#5 separately.
 That's incorrect because CPU#0/1/2 should be associated with same
 NUMA node because they're seated in same socket.
 This fixes the issue by considering the socket ID when the default
 CPU-to-NUMA association is provided in virt_possible_cpu_arch_ids().
 With this applied, no more CPU topology broken warnings are seen
 from the Linux guest. The 6 CPUs are associated with NODE#0/1, but
 there are no CPUs associated with NODE#2/3/4/5.
 Signed-off-by: Gavin Shan <gshan@redhat.com>
 Reviewed-by: Igor Mammedov <imammedo@redhat.com>
 Reviewed-by: Yanan Wang <wangyanan55@huawei.com>
 Message-id: 20220503140304.855514-6-gshan@redhat.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
  hw/arm/virt.c | 4 +++-
 file changed, 3 insertions(+), 1 deletion(-)
 diff --git a/hw/arm/virt.c b/hw/arm/virt.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/virt.c
+--- a/target/arm/tcg/vec_helper.c
-+++ b/hw/arm/virt.c
++++ b/target/arm/tcg/vec_helper.c
-@@ -XXX,XX +XXX,XX @@ virt_cpu_index_to_props(MachineState *ms, unsigned cpu_index)
+@@ -XXX,XX +XXX,XX @@ void HELPER(gvec_fmlal_a32)(void *vd, void *vn, void *vm,
+                             CPUARMState *env, uint32_t desc)
  static int64_t virt_get_default_cpu_node_id(const MachineState *ms, int idx)
  {
--    return idx % ms->numa_state->num_nodes;
+     do_fmlal(vd, vn, vm, &env->vfp.standard_fp_status, desc,
-+    int64_t socket_id = ms->possible_cpus->cpus[idx].props.socket_id;
+-             get_flush_inputs_to_zero(&env->vfp.fp_status_f16));
-+
++             get_flush_inputs_to_zero(&env->vfp.fp_status_f16_a32));
 +    return socket_id % ms->numa_state->num_nodes;
  }
- static const CPUArchIdList *virt_possible_cpu_arch_ids(MachineState *ms)
+ void HELPER(gvec_fmlal_a64)(void *vd, void *vn, void *vm,
@@ -XXX,XX +XXX,XX @@ void HELPER(gvec_fmlal_idx_a32)(void *vd, void *vn, void *vm,
                                  CPUARMState *env, uint32_t desc)
  {
      do_fmlal_idx(vd, vn, vm, &env->vfp.standard_fp_status, desc,
 -                 get_flush_inputs_to_zero(&env->vfp.fp_status_f16));
 +                 get_flush_inputs_to_zero(&env->vfp.fp_status_f16_a32));
  }
  void HELPER(gvec_fmlal_idx_a64)(void *vd, void *vn, void *vm,
 diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/vfp_helper.c
 +++ b/target/arm/vfp_helper.c
@@ -XXX,XX +XXX,XX @@ void VFP_HELPER(cmpe, P)(ARGTYPE a, ARGTYPE b, CPUARMState *env) \
      softfloat_to_vfp_compare(env, \
          FLOATTYPE ## _compare(a, b, &env->vfp.FPST)); \
  }
 -DO_VFP_cmp(h, float16, dh_ctype_f16, fp_status_f16)
 +DO_VFP_cmp(h, float16, dh_ctype_f16, fp_status_f16_a32)
  DO_VFP_cmp(s, float32, float32, fp_status_a32)
  DO_VFP_cmp(d, float64, float64, fp_status_a32)
  #undef DO_VFP_cmp
 --
-.25.1
+.34.1

-[PULL 19/32] target/arm: Enable FEAT_IESB for -cpu max
+[PULL 27/36] target/arm: Use fp_status_f16_a64 in AArch64-only helpers
-From: Richard Henderson <richard.henderson@linaro.org>
+We directly use fp_status_f16 in a handful of helpers that are
 AArch64-specific; switch to fp_status_f16_a64 for these.
-This feature is AArch64 only, and applies to physical SErrors,
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-which QEMU does not implement, thus the feature is a nop.
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20250124162836.2332150-16-peter.maydell@linaro.org
 ---
  target/arm/tcg/sme_helper.c | 4 ++--
  target/arm/tcg/vec_helper.c | 8 ++++----
 files changed, 6 insertions(+), 6 deletions(-)
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+diff --git a/target/arm/tcg/sme_helper.c b/target/arm/tcg/sme_helper.c
 Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20220506180242.216785-19-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
  docs/system/arm/emulation.rst | 1 +
  target/arm/cpu64.c            | 1 +
 files changed, 2 insertions(+)
 diff --git a/docs/system/arm/emulation.rst b/docs/system/arm/emulation.rst
 index XXXXXXX..XXXXXXX 100644
---- a/docs/system/arm/emulation.rst
+--- a/target/arm/tcg/sme_helper.c
-+++ b/docs/system/arm/emulation.rst
++++ b/target/arm/tcg/sme_helper.c
-@@ -XXX,XX +XXX,XX @@ the following architecture extensions:
+@@ -XXX,XX +XXX,XX @@ void HELPER(sme_fmopa_h)(void *vza, void *vzn, void *vzm, void *vpn,
- - FEAT_FlagM2 (Enhancements to flag manipulation instructions)
+     float_status fpst_odd, fpst_std, fpst_f16;
- - FEAT_HPDS (Hierarchical permission disables)
- - FEAT_I8MM (AArch64 Int8 matrix multiplication instructions)
+     /*
-+- FEAT_IESB (Implicit error synchronization event)
+-     * Make copies of fp_status and fp_status_f16, because this operation
- - FEAT_JSCVT (JavaScript conversion instructions)
++     * Make copies of the fp status fields we use, because this operation
- - FEAT_LOR (Limited ordering regions)
+      * does not update the cumulative fp exception status.  It also
- - FEAT_LPA (Large Physical Address space)
+      * produces default NaNs. We also need a second copy of fp_status with
-diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
+      * round-to-odd -- see above.
       */
 -    fpst_f16 = env->vfp.fp_status_f16;
 +    fpst_f16 = env->vfp.fp_status_f16_a64;
      fpst_std = env->vfp.fp_status_a64;
      set_default_nan_mode(true, &fpst_std);
      set_default_nan_mode(true, &fpst_f16);
 diff --git a/target/arm/tcg/vec_helper.c b/target/arm/tcg/vec_helper.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu64.c
+--- a/target/arm/tcg/vec_helper.c
-+++ b/target/arm/cpu64.c
++++ b/target/arm/tcg/vec_helper.c
-@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
+@@ -XXX,XX +XXX,XX @@ void HELPER(gvec_fmlal_a64)(void *vd, void *vn, void *vm,
-     t = cpu->isar.id_aa64mmfr2;
+                             CPUARMState *env, uint32_t desc)
-     t = FIELD_DP64(t, ID_AA64MMFR2, CNP, 1);      /* FEAT_TTCNP */
+ {
-     t = FIELD_DP64(t, ID_AA64MMFR2, UAO, 1);      /* FEAT_UAO */
+     do_fmlal(vd, vn, vm, &env->vfp.fp_status_a64, desc,
-+    t = FIELD_DP64(t, ID_AA64MMFR2, IESB, 1);     /* FEAT_IESB */
+-             get_flush_inputs_to_zero(&env->vfp.fp_status_f16));
-     t = FIELD_DP64(t, ID_AA64MMFR2, VARANGE, 1);  /* FEAT_LVA */
++             get_flush_inputs_to_zero(&env->vfp.fp_status_f16_a64));
-     t = FIELD_DP64(t, ID_AA64MMFR2, ST, 1);       /* FEAT_TTST */
+ }
-     t = FIELD_DP64(t, ID_AA64MMFR2, TTL, 1);      /* FEAT_TTL */
  void HELPER(sve2_fmlal_zzzw_s)(void *vd, void *vn, void *vm, void *va,
@@ -XXX,XX +XXX,XX @@ void HELPER(sve2_fmlal_zzzw_s)(void *vd, void *vn, void *vm, void *va,
      uint16_t negn = extract32(desc, SIMD_DATA_SHIFT, 1) << 15;
      intptr_t sel = extract32(desc, SIMD_DATA_SHIFT + 1, 1) * sizeof(float16);
      float_status *status = &env->vfp.fp_status_a64;
 -    bool fz16 = get_flush_inputs_to_zero(&env->vfp.fp_status_f16);
 +    bool fz16 = get_flush_inputs_to_zero(&env->vfp.fp_status_f16_a64);
      for (i = 0; i < oprsz; i += sizeof(float32)) {
          float16 nn_16 = *(float16 *)(vn + H1_2(i + sel)) ^ negn;
@@ -XXX,XX +XXX,XX @@ void HELPER(gvec_fmlal_idx_a64)(void *vd, void *vn, void *vm,
                                  CPUARMState *env, uint32_t desc)
  {
      do_fmlal_idx(vd, vn, vm, &env->vfp.fp_status_a64, desc,
 -                 get_flush_inputs_to_zero(&env->vfp.fp_status_f16));
 +                 get_flush_inputs_to_zero(&env->vfp.fp_status_f16_a64));
  }
  void HELPER(sve2_fmlal_zzxw_s)(void *vd, void *vn, void *vm, void *va,
@@ -XXX,XX +XXX,XX @@ void HELPER(sve2_fmlal_zzxw_s)(void *vd, void *vn, void *vm, void *va,
      intptr_t sel = extract32(desc, SIMD_DATA_SHIFT + 1, 1) * sizeof(float16);
      intptr_t idx = extract32(desc, SIMD_DATA_SHIFT + 2, 3) * sizeof(float16);
      float_status *status = &env->vfp.fp_status_a64;
 -    bool fz16 = get_flush_inputs_to_zero(&env->vfp.fp_status_f16);
 +    bool fz16 = get_flush_inputs_to_zero(&env->vfp.fp_status_f16_a64);
      for (i = 0; i < oprsz; i += 16) {
          float16 mm_16 = *(float16 *)(vm + i + idx);
 --
-.25.1
+.34.1

-[PULL 07/32] target/arm: Update qemu-system-arm -cpu max to cortex-a57
+[PULL 28/36] target/arm: Use FPST_A32_F16 in A32 decoder
-From: Richard Henderson <richard.henderson@linaro.org>
+In the A32 decoder, use FPST_A32_F16 rather than FPST_FPCR_F16.
 By doing an automated conversion of the whole file we avoid possibly
 using more than one fpst value in a set_rmode/op/restore_rmode
 sequence.
-Instead of starting with cortex-a15 and adding v8 features to
+Patch created with
-a v7 cpu, begin with a v8 cpu stripped of its aarch64 features.
+  perl -p -i -e 's/FPST_FPCR_F16(?!_)/FPST_A32_F16/g' target/arm/tcg/translate-vfp.c
 This fixes the long-standing to-do where we only enabled v8
 features for user-only.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220506180242.216785-7-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250124162836.2332150-17-peter.maydell@linaro.org
 ---
- target/arm/cpu_tcg.c | 151 ++++++++++++++++++++++++++-----------------
+ target/arm/tcg/translate-vfp.c | 24 ++++++++++++------------
-file changed, 92 insertions(+), 59 deletions(-)
+file changed, 12 insertions(+), 12 deletions(-)
-diff --git a/target/arm/cpu_tcg.c b/target/arm/cpu_tcg.c
+diff --git a/target/arm/tcg/translate-vfp.c b/target/arm/tcg/translate-vfp.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu_tcg.c
+--- a/target/arm/tcg/translate-vfp.c
-+++ b/target/arm/cpu_tcg.c
++++ b/target/arm/tcg/translate-vfp.c
-@@ -XXX,XX +XXX,XX @@ static void arm_v7m_class_init(ObjectClass *oc, void *data)
+@@ -XXX,XX +XXX,XX @@ static bool trans_VRINT(DisasContext *s, arg_VRINT *a)
- static void arm_max_initfn(Object *obj)
+     }
      if (sz == 1) {
 -        fpst = fpstatus_ptr(FPST_FPCR_F16);
 +        fpst = fpstatus_ptr(FPST_A32_F16);
      } else {
          fpst = fpstatus_ptr(FPST_A32);
      }
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT(DisasContext *s, arg_VCVT *a)
      }
      if (sz == 1) {
 -        fpst = fpstatus_ptr(FPST_FPCR_F16);
 +        fpst = fpstatus_ptr(FPST_A32_F16);
      } else {
          fpst = fpstatus_ptr(FPST_A32);
      }
@@ -XXX,XX +XXX,XX @@ static bool do_vfp_3op_hp(DisasContext *s, VFPGen3OpSPFn *fn,
      /*
       * Do a half-precision operation. Functionally this is
       * the same as do_vfp_3op_sp(), except:
 -     *  - it uses the FPST_FPCR_F16
 +     *  - it uses the FPST_A32_F16
       *  - it doesn't need the VFP vector handling (fp16 is a
       *    v8 feature, and in v8 VFP vectors don't exist)
       *  - it does the aa32_fp16_arith feature test
@@ -XXX,XX +XXX,XX @@ static bool do_vfp_3op_hp(DisasContext *s, VFPGen3OpSPFn *fn,
      f0 = tcg_temp_new_i32();
      f1 = tcg_temp_new_i32();
      fd = tcg_temp_new_i32();
 -    fpst = fpstatus_ptr(FPST_FPCR_F16);
 +    fpst = fpstatus_ptr(FPST_A32_F16);
      vfp_load_reg16(f0, vn);
      vfp_load_reg16(f1, vm);
@@ -XXX,XX +XXX,XX @@ static bool do_vfm_hp(DisasContext *s, arg_VFMA_sp *a, bool neg_n, bool neg_d)
          /* VFNMA, VFNMS */
          gen_vfp_negh(vd, vd);
      }
 -    fpst = fpstatus_ptr(FPST_FPCR_F16);
 +    fpst = fpstatus_ptr(FPST_A32_F16);
      gen_helper_vfp_muladdh(vd, vn, vm, vd, fpst);
      vfp_store_reg32(vd, a->vd);
      return true;
@@ -XXX,XX +XXX,XX @@ DO_VFP_2OP(VNEG, dp, gen_vfp_negd, aa32_fpdp_v2)
  static void gen_VSQRT_hp(TCGv_i32 vd, TCGv_i32 vm)
  {
-     ARMCPU *cpu = ARM_CPU(obj);
+-    gen_helper_vfp_sqrth(vd, vm, fpstatus_ptr(FPST_FPCR_F16));
-+    uint32_t t;
++    gen_helper_vfp_sqrth(vd, vm, fpstatus_ptr(FPST_A32_F16));
 -    cortex_a15_initfn(obj);
 +    /* aarch64_a57_initfn, advertising none of the aarch64 features */
 +    cpu->dtb_compatible = "arm,cortex-a57";
 +    set_feature(&cpu->env, ARM_FEATURE_V8);
 +    set_feature(&cpu->env, ARM_FEATURE_NEON);
 +    set_feature(&cpu->env, ARM_FEATURE_GENERIC_TIMER);
 +    set_feature(&cpu->env, ARM_FEATURE_CBAR_RO);
 +    set_feature(&cpu->env, ARM_FEATURE_EL2);
 +    set_feature(&cpu->env, ARM_FEATURE_EL3);
 +    set_feature(&cpu->env, ARM_FEATURE_PMU);
 +    cpu->midr = 0x411fd070;
 +    cpu->revidr = 0x00000000;
 +    cpu->reset_fpsid = 0x41034070;
 +    cpu->isar.mvfr0 = 0x10110222;
 +    cpu->isar.mvfr1 = 0x12111111;
 +    cpu->isar.mvfr2 = 0x00000043;
 +    cpu->ctr = 0x8444c004;
 +    cpu->reset_sctlr = 0x00c50838;
 +    cpu->isar.id_pfr0 = 0x00000131;
 +    cpu->isar.id_pfr1 = 0x00011011;
 +    cpu->isar.id_dfr0 = 0x03010066;
 +    cpu->id_afr0 = 0x00000000;
 +    cpu->isar.id_mmfr0 = 0x10101105;
 +    cpu->isar.id_mmfr1 = 0x40000000;
 +    cpu->isar.id_mmfr2 = 0x01260000;
 +    cpu->isar.id_mmfr3 = 0x02102211;
 +    cpu->isar.id_isar0 = 0x02101110;
 +    cpu->isar.id_isar1 = 0x13112111;
 +    cpu->isar.id_isar2 = 0x21232042;
 +    cpu->isar.id_isar3 = 0x01112131;
 +    cpu->isar.id_isar4 = 0x00011142;
 +    cpu->isar.id_isar5 = 0x00011121;
 +    cpu->isar.id_isar6 = 0;
 +    cpu->isar.dbgdidr = 0x3516d000;
 +    cpu->clidr = 0x0a200023;
 +    cpu->ccsidr[0] = 0x701fe00a; /* 32KB L1 dcache */
 +    cpu->ccsidr[1] = 0x201fe012; /* 48KB L1 icache */
 +    cpu->ccsidr[2] = 0x70ffe07a; /* 2048KB L2 cache */
 +    define_cortex_a72_a57_a53_cp_reginfo(cpu);
 -    /* old-style VFP short-vector support */
 -    cpu->isar.mvfr0 = FIELD_DP32(cpu->isar.mvfr0, MVFR0, FPSHVEC, 1);
 +    /* Add additional features supported by QEMU */
 +    t = cpu->isar.id_isar5;
 +    t = FIELD_DP32(t, ID_ISAR5, AES, 2);
 +    t = FIELD_DP32(t, ID_ISAR5, SHA1, 1);
 +    t = FIELD_DP32(t, ID_ISAR5, SHA2, 1);
 +    t = FIELD_DP32(t, ID_ISAR5, CRC32, 1);
 +    t = FIELD_DP32(t, ID_ISAR5, RDM, 1);
 +    t = FIELD_DP32(t, ID_ISAR5, VCMA, 1);
 +    cpu->isar.id_isar5 = t;
 +
 +    t = cpu->isar.id_isar6;
 +    t = FIELD_DP32(t, ID_ISAR6, JSCVT, 1);
 +    t = FIELD_DP32(t, ID_ISAR6, DP, 1);
 +    t = FIELD_DP32(t, ID_ISAR6, FHM, 1);
 +    t = FIELD_DP32(t, ID_ISAR6, SB, 1);
 +    t = FIELD_DP32(t, ID_ISAR6, SPECRES, 1);
 +    t = FIELD_DP32(t, ID_ISAR6, BF16, 1);
 +    t = FIELD_DP32(t, ID_ISAR6, I8MM, 1);
 +    cpu->isar.id_isar6 = t;
 +
 +    t = cpu->isar.mvfr1;
 +    t = FIELD_DP32(t, MVFR1, FPHP, 3);     /* v8.2-FP16 */
 +    t = FIELD_DP32(t, MVFR1, SIMDHP, 2);   /* v8.2-FP16 */
 +    cpu->isar.mvfr1 = t;
 +
 +    t = cpu->isar.mvfr2;
 +    t = FIELD_DP32(t, MVFR2, SIMDMISC, 3); /* SIMD MaxNum */
 +    t = FIELD_DP32(t, MVFR2, FPMISC, 4);   /* FP MaxNum */
 +    cpu->isar.mvfr2 = t;
 +
 +    t = cpu->isar.id_mmfr3;
 +    t = FIELD_DP32(t, ID_MMFR3, PAN, 2); /* ATS1E1 */
 +    cpu->isar.id_mmfr3 = t;
 +
 +    t = cpu->isar.id_mmfr4;
 +    t = FIELD_DP32(t, ID_MMFR4, HPDS, 1); /* AA32HPD */
 +    t = FIELD_DP32(t, ID_MMFR4, AC2, 1); /* ACTLR2, HACTLR2 */
 +    t = FIELD_DP32(t, ID_MMFR4, CNP, 1); /* TTCNP */
 +    t = FIELD_DP32(t, ID_MMFR4, XNX, 1); /* TTS2UXN */
 +    cpu->isar.id_mmfr4 = t;
 +
 +    t = cpu->isar.id_pfr0;
 +    t = FIELD_DP32(t, ID_PFR0, DIT, 1);
 +    cpu->isar.id_pfr0 = t;
 +
 +    t = cpu->isar.id_pfr2;
 +    t = FIELD_DP32(t, ID_PFR2, SSBS, 1);
 +    cpu->isar.id_pfr2 = t;
  #ifdef CONFIG_USER_ONLY
      /*
 -     * We don't set these in system emulation mode for the moment,
 -     * since we don't correctly set (all of) the ID registers to
 -     * advertise them.
 +     * Break with true ARMv8 and add back old-style VFP short-vector support.
 +     * Only do this for user-mode, where -cpu max is the default, so that
 +     * older v6 and v7 programs are more likely to work without adjustment.
       */
 -    set_feature(&cpu->env, ARM_FEATURE_V8);
 -    {
 -        uint32_t t;
 -
 -        t = cpu->isar.id_isar5;
 -        t = FIELD_DP32(t, ID_ISAR5, AES, 2);
 -        t = FIELD_DP32(t, ID_ISAR5, SHA1, 1);
 -        t = FIELD_DP32(t, ID_ISAR5, SHA2, 1);
 -        t = FIELD_DP32(t, ID_ISAR5, CRC32, 1);
 -        t = FIELD_DP32(t, ID_ISAR5, RDM, 1);
 -        t = FIELD_DP32(t, ID_ISAR5, VCMA, 1);
 -        cpu->isar.id_isar5 = t;
 -
 -        t = cpu->isar.id_isar6;
 -        t = FIELD_DP32(t, ID_ISAR6, JSCVT, 1);
 -        t = FIELD_DP32(t, ID_ISAR6, DP, 1);
 -        t = FIELD_DP32(t, ID_ISAR6, FHM, 1);
 -        t = FIELD_DP32(t, ID_ISAR6, SB, 1);
 -        t = FIELD_DP32(t, ID_ISAR6, SPECRES, 1);
 -        t = FIELD_DP32(t, ID_ISAR6, BF16, 1);
 -        t = FIELD_DP32(t, ID_ISAR6, I8MM, 1);
 -        cpu->isar.id_isar6 = t;
 -
 -        t = cpu->isar.mvfr1;
 -        t = FIELD_DP32(t, MVFR1, FPHP, 3);     /* v8.2-FP16 */
 -        t = FIELD_DP32(t, MVFR1, SIMDHP, 2);   /* v8.2-FP16 */
 -        cpu->isar.mvfr1 = t;
 -
 -        t = cpu->isar.mvfr2;
 -        t = FIELD_DP32(t, MVFR2, SIMDMISC, 3); /* SIMD MaxNum */
 -        t = FIELD_DP32(t, MVFR2, FPMISC, 4);   /* FP MaxNum */
 -        cpu->isar.mvfr2 = t;
 -
 -        t = cpu->isar.id_mmfr3;
 -        t = FIELD_DP32(t, ID_MMFR3, PAN, 2); /* ATS1E1 */
 -        cpu->isar.id_mmfr3 = t;
 -
 -        t = cpu->isar.id_mmfr4;
 -        t = FIELD_DP32(t, ID_MMFR4, HPDS, 1); /* AA32HPD */
 -        t = FIELD_DP32(t, ID_MMFR4, AC2, 1); /* ACTLR2, HACTLR2 */
 -        t = FIELD_DP32(t, ID_MMFR4, CNP, 1); /* TTCNP */
 -        t = FIELD_DP32(t, ID_MMFR4, XNX, 1); /* TTS2UXN */
 -        cpu->isar.id_mmfr4 = t;
 -
 -        t = cpu->isar.id_pfr0;
 -        t = FIELD_DP32(t, ID_PFR0, DIT, 1);
 -        cpu->isar.id_pfr0 = t;
 -
 -        t = cpu->isar.id_pfr2;
 -        t = FIELD_DP32(t, ID_PFR2, SSBS, 1);
 -        cpu->isar.id_pfr2 = t;
 -    }
 -#endif /* CONFIG_USER_ONLY */
 +    cpu->isar.mvfr0 = FIELD_DP32(cpu->isar.mvfr0, MVFR0, FPSHVEC, 1);
 +#endif
  }
- #endif /* !TARGET_AARCH64 */
  static void gen_VSQRT_sp(TCGv_i32 vd, TCGv_i32 vm)
@@ -XXX,XX +XXX,XX @@ static bool trans_VRINTR_hp(DisasContext *s, arg_VRINTR_sp *a)
      tmp = tcg_temp_new_i32();
      vfp_load_reg16(tmp, a->vm);
 -    fpst = fpstatus_ptr(FPST_FPCR_F16);
 +    fpst = fpstatus_ptr(FPST_A32_F16);
      gen_helper_rinth(tmp, tmp, fpst);
      vfp_store_reg32(tmp, a->vd);
      return true;
@@ -XXX,XX +XXX,XX @@ static bool trans_VRINTZ_hp(DisasContext *s, arg_VRINTZ_sp *a)
      tmp = tcg_temp_new_i32();
      vfp_load_reg16(tmp, a->vm);
 -    fpst = fpstatus_ptr(FPST_FPCR_F16);
 +    fpst = fpstatus_ptr(FPST_A32_F16);
      tcg_rmode = gen_set_rmode(FPROUNDING_ZERO, fpst);
      gen_helper_rinth(tmp, tmp, fpst);
      gen_restore_rmode(tcg_rmode, fpst);
@@ -XXX,XX +XXX,XX @@ static bool trans_VRINTX_hp(DisasContext *s, arg_VRINTX_sp *a)
      tmp = tcg_temp_new_i32();
      vfp_load_reg16(tmp, a->vm);
 -    fpst = fpstatus_ptr(FPST_FPCR_F16);
 +    fpst = fpstatus_ptr(FPST_A32_F16);
      gen_helper_rinth_exact(tmp, tmp, fpst);
      vfp_store_reg32(tmp, a->vd);
      return true;
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_int_hp(DisasContext *s, arg_VCVT_int_sp *a)
      vm = tcg_temp_new_i32();
      vfp_load_reg32(vm, a->vm);
 -    fpst = fpstatus_ptr(FPST_FPCR_F16);
 +    fpst = fpstatus_ptr(FPST_A32_F16);
      if (a->s) {
          /* i32 -> f16 */
          gen_helper_vfp_sitoh(vm, vm, fpst);
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_fix_hp(DisasContext *s, arg_VCVT_fix_sp *a)
      vd = tcg_temp_new_i32();
      vfp_load_reg32(vd, a->vd);
 -    fpst = fpstatus_ptr(FPST_FPCR_F16);
 +    fpst = fpstatus_ptr(FPST_A32_F16);
      shift = tcg_constant_i32(frac_bits);
      /* Switch on op:U:sx bits */
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_hp_int(DisasContext *s, arg_VCVT_sp_int *a)
          return true;
      }
 -    fpst = fpstatus_ptr(FPST_FPCR_F16);
 +    fpst = fpstatus_ptr(FPST_A32_F16);
      vm = tcg_temp_new_i32();
      vfp_load_reg16(vm, a->vm);
 --
-.25.1
+.34.1

-[PULL 06/32] target/arm: Move cortex impdef sysregs to cpu_tcg.c
+[PULL 29/36] target/arm: Use FPST_A64_F16 in A64 decoder
-From: Richard Henderson <richard.henderson@linaro.org>
+In the A32 decoder, use FPST_A64_F16 rather than FPST_FPCR_F16.
 By doing an automated conversion of the whole file we avoid possibly
 using more than one fpst value in a set_rmode/op/restore_rmode
 sequence.
-Previously we were defining some of these in user-only mode,
+Patch created with
-but none of them are accessible from user-only, therefore
+  perl -p -i -e 's/FPST_FPCR_F16(?!_)/FPST_A64_F16/g' target/arm/tcg/translate-{a64,sve,sme}.c
 define them only in system mode.
-This will shortly be used from cpu_tcg.c also.
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20250124162836.2332150-18-peter.maydell@linaro.org
 ---
  target/arm/tcg/translate-a64.c | 32 ++++++++---------
  target/arm/tcg/translate-sve.c | 66 +++++++++++++++++-----------------
 files changed, 49 insertions(+), 49 deletions(-)
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+diff --git a/target/arm/tcg/translate-a64.c b/target/arm/tcg/translate-a64.c
 Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20220506180242.216785-6-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
  target/arm/internals.h |  6 ++++
  target/arm/cpu64.c     | 64 +++---------------------------------------
  target/arm/cpu_tcg.c   | 59 ++++++++++++++++++++++++++++++++++++++
 files changed, 69 insertions(+), 60 deletions(-)
 diff --git a/target/arm/internals.h b/target/arm/internals.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/internals.h
+--- a/target/arm/tcg/translate-a64.c
-+++ b/target/arm/internals.h
++++ b/target/arm/tcg/translate-a64.c
-@@ -XXX,XX +XXX,XX @@ int aarch64_fpu_gdb_get_reg(CPUARMState *env, GByteArray *buf, int reg);
+@@ -XXX,XX +XXX,XX @@ static void gen_gvec_op3_fpst(DisasContext *s, bool is_q, int rd, int rn,
- int aarch64_fpu_gdb_set_reg(CPUARMState *env, uint8_t *buf, int reg);
+                               int rm, bool is_fp16, int data,
- #endif
+                               gen_helper_gvec_3_ptr *fn)
+ {
-+#ifdef CONFIG_USER_ONLY
+-    TCGv_ptr fpst = fpstatus_ptr(is_fp16 ? FPST_FPCR_F16 : FPST_A64);
-+static inline void define_cortex_a72_a57_a53_cp_reginfo(ARMCPU *cpu) { }
++    TCGv_ptr fpst = fpstatus_ptr(is_fp16 ? FPST_A64_F16 : FPST_A64);
-+#else
+     tcg_gen_gvec_3_ptr(vec_full_reg_offset(s, rd),
-+void define_cortex_a72_a57_a53_cp_reginfo(ARMCPU *cpu);
+                        vec_full_reg_offset(s, rn),
-+#endif
+                        vec_full_reg_offset(s, rm), fpst,
-+
+@@ -XXX,XX +XXX,XX @@ static void gen_gvec_op4_fpst(DisasContext *s, bool is_q, int rd, int rn,
- #endif
+                               int rm, int ra, bool is_fp16, int data,
-diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
+                               gen_helper_gvec_4_ptr *fn)
  {
 -    TCGv_ptr fpst = fpstatus_ptr(is_fp16 ? FPST_FPCR_F16 : FPST_A64);
 +    TCGv_ptr fpst = fpstatus_ptr(is_fp16 ? FPST_A64_F16 : FPST_A64);
      tcg_gen_gvec_4_ptr(vec_full_reg_offset(s, rd),
                         vec_full_reg_offset(s, rn),
                         vec_full_reg_offset(s, rm),
@@ -XXX,XX +XXX,XX @@ static bool do_fp3_scalar(DisasContext *s, arg_rrr_e *a, const FPScalar *f)
          if (fp_access_check(s)) {
              TCGv_i32 t0 = read_fp_hreg(s, a->rn);
              TCGv_i32 t1 = read_fp_hreg(s, a->rm);
 -            f->gen_h(t0, t0, t1, fpstatus_ptr(FPST_FPCR_F16));
 +            f->gen_h(t0, t0, t1, fpstatus_ptr(FPST_A64_F16));
              write_fp_sreg(s, a->rd, t0);
          }
          break;
@@ -XXX,XX +XXX,XX @@ static bool do_fcmp0_s(DisasContext *s, arg_rr_e *a,
              TCGv_i32 t0 = read_fp_hreg(s, a->rn);
              TCGv_i32 t1 = tcg_constant_i32(0);
              if (swap) {
 -                f->gen_h(t0, t1, t0, fpstatus_ptr(FPST_FPCR_F16));
 +                f->gen_h(t0, t1, t0, fpstatus_ptr(FPST_A64_F16));
              } else {
 -                f->gen_h(t0, t0, t1, fpstatus_ptr(FPST_FPCR_F16));
 +                f->gen_h(t0, t0, t1, fpstatus_ptr(FPST_A64_F16));
              }
              write_fp_sreg(s, a->rd, t0);
          }
@@ -XXX,XX +XXX,XX @@ static bool do_fp3_scalar_idx(DisasContext *s, arg_rrx_e *a, const FPScalar *f)
              TCGv_i32 t1 = tcg_temp_new_i32();
              read_vec_element_i32(s, t1, a->rm, a->idx, MO_16);
 -            f->gen_h(t0, t0, t1, fpstatus_ptr(FPST_FPCR_F16));
 +            f->gen_h(t0, t0, t1, fpstatus_ptr(FPST_A64_F16));
              write_fp_sreg(s, a->rd, t0);
          }
          break;
@@ -XXX,XX +XXX,XX @@ static bool do_fmla_scalar_idx(DisasContext *s, arg_rrx_e *a, bool neg)
                  gen_vfp_negh(t1, t1);
              }
              gen_helper_advsimd_muladdh(t0, t1, t2, t0,
 -                                       fpstatus_ptr(FPST_FPCR_F16));
 +                                       fpstatus_ptr(FPST_A64_F16));
              write_fp_sreg(s, a->rd, t0);
          }
          break;
@@ -XXX,XX +XXX,XX @@ static bool do_fp3_scalar_pair(DisasContext *s, arg_rr_e *a, const FPScalar *f)
              read_vec_element_i32(s, t0, a->rn, 0, MO_16);
              read_vec_element_i32(s, t1, a->rn, 1, MO_16);
 -            f->gen_h(t0, t0, t1, fpstatus_ptr(FPST_FPCR_F16));
 +            f->gen_h(t0, t0, t1, fpstatus_ptr(FPST_A64_F16));
              write_fp_sreg(s, a->rd, t0);
          }
          break;
@@ -XXX,XX +XXX,XX @@ static bool do_fmadd(DisasContext *s, arg_rrrr_e *a, bool neg_a, bool neg_n)
              if (neg_n) {
                  gen_vfp_negh(tn, tn);
              }
 -            fpst = fpstatus_ptr(FPST_FPCR_F16);
 +            fpst = fpstatus_ptr(FPST_A64_F16);
              gen_helper_advsimd_muladdh(ta, tn, tm, ta, fpst);
              write_fp_sreg(s, a->rd, ta);
          }
@@ -XXX,XX +XXX,XX @@ static bool do_fp_reduction(DisasContext *s, arg_qrr_e *a,
      if (fp_access_check(s)) {
          MemOp esz = a->esz;
          int elts = (a->q ? 16 : 8) >> esz;
 -        TCGv_ptr fpst = fpstatus_ptr(esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
 +        TCGv_ptr fpst = fpstatus_ptr(esz == MO_16 ? FPST_A64_F16 : FPST_A64);
          TCGv_i32 res = do_reduction_op(s, a->rn, esz, 0, elts, fpst, fn);
          write_fp_sreg(s, a->rd, res);
      }
@@ -XXX,XX +XXX,XX @@ static void handle_fp_compare(DisasContext *s, int size,
                                bool cmp_with_zero, bool signal_all_nans)
  {
      TCGv_i64 tcg_flags = tcg_temp_new_i64();
 -    TCGv_ptr fpst = fpstatus_ptr(size == MO_16 ? FPST_FPCR_F16 : FPST_A64);
 +    TCGv_ptr fpst = fpstatus_ptr(size == MO_16 ? FPST_A64_F16 : FPST_A64);
      if (size == MO_64) {
          TCGv_i64 tcg_vn, tcg_vm;
@@ -XXX,XX +XXX,XX @@ static bool do_fp1_scalar(DisasContext *s, arg_rr_e *a,
          return check == 0;
      }
 -    fpst = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
 +    fpst = fpstatus_ptr(a->esz == MO_16 ? FPST_A64_F16 : FPST_A64);
      if (rmode >= 0) {
          tcg_rmode = gen_set_rmode(rmode, fpst);
      }
@@ -XXX,XX +XXX,XX @@ static bool do_cvtf_scalar(DisasContext *s, MemOp esz, int rd, int shift,
      TCGv_i32 tcg_shift, tcg_single;
      TCGv_i64 tcg_double;
 -    tcg_fpstatus = fpstatus_ptr(esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
 +    tcg_fpstatus = fpstatus_ptr(esz == MO_16 ? FPST_A64_F16 : FPST_A64);
      tcg_shift = tcg_constant_i32(shift);
      switch (esz) {
@@ -XXX,XX +XXX,XX @@ static void do_fcvt_scalar(DisasContext *s, MemOp out, MemOp esz,
      TCGv_ptr tcg_fpstatus;
      TCGv_i32 tcg_shift, tcg_rmode, tcg_single;
 -    tcg_fpstatus = fpstatus_ptr(esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
 +    tcg_fpstatus = fpstatus_ptr(esz == MO_16 ? FPST_A64_F16 : FPST_A64);
      tcg_shift = tcg_constant_i32(shift);
      tcg_rmode = gen_set_rmode(rmode, tcg_fpstatus);
@@ -XXX,XX +XXX,XX @@ static bool do_fp1_vector(DisasContext *s, arg_qrr_e *a,
          return check == 0;
      }
 -    fpst = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
 +    fpst = fpstatus_ptr(a->esz == MO_16 ? FPST_A64_F16 : FPST_A64);
      if (rmode >= 0) {
          tcg_rmode = gen_set_rmode(rmode, fpst);
      }
@@ -XXX,XX +XXX,XX @@ static bool do_gvec_op2_fpst(DisasContext *s, MemOp esz, bool is_q,
          return check == 0;
      }
 -    fpst = fpstatus_ptr(esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
 +    fpst = fpstatus_ptr(esz == MO_16 ? FPST_A64_F16 : FPST_A64);
      tcg_gen_gvec_2_ptr(vec_full_reg_offset(s, rd),
                         vec_full_reg_offset(s, rn), fpst,
                         is_q ? 16 : 8, vec_full_reg_size(s),
 diff --git a/target/arm/tcg/translate-sve.c b/target/arm/tcg/translate-sve.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu64.c
+--- a/target/arm/tcg/translate-sve.c
-+++ b/target/arm/cpu64.c
++++ b/target/arm/tcg/translate-sve.c
-@@ -XXX,XX +XXX,XX @@
+@@ -XXX,XX +XXX,XX @@ static bool gen_gvec_fpst_arg_zz(DisasContext *s, gen_helper_gvec_2_ptr *fn,
- #include "hvf_arm.h"
+                                  arg_rr_esz *a, int data)
- #include "qapi/visitor.h"
+ {
- #include "hw/qdev-properties.h"
+     return gen_gvec_fpst_zz(s, fn, a->rd, a->rn, data,
--#include "cpregs.h"
+-                            a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
-+#include "internals.h"
++                            a->esz == MO_16 ? FPST_A64_F16 : FPST_A64);
 -#ifndef CONFIG_USER_ONLY
 -static uint64_t a57_a53_l2ctlr_read(CPUARMState *env, const ARMCPRegInfo *ri)
 -{
 -    ARMCPU *cpu = env_archcpu(env);
 -
 -    /* Number of cores is in [25:24]; otherwise we RAZ */
 -    return (cpu->core_count - 1) << 24;
 -}
 -#endif
 -
 -static const ARMCPRegInfo cortex_a72_a57_a53_cp_reginfo[] = {
 -#ifndef CONFIG_USER_ONLY
 -    { .name = "L2CTLR_EL1", .state = ARM_CP_STATE_AA64,
 -      .opc0 = 3, .opc1 = 1, .crn = 11, .crm = 0, .opc2 = 2,
 -      .access = PL1_RW, .readfn = a57_a53_l2ctlr_read,
 -      .writefn = arm_cp_write_ignore },
 -    { .name = "L2CTLR",
 -      .cp = 15, .opc1 = 1, .crn = 9, .crm = 0, .opc2 = 2,
 -      .access = PL1_RW, .readfn = a57_a53_l2ctlr_read,
 -      .writefn = arm_cp_write_ignore },
 -#endif
 -    { .name = "L2ECTLR_EL1", .state = ARM_CP_STATE_AA64,
 -      .opc0 = 3, .opc1 = 1, .crn = 11, .crm = 0, .opc2 = 3,
 -      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "L2ECTLR",
 -      .cp = 15, .opc1 = 1, .crn = 9, .crm = 0, .opc2 = 3,
 -      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "L2ACTLR", .state = ARM_CP_STATE_BOTH,
 -      .opc0 = 3, .opc1 = 1, .crn = 15, .crm = 0, .opc2 = 0,
 -      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "CPUACTLR_EL1", .state = ARM_CP_STATE_AA64,
 -      .opc0 = 3, .opc1 = 1, .crn = 15, .crm = 2, .opc2 = 0,
 -      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "CPUACTLR",
 -      .cp = 15, .opc1 = 0, .crm = 15,
 -      .access = PL1_RW, .type = ARM_CP_CONST | ARM_CP_64BIT, .resetvalue = 0 },
 -    { .name = "CPUECTLR_EL1", .state = ARM_CP_STATE_AA64,
 -      .opc0 = 3, .opc1 = 1, .crn = 15, .crm = 2, .opc2 = 1,
 -      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "CPUECTLR",
 -      .cp = 15, .opc1 = 1, .crm = 15,
 -      .access = PL1_RW, .type = ARM_CP_CONST | ARM_CP_64BIT, .resetvalue = 0 },
 -    { .name = "CPUMERRSR_EL1", .state = ARM_CP_STATE_AA64,
 -      .opc0 = 3, .opc1 = 1, .crn = 15, .crm = 2, .opc2 = 2,
 -      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "CPUMERRSR",
 -      .cp = 15, .opc1 = 2, .crm = 15,
 -      .access = PL1_RW, .type = ARM_CP_CONST | ARM_CP_64BIT, .resetvalue = 0 },
 -    { .name = "L2MERRSR_EL1", .state = ARM_CP_STATE_AA64,
 -      .opc0 = 3, .opc1 = 1, .crn = 15, .crm = 2, .opc2 = 3,
 -      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 -    { .name = "L2MERRSR",
 -      .cp = 15, .opc1 = 3, .crm = 15,
 -      .access = PL1_RW, .type = ARM_CP_CONST | ARM_CP_64BIT, .resetvalue = 0 },
 -};
 -
  static void aarch64_a57_initfn(Object *obj)
  {
      ARMCPU *cpu = ARM_CPU(obj);
@@ -XXX,XX +XXX,XX @@ static void aarch64_a57_initfn(Object *obj)
      cpu->gic_num_lrs = 4;
      cpu->gic_vpribits = 5;
      cpu->gic_vprebits = 5;
 -    define_arm_cp_regs(cpu, cortex_a72_a57_a53_cp_reginfo);
 +    define_cortex_a72_a57_a53_cp_reginfo(cpu);
  }
- static void aarch64_a53_initfn(Object *obj)
+ /* Invoke an out-of-line helper on 3 Zregs. */
-@@ -XXX,XX +XXX,XX @@ static void aarch64_a53_initfn(Object *obj)
+@@ -XXX,XX +XXX,XX @@ static bool gen_gvec_fpst_arg_zzz(DisasContext *s, gen_helper_gvec_3_ptr *fn,
-     cpu->gic_num_lrs = 4;
+                                   arg_rrr_esz *a, int data)
-     cpu->gic_vpribits = 5;
+ {
-     cpu->gic_vprebits = 5;
+     return gen_gvec_fpst_zzz(s, fn, a->rd, a->rn, a->rm, data,
--    define_arm_cp_regs(cpu, cortex_a72_a57_a53_cp_reginfo);
+-                             a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
-+    define_cortex_a72_a57_a53_cp_reginfo(cpu);
++                             a->esz == MO_16 ? FPST_A64_F16 : FPST_A64);
  }
- static void aarch64_a72_initfn(Object *obj)
+ /* Invoke an out-of-line helper on 4 Zregs. */
-@@ -XXX,XX +XXX,XX @@ static void aarch64_a72_initfn(Object *obj)
+@@ -XXX,XX +XXX,XX @@ static bool gen_gvec_fpst_arg_zpzz(DisasContext *s, gen_helper_gvec_4_ptr *fn,
-     cpu->gic_num_lrs = 4;
+                                    arg_rprr_esz *a)
-     cpu->gic_vpribits = 5;
+ {
-     cpu->gic_vprebits = 5;
+     return gen_gvec_fpst_zzzp(s, fn, a->rd, a->rn, a->rm, a->pg, 0,
--    define_arm_cp_regs(cpu, cortex_a72_a57_a53_cp_reginfo);
+-                              a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
-+    define_cortex_a72_a57_a53_cp_reginfo(cpu);
++                              a->esz == MO_16 ? FPST_A64_F16 : FPST_A64);
  }
- void arm_cpu_sve_finalize(ARMCPU *cpu, Error **errp)
+ /* Invoke a vector expander on two Zregs and an immediate.  */
-diff --git a/target/arm/cpu_tcg.c b/target/arm/cpu_tcg.c
+@@ -XXX,XX +XXX,XX @@ static bool do_FMLA_zzxz(DisasContext *s, arg_rrxr_esz *a, bool sub)
-index XXXXXXX..XXXXXXX 100644
+     };
---- a/target/arm/cpu_tcg.c
+     return gen_gvec_fpst_zzzz(s, fns[a->esz], a->rd, a->rn, a->rm, a->ra,
-+++ b/target/arm/cpu_tcg.c
+                               (a->index << 1) | sub,
-@@ -XXX,XX +XXX,XX @@
+-                              a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
- #endif
++                              a->esz == MO_16 ? FPST_A64_F16 : FPST_A64);
- #include "cpregs.h"
+ }
-+#ifndef CONFIG_USER_ONLY
+ TRANS_FEAT(FMLA_zzxz, aa64_sve, do_FMLA_zzxz, a, false)
-+static uint64_t l2ctlr_read(CPUARMState *env, const ARMCPRegInfo *ri)
+@@ -XXX,XX +XXX,XX @@ static gen_helper_gvec_3_ptr * const fmul_idx_fns[4] = {
-+{
+ };
-+    ARMCPU *cpu = env_archcpu(env);
+ TRANS_FEAT(FMUL_zzx, aa64_sve, gen_gvec_fpst_zzz,
-+
+            fmul_idx_fns[a->esz], a->rd, a->rn, a->rm, a->index,
-+    /* Number of cores is in [25:24]; otherwise we RAZ */
+-           a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
-+    return (cpu->core_count - 1) << 24;
++           a->esz == MO_16 ? FPST_A64_F16 : FPST_A64)
-+}
-+
+ /*
-+static const ARMCPRegInfo cortex_a72_a57_a53_cp_reginfo[] = {
+  *** SVE Floating Point Fast Reduction Group
-+    { .name = "L2CTLR_EL1", .state = ARM_CP_STATE_AA64,
+@@ -XXX,XX +XXX,XX @@ static bool do_reduce(DisasContext *s, arg_rpr_esz *a,
-+      .opc0 = 3, .opc1 = 1, .crn = 11, .crm = 0, .opc2 = 2,
-+      .access = PL1_RW, .readfn = l2ctlr_read,
+     tcg_gen_addi_ptr(t_zn, tcg_env, vec_full_reg_offset(s, a->rn));
-+      .writefn = arm_cp_write_ignore },
+     tcg_gen_addi_ptr(t_pg, tcg_env, pred_full_reg_offset(s, a->pg));
-+    { .name = "L2CTLR",
+-    status = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
-+      .cp = 15, .opc1 = 1, .crn = 9, .crm = 0, .opc2 = 2,
++    status = fpstatus_ptr(a->esz == MO_16 ? FPST_A64_F16 : FPST_A64);
-+      .access = PL1_RW, .readfn = l2ctlr_read,
-+      .writefn = arm_cp_write_ignore },
+     fn(temp, t_zn, t_pg, status, t_desc);
-+    { .name = "L2ECTLR_EL1", .state = ARM_CP_STATE_AA64,
-+      .opc0 = 3, .opc1 = 1, .crn = 11, .crm = 0, .opc2 = 3,
+@@ -XXX,XX +XXX,XX @@ static bool do_ppz_fp(DisasContext *s, arg_rpr_esz *a,
-+      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
+     if (sve_access_check(s)) {
-+    { .name = "L2ECTLR",
+         unsigned vsz = vec_full_reg_size(s);
-+      .cp = 15, .opc1 = 1, .crn = 9, .crm = 0, .opc2 = 3,
+         TCGv_ptr status =
-+      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
+-            fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
-+    { .name = "L2ACTLR", .state = ARM_CP_STATE_BOTH,
++            fpstatus_ptr(a->esz == MO_16 ? FPST_A64_F16 : FPST_A64);
-+      .opc0 = 3, .opc1 = 1, .crn = 15, .crm = 0, .opc2 = 0,
-+      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
+         tcg_gen_gvec_3_ptr(pred_full_reg_offset(s, a->rd),
-+    { .name = "CPUACTLR_EL1", .state = ARM_CP_STATE_AA64,
+                            vec_full_reg_offset(s, a->rn),
-+      .opc0 = 3, .opc1 = 1, .crn = 15, .crm = 2, .opc2 = 0,
+@@ -XXX,XX +XXX,XX @@ static gen_helper_gvec_3_ptr * const ftmad_fns[4] = {
-+      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
+ };
-+    { .name = "CPUACTLR",
+ TRANS_FEAT_NONSTREAMING(FTMAD, aa64_sve, gen_gvec_fpst_zzz,
-+      .cp = 15, .opc1 = 0, .crm = 15,
+                         ftmad_fns[a->esz], a->rd, a->rn, a->rm, a->imm,
-+      .access = PL1_RW, .type = ARM_CP_CONST | ARM_CP_64BIT, .resetvalue = 0 },
+-                        a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
-+    { .name = "CPUECTLR_EL1", .state = ARM_CP_STATE_AA64,
++                        a->esz == MO_16 ? FPST_A64_F16 : FPST_A64)
-+      .opc0 = 3, .opc1 = 1, .crn = 15, .crm = 2, .opc2 = 1,
-+      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
+ /*
-+    { .name = "CPUECTLR",
+  *** SVE Floating Point Accumulating Reduction Group
-+      .cp = 15, .opc1 = 1, .crm = 15,
+@@ -XXX,XX +XXX,XX @@ static bool trans_FADDA(DisasContext *s, arg_rprr_esz *a)
-+      .access = PL1_RW, .type = ARM_CP_CONST | ARM_CP_64BIT, .resetvalue = 0 },
+     t_pg = tcg_temp_new_ptr();
-+    { .name = "CPUMERRSR_EL1", .state = ARM_CP_STATE_AA64,
+     tcg_gen_addi_ptr(t_rm, tcg_env, vec_full_reg_offset(s, a->rm));
-+      .opc0 = 3, .opc1 = 1, .crn = 15, .crm = 2, .opc2 = 2,
+     tcg_gen_addi_ptr(t_pg, tcg_env, pred_full_reg_offset(s, a->pg));
-+      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
+-    t_fpst = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
-+    { .name = "CPUMERRSR",
++    t_fpst = fpstatus_ptr(a->esz == MO_16 ? FPST_A64_F16 : FPST_A64);
-+      .cp = 15, .opc1 = 2, .crm = 15,
+     t_desc = tcg_constant_i32(simd_desc(vsz, vsz, 0));
-+      .access = PL1_RW, .type = ARM_CP_CONST | ARM_CP_64BIT, .resetvalue = 0 },
-+    { .name = "L2MERRSR_EL1", .state = ARM_CP_STATE_AA64,
+     fns[a->esz - 1](t_val, t_val, t_rm, t_pg, t_fpst, t_desc);
-+      .opc0 = 3, .opc1 = 1, .crn = 15, .crm = 2, .opc2 = 3,
+@@ -XXX,XX +XXX,XX @@ static void do_fp_scalar(DisasContext *s, int zd, int zn, int pg, bool is_fp16,
-+      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
+     tcg_gen_addi_ptr(t_zn, tcg_env, vec_full_reg_offset(s, zn));
-+    { .name = "L2MERRSR",
+     tcg_gen_addi_ptr(t_pg, tcg_env, pred_full_reg_offset(s, pg));
-+      .cp = 15, .opc1 = 3, .crm = 15,
-+      .access = PL1_RW, .type = ARM_CP_CONST | ARM_CP_64BIT, .resetvalue = 0 },
+-    status = fpstatus_ptr(is_fp16 ? FPST_FPCR_F16 : FPST_A64);
-+};
++    status = fpstatus_ptr(is_fp16 ? FPST_A64_F16 : FPST_A64);
-+
+     desc = tcg_constant_i32(simd_desc(vsz, vsz, 0));
-+void define_cortex_a72_a57_a53_cp_reginfo(ARMCPU *cpu)
+     fn(t_zd, t_zn, t_pg, scalar, status, desc);
-+{
+ }
-+    define_arm_cp_regs(cpu, cortex_a72_a57_a53_cp_reginfo);
+@@ -XXX,XX +XXX,XX @@ static bool do_fp_cmp(DisasContext *s, arg_rprr_esz *a,
-+}
+     }
-+#endif /* !CONFIG_USER_ONLY */
+     if (sve_access_check(s)) {
-+
+         unsigned vsz = vec_full_reg_size(s);
- /* CPU models. These are not needed for the AArch64 linux-user build. */
+-        TCGv_ptr status = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
- #if !defined(CONFIG_USER_ONLY) || !defined(TARGET_AARCH64)
++        TCGv_ptr status = fpstatus_ptr(a->esz == MO_16 ? FPST_A64_F16 : FPST_A64);
+         tcg_gen_gvec_4_ptr(pred_full_reg_offset(s, a->rd),
                             vec_full_reg_offset(s, a->rn),
                             vec_full_reg_offset(s, a->rm),
@@ -XXX,XX +XXX,XX @@ static gen_helper_gvec_4_ptr * const fcadd_fns[] = {
  };
  TRANS_FEAT(FCADD, aa64_sve, gen_gvec_fpst_zzzp, fcadd_fns[a->esz],
             a->rd, a->rn, a->rm, a->pg, a->rot,
 -           a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
 +           a->esz == MO_16 ? FPST_A64_F16 : FPST_A64)
  #define DO_FMLA(NAME, name) \
      static gen_helper_gvec_5_ptr * const name##_fns[4] = {              \
@@ -XXX,XX +XXX,XX @@ TRANS_FEAT(FCADD, aa64_sve, gen_gvec_fpst_zzzp, fcadd_fns[a->esz],
      };                                                                  \
      TRANS_FEAT(NAME, aa64_sve, gen_gvec_fpst_zzzzp, name##_fns[a->esz], \
                 a->rd, a->rn, a->rm, a->ra, a->pg, 0,                    \
 -               a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
 +               a->esz == MO_16 ? FPST_A64_F16 : FPST_A64)
  DO_FMLA(FMLA_zpzzz, fmla_zpzzz)
  DO_FMLA(FMLS_zpzzz, fmls_zpzzz)
@@ -XXX,XX +XXX,XX @@ static gen_helper_gvec_5_ptr * const fcmla_fns[4] = {
  };
  TRANS_FEAT(FCMLA_zpzzz, aa64_sve, gen_gvec_fpst_zzzzp, fcmla_fns[a->esz],
             a->rd, a->rn, a->rm, a->ra, a->pg, a->rot,
 -           a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
 +           a->esz == MO_16 ? FPST_A64_F16 : FPST_A64)
  static gen_helper_gvec_4_ptr * const fcmla_idx_fns[4] = {
      NULL, gen_helper_gvec_fcmlah_idx, gen_helper_gvec_fcmlas_idx, NULL
  };
  TRANS_FEAT(FCMLA_zzxz, aa64_sve, gen_gvec_fpst_zzzz, fcmla_idx_fns[a->esz],
             a->rd, a->rn, a->rm, a->ra, a->index * 4 + a->rot,
 -           a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
 +           a->esz == MO_16 ? FPST_A64_F16 : FPST_A64)
  /*
   *** SVE Floating Point Unary Operations Predicated Group
@@ -XXX,XX +XXX,XX @@ TRANS_FEAT(FCVT_sd, aa64_sve, gen_gvec_fpst_arg_zpz,
             gen_helper_sve_fcvt_sd, a, 0, FPST_A64)
  TRANS_FEAT(FCVTZS_hh, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvtzs_hh, a, 0, FPST_FPCR_F16)
 +           gen_helper_sve_fcvtzs_hh, a, 0, FPST_A64_F16)
  TRANS_FEAT(FCVTZU_hh, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvtzu_hh, a, 0, FPST_FPCR_F16)
 +           gen_helper_sve_fcvtzu_hh, a, 0, FPST_A64_F16)
  TRANS_FEAT(FCVTZS_hs, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvtzs_hs, a, 0, FPST_FPCR_F16)
 +           gen_helper_sve_fcvtzs_hs, a, 0, FPST_A64_F16)
  TRANS_FEAT(FCVTZU_hs, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvtzu_hs, a, 0, FPST_FPCR_F16)
 +           gen_helper_sve_fcvtzu_hs, a, 0, FPST_A64_F16)
  TRANS_FEAT(FCVTZS_hd, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvtzs_hd, a, 0, FPST_FPCR_F16)
 +           gen_helper_sve_fcvtzs_hd, a, 0, FPST_A64_F16)
  TRANS_FEAT(FCVTZU_hd, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvtzu_hd, a, 0, FPST_FPCR_F16)
 +           gen_helper_sve_fcvtzu_hd, a, 0, FPST_A64_F16)
  TRANS_FEAT(FCVTZS_ss, aa64_sve, gen_gvec_fpst_arg_zpz,
             gen_helper_sve_fcvtzs_ss, a, 0, FPST_A64)
@@ -XXX,XX +XXX,XX @@ static gen_helper_gvec_3_ptr * const frint_fns[] = {
      gen_helper_sve_frint_d
  };
  TRANS_FEAT(FRINTI, aa64_sve, gen_gvec_fpst_arg_zpz, frint_fns[a->esz],
 -           a, 0, a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
 +           a, 0, a->esz == MO_16 ? FPST_A64_F16 : FPST_A64)
  static gen_helper_gvec_3_ptr * const frintx_fns[] = {
      NULL,
@@ -XXX,XX +XXX,XX @@ static gen_helper_gvec_3_ptr * const frintx_fns[] = {
      gen_helper_sve_frintx_d
  };
  TRANS_FEAT(FRINTX, aa64_sve, gen_gvec_fpst_arg_zpz, frintx_fns[a->esz],
 -           a, 0, a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
 +           a, 0, a->esz == MO_16 ? FPST_A64_F16 : FPST_A64);
  static bool do_frint_mode(DisasContext *s, arg_rpr_esz *a,
                            ARMFPRounding mode, gen_helper_gvec_3_ptr *fn)
@@ -XXX,XX +XXX,XX @@ static bool do_frint_mode(DisasContext *s, arg_rpr_esz *a,
      }
      vsz = vec_full_reg_size(s);
 -    status = fpstatus_ptr(a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64);
 +    status = fpstatus_ptr(a->esz == MO_16 ? FPST_A64_F16 : FPST_A64);
      tmode = gen_set_rmode(mode, status);
      tcg_gen_gvec_3_ptr(vec_full_reg_offset(s, a->rd),
@@ -XXX,XX +XXX,XX @@ static gen_helper_gvec_3_ptr * const frecpx_fns[] = {
      gen_helper_sve_frecpx_s, gen_helper_sve_frecpx_d,
  };
  TRANS_FEAT(FRECPX, aa64_sve, gen_gvec_fpst_arg_zpz, frecpx_fns[a->esz],
 -           a, 0, a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
 +           a, 0, a->esz == MO_16 ? FPST_A64_F16 : FPST_A64)
  static gen_helper_gvec_3_ptr * const fsqrt_fns[] = {
      NULL,                   gen_helper_sve_fsqrt_h,
      gen_helper_sve_fsqrt_s, gen_helper_sve_fsqrt_d,
  };
  TRANS_FEAT(FSQRT, aa64_sve, gen_gvec_fpst_arg_zpz, fsqrt_fns[a->esz],
 -           a, 0, a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
 +           a, 0, a->esz == MO_16 ? FPST_A64_F16 : FPST_A64)
  TRANS_FEAT(SCVTF_hh, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_scvt_hh, a, 0, FPST_FPCR_F16)
 +           gen_helper_sve_scvt_hh, a, 0, FPST_A64_F16)
  TRANS_FEAT(SCVTF_sh, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_scvt_sh, a, 0, FPST_FPCR_F16)
 +           gen_helper_sve_scvt_sh, a, 0, FPST_A64_F16)
  TRANS_FEAT(SCVTF_dh, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_scvt_dh, a, 0, FPST_FPCR_F16)
 +           gen_helper_sve_scvt_dh, a, 0, FPST_A64_F16)
  TRANS_FEAT(SCVTF_ss, aa64_sve, gen_gvec_fpst_arg_zpz,
             gen_helper_sve_scvt_ss, a, 0, FPST_A64)
@@ -XXX,XX +XXX,XX @@ TRANS_FEAT(SCVTF_dd, aa64_sve, gen_gvec_fpst_arg_zpz,
             gen_helper_sve_scvt_dd, a, 0, FPST_A64)
  TRANS_FEAT(UCVTF_hh, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_ucvt_hh, a, 0, FPST_FPCR_F16)
 +           gen_helper_sve_ucvt_hh, a, 0, FPST_A64_F16)
  TRANS_FEAT(UCVTF_sh, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_ucvt_sh, a, 0, FPST_FPCR_F16)
 +           gen_helper_sve_ucvt_sh, a, 0, FPST_A64_F16)
  TRANS_FEAT(UCVTF_dh, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_ucvt_dh, a, 0, FPST_FPCR_F16)
 +           gen_helper_sve_ucvt_dh, a, 0, FPST_A64_F16)
  TRANS_FEAT(UCVTF_ss, aa64_sve, gen_gvec_fpst_arg_zpz,
             gen_helper_sve_ucvt_ss, a, 0, FPST_A64)
@@ -XXX,XX +XXX,XX @@ static gen_helper_gvec_3_ptr * const flogb_fns[] = {
      gen_helper_flogb_s, gen_helper_flogb_d
  };
  TRANS_FEAT(FLOGB, aa64_sve2, gen_gvec_fpst_arg_zpz, flogb_fns[a->esz],
 -           a, 0, a->esz == MO_16 ? FPST_FPCR_F16 : FPST_A64)
 +           a, 0, a->esz == MO_16 ? FPST_A64_F16 : FPST_A64)
  static bool do_FMLAL_zzzw(DisasContext *s, arg_rrrr_esz *a, bool sub, bool sel)
  {
 --
-.25.1
+.34.1

-[PULL 21/32] target/arm: Enable FEAT_CSV2_2 for -cpu max
+[PULL 30/36] target/arm: Remove now-unused vfp.fp_status_f16 and FPST_FPCR_F16
-From: Richard Henderson <richard.henderson@linaro.org>
+Now we have moved all the uses of vfp.fp_status_f16 and FPST_FPCR_F16
 to the new A32 or A64 fields, we can remove these.
-There is no branch prediction in TCG, therefore there is no
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-need to actually include the context number into the predictor.
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Therefore all we need to do is add the state for SCXTNUM_ELx.
+Message-id: 20250124162836.2332150-19-peter.maydell@linaro.org
 ---
  target/arm/cpu.h           | 2 --
  target/arm/tcg/translate.h | 6 ------
  target/arm/cpu.c           | 1 -
  target/arm/vfp_helper.c    | 7 -------
 files changed, 16 deletions(-)
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220506180242.216785-21-richard.henderson@linaro.org
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- docs/system/arm/emulation.rst |  3 ++
- target/arm/cpu.h              | 16 +++++++++
- target/arm/cpu.c              |  5 +++
- target/arm/cpu64.c            |  3 +-
- target/arm/helper.c           | 61 ++++++++++++++++++++++++++++++++++-
-files changed, 86 insertions(+), 2 deletions(-)
-diff --git a/docs/system/arm/emulation.rst b/docs/system/arm/emulation.rst
-index XXXXXXX..XXXXXXX 100644
---- a/docs/system/arm/emulation.rst
-+++ b/docs/system/arm/emulation.rst
-@@ -XXX,XX +XXX,XX @@ the following architecture extensions:
- - FEAT_BF16 (AArch64 BFloat16 instructions)
- - FEAT_BTI (Branch Target Identification)
- - FEAT_CSV2 (Cache speculation variant 2)
-+- FEAT_CSV2_1p1 (Cache speculation variant 2, version 1.1)
-+- FEAT_CSV2_1p2 (Cache speculation variant 2, version 1.2)
-+- FEAT_CSV2_2 (Cache speculation variant 2, version 2)
- - FEAT_DIT (Data Independent Timing instructions)
- - FEAT_DPB (DC CVAP instruction)
- - FEAT_Debugv8p2 (Debug changes for v8.2)
 diff --git a/target/arm/cpu.h b/target/arm/cpu.h
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu.h
 +++ b/target/arm/cpu.h
 @@ -XXX,XX +XXX,XX @@ typedef struct CPUArchState {
-         ARMPACKey apdb;
+          *
-         ARMPACKey apga;
+          *  fp_status_a32: is the "normal" fp status for AArch32 insns
-     } keys;
+          *  fp_status_a64: is the "normal" fp status for AArch64 insns
-+
+-         *  fp_status_fp16: used for half-precision calculations
-+    uint64_t scxtnum_el[4];
+          *  fp_status_fp16_a32: used for AArch32 half-precision calculations
- #endif
+          *  fp_status_fp16_a64: used for AArch64 half-precision calculations
+          *  standard_fp_status : the ARM "Standard FPSCR Value"
- #if defined(CONFIG_USER_ONLY)
+@@ -XXX,XX +XXX,XX @@ typedef struct CPUArchState {
-@@ -XXX,XX +XXX,XX @@ void pmu_init(ARMCPU *cpu);
+          */
- #define SCTLR_WXN     (1U << 19)
+         float_status fp_status_a32;
- #define SCTLR_ST      (1U << 20) /* up to ??, RAZ in v6 */
+         float_status fp_status_a64;
- #define SCTLR_UWXN    (1U << 20) /* v7 onward, AArch32 only */
+-        float_status fp_status_f16;
-+#define SCTLR_TSCXT   (1U << 20) /* FEAT_CSV2_1p2, AArch64 only */
+         float_status fp_status_f16_a32;
- #define SCTLR_FI      (1U << 21) /* up to v7, v8 RES0 */
+         float_status fp_status_f16_a64;
- #define SCTLR_IESB    (1U << 21) /* v8.2-IESB, AArch64 only */
+         float_status standard_fp_status;
- #define SCTLR_U       (1U << 22) /* up to v6, RAO in v7 */
+diff --git a/target/arm/tcg/translate.h b/target/arm/tcg/translate.h
-@@ -XXX,XX +XXX,XX @@ static inline bool isar_feature_aa64_dit(const ARMISARegisters *id)
+index XXXXXXX..XXXXXXX 100644
-     return FIELD_EX64(id->id_aa64pfr0, ID_AA64PFR0, DIT) != 0;
+--- a/target/arm/tcg/translate.h
- }
++++ b/target/arm/tcg/translate.h
+@@ -XXX,XX +XXX,XX @@ static inline CPUARMTBFlags arm_tbflags_from_tb(const TranslationBlock *tb)
-+static inline bool isar_feature_aa64_scxtnum(const ARMISARegisters *id)
+ typedef enum ARMFPStatusFlavour {
-+{
+     FPST_A32,
-+    int key = FIELD_EX64(id->id_aa64pfr0, ID_AA64PFR0, CSV2);
+     FPST_A64,
-+    if (key >= 2) {
+-    FPST_FPCR_F16,
-+        return true;      /* FEAT_CSV2_2 */
+     FPST_A32_F16,
-+    }
+     FPST_A64_F16,
-+    if (key == 1) {
+     FPST_STD,
-+        key = FIELD_EX64(id->id_aa64pfr1, ID_AA64PFR1, CSV2_FRAC);
+@@ -XXX,XX +XXX,XX @@ typedef enum ARMFPStatusFlavour {
-+        return key >= 2;  /* FEAT_CSV2_1p2 */
+  *   for AArch32 non-FP16 operations controlled by the FPCR
-+    }
+  * FPST_A64
-+    return false;
+  *   for AArch64 non-FP16 operations controlled by the FPCR
-+}
+- * FPST_FPCR_F16
-+
+- *   for operations controlled by the FPCR where FPCR.FZ16 is to be used
- static inline bool isar_feature_aa64_ssbs(const ARMISARegisters *id)
+  * FPST_A32_F16
- {
+  *   for AArch32 operations controlled by the FPCR where FPCR.FZ16 is to be used
-     return FIELD_EX64(id->id_aa64pfr1, ID_AA64PFR1, SSBS) != 0;
+  * FPST_A64_F16
@@ -XXX,XX +XXX,XX @@ static inline TCGv_ptr fpstatus_ptr(ARMFPStatusFlavour flavour)
      case FPST_A64:
          offset = offsetof(CPUARMState, vfp.fp_status_a64);
          break;
 -    case FPST_FPCR_F16:
 -        offset = offsetof(CPUARMState, vfp.fp_status_f16);
 -        break;
      case FPST_A32_F16:
          offset = offsetof(CPUARMState, vfp.fp_status_f16_a32);
          break;
 diff --git a/target/arm/cpu.c b/target/arm/cpu.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu.c
 +++ b/target/arm/cpu.c
-@@ -XXX,XX +XXX,XX @@ static void arm_cpu_reset(DeviceState *dev)
+@@ -XXX,XX +XXX,XX @@ static void arm_cpu_reset_hold(Object *obj, ResetType type)
-              */
+     arm_set_default_fp_behaviours(&env->vfp.fp_status_a32);
-             env->cp15.gcr_el1 = 0x1ffff;
+     arm_set_default_fp_behaviours(&env->vfp.fp_status_a64);
      arm_set_default_fp_behaviours(&env->vfp.standard_fp_status);
 -    arm_set_default_fp_behaviours(&env->vfp.fp_status_f16);
      arm_set_default_fp_behaviours(&env->vfp.fp_status_f16_a32);
      arm_set_default_fp_behaviours(&env->vfp.fp_status_f16_a64);
      arm_set_default_fp_behaviours(&env->vfp.standard_fp_status_f16);
 diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/vfp_helper.c
 +++ b/target/arm/vfp_helper.c
@@ -XXX,XX +XXX,XX @@ static uint32_t vfp_get_fpsr_from_host(CPUARMState *env)
      i |= get_float_exception_flags(&env->vfp.fp_status_a64);
      i |= get_float_exception_flags(&env->vfp.standard_fp_status);
      /* FZ16 does not generate an input denormal exception.  */
 -    i |= (get_float_exception_flags(&env->vfp.fp_status_f16)
 -          & ~float_flag_input_denormal);
      i |= (get_float_exception_flags(&env->vfp.fp_status_f16_a32)
            & ~float_flag_input_denormal);
      i |= (get_float_exception_flags(&env->vfp.fp_status_f16_a64)
@@ -XXX,XX +XXX,XX @@ static void vfp_clear_float_status_exc_flags(CPUARMState *env)
       */
      set_float_exception_flags(0, &env->vfp.fp_status_a32);
      set_float_exception_flags(0, &env->vfp.fp_status_a64);
 -    set_float_exception_flags(0, &env->vfp.fp_status_f16);
      set_float_exception_flags(0, &env->vfp.fp_status_f16_a32);
      set_float_exception_flags(0, &env->vfp.fp_status_f16_a64);
      set_float_exception_flags(0, &env->vfp.standard_fp_status);
@@ -XXX,XX +XXX,XX @@ static void vfp_set_fpcr_to_host(CPUARMState *env, uint32_t val, uint32_t mask)
          }
-+        /*
+         set_float_rounding_mode(i, &env->vfp.fp_status_a32);
-+         * Disable access to SCXTNUM_EL0 from CSV2_1p2.
+         set_float_rounding_mode(i, &env->vfp.fp_status_a64);
-+         * This is not yet exposed from the Linux kernel in any way.
+-        set_float_rounding_mode(i, &env->vfp.fp_status_f16);
-+         */
+         set_float_rounding_mode(i, &env->vfp.fp_status_f16_a32);
-+        env->cp15.sctlr_el[1] |= SCTLR_TSCXT;
+         set_float_rounding_mode(i, &env->vfp.fp_status_f16_a64);
  #else
          /* Reset into the highest available EL */
          if (arm_feature(env, ARM_FEATURE_EL3)) {
 diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu64.c
 +++ b/target/arm/cpu64.c
@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
      t = FIELD_DP64(t, ID_AA64PFR0, SVE, 1);
      t = FIELD_DP64(t, ID_AA64PFR0, SEL2, 1);      /* FEAT_SEL2 */
      t = FIELD_DP64(t, ID_AA64PFR0, DIT, 1);       /* FEAT_DIT */
 -    t = FIELD_DP64(t, ID_AA64PFR0, CSV2, 1);      /* FEAT_CSV2 */
 +    t = FIELD_DP64(t, ID_AA64PFR0, CSV2, 2);      /* FEAT_CSV2_2 */
      cpu->isar.id_aa64pfr0 = t;
      t = cpu->isar.id_aa64pfr1;
@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
       * we do for EL2 with the virtualization=on property.
       */
      t = FIELD_DP64(t, ID_AA64PFR1, MTE, 3);       /* FEAT_MTE3 */
 +    t = FIELD_DP64(t, ID_AA64PFR1, CSV2_FRAC, 0); /* FEAT_CSV2_2 */
      cpu->isar.id_aa64pfr1 = t;
      t = cpu->isar.id_aa64mmfr0;
 diff --git a/target/arm/helper.c b/target/arm/helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper.c
 +++ b/target/arm/helper.c
@@ -XXX,XX +XXX,XX @@ static void scr_write(CPUARMState *env, const ARMCPRegInfo *ri, uint64_t value)
          if (cpu_isar_feature(aa64_mte, cpu)) {
              valid_mask |= SCR_ATA;
          }
 +        if (cpu_isar_feature(aa64_scxtnum, cpu)) {
 +            valid_mask |= SCR_ENSCXT;
 +        }
      } else {
          valid_mask &= ~(SCR_RW | SCR_ST);
          if (cpu_isar_feature(aa32_ras, cpu)) {
@@ -XXX,XX +XXX,XX @@ static void do_hcr_write(CPUARMState *env, uint64_t value, uint64_t valid_mask)
          if (cpu_isar_feature(aa64_mte, cpu)) {
              valid_mask |= HCR_ATA | HCR_DCT | HCR_TID5;
          }
 +        if (cpu_isar_feature(aa64_scxtnum, cpu)) {
 +            valid_mask |= HCR_ENSCXT;
 +        }
      }
+     if (changed & FPCR_FZ16) {
-     /* Clear RES0 bits.  */
+         bool ftz_enabled = val & FPCR_FZ16;
-@@ -XXX,XX +XXX,XX @@ static void define_arm_vh_e2h_redirects_aliases(ARMCPU *cpu)
+-        set_flush_to_zero(ftz_enabled, &env->vfp.fp_status_f16);
-         { K(3, 0,  5, 6, 0), K(3, 4,  5, 6, 0), K(3, 5, 5, 6, 0),
+         set_flush_to_zero(ftz_enabled, &env->vfp.fp_status_f16_a32);
-           "TFSR_EL1", "TFSR_EL2", "TFSR_EL12", isar_feature_aa64_mte },
+         set_flush_to_zero(ftz_enabled, &env->vfp.fp_status_f16_a64);
+         set_flush_to_zero(ftz_enabled, &env->vfp.standard_fp_status_f16);
-+        { K(3, 0, 13, 0, 7), K(3, 4, 13, 0, 7), K(3, 5, 13, 0, 7),
+-        set_flush_inputs_to_zero(ftz_enabled, &env->vfp.fp_status_f16);
-+          "SCXTNUM_EL1", "SCXTNUM_EL2", "SCXTNUM_EL12",
+         set_flush_inputs_to_zero(ftz_enabled, &env->vfp.fp_status_f16_a32);
-+          isar_feature_aa64_scxtnum },
+         set_flush_inputs_to_zero(ftz_enabled, &env->vfp.fp_status_f16_a64);
-+
+         set_flush_inputs_to_zero(ftz_enabled, &env->vfp.standard_fp_status_f16);
-         /* TODO: ARMv8.2-SPE -- PMSCR_EL2 */
+@@ -XXX,XX +XXX,XX @@ static void vfp_set_fpcr_to_host(CPUARMState *env, uint32_t val, uint32_t mask)
-         /* TODO: ARMv8.4-Trace -- TRFCR_EL2 */
+         bool dnan_enabled = val & FPCR_DN;
-     };
+         set_default_nan_mode(dnan_enabled, &env->vfp.fp_status_a32);
-@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo mte_el0_cacheop_reginfo[] = {
+         set_default_nan_mode(dnan_enabled, &env->vfp.fp_status_a64);
-     },
+-        set_default_nan_mode(dnan_enabled, &env->vfp.fp_status_f16);
- };
+         set_default_nan_mode(dnan_enabled, &env->vfp.fp_status_f16_a32);
+         set_default_nan_mode(dnan_enabled, &env->vfp.fp_status_f16_a64);
 -#endif
 +static CPAccessResult access_scxtnum(CPUARMState *env, const ARMCPRegInfo *ri,
 +                                     bool isread)
 +{
 +    uint64_t hcr = arm_hcr_el2_eff(env);
 +    int el = arm_current_el(env);
 +
 +    if (el == 0 && !((hcr & HCR_E2H) && (hcr & HCR_TGE))) {
 +        if (env->cp15.sctlr_el[1] & SCTLR_TSCXT) {
 +            if (hcr & HCR_TGE) {
 +                return CP_ACCESS_TRAP_EL2;
 +            }
 +            return CP_ACCESS_TRAP;
 +        }
 +    } else if (el < 2 && (env->cp15.sctlr_el[2] & SCTLR_TSCXT)) {
 +        return CP_ACCESS_TRAP_EL2;
 +    }
 +    if (el < 2 && arm_is_el2_enabled(env) && !(hcr & HCR_ENSCXT)) {
 +        return CP_ACCESS_TRAP_EL2;
 +    }
 +    if (el < 3
 +        && arm_feature(env, ARM_FEATURE_EL3)
 +        && !(env->cp15.scr_el3 & SCR_ENSCXT)) {
 +        return CP_ACCESS_TRAP_EL3;
 +    }
 +    return CP_ACCESS_OK;
 +}
 +
 +static const ARMCPRegInfo scxtnum_reginfo[] = {
 +    { .name = "SCXTNUM_EL0", .state = ARM_CP_STATE_AA64,
 +      .opc0 = 3, .opc1 = 3, .crn = 13, .crm = 0, .opc2 = 7,
 +      .access = PL0_RW, .accessfn = access_scxtnum,
 +      .fieldoffset = offsetof(CPUARMState, scxtnum_el[0]) },
 +    { .name = "SCXTNUM_EL1", .state = ARM_CP_STATE_AA64,
 +      .opc0 = 3, .opc1 = 0, .crn = 13, .crm = 0, .opc2 = 7,
 +      .access = PL1_RW, .accessfn = access_scxtnum,
 +      .fieldoffset = offsetof(CPUARMState, scxtnum_el[1]) },
 +    { .name = "SCXTNUM_EL2", .state = ARM_CP_STATE_AA64,
 +      .opc0 = 3, .opc1 = 4, .crn = 13, .crm = 0, .opc2 = 7,
 +      .access = PL2_RW, .accessfn = access_scxtnum,
 +      .fieldoffset = offsetof(CPUARMState, scxtnum_el[2]) },
 +    { .name = "SCXTNUM_EL3", .state = ARM_CP_STATE_AA64,
 +      .opc0 = 3, .opc1 = 6, .crn = 13, .crm = 0, .opc2 = 7,
 +      .access = PL3_RW,
 +      .fieldoffset = offsetof(CPUARMState, scxtnum_el[3]) },
 +};
 +#endif /* TARGET_AARCH64 */
  static CPAccessResult access_predinv(CPUARMState *env, const ARMCPRegInfo *ri,
                                       bool isread)
@@ -XXX,XX +XXX,XX @@ void register_cp_regs_for_features(ARMCPU *cpu)
          define_arm_cp_regs(cpu, mte_tco_ro_reginfo);
          define_arm_cp_regs(cpu, mte_el0_cacheop_reginfo);
      }
-+
-+    if (cpu_isar_feature(aa64_scxtnum, cpu)) {
-+        define_arm_cp_regs(cpu, scxtnum_reginfo);
-+    }
- #endif
-     if (cpu_isar_feature(any_predinv, cpu)) {
 --
-.25.1
+.34.1

-[PULL 18/32] target/arm: Enable FEAT_RAS for -cpu max
+[PULL 31/36] fpu: Rename float_flag_input_denormal to float_flag_input_denormal_flushed
-From: Richard Henderson <richard.henderson@linaro.org>
+Our float_flag_input_denormal exception flag is set when the fpu code
+flushes an input denormal to zero.  This is what many guest
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+architectures (eg classic Arm behaviour) require, but it is not the
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+only donarmal-related reason we might want to set an exception flag.
-Message-id: 20220506180242.216785-18-richard.henderson@linaro.org
+The x86 behaviour (which we do not currently model correctly) wants
 to see an exception flag when a denormal input is *not* flushed to
 zero and is actually used in an arithmetic operation. Arm's FEAT_AFP
 also wants these semantics.
 Rename float_flag_input_denormal to float_flag_input_denormal_flushed
 to make it clearer when it is set and to allow us to add a new
 float_flag_input_denormal_used next to it for the x86/FEAT_AFP
 semantics.
 Commit created with
  for f in `git grep -l float_flag_input_denormal`; do sed -i -e 's/float_flag_input_denormal/float_flag_input_denormal_flushed/' $f; done
 and manual editing of softfloat-types.h and softfloat.c to clean
 up the indentation afterwards and to fix a comment which wasn't
 using the full name of the flag.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250124162836.2332150-20-peter.maydell@linaro.org
 ---
- docs/system/arm/emulation.rst | 1 +
+ include/fpu/softfloat-types.h |  5 +++--
- target/arm/cpu64.c            | 1 +
+ fpu/softfloat.c               |  4 ++--
- target/arm/cpu_tcg.c          | 1 +
+ target/arm/tcg/sve_helper.c   |  6 +++---
-files changed, 3 insertions(+)
+ target/arm/vfp_helper.c       | 10 +++++-----
+ target/i386/tcg/fpu_helper.c  |  6 +++---
-diff --git a/docs/system/arm/emulation.rst b/docs/system/arm/emulation.rst
+ target/mips/tcg/msa_helper.c  |  2 +-
-index XXXXXXX..XXXXXXX 100644
+ target/rx/op_helper.c         |  2 +-
---- a/docs/system/arm/emulation.rst
+ fpu/softfloat-parts.c.inc     |  2 +-
-+++ b/docs/system/arm/emulation.rst
+files changed, 19 insertions(+), 18 deletions(-)
-@@ -XXX,XX +XXX,XX @@ the following architecture extensions:
- - FEAT_PMULL (PMULL, PMULL2 instructions)
+diff --git a/include/fpu/softfloat-types.h b/include/fpu/softfloat-types.h
- - FEAT_PMUv3p1 (PMU Extensions v3.1)
+index XXXXXXX..XXXXXXX 100644
- - FEAT_PMUv3p4 (PMU Extensions v3.4)
+--- a/include/fpu/softfloat-types.h
-+- FEAT_RAS (Reliability, availability, and serviceability)
++++ b/include/fpu/softfloat-types.h
- - FEAT_RDM (Advanced SIMD rounding double multiply accumulate instructions)
+@@ -XXX,XX +XXX,XX @@ enum {
- - FEAT_RNG (Random number generator)
+     float_flag_overflow        = 0x0004,
- - FEAT_SB (Speculation Barrier)
+     float_flag_underflow       = 0x0008,
-diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
+     float_flag_inexact         = 0x0010,
-index XXXXXXX..XXXXXXX 100644
+-    float_flag_input_denormal  = 0x0020,
---- a/target/arm/cpu64.c
++    /* We flushed an input denormal to 0 (because of flush_inputs_to_zero) */
-+++ b/target/arm/cpu64.c
++    float_flag_input_denormal_flushed = 0x0020,
-@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
+     float_flag_output_denormal = 0x0040,
-     t = cpu->isar.id_aa64pfr0;
+     float_flag_invalid_isi     = 0x0080,  /* inf - inf */
-     t = FIELD_DP64(t, ID_AA64PFR0, FP, 1);        /* FEAT_FP16 */
+     float_flag_invalid_imz     = 0x0100,  /* inf * 0 */
-     t = FIELD_DP64(t, ID_AA64PFR0, ADVSIMD, 1);   /* FEAT_FP16 */
+@@ -XXX,XX +XXX,XX @@ typedef struct float_status {
-+    t = FIELD_DP64(t, ID_AA64PFR0, RAS, 1);       /* FEAT_RAS */
+     bool tininess_before_rounding;
-     t = FIELD_DP64(t, ID_AA64PFR0, SVE, 1);
+     /* should denormalised results go to zero and set the inexact flag? */
-     t = FIELD_DP64(t, ID_AA64PFR0, SEL2, 1);      /* FEAT_SEL2 */
+     bool flush_to_zero;
-     t = FIELD_DP64(t, ID_AA64PFR0, DIT, 1);       /* FEAT_DIT */
+-    /* should denormalised inputs go to zero and set the input_denormal flag? */
-diff --git a/target/arm/cpu_tcg.c b/target/arm/cpu_tcg.c
++    /* should denormalised inputs go to zero and set input_denormal_flushed? */
-index XXXXXXX..XXXXXXX 100644
+     bool flush_inputs_to_zero;
---- a/target/arm/cpu_tcg.c
+     bool default_nan_mode;
-+++ b/target/arm/cpu_tcg.c
+     /*
-@@ -XXX,XX +XXX,XX @@ void aa32_max_features(ARMCPU *cpu)
+diff --git a/fpu/softfloat.c b/fpu/softfloat.c
+index XXXXXXX..XXXXXXX 100644
-     t = cpu->isar.id_pfr0;
+--- a/fpu/softfloat.c
-     t = FIELD_DP32(t, ID_PFR0, DIT, 1);           /* FEAT_DIT */
++++ b/fpu/softfloat.c
-+    t = FIELD_DP32(t, ID_PFR0, RAS, 1);           /* FEAT_RAS */
+@@ -XXX,XX +XXX,XX @@ this code that are retained.
-     cpu->isar.id_pfr0 = t;
+         if (unlikely(soft_t ## _is_denormal(*a))) {                     \
+             *a = soft_t ## _set_sign(soft_t ## _zero,                   \
-     t = cpu->isar.id_pfr2;
+                                      soft_t ## _is_neg(*a));            \
 -            float_raise(float_flag_input_denormal, s);                  \
 +            float_raise(float_flag_input_denormal_flushed, s);          \
          }                                                               \
      }
@@ -XXX,XX +XXX,XX @@ float128 float128_silence_nan(float128 a, float_status *status)
  static bool parts_squash_denormal(FloatParts64 p, float_status *status)
  {
      if (p.exp == 0 && p.frac != 0) {
 -        float_raise(float_flag_input_denormal, status);
 +        float_raise(float_flag_input_denormal_flushed, status);
          return true;
      }
 diff --git a/target/arm/tcg/sve_helper.c b/target/arm/tcg/sve_helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/tcg/sve_helper.c
 +++ b/target/arm/tcg/sve_helper.c
@@ -XXX,XX +XXX,XX @@ static int16_t do_float16_logb_as_int(float16 a, float_status *s)
                  return -15 - clz32(frac);
              }
              /* flush to zero */
 -            float_raise(float_flag_input_denormal, s);
 +            float_raise(float_flag_input_denormal_flushed, s);
          }
      } else if (unlikely(exp == 0x1f)) {
          if (frac == 0) {
@@ -XXX,XX +XXX,XX @@ static int32_t do_float32_logb_as_int(float32 a, float_status *s)
                  return -127 - clz32(frac);
              }
              /* flush to zero */
 -            float_raise(float_flag_input_denormal, s);
 +            float_raise(float_flag_input_denormal_flushed, s);
          }
      } else if (unlikely(exp == 0xff)) {
          if (frac == 0) {
@@ -XXX,XX +XXX,XX @@ static int64_t do_float64_logb_as_int(float64 a, float_status *s)
                  return -1023 - clz64(frac);
              }
              /* flush to zero */
 -            float_raise(float_flag_input_denormal, s);
 +            float_raise(float_flag_input_denormal_flushed, s);
          }
      } else if (unlikely(exp == 0x7ff)) {
          if (frac == 0) {
 diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/vfp_helper.c
 +++ b/target/arm/vfp_helper.c
@@ -XXX,XX +XXX,XX @@ static inline uint32_t vfp_exceptbits_from_host(int host_bits)
      if (host_bits & float_flag_inexact) {
          target_bits |= FPSR_IXC;
      }
 -    if (host_bits & float_flag_input_denormal) {
 +    if (host_bits & float_flag_input_denormal_flushed) {
          target_bits |= FPSR_IDC;
      }
      return target_bits;
@@ -XXX,XX +XXX,XX @@ static uint32_t vfp_get_fpsr_from_host(CPUARMState *env)
      i |= get_float_exception_flags(&env->vfp.standard_fp_status);
      /* FZ16 does not generate an input denormal exception.  */
      i |= (get_float_exception_flags(&env->vfp.fp_status_f16_a32)
 -          & ~float_flag_input_denormal);
 +          & ~float_flag_input_denormal_flushed);
      i |= (get_float_exception_flags(&env->vfp.fp_status_f16_a64)
 -          & ~float_flag_input_denormal);
 +          & ~float_flag_input_denormal_flushed);
      i |= (get_float_exception_flags(&env->vfp.standard_fp_status_f16)
 -          & ~float_flag_input_denormal);
 +          & ~float_flag_input_denormal_flushed);
      return vfp_exceptbits_from_host(i);
  }
@@ -XXX,XX +XXX,XX @@ uint64_t HELPER(fjcvtzs)(float64 value, float_status *status)
      /* Normal inexact, denormal with flush-to-zero, or overflow or NaN */
      inexact = e_new & (float_flag_inexact |
 -                       float_flag_input_denormal |
 +                       float_flag_input_denormal_flushed |
                         float_flag_invalid);
      /* While not inexact for IEEE FP, -0.0 is inexact for JavaScript. */
 diff --git a/target/i386/tcg/fpu_helper.c b/target/i386/tcg/fpu_helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/i386/tcg/fpu_helper.c
 +++ b/target/i386/tcg/fpu_helper.c
@@ -XXX,XX +XXX,XX @@ static void merge_exception_flags(CPUX86State *env, uint8_t old_flags)
                         (new_flags & float_flag_overflow ? FPUS_OE : 0) |
                         (new_flags & float_flag_underflow ? FPUS_UE : 0) |
                         (new_flags & float_flag_inexact ? FPUS_PE : 0) |
 -                       (new_flags & float_flag_input_denormal ? FPUS_DE : 0)));
 +                       (new_flags & float_flag_input_denormal_flushed ? FPUS_DE : 0)));
  }
  static inline floatx80 helper_fdiv(CPUX86State *env, floatx80 a, floatx80 b)
@@ -XXX,XX +XXX,XX @@ void helper_fxtract(CPUX86State *env)
              int shift = clz64(temp.l.lower);
              temp.l.lower <<= shift;
              expdif = 1 - EXPBIAS - shift;
 -            float_raise(float_flag_input_denormal, &env->fp_status);
 +            float_raise(float_flag_input_denormal_flushed, &env->fp_status);
          } else {
              expdif = EXPD(temp) - EXPBIAS;
          }
@@ -XXX,XX +XXX,XX @@ void update_mxcsr_from_sse_status(CPUX86State *env)
      uint8_t flags = get_float_exception_flags(&env->sse_status);
      /*
       * The MXCSR denormal flag has opposite semantics to
 -     * float_flag_input_denormal (the softfloat code sets that flag
 +     * float_flag_input_denormal_flushed (the softfloat code sets that flag
       * only when flushing input denormals to zero, but SSE sets it
       * only when not flushing them to zero), so is not converted
       * here.
 diff --git a/target/mips/tcg/msa_helper.c b/target/mips/tcg/msa_helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/mips/tcg/msa_helper.c
 +++ b/target/mips/tcg/msa_helper.c
@@ -XXX,XX +XXX,XX @@ static inline int update_msacsr(CPUMIPSState *env, int action, int denormal)
      enable = GET_FP_ENABLE(env->active_tc.msacsr) | FP_UNIMPLEMENTED;
      /* Set Inexact (I) when flushing inputs to zero */
 -    if ((ieee_exception_flags & float_flag_input_denormal) &&
 +    if ((ieee_exception_flags & float_flag_input_denormal_flushed) &&
              (env->active_tc.msacsr & MSACSR_FS_MASK) != 0) {
          if (action & CLEAR_IS_INEXACT) {
              mips_exception_flags &= ~FP_INEXACT;
 diff --git a/target/rx/op_helper.c b/target/rx/op_helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/rx/op_helper.c
 +++ b/target/rx/op_helper.c
@@ -XXX,XX +XXX,XX @@ static void update_fpsw(CPURXState *env, float32 ret, uintptr_t retaddr)
          if (xcpt & float_flag_inexact) {
              SET_FPSW(X);
          }
 -        if ((xcpt & (float_flag_input_denormal
 +        if ((xcpt & (float_flag_input_denormal_flushed
                       | float_flag_output_denormal))
              && !FIELD_EX32(env->fpsw, FPSW, DN)) {
              env->fpsw = FIELD_DP32(env->fpsw, FPSW, CE, 1);
 diff --git a/fpu/softfloat-parts.c.inc b/fpu/softfloat-parts.c.inc
 index XXXXXXX..XXXXXXX 100644
 --- a/fpu/softfloat-parts.c.inc
 +++ b/fpu/softfloat-parts.c.inc
@@ -XXX,XX +XXX,XX @@ static void partsN(canonicalize)(FloatPartsN *p, float_status *status,
          if (likely(frac_eqz(p))) {
              p->cls = float_class_zero;
          } else if (status->flush_inputs_to_zero) {
 -            float_raise(float_flag_input_denormal, status);
 +            float_raise(float_flag_input_denormal_flushed, status);
              p->cls = float_class_zero;
              frac_clear(p);
          } else {
 --
-.25.1
+.34.1

-[PULL 24/32] target/arm: Define cortex-a76
+[PULL 32/36] fpu: Rename float_flag_output_denormal to float_flag_output_denormal_flushed
-From: Richard Henderson <richard.henderson@linaro.org>
+Our float_flag_output_denormal exception flag is set when
 the fpu code flushes an output denormal to zero. Rename
 it to float_flag_output_denormal_flushed:
  * this keeps it parallel with the flag for flushing
    input denormals, which we just renamed
  * it makes it clearer that it doesn't mean "set when
    the output is a denormal"
-Enable the a76 for virt and sbsa board use.
+Commit created with
  for f in `git grep -l float_flag_output_denormal`; do sed -i -e 's/float_flag_output_denormal/float_flag_output_denormal_flushed/' $f; done
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220506180242.216785-24-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250124162836.2332150-21-peter.maydell@linaro.org
 ---
- docs/system/arm/virt.rst |  1 +
+ include/fpu/softfloat-types.h | 3 ++-
- hw/arm/sbsa-ref.c        |  1 +
+ fpu/softfloat.c               | 2 +-
- hw/arm/virt.c            |  1 +
+ target/arm/vfp_helper.c       | 2 +-
- target/arm/cpu64.c       | 66 ++++++++++++++++++++++++++++++++++++++++
+ target/i386/tcg/fpu_helper.c  | 2 +-
-files changed, 69 insertions(+)
+ target/m68k/fpu_helper.c      | 2 +-
  target/mips/tcg/msa_helper.c  | 2 +-
  target/rx/op_helper.c         | 2 +-
  target/tricore/fpu_helper.c   | 6 +++---
  fpu/softfloat-parts.c.inc     | 2 +-
 files changed, 12 insertions(+), 11 deletions(-)
-diff --git a/docs/system/arm/virt.rst b/docs/system/arm/virt.rst
+diff --git a/include/fpu/softfloat-types.h b/include/fpu/softfloat-types.h
 index XXXXXXX..XXXXXXX 100644
---- a/docs/system/arm/virt.rst
+--- a/include/fpu/softfloat-types.h
-+++ b/docs/system/arm/virt.rst
++++ b/include/fpu/softfloat-types.h
-@@ -XXX,XX +XXX,XX @@ Supported guest CPU types:
+@@ -XXX,XX +XXX,XX @@ enum {
- - ``cortex-a53`` (64-bit)
+     float_flag_inexact         = 0x0010,
- - ``cortex-a57`` (64-bit)
+     /* We flushed an input denormal to 0 (because of flush_inputs_to_zero) */
- - ``cortex-a72`` (64-bit)
+     float_flag_input_denormal_flushed = 0x0020,
-+- ``cortex-a76`` (64-bit)
+-    float_flag_output_denormal = 0x0040,
- - ``a64fx`` (64-bit)
++    /* We flushed an output denormal to 0 (because of flush_to_zero) */
- - ``host`` (with KVM only)
++    float_flag_output_denormal_flushed = 0x0040,
- - ``max`` (same as ``host`` for KVM; best possible emulation with TCG)
+     float_flag_invalid_isi     = 0x0080,  /* inf - inf */
-diff --git a/hw/arm/sbsa-ref.c b/hw/arm/sbsa-ref.c
+     float_flag_invalid_imz     = 0x0100,  /* inf * 0 */
      float_flag_invalid_idi     = 0x0200,  /* inf / inf */
 diff --git a/fpu/softfloat.c b/fpu/softfloat.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/sbsa-ref.c
+--- a/fpu/softfloat.c
-+++ b/hw/arm/sbsa-ref.c
++++ b/fpu/softfloat.c
-@@ -XXX,XX +XXX,XX @@ static const int sbsa_ref_irqmap[] = {
+@@ -XXX,XX +XXX,XX @@ floatx80 roundAndPackFloatx80(FloatX80RoundPrec roundingPrecision, bool zSign,
- static const char * const valid_cpus[] = {
+         }
-     ARM_CPU_TYPE_NAME("cortex-a57"),
+         if ( zExp <= 0 ) {
-     ARM_CPU_TYPE_NAME("cortex-a72"),
+             if (status->flush_to_zero) {
-+    ARM_CPU_TYPE_NAME("cortex-a76"),
+-                float_raise(float_flag_output_denormal, status);
-     ARM_CPU_TYPE_NAME("max"),
++                float_raise(float_flag_output_denormal_flushed, status);
- };
+                 return packFloatx80(zSign, 0, 0);
+             }
-diff --git a/hw/arm/virt.c b/hw/arm/virt.c
+             isTiny = status->tininess_before_rounding
 diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/virt.c
+--- a/target/arm/vfp_helper.c
-+++ b/hw/arm/virt.c
++++ b/target/arm/vfp_helper.c
-@@ -XXX,XX +XXX,XX @@ static const char *valid_cpus[] = {
+@@ -XXX,XX +XXX,XX @@ static inline uint32_t vfp_exceptbits_from_host(int host_bits)
-     ARM_CPU_TYPE_NAME("cortex-a53"),
+     if (host_bits & float_flag_overflow) {
-     ARM_CPU_TYPE_NAME("cortex-a57"),
+         target_bits |= FPSR_OFC;
-     ARM_CPU_TYPE_NAME("cortex-a72"),
+     }
-+    ARM_CPU_TYPE_NAME("cortex-a76"),
+-    if (host_bits & (float_flag_underflow | float_flag_output_denormal)) {
-     ARM_CPU_TYPE_NAME("a64fx"),
++    if (host_bits & (float_flag_underflow | float_flag_output_denormal_flushed)) {
-     ARM_CPU_TYPE_NAME("host"),
+         target_bits |= FPSR_UFC;
-     ARM_CPU_TYPE_NAME("max"),
+     }
-diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
+     if (host_bits & float_flag_inexact) {
 diff --git a/target/i386/tcg/fpu_helper.c b/target/i386/tcg/fpu_helper.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu64.c
+--- a/target/i386/tcg/fpu_helper.c
-+++ b/target/arm/cpu64.c
++++ b/target/i386/tcg/fpu_helper.c
-@@ -XXX,XX +XXX,XX @@ static void aarch64_a72_initfn(Object *obj)
+@@ -XXX,XX +XXX,XX @@ void update_mxcsr_from_sse_status(CPUX86State *env)
-     define_cortex_a72_a57_a53_cp_reginfo(cpu);
+                    (flags & float_flag_overflow ? FPUS_OE : 0) |
                     (flags & float_flag_underflow ? FPUS_UE : 0) |
                     (flags & float_flag_inexact ? FPUS_PE : 0) |
 -                   (flags & float_flag_output_denormal ? FPUS_UE | FPUS_PE :
 +                   (flags & float_flag_output_denormal_flushed ? FPUS_UE | FPUS_PE :
 ));
  }
-+static void aarch64_a76_initfn(Object *obj)
+diff --git a/target/m68k/fpu_helper.c b/target/m68k/fpu_helper.c
-+{
+index XXXXXXX..XXXXXXX 100644
-+    ARMCPU *cpu = ARM_CPU(obj);
+--- a/target/m68k/fpu_helper.c
-+
++++ b/target/m68k/fpu_helper.c
-+    cpu->dtb_compatible = "arm,cortex-a76";
+@@ -XXX,XX +XXX,XX @@ static int cpu_m68k_exceptbits_from_host(int host_bits)
-+    set_feature(&cpu->env, ARM_FEATURE_V8);
+     if (host_bits & float_flag_overflow) {
-+    set_feature(&cpu->env, ARM_FEATURE_NEON);
+         target_bits |= 0x40;
-+    set_feature(&cpu->env, ARM_FEATURE_GENERIC_TIMER);
+     }
-+    set_feature(&cpu->env, ARM_FEATURE_AARCH64);
+-    if (host_bits & (float_flag_underflow | float_flag_output_denormal)) {
-+    set_feature(&cpu->env, ARM_FEATURE_CBAR_RO);
++    if (host_bits & (float_flag_underflow | float_flag_output_denormal_flushed)) {
-+    set_feature(&cpu->env, ARM_FEATURE_EL2);
+         target_bits |= 0x20;
-+    set_feature(&cpu->env, ARM_FEATURE_EL3);
+     }
-+    set_feature(&cpu->env, ARM_FEATURE_PMU);
+     if (host_bits & float_flag_divbyzero) {
-+
+diff --git a/target/mips/tcg/msa_helper.c b/target/mips/tcg/msa_helper.c
-+    /* Ordered by B2.4 AArch64 registers by functional group */
+index XXXXXXX..XXXXXXX 100644
-+    cpu->clidr = 0x82000023;
+--- a/target/mips/tcg/msa_helper.c
-+    cpu->ctr = 0x8444C004;
++++ b/target/mips/tcg/msa_helper.c
-+    cpu->dcz_blocksize = 4;
+@@ -XXX,XX +XXX,XX @@ static inline int update_msacsr(CPUMIPSState *env, int action, int denormal)
-+    cpu->isar.id_aa64dfr0  = 0x0000000010305408ull;
+     }
-+    cpu->isar.id_aa64isar0 = 0x0000100010211120ull;
-+    cpu->isar.id_aa64isar1 = 0x0000000000100001ull;
+     /* Set Inexact (I) and Underflow (U) when flushing outputs to zero */
-+    cpu->isar.id_aa64mmfr0 = 0x0000000000101122ull;
+-    if ((ieee_exception_flags & float_flag_output_denormal) &&
-+    cpu->isar.id_aa64mmfr1 = 0x0000000010212122ull;
++    if ((ieee_exception_flags & float_flag_output_denormal_flushed) &&
-+    cpu->isar.id_aa64mmfr2 = 0x0000000000001011ull;
+             (env->active_tc.msacsr & MSACSR_FS_MASK) != 0) {
-+    cpu->isar.id_aa64pfr0  = 0x1100000010111112ull; /* GIC filled in later */
+         mips_exception_flags |= FP_INEXACT;
-+    cpu->isar.id_aa64pfr1  = 0x0000000000000010ull;
+         if (action & CLEAR_FS_UNDERFLOW) {
-+    cpu->id_afr0       = 0x00000000;
+diff --git a/target/rx/op_helper.c b/target/rx/op_helper.c
-+    cpu->isar.id_dfr0  = 0x04010088;
+index XXXXXXX..XXXXXXX 100644
-+    cpu->isar.id_isar0 = 0x02101110;
+--- a/target/rx/op_helper.c
-+    cpu->isar.id_isar1 = 0x13112111;
++++ b/target/rx/op_helper.c
-+    cpu->isar.id_isar2 = 0x21232042;
+@@ -XXX,XX +XXX,XX @@ static void update_fpsw(CPURXState *env, float32 ret, uintptr_t retaddr)
-+    cpu->isar.id_isar3 = 0x01112131;
+             SET_FPSW(X);
-+    cpu->isar.id_isar4 = 0x00010142;
+         }
-+    cpu->isar.id_isar5 = 0x01011121;
+         if ((xcpt & (float_flag_input_denormal_flushed
-+    cpu->isar.id_isar6 = 0x00000010;
+-                     | float_flag_output_denormal))
-+    cpu->isar.id_mmfr0 = 0x10201105;
++                     | float_flag_output_denormal_flushed))
-+    cpu->isar.id_mmfr1 = 0x40000000;
+             && !FIELD_EX32(env->fpsw, FPSW, DN)) {
-+    cpu->isar.id_mmfr2 = 0x01260000;
+             env->fpsw = FIELD_DP32(env->fpsw, FPSW, CE, 1);
-+    cpu->isar.id_mmfr3 = 0x02122211;
+         }
-+    cpu->isar.id_mmfr4 = 0x00021110;
+diff --git a/target/tricore/fpu_helper.c b/target/tricore/fpu_helper.c
-+    cpu->isar.id_pfr0  = 0x10010131;
+index XXXXXXX..XXXXXXX 100644
-+    cpu->isar.id_pfr1  = 0x00010000; /* GIC filled in later */
+--- a/target/tricore/fpu_helper.c
-+    cpu->isar.id_pfr2  = 0x00000011;
++++ b/target/tricore/fpu_helper.c
-+    cpu->midr = 0x414fd0b1;          /* r4p1 */
+@@ -XXX,XX +XXX,XX @@ static inline uint8_t f_get_excp_flags(CPUTriCoreState *env)
-+    cpu->revidr = 0;
+            & (float_flag_invalid
-+
+               | float_flag_overflow
-+    /* From B2.18 CCSIDR_EL1 */
+               | float_flag_underflow
-+    cpu->ccsidr[0] = 0x701fe01a; /* 64KB L1 dcache */
+-              | float_flag_output_denormal
-+    cpu->ccsidr[1] = 0x201fe01a; /* 64KB L1 icache */
++              | float_flag_output_denormal_flushed
-+    cpu->ccsidr[2] = 0x707fe03a; /* 512KB L2 cache */
+               | float_flag_divbyzero
-+
+               | float_flag_inexact);
-+    /* From B2.93 SCTLR_EL3 */
+ }
-+    cpu->reset_sctlr = 0x30c50838;
+@@ -XXX,XX +XXX,XX @@ static void f_update_psw_flags(CPUTriCoreState *env, uint8_t flags)
-+
+         some_excp = 1;
-+    /* From B4.23 ICH_VTR_EL2 */
+     }
-+    cpu->gic_num_lrs = 4;
-+    cpu->gic_vpribits = 5;
+-    if (flags & float_flag_underflow || flags & float_flag_output_denormal) {
-+    cpu->gic_vprebits = 5;
++    if (flags & float_flag_underflow || flags & float_flag_output_denormal_flushed) {
-+
+         env->FPU_FU = 1 << 31;
-+    /* From B5.1 AdvSIMD AArch64 register summary */
+         some_excp = 1;
-+    cpu->isar.mvfr0 = 0x10110222;
+     }
-+    cpu->isar.mvfr1 = 0x13211111;
+@@ -XXX,XX +XXX,XX @@ static void f_update_psw_flags(CPUTriCoreState *env, uint8_t flags)
-+    cpu->isar.mvfr2 = 0x00000043;
+         some_excp = 1;
-+}
+     }
-+
- void arm_cpu_sve_finalize(ARMCPU *cpu, Error **errp)
+-    if (flags & float_flag_inexact || flags & float_flag_output_denormal) {
- {
++    if (flags & float_flag_inexact || flags & float_flag_output_denormal_flushed) {
-     /*
+         env->PSW |= 1 << 26;
-@@ -XXX,XX +XXX,XX @@ static const ARMCPUInfo aarch64_cpus[] = {
+         some_excp = 1;
-     { .name = "cortex-a57",         .initfn = aarch64_a57_initfn },
+     }
-     { .name = "cortex-a53",         .initfn = aarch64_a53_initfn },
+diff --git a/fpu/softfloat-parts.c.inc b/fpu/softfloat-parts.c.inc
-     { .name = "cortex-a72",         .initfn = aarch64_a72_initfn },
+index XXXXXXX..XXXXXXX 100644
-+    { .name = "cortex-a76",         .initfn = aarch64_a76_initfn },
+--- a/fpu/softfloat-parts.c.inc
-     { .name = "a64fx",              .initfn = aarch64_a64fx_initfn },
++++ b/fpu/softfloat-parts.c.inc
-     { .name = "max",                .initfn = aarch64_max_initfn },
+@@ -XXX,XX +XXX,XX @@ static void partsN(uncanon_normal)(FloatPartsN *p, float_status *s,
- #if defined(CONFIG_KVM) || defined(CONFIG_HVF)
+         }
          frac_shr(p, frac_shift);
      } else if (s->flush_to_zero) {
 -        flags |= float_flag_output_denormal;
 +        flags |= float_flag_output_denormal_flushed;
          p->cls = float_class_zero;
          exp = 0;
          frac_clear(p);
 --
-.25.1
+.34.1

-[PULL 09/32] target/arm: Split out aa32_max_features
+[PULL 33/36] fpu: Fix a comment in softfloat-types.h
-From: Richard Henderson <richard.henderson@linaro.org>
+In softfloat-types.h a comment documents that if the float_status
 field flush_to_zero is set then we flush denormalised results to 0
 and set the inexact flag.  This isn't correct: the status flag that
 we set when flush_to_zero causes us to flush an output to zero is
 float_flag_output_denormal_flushed.
-Share the code to set AArch32 max features so that we no
+Correct the comment.
 longer have code drift between qemu{-system,}-{arm,aarch64}.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220506180242.216785-9-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250124162836.2332150-22-peter.maydell@linaro.org
 ---
- target/arm/internals.h |   2 +
+ include/fpu/softfloat-types.h | 2 +-
- target/arm/cpu64.c     |  50 +-----------------
+file changed, 1 insertion(+), 1 deletion(-)
  target/arm/cpu_tcg.c   | 114 ++++++++++++++++++++++-------------------
 files changed, 65 insertions(+), 101 deletions(-)
-diff --git a/target/arm/internals.h b/target/arm/internals.h
+diff --git a/include/fpu/softfloat-types.h b/include/fpu/softfloat-types.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/internals.h
+--- a/include/fpu/softfloat-types.h
-+++ b/target/arm/internals.h
++++ b/include/fpu/softfloat-types.h
-@@ -XXX,XX +XXX,XX @@ static inline void define_cortex_a72_a57_a53_cp_reginfo(ARMCPU *cpu) { }
+@@ -XXX,XX +XXX,XX @@ typedef struct float_status {
- void define_cortex_a72_a57_a53_cp_reginfo(ARMCPU *cpu);
+     Float3NaNPropRule float_3nan_prop_rule;
- #endif
+     FloatInfZeroNaNRule float_infzeronan_rule;
+     bool tininess_before_rounding;
-+void aa32_max_features(ARMCPU *cpu);
+-    /* should denormalised results go to zero and set the inexact flag? */
-+
++    /* should denormalised results go to zero and set output_denormal_flushed? */
- #endif
+     bool flush_to_zero;
-diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
+     /* should denormalised inputs go to zero and set input_denormal_flushed? */
-index XXXXXXX..XXXXXXX 100644
+     bool flush_inputs_to_zero;
 --- a/target/arm/cpu64.c
 +++ b/target/arm/cpu64.c
@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
  {
      ARMCPU *cpu = ARM_CPU(obj);
      uint64_t t;
 -    uint32_t u;
      if (kvm_enabled() || hvf_enabled()) {
          /* With KVM or HVF, '-cpu max' is identical to '-cpu host' */
@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
      t = FIELD_DP64(t, ID_AA64ZFR0, F64MM, 1);
      cpu->isar.id_aa64zfr0 = t;
 -    /* Replicate the same data to the 32-bit id registers.  */
 -    u = cpu->isar.id_isar5;
 -    u = FIELD_DP32(u, ID_ISAR5, AES, 2); /* AES + PMULL */
 -    u = FIELD_DP32(u, ID_ISAR5, SHA1, 1);
 -    u = FIELD_DP32(u, ID_ISAR5, SHA2, 1);
 -    u = FIELD_DP32(u, ID_ISAR5, CRC32, 1);
 -    u = FIELD_DP32(u, ID_ISAR5, RDM, 1);
 -    u = FIELD_DP32(u, ID_ISAR5, VCMA, 1);
 -    cpu->isar.id_isar5 = u;
 -
 -    u = cpu->isar.id_isar6;
 -    u = FIELD_DP32(u, ID_ISAR6, JSCVT, 1);
 -    u = FIELD_DP32(u, ID_ISAR6, DP, 1);
 -    u = FIELD_DP32(u, ID_ISAR6, FHM, 1);
 -    u = FIELD_DP32(u, ID_ISAR6, SB, 1);
 -    u = FIELD_DP32(u, ID_ISAR6, SPECRES, 1);
 -    u = FIELD_DP32(u, ID_ISAR6, BF16, 1);
 -    u = FIELD_DP32(u, ID_ISAR6, I8MM, 1);
 -    cpu->isar.id_isar6 = u;
 -
 -    u = cpu->isar.id_pfr0;
 -    u = FIELD_DP32(u, ID_PFR0, DIT, 1);
 -    cpu->isar.id_pfr0 = u;
 -
 -    u = cpu->isar.id_pfr2;
 -    u = FIELD_DP32(u, ID_PFR2, SSBS, 1);
 -    cpu->isar.id_pfr2 = u;
 -
 -    u = cpu->isar.id_mmfr3;
 -    u = FIELD_DP32(u, ID_MMFR3, PAN, 2); /* ATS1E1 */
 -    cpu->isar.id_mmfr3 = u;
 -
 -    u = cpu->isar.id_mmfr4;
 -    u = FIELD_DP32(u, ID_MMFR4, HPDS, 1); /* AA32HPD */
 -    u = FIELD_DP32(u, ID_MMFR4, AC2, 1); /* ACTLR2, HACTLR2 */
 -    u = FIELD_DP32(u, ID_MMFR4, CNP, 1); /* TTCNP */
 -    u = FIELD_DP32(u, ID_MMFR4, XNX, 1); /* TTS2UXN */
 -    cpu->isar.id_mmfr4 = u;
 -
      t = cpu->isar.id_aa64dfr0;
      t = FIELD_DP64(t, ID_AA64DFR0, PMUVER, 5); /* v8.4-PMU */
      cpu->isar.id_aa64dfr0 = t;
 -    u = cpu->isar.id_dfr0;
 -    u = FIELD_DP32(u, ID_DFR0, PERFMON, 5); /* v8.4-PMU */
 -    cpu->isar.id_dfr0 = u;
 -
 -    u = cpu->isar.mvfr1;
 -    u = FIELD_DP32(u, MVFR1, FPHP, 3);      /* v8.2-FP16 */
 -    u = FIELD_DP32(u, MVFR1, SIMDHP, 2);    /* v8.2-FP16 */
 -    cpu->isar.mvfr1 = u;
 +    /* Replicate the same data to the 32-bit id registers.  */
 +    aa32_max_features(cpu);
  #ifdef CONFIG_USER_ONLY
      /*
 diff --git a/target/arm/cpu_tcg.c b/target/arm/cpu_tcg.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu_tcg.c
 +++ b/target/arm/cpu_tcg.c
@@ -XXX,XX +XXX,XX @@
  #endif
  #include "cpregs.h"
 +
 +/* Share AArch32 -cpu max features with AArch64. */
 +void aa32_max_features(ARMCPU *cpu)
 +{
 +    uint32_t t;
 +
 +    /* Add additional features supported by QEMU */
 +    t = cpu->isar.id_isar5;
 +    t = FIELD_DP32(t, ID_ISAR5, AES, 2);
 +    t = FIELD_DP32(t, ID_ISAR5, SHA1, 1);
 +    t = FIELD_DP32(t, ID_ISAR5, SHA2, 1);
 +    t = FIELD_DP32(t, ID_ISAR5, CRC32, 1);
 +    t = FIELD_DP32(t, ID_ISAR5, RDM, 1);
 +    t = FIELD_DP32(t, ID_ISAR5, VCMA, 1);
 +    cpu->isar.id_isar5 = t;
 +
 +    t = cpu->isar.id_isar6;
 +    t = FIELD_DP32(t, ID_ISAR6, JSCVT, 1);
 +    t = FIELD_DP32(t, ID_ISAR6, DP, 1);
 +    t = FIELD_DP32(t, ID_ISAR6, FHM, 1);
 +    t = FIELD_DP32(t, ID_ISAR6, SB, 1);
 +    t = FIELD_DP32(t, ID_ISAR6, SPECRES, 1);
 +    t = FIELD_DP32(t, ID_ISAR6, BF16, 1);
 +    t = FIELD_DP32(t, ID_ISAR6, I8MM, 1);
 +    cpu->isar.id_isar6 = t;
 +
 +    t = cpu->isar.mvfr1;
 +    t = FIELD_DP32(t, MVFR1, FPHP, 3);     /* v8.2-FP16 */
 +    t = FIELD_DP32(t, MVFR1, SIMDHP, 2);   /* v8.2-FP16 */
 +    cpu->isar.mvfr1 = t;
 +
 +    t = cpu->isar.mvfr2;
 +    t = FIELD_DP32(t, MVFR2, SIMDMISC, 3); /* SIMD MaxNum */
 +    t = FIELD_DP32(t, MVFR2, FPMISC, 4);   /* FP MaxNum */
 +    cpu->isar.mvfr2 = t;
 +
 +    t = cpu->isar.id_mmfr3;
 +    t = FIELD_DP32(t, ID_MMFR3, PAN, 2); /* ATS1E1 */
 +    cpu->isar.id_mmfr3 = t;
 +
 +    t = cpu->isar.id_mmfr4;
 +    t = FIELD_DP32(t, ID_MMFR4, HPDS, 1); /* AA32HPD */
 +    t = FIELD_DP32(t, ID_MMFR4, AC2, 1); /* ACTLR2, HACTLR2 */
 +    t = FIELD_DP32(t, ID_MMFR4, CNP, 1); /* TTCNP */
 +    t = FIELD_DP32(t, ID_MMFR4, XNX, 1); /* TTS2UXN */
 +    cpu->isar.id_mmfr4 = t;
 +
 +    t = cpu->isar.id_pfr0;
 +    t = FIELD_DP32(t, ID_PFR0, DIT, 1);
 +    cpu->isar.id_pfr0 = t;
 +
 +    t = cpu->isar.id_pfr2;
 +    t = FIELD_DP32(t, ID_PFR2, SSBS, 1);
 +    cpu->isar.id_pfr2 = t;
 +
 +    t = cpu->isar.id_dfr0;
 +    t = FIELD_DP32(t, ID_DFR0, PERFMON, 5); /* v8.4-PMU */
 +    cpu->isar.id_dfr0 = t;
 +}
 +
  #ifndef CONFIG_USER_ONLY
  static uint64_t l2ctlr_read(CPUARMState *env, const ARMCPRegInfo *ri)
  {
@@ -XXX,XX +XXX,XX @@ static void arm_v7m_class_init(ObjectClass *oc, void *data)
  static void arm_max_initfn(Object *obj)
  {
      ARMCPU *cpu = ARM_CPU(obj);
 -    uint32_t t;
      /* aarch64_a57_initfn, advertising none of the aarch64 features */
      cpu->dtb_compatible = "arm,cortex-a57";
@@ -XXX,XX +XXX,XX @@ static void arm_max_initfn(Object *obj)
      cpu->ccsidr[2] = 0x70ffe07a; /* 2048KB L2 cache */
      define_cortex_a72_a57_a53_cp_reginfo(cpu);
 -    /* Add additional features supported by QEMU */
 -    t = cpu->isar.id_isar5;
 -    t = FIELD_DP32(t, ID_ISAR5, AES, 2);
 -    t = FIELD_DP32(t, ID_ISAR5, SHA1, 1);
 -    t = FIELD_DP32(t, ID_ISAR5, SHA2, 1);
 -    t = FIELD_DP32(t, ID_ISAR5, CRC32, 1);
 -    t = FIELD_DP32(t, ID_ISAR5, RDM, 1);
 -    t = FIELD_DP32(t, ID_ISAR5, VCMA, 1);
 -    cpu->isar.id_isar5 = t;
 -
 -    t = cpu->isar.id_isar6;
 -    t = FIELD_DP32(t, ID_ISAR6, JSCVT, 1);
 -    t = FIELD_DP32(t, ID_ISAR6, DP, 1);
 -    t = FIELD_DP32(t, ID_ISAR6, FHM, 1);
 -    t = FIELD_DP32(t, ID_ISAR6, SB, 1);
 -    t = FIELD_DP32(t, ID_ISAR6, SPECRES, 1);
 -    t = FIELD_DP32(t, ID_ISAR6, BF16, 1);
 -    t = FIELD_DP32(t, ID_ISAR6, I8MM, 1);
 -    cpu->isar.id_isar6 = t;
 -
 -    t = cpu->isar.mvfr1;
 -    t = FIELD_DP32(t, MVFR1, FPHP, 3);     /* v8.2-FP16 */
 -    t = FIELD_DP32(t, MVFR1, SIMDHP, 2);   /* v8.2-FP16 */
 -    cpu->isar.mvfr1 = t;
 -
 -    t = cpu->isar.mvfr2;
 -    t = FIELD_DP32(t, MVFR2, SIMDMISC, 3); /* SIMD MaxNum */
 -    t = FIELD_DP32(t, MVFR2, FPMISC, 4);   /* FP MaxNum */
 -    cpu->isar.mvfr2 = t;
 -
 -    t = cpu->isar.id_mmfr3;
 -    t = FIELD_DP32(t, ID_MMFR3, PAN, 2); /* ATS1E1 */
 -    cpu->isar.id_mmfr3 = t;
 -
 -    t = cpu->isar.id_mmfr4;
 -    t = FIELD_DP32(t, ID_MMFR4, HPDS, 1); /* AA32HPD */
 -    t = FIELD_DP32(t, ID_MMFR4, AC2, 1); /* ACTLR2, HACTLR2 */
 -    t = FIELD_DP32(t, ID_MMFR4, CNP, 1); /* TTCNP */
 -    t = FIELD_DP32(t, ID_MMFR4, XNX, 1); /* TTS2UXN */
 -    cpu->isar.id_mmfr4 = t;
 -
 -    t = cpu->isar.id_pfr0;
 -    t = FIELD_DP32(t, ID_PFR0, DIT, 1);
 -    cpu->isar.id_pfr0 = t;
 -
 -    t = cpu->isar.id_pfr2;
 -    t = FIELD_DP32(t, ID_PFR2, SSBS, 1);
 -    cpu->isar.id_pfr2 = t;
 -
 -    t = cpu->isar.id_dfr0;
 -    t = FIELD_DP32(t, ID_DFR0, PERFMON, 5); /* v8.4-PMU */
 -    cpu->isar.id_dfr0 = t;
 +    aa32_max_features(cpu);
  #ifdef CONFIG_USER_ONLY
      /*
 --
-.25.1
+.34.1

-[PULL 17/32] target/arm: Implement ESB instruction
+[PULL 34/36] target/arm: Remove redundant advsimd float16 helpers
-From: Richard Henderson <richard.henderson@linaro.org>
+The advsimd_addh etc helpers defined in helper-a64.c are identical to
 the vfp_addh etc helpers defined in helper-vfp.c: both take two
 float16 inputs (in a uint32_t type) plus a float_status* and are
 simple wrappers around the softfloat float16_* functions.
-Check for and defer any pending virtual SError.
+(The duplication seems to be a historical accident: we added the
 advsimd helpers in 2018 as part of the A64 implementation, and at
 that time there was no f16 emulation in A32.  Then later we added the
 A32 f16 handling by extending the existing VFP helper macros to
 generate f16 versions as well as f32 and f64, and didn't realise we
 could clean things up.)
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+Remove the now-unnecessary advsimd helpers and make the places that
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+generated calls to them use the vfp helpers instead. Many of the
-Message-id: 20220506180242.216785-17-richard.henderson@linaro.org
+helper functions were already unused.
 (The remaining advsimd_ helpers are those which don't have vfp
 versions.)
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250124162836.2332150-26-peter.maydell@linaro.org
 ---
- target/arm/helper.h        |  1 +
+ target/arm/tcg/helper-a64.h    |  8 --------
- target/arm/a32.decode      | 16 ++++++++------
+ target/arm/tcg/helper-a64.c    |  9 ---------
- target/arm/t32.decode      | 18 ++++++++--------
+ target/arm/tcg/translate-a64.c | 16 ++++++++--------
- target/arm/op_helper.c     | 43 ++++++++++++++++++++++++++++++++++++++
+files changed, 8 insertions(+), 25 deletions(-)
  target/arm/translate-a64.c | 17 +++++++++++++++
  target/arm/translate.c     | 23 ++++++++++++++++++++
 files changed, 103 insertions(+), 15 deletions(-)
-diff --git a/target/arm/helper.h b/target/arm/helper.h
+diff --git a/target/arm/tcg/helper-a64.h b/target/arm/tcg/helper-a64.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper.h
+--- a/target/arm/tcg/helper-a64.h
-+++ b/target/arm/helper.h
++++ b/target/arm/tcg/helper-a64.h
-@@ -XXX,XX +XXX,XX @@ DEF_HELPER_1(wfe, void, env)
+@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_2(frecpx_f16, TCG_CALL_NO_RWG, f16, f16, fpst)
- DEF_HELPER_1(yield, void, env)
+ DEF_HELPER_FLAGS_2(fcvtx_f64_to_f32, TCG_CALL_NO_RWG, f32, f64, fpst)
- DEF_HELPER_1(pre_hvc, void, env)
+ DEF_HELPER_FLAGS_3(crc32_64, TCG_CALL_NO_RWG_SE, i64, i64, i64, i32)
- DEF_HELPER_2(pre_smc, void, env, i32)
+ DEF_HELPER_FLAGS_3(crc32c_64, TCG_CALL_NO_RWG_SE, i64, i64, i64, i32)
-+DEF_HELPER_1(vesb, void, env)
+-DEF_HELPER_FLAGS_3(advsimd_maxh, TCG_CALL_NO_RWG, f16, f16, f16, fpst)
+-DEF_HELPER_FLAGS_3(advsimd_minh, TCG_CALL_NO_RWG, f16, f16, f16, fpst)
- DEF_HELPER_3(cpsr_write, void, env, i32, i32)
+-DEF_HELPER_FLAGS_3(advsimd_maxnumh, TCG_CALL_NO_RWG, f16, f16, f16, fpst)
- DEF_HELPER_2(cpsr_write_eret, void, env, i32)
+-DEF_HELPER_FLAGS_3(advsimd_minnumh, TCG_CALL_NO_RWG, f16, f16, f16, fpst)
-diff --git a/target/arm/a32.decode b/target/arm/a32.decode
+-DEF_HELPER_3(advsimd_addh, f16, f16, f16, fpst)
 -DEF_HELPER_3(advsimd_subh, f16, f16, f16, fpst)
 -DEF_HELPER_3(advsimd_mulh, f16, f16, f16, fpst)
 -DEF_HELPER_3(advsimd_divh, f16, f16, f16, fpst)
  DEF_HELPER_3(advsimd_ceq_f16, i32, f16, f16, fpst)
  DEF_HELPER_3(advsimd_cge_f16, i32, f16, f16, fpst)
  DEF_HELPER_3(advsimd_cgt_f16, i32, f16, f16, fpst)
 diff --git a/target/arm/tcg/helper-a64.c b/target/arm/tcg/helper-a64.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/a32.decode
+--- a/target/arm/tcg/helper-a64.c
-+++ b/target/arm/a32.decode
++++ b/target/arm/tcg/helper-a64.c
-@@ -XXX,XX +XXX,XX @@ SMULTT           .... 0001 0110 .... 0000 .... 1110 ....      @rd0mn
+@@ -XXX,XX +XXX,XX @@ uint32_t ADVSIMD_HELPER(name, h)(uint32_t a, uint32_t b, float_status *fpst) \
+     return float16_ ## name(a, b, fpst);    \
- {
+ }
-   {
--    YIELD        ---- 0011 0010 0000 1111 ---- 0000 0001
+-ADVSIMD_HALFOP(add)
--    WFE          ---- 0011 0010 0000 1111 ---- 0000 0010
+-ADVSIMD_HALFOP(sub)
--    WFI          ---- 0011 0010 0000 1111 ---- 0000 0011
+-ADVSIMD_HALFOP(mul)
-+    [
+-ADVSIMD_HALFOP(div)
-+      YIELD      ---- 0011 0010 0000 1111 ---- 0000 0001
+-ADVSIMD_HALFOP(min)
-+      WFE        ---- 0011 0010 0000 1111 ---- 0000 0010
+-ADVSIMD_HALFOP(max)
-+      WFI        ---- 0011 0010 0000 1111 ---- 0000 0011
+-ADVSIMD_HALFOP(minnum)
+-ADVSIMD_HALFOP(maxnum)
--    # TODO: Implement SEV, SEVL; may help SMP performance.
+-
--    # SEV        ---- 0011 0010 0000 1111 ---- 0000 0100
+ #define ADVSIMD_TWOHALFOP(name)                                         \
--    # SEVL       ---- 0011 0010 0000 1111 ---- 0000 0101
+ uint32_t ADVSIMD_HELPER(name, 2h)(uint32_t two_a, uint32_t two_b,       \
-+      # TODO: Implement SEV, SEVL; may help SMP performance.
+                                   float_status *fpst)                   \
-+      # SEV      ---- 0011 0010 0000 1111 ---- 0000 0100
+diff --git a/target/arm/tcg/translate-a64.c b/target/arm/tcg/translate-a64.c
 +      # SEVL     ---- 0011 0010 0000 1111 ---- 0000 0101
 +
 +      ESB        ---- 0011 0010 0000 1111 ---- 0001 0000
 +    ]
      # The canonical nop ends in 00000000, but the whole of the
      # rest of the space executes as nop if otherwise unsupported.
 diff --git a/target/arm/t32.decode b/target/arm/t32.decode
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/t32.decode
+--- a/target/arm/tcg/translate-a64.c
-+++ b/target/arm/t32.decode
++++ b/target/arm/tcg/translate-a64.c
-@@ -XXX,XX +XXX,XX @@ CLZ              1111 1010 1011 ---- 1111 .... 1000 ....      @rdm
+@@ -XXX,XX +XXX,XX @@ static const FPScalar f_scalar_fmul = {
-   [
+ TRANS(FMUL_s, do_fp3_scalar, a, &f_scalar_fmul)
-     # Hints, and CPS
-     {
+ static const FPScalar f_scalar_fmax = {
--      YIELD      1111 0011 1010 1111 1000 0000 0000 0001
+-    gen_helper_advsimd_maxh,
--      WFE        1111 0011 1010 1111 1000 0000 0000 0010
++    gen_helper_vfp_maxh,
--      WFI        1111 0011 1010 1111 1000 0000 0000 0011
+     gen_helper_vfp_maxs,
-+      [
+     gen_helper_vfp_maxd,
-+        YIELD    1111 0011 1010 1111 1000 0000 0000 0001
+ };
-+        WFE      1111 0011 1010 1111 1000 0000 0000 0010
+ TRANS(FMAX_s, do_fp3_scalar, a, &f_scalar_fmax)
-+        WFI      1111 0011 1010 1111 1000 0000 0000 0011
+ static const FPScalar f_scalar_fmin = {
--      # TODO: Implement SEV, SEVL; may help SMP performance.
+-    gen_helper_advsimd_minh,
--      # SEV      1111 0011 1010 1111 1000 0000 0000 0100
++    gen_helper_vfp_minh,
--      # SEVL     1111 0011 1010 1111 1000 0000 0000 0101
+     gen_helper_vfp_mins,
-+        # TODO: Implement SEV, SEVL; may help SMP performance.
+     gen_helper_vfp_mind,
-+        # SEV    1111 0011 1010 1111 1000 0000 0000 0100
+ };
-+        # SEVL   1111 0011 1010 1111 1000 0000 0000 0101
+ TRANS(FMIN_s, do_fp3_scalar, a, &f_scalar_fmin)
--      # For M-profile minimal-RAS ESB can be a NOP, which is the
+ static const FPScalar f_scalar_fmaxnm = {
--      # default behaviour since it is in the hint space.
+-    gen_helper_advsimd_maxnumh,
--      # ESB      1111 0011 1010 1111 1000 0000 0001 0000
++    gen_helper_vfp_maxnumh,
-+        ESB      1111 0011 1010 1111 1000 0000 0001 0000
+     gen_helper_vfp_maxnums,
-+      ]
+     gen_helper_vfp_maxnumd,
+ };
-       # The canonical nop ends in 0000 0000, but the whole rest
+ TRANS(FMAXNM_s, do_fp3_scalar, a, &f_scalar_fmaxnm)
-       # of the space is "reserved hint, behaves as nop".
-diff --git a/target/arm/op_helper.c b/target/arm/op_helper.c
+ static const FPScalar f_scalar_fminnm = {
-index XXXXXXX..XXXXXXX 100644
+-    gen_helper_advsimd_minnumh,
---- a/target/arm/op_helper.c
++    gen_helper_vfp_minnumh,
-+++ b/target/arm/op_helper.c
+     gen_helper_vfp_minnums,
-@@ -XXX,XX +XXX,XX @@ void HELPER(probe_access)(CPUARMState *env, target_ulong ptr,
+     gen_helper_vfp_minnumd,
-                      access_type, mmu_idx, ra);
+ };
-     }
+@@ -XXX,XX +XXX,XX @@ static bool do_fp_reduction(DisasContext *s, arg_qrr_e *a,
  }
 +
 +/*
 + * This function corresponds to AArch64.vESBOperation().
 + * Note that the AArch32 version is not functionally different.
 + */
 +void HELPER(vesb)(CPUARMState *env)
 +{
 +    /*
 +     * The EL2Enabled() check is done inside arm_hcr_el2_eff,
 +     * and will return HCR_EL2.VSE == 0, so nothing happens.
 +     */
 +    uint64_t hcr = arm_hcr_el2_eff(env);
 +    bool enabled = !(hcr & HCR_TGE) && (hcr & HCR_AMO);
 +    bool pending = enabled && (hcr & HCR_VSE);
 +    bool masked  = (env->daif & PSTATE_A);
 +
 +    /* If VSE pending and masked, defer the exception.  */
 +    if (pending && masked) {
 +        uint32_t syndrome;
 +
 +        if (arm_el_is_aa64(env, 1)) {
 +            /* Copy across IDS and ISS from VSESR. */
 +            syndrome = env->cp15.vsesr_el2 & 0x1ffffff;
 +        } else {
 +            ARMMMUFaultInfo fi = { .type = ARMFault_AsyncExternal };
 +
 +            if (extended_addresses_enabled(env)) {
 +                syndrome = arm_fi_to_lfsc(&fi);
 +            } else {
 +                syndrome = arm_fi_to_sfsc(&fi);
 +            }
 +            /* Copy across AET and ExT from VSESR. */
 +            syndrome |= env->cp15.vsesr_el2 & 0xd000;
 +        }
 +
 +        /* Set VDISR_EL2.A along with the syndrome. */
 +        env->cp15.vdisr_el2 = syndrome | (1u << 31);
 +
 +        /* Clear pending virtual SError */
 +        env->cp15.hcr_el2 &= ~HCR_VSE;
 +        cpu_reset_interrupt(env_cpu(env), CPU_INTERRUPT_VSERR);
 +    }
 +}
 diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-a64.c
 +++ b/target/arm/translate-a64.c
@@ -XXX,XX +XXX,XX @@ static void handle_hint(DisasContext *s, uint32_t insn,
              gen_helper_autib(cpu_X[17], cpu_env, cpu_X[17], cpu_X[16]);
          }
          break;
 +    case 0b10000: /* ESB */
 +        /* Without RAS, we must implement this as NOP. */
 +        if (dc_isar_feature(aa64_ras, s)) {
 +            /*
 +             * QEMU does not have a source of physical SErrors,
 +             * so we are only concerned with virtual SErrors.
 +             * The pseudocode in the ARM for this case is
 +             *   if PSTATE.EL IN {EL0, EL1} && EL2Enabled() then
 +             *      AArch64.vESBOperation();
 +             * Most of the condition can be evaluated at translation time.
 +             * Test for EL2 present, and defer test for SEL2 to runtime.
 +             */
 +            if (s->current_el <= 1 && arm_dc_feature(s, ARM_FEATURE_EL2)) {
 +                gen_helper_vesb(cpu_env);
 +            }
 +        }
 +        break;
      case 0b11000: /* PACIAZ */
          if (s->pauth_active) {
              gen_helper_pacia(cpu_X[30], cpu_env, cpu_X[30],
 diff --git a/target/arm/translate.c b/target/arm/translate.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate.c
 +++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static bool trans_WFI(DisasContext *s, arg_WFI *a)
      return true;
  }
-+static bool trans_ESB(DisasContext *s, arg_ESB *a)
+-TRANS_FEAT(FMAXNMV_h, aa64_fp16, do_fp_reduction, a, gen_helper_advsimd_maxnumh)
-+{
+-TRANS_FEAT(FMINNMV_h, aa64_fp16, do_fp_reduction, a, gen_helper_advsimd_minnumh)
-+    /*
+-TRANS_FEAT(FMAXV_h, aa64_fp16, do_fp_reduction, a, gen_helper_advsimd_maxh)
-+     * For M-profile, minimal-RAS ESB can be a NOP.
+-TRANS_FEAT(FMINV_h, aa64_fp16, do_fp_reduction, a, gen_helper_advsimd_minh)
-+     * Without RAS, we must implement this as NOP.
++TRANS_FEAT(FMAXNMV_h, aa64_fp16, do_fp_reduction, a, gen_helper_vfp_maxnumh)
-+     */
++TRANS_FEAT(FMINNMV_h, aa64_fp16, do_fp_reduction, a, gen_helper_vfp_minnumh)
-+    if (!arm_dc_feature(s, ARM_FEATURE_M) && dc_isar_feature(aa32_ras, s)) {
++TRANS_FEAT(FMAXV_h, aa64_fp16, do_fp_reduction, a, gen_helper_vfp_maxh)
-+        /*
++TRANS_FEAT(FMINV_h, aa64_fp16, do_fp_reduction, a, gen_helper_vfp_minh)
-+         * QEMU does not have a source of physical SErrors,
-+         * so we are only concerned with virtual SErrors.
+ TRANS(FMAXNMV_s, do_fp_reduction, a, gen_helper_vfp_maxnums)
-+         * The pseudocode in the ARM for this case is
+ TRANS(FMINNMV_s, do_fp_reduction, a, gen_helper_vfp_minnums)
 +         *   if PSTATE.EL IN {EL0, EL1} && EL2Enabled() then
 +         *      AArch32.vESBOperation();
 +         * Most of the condition can be evaluated at translation time.
 +         * Test for EL2 present, and defer test for SEL2 to runtime.
 +         */
 +        if (s->current_el <= 1 && arm_dc_feature(s, ARM_FEATURE_EL2)) {
 +            gen_helper_vesb(cpu_env);
 +        }
 +    }
 +    return true;
 +}
 +
  static bool trans_NOP(DisasContext *s, arg_NOP *a)
  {
      return true;
 --
-.25.1
+.34.1

-[PULL 15/32] target/arm: Enable SCR and HCR bits for RAS
+[PULL 35/36] target/arm: Use FPST_A64_F16 for halfprec-to-other conversions
-From: Richard Henderson <richard.henderson@linaro.org>
+We should be using the F16-specific float_status for conversions from
 half-precision, because halfprec inputs never set Input Denormal.
-Enable writes to the TERR and TEA bits when RAS is enabled.
+Without FEAT_AHP, using the wrong fpst here had no effect, because
-These bits are otherwise RES0.
+the only difference between the A64_F16 and A64 fpst is its handling
 of flush-to-zero on input and output, and the helper functions
 vfp_fcvt_f16_to_* and vfp_fcvt_*_to_f16 all explicitly squash the
 relevant flushing flags, and flush_inputs_to_zero was the only way
 that IDC could be set.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+With FEAT_AHP, the FPCR.AH=1 behaviour sets IDC for
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+input_denormal_used, which we will only ignore in
-Message-id: 20220506180242.216785-15-richard.henderson@linaro.org
+vfp_get_fpsr_from_host() for the A64_F16 fpst; so it matters that we
 use that one for f16 inputs (and the normal one for single/double to
 f16 conversions).
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250124162836.2332150-27-peter.maydell@linaro.org
 ---
- target/arm/helper.c | 9 +++++++++
+ target/arm/tcg/translate-a64.c | 9 ++++++---
-file changed, 9 insertions(+)
+ target/arm/tcg/translate-sve.c | 4 ++--
 files changed, 8 insertions(+), 5 deletions(-)
-diff --git a/target/arm/helper.c b/target/arm/helper.c
+diff --git a/target/arm/tcg/translate-a64.c b/target/arm/tcg/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper.c
+--- a/target/arm/tcg/translate-a64.c
-+++ b/target/arm/helper.c
++++ b/target/arm/tcg/translate-a64.c
-@@ -XXX,XX +XXX,XX @@ static void scr_write(CPUARMState *env, const ARMCPRegInfo *ri, uint64_t value)
+@@ -XXX,XX +XXX,XX @@ static bool trans_FCVT_s_sh(DisasContext *s, arg_rr *a)
-         }
+     if (fp_access_check(s)) {
-         valid_mask &= ~SCR_NET;
+         TCGv_i32 tcg_rn = read_fp_hreg(s, a->rn);
+         TCGv_i32 tcg_rd = tcg_temp_new_i32();
-+        if (cpu_isar_feature(aa64_ras, cpu)) {
+-        TCGv_ptr tcg_fpst = fpstatus_ptr(FPST_A64);
-+            valid_mask |= SCR_TERR;
++        TCGv_ptr tcg_fpst = fpstatus_ptr(FPST_A64_F16);
-+        }
+         TCGv_i32 tcg_ahp = get_ahp_flag();
-         if (cpu_isar_feature(aa64_lor, cpu)) {
-             valid_mask |= SCR_TLOR;
+         gen_helper_vfp_fcvt_f16_to_f32(tcg_rd, tcg_rn, tcg_fpst, tcg_ahp);
-         }
+@@ -XXX,XX +XXX,XX @@ static bool trans_FCVT_s_dh(DisasContext *s, arg_rr *a)
-@@ -XXX,XX +XXX,XX @@ static void scr_write(CPUARMState *env, const ARMCPRegInfo *ri, uint64_t value)
+     if (fp_access_check(s)) {
-         }
+         TCGv_i32 tcg_rn = read_fp_hreg(s, a->rn);
-     } else {
+         TCGv_i64 tcg_rd = tcg_temp_new_i64();
-         valid_mask &= ~(SCR_RW | SCR_ST);
+-        TCGv_ptr tcg_fpst = fpstatus_ptr(FPST_A64);
-+        if (cpu_isar_feature(aa32_ras, cpu)) {
++        TCGv_ptr tcg_fpst = fpstatus_ptr(FPST_A64_F16);
-+            valid_mask |= SCR_TERR;
+         TCGv_i32 tcg_ahp = get_ahp_flag();
-+        }
          gen_helper_vfp_fcvt_f16_to_f64(tcg_rd, tcg_rn, tcg_fpst, tcg_ahp);
@@ -XXX,XX +XXX,XX @@ static bool trans_FCVTL_v(DisasContext *s, arg_qrr_e *a)
          return true;
      }
-     if (!arm_feature(env, ARM_FEATURE_EL2)) {
+-    fpst = fpstatus_ptr(FPST_A64);
-@@ -XXX,XX +XXX,XX @@ static void do_hcr_write(CPUARMState *env, uint64_t value, uint64_t valid_mask)
+     if (a->esz == MO_64) {
-         if (cpu_isar_feature(aa64_vh, cpu)) {
+         /* 32 -> 64 bit fp conversion */
-             valid_mask |= HCR_E2H;
+         TCGv_i64 tcg_res[2];
-         }
+         TCGv_i32 tcg_op = tcg_temp_new_i32();
-+        if (cpu_isar_feature(aa64_ras, cpu)) {
+         int srcelt = a->q ? 2 : 0;
-+            valid_mask |= HCR_TERR | HCR_TEA;
-+        }
++        fpst = fpstatus_ptr(FPST_A64);
-         if (cpu_isar_feature(aa64_lor, cpu)) {
++
-             valid_mask |= HCR_TLOR;
+         for (pass = 0; pass < 2; pass++) {
-         }
+             tcg_res[pass] = tcg_temp_new_i64();
              read_vec_element_i32(s, tcg_op, a->rn, srcelt + pass, MO_32);
@@ -XXX,XX +XXX,XX @@ static bool trans_FCVTL_v(DisasContext *s, arg_qrr_e *a)
          TCGv_i32 tcg_res[4];
          TCGv_i32 ahp = get_ahp_flag();
 +        fpst = fpstatus_ptr(FPST_A64_F16);
 +
          for (pass = 0; pass < 4; pass++) {
              tcg_res[pass] = tcg_temp_new_i32();
              read_vec_element_i32(s, tcg_res[pass], a->rn, srcelt + pass, MO_16);
 diff --git a/target/arm/tcg/translate-sve.c b/target/arm/tcg/translate-sve.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/tcg/translate-sve.c
 +++ b/target/arm/tcg/translate-sve.c
@@ -XXX,XX +XXX,XX @@ TRANS_FEAT(FCMLA_zzxz, aa64_sve, gen_gvec_fpst_zzzz, fcmla_idx_fns[a->esz],
  TRANS_FEAT(FCVT_sh, aa64_sve, gen_gvec_fpst_arg_zpz,
             gen_helper_sve_fcvt_sh, a, 0, FPST_A64)
  TRANS_FEAT(FCVT_hs, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvt_hs, a, 0, FPST_A64)
 +           gen_helper_sve_fcvt_hs, a, 0, FPST_A64_F16)
  TRANS_FEAT(BFCVT, aa64_sve_bf16, gen_gvec_fpst_arg_zpz,
             gen_helper_sve_bfcvt, a, 0, FPST_A64)
@@ -XXX,XX +XXX,XX @@ TRANS_FEAT(BFCVT, aa64_sve_bf16, gen_gvec_fpst_arg_zpz,
  TRANS_FEAT(FCVT_dh, aa64_sve, gen_gvec_fpst_arg_zpz,
             gen_helper_sve_fcvt_dh, a, 0, FPST_A64)
  TRANS_FEAT(FCVT_hd, aa64_sve, gen_gvec_fpst_arg_zpz,
 -           gen_helper_sve_fcvt_hd, a, 0, FPST_A64)
 +           gen_helper_sve_fcvt_hd, a, 0, FPST_A64_F16)
  TRANS_FEAT(FCVT_ds, aa64_sve, gen_gvec_fpst_arg_zpz,
             gen_helper_sve_fcvt_ds, a, 0, FPST_A64)
  TRANS_FEAT(FCVT_sd, aa64_sve, gen_gvec_fpst_arg_zpz,
 --
-.25.1
+.34.1

-[PULL 08/32] target/arm: Set ID_DFR0.PerfMon for qemu-system-arm -cpu max
+[PULL 36/36] hw/usb/canokey: Fix buffer overflow for OUT packet
-From: Richard Henderson <richard.henderson@linaro.org>
+From: Hongren Zheng <i@zenithal.me>
-We set this for qemu-system-aarch64, but failed to do so
+When USBPacket in OUT direction has larger payload
-for the strictly 32-bit emulation.
+than the ep_out_buffer (of size 512), a buffer overflow
 would occur.
-Fixes: 3bec78447a9 ("target/arm: Provide ARMv8.4-PMU in '-cpu max'")
+It could be fixed by limiting the size of usb_packet_copy
 to be at most buffer size. Further optimization gets rid
 of the ep_out_buffer and directly uses ep_out as the target
 buffer.
 This is reported by a security researcher who artificially
 constructed an OUT packet of size 2047. The report has gone
 through the QEMU security process, and as this device is for
 testing purpose and no deployment of it in virtualization
 environment is observed, it is triaged not to be a security bug.
 Cc: qemu-stable@nongnu.org
 Fixes: d7d34918551dc48 ("hw/usb: Add CanoKey Implementation")
 Reported-by: Juan Jose Lopez Jaimez <thatjiaozi@gmail.com>
 Signed-off-by: Hongren Zheng <i@zenithal.me>
 Message-id: Z4TfMOrZz6IQYl_h@Sun
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220506180242.216785-8-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/cpu_tcg.c | 4 ++++
+ hw/usb/canokey.h | 4 ----
-file changed, 4 insertions(+)
+ hw/usb/canokey.c | 6 +++---
 files changed, 3 insertions(+), 7 deletions(-)
-diff --git a/target/arm/cpu_tcg.c b/target/arm/cpu_tcg.c
+diff --git a/hw/usb/canokey.h b/hw/usb/canokey.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu_tcg.c
+--- a/hw/usb/canokey.h
-+++ b/target/arm/cpu_tcg.c
++++ b/hw/usb/canokey.h
-@@ -XXX,XX +XXX,XX @@ static void arm_max_initfn(Object *obj)
+@@ -XXX,XX +XXX,XX @@
-     t = FIELD_DP32(t, ID_PFR2, SSBS, 1);
+ #define CANOKEY_EP_NUM 3
-     cpu->isar.id_pfr2 = t;
+ /* BULK/INTR IN can be up to 1352 bytes, e.g. get key info */
+ #define CANOKEY_EP_IN_BUFFER_SIZE 2048
-+    t = cpu->isar.id_dfr0;
+-/* BULK OUT can be up to 270 bytes, e.g. PIV import cert */
-+    t = FIELD_DP32(t, ID_DFR0, PERFMON, 5); /* v8.4-PMU */
+-#define CANOKEY_EP_OUT_BUFFER_SIZE 512
-+    cpu->isar.id_dfr0 = t;
-+
+ typedef enum {
- #ifdef CONFIG_USER_ONLY
+     CANOKEY_EP_IN_WAIT,
-     /*
+@@ -XXX,XX +XXX,XX @@ typedef struct CanoKeyState {
-      * Break with true ARMv8 and add back old-style VFP short-vector support.
+     /* OUT pointer to canokey recv buffer */
      uint8_t *ep_out[CANOKEY_EP_NUM];
      uint32_t ep_out_size[CANOKEY_EP_NUM];
 -    /* For large BULK OUT, multiple write to ep_out is needed */
 -    uint8_t ep_out_buffer[CANOKEY_EP_NUM][CANOKEY_EP_OUT_BUFFER_SIZE];
      /* Properties */
      char *file; /* canokey-file */
 diff --git a/hw/usb/canokey.c b/hw/usb/canokey.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/usb/canokey.c
 +++ b/hw/usb/canokey.c
@@ -XXX,XX +XXX,XX @@ static void canokey_handle_data(USBDevice *dev, USBPacket *p)
      switch (p->pid) {
      case USB_TOKEN_OUT:
          trace_canokey_handle_data_out(ep_out, p->iov.size);
 -        usb_packet_copy(p, key->ep_out_buffer[ep_out], p->iov.size);
          out_pos = 0;
 +        /* segment packet into (possibly multiple) ep_out */
          while (out_pos != p->iov.size) {
              /*
               * key->ep_out[ep_out] set by prepare_receive
@@ -XXX,XX +XXX,XX @@ static void canokey_handle_data(USBDevice *dev, USBPacket *p)
               * to be the buffer length
               */
              out_len = MIN(p->iov.size - out_pos, key->ep_out_size[ep_out]);
 -            memcpy(key->ep_out[ep_out],
 -                    key->ep_out_buffer[ep_out] + out_pos, out_len);
 +            /* usb_packet_copy would update the pos offset internally */
 +            usb_packet_copy(p, key->ep_out[ep_out], out_len);
              out_pos += out_len;
              /* update ep_out_size to actual len */
              key->ep_out_size[ep_out] = out_len;
 --
-.25.1
+.34.1

target-arm queue: the big stuff here is the final part of
rth's patches for Cortex-A76 and Neoverse-N1 support;
also present are Gavin's NUMA series and a few other things.

thanks
-- PMM

The following changes since commit 554623226f800acf48a2ed568900c1c968ec9a8b:

Merge tag 'qemu-sparc-20220508' of https://github.com/mcayland/qemu into staging (2022-05-08 17:03:26 -0500)

are available in the Git repository at:

https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20220509

for you to fetch changes up to ae9141d4a3265553503bf07d3574b40f84615a34:

hw/acpi/aml-build: Use existing CPU topology to build PPTT table (2022-05-09 11:47:55 +0100)

----------------------------------------------------------------
target-arm queue:
 * MAINTAINERS/.mailmap: update email for Leif Lindholm
 * hw/arm: add version information to sbsa-ref machine DT
 * Enable new features for -cpu max:
   FEAT_Debugv8p2, FEAT_Debugv8p4, FEAT_RAS (minimal version only),
   FEAT_IESB, FEAT_CSV2, FEAT_CSV2_2, FEAT_CSV3, FEAT_DGH
 * Emulate Cortex-A76
 * Emulate Neoverse-N1
 * Fix the virt board default NUMA topology

----------------------------------------------------------------
Gavin Shan (6):
      qapi/machine.json: Add cluster-id
      qtest/numa-test: Specify CPU topology in aarch64_numa_cpu()
      hw/arm/virt: Consider SMP configuration in CPU topology
      qtest/numa-test: Correct CPU and NUMA association in aarch64_numa_cpu()
      hw/arm/virt: Fix CPU's default NUMA node ID
      hw/acpi/aml-build: Use existing CPU topology to build PPTT table

Leif Lindholm (2):
      MAINTAINERS/.mailmap: update email for Leif Lindholm
      hw/arm: add versioning to sbsa-ref machine DT

Richard Henderson (24):
      target/arm: Handle cpreg registration for missing EL
      target/arm: Drop EL3 no EL2 fallbacks
      target/arm: Merge zcr reginfo
      target/arm: Adjust definition of CONTEXTIDR_EL2
      target/arm: Move cortex impdef sysregs to cpu_tcg.c
      target/arm: Update qemu-system-arm -cpu max to cortex-a57
      target/arm: Set ID_DFR0.PerfMon for qemu-system-arm -cpu max
      target/arm: Split out aa32_max_features
      target/arm: Annotate arm_max_initfn with FEAT identifiers
      target/arm: Use field names for manipulating EL2 and EL3 modes
      target/arm: Enable FEAT_Debugv8p2 for -cpu max
      target/arm: Enable FEAT_Debugv8p4 for -cpu max
      target/arm: Add minimal RAS registers
      target/arm: Enable SCR and HCR bits for RAS
      target/arm: Implement virtual SError exceptions
      target/arm: Implement ESB instruction
      target/arm: Enable FEAT_RAS for -cpu max
      target/arm: Enable FEAT_IESB for -cpu max
      target/arm: Enable FEAT_CSV2 for -cpu max
      target/arm: Enable FEAT_CSV2_2 for -cpu max
      target/arm: Enable FEAT_CSV3 for -cpu max
      target/arm: Enable FEAT_DGH for -cpu max
      target/arm: Define cortex-a76
      target/arm: Define neoverse-n1

From: Leif Lindholm <quic_llindhol@quicinc.com>

NUVIA was acquired by Qualcomm in March 2021, but kept functioning on
separate infrastructure for a transitional period. We've now switched
over to contributing as Qualcomm Innovation Center (quicinc), so update
my email address to reflect this.

Signed-off-by: Leif Lindholm <quic_llindhol@quicinc.com>
Message-id: 20220505113740.75565-1-quic_llindhol@quicinc.com
Cc: Leif Lindholm <leif@nuviainc.com>
Cc: Peter Maydell <peter.maydell@linaro.org>
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
[Fixed commit message typo]
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 .mailmap    | 3 ++-
 MAINTAINERS | 2 +-
 2 files changed, 3 insertions(+), 2 deletions(-)

diff --git a/.mailmap b/.mailmap
index XXXXXXX..XXXXXXX 100644
--- a/.mailmap
+++ b/.mailmap
@@ -XXX,XX +XXX,XX @@ Greg Kurz <groug@kaod.org> <gkurz@linux.vnet.ibm.com>
 Huacai Chen <chenhuacai@kernel.org> <chenhc@lemote.com>
 Huacai Chen <chenhuacai@kernel.org> <chenhuacai@loongson.cn>
 James Hogan <jhogan@kernel.org> <james.hogan@imgtec.com>
-Leif Lindholm <leif@nuviainc.com> <leif.lindholm@linaro.org>
+Leif Lindholm <quic_llindhol@quicinc.com> <leif.lindholm@linaro.org>
+Leif Lindholm <quic_llindhol@quicinc.com> <leif@nuviainc.com>
 Radoslaw Biernacki <rad@semihalf.com> <radoslaw.biernacki@linaro.org>
 Paul Burton <paulburton@kernel.org> <paul.burton@mips.com>
 Paul Burton <paulburton@kernel.org> <paul.burton@imgtec.com>
diff --git a/MAINTAINERS b/MAINTAINERS
index XXXXXXX..XXXXXXX 100644
--- a/MAINTAINERS
+++ b/MAINTAINERS
@@ -XXX,XX +XXX,XX @@ F: include/hw/ssi/imx_spi.h
 SBSA-REF
 M: Radoslaw Biernacki <rad@semihalf.com>
 M: Peter Maydell <peter.maydell@linaro.org>
-R: Leif Lindholm <leif@nuviainc.com>
+R: Leif Lindholm <quic_llindhol@quicinc.com>
 L: qemu-arm@nongnu.org
 S: Maintained
 F: hw/arm/sbsa-ref.c
-- 
2.25.1