Series comparison

-[PULL 00/39] target-arm queue
+[PULL 00/35] target-arm queue
-Most of this is the Neon decodetree patches, followed by Edgar's versal cleanups.
+The following changes since commit 5767815218efd3cbfd409505ed824d5f356044ae:
-thanks
+  Merge tag 'for_upstream' of https://git.kernel.org/pub/scm/virt/kvm/mst/qemu into staging (2024-02-14 15:45:52 +0000)
 -- PMM
 The following changes since commit 2ef486e76d64436be90f7359a3071fb2a56ce835:
   Merge remote-tracking branch 'remotes/marcel/tags/rdma-pull-request' into staging (2020-05-03 14:12:56 +0100)
 are available in the Git repository at:
-  https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20200504
+  https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20240215
-for you to fetch changes up to 9aefc6cf9b73f66062d2f914a0136756e7a28211:
+for you to fetch changes up to f780e63fe731b058fe52d43653600d8729a1b5f2:
-  target/arm: Move gen_ function typedefs to translate.h (2020-05-04 12:59:26 +0100)
+  docs: Add documentation for the mps3-an536 board (2024-02-15 14:32:39 +0000)
 ----------------------------------------------------------------
 target-arm queue:
- * Start of conversion of Neon insns to decodetree
+ * hw/arm/xilinx_zynq: Wire FIQ between CPU <> GIC
- * versal board: support SD and RTC
+ * linux-user/aarch64: Choose SYNC as the preferred MTE mode
- * Implement ARMv8.2-TTS2UXN
+ * Fix some errors in SVE/SME handling of MTE tags
- * Make VQDMULL undefined when U=1
+ * hw/pci-host/raven.c: Mark raven_io_ops as implementing unaligned accesses
- * Some minor code cleanups
+ * hw/block/tc58128: Don't emit deprecation warning under qtest
  * tests/qtest: Fix handling of npcm7xx and GMAC tests
  * hw/arm/virt: Wire up non-secure EL2 virtual timer IRQ
  * tests/qtest/npcm7xx_emc-test: Connect all NICs to a backend
  * Don't assert on vmload/vmsave of M-profile CPUs
  * hw/arm/smmuv3: add support for stage 1 access fault
  * hw/arm/stellaris: QOM cleanups
  * Use new CBAR encoding for all v8 CPUs, not all aarch64 CPUs
  * Improve Cortex_R52 IMPDEF sysreg modelling
  * Allow access to SPSR_hyp from hyp mode
  * New board model mps3-an536 (Cortex-R52)
 ----------------------------------------------------------------
-Edgar E. Iglesias (11):
+Luc Michel (1):
-      hw/arm: versal: Remove inclusion of arm_gicv3_common.h
+      hw/arm/smmuv3: add support for stage 1 access fault
       hw/arm: versal: Move misplaced comment
       hw/arm: versal-virt: Fix typo xlnx-ve -> xlnx-versal
       hw/arm: versal: Embed the UARTs into the SoC type
       hw/arm: versal: Embed the GEMs into the SoC type
       hw/arm: versal: Embed the ADMAs into the SoC type
       hw/arm: versal: Embed the APUs into the SoC type
       hw/arm: versal: Add support for SD
       hw/arm: versal: Add support for the RTC
       hw/arm: versal-virt: Add support for SD
       hw/arm: versal-virt: Add support for the RTC
-Fredrik Strupe (1):
+Nabih Estefan (1):
-      target/arm: Make VQDMULL undefined when U=1
+      tests/qtest: Fix GMAC test to run on a machine in upstream QEMU
-Peter Maydell (25):
+Peter Maydell (22):
-      target/arm: Don't use a TLB for ARMMMUIdx_Stage2
+      hw/pci-host/raven.c: Mark raven_io_ops as implementing unaligned accesses
-      target/arm: Use enum constant in get_phys_addr_lpae() call
+      hw/block/tc58128: Don't emit deprecation warning under qtest
-      target/arm: Add new 's1_is_el0' argument to get_phys_addr_lpae()
+      tests/qtest/meson.build: Don't include qtests_npcm7xx in qtests_aarch64
-      target/arm: Implement ARMv8.2-TTS2UXN
+      tests/qtest/bios-tables-test: Allow changes to virt GTDT
-      target/arm: Use correct variable for setting 'max' cpu's ID_AA64DFR0
+      hw/arm/virt: Wire up non-secure EL2 virtual timer IRQ
-      target/arm/translate-vfp.inc.c: Remove duplicate simd_r32 check
+      tests/qtest/bios-tables-tests: Update virt golden reference
-      target/arm: Don't allow Thumb Neon insns without FEATURE_NEON
+      hw/arm/npcm7xx: Call qemu_configure_nic_device() for GMAC modules
-      target/arm: Add stubs for AArch32 Neon decodetree
+      tests/qtest/npcm7xx_emc-test: Connect all NICs to a backend
-      target/arm: Convert VCMLA (vector) to decodetree
+      target/arm: Don't get MDCR_EL2 in pmu_counter_enabled() before checking ARM_FEATURE_PMU
-      target/arm: Convert VCADD (vector) to decodetree
+      target/arm: Use new CBAR encoding for all v8 CPUs, not all aarch64 CPUs
-      target/arm: Convert V[US]DOT (vector) to decodetree
+      target/arm: The Cortex-R52 has a read-only CBAR
-      target/arm: Convert VFM[AS]L (vector) to decodetree
+      target/arm: Add Cortex-R52 IMPDEF sysregs
-      target/arm: Convert VCMLA (scalar) to decodetree
+      target/arm: Allow access to SPSR_hyp from hyp mode
-      target/arm: Convert V[US]DOT (scalar) to decodetree
+      hw/misc/mps2-scc: Fix condition for CFG3 register
-      target/arm: Convert VFM[AS]L (scalar) to decodetree
+      hw/misc/mps2-scc: Factor out which-board conditionals
-      target/arm: Convert Neon load/store multiple structures to decodetree
+      hw/misc/mps2-scc: Make changes needed for AN536 FPGA image
-      target/arm: Convert Neon 'load single structure to all lanes' to decodetree
+      hw/arm/mps3r: Initial skeleton for mps3-an536 board
-      target/arm: Convert Neon 'load/store single structure' to decodetree
+      hw/arm/mps3r: Add CPUs, GIC, and per-CPU RAM
-      target/arm: Convert Neon 3-reg-same VADD/VSUB to decodetree
+      hw/arm/mps3r: Add UARTs
-      target/arm: Convert Neon 3-reg-same logic ops to decodetree
+      hw/arm/mps3r: Add GPIO, watchdog, dual-timer, I2C devices
-      target/arm: Convert Neon 3-reg-same VMAX/VMIN to decodetree
+      hw/arm/mps3r: Add remaining devices
-      target/arm: Convert Neon 3-reg-same comparisons to decodetree
+      docs: Add documentation for the mps3-an536 board
       target/arm: Convert Neon 3-reg-same VQADD/VQSUB to decodetree
       target/arm: Convert Neon 3-reg-same VMUL, VMLA, VMLS, VSHL to decodetree
       target/arm: Move gen_ function typedefs to translate.h
-Philippe Mathieu-Daudé (2):
+Philippe Mathieu-Daudé (5):
-      hw/arm/mps2-tz: Use TYPE_IOTKIT instead of hardcoded string
+      hw/arm/xilinx_zynq: Wire FIQ between CPU <> GIC
-      target/arm: Use uint64_t for midr field in CPU state struct
+      hw/arm/stellaris: Convert ADC controller to Resettable interface
       hw/arm/stellaris: Convert I2C controller to Resettable interface
       hw/arm/stellaris: Add missing QOM 'machine' parent
       hw/arm/stellaris: Add missing QOM 'SoC' parent
- include/hw/arm/xlnx-versal.h    |  31 +-
+Richard Henderson (6):
- target/arm/cpu-param.h          |   2 +-
+      linux-user/aarch64: Choose SYNC as the preferred MTE mode
- target/arm/cpu.h                |  38 ++-
+      target/arm: Fix nregs computation in do_{ld,st}_zpa
- target/arm/translate-a64.h      |   9 -
+      target/arm: Adjust and validate mtedesc sizem1
- target/arm/translate.h          |  26 ++
+      target/arm: Split out make_svemte_desc
- target/arm/neon-dp.decode       |  86 +++++
+      target/arm: Handle mte in do_ldrq, do_ldro
- target/arm/neon-ls.decode       |  52 +++
+      target/arm: Fix SVE/SME gross MTE suppression checks
  target/arm/neon-shared.decode   |  66 ++++
  hw/arm/mps2-tz.c                |   2 +-
  hw/arm/xlnx-versal-virt.c       |  74 ++++-
  hw/arm/xlnx-versal.c            | 115 +++++--
  target/arm/cpu.c                |   3 +-
  target/arm/cpu64.c              |   8 +-
  target/arm/helper.c             | 183 ++++------
  target/arm/translate-a64.c      |  17 -
  target/arm/translate-neon.inc.c | 714 +++++++++++++++++++++++++++++++++++++++
  target/arm/translate-vfp.inc.c  |   6 -
  target/arm/translate.c          | 716 +++-------------------------------------
  target/arm/Makefile.objs        |  18 +
 files changed, 1302 insertions(+), 864 deletions(-)
  create mode 100644 target/arm/neon-dp.decode
  create mode 100644 target/arm/neon-ls.decode
  create mode 100644 target/arm/neon-shared.decode
  create mode 100644 target/arm/translate-neon.inc.c
+ MAINTAINERS                             |   3 +-
+ docs/system/arm/mps2.rst                |  37 +-
+ configs/devices/arm-softmmu/default.mak |   1 +
+ hw/arm/smmuv3-internal.h                |   1 +
+ include/hw/arm/smmu-common.h            |   1 +
+ include/hw/arm/virt.h                   |   2 +
+ include/hw/misc/mps2-scc.h              |   1 +
+ linux-user/aarch64/target_prctl.h       |  29 +-
+ target/arm/internals.h                  |   2 +-
+ target/arm/tcg/translate-a64.h          |   2 +
+ hw/arm/mps3r.c                          | 640 ++++++++++++++++++++++++++++++++
+ hw/arm/npcm7xx.c                        |   1 +
+ hw/arm/smmu-common.c                    |  11 +
+ hw/arm/smmuv3.c                         |   1 +
+ hw/arm/stellaris.c                      |  47 ++-
+ hw/arm/virt-acpi-build.c                |  20 +-
+ hw/arm/virt.c                           |  60 ++-
+ hw/arm/xilinx_zynq.c                    |   2 +
+ hw/block/tc58128.c                      |   4 +-
+ hw/misc/mps2-scc.c                      | 138 ++++++-
+ hw/pci-host/raven.c                     |   1 +
+ target/arm/helper.c                     |  14 +-
+ target/arm/tcg/cpu32.c                  | 109 ++++++
+ target/arm/tcg/op_helper.c              |  43 ++-
+ target/arm/tcg/sme_helper.c             |   8 +-
+ target/arm/tcg/sve_helper.c             |  12 +-
+ target/arm/tcg/translate-sme.c          |  15 +-
+ target/arm/tcg/translate-sve.c          |  83 +++--
+ target/arm/tcg/translate.c              |  19 +-
+ tests/qtest/npcm7xx_emc-test.c          |   5 +-
+ tests/qtest/npcm_gmac-test.c            |  84 +----
+ hw/arm/Kconfig                          |   5 +
+ hw/arm/meson.build                      |   1 +
+ tests/data/acpi/virt/FACP               | Bin 276 -> 276 bytes
+ tests/data/acpi/virt/GTDT               | Bin 96 -> 104 bytes
+ tests/qtest/meson.build                 |   4 +-
+files changed, 1184 insertions(+), 222 deletions(-)
+ create mode 100644 hw/arm/mps3r.c

-[PULL 01/39] target/arm: Make VQDMULL undefined when U=1
+Deleted patch
-From: Fredrik Strupe <fredrik@strupe.net>
-According to Arm ARM, VQDMULL is only valid when U=0, while having
-U=1 is unallocated.
-Signed-off-by: Fredrik Strupe <fredrik@strupe.net>
-Fixes: 695272dcb976 ("target-arm: Handle UNDEF cases for Neon 3-regs-different-widths")
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- target/arm/translate.c | 2 +-
-file changed, 1 insertion(+), 1 deletion(-)
-diff --git a/target/arm/translate.c b/target/arm/translate.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate.c
-+++ b/target/arm/translate.c
-@@ -XXX,XX +XXX,XX @@ static int disas_neon_data_insn(DisasContext *s, uint32_t insn)
-                     {0, 0, 0, 0}, /* VMLSL */
-                     {0, 0, 0, 9}, /* VQDMLSL */
-                     {0, 0, 0, 0}, /* Integer VMULL */
--                    {0, 0, 0, 1}, /* VQDMULL */
-+                    {0, 0, 0, 9}, /* VQDMULL */
-                     {0, 0, 0, 0xa}, /* Polynomial VMULL */
-                     {0, 0, 0, 7}, /* Reserved: always UNDEF */
-                 };
---
-.20.1

-[PULL 08/39] target/arm: Use uint64_t for midr field in CPU state struct
+[PULL 01/35] hw/arm/xilinx_zynq: Wire FIQ between CPU <> GIC
-From: Philippe Mathieu-Daudé <f4bug@amsat.org>
+From: Philippe Mathieu-Daudé <philmd@linaro.org>
-MIDR_EL1 is a 64-bit system register with the top 32-bit being RES0.
+Similarly to commits dadbb58f59..5ae79fe825 for other ARM boards,
-Represent it in QEMU's ARMCPU struct with a uint64_t, not a
+connect FIQ output of the GIC CPU interfaces to the CPU.
 uint32_t.
-This fixes an error when compiling with -Werror=conversion
+Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
-because we were manipulating the register value using a
+Message-id: 20240130152548.17855-1-philmd@linaro.org
 local uint64_t variable:
   target/arm/cpu64.c: In function ‘aarch64_max_initfn’:
   target/arm/cpu64.c:628:21: error: conversion from ‘uint64_t’ {aka ‘long unsigned int’} to ‘uint32_t’ {aka ‘unsigned int’} may change value [-Werror=conversion]
 |         cpu->midr = t;
         |                     ^
 and future-proofs us against a possible future architecture
 change using some of the top 32 bits.
 Suggested-by: Laurent Desnogues <laurent.desnogues@gmail.com>
 Suggested-by: Peter Maydell <peter.maydell@linaro.org>
 Signed-off-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
 Reviewed-by: Laurent Desnogues <laurent.desnogues@gmail.com>
 Message-id: 20200428172634.29707-1-f4bug@amsat.org
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/cpu.h | 2 +-
+ hw/arm/xilinx_zynq.c | 2 ++
- target/arm/cpu.c | 2 +-
+file changed, 2 insertions(+)
 files changed, 2 insertions(+), 2 deletions(-)
-diff --git a/target/arm/cpu.h b/target/arm/cpu.h
+diff --git a/hw/arm/xilinx_zynq.c b/hw/arm/xilinx_zynq.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.h
+--- a/hw/arm/xilinx_zynq.c
-+++ b/target/arm/cpu.h
++++ b/hw/arm/xilinx_zynq.c
-@@ -XXX,XX +XXX,XX @@ struct ARMCPU {
+@@ -XXX,XX +XXX,XX @@ static void zynq_init(MachineState *machine)
-         uint64_t id_aa64dfr0;
+     sysbus_mmio_map(busdev, 0, MPCORE_PERIPHBASE);
-         uint64_t id_aa64dfr1;
+     sysbus_connect_irq(busdev, 0,
-     } isar;
+                        qdev_get_gpio_in(DEVICE(cpu), ARM_CPU_IRQ));
--    uint32_t midr;
++    sysbus_connect_irq(busdev, 1,
-+    uint64_t midr;
++                       qdev_get_gpio_in(DEVICE(cpu), ARM_CPU_FIQ));
-     uint32_t revidr;
-     uint32_t reset_fpsid;
+     for (n = 0; n < 64; n++) {
-     uint32_t ctr;
+         pic[n] = qdev_get_gpio_in(dev, n);
 diff --git a/target/arm/cpu.c b/target/arm/cpu.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu.c
 +++ b/target/arm/cpu.c
@@ -XXX,XX +XXX,XX @@ static const ARMCPUInfo arm_cpus[] = {
  static Property arm_cpu_properties[] = {
      DEFINE_PROP_BOOL("start-powered-off", ARMCPU, start_powered_off, false),
      DEFINE_PROP_UINT32("psci-conduit", ARMCPU, psci_conduit, 0),
 -    DEFINE_PROP_UINT32("midr", ARMCPU, midr, 0),
 +    DEFINE_PROP_UINT64("midr", ARMCPU, midr, 0),
      DEFINE_PROP_UINT64("mp-affinity", ARMCPU,
                          mp_affinity, ARM64_AFFINITY_INVALID),
      DEFINE_PROP_INT32("node-id", ARMCPU, node_id, CPU_UNSET_NUMA_NODE_ID),
 --
-.20.1
+.34.1

-[PULL 15/39] hw/arm: versal: Embed the APUs into the SoC type
+[PULL 02/35] linux-user/aarch64: Choose SYNC as the preferred MTE mode
-From: "Edgar E. Iglesias" <edgar.iglesias@xilinx.com>
+From: Richard Henderson <richard.henderson@linaro.org>
-Embed the APUs into the SoC type.
+The API does not generate an error for setting ASYNC | SYNC; that merely
 constrains the selection vs the per-cpu default.  For qemu linux-user,
 choose SYNC as the default.
-Suggested-by: Peter Maydell <peter.maydell@linaro.org>
+Cc: qemu-stable@nongnu.org
-Signed-off-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
+Reported-by: Gustavo Romero <gustavo.romero@linaro.org>
-Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
+Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
+Tested-by: Gustavo Romero <gustavo.romero@linaro.org>
-Reviewed-by: Luc Michel <luc.michel@greensocs.com>
+Message-id: 20240207025210.8837-2-richard.henderson@linaro.org
 Message-id: 20200427181649.26851-8-edgar.iglesias@gmail.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- include/hw/arm/xlnx-versal.h |  2 +-
+ linux-user/aarch64/target_prctl.h | 29 +++++++++++++++++------------
- hw/arm/xlnx-versal-virt.c    |  4 ++--
+file changed, 17 insertions(+), 12 deletions(-)
  hw/arm/xlnx-versal.c         | 19 +++++--------------
 files changed, 8 insertions(+), 17 deletions(-)
-diff --git a/include/hw/arm/xlnx-versal.h b/include/hw/arm/xlnx-versal.h
+diff --git a/linux-user/aarch64/target_prctl.h b/linux-user/aarch64/target_prctl.h
 index XXXXXXX..XXXXXXX 100644
---- a/include/hw/arm/xlnx-versal.h
+--- a/linux-user/aarch64/target_prctl.h
-+++ b/include/hw/arm/xlnx-versal.h
++++ b/linux-user/aarch64/target_prctl.h
-@@ -XXX,XX +XXX,XX @@ typedef struct Versal {
+@@ -XXX,XX +XXX,XX @@ static abi_long do_prctl_set_tagged_addr_ctrl(CPUArchState *env, abi_long arg2)
-     struct {
+     env->tagged_addr_enable = arg2 & PR_TAGGED_ADDR_ENABLE;
-         struct {
-             MemoryRegion mr;
+     if (cpu_isar_feature(aa64_mte, cpu)) {
--            ARMCPU *cpu[XLNX_VERSAL_NR_ACPUS];
+-        switch (arg2 & PR_MTE_TCF_MASK) {
-+            ARMCPU cpu[XLNX_VERSAL_NR_ACPUS];
+-        case PR_MTE_TCF_NONE:
-             GICv3State gic;
+-        case PR_MTE_TCF_SYNC:
-         } apu;
+-        case PR_MTE_TCF_ASYNC:
-     } fpd;
+-            break;
-diff --git a/hw/arm/xlnx-versal-virt.c b/hw/arm/xlnx-versal-virt.c
+-        default:
-index XXXXXXX..XXXXXXX 100644
+-            return -EINVAL;
 --- a/hw/arm/xlnx-versal-virt.c
 +++ b/hw/arm/xlnx-versal-virt.c
@@ -XXX,XX +XXX,XX @@ static void versal_virt_init(MachineState *machine)
      s->binfo.get_dtb = versal_virt_get_dtb;
      s->binfo.modify_dtb = versal_virt_modify_dtb;
      if (machine->kernel_filename) {
 -        arm_load_kernel(s->soc.fpd.apu.cpu[0], machine, &s->binfo);
 +        arm_load_kernel(&s->soc.fpd.apu.cpu[0], machine, &s->binfo);
      } else {
 -        AddressSpace *as = arm_boot_address_space(s->soc.fpd.apu.cpu[0],
 +        AddressSpace *as = arm_boot_address_space(&s->soc.fpd.apu.cpu[0],
                                                    &s->binfo);
          /* Some boot-loaders (e.g u-boot) don't like blobs at address 0 (NULL).
           * Offset things by 4K.  */
 diff --git a/hw/arm/xlnx-versal.c b/hw/arm/xlnx-versal.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/xlnx-versal.c
 +++ b/hw/arm/xlnx-versal.c
@@ -XXX,XX +XXX,XX @@ static void versal_create_apu_cpus(Versal *s)
      for (i = 0; i < ARRAY_SIZE(s->fpd.apu.cpu); i++) {
          Object *obj;
 -        char *name;
 -
 -        obj = object_new(XLNX_VERSAL_ACPU_TYPE);
 -        if (!obj) {
 -            error_report("Unable to create apu.cpu[%d] of type %s",
 -                         i, XLNX_VERSAL_ACPU_TYPE);
 -            exit(EXIT_FAILURE);
 -        }
 -
--        name = g_strdup_printf("apu-cpu[%d]", i);
+         /*
--        object_property_add_child(OBJECT(s), name, obj, &error_fatal);
+          * Write PR_MTE_TCF to SCTLR_EL1[TCF0].
--        g_free(name);
+-         * Note that the syscall values are consistent with hw.
++         *
-+        object_initialize_child(OBJECT(s), "apu-cpu[*]",
++         * The kernel has a per-cpu configuration for the sysadmin,
-+                                &s->fpd.apu.cpu[i], sizeof(s->fpd.apu.cpu[i]),
++         * /sys/devices/system/cpu/cpu<N>/mte_tcf_preferred,
-+                                XLNX_VERSAL_ACPU_TYPE, &error_abort, NULL);
++         * which qemu does not implement.
-+        obj = OBJECT(&s->fpd.apu.cpu[i]);
++         *
-         object_property_set_int(obj, s->cfg.psci_conduit,
++         * Because there is no performance difference between the modes, and
-                                 "psci-conduit", &error_abort);
++         * because SYNC is most useful for debugging MTE errors, choose SYNC
-         if (i) {
++         * as the preferred mode.  With this preference, and the way the API
-@@ -XXX,XX +XXX,XX @@ static void versal_create_apu_cpus(Versal *s)
++         * uses only two bits, there is no way for the program to select
-         object_property_set_link(obj, OBJECT(&s->fpd.apu.mr), "memory",
++         * ASYMM mode.
-                                  &error_abort);
+          */
-         object_property_set_bool(obj, true, "realized", &error_fatal);
+-        env->cp15.sctlr_el[1] =
--        s->fpd.apu.cpu[i] = ARM_CPU(obj);
+-            deposit64(env->cp15.sctlr_el[1], 38, 2, arg2 >> PR_MTE_TCF_SHIFT);
-     }
++        unsigned tcf = 0;
- }
++        if (arg2 & PR_MTE_TCF_SYNC) {
++            tcf = 1;
-@@ -XXX,XX +XXX,XX @@ static void versal_create_apu_gic(Versal *s, qemu_irq *pic)
++        } else if (arg2 & PR_MTE_TCF_ASYNC) {
-     }
++            tcf = 2;
++        }
-     for (i = 0; i < nr_apu_cpus; i++) {
++        env->cp15.sctlr_el[1] = deposit64(env->cp15.sctlr_el[1], 38, 2, tcf);
--        DeviceState *cpudev = DEVICE(s->fpd.apu.cpu[i]);
-+        DeviceState *cpudev = DEVICE(&s->fpd.apu.cpu[i]);
+         /*
-         int ppibase = XLNX_VERSAL_NR_IRQS + i * GIC_INTERNAL + GIC_NR_SGIS;
+          * Write PR_MTE_TAG to GCR_EL1[Exclude].
          qemu_irq maint_irq;
          int ti;
 --
-.20.1
+.34.1

-[PULL 02/39] hw/arm/mps2-tz: Use TYPE_IOTKIT instead of hardcoded string
+[PULL 03/35] target/arm: Fix nregs computation in do_{ld,st}_zpa
-From: Philippe Mathieu-Daudé <f4bug@amsat.org>
+From: Richard Henderson <richard.henderson@linaro.org>
-By using the TYPE_* definitions for devices, we can:
+The field is encoded as [0-3], which is convenient for
- - quickly find where devices are used with 'git-grep'
+indexing our array of function pointers, but the true
- - easily rename a device (one-line change).
+value is [1-4].  Adjust before calling do_mem_zpa.
-Signed-off-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
+Add an assert, and move the comment re passing ZT to
-Message-id: 20200428154650.21991-1-f4bug@amsat.org
+the helper back next to the relevant code.
 Cc: qemu-stable@nongnu.org
 Fixes: 206adacfb8d ("target/arm: Add mte helpers for sve scalar + int loads")
 Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
 Tested-by: Gustavo Romero <gustavo.romero@linaro.org>
 Message-id: 20240207025210.8837-3-richard.henderson@linaro.org
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- hw/arm/mps2-tz.c | 2 +-
+ target/arm/tcg/translate-sve.c | 16 ++++++++--------
-file changed, 1 insertion(+), 1 deletion(-)
+file changed, 8 insertions(+), 8 deletions(-)
-diff --git a/hw/arm/mps2-tz.c b/hw/arm/mps2-tz.c
+diff --git a/target/arm/tcg/translate-sve.c b/target/arm/tcg/translate-sve.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/mps2-tz.c
+--- a/target/arm/tcg/translate-sve.c
-+++ b/hw/arm/mps2-tz.c
++++ b/target/arm/tcg/translate-sve.c
-@@ -XXX,XX +XXX,XX @@ static void mps2tz_common_init(MachineState *machine)
+@@ -XXX,XX +XXX,XX @@ static void do_mem_zpa(DisasContext *s, int zt, int pg, TCGv_i64 addr,
-         exit(EXIT_FAILURE);
+     TCGv_ptr t_pg;
      int desc = 0;
 -    /*
 -     * For e.g. LD4, there are not enough arguments to pass all 4
 -     * registers as pointers, so encode the regno into the data field.
 -     * For consistency, do this even for LD1.
 -     */
 +    assert(mte_n >= 1 && mte_n <= 4);
      if (s->mte_active[0]) {
          int msz = dtype_msz(dtype);
@@ -XXX,XX +XXX,XX @@ static void do_mem_zpa(DisasContext *s, int zt, int pg, TCGv_i64 addr,
          addr = clean_data_tbi(s, addr);
      }
--    sysbus_init_child_obj(OBJECT(machine), "iotkit", &mms->iotkit,
++    /*
-+    sysbus_init_child_obj(OBJECT(machine), TYPE_IOTKIT, &mms->iotkit,
++     * For e.g. LD4, there are not enough arguments to pass all 4
-                           sizeof(mms->iotkit), mmc->armsse_type);
++     * registers as pointers, so encode the regno into the data field.
-     iotkitdev = DEVICE(&mms->iotkit);
++     * For consistency, do this even for LD1.
-     object_property_set_link(OBJECT(&mms->iotkit), OBJECT(system_memory),
++     */
      desc = simd_desc(vsz, vsz, zt | desc);
      t_pg = tcg_temp_new_ptr();
@@ -XXX,XX +XXX,XX @@ static void do_ld_zpa(DisasContext *s, int zt, int pg,
       * accessible via the instruction encoding.
       */
      assert(fn != NULL);
 -    do_mem_zpa(s, zt, pg, addr, dtype, nreg, false, fn);
 +    do_mem_zpa(s, zt, pg, addr, dtype, nreg + 1, false, fn);
  }
  static bool trans_LD_zprr(DisasContext *s, arg_rprr_load *a)
@@ -XXX,XX +XXX,XX @@ static void do_st_zpa(DisasContext *s, int zt, int pg, TCGv_i64 addr,
      if (nreg == 0) {
          /* ST1 */
          fn = fn_single[s->mte_active[0]][be][msz][esz];
 -        nreg = 1;
      } else {
          /* ST2, ST3, ST4 -- msz == esz, enforced by encoding */
          assert(msz == esz);
          fn = fn_multiple[s->mte_active[0]][be][nreg - 1][msz];
      }
      assert(fn != NULL);
 -    do_mem_zpa(s, zt, pg, addr, msz_dtype(s, msz), nreg, true, fn);
 +    do_mem_zpa(s, zt, pg, addr, msz_dtype(s, msz), nreg + 1, true, fn);
  }
  static bool trans_ST_zprr(DisasContext *s, arg_rprr_store *a)
 --
-.20.1
+.34.1

-[PULL 12/39] hw/arm: versal: Embed the UARTs into the SoC type
+[PULL 04/35] target/arm: Adjust and validate mtedesc sizem1
-From: "Edgar E. Iglesias" <edgar.iglesias@xilinx.com>
+From: Richard Henderson <richard.henderson@linaro.org>
-Embed the UARTs into the SoC type.
+When we added SVE_MTEDESC_SHIFT, we effectively limited the
 maximum size of MTEDESC.  Adjust SIZEM1 to consume the remaining
 bits (32 - 10 - 5 - 12 == 5).  Assert that the data to be stored
 fits within the field (expecting 8 * 4 - 1 == 31, exact fit).
-Suggested-by: Peter Maydell <peter.maydell@linaro.org>
+Cc: qemu-stable@nongnu.org
-Signed-off-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
+Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
+Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
+Tested-by: Gustavo Romero <gustavo.romero@linaro.org>
-Reviewed-by: Luc Michel <luc.michel@greensocs.com>
+Message-id: 20240207025210.8837-4-richard.henderson@linaro.org
 Message-id: 20200427181649.26851-5-edgar.iglesias@gmail.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- include/hw/arm/xlnx-versal.h |  3 ++-
+ target/arm/internals.h         | 2 +-
- hw/arm/xlnx-versal.c         | 12 ++++++------
+ target/arm/tcg/translate-sve.c | 7 ++++---
-files changed, 8 insertions(+), 7 deletions(-)
+files changed, 5 insertions(+), 4 deletions(-)
-diff --git a/include/hw/arm/xlnx-versal.h b/include/hw/arm/xlnx-versal.h
+diff --git a/target/arm/internals.h b/target/arm/internals.h
 index XXXXXXX..XXXXXXX 100644
---- a/include/hw/arm/xlnx-versal.h
+--- a/target/arm/internals.h
-+++ b/include/hw/arm/xlnx-versal.h
++++ b/target/arm/internals.h
-@@ -XXX,XX +XXX,XX @@
+@@ -XXX,XX +XXX,XX @@ FIELD(MTEDESC, TBI,   4, 2)
- #include "hw/sysbus.h"
+ FIELD(MTEDESC, TCMA,  6, 2)
- #include "hw/arm/boot.h"
+ FIELD(MTEDESC, WRITE, 8, 1)
- #include "hw/intc/arm_gicv3.h"
+ FIELD(MTEDESC, ALIGN, 9, 3)
-+#include "hw/char/pl011.h"
+-FIELD(MTEDESC, SIZEM1, 12, SIMD_DATA_BITS - 12)  /* size - 1 */
++FIELD(MTEDESC, SIZEM1, 12, SIMD_DATA_BITS - SVE_MTEDESC_SHIFT - 12)  /* size - 1 */
- #define TYPE_XLNX_VERSAL "xlnx-versal"
- #define XLNX_VERSAL(obj) OBJECT_CHECK(Versal, (obj), TYPE_XLNX_VERSAL)
+ bool mte_probe(CPUARMState *env, uint32_t desc, uint64_t ptr);
-@@ -XXX,XX +XXX,XX @@ typedef struct Versal {
+ uint64_t mte_check(CPUARMState *env, uint32_t desc, uint64_t ptr, uintptr_t ra);
-         MemoryRegion mr_ocm;
+diff --git a/target/arm/tcg/translate-sve.c b/target/arm/tcg/translate-sve.c
          struct {
 -            SysBusDevice *uart[XLNX_VERSAL_NR_UARTS];
 +            PL011State uart[XLNX_VERSAL_NR_UARTS];
              SysBusDevice *gem[XLNX_VERSAL_NR_GEMS];
              SysBusDevice *adma[XLNX_VERSAL_NR_ADMAS];
          } iou;
 diff --git a/hw/arm/xlnx-versal.c b/hw/arm/xlnx-versal.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/xlnx-versal.c
+--- a/target/arm/tcg/translate-sve.c
-+++ b/hw/arm/xlnx-versal.c
++++ b/target/arm/tcg/translate-sve.c
-@@ -XXX,XX +XXX,XX @@
+@@ -XXX,XX +XXX,XX @@ static void do_mem_zpa(DisasContext *s, int zt, int pg, TCGv_i64 addr,
- #include "kvm_arm.h"
+ {
- #include "hw/misc/unimp.h"
+     unsigned vsz = vec_full_reg_size(s);
- #include "hw/arm/xlnx-versal.h"
+     TCGv_ptr t_pg;
--#include "hw/char/pl011.h"
++    uint32_t sizem1;
+     int desc = 0;
- #define XLNX_VERSAL_ACPU_TYPE ARM_CPU_TYPE_NAME("cortex-a72")
- #define GEM_REVISION        0x40070106
+     assert(mte_n >= 1 && mte_n <= 4);
-@@ -XXX,XX +XXX,XX @@ static void versal_create_uarts(Versal *s, qemu_irq *pic)
++    sizem1 = (mte_n << dtype_msz(dtype)) - 1;
-         DeviceState *dev;
++    assert(sizem1 <= R_MTEDESC_SIZEM1_MASK >> R_MTEDESC_SIZEM1_SHIFT);
-         MemoryRegion *mr;
+     if (s->mte_active[0]) {
+-        int msz = dtype_msz(dtype);
--        dev = qdev_create(NULL, TYPE_PL011);
+-
--        s->lpd.iou.uart[i] = SYS_BUS_DEVICE(dev);
+         desc = FIELD_DP32(desc, MTEDESC, MIDX, get_mem_index(s));
-+        sysbus_init_child_obj(OBJECT(s), name,
+         desc = FIELD_DP32(desc, MTEDESC, TBI, s->tbid);
-+                              &s->lpd.iou.uart[i], sizeof(s->lpd.iou.uart[i]),
+         desc = FIELD_DP32(desc, MTEDESC, TCMA, s->tcma);
-+                              TYPE_PL011);
+         desc = FIELD_DP32(desc, MTEDESC, WRITE, is_write);
-+        dev = DEVICE(&s->lpd.iou.uart[i]);
+-        desc = FIELD_DP32(desc, MTEDESC, SIZEM1, (mte_n << msz) - 1);
-         qdev_prop_set_chr(dev, "chardev", serial_hd(i));
++        desc = FIELD_DP32(desc, MTEDESC, SIZEM1, sizem1);
--        object_property_add_child(OBJECT(s), name, OBJECT(dev), &error_fatal);
+         desc <<= SVE_MTEDESC_SHIFT;
-         qdev_init_nofail(dev);
+     } else {
+         addr = clean_data_tbi(s, addr);
 -        mr = sysbus_mmio_get_region(s->lpd.iou.uart[i], 0);
 +        mr = sysbus_mmio_get_region(SYS_BUS_DEVICE(dev), 0);
          memory_region_add_subregion(&s->mr_ps, addrs[i], mr);
 -        sysbus_connect_irq(s->lpd.iou.uart[i], 0, pic[irqs[i]]);
 +        sysbus_connect_irq(SYS_BUS_DEVICE(dev), 0, pic[irqs[i]]);
          g_free(name);
      }
  }
 --
-.20.1
+.34.1

-[PULL 17/39] hw/arm: versal: Add support for the RTC
+[PULL 05/35] target/arm: Split out make_svemte_desc
-From: "Edgar E. Iglesias" <edgar.iglesias@xilinx.com>
+From: Richard Henderson <richard.henderson@linaro.org>
-hw/arm: versal: Add support for the RTC.
+Share code that creates mtedesc and embeds within simd_desc.
-Signed-off-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
+Cc: qemu-stable@nongnu.org
-Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
+Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
+Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Reviewed-by: Luc Michel <luc.michel@greensocs.com>
+Tested-by: Gustavo Romero <gustavo.romero@linaro.org>
-Message-id: 20200427181649.26851-10-edgar.iglesias@gmail.com
+Message-id: 20240207025210.8837-5-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- include/hw/arm/xlnx-versal.h |  8 ++++++++
+ target/arm/tcg/translate-a64.h |  2 ++
- hw/arm/xlnx-versal.c         | 21 +++++++++++++++++++++
+ target/arm/tcg/translate-sme.c | 15 +++--------
-files changed, 29 insertions(+)
+ target/arm/tcg/translate-sve.c | 47 ++++++++++++++++++----------------
 files changed, 31 insertions(+), 33 deletions(-)
-diff --git a/include/hw/arm/xlnx-versal.h b/include/hw/arm/xlnx-versal.h
+diff --git a/target/arm/tcg/translate-a64.h b/target/arm/tcg/translate-a64.h
 index XXXXXXX..XXXXXXX 100644
---- a/include/hw/arm/xlnx-versal.h
+--- a/target/arm/tcg/translate-a64.h
-+++ b/include/hw/arm/xlnx-versal.h
++++ b/target/arm/tcg/translate-a64.h
-@@ -XXX,XX +XXX,XX @@
+@@ -XXX,XX +XXX,XX @@ bool logic_imm_decode_wmask(uint64_t *result, unsigned int immn,
- #include "hw/char/pl011.h"
+ bool sve_access_check(DisasContext *s);
- #include "hw/dma/xlnx-zdma.h"
+ bool sme_enabled_check(DisasContext *s);
- #include "hw/net/cadence_gem.h"
+ bool sme_enabled_check_with_svcr(DisasContext *s, unsigned);
-+#include "hw/rtc/xlnx-zynqmp-rtc.h"
++uint32_t make_svemte_desc(DisasContext *s, unsigned vsz, uint32_t nregs,
++                          uint32_t msz, bool is_write, uint32_t data);
- #define TYPE_XLNX_VERSAL "xlnx-versal"
- #define XLNX_VERSAL(obj) OBJECT_CHECK(Versal, (obj), TYPE_XLNX_VERSAL)
+ /* This function corresponds to CheckStreamingSVEEnabled. */
-@@ -XXX,XX +XXX,XX @@ typedef struct Versal {
+ static inline bool sme_sm_enabled_check(DisasContext *s)
-         struct {
+diff --git a/target/arm/tcg/translate-sme.c b/target/arm/tcg/translate-sme.c
-             SDHCIState sd[XLNX_VERSAL_NR_SDS];
+index XXXXXXX..XXXXXXX 100644
-         } iou;
+--- a/target/arm/tcg/translate-sme.c
 +++ b/target/arm/tcg/translate-sme.c
@@ -XXX,XX +XXX,XX @@ static bool trans_LDST1(DisasContext *s, arg_LDST1 *a)
      TCGv_ptr t_za, t_pg;
      TCGv_i64 addr;
 -    int svl, desc = 0;
 +    uint32_t desc;
      bool be = s->be_data == MO_BE;
      bool mte = s->mte_active[0];
@@ -XXX,XX +XXX,XX @@ static bool trans_LDST1(DisasContext *s, arg_LDST1 *a)
      tcg_gen_shli_i64(addr, cpu_reg(s, a->rm), a->esz);
      tcg_gen_add_i64(addr, addr, cpu_reg_sp(s, a->rn));
 -    if (mte) {
 -        desc = FIELD_DP32(desc, MTEDESC, MIDX, get_mem_index(s));
 -        desc = FIELD_DP32(desc, MTEDESC, TBI, s->tbid);
 -        desc = FIELD_DP32(desc, MTEDESC, TCMA, s->tcma);
 -        desc = FIELD_DP32(desc, MTEDESC, WRITE, a->st);
 -        desc = FIELD_DP32(desc, MTEDESC, SIZEM1, (1 << a->esz) - 1);
 -        desc <<= SVE_MTEDESC_SHIFT;
 -    } else {
 +    if (!mte) {
          addr = clean_data_tbi(s, addr);
      }
 -    svl = streaming_vec_reg_size(s);
 -    desc = simd_desc(svl, svl, desc);
 +
-+        XlnxZynqMPRTC rtc;
++    desc = make_svemte_desc(s, streaming_vec_reg_size(s), 1, a->esz, a->st, 0);
-     } pmc;
+     fns[a->esz][be][a->v][mte][a->st](tcg_env, t_za, t_pg, addr,
-     struct {
+                                       tcg_constant_i32(desc));
-@@ -XXX,XX +XXX,XX @@ typedef struct Versal {
+diff --git a/target/arm/tcg/translate-sve.c b/target/arm/tcg/translate-sve.c
  #define VERSAL_GEM1_IRQ_0          58
  #define VERSAL_GEM1_WAKE_IRQ_0     59
  #define VERSAL_ADMA_IRQ_0          60
 +#define VERSAL_RTC_APB_ERR_IRQ     121
  #define VERSAL_SD0_IRQ_0           126
 +#define VERSAL_RTC_ALARM_IRQ       142
 +#define VERSAL_RTC_SECONDS_IRQ     143
  /* Architecturally reserved IRQs suitable for virtualization.  */
  #define VERSAL_RSVD_IRQ_FIRST 111
@@ -XXX,XX +XXX,XX @@ typedef struct Versal {
  #define MM_PMC_SD0_SIZE             0x10000
  #define MM_PMC_CRP                  0xf1260000U
  #define MM_PMC_CRP_SIZE             0x10000
 +#define MM_PMC_RTC                  0xf12a0000
 +#define MM_PMC_RTC_SIZE             0x10000
  #endif
 diff --git a/hw/arm/xlnx-versal.c b/hw/arm/xlnx-versal.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/xlnx-versal.c
+--- a/target/arm/tcg/translate-sve.c
-+++ b/hw/arm/xlnx-versal.c
++++ b/target/arm/tcg/translate-sve.c
-@@ -XXX,XX +XXX,XX @@ static void versal_create_sds(Versal *s, qemu_irq *pic)
+@@ -XXX,XX +XXX,XX @@ static const uint8_t dtype_esz[16] = {
-     }
+, 2, 1, 3
- }
+ };
-+static void versal_create_rtc(Versal *s, qemu_irq *pic)
+-static void do_mem_zpa(DisasContext *s, int zt, int pg, TCGv_i64 addr,
-+{
+-                       int dtype, uint32_t mte_n, bool is_write,
-+    SysBusDevice *sbd;
+-                       gen_helper_gvec_mem *fn)
-+    MemoryRegion *mr;
++uint32_t make_svemte_desc(DisasContext *s, unsigned vsz, uint32_t nregs,
 +                          uint32_t msz, bool is_write, uint32_t data)
  {
 -    unsigned vsz = vec_full_reg_size(s);
 -    TCGv_ptr t_pg;
      uint32_t sizem1;
 -    int desc = 0;
 +    uint32_t desc = 0;
 -    assert(mte_n >= 1 && mte_n <= 4);
 -    sizem1 = (mte_n << dtype_msz(dtype)) - 1;
 +    /* Assert all of the data fits, with or without MTE enabled. */
 +    assert(nregs >= 1 && nregs <= 4);
 +    sizem1 = (nregs << msz) - 1;
      assert(sizem1 <= R_MTEDESC_SIZEM1_MASK >> R_MTEDESC_SIZEM1_SHIFT);
 +    assert(data < 1u << SVE_MTEDESC_SHIFT);
 +
-+    sysbus_init_child_obj(OBJECT(s), "rtc", &s->pmc.rtc, sizeof(s->pmc.rtc),
+     if (s->mte_active[0]) {
-+                          TYPE_XLNX_ZYNQMP_RTC);
+         desc = FIELD_DP32(desc, MTEDESC, MIDX, get_mem_index(s));
-+    sbd = SYS_BUS_DEVICE(&s->pmc.rtc);
+         desc = FIELD_DP32(desc, MTEDESC, TBI, s->tbid);
-+    qdev_init_nofail(DEVICE(sbd));
+@@ -XXX,XX +XXX,XX @@ static void do_mem_zpa(DisasContext *s, int zt, int pg, TCGv_i64 addr,
-+
+         desc = FIELD_DP32(desc, MTEDESC, WRITE, is_write);
-+    mr = sysbus_mmio_get_region(sbd, 0);
+         desc = FIELD_DP32(desc, MTEDESC, SIZEM1, sizem1);
-+    memory_region_add_subregion(&s->mr_ps, MM_PMC_RTC, mr);
+         desc <<= SVE_MTEDESC_SHIFT;
-+
+-    } else {
-+    /*
++    }
-+     * TODO: Connect the ALARM and SECONDS interrupts once our RTC model
++    return simd_desc(vsz, vsz, desc | data);
 +     * supports them.
 +     */
 +    sysbus_connect_irq(sbd, 1, pic[VERSAL_RTC_APB_ERR_IRQ]);
 +}
 +
- /* This takes the board allocated linear DDR memory and creates aliases
++static void do_mem_zpa(DisasContext *s, int zt, int pg, TCGv_i64 addr,
-  * for each split DDR range/aperture on the Versal address map.
++                       int dtype, uint32_t nregs, bool is_write,
-  */
++                       gen_helper_gvec_mem *fn)
-@@ -XXX,XX +XXX,XX @@ static void versal_realize(DeviceState *dev, Error **errp)
++{
-     versal_create_gems(s, pic);
++    TCGv_ptr t_pg;
-     versal_create_admas(s, pic);
++    uint32_t desc;
-     versal_create_sds(s, pic);
++
-+    versal_create_rtc(s, pic);
++    if (!s->mte_active[0]) {
-     versal_map_ddr(s);
+         addr = clean_data_tbi(s, addr);
-     versal_unimp(s);
+     }
@@ -XXX,XX +XXX,XX @@ static void do_mem_zpa(DisasContext *s, int zt, int pg, TCGv_i64 addr,
       * registers as pointers, so encode the regno into the data field.
       * For consistency, do this even for LD1.
       */
 -    desc = simd_desc(vsz, vsz, zt | desc);
 +    desc = make_svemte_desc(s, vec_full_reg_size(s), nregs,
 +                            dtype_msz(dtype), is_write, zt);
      t_pg = tcg_temp_new_ptr();
      tcg_gen_addi_ptr(t_pg, tcg_env, pred_full_reg_offset(s, pg));
@@ -XXX,XX +XXX,XX @@ static void do_mem_zpz(DisasContext *s, int zt, int pg, int zm,
                         int scale, TCGv_i64 scalar, int msz, bool is_write,
                         gen_helper_gvec_mem_scatter *fn)
  {
 -    unsigned vsz = vec_full_reg_size(s);
      TCGv_ptr t_zm = tcg_temp_new_ptr();
      TCGv_ptr t_pg = tcg_temp_new_ptr();
      TCGv_ptr t_zt = tcg_temp_new_ptr();
 -    int desc = 0;
 -
 -    if (s->mte_active[0]) {
 -        desc = FIELD_DP32(desc, MTEDESC, MIDX, get_mem_index(s));
 -        desc = FIELD_DP32(desc, MTEDESC, TBI, s->tbid);
 -        desc = FIELD_DP32(desc, MTEDESC, TCMA, s->tcma);
 -        desc = FIELD_DP32(desc, MTEDESC, WRITE, is_write);
 -        desc = FIELD_DP32(desc, MTEDESC, SIZEM1, (1 << msz) - 1);
 -        desc <<= SVE_MTEDESC_SHIFT;
 -    }
 -    desc = simd_desc(vsz, vsz, desc | scale);
 +    uint32_t desc;
      tcg_gen_addi_ptr(t_pg, tcg_env, pred_full_reg_offset(s, pg));
      tcg_gen_addi_ptr(t_zm, tcg_env, vec_full_reg_offset(s, zm));
      tcg_gen_addi_ptr(t_zt, tcg_env, vec_full_reg_offset(s, zt));
 +
 +    desc = make_svemte_desc(s, vec_full_reg_size(s), 1, msz, is_write, scale);
      fn(tcg_env, t_zt, t_pg, t_zm, scalar, tcg_constant_i32(desc));
  }
 --
-.20.1
+.34.1

-[PULL 35/39] target/arm: Convert Neon 3-reg-same VMAX/VMIN to decodetree
+[PULL 06/35] target/arm: Handle mte in do_ldrq, do_ldro
-Convert the Neon 3-reg-same VMAX and VMIN insns to decodetree.
+From: Richard Henderson <richard.henderson@linaro.org>
+These functions "use the standard load helpers", but
+fail to clean_data_tbi or populate mtedesc.
+Cc: qemu-stable@nongnu.org
+Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+Tested-by: Gustavo Romero <gustavo.romero@linaro.org>
+Message-id: 20240207025210.8837-6-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20200430181003.21682-17-peter.maydell@linaro.org
 ---
- target/arm/neon-dp.decode       |  5 +++++
+ target/arm/tcg/translate-sve.c | 15 +++++++++++++--
- target/arm/translate-neon.inc.c | 14 ++++++++++++++
+file changed, 13 insertions(+), 2 deletions(-)
  target/arm/translate.c          | 21 ++-------------------
 files changed, 21 insertions(+), 19 deletions(-)
-diff --git a/target/arm/neon-dp.decode b/target/arm/neon-dp.decode
+diff --git a/target/arm/tcg/translate-sve.c b/target/arm/tcg/translate-sve.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/neon-dp.decode
+--- a/target/arm/tcg/translate-sve.c
-+++ b/target/arm/neon-dp.decode
++++ b/target/arm/tcg/translate-sve.c
-@@ -XXX,XX +XXX,XX @@ VBSL_3s          1111 001 1 0 . 01 .... .... 0001 ... 1 .... @3same_logic
+@@ -XXX,XX +XXX,XX @@ static void do_ldrq(DisasContext *s, int zt, int pg, TCGv_i64 addr, int dtype)
- VBIT_3s          1111 001 1 0 . 10 .... .... 0001 ... 1 .... @3same_logic
+     unsigned vsz = vec_full_reg_size(s);
- VBIF_3s          1111 001 1 0 . 11 .... .... 0001 ... 1 .... @3same_logic
+     TCGv_ptr t_pg;
+     int poff;
-+VMAX_S_3s        1111 001 0 0 . .. .... .... 0110 . . . 0 .... @3same
++    uint32_t desc;
-+VMAX_U_3s        1111 001 1 0 . .. .... .... 0110 . . . 0 .... @3same
-+VMIN_S_3s        1111 001 0 0 . .. .... .... 0110 . . . 1 .... @3same
+     /* Load the first quadword using the normal predicated load helpers.  */
-+VMIN_U_3s        1111 001 1 0 . .. .... .... 0110 . . . 1 .... @3same
++    if (!s->mte_active[0]) {
-+
++        addr = clean_data_tbi(s, addr);
  VADD_3s          1111 001 0 0 . .. .... .... 1000 . . . 0 .... @3same
  VSUB_3s          1111 001 1 0 . .. .... .... 1000 . . . 0 .... @3same
 diff --git a/target/arm/translate-neon.inc.c b/target/arm/translate-neon.inc.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-neon.inc.c
 +++ b/target/arm/translate-neon.inc.c
@@ -XXX,XX +XXX,XX @@ DO_3SAME(VEOR, tcg_gen_gvec_xor)
  DO_3SAME_BITSEL(VBSL, rd_ofs, rn_ofs, rm_ofs)
  DO_3SAME_BITSEL(VBIT, rm_ofs, rn_ofs, rd_ofs)
  DO_3SAME_BITSEL(VBIF, rm_ofs, rd_ofs, rn_ofs)
 +
 +#define DO_3SAME_NO_SZ_3(INSN, FUNC)                                    \
 +    static bool trans_##INSN##_3s(DisasContext *s, arg_3same *a)        \
 +    {                                                                   \
 +        if (a->size == 3) {                                             \
 +            return false;                                               \
 +        }                                                               \
 +        return do_3same(s, a, FUNC);                                    \
 +    }
 +
-+DO_3SAME_NO_SZ_3(VMAX_S, tcg_gen_gvec_smax)
+     poff = pred_full_reg_offset(s, pg);
-+DO_3SAME_NO_SZ_3(VMAX_U, tcg_gen_gvec_umax)
+     if (vsz > 16) {
-+DO_3SAME_NO_SZ_3(VMIN_S, tcg_gen_gvec_smin)
+         /*
-+DO_3SAME_NO_SZ_3(VMIN_U, tcg_gen_gvec_umin)
+@@ -XXX,XX +XXX,XX @@ static void do_ldrq(DisasContext *s, int zt, int pg, TCGv_i64 addr, int dtype)
-diff --git a/target/arm/translate.c b/target/arm/translate.c
-index XXXXXXX..XXXXXXX 100644
+     gen_helper_gvec_mem *fn
---- a/target/arm/translate.c
+         = ldr_fns[s->mte_active[0]][s->be_data == MO_BE][dtype][0];
-+++ b/target/arm/translate.c
+-    fn(tcg_env, t_pg, addr, tcg_constant_i32(simd_desc(16, 16, zt)));
-@@ -XXX,XX +XXX,XX @@ static int disas_neon_data_insn(DisasContext *s, uint32_t insn)
++    desc = make_svemte_desc(s, 16, 1, dtype_msz(dtype), false, zt);
-                              rd_ofs, rn_ofs, rm_ofs, vec_size, vec_size);
++    fn(tcg_env, t_pg, addr, tcg_constant_i32(desc));
-             return 0;
+     /* Replicate that first quadword.  */
--        case NEON_3R_VMAX:
+     if (vsz > 16) {
--            if (u) {
+@@ -XXX,XX +XXX,XX @@ static void do_ldro(DisasContext *s, int zt, int pg, TCGv_i64 addr, int dtype)
--                tcg_gen_gvec_umax(size, rd_ofs, rn_ofs, rm_ofs,
+     unsigned vsz_r32;
--                                  vec_size, vec_size);
+     TCGv_ptr t_pg;
--            } else {
+     int poff, doff;
--                tcg_gen_gvec_smax(size, rd_ofs, rn_ofs, rm_ofs,
++    uint32_t desc;
--                                  vec_size, vec_size);
--            }
+     if (vsz < 32) {
--            return 0;
+         /*
--        case NEON_3R_VMIN:
+@@ -XXX,XX +XXX,XX @@ static void do_ldro(DisasContext *s, int zt, int pg, TCGv_i64 addr, int dtype)
--            if (u) {
+     }
--                tcg_gen_gvec_umin(size, rd_ofs, rn_ofs, rm_ofs,
--                                  vec_size, vec_size);
+     /* Load the first octaword using the normal predicated load helpers.  */
--            } else {
++    if (!s->mte_active[0]) {
--                tcg_gen_gvec_smin(size, rd_ofs, rn_ofs, rm_ofs,
++        addr = clean_data_tbi(s, addr);
--                                  vec_size, vec_size);
++    }
--            }
--            return 0;
+     poff = pred_full_reg_offset(s, pg);
--
+     if (vsz > 32) {
-         case NEON_3R_VSHL:
+@@ -XXX,XX +XXX,XX @@ static void do_ldro(DisasContext *s, int zt, int pg, TCGv_i64 addr, int dtype)
-             /* Note the operation is vshl vd,vm,vn */
-             tcg_gen_gvec_3(rd_ofs, rm_ofs, rn_ofs, vec_size, vec_size,
+     gen_helper_gvec_mem *fn
-@@ -XXX,XX +XXX,XX @@ static int disas_neon_data_insn(DisasContext *s, uint32_t insn)
+         = ldr_fns[s->mte_active[0]][s->be_data == MO_BE][dtype][0];
+-    fn(tcg_env, t_pg, addr, tcg_constant_i32(simd_desc(32, 32, zt)));
-         case NEON_3R_VADD_VSUB:
++    desc = make_svemte_desc(s, 32, 1, dtype_msz(dtype), false, zt);
-         case NEON_3R_LOGIC:
++    fn(tcg_env, t_pg, addr, tcg_constant_i32(desc));
-+        case NEON_3R_VMAX:
-+        case NEON_3R_VMIN:
+     /*
-             /* Already handled by decodetree */
+      * Replicate that first octaword.
              return 1;
          }
 --
-.20.1
+.34.1

-[PULL 11/39] hw/arm: versal-virt: Fix typo xlnx-ve -> xlnx-versal
+[PULL 07/35] target/arm: Fix SVE/SME gross MTE suppression checks
-From: "Edgar E. Iglesias" <edgar.iglesias@xilinx.com>
+From: Richard Henderson <richard.henderson@linaro.org>
-Fix typo xlnx-ve -> xlnx-versal.
+The TBI and TCMA bits are located within mtedesc, not desc.
-Signed-off-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
+Cc: qemu-stable@nongnu.org
-Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
+Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
+Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Reviewed-by: Luc Michel <luc.michel@greensocs.com>
+Tested-by: Gustavo Romero <gustavo.romero@linaro.org>
-Message-id: 20200427181649.26851-4-edgar.iglesias@gmail.com
+Message-id: 20240207025210.8837-7-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- hw/arm/xlnx-versal-virt.c | 2 +-
+ target/arm/tcg/sme_helper.c |  8 ++++----
-file changed, 1 insertion(+), 1 deletion(-)
+ target/arm/tcg/sve_helper.c | 12 ++++++------
 files changed, 10 insertions(+), 10 deletions(-)
-diff --git a/hw/arm/xlnx-versal-virt.c b/hw/arm/xlnx-versal-virt.c
+diff --git a/target/arm/tcg/sme_helper.c b/target/arm/tcg/sme_helper.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/xlnx-versal-virt.c
+--- a/target/arm/tcg/sme_helper.c
-+++ b/hw/arm/xlnx-versal-virt.c
++++ b/target/arm/tcg/sme_helper.c
-@@ -XXX,XX +XXX,XX @@ static void versal_virt_init(MachineState *machine)
+@@ -XXX,XX +XXX,XX @@ void sme_ld1_mte(CPUARMState *env, void *za, uint64_t *vg,
-         psci_conduit = QEMU_PSCI_CONDUIT_SMC;
+     desc = extract32(desc, 0, SIMD_DATA_SHIFT + SVE_MTEDESC_SHIFT);
      /* Perform gross MTE suppression early. */
 -    if (!tbi_check(desc, bit55) ||
 -        tcma_check(desc, bit55, allocation_tag_from_addr(addr))) {
 +    if (!tbi_check(mtedesc, bit55) ||
 +        tcma_check(mtedesc, bit55, allocation_tag_from_addr(addr))) {
          mtedesc = 0;
      }
--    sysbus_init_child_obj(OBJECT(machine), "xlnx-ve", &s->soc,
+@@ -XXX,XX +XXX,XX @@ void sme_st1_mte(CPUARMState *env, void *za, uint64_t *vg, target_ulong addr,
-+    sysbus_init_child_obj(OBJECT(machine), "xlnx-versal", &s->soc,
+     desc = extract32(desc, 0, SIMD_DATA_SHIFT + SVE_MTEDESC_SHIFT);
-                           sizeof(s->soc), TYPE_XLNX_VERSAL);
-     object_property_set_link(OBJECT(&s->soc), OBJECT(machine->ram),
+     /* Perform gross MTE suppression early. */
-                              "ddr", &error_abort);
+-    if (!tbi_check(desc, bit55) ||
 -        tcma_check(desc, bit55, allocation_tag_from_addr(addr))) {
 +    if (!tbi_check(mtedesc, bit55) ||
 +        tcma_check(mtedesc, bit55, allocation_tag_from_addr(addr))) {
          mtedesc = 0;
      }
 diff --git a/target/arm/tcg/sve_helper.c b/target/arm/tcg/sve_helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/tcg/sve_helper.c
 +++ b/target/arm/tcg/sve_helper.c
@@ -XXX,XX +XXX,XX @@ void sve_ldN_r_mte(CPUARMState *env, uint64_t *vg, target_ulong addr,
      desc = extract32(desc, 0, SIMD_DATA_SHIFT + SVE_MTEDESC_SHIFT);
      /* Perform gross MTE suppression early. */
 -    if (!tbi_check(desc, bit55) ||
 -        tcma_check(desc, bit55, allocation_tag_from_addr(addr))) {
 +    if (!tbi_check(mtedesc, bit55) ||
 +        tcma_check(mtedesc, bit55, allocation_tag_from_addr(addr))) {
          mtedesc = 0;
      }
@@ -XXX,XX +XXX,XX @@ void sve_ldnfff1_r_mte(CPUARMState *env, void *vg, target_ulong addr,
      desc = extract32(desc, 0, SIMD_DATA_SHIFT + SVE_MTEDESC_SHIFT);
      /* Perform gross MTE suppression early. */
 -    if (!tbi_check(desc, bit55) ||
 -        tcma_check(desc, bit55, allocation_tag_from_addr(addr))) {
 +    if (!tbi_check(mtedesc, bit55) ||
 +        tcma_check(mtedesc, bit55, allocation_tag_from_addr(addr))) {
          mtedesc = 0;
      }
@@ -XXX,XX +XXX,XX @@ void sve_stN_r_mte(CPUARMState *env, uint64_t *vg, target_ulong addr,
      desc = extract32(desc, 0, SIMD_DATA_SHIFT + SVE_MTEDESC_SHIFT);
      /* Perform gross MTE suppression early. */
 -    if (!tbi_check(desc, bit55) ||
 -        tcma_check(desc, bit55, allocation_tag_from_addr(addr))) {
 +    if (!tbi_check(mtedesc, bit55) ||
 +        tcma_check(mtedesc, bit55, allocation_tag_from_addr(addr))) {
          mtedesc = 0;
      }
 --
-.20.1
+.34.1

-[PULL 39/39] target/arm: Move gen_ function typedefs to translate.h
+[PULL 08/35] hw/pci-host/raven.c: Mark raven_io_ops as implementing unaligned accesses
-We're going to want at least some of the NeonGen* typedefs
+The raven_io_ops MemoryRegionOps is the only one in the source tree
-for the refactored 32-bit Neon decoder, so move them all
+which sets .valid.unaligned to indicate that it should support
-to translate.h since it makes more sense to keep them in
+unaligned accesses and which does not also set .impl.unaligned to
-one group.
+indicate that its read and write functions can do the unaligned
 handling themselves.  This is a problem, because at the moment the
 core memory system does not implement the support for handling
 unaligned accesses by doing a series of aligned accesses and
 combining them (system/memory.c:access_with_adjusted_size() has a
 TODO comment noting this).
+Fortunately raven_io_read() and raven_io_write() will correctly deal
+with the case of being passed an unaligned address, so we can fix the
+missing unaligned access support by setting .impl.unaligned in the
+MemoryRegionOps struct.
+Fixes: 9a1839164c9c8f06 ("raven: Implement non-contiguous I/O region")
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Tested-by: Cédric Le Goater <clg@redhat.com>
-Message-id: 20200430181003.21682-23-peter.maydell@linaro.org
+Reviewed-by: Cédric Le Goater <clg@redhat.com>
 Message-id: 20240112134640.1775041-1-peter.maydell@linaro.org
 ---
- target/arm/translate.h     | 17 +++++++++++++++++
+ hw/pci-host/raven.c | 1 +
- target/arm/translate-a64.c | 17 -----------------
+file changed, 1 insertion(+)
 files changed, 17 insertions(+), 17 deletions(-)
-diff --git a/target/arm/translate.h b/target/arm/translate.h
+diff --git a/hw/pci-host/raven.c b/hw/pci-host/raven.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate.h
+--- a/hw/pci-host/raven.c
-+++ b/target/arm/translate.h
++++ b/hw/pci-host/raven.c
-@@ -XXX,XX +XXX,XX @@ typedef void GVecGen3Fn(unsigned, uint32_t, uint32_t,
+@@ -XXX,XX +XXX,XX @@ static const MemoryRegionOps raven_io_ops = {
- typedef void GVecGen4Fn(unsigned, uint32_t, uint32_t, uint32_t,
+     .write = raven_io_write,
-                         uint32_t, uint32_t, uint32_t);
+     .endianness = DEVICE_LITTLE_ENDIAN,
+     .impl.max_access_size = 4,
-+/* Function prototype for gen_ functions for calling Neon helpers */
++    .impl.unaligned = true,
-+typedef void NeonGenOneOpEnvFn(TCGv_i32, TCGv_ptr, TCGv_i32);
+     .valid.unaligned = true,
-+typedef void NeonGenTwoOpFn(TCGv_i32, TCGv_i32, TCGv_i32);
+ };
-+typedef void NeonGenTwoOpEnvFn(TCGv_i32, TCGv_ptr, TCGv_i32, TCGv_i32);
 +typedef void NeonGenTwo64OpFn(TCGv_i64, TCGv_i64, TCGv_i64);
 +typedef void NeonGenTwo64OpEnvFn(TCGv_i64, TCGv_ptr, TCGv_i64, TCGv_i64);
 +typedef void NeonGenNarrowFn(TCGv_i32, TCGv_i64);
 +typedef void NeonGenNarrowEnvFn(TCGv_i32, TCGv_ptr, TCGv_i64);
 +typedef void NeonGenWidenFn(TCGv_i64, TCGv_i32);
 +typedef void NeonGenTwoSingleOPFn(TCGv_i32, TCGv_i32, TCGv_i32, TCGv_ptr);
 +typedef void NeonGenTwoDoubleOPFn(TCGv_i64, TCGv_i64, TCGv_i64, TCGv_ptr);
 +typedef void NeonGenOneOpFn(TCGv_i64, TCGv_i64);
 +typedef void CryptoTwoOpFn(TCGv_ptr, TCGv_ptr);
 +typedef void CryptoThreeOpIntFn(TCGv_ptr, TCGv_ptr, TCGv_i32);
 +typedef void CryptoThreeOpFn(TCGv_ptr, TCGv_ptr, TCGv_ptr);
 +typedef void AtomicThreeOpFn(TCGv_i64, TCGv_i64, TCGv_i64, TCGArg, MemOp);
 +
  #endif /* TARGET_ARM_TRANSLATE_H */
 diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-a64.c
 +++ b/target/arm/translate-a64.c
@@ -XXX,XX +XXX,XX @@ typedef struct AArch64DecodeTable {
      AArch64DecodeFn *disas_fn;
  } AArch64DecodeTable;
 -/* Function prototype for gen_ functions for calling Neon helpers */
 -typedef void NeonGenOneOpEnvFn(TCGv_i32, TCGv_ptr, TCGv_i32);
 -typedef void NeonGenTwoOpFn(TCGv_i32, TCGv_i32, TCGv_i32);
 -typedef void NeonGenTwoOpEnvFn(TCGv_i32, TCGv_ptr, TCGv_i32, TCGv_i32);
 -typedef void NeonGenTwo64OpFn(TCGv_i64, TCGv_i64, TCGv_i64);
 -typedef void NeonGenTwo64OpEnvFn(TCGv_i64, TCGv_ptr, TCGv_i64, TCGv_i64);
 -typedef void NeonGenNarrowFn(TCGv_i32, TCGv_i64);
 -typedef void NeonGenNarrowEnvFn(TCGv_i32, TCGv_ptr, TCGv_i64);
 -typedef void NeonGenWidenFn(TCGv_i64, TCGv_i32);
 -typedef void NeonGenTwoSingleOPFn(TCGv_i32, TCGv_i32, TCGv_i32, TCGv_ptr);
 -typedef void NeonGenTwoDoubleOPFn(TCGv_i64, TCGv_i64, TCGv_i64, TCGv_ptr);
 -typedef void NeonGenOneOpFn(TCGv_i64, TCGv_i64);
 -typedef void CryptoTwoOpFn(TCGv_ptr, TCGv_ptr);
 -typedef void CryptoThreeOpIntFn(TCGv_ptr, TCGv_ptr, TCGv_i32);
 -typedef void CryptoThreeOpFn(TCGv_ptr, TCGv_ptr, TCGv_ptr);
 -typedef void AtomicThreeOpFn(TCGv_i64, TCGv_i64, TCGv_i64, TCGArg, MemOp);
 -
  /* initialize TCG globals.  */
  void a64_translate_init(void)
  {
 --
-.20.1
+.34.1

-[PULL 37/39] target/arm: Convert Neon 3-reg-same VQADD/VQSUB to decodetree
+[PULL 09/35] hw/block/tc58128: Don't emit deprecation warning under qtest
-Convert the Neon VQADD/VQSUB insns in the 3-reg-same grouping
+Suppress the deprecation warning when we're running under qtest,
-to decodetree.
+to avoid "make check" including warning messages in its output.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
-Message-id: 20200430181003.21682-19-peter.maydell@linaro.org
+Message-id: 20240206154151.155620-1-peter.maydell@linaro.org
 ---
- target/arm/neon-dp.decode       |  6 ++++++
+ hw/block/tc58128.c | 4 +++-
- target/arm/translate-neon.inc.c | 15 +++++++++++++++
+file changed, 3 insertions(+), 1 deletion(-)
  target/arm/translate.c          | 14 ++------------
 files changed, 23 insertions(+), 12 deletions(-)
-diff --git a/target/arm/neon-dp.decode b/target/arm/neon-dp.decode
+diff --git a/hw/block/tc58128.c b/hw/block/tc58128.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/neon-dp.decode
+--- a/hw/block/tc58128.c
-+++ b/target/arm/neon-dp.decode
++++ b/hw/block/tc58128.c
-@@ -XXX,XX +XXX,XX @@
+@@ -XXX,XX +XXX,XX @@ static sh7750_io_device tc58128 = {
- @3same           .... ... . . . size:2 .... .... .... . q:1 . . .... \
-                  &3same vm=%vm_dp vn=%vn_dp vd=%vd_dp
+ int tc58128_init(struct SH7750State *s, const char *zone1, const char *zone2)
+ {
-+VQADD_S_3s       1111 001 0 0 . .. .... .... 0000 . . . 1 .... @3same
+-    warn_report_once("The TC58128 flash device is deprecated");
-+VQADD_U_3s       1111 001 1 0 . .. .... .... 0000 . . . 1 .... @3same
++    if (!qtest_enabled()) {
-+
++        warn_report_once("The TC58128 flash device is deprecated");
- @3same_logic     .... ... . . . .. .... .... .... . q:1 .. .... \
++    }
-                  &3same vm=%vm_dp vn=%vn_dp vd=%vd_dp size=0
+     init_dev(&tc58128_devs[0], zone1);
+     init_dev(&tc58128_devs[1], zone2);
-@@ -XXX,XX +XXX,XX @@ VBSL_3s          1111 001 1 0 . 01 .... .... 0001 ... 1 .... @3same_logic
+     return sh7750_register_io_device(s, &tc58128);
  VBIT_3s          1111 001 1 0 . 10 .... .... 0001 ... 1 .... @3same_logic
  VBIF_3s          1111 001 1 0 . 11 .... .... 0001 ... 1 .... @3same_logic
 +VQSUB_S_3s       1111 001 0 0 . .. .... .... 0010 . . . 1 .... @3same
 +VQSUB_U_3s       1111 001 1 0 . .. .... .... 0010 . . . 1 .... @3same
 +
  VCGT_S_3s        1111 001 0 0 . .. .... .... 0011 . . . 0 .... @3same
  VCGT_U_3s        1111 001 1 0 . .. .... .... 0011 . . . 0 .... @3same
  VCGE_S_3s        1111 001 0 0 . .. .... .... 0011 . . . 1 .... @3same
 diff --git a/target/arm/translate-neon.inc.c b/target/arm/translate-neon.inc.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-neon.inc.c
 +++ b/target/arm/translate-neon.inc.c
@@ -XXX,XX +XXX,XX @@ static void gen_VTST_3s(unsigned vece, uint32_t rd_ofs, uint32_t rn_ofs,
      tcg_gen_gvec_3(rd_ofs, rn_ofs, rm_ofs, oprsz, maxsz, &cmtst_op[vece]);
  }
  DO_3SAME_NO_SZ_3(VTST, gen_VTST_3s)
 +
 +#define DO_3SAME_GVEC4(INSN, OPARRAY)                                   \
 +    static void gen_##INSN##_3s(unsigned vece, uint32_t rd_ofs,         \
 +                                uint32_t rn_ofs, uint32_t rm_ofs,       \
 +                                uint32_t oprsz, uint32_t maxsz)         \
 +    {                                                                   \
 +        tcg_gen_gvec_4(rd_ofs, offsetof(CPUARMState, vfp.qc),           \
 +                       rn_ofs, rm_ofs, oprsz, maxsz, &OPARRAY[vece]);   \
 +    }                                                                   \
 +    DO_3SAME(INSN, gen_##INSN##_3s)
 +
 +DO_3SAME_GVEC4(VQADD_S, sqadd_op)
 +DO_3SAME_GVEC4(VQADD_U, uqadd_op)
 +DO_3SAME_GVEC4(VQSUB_S, sqsub_op)
 +DO_3SAME_GVEC4(VQSUB_U, uqsub_op)
 diff --git a/target/arm/translate.c b/target/arm/translate.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate.c
 +++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static int disas_neon_data_insn(DisasContext *s, uint32_t insn)
              }
              return 1;
 -        case NEON_3R_VQADD:
 -            tcg_gen_gvec_4(rd_ofs, offsetof(CPUARMState, vfp.qc),
 -                           rn_ofs, rm_ofs, vec_size, vec_size,
 -                           (u ? uqadd_op : sqadd_op) + size);
 -            return 0;
 -
 -        case NEON_3R_VQSUB:
 -            tcg_gen_gvec_4(rd_ofs, offsetof(CPUARMState, vfp.qc),
 -                           rn_ofs, rm_ofs, vec_size, vec_size,
 -                           (u ? uqsub_op : sqsub_op) + size);
 -            return 0;
 -
          case NEON_3R_VMUL: /* VMUL */
              if (u) {
                  /* Polynomial case allows only P8.  */
@@ -XXX,XX +XXX,XX @@ static int disas_neon_data_insn(DisasContext *s, uint32_t insn)
          case NEON_3R_VTST_VCEQ:
          case NEON_3R_VCGT:
          case NEON_3R_VCGE:
 +        case NEON_3R_VQADD:
 +        case NEON_3R_VQSUB:
              /* Already handled by decodetree */
              return 1;
          }
 --
-.20.1
+.34.1

-[PULL 36/39] target/arm: Convert Neon 3-reg-same comparisons to decodetree
+[PULL 10/35] tests/qtest/meson.build: Don't include qtests_npcm7xx in qtests_aarch64
-Convert the Neon comparison ops in the 3-reg-same grouping
+We deliberately don't include qtests_npcm7xx in qtests_aarch64,
-to decodetree.
+because we already get the coverage of those tests via qtests_arm,
 and we don't want to use extra CI minutes testing them twice.
+In commit 327b680877b79c4b we added it to qtests_aarch64; revert
+that change.
+Fixes: 327b680877b79c4b ("tests/qtest: Creating qtest for GMAC Module")
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
-Message-id: 20200430181003.21682-18-peter.maydell@linaro.org
+Message-id: 20240206163043.315535-1-peter.maydell@linaro.org
 ---
- target/arm/neon-dp.decode       |  8 ++++++++
+ tests/qtest/meson.build | 1 -
- target/arm/translate-neon.inc.c | 22 ++++++++++++++++++++++
+file changed, 1 deletion(-)
  target/arm/translate.c          | 23 +++--------------------
 files changed, 33 insertions(+), 20 deletions(-)
-diff --git a/target/arm/neon-dp.decode b/target/arm/neon-dp.decode
+diff --git a/tests/qtest/meson.build b/tests/qtest/meson.build
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/neon-dp.decode
+--- a/tests/qtest/meson.build
-+++ b/target/arm/neon-dp.decode
++++ b/tests/qtest/meson.build
-@@ -XXX,XX +XXX,XX @@ VBSL_3s          1111 001 1 0 . 01 .... .... 0001 ... 1 .... @3same_logic
+@@ -XXX,XX +XXX,XX @@ qtests_aarch64 = \
- VBIT_3s          1111 001 1 0 . 10 .... .... 0001 ... 1 .... @3same_logic
+   (config_all_devices.has_key('CONFIG_RASPI') ? ['bcm2835-dma-test'] : []) +  \
- VBIF_3s          1111 001 1 0 . 11 .... .... 0001 ... 1 .... @3same_logic
+   (config_all_accel.has_key('CONFIG_TCG') and                                            \
+    config_all_devices.has_key('CONFIG_TPM_TIS_I2C') ? ['tpm-tis-i2c-test'] : []) + \
-+VCGT_S_3s        1111 001 0 0 . .. .... .... 0011 . . . 0 .... @3same
+-  (config_all_devices.has_key('CONFIG_NPCM7XX') ? qtests_npcm7xx : []) + \
-+VCGT_U_3s        1111 001 1 0 . .. .... .... 0011 . . . 0 .... @3same
+   ['arm-cpu-features',
-+VCGE_S_3s        1111 001 0 0 . .. .... .... 0011 . . . 1 .... @3same
+    'numa-test',
-+VCGE_U_3s        1111 001 1 0 . .. .... .... 0011 . . . 1 .... @3same
+    'boot-serial-test',
 +
  VMAX_S_3s        1111 001 0 0 . .. .... .... 0110 . . . 0 .... @3same
  VMAX_U_3s        1111 001 1 0 . .. .... .... 0110 . . . 0 .... @3same
  VMIN_S_3s        1111 001 0 0 . .. .... .... 0110 . . . 1 .... @3same
@@ -XXX,XX +XXX,XX @@ VMIN_U_3s        1111 001 1 0 . .. .... .... 0110 . . . 1 .... @3same
  VADD_3s          1111 001 0 0 . .. .... .... 1000 . . . 0 .... @3same
  VSUB_3s          1111 001 1 0 . .. .... .... 1000 . . . 0 .... @3same
 +
 +VTST_3s          1111 001 0 0 . .. .... .... 1000 . . . 1 .... @3same
 +VCEQ_3s          1111 001 1 0 . .. .... .... 1000 . . . 1 .... @3same
 diff --git a/target/arm/translate-neon.inc.c b/target/arm/translate-neon.inc.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-neon.inc.c
 +++ b/target/arm/translate-neon.inc.c
@@ -XXX,XX +XXX,XX @@ DO_3SAME_NO_SZ_3(VMAX_S, tcg_gen_gvec_smax)
  DO_3SAME_NO_SZ_3(VMAX_U, tcg_gen_gvec_umax)
  DO_3SAME_NO_SZ_3(VMIN_S, tcg_gen_gvec_smin)
  DO_3SAME_NO_SZ_3(VMIN_U, tcg_gen_gvec_umin)
 +
 +#define DO_3SAME_CMP(INSN, COND)                                        \
 +    static void gen_##INSN##_3s(unsigned vece, uint32_t rd_ofs,         \
 +                                uint32_t rn_ofs, uint32_t rm_ofs,       \
 +                                uint32_t oprsz, uint32_t maxsz)         \
 +    {                                                                   \
 +        tcg_gen_gvec_cmp(COND, vece, rd_ofs, rn_ofs, rm_ofs, oprsz, maxsz); \
 +    }                                                                   \
 +    DO_3SAME_NO_SZ_3(INSN, gen_##INSN##_3s)
 +
 +DO_3SAME_CMP(VCGT_S, TCG_COND_GT)
 +DO_3SAME_CMP(VCGT_U, TCG_COND_GTU)
 +DO_3SAME_CMP(VCGE_S, TCG_COND_GE)
 +DO_3SAME_CMP(VCGE_U, TCG_COND_GEU)
 +DO_3SAME_CMP(VCEQ, TCG_COND_EQ)
 +
 +static void gen_VTST_3s(unsigned vece, uint32_t rd_ofs, uint32_t rn_ofs,
 +                         uint32_t rm_ofs, uint32_t oprsz, uint32_t maxsz)
 +{
 +    tcg_gen_gvec_3(rd_ofs, rn_ofs, rm_ofs, oprsz, maxsz, &cmtst_op[vece]);
 +}
 +DO_3SAME_NO_SZ_3(VTST, gen_VTST_3s)
 diff --git a/target/arm/translate.c b/target/arm/translate.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate.c
 +++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static int disas_neon_data_insn(DisasContext *s, uint32_t insn)
                             u ? &mls_op[size] : &mla_op[size]);
              return 0;
 -        case NEON_3R_VTST_VCEQ:
 -            if (u) { /* VCEQ */
 -                tcg_gen_gvec_cmp(TCG_COND_EQ, size, rd_ofs, rn_ofs, rm_ofs,
 -                                 vec_size, vec_size);
 -            } else { /* VTST */
 -                tcg_gen_gvec_3(rd_ofs, rn_ofs, rm_ofs,
 -                               vec_size, vec_size, &cmtst_op[size]);
 -            }
 -            return 0;
 -
 -        case NEON_3R_VCGT:
 -            tcg_gen_gvec_cmp(u ? TCG_COND_GTU : TCG_COND_GT, size,
 -                             rd_ofs, rn_ofs, rm_ofs, vec_size, vec_size);
 -            return 0;
 -
 -        case NEON_3R_VCGE:
 -            tcg_gen_gvec_cmp(u ? TCG_COND_GEU : TCG_COND_GE, size,
 -                             rd_ofs, rn_ofs, rm_ofs, vec_size, vec_size);
 -            return 0;
 -
          case NEON_3R_VSHL:
              /* Note the operation is vshl vd,vm,vn */
              tcg_gen_gvec_3(rd_ofs, rm_ofs, rn_ofs, vec_size, vec_size,
@@ -XXX,XX +XXX,XX @@ static int disas_neon_data_insn(DisasContext *s, uint32_t insn)
          case NEON_3R_LOGIC:
          case NEON_3R_VMAX:
          case NEON_3R_VMIN:
 +        case NEON_3R_VTST_VCEQ:
 +        case NEON_3R_VCGT:
 +        case NEON_3R_VCGE:
              /* Already handled by decodetree */
              return 1;
          }
 --
-.20.1
+.34.1

-[PULL 34/39] target/arm: Convert Neon 3-reg-same logic ops to decodetree
+[PULL 11/35] tests/qtest/bios-tables-test: Allow changes to virt GTDT
-Convert the Neon logic ops in the 3-reg-same grouping to decodetree.
+Allow changes to the virt GTDT -- we are going to add the IRQ
-Note that for the logic ops the 'size' field forms part of their
+entry for a new timer to it.
 decode and the actual operations are always bitwise.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Reviewed-by: Ard Biesheuvel <ardb@kernel.org>
-Message-id: 20200430181003.21682-16-peter.maydell@linaro.org
+Message-id: 20240122143537.233498-2-peter.maydell@linaro.org
 ---
- target/arm/neon-dp.decode       | 12 +++++++++++
+ tests/qtest/bios-tables-test-allowed-diff.h | 2 ++
- target/arm/translate-neon.inc.c | 19 +++++++++++++++++
+file changed, 2 insertions(+)
  target/arm/translate.c          | 38 +--------------------------------
 files changed, 32 insertions(+), 37 deletions(-)
-diff --git a/target/arm/neon-dp.decode b/target/arm/neon-dp.decode
+diff --git a/tests/qtest/bios-tables-test-allowed-diff.h b/tests/qtest/bios-tables-test-allowed-diff.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/neon-dp.decode
+--- a/tests/qtest/bios-tables-test-allowed-diff.h
-+++ b/target/arm/neon-dp.decode
++++ b/tests/qtest/bios-tables-test-allowed-diff.h
-@@ -XXX,XX +XXX,XX @@
+@@ -1 +1,3 @@
- @3same           .... ... . . . size:2 .... .... .... . q:1 . . .... \
+ /* List of comma-separated changed AML files to ignore */
-                  &3same vm=%vm_dp vn=%vn_dp vd=%vd_dp
++"tests/data/acpi/virt/FACP",
++"tests/data/acpi/virt/GTDT",
 +@3same_logic     .... ... . . . .. .... .... .... . q:1 .. .... \
 +                 &3same vm=%vm_dp vn=%vn_dp vd=%vd_dp size=0
 +
 +VAND_3s          1111 001 0 0 . 00 .... .... 0001 ... 1 .... @3same_logic
 +VBIC_3s          1111 001 0 0 . 01 .... .... 0001 ... 1 .... @3same_logic
 +VORR_3s          1111 001 0 0 . 10 .... .... 0001 ... 1 .... @3same_logic
 +VORN_3s          1111 001 0 0 . 11 .... .... 0001 ... 1 .... @3same_logic
 +VEOR_3s          1111 001 1 0 . 00 .... .... 0001 ... 1 .... @3same_logic
 +VBSL_3s          1111 001 1 0 . 01 .... .... 0001 ... 1 .... @3same_logic
 +VBIT_3s          1111 001 1 0 . 10 .... .... 0001 ... 1 .... @3same_logic
 +VBIF_3s          1111 001 1 0 . 11 .... .... 0001 ... 1 .... @3same_logic
 +
  VADD_3s          1111 001 0 0 . .. .... .... 1000 . . . 0 .... @3same
  VSUB_3s          1111 001 1 0 . .. .... .... 1000 . . . 0 .... @3same
 diff --git a/target/arm/translate-neon.inc.c b/target/arm/translate-neon.inc.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-neon.inc.c
 +++ b/target/arm/translate-neon.inc.c
@@ -XXX,XX +XXX,XX @@ static bool do_3same(DisasContext *s, arg_3same *a, GVecGen3Fn fn)
  DO_3SAME(VADD, tcg_gen_gvec_add)
  DO_3SAME(VSUB, tcg_gen_gvec_sub)
 +DO_3SAME(VAND, tcg_gen_gvec_and)
 +DO_3SAME(VBIC, tcg_gen_gvec_andc)
 +DO_3SAME(VORR, tcg_gen_gvec_or)
 +DO_3SAME(VORN, tcg_gen_gvec_orc)
 +DO_3SAME(VEOR, tcg_gen_gvec_xor)
 +
 +/* These insns are all gvec_bitsel but with the inputs in various orders. */
 +#define DO_3SAME_BITSEL(INSN, O1, O2, O3)                               \
 +    static void gen_##INSN##_3s(unsigned vece, uint32_t rd_ofs,         \
 +                                uint32_t rn_ofs, uint32_t rm_ofs,       \
 +                                uint32_t oprsz, uint32_t maxsz)         \
 +    {                                                                   \
 +        tcg_gen_gvec_bitsel(vece, rd_ofs, O1, O2, O3, oprsz, maxsz);    \
 +    }                                                                   \
 +    DO_3SAME(INSN, gen_##INSN##_3s)
 +
 +DO_3SAME_BITSEL(VBSL, rd_ofs, rn_ofs, rm_ofs)
 +DO_3SAME_BITSEL(VBIT, rm_ofs, rn_ofs, rd_ofs)
 +DO_3SAME_BITSEL(VBIF, rm_ofs, rd_ofs, rn_ofs)
 diff --git a/target/arm/translate.c b/target/arm/translate.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate.c
 +++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static int disas_neon_data_insn(DisasContext *s, uint32_t insn)
              }
              return 1;
 -        case NEON_3R_LOGIC: /* Logic ops.  */
 -            switch ((u << 2) | size) {
 -            case 0: /* VAND */
 -                tcg_gen_gvec_and(0, rd_ofs, rn_ofs, rm_ofs,
 -                                 vec_size, vec_size);
 -                break;
 -            case 1: /* VBIC */
 -                tcg_gen_gvec_andc(0, rd_ofs, rn_ofs, rm_ofs,
 -                                  vec_size, vec_size);
 -                break;
 -            case 2: /* VORR */
 -                tcg_gen_gvec_or(0, rd_ofs, rn_ofs, rm_ofs,
 -                                vec_size, vec_size);
 -                break;
 -            case 3: /* VORN */
 -                tcg_gen_gvec_orc(0, rd_ofs, rn_ofs, rm_ofs,
 -                                 vec_size, vec_size);
 -                break;
 -            case 4: /* VEOR */
 -                tcg_gen_gvec_xor(0, rd_ofs, rn_ofs, rm_ofs,
 -                                 vec_size, vec_size);
 -                break;
 -            case 5: /* VBSL */
 -                tcg_gen_gvec_bitsel(MO_8, rd_ofs, rd_ofs, rn_ofs, rm_ofs,
 -                                    vec_size, vec_size);
 -                break;
 -            case 6: /* VBIT */
 -                tcg_gen_gvec_bitsel(MO_8, rd_ofs, rm_ofs, rn_ofs, rd_ofs,
 -                                    vec_size, vec_size);
 -                break;
 -            case 7: /* VBIF */
 -                tcg_gen_gvec_bitsel(MO_8, rd_ofs, rm_ofs, rd_ofs, rn_ofs,
 -                                    vec_size, vec_size);
 -                break;
 -            }
 -            return 0;
 -
          case NEON_3R_VQADD:
              tcg_gen_gvec_4(rd_ofs, offsetof(CPUARMState, vfp.qc),
                             rn_ofs, rm_ofs, vec_size, vec_size,
@@ -XXX,XX +XXX,XX @@ static int disas_neon_data_insn(DisasContext *s, uint32_t insn)
              return 0;
          case NEON_3R_VADD_VSUB:
 +        case NEON_3R_LOGIC:
              /* Already handled by decodetree */
              return 1;
          }
 --
-.20.1
+.34.1

-[PULL 38/39] target/arm: Convert Neon 3-reg-same VMUL, VMLA, VMLS, VSHL to decodetree
+[PULL 12/35] hw/arm/virt: Wire up non-secure EL2 virtual timer IRQ
-Convert the Neon VMUL, VMLA, VMLS and VSHL insns in the
+Armv8.1+ CPUs have the Virtual Host Extension (VHE) which adds a
--reg-same grouping to decodetree.
+non-secure EL2 virtual timer.  We implemented the timer itself in the
 CPU model, but never wired up its IRQ line to the GIC.
 Wire up the IRQ line (this is always safe whether the CPU has the
 interrupt or not, since it always creates the outbound IRQ line).
 Report it to the guest via dtb and ACPI if the CPU has the feature.
 The DTB binding is documented in the kernel's
 Documentation/devicetree/bindings/timer/arm\,arch_timer.yaml
 and the ACPI table entries are documented in the ACPI specification
 version 6.3 or later.
 Because the IRQ line ACPI binding is new in 6.3, we need to bump the
 FADT table rev to show that we might be using 6.3 features.
 Note that exposing this IRQ in the DTB will trigger a bug in EDK2
 versions prior to edk2-stable202311, for users who use the virt board
 with 'virtualization=on' to enable EL2 emulation and are booting an
 EDK2 guest BIOS, if that EDK2 has assertions enabled.  The effect is
 that EDK2 will assert on bootup:
  ASSERT [ArmTimerDxe] /home/kraxel/projects/qemu/roms/edk2/ArmVirtPkg/Library/ArmVirtTimerFdtClientLib/ArmVirtTimerFdtClientLib.c(72): PropSize == 36 || PropSize == 48
 If you see that assertion you should do one of:
  * update your EDK2 binaries to edk2-stable202311 or newer
  * use the 'virt-8.2' versioned machine type
  * not use 'virtualization=on'
 (The versions shipped with QEMU itself have the fix.)
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Reviewed-by: Ard Biesheuvel <ardb@kernel.org>
-Message-id: 20200430181003.21682-20-peter.maydell@linaro.org
+Message-id: 20240122143537.233498-3-peter.maydell@linaro.org
 ---
- target/arm/neon-dp.decode       |  9 +++++++
+ include/hw/arm/virt.h    |  2 ++
- target/arm/translate-neon.inc.c | 44 +++++++++++++++++++++++++++++++++
+ hw/arm/virt-acpi-build.c | 20 ++++++++++----
- target/arm/translate.c          | 28 +++------------------
+ hw/arm/virt.c            | 60 ++++++++++++++++++++++++++++++++++------
-files changed, 56 insertions(+), 25 deletions(-)
+files changed, 67 insertions(+), 15 deletions(-)
-diff --git a/target/arm/neon-dp.decode b/target/arm/neon-dp.decode
+diff --git a/include/hw/arm/virt.h b/include/hw/arm/virt.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/neon-dp.decode
+--- a/include/hw/arm/virt.h
-+++ b/target/arm/neon-dp.decode
++++ b/include/hw/arm/virt.h
-@@ -XXX,XX +XXX,XX @@ VCGT_U_3s        1111 001 1 0 . .. .... .... 0011 . . . 0 .... @3same
+@@ -XXX,XX +XXX,XX @@ struct VirtMachineClass {
- VCGE_S_3s        1111 001 0 0 . .. .... .... 0011 . . . 1 .... @3same
+     /* Machines < 6.2 have no support for describing cpu topology to guest */
- VCGE_U_3s        1111 001 1 0 . .. .... .... 0011 . . . 1 .... @3same
+     bool no_cpu_topology;
+     bool no_tcg_lpa2;
-+VSHL_S_3s        1111 001 0 0 . .. .... .... 0100 . . . 0 .... @3same
++    bool no_ns_el2_virt_timer_irq;
-+VSHL_U_3s        1111 001 1 0 . .. .... .... 0100 . . . 0 .... @3same
+ };
-+
- VMAX_S_3s        1111 001 0 0 . .. .... .... 0110 . . . 0 .... @3same
+ struct VirtMachineState {
- VMAX_U_3s        1111 001 1 0 . .. .... .... 0110 . . . 0 .... @3same
+@@ -XXX,XX +XXX,XX @@ struct VirtMachineState {
- VMIN_S_3s        1111 001 0 0 . .. .... .... 0110 . . . 1 .... @3same
+     PCIBus *bus;
-@@ -XXX,XX +XXX,XX @@ VSUB_3s          1111 001 1 0 . .. .... .... 1000 . . . 0 .... @3same
+     char *oem_id;
+     char *oem_table_id;
- VTST_3s          1111 001 0 0 . .. .... .... 1000 . . . 1 .... @3same
++    bool ns_el2_virt_timer_irq;
- VCEQ_3s          1111 001 1 0 . .. .... .... 1000 . . . 1 .... @3same
+ };
-+
-+VMLA_3s          1111 001 0 0 . .. .... .... 1001 . . . 0 .... @3same
+ #define VIRT_ECAM_ID(high) (high ? VIRT_HIGH_PCIE_ECAM : VIRT_PCIE_ECAM)
-+VMLS_3s          1111 001 1 0 . .. .... .... 1001 . . . 0 .... @3same
+diff --git a/hw/arm/virt-acpi-build.c b/hw/arm/virt-acpi-build.c
 +
 +VMUL_3s          1111 001 0 0 . .. .... .... 1001 . . . 1 .... @3same
 +VMUL_p_3s        1111 001 1 0 . .. .... .... 1001 . . . 1 .... @3same
 diff --git a/target/arm/translate-neon.inc.c b/target/arm/translate-neon.inc.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-neon.inc.c
+--- a/hw/arm/virt-acpi-build.c
-+++ b/target/arm/translate-neon.inc.c
++++ b/hw/arm/virt-acpi-build.c
-@@ -XXX,XX +XXX,XX @@ DO_3SAME_NO_SZ_3(VMAX_S, tcg_gen_gvec_smax)
+@@ -XXX,XX +XXX,XX @@ build_srat(GArray *table_data, BIOSLinker *linker, VirtMachineState *vms)
- DO_3SAME_NO_SZ_3(VMAX_U, tcg_gen_gvec_umax)
+ }
- DO_3SAME_NO_SZ_3(VMIN_S, tcg_gen_gvec_smin)
- DO_3SAME_NO_SZ_3(VMIN_U, tcg_gen_gvec_umin)
+ /*
-+DO_3SAME_NO_SZ_3(VMUL, tcg_gen_gvec_mul)
+- * ACPI spec, Revision 5.1
+- * 5.2.24 Generic Timer Description Table (GTDT)
- #define DO_3SAME_CMP(INSN, COND)                                        \
++ * ACPI spec, Revision 6.5
-     static void gen_##INSN##_3s(unsigned vece, uint32_t rd_ofs,         \
++ * 5.2.25 Generic Timer Description Table (GTDT)
-@@ -XXX,XX +XXX,XX @@ DO_3SAME_GVEC4(VQADD_S, sqadd_op)
+  */
- DO_3SAME_GVEC4(VQADD_U, uqadd_op)
+ static void
- DO_3SAME_GVEC4(VQSUB_S, sqsub_op)
+ build_gtdt(GArray *table_data, BIOSLinker *linker, VirtMachineState *vms)
- DO_3SAME_GVEC4(VQSUB_U, uqsub_op)
+@@ -XXX,XX +XXX,XX @@ build_gtdt(GArray *table_data, BIOSLinker *linker, VirtMachineState *vms)
-+
+     uint32_t irqflags = vmc->claim_edge_triggered_timers ?
-+static void gen_VMUL_p_3s(unsigned vece, uint32_t rd_ofs, uint32_t rn_ofs,
+: /* Interrupt is Edge triggered */
-+                           uint32_t rm_ofs, uint32_t oprsz, uint32_t maxsz)
+;  /* Interrupt is Level triggered  */
 -    AcpiTable table = { .sig = "GTDT", .rev = 2, .oem_id = vms->oem_id,
 +    AcpiTable table = { .sig = "GTDT", .rev = 3, .oem_id = vms->oem_id,
                          .oem_table_id = vms->oem_table_id };
      acpi_table_begin(&table, table_data);
@@ -XXX,XX +XXX,XX @@ build_gtdt(GArray *table_data, BIOSLinker *linker, VirtMachineState *vms)
      build_append_int_noprefix(table_data, 0, 4);
      /* Platform Timer Offset */
      build_append_int_noprefix(table_data, 0, 4);
 -
 +    if (vms->ns_el2_virt_timer_irq) {
 +        /* Virtual EL2 Timer GSIV */
 +        build_append_int_noprefix(table_data, ARCH_TIMER_NS_EL2_VIRT_IRQ, 4);
 +        /* Virtual EL2 Timer Flags */
 +        build_append_int_noprefix(table_data, irqflags, 4);
 +    } else {
 +        build_append_int_noprefix(table_data, 0, 4);
 +        build_append_int_noprefix(table_data, 0, 4);
 +    }
      acpi_table_end(linker, &table);
  }
@@ -XXX,XX +XXX,XX @@ build_madt(GArray *table_data, BIOSLinker *linker, VirtMachineState *vms)
  static void build_fadt_rev6(GArray *table_data, BIOSLinker *linker,
                              VirtMachineState *vms, unsigned dsdt_tbl_offset)
  {
 -    /* ACPI v6.0 */
 +    /* ACPI v6.3 */
      AcpiFadtData fadt = {
          .rev = 6,
 -        .minor_ver = 0,
 +        .minor_ver = 3,
          .flags = 1 << ACPI_FADT_F_HW_REDUCED_ACPI,
          .xdsdt_tbl_offset = &dsdt_tbl_offset,
      };
 diff --git a/hw/arm/virt.c b/hw/arm/virt.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/virt.c
 +++ b/hw/arm/virt.c
@@ -XXX,XX +XXX,XX @@ static void create_randomness(MachineState *ms, const char *node)
      qemu_fdt_setprop(ms->fdt, node, "rng-seed", seed.rng, sizeof(seed.rng));
  }
 +/*
 + * The CPU object always exposes the NS EL2 virt timer IRQ line,
 + * but we don't want to advertise it to the guest in the dtb or ACPI
 + * table unless it's really going to do something.
 + */
 +static bool ns_el2_virt_timer_present(void)
 +{
-+    tcg_gen_gvec_3_ool(rd_ofs, rn_ofs, rm_ofs, oprsz, maxsz,
++    ARMCPU *cpu = ARM_CPU(qemu_get_cpu(0));
-+                       0, gen_helper_gvec_pmul_b);
++    CPUARMState *env = &cpu->env;
 +
 +    return arm_feature(env, ARM_FEATURE_AARCH64) &&
 +        arm_feature(env, ARM_FEATURE_EL2) && cpu_isar_feature(aa64_vh, cpu);
 +}
 +
-+static bool trans_VMUL_p_3s(DisasContext *s, arg_3same *a)
+ static void create_fdt(VirtMachineState *vms)
-+{
+ {
-+    if (a->size != 0) {
+     MachineState *ms = MACHINE(vms);
-+        return false;
+@@ -XXX,XX +XXX,XX @@ static void fdt_add_timer_nodes(const VirtMachineState *vms)
                                  "arm,armv7-timer");
      }
      qemu_fdt_setprop(ms->fdt, "/timer", "always-on", NULL, 0);
 -    qemu_fdt_setprop_cells(ms->fdt, "/timer", "interrupts",
 -                           GIC_FDT_IRQ_TYPE_PPI,
 -                           INTID_TO_PPI(ARCH_TIMER_S_EL1_IRQ), irqflags,
 -                           GIC_FDT_IRQ_TYPE_PPI,
 -                           INTID_TO_PPI(ARCH_TIMER_NS_EL1_IRQ), irqflags,
 -                           GIC_FDT_IRQ_TYPE_PPI,
 -                           INTID_TO_PPI(ARCH_TIMER_VIRT_IRQ), irqflags,
 -                           GIC_FDT_IRQ_TYPE_PPI,
 -                           INTID_TO_PPI(ARCH_TIMER_NS_EL2_IRQ), irqflags);
 +    if (vms->ns_el2_virt_timer_irq) {
 +        qemu_fdt_setprop_cells(ms->fdt, "/timer", "interrupts",
 +                               GIC_FDT_IRQ_TYPE_PPI,
 +                               INTID_TO_PPI(ARCH_TIMER_S_EL1_IRQ), irqflags,
 +                               GIC_FDT_IRQ_TYPE_PPI,
 +                               INTID_TO_PPI(ARCH_TIMER_NS_EL1_IRQ), irqflags,
 +                               GIC_FDT_IRQ_TYPE_PPI,
 +                               INTID_TO_PPI(ARCH_TIMER_VIRT_IRQ), irqflags,
 +                               GIC_FDT_IRQ_TYPE_PPI,
 +                               INTID_TO_PPI(ARCH_TIMER_NS_EL2_IRQ), irqflags,
 +                               GIC_FDT_IRQ_TYPE_PPI,
 +                               INTID_TO_PPI(ARCH_TIMER_NS_EL2_VIRT_IRQ), irqflags);
 +    } else {
 +        qemu_fdt_setprop_cells(ms->fdt, "/timer", "interrupts",
 +                               GIC_FDT_IRQ_TYPE_PPI,
 +                               INTID_TO_PPI(ARCH_TIMER_S_EL1_IRQ), irqflags,
 +                               GIC_FDT_IRQ_TYPE_PPI,
 +                               INTID_TO_PPI(ARCH_TIMER_NS_EL1_IRQ), irqflags,
 +                               GIC_FDT_IRQ_TYPE_PPI,
 +                               INTID_TO_PPI(ARCH_TIMER_VIRT_IRQ), irqflags,
 +                               GIC_FDT_IRQ_TYPE_PPI,
 +                               INTID_TO_PPI(ARCH_TIMER_NS_EL2_IRQ), irqflags);
 +    }
-+    return do_3same(s, a, gen_VMUL_p_3s);
+ }
-+}
-+
+ static void fdt_add_cpu_nodes(const VirtMachineState *vms)
-+#define DO_3SAME_GVEC3_NO_SZ_3(INSN, OPARRAY)                           \
+@@ -XXX,XX +XXX,XX @@ static void create_gic(VirtMachineState *vms, MemoryRegion *mem)
-+    static void gen_##INSN##_3s(unsigned vece, uint32_t rd_ofs,         \
+             [GTIMER_VIRT] = ARCH_TIMER_VIRT_IRQ,
-+                                uint32_t rn_ofs, uint32_t rm_ofs,       \
+             [GTIMER_HYP]  = ARCH_TIMER_NS_EL2_IRQ,
-+                                uint32_t oprsz, uint32_t maxsz)         \
+             [GTIMER_SEC]  = ARCH_TIMER_S_EL1_IRQ,
-+    {                                                                   \
++            [GTIMER_HYPVIRT] = ARCH_TIMER_NS_EL2_VIRT_IRQ,
-+        tcg_gen_gvec_3(rd_ofs, rn_ofs, rm_ofs,                          \
+         };
-+                       oprsz, maxsz, &OPARRAY[vece]);                   \
-+    }                                                                   \
+         for (unsigned irq = 0; irq < ARRAY_SIZE(timer_irq); irq++) {
-+    DO_3SAME_NO_SZ_3(INSN, gen_##INSN##_3s)
+@@ -XXX,XX +XXX,XX @@ static void machvirt_init(MachineState *machine)
-+
+         qdev_realize(DEVICE(cpuobj), NULL, &error_fatal);
-+
+         object_unref(cpuobj);
-+DO_3SAME_GVEC3_NO_SZ_3(VMLA, mla_op)
+     }
-+DO_3SAME_GVEC3_NO_SZ_3(VMLS, mls_op)
++
-+
++    /* Now we've created the CPUs we can see if they have the hypvirt timer */
-+#define DO_3SAME_GVEC3_SHIFT(INSN, OPARRAY)                             \
++    vms->ns_el2_virt_timer_irq = ns_el2_virt_timer_present() &&
-+    static void gen_##INSN##_3s(unsigned vece, uint32_t rd_ofs,         \
++        !vmc->no_ns_el2_virt_timer_irq;
-+                                uint32_t rn_ofs, uint32_t rm_ofs,       \
++
-+                                uint32_t oprsz, uint32_t maxsz)         \
+     fdt_add_timer_nodes(vms);
-+    {                                                                   \
+     fdt_add_cpu_nodes(vms);
-+        /* Note the operation is vshl vd,vm,vn */                       \
-+        tcg_gen_gvec_3(rd_ofs, rm_ofs, rn_ofs,                          \
+@@ -XXX,XX +XXX,XX @@ DEFINE_VIRT_MACHINE_AS_LATEST(9, 0)
-+                       oprsz, maxsz, &OPARRAY[vece]);                   \
-+    }                                                                   \
+ static void virt_machine_8_2_options(MachineClass *mc)
-+    DO_3SAME(INSN, gen_##INSN##_3s)
+ {
-+
++    VirtMachineClass *vmc = VIRT_MACHINE_CLASS(OBJECT_CLASS(mc));
-+DO_3SAME_GVEC3_SHIFT(VSHL_S, sshl_op)
++
-+DO_3SAME_GVEC3_SHIFT(VSHL_U, ushl_op)
+     virt_machine_9_0_options(mc);
-diff --git a/target/arm/translate.c b/target/arm/translate.c
+     compat_props_add(mc->compat_props, hw_compat_8_2, hw_compat_8_2_len);
-index XXXXXXX..XXXXXXX 100644
++    /*
---- a/target/arm/translate.c
++     * Don't expose NS_EL2_VIRT timer IRQ in DTB on ACPI on 8.2 and
-+++ b/target/arm/translate.c
++     * earlier machines. (Exposing it tickles a bug in older EDK2
-@@ -XXX,XX +XXX,XX @@ static int disas_neon_data_insn(DisasContext *s, uint32_t insn)
++     * guest BIOS binaries.)
-             }
++     */
-             return 1;
++    vmc->no_ns_el2_virt_timer_irq = true;
+ }
--        case NEON_3R_VMUL: /* VMUL */
+ DEFINE_VIRT_MACHINE(8, 2)
--            if (u) {
 -                /* Polynomial case allows only P8.  */
 -                if (size != 0) {
 -                    return 1;
 -                }
 -                tcg_gen_gvec_3_ool(rd_ofs, rn_ofs, rm_ofs, vec_size, vec_size,
 -                                   0, gen_helper_gvec_pmul_b);
 -            } else {
 -                tcg_gen_gvec_mul(size, rd_ofs, rn_ofs, rm_ofs,
 -                                 vec_size, vec_size);
 -            }
 -            return 0;
 -
 -        case NEON_3R_VML: /* VMLA, VMLS */
 -            tcg_gen_gvec_3(rd_ofs, rn_ofs, rm_ofs, vec_size, vec_size,
 -                           u ? &mls_op[size] : &mla_op[size]);
 -            return 0;
 -
 -        case NEON_3R_VSHL:
 -            /* Note the operation is vshl vd,vm,vn */
 -            tcg_gen_gvec_3(rd_ofs, rm_ofs, rn_ofs, vec_size, vec_size,
 -                           u ? &ushl_op[size] : &sshl_op[size]);
 -            return 0;
 -
          case NEON_3R_VADD_VSUB:
          case NEON_3R_LOGIC:
          case NEON_3R_VMAX:
@@ -XXX,XX +XXX,XX @@ static int disas_neon_data_insn(DisasContext *s, uint32_t insn)
          case NEON_3R_VCGE:
          case NEON_3R_VQADD:
          case NEON_3R_VQSUB:
 +        case NEON_3R_VMUL:
 +        case NEON_3R_VML:
 +        case NEON_3R_VSHL:
              /* Already handled by decodetree */
              return 1;
          }
 --
-.20.1
+.34.1

-[PULL 33/39] target/arm: Convert Neon 3-reg-same VADD/VSUB to decodetree
+[PULL 13/35] tests/qtest/bios-tables-tests: Update virt golden reference
-Convert the Neon 3-reg-same VADD and VSUB insns to decodetree.
+Update the virt golden reference files to say that the FACP is ACPI
+v6.3, and the GTDT table is a revision 3 table with space for the
-Note that we don't need the neon_3r_sizes[op] check here because all
+virtual EL2 timer.
-size values are OK for VADD and VSUB; we'll add this when we convert
-the first insn that has size restrictions.
+Diffs from iasl:
-For this we need one of the GVecGen*Fn typedefs currently in
+@@ -XXX,XX +XXX,XX @@
-translate-a64.h; move them all to translate.h as a block so they
+ /*
-are visible to the 32-bit decoder.
+  * Intel ACPI Component Architecture
   * AML/ASL+ Disassembler version 20200925 (64-bit version)
   * Copyright (c) 2000 - 2020 Intel Corporation
   *
 - * Disassembly of tests/data/acpi/virt/FACP, Mon Jan 22 13:48:40 2024
 + * Disassembly of /tmp/aml-W8RZH2, Mon Jan 22 13:48:40 2024
   *
   * ACPI Data Table [FACP]
   *
   * Format: [HexOffset DecimalOffset ByteLength]  FieldName : FieldValue
   */
  [000h 0000   4]                    Signature : "FACP"    [Fixed ACPI Description Table (FADT)]
  [004h 0004   4]                 Table Length : 00000114
  [008h 0008   1]                     Revision : 06
 -[009h 0009   1]                     Checksum : 15
 +[009h 0009   1]                     Checksum : 12
  [00Ah 0010   6]                       Oem ID : "BOCHS "
  [010h 0016   8]                 Oem Table ID : "BXPC    "
  [018h 0024   4]                 Oem Revision : 00000001
  [01Ch 0028   4]              Asl Compiler ID : "BXPC"
  [020h 0032   4]        Asl Compiler Revision : 00000001
  [024h 0036   4]                 FACS Address : 00000000
  [028h 0040   4]                 DSDT Address : 00000000
  [02Ch 0044   1]                        Model : 00
  [02Dh 0045   1]                   PM Profile : 00 [Unspecified]
  [02Eh 0046   2]                SCI Interrupt : 0000
  [030h 0048   4]             SMI Command Port : 00000000
  [034h 0052   1]            ACPI Enable Value : 00
  [035h 0053   1]           ACPI Disable Value : 00
  [036h 0054   1]               S4BIOS Command : 00
  [037h 0055   1]              P-State Control : 00
@@ -XXX,XX +XXX,XX @@
       Use APIC Physical Destination Mode (V4) : 0
                         Hardware Reduced (V5) : 1
                        Low Power S0 Idle (V5) : 0
  [074h 0116  12]               Reset Register : [Generic Address Structure]
  [074h 0116   1]                     Space ID : 00 [SystemMemory]
  [075h 0117   1]                    Bit Width : 00
  [076h 0118   1]                   Bit Offset : 00
  [077h 0119   1]         Encoded Access Width : 00 [Undefined/Legacy]
  [078h 0120   8]                      Address : 0000000000000000
  [080h 0128   1]         Value to cause reset : 00
  [081h 0129   2]    ARM Flags (decoded below) : 0003
                                PSCI Compliant : 1
                         Must use HVC for PSCI : 1
 -[083h 0131   1]          FADT Minor Revision : 00
 +[083h 0131   1]          FADT Minor Revision : 03
  [084h 0132   8]                 FACS Address : 0000000000000000
  [08Ch 0140   8]                 DSDT Address : 0000000000000000
  [094h 0148  12]             PM1A Event Block : [Generic Address Structure]
  [094h 0148   1]                     Space ID : 00 [SystemMemory]
  [095h 0149   1]                    Bit Width : 00
  [096h 0150   1]                   Bit Offset : 00
  [097h 0151   1]         Encoded Access Width : 00 [Undefined/Legacy]
  [098h 0152   8]                      Address : 0000000000000000
  [0A0h 0160  12]             PM1B Event Block : [Generic Address Structure]
  [0A0h 0160   1]                     Space ID : 00 [SystemMemory]
  [0A1h 0161   1]                    Bit Width : 00
  [0A2h 0162   1]                   Bit Offset : 00
  [0A3h 0163   1]         Encoded Access Width : 00 [Undefined/Legacy]
  [0A4h 0164   8]                      Address : 0000000000000000
@@ -XXX,XX +XXX,XX @@
  [0F5h 0245   1]                    Bit Width : 00
  [0F6h 0246   1]                   Bit Offset : 00
  [0F7h 0247   1]         Encoded Access Width : 00 [Undefined/Legacy]
  [0F8h 0248   8]                      Address : 0000000000000000
  [100h 0256  12]        Sleep Status Register : [Generic Address Structure]
  [100h 0256   1]                     Space ID : 00 [SystemMemory]
  [101h 0257   1]                    Bit Width : 00
  [102h 0258   1]                   Bit Offset : 00
  [103h 0259   1]         Encoded Access Width : 00 [Undefined/Legacy]
  [104h 0260   8]                      Address : 0000000000000000
  [10Ch 0268   8]                Hypervisor ID : 00000000554D4551
  Raw Table Data: Length 276 (0x114)
 -    0000: 46 41 43 50 14 01 00 00 06 15 42 4F 43 48 53 20  // FACP......BOCHS
 +    0000: 46 41 43 50 14 01 00 00 06 12 42 4F 43 48 53 20  // FACP......BOCHS
 : 42 58 50 43 20 20 20 20 01 00 00 00 42 58 50 43  // BXPC    ....BXPC
 : 01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  // ................
 : 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  // ................
 : 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  // ................
 : 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  // ................
 : 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  // ................
 : 00 00 10 00 00 00 00 00 00 00 00 00 00 00 00 00  // ................
 -    0080: 00 03 00 00 00 00 00 00 00 00 00 00 00 00 00 00  // ................
 +    0080: 00 03 00 03 00 00 00 00 00 00 00 00 00 00 00 00  // ................
 : 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  // ................
 A0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  // ................
 B0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  // ................
 C0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  // ................
 D0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  // ................
 E0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  // ................
 F0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  // ................
 : 00 00 00 00 00 00 00 00 00 00 00 00 51 45 4D 55  // ............QEMU
 : 00 00 00 00                                      // ....
@@ -XXX,XX +XXX,XX @@
  /*
   * Intel ACPI Component Architecture
   * AML/ASL+ Disassembler version 20200925 (64-bit version)
   * Copyright (c) 2000 - 2020 Intel Corporation
   *
 - * Disassembly of tests/data/acpi/virt/GTDT, Mon Jan 22 13:48:40 2024
 + * Disassembly of /tmp/aml-XDSZH2, Mon Jan 22 13:48:40 2024
   *
   * ACPI Data Table [GTDT]
   *
   * Format: [HexOffset DecimalOffset ByteLength]  FieldName : FieldValue
   */
  [000h 0000   4]                    Signature : "GTDT"    [Generic Timer Description Table]
 -[004h 0004   4]                 Table Length : 00000060
 -[008h 0008   1]                     Revision : 02
 -[009h 0009   1]                     Checksum : 9C
 +[004h 0004   4]                 Table Length : 00000068
 +[008h 0008   1]                     Revision : 03
 +[009h 0009   1]                     Checksum : 93
  [00Ah 0010   6]                       Oem ID : "BOCHS "
  [010h 0016   8]                 Oem Table ID : "BXPC    "
  [018h 0024   4]                 Oem Revision : 00000001
  [01Ch 0028   4]              Asl Compiler ID : "BXPC"
  [020h 0032   4]        Asl Compiler Revision : 00000001
  [024h 0036   8]        Counter Block Address : FFFFFFFFFFFFFFFF
  [02Ch 0044   4]                     Reserved : 00000000
  [030h 0048   4]         Secure EL1 Interrupt : 0000001D
  [034h 0052   4]    EL1 Flags (decoded below) : 00000000
                                  Trigger Mode : 0
                                      Polarity : 0
                                     Always On : 0
  [038h 0056   4]     Non-Secure EL1 Interrupt : 0000001E
@@ -XXX,XX +XXX,XX @@
  [040h 0064   4]      Virtual Timer Interrupt : 0000001B
  [044h 0068   4]     VT Flags (decoded below) : 00000000
                                  Trigger Mode : 0
                                      Polarity : 0
                                     Always On : 0
  [048h 0072   4]     Non-Secure EL2 Interrupt : 0000001A
  [04Ch 0076   4]   NEL2 Flags (decoded below) : 00000000
                                  Trigger Mode : 0
                                      Polarity : 0
                                     Always On : 0
  [050h 0080   8]   Counter Read Block Address : FFFFFFFFFFFFFFFF
  [058h 0088   4]         Platform Timer Count : 00000000
  [05Ch 0092   4]        Platform Timer Offset : 00000000
 +[060h 0096   4]       Virtual EL2 Timer GSIV : 00000000
 +[064h 0100   4]      Virtual EL2 Timer Flags : 00000000
 -Raw Table Data: Length 96 (0x60)
 +Raw Table Data: Length 104 (0x68)
 -    0000: 47 54 44 54 60 00 00 00 02 9C 42 4F 43 48 53 20  // GTDT`.....BOCHS
 +    0000: 47 54 44 54 68 00 00 00 03 93 42 4F 43 48 53 20  // GTDTh.....BOCHS
 : 42 58 50 43 20 20 20 20 01 00 00 00 42 58 50 43  // BXPC    ....BXPC
 : 01 00 00 00 FF FF FF FF FF FF FF FF 00 00 00 00  // ................
 : 1D 00 00 00 00 00 00 00 1E 00 00 00 04 00 00 00  // ................
 : 1B 00 00 00 00 00 00 00 1A 00 00 00 00 00 00 00  // ................
 : FF FF FF FF FF FF FF FF 00 00 00 00 00 00 00 00  // ................
 +    0060: 00 00 00 00 00 00 00 00                          // ........
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Reviewed-by: Ard Biesheuvel <ardb@kernel.org>
-Message-id: 20200430181003.21682-15-peter.maydell@linaro.org
+Message-id: 20240122143537.233498-4-peter.maydell@linaro.org
 ---
- target/arm/translate-a64.h      |  9 --------
+ tests/qtest/bios-tables-test-allowed-diff.h |   2 --
- target/arm/translate.h          |  9 ++++++++
+ tests/data/acpi/virt/FACP                   | Bin 276 -> 276 bytes
- target/arm/neon-dp.decode       | 17 +++++++++++++++
+ tests/data/acpi/virt/GTDT                   | Bin 96 -> 104 bytes
- target/arm/translate-neon.inc.c | 38 +++++++++++++++++++++++++++++++++
+files changed, 2 deletions(-)
- target/arm/translate.c          | 14 ++++--------
-files changed, 68 insertions(+), 19 deletions(-)
+diff --git a/tests/qtest/bios-tables-test-allowed-diff.h b/tests/qtest/bios-tables-test-allowed-diff.h
 diff --git a/target/arm/translate-a64.h b/target/arm/translate-a64.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.h
+--- a/tests/qtest/bios-tables-test-allowed-diff.h
-+++ b/target/arm/translate-a64.h
++++ b/tests/qtest/bios-tables-test-allowed-diff.h
-@@ -XXX,XX +XXX,XX @@ static inline int vec_full_reg_size(DisasContext *s)
+@@ -1,3 +1 @@
+ /* List of comma-separated changed AML files to ignore */
- bool disas_sve(DisasContext *, uint32_t);
+-"tests/data/acpi/virt/FACP",
+-"tests/data/acpi/virt/GTDT",
--/* Note that the gvec expanders operate on offsets + sizes.  */
+diff --git a/tests/data/acpi/virt/FACP b/tests/data/acpi/virt/FACP
 -typedef void GVecGen2Fn(unsigned, uint32_t, uint32_t, uint32_t, uint32_t);
 -typedef void GVecGen2iFn(unsigned, uint32_t, uint32_t, int64_t,
 -                         uint32_t, uint32_t);
 -typedef void GVecGen3Fn(unsigned, uint32_t, uint32_t,
 -                        uint32_t, uint32_t, uint32_t);
 -typedef void GVecGen4Fn(unsigned, uint32_t, uint32_t, uint32_t,
 -                        uint32_t, uint32_t, uint32_t);
 -
  #endif /* TARGET_ARM_TRANSLATE_A64_H */
 diff --git a/target/arm/translate.h b/target/arm/translate.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate.h
+GIT binary patch
-+++ b/target/arm/translate.h
+delta 25
-@@ -XXX,XX +XXX,XX @@ void gen_sshl_i64(TCGv_i64 d, TCGv_i64 a, TCGv_i64 b);
+gcmbQjG=+)F&CxkPgpq-PO=u!l<;2F$$vli407<0<)c^nh
- #define dc_isar_feature(name, ctx) \
-     ({ DisasContext *ctx_ = (ctx); isar_feature_##name(ctx_->isar); })
+delta 28
+kcmbQjG=+)F&CxkPgpq-PO>`nx<-|!<6Akz$^DuG%0AAS!ssI20
-+/* Note that the gvec expanders operate on offsets + sizes.  */
-+typedef void GVecGen2Fn(unsigned, uint32_t, uint32_t, uint32_t, uint32_t);
+diff --git a/tests/data/acpi/virt/GTDT b/tests/data/acpi/virt/GTDT
 +typedef void GVecGen2iFn(unsigned, uint32_t, uint32_t, int64_t,
 +                         uint32_t, uint32_t);
 +typedef void GVecGen3Fn(unsigned, uint32_t, uint32_t,
 +                        uint32_t, uint32_t, uint32_t);
 +typedef void GVecGen4Fn(unsigned, uint32_t, uint32_t, uint32_t,
 +                        uint32_t, uint32_t, uint32_t);
 +
  #endif /* TARGET_ARM_TRANSLATE_H */
 diff --git a/target/arm/neon-dp.decode b/target/arm/neon-dp.decode
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/neon-dp.decode
+GIT binary patch
-+++ b/target/arm/neon-dp.decode
+delta 25
-@@ -XXX,XX +XXX,XX @@
+bcmYeu;BpUf3CUn!U|^m+kt>V?$N&QXMtB4L
- #
- # This file is processed by scripts/decodetree.py
+delta 16
- #
+Xcmc~u;BpUf2}xjJU|^avkt+-UB60)u
-+# VFP/Neon register fields; same as vfp.decode
 +%vm_dp  5:1 0:4
 +%vn_dp  7:1 16:4
 +%vd_dp  22:1 12:4
  # Encodings for Neon data processing instructions where the T32 encoding
  # is a simple transformation of the A32 encoding.
@@ -XXX,XX +XXX,XX @@
  #   0b111p_1111_qqqq_qqqq_qqqq_qqqq_qqqq_qqqq
  # This file works on the A32 encoding only; calling code for T32 has to
  # transform the insn into the A32 version first.
 +
 +######################################################################
 +# 3-reg-same grouping:
 +# 1111 001 U 0 D sz:2 Vn:4 Vd:4 opc:4 N Q M op Vm:4
 +######################################################################
 +
 +&3same vm vn vd q size
 +
 +@3same           .... ... . . . size:2 .... .... .... . q:1 . . .... \
 +                 &3same vm=%vm_dp vn=%vn_dp vd=%vd_dp
 +
 +VADD_3s          1111 001 0 0 . .. .... .... 1000 . . . 0 .... @3same
 +VSUB_3s          1111 001 1 0 . .. .... .... 1000 . . . 0 .... @3same
 diff --git a/target/arm/translate-neon.inc.c b/target/arm/translate-neon.inc.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-neon.inc.c
 +++ b/target/arm/translate-neon.inc.c
@@ -XXX,XX +XXX,XX @@ static bool trans_VLDST_single(DisasContext *s, arg_VLDST_single *a)
      return true;
  }
 +
 +static bool do_3same(DisasContext *s, arg_3same *a, GVecGen3Fn fn)
 +{
 +    int vec_size = a->q ? 16 : 8;
 +    int rd_ofs = neon_reg_offset(a->vd, 0);
 +    int rn_ofs = neon_reg_offset(a->vn, 0);
 +    int rm_ofs = neon_reg_offset(a->vm, 0);
 +
 +    if (!arm_dc_feature(s, ARM_FEATURE_NEON)) {
 +        return false;
 +    }
 +
 +    /* UNDEF accesses to D16-D31 if they don't exist. */
 +    if (!dc_isar_feature(aa32_simd_r32, s) &&
 +        ((a->vd | a->vn | a->vm) & 0x10)) {
 +        return false;
 +    }
 +
 +    if ((a->vn | a->vm | a->vd) & a->q) {
 +        return false;
 +    }
 +
 +    if (!vfp_access_check(s)) {
 +        return true;
 +    }
 +
 +    fn(a->size, rd_ofs, rn_ofs, rm_ofs, vec_size, vec_size);
 +    return true;
 +}
 +
 +#define DO_3SAME(INSN, FUNC)                                            \
 +    static bool trans_##INSN##_3s(DisasContext *s, arg_3same *a)        \
 +    {                                                                   \
 +        return do_3same(s, a, FUNC);                                    \
 +    }
 +
 +DO_3SAME(VADD, tcg_gen_gvec_add)
 +DO_3SAME(VSUB, tcg_gen_gvec_sub)
 diff --git a/target/arm/translate.c b/target/arm/translate.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate.c
 +++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static int disas_neon_data_insn(DisasContext *s, uint32_t insn)
              }
              return 0;
 -        case NEON_3R_VADD_VSUB:
 -            if (u) {
 -                tcg_gen_gvec_sub(size, rd_ofs, rn_ofs, rm_ofs,
 -                                 vec_size, vec_size);
 -            } else {
 -                tcg_gen_gvec_add(size, rd_ofs, rn_ofs, rm_ofs,
 -                                 vec_size, vec_size);
 -            }
 -            return 0;
 -
          case NEON_3R_VQADD:
              tcg_gen_gvec_4(rd_ofs, offsetof(CPUARMState, vfp.qc),
                             rn_ofs, rm_ofs, vec_size, vec_size,
@@ -XXX,XX +XXX,XX @@ static int disas_neon_data_insn(DisasContext *s, uint32_t insn)
              tcg_gen_gvec_3(rd_ofs, rm_ofs, rn_ofs, vec_size, vec_size,
                             u ? &ushl_op[size] : &sshl_op[size]);
              return 0;
 +
 +        case NEON_3R_VADD_VSUB:
 +            /* Already handled by decodetree */
 +            return 1;
          }
          if (size == 3) {
 --
-.20.1
+.34.1

-[PULL 29/39] target/arm: Convert VFM[AS]L (scalar) to decodetree
+[PULL 14/35] hw/arm/npcm7xx: Call qemu_configure_nic_device() for GMAC modules
-Convert the VFM[AS]L (scalar) insns in the 2reg-scalar-ext group
+The patchset adding the GMAC ethernet to this SoC crossed in the
-to decodetree. These are the last ones in the group so we can remove
+mail with the patchset cleaning up the NIC handling. When we
-all the legacy decode for the group.
+create the GMAC modules we must call qemu_configure_nic_device()
 so that the user has the opportunity to use the -nic commandline
 option to create a network backend and connect it to the GMACs.
-Note that in disas_thumb2_insn() the parts of this encoding space
+Add the missing call.
 where the decodetree decoder returns false will correctly be directed
 to illegal_op by the "(insn & (1 << 28))" check so they won't fall
 into disas_coproc_insn() by mistake.
+Fixes: 21e5326a7c ("hw/arm: Add GMAC devices to NPCM7XX SoC")
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Reviewed-by: David Woodhouse <dwmw@amazon.co.uk>
-Message-id: 20200430181003.21682-11-peter.maydell@linaro.org
+Message-id: 20240206171231.396392-2-peter.maydell@linaro.org
 ---
- target/arm/neon-shared.decode   |   7 +++
+ hw/arm/npcm7xx.c | 1 +
- target/arm/translate-neon.inc.c |  32 ++++++++++
+file changed, 1 insertion(+)
  target/arm/translate.c          | 107 +-------------------------------
 files changed, 40 insertions(+), 106 deletions(-)
-diff --git a/target/arm/neon-shared.decode b/target/arm/neon-shared.decode
+diff --git a/hw/arm/npcm7xx.c b/hw/arm/npcm7xx.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/neon-shared.decode
+--- a/hw/arm/npcm7xx.c
-+++ b/target/arm/neon-shared.decode
++++ b/hw/arm/npcm7xx.c
-@@ -XXX,XX +XXX,XX @@ VCMLA_scalar   1111 1110 1 . rot:2 .... .... 1000 . q:1 . 0 .... \
+@@ -XXX,XX +XXX,XX @@ static void npcm7xx_realize(DeviceState *dev, Error **errp)
+     for (i = 0; i < ARRAY_SIZE(s->gmac); i++) {
- VDOT_scalar    1111 1110 0 . 10 .... .... 1101 . q:1 index:1 u:1 rm:4 \
+         SysBusDevice *sbd = SYS_BUS_DEVICE(&s->gmac[i]);
-                vm=%vm_dp vn=%vn_dp vd=%vd_dp
-+
++        qemu_configure_nic_device(DEVICE(sbd), false, NULL);
-+%vfml_scalar_q0_rm 0:3 5:1
+         /*
-+%vfml_scalar_q1_index 5:1 3:1
+          * The device exists regardless of whether it's connected to a QEMU
-+VFML_scalar    1111 1110 0 . 0 s:1 .... .... 1000 . 0 . 1 index:1 ... \
+          * netdev backend. So always instantiate it even if there is no
 +               rm=%vfml_scalar_q0_rm vn=%vn_sp vd=%vd_dp q=0
 +VFML_scalar    1111 1110 0 . 0 s:1 .... .... 1000 . 1 . 1 . rm:3 \
 +               index=%vfml_scalar_q1_index vn=%vn_dp vd=%vd_dp q=1
 diff --git a/target/arm/translate-neon.inc.c b/target/arm/translate-neon.inc.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-neon.inc.c
 +++ b/target/arm/translate-neon.inc.c
@@ -XXX,XX +XXX,XX @@ static bool trans_VDOT_scalar(DisasContext *s, arg_VDOT_scalar *a)
      tcg_temp_free_ptr(fpst);
      return true;
  }
 +
 +static bool trans_VFML_scalar(DisasContext *s, arg_VFML_scalar *a)
 +{
 +    int opr_sz;
 +
 +    if (!dc_isar_feature(aa32_fhm, s)) {
 +        return false;
 +    }
 +
 +    /* UNDEF accesses to D16-D31 if they don't exist. */
 +    if (!dc_isar_feature(aa32_simd_r32, s) &&
 +        ((a->vd & 0x10) || (a->q && (a->vn & 0x10)))) {
 +        return false;
 +    }
 +
 +    if (a->vd & a->q) {
 +        return false;
 +    }
 +
 +    if (!vfp_access_check(s)) {
 +        return true;
 +    }
 +
 +    opr_sz = (1 + a->q) * 8;
 +    tcg_gen_gvec_3_ptr(vfp_reg_offset(1, a->vd),
 +                       vfp_reg_offset(a->q, a->vn),
 +                       vfp_reg_offset(a->q, a->rm),
 +                       cpu_env, opr_sz, opr_sz,
 +                       (a->index << 2) | a->s, /* is_2 == 0 */
 +                       gen_helper_gvec_fmlal_idx_a32);
 +    return true;
 +}
 diff --git a/target/arm/translate.c b/target/arm/translate.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate.c
 +++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static int disas_dsp_insn(DisasContext *s, uint32_t insn)
  }
  #define VFP_REG_SHR(x, n) (((n) > 0) ? (x) >> (n) : (x) << -(n))
 -#define VFP_SREG(insn, bigbit, smallbit) \
 -  ((VFP_REG_SHR(insn, bigbit - 1) & 0x1e) | (((insn) >> (smallbit)) & 1))
  #define VFP_DREG(reg, insn, bigbit, smallbit) do { \
      if (dc_isar_feature(aa32_simd_r32, s)) { \
          reg = (((insn) >> (bigbit)) & 0x0f) \
@@ -XXX,XX +XXX,XX @@ static int disas_dsp_insn(DisasContext *s, uint32_t insn)
          reg = ((insn) >> (bigbit)) & 0x0f; \
      }} while (0)
 -#define VFP_SREG_D(insn) VFP_SREG(insn, 12, 22)
  #define VFP_DREG_D(reg, insn) VFP_DREG(reg, insn, 12, 22)
 -#define VFP_SREG_N(insn) VFP_SREG(insn, 16,  7)
  #define VFP_DREG_N(reg, insn) VFP_DREG(reg, insn, 16,  7)
 -#define VFP_SREG_M(insn) VFP_SREG(insn,  0,  5)
  #define VFP_DREG_M(reg, insn) VFP_DREG(reg, insn,  0,  5)
  static void gen_neon_dup_low16(TCGv_i32 var)
@@ -XXX,XX +XXX,XX @@ static int disas_neon_data_insn(DisasContext *s, uint32_t insn)
      return 0;
  }
 -/* Advanced SIMD two registers and a scalar extension.
 - *  31             24   23  22   20   16   12  11   10   9    8        3     0
 - * +-----------------+----+---+----+----+----+---+----+---+----+---------+----+
 - * | 1 1 1 1 1 1 1 0 | o1 | D | o2 | Vn | Vd | 1 | o3 | 0 | o4 | N Q M U | Vm |
 - * +-----------------+----+---+----+----+----+---+----+---+----+---------+----+
 - *
 - */
 -
 -static int disas_neon_insn_2reg_scalar_ext(DisasContext *s, uint32_t insn)
 -{
 -    gen_helper_gvec_3 *fn_gvec = NULL;
 -    gen_helper_gvec_3_ptr *fn_gvec_ptr = NULL;
 -    int rd, rn, rm, opr_sz, data;
 -    int off_rn, off_rm;
 -    bool is_long = false, q = extract32(insn, 6, 1);
 -    bool ptr_is_env = false;
 -
 -    if ((insn & 0xffa00f10) == 0xfe000810) {
 -        /* VFM[AS]L -- 1111 1110 0.0S .... .... 1000 .Q.1 .... */
 -        int is_s = extract32(insn, 20, 1);
 -        int vm20 = extract32(insn, 0, 3);
 -        int vm3 = extract32(insn, 3, 1);
 -        int m = extract32(insn, 5, 1);
 -        int index;
 -
 -        if (!dc_isar_feature(aa32_fhm, s)) {
 -            return 1;
 -        }
 -        if (q) {
 -            rm = vm20;
 -            index = m * 2 + vm3;
 -        } else {
 -            rm = vm20 * 2 + m;
 -            index = vm3;
 -        }
 -        is_long = true;
 -        data = (index << 2) | is_s; /* is_2 == 0 */
 -        fn_gvec_ptr = gen_helper_gvec_fmlal_idx_a32;
 -        ptr_is_env = true;
 -    } else {
 -        return 1;
 -    }
 -
 -    VFP_DREG_D(rd, insn);
 -    if (rd & q) {
 -        return 1;
 -    }
 -    if (q || !is_long) {
 -        VFP_DREG_N(rn, insn);
 -        if (rn & q & !is_long) {
 -            return 1;
 -        }
 -        off_rn = vfp_reg_offset(1, rn);
 -        off_rm = vfp_reg_offset(1, rm);
 -    } else {
 -        rn = VFP_SREG_N(insn);
 -        off_rn = vfp_reg_offset(0, rn);
 -        off_rm = vfp_reg_offset(0, rm);
 -    }
 -    if (s->fp_excp_el) {
 -        gen_exception_insn(s, s->pc_curr, EXCP_UDEF,
 -                           syn_simd_access_trap(1, 0xe, false), s->fp_excp_el);
 -        return 0;
 -    }
 -    if (!s->vfp_enabled) {
 -        return 1;
 -    }
 -
 -    opr_sz = (1 + q) * 8;
 -    if (fn_gvec_ptr) {
 -        TCGv_ptr ptr;
 -        if (ptr_is_env) {
 -            ptr = cpu_env;
 -        } else {
 -            ptr = get_fpstatus_ptr(1);
 -        }
 -        tcg_gen_gvec_3_ptr(vfp_reg_offset(1, rd), off_rn, off_rm, ptr,
 -                           opr_sz, opr_sz, data, fn_gvec_ptr);
 -        if (!ptr_is_env) {
 -            tcg_temp_free_ptr(ptr);
 -        }
 -    } else {
 -        tcg_gen_gvec_3_ool(vfp_reg_offset(1, rd), off_rn, off_rm,
 -                           opr_sz, opr_sz, data, fn_gvec);
 -    }
 -    return 0;
 -}
 -
  static int disas_coproc_insn(DisasContext *s, uint32_t insn)
  {
      int cpnum, is64, crn, crm, opc1, opc2, isread, rt, rt2;
@@ -XXX,XX +XXX,XX @@ static void disas_arm_insn(DisasContext *s, unsigned int insn)
                      }
                  }
              }
 -        } else if ((insn & 0x0f000a00) == 0x0e000800
 -                   && arm_dc_feature(s, ARM_FEATURE_V8)) {
 -            if (disas_neon_insn_2reg_scalar_ext(s, insn)) {
 -                goto illegal_op;
 -            }
 -            return;
          }
          goto illegal_op;
      }
@@ -XXX,XX +XXX,XX @@ static void disas_thumb2_insn(DisasContext *s, uint32_t insn)
              }
              break;
          }
 -        if ((insn & 0xff000a00) == 0xfe000800
 -            && arm_dc_feature(s, ARM_FEATURE_V8)) {
 -            /* The Thumb2 and ARM encodings are identical.  */
 -            if (disas_neon_insn_2reg_scalar_ext(s, insn)) {
 -                goto illegal_op;
 -            }
 -        } else if (((insn >> 24) & 3) == 3) {
 +        if (((insn >> 24) & 3) == 3) {
              /* Translate into the equivalent ARM encoding.  */
              insn = (insn & 0xe2ffffff) | ((insn & (1 << 28)) >> 4) | (1 << 28);
              if (disas_neon_data_insn(s, insn)) {
 --
-.20.1
+.34.1

-[PULL 28/39] target/arm: Convert V[US]DOT (scalar) to decodetree
+[PULL 15/35] tests/qtest/npcm7xx_emc-test: Connect all NICs to a backend
-Convert the V[US]DOT (scalar) insns in the 2reg-scalar-ext group
+Currently QEMU will warn if there is a NIC on the board that
-to decodetree.
+is not connected to a backend. By default the '-nic user' will
 get used for all NICs, but if you manually connect a specific
 NIC to a specific backend, then the other NICs on the board
 have no backend and will be warned about:
 qemu-system-arm: warning: nic npcm7xx-emc.1 has no peer
 qemu-system-arm: warning: nic npcm-gmac.0 has no peer
 qemu-system-arm: warning: nic npcm-gmac.1 has no peer
 So suppress those warnings by manually connecting every NIC
 on the board to some backend.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Reviewed-by: David Woodhouse <dwmw@amazon.co.uk>
-Message-id: 20200430181003.21682-10-peter.maydell@linaro.org
+Reviewed-by: Thomas Huth <thuth@redhat.com>
 Message-id: 20240206171231.396392-3-peter.maydell@linaro.org
 ---
- target/arm/neon-shared.decode   |  3 +++
+ tests/qtest/npcm7xx_emc-test.c | 5 ++++-
- target/arm/translate-neon.inc.c | 35 +++++++++++++++++++++++++++++++++
+file changed, 4 insertions(+), 1 deletion(-)
  target/arm/translate.c          | 13 +-----------
 files changed, 39 insertions(+), 12 deletions(-)
-diff --git a/target/arm/neon-shared.decode b/target/arm/neon-shared.decode
+diff --git a/tests/qtest/npcm7xx_emc-test.c b/tests/qtest/npcm7xx_emc-test.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/neon-shared.decode
+--- a/tests/qtest/npcm7xx_emc-test.c
-+++ b/target/arm/neon-shared.decode
++++ b/tests/qtest/npcm7xx_emc-test.c
-@@ -XXX,XX +XXX,XX @@ VCMLA_scalar   1111 1110 0 . rot:2 .... .... 1000 . q:1 index:1 0 vm:4 \
+@@ -XXX,XX +XXX,XX @@ static int *packet_test_init(int module_num, GString *cmd_line)
-                vn=%vn_dp vd=%vd_dp size=0
+      * KISS and use -nic. The driver accepts 'emc0' and 'emc1' as aliases
- VCMLA_scalar   1111 1110 1 . rot:2 .... .... 1000 . q:1 . 0 .... \
+      * in the 'model' field to specify the device to match.
-                vm=%vm_dp vn=%vn_dp vd=%vd_dp size=1 index=0
+      */
-+
+-    g_string_append_printf(cmd_line, " -nic socket,fd=%d,model=emc%d ",
-+VDOT_scalar    1111 1110 0 . 10 .... .... 1101 . q:1 index:1 u:1 rm:4 \
++    g_string_append_printf(cmd_line, " -nic socket,fd=%d,model=emc%d "
-+               vm=%vm_dp vn=%vn_dp vd=%vd_dp
++                           "-nic user,model=npcm7xx-emc "
-diff --git a/target/arm/translate-neon.inc.c b/target/arm/translate-neon.inc.c
++                           "-nic user,model=npcm-gmac "
-index XXXXXXX..XXXXXXX 100644
++                           "-nic user,model=npcm-gmac",
---- a/target/arm/translate-neon.inc.c
+                            test_sockets[1], module_num);
-+++ b/target/arm/translate-neon.inc.c
-@@ -XXX,XX +XXX,XX @@ static bool trans_VCMLA_scalar(DisasContext *s, arg_VCMLA_scalar *a)
+     g_test_queue_destroy(packet_test_clear, test_sockets);
      tcg_temp_free_ptr(fpst);
      return true;
  }
 +
 +static bool trans_VDOT_scalar(DisasContext *s, arg_VDOT_scalar *a)
 +{
 +    gen_helper_gvec_3 *fn_gvec;
 +    int opr_sz;
 +    TCGv_ptr fpst;
 +
 +    if (!dc_isar_feature(aa32_dp, s)) {
 +        return false;
 +    }
 +
 +    /* UNDEF accesses to D16-D31 if they don't exist. */
 +    if (!dc_isar_feature(aa32_simd_r32, s) &&
 +        ((a->vd | a->vn) & 0x10)) {
 +        return false;
 +    }
 +
 +    if ((a->vd | a->vn) & a->q) {
 +        return false;
 +    }
 +
 +    if (!vfp_access_check(s)) {
 +        return true;
 +    }
 +
 +    fn_gvec = a->u ? gen_helper_gvec_udot_idx_b : gen_helper_gvec_sdot_idx_b;
 +    opr_sz = (1 + a->q) * 8;
 +    fpst = get_fpstatus_ptr(1);
 +    tcg_gen_gvec_3_ool(vfp_reg_offset(1, a->vd),
 +                       vfp_reg_offset(1, a->vn),
 +                       vfp_reg_offset(1, a->rm),
 +                       opr_sz, opr_sz, a->index, fn_gvec);
 +    tcg_temp_free_ptr(fpst);
 +    return true;
 +}
 diff --git a/target/arm/translate.c b/target/arm/translate.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate.c
 +++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static int disas_neon_insn_2reg_scalar_ext(DisasContext *s, uint32_t insn)
      bool is_long = false, q = extract32(insn, 6, 1);
      bool ptr_is_env = false;
 -    if ((insn & 0xffb00f00) == 0xfe200d00) {
 -        /* V[US]DOT -- 1111 1110 0.10 .... .... 1101 .Q.U .... */
 -        int u = extract32(insn, 4, 1);
 -
 -        if (!dc_isar_feature(aa32_dp, s)) {
 -            return 1;
 -        }
 -        fn_gvec = u ? gen_helper_gvec_udot_idx_b : gen_helper_gvec_sdot_idx_b;
 -        /* rm is just Vm, and index is M.  */
 -        data = extract32(insn, 5, 1); /* index */
 -        rm = extract32(insn, 0, 4);
 -    } else if ((insn & 0xffa00f10) == 0xfe000810) {
 +    if ((insn & 0xffa00f10) == 0xfe000810) {
          /* VFM[AS]L -- 1111 1110 0.0S .... .... 1000 .Q.1 .... */
          int is_s = extract32(insn, 20, 1);
          int vm20 = extract32(insn, 0, 3);
 --
-.20.1
+.34.1

-[PULL 05/39] target/arm: Add new 's1_is_el0' argument to get_phys_addr_lpae()
+[PULL 16/35] target/arm: Don't get MDCR_EL2 in pmu_counter_enabled() before checking ARM_FEATURE_PMU
-For ARMv8.2-TTS2UXN, the stage 2 page table walk wants to know
+It doesn't make sense to read the value of MDCR_EL2 on a non-A-profile
-whether the stage 1 access is for EL0 or not, because whether
+CPU, and in fact if you try to do it we will assert:
 exec permission is given can depend on whether this is an EL0
 or EL1 access. Add a new argument to get_phys_addr_lpae() so
 the call sites can pass this information in.
-Since get_phys_addr_lpae() doesn't already have a doc comment,
+#6  0x00007ffff4b95e96 in __GI___assert_fail
-add one so we have a place to put the documentation of the
+    (assertion=0x5555565a8c70 "!arm_feature(env, ARM_FEATURE_M)", file=0x5555565a6e5c "../../target/arm/helper.c", line=12600, function=0x5555565a9560 <__PRETTY_FUNCTION__.0> "arm_security_space_below_el3") at ./assert/assert.c:101
-semantics of the new s1_is_el0 argument.
+#7  0x0000555555ebf412 in arm_security_space_below_el3 (env=0x555557bc8190) at ../../target/arm/helper.c:12600
 #8  0x0000555555ea6f89 in arm_is_el2_enabled (env=0x555557bc8190) at ../../target/arm/cpu.h:2595
 #9  0x0000555555ea942f in arm_mdcr_el2_eff (env=0x555557bc8190) at ../../target/arm/internals.h:1512
+We might call pmu_counter_enabled() on an M-profile CPU (for example
+from the migration pre/post hooks in machine.c); this should always
+return false because these CPUs don't set ARM_FEATURE_PMU.
+Avoid the assertion by not calling arm_mdcr_el2_eff() before we
+have done the early return for "PMU not present".
+This fixes an assertion failure if you try to do a loadvm or
+savevm for an M-profile board.
+Cc: qemu-stable@nongnu.org
+Resolves: https://gitlab.com/qemu-project/qemu/-/issues/2155
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
+Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20200330210400.11724-4-peter.maydell@linaro.org
+Message-id: 20240208153346.970021-1-peter.maydell@linaro.org
 ---
- target/arm/helper.c | 29 ++++++++++++++++++++++++++++-
+ target/arm/helper.c | 12 ++++++++++--
-file changed, 28 insertions(+), 1 deletion(-)
+file changed, 10 insertions(+), 2 deletions(-)
 diff --git a/target/arm/helper.c b/target/arm/helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper.c
 +++ b/target/arm/helper.c
-@@ -XXX,XX +XXX,XX @@
+@@ -XXX,XX +XXX,XX @@ static bool pmu_counter_enabled(CPUARMState *env, uint8_t counter)
+     bool enabled, prohibited = false, filtered;
- static bool get_phys_addr_lpae(CPUARMState *env, target_ulong address,
+     bool secure = arm_is_secure(env);
-                                MMUAccessType access_type, ARMMMUIdx mmu_idx,
+     int el = arm_current_el(env);
-+                               bool s1_is_el0,
+-    uint64_t mdcr_el2 = arm_mdcr_el2_eff(env);
-                                hwaddr *phys_ptr, MemTxAttrs *txattrs, int *prot,
+-    uint8_t hpmn = mdcr_el2 & MDCR_HPMN;
-                                target_ulong *page_size_ptr,
++    uint64_t mdcr_el2;
-                                ARMMMUFaultInfo *fi, ARMCacheAttrs *cacheattrs);
++    uint8_t hpmn;
-@@ -XXX,XX +XXX,XX @@ static hwaddr S1_ptw_translate(CPUARMState *env, ARMMMUIdx mmu_idx,
-         }
++    /*
++     * We might be called for M-profile cores where MDCR_EL2 doesn't
-         ret = get_phys_addr_lpae(env, addr, MMU_DATA_LOAD, ARMMMUIdx_Stage2,
++     * exist and arm_mdcr_el2_eff() will assert, so this early-exit check
-+                                 false,
++     * must be before we read that value.
-                                  &s2pa, &txattrs, &s2prot, &s2size, fi,
++     */
-                                  pcacheattrs);
+     if (!arm_feature(env, ARM_FEATURE_PMU)) {
-         if (ret) {
+         return false;
@@ -XXX,XX +XXX,XX @@ static ARMVAParameters aa32_va_parameters(CPUARMState *env, uint32_t va,
      };
  }
 +/**
 + * get_phys_addr_lpae: perform one stage of page table walk, LPAE format
 + *
 + * Returns false if the translation was successful. Otherwise, phys_ptr, attrs,
 + * prot and page_size may not be filled in, and the populated fsr value provides
 + * information on why the translation aborted, in the format of a long-format
 + * DFSR/IFSR fault register, with the following caveats:
 + *  * the WnR bit is never set (the caller must do this).
 + *
 + * @env: CPUARMState
 + * @address: virtual address to get physical address for
 + * @access_type: MMU_DATA_LOAD, MMU_DATA_STORE or MMU_INST_FETCH
 + * @mmu_idx: MMU index indicating required translation regime
 + * @s1_is_el0: if @mmu_idx is ARMMMUIdx_Stage2 (so this is a stage 2 page table
 + *             walk), must be true if this is stage 2 of a stage 1+2 walk for an
 + *             EL0 access). If @mmu_idx is anything else, @s1_is_el0 is ignored.
 + * @phys_ptr: set to the physical address corresponding to the virtual address
 + * @attrs: set to the memory transaction attributes to use
 + * @prot: set to the permissions for the page containing phys_ptr
 + * @page_size_ptr: set to the size of the page containing phys_ptr
 + * @fi: set to fault info if the translation fails
 + * @cacheattrs: (if non-NULL) set to the cacheability/shareability attributes
 + */
  static bool get_phys_addr_lpae(CPUARMState *env, target_ulong address,
                                 MMUAccessType access_type, ARMMMUIdx mmu_idx,
 +                               bool s1_is_el0,
                                 hwaddr *phys_ptr, MemTxAttrs *txattrs, int *prot,
                                 target_ulong *page_size_ptr,
                                 ARMMMUFaultInfo *fi, ARMCacheAttrs *cacheattrs)
@@ -XXX,XX +XXX,XX @@ bool get_phys_addr(CPUARMState *env, target_ulong address,
              /* S1 is done. Now do S2 translation.  */
              ret = get_phys_addr_lpae(env, ipa, access_type, ARMMMUIdx_Stage2,
 +                                     mmu_idx == ARMMMUIdx_E10_0,
                                       phys_ptr, attrs, &s2_prot,
                                       page_size, fi,
                                       cacheattrs != NULL ? &cacheattrs2 : NULL);
@@ -XXX,XX +XXX,XX @@ bool get_phys_addr(CPUARMState *env, target_ulong address,
      }
-     if (regime_using_lpae_format(env, mmu_idx)) {
++    mdcr_el2 = arm_mdcr_el2_eff(env);
--        return get_phys_addr_lpae(env, address, access_type, mmu_idx,
++    hpmn = mdcr_el2 & MDCR_HPMN;
-+        return get_phys_addr_lpae(env, address, access_type, mmu_idx, false,
++
-                                   phys_ptr, attrs, prot, page_size,
+     if (!arm_feature(env, ARM_FEATURE_EL2) ||
-                                   fi, cacheattrs);
+             (counter < hpmn || counter == 31)) {
-     } else if (regime_sctlr(env, mmu_idx) & SCTLR_XP) {
+         e = env->cp15.c9_pmcr & PMCRE;
 --
-.20.1
+.34.1

-[PULL 18/39] hw/arm: versal-virt: Add support for SD
+[PULL 17/35] tests/qtest: Fix GMAC test to run on a machine in upstream QEMU
-From: "Edgar E. Iglesias" <edgar.iglesias@xilinx.com>
+From: Nabih Estefan <nabihestefan@google.com>
-Add support for SD.
+Fix the nocm_gmac-test.c file to run on a nuvoton 7xx machine instead
 of 8xx. Also fix comments referencing this and values expecting 8xx.
-Signed-off-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
+Change-Id: Iabd0fba14910c3f1e883c4a9521350f3db9ffab8
-Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
+Signed-Off-By: Nabih Estefan <nabihestefan@google.com>
-Reviewed-by: Luc Michel <luc.michel@greensocs.com>
+Reviewed-by: Tyrone Ting <kfting@nuvoton.com>
-Message-id: 20200427181649.26851-11-edgar.iglesias@gmail.com
+Message-id: 20240208194759.2858582-2-nabihestefan@google.com
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
 [PMM: commit message tweaks]
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- hw/arm/xlnx-versal-virt.c | 46 +++++++++++++++++++++++++++++++++++++++
+ tests/qtest/npcm_gmac-test.c | 84 +-----------------------------------
-file changed, 46 insertions(+)
+ tests/qtest/meson.build      |  3 +-
 files changed, 4 insertions(+), 83 deletions(-)
-diff --git a/hw/arm/xlnx-versal-virt.c b/hw/arm/xlnx-versal-virt.c
+diff --git a/tests/qtest/npcm_gmac-test.c b/tests/qtest/npcm_gmac-test.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/xlnx-versal-virt.c
+--- a/tests/qtest/npcm_gmac-test.c
-+++ b/hw/arm/xlnx-versal-virt.c
++++ b/tests/qtest/npcm_gmac-test.c
-@@ -XXX,XX +XXX,XX @@
+@@ -XXX,XX +XXX,XX @@ typedef struct TestData {
- #include "hw/arm/sysbus-fdt.h"
+     const GMACModule *module;
- #include "hw/arm/fdt.h"
+ } TestData;
- #include "cpu.h"
-+#include "hw/qdev-properties.h"
+-/* Values extracted from hw/arm/npcm8xx.c */
- #include "hw/arm/xlnx-versal.h"
++/* Values extracted from hw/arm/npcm7xx.c */
+ static const GMACModule gmac_module_list[] = {
- #define TYPE_XLNX_VERSAL_VIRT_MACHINE MACHINE_TYPE_NAME("xlnx-versal-virt")
+     {
-@@ -XXX,XX +XXX,XX @@ static void fdt_add_zdma_nodes(VersalVirt *s)
+         .irq        = 14,
-     }
+@@ -XXX,XX +XXX,XX @@ static const GMACModule gmac_module_list[] = {
          .irq        = 15,
          .base_addr  = 0xf0804000
      },
 -    {
 -        .irq        = 16,
 -        .base_addr  = 0xf0806000
 -    },
 -    {
 -        .irq        = 17,
 -        .base_addr  = 0xf0808000
 -    }
  };
  /* Returns the index of the GMAC module. */
@@ -XXX,XX +XXX,XX @@ static uint32_t gmac_read(QTestState *qts, const GMACModule *mod,
      return qtest_readl(qts, mod->base_addr + regno);
  }
-+static void fdt_add_sd_nodes(VersalVirt *s)
+-static uint16_t pcs_read(QTestState *qts, const GMACModule *mod,
-+{
+-                          NPCMRegister regno)
-+    const char clocknames[] = "clk_xin\0clk_ahb";
+-{
-+    const char compat[] = "arasan,sdhci-8.9a";
+-    uint32_t write_value = (regno & 0x3ffe00) >> 9;
-+    int i;
+-    qtest_writel(qts, PCS_BASE_ADDRESS + NPCM_PCS_IND_AC_BA, write_value);
-+
+-    uint32_t read_offset = regno & 0x1ff;
-+    for (i = ARRAY_SIZE(s->soc.pmc.iou.sd) - 1; i >= 0; i--) {
+-    return qtest_readl(qts, PCS_BASE_ADDRESS + read_offset);
-+        uint64_t addr = MM_PMC_SD0 + MM_PMC_SD0_SIZE * i;
+-}
-+        char *name = g_strdup_printf("/sdhci@%" PRIx64, addr);
+-
-+
+ /* Check that GMAC registers are reset to default value */
-+        qemu_fdt_add_subnode(s->fdt, name);
+ static void test_init(gconstpointer test_data)
 +
 +        qemu_fdt_setprop_cells(s->fdt, name, "clocks",
 +                               s->phandle.clk_25Mhz, s->phandle.clk_25Mhz);
 +        qemu_fdt_setprop(s->fdt, name, "clock-names",
 +                         clocknames, sizeof(clocknames));
 +        qemu_fdt_setprop_cells(s->fdt, name, "interrupts",
 +                               GIC_FDT_IRQ_TYPE_SPI, VERSAL_SD0_IRQ_0 + i * 2,
 +                               GIC_FDT_IRQ_FLAGS_LEVEL_HI);
 +        qemu_fdt_setprop_sized_cells(s->fdt, name, "reg",
 +                                     2, addr, 2, MM_PMC_SD0_SIZE);
 +        qemu_fdt_setprop(s->fdt, name, "compatible", compat, sizeof(compat));
 +        g_free(name);
 +    }
 +}
 +
  static void fdt_nop_memory_nodes(void *fdt, Error **errp)
  {
-     Error *err = NULL;
+     const TestData *td = test_data;
-@@ -XXX,XX +XXX,XX @@ static void create_virtio_regions(VersalVirt *s)
+     const GMACModule *mod = td->module;
-     }
+-    QTestState *qts = qtest_init("-machine npcm845-evb");
 +    QTestState *qts = qtest_init("-machine npcm750-evb");
  #define CHECK_REG32(regno, value) \
      do { \
          g_assert_cmphex(gmac_read(qts, mod, (regno)), ==, (value)); \
      } while (0)
 -#define CHECK_REG_PCS(regno, value) \
 -    do { \
 -        g_assert_cmphex(pcs_read(qts, mod, (regno)), ==, (value)); \
 -    } while (0)
 -
      CHECK_REG32(NPCM_DMA_BUS_MODE, 0x00020100);
      CHECK_REG32(NPCM_DMA_XMT_POLL_DEMAND, 0);
      CHECK_REG32(NPCM_DMA_RCV_POLL_DEMAND, 0);
@@ -XXX,XX +XXX,XX @@ static void test_init(gconstpointer test_data)
      CHECK_REG32(NPCM_GMAC_PTP_TAR, 0);
      CHECK_REG32(NPCM_GMAC_PTP_TTSR, 0);
 -    /* TODO Add registers PCS */
 -    if (mod->base_addr == 0xf0802000) {
 -        CHECK_REG_PCS(NPCM_PCS_SR_CTL_ID1, 0x699e);
 -        CHECK_REG_PCS(NPCM_PCS_SR_CTL_ID2, 0);
 -        CHECK_REG_PCS(NPCM_PCS_SR_CTL_STS, 0x8000);
 -
 -        CHECK_REG_PCS(NPCM_PCS_SR_MII_CTRL, 0x1140);
 -        CHECK_REG_PCS(NPCM_PCS_SR_MII_STS, 0x0109);
 -        CHECK_REG_PCS(NPCM_PCS_SR_MII_DEV_ID1, 0x699e);
 -        CHECK_REG_PCS(NPCM_PCS_SR_MII_DEV_ID2, 0x0ced0);
 -        CHECK_REG_PCS(NPCM_PCS_SR_MII_AN_ADV, 0x0020);
 -        CHECK_REG_PCS(NPCM_PCS_SR_MII_LP_BABL, 0);
 -        CHECK_REG_PCS(NPCM_PCS_SR_MII_AN_EXPN, 0);
 -        CHECK_REG_PCS(NPCM_PCS_SR_MII_EXT_STS, 0xc000);
 -
 -        CHECK_REG_PCS(NPCM_PCS_SR_TIM_SYNC_ABL, 0x0003);
 -        CHECK_REG_PCS(NPCM_PCS_SR_TIM_SYNC_TX_MAX_DLY_LWR, 0x0038);
 -        CHECK_REG_PCS(NPCM_PCS_SR_TIM_SYNC_TX_MAX_DLY_UPR, 0);
 -        CHECK_REG_PCS(NPCM_PCS_SR_TIM_SYNC_TX_MIN_DLY_LWR, 0x0038);
 -        CHECK_REG_PCS(NPCM_PCS_SR_TIM_SYNC_TX_MIN_DLY_UPR, 0);
 -        CHECK_REG_PCS(NPCM_PCS_SR_TIM_SYNC_RX_MAX_DLY_LWR, 0x0058);
 -        CHECK_REG_PCS(NPCM_PCS_SR_TIM_SYNC_RX_MAX_DLY_UPR, 0);
 -        CHECK_REG_PCS(NPCM_PCS_SR_TIM_SYNC_RX_MIN_DLY_LWR, 0x0048);
 -        CHECK_REG_PCS(NPCM_PCS_SR_TIM_SYNC_RX_MIN_DLY_UPR, 0);
 -
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_MMD_DIG_CTRL1, 0x2400);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_AN_CTRL, 0);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_AN_INTR_STS, 0x000a);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_TC, 0);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_DBG_CTRL, 0);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_EEE_MCTRL0, 0x899c);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_EEE_TXTIMER, 0);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_EEE_RXTIMER, 0);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_LINK_TIMER_CTRL, 0);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_EEE_MCTRL1, 0);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_DIG_STS, 0x0010);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_ICG_ERRCNT1, 0);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_MISC_STS, 0);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_RX_LSTS, 0);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_MP_TX_BSTCTRL0, 0x00a);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_MP_TX_LVLCTRL0, 0x007f);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_MP_TX_GENCTRL0, 0x0001);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_MP_TX_GENCTRL1, 0);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_MP_TX_STS, 0);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_MP_RX_GENCTRL0, 0x0100);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_MP_RX_GENCTRL1, 0x1100);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_MP_RX_LOS_CTRL0, 0x000e);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_MP_MPLL_CTRL0, 0x0100);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_MP_MPLL_CTRL1, 0x0032);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_MP_MPLL_STS, 0x0001);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_MP_MISC_CTRL2, 0);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_MP_LVL_CTRL, 0x0019);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_MP_MISC_CTRL0, 0);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_MP_MISC_CTRL1, 0);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_DIG_CTRL2, 0);
 -        CHECK_REG_PCS(NPCM_PCS_VR_MII_DIG_ERRCNT_SEL, 0);
 -    }
 -
      qtest_quit(qts);
  }
-+static void sd_plugin_card(SDHCIState *sd, DriveInfo *di)
+diff --git a/tests/qtest/meson.build b/tests/qtest/meson.build
-+{
+index XXXXXXX..XXXXXXX 100644
-+    BlockBackend *blk = di ? blk_by_legacy_dinfo(di) : NULL;
+--- a/tests/qtest/meson.build
-+    DeviceState *card;
++++ b/tests/qtest/meson.build
-+
+@@ -XXX,XX +XXX,XX @@ qtests_npcm7xx = \
-+    card = qdev_create(qdev_get_child_bus(DEVICE(sd), "sd-bus"), TYPE_SD_CARD);
+    'npcm7xx_sdhci-test',
-+    object_property_add_child(OBJECT(sd), "card[*]", OBJECT(card),
+    'npcm7xx_smbus-test',
-+                              &error_fatal);
+    'npcm7xx_timer-test',
-+    qdev_prop_set_drive(card, "drive", blk, &error_fatal);
+-   'npcm7xx_watchdog_timer-test'] + \
-+    object_property_set_bool(OBJECT(card), true, "realized", &error_fatal);
++   'npcm7xx_watchdog_timer-test',
-+}
++   'npcm_gmac-test'] + \
-+
+    (slirp.found() ? ['npcm7xx_emc-test'] : [])
- static void versal_virt_init(MachineState *machine)
+ qtests_aspeed = \
- {
+   ['aspeed_hace-test',
      VersalVirt *s = XLNX_VERSAL_VIRT_MACHINE(machine);
      int psci_conduit = QEMU_PSCI_CONDUIT_DISABLED;
 +    int i;
      /*
       * If the user provides an Operating System to be loaded, we expect them
@@ -XXX,XX +XXX,XX @@ static void versal_virt_init(MachineState *machine)
      fdt_add_gic_nodes(s);
      fdt_add_timer_nodes(s);
      fdt_add_zdma_nodes(s);
 +    fdt_add_sd_nodes(s);
      fdt_add_cpu_nodes(s, psci_conduit);
      fdt_add_clk_node(s, "/clk125", 125000000, s->phandle.clk_125Mhz);
      fdt_add_clk_node(s, "/clk25", 25000000, s->phandle.clk_25Mhz);
@@ -XXX,XX +XXX,XX @@ static void versal_virt_init(MachineState *machine)
      memory_region_add_subregion_overlap(get_system_memory(),
 , &s->soc.fpd.apu.mr, 0);
 +    /* Plugin SD cards.  */
 +    for (i = 0; i < ARRAY_SIZE(s->soc.pmc.iou.sd); i++) {
 +        sd_plugin_card(&s->soc.pmc.iou.sd[i], drive_get_next(IF_SD));
 +    }
 +
      s->binfo.ram_size = machine->ram_size;
      s->binfo.loader_start = 0x0;
      s->binfo.get_dtb = versal_virt_get_dtb;
 --
-.20.1
+.34.1

-[PULL 09/39] hw/arm: versal: Remove inclusion of arm_gicv3_common.h
+[PULL 18/35] hw/arm/smmuv3: add support for stage 1 access fault
-From: "Edgar E. Iglesias" <edgar.iglesias@xilinx.com>
+From: Luc Michel <luc.michel@amd.com>
-Remove inclusion of arm_gicv3_common.h, this already gets
+An access fault is raised when the Access Flag is not set in the
-included via xlnx-versal.h.
+looked-up PTE and the AFFD field is not set in the corresponding context
 descriptor. This was already implemented for stage 2. Implement it for
 stage 1 as well.
-Signed-off-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
+Signed-off-by: Luc Michel <luc.michel@amd.com>
-Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
+Reviewed-by: Mostafa Saleh <smostafa@google.com>
-Reviewed-by: Luc Michel <luc.michel@greensocs.com>
+Reviewed-by: Eric Auger <eric.auger@redhat.com>
-Message-id: 20200427181649.26851-2-edgar.iglesias@gmail.com
+Tested-by: Mostafa Saleh <smostafa@google.com>
 Message-id: 20240213082211.3330400-1-luc.michel@amd.com
 [PMM: tweaked comment text]
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- hw/arm/xlnx-versal.c | 1 -
+ hw/arm/smmuv3-internal.h     |  1 +
-file changed, 1 deletion(-)
+ include/hw/arm/smmu-common.h |  1 +
  hw/arm/smmu-common.c         | 11 +++++++++++
  hw/arm/smmuv3.c              |  1 +
 files changed, 14 insertions(+)
-diff --git a/hw/arm/xlnx-versal.c b/hw/arm/xlnx-versal.c
+diff --git a/hw/arm/smmuv3-internal.h b/hw/arm/smmuv3-internal.h
 index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/xlnx-versal.c
+--- a/hw/arm/smmuv3-internal.h
-+++ b/hw/arm/xlnx-versal.c
++++ b/hw/arm/smmuv3-internal.h
-@@ -XXX,XX +XXX,XX @@
+@@ -XXX,XX +XXX,XX @@ static inline int pa_range(STE *ste)
- #include "hw/arm/boot.h"
+ #define CD_EPD(x, sel)   extract32((x)->word[0], (16 * (sel)) + 14, 1)
- #include "kvm_arm.h"
+ #define CD_ENDI(x)       extract32((x)->word[0], 15, 1)
- #include "hw/misc/unimp.h"
+ #define CD_IPS(x)        extract32((x)->word[1], 0 , 3)
--#include "hw/intc/arm_gicv3_common.h"
++#define CD_AFFD(x)       extract32((x)->word[1], 3 , 1)
- #include "hw/arm/xlnx-versal.h"
+ #define CD_TBI(x)        extract32((x)->word[1], 6 , 2)
- #include "hw/char/pl011.h"
+ #define CD_HD(x)         extract32((x)->word[1], 10 , 1)
  #define CD_HA(x)         extract32((x)->word[1], 11 , 1)
 diff --git a/include/hw/arm/smmu-common.h b/include/hw/arm/smmu-common.h
 index XXXXXXX..XXXXXXX 100644
 --- a/include/hw/arm/smmu-common.h
 +++ b/include/hw/arm/smmu-common.h
@@ -XXX,XX +XXX,XX @@ typedef struct SMMUTransCfg {
      bool disabled;             /* smmu is disabled */
      bool bypassed;             /* translation is bypassed */
      bool aborted;              /* translation is aborted */
 +    bool affd;                 /* AF fault disable */
      uint32_t iotlb_hits;       /* counts IOTLB hits */
      uint32_t iotlb_misses;     /* counts IOTLB misses*/
      /* Used by stage-1 only. */
 diff --git a/hw/arm/smmu-common.c b/hw/arm/smmu-common.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/smmu-common.c
 +++ b/hw/arm/smmu-common.c
@@ -XXX,XX +XXX,XX @@ static int smmu_ptw_64_s1(SMMUTransCfg *cfg,
                                       pte_addr, pte, iova, gpa,
                                       block_size >> 20);
          }
 +
 +        /*
 +         * QEMU does not currently implement HTTU, so if AFFD and PTE.AF
 +         * are 0 we take an Access flag fault. (5.4. Context Descriptor)
 +         * An Access flag fault takes priority over a Permission fault.
 +         */
 +        if (!PTE_AF(pte) && !cfg->affd) {
 +            info->type = SMMU_PTW_ERR_ACCESS;
 +            goto error;
 +        }
 +
          ap = PTE_AP(pte);
          if (is_permission_fault(ap, perm)) {
              info->type = SMMU_PTW_ERR_PERMISSION;
 diff --git a/hw/arm/smmuv3.c b/hw/arm/smmuv3.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/smmuv3.c
 +++ b/hw/arm/smmuv3.c
@@ -XXX,XX +XXX,XX @@ static int decode_cd(SMMUTransCfg *cfg, CD *cd, SMMUEventInfo *event)
      cfg->oas = MIN(oas2bits(SMMU_IDR5_OAS), cfg->oas);
      cfg->tbi = CD_TBI(cd);
      cfg->asid = CD_ASID(cd);
 +    cfg->affd = CD_AFFD(cd);
      trace_smmuv3_decode_cd(cfg->oas);
 --
-.20.1
+.34.1

-[PULL 16/39] hw/arm: versal: Add support for SD
+[PULL 19/35] hw/arm/stellaris: Convert ADC controller to Resettable interface
-From: "Edgar E. Iglesias" <edgar.iglesias@xilinx.com>
+From: Philippe Mathieu-Daudé <philmd@linaro.org>
-Add support for SD.
+Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
+Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
+Message-id: 20240213155214.13619-2-philmd@linaro.org
 Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
 Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
 Reviewed-by: Luc Michel <luc.michel@greensocs.com>
 Message-id: 20200427181649.26851-9-edgar.iglesias@gmail.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- include/hw/arm/xlnx-versal.h | 12 ++++++++++++
+ hw/arm/stellaris.c | 6 ++++--
- hw/arm/xlnx-versal.c         | 31 +++++++++++++++++++++++++++++++
+file changed, 4 insertions(+), 2 deletions(-)
 files changed, 43 insertions(+)
-diff --git a/include/hw/arm/xlnx-versal.h b/include/hw/arm/xlnx-versal.h
+diff --git a/hw/arm/stellaris.c b/hw/arm/stellaris.c
 index XXXXXXX..XXXXXXX 100644
---- a/include/hw/arm/xlnx-versal.h
+--- a/hw/arm/stellaris.c
-+++ b/include/hw/arm/xlnx-versal.h
++++ b/hw/arm/stellaris.c
-@@ -XXX,XX +XXX,XX @@
+@@ -XXX,XX +XXX,XX @@ static void stellaris_adc_trigger(void *opaque, int irq, int level)
  #include "hw/sysbus.h"
  #include "hw/arm/boot.h"
 +#include "hw/sd/sdhci.h"
  #include "hw/intc/arm_gicv3.h"
  #include "hw/char/pl011.h"
  #include "hw/dma/xlnx-zdma.h"
@@ -XXX,XX +XXX,XX @@
  #define XLNX_VERSAL_NR_UARTS   2
  #define XLNX_VERSAL_NR_GEMS    2
  #define XLNX_VERSAL_NR_ADMAS   8
 +#define XLNX_VERSAL_NR_SDS     2
  #define XLNX_VERSAL_NR_IRQS    192
  typedef struct Versal {
@@ -XXX,XX +XXX,XX @@ typedef struct Versal {
          } iou;
      } lpd;
 +    /* The Platform Management Controller subsystem.  */
 +    struct {
 +        struct {
 +            SDHCIState sd[XLNX_VERSAL_NR_SDS];
 +        } iou;
 +    } pmc;
 +
      struct {
          MemoryRegion *mr_ddr;
          uint32_t psci_conduit;
@@ -XXX,XX +XXX,XX @@ typedef struct Versal {
  #define VERSAL_GEM1_IRQ_0          58
  #define VERSAL_GEM1_WAKE_IRQ_0     59
  #define VERSAL_ADMA_IRQ_0          60
 +#define VERSAL_SD0_IRQ_0           126
  /* Architecturally reserved IRQs suitable for virtualization.  */
  #define VERSAL_RSVD_IRQ_FIRST 111
@@ -XXX,XX +XXX,XX @@ typedef struct Versal {
  #define MM_FPD_CRF                  0xfd1a0000U
  #define MM_FPD_CRF_SIZE             0x140000
 +#define MM_PMC_SD0                  0xf1040000U
 +#define MM_PMC_SD0_SIZE             0x10000
  #define MM_PMC_CRP                  0xf1260000U
  #define MM_PMC_CRP_SIZE             0x10000
  #endif
 diff --git a/hw/arm/xlnx-versal.c b/hw/arm/xlnx-versal.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/xlnx-versal.c
 +++ b/hw/arm/xlnx-versal.c
@@ -XXX,XX +XXX,XX @@ static void versal_create_admas(Versal *s, qemu_irq *pic)
      }
  }
-+#define SDHCI_CAPABILITIES  0x280737ec6481 /* Same as on ZynqMP.  */
+-static void stellaris_adc_reset(StellarisADCState *s)
-+static void versal_create_sds(Versal *s, qemu_irq *pic)
++static void stellaris_adc_reset_hold(Object *obj)
-+{
+ {
-+    int i;
++    StellarisADCState *s = STELLARIS_ADC(obj);
-+
+     int n;
-+    for (i = 0; i < ARRAY_SIZE(s->pmc.iou.sd); i++) {
-+        DeviceState *dev;
+     for (n = 0; n < 4; n++) {
-+        MemoryRegion *mr;
+@@ -XXX,XX +XXX,XX @@ static void stellaris_adc_init(Object *obj)
-+
+     memory_region_init_io(&s->iomem, obj, &stellaris_adc_ops, s,
-+        sysbus_init_child_obj(OBJECT(s), "sd[*]",
+                           "adc", 0x1000);
-+                              &s->pmc.iou.sd[i], sizeof(s->pmc.iou.sd[i]),
+     sysbus_init_mmio(sbd, &s->iomem);
-+                              TYPE_SYSBUS_SDHCI);
+-    stellaris_adc_reset(s);
-+        dev = DEVICE(&s->pmc.iou.sd[i]);
+     qdev_init_gpio_in(dev, stellaris_adc_trigger, 1);
-+
+ }
-+        object_property_set_uint(OBJECT(dev),
-+                                 3, "sd-spec-version", &error_fatal);
+@@ -XXX,XX +XXX,XX @@ static const TypeInfo stellaris_i2c_info = {
-+        object_property_set_uint(OBJECT(dev), SDHCI_CAPABILITIES, "capareg",
+ static void stellaris_adc_class_init(ObjectClass *klass, void *data)
-+                                 &error_fatal);
+ {
-+        object_property_set_uint(OBJECT(dev), UHS_I, "uhs", &error_fatal);
+     DeviceClass *dc = DEVICE_CLASS(klass);
-+        qdev_init_nofail(dev);
++    ResettableClass *rc = RESETTABLE_CLASS(klass);
-+
-+        mr = sysbus_mmio_get_region(SYS_BUS_DEVICE(dev), 0);
++    rc->phases.hold = stellaris_adc_reset_hold;
-+        memory_region_add_subregion(&s->mr_ps,
+     dc->vmsd = &vmstate_stellaris_adc;
-+                                    MM_PMC_SD0 + i * MM_PMC_SD0_SIZE, mr);
+ }
 +
 +        sysbus_connect_irq(SYS_BUS_DEVICE(dev), 0,
 +                           pic[VERSAL_SD0_IRQ_0 + i * 2]);
 +    }
 +}
 +
  /* This takes the board allocated linear DDR memory and creates aliases
   * for each split DDR range/aperture on the Versal address map.
   */
@@ -XXX,XX +XXX,XX @@ static void versal_realize(DeviceState *dev, Error **errp)
      versal_create_uarts(s, pic);
      versal_create_gems(s, pic);
      versal_create_admas(s, pic);
 +    versal_create_sds(s, pic);
      versal_map_ddr(s);
      versal_unimp(s);
 --
-.20.1
+.34.1

-[PULL 03/39] target/arm: Don't use a TLB for ARMMMUIdx_Stage2
+[PULL 20/35] hw/arm/stellaris: Convert I2C controller to Resettable interface
-We define ARMMMUIdx_Stage2 as being an MMU index which uses a QEMU
+From: Philippe Mathieu-Daudé <philmd@linaro.org>
 TLB.  However we never actually use the TLB -- all stage 2 lookups
 are done by direct calls to get_phys_addr_lpae() followed by a
 physical address load via address_space_ld*().
-Remove Stage2 from the list of ARM MMU indexes which correspond to
+Suggested-by: Peter Maydell <peter.maydell@linaro.org>
-real core MMU indexes, and instead put it in the set of "NOTLB" ARM
+Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
-MMU indexes.
+Message-id: 20240213155214.13619-3-philmd@linaro.org
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
  hw/arm/stellaris.c | 26 ++++++++++++++++++++++----
 file changed, 22 insertions(+), 4 deletions(-)
-This allows us to drop NB_MMU_MODES to 11.  It also means we can
+diff --git a/hw/arm/stellaris.c b/hw/arm/stellaris.c
 safely add support for the ARMv8.3-TTS2UXN extension, which adds
 permission bits to the stage 2 descriptors which define execute
 permission separatel for EL0 and EL1; supporting that while keeping
 Stage2 in a QEMU TLB would require us to use separate TLBs for
 "Stage2 for an EL0 access" and "Stage2 for an EL1 access", which is a
 lot of extra complication given we aren't even using the QEMU TLB.
 In the process of updating the comment on our MMU index use,
 fix a couple of other minor errors:
  * NS EL2 EL2&0 was missing from the list in the comment
  * some text hadn't been updated from when we bumped NB_MMU_MODES
    above 8
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20200330210400.11724-2-peter.maydell@linaro.org
 ---
  target/arm/cpu-param.h |   2 +-
  target/arm/cpu.h       |  21 +++++---
  target/arm/helper.c    | 112 ++++-------------------------------------
 files changed, 27 insertions(+), 108 deletions(-)
 diff --git a/target/arm/cpu-param.h b/target/arm/cpu-param.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu-param.h
+--- a/hw/arm/stellaris.c
-+++ b/target/arm/cpu-param.h
++++ b/hw/arm/stellaris.c
-@@ -XXX,XX +XXX,XX @@
+@@ -XXX,XX +XXX,XX @@ static void stellaris_sys_instance_init(Object *obj)
- # define TARGET_PAGE_BITS_MIN  10
+     s->sysclk = qdev_init_clock_out(DEVICE(s), "SYSCLK");
  #endif
 -#define NB_MMU_MODES 12
 +#define NB_MMU_MODES 11
  #endif
 diff --git a/target/arm/cpu.h b/target/arm/cpu.h
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu.h
 +++ b/target/arm/cpu.h
@@ -XXX,XX +XXX,XX @@ bool write_cpustate_to_list(ARMCPU *cpu, bool kvm_sync);
   *     handling via the TLB. The only way to do a stage 1 translation without
   *     the immediate stage 2 translation is via the ATS or AT system insns,
   *     which can be slow-pathed and always do a page table walk.
 + *     The only use of stage 2 translations is either as part of an s1+2
 + *     lookup or when loading the descriptors during a stage 1 page table walk,
 + *     and in both those cases we don't use the TLB.
   *  4. we can also safely fold together the "32 bit EL3" and "64 bit EL3"
   *     translation regimes, because they map reasonably well to each other
   *     and they can't both be active at the same time.
@@ -XXX,XX +XXX,XX @@ bool write_cpustate_to_list(ARMCPU *cpu, bool kvm_sync);
   * NS EL1 EL1&0 stage 1+2 (aka NS PL1)
   * NS EL1 EL1&0 stage 1+2 +PAN
   * NS EL0 EL2&0
 + * NS EL2 EL2&0
   * NS EL2 EL2&0 +PAN
   * NS EL2 (aka NS PL2)
   * S EL0 EL1&0 (aka S PL0)
   * S EL1 EL1&0 (not used if EL3 is 32 bit)
   * S EL1 EL1&0 +PAN
   * S EL3 (aka S PL1)
 - * NS EL1&0 stage 2
   *
 - * for a total of 12 different mmu_idx.
 + * for a total of 11 different mmu_idx.
   *
   * R profile CPUs have an MPU, but can use the same set of MMU indexes
   * as A profile. They only need to distinguish NS EL0 and NS EL1 (and
@@ -XXX,XX +XXX,XX @@ bool write_cpustate_to_list(ARMCPU *cpu, bool kvm_sync);
   * are not quite the same -- different CPU types (most notably M profile
   * vs A/R profile) would like to use MMU indexes with different semantics,
   * but since we don't ever need to use all of those in a single CPU we
 - * can avoid setting NB_MMU_MODES to more than 8. The lower bits of
 + * can avoid having to set NB_MMU_MODES to "total number of A profile MMU
 + * modes + total number of M profile MMU modes". The lower bits of
   * ARMMMUIdx are the core TLB mmu index, and the higher bits are always
   * the same for any particular CPU.
   * Variables of type ARMMUIdx are always full values, and the core
@@ -XXX,XX +XXX,XX @@ typedef enum ARMMMUIdx {
      ARMMMUIdx_SE10_1_PAN = 9 | ARM_MMU_IDX_A,
      ARMMMUIdx_SE3        = 10 | ARM_MMU_IDX_A,
 -    ARMMMUIdx_Stage2     = 11 | ARM_MMU_IDX_A,
 -
      /*
       * These are not allocated TLBs and are used only for AT system
       * instructions or for the first stage of an S12 page table walk.
@@ -XXX,XX +XXX,XX @@ typedef enum ARMMMUIdx {
      ARMMMUIdx_Stage1_E0 = 0 | ARM_MMU_IDX_NOTLB,
      ARMMMUIdx_Stage1_E1 = 1 | ARM_MMU_IDX_NOTLB,
      ARMMMUIdx_Stage1_E1_PAN = 2 | ARM_MMU_IDX_NOTLB,
 +    /*
 +     * Not allocated a TLB: used only for second stage of an S12 page
 +     * table walk, or for descriptor loads during first stage of an S1
 +     * page table walk. Note that if we ever want to have a TLB for this
 +     * then various TLB flush insns which currently are no-ops or flush
 +     * only stage 1 MMU indexes will need to change to flush stage 2.
 +     */
 +    ARMMMUIdx_Stage2     = 3 | ARM_MMU_IDX_NOTLB,
      /*
       * M-profile.
@@ -XXX,XX +XXX,XX @@ typedef enum ARMMMUIdxBit {
      TO_CORE_BIT(SE10_1),
      TO_CORE_BIT(SE10_1_PAN),
      TO_CORE_BIT(SE3),
 -    TO_CORE_BIT(Stage2),
      TO_CORE_BIT(MUser),
      TO_CORE_BIT(MPriv),
 diff --git a/target/arm/helper.c b/target/arm/helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper.c
 +++ b/target/arm/helper.c
@@ -XXX,XX +XXX,XX @@ static void tlbiall_nsnh_write(CPUARMState *env, const ARMCPRegInfo *ri,
      tlb_flush_by_mmuidx(cs,
                          ARMMMUIdxBit_E10_1 |
                          ARMMMUIdxBit_E10_1_PAN |
 -                        ARMMMUIdxBit_E10_0 |
 -                        ARMMMUIdxBit_Stage2);
 +                        ARMMMUIdxBit_E10_0);
  }
- static void tlbiall_nsnh_is_write(CPUARMState *env, const ARMCPRegInfo *ri,
+-/* I2C controller.  */
-@@ -XXX,XX +XXX,XX @@ static void tlbiall_nsnh_is_write(CPUARMState *env, const ARMCPRegInfo *ri,
++/*
-     tlb_flush_by_mmuidx_all_cpus_synced(cs,
++ * I2C controller.
-                                         ARMMMUIdxBit_E10_1 |
++ * ??? For now we only implement the master interface.
-                                         ARMMMUIdxBit_E10_1_PAN |
++ */
--                                        ARMMMUIdxBit_E10_0 |
--                                        ARMMMUIdxBit_Stage2);
+ #define TYPE_STELLARIS_I2C "stellaris-i2c"
-+                                        ARMMMUIdxBit_E10_0);
+ OBJECT_DECLARE_SIMPLE_TYPE(stellaris_i2c_state, STELLARIS_I2C)
@@ -XXX,XX +XXX,XX @@ static void stellaris_i2c_write(void *opaque, hwaddr offset,
      stellaris_i2c_update(s);
  }
--static void tlbiipas2_write(CPUARMState *env, const ARMCPRegInfo *ri,
+-static void stellaris_i2c_reset(stellaris_i2c_state *s)
--                            uint64_t value)
++static void stellaris_i2c_reset_enter(Object *obj, ResetType type)
--{
+ {
--    /* Invalidate by IPA. This has to invalidate any structures that
++    stellaris_i2c_state *s = STELLARIS_I2C(obj);
--     * contain only stage 2 translation information, but does not need
++
--     * to apply to structures that contain combined stage 1 and stage 2
+     if (s->mcs & STELLARIS_I2C_MCS_BUSBSY)
--     * translation information.
+         i2c_end_transfer(s->bus);
--     * This must NOP if EL2 isn't implemented or SCR_EL3.NS is zero.
++}
--     */
++
--    CPUState *cs = env_cpu(env);
++static void stellaris_i2c_reset_hold(Object *obj)
--    uint64_t pageaddr;
++{
--
++    stellaris_i2c_state *s = STELLARIS_I2C(obj);
--    if (!arm_feature(env, ARM_FEATURE_EL2) || !(env->cp15.scr_el3 & SCR_NS)) {
--        return;
+     s->msa = 0;
--    }
+     s->mcs = 0;
--
+@@ -XXX,XX +XXX,XX @@ static void stellaris_i2c_reset(stellaris_i2c_state *s)
--    pageaddr = sextract64(value << 12, 0, 40);
+     s->mimr = 0;
--
+     s->mris = 0;
--    tlb_flush_page_by_mmuidx(cs, pageaddr, ARMMMUIdxBit_Stage2);
+     s->mcr = 0;
--}
++}
--
++
--static void tlbiipas2_is_write(CPUARMState *env, const ARMCPRegInfo *ri,
++static void stellaris_i2c_reset_exit(Object *obj)
--                               uint64_t value)
++{
--{
++    stellaris_i2c_state *s = STELLARIS_I2C(obj);
--    CPUState *cs = env_cpu(env);
++
--    uint64_t pageaddr;
+     stellaris_i2c_update(s);
 -
 -    if (!arm_feature(env, ARM_FEATURE_EL2) || !(env->cp15.scr_el3 & SCR_NS)) {
 -        return;
 -    }
 -
 -    pageaddr = sextract64(value << 12, 0, 40);
 -
 -    tlb_flush_page_by_mmuidx_all_cpus_synced(cs, pageaddr,
 -                                             ARMMMUIdxBit_Stage2);
 -}
  static void tlbiall_hyp_write(CPUARMState *env, const ARMCPRegInfo *ri,
                                uint64_t value)
@@ -XXX,XX +XXX,XX @@ static void vttbr_write(CPUARMState *env, const ARMCPRegInfo *ri,
          tlb_flush_by_mmuidx(cs,
                              ARMMMUIdxBit_E10_1 |
                              ARMMMUIdxBit_E10_1_PAN |
 -                            ARMMMUIdxBit_E10_0 |
 -                            ARMMMUIdxBit_Stage2);
 +                            ARMMMUIdxBit_E10_0);
          raw_write(env, ri, value);
      }
  }
-@@ -XXX,XX +XXX,XX @@ static int alle1_tlbmask(CPUARMState *env)
-         return ARMMMUIdxBit_SE10_1 |
+@@ -XXX,XX +XXX,XX @@ static void stellaris_i2c_init(Object *obj)
-                ARMMMUIdxBit_SE10_1_PAN |
+     memory_region_init_io(&s->iomem, obj, &stellaris_i2c_ops, s,
-                ARMMMUIdxBit_SE10_0;
+                           "i2c", 0x1000);
--    } else if (arm_feature(env, ARM_FEATURE_EL2)) {
+     sysbus_init_mmio(sbd, &s->iomem);
--        return ARMMMUIdxBit_E10_1 |
+-    /* ??? For now we only implement the master interface.  */
--               ARMMMUIdxBit_E10_1_PAN |
+-    stellaris_i2c_reset(s);
 -               ARMMMUIdxBit_E10_0 |
 -               ARMMMUIdxBit_Stage2;
      } else {
          return ARMMMUIdxBit_E10_1 |
                 ARMMMUIdxBit_E10_1_PAN |
@@ -XXX,XX +XXX,XX @@ static void tlbi_aa64_vae3is_write(CPUARMState *env, const ARMCPRegInfo *ri,
                                               ARMMMUIdxBit_SE3);
  }
--static void tlbi_aa64_ipas2e1_write(CPUARMState *env, const ARMCPRegInfo *ri,
+ /* Analogue to Digital Converter.  This is only partially implemented,
--                                    uint64_t value)
+@@ -XXX,XX +XXX,XX @@ type_init(stellaris_machine_init)
--{
+ static void stellaris_i2c_class_init(ObjectClass *klass, void *data)
 -    /* Invalidate by IPA. This has to invalidate any structures that
 -     * contain only stage 2 translation information, but does not need
 -     * to apply to structures that contain combined stage 1 and stage 2
 -     * translation information.
 -     * This must NOP if EL2 isn't implemented or SCR_EL3.NS is zero.
 -     */
 -    ARMCPU *cpu = env_archcpu(env);
 -    CPUState *cs = CPU(cpu);
 -    uint64_t pageaddr;
 -
 -    if (!arm_feature(env, ARM_FEATURE_EL2) || !(env->cp15.scr_el3 & SCR_NS)) {
 -        return;
 -    }
 -
 -    pageaddr = sextract64(value << 12, 0, 48);
 -
 -    tlb_flush_page_by_mmuidx(cs, pageaddr, ARMMMUIdxBit_Stage2);
 -}
 -
 -static void tlbi_aa64_ipas2e1is_write(CPUARMState *env, const ARMCPRegInfo *ri,
 -                                      uint64_t value)
 -{
 -    CPUState *cs = env_cpu(env);
 -    uint64_t pageaddr;
 -
 -    if (!arm_feature(env, ARM_FEATURE_EL2) || !(env->cp15.scr_el3 & SCR_NS)) {
 -        return;
 -    }
 -
 -    pageaddr = sextract64(value << 12, 0, 48);
 -
 -    tlb_flush_page_by_mmuidx_all_cpus_synced(cs, pageaddr,
 -                                             ARMMMUIdxBit_Stage2);
 -}
 -
  static CPAccessResult aa64_zva_access(CPUARMState *env, const ARMCPRegInfo *ri,
                                        bool isread)
  {
-@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo v8_cp_reginfo[] = {
+     DeviceClass *dc = DEVICE_CLASS(klass);
-       .writefn = tlbi_aa64_vae1_write },
++    ResettableClass *rc = RESETTABLE_CLASS(klass);
-     { .name = "TLBI_IPAS2E1IS", .state = ARM_CP_STATE_AA64,
-       .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 0, .opc2 = 1,
++    rc->phases.enter = stellaris_i2c_reset_enter;
--      .access = PL2_W, .type = ARM_CP_NO_RAW,
++    rc->phases.hold = stellaris_i2c_reset_hold;
--      .writefn = tlbi_aa64_ipas2e1is_write },
++    rc->phases.exit = stellaris_i2c_reset_exit;
-+      .access = PL2_W, .type = ARM_CP_NOP },
+     dc->vmsd = &vmstate_stellaris_i2c;
-     { .name = "TLBI_IPAS2LE1IS", .state = ARM_CP_STATE_AA64,
+ }
-       .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 0, .opc2 = 5,
 -      .access = PL2_W, .type = ARM_CP_NO_RAW,
 -      .writefn = tlbi_aa64_ipas2e1is_write },
 +      .access = PL2_W, .type = ARM_CP_NOP },
      { .name = "TLBI_ALLE1IS", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 3, .opc2 = 4,
        .access = PL2_W, .type = ARM_CP_NO_RAW,
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo v8_cp_reginfo[] = {
        .writefn = tlbi_aa64_alle1is_write },
      { .name = "TLBI_IPAS2E1", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 4, .opc2 = 1,
 -      .access = PL2_W, .type = ARM_CP_NO_RAW,
 -      .writefn = tlbi_aa64_ipas2e1_write },
 +      .access = PL2_W, .type = ARM_CP_NOP },
      { .name = "TLBI_IPAS2LE1", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 4, .opc2 = 5,
 -      .access = PL2_W, .type = ARM_CP_NO_RAW,
 -      .writefn = tlbi_aa64_ipas2e1_write },
 +      .access = PL2_W, .type = ARM_CP_NOP },
      { .name = "TLBI_ALLE1", .state = ARM_CP_STATE_AA64,
        .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 7, .opc2 = 4,
        .access = PL2_W, .type = ARM_CP_NO_RAW,
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo v8_cp_reginfo[] = {
        .writefn = tlbimva_hyp_is_write },
      { .name = "TLBIIPAS2",
        .cp = 15, .opc1 = 4, .crn = 8, .crm = 4, .opc2 = 1,
 -      .type = ARM_CP_NO_RAW, .access = PL2_W,
 -      .writefn = tlbiipas2_write },
 +      .type = ARM_CP_NOP, .access = PL2_W },
      { .name = "TLBIIPAS2IS",
        .cp = 15, .opc1 = 4, .crn = 8, .crm = 0, .opc2 = 1,
 -      .type = ARM_CP_NO_RAW, .access = PL2_W,
 -      .writefn = tlbiipas2_is_write },
 +      .type = ARM_CP_NOP, .access = PL2_W },
      { .name = "TLBIIPAS2L",
        .cp = 15, .opc1 = 4, .crn = 8, .crm = 4, .opc2 = 5,
 -      .type = ARM_CP_NO_RAW, .access = PL2_W,
 -      .writefn = tlbiipas2_write },
 +      .type = ARM_CP_NOP, .access = PL2_W },
      { .name = "TLBIIPAS2LIS",
        .cp = 15, .opc1 = 4, .crn = 8, .crm = 0, .opc2 = 5,
 -      .type = ARM_CP_NO_RAW, .access = PL2_W,
 -      .writefn = tlbiipas2_is_write },
 +      .type = ARM_CP_NOP, .access = PL2_W },
      /* 32 bit cache operations */
      { .name = "ICIALLUIS", .cp = 15, .opc1 = 0, .crn = 7, .crm = 1, .opc2 = 0,
        .type = ARM_CP_NOP, .access = PL1_W, .accessfn = aa64_cacheop_pou_access },
 --
-.20.1
+.34.1

-[PULL 14/39] hw/arm: versal: Embed the ADMAs into the SoC type
+[PULL 21/35] hw/arm/stellaris: Add missing QOM 'machine' parent
-From: "Edgar E. Iglesias" <edgar.iglesias@xilinx.com>
+From: Philippe Mathieu-Daudé <philmd@linaro.org>
-Embed the ADMAs into the SoC type.
+QDev objects created with qdev_new() need to manually add
 their parent relationship with object_property_add_child().
-Suggested-by: Peter Maydell <peter.maydell@linaro.org>
+This commit plug the devices which aren't part of the SoC;
-Signed-off-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
+they will be plugged into a SoC container in the next one.
-Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
-Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
+Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
-Reviewed-by: Luc Michel <luc.michel@greensocs.com>
+Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Message-id: 20200427181649.26851-7-edgar.iglesias@gmail.com
+Message-id: 20240213155214.13619-4-philmd@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- include/hw/arm/xlnx-versal.h |  3 ++-
+ hw/arm/stellaris.c | 4 ++++
- hw/arm/xlnx-versal.c         | 14 +++++++-------
+file changed, 4 insertions(+)
 files changed, 9 insertions(+), 8 deletions(-)
-diff --git a/include/hw/arm/xlnx-versal.h b/include/hw/arm/xlnx-versal.h
+diff --git a/hw/arm/stellaris.c b/hw/arm/stellaris.c
 index XXXXXXX..XXXXXXX 100644
---- a/include/hw/arm/xlnx-versal.h
+--- a/hw/arm/stellaris.c
-+++ b/include/hw/arm/xlnx-versal.h
++++ b/hw/arm/stellaris.c
-@@ -XXX,XX +XXX,XX @@
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
- #include "hw/arm/boot.h"
+                                    &error_fatal);
- #include "hw/intc/arm_gicv3.h"
- #include "hw/char/pl011.h"
+             ssddev = qdev_new("ssd0323");
-+#include "hw/dma/xlnx-zdma.h"
++            object_property_add_child(OBJECT(ms), "oled", OBJECT(ssddev));
- #include "hw/net/cadence_gem.h"
+             qdev_prop_set_uint8(ssddev, "cs", 1);
+             qdev_realize_and_unref(ssddev, bus, &error_fatal);
- #define TYPE_XLNX_VERSAL "xlnx-versal"
-@@ -XXX,XX +XXX,XX @@ typedef struct Versal {
+             gpio_d_splitter = qdev_new(TYPE_SPLIT_IRQ);
-         struct {
++            object_property_add_child(OBJECT(ms), "splitter",
-             PL011State uart[XLNX_VERSAL_NR_UARTS];
++                                      OBJECT(gpio_d_splitter));
-             CadenceGEMState gem[XLNX_VERSAL_NR_GEMS];
+             qdev_prop_set_uint32(gpio_d_splitter, "num-lines", 2);
--            SysBusDevice *adma[XLNX_VERSAL_NR_ADMAS];
+             qdev_realize_and_unref(gpio_d_splitter, NULL, &error_fatal);
-+            XlnxZDMA adma[XLNX_VERSAL_NR_ADMAS];
+             qdev_connect_gpio_out(
-         } iou;
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
-     } lpd;
+         DeviceState *gpad;
-diff --git a/hw/arm/xlnx-versal.c b/hw/arm/xlnx-versal.c
+         gpad = qdev_new(TYPE_STELLARIS_GAMEPAD);
-index XXXXXXX..XXXXXXX 100644
++        object_property_add_child(OBJECT(ms), "gamepad", OBJECT(gpad));
---- a/hw/arm/xlnx-versal.c
+         for (i = 0; i < ARRAY_SIZE(gpad_keycode); i++) {
-+++ b/hw/arm/xlnx-versal.c
+             qlist_append_int(gpad_keycode_list, gpad_keycode[i]);
-@@ -XXX,XX +XXX,XX @@ static void versal_create_admas(Versal *s, qemu_irq *pic)
+         }
          DeviceState *dev;
          MemoryRegion *mr;
 -        dev = qdev_create(NULL, "xlnx.zdma");
 -        s->lpd.iou.adma[i] = SYS_BUS_DEVICE(dev);
 -        object_property_set_int(OBJECT(s->lpd.iou.adma[i]), 128, "bus-width",
 -                                &error_abort);
 -        object_property_add_child(OBJECT(s), name, OBJECT(dev), &error_fatal);
 +        sysbus_init_child_obj(OBJECT(s), name,
 +                              &s->lpd.iou.adma[i], sizeof(s->lpd.iou.adma[i]),
 +                              TYPE_XLNX_ZDMA);
 +        dev = DEVICE(&s->lpd.iou.adma[i]);
 +        object_property_set_int(OBJECT(dev), 128, "bus-width", &error_abort);
          qdev_init_nofail(dev);
 -        mr = sysbus_mmio_get_region(s->lpd.iou.adma[i], 0);
 +        mr = sysbus_mmio_get_region(SYS_BUS_DEVICE(dev), 0);
          memory_region_add_subregion(&s->mr_ps,
                                      MM_ADMA_CH0 + i * MM_ADMA_CH0_SIZE, mr);
 -        sysbus_connect_irq(s->lpd.iou.adma[i], 0, pic[VERSAL_ADMA_IRQ_0 + i]);
 +        sysbus_connect_irq(SYS_BUS_DEVICE(dev), 0, pic[VERSAL_ADMA_IRQ_0 + i]);
          g_free(name);
      }
  }
 --
-.20.1
+.34.1

-[PULL 13/39] hw/arm: versal: Embed the GEMs into the SoC type
+[PULL 22/35] hw/arm/stellaris: Add missing QOM 'SoC' parent
-From: "Edgar E. Iglesias" <edgar.iglesias@xilinx.com>
+From: Philippe Mathieu-Daudé <philmd@linaro.org>
-Embed the GEMs into the SoC type.
+QDev objects created with qdev_new() need to manually add
 their parent relationship with object_property_add_child().
-Suggested-by: Peter Maydell <peter.maydell@linaro.org>
+Since we don't model the SoC, just use a QOM container.
-Signed-off-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
-Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
+Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
-Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
+Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Luc Michel <luc.michel@greensocs.com>
+Message-id: 20240213155214.13619-5-philmd@linaro.org
 Message-id: 20200427181649.26851-6-edgar.iglesias@gmail.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- include/hw/arm/xlnx-versal.h |  3 ++-
+ hw/arm/stellaris.c | 11 ++++++++++-
- hw/arm/xlnx-versal.c         | 15 ++++++++-------
+file changed, 10 insertions(+), 1 deletion(-)
 files changed, 10 insertions(+), 8 deletions(-)
-diff --git a/include/hw/arm/xlnx-versal.h b/include/hw/arm/xlnx-versal.h
+diff --git a/hw/arm/stellaris.c b/hw/arm/stellaris.c
 index XXXXXXX..XXXXXXX 100644
---- a/include/hw/arm/xlnx-versal.h
+--- a/hw/arm/stellaris.c
-+++ b/include/hw/arm/xlnx-versal.h
++++ b/hw/arm/stellaris.c
-@@ -XXX,XX +XXX,XX @@
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
- #include "hw/arm/boot.h"
+      * 400fe000 system control
- #include "hw/intc/arm_gicv3.h"
+      */
- #include "hw/char/pl011.h"
-+#include "hw/net/cadence_gem.h"
++    Object *soc_container;
+     DeviceState *gpio_dev[7], *nvic;
- #define TYPE_XLNX_VERSAL "xlnx-versal"
+     qemu_irq gpio_in[7][8];
- #define XLNX_VERSAL(obj) OBJECT_CHECK(Versal, (obj), TYPE_XLNX_VERSAL)
+     qemu_irq gpio_out[7][8];
-@@ -XXX,XX +XXX,XX @@ typedef struct Versal {
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
+     flash_size = (((board->dc0 & 0xffff) + 1) << 1) * 1024;
-         struct {
+     sram_size = ((board->dc0 >> 18) + 1) * 1024;
-             PL011State uart[XLNX_VERSAL_NR_UARTS];
--            SysBusDevice *gem[XLNX_VERSAL_NR_GEMS];
++    soc_container = object_new("container");
-+            CadenceGEMState gem[XLNX_VERSAL_NR_GEMS];
++    object_property_add_child(OBJECT(ms), "soc", soc_container);
-             SysBusDevice *adma[XLNX_VERSAL_NR_ADMAS];
++
-         } iou;
+     /* Flash programming is done via the SCU, so pretend it is ROM.  */
-     } lpd;
+     memory_region_init_rom(flash, NULL, "stellaris.flash", flash_size,
-diff --git a/hw/arm/xlnx-versal.c b/hw/arm/xlnx-versal.c
+                            &error_fatal);
-index XXXXXXX..XXXXXXX 100644
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
---- a/hw/arm/xlnx-versal.c
+      * need its sysclk output.
-+++ b/hw/arm/xlnx-versal.c
+      */
-@@ -XXX,XX +XXX,XX @@ static void versal_create_gems(Versal *s, qemu_irq *pic)
+     ssys_dev = qdev_new(TYPE_STELLARIS_SYS);
-         DeviceState *dev;
++    object_property_add_child(soc_container, "sys", OBJECT(ssys_dev));
-         MemoryRegion *mr;
+     /*
--        dev = qdev_create(NULL, "cadence_gem");
+      * Most devices come preprogrammed with a MAC address in the user data.
--        s->lpd.iou.gem[i] = SYS_BUS_DEVICE(dev);
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
--        object_property_add_child(OBJECT(s), name, OBJECT(dev), &error_fatal);
+     sysbus_realize_and_unref(SYS_BUS_DEVICE(ssys_dev), &error_fatal);
-+        sysbus_init_child_obj(OBJECT(s), name,
-+                              &s->lpd.iou.gem[i], sizeof(s->lpd.iou.gem[i]),
+     nvic = qdev_new(TYPE_ARMV7M);
-+                              TYPE_CADENCE_GEM);
++    object_property_add_child(soc_container, "v7m", OBJECT(nvic));
-+        dev = DEVICE(&s->lpd.iou.gem[i]);
+     qdev_prop_set_uint32(nvic, "num-irq", NUM_IRQ_LINES);
-         if (nd->used) {
+     qdev_prop_set_uint8(nvic, "num-prio-bits", NUM_PRIO_BITS);
-             qemu_check_nic_model(nd, "cadence_gem");
+     qdev_prop_set_string(nvic, "cpu-type", ms->cpu_type);
-             qdev_set_nic_properties(dev, nd);
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
-         }
--        object_property_set_int(OBJECT(s->lpd.iou.gem[i]),
+             dev = qdev_new(TYPE_STELLARIS_GPTM);
-+        object_property_set_int(OBJECT(dev),
+             sbd = SYS_BUS_DEVICE(dev);
-, "num-priority-queues",
++            object_property_add_child(soc_container, "gptm[*]", OBJECT(dev));
-                                 &error_abort);
+             qdev_connect_clock_in(dev, "clk",
--        object_property_set_link(OBJECT(s->lpd.iou.gem[i]),
+                                   qdev_get_clock_out(ssys_dev, "SYSCLK"));
-+        object_property_set_link(OBJECT(dev),
+             sysbus_realize_and_unref(sbd, &error_fatal);
-                                  OBJECT(&s->mr_ps), "dma",
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
-                                  &error_abort);
-         qdev_init_nofail(dev);
+     if (board->dc1 & (1 << 3)) { /* watchdog present */
+         dev = qdev_new(TYPE_LUMINARY_WATCHDOG);
--        mr = sysbus_mmio_get_region(s->lpd.iou.gem[i], 0);
+-
-+        mr = sysbus_mmio_get_region(SYS_BUS_DEVICE(dev), 0);
++        object_property_add_child(soc_container, "wdg", OBJECT(dev));
-         memory_region_add_subregion(&s->mr_ps, addrs[i], mr);
+         qdev_connect_clock_in(dev, "WDOGCLK",
+                               qdev_get_clock_out(ssys_dev, "SYSCLK"));
--        sysbus_connect_irq(s->lpd.iou.gem[i], 0, pic[irqs[i]]);
-+        sysbus_connect_irq(SYS_BUS_DEVICE(dev), 0, pic[irqs[i]]);
+@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
-         g_free(name);
+             SysBusDevice *sbd;
-     }
- }
+             dev = qdev_new("pl011_luminary");
 +            object_property_add_child(soc_container, "uart[*]", OBJECT(dev));
              sbd = SYS_BUS_DEVICE(dev);
              qdev_prop_set_chr(dev, "chardev", serial_hd(i));
              sysbus_realize_and_unref(sbd, &error_fatal);
@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
          DeviceState *enet;
          enet = qdev_new("stellaris_enet");
 +        object_property_add_child(soc_container, "enet", OBJECT(enet));
          if (nd) {
              qdev_set_nic_properties(enet, nd);
          } else {
 --
-.20.1
+.34.1

-[PULL 04/39] target/arm: Use enum constant in get_phys_addr_lpae() call
+[PULL 23/35] target/arm: Use new CBAR encoding for all v8 CPUs, not all aarch64 CPUs
-The access_type argument to get_phys_addr_lpae() is an MMUAccessType;
+We support two different encodings for the AArch32 IMPDEF
-use the enum constant MMU_DATA_LOAD rather than a literal 0 when we
+CBAR register -- older cores like the Cortex A9, A7, A15
-call it in S1_ptw_translate().
+have this at 4, c15, c0, 0; newer cores like the
 Cortex A35, A53, A57 and A72 have it at 1 c15 c0 0.
 When we implemented this we picked which encoding to
 use based on whether the CPU set ARM_FEATURE_AARCH64.
 However this isn't right for three cases:
  * the qemu-system-arm 'max' CPU, which is supposed to be
    a variant on a Cortex-A57; it ought to use the same
    encoding the A57 does and which the AArch64 'max'
    exposes to AArch32 guest code
  * the Cortex-R52, which is AArch32-only but has the CBAR
    at the newer encoding (and where we incorrectly are
    not yet setting ARM_FEATURE_CBAR_RO anyway)
  * any possible future support for other v8 AArch32
    only CPUs, or for supporting "boot the CPU into
    AArch32 mode" on our existing cores like the A57 etc
 Make the decision of the encoding be based on whether
 the CPU implements the ARM_FEATURE_V8 flag instead.
 This changes the behaviour only for the qemu-system-arm
 '-cpu max'. We don't expect anybody to be relying on the
 old behaviour because:
  * it's not what the real hardware Cortex-A57 does
    (and that's what our ID register claims we are)
  * we don't implement the memory-mapped GICv3 support
    which is the only thing that exists at the peripheral
    base address pointed to by the register
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20200330210400.11724-3-peter.maydell@linaro.org
+Message-id: 20240206132931.38376-2-peter.maydell@linaro.org
 ---
- target/arm/helper.c | 5 +++--
+ target/arm/helper.c | 2 +-
-file changed, 3 insertions(+), 2 deletions(-)
+file changed, 1 insertion(+), 1 deletion(-)
 diff --git a/target/arm/helper.c b/target/arm/helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper.c
 +++ b/target/arm/helper.c
-@@ -XXX,XX +XXX,XX @@ static hwaddr S1_ptw_translate(CPUARMState *env, ARMMMUIdx mmu_idx,
+@@ -XXX,XX +XXX,XX @@ void register_cp_regs_for_features(ARMCPU *cpu)
-             pcacheattrs = &cacheattrs;
+          * AArch64 cores we might need to add a specific feature flag
-         }
+          * to indicate cores with "flavour 2" CBAR.
+          */
--        ret = get_phys_addr_lpae(env, addr, 0, ARMMMUIdx_Stage2, &s2pa,
+-        if (arm_feature(env, ARM_FEATURE_AARCH64)) {
--                                 &txattrs, &s2prot, &s2size, fi, pcacheattrs);
++        if (arm_feature(env, ARM_FEATURE_V8)) {
-+        ret = get_phys_addr_lpae(env, addr, MMU_DATA_LOAD, ARMMMUIdx_Stage2,
+             /* 32 bit view is [31:18] 0...0 [43:32]. */
-+                                 &s2pa, &txattrs, &s2prot, &s2size, fi,
+             uint32_t cbar32 = (extract64(cpu->reset_cbar, 18, 14) << 18)
-+                                 pcacheattrs);
+                 | extract64(cpu->reset_cbar, 32, 12);
          if (ret) {
              assert(fi->type != ARMFault_None);
              fi->s2addr = addr;
 --
-.20.1
+.34.1

-[PULL 25/39] target/arm: Convert V[US]DOT (vector) to decodetree
+[PULL 24/35] target/arm: The Cortex-R52 has a read-only CBAR
-Convert the V[US]DOT (vector) insns to decodetree.
+The Cortex-R52 implements the Configuration Base Address Register
 (CBAR), as a read-only register.  Add ARM_FEATURE_CBAR_RO to this CPU
 type, so that our implementation provides the register and the
 associated qdev property.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20200430181003.21682-7-peter.maydell@linaro.org
+Message-id: 20240206132931.38376-3-peter.maydell@linaro.org
 ---
- target/arm/neon-shared.decode   |  4 ++++
+ target/arm/tcg/cpu32.c | 1 +
- target/arm/translate-neon.inc.c | 32 ++++++++++++++++++++++++++++++++
+file changed, 1 insertion(+)
  target/arm/translate.c          |  9 +--------
 files changed, 37 insertions(+), 8 deletions(-)
-diff --git a/target/arm/neon-shared.decode b/target/arm/neon-shared.decode
+diff --git a/target/arm/tcg/cpu32.c b/target/arm/tcg/cpu32.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/neon-shared.decode
+--- a/target/arm/tcg/cpu32.c
-+++ b/target/arm/neon-shared.decode
++++ b/target/arm/tcg/cpu32.c
-@@ -XXX,XX +XXX,XX @@ VCMLA          1111 110 rot:2 . 1 size:1 .... .... 1000 . q:1 . 0 .... \
+@@ -XXX,XX +XXX,XX @@ static void cortex_r52_initfn(Object *obj)
+     set_feature(&cpu->env, ARM_FEATURE_PMSA);
- VCADD          1111 110 rot:1 1 . 0 size:1 .... .... 1000 . q:1 . 0 .... \
+     set_feature(&cpu->env, ARM_FEATURE_NEON);
-                vm=%vm_dp vn=%vn_dp vd=%vd_dp
+     set_feature(&cpu->env, ARM_FEATURE_GENERIC_TIMER);
-+
++    set_feature(&cpu->env, ARM_FEATURE_CBAR_RO);
-+# VUDOT and VSDOT
+     cpu->midr = 0x411fd133; /* r1p3 */
-+VDOT           1111 110 00 . 10 .... .... 1101 . q:1 . u:1 .... \
+     cpu->revidr = 0x00000000;
-+               vm=%vm_dp vn=%vn_dp vd=%vd_dp
+     cpu->reset_fpsid = 0x41034023;
 diff --git a/target/arm/translate-neon.inc.c b/target/arm/translate-neon.inc.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-neon.inc.c
 +++ b/target/arm/translate-neon.inc.c
@@ -XXX,XX +XXX,XX @@ static bool trans_VCADD(DisasContext *s, arg_VCADD *a)
      tcg_temp_free_ptr(fpst);
      return true;
  }
 +
 +static bool trans_VDOT(DisasContext *s, arg_VDOT *a)
 +{
 +    int opr_sz;
 +    gen_helper_gvec_3 *fn_gvec;
 +
 +    if (!dc_isar_feature(aa32_dp, s)) {
 +        return false;
 +    }
 +
 +    /* UNDEF accesses to D16-D31 if they don't exist. */
 +    if (!dc_isar_feature(aa32_simd_r32, s) &&
 +        ((a->vd | a->vn | a->vm) & 0x10)) {
 +        return false;
 +    }
 +
 +    if ((a->vn | a->vm | a->vd) & a->q) {
 +        return false;
 +    }
 +
 +    if (!vfp_access_check(s)) {
 +        return true;
 +    }
 +
 +    opr_sz = (1 + a->q) * 8;
 +    fn_gvec = a->u ? gen_helper_gvec_udot_b : gen_helper_gvec_sdot_b;
 +    tcg_gen_gvec_3_ool(vfp_reg_offset(1, a->vd),
 +                       vfp_reg_offset(1, a->vn),
 +                       vfp_reg_offset(1, a->vm),
 +                       opr_sz, opr_sz, 0, fn_gvec);
 +    return true;
 +}
 diff --git a/target/arm/translate.c b/target/arm/translate.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate.c
 +++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static int disas_neon_insn_3same_ext(DisasContext *s, uint32_t insn)
      bool is_long = false, q = extract32(insn, 6, 1);
      bool ptr_is_env = false;
 -    if ((insn & 0xfeb00f00) == 0xfc200d00) {
 -        /* V[US]DOT -- 1111 1100 0.10 .... .... 1101 .Q.U .... */
 -        bool u = extract32(insn, 4, 1);
 -        if (!dc_isar_feature(aa32_dp, s)) {
 -            return 1;
 -        }
 -        fn_gvec = u ? gen_helper_gvec_udot_b : gen_helper_gvec_sdot_b;
 -    } else if ((insn & 0xff300f10) == 0xfc200810) {
 +    if ((insn & 0xff300f10) == 0xfc200810) {
          /* VFM[AS]L -- 1111 1100 S.10 .... .... 1000 .Q.1 .... */
          int is_s = extract32(insn, 23, 1);
          if (!dc_isar_feature(aa32_fhm, s)) {
 --
-.20.1
+.34.1

-[PULL 30/39] target/arm: Convert Neon load/store multiple structures to decodetree
+[PULL 25/35] target/arm: Add Cortex-R52 IMPDEF sysregs
-Convert the Neon "load/store multiple structures" insns to decodetree.
+Add the Cortex-R52 IMPDEF sysregs, by defining them here and
 also by enabling the AUXCR feature which defines the ACTLR
 and HACTLR registers. As is our usual practice, we make these
 simple reads-as-zero stubs for now.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20200430181003.21682-12-peter.maydell@linaro.org
+Message-id: 20240206132931.38376-4-peter.maydell@linaro.org
 ---
- target/arm/neon-ls.decode       |   7 ++
+ target/arm/tcg/cpu32.c | 108 +++++++++++++++++++++++++++++++++++++++++
- target/arm/translate-neon.inc.c | 124 ++++++++++++++++++++++++++++++++
+file changed, 108 insertions(+)
  target/arm/translate.c          |  91 +----------------------
 files changed, 133 insertions(+), 89 deletions(-)
-diff --git a/target/arm/neon-ls.decode b/target/arm/neon-ls.decode
+diff --git a/target/arm/tcg/cpu32.c b/target/arm/tcg/cpu32.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/neon-ls.decode
+--- a/target/arm/tcg/cpu32.c
-+++ b/target/arm/neon-ls.decode
++++ b/target/arm/tcg/cpu32.c
-@@ -XXX,XX +XXX,XX @@
+@@ -XXX,XX +XXX,XX @@ static void cortex_r5_initfn(Object *obj)
- #   0b1111_1001_xxx0_xxxx_xxxx_xxxx_xxxx_xxxx
+     define_arm_cp_regs(cpu, cortexr5_cp_reginfo);
  # This file works on the A32 encoding only; calling code for T32 has to
  # transform the insn into the A32 version first.
 +
 +%vd_dp  22:1 12:4
 +
 +# Neon load/store multiple structures
 +
 +VLDST_multiple 1111 0100 0 . l:1 0 rn:4 .... itype:4 size:2 align:2 rm:4 \
 +               vd=%vd_dp
 diff --git a/target/arm/translate-neon.inc.c b/target/arm/translate-neon.inc.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-neon.inc.c
 +++ b/target/arm/translate-neon.inc.c
@@ -XXX,XX +XXX,XX @@ static bool trans_VFML_scalar(DisasContext *s, arg_VFML_scalar *a)
                         gen_helper_gvec_fmlal_idx_a32);
      return true;
  }
-+
-+static struct {
++static const ARMCPRegInfo cortex_r52_cp_reginfo[] = {
-+    int nregs;
++    { .name = "CPUACTLR", .cp = 15, .opc1 = 0, .crm = 15,
-+    int interleave;
++      .access = PL1_RW, .type = ARM_CP_CONST | ARM_CP_64BIT, .resetvalue = 0 },
-+    int spacing;
++    { .name = "IMP_ATCMREGIONR",
-+} const neon_ls_element_type[11] = {
++      .cp = 15, .opc1 = 0, .crn = 9, .crm = 1, .opc2 = 0,
-+    {1, 4, 1},
++      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
-+    {1, 4, 2},
++    { .name = "IMP_BTCMREGIONR",
-+    {4, 1, 1},
++      .cp = 15, .opc1 = 0, .crn = 9, .crm = 1, .opc2 = 1,
-+    {2, 2, 2},
++      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
-+    {1, 3, 1},
++    { .name = "IMP_CTCMREGIONR",
-+    {1, 3, 2},
++      .cp = 15, .opc1 = 0, .crn = 9, .crm = 1, .opc2 = 2,
-+    {3, 1, 1},
++      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
-+    {1, 1, 1},
++    { .name = "IMP_CSCTLR",
-+    {1, 2, 1},
++      .cp = 15, .opc1 = 1, .crn = 9, .crm = 1, .opc2 = 0,
-+    {1, 2, 2},
++      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
-+    {2, 1, 1}
++    { .name = "IMP_BPCTLR",
 +      .cp = 15, .opc1 = 1, .crn = 9, .crm = 1, .opc2 = 1,
 +      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_MEMPROTCLR",
 +      .cp = 15, .opc1 = 1, .crn = 9, .crm = 1, .opc2 = 2,
 +      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_SLAVEPCTLR",
 +      .cp = 15, .opc1 = 0, .crn = 11, .crm = 0, .opc2 = 0,
 +      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_PERIPHREGIONR",
 +      .cp = 15, .opc1 = 0, .crn = 15, .crm = 0, .opc2 = 0,
 +      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_FLASHIFREGIONR",
 +      .cp = 15, .opc1 = 0, .crn = 15, .crm = 0, .opc2 = 1,
 +      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_BUILDOPTR",
 +      .cp = 15, .opc1 = 0, .crn = 15, .crm = 2, .opc2 = 0,
 +      .access = PL1_R, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_PINOPTR",
 +      .cp = 15, .opc1 = 0, .crn = 15, .crm = 2, .opc2 = 7,
 +      .access = PL1_R, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_QOSR",
 +      .cp = 15, .opc1 = 1, .crn = 15, .crm = 3, .opc2 = 1,
 +      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_BUSTIMEOUTR",
 +      .cp = 15, .opc1 = 1, .crn = 15, .crm = 3, .opc2 = 2,
 +      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_INTMONR",
 +      .cp = 15, .opc1 = 1, .crn = 15, .crm = 3, .opc2 = 4,
 +      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_ICERR0",
 +      .cp = 15, .opc1 = 2, .crn = 15, .crm = 0, .opc2 = 0,
 +      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_ICERR1",
 +      .cp = 15, .opc1 = 2, .crn = 15, .crm = 0, .opc2 = 1,
 +      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_DCERR0",
 +      .cp = 15, .opc1 = 2, .crn = 15, .crm = 1, .opc2 = 0,
 +      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_DCERR1",
 +      .cp = 15, .opc1 = 2, .crn = 15, .crm = 1, .opc2 = 1,
 +      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_TCMERR0",
 +      .cp = 15, .opc1 = 2, .crn = 15, .crm = 2, .opc2 = 0,
 +      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_TCMERR1",
 +      .cp = 15, .opc1 = 2, .crn = 15, .crm = 2, .opc2 = 1,
 +      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_TCMSYNDR0",
 +      .cp = 15, .opc1 = 2, .crn = 15, .crm = 2, .opc2 = 2,
 +      .access = PL1_R, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_TCMSYNDR1",
 +      .cp = 15, .opc1 = 2, .crn = 15, .crm = 2, .opc2 = 3,
 +      .access = PL1_R, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_FLASHERR0",
 +      .cp = 15, .opc1 = 2, .crn = 15, .crm = 3, .opc2 = 0,
 +      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_FLASHERR1",
 +      .cp = 15, .opc1 = 2, .crn = 15, .crm = 3, .opc2 = 1,
 +      .access = PL1_RW, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_CDBGDR0",
 +      .cp = 15, .opc1 = 3, .crn = 15, .crm = 0, .opc2 = 0,
 +      .access = PL1_R, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_CBDGBR1",
 +      .cp = 15, .opc1 = 3, .crn = 15, .crm = 0, .opc2 = 1,
 +      .access = PL1_R, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_TESTR0",
 +      .cp = 15, .opc1 = 4, .crn = 15, .crm = 0, .opc2 = 0,
 +      .access = PL1_R, .type = ARM_CP_CONST, .resetvalue = 0 },
 +    { .name = "IMP_TESTR1",
 +      .cp = 15, .opc1 = 4, .crn = 15, .crm = 0, .opc2 = 1,
 +      .access = PL1_W, .type = ARM_CP_NOP, .resetvalue = 0 },
 +    { .name = "IMP_CDBGDCI",
 +      .cp = 15, .opc1 = 0, .crn = 15, .crm = 15, .opc2 = 0,
 +      .access = PL1_W, .type = ARM_CP_NOP, .resetvalue = 0 },
 +    { .name = "IMP_CDBGDCT",
 +      .cp = 15, .opc1 = 3, .crn = 15, .crm = 2, .opc2 = 0,
 +      .access = PL1_W, .type = ARM_CP_NOP, .resetvalue = 0 },
 +    { .name = "IMP_CDBGICT",
 +      .cp = 15, .opc1 = 3, .crn = 15, .crm = 2, .opc2 = 1,
 +      .access = PL1_W, .type = ARM_CP_NOP, .resetvalue = 0 },
 +    { .name = "IMP_CDBGDCD",
 +      .cp = 15, .opc1 = 3, .crn = 15, .crm = 4, .opc2 = 0,
 +      .access = PL1_W, .type = ARM_CP_NOP, .resetvalue = 0 },
 +    { .name = "IMP_CDBGICD",
 +      .cp = 15, .opc1 = 3, .crn = 15, .crm = 4, .opc2 = 1,
 +      .access = PL1_W, .type = ARM_CP_NOP, .resetvalue = 0 },
 +};
 +
-+static void gen_neon_ldst_base_update(DisasContext *s, int rm, int rn,
-+                                      int stride)
-+{
-+    if (rm != 15) {
-+        TCGv_i32 base;
 +
-+        base = load_reg(s, rn);
+ static void cortex_r52_initfn(Object *obj)
-+        if (rm == 13) {
+ {
-+            tcg_gen_addi_i32(base, base, stride);
+     ARMCPU *cpu = ARM_CPU(obj);
-+        } else {
+@@ -XXX,XX +XXX,XX @@ static void cortex_r52_initfn(Object *obj)
-+            TCGv_i32 index;
+     set_feature(&cpu->env, ARM_FEATURE_NEON);
-+            index = load_reg(s, rm);
+     set_feature(&cpu->env, ARM_FEATURE_GENERIC_TIMER);
-+            tcg_gen_add_i32(base, base, index);
+     set_feature(&cpu->env, ARM_FEATURE_CBAR_RO);
-+            tcg_temp_free_i32(index);
++    set_feature(&cpu->env, ARM_FEATURE_AUXCR);
-+        }
+     cpu->midr = 0x411fd133; /* r1p3 */
-+        store_reg(s, rn, base);
+     cpu->revidr = 0x00000000;
-+    }
+     cpu->reset_fpsid = 0x41034023;
-+}
+@@ -XXX,XX +XXX,XX @@ static void cortex_r52_initfn(Object *obj)
      cpu->pmsav7_dregion = 16;
      cpu->pmsav8r_hdregion = 16;
 +
-+static bool trans_VLDST_multiple(DisasContext *s, arg_VLDST_multiple *a)
++    define_arm_cp_regs(cpu, cortex_r52_cp_reginfo);
 +{
 +    /* Neon load/store multiple structures */
 +    int nregs, interleave, spacing, reg, n;
 +    MemOp endian = s->be_data;
 +    int mmu_idx = get_mem_index(s);
 +    int size = a->size;
 +    TCGv_i64 tmp64;
 +    TCGv_i32 addr, tmp;
 +
 +    if (!arm_dc_feature(s, ARM_FEATURE_NEON)) {
 +        return false;
 +    }
 +
 +    /* UNDEF accesses to D16-D31 if they don't exist */
 +    if (!dc_isar_feature(aa32_simd_r32, s) && (a->vd & 0x10)) {
 +        return false;
 +    }
 +    if (a->itype > 10) {
 +        return false;
 +    }
 +    /* Catch UNDEF cases for bad values of align field */
 +    switch (a->itype & 0xc) {
 +    case 4:
 +        if (a->align >= 2) {
 +            return false;
 +        }
 +        break;
 +    case 8:
 +        if (a->align == 3) {
 +            return false;
 +        }
 +        break;
 +    default:
 +        break;
 +    }
 +    nregs = neon_ls_element_type[a->itype].nregs;
 +    interleave = neon_ls_element_type[a->itype].interleave;
 +    spacing = neon_ls_element_type[a->itype].spacing;
 +    if (size == 3 && (interleave | spacing) != 1) {
 +        return false;
 +    }
 +
 +    if (!vfp_access_check(s)) {
 +        return true;
 +    }
 +
 +    /* For our purposes, bytes are always little-endian.  */
 +    if (size == 0) {
 +        endian = MO_LE;
 +    }
 +    /*
 +     * Consecutive little-endian elements from a single register
 +     * can be promoted to a larger little-endian operation.
 +     */
 +    if (interleave == 1 && endian == MO_LE) {
 +        size = 3;
 +    }
 +    tmp64 = tcg_temp_new_i64();
 +    addr = tcg_temp_new_i32();
 +    tmp = tcg_const_i32(1 << size);
 +    load_reg_var(s, addr, a->rn);
 +    for (reg = 0; reg < nregs; reg++) {
 +        for (n = 0; n < 8 >> size; n++) {
 +            int xs;
 +            for (xs = 0; xs < interleave; xs++) {
 +                int tt = a->vd + reg + spacing * xs;
 +
 +                if (a->l) {
 +                    gen_aa32_ld_i64(s, tmp64, addr, mmu_idx, endian | size);
 +                    neon_store_element64(tt, n, size, tmp64);
 +                } else {
 +                    neon_load_element64(tmp64, tt, n, size);
 +                    gen_aa32_st_i64(s, tmp64, addr, mmu_idx, endian | size);
 +                }
 +                tcg_gen_add_i32(addr, addr, tmp);
 +            }
 +        }
 +    }
 +    tcg_temp_free_i32(addr);
 +    tcg_temp_free_i32(tmp);
 +    tcg_temp_free_i64(tmp64);
 +
 +    gen_neon_ldst_base_update(s, a->rm, a->rn, nregs * interleave * 8);
 +    return true;
 +}
 diff --git a/target/arm/translate.c b/target/arm/translate.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate.c
 +++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static void gen_neon_trn_u16(TCGv_i32 t0, TCGv_i32 t1)
  }
+ static void cortex_r5f_initfn(Object *obj)
 -static struct {
 -    int nregs;
 -    int interleave;
 -    int spacing;
 -} const neon_ls_element_type[11] = {
 -    {1, 4, 1},
 -    {1, 4, 2},
 -    {4, 1, 1},
 -    {2, 2, 2},
 -    {1, 3, 1},
 -    {1, 3, 2},
 -    {3, 1, 1},
 -    {1, 1, 1},
 -    {1, 2, 1},
 -    {1, 2, 2},
 -    {2, 1, 1}
 -};
 -
  /* Translate a NEON load/store element instruction.  Return nonzero if the
     instruction is invalid.  */
  static int disas_neon_ls_insn(DisasContext *s, uint32_t insn)
  {
      int rd, rn, rm;
 -    int op;
      int nregs;
 -    int interleave;
 -    int spacing;
      int stride;
      int size;
      int reg;
      int load;
 -    int n;
      int vec_size;
 -    int mmu_idx;
 -    MemOp endian;
      TCGv_i32 addr;
      TCGv_i32 tmp;
 -    TCGv_i32 tmp2;
 -    TCGv_i64 tmp64;
      if (!arm_dc_feature(s, ARM_FEATURE_NEON)) {
          return 1;
@@ -XXX,XX +XXX,XX @@ static int disas_neon_ls_insn(DisasContext *s, uint32_t insn)
      rn = (insn >> 16) & 0xf;
      rm = insn & 0xf;
      load = (insn & (1 << 21)) != 0;
 -    endian = s->be_data;
 -    mmu_idx = get_mem_index(s);
      if ((insn & (1 << 23)) == 0) {
 -        /* Load store all elements.  */
 -        op = (insn >> 8) & 0xf;
 -        size = (insn >> 6) & 3;
 -        if (op > 10)
 -            return 1;
 -        /* Catch UNDEF cases for bad values of align field */
 -        switch (op & 0xc) {
 -        case 4:
 -            if (((insn >> 5) & 1) == 1) {
 -                return 1;
 -            }
 -            break;
 -        case 8:
 -            if (((insn >> 4) & 3) == 3) {
 -                return 1;
 -            }
 -            break;
 -        default:
 -            break;
 -        }
 -        nregs = neon_ls_element_type[op].nregs;
 -        interleave = neon_ls_element_type[op].interleave;
 -        spacing = neon_ls_element_type[op].spacing;
 -        if (size == 3 && (interleave | spacing) != 1) {
 -            return 1;
 -        }
 -        /* For our purposes, bytes are always little-endian.  */
 -        if (size == 0) {
 -            endian = MO_LE;
 -        }
 -        /* Consecutive little-endian elements from a single register
 -         * can be promoted to a larger little-endian operation.
 -         */
 -        if (interleave == 1 && endian == MO_LE) {
 -            size = 3;
 -        }
 -        tmp64 = tcg_temp_new_i64();
 -        addr = tcg_temp_new_i32();
 -        tmp2 = tcg_const_i32(1 << size);
 -        load_reg_var(s, addr, rn);
 -        for (reg = 0; reg < nregs; reg++) {
 -            for (n = 0; n < 8 >> size; n++) {
 -                int xs;
 -                for (xs = 0; xs < interleave; xs++) {
 -                    int tt = rd + reg + spacing * xs;
 -
 -                    if (load) {
 -                        gen_aa32_ld_i64(s, tmp64, addr, mmu_idx, endian | size);
 -                        neon_store_element64(tt, n, size, tmp64);
 -                    } else {
 -                        neon_load_element64(tmp64, tt, n, size);
 -                        gen_aa32_st_i64(s, tmp64, addr, mmu_idx, endian | size);
 -                    }
 -                    tcg_gen_add_i32(addr, addr, tmp2);
 -                }
 -            }
 -        }
 -        tcg_temp_free_i32(addr);
 -        tcg_temp_free_i32(tmp2);
 -        tcg_temp_free_i64(tmp64);
 -        stride = nregs * interleave * 8;
 +        /* Load store all elements -- handled already by decodetree */
 +        return 1;
      } else {
          size = (insn >> 10) & 3;
          if (size == 3) {
 --
-.20.1
+.34.1

-[PULL 32/39] target/arm: Convert Neon 'load/store single structure' to decodetree
+[PULL 26/35] target/arm: Allow access to SPSR_hyp from hyp mode
-Convert the Neon "load/store single structure to one lane" insns to
+Architecturally, the AArch32 MSR/MRS to/from banked register
-decodetree.
+instructions are UNPREDICTABLE for attempts to access a banked
 register that the guest could access in a more direct way (e.g.
 using this insn to access r8_fiq when already in FIQ mode).  QEMU has
 chosen to UNDEF on all of these.
-As this is the last set of insns in the neon load/store group,
+However, for the case of accessing SPSR_hyp from hyp mode, it turns
-we can remove the whole disas_neon_ls_insn() function.
+out that real hardware permits this, with the same effect as if the
 guest had directly written to SPSR. Further, there is some
 guest code out there that assumes it can do this, because it
 happens to work on hardware: an example Cortex-R52 startup code
 fragment uses this, and it got copied into various other places,
 including Zephyr. Zephyr was fixed to not use this:
  https://github.com/zephyrproject-rtos/zephyr/issues/47330
 but other examples are still out there, like the selftest
 binary for the MPS3-AN536.
 For convenience of being able to run guest code, permit
 this UNPREDICTABLE access instead of UNDEFing it.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20200430181003.21682-14-peter.maydell@linaro.org
+Message-id: 20240206132931.38376-5-peter.maydell@linaro.org
 ---
- target/arm/neon-ls.decode       |  11 +++
+ target/arm/tcg/op_helper.c | 43 ++++++++++++++++++++++++++------------
- target/arm/translate-neon.inc.c |  89 +++++++++++++++++++
+ target/arm/tcg/translate.c | 19 +++++++++++------
- target/arm/translate.c          | 147 --------------------------------
+files changed, 43 insertions(+), 19 deletions(-)
 files changed, 100 insertions(+), 147 deletions(-)
-diff --git a/target/arm/neon-ls.decode b/target/arm/neon-ls.decode
+diff --git a/target/arm/tcg/op_helper.c b/target/arm/tcg/op_helper.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/neon-ls.decode
+--- a/target/arm/tcg/op_helper.c
-+++ b/target/arm/neon-ls.decode
++++ b/target/arm/tcg/op_helper.c
-@@ -XXX,XX +XXX,XX @@ VLDST_multiple 1111 0100 0 . l:1 0 rn:4 .... itype:4 size:2 align:2 rm:4 \
+@@ -XXX,XX +XXX,XX @@ static void msr_mrs_banked_exc_checks(CPUARMState *env, uint32_t tgtmode,
+      */
- VLD_all_lanes  1111 0100 1 . 1 0 rn:4 .... 11 n:2 size:2 t:1 a:1 rm:4 \
+     int curmode = env->uncached_cpsr & CPSR_M;
-                vd=%vd_dp
-+
+-    if (regno == 17) {
-+# Neon load/store single structure to one lane
+-        /* ELR_Hyp: a special case because access from tgtmode is OK */
-+%imm1_5_p1 5:1 !function=plus1
+-        if (curmode != ARM_CPU_MODE_HYP && curmode != ARM_CPU_MODE_MON) {
-+%imm1_6_p1 6:1 !function=plus1
+-            goto undef;
-+
++    if (tgtmode == ARM_CPU_MODE_HYP) {
 +VLDST_single   1111 0100 1 . l:1 0 rn:4 .... 00 n:2 reg_idx:3 align:1 rm:4 \
 +               vd=%vd_dp size=0 stride=1
 +VLDST_single   1111 0100 1 . l:1 0 rn:4 .... 01 n:2 reg_idx:2 align:2 rm:4 \
 +               vd=%vd_dp size=1 stride=%imm1_5_p1
 +VLDST_single   1111 0100 1 . l:1 0 rn:4 .... 10 n:2 reg_idx:1 align:3 rm:4 \
 +               vd=%vd_dp size=2 stride=%imm1_6_p1
 diff --git a/target/arm/translate-neon.inc.c b/target/arm/translate-neon.inc.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-neon.inc.c
 +++ b/target/arm/translate-neon.inc.c
@@ -XXX,XX +XXX,XX @@
   * It might be possible to convert it to a standalone .c file eventually.
   */
 +static inline int plus1(DisasContext *s, int x)
 +{
 +    return x + 1;
 +}
 +
  /* Include the generated Neon decoder */
  #include "decode-neon-dp.inc.c"
  #include "decode-neon-ls.inc.c"
@@ -XXX,XX +XXX,XX @@ static bool trans_VLD_all_lanes(DisasContext *s, arg_VLD_all_lanes *a)
      return true;
  }
 +
 +static bool trans_VLDST_single(DisasContext *s, arg_VLDST_single *a)
 +{
 +    /* Neon load/store single structure to one lane */
 +    int reg;
 +    int nregs = a->n + 1;
 +    int vd = a->vd;
 +    TCGv_i32 addr, tmp;
 +
 +    if (!arm_dc_feature(s, ARM_FEATURE_NEON)) {
 +        return false;
 +    }
 +
 +    /* UNDEF accesses to D16-D31 if they don't exist */
 +    if (!dc_isar_feature(aa32_simd_r32, s) && (a->vd & 0x10)) {
 +        return false;
 +    }
 +
 +    /* Catch the UNDEF cases. This is unavoidably a bit messy. */
 +    switch (nregs) {
 +    case 1:
 +        if (((a->align & (1 << a->size)) != 0) ||
 +            (a->size == 2 && ((a->align & 3) == 1 || (a->align & 3) == 2))) {
 +            return false;
 +        }
 +        break;
 +    case 3:
 +        if ((a->align & 1) != 0) {
 +            return false;
 +        }
 +        /* fall through */
 +    case 2:
 +        if (a->size == 2 && (a->align & 2) != 0) {
 +            return false;
 +        }
 +        break;
 +    case 4:
 +        if ((a->size == 2) && ((a->align & 3) == 3)) {
 +            return false;
 +        }
 +        break;
 +    default:
 +        abort();
 +    }
 +    if ((vd + a->stride * (nregs - 1)) > 31) {
 +        /*
-+         * Attempts to write off the end of the register file are
++         * Handle Hyp target regs first because some are special cases
-+         * UNPREDICTABLE; we choose to UNDEF because otherwise we would
++         * which don't want the usual "not accessible from tgtmode" check.
 +         * access off the end of the array that holds the register data.
 +         */
-+        return false;
++        switch (regno) {
-+    }
++        case 16 ... 17: /* ELR_Hyp, SPSR_Hyp */
-+
++            if (curmode != ARM_CPU_MODE_HYP && curmode != ARM_CPU_MODE_MON) {
-+    if (!vfp_access_check(s)) {
++                goto undef;
-+        return true;
++            }
-+    }
++            break;
-+
++        case 13:
-+    tmp = tcg_temp_new_i32();
++            if (curmode != ARM_CPU_MODE_MON) {
-+    addr = tcg_temp_new_i32();
++                goto undef;
-+    load_reg_var(s, addr, a->rn);
++            }
-+    /*
++            break;
-+     * TODO: if we implemented alignment exceptions, we should check
++        default:
-+     * addr against the alignment encoded in a->align here.
++            g_assert_not_reached();
-+     */
+         }
-+    for (reg = 0; reg < nregs; reg++) {
+         return;
-+        if (a->l) {
+     }
-+            gen_aa32_ld_i32(s, tmp, addr, get_mem_index(s),
+@@ -XXX,XX +XXX,XX @@ static void msr_mrs_banked_exc_checks(CPUARMState *env, uint32_t tgtmode,
-+                            s->be_data | a->size);
+         }
-+            neon_store_element(vd, a->reg_idx, a->size, tmp);
+     }
-+        } else { /* Store */
-+            neon_load_element(tmp, vd, a->reg_idx, a->size);
+-    if (tgtmode == ARM_CPU_MODE_HYP) {
-+            gen_aa32_st_i32(s, tmp, addr, get_mem_index(s),
+-        /* SPSR_Hyp, r13_hyp: accessible from Monitor mode only */
-+                            s->be_data | a->size);
+-        if (curmode != ARM_CPU_MODE_MON) {
-+        }
+-            goto undef;
-+        vd += a->stride;
+-        }
 +        tcg_gen_addi_i32(addr, addr, 1 << a->size);
 +    }
 +    tcg_temp_free_i32(addr);
 +    tcg_temp_free_i32(tmp);
 +
 +    gen_neon_ldst_base_update(s, a->rm, a->rn, (1 << a->size) * nregs);
 +
 +    return true;
 +}
 diff --git a/target/arm/translate.c b/target/arm/translate.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate.c
 +++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static void gen_neon_trn_u16(TCGv_i32 t0, TCGv_i32 t1)
      tcg_temp_free_i32(rd);
  }
 -
 -/* Translate a NEON load/store element instruction.  Return nonzero if the
 -   instruction is invalid.  */
 -static int disas_neon_ls_insn(DisasContext *s, uint32_t insn)
 -{
 -    int rd, rn, rm;
 -    int nregs;
 -    int stride;
 -    int size;
 -    int reg;
 -    int load;
 -    TCGv_i32 addr;
 -    TCGv_i32 tmp;
 -
 -    if (!arm_dc_feature(s, ARM_FEATURE_NEON)) {
 -        return 1;
 -    }
 -
--    /* FIXME: this access check should not take precedence over UNDEF
+     return;
--     * for invalid encodings; we will generate incorrect syndrome information
--     * for attempts to execute invalid vfp/neon encodings with FP disabled.
+ undef:
--     */
+@@ -XXX,XX +XXX,XX @@ void HELPER(msr_banked)(CPUARMState *env, uint32_t value, uint32_t tgtmode,
--    if (s->fp_excp_el) {
--        gen_exception_insn(s, s->pc_curr, EXCP_UDEF,
+     switch (regno) {
--                           syn_simd_access_trap(1, 0xe, false), s->fp_excp_el);
+     case 16: /* SPSRs */
--        return 0;
+-        env->banked_spsr[bank_number(tgtmode)] = value;
--    }
++        if (tgtmode == (env->uncached_cpsr & CPSR_M)) {
--
++            /* Only happens for SPSR_Hyp access in Hyp mode */
--    if (!s->vfp_enabled)
++            env->spsr = value;
--      return 1;
++        } else {
--    VFP_DREG_D(rd, insn);
++            env->banked_spsr[bank_number(tgtmode)] = value;
--    rn = (insn >> 16) & 0xf;
++        }
--    rm = insn & 0xf;
+         break;
--    load = (insn & (1 << 21)) != 0;
+     case 17: /* ELR_Hyp */
--    if ((insn & (1 << 23)) == 0) {
+         env->elr_el[2] = value;
--        /* Load store all elements -- handled already by decodetree */
+@@ -XXX,XX +XXX,XX @@ uint32_t HELPER(mrs_banked)(CPUARMState *env, uint32_t tgtmode, uint32_t regno)
--        return 1;
--    } else {
+     switch (regno) {
--        size = (insn >> 10) & 3;
+     case 16: /* SPSRs */
--        if (size == 3) {
+-        return env->banked_spsr[bank_number(tgtmode)];
--            /* Load single element to all lanes -- handled by decodetree  */
++        if (tgtmode == (env->uncached_cpsr & CPSR_M)) {
--            return 1;
++            /* Only happens for SPSR_Hyp access in Hyp mode */
--        } else {
++            return env->spsr;
--            /* Single element.  */
++        } else {
--            int idx = (insn >> 4) & 0xf;
++            return env->banked_spsr[bank_number(tgtmode)];
--            int reg_idx;
++        }
--            switch (size) {
+     case 17: /* ELR_Hyp */
--            case 0:
+         return env->elr_el[2];
--                reg_idx = (insn >> 5) & 7;
+     case 13:
--                stride = 1;
+diff --git a/target/arm/tcg/translate.c b/target/arm/tcg/translate.c
--                break;
+index XXXXXXX..XXXXXXX 100644
--            case 1:
+--- a/target/arm/tcg/translate.c
--                reg_idx = (insn >> 6) & 3;
++++ b/target/arm/tcg/translate.c
--                stride = (insn & (1 << 5)) ? 2 : 1;
+@@ -XXX,XX +XXX,XX @@ static bool msr_banked_access_decode(DisasContext *s, int r, int sysm, int rn,
--                break;
+         break;
--            case 2:
+     case ARM_CPU_MODE_HYP:
--                reg_idx = (insn >> 7) & 1;
+         /*
--                stride = (insn & (1 << 6)) ? 2 : 1;
+-         * SPSR_hyp and r13_hyp can only be accessed from Monitor mode
--                break;
+-         * (and so we can forbid accesses from EL2 or below). elr_hyp
--            default:
+-         * can be accessed also from Hyp mode, so forbid accesses from
--                abort();
+-         * EL0 or EL1.
--            }
++         * r13_hyp can only be accessed from Monitor mode, and so we
--            nregs = ((insn >> 8) & 3) + 1;
++         * can forbid accesses from EL2 or below.
--            /* Catch the UNDEF cases. This is unavoidably a bit messy. */
++         * elr_hyp can be accessed also from Hyp mode, so forbid
--            switch (nregs) {
++         * accesses from EL0 or EL1.
--            case 1:
++         * SPSR_hyp is supposed to be in the same category as r13_hyp
--                if (((idx & (1 << size)) != 0) ||
++         * and UNPREDICTABLE if accessed from anything except Monitor
--                    (size == 2 && ((idx & 3) == 1 || (idx & 3) == 2))) {
++         * mode. However there is some real-world code that will do
--                    return 1;
++         * it because at least some hardware happens to permit the
--                }
++         * access. (Notably a standard Cortex-R52 startup code fragment
--                break;
++         * does this.) So we permit SPSR_hyp from Hyp mode also, to allow
--            case 3:
++         * this (incorrect) guest code to run.
--                if ((idx & 1) != 0) {
+          */
--                    return 1;
+-        if (!arm_dc_feature(s, ARM_FEATURE_EL2) || s->current_el < 2 ||
--                }
+-            (s->current_el < 3 && *regno != 17)) {
--                /* fall through */
++        if (!arm_dc_feature(s, ARM_FEATURE_EL2) || s->current_el < 2
--            case 2:
++            || (s->current_el < 3 && *regno != 16 && *regno != 17)) {
--                if (size == 2 && (idx & 2) != 0) {
+             goto undef;
 -                    return 1;
 -                }
 -                break;
 -            case 4:
 -                if ((size == 2) && ((idx & 3) == 3)) {
 -                    return 1;
 -                }
 -                break;
 -            default:
 -                abort();
 -            }
 -            if ((rd + stride * (nregs - 1)) > 31) {
 -                /* Attempts to write off the end of the register file
 -                 * are UNPREDICTABLE; we choose to UNDEF because otherwise
 -                 * the neon_load_reg() would write off the end of the array.
 -                 */
 -                return 1;
 -            }
 -            tmp = tcg_temp_new_i32();
 -            addr = tcg_temp_new_i32();
 -            load_reg_var(s, addr, rn);
 -            for (reg = 0; reg < nregs; reg++) {
 -                if (load) {
 -                    gen_aa32_ld_i32(s, tmp, addr, get_mem_index(s),
 -                                    s->be_data | size);
 -                    neon_store_element(rd, reg_idx, size, tmp);
 -                } else { /* Store */
 -                    neon_load_element(tmp, rd, reg_idx, size);
 -                    gen_aa32_st_i32(s, tmp, addr, get_mem_index(s),
 -                                    s->be_data | size);
 -                }
 -                rd += stride;
 -                tcg_gen_addi_i32(addr, addr, 1 << size);
 -            }
 -            tcg_temp_free_i32(addr);
 -            tcg_temp_free_i32(tmp);
 -            stride = nregs * (1 << size);
 -        }
 -    }
 -    if (rm != 15) {
 -        TCGv_i32 base;
 -
 -        base = load_reg(s, rn);
 -        if (rm == 13) {
 -            tcg_gen_addi_i32(base, base, stride);
 -        } else {
 -            TCGv_i32 index;
 -            index = load_reg(s, rm);
 -            tcg_gen_add_i32(base, base, index);
 -            tcg_temp_free_i32(index);
 -        }
 -        store_reg(s, rn, base);
 -    }
 -    return 0;
 -}
 -
  static inline void gen_neon_narrow(int size, TCGv_i32 dest, TCGv_i64 src)
  {
      switch (size) {
@@ -XXX,XX +XXX,XX @@ static void disas_arm_insn(DisasContext *s, unsigned int insn)
              }
              return;
          }
 -        if ((insn & 0x0f100000) == 0x04000000) {
 -            /* NEON load/store.  */
 -            if (disas_neon_ls_insn(s, insn)) {
 -                goto illegal_op;
 -            }
 -            return;
 -        }
          if ((insn & 0x0e000f00) == 0x0c000100) {
              if (arm_dc_feature(s, ARM_FEATURE_IWMMXT)) {
                  /* iWMMXt register transfer.  */
@@ -XXX,XX +XXX,XX @@ static void disas_thumb2_insn(DisasContext *s, uint32_t insn)
          }
          break;
-     case 12:
--        if ((insn & 0x01100000) == 0x01000000) {
--            if (disas_neon_ls_insn(s, insn)) {
--                goto illegal_op;
--            }
--            break;
--        }
-         goto illegal_op;
-     default:
-     illegal_op:
 --
-.20.1
+.34.1

-[PULL 10/39] hw/arm: versal: Move misplaced comment
+[PULL 27/35] hw/misc/mps2-scc: Fix condition for CFG3 register
-From: "Edgar E. Iglesias" <edgar.iglesias@xilinx.com>
+We currently guard the CFG3 register read with
  (scc_partno(s) == 0x524 && scc_partno(s) == 0x547)
 which is clearly wrong as it is never true.
-Move misplaced comment.
+This register is present on all board types except AN524
 and AN527; correct the condition.
-Signed-off-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
+Fixes: 6ac80818941829c0 ("hw/misc/mps2-scc: Implement changes for AN547")
 Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
 Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
 Reviewed-by: Luc Michel <luc.michel@greensocs.com>
 Message-id: 20200427181649.26851-3-edgar.iglesias@gmail.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20240206132931.38376-6-peter.maydell@linaro.org
 ---
- hw/arm/xlnx-versal.c | 2 +-
+ hw/misc/mps2-scc.c | 2 +-
 file changed, 1 insertion(+), 1 deletion(-)
-diff --git a/hw/arm/xlnx-versal.c b/hw/arm/xlnx-versal.c
+diff --git a/hw/misc/mps2-scc.c b/hw/misc/mps2-scc.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/xlnx-versal.c
+--- a/hw/misc/mps2-scc.c
-+++ b/hw/arm/xlnx-versal.c
++++ b/hw/misc/mps2-scc.c
-@@ -XXX,XX +XXX,XX @@ static void versal_create_apu_cpus(Versal *s)
+@@ -XXX,XX +XXX,XX @@ static uint64_t mps2_scc_read(void *opaque, hwaddr offset, unsigned size)
+         r = s->cfg2;
-         obj = object_new(XLNX_VERSAL_ACPU_TYPE);
+         break;
-         if (!obj) {
+     case A_CFG3:
--            /* Secondary CPUs start in PSCI powered-down state */
+-        if (scc_partno(s) == 0x524 && scc_partno(s) == 0x547) {
-             error_report("Unable to create apu.cpu[%d] of type %s",
++        if (scc_partno(s) == 0x524 || scc_partno(s) == 0x547) {
-                          i, XLNX_VERSAL_ACPU_TYPE);
+             /* CFG3 reserved on AN524 */
-             exit(EXIT_FAILURE);
+             goto bad_offset;
@@ -XXX,XX +XXX,XX @@ static void versal_create_apu_cpus(Versal *s)
          object_property_set_int(obj, s->cfg.psci_conduit,
                                  "psci-conduit", &error_abort);
          if (i) {
 +            /* Secondary CPUs start in PSCI powered-down state */
              object_property_set_bool(obj, true,
                                       "start-powered-off", &error_abort);
          }
 --
-.20.1
+.34.1

-[PULL 06/39] target/arm: Implement ARMv8.2-TTS2UXN
+[PULL 28/35] hw/misc/mps2-scc: Factor out which-board conditionals
-The ARMv8.2-TTS2UXN feature extends the XN field in stage 2
+The MPS SCC device has a lot of different flavours for the various
-translation table descriptors from just bit [54] to bits [54:53],
+different MPS FPGA images, which look mostly similar but have
-allowing stage 2 to control execution permissions separately for EL0
+differences in how particular registers are handled.  Currently we
-and EL1. Implement the new semantics of the XN field and enable
+deal with this with a lot of open-coded checks on scc_partno(), but
-the feature for our 'max' CPU.
+as we add more board types this is getting a bit hard to read.
 Factor out the conditions into some functions which we can
 give more descriptive names to.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
+Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20200330210400.11724-5-peter.maydell@linaro.org
+Message-id: 20240206132931.38376-7-peter.maydell@linaro.org
 ---
- target/arm/cpu.h    | 15 +++++++++++++++
+ hw/misc/mps2-scc.c | 45 +++++++++++++++++++++++++++++++--------------
- target/arm/cpu.c    |  1 +
+file changed, 31 insertions(+), 14 deletions(-)
  target/arm/cpu64.c  |  2 ++
  target/arm/helper.c | 37 +++++++++++++++++++++++++++++++------
 files changed, 49 insertions(+), 6 deletions(-)
-diff --git a/target/arm/cpu.h b/target/arm/cpu.h
+diff --git a/hw/misc/mps2-scc.c b/hw/misc/mps2-scc.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.h
+--- a/hw/misc/mps2-scc.c
-+++ b/target/arm/cpu.h
++++ b/hw/misc/mps2-scc.c
-@@ -XXX,XX +XXX,XX @@ static inline bool isar_feature_aa32_ccidx(const ARMISARegisters *id)
+@@ -XXX,XX +XXX,XX @@ static int scc_partno(MPS2SCC *s)
-     return FIELD_EX32(id->id_mmfr4, ID_MMFR4, CCIDX) != 0;
+     return extract32(s->id, 4, 8);
  }
-+static inline bool isar_feature_aa32_tts2uxn(const ARMISARegisters *id)
++/* Is CFG_REG2 present? */
 +static bool have_cfg2(MPS2SCC *s)
 +{
-+    return FIELD_EX32(id->id_mmfr4, ID_MMFR4, XNX) != 0;
++    return scc_partno(s) == 0x524 || scc_partno(s) == 0x547;
 +}
 +
- /*
++/* Is CFG_REG3 present? */
-  * 64-bit feature tests via id registers.
++static bool have_cfg3(MPS2SCC *s)
   */
@@ -XXX,XX +XXX,XX @@ static inline bool isar_feature_aa64_ccidx(const ARMISARegisters *id)
      return FIELD_EX64(id->id_aa64mmfr2, ID_AA64MMFR2, CCIDX) != 0;
  }
 +static inline bool isar_feature_aa64_tts2uxn(const ARMISARegisters *id)
 +{
-+    return FIELD_EX64(id->id_aa64mmfr1, ID_AA64MMFR1, XNX) != 0;
++    return scc_partno(s) != 0x524 && scc_partno(s) != 0x547;
 +}
 +
- /*
++/* Is CFG_REG5 present? */
-  * Feature tests for "does this exist in either 32-bit or 64-bit?"
++static bool have_cfg5(MPS2SCC *s)
   */
@@ -XXX,XX +XXX,XX @@ static inline bool isar_feature_any_ccidx(const ARMISARegisters *id)
      return isar_feature_aa64_ccidx(id) || isar_feature_aa32_ccidx(id);
  }
 +static inline bool isar_feature_any_tts2uxn(const ARMISARegisters *id)
 +{
-+    return isar_feature_aa64_tts2uxn(id) || isar_feature_aa32_tts2uxn(id);
++    return scc_partno(s) == 0x524 || scc_partno(s) == 0x547;
 +}
 +
- /*
++/* Is CFG_REG6 present? */
-  * Forward to the above feature tests given an ARMCPU pointer.
++static bool have_cfg6(MPS2SCC *s)
 +{
 +    return scc_partno(s) == 0x524;
 +}
 +
  /* Handle a write via the SYS_CFG channel to the specified function/device.
   * Return false on error (reported to guest via SYS_CFGCTRL ERROR bit).
   */
-diff --git a/target/arm/cpu.c b/target/arm/cpu.c
+@@ -XXX,XX +XXX,XX @@ static uint64_t mps2_scc_read(void *opaque, hwaddr offset, unsigned size)
-index XXXXXXX..XXXXXXX 100644
+         r = s->cfg1;
---- a/target/arm/cpu.c
+         break;
-+++ b/target/arm/cpu.c
+     case A_CFG2:
-@@ -XXX,XX +XXX,XX @@ static void arm_max_initfn(Object *obj)
+-        if (scc_partno(s) != 0x524 && scc_partno(s) != 0x547) {
-             t = FIELD_DP32(t, ID_MMFR4, HPDS, 1); /* AA32HPD */
+-            /* CFG2 reserved on other boards */
-             t = FIELD_DP32(t, ID_MMFR4, AC2, 1); /* ACTLR2, HACTLR2 */
++        if (!have_cfg2(s)) {
-             t = FIELD_DP32(t, ID_MMFR4, CNP, 1); /* TTCNP */
+             goto bad_offset;
 +            t = FIELD_DP32(t, ID_MMFR4, XNX, 1); /* TTS2UXN */
              cpu->isar.id_mmfr4 = t;
          }
- #endif
+         r = s->cfg2;
-diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
+         break;
-index XXXXXXX..XXXXXXX 100644
+     case A_CFG3:
---- a/target/arm/cpu64.c
+-        if (scc_partno(s) == 0x524 || scc_partno(s) == 0x547) {
-+++ b/target/arm/cpu64.c
+-            /* CFG3 reserved on AN524 */
-@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
++        if (!have_cfg3(s)) {
-         t = FIELD_DP64(t, ID_AA64MMFR1, VH, 1);
+             goto bad_offset;
          t = FIELD_DP64(t, ID_AA64MMFR1, PAN, 2); /* ATS1E1 */
          t = FIELD_DP64(t, ID_AA64MMFR1, VMIDBITS, 2); /* VMID16 */
 +        t = FIELD_DP64(t, ID_AA64MMFR1, XNX, 1); /* TTS2UXN */
          cpu->isar.id_aa64mmfr1 = t;
          t = cpu->isar.id_aa64mmfr2;
@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
          u = FIELD_DP32(u, ID_MMFR4, HPDS, 1); /* AA32HPD */
          u = FIELD_DP32(u, ID_MMFR4, AC2, 1); /* ACTLR2, HACTLR2 */
          u = FIELD_DP32(u, ID_MMFR4, CNP, 1); /* TTCNP */
 +        u = FIELD_DP32(u, ID_MMFR4, XNX, 1); /* TTS2UXN */
          cpu->isar.id_mmfr4 = u;
          u = cpu->isar.id_aa64dfr0;
 diff --git a/target/arm/helper.c b/target/arm/helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper.c
 +++ b/target/arm/helper.c
@@ -XXX,XX +XXX,XX @@ simple_ap_to_rw_prot(CPUARMState *env, ARMMMUIdx mmu_idx, int ap)
   *
   * @env:     CPUARMState
   * @s2ap:    The 2-bit stage2 access permissions (S2AP)
 - * @xn:      XN (execute-never) bit
 + * @xn:      XN (execute-never) bits
 + * @s1_is_el0: true if this is S2 of an S1+2 walk for EL0
   */
 -static int get_S2prot(CPUARMState *env, int s2ap, int xn)
 +static int get_S2prot(CPUARMState *env, int s2ap, int xn, bool s1_is_el0)
  {
      int prot = 0;
@@ -XXX,XX +XXX,XX @@ static int get_S2prot(CPUARMState *env, int s2ap, int xn)
      if (s2ap & 2) {
          prot |= PAGE_WRITE;
      }
 -    if (!xn) {
 -        if (arm_el_is_aa64(env, 2) || prot & PAGE_READ) {
 +
 +    if (cpu_isar_feature(any_tts2uxn, env_archcpu(env))) {
 +        switch (xn) {
 +        case 0:
              prot |= PAGE_EXEC;
 +            break;
 +        case 1:
 +            if (s1_is_el0) {
 +                prot |= PAGE_EXEC;
 +            }
 +            break;
 +        case 2:
 +            break;
 +        case 3:
 +            if (!s1_is_el0) {
 +                prot |= PAGE_EXEC;
 +            }
 +            break;
 +        default:
 +            g_assert_not_reached();
 +        }
 +    } else {
 +        if (!extract32(xn, 1, 1)) {
 +            if (arm_el_is_aa64(env, 2) || prot & PAGE_READ) {
 +                prot |= PAGE_EXEC;
 +            }
          }
-     }
+         /* These are user-settable DIP switches on the board. We don't
-     return prot;
+@@ -XXX,XX +XXX,XX @@ static uint64_t mps2_scc_read(void *opaque, hwaddr offset, unsigned size)
-@@ -XXX,XX +XXX,XX @@ static bool get_phys_addr_lpae(CPUARMState *env, target_ulong address,
+         r = s->cfg4;
-     }
+         break;
+     case A_CFG5:
-     ap = extract32(attrs, 4, 2);
+-        if (scc_partno(s) != 0x524 && scc_partno(s) != 0x547) {
--    xn = extract32(attrs, 12, 1);
+-            /* CFG5 reserved on other boards */
++        if (!have_cfg5(s)) {
-     if (mmu_idx == ARMMMUIdx_Stage2) {
+             goto bad_offset;
-         ns = true;
+         }
--        *prot = get_S2prot(env, ap, xn);
+         r = s->cfg5;
-+        xn = extract32(attrs, 11, 2);
+         break;
-+        *prot = get_S2prot(env, ap, xn, s1_is_el0);
+     case A_CFG6:
-     } else {
+-        if (scc_partno(s) != 0x524) {
-         ns = extract32(attrs, 3, 1);
+-            /* CFG6 reserved on other boards */
-+        xn = extract32(attrs, 12, 1);
++        if (!have_cfg6(s)) {
-         pxn = extract32(attrs, 11, 1);
+             goto bad_offset;
-         *prot = get_S1prot(env, mmu_idx, aarch64, ap, ns, xn, pxn);
+         }
-     }
+         r = s->cfg6;
@@ -XXX,XX +XXX,XX @@ static void mps2_scc_write(void *opaque, hwaddr offset, uint64_t value,
          }
          break;
      case A_CFG2:
 -        if (scc_partno(s) != 0x524 && scc_partno(s) != 0x547) {
 -            /* CFG2 reserved on other boards */
 +        if (!have_cfg2(s)) {
              goto bad_offset;
          }
          /* AN524: QSPI Select signal */
          s->cfg2 = value;
          break;
      case A_CFG5:
 -        if (scc_partno(s) != 0x524 && scc_partno(s) != 0x547) {
 -            /* CFG5 reserved on other boards */
 +        if (!have_cfg5(s)) {
              goto bad_offset;
          }
          /* AN524: ACLK frequency in Hz */
          s->cfg5 = value;
          break;
      case A_CFG6:
 -        if (scc_partno(s) != 0x524) {
 -            /* CFG6 reserved on other boards */
 +        if (!have_cfg6(s)) {
              goto bad_offset;
          }
          /* AN524: Clock divider for BRAM */
 --
-.20.1
+.34.1

-[PULL 07/39] target/arm: Use correct variable for setting 'max' cpu's ID_AA64DFR0
+Deleted patch
-In aarch64_max_initfn() we update both 32-bit and 64-bit ID
-registers.  The intended pattern is that for 64-bit ID registers we
-use FIELD_DP64 and the uint64_t 't' register, while 32-bit ID
-registers use FIELD_DP32 and the uint32_t 'u' register.  For
-ID_AA64DFR0 we accidentally used 'u', meaning that the top 32 bits of
-this 64-bit ID register would end up always zero.  Luckily at the
-moment that's what they should be anyway, so this bug has no visible
-effects.
-Use the right-sized variable.
-Fixes: 3bec78447a958d481991
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Laurent Desnogues <laurent.desnogues@gmail.com>
-Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
-Message-id: 20200423110915.10527-1-peter.maydell@linaro.org
----
- target/arm/cpu64.c | 6 +++---
-file changed, 3 insertions(+), 3 deletions(-)
-diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu64.c
-+++ b/target/arm/cpu64.c
-@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
-         u = FIELD_DP32(u, ID_MMFR4, XNX, 1); /* TTS2UXN */
-         cpu->isar.id_mmfr4 = u;
--        u = cpu->isar.id_aa64dfr0;
--        u = FIELD_DP64(u, ID_AA64DFR0, PMUVER, 5); /* v8.4-PMU */
--        cpu->isar.id_aa64dfr0 = u;
-+        t = cpu->isar.id_aa64dfr0;
-+        t = FIELD_DP64(t, ID_AA64DFR0, PMUVER, 5); /* v8.4-PMU */
-+        cpu->isar.id_aa64dfr0 = t;
-         u = cpu->isar.id_dfr0;
-         u = FIELD_DP32(u, ID_DFR0, PERFMON, 5); /* v8.4-PMU */
---
-.20.1

-[PULL 27/39] target/arm: Convert VCMLA (scalar) to decodetree
+[PULL 29/35] hw/misc/mps2-scc: Make changes needed for AN536 FPGA image
-Convert VCMLA (scalar) in the 2reg-scalar-ext group to decodetree.
+The MPS2 SCC device is broadly the same for all FPGA images, but has
 minor differences in the behaviour of the CFG registers depending on
 the image. In many cases we don't really care about the functionality
 controlled by these registers and a reads-as-written or similar
 behaviour is sufficient for the moment.
 For the AN536 the required behaviour is:
  * A_CFG0 has CPU reset and halt bits
     - implement as reads-as-written for the moment
  * A_CFG1 has flash or ATCM address 0 remap handling
     - QEMU doesn't model this; implement as reads-as-written
  * A_CFG2 has QSPI select (like AN524)
     - implemented (no behaviour, as with AN524)
  * A_CFG3 is MCC_MSB_ADDR "additional MCC addressing bits"
     - QEMU doesn't care about these, so use the existing
       RAZ behaviour for convenience
  * A_CFG4 is board rev (like all other images)
     - no change needed
  * A_CFG5 is ACLK frq in hz (like AN524)
     - implemented as reads-as-written, as for other boards
  * A_CFG6 is core 0 vector table base address
     - implemented as reads-as-written for the moment
  * A_CFG7 is core 1 vector table base address
     - implemented as reads-as-written for the moment
 Make the changes necessary for this; leave TODO comments where
 appropriate to indicate where we might want to come back and
 implement things like CPU reset.
 The other aspects of the device specific to this FPGA image (like the
 values of the board ID and similar registers) will be set via the
 device's qdev properties.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20200430181003.21682-9-peter.maydell@linaro.org
+Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
 Message-id: 20240206132931.38376-8-peter.maydell@linaro.org
 ---
- target/arm/neon-shared.decode   |  5 +++++
+ include/hw/misc/mps2-scc.h |   1 +
- target/arm/translate-neon.inc.c | 40 +++++++++++++++++++++++++++++++++
+ hw/misc/mps2-scc.c         | 101 +++++++++++++++++++++++++++++++++----
- target/arm/translate.c          | 26 +--------------------
+files changed, 92 insertions(+), 10 deletions(-)
-files changed, 46 insertions(+), 25 deletions(-)
+diff --git a/include/hw/misc/mps2-scc.h b/include/hw/misc/mps2-scc.h
 diff --git a/target/arm/neon-shared.decode b/target/arm/neon-shared.decode
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/neon-shared.decode
+--- a/include/hw/misc/mps2-scc.h
-+++ b/target/arm/neon-shared.decode
++++ b/include/hw/misc/mps2-scc.h
-@@ -XXX,XX +XXX,XX @@ VFML           1111 110 0 s:1 . 10 .... .... 1000 . 0 . 1 .... \
+@@ -XXX,XX +XXX,XX @@ struct MPS2SCC {
-                vm=%vm_sp vn=%vn_sp vd=%vd_dp q=0
+     uint32_t cfg4;
- VFML           1111 110 0 s:1 . 10 .... .... 1000 . 1 . 1 .... \
+     uint32_t cfg5;
-                vm=%vm_dp vn=%vn_dp vd=%vd_dp q=1
+     uint32_t cfg6;
-+
++    uint32_t cfg7;
-+VCMLA_scalar   1111 1110 0 . rot:2 .... .... 1000 . q:1 index:1 0 vm:4 \
+     uint32_t cfgdata_rtn;
-+               vn=%vn_dp vd=%vd_dp size=0
+     uint32_t cfgdata_out;
-+VCMLA_scalar   1111 1110 1 . rot:2 .... .... 1000 . q:1 . 0 .... \
+     uint32_t cfgctrl;
-+               vm=%vm_dp vn=%vn_dp vd=%vd_dp size=1 index=0
+diff --git a/hw/misc/mps2-scc.c b/hw/misc/mps2-scc.c
 diff --git a/target/arm/translate-neon.inc.c b/target/arm/translate-neon.inc.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-neon.inc.c
+--- a/hw/misc/mps2-scc.c
-+++ b/target/arm/translate-neon.inc.c
++++ b/hw/misc/mps2-scc.c
-@@ -XXX,XX +XXX,XX @@ static bool trans_VFML(DisasContext *s, arg_VFML *a)
+@@ -XXX,XX +XXX,XX @@ REG32(CFG3, 0xc)
-                        gen_helper_gvec_fmlal_a32);
+ REG32(CFG4, 0x10)
-     return true;
+ REG32(CFG5, 0x14)
- }
+ REG32(CFG6, 0x18)
-+
++REG32(CFG7, 0x1c)
-+static bool trans_VCMLA_scalar(DisasContext *s, arg_VCMLA_scalar *a)
+ REG32(CFGDATA_RTN, 0xa0)
-+{
+ REG32(CFGDATA_OUT, 0xa4)
-+    gen_helper_gvec_3_ptr *fn_gvec_ptr;
+ REG32(CFGCTRL, 0xa8)
-+    int opr_sz;
+@@ -XXX,XX +XXX,XX @@ static int scc_partno(MPS2SCC *s)
-+    TCGv_ptr fpst;
+ /* Is CFG_REG2 present? */
-+
+ static bool have_cfg2(MPS2SCC *s)
-+    if (!dc_isar_feature(aa32_vcma, s)) {
+ {
-+        return false;
+-    return scc_partno(s) == 0x524 || scc_partno(s) == 0x547;
 +    return scc_partno(s) == 0x524 || scc_partno(s) == 0x547 ||
 +        scc_partno(s) == 0x536;
  }
  /* Is CFG_REG3 present? */
  static bool have_cfg3(MPS2SCC *s)
  {
 -    return scc_partno(s) != 0x524 && scc_partno(s) != 0x547;
 +    return scc_partno(s) != 0x524 && scc_partno(s) != 0x547 &&
 +        scc_partno(s) != 0x536;
  }
  /* Is CFG_REG5 present? */
  static bool have_cfg5(MPS2SCC *s)
  {
 -    return scc_partno(s) == 0x524 || scc_partno(s) == 0x547;
 +    return scc_partno(s) == 0x524 || scc_partno(s) == 0x547 ||
 +        scc_partno(s) == 0x536;
  }
  /* Is CFG_REG6 present? */
  static bool have_cfg6(MPS2SCC *s)
  {
 -    return scc_partno(s) == 0x524;
 +    return scc_partno(s) == 0x524 || scc_partno(s) == 0x536;
 +}
 +
 +/* Is CFG_REG7 present? */
 +static bool have_cfg7(MPS2SCC *s)
 +{
 +    return scc_partno(s) == 0x536;
 +}
 +
 +/* Does CFG_REG0 drive the 'remap' GPIO output? */
 +static bool cfg0_is_remap(MPS2SCC *s)
 +{
 +    return scc_partno(s) != 0x536;
 +}
 +
 +/* Is CFG_REG1 driving a set of LEDs? */
 +static bool cfg1_is_leds(MPS2SCC *s)
 +{
 +    return scc_partno(s) != 0x536;
  }
  /* Handle a write via the SYS_CFG channel to the specified function/device.
@@ -XXX,XX +XXX,XX @@ static uint64_t mps2_scc_read(void *opaque, hwaddr offset, unsigned size)
          if (!have_cfg3(s)) {
              goto bad_offset;
          }
 -        /* These are user-settable DIP switches on the board. We don't
 +        /*
 +         * These are user-settable DIP switches on the board. We don't
           * model that, so just return zeroes.
 +         *
 +         * TODO: for AN536 this is MCC_MSB_ADDR "additional MCC addressing
 +         * bits". These change which part of the DDR4 the motherboard
 +         * configuration controller can see in its memory map (see the
 +         * appnote section 2.4). QEMU doesn't model the MCC at all, so these
 +         * bits are not interesting to us; read-as-zero is as good as anything
 +         * else.
           */
          r = 0;
          break;
@@ -XXX,XX +XXX,XX @@ static uint64_t mps2_scc_read(void *opaque, hwaddr offset, unsigned size)
          }
          r = s->cfg6;
          break;
 +    case A_CFG7:
 +        if (!have_cfg7(s)) {
 +            goto bad_offset;
 +        }
 +        r = s->cfg7;
 +        break;
      case A_CFGDATA_RTN:
          r = s->cfgdata_rtn;
          break;
@@ -XXX,XX +XXX,XX @@ static void mps2_scc_write(void *opaque, hwaddr offset, uint64_t value,
           * we always reflect bit 0 in the 'remap' GPIO output line,
           * and let the board wire it up or not as it chooses.
           * TODO on some boards bit 1 is CPU_WAIT.
 +         *
 +         * TODO: on the AN536 this register controls reset and halt
 +         * for both CPUs. For the moment we don't implement this, so the
 +         * register just reads as written.
           */
          s->cfg0 = value;
 -        qemu_set_irq(s->remap, s->cfg0 & 1);
 +        if (cfg0_is_remap(s)) {
 +            qemu_set_irq(s->remap, s->cfg0 & 1);
 +        }
          break;
      case A_CFG1:
          s->cfg1 = value;
 -        for (size_t i = 0; i < ARRAY_SIZE(s->led); i++) {
 -            led_set_state(s->led[i], extract32(value, i, 1));
 +        /*
 +         * On most boards this register drives LEDs.
 +         *
 +         * TODO: for AN536 this controls whether flash and ATCM are
 +         * enabled or disabled on reset. QEMU doesn't model this, and
 +         * always wires up RAM in the ATCM area and ROM in the flash area.
 +         */
 +        if (cfg1_is_leds(s)) {
 +            for (size_t i = 0; i < ARRAY_SIZE(s->led); i++) {
 +                led_set_state(s->led[i], extract32(value, i, 1));
 +            }
          }
          break;
      case A_CFG2:
          if (!have_cfg2(s)) {
              goto bad_offset;
          }
 -        /* AN524: QSPI Select signal */
 +        /* AN524, AN536: QSPI Select signal */
          s->cfg2 = value;
          break;
      case A_CFG5:
          if (!have_cfg5(s)) {
              goto bad_offset;
          }
 -        /* AN524: ACLK frequency in Hz */
 +        /* AN524, AN536: ACLK frequency in Hz */
          s->cfg5 = value;
          break;
      case A_CFG6:
@@ -XXX,XX +XXX,XX @@ static void mps2_scc_write(void *opaque, hwaddr offset, uint64_t value,
              goto bad_offset;
          }
          /* AN524: Clock divider for BRAM */
 +        /* AN536: Core 0 vector table base address */
 +        s->cfg6 = value;
 +        break;
 +    case A_CFG7:
 +        if (!have_cfg7(s)) {
 +            goto bad_offset;
 +        }
 +        /* AN536: Core 1 vector table base address */
          s->cfg6 = value;
          break;
      case A_CFGDATA_OUT:
@@ -XXX,XX +XXX,XX @@ static void mps2_scc_finalize(Object *obj)
      g_free(s->oscclk_reset);
  }
 +static bool cfg7_needed(void *opaque)
 +{
 +    MPS2SCC *s = opaque;
 +
 +    return have_cfg7(s);
 +}
 +
 +static const VMStateDescription vmstate_cfg7 = {
 +    .name = "mps2-scc/cfg7",
 +    .version_id = 1,
 +    .minimum_version_id = 1,
 +    .needed = cfg7_needed,
 +    .fields = (const VMStateField[]) {
 +        VMSTATE_UINT32(cfg7, MPS2SCC),
 +        VMSTATE_END_OF_LIST()
 +    }
-+    if (a->size == 0 && !dc_isar_feature(aa32_fp16_arith, s)) {
++};
-+        return false;
++
-+    }
+ static const VMStateDescription mps2_scc_vmstate = {
-+
+     .name = "mps2-scc",
-+    /* UNDEF accesses to D16-D31 if they don't exist. */
+     .version_id = 3,
-+    if (!dc_isar_feature(aa32_simd_r32, s) &&
+@@ -XXX,XX +XXX,XX @@ static const VMStateDescription mps2_scc_vmstate = {
-+        ((a->vd | a->vn | a->vm) & 0x10)) {
+         VMSTATE_VARRAY_UINT32(oscclk, MPS2SCC, num_oscclk,
-+        return false;
+, vmstate_info_uint32, uint32_t),
-+    }
+         VMSTATE_END_OF_LIST()
-+
++    },
-+    if ((a->vd | a->vn) & a->q) {
++    .subsections = (const VMStateDescription * const []) {
-+        return false;
++        &vmstate_cfg7,
-+    }
++        NULL
-+
+     }
-+    if (!vfp_access_check(s)) {
+ };
 +        return true;
 +    }
 +
 +    fn_gvec_ptr = (a->size ? gen_helper_gvec_fcmlas_idx
 +                   : gen_helper_gvec_fcmlah_idx);
 +    opr_sz = (1 + a->q) * 8;
 +    fpst = get_fpstatus_ptr(1);
 +    tcg_gen_gvec_3_ptr(vfp_reg_offset(1, a->vd),
 +                       vfp_reg_offset(1, a->vn),
 +                       vfp_reg_offset(1, a->vm),
 +                       fpst, opr_sz, opr_sz,
 +                       (a->index << 2) | a->rot, fn_gvec_ptr);
 +    tcg_temp_free_ptr(fpst);
 +    return true;
 +}
 diff --git a/target/arm/translate.c b/target/arm/translate.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate.c
 +++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static int disas_neon_insn_2reg_scalar_ext(DisasContext *s, uint32_t insn)
      bool is_long = false, q = extract32(insn, 6, 1);
      bool ptr_is_env = false;
 -    if ((insn & 0xff000f10) == 0xfe000800) {
 -        /* VCMLA (indexed) -- 1111 1110 S.RR .... .... 1000 ...0 .... */
 -        int rot = extract32(insn, 20, 2);
 -        int size = extract32(insn, 23, 1);
 -        int index;
 -
 -        if (!dc_isar_feature(aa32_vcma, s)) {
 -            return 1;
 -        }
 -        if (size == 0) {
 -            if (!dc_isar_feature(aa32_fp16_arith, s)) {
 -                return 1;
 -            }
 -            /* For fp16, rm is just Vm, and index is M.  */
 -            rm = extract32(insn, 0, 4);
 -            index = extract32(insn, 5, 1);
 -        } else {
 -            /* For fp32, rm is the usual M:Vm, and index is 0.  */
 -            VFP_DREG_M(rm, insn);
 -            index = 0;
 -        }
 -        data = (index << 2) | rot;
 -        fn_gvec_ptr = (size ? gen_helper_gvec_fcmlas_idx
 -                       : gen_helper_gvec_fcmlah_idx);
 -    } else if ((insn & 0xffb00f00) == 0xfe200d00) {
 +    if ((insn & 0xffb00f00) == 0xfe200d00) {
          /* V[US]DOT -- 1111 1110 0.10 .... .... 1101 .Q.U .... */
          int u = extract32(insn, 4, 1);
 --
-.20.1
+.34.1

-[PULL 22/39] target/arm: Add stubs for AArch32 Neon decodetree
+[PULL 30/35] hw/arm/mps3r: Initial skeleton for mps3-an536 board
-Add the infrastructure for building and invoking a decodetree decoder
+The AN536 is another FPGA image for the MPS3 development board. Unlike
-for the AArch32 Neon encodings.  At the moment the new decoder covers
+the existing FPGA images we already model, this board uses a Cortex-R
-nothing, so we always fall back to the existing hand-written decode.
+family CPU, and it does not use any equivalent to the M-profile
+"Subsystem for Embedded" SoC-equivalent that we model in hw/arm/armsse.c.
-We follow the same pattern we did for the VFP decodetree conversion
+It's therefore more convenient for us to model it as a completely
-(commit 78e138bc1f672c145ef6ace74617d and following): code that deals
+separate C file.
-with Neon will be moving gradually out to translate-neon.vfp.inc,
-which we #include into translate.c.
+This commit adds the basic skeleton of the board model, and the
+code to create all the RAM and ROM. We assume that we're probably
-In order to share the decode files between A32 and T32, we
+going to want to add more images in future, so use the same
-split Neon into 3 parts:
+base class/subclass setup that mps2-tz.c uses, even though at
- * data-processing
+the moment there's only a single subclass.
- * load-store
- * 'shared' encodings
+Following commits will add the CPUs and the peripherals.
 The first two groups of instructions have similar but not identical
 A32 and T32 encodings, so we need to manually transform the T32
 encoding into the A32 one before calling the decoder; the third group
 covers the Neon instructions which are identical in A32 and T32.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
-Message-id: 20200430181003.21682-4-peter.maydell@linaro.org
+Message-id: 20240206132931.38376-9-peter.maydell@linaro.org
 ---
- target/arm/neon-dp.decode       | 29 ++++++++++++++++++++++++++
+ MAINTAINERS                             |   3 +-
- target/arm/neon-ls.decode       | 29 ++++++++++++++++++++++++++
+ configs/devices/arm-softmmu/default.mak |   1 +
- target/arm/neon-shared.decode   | 27 +++++++++++++++++++++++++
+ hw/arm/mps3r.c                          | 239 ++++++++++++++++++++++++
- target/arm/translate-neon.inc.c | 32 +++++++++++++++++++++++++++++
+ hw/arm/Kconfig                          |   5 +
- target/arm/translate.c          | 36 +++++++++++++++++++++++++++++++--
+ hw/arm/meson.build                      |   1 +
- target/arm/Makefile.objs        | 18 +++++++++++++++++
+files changed, 248 insertions(+), 1 deletion(-)
-files changed, 169 insertions(+), 2 deletions(-)
+ create mode 100644 hw/arm/mps3r.c
- create mode 100644 target/arm/neon-dp.decode
- create mode 100644 target/arm/neon-ls.decode
+diff --git a/MAINTAINERS b/MAINTAINERS
- create mode 100644 target/arm/neon-shared.decode
+index XXXXXXX..XXXXXXX 100644
- create mode 100644 target/arm/translate-neon.inc.c
+--- a/MAINTAINERS
++++ b/MAINTAINERS
-diff --git a/target/arm/neon-dp.decode b/target/arm/neon-dp.decode
+@@ -XXX,XX +XXX,XX @@ F: include/hw/misc/imx7_*.h
  F: hw/pci-host/designware.c
  F: include/hw/pci-host/designware.h
 -MPS2
 +MPS2 / MPS3
  M: Peter Maydell <peter.maydell@linaro.org>
  L: qemu-arm@nongnu.org
  S: Maintained
  F: hw/arm/mps2.c
  F: hw/arm/mps2-tz.c
 +F: hw/arm/mps3r.c
  F: hw/misc/mps2-*.c
  F: include/hw/misc/mps2-*.h
  F: hw/arm/armsse.c
 diff --git a/configs/devices/arm-softmmu/default.mak b/configs/devices/arm-softmmu/default.mak
 index XXXXXXX..XXXXXXX 100644
 --- a/configs/devices/arm-softmmu/default.mak
 +++ b/configs/devices/arm-softmmu/default.mak
@@ -XXX,XX +XXX,XX @@ CONFIG_ARM_VIRT=y
  # CONFIG_INTEGRATOR=n
  # CONFIG_FSL_IMX31=n
  # CONFIG_MUSICPAL=n
 +# CONFIG_MPS3R=n
  # CONFIG_MUSCA=n
  # CONFIG_CHEETAH=n
  # CONFIG_SX1=n
 diff --git a/hw/arm/mps3r.c b/hw/arm/mps3r.c
 new file mode 100644
 index XXXXXXX..XXXXXXX
 --- /dev/null
-+++ b/target/arm/neon-dp.decode
++++ b/hw/arm/mps3r.c
@@ -XXX,XX +XXX,XX @@
 +# AArch32 Neon data-processing instruction descriptions
 +#
 +#  Copyright (c) 2020 Linaro, Ltd
 +#
 +# This library is free software; you can redistribute it and/or
 +# modify it under the terms of the GNU Lesser General Public
 +# License as published by the Free Software Foundation; either
 +# version 2 of the License, or (at your option) any later version.
 +#
 +# This library is distributed in the hope that it will be useful,
 +# but WITHOUT ANY WARRANTY; without even the implied warranty of
 +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
 +# Lesser General Public License for more details.
 +#
 +# You should have received a copy of the GNU Lesser General Public
 +# License along with this library; if not, see <http://www.gnu.org/licenses/>.
 +
 +#
 +# This file is processed by scripts/decodetree.py
 +#
 +
 +# Encodings for Neon data processing instructions where the T32 encoding
 +# is a simple transformation of the A32 encoding.
 +# More specifically, this file covers instructions where the A32 encoding is
 +#   0b1111_001p_qqqq_qqqq_qqqq_qqqq_qqqq_qqqq
 +# and the T32 encoding is
 +#   0b111p_1111_qqqq_qqqq_qqqq_qqqq_qqqq_qqqq
 +# This file works on the A32 encoding only; calling code for T32 has to
 +# transform the insn into the A32 version first.
 diff --git a/target/arm/neon-ls.decode b/target/arm/neon-ls.decode
 new file mode 100644
 index XXXXXXX..XXXXXXX
 --- /dev/null
 +++ b/target/arm/neon-ls.decode
@@ -XXX,XX +XXX,XX @@
 +# AArch32 Neon load/store instruction descriptions
 +#
 +#  Copyright (c) 2020 Linaro, Ltd
 +#
 +# This library is free software; you can redistribute it and/or
 +# modify it under the terms of the GNU Lesser General Public
 +# License as published by the Free Software Foundation; either
 +# version 2 of the License, or (at your option) any later version.
 +#
 +# This library is distributed in the hope that it will be useful,
 +# but WITHOUT ANY WARRANTY; without even the implied warranty of
 +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
 +# Lesser General Public License for more details.
 +#
 +# You should have received a copy of the GNU Lesser General Public
 +# License along with this library; if not, see <http://www.gnu.org/licenses/>.
 +
 +#
 +# This file is processed by scripts/decodetree.py
 +#
 +
 +# Encodings for Neon load/store instructions where the T32 encoding
 +# is a simple transformation of the A32 encoding.
 +# More specifically, this file covers instructions where the A32 encoding is
 +#   0b1111_0100_xxx0_xxxx_xxxx_xxxx_xxxx_xxxx
 +# and the T32 encoding is
 +#   0b1111_1001_xxx0_xxxx_xxxx_xxxx_xxxx_xxxx
 +# This file works on the A32 encoding only; calling code for T32 has to
 +# transform the insn into the A32 version first.
 diff --git a/target/arm/neon-shared.decode b/target/arm/neon-shared.decode
 new file mode 100644
 index XXXXXXX..XXXXXXX
 --- /dev/null
 +++ b/target/arm/neon-shared.decode
@@ -XXX,XX +XXX,XX @@
 +# AArch32 Neon instruction descriptions
 +#
 +#  Copyright (c) 2020 Linaro, Ltd
 +#
 +# This library is free software; you can redistribute it and/or
 +# modify it under the terms of the GNU Lesser General Public
 +# License as published by the Free Software Foundation; either
 +# version 2 of the License, or (at your option) any later version.
 +#
 +# This library is distributed in the hope that it will be useful,
 +# but WITHOUT ANY WARRANTY; without even the implied warranty of
 +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
 +# Lesser General Public License for more details.
 +#
 +# You should have received a copy of the GNU Lesser General Public
 +# License along with this library; if not, see <http://www.gnu.org/licenses/>.
 +
 +#
 +# This file is processed by scripts/decodetree.py
 +#
 +
 +# Encodings for Neon instructions whose encoding is the same for
 +# both A32 and T32.
 +
 +# More specifically, this covers:
 +# 2reg scalar ext: 0b1111_1110_xxxx_xxxx_xxxx_1x0x_xxxx_xxxx
 +# 3same ext:       0b1111_110x_xxxx_xxxx_xxxx_1x0x_xxxx_xxxx
 diff --git a/target/arm/translate-neon.inc.c b/target/arm/translate-neon.inc.c
 new file mode 100644
 index XXXXXXX..XXXXXXX
 --- /dev/null
 +++ b/target/arm/translate-neon.inc.c
 @@ -XXX,XX +XXX,XX @@
 +/*
-+ *  ARM translation: AArch32 Neon instructions
++ * Arm MPS3 board emulation for Cortex-R-based FPGA images.
 + * (For M-profile images see mps2.c and mps2tz.c.)
 + *
-+ *  Copyright (c) 2003 Fabrice Bellard
++ * Copyright (c) 2017 Linaro Limited
-+ *  Copyright (c) 2005-2007 CodeSourcery
++ * Written by Peter Maydell
 + *  Copyright (c) 2007 OpenedHand, Ltd.
 + *  Copyright (c) 2020 Linaro, Ltd.
 + *
-+ * This library is free software; you can redistribute it and/or
++ *  This program is free software; you can redistribute it and/or modify
-+ * modify it under the terms of the GNU Lesser General Public
++ *  it under the terms of the GNU General Public License version 2 or
-+ * License as published by the Free Software Foundation; either
++ *  (at your option) any later version.
-+ * version 2 of the License, or (at your option) any later version.
++ */
 +
 +/*
 + * The MPS3 is an FPGA based dev board. This file handles FPGA images
 + * which use the Cortex-R CPUs. We model these separately from the
 + * M-profile images, because on M-profile the FPGA image is based on
 + * a "Subsystem for Embedded" which is similar to an SoC, whereas
 + * the R-profile FPGA images don't have that abstraction layer.
 + *
-+ * This library is distributed in the hope that it will be useful,
++ * We model the following FPGA images here:
-+ * but WITHOUT ANY WARRANTY; without even the implied warranty of
++ *  "mps3-an536" -- dual Cortex-R52 as documented in Arm Application Note AN536
 + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
 + * Lesser General Public License for more details.
 + *
-+ * You should have received a copy of the GNU Lesser General Public
++ * Application Note AN536:
-+ * License along with this library; if not, see <http://www.gnu.org/licenses/>.
++ * https://developer.arm.com/documentation/dai0536/latest/
 + */
 +
++#include "qemu/osdep.h"
++#include "qemu/units.h"
++#include "qapi/error.h"
++#include "exec/address-spaces.h"
++#include "cpu.h"
++#include "hw/boards.h"
++#include "hw/arm/boot.h"
++
++/* Define the layout of RAM and ROM in a board */
++typedef struct RAMInfo {
++    const char *name;
++    hwaddr base;
++    hwaddr size;
++    int mrindex; /* index into rams[]; -1 for the system RAM block */
++    int flags;
++} RAMInfo;
++
 +/*
-+ * This file is intended to be included from translate.c; it uses
++ * The MPS3 DDR is 3GiB, but on a 32-bit host QEMU doesn't permit
-+ * some macros and definitions provided by that file.
++ * emulation of that much guest RAM, so artificially make it smaller.
 + * It might be possible to convert it to a standalone .c file eventually.
 + */
-+
++#if HOST_LONG_BITS == 32
-+/* Include the generated Neon decoder */
++#define MPS3_DDR_SIZE (1 * GiB)
-+#include "decode-neon-dp.inc.c"
++#else
-+#include "decode-neon-ls.inc.c"
++#define MPS3_DDR_SIZE (3 * GiB)
-+#include "decode-neon-shared.inc.c"
++#endif
-diff --git a/target/arm/translate.c b/target/arm/translate.c
++
-index XXXXXXX..XXXXXXX 100644
++/*
---- a/target/arm/translate.c
++ * Flag values:
-+++ b/target/arm/translate.c
++ * IS_MAIN: this is the main machine RAM
-@@ -XXX,XX +XXX,XX @@ static TCGv_ptr vfp_reg_ptr(bool dp, int reg)
++ * IS_ROM: this area is read-only
++ */
- #define ARM_CP_RW_BIT   (1 << 20)
++#define IS_MAIN 1
++#define IS_ROM 2
--/* Include the VFP decoder */
++
-+/* Include the VFP and Neon decoders */
++#define MPS3R_RAM_MAX 9
- #include "translate-vfp.inc.c"
++
-+#include "translate-neon.inc.c"
++typedef enum MPS3RFPGAType {
++    FPGA_AN536,
- static inline void iwmmxt_load_reg(TCGv_i64 var, int reg)
++} MPS3RFPGAType;
- {
++
-@@ -XXX,XX +XXX,XX @@ static void disas_arm_insn(DisasContext *s, unsigned int insn)
++struct MPS3RMachineClass {
-         /* Unconditional instructions.  */
++    MachineClass parent;
-         /* TODO: Perhaps merge these into one decodetree output file.  */
++    MPS3RFPGAType fpga_type;
-         if (disas_a32_uncond(s, insn) ||
++    const RAMInfo *raminfo;
--            disas_vfp_uncond(s, insn)) {
++};
-+            disas_vfp_uncond(s, insn) ||
++
-+            disas_neon_dp(s, insn) ||
++struct MPS3RMachineState {
-+            disas_neon_ls(s, insn) ||
++    MachineState parent;
-+            disas_neon_shared(s, insn)) {
++    MemoryRegion ram[MPS3R_RAM_MAX];
-             return;
++};
-         }
++
-         /* fall back to legacy decoder */
++#define TYPE_MPS3R_MACHINE "mps3r"
-@@ -XXX,XX +XXX,XX @@ static void disas_thumb2_insn(DisasContext *s, uint32_t insn)
++#define TYPE_MPS3R_AN536_MACHINE MACHINE_TYPE_NAME("mps3-an536")
-         ARCH(6T2);
++
-     }
++OBJECT_DECLARE_TYPE(MPS3RMachineState, MPS3RMachineClass, MPS3R_MACHINE)
++
-+    if ((insn & 0xef000000) == 0xef000000) {
++static const RAMInfo an536_raminfo[] = {
-+        /*
++    {
-+         * T32 encodings 0b111p_1111_qqqq_qqqq_qqqq_qqqq_qqqq_qqqq
++        .name = "ATCM",
-+         * transform into
++        .base = 0x00000000,
-+         * A32 encodings 0b1111_001p_qqqq_qqqq_qqqq_qqqq_qqqq_qqqq
++        .size = 0x00008000,
-+         */
++        .mrindex = 0,
-+        uint32_t a32_insn = (insn & 0xe2ffffff) |
++    }, {
-+            ((insn & (1 << 28)) >> 4) | (1 << 28);
++        /* We model the QSPI flash as simple ROM for now */
-+
++        .name = "QSPI",
-+        if (disas_neon_dp(s, a32_insn)) {
++        .base = 0x08000000,
 +        .size = 0x00800000,
 +        .flags = IS_ROM,
 +        .mrindex = 1,
 +    }, {
 +        .name = "BRAM",
 +        .base = 0x10000000,
 +        .size = 0x00080000,
 +        .mrindex = 2,
 +    }, {
 +        .name = "DDR",
 +        .base = 0x20000000,
 +        .size = MPS3_DDR_SIZE,
 +        .mrindex = -1,
 +    }, {
 +        .name = "ATCM0",
 +        .base = 0xee000000,
 +        .size = 0x00008000,
 +        .mrindex = 3,
 +    }, {
 +        .name = "BTCM0",
 +        .base = 0xee100000,
 +        .size = 0x00008000,
 +        .mrindex = 4,
 +    }, {
 +        .name = "CTCM0",
 +        .base = 0xee200000,
 +        .size = 0x00008000,
 +        .mrindex = 5,
 +    }, {
 +        .name = "ATCM1",
 +        .base = 0xee400000,
 +        .size = 0x00008000,
 +        .mrindex = 6,
 +    }, {
 +        .name = "BTCM1",
 +        .base = 0xee500000,
 +        .size = 0x00008000,
 +        .mrindex = 7,
 +    }, {
 +        .name = "CTCM1",
 +        .base = 0xee600000,
 +        .size = 0x00008000,
 +        .mrindex = 8,
 +    }, {
 +        .name = NULL,
 +    }
 +};
 +
 +static MemoryRegion *mr_for_raminfo(MPS3RMachineState *mms,
 +                                    const RAMInfo *raminfo)
 +{
 +    /* Return an initialized MemoryRegion for the RAMInfo. */
 +    MemoryRegion *ram;
 +
 +    if (raminfo->mrindex < 0) {
 +        /* Means this RAMInfo is for QEMU's "system memory" */
 +        MachineState *machine = MACHINE(mms);
 +        assert(!(raminfo->flags & IS_ROM));
 +        return machine->ram;
 +    }
 +
 +    assert(raminfo->mrindex < MPS3R_RAM_MAX);
 +    ram = &mms->ram[raminfo->mrindex];
 +
 +    memory_region_init_ram(ram, NULL, raminfo->name,
 +                           raminfo->size, &error_fatal);
 +    if (raminfo->flags & IS_ROM) {
 +        memory_region_set_readonly(ram, true);
 +    }
 +    return ram;
 +}
 +
 +static void mps3r_common_init(MachineState *machine)
 +{
 +    MPS3RMachineState *mms = MPS3R_MACHINE(machine);
 +    MPS3RMachineClass *mmc = MPS3R_MACHINE_GET_CLASS(mms);
 +    MemoryRegion *sysmem = get_system_memory();
 +
 +    for (const RAMInfo *ri = mmc->raminfo; ri->name; ri++) {
 +        MemoryRegion *mr = mr_for_raminfo(mms, ri);
 +        memory_region_add_subregion(sysmem, ri->base, mr);
 +    }
 +}
 +
 +static void mps3r_set_default_ram_info(MPS3RMachineClass *mmc)
 +{
 +    /*
 +     * Set mc->default_ram_size and default_ram_id from the
 +     * information in mmc->raminfo.
 +     */
 +    MachineClass *mc = MACHINE_CLASS(mmc);
 +    const RAMInfo *p;
 +
 +    for (p = mmc->raminfo; p->name; p++) {
 +        if (p->mrindex < 0) {
 +            /* Found the entry for "system memory" */
 +            mc->default_ram_size = p->size;
 +            mc->default_ram_id = p->name;
 +            return;
 +        }
 +    }
-+
++    g_assert_not_reached();
-+    if ((insn & 0xff100000) == 0xf9000000) {
++}
-+        /*
++
-+         * T32 encodings 0b1111_1001_ppp0_qqqq_qqqq_qqqq_qqqq_qqqq
++static void mps3r_class_init(ObjectClass *oc, void *data)
-+         * transform into
++{
-+         * A32 encodings 0b1111_0100_ppp0_qqqq_qqqq_qqqq_qqqq_qqqq
++    MachineClass *mc = MACHINE_CLASS(oc);
-+         */
++
-+        uint32_t a32_insn = (insn & 0x00ffffff) | 0xf4000000;
++    mc->init = mps3r_common_init;
-+
++}
-+        if (disas_neon_ls(s, a32_insn)) {
++
-+            return;
++static void mps3r_an536_class_init(ObjectClass *oc, void *data)
-+        }
++{
-+    }
++    MachineClass *mc = MACHINE_CLASS(oc);
-+
++    MPS3RMachineClass *mmc = MPS3R_MACHINE_CLASS(oc);
-     /*
++    static const char * const valid_cpu_types[] = {
-      * TODO: Perhaps merge these into one decodetree output file.
++        ARM_CPU_TYPE_NAME("cortex-r52"),
-      * Note disas_vfp is written for a32 with cond field in the
++        NULL
-@@ -XXX,XX +XXX,XX @@ static void disas_thumb2_insn(DisasContext *s, uint32_t insn)
++    };
-      */
++
-     if (disas_t32(s, insn) ||
++    mc->desc = "ARM MPS3 with AN536 FPGA image for Cortex-R52";
-         disas_vfp_uncond(s, insn) ||
++    mc->default_cpus = 2;
-+        disas_neon_shared(s, insn) ||
++    mc->min_cpus = mc->default_cpus;
-         ((insn >> 28) == 0xe && disas_vfp(s, insn))) {
++    mc->max_cpus = mc->default_cpus;
-         return;
++    mc->default_cpu_type = ARM_CPU_TYPE_NAME("cortex-r52");
-     }
++    mc->valid_cpu_types = valid_cpu_types;
-diff --git a/target/arm/Makefile.objs b/target/arm/Makefile.objs
++    mmc->raminfo = an536_raminfo;
 +    mps3r_set_default_ram_info(mmc);
 +}
 +
 +static const TypeInfo mps3r_machine_types[] = {
 +    {
 +        .name = TYPE_MPS3R_MACHINE,
 +        .parent = TYPE_MACHINE,
 +        .abstract = true,
 +        .instance_size = sizeof(MPS3RMachineState),
 +        .class_size = sizeof(MPS3RMachineClass),
 +        .class_init = mps3r_class_init,
 +    }, {
 +        .name = TYPE_MPS3R_AN536_MACHINE,
 +        .parent = TYPE_MPS3R_MACHINE,
 +        .class_init = mps3r_an536_class_init,
 +    },
 +};
 +
 +DEFINE_TYPES(mps3r_machine_types);
 diff --git a/hw/arm/Kconfig b/hw/arm/Kconfig
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/Makefile.objs
+--- a/hw/arm/Kconfig
-+++ b/target/arm/Makefile.objs
++++ b/hw/arm/Kconfig
-@@ -XXX,XX +XXX,XX @@ target/arm/decode-sve.inc.c: $(SRC_PATH)/target/arm/sve.decode $(DECODETREE)
+@@ -XXX,XX +XXX,XX @@ config MAINSTONE
-       $(PYTHON) $(DECODETREE) --decode disas_sve -o $@ $<,\
+     select PFLASH_CFI01
-       "GEN", $(TARGET_DIR)$@)
+     select SMC91C111
-+target/arm/decode-neon-shared.inc.c: $(SRC_PATH)/target/arm/neon-shared.decode $(DECODETREE)
++config MPS3R
-+    $(call quiet-command,\
++    bool
-+      $(PYTHON) $(DECODETREE) --static-decode disas_neon_shared -o $@ $<,\
++    default y
-+      "GEN", $(TARGET_DIR)$@)
++    depends on TCG && ARM
 +
-+target/arm/decode-neon-dp.inc.c: $(SRC_PATH)/target/arm/neon-dp.decode $(DECODETREE)
+ config MUSCA
-+    $(call quiet-command,\
+     bool
-+      $(PYTHON) $(DECODETREE) --static-decode disas_neon_dp -o $@ $<,\
+     default y
-+      "GEN", $(TARGET_DIR)$@)
+diff --git a/hw/arm/meson.build b/hw/arm/meson.build
-+
+index XXXXXXX..XXXXXXX 100644
-+target/arm/decode-neon-ls.inc.c: $(SRC_PATH)/target/arm/neon-ls.decode $(DECODETREE)
+--- a/hw/arm/meson.build
-+    $(call quiet-command,\
++++ b/hw/arm/meson.build
-+      $(PYTHON) $(DECODETREE) --static-decode disas_neon_ls -o $@ $<,\
+@@ -XXX,XX +XXX,XX @@ arm_ss.add(when: 'CONFIG_HIGHBANK', if_true: files('highbank.c'))
-+      "GEN", $(TARGET_DIR)$@)
+ arm_ss.add(when: 'CONFIG_INTEGRATOR', if_true: files('integratorcp.c'))
-+
+ arm_ss.add(when: 'CONFIG_MAINSTONE', if_true: files('mainstone.c'))
- target/arm/decode-vfp.inc.c: $(SRC_PATH)/target/arm/vfp.decode $(DECODETREE)
+ arm_ss.add(when: 'CONFIG_MICROBIT', if_true: files('microbit.c'))
-     $(call quiet-command,\
++arm_ss.add(when: 'CONFIG_MPS3R', if_true: files('mps3r.c'))
-       $(PYTHON) $(DECODETREE) --static-decode disas_vfp -o $@ $<,\
+ arm_ss.add(when: 'CONFIG_MUSICPAL', if_true: files('musicpal.c'))
-@@ -XXX,XX +XXX,XX @@ target/arm/decode-t16.inc.c: $(SRC_PATH)/target/arm/t16.decode $(DECODETREE)
+ arm_ss.add(when: 'CONFIG_NETDUINOPLUS2', if_true: files('netduinoplus2.c'))
-       "GEN", $(TARGET_DIR)$@)
+ arm_ss.add(when: 'CONFIG_OLIMEX_STM32_H405', if_true: files('olimex-stm32-h405.c'))
  target/arm/translate-sve.o: target/arm/decode-sve.inc.c
 +target/arm/translate.o: target/arm/decode-neon-shared.inc.c
 +target/arm/translate.o: target/arm/decode-neon-dp.inc.c
 +target/arm/translate.o: target/arm/decode-neon-ls.inc.c
  target/arm/translate.o: target/arm/decode-vfp.inc.c
  target/arm/translate.o: target/arm/decode-vfp-uncond.inc.c
  target/arm/translate.o: target/arm/decode-a32.inc.c
 --
-.20.1
+.34.1

-[PULL 26/39] target/arm: Convert VFM[AS]L (vector) to decodetree
+[PULL 31/35] hw/arm/mps3r: Add CPUs, GIC, and per-CPU RAM
-Convert the VFM[AS]L (vector) insns to decodetree.  This is the last
+Create the CPUs, the GIC, and the per-CPU RAM block for
-insn in the legacy decoder for the 3same_ext group, so we can
+the mps3-an536 board.
 delete the legacy decoder function for the group entirely.
 Note that in disas_thumb2_insn() the parts of this encoding space
 where the decodetree decoder returns false will correctly be directed
 to illegal_op by the "(insn & (1 << 28))" check so they won't fall
 into disas_coproc_insn() by mistake.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20240206132931.38376-10-peter.maydell@linaro.org
 Message-id: 20200430181003.21682-8-peter.maydell@linaro.org
 ---
- target/arm/neon-shared.decode   |  6 +++
+ hw/arm/mps3r.c | 180 ++++++++++++++++++++++++++++++++++++++++++++++++-
- target/arm/translate-neon.inc.c | 31 +++++++++++
+file changed, 177 insertions(+), 3 deletions(-)
  target/arm/translate.c          | 92 +--------------------------------
 files changed, 38 insertions(+), 91 deletions(-)
-diff --git a/target/arm/neon-shared.decode b/target/arm/neon-shared.decode
+diff --git a/hw/arm/mps3r.c b/hw/arm/mps3r.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/neon-shared.decode
+--- a/hw/arm/mps3r.c
-+++ b/target/arm/neon-shared.decode
++++ b/hw/arm/mps3r.c
-@@ -XXX,XX +XXX,XX @@ VCADD          1111 110 rot:1 1 . 0 size:1 .... .... 1000 . q:1 . 0 .... \
+@@ -XXX,XX +XXX,XX @@
- # VUDOT and VSDOT
+ #include "qemu/osdep.h"
- VDOT           1111 110 00 . 10 .... .... 1101 . q:1 . u:1 .... \
+ #include "qemu/units.h"
-                vm=%vm_dp vn=%vn_dp vd=%vd_dp
+ #include "qapi/error.h"
-+
++#include "qapi/qmp/qlist.h"
-+# VFM[AS]L
+ #include "exec/address-spaces.h"
-+VFML           1111 110 0 s:1 . 10 .... .... 1000 . 0 . 1 .... \
+ #include "cpu.h"
-+               vm=%vm_sp vn=%vn_sp vd=%vd_dp q=0
+ #include "hw/boards.h"
-+VFML           1111 110 0 s:1 . 10 .... .... 1000 . 1 . 1 .... \
++#include "hw/qdev-properties.h"
-+               vm=%vm_dp vn=%vn_dp vd=%vd_dp q=1
+ #include "hw/arm/boot.h"
-diff --git a/target/arm/translate-neon.inc.c b/target/arm/translate-neon.inc.c
++#include "hw/arm/bsa.h"
-index XXXXXXX..XXXXXXX 100644
++#include "hw/intc/arm_gicv3.h"
---- a/target/arm/translate-neon.inc.c
-+++ b/target/arm/translate-neon.inc.c
+ /* Define the layout of RAM and ROM in a board */
-@@ -XXX,XX +XXX,XX @@ static bool trans_VDOT(DisasContext *s, arg_VDOT *a)
+ typedef struct RAMInfo {
-                        opr_sz, opr_sz, 0, fn_gvec);
+@@ -XXX,XX +XXX,XX @@ typedef struct RAMInfo {
-     return true;
+ #define IS_ROM 2
  #define MPS3R_RAM_MAX 9
 +#define MPS3R_CPU_MAX 2
 +
 +#define PERIPHBASE 0xf0000000
 +#define NUM_SPIS 96
  typedef enum MPS3RFPGAType {
      FPGA_AN536,
@@ -XXX,XX +XXX,XX @@ struct MPS3RMachineClass {
      MachineClass parent;
      MPS3RFPGAType fpga_type;
      const RAMInfo *raminfo;
 +    hwaddr loader_start;
  };
  struct MPS3RMachineState {
      MachineState parent;
 +    struct arm_boot_info bootinfo;
      MemoryRegion ram[MPS3R_RAM_MAX];
 +    Object *cpu[MPS3R_CPU_MAX];
 +    MemoryRegion cpu_sysmem[MPS3R_CPU_MAX];
 +    MemoryRegion sysmem_alias[MPS3R_CPU_MAX];
 +    MemoryRegion cpu_ram[MPS3R_CPU_MAX];
 +    GICv3State gic;
  };
  #define TYPE_MPS3R_MACHINE "mps3r"
@@ -XXX,XX +XXX,XX @@ static MemoryRegion *mr_for_raminfo(MPS3RMachineState *mms,
      return ram;
  }
-+
-+static bool trans_VFML(DisasContext *s, arg_VFML *a)
++/*
 + * There is no defined secondary boot protocol for Linux for the AN536,
 + * because real hardware has a restriction that atomic operations between
 + * the two CPUs do not function correctly, and so true SMP is not
 + * possible. Therefore for cases where the user is directly booting
 + * a kernel, we treat the system as essentially uniprocessor, and
 + * put the secondary CPU into power-off state (as if the user on the
 + * real hardware had configured the secondary to be halted via the
 + * SCC config registers).
 + *
 + * Note that the default secondary boot code would not work here anyway
 + * as it assumes a GICv2, and we have a GICv3.
 + */
 +static void mps3r_write_secondary_boot(ARMCPU *cpu,
 +                                       const struct arm_boot_info *info)
 +{
-+    int opr_sz;
++    /*
-+
++     * Power the secondary CPU off. This means we don't need to write any
-+    if (!dc_isar_feature(aa32_fhm, s)) {
++     * boot code into guest memory. Note that the 'cpu' argument to this
-+        return false;
++     * function is the primary CPU we passed to arm_load_kernel(), not
 +     * the secondary. Loop around all the other CPUs, as the boot.c
 +     * code does for the "disable secondaries if PSCI is enabled" case.
 +     */
 +    for (CPUState *cs = first_cpu; cs; cs = CPU_NEXT(cs)) {
 +        if (cs != first_cpu) {
 +            object_property_set_bool(OBJECT(cs), "start-powered-off", true,
 +                                     &error_abort);
 +        }
 +    }
-+
++}
-+    /* UNDEF accesses to D16-D31 if they don't exist. */
++
-+    if (!dc_isar_feature(aa32_simd_r32, s) &&
++static void mps3r_secondary_cpu_reset(ARMCPU *cpu,
-+        (a->vd & 0x10)) {
++                                      const struct arm_boot_info *info)
-+        return false;
++{
 +    /* We don't need to do anything here because the CPU will be off */
 +}
 +
 +static void create_gic(MPS3RMachineState *mms, MemoryRegion *sysmem)
 +{
 +    MachineState *machine = MACHINE(mms);
 +    DeviceState *gicdev;
 +    QList *redist_region_count;
 +
 +    object_initialize_child(OBJECT(mms), "gic", &mms->gic, TYPE_ARM_GICV3);
 +    gicdev = DEVICE(&mms->gic);
 +    qdev_prop_set_uint32(gicdev, "num-cpu", machine->smp.cpus);
 +    qdev_prop_set_uint32(gicdev, "num-irq", NUM_SPIS + GIC_INTERNAL);
 +    redist_region_count = qlist_new();
 +    qlist_append_int(redist_region_count, machine->smp.cpus);
 +    qdev_prop_set_array(gicdev, "redist-region-count", redist_region_count);
 +    object_property_set_link(OBJECT(&mms->gic), "sysmem",
 +                             OBJECT(sysmem), &error_fatal);
 +    sysbus_realize(SYS_BUS_DEVICE(&mms->gic), &error_fatal);
 +    sysbus_mmio_map(SYS_BUS_DEVICE(&mms->gic), 0, PERIPHBASE);
 +    sysbus_mmio_map(SYS_BUS_DEVICE(&mms->gic), 1, PERIPHBASE + 0x100000);
 +    /*
 +     * Wire the outputs from each CPU's generic timer and the GICv3
 +     * maintenance interrupt signal to the appropriate GIC PPI inputs,
 +     * and the GIC's IRQ/FIQ/VIRQ/VFIQ interrupt outputs to the CPU's inputs.
 +     */
 +    for (int i = 0; i < machine->smp.cpus; i++) {
 +        DeviceState *cpudev = DEVICE(mms->cpu[i]);
 +        SysBusDevice *gicsbd = SYS_BUS_DEVICE(&mms->gic);
 +        int intidbase = NUM_SPIS + i * GIC_INTERNAL;
 +        int irq;
 +        /*
 +         * Mapping from the output timer irq lines from the CPU to the
 +         * GIC PPI inputs used for this board. This isn't a BSA board,
 +         * but it uses the standard convention for the PPI numbers.
 +         */
 +        const int timer_irq[] = {
 +            [GTIMER_PHYS] = ARCH_TIMER_NS_EL1_IRQ,
 +            [GTIMER_VIRT] = ARCH_TIMER_VIRT_IRQ,
 +            [GTIMER_HYP]  = ARCH_TIMER_NS_EL2_IRQ,
 +        };
 +
 +        for (irq = 0; irq < ARRAY_SIZE(timer_irq); irq++) {
 +            qdev_connect_gpio_out(cpudev, irq,
 +                                  qdev_get_gpio_in(gicdev,
 +                                                   intidbase + timer_irq[irq]));
 +        }
 +
 +        qdev_connect_gpio_out_named(cpudev, "gicv3-maintenance-interrupt", 0,
 +                                    qdev_get_gpio_in(gicdev,
 +                                                     intidbase + ARCH_GIC_MAINT_IRQ));
 +
 +        qdev_connect_gpio_out_named(cpudev, "pmu-interrupt", 0,
 +                                    qdev_get_gpio_in(gicdev,
 +                                                     intidbase + VIRTUAL_PMU_IRQ));
 +
 +        sysbus_connect_irq(gicsbd, i,
 +                           qdev_get_gpio_in(cpudev, ARM_CPU_IRQ));
 +        sysbus_connect_irq(gicsbd, i + machine->smp.cpus,
 +                           qdev_get_gpio_in(cpudev, ARM_CPU_FIQ));
 +        sysbus_connect_irq(gicsbd, i + 2 * machine->smp.cpus,
 +                           qdev_get_gpio_in(cpudev, ARM_CPU_VIRQ));
 +        sysbus_connect_irq(gicsbd, i + 3 * machine->smp.cpus,
 +                           qdev_get_gpio_in(cpudev, ARM_CPU_VFIQ));
 +    }
-+
++}
-+    if (a->vd & a->q) {
++
-+        return false;
+ static void mps3r_common_init(MachineState *machine)
  {
      MPS3RMachineState *mms = MPS3R_MACHINE(machine);
@@ -XXX,XX +XXX,XX @@ static void mps3r_common_init(MachineState *machine)
          MemoryRegion *mr = mr_for_raminfo(mms, ri);
          memory_region_add_subregion(sysmem, ri->base, mr);
      }
 +
 +    assert(machine->smp.cpus <= MPS3R_CPU_MAX);
 +    for (int i = 0; i < machine->smp.cpus; i++) {
 +        g_autofree char *sysmem_name = g_strdup_printf("cpu-%d-memory", i);
 +        g_autofree char *ramname = g_strdup_printf("cpu-%d-memory", i);
 +        g_autofree char *alias_name = g_strdup_printf("sysmem-alias-%d", i);
 +
 +        /*
 +         * Each CPU has some private RAM/peripherals, so create the container
 +         * which will house those, with the whole-machine system memory being
 +         * used where there's no CPU-specific device. Note that we need the
 +         * sysmem_alias aliases because we can't put one MR (the original
 +         * 'sysmem') into more than one other MR.
 +         */
 +        memory_region_init(&mms->cpu_sysmem[i], OBJECT(machine),
 +                           sysmem_name, UINT64_MAX);
 +        memory_region_init_alias(&mms->sysmem_alias[i], OBJECT(machine),
 +                                 alias_name, sysmem, 0, UINT64_MAX);
 +        memory_region_add_subregion_overlap(&mms->cpu_sysmem[i], 0,
 +                                            &mms->sysmem_alias[i], -1);
 +
 +        mms->cpu[i] = object_new(machine->cpu_type);
 +        object_property_set_link(mms->cpu[i], "memory",
 +                                 OBJECT(&mms->cpu_sysmem[i]), &error_abort);
 +        object_property_set_int(mms->cpu[i], "reset-cbar",
 +                                PERIPHBASE, &error_abort);
 +        qdev_realize(DEVICE(mms->cpu[i]), NULL, &error_fatal);
 +        object_unref(mms->cpu[i]);
 +
 +        /* Per-CPU RAM */
 +        memory_region_init_ram(&mms->cpu_ram[i], NULL, ramname,
 +                               0x1000, &error_fatal);
 +        memory_region_add_subregion(&mms->cpu_sysmem[i], 0xe7c01000,
 +                                    &mms->cpu_ram[i]);
 +    }
 +
-+    if (!vfp_access_check(s)) {
++    create_gic(mms, sysmem);
-+        return true;
++
-+    }
++    mms->bootinfo.ram_size = machine->ram_size;
-+
++    mms->bootinfo.board_id = -1;
-+    opr_sz = (1 + a->q) * 8;
++    mms->bootinfo.loader_start = mmc->loader_start;
-+    tcg_gen_gvec_3_ptr(vfp_reg_offset(1, a->vd),
++    mms->bootinfo.write_secondary_boot = mps3r_write_secondary_boot;
-+                       vfp_reg_offset(a->q, a->vn),
++    mms->bootinfo.secondary_cpu_reset_hook = mps3r_secondary_cpu_reset;
-+                       vfp_reg_offset(a->q, a->vm),
++    arm_load_kernel(ARM_CPU(mms->cpu[0]), machine, &mms->bootinfo);
 +                       cpu_env, opr_sz, opr_sz, a->s, /* is_2 == 0 */
 +                       gen_helper_gvec_fmlal_a32);
 +    return true;
 +}
 diff --git a/target/arm/translate.c b/target/arm/translate.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate.c
 +++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static int disas_neon_data_insn(DisasContext *s, uint32_t insn)
      return 0;
  }
--/* Advanced SIMD three registers of the same length extension.
+ static void mps3r_set_default_ram_info(MPS3RMachineClass *mmc)
-- *  31           25    23  22    20   16   12  11   10   9    8        3     0
+@@ -XXX,XX +XXX,XX @@ static void mps3r_set_default_ram_info(MPS3RMachineClass *mmc)
-- * +---------------+-----+---+-----+----+----+---+----+---+----+---------+----+
+             /* Found the entry for "system memory" */
-- * | 1 1 1 1 1 1 0 | op1 | D | op2 | Vn | Vd | 1 | o3 | 0 | o4 | N Q M U | Vm |
+             mc->default_ram_size = p->size;
-- * +---------------+-----+---+-----+----+----+---+----+---+----+---------+----+
+             mc->default_ram_id = p->name;
-- */
++            mmc->loader_start = p->base;
--static int disas_neon_insn_3same_ext(DisasContext *s, uint32_t insn)
+             return;
 -{
 -    gen_helper_gvec_3 *fn_gvec = NULL;
 -    gen_helper_gvec_3_ptr *fn_gvec_ptr = NULL;
 -    int rd, rn, rm, opr_sz;
 -    int data = 0;
 -    int off_rn, off_rm;
 -    bool is_long = false, q = extract32(insn, 6, 1);
 -    bool ptr_is_env = false;
 -
 -    if ((insn & 0xff300f10) == 0xfc200810) {
 -        /* VFM[AS]L -- 1111 1100 S.10 .... .... 1000 .Q.1 .... */
 -        int is_s = extract32(insn, 23, 1);
 -        if (!dc_isar_feature(aa32_fhm, s)) {
 -            return 1;
 -        }
 -        is_long = true;
 -        data = is_s; /* is_2 == 0 */
 -        fn_gvec_ptr = gen_helper_gvec_fmlal_a32;
 -        ptr_is_env = true;
 -    } else {
 -        return 1;
 -    }
 -
 -    VFP_DREG_D(rd, insn);
 -    if (rd & q) {
 -        return 1;
 -    }
 -    if (q || !is_long) {
 -        VFP_DREG_N(rn, insn);
 -        VFP_DREG_M(rm, insn);
 -        if ((rn | rm) & q & !is_long) {
 -            return 1;
 -        }
 -        off_rn = vfp_reg_offset(1, rn);
 -        off_rm = vfp_reg_offset(1, rm);
 -    } else {
 -        rn = VFP_SREG_N(insn);
 -        rm = VFP_SREG_M(insn);
 -        off_rn = vfp_reg_offset(0, rn);
 -        off_rm = vfp_reg_offset(0, rm);
 -    }
 -
 -    if (s->fp_excp_el) {
 -        gen_exception_insn(s, s->pc_curr, EXCP_UDEF,
 -                           syn_simd_access_trap(1, 0xe, false), s->fp_excp_el);
 -        return 0;
 -    }
 -    if (!s->vfp_enabled) {
 -        return 1;
 -    }
 -
 -    opr_sz = (1 + q) * 8;
 -    if (fn_gvec_ptr) {
 -        TCGv_ptr ptr;
 -        if (ptr_is_env) {
 -            ptr = cpu_env;
 -        } else {
 -            ptr = get_fpstatus_ptr(1);
 -        }
 -        tcg_gen_gvec_3_ptr(vfp_reg_offset(1, rd), off_rn, off_rm, ptr,
 -                           opr_sz, opr_sz, data, fn_gvec_ptr);
 -        if (!ptr_is_env) {
 -            tcg_temp_free_ptr(ptr);
 -        }
 -    } else {
 -        tcg_gen_gvec_3_ool(vfp_reg_offset(1, rd), off_rn, off_rm,
 -                           opr_sz, opr_sz, data, fn_gvec);
 -    }
 -    return 0;
 -}
 -
  /* Advanced SIMD two registers and a scalar extension.
   *  31             24   23  22   20   16   12  11   10   9    8        3     0
   * +-----------------+----+---+----+----+----+---+----+---+----+---------+----+
@@ -XXX,XX +XXX,XX @@ static void disas_arm_insn(DisasContext *s, unsigned int insn)
                      }
                  }
              }
 -        } else if ((insn & 0x0e000a00) == 0x0c000800
 -                   && arm_dc_feature(s, ARM_FEATURE_V8)) {
 -            if (disas_neon_insn_3same_ext(s, insn)) {
 -                goto illegal_op;
 -            }
 -            return;
          } else if ((insn & 0x0f000a00) == 0x0e000800
                     && arm_dc_feature(s, ARM_FEATURE_V8)) {
              if (disas_neon_insn_2reg_scalar_ext(s, insn)) {
@@ -XXX,XX +XXX,XX @@ static void disas_thumb2_insn(DisasContext *s, uint32_t insn)
              }
              break;
          }
--        if ((insn & 0xfe000a00) == 0xfc000800
+     }
-+        if ((insn & 0xff000a00) == 0xfe000800
+@@ -XXX,XX +XXX,XX @@ static void mps3r_an536_class_init(ObjectClass *oc, void *data)
-             && arm_dc_feature(s, ARM_FEATURE_V8)) {
+     };
-             /* The Thumb2 and ARM encodings are identical.  */
--            if (disas_neon_insn_3same_ext(s, insn)) {
+     mc->desc = "ARM MPS3 with AN536 FPGA image for Cortex-R52";
--                goto illegal_op;
+-    mc->default_cpus = 2;
--            }
+-    mc->min_cpus = mc->default_cpus;
--        } else if ((insn & 0xff000a00) == 0xfe000800
+-    mc->max_cpus = mc->default_cpus;
--                   && arm_dc_feature(s, ARM_FEATURE_V8)) {
++    /*
--            /* The Thumb2 and ARM encodings are identical.  */
++     * In the real FPGA image there are always two cores, but the standard
-             if (disas_neon_insn_2reg_scalar_ext(s, insn)) {
++     * initial setting for the SCC SYSCON 0x000 register is 0x21, meaning
-                 goto illegal_op;
++     * that the second core is held in reset and halted. Many images built for
-             }
++     * the board do not expect the second core to run at startup (especially
 +     * since on the real FPGA image it is not possible to use LDREX/STREX
 +     * in RAM between the two cores, so a true SMP setup isn't supported).
 +     *
 +     * As QEMU's equivalent of this, we support both -smp 1 and -smp 2,
 +     * with the default being -smp 1. This seems a more intuitive UI for
 +     * QEMU users than, for instance, having a machine property to allow
 +     * the user to set the initial value of the SYSCON 0x000 register.
 +     */
 +    mc->default_cpus = 1;
 +    mc->min_cpus = 1;
 +    mc->max_cpus = 2;
      mc->default_cpu_type = ARM_CPU_TYPE_NAME("cortex-r52");
      mc->valid_cpu_types = valid_cpu_types;
      mmc->raminfo = an536_raminfo;
 --
-.20.1
+.34.1

-[PULL 19/39] hw/arm: versal-virt: Add support for the RTC
+[PULL 32/35] hw/arm/mps3r: Add UARTs
-From: "Edgar E. Iglesias" <edgar.iglesias@xilinx.com>
+This board has a lot of UARTs: there is one UART per CPU in the
 per-CPU peripheral part of the address map, whose interrupts are
 connected as per-CPU interrupt lines.  Then there are 4 UARTs in the
 normal part of the peripheral space, whose interrupts are shared
 peripheral interrupts.
-Add support for the RTC.
+Connect and wire them all up; this involves some OR gates where
 multiple overflow interrupts are wired into one GIC input.
-Signed-off-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
-Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
-Reviewed-by: Luc Michel <luc.michel@greensocs.com>
-Message-id: 20200427181649.26851-12-edgar.iglesias@gmail.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
+Message-id: 20240206132931.38376-11-peter.maydell@linaro.org
 ---
- hw/arm/xlnx-versal-virt.c | 22 ++++++++++++++++++++++
+ hw/arm/mps3r.c | 94 ++++++++++++++++++++++++++++++++++++++++++++++++++
-file changed, 22 insertions(+)
+file changed, 94 insertions(+)
-diff --git a/hw/arm/xlnx-versal-virt.c b/hw/arm/xlnx-versal-virt.c
+diff --git a/hw/arm/mps3r.c b/hw/arm/mps3r.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/xlnx-versal-virt.c
+--- a/hw/arm/mps3r.c
-+++ b/hw/arm/xlnx-versal-virt.c
++++ b/hw/arm/mps3r.c
-@@ -XXX,XX +XXX,XX @@ static void fdt_add_sd_nodes(VersalVirt *s)
+@@ -XXX,XX +XXX,XX @@
  #include "qapi/qmp/qlist.h"
  #include "exec/address-spaces.h"
  #include "cpu.h"
 +#include "sysemu/sysemu.h"
  #include "hw/boards.h"
 +#include "hw/or-irq.h"
  #include "hw/qdev-properties.h"
  #include "hw/arm/boot.h"
  #include "hw/arm/bsa.h"
 +#include "hw/char/cmsdk-apb-uart.h"
  #include "hw/intc/arm_gicv3.h"
  /* Define the layout of RAM and ROM in a board */
@@ -XXX,XX +XXX,XX @@ typedef struct RAMInfo {
  #define MPS3R_RAM_MAX 9
  #define MPS3R_CPU_MAX 2
 +#define MPS3R_UART_MAX 4 /* shared UART count */
  #define PERIPHBASE 0xf0000000
  #define NUM_SPIS 96
@@ -XXX,XX +XXX,XX @@ struct MPS3RMachineState {
      MemoryRegion sysmem_alias[MPS3R_CPU_MAX];
      MemoryRegion cpu_ram[MPS3R_CPU_MAX];
      GICv3State gic;
 +    /* per-CPU UARTs followed by the shared UARTs */
 +    CMSDKAPBUART uart[MPS3R_CPU_MAX + MPS3R_UART_MAX];
 +    OrIRQState cpu_uart_oflow[MPS3R_CPU_MAX];
 +    OrIRQState uart_oflow;
  };
  #define TYPE_MPS3R_MACHINE "mps3r"
@@ -XXX,XX +XXX,XX @@ struct MPS3RMachineState {
  OBJECT_DECLARE_TYPE(MPS3RMachineState, MPS3RMachineClass, MPS3R_MACHINE)
 +/*
 + * Main clock frequency CLK in Hz (50MHz). In the image there are also
 + * ACLK, MCLK, GPUCLK and PERIPHCLK at the same frequency; for our
 + * model we just roll them all into one.
 + */
 +#define CLK_FRQ 50000000
 +
  static const RAMInfo an536_raminfo[] = {
      {
          .name = "ATCM",
@@ -XXX,XX +XXX,XX @@ static void create_gic(MPS3RMachineState *mms, MemoryRegion *sysmem)
      }
  }
-+static void fdt_add_rtc_node(VersalVirt *s)
++/*
 + * Create UART uartno, and map it into the MemoryRegion mem at address baseaddr.
 + * The qemu_irq arguments are where we connect the various IRQs from the UART.
 + */
 +static void create_uart(MPS3RMachineState *mms, int uartno, MemoryRegion *mem,
 +                        hwaddr baseaddr, qemu_irq txirq, qemu_irq rxirq,
 +                        qemu_irq txoverirq, qemu_irq rxoverirq,
 +                        qemu_irq combirq)
 +{
-+    const char compat[] = "xlnx,zynqmp-rtc";
++    g_autofree char *s = g_strdup_printf("uart%d", uartno);
-+    const char interrupt_names[] = "alarm\0sec";
++    SysBusDevice *sbd;
 +    char *name = g_strdup_printf("/rtc@%x", MM_PMC_RTC);
 +
-+    qemu_fdt_add_subnode(s->fdt, name);
++    assert(uartno < ARRAY_SIZE(mms->uart));
-+
++    object_initialize_child(OBJECT(mms), s, &mms->uart[uartno],
-+    qemu_fdt_setprop_cells(s->fdt, name, "interrupts",
++                            TYPE_CMSDK_APB_UART);
-+                           GIC_FDT_IRQ_TYPE_SPI, VERSAL_RTC_ALARM_IRQ,
++    qdev_prop_set_uint32(DEVICE(&mms->uart[uartno]), "pclk-frq", CLK_FRQ);
-+                           GIC_FDT_IRQ_FLAGS_LEVEL_HI,
++    qdev_prop_set_chr(DEVICE(&mms->uart[uartno]), "chardev", serial_hd(uartno));
-+                           GIC_FDT_IRQ_TYPE_SPI, VERSAL_RTC_SECONDS_IRQ,
++    sbd = SYS_BUS_DEVICE(&mms->uart[uartno]);
-+                           GIC_FDT_IRQ_FLAGS_LEVEL_HI);
++    sysbus_realize(sbd, &error_fatal);
-+    qemu_fdt_setprop(s->fdt, name, "interrupt-names",
++    memory_region_add_subregion(mem, baseaddr,
-+                     interrupt_names, sizeof(interrupt_names));
++                                sysbus_mmio_get_region(sbd, 0));
-+    qemu_fdt_setprop_sized_cells(s->fdt, name, "reg",
++    sysbus_connect_irq(sbd, 0, txirq);
-+                                 2, MM_PMC_RTC, 2, MM_PMC_RTC_SIZE);
++    sysbus_connect_irq(sbd, 1, rxirq);
-+    qemu_fdt_setprop(s->fdt, name, "compatible", compat, sizeof(compat));
++    sysbus_connect_irq(sbd, 2, txoverirq);
-+    g_free(name);
++    sysbus_connect_irq(sbd, 3, rxoverirq);
 +    sysbus_connect_irq(sbd, 4, combirq);
 +}
 +
- static void fdt_nop_memory_nodes(void *fdt, Error **errp)
+ static void mps3r_common_init(MachineState *machine)
  {
-     Error *err = NULL;
+     MPS3RMachineState *mms = MPS3R_MACHINE(machine);
-@@ -XXX,XX +XXX,XX @@ static void versal_virt_init(MachineState *machine)
+     MPS3RMachineClass *mmc = MPS3R_MACHINE_GET_CLASS(mms);
-     fdt_add_timer_nodes(s);
+     MemoryRegion *sysmem = get_system_memory();
-     fdt_add_zdma_nodes(s);
++    DeviceState *gicdev;
-     fdt_add_sd_nodes(s);
-+    fdt_add_rtc_node(s);
+     for (const RAMInfo *ri = mmc->raminfo; ri->name; ri++) {
-     fdt_add_cpu_nodes(s, psci_conduit);
+         MemoryRegion *mr = mr_for_raminfo(mms, ri);
-     fdt_add_clk_node(s, "/clk125", 125000000, s->phandle.clk_125Mhz);
+@@ -XXX,XX +XXX,XX @@ static void mps3r_common_init(MachineState *machine)
-     fdt_add_clk_node(s, "/clk25", 25000000, s->phandle.clk_25Mhz);
+     }
      create_gic(mms, sysmem);
 +    gicdev = DEVICE(&mms->gic);
 +
 +    /*
 +     * UARTs 0 and 1 are per-CPU; their interrupts are wired to
 +     * the relevant CPU's PPI 0..3, aka INTID 16..19
 +     */
 +    for (int i = 0; i < machine->smp.cpus; i++) {
 +        int intidbase = NUM_SPIS + i * GIC_INTERNAL;
 +        g_autofree char *s = g_strdup_printf("cpu-uart-oflow-orgate%d", i);
 +        DeviceState *orgate;
 +
 +        /* The two overflow IRQs from the UART are ORed together into PPI 3 */
 +        object_initialize_child(OBJECT(mms), s, &mms->cpu_uart_oflow[i],
 +                                TYPE_OR_IRQ);
 +        orgate = DEVICE(&mms->cpu_uart_oflow[i]);
 +        qdev_prop_set_uint32(orgate, "num-lines", 2);
 +        qdev_realize(orgate, NULL, &error_fatal);
 +        qdev_connect_gpio_out(orgate, 0,
 +                              qdev_get_gpio_in(gicdev, intidbase + 19));
 +
 +        create_uart(mms, i, &mms->cpu_sysmem[i], 0xe7c00000,
 +                    qdev_get_gpio_in(gicdev, intidbase + 17), /* tx */
 +                    qdev_get_gpio_in(gicdev, intidbase + 16), /* rx */
 +                    qdev_get_gpio_in(orgate, 0), /* txover */
 +                    qdev_get_gpio_in(orgate, 1), /* rxover */
 +                    qdev_get_gpio_in(gicdev, intidbase + 18) /* combined */);
 +    }
 +    /*
 +     * UARTs 2 to 5 are whole-system; all overflow IRQs are ORed
 +     * together into IRQ 17
 +     */
 +    object_initialize_child(OBJECT(mms), "uart-oflow-orgate",
 +                            &mms->uart_oflow, TYPE_OR_IRQ);
 +    qdev_prop_set_uint32(DEVICE(&mms->uart_oflow), "num-lines",
 +                         MPS3R_UART_MAX * 2);
 +    qdev_realize(DEVICE(&mms->uart_oflow), NULL, &error_fatal);
 +    qdev_connect_gpio_out(DEVICE(&mms->uart_oflow), 0,
 +                          qdev_get_gpio_in(gicdev, 17));
 +
 +    for (int i = 0; i < MPS3R_UART_MAX; i++) {
 +        hwaddr baseaddr = 0xe0205000 + i * 0x1000;
 +        int rxirq = 5 + i * 2, txirq = 6 + i * 2, combirq = 13 + i;
 +
 +        create_uart(mms, i + MPS3R_CPU_MAX, sysmem, baseaddr,
 +                    qdev_get_gpio_in(gicdev, txirq),
 +                    qdev_get_gpio_in(gicdev, rxirq),
 +                    qdev_get_gpio_in(DEVICE(&mms->uart_oflow), i * 2),
 +                    qdev_get_gpio_in(DEVICE(&mms->uart_oflow), i * 2 + 1),
 +                    qdev_get_gpio_in(gicdev, combirq));
 +    }
      mms->bootinfo.ram_size = machine->ram_size;
      mms->bootinfo.board_id = -1;
 --
-.20.1
+.34.1

-[PULL 20/39] target/arm/translate-vfp.inc.c: Remove duplicate simd_r32 check
+Deleted patch
-Somewhere along theline we accidentally added a duplicate
-"using D16-D31 when they don't exist" check to do_vfm_dp()
-(probably an artifact of a patchseries rebase). Remove it.
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
-Message-id: 20200430181003.21682-2-peter.maydell@linaro.org
----
- target/arm/translate-vfp.inc.c | 6 ------
-file changed, 6 deletions(-)
-diff --git a/target/arm/translate-vfp.inc.c b/target/arm/translate-vfp.inc.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-vfp.inc.c
-+++ b/target/arm/translate-vfp.inc.c
-@@ -XXX,XX +XXX,XX @@ static bool do_vfm_dp(DisasContext *s, arg_VFMA_dp *a, bool neg_n, bool neg_d)
-         return false;
-     }
--    /* UNDEF accesses to D16-D31 if they don't exist. */
--    if (!dc_isar_feature(aa32_simd_r32, s) &&
--        ((a->vd | a->vn | a->vm) & 0x10)) {
--        return false;
--    }
--
-     if (!vfp_access_check(s)) {
-         return true;
-     }
---
-.20.1

-[PULL 21/39] target/arm: Don't allow Thumb Neon insns without FEATURE_NEON
+Deleted patch
-We were accidentally permitting decode of Thumb Neon insns even if
-the CPU didn't have the FEATURE_NEON bit set, because the feature
-check was being done before the call to disas_neon_data_insn() and
-disas_neon_ls_insn() in the Arm decoder but was omitted from the
-Thumb decoder.  Push the feature bit check down into the called
-functions so it is done for both Arm and Thumb encodings.
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
-Message-id: 20200430181003.21682-3-peter.maydell@linaro.org
----
- target/arm/translate.c | 16 ++++++++--------
-file changed, 8 insertions(+), 8 deletions(-)
-diff --git a/target/arm/translate.c b/target/arm/translate.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate.c
-+++ b/target/arm/translate.c
-@@ -XXX,XX +XXX,XX @@ static int disas_neon_ls_insn(DisasContext *s, uint32_t insn)
-     TCGv_i32 tmp2;
-     TCGv_i64 tmp64;
-+    if (!arm_dc_feature(s, ARM_FEATURE_NEON)) {
-+        return 1;
-+    }
-+
-     /* FIXME: this access check should not take precedence over UNDEF
-      * for invalid encodings; we will generate incorrect syndrome information
-      * for attempts to execute invalid vfp/neon encodings with FP disabled.
-@@ -XXX,XX +XXX,XX @@ static int disas_neon_data_insn(DisasContext *s, uint32_t insn)
-     TCGv_ptr ptr1, ptr2, ptr3;
-     TCGv_i64 tmp64;
-+    if (!arm_dc_feature(s, ARM_FEATURE_NEON)) {
-+        return 1;
-+    }
-+
-     /* FIXME: this access check should not take precedence over UNDEF
-      * for invalid encodings; we will generate incorrect syndrome information
-      * for attempts to execute invalid vfp/neon encodings with FP disabled.
-@@ -XXX,XX +XXX,XX @@ static void disas_arm_insn(DisasContext *s, unsigned int insn)
-         if (((insn >> 25) & 7) == 1) {
-             /* NEON Data processing.  */
--            if (!arm_dc_feature(s, ARM_FEATURE_NEON)) {
--                goto illegal_op;
--            }
--
-             if (disas_neon_data_insn(s, insn)) {
-                 goto illegal_op;
-             }
-@@ -XXX,XX +XXX,XX @@ static void disas_arm_insn(DisasContext *s, unsigned int insn)
-         }
-         if ((insn & 0x0f100000) == 0x04000000) {
-             /* NEON load/store.  */
--            if (!arm_dc_feature(s, ARM_FEATURE_NEON)) {
--                goto illegal_op;
--            }
--
-             if (disas_neon_ls_insn(s, insn)) {
-                 goto illegal_op;
-             }
---
-.20.1

-[PULL 24/39] target/arm: Convert VCADD (vector) to decodetree
+[PULL 33/35] hw/arm/mps3r: Add GPIO, watchdog, dual-timer, I2C devices
-Convert the VCADD (vector) insns to decodetree.
+Add the GPIO, watchdog, dual-timer and I2C devices to the mps3-an536
 board.  These are all simple devices that just need to be created and
 wired up.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
-Message-id: 20200430181003.21682-6-peter.maydell@linaro.org
+Message-id: 20240206132931.38376-12-peter.maydell@linaro.org
 ---
- target/arm/neon-shared.decode   |  3 +++
+ hw/arm/mps3r.c | 59 ++++++++++++++++++++++++++++++++++++++++++++++++++
- target/arm/translate-neon.inc.c | 37 +++++++++++++++++++++++++++++++++
+file changed, 59 insertions(+)
  target/arm/translate.c          | 11 +---------
 files changed, 41 insertions(+), 10 deletions(-)
-diff --git a/target/arm/neon-shared.decode b/target/arm/neon-shared.decode
+diff --git a/hw/arm/mps3r.c b/hw/arm/mps3r.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/neon-shared.decode
+--- a/hw/arm/mps3r.c
-+++ b/target/arm/neon-shared.decode
++++ b/hw/arm/mps3r.c
 @@ -XXX,XX +XXX,XX @@
+ #include "sysemu/sysemu.h"
- VCMLA          1111 110 rot:2 . 1 size:1 .... .... 1000 . q:1 . 0 .... \
+ #include "hw/boards.h"
-                vm=%vm_dp vn=%vn_dp vd=%vd_dp
+ #include "hw/or-irq.h"
 +#include "hw/qdev-clock.h"
  #include "hw/qdev-properties.h"
  #include "hw/arm/boot.h"
  #include "hw/arm/bsa.h"
  #include "hw/char/cmsdk-apb-uart.h"
 +#include "hw/i2c/arm_sbcon_i2c.h"
  #include "hw/intc/arm_gicv3.h"
 +#include "hw/misc/unimp.h"
 +#include "hw/timer/cmsdk-apb-dualtimer.h"
 +#include "hw/watchdog/cmsdk-apb-watchdog.h"
  /* Define the layout of RAM and ROM in a board */
  typedef struct RAMInfo {
@@ -XXX,XX +XXX,XX @@ struct MPS3RMachineState {
      CMSDKAPBUART uart[MPS3R_CPU_MAX + MPS3R_UART_MAX];
      OrIRQState cpu_uart_oflow[MPS3R_CPU_MAX];
      OrIRQState uart_oflow;
 +    CMSDKAPBWatchdog watchdog;
 +    CMSDKAPBDualTimer dualtimer;
 +    ArmSbconI2CState i2c[5];
 +    Clock *clk;
  };
  #define TYPE_MPS3R_MACHINE "mps3r"
@@ -XXX,XX +XXX,XX @@ static void mps3r_common_init(MachineState *machine)
      MemoryRegion *sysmem = get_system_memory();
      DeviceState *gicdev;
 +    mms->clk = clock_new(OBJECT(machine), "CLK");
 +    clock_set_hz(mms->clk, CLK_FRQ);
 +
-+VCADD          1111 110 rot:1 1 . 0 size:1 .... .... 1000 . q:1 . 0 .... \
+     for (const RAMInfo *ri = mmc->raminfo; ri->name; ri++) {
-+               vm=%vm_dp vn=%vn_dp vd=%vd_dp
+         MemoryRegion *mr = mr_for_raminfo(mms, ri);
-diff --git a/target/arm/translate-neon.inc.c b/target/arm/translate-neon.inc.c
+         memory_region_add_subregion(sysmem, ri->base, mr);
-index XXXXXXX..XXXXXXX 100644
+@@ -XXX,XX +XXX,XX @@ static void mps3r_common_init(MachineState *machine)
---- a/target/arm/translate-neon.inc.c
+                     qdev_get_gpio_in(gicdev, combirq));
-+++ b/target/arm/translate-neon.inc.c
+     }
-@@ -XXX,XX +XXX,XX @@ static bool trans_VCMLA(DisasContext *s, arg_VCMLA *a)
-     tcg_temp_free_ptr(fpst);
++    for (int i = 0; i < 4; i++) {
-     return true;
++        /* CMSDK GPIO controllers */
- }
++        g_autofree char *s = g_strdup_printf("gpio%d", i);
-+
++        create_unimplemented_device(s, 0xe0000000 + i * 0x1000, 0x1000);
 +static bool trans_VCADD(DisasContext *s, arg_VCADD *a)
 +{
 +    int opr_sz;
 +    TCGv_ptr fpst;
 +    gen_helper_gvec_3_ptr *fn_gvec_ptr;
 +
 +    if (!dc_isar_feature(aa32_vcma, s)
 +        || (!a->size && !dc_isar_feature(aa32_fp16_arith, s))) {
 +        return false;
 +    }
 +
-+    /* UNDEF accesses to D16-D31 if they don't exist. */
++    object_initialize_child(OBJECT(mms), "watchdog", &mms->watchdog,
-+    if (!dc_isar_feature(aa32_simd_r32, s) &&
++                            TYPE_CMSDK_APB_WATCHDOG);
-+        ((a->vd | a->vn | a->vm) & 0x10)) {
++    qdev_connect_clock_in(DEVICE(&mms->watchdog), "WDOGCLK", mms->clk);
-+        return false;
++    sysbus_realize(SYS_BUS_DEVICE(&mms->watchdog), &error_fatal);
 +    sysbus_connect_irq(SYS_BUS_DEVICE(&mms->watchdog), 0,
 +                       qdev_get_gpio_in(gicdev, 0));
 +    sysbus_mmio_map(SYS_BUS_DEVICE(&mms->watchdog), 0, 0xe0100000);
 +
 +    object_initialize_child(OBJECT(mms), "dualtimer", &mms->dualtimer,
 +                            TYPE_CMSDK_APB_DUALTIMER);
 +    qdev_connect_clock_in(DEVICE(&mms->dualtimer), "TIMCLK", mms->clk);
 +    sysbus_realize(SYS_BUS_DEVICE(&mms->dualtimer), &error_fatal);
 +    sysbus_connect_irq(SYS_BUS_DEVICE(&mms->dualtimer), 0,
 +                       qdev_get_gpio_in(gicdev, 3));
 +    sysbus_connect_irq(SYS_BUS_DEVICE(&mms->dualtimer), 1,
 +                       qdev_get_gpio_in(gicdev, 1));
 +    sysbus_connect_irq(SYS_BUS_DEVICE(&mms->dualtimer), 2,
 +                       qdev_get_gpio_in(gicdev, 2));
 +    sysbus_mmio_map(SYS_BUS_DEVICE(&mms->dualtimer), 0, 0xe0101000);
 +
 +    for (int i = 0; i < ARRAY_SIZE(mms->i2c); i++) {
 +        static const hwaddr i2cbase[] = {0xe0102000,    /* Touch */
 +                                         0xe0103000,    /* Audio */
 +                                         0xe0107000,    /* Shield0 */
 +                                         0xe0108000,    /* Shield1 */
 +                                         0xe0109000};   /* DDR4 EEPROM */
 +        g_autofree char *s = g_strdup_printf("i2c%d", i);
 +
 +        object_initialize_child(OBJECT(mms), s, &mms->i2c[i],
 +                                TYPE_ARM_SBCON_I2C);
 +        sysbus_realize(SYS_BUS_DEVICE(&mms->i2c[i]), &error_fatal);
 +        sysbus_mmio_map(SYS_BUS_DEVICE(&mms->i2c[i]), 0, i2cbase[i]);
 +        if (i != 2 && i != 3) {
 +            /*
 +             * internal-only bus: mark it full to avoid user-created
 +             * i2c devices being plugged into it.
 +             */
 +            qbus_mark_full(qdev_get_child_bus(DEVICE(&mms->i2c[i]), "i2c"));
 +        }
 +    }
 +
-+    if ((a->vn | a->vm | a->vd) & a->q) {
+     mms->bootinfo.ram_size = machine->ram_size;
-+        return false;
+     mms->bootinfo.board_id = -1;
-+    }
+     mms->bootinfo.loader_start = mmc->loader_start;
 +
 +    if (!vfp_access_check(s)) {
 +        return true;
 +    }
 +
 +    opr_sz = (1 + a->q) * 8;
 +    fpst = get_fpstatus_ptr(1);
 +    fn_gvec_ptr = a->size ? gen_helper_gvec_fcadds : gen_helper_gvec_fcaddh;
 +    tcg_gen_gvec_3_ptr(vfp_reg_offset(1, a->vd),
 +                       vfp_reg_offset(1, a->vn),
 +                       vfp_reg_offset(1, a->vm),
 +                       fpst, opr_sz, opr_sz, a->rot,
 +                       fn_gvec_ptr);
 +    tcg_temp_free_ptr(fpst);
 +    return true;
 +}
 diff --git a/target/arm/translate.c b/target/arm/translate.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate.c
 +++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static int disas_neon_insn_3same_ext(DisasContext *s, uint32_t insn)
      bool is_long = false, q = extract32(insn, 6, 1);
      bool ptr_is_env = false;
 -    if ((insn & 0xfea00f10) == 0xfc800800) {
 -        /* VCADD -- 1111 110R 1.0S .... .... 1000 ...0 .... */
 -        int size = extract32(insn, 20, 1);
 -        data = extract32(insn, 24, 1); /* rot */
 -        if (!dc_isar_feature(aa32_vcma, s)
 -            || (!size && !dc_isar_feature(aa32_fp16_arith, s))) {
 -            return 1;
 -        }
 -        fn_gvec_ptr = size ? gen_helper_gvec_fcadds : gen_helper_gvec_fcaddh;
 -    } else if ((insn & 0xfeb00f00) == 0xfc200d00) {
 +    if ((insn & 0xfeb00f00) == 0xfc200d00) {
          /* V[US]DOT -- 1111 1100 0.10 .... .... 1101 .Q.U .... */
          bool u = extract32(insn, 4, 1);
          if (!dc_isar_feature(aa32_dp, s)) {
 --
-.20.1
+.34.1

-[PULL 31/39] target/arm: Convert Neon 'load single structure to all lanes' to decodetree
+[PULL 34/35] hw/arm/mps3r: Add remaining devices
-Convert the Neon "load single structure to all lanes" insns to
+Add the remaining devices (or unimplemented-device stubs) for
-decodetree.
+this board: SPI controllers, SCC, FPGAIO, I2S, RTC, the
 QSPI write-config block, and ethernet.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
-Message-id: 20200430181003.21682-13-peter.maydell@linaro.org
+Message-id: 20240206132931.38376-13-peter.maydell@linaro.org
 ---
- target/arm/neon-ls.decode       |  5 +++
+ hw/arm/mps3r.c | 74 ++++++++++++++++++++++++++++++++++++++++++++++++++
- target/arm/translate-neon.inc.c | 73 +++++++++++++++++++++++++++++++++
+file changed, 74 insertions(+)
  target/arm/translate.c          | 55 +------------------------
 files changed, 80 insertions(+), 53 deletions(-)
-diff --git a/target/arm/neon-ls.decode b/target/arm/neon-ls.decode
+diff --git a/hw/arm/mps3r.c b/hw/arm/mps3r.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/neon-ls.decode
+--- a/hw/arm/mps3r.c
-+++ b/target/arm/neon-ls.decode
++++ b/hw/arm/mps3r.c
 @@ -XXX,XX +XXX,XX @@
+ #include "hw/char/cmsdk-apb-uart.h"
- VLDST_multiple 1111 0100 0 . l:1 0 rn:4 .... itype:4 size:2 align:2 rm:4 \
+ #include "hw/i2c/arm_sbcon_i2c.h"
-                vd=%vd_dp
+ #include "hw/intc/arm_gicv3.h"
 +#include "hw/misc/mps2-scc.h"
 +#include "hw/misc/mps2-fpgaio.h"
  #include "hw/misc/unimp.h"
 +#include "hw/net/lan9118.h"
 +#include "hw/rtc/pl031.h"
 +#include "hw/ssi/pl022.h"
  #include "hw/timer/cmsdk-apb-dualtimer.h"
  #include "hw/watchdog/cmsdk-apb-watchdog.h"
@@ -XXX,XX +XXX,XX @@ struct MPS3RMachineState {
      CMSDKAPBWatchdog watchdog;
      CMSDKAPBDualTimer dualtimer;
      ArmSbconI2CState i2c[5];
 +    PL022State spi[3];
 +    MPS2SCC scc;
 +    MPS2FPGAIO fpgaio;
 +    UnimplementedDeviceState i2s_audio;
 +    PL031State rtc;
      Clock *clk;
  };
@@ -XXX,XX +XXX,XX @@ static const RAMInfo an536_raminfo[] = {
      }
  };
 +static const int an536_oscclk[] = {
 +    24000000, /* 24MHz reference for RTC and timers */
 +    50000000, /* 50MHz ACLK */
 +    50000000, /* 50MHz MCLK */
 +    50000000, /* 50MHz GPUCLK */
 +    24576000, /* 24.576MHz AUDCLK */
 +    23750000, /* 23.75MHz HDLCDCLK */
 +    100000000, /* 100MHz DDR4_REF_CLK */
 +};
 +
-+# Neon load single element to all lanes
+ static MemoryRegion *mr_for_raminfo(MPS3RMachineState *mms,
                                      const RAMInfo *raminfo)
  {
@@ -XXX,XX +XXX,XX @@ static void mps3r_common_init(MachineState *machine)
      MPS3RMachineClass *mmc = MPS3R_MACHINE_GET_CLASS(mms);
      MemoryRegion *sysmem = get_system_memory();
      DeviceState *gicdev;
 +    QList *oscclk;
      mms->clk = clock_new(OBJECT(machine), "CLK");
      clock_set_hz(mms->clk, CLK_FRQ);
@@ -XXX,XX +XXX,XX @@ static void mps3r_common_init(MachineState *machine)
          }
      }
 +    for (int i = 0; i < ARRAY_SIZE(mms->spi); i++) {
 +        g_autofree char *s = g_strdup_printf("spi%d", i);
 +        hwaddr baseaddr = 0xe0104000 + i * 0x1000;
 +
-+VLD_all_lanes  1111 0100 1 . 1 0 rn:4 .... 11 n:2 size:2 t:1 a:1 rm:4 \
++        object_initialize_child(OBJECT(mms), s, &mms->spi[i], TYPE_PL022);
-+               vd=%vd_dp
++        sysbus_realize(SYS_BUS_DEVICE(&mms->spi[i]), &error_fatal);
-diff --git a/target/arm/translate-neon.inc.c b/target/arm/translate-neon.inc.c
++        sysbus_mmio_map(SYS_BUS_DEVICE(&mms->spi[i]), 0, baseaddr);
-index XXXXXXX..XXXXXXX 100644
++        sysbus_connect_irq(SYS_BUS_DEVICE(&mms->spi[i]), 0,
---- a/target/arm/translate-neon.inc.c
++                           qdev_get_gpio_in(gicdev, 22 + i));
 +++ b/target/arm/translate-neon.inc.c
@@ -XXX,XX +XXX,XX @@ static bool trans_VLDST_multiple(DisasContext *s, arg_VLDST_multiple *a)
      gen_neon_ldst_base_update(s, a->rm, a->rn, nregs * interleave * 8);
      return true;
  }
 +
 +static bool trans_VLD_all_lanes(DisasContext *s, arg_VLD_all_lanes *a)
 +{
 +    /* Neon load single structure to all lanes */
 +    int reg, stride, vec_size;
 +    int vd = a->vd;
 +    int size = a->size;
 +    int nregs = a->n + 1;
 +    TCGv_i32 addr, tmp;
 +
 +    if (!arm_dc_feature(s, ARM_FEATURE_NEON)) {
 +        return false;
 +    }
 +
-+    /* UNDEF accesses to D16-D31 if they don't exist */
++    object_initialize_child(OBJECT(mms), "scc", &mms->scc, TYPE_MPS2_SCC);
-+    if (!dc_isar_feature(aa32_simd_r32, s) && (a->vd & 0x10)) {
++    qdev_prop_set_uint32(DEVICE(&mms->scc), "scc-cfg0", 0);
-+        return false;
++    qdev_prop_set_uint32(DEVICE(&mms->scc), "scc-cfg4", 0x2);
 +    qdev_prop_set_uint32(DEVICE(&mms->scc), "scc-aid", 0x00200008);
 +    qdev_prop_set_uint32(DEVICE(&mms->scc), "scc-id", 0x41055360);
 +    oscclk = qlist_new();
 +    for (int i = 0; i < ARRAY_SIZE(an536_oscclk); i++) {
 +        qlist_append_int(oscclk, an536_oscclk[i]);
 +    }
++    qdev_prop_set_array(DEVICE(&mms->scc), "oscclk", oscclk);
++    sysbus_realize(SYS_BUS_DEVICE(&mms->scc), &error_fatal);
++    sysbus_mmio_map(SYS_BUS_DEVICE(&mms->scc), 0, 0xe0200000);
 +
-+    if (size == 3) {
++    create_unimplemented_device("i2s-audio", 0xe0201000, 0x1000);
 +        if (nregs != 4 || a->a == 0) {
 +            return false;
 +        }
 +        /* For VLD4 size == 3 a == 1 means 32 bits at 16 byte alignment */
 +        size = 2;
 +    }
 +    if (nregs == 1 && a->a == 1 && size == 0) {
 +        return false;
 +    }
 +    if (nregs == 3 && a->a == 1) {
 +        return false;
 +    }
 +
-+    if (!vfp_access_check(s)) {
++    object_initialize_child(OBJECT(mms), "fpgaio", &mms->fpgaio,
-+        return true;
++                            TYPE_MPS2_FPGAIO);
-+    }
++    qdev_prop_set_uint32(DEVICE(&mms->fpgaio), "prescale-clk", an536_oscclk[1]);
 +    qdev_prop_set_uint32(DEVICE(&mms->fpgaio), "num-leds", 10);
 +    qdev_prop_set_bit(DEVICE(&mms->fpgaio), "has-switches", true);
 +    qdev_prop_set_bit(DEVICE(&mms->fpgaio), "has-dbgctrl", false);
 +    sysbus_realize(SYS_BUS_DEVICE(&mms->fpgaio), &error_fatal);
 +    sysbus_mmio_map(SYS_BUS_DEVICE(&mms->fpgaio), 0, 0xe0202000);
 +
 +    create_unimplemented_device("clcd", 0xe0209000, 0x1000);
 +
 +    object_initialize_child(OBJECT(mms), "rtc", &mms->rtc, TYPE_PL031);
 +    sysbus_realize(SYS_BUS_DEVICE(&mms->rtc), &error_fatal);
 +    sysbus_mmio_map(SYS_BUS_DEVICE(&mms->rtc), 0, 0xe020a000);
 +    sysbus_connect_irq(SYS_BUS_DEVICE(&mms->rtc), 0,
 +                       qdev_get_gpio_in(gicdev, 4));
 +
 +    /*
-+     * VLD1 to all lanes: T bit indicates how many Dregs to write.
++     * In hardware this is a LAN9220; the LAN9118 is software compatible
-+     * VLD2/3/4 to all lanes: T bit indicates register stride.
++     * except that it doesn't support the checksum-offload feature.
 +     */
-+    stride = a->t ? 2 : 1;
++    lan9118_init(0xe0300000,
-+    vec_size = nregs == 1 ? stride * 8 : 8;
++                 qdev_get_gpio_in(gicdev, 18));
 +
-+    tmp = tcg_temp_new_i32();
++    create_unimplemented_device("usb", 0xe0301000, 0x1000);
-+    addr = tcg_temp_new_i32();
++    create_unimplemented_device("qspi-write-config", 0xe0600000, 0x1000);
 +    load_reg_var(s, addr, a->rn);
 +    for (reg = 0; reg < nregs; reg++) {
 +        gen_aa32_ld_i32(s, tmp, addr, get_mem_index(s),
 +                        s->be_data | size);
 +        if ((vd & 1) && vec_size == 16) {
 +            /*
 +             * We cannot write 16 bytes at once because the
 +             * destination is unaligned.
 +             */
 +            tcg_gen_gvec_dup_i32(size, neon_reg_offset(vd, 0),
 +                                 8, 8, tmp);
 +            tcg_gen_gvec_mov(0, neon_reg_offset(vd + 1, 0),
 +                             neon_reg_offset(vd, 0), 8, 8);
 +        } else {
 +            tcg_gen_gvec_dup_i32(size, neon_reg_offset(vd, 0),
 +                                 vec_size, vec_size, tmp);
 +        }
 +        tcg_gen_addi_i32(addr, addr, 1 << size);
 +        vd += stride;
 +    }
 +    tcg_temp_free_i32(tmp);
 +    tcg_temp_free_i32(addr);
 +
-+    gen_neon_ldst_base_update(s, a->rm, a->rn, (1 << size) * nregs);
+     mms->bootinfo.ram_size = machine->ram_size;
-+
+     mms->bootinfo.board_id = -1;
-+    return true;
+     mms->bootinfo.loader_start = mmc->loader_start;
 +}
 diff --git a/target/arm/translate.c b/target/arm/translate.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate.c
 +++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static int disas_neon_ls_insn(DisasContext *s, uint32_t insn)
      int size;
      int reg;
      int load;
 -    int vec_size;
      TCGv_i32 addr;
      TCGv_i32 tmp;
@@ -XXX,XX +XXX,XX @@ static int disas_neon_ls_insn(DisasContext *s, uint32_t insn)
      } else {
          size = (insn >> 10) & 3;
          if (size == 3) {
 -            /* Load single element to all lanes.  */
 -            int a = (insn >> 4) & 1;
 -            if (!load) {
 -                return 1;
 -            }
 -            size = (insn >> 6) & 3;
 -            nregs = ((insn >> 8) & 3) + 1;
 -
 -            if (size == 3) {
 -                if (nregs != 4 || a == 0) {
 -                    return 1;
 -                }
 -                /* For VLD4 size==3 a == 1 means 32 bits at 16 byte alignment */
 -                size = 2;
 -            }
 -            if (nregs == 1 && a == 1 && size == 0) {
 -                return 1;
 -            }
 -            if (nregs == 3 && a == 1) {
 -                return 1;
 -            }
 -            addr = tcg_temp_new_i32();
 -            load_reg_var(s, addr, rn);
 -
 -            /* VLD1 to all lanes: bit 5 indicates how many Dregs to write.
 -             * VLD2/3/4 to all lanes: bit 5 indicates register stride.
 -             */
 -            stride = (insn & (1 << 5)) ? 2 : 1;
 -            vec_size = nregs == 1 ? stride * 8 : 8;
 -
 -            tmp = tcg_temp_new_i32();
 -            for (reg = 0; reg < nregs; reg++) {
 -                gen_aa32_ld_i32(s, tmp, addr, get_mem_index(s),
 -                                s->be_data | size);
 -                if ((rd & 1) && vec_size == 16) {
 -                    /* We cannot write 16 bytes at once because the
 -                     * destination is unaligned.
 -                     */
 -                    tcg_gen_gvec_dup_i32(size, neon_reg_offset(rd, 0),
 -                                         8, 8, tmp);
 -                    tcg_gen_gvec_mov(0, neon_reg_offset(rd + 1, 0),
 -                                     neon_reg_offset(rd, 0), 8, 8);
 -                } else {
 -                    tcg_gen_gvec_dup_i32(size, neon_reg_offset(rd, 0),
 -                                         vec_size, vec_size, tmp);
 -                }
 -                tcg_gen_addi_i32(addr, addr, 1 << size);
 -                rd += stride;
 -            }
 -            tcg_temp_free_i32(tmp);
 -            tcg_temp_free_i32(addr);
 -            stride = (1 << size) * nregs;
 +            /* Load single element to all lanes -- handled by decodetree  */
 +            return 1;
          } else {
              /* Single element.  */
              int idx = (insn >> 4) & 0xf;
 --
-.20.1
+.34.1

-[PULL 23/39] target/arm: Convert VCMLA (vector) to decodetree
+[PULL 35/35] docs: Add documentation for the mps3-an536 board
-Convert the VCMLA (vector) insns in the 3same extension group to
+Add documentation for the mps3-an536 board type.
 decodetree.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
-Message-id: 20200430181003.21682-5-peter.maydell@linaro.org
+Message-id: 20240206132931.38376-14-peter.maydell@linaro.org
 ---
- target/arm/neon-shared.decode   | 11 ++++++++++
+ docs/system/arm/mps2.rst | 37 ++++++++++++++++++++++++++++++++++---
- target/arm/translate-neon.inc.c | 37 +++++++++++++++++++++++++++++++++
+file changed, 34 insertions(+), 3 deletions(-)
  target/arm/translate.c          | 11 +---------
 files changed, 49 insertions(+), 10 deletions(-)
-diff --git a/target/arm/neon-shared.decode b/target/arm/neon-shared.decode
+diff --git a/docs/system/arm/mps2.rst b/docs/system/arm/mps2.rst
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/neon-shared.decode
+--- a/docs/system/arm/mps2.rst
-+++ b/target/arm/neon-shared.decode
++++ b/docs/system/arm/mps2.rst
 @@ -XXX,XX +XXX,XX @@
- # More specifically, this covers:
+-Arm MPS2 and MPS3 boards (``mps2-an385``, ``mps2-an386``, ``mps2-an500``, ``mps2-an505``, ``mps2-an511``, ``mps2-an521``, ``mps3-an524``, ``mps3-an547``)
- # 2reg scalar ext: 0b1111_1110_xxxx_xxxx_xxxx_1x0x_xxxx_xxxx
+-=========================================================================================================================================================
- # 3same ext:       0b1111_110x_xxxx_xxxx_xxxx_1x0x_xxxx_xxxx
++Arm MPS2 and MPS3 boards (``mps2-an385``, ``mps2-an386``, ``mps2-an500``, ``mps2-an505``, ``mps2-an511``, ``mps2-an521``, ``mps3-an524``, ``mps3-an536``, ``mps3-an547``)
 +=========================================================================================================================================================================
 -These board models all use Arm M-profile CPUs.
 +These board models use Arm M-profile or R-profile CPUs.
  The Arm MPS2, MPS2+ and MPS3 dev boards are FPGA based (the 2+ has a
  bigger FPGA but is otherwise the same as the 2; the 3 has a bigger
@@ -XXX,XX +XXX,XX @@ FPGA image.
  QEMU models the following FPGA images:
 +FPGA images using M-profile CPUs:
 +
-+# VFP/Neon register fields; same as vfp.decode
+ ``mps2-an385``
-+%vm_dp  5:1 0:4
+   Cortex-M3 as documented in Arm Application Note AN385
-+%vm_sp  0:4 5:1
+ ``mps2-an386``
-+%vn_dp  7:1 16:4
+@@ -XXX,XX +XXX,XX @@ QEMU models the following FPGA images:
-+%vn_sp  16:4 7:1
+ ``mps3-an547``
-+%vd_dp  22:1 12:4
+   Cortex-M55 on an MPS3, as documented in Arm Application Note AN547
-+%vd_sp  12:4 22:1
 +FPGA images using R-profile CPUs:
 +
-+VCMLA          1111 110 rot:2 . 1 size:1 .... .... 1000 . q:1 . 0 .... \
++``mps3-an536``
-+               vm=%vm_dp vn=%vn_dp vd=%vd_dp
++  Dual Cortex-R52 on an MPS3, as documented in Arm Application Note AN536
 diff --git a/target/arm/translate-neon.inc.c b/target/arm/translate-neon.inc.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-neon.inc.c
 +++ b/target/arm/translate-neon.inc.c
@@ -XXX,XX +XXX,XX @@
  #include "decode-neon-dp.inc.c"
  #include "decode-neon-ls.inc.c"
  #include "decode-neon-shared.inc.c"
 +
-+static bool trans_VCMLA(DisasContext *s, arg_VCMLA *a)
+ Differences between QEMU and real hardware:
-+{
-+    int opr_sz;
+ - AN385/AN386 remapping of low 16K of memory to either ZBT SSRAM1 or to
-+    TCGv_ptr fpst;
+@@ -XXX,XX +XXX,XX @@ Differences between QEMU and real hardware:
-+    gen_helper_gvec_3_ptr *fn_gvec_ptr;
+   flash, but only as simple ROM, so attempting to rewrite the flash
    from the guest will fail
  - QEMU does not model the USB controller in MPS3 boards
 +- AN536 does not support runtime control of CPU reset and halt via
 +  the SCC CFG_REG0 register.
 +- AN536 does not support enabling or disabling the flash and ATCM
 +  interfaces via the SCC CFG_REG1 register.
 +- AN536 does not support setting of the initial vector table
 +  base address via the SCC CFG_REG6 and CFG_REG7 register config,
 +  and does not provide a mechanism for specifying these values at
 +  startup, so all guest images must be built to start from TCM
 +  (i.e. to expect the interrupt vector base at 0 from reset).
 +- AN536 defaults to only creating a single CPU; this is the equivalent
 +  of the way the real FPGA image usually runs with the second Cortex-R52
 +  held in halt via the initial SCC CFG_REG0 register setting. You can
 +  create the second CPU with ``-smp 2``; both CPUs will then start
 +  execution immediately on startup.
 +
-+    if (!dc_isar_feature(aa32_vcma, s)
++Note that for the AN536 the first UART is accessible only by
-+        || (!a->size && !dc_isar_feature(aa32_fp16_arith, s))) {
++CPU0, and the second UART is accessible only by CPU1. The
-+        return false;
++first UART accessible shared between both CPUs is the third
-+    }
++UART. Guest software might therefore be built to use either
-+
++the first UART or the third UART; if you don't see any output
-+    /* UNDEF accesses to D16-D31 if they don't exist. */
++from the UART you are looking at, try one of the others.
-+    if (!dc_isar_feature(aa32_simd_r32, s) &&
++(Even if the AN536 machine is started with a single CPU and so
-+        ((a->vd | a->vn | a->vm) & 0x10)) {
++no "CPU1-only UART", the UART numbering remains the same,
-+        return false;
++with the third UART being the first of the shared ones.)
-+    }
-+
+ Machine-specific options
-+    if ((a->vn | a->vm | a->vd) & a->q) {
+ """"""""""""""""""""""""
 +        return false;
 +    }
 +
 +    if (!vfp_access_check(s)) {
 +        return true;
 +    }
 +
 +    opr_sz = (1 + a->q) * 8;
 +    fpst = get_fpstatus_ptr(1);
 +    fn_gvec_ptr = a->size ? gen_helper_gvec_fcmlas : gen_helper_gvec_fcmlah;
 +    tcg_gen_gvec_3_ptr(vfp_reg_offset(1, a->vd),
 +                       vfp_reg_offset(1, a->vn),
 +                       vfp_reg_offset(1, a->vm),
 +                       fpst, opr_sz, opr_sz, a->rot,
 +                       fn_gvec_ptr);
 +    tcg_temp_free_ptr(fpst);
 +    return true;
 +}
 diff --git a/target/arm/translate.c b/target/arm/translate.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate.c
 +++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static int disas_neon_insn_3same_ext(DisasContext *s, uint32_t insn)
      bool is_long = false, q = extract32(insn, 6, 1);
      bool ptr_is_env = false;
 -    if ((insn & 0xfe200f10) == 0xfc200800) {
 -        /* VCMLA -- 1111 110R R.1S .... .... 1000 ...0 .... */
 -        int size = extract32(insn, 20, 1);
 -        data = extract32(insn, 23, 2); /* rot */
 -        if (!dc_isar_feature(aa32_vcma, s)
 -            || (!size && !dc_isar_feature(aa32_fp16_arith, s))) {
 -            return 1;
 -        }
 -        fn_gvec_ptr = size ? gen_helper_gvec_fcmlas : gen_helper_gvec_fcmlah;
 -    } else if ((insn & 0xfea00f10) == 0xfc800800) {
 +    if ((insn & 0xfea00f10) == 0xfc800800) {
          /* VCADD -- 1111 110R 1.0S .... .... 1000 ...0 .... */
          int size = extract32(insn, 20, 1);
          data = extract32(insn, 24, 1); /* rot */
 --
-.20.1
+.34.1

Most of this is the Neon decodetree patches, followed by Edgar's versal cleanups.

thanks
-- PMM

The following changes since commit 2ef486e76d64436be90f7359a3071fb2a56ce835:

Merge remote-tracking branch 'remotes/marcel/tags/rdma-pull-request' into staging (2020-05-03 14:12:56 +0100)

are available in the Git repository at:

https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20200504

for you to fetch changes up to 9aefc6cf9b73f66062d2f914a0136756e7a28211:

target/arm: Move gen_ function typedefs to translate.h (2020-05-04 12:59:26 +0100)

----------------------------------------------------------------
target-arm queue:
 * Start of conversion of Neon insns to decodetree
 * versal board: support SD and RTC
 * Implement ARMv8.2-TTS2UXN
 * Make VQDMULL undefined when U=1
 * Some minor code cleanups

----------------------------------------------------------------
Edgar E. Iglesias (11):
      hw/arm: versal: Remove inclusion of arm_gicv3_common.h
      hw/arm: versal: Move misplaced comment
      hw/arm: versal-virt: Fix typo xlnx-ve -> xlnx-versal
      hw/arm: versal: Embed the UARTs into the SoC type
      hw/arm: versal: Embed the GEMs into the SoC type
      hw/arm: versal: Embed the ADMAs into the SoC type
      hw/arm: versal: Embed the APUs into the SoC type
      hw/arm: versal: Add support for SD
      hw/arm: versal: Add support for the RTC
      hw/arm: versal-virt: Add support for SD
      hw/arm: versal-virt: Add support for the RTC

Fredrik Strupe (1):
      target/arm: Make VQDMULL undefined when U=1

Peter Maydell (25):
      target/arm: Don't use a TLB for ARMMMUIdx_Stage2
      target/arm: Use enum constant in get_phys_addr_lpae() call
      target/arm: Add new 's1_is_el0' argument to get_phys_addr_lpae()
      target/arm: Implement ARMv8.2-TTS2UXN
      target/arm: Use correct variable for setting 'max' cpu's ID_AA64DFR0
      target/arm/translate-vfp.inc.c: Remove duplicate simd_r32 check
      target/arm: Don't allow Thumb Neon insns without FEATURE_NEON
      target/arm: Add stubs for AArch32 Neon decodetree
      target/arm: Convert VCMLA (vector) to decodetree
      target/arm: Convert VCADD (vector) to decodetree
      target/arm: Convert V[US]DOT (vector) to decodetree
      target/arm: Convert VFM[AS]L (vector) to decodetree
      target/arm: Convert VCMLA (scalar) to decodetree
      target/arm: Convert V[US]DOT (scalar) to decodetree
      target/arm: Convert VFM[AS]L (scalar) to decodetree
      target/arm: Convert Neon load/store multiple structures to decodetree
      target/arm: Convert Neon 'load single structure to all lanes' to decodetree
      target/arm: Convert Neon 'load/store single structure' to decodetree
      target/arm: Convert Neon 3-reg-same VADD/VSUB to decodetree
      target/arm: Convert Neon 3-reg-same logic ops to decodetree
      target/arm: Convert Neon 3-reg-same VMAX/VMIN to decodetree
      target/arm: Convert Neon 3-reg-same comparisons to decodetree
      target/arm: Convert Neon 3-reg-same VQADD/VQSUB to decodetree
      target/arm: Convert Neon 3-reg-same VMUL, VMLA, VMLS, VSHL to decodetree
      target/arm: Move gen_ function typedefs to translate.h

Philippe Mathieu-Daudé (2):
      hw/arm/mps2-tz: Use TYPE_IOTKIT instead of hardcoded string
      target/arm: Use uint64_t for midr field in CPU state struct

include/hw/arm/xlnx-versal.h    |  31 +-
 target/arm/cpu-param.h          |   2 +-
 target/arm/cpu.h                |  38 ++-
 target/arm/translate-a64.h      |   9 -
 target/arm/translate.h          |  26 ++
 target/arm/neon-dp.decode       |  86 +++++
 target/arm/neon-ls.decode       |  52 +++
 target/arm/neon-shared.decode   |  66 ++++
 hw/arm/mps2-tz.c                |   2 +-
 hw/arm/xlnx-versal-virt.c       |  74 ++++-
 hw/arm/xlnx-versal.c            | 115 +++++--
 target/arm/cpu.c                |   3 +-
 target/arm/cpu64.c              |   8 +-
 target/arm/helper.c             | 183 ++++------
 target/arm/translate-a64.c      |  17 -
 target/arm/translate-neon.inc.c | 714 +++++++++++++++++++++++++++++++++++++++
 target/arm/translate-vfp.inc.c  |   6 -
 target/arm/translate.c          | 716 +++-------------------------------------
 target/arm/Makefile.objs        |  18 +
 19 files changed, 1302 insertions(+), 864 deletions(-)
 create mode 100644 target/arm/neon-dp.decode
 create mode 100644 target/arm/neon-ls.decode
 create mode 100644 target/arm/neon-shared.decode
 create mode 100644 target/arm/translate-neon.inc.c

From: Fredrik Strupe <fredrik@strupe.net>

According to Arm ARM, VQDMULL is only valid when U=0, while having
U=1 is unallocated.

Signed-off-by: Fredrik Strupe <fredrik@strupe.net>
Fixes: 695272dcb976 ("target-arm: Handle UNDEF cases for Neon 3-regs-different-widths")
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 target/arm/translate.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/target/arm/translate.c b/target/arm/translate.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/translate.c
+++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static int disas_neon_data_insn(DisasContext *s, uint32_t insn)
                     {0, 0, 0, 0}, /* VMLSL */
                     {0, 0, 0, 9}, /* VQDMLSL */
                     {0, 0, 0, 0}, /* Integer VMULL */
-                    {0, 0, 0, 1}, /* VQDMULL */
+                    {0, 0, 0, 9}, /* VQDMULL */
                     {0, 0, 0, 0xa}, /* Polynomial VMULL */
                     {0, 0, 0, 7}, /* Reserved: always UNDEF */
                 };
-- 
2.20.1

From: Philippe Mathieu-Daudé <f4bug@amsat.org>

By using the TYPE_* definitions for devices, we can:
 - quickly find where devices are used with 'git-grep'
 - easily rename a device (one-line change).

Signed-off-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
Message-id: 20200428154650.21991-1-f4bug@amsat.org
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/arm/mps2-tz.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/hw/arm/mps2-tz.c b/hw/arm/mps2-tz.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/mps2-tz.c
+++ b/hw/arm/mps2-tz.c
@@ -XXX,XX +XXX,XX @@ static void mps2tz_common_init(MachineState *machine)
         exit(EXIT_FAILURE);
     }
 
-    sysbus_init_child_obj(OBJECT(machine), "iotkit", &mms->iotkit,
+    sysbus_init_child_obj(OBJECT(machine), TYPE_IOTKIT, &mms->iotkit,
                           sizeof(mms->iotkit), mmc->armsse_type);
     iotkitdev = DEVICE(&mms->iotkit);
     object_property_set_link(OBJECT(&mms->iotkit), OBJECT(system_memory),
-- 
2.20.1

We define ARMMMUIdx_Stage2 as being an MMU index which uses a QEMU
TLB.  However we never actually use the TLB -- all stage 2 lookups
are done by direct calls to get_phys_addr_lpae() followed by a
physical address load via address_space_ld*().

Remove Stage2 from the list of ARM MMU indexes which correspond to
real core MMU indexes, and instead put it in the set of "NOTLB" ARM
MMU indexes.

This allows us to drop NB_MMU_MODES to 11.  It also means we can
safely add support for the ARMv8.3-TTS2UXN extension, which adds
permission bits to the stage 2 descriptors which define execute
permission separatel for EL0 and EL1; supporting that while keeping
Stage2 in a QEMU TLB would require us to use separate TLBs for
"Stage2 for an EL0 access" and "Stage2 for an EL1 access", which is a
lot of extra complication given we aren't even using the QEMU TLB.

In the process of updating the comment on our MMU index use,
fix a couple of other minor errors:
 * NS EL2 EL2&0 was missing from the list in the comment
 * some text hadn't been updated from when we bumped NB_MMU_MODES
   above 8

Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
Reviewed-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Message-id: 20200330210400.11724-2-peter.maydell@linaro.org
---
 target/arm/cpu-param.h |   2 +-
 target/arm/cpu.h       |  21 +++++---
 target/arm/helper.c    | 112 ++++-------------------------------------
 3 files changed, 27 insertions(+), 108 deletions(-)

diff --git a/target/arm/cpu-param.h b/target/arm/cpu-param.h
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/cpu-param.h
+++ b/target/arm/cpu-param.h
@@ -XXX,XX +XXX,XX @@
 # define TARGET_PAGE_BITS_MIN  10
 #endif
 
-#define NB_MMU_MODES 12
+#define NB_MMU_MODES 11
 
 #endif
diff --git a/target/arm/cpu.h b/target/arm/cpu.h
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/cpu.h
+++ b/target/arm/cpu.h
@@ -XXX,XX +XXX,XX @@ bool write_cpustate_to_list(ARMCPU *cpu, bool kvm_sync);
  *     handling via the TLB. The only way to do a stage 1 translation without
  *     the immediate stage 2 translation is via the ATS or AT system insns,
  *     which can be slow-pathed and always do a page table walk.
+ *     The only use of stage 2 translations is either as part of an s1+2
+ *     lookup or when loading the descriptors during a stage 1 page table walk,
+ *     and in both those cases we don't use the TLB.
  *  4. we can also safely fold together the "32 bit EL3" and "64 bit EL3"
  *     translation regimes, because they map reasonably well to each other
  *     and they can't both be active at the same time.
@@ -XXX,XX +XXX,XX @@ bool write_cpustate_to_list(ARMCPU *cpu, bool kvm_sync);
  * NS EL1 EL1&0 stage 1+2 (aka NS PL1)
  * NS EL1 EL1&0 stage 1+2 +PAN
  * NS EL0 EL2&0
+ * NS EL2 EL2&0
  * NS EL2 EL2&0 +PAN
  * NS EL2 (aka NS PL2)
  * S EL0 EL1&0 (aka S PL0)
  * S EL1 EL1&0 (not used if EL3 is 32 bit)
  * S EL1 EL1&0 +PAN
  * S EL3 (aka S PL1)
- * NS EL1&0 stage 2
  *
- * for a total of 12 different mmu_idx.
+ * for a total of 11 different mmu_idx.
  *
  * R profile CPUs have an MPU, but can use the same set of MMU indexes
  * as A profile. They only need to distinguish NS EL0 and NS EL1 (and
@@ -XXX,XX +XXX,XX @@ bool write_cpustate_to_list(ARMCPU *cpu, bool kvm_sync);
  * are not quite the same -- different CPU types (most notably M profile
  * vs A/R profile) would like to use MMU indexes with different semantics,
  * but since we don't ever need to use all of those in a single CPU we
- * can avoid setting NB_MMU_MODES to more than 8. The lower bits of
+ * can avoid having to set NB_MMU_MODES to "total number of A profile MMU
+ * modes + total number of M profile MMU modes". The lower bits of
  * ARMMMUIdx are the core TLB mmu index, and the higher bits are always
  * the same for any particular CPU.
  * Variables of type ARMMUIdx are always full values, and the core
@@ -XXX,XX +XXX,XX @@ typedef enum ARMMMUIdx {
     ARMMMUIdx_SE10_1_PAN = 9 | ARM_MMU_IDX_A,
     ARMMMUIdx_SE3        = 10 | ARM_MMU_IDX_A,
 
-    ARMMMUIdx_Stage2     = 11 | ARM_MMU_IDX_A,
-
     /*
      * These are not allocated TLBs and are used only for AT system
      * instructions or for the first stage of an S12 page table walk.
@@ -XXX,XX +XXX,XX @@ typedef enum ARMMMUIdx {
     ARMMMUIdx_Stage1_E0 = 0 | ARM_MMU_IDX_NOTLB,
     ARMMMUIdx_Stage1_E1 = 1 | ARM_MMU_IDX_NOTLB,
     ARMMMUIdx_Stage1_E1_PAN = 2 | ARM_MMU_IDX_NOTLB,
+    /*
+     * Not allocated a TLB: used only for second stage of an S12 page
+     * table walk, or for descriptor loads during first stage of an S1
+     * page table walk. Note that if we ever want to have a TLB for this
+     * then various TLB flush insns which currently are no-ops or flush
+     * only stage 1 MMU indexes will need to change to flush stage 2.
+     */
+    ARMMMUIdx_Stage2     = 3 | ARM_MMU_IDX_NOTLB,
 
     /*
      * M-profile.
@@ -XXX,XX +XXX,XX @@ typedef enum ARMMMUIdxBit {
     TO_CORE_BIT(SE10_1),
     TO_CORE_BIT(SE10_1_PAN),
     TO_CORE_BIT(SE3),
-    TO_CORE_BIT(Stage2),
 
     TO_CORE_BIT(MUser),
     TO_CORE_BIT(MPriv),
diff --git a/target/arm/helper.c b/target/arm/helper.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/helper.c
+++ b/target/arm/helper.c
@@ -XXX,XX +XXX,XX @@ static void tlbiall_nsnh_write(CPUARMState *env, const ARMCPRegInfo *ri,
     tlb_flush_by_mmuidx(cs,
                         ARMMMUIdxBit_E10_1 |
                         ARMMMUIdxBit_E10_1_PAN |
-                        ARMMMUIdxBit_E10_0 |
-                        ARMMMUIdxBit_Stage2);
+                        ARMMMUIdxBit_E10_0);
 }
 
 static void tlbiall_nsnh_is_write(CPUARMState *env, const ARMCPRegInfo *ri,
@@ -XXX,XX +XXX,XX @@ static void tlbiall_nsnh_is_write(CPUARMState *env, const ARMCPRegInfo *ri,
     tlb_flush_by_mmuidx_all_cpus_synced(cs,
                                         ARMMMUIdxBit_E10_1 |
                                         ARMMMUIdxBit_E10_1_PAN |
-                                        ARMMMUIdxBit_E10_0 |
-                                        ARMMMUIdxBit_Stage2);
+                                        ARMMMUIdxBit_E10_0);
 }
 
-static void tlbiipas2_write(CPUARMState *env, const ARMCPRegInfo *ri,
-                            uint64_t value)
-{
-    /* Invalidate by IPA. This has to invalidate any structures that
-     * contain only stage 2 translation information, but does not need
-     * to apply to structures that contain combined stage 1 and stage 2
-     * translation information.
-     * This must NOP if EL2 isn't implemented or SCR_EL3.NS is zero.
-     */
-    CPUState *cs = env_cpu(env);
-    uint64_t pageaddr;
-
-    if (!arm_feature(env, ARM_FEATURE_EL2) || !(env->cp15.scr_el3 & SCR_NS)) {
-        return;
-    }
-
-    pageaddr = sextract64(value << 12, 0, 40);
-
-    tlb_flush_page_by_mmuidx(cs, pageaddr, ARMMMUIdxBit_Stage2);
-}
-
-static void tlbiipas2_is_write(CPUARMState *env, const ARMCPRegInfo *ri,
-                               uint64_t value)
-{
-    CPUState *cs = env_cpu(env);
-    uint64_t pageaddr;
-
-    if (!arm_feature(env, ARM_FEATURE_EL2) || !(env->cp15.scr_el3 & SCR_NS)) {
-        return;
-    }
-
-    pageaddr = sextract64(value << 12, 0, 40);
-
-    tlb_flush_page_by_mmuidx_all_cpus_synced(cs, pageaddr,
-                                             ARMMMUIdxBit_Stage2);
-}
 
 static void tlbiall_hyp_write(CPUARMState *env, const ARMCPRegInfo *ri,
                               uint64_t value)
@@ -XXX,XX +XXX,XX @@ static void vttbr_write(CPUARMState *env, const ARMCPRegInfo *ri,
         tlb_flush_by_mmuidx(cs,
                             ARMMMUIdxBit_E10_1 |
                             ARMMMUIdxBit_E10_1_PAN |
-                            ARMMMUIdxBit_E10_0 |
-                            ARMMMUIdxBit_Stage2);
+                            ARMMMUIdxBit_E10_0);
         raw_write(env, ri, value);
     }
 }
@@ -XXX,XX +XXX,XX @@ static int alle1_tlbmask(CPUARMState *env)
         return ARMMMUIdxBit_SE10_1 |
                ARMMMUIdxBit_SE10_1_PAN |
                ARMMMUIdxBit_SE10_0;
-    } else if (arm_feature(env, ARM_FEATURE_EL2)) {
-        return ARMMMUIdxBit_E10_1 |
-               ARMMMUIdxBit_E10_1_PAN |
-               ARMMMUIdxBit_E10_0 |
-               ARMMMUIdxBit_Stage2;
     } else {
         return ARMMMUIdxBit_E10_1 |
                ARMMMUIdxBit_E10_1_PAN |
@@ -XXX,XX +XXX,XX @@ static void tlbi_aa64_vae3is_write(CPUARMState *env, const ARMCPRegInfo *ri,
                                              ARMMMUIdxBit_SE3);
 }
 
-static void tlbi_aa64_ipas2e1_write(CPUARMState *env, const ARMCPRegInfo *ri,
-                                    uint64_t value)
-{
-    /* Invalidate by IPA. This has to invalidate any structures that
-     * contain only stage 2 translation information, but does not need
-     * to apply to structures that contain combined stage 1 and stage 2
-     * translation information.
-     * This must NOP if EL2 isn't implemented or SCR_EL3.NS is zero.
-     */
-    ARMCPU *cpu = env_archcpu(env);
-    CPUState *cs = CPU(cpu);
-    uint64_t pageaddr;
-
-    if (!arm_feature(env, ARM_FEATURE_EL2) || !(env->cp15.scr_el3 & SCR_NS)) {
-        return;
-    }
-
-    pageaddr = sextract64(value << 12, 0, 48);
-
-    tlb_flush_page_by_mmuidx(cs, pageaddr, ARMMMUIdxBit_Stage2);
-}
-
-static void tlbi_aa64_ipas2e1is_write(CPUARMState *env, const ARMCPRegInfo *ri,
-                                      uint64_t value)
-{
-    CPUState *cs = env_cpu(env);
-    uint64_t pageaddr;
-
-    if (!arm_feature(env, ARM_FEATURE_EL2) || !(env->cp15.scr_el3 & SCR_NS)) {
-        return;
-    }
-
-    pageaddr = sextract64(value << 12, 0, 48);
-
-    tlb_flush_page_by_mmuidx_all_cpus_synced(cs, pageaddr,
-                                             ARMMMUIdxBit_Stage2);
-}
-
 static CPAccessResult aa64_zva_access(CPUARMState *env, const ARMCPRegInfo *ri,
                                       bool isread)
 {
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo v8_cp_reginfo[] = {
       .writefn = tlbi_aa64_vae1_write },
     { .name = "TLBI_IPAS2E1IS", .state = ARM_CP_STATE_AA64,
       .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 0, .opc2 = 1,
-      .access = PL2_W, .type = ARM_CP_NO_RAW,
-      .writefn = tlbi_aa64_ipas2e1is_write },
+      .access = PL2_W, .type = ARM_CP_NOP },
     { .name = "TLBI_IPAS2LE1IS", .state = ARM_CP_STATE_AA64,
       .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 0, .opc2 = 5,
-      .access = PL2_W, .type = ARM_CP_NO_RAW,
-      .writefn = tlbi_aa64_ipas2e1is_write },
+      .access = PL2_W, .type = ARM_CP_NOP },
     { .name = "TLBI_ALLE1IS", .state = ARM_CP_STATE_AA64,
       .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 3, .opc2 = 4,
       .access = PL2_W, .type = ARM_CP_NO_RAW,
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo v8_cp_reginfo[] = {
       .writefn = tlbi_aa64_alle1is_write },
     { .name = "TLBI_IPAS2E1", .state = ARM_CP_STATE_AA64,
       .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 4, .opc2 = 1,
-      .access = PL2_W, .type = ARM_CP_NO_RAW,
-      .writefn = tlbi_aa64_ipas2e1_write },
+      .access = PL2_W, .type = ARM_CP_NOP },
     { .name = "TLBI_IPAS2LE1", .state = ARM_CP_STATE_AA64,
       .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 4, .opc2 = 5,
-      .access = PL2_W, .type = ARM_CP_NO_RAW,
-      .writefn = tlbi_aa64_ipas2e1_write },
+      .access = PL2_W, .type = ARM_CP_NOP },
     { .name = "TLBI_ALLE1", .state = ARM_CP_STATE_AA64,
       .opc0 = 1, .opc1 = 4, .crn = 8, .crm = 7, .opc2 = 4,
       .access = PL2_W, .type = ARM_CP_NO_RAW,
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo v8_cp_reginfo[] = {
       .writefn = tlbimva_hyp_is_write },
     { .name = "TLBIIPAS2",
       .cp = 15, .opc1 = 4, .crn = 8, .crm = 4, .opc2 = 1,
-      .type = ARM_CP_NO_RAW, .access = PL2_W,
-      .writefn = tlbiipas2_write },
+      .type = ARM_CP_NOP, .access = PL2_W },
     { .name = "TLBIIPAS2IS",
       .cp = 15, .opc1 = 4, .crn = 8, .crm = 0, .opc2 = 1,
-      .type = ARM_CP_NO_RAW, .access = PL2_W,
-      .writefn = tlbiipas2_is_write },
+      .type = ARM_CP_NOP, .access = PL2_W },
     { .name = "TLBIIPAS2L",
       .cp = 15, .opc1 = 4, .crn = 8, .crm = 4, .opc2 = 5,
-      .type = ARM_CP_NO_RAW, .access = PL2_W,
-      .writefn = tlbiipas2_write },
+      .type = ARM_CP_NOP, .access = PL2_W },
     { .name = "TLBIIPAS2LIS",
       .cp = 15, .opc1 = 4, .crn = 8, .crm = 0, .opc2 = 5,
-      .type = ARM_CP_NO_RAW, .access = PL2_W,
-      .writefn = tlbiipas2_is_write },
+      .type = ARM_CP_NOP, .access = PL2_W },
     /* 32 bit cache operations */
     { .name = "ICIALLUIS", .cp = 15, .opc1 = 0, .crn = 7, .crm = 1, .opc2 = 0,
       .type = ARM_CP_NOP, .access = PL1_W, .accessfn = aa64_cacheop_pou_access },
-- 
2.20.1

The access_type argument to get_phys_addr_lpae() is an MMUAccessType;
use the enum constant MMU_DATA_LOAD rather than a literal 0 when we
call it in S1_ptw_translate().

Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
Reviewed-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Message-id: 20200330210400.11724-3-peter.maydell@linaro.org
---
 target/arm/helper.c | 5 +++--
 1 file changed, 3 insertions(+), 2 deletions(-)

diff --git a/target/arm/helper.c b/target/arm/helper.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/helper.c
+++ b/target/arm/helper.c
@@ -XXX,XX +XXX,XX @@ static hwaddr S1_ptw_translate(CPUARMState *env, ARMMMUIdx mmu_idx,
             pcacheattrs = &cacheattrs;
         }
 
-        ret = get_phys_addr_lpae(env, addr, 0, ARMMMUIdx_Stage2, &s2pa,
-                                 &txattrs, &s2prot, &s2size, fi, pcacheattrs);
+        ret = get_phys_addr_lpae(env, addr, MMU_DATA_LOAD, ARMMMUIdx_Stage2,
+                                 &s2pa, &txattrs, &s2prot, &s2size, fi,
+                                 pcacheattrs);
         if (ret) {
             assert(fi->type != ARMFault_None);
             fi->s2addr = addr;
-- 
2.20.1

For ARMv8.2-TTS2UXN, the stage 2 page table walk wants to know
whether the stage 1 access is for EL0 or not, because whether
exec permission is given can depend on whether this is an EL0
or EL1 access. Add a new argument to get_phys_addr_lpae() so
the call sites can pass this information in.

Since get_phys_addr_lpae() doesn't already have a doc comment,
add one so we have a place to put the documentation of the
semantics of the new s1_is_el0 argument.

Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
Reviewed-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Message-id: 20200330210400.11724-4-peter.maydell@linaro.org
---
 target/arm/helper.c | 29 ++++++++++++++++++++++++++++-
 1 file changed, 28 insertions(+), 1 deletion(-)

diff --git a/target/arm/helper.c b/target/arm/helper.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/helper.c
+++ b/target/arm/helper.c
@@ -XXX,XX +XXX,XX @@
 
 static bool get_phys_addr_lpae(CPUARMState *env, target_ulong address,
                                MMUAccessType access_type, ARMMMUIdx mmu_idx,
+                               bool s1_is_el0,
                                hwaddr *phys_ptr, MemTxAttrs *txattrs, int *prot,
                                target_ulong *page_size_ptr,
                                ARMMMUFaultInfo *fi, ARMCacheAttrs *cacheattrs);
@@ -XXX,XX +XXX,XX @@ static hwaddr S1_ptw_translate(CPUARMState *env, ARMMMUIdx mmu_idx,
         }
 
         ret = get_phys_addr_lpae(env, addr, MMU_DATA_LOAD, ARMMMUIdx_Stage2,
+                                 false,
                                  &s2pa, &txattrs, &s2prot, &s2size, fi,
                                  pcacheattrs);
         if (ret) {
@@ -XXX,XX +XXX,XX @@ static ARMVAParameters aa32_va_parameters(CPUARMState *env, uint32_t va,
     };
 }
 
+/**
+ * get_phys_addr_lpae: perform one stage of page table walk, LPAE format
+ *
+ * Returns false if the translation was successful. Otherwise, phys_ptr, attrs,
+ * prot and page_size may not be filled in, and the populated fsr value provides
+ * information on why the translation aborted, in the format of a long-format
+ * DFSR/IFSR fault register, with the following caveats:
+ *  * the WnR bit is never set (the caller must do this).
+ *
+ * @env: CPUARMState
+ * @address: virtual address to get physical address for
+ * @access_type: MMU_DATA_LOAD, MMU_DATA_STORE or MMU_INST_FETCH
+ * @mmu_idx: MMU index indicating required translation regime
+ * @s1_is_el0: if @mmu_idx is ARMMMUIdx_Stage2 (so this is a stage 2 page table
+ *             walk), must be true if this is stage 2 of a stage 1+2 walk for an
+ *             EL0 access). If @mmu_idx is anything else, @s1_is_el0 is ignored.
+ * @phys_ptr: set to the physical address corresponding to the virtual address
+ * @attrs: set to the memory transaction attributes to use
+ * @prot: set to the permissions for the page containing phys_ptr
+ * @page_size_ptr: set to the size of the page containing phys_ptr
+ * @fi: set to fault info if the translation fails
+ * @cacheattrs: (if non-NULL) set to the cacheability/shareability attributes
+ */
 static bool get_phys_addr_lpae(CPUARMState *env, target_ulong address,
                                MMUAccessType access_type, ARMMMUIdx mmu_idx,
+                               bool s1_is_el0,
                                hwaddr *phys_ptr, MemTxAttrs *txattrs, int *prot,
                                target_ulong *page_size_ptr,
                                ARMMMUFaultInfo *fi, ARMCacheAttrs *cacheattrs)
@@ -XXX,XX +XXX,XX @@ bool get_phys_addr(CPUARMState *env, target_ulong address,
 
             /* S1 is done. Now do S2 translation.  */
             ret = get_phys_addr_lpae(env, ipa, access_type, ARMMMUIdx_Stage2,
+                                     mmu_idx == ARMMMUIdx_E10_0,
                                      phys_ptr, attrs, &s2_prot,
                                      page_size, fi,
                                      cacheattrs != NULL ? &cacheattrs2 : NULL);
@@ -XXX,XX +XXX,XX @@ bool get_phys_addr(CPUARMState *env, target_ulong address,
     }
 
     if (regime_using_lpae_format(env, mmu_idx)) {
-        return get_phys_addr_lpae(env, address, access_type, mmu_idx,
+        return get_phys_addr_lpae(env, address, access_type, mmu_idx, false,
                                   phys_ptr, attrs, prot, page_size,
                                   fi, cacheattrs);
     } else if (regime_sctlr(env, mmu_idx) & SCTLR_XP) {
-- 
2.20.1

The ARMv8.2-TTS2UXN feature extends the XN field in stage 2
translation table descriptors from just bit [54] to bits [54:53],
allowing stage 2 to control execution permissions separately for EL0
and EL1. Implement the new semantics of the XN field and enable
the feature for our 'max' CPU.

Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
Reviewed-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Message-id: 20200330210400.11724-5-peter.maydell@linaro.org
---
 target/arm/cpu.h    | 15 +++++++++++++++
 target/arm/cpu.c    |  1 +
 target/arm/cpu64.c  |  2 ++
 target/arm/helper.c | 37 +++++++++++++++++++++++++++++++------
 4 files changed, 49 insertions(+), 6 deletions(-)

diff --git a/target/arm/cpu.h b/target/arm/cpu.h
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/cpu.h
+++ b/target/arm/cpu.h
@@ -XXX,XX +XXX,XX @@ static inline bool isar_feature_aa32_ccidx(const ARMISARegisters *id)
     return FIELD_EX32(id->id_mmfr4, ID_MMFR4, CCIDX) != 0;
 }
 
+static inline bool isar_feature_aa32_tts2uxn(const ARMISARegisters *id)
+{
+    return FIELD_EX32(id->id_mmfr4, ID_MMFR4, XNX) != 0;
+}
+
 /*
  * 64-bit feature tests via id registers.
  */
@@ -XXX,XX +XXX,XX @@ static inline bool isar_feature_aa64_ccidx(const ARMISARegisters *id)
     return FIELD_EX64(id->id_aa64mmfr2, ID_AA64MMFR2, CCIDX) != 0;
 }
 
+static inline bool isar_feature_aa64_tts2uxn(const ARMISARegisters *id)
+{
+    return FIELD_EX64(id->id_aa64mmfr1, ID_AA64MMFR1, XNX) != 0;
+}
+
 /*
  * Feature tests for "does this exist in either 32-bit or 64-bit?"
  */
@@ -XXX,XX +XXX,XX @@ static inline bool isar_feature_any_ccidx(const ARMISARegisters *id)
     return isar_feature_aa64_ccidx(id) || isar_feature_aa32_ccidx(id);
 }
 
+static inline bool isar_feature_any_tts2uxn(const ARMISARegisters *id)
+{
+    return isar_feature_aa64_tts2uxn(id) || isar_feature_aa32_tts2uxn(id);
+}
+
 /*
  * Forward to the above feature tests given an ARMCPU pointer.
  */
diff --git a/target/arm/cpu.c b/target/arm/cpu.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/cpu.c
+++ b/target/arm/cpu.c
@@ -XXX,XX +XXX,XX @@ static void arm_max_initfn(Object *obj)
             t = FIELD_DP32(t, ID_MMFR4, HPDS, 1); /* AA32HPD */
             t = FIELD_DP32(t, ID_MMFR4, AC2, 1); /* ACTLR2, HACTLR2 */
             t = FIELD_DP32(t, ID_MMFR4, CNP, 1); /* TTCNP */
+            t = FIELD_DP32(t, ID_MMFR4, XNX, 1); /* TTS2UXN */
             cpu->isar.id_mmfr4 = t;
         }
 #endif
diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/cpu64.c
+++ b/target/arm/cpu64.c
@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
         t = FIELD_DP64(t, ID_AA64MMFR1, VH, 1);
         t = FIELD_DP64(t, ID_AA64MMFR1, PAN, 2); /* ATS1E1 */
         t = FIELD_DP64(t, ID_AA64MMFR1, VMIDBITS, 2); /* VMID16 */
+        t = FIELD_DP64(t, ID_AA64MMFR1, XNX, 1); /* TTS2UXN */
         cpu->isar.id_aa64mmfr1 = t;
 
         t = cpu->isar.id_aa64mmfr2;
@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
         u = FIELD_DP32(u, ID_MMFR4, HPDS, 1); /* AA32HPD */
         u = FIELD_DP32(u, ID_MMFR4, AC2, 1); /* ACTLR2, HACTLR2 */
         u = FIELD_DP32(u, ID_MMFR4, CNP, 1); /* TTCNP */
+        u = FIELD_DP32(u, ID_MMFR4, XNX, 1); /* TTS2UXN */
         cpu->isar.id_mmfr4 = u;
 
         u = cpu->isar.id_aa64dfr0;
diff --git a/target/arm/helper.c b/target/arm/helper.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/helper.c
+++ b/target/arm/helper.c
@@ -XXX,XX +XXX,XX @@ simple_ap_to_rw_prot(CPUARMState *env, ARMMMUIdx mmu_idx, int ap)
  *
  * @env:     CPUARMState
  * @s2ap:    The 2-bit stage2 access permissions (S2AP)
- * @xn:      XN (execute-never) bit
+ * @xn:      XN (execute-never) bits
+ * @s1_is_el0: true if this is S2 of an S1+2 walk for EL0
  */
-static int get_S2prot(CPUARMState *env, int s2ap, int xn)
+static int get_S2prot(CPUARMState *env, int s2ap, int xn, bool s1_is_el0)
 {
     int prot = 0;
 
@@ -XXX,XX +XXX,XX @@ static int get_S2prot(CPUARMState *env, int s2ap, int xn)
     if (s2ap & 2) {
         prot |= PAGE_WRITE;
     }
-    if (!xn) {
-        if (arm_el_is_aa64(env, 2) || prot & PAGE_READ) {
+
+    if (cpu_isar_feature(any_tts2uxn, env_archcpu(env))) {
+        switch (xn) {
+        case 0:
             prot |= PAGE_EXEC;
+            break;
+        case 1:
+            if (s1_is_el0) {
+                prot |= PAGE_EXEC;
+            }
+            break;
+        case 2:
+            break;
+        case 3:
+            if (!s1_is_el0) {
+                prot |= PAGE_EXEC;
+            }
+            break;
+        default:
+            g_assert_not_reached();
+        }
+    } else {
+        if (!extract32(xn, 1, 1)) {
+            if (arm_el_is_aa64(env, 2) || prot & PAGE_READ) {
+                prot |= PAGE_EXEC;
+            }
         }
     }
     return prot;
@@ -XXX,XX +XXX,XX @@ static bool get_phys_addr_lpae(CPUARMState *env, target_ulong address,
     }
 
     ap = extract32(attrs, 4, 2);
-    xn = extract32(attrs, 12, 1);
 
     if (mmu_idx == ARMMMUIdx_Stage2) {
         ns = true;
-        *prot = get_S2prot(env, ap, xn);
+        xn = extract32(attrs, 11, 2);
+        *prot = get_S2prot(env, ap, xn, s1_is_el0);
     } else {
         ns = extract32(attrs, 3, 1);
+        xn = extract32(attrs, 12, 1);
         pxn = extract32(attrs, 11, 1);
         *prot = get_S1prot(env, mmu_idx, aarch64, ap, ns, xn, pxn);
     }
-- 
2.20.1

In aarch64_max_initfn() we update both 32-bit and 64-bit ID
registers.  The intended pattern is that for 64-bit ID registers we
use FIELD_DP64 and the uint64_t 't' register, while 32-bit ID
registers use FIELD_DP32 and the uint32_t 'u' register.  For
ID_AA64DFR0 we accidentally used 'u', meaning that the top 32 bits of
this 64-bit ID register would end up always zero.  Luckily at the
moment that's what they should be anyway, so this bug has no visible
effects.

Use the right-sized variable.

Fixes: 3bec78447a958d481991
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
Reviewed-by: Laurent Desnogues <laurent.desnogues@gmail.com>
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
Message-id: 20200423110915.10527-1-peter.maydell@linaro.org
---
 target/arm/cpu64.c | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/cpu64.c
+++ b/target/arm/cpu64.c
@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
         u = FIELD_DP32(u, ID_MMFR4, XNX, 1); /* TTS2UXN */
         cpu->isar.id_mmfr4 = u;
 
-        u = cpu->isar.id_aa64dfr0;
-        u = FIELD_DP64(u, ID_AA64DFR0, PMUVER, 5); /* v8.4-PMU */
-        cpu->isar.id_aa64dfr0 = u;
+        t = cpu->isar.id_aa64dfr0;
+        t = FIELD_DP64(t, ID_AA64DFR0, PMUVER, 5); /* v8.4-PMU */
+        cpu->isar.id_aa64dfr0 = t;
 
         u = cpu->isar.id_dfr0;
         u = FIELD_DP32(u, ID_DFR0, PERFMON, 5); /* v8.4-PMU */
-- 
2.20.1

From: Philippe Mathieu-Daudé <f4bug@amsat.org>

MIDR_EL1 is a 64-bit system register with the top 32-bit being RES0.
Represent it in QEMU's ARMCPU struct with a uint64_t, not a
uint32_t.

This fixes an error when compiling with -Werror=conversion
because we were manipulating the register value using a
local uint64_t variable:

target/arm/cpu64.c: In function ‘aarch64_max_initfn’:
  target/arm/cpu64.c:628:21: error: conversion from ‘uint64_t’ {aka ‘long unsigned int’} to ‘uint32_t’ {aka ‘unsigned int’} may change value [-Werror=conversion]
    628 |         cpu->midr = t;
        |                     ^

and future-proofs us against a possible future architecture
change using some of the top 32 bits.

Suggested-by: Laurent Desnogues <laurent.desnogues@gmail.com>
Suggested-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
Reviewed-by: Laurent Desnogues <laurent.desnogues@gmail.com>
Message-id: 20200428172634.29707-1-f4bug@amsat.org
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 target/arm/cpu.h | 2 +-
 target/arm/cpu.c | 2 +-
 2 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/target/arm/cpu.h b/target/arm/cpu.h
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/cpu.h
+++ b/target/arm/cpu.h
@@ -XXX,XX +XXX,XX @@ struct ARMCPU {
         uint64_t id_aa64dfr0;
         uint64_t id_aa64dfr1;
     } isar;
-    uint32_t midr;
+    uint64_t midr;
     uint32_t revidr;
     uint32_t reset_fpsid;
     uint32_t ctr;
diff --git a/target/arm/cpu.c b/target/arm/cpu.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/cpu.c
+++ b/target/arm/cpu.c
@@ -XXX,XX +XXX,XX @@ static const ARMCPUInfo arm_cpus[] = {
 static Property arm_cpu_properties[] = {
     DEFINE_PROP_BOOL("start-powered-off", ARMCPU, start_powered_off, false),
     DEFINE_PROP_UINT32("psci-conduit", ARMCPU, psci_conduit, 0),
-    DEFINE_PROP_UINT32("midr", ARMCPU, midr, 0),
+    DEFINE_PROP_UINT64("midr", ARMCPU, midr, 0),
     DEFINE_PROP_UINT64("mp-affinity", ARMCPU,
                         mp_affinity, ARM64_AFFINITY_INVALID),
     DEFINE_PROP_INT32("node-id", ARMCPU, node_id, CPU_UNSET_NUMA_NODE_ID),
-- 
2.20.1