Series comparison

-[PULL 00/25] target-arm queue
+[PULL 00/21] target-arm queue
-target-arm queue, mostly SME preliminaries.
+Hi; here's a target-arm pullreq to go in before softfreeze.
 This is actually pretty much entirely bugfixes (since the
 SEL2 timers we implement here are a missing part of a feature
 we claim to already implement).
-In the unlikely event we don't land the rest of SME before freeze
+thanks
 for 7.1 we can revert the docs/property changes included here.
 -- PMM
-The following changes since commit 097ccbbbaf2681df1e65542e5b7d2b2d0c66e2bc:
+The following changes since commit 98c7362b1efe651327385a25874a73e008c6549e:
-  Merge tag 'qemu-sparc-20220626' of https://github.com/mcayland/qemu into staging (2022-06-27 05:21:05 +0530)
+  Merge tag 'accel-cpus-20250306' of https://github.com/philmd/qemu into staging (2025-03-07 07:39:49 +0800)
 are available in the Git repository at:
-  https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20220627
+  https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20250307
-for you to fetch changes up to 59e1b8a22ea9f947d038ccac784de1020f266e14:
+for you to fetch changes up to 0ce0739d46983e5e88fa9c149cb305689c9d8c6f:
-  target/arm: Check V7VE as well as LPAE in arm_pamax (2022-06-27 11:18:17 +0100)
+  target/rx: Remove TCG_CALL_NO_WG from helpers which write env (2025-03-07 15:03:20 +0000)
 ----------------------------------------------------------------
 target-arm queue:
- * sphinx: change default language to 'en'
+ * hw/arm/smmu-common: Remove the repeated ttb field
- * Diagnose attempts to emulate EL3 in hvf as well as kvm
+ * hw/gpio: npcm7xx: fixup out-of-bounds access
- * More SME groundwork patches
+ * tests/functional/test_arm_sx1: Check whether the serial console is working
- * virt: Fix calculation of physical address space size
+ * target/arm: Fix minor bugs in generic timer register handling
-   for v7VE CPUs (eg cortex-a15)
+ * target/arm: Implement SEL2 physical and virtual timers
  * target/arm: Correct STRD, LDRD atomicity and fault behaviour
  * target/arm: Make dummy debug registers RAZ, not NOP
  * util/qemu-timer.c: Don't warp timer from timerlist_rearm()
  * include/exec/memop.h: Expand comment for MO_ATOM_SUBALIGN
  * hw/arm/smmu: Introduce smmu_configs_inv_sid_range() helper
  * target/rx: Set exception vector base to 0xffffff80
  * target/rx: Remove TCG_CALL_NO_WG from helpers which write env
 ----------------------------------------------------------------
-Alexander Graf (2):
+Alex Bennée (4):
-      accel: Introduce current_accel_name()
+      target/arm: Implement SEL2 physical and virtual timers
-      target/arm: Catch invalid kvm state also for hvf
+      target/arm: Document the architectural names of our GTIMERs
       hw/arm: enable secure EL2 timers for virt machine
       hw/arm: enable secure EL2 timers for sbsa machine
-Martin Liška (1):
+JianChunfu (2):
-      sphinx: change default language to 'en'
+      hw/arm/smmu-common: Remove the repeated ttb field
       hw/arm/smmu: Introduce smmu_configs_inv_sid_range() helper
-Richard Henderson (22):
+Keith Packard (2):
-      target/arm: Implement TPIDR2_EL0
+      target/rx: Set exception vector base to 0xffffff80
-      target/arm: Add SMEEXC_EL to TB flags
+      target/rx: Remove TCG_CALL_NO_WG from helpers which write env
       target/arm: Add syn_smetrap
       target/arm: Add ARM_CP_SME
       target/arm: Add SVCR
       target/arm: Add SMCR_ELx
       target/arm: Add SMIDR_EL1, SMPRI_EL1, SMPRIMAP_EL2
       target/arm: Add PSTATE.{SM,ZA} to TB flags
       target/arm: Add the SME ZA storage to CPUARMState
       target/arm: Implement SMSTART, SMSTOP
       target/arm: Move error for sve%d property to arm_cpu_sve_finalize
       target/arm: Create ARMVQMap
       target/arm: Generalize cpu_arm_{get,set}_vq
       target/arm: Generalize cpu_arm_{get, set}_default_vec_len
       target/arm: Move arm_cpu_*_finalize to internals.h
       target/arm: Unexport aarch64_add_*_properties
       target/arm: Add cpu properties for SME
       target/arm: Introduce sve_vqm1_for_el_sm
       target/arm: Add SVL to TB flags
       target/arm: Move pred_{full, gvec}_reg_{offset, size} to translate-a64.h
       target/arm: Extend arm_pamax to more than aarch64
       target/arm: Check V7VE as well as LPAE in arm_pamax
- docs/conf.py                     |   2 +-
+Patrick Venture (1):
- docs/system/arm/cpu-features.rst |  56 ++++++++++
+      hw/gpio: npcm7xx: fixup out-of-bounds access
  include/qemu/accel.h             |   1 +
  target/arm/cpregs.h              |   5 +
  target/arm/cpu.h                 | 103 ++++++++++++++-----
  target/arm/helper-sme.h          |  21 ++++
  target/arm/helper.h              |   1 +
  target/arm/internals.h           |   4 +
  target/arm/syndrome.h            |  14 +++
  target/arm/translate-a64.h       |  38 +++++++
  target/arm/translate.h           |   6 ++
  accel/accel-common.c             |   8 ++
  hw/arm/virt.c                    |  10 +-
  softmmu/vl.c                     |   3 +-
  target/arm/cpu.c                 |  32 ++++--
  target/arm/cpu64.c               | 205 ++++++++++++++++++++++++++++---------
  target/arm/helper.c              | 213 +++++++++++++++++++++++++++++++++++++--
  target/arm/kvm64.c               |   2 +-
  target/arm/machine.c             |  34 +++++++
  target/arm/ptw.c                 |  26 +++--
  target/arm/sme_helper.c          |  61 +++++++++++
  target/arm/translate-a64.c       |  46 +++++++++
  target/arm/translate-sve.c       |  36 -------
  target/arm/meson.build           |   1 +
 files changed, 782 insertions(+), 146 deletions(-)
  create mode 100644 target/arm/helper-sme.h
  create mode 100644 target/arm/sme_helper.c
+Peter Maydell (11):
+      target/arm: Apply correct timer offset when calculating deadlines
+      target/arm: Don't apply CNTVOFF_EL2 for EL2_VIRT timer
+      target/arm: Make CNTPS_* UNDEF from Secure EL1 when Secure EL2 is enabled
+      target/arm: Always apply CNTVOFF_EL2 for CNTV_TVAL_EL02 accesses
+      target/arm: Refactor handling of timer offset for direct register accesses
+      target/arm: Correct LDRD atomicity and fault behaviour
+      target/arm: Correct STRD atomicity
+      target/arm: Drop unused address_offset from op_addr_{rr, ri}_post()
+      target/arm: Make dummy debug registers RAZ, not NOP
+      util/qemu-timer.c: Don't warp timer from timerlist_rearm()
+      include/exec/memop.h: Expand comment for MO_ATOM_SUBALIGN
+Thomas Huth (1):
+      tests/functional/test_arm_sx1: Check whether the serial console is working
+ MAINTAINERS                      |   1 +
+ hw/arm/smmu-internal.h           |   5 -
+ include/exec/memop.h             |   8 +-
+ include/hw/arm/bsa.h             |   2 +
+ include/hw/arm/smmu-common.h     |   7 +-
+ target/arm/cpu.h                 |   2 +
+ target/arm/gtimer.h              |  14 +-
+ target/arm/internals.h           |   5 +-
+ target/rx/helper.h               |  34 ++--
+ hw/arm/sbsa-ref.c                |   2 +
+ hw/arm/smmu-common.c             |  21 +++
+ hw/arm/smmuv3.c                  |  19 +--
+ hw/arm/virt.c                    |   2 +
+ hw/gpio/npcm7xx_gpio.c           |   3 +-
+ target/arm/cpu.c                 |   4 +
+ target/arm/debug_helper.c        |   7 +-
+ target/arm/helper.c              | 324 ++++++++++++++++++++++++++++++++-------
+ target/arm/tcg/op_helper.c       |   8 +-
+ target/arm/tcg/translate.c       | 147 +++++++++++-------
+ target/rx/helper.c               |   2 +-
+ util/qemu-timer.c                |   4 -
+ hw/arm/trace-events              |   3 +-
+ tests/functional/test_arm_sx1.py |   7 +-
+files changed, 455 insertions(+), 176 deletions(-)

-[PULL 23/25] target/arm: Move pred_{full, gvec}_reg_{offset, size} to translate-a64.h
+[PULL 01/21] hw/arm/smmu-common: Remove the repeated ttb field
-From: Richard Henderson <richard.henderson@linaro.org>
+From: JianChunfu <jansef.jian@hj-micro.com>
-We will need these functions in translate-sme.c.
+SMMUTransCfg->ttb is never used in QEMU, TT base address
 can be accessed by SMMUTransCfg->tt[i]->ttb.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+Signed-off-by: JianChunfu <jansef.jian@hj-micro.com>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+Reviewed-by: Eric Auger <eric.auger@redhat.com>
-Message-id: 20220620175235.60881-21-richard.henderson@linaro.org
+Message-id: 20250221031034.69822-1-jansef.jian@hj-micro.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/translate-a64.h | 38 ++++++++++++++++++++++++++++++++++++++
+ include/hw/arm/smmu-common.h | 1 -
- target/arm/translate-sve.c | 36 ------------------------------------
+file changed, 1 deletion(-)
 files changed, 38 insertions(+), 36 deletions(-)
-diff --git a/target/arm/translate-a64.h b/target/arm/translate-a64.h
+diff --git a/include/hw/arm/smmu-common.h b/include/hw/arm/smmu-common.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.h
+--- a/include/hw/arm/smmu-common.h
-+++ b/target/arm/translate-a64.h
++++ b/include/hw/arm/smmu-common.h
-@@ -XXX,XX +XXX,XX @@ static inline int vec_full_reg_size(DisasContext *s)
+@@ -XXX,XX +XXX,XX @@ typedef struct SMMUTransCfg {
-     return s->vl;
+     /* Used by stage-1 only. */
- }
+     bool aa64;                 /* arch64 or aarch32 translation table */
+     bool record_faults;        /* record fault events */
-+/*
+-    uint64_t ttb;              /* TT base address */
-+ * Return the offset info CPUARMState of the predicate vector register Pn.
+     uint8_t oas;               /* output address width */
-+ * Note for this purpose, FFR is P16.
+     uint8_t tbi;               /* Top Byte Ignore */
-+ */
+     int asid;
 +static inline int pred_full_reg_offset(DisasContext *s, int regno)
 +{
 +    return offsetof(CPUARMState, vfp.pregs[regno]);
 +}
 +
 +/* Return the byte size of the whole predicate register, VL / 64.  */
 +static inline int pred_full_reg_size(DisasContext *s)
 +{
 +    return s->vl >> 3;
 +}
 +
 +/*
 + * Round up the size of a register to a size allowed by
 + * the tcg vector infrastructure.  Any operation which uses this
 + * size may assume that the bits above pred_full_reg_size are zero,
 + * and must leave them the same way.
 + *
 + * Note that this is not needed for the vector registers as they
 + * are always properly sized for tcg vectors.
 + */
 +static inline int size_for_gvec(int size)
 +{
 +    if (size <= 8) {
 +        return 8;
 +    } else {
 +        return QEMU_ALIGN_UP(size, 16);
 +    }
 +}
 +
 +static inline int pred_gvec_reg_size(DisasContext *s)
 +{
 +    return size_for_gvec(pred_full_reg_size(s));
 +}
 +
  bool disas_sve(DisasContext *, uint32_t);
  void gen_gvec_rax1(unsigned vece, uint32_t rd_ofs, uint32_t rn_ofs,
 diff --git a/target/arm/translate-sve.c b/target/arm/translate-sve.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-sve.c
 +++ b/target/arm/translate-sve.c
@@ -XXX,XX +XXX,XX @@ static inline int msz_dtype(DisasContext *s, int msz)
   * Implement all of the translator functions referenced by the decoder.
   */
 -/* Return the offset info CPUARMState of the predicate vector register Pn.
 - * Note for this purpose, FFR is P16.
 - */
 -static inline int pred_full_reg_offset(DisasContext *s, int regno)
 -{
 -    return offsetof(CPUARMState, vfp.pregs[regno]);
 -}
 -
 -/* Return the byte size of the whole predicate register, VL / 64.  */
 -static inline int pred_full_reg_size(DisasContext *s)
 -{
 -    return s->vl >> 3;
 -}
 -
 -/* Round up the size of a register to a size allowed by
 - * the tcg vector infrastructure.  Any operation which uses this
 - * size may assume that the bits above pred_full_reg_size are zero,
 - * and must leave them the same way.
 - *
 - * Note that this is not needed for the vector registers as they
 - * are always properly sized for tcg vectors.
 - */
 -static int size_for_gvec(int size)
 -{
 -    if (size <= 8) {
 -        return 8;
 -    } else {
 -        return QEMU_ALIGN_UP(size, 16);
 -    }
 -}
 -
 -static int pred_gvec_reg_size(DisasContext *s)
 -{
 -    return size_for_gvec(pred_full_reg_size(s));
 -}
 -
  /* Invoke an out-of-line helper on 2 Zregs. */
  static bool gen_gvec_ool_zz(DisasContext *s, gen_helper_gvec_2 *fn,
                              int rd, int rn, int data)
 --
-.25.1
+.43.0

-[PULL 17/25] target/arm: Generalize cpu_arm_{get, set}_default_vec_len
+[PULL 02/21] hw/gpio: npcm7xx: fixup out-of-bounds access
-From: Richard Henderson <richard.henderson@linaro.org>
+From: Patrick Venture <venture@google.com>
-Rename from cpu_arm_{get,set}_sve_default_vec_len,
+The reg isn't validated to be a possible register before
-and take the pointer to default_vq from opaque.
+it's dereferenced for one case.  The mmio space registered
 for the gpio device is 4KiB but there aren't that many
 registers in the struct.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+Cc: qemu-stable@nongnu.org
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+Fixes: 526dbbe0874 ("hw/gpio: Add GPIO model for Nuvoton NPCM7xx")
-Message-id: 20220620175235.60881-15-richard.henderson@linaro.org
+Signed-off-by: Patrick Venture <venture@google.com>
 Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
 Message-id: 20250226024603.493148-1-venture@google.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/cpu64.c | 27 ++++++++++++++-------------
+ hw/gpio/npcm7xx_gpio.c | 3 +--
-file changed, 14 insertions(+), 13 deletions(-)
+file changed, 1 insertion(+), 2 deletions(-)
-diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
+diff --git a/hw/gpio/npcm7xx_gpio.c b/hw/gpio/npcm7xx_gpio.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu64.c
+--- a/hw/gpio/npcm7xx_gpio.c
-+++ b/target/arm/cpu64.c
++++ b/hw/gpio/npcm7xx_gpio.c
-@@ -XXX,XX +XXX,XX @@ static void cpu_arm_set_sve(Object *obj, bool value, Error **errp)
+@@ -XXX,XX +XXX,XX @@ static void npcm7xx_gpio_regs_write(void *opaque, hwaddr addr, uint64_t v,
  #ifdef CONFIG_USER_ONLY
  /* Mirror linux /proc/sys/abi/sve_default_vector_length. */
 -static void cpu_arm_set_sve_default_vec_len(Object *obj, Visitor *v,
 -                                            const char *name, void *opaque,
 -                                            Error **errp)
 +static void cpu_arm_set_default_vec_len(Object *obj, Visitor *v,
 +                                        const char *name, void *opaque,
 +                                        Error **errp)
  {
 -    ARMCPU *cpu = ARM_CPU(obj);
 +    uint32_t *ptr_default_vq = opaque;
      int32_t default_len, default_vq, remainder;
      if (!visit_type_int32(v, name, &default_len, errp)) {
@@ -XXX,XX +XXX,XX @@ static void cpu_arm_set_sve_default_vec_len(Object *obj, Visitor *v,
      /* Undocumented, but the kernel allows -1 to indicate "maximum". */
      if (default_len == -1) {
 -        cpu->sve_default_vq = ARM_MAX_VQ;
 +        *ptr_default_vq = ARM_MAX_VQ;
          return;
      }
-@@ -XXX,XX +XXX,XX @@ static void cpu_arm_set_sve_default_vec_len(Object *obj, Visitor *v,
+-    diff = s->regs[reg] ^ value;
-         return;
+-
-     }
+     switch (reg) {
+     case NPCM7XX_GPIO_TLOCK1:
--    cpu->sve_default_vq = default_vq;
+     case NPCM7XX_GPIO_TLOCK2:
-+    *ptr_default_vq = default_vq;
+@@ -XXX,XX +XXX,XX @@ static void npcm7xx_gpio_regs_write(void *opaque, hwaddr addr, uint64_t v,
- }
+     case NPCM7XX_GPIO_PU:
+     case NPCM7XX_GPIO_PD:
--static void cpu_arm_get_sve_default_vec_len(Object *obj, Visitor *v,
+     case NPCM7XX_GPIO_IEM:
--                                            const char *name, void *opaque,
++        diff = s->regs[reg] ^ value;
--                                            Error **errp)
+         s->regs[reg] = value;
-+static void cpu_arm_get_default_vec_len(Object *obj, Visitor *v,
+         npcm7xx_gpio_update_pins(s, diff);
-+                                        const char *name, void *opaque,
+         break;
 +                                        Error **errp)
  {
 -    ARMCPU *cpu = ARM_CPU(obj);
 -    int32_t value = cpu->sve_default_vq * 16;
 +    uint32_t *ptr_default_vq = opaque;
 +    int32_t value = *ptr_default_vq * 16;
      visit_type_int32(v, name, &value, errp);
  }
@@ -XXX,XX +XXX,XX @@ void aarch64_add_sve_properties(Object *obj)
  #ifdef CONFIG_USER_ONLY
      /* Mirror linux /proc/sys/abi/sve_default_vector_length. */
      object_property_add(obj, "sve-default-vector-length", "int32",
 -                        cpu_arm_get_sve_default_vec_len,
 -                        cpu_arm_set_sve_default_vec_len, NULL, NULL);
 +                        cpu_arm_get_default_vec_len,
 +                        cpu_arm_set_default_vec_len, NULL,
 +                        &cpu->sve_default_vq);
  #endif
  }
 --
-.25.1
+.43.0

-[PULL 22/25] target/arm: Add SVL to TB flags
+[PULL 03/21] tests/functional/test_arm_sx1: Check whether the serial console is working
-From: Richard Henderson <richard.henderson@linaro.org>
+From: Thomas Huth <thuth@redhat.com>
-We need SVL separate from VL for RDSVL et al, as well as
+The kernel that is used in the sx1 test prints the usual Linux log
-ZA storage loads and stores, which do not require PSTATE.SM.
+onto the serial console, but this test currently ignores it. To
 make sure that the serial device is working properly, let's check
 for some strings in the output here.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+While we're at it, also add the test to the corresponding section
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+in the MAINTAINERS file.
-Message-id: 20220620175235.60881-20-richard.henderson@linaro.org
 Signed-off-by: Thomas Huth <thuth@redhat.com>
 Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
 Message-id: 20250226104833.1176253-1-thuth@redhat.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/cpu.h           | 12 ++++++++++++
+ MAINTAINERS                      | 1 +
- target/arm/translate.h     |  1 +
+ tests/functional/test_arm_sx1.py | 7 ++++---
- target/arm/helper.c        |  8 +++++++-
+files changed, 5 insertions(+), 3 deletions(-)
  target/arm/translate-a64.c |  1 +
 files changed, 21 insertions(+), 1 deletion(-)
-diff --git a/target/arm/cpu.h b/target/arm/cpu.h
+diff --git a/MAINTAINERS b/MAINTAINERS
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.h
+--- a/MAINTAINERS
-+++ b/target/arm/cpu.h
++++ b/MAINTAINERS
-@@ -XXX,XX +XXX,XX @@ FIELD(TBFLAG_A64, MTE0_ACTIVE, 19, 1)
+@@ -XXX,XX +XXX,XX @@ S: Maintained
- FIELD(TBFLAG_A64, SMEEXC_EL, 20, 2)
+ F: hw/*/omap*
- FIELD(TBFLAG_A64, PSTATE_SM, 22, 1)
+ F: include/hw/arm/omap.h
- FIELD(TBFLAG_A64, PSTATE_ZA, 23, 1)
+ F: docs/system/arm/sx1.rst
-+FIELD(TBFLAG_A64, SVL, 24, 4)
++F: tests/functional/test_arm_sx1.py
- /*
+ IPack
-  * Helpers for using the above.
+ M: Alberto Garcia <berto@igalia.com>
-@@ -XXX,XX +XXX,XX @@ static inline int sve_vq(CPUARMState *env)
+diff --git a/tests/functional/test_arm_sx1.py b/tests/functional/test_arm_sx1.py
-     return EX_TBFLAG_A64(env->hflags, VL) + 1;
+index XXXXXXX..XXXXXXX 100755
- }
+--- a/tests/functional/test_arm_sx1.py
++++ b/tests/functional/test_arm_sx1.py
-+/**
+@@ -XXX,XX +XXX,XX @@ def test_arm_sx1_initrd(self):
-+ * sme_vq
+         self.vm.add_args('-append', f'kunit.enable=0 rdinit=/sbin/init {self.CONSOLE_ARGS}')
-+ * @env: the cpu context
+         self.vm.add_args('-no-reboot')
-+ *
+         self.launch_kernel(zimage_path,
-+ * Return the SVL cached within env->hflags, in units of quadwords.
+-                           initrd=initrd_path)
-+ */
++                           initrd=initrd_path,
-+static inline int sme_vq(CPUARMState *env)
++                           wait_for='Boot successful')
-+{
+         self.vm.wait(timeout=120)
-+    return EX_TBFLAG_A64(env->hflags, SVL) + 1;
-+}
+     def test_arm_sx1_sd(self):
-+
+@@ -XXX,XX +XXX,XX @@ def test_arm_sx1_sd(self):
- static inline bool bswap_code(bool sctlr_b)
+         self.vm.add_args('-no-reboot')
- {
+         self.vm.add_args('-snapshot')
- #ifdef CONFIG_USER_ONLY
+         self.vm.add_args('-drive', f'format=raw,if=sd,file={sd_fs_path}')
-diff --git a/target/arm/translate.h b/target/arm/translate.h
+-        self.launch_kernel(zimage_path)
-index XXXXXXX..XXXXXXX 100644
++        self.launch_kernel(zimage_path, wait_for='Boot successful')
---- a/target/arm/translate.h
+         self.vm.wait(timeout=120)
-+++ b/target/arm/translate.h
-@@ -XXX,XX +XXX,XX @@ typedef struct DisasContext {
+     def test_arm_sx1_flash(self):
-     int sve_excp_el; /* SVE exception EL or 0 if enabled */
+@@ -XXX,XX +XXX,XX @@ def test_arm_sx1_flash(self):
-     int sme_excp_el; /* SME exception EL or 0 if enabled */
+         self.vm.add_args('-no-reboot')
-     int vl;          /* current vector length in bytes */
+         self.vm.add_args('-snapshot')
-+    int svl;         /* current streaming vector length in bytes */
+         self.vm.add_args('-drive', f'format=raw,if=pflash,file={flash_path}')
-     bool vfp_enabled; /* FP enabled via FPSCR.EN */
+-        self.launch_kernel(zimage_path)
-     int vec_len;
++        self.launch_kernel(zimage_path, wait_for='Boot successful')
-     int vec_stride;
+         self.vm.wait(timeout=120)
-diff --git a/target/arm/helper.c b/target/arm/helper.c
-index XXXXXXX..XXXXXXX 100644
+ if __name__ == '__main__':
 --- a/target/arm/helper.c
 +++ b/target/arm/helper.c
@@ -XXX,XX +XXX,XX @@ static CPUARMTBFlags rebuild_hflags_a64(CPUARMState *env, int el, int fp_el,
          DP_TBFLAG_A64(flags, SVEEXC_EL, sve_el);
      }
      if (cpu_isar_feature(aa64_sme, env_archcpu(env))) {
 -        DP_TBFLAG_A64(flags, SMEEXC_EL, sme_exception_el(env, el));
 +        int sme_el = sme_exception_el(env, el);
 +
 +        DP_TBFLAG_A64(flags, SMEEXC_EL, sme_el);
 +        if (sme_el == 0) {
 +            /* Similarly, do not compute SVL if SME is disabled. */
 +            DP_TBFLAG_A64(flags, SVL, sve_vqm1_for_el_sm(env, el, true));
 +        }
          if (FIELD_EX64(env->svcr, SVCR, SM)) {
              DP_TBFLAG_A64(flags, PSTATE_SM, 1);
          }
 diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-a64.c
 +++ b/target/arm/translate-a64.c
@@ -XXX,XX +XXX,XX @@ static void aarch64_tr_init_disas_context(DisasContextBase *dcbase,
      dc->sve_excp_el = EX_TBFLAG_A64(tb_flags, SVEEXC_EL);
      dc->sme_excp_el = EX_TBFLAG_A64(tb_flags, SMEEXC_EL);
      dc->vl = (EX_TBFLAG_A64(tb_flags, VL) + 1) * 16;
 +    dc->svl = (EX_TBFLAG_A64(tb_flags, SVL) + 1) * 16;
      dc->pauth_active = EX_TBFLAG_A64(tb_flags, PAUTH_ACTIVE);
      dc->bt = EX_TBFLAG_A64(tb_flags, BT);
      dc->btype = EX_TBFLAG_A64(tb_flags, BTYPE);
 --
-.25.1
+.43.0

-[PULL 09/25] target/arm: Add SMCR_ELx
+[PULL 04/21] target/arm: Apply correct timer offset when calculating deadlines
-From: Richard Henderson <richard.henderson@linaro.org>
+When we are calculating timer deadlines, the correct definition of
 whether or not to apply an offset to the physical count is described
 in the Arm ARM DDI4087 rev L.a section D12.2.4.1.  This is different
 from when the offset should be applied for a direct read of the
 counter sysreg.
-These cpregs control the streaming vector length and whether the
+We got this right for the EL1 physical timer and for the EL1 virtual
-full a64 instruction set is allowed while in streaming mode.
+timer, but got all the rest wrong: they should be using a zero offset
 always.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+Factor the offset calculation out into a function that has a comment
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+documenting exactly which offset it is calculating and which gets the
-Message-id: 20220620175235.60881-7-richard.henderson@linaro.org
+HYP, SEC, and HYPVIRT cases right.
 Cc: qemu-stable@nongnu.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
+Message-id: 20250204125009.2281315-2-peter.maydell@linaro.org
 ---
- target/arm/cpu.h    |  8 ++++++--
+ target/arm/helper.c | 29 +++++++++++++++++++++++++++--
- target/arm/helper.c | 41 +++++++++++++++++++++++++++++++++++++++++
+file changed, 27 insertions(+), 2 deletions(-)
 files changed, 47 insertions(+), 2 deletions(-)
-diff --git a/target/arm/cpu.h b/target/arm/cpu.h
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.h
-+++ b/target/arm/cpu.h
-@@ -XXX,XX +XXX,XX @@ typedef struct CPUArchState {
-         float_status standard_fp_status;
-         float_status standard_fp_status_f16;
--        /* ZCR_EL[1-3] */
--        uint64_t zcr_el[4];
-+        uint64_t zcr_el[4];   /* ZCR_EL[1-3] */
-+        uint64_t smcr_el[4];  /* SMCR_EL[1-3] */
-     } vfp;
-     uint64_t exclusive_addr;
-     uint64_t exclusive_val;
-@@ -XXX,XX +XXX,XX @@ FIELD(CPTR_EL3, TCPAC, 31, 1)
- FIELD(SVCR, SM, 0, 1)
- FIELD(SVCR, ZA, 1, 1)
-+/* Fields for SMCR_ELx. */
-+FIELD(SMCR, LEN, 0, 4)
-+FIELD(SMCR, FA64, 31, 1)
-+
- /* Write a new value to v7m.exception, thus transitioning into or out
-  * of Handler mode; this may result in a change of active stack pointer.
-  */
 diff --git a/target/arm/helper.c b/target/arm/helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper.c
 +++ b/target/arm/helper.c
-@@ -XXX,XX +XXX,XX @@ static void define_arm_vh_e2h_redirects_aliases(ARMCPU *cpu)
+@@ -XXX,XX +XXX,XX @@ static uint64_t gt_phys_cnt_offset(CPUARMState *env)
-          */
+     return gt_phys_raw_cnt_offset(env);
          { K(3, 0,  1, 2, 0), K(3, 4,  1, 2, 0), K(3, 5, 1, 2, 0),
            "ZCR_EL1", "ZCR_EL2", "ZCR_EL12", isar_feature_aa64_sve },
 +        { K(3, 0,  1, 2, 6), K(3, 4,  1, 2, 6), K(3, 5, 1, 2, 6),
 +          "SMCR_EL1", "SMCR_EL2", "SMCR_EL12", isar_feature_aa64_sme },
          { K(3, 0,  5, 6, 0), K(3, 4,  5, 6, 0), K(3, 5, 5, 6, 0),
            "TFSR_EL1", "TFSR_EL2", "TFSR_EL12", isar_feature_aa64_mte },
@@ -XXX,XX +XXX,XX @@ static void svcr_write(CPUARMState *env, const ARMCPRegInfo *ri,
      env->svcr = value;
  }
-+static void smcr_write(CPUARMState *env, const ARMCPRegInfo *ri,
++static uint64_t gt_indirect_access_timer_offset(CPUARMState *env, int timeridx)
 +                       uint64_t value)
 +{
-+    int cur_el = arm_current_el(env);
-+    int old_len = sve_vqm1_for_el(env, cur_el);
-+    int new_len;
-+
-+    QEMU_BUILD_BUG_ON(ARM_MAX_VQ > R_SMCR_LEN_MASK + 1);
-+    value &= R_SMCR_LEN_MASK | R_SMCR_FA64_MASK;
-+    raw_write(env, ri, value);
-+
 +    /*
-+     * Note that it is CONSTRAINED UNPREDICTABLE what happens to ZA storage
++     * Return the timer offset to use for indirect accesses to the timer.
-+     * when SVL is widened (old values kept, or zeros).  Choose to keep the
++     * This is the Offset value as defined in D12.2.4.1 "Operation of the
-+     * current values for simplicity.  But for QEMU internals, we must still
++     * CompareValue views of the timers".
-+     * apply the narrower SVL to the Zregs and Pregs -- see the comment
++     *
-+     * above aarch64_sve_narrow_vq.
++     * The condition here is not always the same as the condition for
 +     * whether to apply an offset register when doing a direct read of
 +     * the counter sysreg; those conditions are described in the
 +     * access pseudocode for each counter register.
 +     */
-+    new_len = sve_vqm1_for_el(env, cur_el);
++    switch (timeridx) {
-+    if (new_len < old_len) {
++    case GTIMER_PHYS:
-+        aarch64_sve_narrow_vq(env, new_len + 1);
++        return gt_phys_raw_cnt_offset(env);
 +    case GTIMER_VIRT:
 +        return env->cp15.cntvoff_el2;
 +    case GTIMER_HYP:
 +    case GTIMER_SEC:
 +    case GTIMER_HYPVIRT:
 +        return 0;
 +    default:
 +        g_assert_not_reached();
 +    }
 +}
 +
- static const ARMCPRegInfo sme_reginfo[] = {
+ static void gt_recalc_timer(ARMCPU *cpu, int timeridx)
-     { .name = "TPIDR2_EL0", .state = ARM_CP_STATE_AA64,
+ {
-       .opc0 = 3, .opc1 = 3, .crn = 13, .crm = 0, .opc2 = 5,
+     ARMGenericTimer *gt = &cpu->env.cp15.c14_timer[timeridx];
-@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo sme_reginfo[] = {
+@@ -XXX,XX +XXX,XX @@ static void gt_recalc_timer(ARMCPU *cpu, int timeridx)
-       .access = PL0_RW, .type = ARM_CP_SME,
+          * Timer enabled: calculate and set current ISTATUS, irq, and
-       .fieldoffset = offsetof(CPUARMState, svcr),
+          * reset timer to when ISTATUS next has to change
-       .writefn = svcr_write, .raw_writefn = raw_write },
+          */
-+    { .name = "SMCR_EL1", .state = ARM_CP_STATE_AA64,
+-        uint64_t offset = timeridx == GTIMER_VIRT ?
-+      .opc0 = 3, .opc1 = 0, .crn = 1, .crm = 2, .opc2 = 6,
+-            cpu->env.cp15.cntvoff_el2 : gt_phys_raw_cnt_offset(&cpu->env);
-+      .access = PL1_RW, .type = ARM_CP_SME,
++        uint64_t offset = gt_indirect_access_timer_offset(&cpu->env, timeridx);
-+      .fieldoffset = offsetof(CPUARMState, vfp.smcr_el[1]),
+         uint64_t count = gt_get_countervalue(&cpu->env);
-+      .writefn = smcr_write, .raw_writefn = raw_write },
+         /* Note that this must be unsigned 64 bit arithmetic: */
-+    { .name = "SMCR_EL2", .state = ARM_CP_STATE_AA64,
+         int istatus = count - offset >= gt->cval;
 +      .opc0 = 3, .opc1 = 4, .crn = 1, .crm = 2, .opc2 = 6,
 +      .access = PL2_RW, .type = ARM_CP_SME,
 +      .fieldoffset = offsetof(CPUARMState, vfp.smcr_el[2]),
 +      .writefn = smcr_write, .raw_writefn = raw_write },
 +    { .name = "SMCR_EL3", .state = ARM_CP_STATE_AA64,
 +      .opc0 = 3, .opc1 = 6, .crn = 1, .crm = 2, .opc2 = 6,
 +      .access = PL3_RW, .type = ARM_CP_SME,
 +      .fieldoffset = offsetof(CPUARMState, vfp.smcr_el[3]),
 +      .writefn = smcr_write, .raw_writefn = raw_write },
  };
  #endif /* TARGET_AARCH64 */
 --
-.25.1
+.43.0

-[PULL 21/25] target/arm: Introduce sve_vqm1_for_el_sm
+[PULL 05/21] target/arm: Don't apply CNTVOFF_EL2 for EL2_VIRT timer
-From: Richard Henderson <richard.henderson@linaro.org>
+The CNTVOFF_EL2 offset register should only be applied for accessses
 to CNTVCT_EL0 and for the EL1 virtual timer (CNTV_*).  We were
 incorrectly applying it for the EL2 virtual timer (CNTHV_*).
-When Streaming SVE mode is enabled, the size is taken from
+Cc: qemu-stable@nongnu.org
-SMCR_ELx instead of ZCR_ELx.  The format is shared, but the
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-set of vector lengths is not.  Further, Streaming SVE does
+Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
-not require any particular length to be supported.
+Message-id: 20250204125009.2281315-3-peter.maydell@linaro.org
 ---
  target/arm/helper.c | 2 --
 file changed, 2 deletions(-)
-Adjust sve_vqm1_for_el to pass the current value of PSTATE.SM
-to the new function.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220620175235.60881-19-richard.henderson@linaro.org
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- target/arm/cpu.h    |  9 +++++++--
- target/arm/helper.c | 32 +++++++++++++++++++++++++-------
-files changed, 32 insertions(+), 9 deletions(-)
-diff --git a/target/arm/cpu.h b/target/arm/cpu.h
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.h
-+++ b/target/arm/cpu.h
-@@ -XXX,XX +XXX,XX @@ int sve_exception_el(CPUARMState *env, int cur_el);
- int sme_exception_el(CPUARMState *env, int cur_el);
- /**
-- * sve_vqm1_for_el:
-+ * sve_vqm1_for_el_sm:
-  * @env: CPUARMState
-  * @el: exception level
-+ * @sm: streaming mode
-  *
-- * Compute the current SVE vector length for @el, in units of
-+ * Compute the current vector length for @el & @sm, in units of
-  * Quadwords Minus 1 -- the same scale used for ZCR_ELx.LEN.
-+ * If @sm, compute for SVL, otherwise NVL.
-  */
-+uint32_t sve_vqm1_for_el_sm(CPUARMState *env, int el, bool sm);
-+
-+/* Likewise, but using @sm = PSTATE.SM. */
- uint32_t sve_vqm1_for_el(CPUARMState *env, int el);
- static inline bool is_a64(CPUARMState *env)
 diff --git a/target/arm/helper.c b/target/arm/helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper.c
 +++ b/target/arm/helper.c
-@@ -XXX,XX +XXX,XX @@ int sme_exception_el(CPUARMState *env, int el)
+@@ -XXX,XX +XXX,XX @@ static uint64_t gt_tval_read(CPUARMState *env, const ARMCPRegInfo *ri,
- /*
-  * Given that SVE is enabled, return the vector length for EL.
+     switch (timeridx) {
-  */
+     case GTIMER_VIRT:
--uint32_t sve_vqm1_for_el(CPUARMState *env, int el)
+-    case GTIMER_HYPVIRT:
-+uint32_t sve_vqm1_for_el_sm(CPUARMState *env, int el, bool sm)
+         offset = gt_virt_cnt_offset(env);
- {
+         break;
-     ARMCPU *cpu = env_archcpu(env);
+     case GTIMER_PHYS:
--    uint32_t len = cpu->sve_max_vq - 1;
+@@ -XXX,XX +XXX,XX @@ static void gt_tval_write(CPUARMState *env, const ARMCPRegInfo *ri,
-+    uint64_t *cr = env->vfp.zcr_el;
-+    uint32_t map = cpu->sve_vq.map;
+     switch (timeridx) {
-+    uint32_t len = ARM_MAX_VQ - 1;
+     case GTIMER_VIRT:
-+
+-    case GTIMER_HYPVIRT:
-+    if (sm) {
+         offset = gt_virt_cnt_offset(env);
-+        cr = env->vfp.smcr_el;
+         break;
-+        map = cpu->sme_vq.map;
+     case GTIMER_PHYS:
 +    }
      if (el <= 1 && !el_is_in_host(env, el)) {
 -        len = MIN(len, 0xf & (uint32_t)env->vfp.zcr_el[1]);
 +        len = MIN(len, 0xf & (uint32_t)cr[1]);
      }
      if (el <= 2 && arm_feature(env, ARM_FEATURE_EL2)) {
 -        len = MIN(len, 0xf & (uint32_t)env->vfp.zcr_el[2]);
 +        len = MIN(len, 0xf & (uint32_t)cr[2]);
      }
      if (arm_feature(env, ARM_FEATURE_EL3)) {
 -        len = MIN(len, 0xf & (uint32_t)env->vfp.zcr_el[3]);
 +        len = MIN(len, 0xf & (uint32_t)cr[3]);
      }
 -    len = 31 - clz32(cpu->sve_vq.map & MAKE_64BIT_MASK(0, len + 1));
 -    return len;
 +    map &= MAKE_64BIT_MASK(0, len + 1);
 +    if (map != 0) {
 +        return 31 - clz32(map);
 +    }
 +
 +    /* Bit 0 is always set for Normal SVE -- not so for Streaming SVE. */
 +    assert(sm);
 +    return ctz32(cpu->sme_vq.map);
 +}
 +
 +uint32_t sve_vqm1_for_el(CPUARMState *env, int el)
 +{
 +    return sve_vqm1_for_el_sm(env, el, FIELD_EX64(env->svcr, SVCR, SM));
  }
  static void zcr_write(CPUARMState *env, const ARMCPRegInfo *ri,
 --
-.25.1
+.43.0

-[PULL 11/25] target/arm: Add PSTATE.{SM,ZA} to TB flags
+[PULL 06/21] target/arm: Make CNTPS_* UNDEF from Secure EL1 when Secure EL2 is enabled
-From: Richard Henderson <richard.henderson@linaro.org>
+When we added Secure EL2 support, we missed that this needs an update
 to the access code for the EL3 physical timer registers.  These are
 supposed to UNDEF from Secure EL1 when Secure EL2 is enabled.
-These are required to determine if various insns
+(Note for stable backporting: for backports to branches where
-are allowed to issue.
+CP_ACCESS_UNDEFINED is not defined, the old name to use instead
 is CP_ACCESS_TRAP_UNCATEGORIZED.)
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+Cc: qemu-stable@nongnu.org
 Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20220620175235.60881-9-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
+Message-id: 20250204125009.2281315-4-peter.maydell@linaro.org
 ---
- target/arm/cpu.h           | 2 ++
+ target/arm/helper.c | 3 +++
- target/arm/translate.h     | 4 ++++
+file changed, 3 insertions(+)
  target/arm/helper.c        | 4 ++++
  target/arm/translate-a64.c | 2 ++
 files changed, 12 insertions(+)
-diff --git a/target/arm/cpu.h b/target/arm/cpu.h
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.h
-+++ b/target/arm/cpu.h
-@@ -XXX,XX +XXX,XX @@ FIELD(TBFLAG_A64, TCMA, 16, 2)
- FIELD(TBFLAG_A64, MTE_ACTIVE, 18, 1)
- FIELD(TBFLAG_A64, MTE0_ACTIVE, 19, 1)
- FIELD(TBFLAG_A64, SMEEXC_EL, 20, 2)
-+FIELD(TBFLAG_A64, PSTATE_SM, 22, 1)
-+FIELD(TBFLAG_A64, PSTATE_ZA, 23, 1)
- /*
-  * Helpers for using the above.
-diff --git a/target/arm/translate.h b/target/arm/translate.h
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate.h
-+++ b/target/arm/translate.h
-@@ -XXX,XX +XXX,XX @@ typedef struct DisasContext {
-     bool align_mem;
-     /* True if PSTATE.IL is set */
-     bool pstate_il;
-+    /* True if PSTATE.SM is set. */
-+    bool pstate_sm;
-+    /* True if PSTATE.ZA is set. */
-+    bool pstate_za;
-     /* True if MVE insns are definitely not predicated by VPR or LTPSIZE */
-     bool mve_no_pred;
-     /*
 diff --git a/target/arm/helper.c b/target/arm/helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper.c
 +++ b/target/arm/helper.c
-@@ -XXX,XX +XXX,XX @@ static CPUARMTBFlags rebuild_hflags_a64(CPUARMState *env, int el, int fp_el,
+@@ -XXX,XX +XXX,XX @@ static CPAccessResult gt_stimer_access(CPUARMState *env,
-     }
+         if (!arm_is_secure(env)) {
-     if (cpu_isar_feature(aa64_sme, env_archcpu(env))) {
+             return CP_ACCESS_UNDEFINED;
-         DP_TBFLAG_A64(flags, SMEEXC_EL, sme_exception_el(env, el));
+         }
-+        if (FIELD_EX64(env->svcr, SVCR, SM)) {
++        if (arm_is_el2_enabled(env)) {
-+            DP_TBFLAG_A64(flags, PSTATE_SM, 1);
++            return CP_ACCESS_UNDEFINED;
 +        }
-+        DP_TBFLAG_A64(flags, PSTATE_ZA, FIELD_EX64(env->svcr, SVCR, ZA));
+         if (!(env->cp15.scr_el3 & SCR_ST)) {
-     }
+             return CP_ACCESS_TRAP_EL3;
+         }
      sctlr = regime_sctlr(env, stage1);
 diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-a64.c
 +++ b/target/arm/translate-a64.c
@@ -XXX,XX +XXX,XX @@ static void aarch64_tr_init_disas_context(DisasContextBase *dcbase,
      dc->ata = EX_TBFLAG_A64(tb_flags, ATA);
      dc->mte_active[0] = EX_TBFLAG_A64(tb_flags, MTE_ACTIVE);
      dc->mte_active[1] = EX_TBFLAG_A64(tb_flags, MTE0_ACTIVE);
 +    dc->pstate_sm = EX_TBFLAG_A64(tb_flags, PSTATE_SM);
 +    dc->pstate_za = EX_TBFLAG_A64(tb_flags, PSTATE_ZA);
      dc->vec_len = 0;
      dc->vec_stride = 0;
      dc->cp_regs = arm_cpu->cp_regs;
 --
-.25.1
+.43.0

-[PULL 13/25] target/arm: Implement SMSTART, SMSTOP
+[PULL 07/21] target/arm: Always apply CNTVOFF_EL2 for CNTV_TVAL_EL02 accesses
-From: Richard Henderson <richard.henderson@linaro.org>
+Currently we handle CNTV_TVAL_EL02 by calling gt_tval_read() for the
 EL1 virt timer.  This is almost correct, but the underlying
 CNTV_TVAL_EL0 register behaves slightly differently.  CNTV_TVAL_EL02
 always applies the CNTVOFF_EL2 offset; CNTV_TVAL_EL0 doesn't do so if
 we're at EL2 and HCR_EL2.E2H is 1.
-These two instructions are aliases of MSR (immediate).
+We were getting this wrong, because we ended up in
-Use the two helpers to properly implement svcr_write.
+gt_virt_cnt_offset() and did the E2H check.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+Factor out the tval read/write calculation from the selection of the
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+offset, so that we can special case gt_virt_tval_read() and
-Message-id: 20220620175235.60881-11-richard.henderson@linaro.org
+gt_virt_tval_write() to unconditionally pass CNTVOFF_EL2.
 Cc: qemu-stable@nongnu.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
+Message-id: 20250204125009.2281315-5-peter.maydell@linaro.org
 ---
- target/arm/cpu.h           |  1 +
+ target/arm/helper.c | 36 +++++++++++++++++++++++++++---------
- target/arm/helper-sme.h    | 21 +++++++++++++
+file changed, 27 insertions(+), 9 deletions(-)
  target/arm/helper.h        |  1 +
  target/arm/helper.c        |  6 ++--
  target/arm/sme_helper.c    | 61 ++++++++++++++++++++++++++++++++++++++
  target/arm/translate-a64.c | 24 +++++++++++++++
  target/arm/meson.build     |  1 +
 files changed, 112 insertions(+), 3 deletions(-)
  create mode 100644 target/arm/helper-sme.h
  create mode 100644 target/arm/sme_helper.c
-diff --git a/target/arm/cpu.h b/target/arm/cpu.h
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.h
-+++ b/target/arm/cpu.h
-@@ -XXX,XX +XXX,XX @@ void aarch64_sve_change_el(CPUARMState *env, int old_el,
-                            int new_el, bool el0_a64);
- void aarch64_add_sve_properties(Object *obj);
- void aarch64_add_pauth_properties(Object *obj);
-+void arm_reset_sve_state(CPUARMState *env);
- /*
-  * SVE registers are encoded in KVM's memory in an endianness-invariant format.
-diff --git a/target/arm/helper-sme.h b/target/arm/helper-sme.h
-new file mode 100644
-index XXXXXXX..XXXXXXX
---- /dev/null
-+++ b/target/arm/helper-sme.h
-@@ -XXX,XX +XXX,XX @@
-+/*
-+ *  AArch64 SME specific helper definitions
-+ *
-+ *  Copyright (c) 2022 Linaro, Ltd
-+ *
-+ * This library is free software; you can redistribute it and/or
-+ * modify it under the terms of the GNU Lesser General Public
-+ * License as published by the Free Software Foundation; either
-+ * version 2.1 of the License, or (at your option) any later version.
-+ *
-+ * This library is distributed in the hope that it will be useful,
-+ * but WITHOUT ANY WARRANTY; without even the implied warranty of
-+ * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
-+ * Lesser General Public License for more details.
-+ *
-+ * You should have received a copy of the GNU Lesser General Public
-+ * License along with this library; if not, see <http://www.gnu.org/licenses/>.
-+ */
-+
-+DEF_HELPER_FLAGS_2(set_pstate_sm, TCG_CALL_NO_RWG, void, env, i32)
-+DEF_HELPER_FLAGS_2(set_pstate_za, TCG_CALL_NO_RWG, void, env, i32)
-diff --git a/target/arm/helper.h b/target/arm/helper.h
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper.h
-+++ b/target/arm/helper.h
-@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_6(gvec_bfmlal_idx, TCG_CALL_NO_RWG,
- #ifdef TARGET_AARCH64
- #include "helper-a64.h"
- #include "helper-sve.h"
-+#include "helper-sme.h"
- #endif
- #include "helper-mve.h"
 diff --git a/target/arm/helper.c b/target/arm/helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper.c
 +++ b/target/arm/helper.c
-@@ -XXX,XX +XXX,XX @@ static CPAccessResult access_esm(CPUARMState *env, const ARMCPRegInfo *ri,
+@@ -XXX,XX +XXX,XX @@ static void gt_cval_write(CPUARMState *env, const ARMCPRegInfo *ri,
- static void svcr_write(CPUARMState *env, const ARMCPRegInfo *ri,
+     gt_recalc_timer(env_archcpu(env), timeridx);
                         uint64_t value)
  {
 -    value &= R_SVCR_SM_MASK | R_SVCR_ZA_MASK;
 -    /* TODO: Side effects. */
 -    env->svcr = value;
 +    helper_set_pstate_sm(env, FIELD_EX64(value, SVCR, SM));
 +    helper_set_pstate_za(env, FIELD_EX64(value, SVCR, ZA));
 +    arm_rebuild_hflags(env);
  }
- static void smcr_write(CPUARMState *env, const ARMCPRegInfo *ri,
++static uint64_t do_tval_read(CPUARMState *env, int timeridx, uint64_t offset)
 diff --git a/target/arm/sme_helper.c b/target/arm/sme_helper.c
 new file mode 100644
 index XXXXXXX..XXXXXXX
 --- /dev/null
 +++ b/target/arm/sme_helper.c
@@ -XXX,XX +XXX,XX @@
 +/*
 + * ARM SME Operations
 + *
 + * Copyright (c) 2022 Linaro, Ltd.
 + *
 + * This library is free software; you can redistribute it and/or
 + * modify it under the terms of the GNU Lesser General Public
 + * License as published by the Free Software Foundation; either
 + * version 2.1 of the License, or (at your option) any later version.
 + *
 + * This library is distributed in the hope that it will be useful,
 + * but WITHOUT ANY WARRANTY; without even the implied warranty of
 + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
 + * Lesser General Public License for more details.
 + *
 + * You should have received a copy of the GNU Lesser General Public
 + * License along with this library; if not, see <http://www.gnu.org/licenses/>.
 + */
 +
 +#include "qemu/osdep.h"
 +#include "cpu.h"
 +#include "internals.h"
 +#include "exec/helper-proto.h"
 +
 +/* ResetSVEState */
 +void arm_reset_sve_state(CPUARMState *env)
 +{
-+    memset(env->vfp.zregs, 0, sizeof(env->vfp.zregs));
++    return (uint32_t)(env->cp15.c14_timer[timeridx].cval -
-+    /* Recall that FFR is stored as pregs[16]. */
++                      (gt_get_countervalue(env) - offset));
 +    memset(env->vfp.pregs, 0, sizeof(env->vfp.pregs));
 +    vfp_set_fpcr(env, 0x0800009f);
 +}
 +
-+void helper_set_pstate_sm(CPUARMState *env, uint32_t i)
+ static uint64_t gt_tval_read(CPUARMState *env, const ARMCPRegInfo *ri,
-+{
+                              int timeridx)
-+    if (i == FIELD_EX64(env->svcr, SVCR, SM)) {
+ {
-+        return;
+@@ -XXX,XX +XXX,XX @@ static uint64_t gt_tval_read(CPUARMState *env, const ARMCPRegInfo *ri,
-+    }
+         break;
-+    env->svcr ^= R_SVCR_SM_MASK;
+     }
-+    arm_reset_sve_state(env);
 -    return (uint32_t)(env->cp15.c14_timer[timeridx].cval -
 -                      (gt_get_countervalue(env) - offset));
 +    return do_tval_read(env, timeridx, offset);
 +}
 +
-+void helper_set_pstate_za(CPUARMState *env, uint32_t i)
++static void do_tval_write(CPUARMState *env, int timeridx, uint64_t value,
 +                          uint64_t offset)
 +{
-+    if (i == FIELD_EX64(env->svcr, SVCR, ZA)) {
++    trace_arm_gt_tval_write(timeridx, value);
-+        return;
++    env->cp15.c14_timer[timeridx].cval = gt_get_countervalue(env) - offset +
-+    }
++                                         sextract64(value, 0, 32);
-+    env->svcr ^= R_SVCR_ZA_MASK;
++    gt_recalc_timer(env_archcpu(env), timeridx);
-+
+ }
  static void gt_tval_write(CPUARMState *env, const ARMCPRegInfo *ri,
@@ -XXX,XX +XXX,XX @@ static void gt_tval_write(CPUARMState *env, const ARMCPRegInfo *ri,
          offset = gt_phys_cnt_offset(env);
          break;
      }
 -
 -    trace_arm_gt_tval_write(timeridx, value);
 -    env->cp15.c14_timer[timeridx].cval = gt_get_countervalue(env) - offset +
 -                                         sextract64(value, 0, 32);
 -    gt_recalc_timer(env_archcpu(env), timeridx);
 +    do_tval_write(env, timeridx, value, offset);
  }
  static void gt_ctl_write(CPUARMState *env, const ARMCPRegInfo *ri,
@@ -XXX,XX +XXX,XX @@ static void gt_virt_cval_write(CPUARMState *env, const ARMCPRegInfo *ri,
  static uint64_t gt_virt_tval_read(CPUARMState *env, const ARMCPRegInfo *ri)
  {
 -    return gt_tval_read(env, ri, GTIMER_VIRT);
 +    /*
-+     * ResetSMEState.
++     * This is CNTV_TVAL_EL02; unlike the underlying CNTV_TVAL_EL0
-+     *
++     * we always apply CNTVOFF_EL2. Special case that here rather
-+     * SetPSTATE_ZA zeros on enable and disable.  We can zero this only
++     * than going into the generic gt_tval_read() and then having
-+     * on enable: while disabled, the storage is inaccessible and the
++     * to re-detect that it's this register.
-+     * value does not matter.  We're not saving the storage in vmstate
++     * Note that the accessfn/perms mean we know we're at EL2 or EL3 here.
 +     * when disabled either.
 +     */
-+    if (i) {
++    return do_tval_read(env, GTIMER_VIRT, env->cp15.cntvoff_el2);
-+        memset(env->zarray, 0, sizeof(env->zarray));
+ }
-+    }
-+}
+ static void gt_virt_tval_write(CPUARMState *env, const ARMCPRegInfo *ri,
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
+                                uint64_t value)
-index XXXXXXX..XXXXXXX 100644
+ {
---- a/target/arm/translate-a64.c
+-    gt_tval_write(env, ri, GTIMER_VIRT, value);
-+++ b/target/arm/translate-a64.c
++    /* Similarly for writes to CNTV_TVAL_EL02 */
-@@ -XXX,XX +XXX,XX @@ static void handle_msr_i(DisasContext *s, uint32_t insn,
++    do_tval_write(env, GTIMER_VIRT, value, env->cp15.cntvoff_el2);
-         }
+ }
-         break;
+ static void gt_virt_ctl_write(CPUARMState *env, const ARMCPRegInfo *ri,
 +    case 0x1b: /* SVCR* */
 +        if (!dc_isar_feature(aa64_sme, s) || crm < 2 || crm > 7) {
 +            goto do_unallocated;
 +        }
 +        if (sme_access_check(s)) {
 +            bool i = crm & 1;
 +            bool changed = false;
 +
 +            if ((crm & 2) && i != s->pstate_sm) {
 +                gen_helper_set_pstate_sm(cpu_env, tcg_constant_i32(i));
 +                changed = true;
 +            }
 +            if ((crm & 4) && i != s->pstate_za) {
 +                gen_helper_set_pstate_za(cpu_env, tcg_constant_i32(i));
 +                changed = true;
 +            }
 +            if (changed) {
 +                gen_rebuild_hflags(s);
 +            } else {
 +                s->base.is_jmp = DISAS_NEXT;
 +            }
 +        }
 +        break;
 +
      default:
      do_unallocated:
          unallocated_encoding(s);
 diff --git a/target/arm/meson.build b/target/arm/meson.build
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/meson.build
 +++ b/target/arm/meson.build
@@ -XXX,XX +XXX,XX @@ arm_ss.add(when: 'TARGET_AARCH64', if_true: files(
    'mte_helper.c',
    'pauth_helper.c',
    'sve_helper.c',
 +  'sme_helper.c',
    'translate-a64.c',
    'translate-sve.c',
  ))
 --
-.25.1
+.43.0

-[PULL 05/25] target/arm: Add SMEEXC_EL to TB flags
+[PULL 08/21] target/arm: Refactor handling of timer offset for direct register accesses
-From: Richard Henderson <richard.henderson@linaro.org>
+When reading or writing the timer registers, sometimes we need to
+apply one of the timer offsets.  Specifically, this happens for
-This is CheckSMEAccess, which is the basis for a set of
+direct reads of the counter registers CNTPCT_EL0 and CNTVCT_EL0 (and
-related tests for various SME cpregs and instructions.
+their self-synchronized variants CNTVCTSS_EL0 and CNTPCTSS_EL0).  It
+also applies for direct reads and writes of the CNT*_TVAL_EL*
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+registers that provide the 32-bit downcounting view of each timer.
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220620175235.60881-3-richard.henderson@linaro.org
+We currently do this with duplicated code in gt_tval_read() and
 gt_tval_write() and a special-case in gt_virt_cnt_read() and
 gt_cnt_read().  Refactor this so that we handle it all in a single
 function gt_direct_access_timer_offset(), to parallel how we handle
 the offset for indirect accesses.
 The call in the WFIT helper previously to gt_virt_cnt_offset() is
 now to gt_direct_access_timer_offset(); this is the correct
 behaviour, but it's not immediately obvious that it shouldn't be
 considered an indirect access, so we add an explanatory comment.
 This commit should make no behavioural changes.
 (Cc to stable because the following bugfix commit will
 depend on this one.)
 Cc: qemu-stable@nongnu.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
+Message-id: 20250204125009.2281315-6-peter.maydell@linaro.org
 ---
- target/arm/cpu.h           |  2 ++
+ target/arm/internals.h     |   5 +-
- target/arm/translate.h     |  1 +
+ target/arm/helper.c        | 103 +++++++++++++++++++------------------
- target/arm/helper.c        | 52 ++++++++++++++++++++++++++++++++++++++
+ target/arm/tcg/op_helper.c |   8 ++-
- target/arm/translate-a64.c |  1 +
+files changed, 62 insertions(+), 54 deletions(-)
-files changed, 56 insertions(+)
+diff --git a/target/arm/internals.h b/target/arm/internals.h
 diff --git a/target/arm/cpu.h b/target/arm/cpu.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.h
+--- a/target/arm/internals.h
-+++ b/target/arm/cpu.h
++++ b/target/arm/internals.h
-@@ -XXX,XX +XXX,XX @@ void aarch64_sync_64_to_32(CPUARMState *env);
+@@ -XXX,XX +XXX,XX @@ int delete_hw_watchpoint(target_ulong addr, target_ulong len, int type);
+ uint64_t gt_get_countervalue(CPUARMState *env);
  int fp_exception_el(CPUARMState *env, int cur_el);
  int sve_exception_el(CPUARMState *env, int cur_el);
 +int sme_exception_el(CPUARMState *env, int cur_el);
  /**
   * sve_vqm1_for_el:
@@ -XXX,XX +XXX,XX @@ FIELD(TBFLAG_A64, ATA, 15, 1)
  FIELD(TBFLAG_A64, TCMA, 16, 2)
  FIELD(TBFLAG_A64, MTE_ACTIVE, 18, 1)
  FIELD(TBFLAG_A64, MTE0_ACTIVE, 19, 1)
 +FIELD(TBFLAG_A64, SMEEXC_EL, 20, 2)
  /*
-  * Helpers for using the above.
+  * Return the currently applicable offset between the system counter
-diff --git a/target/arm/translate.h b/target/arm/translate.h
+- * and CNTVCT_EL0 (this will be either 0 or the value of CNTVOFF_EL2).
-index XXXXXXX..XXXXXXX 100644
++ * and the counter for the specified timer, as used for direct register
---- a/target/arm/translate.h
++ * accesses.
-+++ b/target/arm/translate.h
+  */
-@@ -XXX,XX +XXX,XX @@ typedef struct DisasContext {
+-uint64_t gt_virt_cnt_offset(CPUARMState *env);
-     bool ns;        /* Use non-secure CPREG bank on access */
++uint64_t gt_direct_access_timer_offset(CPUARMState *env, int timeridx);
-     int fp_excp_el; /* FP exception EL or 0 if enabled */
-     int sve_excp_el; /* SVE exception EL or 0 if enabled */
+ /*
-+    int sme_excp_el; /* SME exception EL or 0 if enabled */
+  * Return mask of ARMMMUIdxBit values corresponding to an "invalidate
      int vl;          /* current vector length in bytes */
      bool vfp_enabled; /* FP enabled via FPSCR.EN */
      int vec_len;
 diff --git a/target/arm/helper.c b/target/arm/helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper.c
 +++ b/target/arm/helper.c
-@@ -XXX,XX +XXX,XX @@ int sve_exception_el(CPUARMState *env, int el)
+@@ -XXX,XX +XXX,XX @@ static uint64_t gt_phys_raw_cnt_offset(CPUARMState *env)
      return 0;
  }
-+/*
+-static uint64_t gt_phys_cnt_offset(CPUARMState *env)
-+ * Return the exception level to which exceptions should be taken for SME.
+-{
-+ * C.f. the ARM pseudocode function CheckSMEAccess.
+-    if (arm_current_el(env) >= 2) {
-+ */
+-        return 0;
-+int sme_exception_el(CPUARMState *env, int el)
+-    }
 -    return gt_phys_raw_cnt_offset(env);
 -}
 -
  static uint64_t gt_indirect_access_timer_offset(CPUARMState *env, int timeridx)
  {
      /*
@@ -XXX,XX +XXX,XX @@ static uint64_t gt_indirect_access_timer_offset(CPUARMState *env, int timeridx)
      }
  }
 +uint64_t gt_direct_access_timer_offset(CPUARMState *env, int timeridx)
 +{
-+#ifndef CONFIG_USER_ONLY
++    /*
-+    if (el <= 1 && !el_is_in_host(env, el)) {
++     * Return the timer offset to use for direct accesses to the
-+        switch (FIELD_EX64(env->cp15.cpacr_el1, CPACR_EL1, SMEN)) {
++     * counter registers CNTPCT and CNTVCT, and for direct accesses
-+        case 1:
++     * to the CNT*_TVAL registers.
-+            if (el != 0) {
++     *
-+                break;
++     * This isn't exactly the same as the indirect-access offset,
 +     * because here we also care about what EL the register access
 +     * is being made from.
 +     *
 +     * This corresponds to the access pseudocode for the registers.
 +     */
 +    uint64_t hcr;
 +
 +    switch (timeridx) {
 +    case GTIMER_PHYS:
 +        if (arm_current_el(env) >= 2) {
 +            return 0;
 +        }
 +        return gt_phys_raw_cnt_offset(env);
 +    case GTIMER_VIRT:
 +        switch (arm_current_el(env)) {
 +        case 2:
 +            hcr = arm_hcr_el2_eff(env);
 +            if (hcr & HCR_E2H) {
 +                return 0;
 +            }
-+            /* fall through */
++            break;
 +        case 0:
-+        case 2:
++            hcr = arm_hcr_el2_eff(env);
-+            return 1;
++            if ((hcr & (HCR_E2H | HCR_TGE)) == (HCR_E2H | HCR_TGE)) {
 +                return 0;
 +            }
 +            break;
 +        }
++        return env->cp15.cntvoff_el2;
++    case GTIMER_HYP:
++    case GTIMER_SEC:
++    case GTIMER_HYPVIRT:
++        return 0;
++    default:
++        g_assert_not_reached();
 +    }
-+
-+    if (el <= 2 && arm_is_el2_enabled(env)) {
-+        /* CPTR_EL2 changes format with HCR_EL2.E2H (regardless of TGE). */
-+        if (env->cp15.hcr_el2 & HCR_E2H) {
-+            switch (FIELD_EX64(env->cp15.cptr_el[2], CPTR_EL2, SMEN)) {
-+            case 1:
-+                if (el != 0 || !(env->cp15.hcr_el2 & HCR_TGE)) {
-+                    break;
-+                }
-+                /* fall through */
-+            case 0:
-+            case 2:
-+                return 2;
-+            }
-+        } else {
-+            if (FIELD_EX64(env->cp15.cptr_el[2], CPTR_EL2, TSM)) {
-+                return 2;
-+            }
-+        }
-+    }
-+
-+    /* CPTR_EL3.  Since ESM is negative we must check for EL3.  */
-+    if (arm_feature(env, ARM_FEATURE_EL3)
-+        && !FIELD_EX64(env->cp15.cptr_el[3], CPTR_EL3, ESM)) {
-+        return 3;
-+    }
-+#endif
-+    return 0;
 +}
 +
- /*
+ static void gt_recalc_timer(ARMCPU *cpu, int timeridx)
-  * Given that SVE is enabled, return the vector length for EL.
+ {
-  */
+     ARMGenericTimer *gt = &cpu->env.cp15.c14_timer[timeridx];
-@@ -XXX,XX +XXX,XX @@ static CPUARMTBFlags rebuild_hflags_a64(CPUARMState *env, int el, int fp_el,
+@@ -XXX,XX +XXX,XX @@ static void gt_timer_reset(CPUARMState *env, const ARMCPRegInfo *ri,
-         }
-         DP_TBFLAG_A64(flags, SVEEXC_EL, sve_el);
+ static uint64_t gt_cnt_read(CPUARMState *env, const ARMCPRegInfo *ri)
-     }
+ {
-+    if (cpu_isar_feature(aa64_sme, env_archcpu(env))) {
+-    return gt_get_countervalue(env) - gt_phys_cnt_offset(env);
-+        DP_TBFLAG_A64(flags, SMEEXC_EL, sme_exception_el(env, el));
+-}
-+    }
+-
+-uint64_t gt_virt_cnt_offset(CPUARMState *env)
-     sctlr = regime_sctlr(env, stage1);
+-{
+-    uint64_t hcr;
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
+-
 -    switch (arm_current_el(env)) {
 -    case 2:
 -        hcr = arm_hcr_el2_eff(env);
 -        if (hcr & HCR_E2H) {
 -            return 0;
 -        }
 -        break;
 -    case 0:
 -        hcr = arm_hcr_el2_eff(env);
 -        if ((hcr & (HCR_E2H | HCR_TGE)) == (HCR_E2H | HCR_TGE)) {
 -            return 0;
 -        }
 -        break;
 -    }
 -
 -    return env->cp15.cntvoff_el2;
 +    uint64_t offset = gt_direct_access_timer_offset(env, GTIMER_PHYS);
 +    return gt_get_countervalue(env) - offset;
  }
  static uint64_t gt_virt_cnt_read(CPUARMState *env, const ARMCPRegInfo *ri)
  {
 -    return gt_get_countervalue(env) - gt_virt_cnt_offset(env);
 +    uint64_t offset = gt_direct_access_timer_offset(env, GTIMER_VIRT);
 +    return gt_get_countervalue(env) - offset;
  }
  static void gt_cval_write(CPUARMState *env, const ARMCPRegInfo *ri,
@@ -XXX,XX +XXX,XX @@ static uint64_t do_tval_read(CPUARMState *env, int timeridx, uint64_t offset)
  static uint64_t gt_tval_read(CPUARMState *env, const ARMCPRegInfo *ri,
                               int timeridx)
  {
 -    uint64_t offset = 0;
 -
 -    switch (timeridx) {
 -    case GTIMER_VIRT:
 -        offset = gt_virt_cnt_offset(env);
 -        break;
 -    case GTIMER_PHYS:
 -        offset = gt_phys_cnt_offset(env);
 -        break;
 -    }
 +    uint64_t offset = gt_direct_access_timer_offset(env, timeridx);
      return do_tval_read(env, timeridx, offset);
  }
@@ -XXX,XX +XXX,XX @@ static void gt_tval_write(CPUARMState *env, const ARMCPRegInfo *ri,
                            int timeridx,
                            uint64_t value)
  {
 -    uint64_t offset = 0;
 +    uint64_t offset = gt_direct_access_timer_offset(env, timeridx);
 -    switch (timeridx) {
 -    case GTIMER_VIRT:
 -        offset = gt_virt_cnt_offset(env);
 -        break;
 -    case GTIMER_PHYS:
 -        offset = gt_phys_cnt_offset(env);
 -        break;
 -    }
      do_tval_write(env, timeridx, value, offset);
  }
 diff --git a/target/arm/tcg/op_helper.c b/target/arm/tcg/op_helper.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
+--- a/target/arm/tcg/op_helper.c
-+++ b/target/arm/translate-a64.c
++++ b/target/arm/tcg/op_helper.c
-@@ -XXX,XX +XXX,XX @@ static void aarch64_tr_init_disas_context(DisasContextBase *dcbase,
+@@ -XXX,XX +XXX,XX @@ void HELPER(wfit)(CPUARMState *env, uint64_t timeout)
-     dc->align_mem = EX_TBFLAG_ANY(tb_flags, ALIGN_MEM);
+     int target_el = check_wfx_trap(env, false, &excp);
-     dc->pstate_il = EX_TBFLAG_ANY(tb_flags, PSTATE__IL);
+     /* The WFIT should time out when CNTVCT_EL0 >= the specified value. */
-     dc->sve_excp_el = EX_TBFLAG_A64(tb_flags, SVEEXC_EL);
+     uint64_t cntval = gt_get_countervalue(env);
-+    dc->sme_excp_el = EX_TBFLAG_A64(tb_flags, SMEEXC_EL);
+-    uint64_t offset = gt_virt_cnt_offset(env);
-     dc->vl = (EX_TBFLAG_A64(tb_flags, VL) + 1) * 16;
++    /*
-     dc->pauth_active = EX_TBFLAG_A64(tb_flags, PAUTH_ACTIVE);
++     * We want the value that we would get if we read CNTVCT_EL0 from
-     dc->bt = EX_TBFLAG_A64(tb_flags, BT);
++     * the current exception level, so the direct_access offset, not
 +     * the indirect_access one. Compare the pseudocode LocalTimeoutEvent(),
 +     * which calls VirtualCounterTimer().
 +     */
 +    uint64_t offset = gt_direct_access_timer_offset(env, GTIMER_VIRT);
      uint64_t cntvct = cntval - offset;
      uint64_t nexttick;
 --
-.25.1
+.43.0

-[PULL 20/25] target/arm: Add cpu properties for SME
+[PULL 09/21] target/arm: Implement SEL2 physical and virtual timers
-From: Richard Henderson <richard.henderson@linaro.org>
+From: Alex Bennée <alex.bennee@linaro.org>
-Mirror the properties for SVE.  The main difference is
+When FEAT_SEL2 was implemented the SEL2 timers were missed. This
-that any arbitrary set of powers of 2 may be supported,
+shows up when building the latest Hafnium with SPMC_AT_EL=2. The
-and not the stricter constraints that apply to SVE.
+actual implementation utilises the same logic as the rest of the
+timers so all we need to do is:
-Include a property to control FEAT_SME_FA64, as failing
-to restrict the runtime to the proper subset of insns
+  - define the timers and their access functions
-could be a major point for bugs.
+  - conditionally add the correct system registers
+  - create a new accessfn as the rules are subtly different to the
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+    existing secure timer
 Fixes: e9152ee91c (target/arm: add ARMv8.4-SEL2 system registers)
 Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Message-id: 20220620175235.60881-18-richard.henderson@linaro.org
+Message-id: 20250204125009.2281315-7-peter.maydell@linaro.org
 Cc: qemu-stable@nongnu.org
 Cc: Andrei Homescu <ahomescu@google.com>
 Cc: Arve Hjønnevåg <arve@google.com>
 Cc: Rémi Denis-Courmont <remi.denis.courmont@huawei.com>
 [PMM: CP_ACCESS_TRAP_UNCATEGORIZED -> CP_ACCESS_UNDEFINED;
  offset logic now in gt_{indirect,direct}_access_timer_offset() ]
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- docs/system/arm/cpu-features.rst |  56 +++++++++++++++
+ include/hw/arm/bsa.h |   2 +
- target/arm/cpu.h                 |   2 +
+ target/arm/cpu.h     |   2 +
- target/arm/internals.h           |   1 +
+ target/arm/gtimer.h  |   4 +-
- target/arm/cpu.c                 |  14 +++-
+ target/arm/cpu.c     |   4 ++
- target/arm/cpu64.c               | 114 +++++++++++++++++++++++++++++--
+ target/arm/helper.c  | 163 +++++++++++++++++++++++++++++++++++++++++++
-files changed, 180 insertions(+), 7 deletions(-)
+files changed, 174 insertions(+), 1 deletion(-)
-diff --git a/docs/system/arm/cpu-features.rst b/docs/system/arm/cpu-features.rst
+diff --git a/include/hw/arm/bsa.h b/include/hw/arm/bsa.h
 index XXXXXXX..XXXXXXX 100644
---- a/docs/system/arm/cpu-features.rst
+--- a/include/hw/arm/bsa.h
-+++ b/docs/system/arm/cpu-features.rst
++++ b/include/hw/arm/bsa.h
-@@ -XXX,XX +XXX,XX @@ verbose command lines.  However, the recommended way to select vector
+@@ -XXX,XX +XXX,XX @@
- lengths is to explicitly enable each desired length.  Therefore only
+ #define QEMU_ARM_BSA_H
- example's (1), (4), and (6) exhibit recommended uses of the properties.
+ /* These are architectural INTID values */
-+SME CPU Property Examples
++#define ARCH_TIMER_S_EL2_VIRT_IRQ  19
-+-------------------------
++#define ARCH_TIMER_S_EL2_IRQ       20
-+
+ #define VIRTUAL_PMU_IRQ            23
-+  1) Disable SME::
+ #define ARCH_GIC_MAINT_IRQ         25
-+
+ #define ARCH_TIMER_NS_EL2_IRQ      26
 +     $ qemu-system-aarch64 -M virt -cpu max,sme=off
 +
 +  2) Implicitly enable all vector lengths for the ``max`` CPU type::
 +
 +     $ qemu-system-aarch64 -M virt -cpu max
 +
 +  3) Only enable the 256-bit vector length::
 +
 +     $ qemu-system-aarch64 -M virt -cpu max,sme256=on
 +
 +  3) Enable the 256-bit and 1024-bit vector lengths::
 +
 +     $ qemu-system-aarch64 -M virt -cpu max,sme256=on,sme1024=on
 +
 +  4) Disable the 512-bit vector length.  This results in all the other
 +     lengths supported by ``max`` defaulting to enabled
 +     (128, 256, 1024 and 2048)::
 +
 +     $ qemu-system-aarch64 -M virt -cpu max,sve512=off
 +
  SVE User-mode Default Vector Length Property
  --------------------------------------------
@@ -XXX,XX +XXX,XX @@ length supported by QEMU is 256.
  If this property is set to ``-1`` then the default vector length
  is set to the maximum possible length.
 +
 +SME CPU Properties
 +==================
 +
 +The SME CPU properties are much like the SVE properties: ``sme`` is
 +used to enable or disable the entire SME feature, and ``sme<N>`` is
 +used to enable or disable specific vector lengths.  Finally,
 +``sme_fa64`` is used to enable or disable ``FEAT_SME_FA64``, which
 +allows execution of the "full a64" instruction set while Streaming
 +SVE mode is enabled.
 +
 +SME is not supported by KVM at this time.
 +
 +At least one vector length must be enabled when ``sme`` is enabled,
 +and all vector lengths must be powers of 2.  The maximum vector
 +length supported by qemu is 2048 bits.  Otherwise, there are no
 +additional constraints on the set of vector lengths supported by SME.
 +
 +SME User-mode Default Vector Length Property
 +--------------------------------------------
 +
 +For qemu-aarch64, the cpu propery ``sme-default-vector-length=N`` is
 +defined to mirror the Linux kernel parameter file
 +``/proc/sys/abi/sme_default_vector_length``.  The default length, ``N``,
 +is in units of bytes and must be between 16 and 8192.
 +If not specified, the default vector length is 32.
 +
 +As with ``sve-default-vector-length``, if the default length is larger
 +than the maximum vector length enabled, the actual vector length will
 +be reduced.  If this property is set to ``-1`` then the default vector
 +length is set to the maximum possible length.
 diff --git a/target/arm/cpu.h b/target/arm/cpu.h
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu.h
 +++ b/target/arm/cpu.h
-@@ -XXX,XX +XXX,XX @@ struct ArchCPU {
+@@ -XXX,XX +XXX,XX @@ void arm_gt_vtimer_cb(void *opaque);
- #ifdef CONFIG_USER_ONLY
+ void arm_gt_htimer_cb(void *opaque);
-     /* Used to set the default vector length at process start. */
+ void arm_gt_stimer_cb(void *opaque);
-     uint32_t sve_default_vq;
+ void arm_gt_hvtimer_cb(void *opaque);
-+    uint32_t sme_default_vq;
++void arm_gt_sel2timer_cb(void *opaque);
- #endif
++void arm_gt_sel2vtimer_cb(void *opaque);
-     ARMVQMap sve_vq;
+ unsigned int gt_cntfrq_period_ns(ARMCPU *cpu);
-+    ARMVQMap sme_vq;
+ void gt_rme_post_el_change(ARMCPU *cpu, void *opaque);
+diff --git a/target/arm/gtimer.h b/target/arm/gtimer.h
-     /* Generic timer counter frequency, in Hz */
+index XXXXXXX..XXXXXXX 100644
-     uint64_t gt_cntfrq_hz;
+--- a/target/arm/gtimer.h
-diff --git a/target/arm/internals.h b/target/arm/internals.h
++++ b/target/arm/gtimer.h
-index XXXXXXX..XXXXXXX 100644
+@@ -XXX,XX +XXX,XX @@ enum {
---- a/target/arm/internals.h
+     GTIMER_HYP      = 2,
-+++ b/target/arm/internals.h
+     GTIMER_SEC      = 3,
-@@ -XXX,XX +XXX,XX @@ int arm_gdb_set_svereg(CPUARMState *env, uint8_t *buf, int reg);
+     GTIMER_HYPVIRT  = 4,
- int aarch64_fpu_gdb_get_reg(CPUARMState *env, GByteArray *buf, int reg);
+-#define NUM_GTIMERS   5
- int aarch64_fpu_gdb_set_reg(CPUARMState *env, uint8_t *buf, int reg);
++    GTIMER_S_EL2_PHYS = 5, /* CNTHPS_* ; only if FEAT_SEL2 */
- void arm_cpu_sve_finalize(ARMCPU *cpu, Error **errp);
++    GTIMER_S_EL2_VIRT = 6, /* CNTHVS_* ; only if FEAT_SEL2 */
-+void arm_cpu_sme_finalize(ARMCPU *cpu, Error **errp);
++#define NUM_GTIMERS   7
- void arm_cpu_pauth_finalize(ARMCPU *cpu, Error **errp);
+ };
- void arm_cpu_lpa2_finalize(ARMCPU *cpu, Error **errp);
  #endif
 diff --git a/target/arm/cpu.c b/target/arm/cpu.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu.c
 +++ b/target/arm/cpu.c
-@@ -XXX,XX +XXX,XX @@ static void arm_cpu_initfn(Object *obj)
+@@ -XXX,XX +XXX,XX @@ static void arm_cpu_realizefn(DeviceState *dev, Error **errp)
- #ifdef CONFIG_USER_ONLY
+                                               arm_gt_stimer_cb, cpu);
- # ifdef TARGET_AARCH64
+         cpu->gt_timer[GTIMER_HYPVIRT] = timer_new(QEMU_CLOCK_VIRTUAL, scale,
-     /*
+                                                   arm_gt_hvtimer_cb, cpu);
--     * The linux kernel defaults to 512-bit vectors, when sve is supported.
++        cpu->gt_timer[GTIMER_S_EL2_PHYS] = timer_new(QEMU_CLOCK_VIRTUAL, scale,
--     * See documentation for /proc/sys/abi/sve_default_vector_length, and
++                                                     arm_gt_sel2timer_cb, cpu);
--     * our corresponding sve-default-vector-length cpu property.
++        cpu->gt_timer[GTIMER_S_EL2_VIRT] = timer_new(QEMU_CLOCK_VIRTUAL, scale,
-+     * The linux kernel defaults to 512-bit for SVE, and 256-bit for SME.
++                                                     arm_gt_sel2vtimer_cb, cpu);
-+     * These values were chosen to fit within the default signal frame.
+     }
-+     * See documentation for /proc/sys/abi/{sve,sme}_default_vector_length,
+ #endif
-+     * and our corresponding cpu property.
-      */
+diff --git a/target/arm/helper.c b/target/arm/helper.c
-     cpu->sve_default_vq = 4;
+index XXXXXXX..XXXXXXX 100644
-+    cpu->sme_default_vq = 2;
+--- a/target/arm/helper.c
- # endif
++++ b/target/arm/helper.c
- #else
+@@ -XXX,XX +XXX,XX @@ static CPAccessResult gt_stimer_access(CPUARMState *env,
-     /* Our inbound IRQ and FIQ lines */
+     }
-@@ -XXX,XX +XXX,XX @@ void arm_cpu_finalize_features(ARMCPU *cpu, Error **errp)
+ }
-             return;
-         }
++static CPAccessResult gt_sel2timer_access(CPUARMState *env,
++                                          const ARMCPRegInfo *ri,
-+        arm_cpu_sme_finalize(cpu, &local_err);
++                                          bool isread)
-+        if (local_err != NULL) {
++{
-+            error_propagate(errp, local_err);
++    /*
-+            return;
++     * The AArch64 register view of the secure EL2 timers are mostly
 +     * accessible from EL3 and EL2 although can also be trapped to EL2
 +     * from EL1 depending on nested virt config.
 +     */
 +    switch (arm_current_el(env)) {
 +    case 0: /* UNDEFINED */
 +        return CP_ACCESS_UNDEFINED;
 +    case 1:
 +        if (!arm_is_secure(env)) {
 +            /* UNDEFINED */
 +            return CP_ACCESS_UNDEFINED;
 +        } else if (arm_hcr_el2_eff(env) & HCR_NV) {
 +            /* Aarch64.SystemAccessTrap(EL2, 0x18) */
 +            return CP_ACCESS_TRAP_EL2;
 +        }
-+
++        /* UNDEFINED */
-         arm_cpu_pauth_finalize(cpu, &local_err);
++        return CP_ACCESS_UNDEFINED;
-         if (local_err != NULL) {
++    case 2:
-             error_propagate(errp, local_err);
++        if (!arm_is_secure(env)) {
-diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
++            /* UNDEFINED */
-index XXXXXXX..XXXXXXX 100644
++            return CP_ACCESS_UNDEFINED;
---- a/target/arm/cpu64.c
++        }
-+++ b/target/arm/cpu64.c
++        return CP_ACCESS_OK;
-@@ -XXX,XX +XXX,XX @@ static void cpu_arm_get_vq(Object *obj, Visitor *v, const char *name,
++    case 3:
-     ARMCPU *cpu = ARM_CPU(obj);
++        if (env->cp15.scr_el3 & SCR_EEL2) {
-     ARMVQMap *vq_map = opaque;
++            return CP_ACCESS_OK;
-     uint32_t vq = atoi(&name[3]) / 128;
++        } else {
-+    bool sve = vq_map == &cpu->sve_vq;
++            return CP_ACCESS_UNDEFINED;
-     bool value;
++        }
++    default:
--    /* All vector lengths are disabled when SVE is off. */
++        g_assert_not_reached();
--    if (!cpu_isar_feature(aa64_sve, cpu)) {
++    }
-+    /* All vector lengths are disabled when feature is off. */
++}
-+    if (sve
++
-+        ? !cpu_isar_feature(aa64_sve, cpu)
+ uint64_t gt_get_countervalue(CPUARMState *env)
-+        : !cpu_isar_feature(aa64_sme, cpu)) {
+ {
-         value = false;
+     ARMCPU *cpu = env_archcpu(env);
-     } else {
+@@ -XXX,XX +XXX,XX @@ static uint64_t gt_indirect_access_timer_offset(CPUARMState *env, int timeridx)
-         value = extract32(vq_map->map, vq - 1, 1);
+     case GTIMER_HYP:
-@@ -XXX,XX +XXX,XX @@ static void cpu_arm_set_sve(Object *obj, bool value, Error **errp)
+     case GTIMER_SEC:
-     cpu->isar.id_aa64pfr0 = t;
+     case GTIMER_HYPVIRT:
 +    case GTIMER_S_EL2_PHYS:
 +    case GTIMER_S_EL2_VIRT:
          return 0;
      default:
          g_assert_not_reached();
@@ -XXX,XX +XXX,XX @@ uint64_t gt_direct_access_timer_offset(CPUARMState *env, int timeridx)
      case GTIMER_HYP:
      case GTIMER_SEC:
      case GTIMER_HYPVIRT:
 +    case GTIMER_S_EL2_PHYS:
 +    case GTIMER_S_EL2_VIRT:
          return 0;
      default:
          g_assert_not_reached();
@@ -XXX,XX +XXX,XX @@ static void gt_sec_ctl_write(CPUARMState *env, const ARMCPRegInfo *ri,
      gt_ctl_write(env, ri, GTIMER_SEC, value);
  }
-+void arm_cpu_sme_finalize(ARMCPU *cpu, Error **errp)
++static void gt_sec_pel2_timer_reset(CPUARMState *env, const ARMCPRegInfo *ri)
 +{
-+    uint32_t vq_map = cpu->sme_vq.map;
++    gt_timer_reset(env, ri, GTIMER_S_EL2_PHYS);
-+    uint32_t vq_init = cpu->sme_vq.init;
++}
-+    uint32_t vq_supported = cpu->sme_vq.supported;
++
-+    uint32_t vq;
++static void gt_sec_pel2_cval_write(CPUARMState *env, const ARMCPRegInfo *ri,
-+
++                                   uint64_t value)
-+    if (vq_map == 0) {
++{
-+        if (!cpu_isar_feature(aa64_sme, cpu)) {
++    gt_cval_write(env, ri, GTIMER_S_EL2_PHYS, value);
-+            cpu->isar.id_aa64smfr0 = 0;
++}
-+            return;
++
-+        }
++static uint64_t gt_sec_pel2_tval_read(CPUARMState *env, const ARMCPRegInfo *ri)
-+
++{
-+        /* TODO: KVM will require limitations via SMCR_EL2. */
++    return gt_tval_read(env, ri, GTIMER_S_EL2_PHYS);
-+        vq_map = vq_supported & ~vq_init;
++}
 +
-+        if (vq_map == 0) {
++static void gt_sec_pel2_tval_write(CPUARMState *env, const ARMCPRegInfo *ri,
-+            vq = ctz32(vq_supported) + 1;
++                              uint64_t value)
-+            error_setg(errp, "cannot disable sme%d", vq * 128);
++{
-+            error_append_hint(errp, "All SME vector lengths are disabled.\n");
++    gt_tval_write(env, ri, GTIMER_S_EL2_PHYS, value);
-+            error_append_hint(errp, "With SME enabled, at least one "
++}
-+                              "vector length must be enabled.\n");
++
-+            return;
++static void gt_sec_pel2_ctl_write(CPUARMState *env, const ARMCPRegInfo *ri,
-+        }
++                              uint64_t value)
-+    } else {
++{
-+        if (!cpu_isar_feature(aa64_sme, cpu)) {
++    gt_ctl_write(env, ri, GTIMER_S_EL2_PHYS, value);
-+            vq = 32 - clz32(vq_map);
++}
-+            error_setg(errp, "cannot enable sme%d", vq * 128);
++
-+            error_append_hint(errp, "SME must be enabled to enable "
++static void gt_sec_vel2_timer_reset(CPUARMState *env, const ARMCPRegInfo *ri)
-+                              "vector lengths.\n");
++{
-+            error_append_hint(errp, "Add sme=on to the CPU property list.\n");
++    gt_timer_reset(env, ri, GTIMER_S_EL2_VIRT);
-+            return;
++}
-+        }
++
-+        /* TODO: KVM will require limitations via SMCR_EL2. */
++static void gt_sec_vel2_cval_write(CPUARMState *env, const ARMCPRegInfo *ri,
-+    }
++                              uint64_t value)
-+
++{
-+    cpu->sme_vq.map = vq_map;
++    gt_cval_write(env, ri, GTIMER_S_EL2_VIRT, value);
 +}
 +
-+static bool cpu_arm_get_sme(Object *obj, Error **errp)
++static uint64_t gt_sec_vel2_tval_read(CPUARMState *env, const ARMCPRegInfo *ri)
 +{
-+    ARMCPU *cpu = ARM_CPU(obj);
++    return gt_tval_read(env, ri, GTIMER_S_EL2_VIRT);
-+    return cpu_isar_feature(aa64_sme, cpu);
++}
-+}
++
-+
++static void gt_sec_vel2_tval_write(CPUARMState *env, const ARMCPRegInfo *ri,
-+static void cpu_arm_set_sme(Object *obj, bool value, Error **errp)
++                                   uint64_t value)
 +{
-+    ARMCPU *cpu = ARM_CPU(obj);
++    gt_tval_write(env, ri, GTIMER_S_EL2_VIRT, value);
-+    uint64_t t;
++}
 +
-+    t = cpu->isar.id_aa64pfr1;
++static void gt_sec_vel2_ctl_write(CPUARMState *env, const ARMCPRegInfo *ri,
-+    t = FIELD_DP64(t, ID_AA64PFR1, SME, value);
++                              uint64_t value)
-+    cpu->isar.id_aa64pfr1 = t;
++{
-+}
++    gt_ctl_write(env, ri, GTIMER_S_EL2_VIRT, value);
-+
++}
-+static bool cpu_arm_get_sme_fa64(Object *obj, Error **errp)
++
-+{
+ static void gt_hv_timer_reset(CPUARMState *env, const ARMCPRegInfo *ri)
-+    ARMCPU *cpu = ARM_CPU(obj);
+ {
-+    return cpu_isar_feature(aa64_sme, cpu) &&
+     gt_timer_reset(env, ri, GTIMER_HYPVIRT);
-+           cpu_isar_feature(aa64_sme_fa64, cpu);
+@@ -XXX,XX +XXX,XX @@ void arm_gt_stimer_cb(void *opaque)
-+}
+     gt_recalc_timer(cpu, GTIMER_SEC);
 +
 +static void cpu_arm_set_sme_fa64(Object *obj, bool value, Error **errp)
 +{
 +    ARMCPU *cpu = ARM_CPU(obj);
 +    uint64_t t;
 +
 +    t = cpu->isar.id_aa64smfr0;
 +    t = FIELD_DP64(t, ID_AA64SMFR0, FA64, value);
 +    cpu->isar.id_aa64smfr0 = t;
 +}
 +
  #ifdef CONFIG_USER_ONLY
 -/* Mirror linux /proc/sys/abi/sve_default_vector_length. */
 +/* Mirror linux /proc/sys/abi/{sve,sme}_default_vector_length. */
  static void cpu_arm_set_default_vec_len(Object *obj, Visitor *v,
                                          const char *name, void *opaque,
                                          Error **errp)
@@ -XXX,XX +XXX,XX @@ static void cpu_arm_set_default_vec_len(Object *obj, Visitor *v,
       * and is the maximum architectural width of ZCR_ELx.LEN.
       */
      if (remainder || default_vq < 1 || default_vq > 512) {
 -        error_setg(errp, "cannot set sve-default-vector-length");
 +        ARMCPU *cpu = ARM_CPU(obj);
 +        const char *which =
 +            (ptr_default_vq == &cpu->sve_default_vq ? "sve" : "sme");
 +
 +        error_setg(errp, "cannot set %s-default-vector-length", which);
          if (remainder) {
              error_append_hint(errp, "Vector length not a multiple of 16\n");
          } else if (default_vq < 1) {
@@ -XXX,XX +XXX,XX @@ static void aarch64_add_sve_properties(Object *obj)
  #endif
  }
-+static void aarch64_add_sme_properties(Object *obj)
++void arm_gt_sel2timer_cb(void *opaque)
 +{
-+    ARMCPU *cpu = ARM_CPU(obj);
++    ARMCPU *cpu = opaque;
-+    uint32_t vq;
++
-+
++    gt_recalc_timer(cpu, GTIMER_S_EL2_PHYS);
-+    object_property_add_bool(obj, "sme", cpu_arm_get_sme, cpu_arm_set_sme);
++}
-+    object_property_add_bool(obj, "sme_fa64", cpu_arm_get_sme_fa64,
++
-+                             cpu_arm_set_sme_fa64);
++void arm_gt_sel2vtimer_cb(void *opaque)
-+
++{
-+    for (vq = 1; vq <= ARM_MAX_VQ; vq <<= 1) {
++    ARMCPU *cpu = opaque;
-+        char name[8];
++
-+        sprintf(name, "sme%d", vq * 128);
++    gt_recalc_timer(cpu, GTIMER_S_EL2_VIRT);
-+        object_property_add(obj, name, "bool", cpu_arm_get_vq,
++}
-+                            cpu_arm_set_vq, NULL, &cpu->sme_vq);
++
-+    }
+ void arm_gt_hvtimer_cb(void *opaque)
-+
+ {
-+#ifdef CONFIG_USER_ONLY
+     ARMCPU *cpu = opaque;
-+    /* Mirror linux /proc/sys/abi/sme_default_vector_length. */
+@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo el2_sec_cp_reginfo[] = {
-+    object_property_add(obj, "sme-default-vector-length", "int32",
+       .access = PL2_RW, .accessfn = sel2_access,
-+                        cpu_arm_get_default_vec_len,
+       .nv2_redirect_offset = 0x48,
-+                        cpu_arm_set_default_vec_len, NULL,
+       .fieldoffset = offsetof(CPUARMState, cp15.vstcr_el2) },
-+                        &cpu->sme_default_vq);
++#ifndef CONFIG_USER_ONLY
 +    /* Secure EL2 Physical Timer */
 +    { .name = "CNTHPS_TVAL_EL2", .state = ARM_CP_STATE_AA64,
 +      .opc0 = 3, .opc1 = 4, .crn = 14, .crm = 5, .opc2 = 0,
 +      .type = ARM_CP_NO_RAW | ARM_CP_IO, .access = PL2_RW,
 +      .accessfn = gt_sel2timer_access,
 +      .readfn = gt_sec_pel2_tval_read,
 +      .writefn = gt_sec_pel2_tval_write,
 +      .resetfn = gt_sec_pel2_timer_reset,
 +    },
 +    { .name = "CNTHPS_CTL_EL2", .state = ARM_CP_STATE_AA64,
 +      .opc0 = 3, .opc1 = 4, .crn = 14, .crm = 5, .opc2 = 1,
 +      .type = ARM_CP_IO, .access = PL2_RW,
 +      .accessfn = gt_sel2timer_access,
 +      .fieldoffset = offsetof(CPUARMState, cp15.c14_timer[GTIMER_S_EL2_PHYS].ctl),
 +      .resetvalue = 0,
 +      .writefn = gt_sec_pel2_ctl_write, .raw_writefn = raw_write,
 +    },
 +    { .name = "CNTHPS_CVAL_EL2", .state = ARM_CP_STATE_AA64,
 +      .opc0 = 3, .opc1 = 4, .crn = 14, .crm = 5, .opc2 = 2,
 +      .type = ARM_CP_IO, .access = PL2_RW,
 +      .accessfn = gt_sel2timer_access,
 +      .fieldoffset = offsetof(CPUARMState, cp15.c14_timer[GTIMER_S_EL2_PHYS].cval),
 +      .writefn = gt_sec_pel2_cval_write, .raw_writefn = raw_write,
 +    },
 +    /* Secure EL2 Virtual Timer */
 +    { .name = "CNTHVS_TVAL_EL2", .state = ARM_CP_STATE_AA64,
 +      .opc0 = 3, .opc1 = 4, .crn = 14, .crm = 4, .opc2 = 0,
 +      .type = ARM_CP_NO_RAW | ARM_CP_IO, .access = PL2_RW,
 +      .accessfn = gt_sel2timer_access,
 +      .readfn = gt_sec_vel2_tval_read,
 +      .writefn = gt_sec_vel2_tval_write,
 +      .resetfn = gt_sec_vel2_timer_reset,
 +    },
 +    { .name = "CNTHVS_CTL_EL2", .state = ARM_CP_STATE_AA64,
 +      .opc0 = 3, .opc1 = 4, .crn = 14, .crm = 4, .opc2 = 1,
 +      .type = ARM_CP_IO, .access = PL2_RW,
 +      .accessfn = gt_sel2timer_access,
 +      .fieldoffset = offsetof(CPUARMState, cp15.c14_timer[GTIMER_S_EL2_VIRT].ctl),
 +      .resetvalue = 0,
 +      .writefn = gt_sec_vel2_ctl_write, .raw_writefn = raw_write,
 +    },
 +    { .name = "CNTHVS_CVAL_EL2", .state = ARM_CP_STATE_AA64,
 +      .opc0 = 3, .opc1 = 4, .crn = 14, .crm = 4, .opc2 = 2,
 +      .type = ARM_CP_IO, .access = PL2_RW,
 +      .accessfn = gt_sel2timer_access,
 +      .fieldoffset = offsetof(CPUARMState, cp15.c14_timer[GTIMER_S_EL2_VIRT].cval),
 +      .writefn = gt_sec_vel2_cval_write, .raw_writefn = raw_write,
 +    },
 +#endif
-+}
+ };
-+
- void arm_cpu_pauth_finalize(ARMCPU *cpu, Error **errp)
+ static CPAccessResult nsacr_access(CPUARMState *env, const ARMCPRegInfo *ri,
  {
      int arch_val = 0, impdef_val = 0;
@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
  #endif
      cpu->sve_vq.supported = MAKE_64BIT_MASK(0, ARM_MAX_VQ);
 +    cpu->sme_vq.supported = SVE_VQ_POW2_MAP;
      aarch64_add_pauth_properties(obj);
      aarch64_add_sve_properties(obj);
 +    aarch64_add_sme_properties(obj);
      object_property_add(obj, "sve-max-vq", "uint32", cpu_max_get_sve_max_vq,
                          cpu_max_set_sve_max_vq, NULL, NULL);
      qdev_property_add_static(DEVICE(obj), &arm_cpu_lpa2_property);
 --
-.25.1
+.43.0

-[PULL 01/25] sphinx: change default language to 'en'
+[PULL 10/21] target/arm: Document the architectural names of our GTIMERs
-From: Martin Liška <mliska@suse.cz>
+From: Alex Bennée <alex.bennee@linaro.org>
-Fixes the following Sphinx warning (treated as error) starting
+As we are about to add more physical and virtual timers let's make it
-with 5.0 release:
+clear what each timer does.
-Warning, treated as error:
+Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
-Invalid configuration value found: 'language = None'. Update your configuration to a valid langauge code. Falling back to 'en' (English).
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Martin Liska <mliska@suse.cz>
+Message-id: 20250204125009.2281315-8-peter.maydell@linaro.org
-Message-id: e91e51ee-48ac-437e-6467-98b56ee40042@suse.cz
+[PMM: Add timer register name prefix to each comment]
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- docs/conf.py | 2 +-
+ target/arm/gtimer.h | 10 +++++-----
-file changed, 1 insertion(+), 1 deletion(-)
+file changed, 5 insertions(+), 5 deletions(-)
-diff --git a/docs/conf.py b/docs/conf.py
+diff --git a/target/arm/gtimer.h b/target/arm/gtimer.h
 index XXXXXXX..XXXXXXX 100644
---- a/docs/conf.py
+--- a/target/arm/gtimer.h
-+++ b/docs/conf.py
++++ b/target/arm/gtimer.h
 @@ -XXX,XX +XXX,XX @@
- #
+ #define TARGET_ARM_GTIMER_H
- # This is also used if you do content translation via gettext catalogs.
- # Usually you set "language" from the command line for these cases.
+ enum {
--language = None
+-    GTIMER_PHYS     = 0,
-+language = 'en'
+-    GTIMER_VIRT     = 1,
+-    GTIMER_HYP      = 2,
- # List of patterns, relative to source directory, that match files and
+-    GTIMER_SEC      = 3,
- # directories to ignore when looking for source files.
+-    GTIMER_HYPVIRT  = 4,
 +    GTIMER_PHYS     = 0, /* CNTP_* ; EL1 physical timer */
 +    GTIMER_VIRT     = 1, /* CNTV_* ; EL1 virtual timer */
 +    GTIMER_HYP      = 2, /* CNTHP_* ; EL2 physical timer */
 +    GTIMER_SEC      = 3, /* CNTPS_* ; EL3 physical timer */
 +    GTIMER_HYPVIRT  = 4, /* CNTHV_* ; EL2 virtual timer ; only if FEAT_VHE */
      GTIMER_S_EL2_PHYS = 5, /* CNTHPS_* ; only if FEAT_SEL2 */
      GTIMER_S_EL2_VIRT = 6, /* CNTHVS_* ; only if FEAT_SEL2 */
  #define NUM_GTIMERS   7
 --
-.25.1
+.43.0

-[PULL 02/25] accel: Introduce current_accel_name()
+Deleted patch
-From: Alexander Graf <agraf@csgraf.de>
-We need to fetch the name of the current accelerator in flexible error
-messages more going forward. Let's create a helper that gives it to us
-without casting in the target code.
-Signed-off-by: Alexander Graf <agraf@csgraf.de>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220620192242.70573-1-agraf@csgraf.de
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- include/qemu/accel.h | 1 +
- accel/accel-common.c | 8 ++++++++
- softmmu/vl.c         | 3 +--
-files changed, 10 insertions(+), 2 deletions(-)
-diff --git a/include/qemu/accel.h b/include/qemu/accel.h
-index XXXXXXX..XXXXXXX 100644
---- a/include/qemu/accel.h
-+++ b/include/qemu/accel.h
-@@ -XXX,XX +XXX,XX @@ typedef struct AccelClass {
- AccelClass *accel_find(const char *opt_name);
- AccelState *current_accel(void);
-+const char *current_accel_name(void);
- void accel_init_interfaces(AccelClass *ac);
-diff --git a/accel/accel-common.c b/accel/accel-common.c
-index XXXXXXX..XXXXXXX 100644
---- a/accel/accel-common.c
-+++ b/accel/accel-common.c
-@@ -XXX,XX +XXX,XX @@ AccelClass *accel_find(const char *opt_name)
-     return ac;
- }
-+/* Return the name of the current accelerator */
-+const char *current_accel_name(void)
-+{
-+    AccelClass *ac = ACCEL_GET_CLASS(current_accel());
-+
-+    return ac->name;
-+}
-+
- static void accel_init_cpu_int_aux(ObjectClass *klass, void *opaque)
- {
-     CPUClass *cc = CPU_CLASS(klass);
-diff --git a/softmmu/vl.c b/softmmu/vl.c
-index XXXXXXX..XXXXXXX 100644
---- a/softmmu/vl.c
-+++ b/softmmu/vl.c
-@@ -XXX,XX +XXX,XX @@ static void configure_accelerators(const char *progname)
-     }
-     if (init_failed && !qtest_chrdev) {
--        AccelClass *ac = ACCEL_GET_CLASS(current_accel());
--        error_report("falling back to %s", ac->name);
-+        error_report("falling back to %s", current_accel_name());
-     }
-     if (icount_enabled() && !tcg_enabled()) {
---
-.25.1

-[PULL 03/25] target/arm: Catch invalid kvm state also for hvf
+Deleted patch
-From: Alexander Graf <agraf@csgraf.de>
-Some features such as running in EL3 or running M profile code are
-incompatible with virtualization as QEMU implements it today. To prevent
-users from picking invalid configurations on other virt solutions like
-Hvf, let's run the same checks there too.
-Resolves: https://gitlab.com/qemu-project/qemu/-/issues/1073
-Signed-off-by: Alexander Graf <agraf@csgraf.de>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220620192242.70573-2-agraf@csgraf.de
-[PMM: Allow qtest accelerator too; tweak comment]
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- target/arm/cpu.c | 16 ++++++++++++----
-file changed, 12 insertions(+), 4 deletions(-)
-diff --git a/target/arm/cpu.c b/target/arm/cpu.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.c
-+++ b/target/arm/cpu.c
-@@ -XXX,XX +XXX,XX @@
- #include "hw/boards.h"
- #endif
- #include "sysemu/tcg.h"
-+#include "sysemu/qtest.h"
- #include "sysemu/hw_accel.h"
- #include "kvm_arm.h"
- #include "disas/capstone.h"
-@@ -XXX,XX +XXX,XX @@ static void arm_cpu_realizefn(DeviceState *dev, Error **errp)
-         }
-     }
--    if (kvm_enabled()) {
-+    if (!tcg_enabled() && !qtest_enabled()) {
-         /*
-+         * We assume that no accelerator except TCG (and the "not really an
-+         * accelerator" qtest) can handle these features, because Arm hardware
-+         * virtualization can't virtualize them.
-+         *
-          * Catch all the cases which might cause us to create more than one
-          * address space for the CPU (otherwise we will assert() later in
-          * cpu_address_space_init()).
-          */
-         if (arm_feature(env, ARM_FEATURE_M)) {
-             error_setg(errp,
--                       "Cannot enable KVM when using an M-profile guest CPU");
-+                       "Cannot enable %s when using an M-profile guest CPU",
-+                       current_accel_name());
-             return;
-         }
-         if (cpu->has_el3) {
-             error_setg(errp,
--                       "Cannot enable KVM when guest CPU has EL3 enabled");
-+                       "Cannot enable %s when guest CPU has EL3 enabled",
-+                       current_accel_name());
-             return;
-         }
-         if (cpu->tag_memory) {
-             error_setg(errp,
--                       "Cannot enable KVM when guest CPUs has MTE enabled");
-+                       "Cannot enable %s when guest CPUs has MTE enabled",
-+                       current_accel_name());
-             return;
-         }
-     }
---
-.25.1

-[PULL 04/25] target/arm: Implement TPIDR2_EL0
+Deleted patch
-From: Richard Henderson <richard.henderson@linaro.org>
-This register is part of SME, but isn't closely related to the
-rest of the extension.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220620175235.60881-2-richard.henderson@linaro.org
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- target/arm/cpu.h    |  1 +
- target/arm/helper.c | 32 ++++++++++++++++++++++++++++++++
-files changed, 33 insertions(+)
-diff --git a/target/arm/cpu.h b/target/arm/cpu.h
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.h
-+++ b/target/arm/cpu.h
-@@ -XXX,XX +XXX,XX @@ typedef struct CPUArchState {
-             };
-             uint64_t tpidr_el[4];
-         };
-+        uint64_t tpidr2_el0;
-         /* The secure banks of these registers don't map anywhere */
-         uint64_t tpidrurw_s;
-         uint64_t tpidrprw_s;
-diff --git a/target/arm/helper.c b/target/arm/helper.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper.c
-+++ b/target/arm/helper.c
-@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo zcr_reginfo[] = {
-       .writefn = zcr_write, .raw_writefn = raw_write },
- };
-+#ifdef TARGET_AARCH64
-+static CPAccessResult access_tpidr2(CPUARMState *env, const ARMCPRegInfo *ri,
-+                                    bool isread)
-+{
-+    int el = arm_current_el(env);
-+
-+    if (el == 0) {
-+        uint64_t sctlr = arm_sctlr(env, el);
-+        if (!(sctlr & SCTLR_EnTP2)) {
-+            return CP_ACCESS_TRAP;
-+        }
-+    }
-+    /* TODO: FEAT_FGT */
-+    if (el < 3
-+        && arm_feature(env, ARM_FEATURE_EL3)
-+        && !(env->cp15.scr_el3 & SCR_ENTP2)) {
-+        return CP_ACCESS_TRAP_EL3;
-+    }
-+    return CP_ACCESS_OK;
-+}
-+
-+static const ARMCPRegInfo sme_reginfo[] = {
-+    { .name = "TPIDR2_EL0", .state = ARM_CP_STATE_AA64,
-+      .opc0 = 3, .opc1 = 3, .crn = 13, .crm = 0, .opc2 = 5,
-+      .access = PL0_RW, .accessfn = access_tpidr2,
-+      .fieldoffset = offsetof(CPUARMState, cp15.tpidr2_el0) },
-+};
-+#endif /* TARGET_AARCH64 */
-+
- void hw_watchpoint_update(ARMCPU *cpu, int n)
- {
-     CPUARMState *env = &cpu->env;
-@@ -XXX,XX +XXX,XX @@ void register_cp_regs_for_features(ARMCPU *cpu)
-     }
- #ifdef TARGET_AARCH64
-+    if (cpu_isar_feature(aa64_sme, cpu)) {
-+        define_arm_cp_regs(cpu, sme_reginfo);
-+    }
-     if (cpu_isar_feature(aa64_pauth, cpu)) {
-         define_arm_cp_regs(cpu, pauth_reginfo);
-     }
---
-.25.1

-[PULL 24/25] target/arm: Extend arm_pamax to more than aarch64
+[PULL 11/21] hw/arm: enable secure EL2 timers for virt machine
-From: Richard Henderson <richard.henderson@linaro.org>
+From: Alex Bennée <alex.bennee@linaro.org>
-Move the code from hw/arm/virt.c that is supposed
+Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
-to handle v7 into the one function.
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250204125009.2281315-9-peter.maydell@linaro.org
-Reported-by: He Zhe <zhe.he@windriver.com>
+Cc: qemu-stable@nongnu.org
 Message-id: 20220619001541.131672-2-richard.henderson@linaro.org
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- hw/arm/virt.c    | 10 +---------
+ hw/arm/virt.c | 2 ++
- target/arm/ptw.c | 24 ++++++++++++++++--------
+file changed, 2 insertions(+)
 files changed, 17 insertions(+), 17 deletions(-)
 diff --git a/hw/arm/virt.c b/hw/arm/virt.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/virt.c
 +++ b/hw/arm/virt.c
-@@ -XXX,XX +XXX,XX @@ static void machvirt_init(MachineState *machine)
+@@ -XXX,XX +XXX,XX @@ static void create_gic(VirtMachineState *vms, MemoryRegion *mem)
-         cpuobj = object_new(possible_cpus->cpus[0].type);
+             [GTIMER_HYP]  = ARCH_TIMER_NS_EL2_IRQ,
-         armcpu = ARM_CPU(cpuobj);
+             [GTIMER_SEC]  = ARCH_TIMER_S_EL1_IRQ,
+             [GTIMER_HYPVIRT] = ARCH_TIMER_NS_EL2_VIRT_IRQ,
--        if (object_property_get_bool(cpuobj, "aarch64", NULL)) {
++            [GTIMER_S_EL2_PHYS] = ARCH_TIMER_S_EL2_IRQ,
--            pa_bits = arm_pamax(armcpu);
++            [GTIMER_S_EL2_VIRT] = ARCH_TIMER_S_EL2_VIRT_IRQ,
--        } else if (arm_feature(&armcpu->env, ARM_FEATURE_LPAE)) {
+         };
--            /* v7 with LPAE */
--            pa_bits = 40;
+         for (unsigned irq = 0; irq < ARRAY_SIZE(timer_irq); irq++) {
 -        } else {
 -            /* Anything else */
 -            pa_bits = 32;
 -        }
 +        pa_bits = arm_pamax(armcpu);
          object_unref(cpuobj);
 diff --git a/target/arm/ptw.c b/target/arm/ptw.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/ptw.c
 +++ b/target/arm/ptw.c
@@ -XXX,XX +XXX,XX @@ static const uint8_t pamax_map[] = {
  /* The cpu-specific constant value of PAMax; also used by hw/arm/virt. */
  unsigned int arm_pamax(ARMCPU *cpu)
  {
 -    unsigned int parange =
 -        FIELD_EX64(cpu->isar.id_aa64mmfr0, ID_AA64MMFR0, PARANGE);
 +    if (arm_feature(&cpu->env, ARM_FEATURE_AARCH64)) {
 +        unsigned int parange =
 +            FIELD_EX64(cpu->isar.id_aa64mmfr0, ID_AA64MMFR0, PARANGE);
 -    /*
 -     * id_aa64mmfr0 is a read-only register so values outside of the
 -     * supported mappings can be considered an implementation error.
 -     */
 -    assert(parange < ARRAY_SIZE(pamax_map));
 -    return pamax_map[parange];
 +        /*
 +         * id_aa64mmfr0 is a read-only register so values outside of the
 +         * supported mappings can be considered an implementation error.
 +         */
 +        assert(parange < ARRAY_SIZE(pamax_map));
 +        return pamax_map[parange];
 +    }
 +    if (arm_feature(&cpu->env, ARM_FEATURE_LPAE)) {
 +        /* v7 with LPAE */
 +        return 40;
 +    }
 +    /* Anything else */
 +    return 32;
  }
  /*
 --
-.25.1
+.43.0

-[PULL 08/25] target/arm: Add SVCR
+[PULL 12/21] hw/arm: enable secure EL2 timers for sbsa machine
-From: Richard Henderson <richard.henderson@linaro.org>
+From: Alex Bennée <alex.bennee@linaro.org>
-This cpreg is used to access two new bits of PSTATE
+Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
 that are not visible via any other mechanism.
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Message-id: 20220620175235.60881-6-richard.henderson@linaro.org
+Message-id: 20250204125009.2281315-10-peter.maydell@linaro.org
 Cc: qemu-stable@nongnu.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/cpu.h    |  6 ++++++
+ hw/arm/sbsa-ref.c | 2 ++
- target/arm/helper.c | 13 +++++++++++++
+file changed, 2 insertions(+)
 files changed, 19 insertions(+)
-diff --git a/target/arm/cpu.h b/target/arm/cpu.h
+diff --git a/hw/arm/sbsa-ref.c b/hw/arm/sbsa-ref.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.h
+--- a/hw/arm/sbsa-ref.c
-+++ b/target/arm/cpu.h
++++ b/hw/arm/sbsa-ref.c
-@@ -XXX,XX +XXX,XX @@ typedef struct CPUArchState {
+@@ -XXX,XX +XXX,XX @@ static void create_gic(SBSAMachineState *sms, MemoryRegion *mem)
-      *  nRW (also known as M[4]) is kept, inverted, in env->aarch64
+             [GTIMER_HYP]  = ARCH_TIMER_NS_EL2_IRQ,
-      *  DAIF (exception masks) are kept in env->daif
+             [GTIMER_SEC]  = ARCH_TIMER_S_EL1_IRQ,
-      *  BTYPE is kept in env->btype
+             [GTIMER_HYPVIRT] = ARCH_TIMER_NS_EL2_VIRT_IRQ,
-+     *  SM and ZA are kept in env->svcr
++            [GTIMER_S_EL2_PHYS] = ARCH_TIMER_S_EL2_IRQ,
-      *  all other bits are stored in their correct places in env->pstate
++            [GTIMER_S_EL2_VIRT] = ARCH_TIMER_S_EL2_VIRT_IRQ,
-      */
+         };
-     uint32_t pstate;
-@@ -XXX,XX +XXX,XX @@ typedef struct CPUArchState {
+         for (irq = 0; irq < ARRAY_SIZE(timer_irq); irq++) {
      uint32_t condexec_bits; /* IT bits.  cpsr[15:10,26:25].  */
      uint32_t btype;  /* BTI branch type.  spsr[11:10].  */
      uint64_t daif; /* exception masks, in the bits they are in PSTATE */
 +    uint64_t svcr; /* PSTATE.{SM,ZA} in the bits they are in SVCR */
      uint64_t elr_el[4]; /* AArch64 exception link regs  */
      uint64_t sp_el[4]; /* AArch64 banked stack pointers */
@@ -XXX,XX +XXX,XX @@ FIELD(CPTR_EL3, TCPAC, 31, 1)
  #define PSTATE_MODE_EL1t 4
  #define PSTATE_MODE_EL0t 0
 +/* PSTATE bits that are accessed via SVCR and not stored in SPSR_ELx. */
 +FIELD(SVCR, SM, 0, 1)
 +FIELD(SVCR, ZA, 1, 1)
 +
  /* Write a new value to v7m.exception, thus transitioning into or out
   * of Handler mode; this may result in a change of active stack pointer.
   */
 diff --git a/target/arm/helper.c b/target/arm/helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper.c
 +++ b/target/arm/helper.c
@@ -XXX,XX +XXX,XX @@ static CPAccessResult access_tpidr2(CPUARMState *env, const ARMCPRegInfo *ri,
      return CP_ACCESS_OK;
  }
 +static void svcr_write(CPUARMState *env, const ARMCPRegInfo *ri,
 +                       uint64_t value)
 +{
 +    value &= R_SVCR_SM_MASK | R_SVCR_ZA_MASK;
 +    /* TODO: Side effects. */
 +    env->svcr = value;
 +}
 +
  static const ARMCPRegInfo sme_reginfo[] = {
      { .name = "TPIDR2_EL0", .state = ARM_CP_STATE_AA64,
        .opc0 = 3, .opc1 = 3, .crn = 13, .crm = 0, .opc2 = 5,
        .access = PL0_RW, .accessfn = access_tpidr2,
        .fieldoffset = offsetof(CPUARMState, cp15.tpidr2_el0) },
 +    { .name = "SVCR", .state = ARM_CP_STATE_AA64,
 +      .opc0 = 3, .opc1 = 3, .crn = 4, .crm = 2, .opc2 = 2,
 +      .access = PL0_RW, .type = ARM_CP_SME,
 +      .fieldoffset = offsetof(CPUARMState, svcr),
 +      .writefn = svcr_write, .raw_writefn = raw_write },
  };
  #endif /* TARGET_AARCH64 */
 --
-.25.1
+.43.0

-[PULL 10/25] target/arm: Add SMIDR_EL1, SMPRI_EL1, SMPRIMAP_EL2
+[PULL 13/21] target/arm: Correct LDRD atomicity and fault behaviour
-From: Richard Henderson <richard.henderson@linaro.org>
+Our LDRD implementation is wrong in two respects:
-Implement the streaming mode identification register, and the
+ * if the address is 4-aligned and the load crosses a page boundary
-two streaming priority registers.  For QEMU, they are all RES0.
+   and the second load faults and the first load was to the
    base register (as in cases like "ldrd r2, r3, [r2]", then we
    must not update the base register before taking the fault
  * if the address is 8-aligned the access must be a 64-bit
    single-copy atomic access, not two 32-bit accesses
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+Rewrite the handling of the loads in LDRD to use a single
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+tcg_gen_qemu_ld_i64() and split the result into the destination
-Message-id: 20220620175235.60881-8-richard.henderson@linaro.org
+registers. This allows us to get the atomicity requirements
 right, and also implicitly means that we won't update the
 base register too early for the page-crossing case.
 Note that because we no longer increment 'addr' by 4 in the course of
 performing the LDRD we must change the adjustment value we pass to
 op_addr_ri_post() and op_addr_rr_post(): it no longer needs to
 subtract 4 to get the correct value to use if doing base register
 writeback.
 STRD has the same problem with not getting the atomicity right;
 we will deal with that in the following commit.
 Cc: qemu-stable@nongnu.org
 Reported-by: Stu Grossman <stu.grossman@gmail.com>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250227142746.1698904-2-peter.maydell@linaro.org
 ---
- target/arm/helper.c | 33 +++++++++++++++++++++++++++++++++
+ target/arm/tcg/translate.c | 70 +++++++++++++++++++++++++-------------
-file changed, 33 insertions(+)
+file changed, 46 insertions(+), 24 deletions(-)
-diff --git a/target/arm/helper.c b/target/arm/helper.c
+diff --git a/target/arm/tcg/translate.c b/target/arm/tcg/translate.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper.c
+--- a/target/arm/tcg/translate.c
-+++ b/target/arm/helper.c
++++ b/target/arm/tcg/translate.c
-@@ -XXX,XX +XXX,XX @@ static CPAccessResult access_tpidr2(CPUARMState *env, const ARMCPRegInfo *ri,
+@@ -XXX,XX +XXX,XX @@ static bool op_store_rr(DisasContext *s, arg_ldst_rr *a,
-     return CP_ACCESS_OK;
+     return true;
  }
-+static CPAccessResult access_esm(CPUARMState *env, const ARMCPRegInfo *ri,
++static void do_ldrd_load(DisasContext *s, TCGv_i32 addr, int rt, int rt2)
 +                                 bool isread)
 +{
-+    /* TODO: FEAT_FGT for SMPRI_EL1 but not SMPRIMAP_EL2 */
++    /*
-+    if (arm_current_el(env) < 3
++     * LDRD is required to be an atomic 64-bit access if the
-+        && arm_feature(env, ARM_FEATURE_EL3)
++     * address is 8-aligned, two atomic 32-bit accesses if
-+        && !FIELD_EX64(env->cp15.cptr_el[3], CPTR_EL3, ESM)) {
++     * it's only 4-aligned, and to give an alignment fault
-+        return CP_ACCESS_TRAP_EL3;
++     * if it's not 4-aligned. This is MO_ALIGN_4 | MO_ATOM_SUBALIGN.
 +     * Rt is always the word from the lower address, and Rt2 the
 +     * data from the higher address, regardless of endianness.
 +     * So (like gen_load_exclusive) we avoid gen_aa32_ld_i64()
 +     * so we don't get its SCTLR_B check, and instead do a 64-bit access
 +     * using MO_BE if appropriate and then split the two halves.
 +     *
 +     * For M-profile, and for A-profile before LPAE, the 64-bit
 +     * atomicity is not required. We could model that using
 +     * the looser MO_ATOM_IFALIGN_PAIR, but providing a higher
 +     * level of atomicity than required is harmless (we would not
 +     * currently generate better code for IFALIGN_PAIR here).
 +     *
 +     * This also gives us the correct behaviour of not updating
 +     * rt if the load of rt2 faults; this is required for cases
 +     * like "ldrd r2, r3, [r2]" where rt is also the base register.
 +     */
 +    int mem_idx = get_mem_index(s);
 +    MemOp opc = MO_64 | MO_ALIGN_4 | MO_ATOM_SUBALIGN | s->be_data;
 +    TCGv taddr = gen_aa32_addr(s, addr, opc);
 +    TCGv_i64 t64 = tcg_temp_new_i64();
 +    TCGv_i32 tmp = tcg_temp_new_i32();
 +    TCGv_i32 tmp2 = tcg_temp_new_i32();
 +
 +    tcg_gen_qemu_ld_i64(t64, taddr, mem_idx, opc);
 +    if (s->be_data == MO_BE) {
 +        tcg_gen_extr_i64_i32(tmp2, tmp, t64);
 +    } else {
 +        tcg_gen_extr_i64_i32(tmp, tmp2, t64);
 +    }
-+    return CP_ACCESS_OK;
++    store_reg(s, rt, tmp);
 +    store_reg(s, rt2, tmp2);
 +}
 +
- static void svcr_write(CPUARMState *env, const ARMCPRegInfo *ri,
+ static bool trans_LDRD_rr(DisasContext *s, arg_ldst_rr *a)
                         uint64_t value)
  {
-@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo sme_reginfo[] = {
+-    int mem_idx = get_mem_index(s);
-       .access = PL3_RW, .type = ARM_CP_SME,
+-    TCGv_i32 addr, tmp;
-       .fieldoffset = offsetof(CPUARMState, vfp.smcr_el[3]),
++    TCGv_i32 addr;
-       .writefn = smcr_write, .raw_writefn = raw_write },
-+    { .name = "SMIDR_EL1", .state = ARM_CP_STATE_AA64,
+     if (!ENABLE_ARCH_5TE) {
-+      .opc0 = 3, .opc1 = 1, .crn = 0, .crm = 0, .opc2 = 6,
+         return false;
-+      .access = PL1_R, .accessfn = access_aa64_tid1,
+@@ -XXX,XX +XXX,XX @@ static bool trans_LDRD_rr(DisasContext *s, arg_ldst_rr *a)
-+      /*
+     }
-+       * IMPLEMENTOR = 0 (software)
+     addr = op_addr_rr_pre(s, a);
-+       * REVISION    = 0 (implementation defined)
-+       * SMPS        = 0 (no streaming execution priority in QEMU)
+-    tmp = tcg_temp_new_i32();
-+       * AFFINITY    = 0 (streaming sve mode not shared with other PEs)
+-    gen_aa32_ld_i32(s, tmp, addr, mem_idx, MO_UL | MO_ALIGN);
-+       */
+-    store_reg(s, a->rt, tmp);
-+      .type = ARM_CP_CONST, .resetvalue = 0, },
+-
-+    /*
+-    tcg_gen_addi_i32(addr, addr, 4);
-+     * Because SMIDR_EL1.SMPS is 0, SMPRI_EL1 and SMPRIMAP_EL2 are RES 0.
+-
-+     */
+-    tmp = tcg_temp_new_i32();
-+    { .name = "SMPRI_EL1", .state = ARM_CP_STATE_AA64,
+-    gen_aa32_ld_i32(s, tmp, addr, mem_idx, MO_UL | MO_ALIGN);
-+      .opc0 = 3, .opc1 = 0, .crn = 1, .crm = 2, .opc2 = 4,
+-    store_reg(s, a->rt + 1, tmp);
-+      .access = PL1_RW, .accessfn = access_esm,
++    do_ldrd_load(s, addr, a->rt, a->rt + 1);
-+      .type = ARM_CP_CONST, .resetvalue = 0 },
-+    { .name = "SMPRIMAP_EL2", .state = ARM_CP_STATE_AA64,
+     /* LDRD w/ base writeback is undefined if the registers overlap.  */
-+      .opc0 = 3, .opc1 = 4, .crn = 1, .crm = 2, .opc2 = 5,
+-    op_addr_rr_post(s, a, addr, -4);
-+      .access = PL2_RW, .accessfn = access_esm,
++    op_addr_rr_post(s, a, addr, 0);
-+      .type = ARM_CP_CONST, .resetvalue = 0 },
+     return true;
- };
+ }
- #endif /* TARGET_AARCH64 */
@@ -XXX,XX +XXX,XX @@ static bool op_store_ri(DisasContext *s, arg_ldst_ri *a,
  static bool op_ldrd_ri(DisasContext *s, arg_ldst_ri *a, int rt2)
  {
 -    int mem_idx = get_mem_index(s);
 -    TCGv_i32 addr, tmp;
 +    TCGv_i32 addr;
      addr = op_addr_ri_pre(s, a);
 -    tmp = tcg_temp_new_i32();
 -    gen_aa32_ld_i32(s, tmp, addr, mem_idx, MO_UL | MO_ALIGN);
 -    store_reg(s, a->rt, tmp);
 -
 -    tcg_gen_addi_i32(addr, addr, 4);
 -
 -    tmp = tcg_temp_new_i32();
 -    gen_aa32_ld_i32(s, tmp, addr, mem_idx, MO_UL | MO_ALIGN);
 -    store_reg(s, rt2, tmp);
 +    do_ldrd_load(s, addr, a->rt, rt2);
      /* LDRD w/ base writeback is undefined if the registers overlap.  */
 -    op_addr_ri_post(s, a, addr, -4);
 +    op_addr_ri_post(s, a, addr, 0);
      return true;
  }
 --
-.25.1
+.43.0

-[PULL 16/25] target/arm: Generalize cpu_arm_{get,set}_vq
+[PULL 14/21] target/arm: Correct STRD atomicity
-From: Richard Henderson <richard.henderson@linaro.org>
+Our STRD implementation doesn't correctly implement the requirement:
  * if the address is 8-aligned the access must be a 64-bit
    single-copy atomic access, not two 32-bit accesses
-Rename from cpu_arm_{get,set}_sve_vq, and take the
+Rewrite the handling of STRD to use a single tcg_gen_qemu_st_i64()
-ARMVQMap as the opaque parameter.
+of a value produced by concatenating the two 32 bit source registers.
 This allows us to get the atomicity right.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+As with the LDRD change, now that we don't update 'addr' in the
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+course of performing the store we need to adjust the offset
-Message-id: 20220620175235.60881-14-richard.henderson@linaro.org
+we pass to op_addr_ri_post() and op_addr_rr_post().
 Cc: qemu-stable@nongnu.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250227142746.1698904-3-peter.maydell@linaro.org
 ---
- target/arm/cpu64.c | 29 +++++++++++++++--------------
+ target/arm/tcg/translate.c | 59 +++++++++++++++++++++++++-------------
-file changed, 15 insertions(+), 14 deletions(-)
+file changed, 39 insertions(+), 20 deletions(-)
-diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
+diff --git a/target/arm/tcg/translate.c b/target/arm/tcg/translate.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu64.c
+--- a/target/arm/tcg/translate.c
-+++ b/target/arm/cpu64.c
++++ b/target/arm/tcg/translate.c
-@@ -XXX,XX +XXX,XX @@ static void cpu_max_set_sve_max_vq(Object *obj, Visitor *v, const char *name,
+@@ -XXX,XX +XXX,XX @@ static bool trans_LDRD_rr(DisasContext *s, arg_ldst_rr *a)
      return true;
  }
- /*
++static void do_strd_store(DisasContext *s, TCGv_i32 addr, int rt, int rt2)
-- * Note that cpu_arm_get/set_sve_vq cannot use the simpler
++{
-- * object_property_add_bool interface because they make use
++    /*
-- * of the contents of "name" to determine which bit on which
++     * STRD is required to be an atomic 64-bit access if the
-- * to operate.
++     * address is 8-aligned, two atomic 32-bit accesses if
-+ * Note that cpu_arm_{get,set}_vq cannot use the simpler
++     * it's only 4-aligned, and to give an alignment fault
-+ * object_property_add_bool interface because they make use of the
++     * if it's not 4-aligned.
-+ * contents of "name" to determine which bit on which to operate.
++     * Rt is always the word from the lower address, and Rt2 the
-  */
++     * data from the higher address, regardless of endianness.
--static void cpu_arm_get_sve_vq(Object *obj, Visitor *v, const char *name,
++     * So (like gen_store_exclusive) we avoid gen_aa32_ld_i64()
--                               void *opaque, Error **errp)
++     * so we don't get its SCTLR_B check, and instead do a 64-bit access
-+static void cpu_arm_get_vq(Object *obj, Visitor *v, const char *name,
++     * using MO_BE if appropriate, using a value constructed
-+                           void *opaque, Error **errp)
++     * by putting the two halves together in the right order.
 +     *
 +     * As with LDRD, the 64-bit atomicity is not required for
 +     * M-profile, or for A-profile before LPAE, and we provide
 +     * the higher guarantee always for simplicity.
 +     */
 +    int mem_idx = get_mem_index(s);
 +    MemOp opc = MO_64 | MO_ALIGN_4 | MO_ATOM_SUBALIGN | s->be_data;
 +    TCGv taddr = gen_aa32_addr(s, addr, opc);
 +    TCGv_i32 t1 = load_reg(s, rt);
 +    TCGv_i32 t2 = load_reg(s, rt2);
 +    TCGv_i64 t64 = tcg_temp_new_i64();
 +
 +    if (s->be_data == MO_BE) {
 +        tcg_gen_concat_i32_i64(t64, t2, t1);
 +    } else {
 +        tcg_gen_concat_i32_i64(t64, t1, t2);
 +    }
 +    tcg_gen_qemu_st_i64(t64, taddr, mem_idx, opc);
 +}
 +
  static bool trans_STRD_rr(DisasContext *s, arg_ldst_rr *a)
  {
-     ARMCPU *cpu = ARM_CPU(obj);
+-    int mem_idx = get_mem_index(s);
-+    ARMVQMap *vq_map = opaque;
+-    TCGv_i32 addr, tmp;
-     uint32_t vq = atoi(&name[3]) / 128;
++    TCGv_i32 addr;
-     bool value;
+     if (!ENABLE_ARCH_5TE) {
-@@ -XXX,XX +XXX,XX @@ static void cpu_arm_get_sve_vq(Object *obj, Visitor *v, const char *name,
+         return false;
-     if (!cpu_isar_feature(aa64_sve, cpu)) {
+@@ -XXX,XX +XXX,XX @@ static bool trans_STRD_rr(DisasContext *s, arg_ldst_rr *a)
          value = false;
      } else {
 -        value = extract32(cpu->sve_vq.map, vq - 1, 1);
 +        value = extract32(vq_map->map, vq - 1, 1);
      }
-     visit_type_bool(v, name, &value, errp);
+     addr = op_addr_rr_pre(s, a);
 -    tmp = load_reg(s, a->rt);
 -    gen_aa32_st_i32(s, tmp, addr, mem_idx, MO_UL | MO_ALIGN);
 +    do_strd_store(s, addr, a->rt, a->rt + 1);
 -    tcg_gen_addi_i32(addr, addr, 4);
 -
 -    tmp = load_reg(s, a->rt + 1);
 -    gen_aa32_st_i32(s, tmp, addr, mem_idx, MO_UL | MO_ALIGN);
 -
 -    op_addr_rr_post(s, a, addr, -4);
 +    op_addr_rr_post(s, a, addr, 0);
      return true;
  }
--static void cpu_arm_set_sve_vq(Object *obj, Visitor *v, const char *name,
+@@ -XXX,XX +XXX,XX @@ static bool trans_LDRD_ri_t32(DisasContext *s, arg_ldst_ri2 *a)
--                               void *opaque, Error **errp)
-+static void cpu_arm_set_vq(Object *obj, Visitor *v, const char *name,
+ static bool op_strd_ri(DisasContext *s, arg_ldst_ri *a, int rt2)
 +                           void *opaque, Error **errp)
  {
--    ARMCPU *cpu = ARM_CPU(obj);
+-    int mem_idx = get_mem_index(s);
-+    ARMVQMap *vq_map = opaque;
+-    TCGv_i32 addr, tmp;
-     uint32_t vq = atoi(&name[3]) / 128;
++    TCGv_i32 addr;
-     bool value;
+     addr = op_addr_ri_pre(s, a);
-@@ -XXX,XX +XXX,XX @@ static void cpu_arm_set_sve_vq(Object *obj, Visitor *v, const char *name,
-         return;
+-    tmp = load_reg(s, a->rt);
-     }
+-    gen_aa32_st_i32(s, tmp, addr, mem_idx, MO_UL | MO_ALIGN);
++    do_strd_store(s, addr, a->rt, rt2);
--    cpu->sve_vq.map = deposit32(cpu->sve_vq.map, vq - 1, 1, value);
--    cpu->sve_vq.init |= 1 << (vq - 1);
+-    tcg_gen_addi_i32(addr, addr, 4);
-+    vq_map->map = deposit32(vq_map->map, vq - 1, 1, value);
+-
-+    vq_map->init |= 1 << (vq - 1);
+-    tmp = load_reg(s, rt2);
 -    gen_aa32_st_i32(s, tmp, addr, mem_idx, MO_UL | MO_ALIGN);
 -
 -    op_addr_ri_post(s, a, addr, -4);
 +    op_addr_ri_post(s, a, addr, 0);
      return true;
  }
- static bool cpu_arm_get_sve(Object *obj, Error **errp)
-@@ -XXX,XX +XXX,XX @@ static void cpu_arm_get_sve_default_vec_len(Object *obj, Visitor *v,
- void aarch64_add_sve_properties(Object *obj)
- {
-+    ARMCPU *cpu = ARM_CPU(obj);
-     uint32_t vq;
-     object_property_add_bool(obj, "sve", cpu_arm_get_sve, cpu_arm_set_sve);
-@@ -XXX,XX +XXX,XX @@ void aarch64_add_sve_properties(Object *obj)
-     for (vq = 1; vq <= ARM_MAX_VQ; ++vq) {
-         char name[8];
-         sprintf(name, "sve%d", vq * 128);
--        object_property_add(obj, name, "bool", cpu_arm_get_sve_vq,
--                            cpu_arm_set_sve_vq, NULL, NULL);
-+        object_property_add(obj, name, "bool", cpu_arm_get_vq,
-+                            cpu_arm_set_vq, NULL, &cpu->sve_vq);
-     }
- #ifdef CONFIG_USER_ONLY
 --
-.25.1
+.43.0

-[PULL 15/25] target/arm: Create ARMVQMap
+[PULL 15/21] target/arm: Drop unused address_offset from op_addr_{rr, ri}_post()
-From: Richard Henderson <richard.henderson@linaro.org>
+All the callers of op_addr_rr_post() and op_addr_ri_post() now pass in
 zero for the address_offset, so we can remove that argument.
-Pull the three sve_vq_* values into a structure.
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-This will be reused for SME.
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
 Message-id: 20250227142746.1698904-4-peter.maydell@linaro.org
 ---
  target/arm/tcg/translate.c | 26 +++++++++++++-------------
 file changed, 13 insertions(+), 13 deletions(-)
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+diff --git a/target/arm/tcg/translate.c b/target/arm/tcg/translate.c
 Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20220620175235.60881-13-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
  target/arm/cpu.h    | 29 ++++++++++++++---------------
  target/arm/cpu64.c  | 22 +++++++++++-----------
  target/arm/helper.c |  2 +-
  target/arm/kvm64.c  |  2 +-
 files changed, 27 insertions(+), 28 deletions(-)
 diff --git a/target/arm/cpu.h b/target/arm/cpu.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.h
+--- a/target/arm/tcg/translate.c
-+++ b/target/arm/cpu.h
++++ b/target/arm/tcg/translate.c
-@@ -XXX,XX +XXX,XX @@ typedef enum ARMPSCIState {
+@@ -XXX,XX +XXX,XX @@ static TCGv_i32 op_addr_rr_pre(DisasContext *s, arg_ldst_rr *a)
  typedef struct ARMISARegisters ARMISARegisters;
 +/*
 + * In map, each set bit is a supported vector length of (bit-number + 1) * 16
 + * bytes, i.e. each bit number + 1 is the vector length in quadwords.
 + *
 + * While processing properties during initialization, corresponding init bits
 + * are set for bits in sve_vq_map that have been set by properties.
 + *
 + * Bits set in supported represent valid vector lengths for the CPU type.
 + */
 +typedef struct {
 +    uint32_t map, init, supported;
 +} ARMVQMap;
 +
  /**
   * ARMCPU:
   * @env: #CPUARMState
@@ -XXX,XX +XXX,XX @@ struct ArchCPU {
      uint32_t sve_default_vq;
  #endif
 -    /*
 -     * In sve_vq_map each set bit is a supported vector length of
 -     * (bit-number + 1) * 16 bytes, i.e. each bit number + 1 is the vector
 -     * length in quadwords.
 -     *
 -     * While processing properties during initialization, corresponding
 -     * sve_vq_init bits are set for bits in sve_vq_map that have been
 -     * set by properties.
 -     *
 -     * Bits set in sve_vq_supported represent valid vector lengths for
 -     * the CPU type.
 -     */
 -    uint32_t sve_vq_map;
 -    uint32_t sve_vq_init;
 -    uint32_t sve_vq_supported;
 +    ARMVQMap sve_vq;
      /* Generic timer counter frequency, in Hz */
      uint64_t gt_cntfrq_hz;
 diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu64.c
 +++ b/target/arm/cpu64.c
@@ -XXX,XX +XXX,XX @@ void arm_cpu_sve_finalize(ARMCPU *cpu, Error **errp)
       * any of the above.  Finally, if SVE is not disabled, then at least one
       * vector length must be enabled.
       */
 -    uint32_t vq_map = cpu->sve_vq_map;
 -    uint32_t vq_init = cpu->sve_vq_init;
 +    uint32_t vq_map = cpu->sve_vq.map;
 +    uint32_t vq_init = cpu->sve_vq.init;
      uint32_t vq_supported;
      uint32_t vq_mask = 0;
      uint32_t tmp, vq, max_vq = 0;
@@ -XXX,XX +XXX,XX @@ void arm_cpu_sve_finalize(ARMCPU *cpu, Error **errp)
       */
      if (kvm_enabled()) {
          if (kvm_arm_sve_supported()) {
 -            cpu->sve_vq_supported = kvm_arm_sve_get_vls(CPU(cpu));
 -            vq_supported = cpu->sve_vq_supported;
 +            cpu->sve_vq.supported = kvm_arm_sve_get_vls(CPU(cpu));
 +            vq_supported = cpu->sve_vq.supported;
          } else {
              assert(!cpu_isar_feature(aa64_sve, cpu));
              vq_supported = 0;
          }
      } else {
 -        vq_supported = cpu->sve_vq_supported;
 +        vq_supported = cpu->sve_vq.supported;
      }
      /*
@@ -XXX,XX +XXX,XX @@ void arm_cpu_sve_finalize(ARMCPU *cpu, Error **errp)
      /* From now on sve_max_vq is the actual maximum supported length. */
      cpu->sve_max_vq = max_vq;
 -    cpu->sve_vq_map = vq_map;
 +    cpu->sve_vq.map = vq_map;
  }
- static void cpu_max_get_sve_max_vq(Object *obj, Visitor *v, const char *name,
+ static void op_addr_rr_post(DisasContext *s, arg_ldst_rr *a,
-@@ -XXX,XX +XXX,XX @@ static void cpu_arm_get_sve_vq(Object *obj, Visitor *v, const char *name,
+-                            TCGv_i32 addr, int address_offset)
-     if (!cpu_isar_feature(aa64_sve, cpu)) {
++                            TCGv_i32 addr)
-         value = false;
+ {
-     } else {
+     if (!a->p) {
--        value = extract32(cpu->sve_vq_map, vq - 1, 1);
+         TCGv_i32 ofs = load_reg(s, a->rm);
-+        value = extract32(cpu->sve_vq.map, vq - 1, 1);
+@@ -XXX,XX +XXX,XX @@ static void op_addr_rr_post(DisasContext *s, arg_ldst_rr *a,
-     }
+     } else if (!a->w) {
      visit_type_bool(v, name, &value, errp);
  }
@@ -XXX,XX +XXX,XX @@ static void cpu_arm_set_sve_vq(Object *obj, Visitor *v, const char *name,
          return;
      }
+-    tcg_gen_addi_i32(addr, addr, address_offset);
--    cpu->sve_vq_map = deposit32(cpu->sve_vq_map, vq - 1, 1, value);
+     store_reg(s, a->rn, addr);
 -    cpu->sve_vq_init |= 1 << (vq - 1);
 +    cpu->sve_vq.map = deposit32(cpu->sve_vq.map, vq - 1, 1, value);
 +    cpu->sve_vq.init |= 1 << (vq - 1);
  }
- static bool cpu_arm_get_sve(Object *obj, Error **errp)
+@@ -XXX,XX +XXX,XX @@ static bool op_load_rr(DisasContext *s, arg_ldst_rr *a,
-@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
+      * Perform base writeback before the loaded value to
-     cpu->dcz_blocksize = 7; /*  512 bytes */
+      * ensure correct behavior with overlapping index registers.
- #endif
+      */
+-    op_addr_rr_post(s, a, addr, 0);
--    cpu->sve_vq_supported = MAKE_64BIT_MASK(0, ARM_MAX_VQ);
++    op_addr_rr_post(s, a, addr);
-+    cpu->sve_vq.supported = MAKE_64BIT_MASK(0, ARM_MAX_VQ);
+     store_reg_from_load(s, a->rt, tmp);
+     return true;
      aarch64_add_pauth_properties(obj);
      aarch64_add_sve_properties(obj);
@@ -XXX,XX +XXX,XX @@ static void aarch64_a64fx_initfn(Object *obj)
      /* The A64FX supports only 128, 256 and 512 bit vector lengths */
      aarch64_add_sve_properties(obj);
 -    cpu->sve_vq_supported = (1 << 0)  /* 128bit */
 +    cpu->sve_vq.supported = (1 << 0)  /* 128bit */
                            | (1 << 1)  /* 256bit */
                            | (1 << 3); /* 512bit */
 diff --git a/target/arm/helper.c b/target/arm/helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper.c
 +++ b/target/arm/helper.c
@@ -XXX,XX +XXX,XX @@ uint32_t sve_vqm1_for_el(CPUARMState *env, int el)
          len = MIN(len, 0xf & (uint32_t)env->vfp.zcr_el[3]);
      }
 -    len = 31 - clz32(cpu->sve_vq_map & MAKE_64BIT_MASK(0, len + 1));
 +    len = 31 - clz32(cpu->sve_vq.map & MAKE_64BIT_MASK(0, len + 1));
      return len;
  }
+@@ -XXX,XX +XXX,XX @@ static bool op_store_rr(DisasContext *s, arg_ldst_rr *a,
-diff --git a/target/arm/kvm64.c b/target/arm/kvm64.c
+     gen_aa32_st_i32(s, tmp, addr, mem_idx, mop);
-index XXXXXXX..XXXXXXX 100644
+     disas_set_da_iss(s, mop, issinfo);
---- a/target/arm/kvm64.c
-+++ b/target/arm/kvm64.c
+-    op_addr_rr_post(s, a, addr, 0);
-@@ -XXX,XX +XXX,XX @@ uint32_t kvm_arm_sve_get_vls(CPUState *cs)
++    op_addr_rr_post(s, a, addr);
- static int kvm_arm_sve_set_vls(CPUState *cs)
+     return true;
  }
@@ -XXX,XX +XXX,XX @@ static bool trans_LDRD_rr(DisasContext *s, arg_ldst_rr *a)
      do_ldrd_load(s, addr, a->rt, a->rt + 1);
      /* LDRD w/ base writeback is undefined if the registers overlap.  */
 -    op_addr_rr_post(s, a, addr, 0);
 +    op_addr_rr_post(s, a, addr);
      return true;
  }
@@ -XXX,XX +XXX,XX @@ static bool trans_STRD_rr(DisasContext *s, arg_ldst_rr *a)
      do_strd_store(s, addr, a->rt, a->rt + 1);
 -    op_addr_rr_post(s, a, addr, 0);
 +    op_addr_rr_post(s, a, addr);
      return true;
  }
@@ -XXX,XX +XXX,XX @@ static TCGv_i32 op_addr_ri_pre(DisasContext *s, arg_ldst_ri *a)
  }
  static void op_addr_ri_post(DisasContext *s, arg_ldst_ri *a,
 -                            TCGv_i32 addr, int address_offset)
 +                            TCGv_i32 addr)
  {
-     ARMCPU *cpu = ARM_CPU(cs);
++    int address_offset = 0;
--    uint64_t vls[KVM_ARM64_SVE_VLS_WORDS] = { cpu->sve_vq_map };
+     if (!a->p) {
-+    uint64_t vls[KVM_ARM64_SVE_VLS_WORDS] = { cpu->sve_vq.map };
+         if (a->u) {
-     struct kvm_one_reg reg = {
+-            address_offset += a->imm;
-         .id = KVM_REG_ARM64_SVE_VLS,
++            address_offset = a->imm;
-         .addr = (uint64_t)&vls[0],
+         } else {
 -            address_offset -= a->imm;
 +            address_offset = -a->imm;
          }
      } else if (!a->w) {
          return;
@@ -XXX,XX +XXX,XX @@ static bool op_load_ri(DisasContext *s, arg_ldst_ri *a,
       * Perform base writeback before the loaded value to
       * ensure correct behavior with overlapping index registers.
       */
 -    op_addr_ri_post(s, a, addr, 0);
 +    op_addr_ri_post(s, a, addr);
      store_reg_from_load(s, a->rt, tmp);
      return true;
  }
@@ -XXX,XX +XXX,XX @@ static bool op_store_ri(DisasContext *s, arg_ldst_ri *a,
      gen_aa32_st_i32(s, tmp, addr, mem_idx, mop);
      disas_set_da_iss(s, mop, issinfo);
 -    op_addr_ri_post(s, a, addr, 0);
 +    op_addr_ri_post(s, a, addr);
      return true;
  }
@@ -XXX,XX +XXX,XX @@ static bool op_ldrd_ri(DisasContext *s, arg_ldst_ri *a, int rt2)
      do_ldrd_load(s, addr, a->rt, rt2);
      /* LDRD w/ base writeback is undefined if the registers overlap.  */
 -    op_addr_ri_post(s, a, addr, 0);
 +    op_addr_ri_post(s, a, addr);
      return true;
  }
@@ -XXX,XX +XXX,XX @@ static bool op_strd_ri(DisasContext *s, arg_ldst_ri *a, int rt2)
      do_strd_store(s, addr, a->rt, rt2);
 -    op_addr_ri_post(s, a, addr, 0);
 +    op_addr_ri_post(s, a, addr);
      return true;
  }
 --
-.25.1
+.43.0

-[PULL 06/25] target/arm: Add syn_smetrap
+[PULL 16/21] target/arm: Make dummy debug registers RAZ, not NOP
-From: Richard Henderson <richard.henderson@linaro.org>
+In debug_helper.c we provide a few dummy versions of
 debug registers:
  * DBGVCR (AArch32 only): enable bits for vector-catch
    debug events
  * MDCCINT_EL1: interrupt enable bits for the DCC
    debug communications channel
  * DBGVCR32_EL2: the AArch64 accessor for the state in
    DBGVCR
-This will be used for raising various traps for SME.
+We implemented these only to stop Linux crashing on startup,
 but we chose to implement them as ARM_CP_NOP. This worked
 for Linux where it only cares about trying to write to these
 registers, but is very confusing behaviour for anything that
 wants to read the registers (perhaps for context state switches),
 because the destination register will be left with whatever
 random value it happened to have before the read.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+Model these registers instead as RAZ.
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220620175235.60881-4-richard.henderson@linaro.org
+Fixes: 5e8b12ffbb8c68 ("target-arm: Implement minimal DBGVCR, OSDLR_EL1, MDCCSR_EL0")
 Fixes: 5dbdc4342f479d ("target-arm: Implement dummy MDCCINT_EL1")
 Resolves: https://gitlab.com/qemu-project/qemu/-/issues/2708
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20250228162424.1917269-1-peter.maydell@linaro.org
 ---
- target/arm/syndrome.h | 14 ++++++++++++++
+ target/arm/debug_helper.c | 7 ++++---
-file changed, 14 insertions(+)
+file changed, 4 insertions(+), 3 deletions(-)
-diff --git a/target/arm/syndrome.h b/target/arm/syndrome.h
+diff --git a/target/arm/debug_helper.c b/target/arm/debug_helper.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/syndrome.h
+--- a/target/arm/debug_helper.c
-+++ b/target/arm/syndrome.h
++++ b/target/arm/debug_helper.c
-@@ -XXX,XX +XXX,XX @@ enum arm_exception_class {
+@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo debug_cp_reginfo[] = {
-     EC_AA64_SMC               = 0x17,
+     { .name = "DBGVCR",
-     EC_SYSTEMREGISTERTRAP     = 0x18,
+       .cp = 14, .opc1 = 0, .crn = 0, .crm = 7, .opc2 = 0,
-     EC_SVEACCESSTRAP          = 0x19,
+       .access = PL1_RW, .accessfn = access_tda,
-+    EC_SMETRAP                = 0x1d,
+-      .type = ARM_CP_NOP },
-     EC_INSNABORT              = 0x20,
++      .type = ARM_CP_CONST, .resetvalue = 0 },
-     EC_INSNABORT_SAME_EL      = 0x21,
+     /*
-     EC_PCALIGNMENT            = 0x22,
+      * Dummy MDCCINT_EL1, since we don't implement the Debug Communications
-@@ -XXX,XX +XXX,XX @@ enum arm_exception_class {
+      * Channel but Linux may try to access this register. The 32-bit
-     EC_AA64_BKPT              = 0x3c,
+@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo debug_cp_reginfo[] = {
      { .name = "MDCCINT_EL1", .state = ARM_CP_STATE_BOTH,
        .cp = 14, .opc0 = 2, .opc1 = 0, .crn = 0, .crm = 2, .opc2 = 0,
        .access = PL1_RW, .accessfn = access_tdcc,
 -      .type = ARM_CP_NOP },
 +      .type = ARM_CP_CONST, .resetvalue = 0 },
      /*
       * Dummy DBGCLAIM registers.
       * "The architecture does not define any functionality for the CLAIM tag bits.",
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo debug_aa32_el1_reginfo[] = {
      { .name = "DBGVCR32_EL2", .state = ARM_CP_STATE_AA64,
        .opc0 = 2, .opc1 = 4, .crn = 0, .crm = 7, .opc2 = 0,
        .access = PL2_RW, .accessfn = access_dbgvcr32,
 -      .type = ARM_CP_NOP | ARM_CP_EL3_NO_EL2_KEEP },
 +      .type = ARM_CP_CONST | ARM_CP_EL3_NO_EL2_KEEP,
 +      .resetvalue = 0 },
  };
-+typedef enum {
+ static const ARMCPRegInfo debug_lpae_cp_reginfo[] = {
 +    SME_ET_AccessTrap,
 +    SME_ET_Streaming,
 +    SME_ET_NotStreaming,
 +    SME_ET_InactiveZA,
 +} SMEExceptionType;
 +
  #define ARM_EL_EC_SHIFT 26
  #define ARM_EL_IL_SHIFT 25
  #define ARM_EL_ISV_SHIFT 24
@@ -XXX,XX +XXX,XX @@ static inline uint32_t syn_sve_access_trap(void)
      return EC_SVEACCESSTRAP << ARM_EL_EC_SHIFT;
  }
 +static inline uint32_t syn_smetrap(SMEExceptionType etype, bool is_16bit)
 +{
 +    return (EC_SMETRAP << ARM_EL_EC_SHIFT)
 +        | (is_16bit ? 0 : ARM_EL_IL) | etype;
 +}
 +
  static inline uint32_t syn_pactrap(void)
  {
      return EC_PACTRAP << ARM_EL_EC_SHIFT;
 --
-.25.1
+.43.0

-[PULL 19/25] target/arm: Unexport aarch64_add_*_properties
+[PULL 17/21] util/qemu-timer.c: Don't warp timer from timerlist_rearm()
-From: Richard Henderson <richard.henderson@linaro.org>
+Currently we call icount_start_warp_timer() from timerlist_rearm().
 This produces incorrect behaviour, because timerlist_rearm() is
 called, for instance, when a timer callback modifies its timer.  We
 cannot decide here to warp the timer forwards to the next timer
 deadline merely because all_cpu_threads_idle() is true, because the
 timer callback we were called from (or some other callback later in
 the list of callbacks being invoked) may be about to raise a CPU
 interrupt and move a CPU from idle to ready.
-These functions are not used outside cpu64.c,
+The only valid place to choose to warp the timer forward is from the
-so make them static.
+main loop, when we know we have no outstanding IO or timer callbacks
 that might be about to wake up a CPU.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+For Arm guests, this bug was mostly latent until the refactoring
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+commit f6fc36deef6abc ("target/arm/helper: Implement
-Message-id: 20220620175235.60881-17-richard.henderson@linaro.org
+CNTHCTL_EL2.CNT[VP]MASK"), which exposed it because it refactored a
 timer callback so that it happened to call timer_mod() first and
 raise the interrupt second, when it had previously raised the
 interrupt first and called timer_mod() afterwards.
 This call seems to have originally derived from the
 pre-record-and-replay icount code, which (as of e.g.  commit
 db1a49726c3c in 2010) in this location did a call to
 qemu_notify_event(), necessary to get the icount code in the vCPU
 round-robin thread to stop and recalculate the icount deadline when a
 timer was reprogrammed from the IO thread.  In current QEMU,
 everything is done on the vCPU thread when we are in icount mode, so
 there's no need to try to notify another thread here.
 I suspect that the other reason why this call was doing icount timer
 warping is that it pre-dates commit efab87cf79077a from 2015, which
 added a call to icount_start_warp_timer() to main_loop_wait().  Once
 the call in timerlist_rearm() has been removed, if the timer
 callbacks don't cause any CPU to be woken up then we will end up
 calling icount_start_warp_timer() from main_loop_wait() when the rr
 main loop code calls rr_wait_io_event().
 Remove the incorrect call from timerlist_rearm().
 Cc: qemu-stable@nongnu.org
 Resolves: https://gitlab.com/qemu-project/qemu/-/issues/2703
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
+Tested-by: Alex Bennée <alex.bennee@linaro.org>
+Message-id: 20250210135804.3526943-1-peter.maydell@linaro.org
 ---
- target/arm/cpu.h   | 3 ---
+ util/qemu-timer.c | 4 ----
- target/arm/cpu64.c | 4 ++--
+file changed, 4 deletions(-)
 files changed, 2 insertions(+), 5 deletions(-)
-diff --git a/target/arm/cpu.h b/target/arm/cpu.h
+diff --git a/util/qemu-timer.c b/util/qemu-timer.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.h
+--- a/util/qemu-timer.c
-+++ b/target/arm/cpu.h
++++ b/util/qemu-timer.c
-@@ -XXX,XX +XXX,XX @@ int aarch64_cpu_gdb_write_register(CPUState *cpu, uint8_t *buf, int reg);
+@@ -XXX,XX +XXX,XX @@ static bool timer_mod_ns_locked(QEMUTimerList *timer_list,
- void aarch64_sve_narrow_vq(CPUARMState *env, unsigned vq);
- void aarch64_sve_change_el(CPUARMState *env, int old_el,
+ static void timerlist_rearm(QEMUTimerList *timer_list)
-                            int new_el, bool el0_a64);
+ {
--void aarch64_add_sve_properties(Object *obj);
+-    /* Interrupt execution to force deadline recalculation.  */
--void aarch64_add_pauth_properties(Object *obj);
+-    if (icount_enabled() && timer_list->clock->type == QEMU_CLOCK_VIRTUAL) {
- void arm_reset_sve_state(CPUARMState *env);
+-        icount_start_warp_timer();
+-    }
- /*
+     timerlist_notify(timer_list);
@@ -XXX,XX +XXX,XX @@ static inline void aarch64_sve_narrow_vq(CPUARMState *env, unsigned vq) { }
  static inline void aarch64_sve_change_el(CPUARMState *env, int o,
                                           int n, bool a)
  { }
 -static inline void aarch64_add_sve_properties(Object *obj) { }
  #endif
  void aarch64_sync_32_to_64(CPUARMState *env);
 diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu64.c
 +++ b/target/arm/cpu64.c
@@ -XXX,XX +XXX,XX @@ static void cpu_arm_get_default_vec_len(Object *obj, Visitor *v,
  }
- #endif
--void aarch64_add_sve_properties(Object *obj)
-+static void aarch64_add_sve_properties(Object *obj)
- {
-     ARMCPU *cpu = ARM_CPU(obj);
-     uint32_t vq;
-@@ -XXX,XX +XXX,XX @@ static Property arm_cpu_pauth_property =
- static Property arm_cpu_pauth_impdef_property =
-     DEFINE_PROP_BOOL("pauth-impdef", ARMCPU, prop_pauth_impdef, false);
--void aarch64_add_pauth_properties(Object *obj)
-+static void aarch64_add_pauth_properties(Object *obj)
- {
-     ARMCPU *cpu = ARM_CPU(obj);
 --
-.25.1
+.43.0

-[PULL 18/25] target/arm: Move arm_cpu_*_finalize to internals.h
+[PULL 18/21] include/exec/memop.h: Expand comment for MO_ATOM_SUBALIGN
-From: Richard Henderson <richard.henderson@linaro.org>
+Expand the example in the comment documenting MO_ATOM_SUBALIGN,
 to be clearer about the atomicity guarantees it represents.
-Drop the aa32-only inline fallbacks,
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-and just use a couple of ifdefs.
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20250228103222.1838913-1-peter.maydell@linaro.org
 ---
  include/exec/memop.h | 8 ++++++--
 file changed, 6 insertions(+), 2 deletions(-)
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+diff --git a/include/exec/memop.h b/include/exec/memop.h
 Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20220620175235.60881-16-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
  target/arm/cpu.h       | 6 ------
  target/arm/internals.h | 3 +++
  target/arm/cpu.c       | 2 ++
 files changed, 5 insertions(+), 6 deletions(-)
 diff --git a/target/arm/cpu.h b/target/arm/cpu.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.h
+--- a/include/exec/memop.h
-+++ b/target/arm/cpu.h
++++ b/include/exec/memop.h
-@@ -XXX,XX +XXX,XX @@ typedef struct {
+@@ -XXX,XX +XXX,XX @@ typedef enum MemOp {
+      *    Depending on alignment, one or both will be single-copy atomic.
- #ifdef TARGET_AARCH64
+      *    This is the atomicity e.g. of Arm FEAT_LSE2 LDP.
- # define ARM_MAX_VQ    16
+      * MO_ATOM_SUBALIGN: the operation is single-copy atomic by parts
--void arm_cpu_sve_finalize(ARMCPU *cpu, Error **errp);
+-     *    by the alignment.  E.g. if the address is 0 mod 4, then each
--void arm_cpu_pauth_finalize(ARMCPU *cpu, Error **errp);
+-     *    4-byte subobject is single-copy atomic.
--void arm_cpu_lpa2_finalize(ARMCPU *cpu, Error **errp);
++     *    by the alignment.  E.g. if an 8-byte value is accessed at an
- #else
++     *    address which is 0 mod 8, then the whole 8-byte access is
- # define ARM_MAX_VQ    1
++     *    single-copy atomic; otherwise, if it is accessed at 0 mod 4
--static inline void arm_cpu_sve_finalize(ARMCPU *cpu, Error **errp) { }
++     *    then each 4-byte subobject is single-copy atomic; otherwise
--static inline void arm_cpu_pauth_finalize(ARMCPU *cpu, Error **errp) { }
++     *    if it is accessed at 0 mod 2 then the four 2-byte subobjects
--static inline void arm_cpu_lpa2_finalize(ARMCPU *cpu, Error **errp) { }
++     *    are single-copy atomic.
- #endif
+      *    This is the atomicity e.g. of IBM Power.
+      * MO_ATOM_NONE: the operation has no atomicity requirements.
- typedef struct ARMVectorReg {
+      *
 diff --git a/target/arm/internals.h b/target/arm/internals.h
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/internals.h
 +++ b/target/arm/internals.h
@@ -XXX,XX +XXX,XX @@ int arm_gdb_get_svereg(CPUARMState *env, GByteArray *buf, int reg);
  int arm_gdb_set_svereg(CPUARMState *env, uint8_t *buf, int reg);
  int aarch64_fpu_gdb_get_reg(CPUARMState *env, GByteArray *buf, int reg);
  int aarch64_fpu_gdb_set_reg(CPUARMState *env, uint8_t *buf, int reg);
 +void arm_cpu_sve_finalize(ARMCPU *cpu, Error **errp);
 +void arm_cpu_pauth_finalize(ARMCPU *cpu, Error **errp);
 +void arm_cpu_lpa2_finalize(ARMCPU *cpu, Error **errp);
  #endif
  #ifdef CONFIG_USER_ONLY
 diff --git a/target/arm/cpu.c b/target/arm/cpu.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu.c
 +++ b/target/arm/cpu.c
@@ -XXX,XX +XXX,XX @@ void arm_cpu_finalize_features(ARMCPU *cpu, Error **errp)
  {
      Error *local_err = NULL;
 +#ifdef TARGET_AARCH64
      if (arm_feature(&cpu->env, ARM_FEATURE_AARCH64)) {
          arm_cpu_sve_finalize(cpu, &local_err);
          if (local_err != NULL) {
@@ -XXX,XX +XXX,XX @@ void arm_cpu_finalize_features(ARMCPU *cpu, Error **errp)
              return;
          }
      }
 +#endif
      if (kvm_enabled()) {
          kvm_arm_steal_time_finalize(cpu, &local_err);
 --
-.25.1
+.43.0

-[PULL 07/25] target/arm: Add ARM_CP_SME
+[PULL 19/21] hw/arm/smmu: Introduce smmu_configs_inv_sid_range() helper
-From: Richard Henderson <richard.henderson@linaro.org>
+From: JianChunfu <jansef.jian@hj-micro.com>
-This will be used for controlling access to SME cpregs.
+Use a similar terminology smmu_hash_remove_by_sid_range() as the one
 being used for other hash table matching functions since
 smmuv3_invalidate_ste() name is not self explanatory, and introduce a
 helper that invokes the g_hash_table_foreach_remove.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+No functional change intended.
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220620175235.60881-5-richard.henderson@linaro.org
+Signed-off-by: JianChunfu <jansef.jian@hj-micro.com>
 Reviewed-by: Eric Auger <eric.auger@redhat.com>
 Message-id: 20250228031438.3916-1-jansef.jian@hj-micro.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/cpregs.h        |  5 +++++
+ hw/arm/smmu-internal.h       |  5 -----
- target/arm/translate-a64.c | 18 ++++++++++++++++++
+ include/hw/arm/smmu-common.h |  6 ++++++
-files changed, 23 insertions(+)
+ hw/arm/smmu-common.c         | 21 +++++++++++++++++++++
  hw/arm/smmuv3.c              | 19 ++-----------------
  hw/arm/trace-events          |  3 ++-
 files changed, 31 insertions(+), 23 deletions(-)
-diff --git a/target/arm/cpregs.h b/target/arm/cpregs.h
+diff --git a/hw/arm/smmu-internal.h b/hw/arm/smmu-internal.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpregs.h
+--- a/hw/arm/smmu-internal.h
-+++ b/target/arm/cpregs.h
++++ b/hw/arm/smmu-internal.h
-@@ -XXX,XX +XXX,XX @@ enum {
+@@ -XXX,XX +XXX,XX @@ typedef struct SMMUIOTLBPageInvInfo {
-     ARM_CP_EL3_NO_EL2_UNDEF      = 1 << 16,
+     uint64_t mask;
-     ARM_CP_EL3_NO_EL2_KEEP       = 1 << 17,
+ } SMMUIOTLBPageInvInfo;
-     ARM_CP_EL3_NO_EL2_C_NZ       = 1 << 18,
-+    /*
+-typedef struct SMMUSIDRange {
-+     * Flag: Access check for this sysreg is constrained by the
+-    uint32_t start;
-+     * ARM pseudocode function CheckSMEAccess().
+-    uint32_t end;
-+     */
+-} SMMUSIDRange;
-+    ARM_CP_SME                   = 1 << 19,
+-
- };
+ #endif
+diff --git a/include/hw/arm/smmu-common.h b/include/hw/arm/smmu-common.h
  /*
 diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
+--- a/include/hw/arm/smmu-common.h
-+++ b/target/arm/translate-a64.c
++++ b/include/hw/arm/smmu-common.h
-@@ -XXX,XX +XXX,XX @@ bool sve_access_check(DisasContext *s)
+@@ -XXX,XX +XXX,XX @@ typedef struct SMMUIOTLBKey {
-     return fp_access_check(s);
+     uint8_t level;
  } SMMUIOTLBKey;
 +typedef struct SMMUSIDRange {
 +    uint32_t start;
 +    uint32_t end;
 +} SMMUSIDRange;
 +
  struct SMMUState {
      /* <private> */
      SysBusDevice  dev;
@@ -XXX,XX +XXX,XX @@ void smmu_iotlb_inv_iova(SMMUState *s, int asid, int vmid, dma_addr_t iova,
                           uint8_t tg, uint64_t num_pages, uint8_t ttl);
  void smmu_iotlb_inv_ipa(SMMUState *s, int vmid, dma_addr_t ipa, uint8_t tg,
                          uint64_t num_pages, uint8_t ttl);
 +void smmu_configs_inv_sid_range(SMMUState *s, SMMUSIDRange sid_range);
  /* Unmap the range of all the notifiers registered to any IOMMU mr */
  void smmu_inv_notifiers_all(SMMUState *s);
 diff --git a/hw/arm/smmu-common.c b/hw/arm/smmu-common.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/smmu-common.c
 +++ b/hw/arm/smmu-common.c
@@ -XXX,XX +XXX,XX @@ static gboolean smmu_hash_remove_by_vmid_ipa(gpointer key, gpointer value,
             ((entry->iova & ~info->mask) == info->iova);
  }
-+/*
++static gboolean
-+ * Check that SME access is enabled, raise an exception if not.
++smmu_hash_remove_by_sid_range(gpointer key, gpointer value, gpointer user_data)
 + * Note that this function corresponds to CheckSMEAccess and is
 + * only used directly for cpregs.
 + */
 +static bool sme_access_check(DisasContext *s)
 +{
-+    if (s->sme_excp_el) {
++    SMMUDevice *sdev = (SMMUDevice *)key;
-+        gen_exception_insn_el(s, s->pc_curr, EXCP_UDEF,
++    uint32_t sid = smmu_get_sid(sdev);
-+                              syn_smetrap(SME_ET_AccessTrap, false),
++    SMMUSIDRange *sid_range = (SMMUSIDRange *)user_data;
-+                              s->sme_excp_el);
++
 +    if (sid < sid_range->start || sid > sid_range->end) {
 +        return false;
 +    }
++    trace_smmu_config_cache_inv(sid);
 +    return true;
 +}
 +
- /*
++void smmu_configs_inv_sid_range(SMMUState *s, SMMUSIDRange sid_range)
-  * This utility function is for doing register extension with an
++{
-  * optional shift. You will likely want to pass a temporary for the
++    trace_smmu_configs_inv_sid_range(sid_range.start, sid_range.end);
-@@ -XXX,XX +XXX,XX @@ static void handle_sys(DisasContext *s, uint32_t insn, bool isread,
++    g_hash_table_foreach_remove(s->configs, smmu_hash_remove_by_sid_range,
-         return;
++                                &sid_range);
-     } else if ((ri->type & ARM_CP_SVE) && !sve_access_check(s)) {
++}
-         return;
++
-+    } else if ((ri->type & ARM_CP_SME) && !sme_access_check(s)) {
+ void smmu_iotlb_inv_iova(SMMUState *s, int asid, int vmid, dma_addr_t iova,
-+        return;
+                          uint8_t tg, uint64_t num_pages, uint8_t ttl)
  {
 diff --git a/hw/arm/smmuv3.c b/hw/arm/smmuv3.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/smmuv3.c
 +++ b/hw/arm/smmuv3.c
@@ -XXX,XX +XXX,XX @@ static void smmuv3_flush_config(SMMUDevice *sdev)
      SMMUv3State *s = sdev->smmu;
      SMMUState *bc = &s->smmu_state;
 -    trace_smmuv3_config_cache_inv(smmu_get_sid(sdev));
 +    trace_smmu_config_cache_inv(smmu_get_sid(sdev));
      g_hash_table_remove(bc->configs, sdev);
  }
@@ -XXX,XX +XXX,XX @@ static void smmuv3_range_inval(SMMUState *s, Cmd *cmd, SMMUStage stage)
      }
+ }
-     if ((tb_cflags(s->base.tb) & CF_USE_ICOUNT) && (ri->type & ARM_CP_IO)) {
 -static gboolean
 -smmuv3_invalidate_ste(gpointer key, gpointer value, gpointer user_data)
 -{
 -    SMMUDevice *sdev = (SMMUDevice *)key;
 -    uint32_t sid = smmu_get_sid(sdev);
 -    SMMUSIDRange *sid_range = (SMMUSIDRange *)user_data;
 -
 -    if (sid < sid_range->start || sid > sid_range->end) {
 -        return false;
 -    }
 -    trace_smmuv3_config_cache_inv(sid);
 -    return true;
 -}
 -
  static int smmuv3_cmdq_consume(SMMUv3State *s)
  {
      SMMUState *bs = ARM_SMMU(s);
@@ -XXX,XX +XXX,XX @@ static int smmuv3_cmdq_consume(SMMUv3State *s)
              sid_range.end = sid_range.start + mask;
              trace_smmuv3_cmdq_cfgi_ste_range(sid_range.start, sid_range.end);
 -            g_hash_table_foreach_remove(bs->configs, smmuv3_invalidate_ste,
 -                                        &sid_range);
 +            smmu_configs_inv_sid_range(bs, sid_range);
              break;
          }
          case SMMU_CMD_CFGI_CD:
 diff --git a/hw/arm/trace-events b/hw/arm/trace-events
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/trace-events
 +++ b/hw/arm/trace-events
@@ -XXX,XX +XXX,XX @@ smmu_iotlb_inv_asid_vmid(int asid, int vmid) "IOTLB invalidate asid=%d vmid=%d"
  smmu_iotlb_inv_vmid(int vmid) "IOTLB invalidate vmid=%d"
  smmu_iotlb_inv_vmid_s1(int vmid) "IOTLB invalidate vmid=%d"
  smmu_iotlb_inv_iova(int asid, uint64_t addr) "IOTLB invalidate asid=%d addr=0x%"PRIx64
 +smmu_configs_inv_sid_range(uint32_t start, uint32_t end) "Config cache INV SID range from 0x%x to 0x%x"
 +smmu_config_cache_inv(uint32_t sid) "Config cache INV for sid=0x%x"
  smmu_inv_notifiers_mr(const char *name) "iommu mr=%s"
  smmu_iotlb_lookup_hit(int asid, int vmid, uint64_t addr, uint32_t hit, uint32_t miss, uint32_t p) "IOTLB cache HIT asid=%d vmid=%d addr=0x%"PRIx64" hit=%d miss=%d hit rate=%d"
  smmu_iotlb_lookup_miss(int asid, int vmid, uint64_t addr, uint32_t hit, uint32_t miss, uint32_t p) "IOTLB cache MISS asid=%d vmid=%d addr=0x%"PRIx64" hit=%d miss=%d hit rate=%d"
@@ -XXX,XX +XXX,XX @@ smmuv3_cmdq_tlbi_nh(int vmid) "vmid=%d"
  smmuv3_cmdq_tlbi_nsnh(void) ""
  smmuv3_cmdq_tlbi_nh_asid(int asid) "asid=%d"
  smmuv3_cmdq_tlbi_s12_vmid(int vmid) "vmid=%d"
 -smmuv3_config_cache_inv(uint32_t sid) "Config cache INV for sid=0x%x"
  smmuv3_notify_flag_add(const char *iommu) "ADD SMMUNotifier node for iommu mr=%s"
  smmuv3_notify_flag_del(const char *iommu) "DEL SMMUNotifier node for iommu mr=%s"
  smmuv3_inv_notifiers_iova(const char *name, int asid, int vmid, uint64_t iova, uint8_t tg, uint64_t num_pages, int stage) "iommu mr=%s asid=%d vmid=%d iova=0x%"PRIx64" tg=%d num_pages=0x%"PRIx64" stage=%d"
 --
-.25.1
+.43.0

-[PULL 12/25] target/arm: Add the SME ZA storage to CPUARMState
+Deleted patch
-From: Richard Henderson <richard.henderson@linaro.org>
-Place this late in the resettable section of the structure,
-to keep the most common element offsets from being > 64k.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220620175235.60881-10-richard.henderson@linaro.org
-[PMM: expanded comment on zarray[] format]
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- target/arm/cpu.h     | 22 ++++++++++++++++++++++
- target/arm/machine.c | 34 ++++++++++++++++++++++++++++++++++
-files changed, 56 insertions(+)
-diff --git a/target/arm/cpu.h b/target/arm/cpu.h
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.h
-+++ b/target/arm/cpu.h
-@@ -XXX,XX +XXX,XX @@ typedef struct CPUArchState {
-     } keys;
-     uint64_t scxtnum_el[4];
-+
-+    /*
-+     * SME ZA storage -- 256 x 256 byte array, with bytes in host word order,
-+     * as we do with vfp.zregs[].  This corresponds to the architectural ZA
-+     * array, where ZA[N] is in the least-significant bytes of env->zarray[N].
-+     * When SVL is less than the architectural maximum, the accessible
-+     * storage is restricted, such that if the SVL is X bytes the guest can
-+     * see only the bottom X elements of zarray[], and only the least
-+     * significant X bytes of each element of the array. (In other words,
-+     * the observable part is always square.)
-+     *
-+     * The ZA storage can also be considered as a set of square tiles of
-+     * elements of different sizes. The mapping from tiles to the ZA array
-+     * is architecturally defined, such that for tiles of elements of esz
-+     * bytes, the Nth row (or "horizontal slice") of tile T is in
-+     * ZA[T + N * esz]. Note that this means that each tile is not contiguous
-+     * in the ZA storage, because its rows are striped through the ZA array.
-+     *
-+     * Because this is so large, keep this toward the end of the reset area,
-+     * to keep the offsets into the rest of the structure smaller.
-+     */
-+    ARMVectorReg zarray[ARM_MAX_VQ * 16];
- #endif
- #if defined(CONFIG_USER_ONLY)
-diff --git a/target/arm/machine.c b/target/arm/machine.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/machine.c
-+++ b/target/arm/machine.c
-@@ -XXX,XX +XXX,XX @@ static const VMStateDescription vmstate_sve = {
-         VMSTATE_END_OF_LIST()
-     }
- };
-+
-+static const VMStateDescription vmstate_vreg = {
-+    .name = "vreg",
-+    .version_id = 1,
-+    .minimum_version_id = 1,
-+    .fields = (VMStateField[]) {
-+        VMSTATE_UINT64_ARRAY(d, ARMVectorReg, ARM_MAX_VQ * 2),
-+        VMSTATE_END_OF_LIST()
-+    }
-+};
-+
-+static bool za_needed(void *opaque)
-+{
-+    ARMCPU *cpu = opaque;
-+
-+    /*
-+     * When ZA storage is disabled, its contents are discarded.
-+     * It will be zeroed when ZA storage is re-enabled.
-+     */
-+    return FIELD_EX64(cpu->env.svcr, SVCR, ZA);
-+}
-+
-+static const VMStateDescription vmstate_za = {
-+    .name = "cpu/sme",
-+    .version_id = 1,
-+    .minimum_version_id = 1,
-+    .needed = za_needed,
-+    .fields = (VMStateField[]) {
-+        VMSTATE_STRUCT_ARRAY(env.zarray, ARMCPU, ARM_MAX_VQ * 16, 0,
-+                             vmstate_vreg, ARMVectorReg),
-+        VMSTATE_END_OF_LIST()
-+    }
-+};
- #endif /* AARCH64 */
- static bool serror_needed(void *opaque)
-@@ -XXX,XX +XXX,XX @@ const VMStateDescription vmstate_arm_cpu = {
-         &vmstate_m_security,
- #ifdef TARGET_AARCH64
-         &vmstate_sve,
-+        &vmstate_za,
- #endif
-         &vmstate_serror,
-         &vmstate_irq_line_state,
---
-.25.1

-[PULL 14/25] target/arm: Move error for sve%d property to arm_cpu_sve_finalize
+[PULL 20/21] target/rx: Set exception vector base to 0xffffff80
-From: Richard Henderson <richard.henderson@linaro.org>
+From: Keith Packard <keithp@keithp.com>
-Keep all of the error messages together.  This does mean that
+The documentation says the vector is at 0xffffff80, instead of the
-when setting many sve length properties we'll only generate
+previous value of 0xffffffc0. That value must have been a bug because
-one error, but we only really need one.
+the standard vector values (20, 21, 23, 25, 30) were all
 past the end of the array.
+Signed-off-by: Keith Packard <keithp@keithp.com>
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20220620175235.60881-12-richard.henderson@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/cpu64.c | 15 +++++++--------
+ target/rx/helper.c | 2 +-
-file changed, 7 insertions(+), 8 deletions(-)
+file changed, 1 insertion(+), 1 deletion(-)
-diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
+diff --git a/target/rx/helper.c b/target/rx/helper.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu64.c
+--- a/target/rx/helper.c
-+++ b/target/arm/cpu64.c
++++ b/target/rx/helper.c
-@@ -XXX,XX +XXX,XX @@ void arm_cpu_sve_finalize(ARMCPU *cpu, Error **errp)
+@@ -XXX,XX +XXX,XX @@ void rx_cpu_do_interrupt(CPUState *cs)
-                                   "using only sve<N> properties.\n");
+         cpu_stl_data(env, env->isp, env->pc);
-             } else {
-                 error_setg(errp, "cannot enable sve%d", vq * 128);
+         if (vec < 0x100) {
--                error_append_hint(errp, "This CPU does not support "
+-            env->pc = cpu_ldl_data(env, 0xffffffc0 + vec * 4);
--                                  "the vector length %d-bits.\n", vq * 128);
++            env->pc = cpu_ldl_data(env, 0xffffff80 + vec * 4);
 +                if (vq_supported) {
 +                    error_append_hint(errp, "This CPU does not support "
 +                                      "the vector length %d-bits.\n", vq * 128);
 +                } else {
 +                    error_append_hint(errp, "SVE not supported by KVM "
 +                                      "on this host\n");
 +                }
              }
              return;
          } else {
-@@ -XXX,XX +XXX,XX @@ static void cpu_arm_set_sve_vq(Object *obj, Visitor *v, const char *name,
+             env->pc = cpu_ldl_data(env, env->intb + (vec & 0xff) * 4);
-         return;
+         }
      }
 -    if (value && kvm_enabled() && !kvm_arm_sve_supported()) {
 -        error_setg(errp, "cannot enable %s", name);
 -        error_append_hint(errp, "SVE not supported by KVM on this host\n");
 -        return;
 -    }
 -
      cpu->sve_vq_map = deposit32(cpu->sve_vq_map, vq - 1, 1, value);
      cpu->sve_vq_init |= 1 << (vq - 1);
  }
 --
-.25.1
+.43.0

-[PULL 25/25] target/arm: Check V7VE as well as LPAE in arm_pamax
+[PULL 21/21] target/rx: Remove TCG_CALL_NO_WG from helpers which write env
-From: Richard Henderson <richard.henderson@linaro.org>
+From: Keith Packard <keithp@keithp.com>
-In machvirt_init we create a cpu but do not fully initialize it.
+Functions which modify TCG globals must not be marked TCG_CALL_NO_WG,
-Thus the propagation of V7VE to LPAE has not been done, and we
+as that tells the optimizer that TCG global values already loaded in
-compute the wrong value for some v7 cpus, e.g. cortex-a15.
+machine registers are still valid, and so any changes which these
 helpers make to the CPU state may be ignored.
-Resolves: https://gitlab.com/qemu-project/qemu/-/issues/1078
+The target/rx code chooses to put (among other things) all the PSW
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+bits and also ACC into globals, so the NO_WG flag on various
-Reported-by: He Zhe <zhe.he@windriver.com>
+functions that touch the PSW or ACC is incorrect and must be removed.
-Message-id: 20220619001541.131672-3-richard.henderson@linaro.org
+This includes all the floating point helper functions, because
 update_fpsw() will update PSW Z and S.
 Signed-off-by: Keith Packard <keithp@keithp.com>
 [PMM: Clarified commit message]
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/ptw.c | 8 +++++++-
+ target/rx/helper.h | 34 +++++++++++++++++-----------------
-file changed, 7 insertions(+), 1 deletion(-)
+file changed, 17 insertions(+), 17 deletions(-)
-diff --git a/target/arm/ptw.c b/target/arm/ptw.c
+diff --git a/target/rx/helper.h b/target/rx/helper.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/ptw.c
+--- a/target/rx/helper.h
-+++ b/target/arm/ptw.c
++++ b/target/rx/helper.h
-@@ -XXX,XX +XXX,XX @@ unsigned int arm_pamax(ARMCPU *cpu)
+@@ -XXX,XX +XXX,XX @@ DEF_HELPER_1(raise_privilege_violation, noreturn, env)
-         assert(parange < ARRAY_SIZE(pamax_map));
+ DEF_HELPER_1(wait, noreturn, env)
-         return pamax_map[parange];
+ DEF_HELPER_2(rxint, noreturn, env, i32)
-     }
+ DEF_HELPER_1(rxbrk, noreturn, env)
--    if (arm_feature(&cpu->env, ARM_FEATURE_LPAE)) {
+-DEF_HELPER_FLAGS_3(fadd, TCG_CALL_NO_WG, f32, env, f32, f32)
-+
+-DEF_HELPER_FLAGS_3(fsub, TCG_CALL_NO_WG, f32, env, f32, f32)
-+    /*
+-DEF_HELPER_FLAGS_3(fmul, TCG_CALL_NO_WG, f32, env, f32, f32)
-+     * In machvirt_init, we call arm_pamax on a cpu that is not fully
+-DEF_HELPER_FLAGS_3(fdiv, TCG_CALL_NO_WG, f32, env, f32, f32)
-+     * initialized, so we can't rely on the propagation done in realize.
+-DEF_HELPER_FLAGS_3(fcmp, TCG_CALL_NO_WG, void, env, f32, f32)
-+     */
+-DEF_HELPER_FLAGS_2(ftoi, TCG_CALL_NO_WG, i32, env, f32)
-+    if (arm_feature(&cpu->env, ARM_FEATURE_LPAE) ||
+-DEF_HELPER_FLAGS_2(round, TCG_CALL_NO_WG, i32, env, f32)
-+        arm_feature(&cpu->env, ARM_FEATURE_V7VE)) {
+-DEF_HELPER_FLAGS_2(itof, TCG_CALL_NO_WG, f32, env, i32)
-         /* v7 with LPAE */
++DEF_HELPER_3(fadd, f32, env, f32, f32)
-         return 40;
++DEF_HELPER_3(fsub, f32, env, f32, f32)
-     }
++DEF_HELPER_3(fmul, f32, env, f32, f32)
 +DEF_HELPER_3(fdiv, f32, env, f32, f32)
 +DEF_HELPER_3(fcmp, void, env, f32, f32)
 +DEF_HELPER_2(ftoi, i32, env, f32)
 +DEF_HELPER_2(round, i32, env, f32)
 +DEF_HELPER_2(itof, f32, env, i32)
  DEF_HELPER_2(set_fpsw, void, env, i32)
 -DEF_HELPER_FLAGS_2(racw, TCG_CALL_NO_WG, void, env, i32)
 -DEF_HELPER_FLAGS_2(set_psw_rte, TCG_CALL_NO_WG, void, env, i32)
 -DEF_HELPER_FLAGS_2(set_psw, TCG_CALL_NO_WG, void, env, i32)
 +DEF_HELPER_2(racw, void, env, i32)
 +DEF_HELPER_2(set_psw_rte, void, env, i32)
 +DEF_HELPER_2(set_psw, void, env, i32)
  DEF_HELPER_1(pack_psw, i32, env)
 -DEF_HELPER_FLAGS_3(div, TCG_CALL_NO_WG, i32, env, i32, i32)
 -DEF_HELPER_FLAGS_3(divu, TCG_CALL_NO_WG, i32, env, i32, i32)
 -DEF_HELPER_FLAGS_1(scmpu, TCG_CALL_NO_WG, void, env)
 +DEF_HELPER_3(div, i32, env, i32, i32)
 +DEF_HELPER_3(divu, i32, env, i32, i32)
 +DEF_HELPER_1(scmpu, void, env)
  DEF_HELPER_1(smovu, void, env)
  DEF_HELPER_1(smovf, void, env)
  DEF_HELPER_1(smovb, void, env)
  DEF_HELPER_2(sstr, void, env, i32)
 -DEF_HELPER_FLAGS_2(swhile, TCG_CALL_NO_WG, void, env, i32)
 -DEF_HELPER_FLAGS_2(suntil, TCG_CALL_NO_WG, void, env, i32)
 -DEF_HELPER_FLAGS_2(rmpa, TCG_CALL_NO_WG, void, env, i32)
 +DEF_HELPER_2(swhile, void, env, i32)
 +DEF_HELPER_2(suntil, void, env, i32)
 +DEF_HELPER_2(rmpa, void, env, i32)
  DEF_HELPER_1(satr, void, env)
 --
-.25.1
+.43.0

target-arm queue, mostly SME preliminaries.

In the unlikely event we don't land the rest of SME before freeze
for 7.1 we can revert the docs/property changes included here.

-- PMM

The following changes since commit 097ccbbbaf2681df1e65542e5b7d2b2d0c66e2bc:

Merge tag 'qemu-sparc-20220626' of https://github.com/mcayland/qemu into staging (2022-06-27 05:21:05 +0530)

are available in the Git repository at:

https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20220627

for you to fetch changes up to 59e1b8a22ea9f947d038ccac784de1020f266e14:

target/arm: Check V7VE as well as LPAE in arm_pamax (2022-06-27 11:18:17 +0100)

----------------------------------------------------------------
target-arm queue:
 * sphinx: change default language to 'en'
 * Diagnose attempts to emulate EL3 in hvf as well as kvm
 * More SME groundwork patches
 * virt: Fix calculation of physical address space size
   for v7VE CPUs (eg cortex-a15)

----------------------------------------------------------------
Alexander Graf (2):
      accel: Introduce current_accel_name()
      target/arm: Catch invalid kvm state also for hvf

Martin Liška (1):
      sphinx: change default language to 'en'

Richard Henderson (22):
      target/arm: Implement TPIDR2_EL0
      target/arm: Add SMEEXC_EL to TB flags
      target/arm: Add syn_smetrap
      target/arm: Add ARM_CP_SME
      target/arm: Add SVCR
      target/arm: Add SMCR_ELx
      target/arm: Add SMIDR_EL1, SMPRI_EL1, SMPRIMAP_EL2
      target/arm: Add PSTATE.{SM,ZA} to TB flags
      target/arm: Add the SME ZA storage to CPUARMState
      target/arm: Implement SMSTART, SMSTOP
      target/arm: Move error for sve%d property to arm_cpu_sve_finalize
      target/arm: Create ARMVQMap
      target/arm: Generalize cpu_arm_{get,set}_vq
      target/arm: Generalize cpu_arm_{get, set}_default_vec_len
      target/arm: Move arm_cpu_*_finalize to internals.h
      target/arm: Unexport aarch64_add_*_properties
      target/arm: Add cpu properties for SME
      target/arm: Introduce sve_vqm1_for_el_sm
      target/arm: Add SVL to TB flags
      target/arm: Move pred_{full, gvec}_reg_{offset, size} to translate-a64.h
      target/arm: Extend arm_pamax to more than aarch64
      target/arm: Check V7VE as well as LPAE in arm_pamax

From: Martin Liška <mliska@suse.cz>

Fixes the following Sphinx warning (treated as error) starting
with 5.0 release:

Warning, treated as error:
Invalid configuration value found: 'language = None'. Update your configuration to a valid langauge code. Falling back to 'en' (English).

Signed-off-by: Martin Liska <mliska@suse.cz>
Message-id: e91e51ee-48ac-437e-6467-98b56ee40042@suse.cz
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 docs/conf.py | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/conf.py b/docs/conf.py
index XXXXXXX..XXXXXXX 100644
--- a/docs/conf.py
+++ b/docs/conf.py
@@ -XXX,XX +XXX,XX @@
 #
 # This is also used if you do content translation via gettext catalogs.
 # Usually you set "language" from the command line for these cases.
-language = None
+language = 'en'
 
 # List of patterns, relative to source directory, that match files and
 # directories to ignore when looking for source files.
-- 
2.25.1

From: Alexander Graf <agraf@csgraf.de>

We need to fetch the name of the current accelerator in flexible error
messages more going forward. Let's create a helper that gives it to us
without casting in the target code.

Signed-off-by: Alexander Graf <agraf@csgraf.de>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Message-id: 20220620192242.70573-1-agraf@csgraf.de
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 include/qemu/accel.h | 1 +
 accel/accel-common.c | 8 ++++++++
 softmmu/vl.c         | 3 +--
 3 files changed, 10 insertions(+), 2 deletions(-)

diff --git a/include/qemu/accel.h b/include/qemu/accel.h
index XXXXXXX..XXXXXXX 100644
--- a/include/qemu/accel.h
+++ b/include/qemu/accel.h
@@ -XXX,XX +XXX,XX @@ typedef struct AccelClass {
 
 AccelClass *accel_find(const char *opt_name);
 AccelState *current_accel(void);
+const char *current_accel_name(void);
 
 void accel_init_interfaces(AccelClass *ac);
 
diff --git a/accel/accel-common.c b/accel/accel-common.c
index XXXXXXX..XXXXXXX 100644
--- a/accel/accel-common.c
+++ b/accel/accel-common.c
@@ -XXX,XX +XXX,XX @@ AccelClass *accel_find(const char *opt_name)
     return ac;
 }
 
+/* Return the name of the current accelerator */
+const char *current_accel_name(void)
+{
+    AccelClass *ac = ACCEL_GET_CLASS(current_accel());
+
+    return ac->name;
+}
+
 static void accel_init_cpu_int_aux(ObjectClass *klass, void *opaque)
 {
     CPUClass *cc = CPU_CLASS(klass);
diff --git a/softmmu/vl.c b/softmmu/vl.c
index XXXXXXX..XXXXXXX 100644
--- a/softmmu/vl.c
+++ b/softmmu/vl.c
@@ -XXX,XX +XXX,XX @@ static void configure_accelerators(const char *progname)
     }
 
     if (init_failed && !qtest_chrdev) {
-        AccelClass *ac = ACCEL_GET_CLASS(current_accel());
-        error_report("falling back to %s", ac->name);
+        error_report("falling back to %s", current_accel_name());
     }
 
     if (icount_enabled() && !tcg_enabled()) {
-- 
2.25.1

From: Alexander Graf <agraf@csgraf.de>

Some features such as running in EL3 or running M profile code are
incompatible with virtualization as QEMU implements it today. To prevent
users from picking invalid configurations on other virt solutions like
Hvf, let's run the same checks there too.

Resolves: https://gitlab.com/qemu-project/qemu/-/issues/1073
Signed-off-by: Alexander Graf <agraf@csgraf.de>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Message-id: 20220620192242.70573-2-agraf@csgraf.de
[PMM: Allow qtest accelerator too; tweak comment]
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 target/arm/cpu.c | 16 ++++++++++++----
 1 file changed, 12 insertions(+), 4 deletions(-)

diff --git a/target/arm/cpu.c b/target/arm/cpu.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/cpu.c
+++ b/target/arm/cpu.c
@@ -XXX,XX +XXX,XX @@
 #include "hw/boards.h"
 #endif
 #include "sysemu/tcg.h"
+#include "sysemu/qtest.h"
 #include "sysemu/hw_accel.h"
 #include "kvm_arm.h"
 #include "disas/capstone.h"
@@ -XXX,XX +XXX,XX @@ static void arm_cpu_realizefn(DeviceState *dev, Error **errp)
         }
     }
 
-    if (kvm_enabled()) {
+    if (!tcg_enabled() && !qtest_enabled()) {
         /*
+         * We assume that no accelerator except TCG (and the "not really an
+         * accelerator" qtest) can handle these features, because Arm hardware
+         * virtualization can't virtualize them.
+         *
          * Catch all the cases which might cause us to create more than one
          * address space for the CPU (otherwise we will assert() later in
          * cpu_address_space_init()).
          */
         if (arm_feature(env, ARM_FEATURE_M)) {
             error_setg(errp,
-                       "Cannot enable KVM when using an M-profile guest CPU");
+                       "Cannot enable %s when using an M-profile guest CPU",
+                       current_accel_name());
             return;
         }
         if (cpu->has_el3) {
             error_setg(errp,
-                       "Cannot enable KVM when guest CPU has EL3 enabled");
+                       "Cannot enable %s when guest CPU has EL3 enabled",
+                       current_accel_name());
             return;
         }
         if (cpu->tag_memory) {
             error_setg(errp,
-                       "Cannot enable KVM when guest CPUs has MTE enabled");
+                       "Cannot enable %s when guest CPUs has MTE enabled",
+                       current_accel_name());
             return;
         }
     }
-- 
2.25.1