Series comparison

-[PULL 00/12] target-arm queue
+[PULL 0/6] target-arm queue
-One last arm pullreq before I stop work for the end of the year...
+Hi; here's a target-arm pull for rc2. Four arm-related fixes,
 and a couple of bug fixes for other areas of the codebase
 that seemed like they'd fallen through the cracks.
+thanks
 -- PMM
-The following changes since commit 8e5943260a8f765216674ee87ce8588cc4e7463e:
+The following changes since commit ccb86f079a9e4d94918086a9df18c1844347aff8:
-  Merge remote-tracking branch 'remotes/vivier2/tags/trivial-branch-pull-request' into staging (2019-12-20 12:46:10 +0000)
+  Merge tag 'pull-nbd-2023-07-28' of https://repo.or.cz/qemu/ericb into staging (2023-07-28 09:56:57 -0700)
 are available in the Git repository at:
-  https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20191220
+  https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20230731
-for you to fetch changes up to c8fa6079eb35888587f1be27c1590da4edcc5098:
+for you to fetch changes up to 108e8180c6b0c315711aa54e914030a313505c17:
-  arm/arm-powerctl: rebuild hflags after setting CP15 bits in arm_set_cpu_on() (2019-12-20 14:03:00 +0000)
+  gdbstub: Fix client Ctrl-C handling (2023-07-31 14:57:32 +0100)
 ----------------------------------------------------------------
 target-arm queue:
- * Support emulating the generic timers at frequencies other than 62.5MHz
+ * Don't build AArch64 decodetree files for qemu-system-arm
- * Various fixes for SMMUv3 emulation bugs
+ * Fix TCG assert in v8.1M CSEL etc
- * Improve assert error message for hflags mismatches
+ * Fix MemOp for STGP
- * arm-powerctl: rebuild hflags after setting CP15 bits in arm_set_cpu_on()
+ * gdbstub: Fix client Ctrl-C handling
  * kvm: Fix crash due to access uninitialized kvm_state
  * elf2dmp: Don't abandon when Prcb is set to 0
 ----------------------------------------------------------------
-Andrew Jeffery (4):
+Akihiko Odaki (1):
-      target/arm: Remove redundant scaling of nexttick
+      elf2dmp: Don't abandon when Prcb is set to 0
       target/arm: Abstract the generic timer frequency
       target/arm: Prepare generic timer for per-platform CNTFRQ
       ast2600: Configure CNTFRQ at 1125MHz
-Niek Linnenbank (1):
+Gavin Shan (1):
-      arm/arm-powerctl: rebuild hflags after setting CP15 bits in arm_set_cpu_on()
+      kvm: Fix crash due to access uninitialized kvm_state
-Philippe Mathieu-Daudé (1):
+Nicholas Piggin (1):
-      target/arm: Display helpful message when hflags mismatch
+      gdbstub: Fix client Ctrl-C handling
-Simon Veith (6):
+Peter Maydell (2):
-      hw/arm/smmuv3: Apply address mask to linear strtab base address
+      target/arm: Avoid writing to constant TCGv in trans_CSEL()
-      hw/arm/smmuv3: Correct SMMU_BASE_ADDR_MASK value
+      target/arm/tcg: Don't build AArch64 decodetree files for qemu-system-arm
       hw/arm/smmuv3: Check stream IDs against actual table LOG2SIZE
       hw/arm/smmuv3: Align stream table base address to table size
       hw/arm/smmuv3: Use correct bit positions in EVT_SET_ADDR2 macro
       hw/arm/smmuv3: Report F_STE_FETCH fault address in correct word position
- hw/arm/smmuv3-internal.h  |  6 ++---
+Richard Henderson (1):
- target/arm/cpu.h          |  5 ++++
+      target/arm: Fix MemOp for STGP
  hw/arm/aspeed_ast2600.c   |  3 +++
  hw/arm/smmuv3.c           | 28 +++++++++++++++-----
  target/arm/arm-powerctl.c |  3 +++
  target/arm/cpu.c          | 65 +++++++++++++++++++++++++++++++++++++++++------
  target/arm/helper.c       | 42 +++++++++++++++++++++++-------
 files changed, 125 insertions(+), 27 deletions(-)
+ accel/kvm/kvm-all.c            |  2 +-
+ contrib/elf2dmp/main.c         |  5 +++++
+ gdbstub/gdbstub.c              | 13 +++++++++++--
+ target/arm/tcg/translate-a64.c | 21 ++++++++++++++++++---
+ target/arm/tcg/translate.c     | 15 ++++++++-------
+ target/arm/tcg/meson.build     | 10 +++++++---
+files changed, 50 insertions(+), 16 deletions(-)

-[PULL 01/12] target/arm: Remove redundant scaling of nexttick
+Deleted patch
-From: Andrew Jeffery <andrew@aj.id.au>
-The corner-case codepath was adjusting nexttick such that overflow
-wouldn't occur when timer_mod() scaled the value back up. Remove a use
-of GTIMER_SCALE and avoid unnecessary operations by calling
-timer_mod_ns() directly.
-Signed-off-by: Andrew Jeffery <andrew@aj.id.au>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Reviewed-by: Cédric Le Goater <clg@kaod.org>
-Message-id: f8c680720e3abe55476e6d9cb604ad27fdbeb2e0.1576215453.git-series.andrew@aj.id.au
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- target/arm/helper.c | 5 +++--
-file changed, 3 insertions(+), 2 deletions(-)
-diff --git a/target/arm/helper.c b/target/arm/helper.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper.c
-+++ b/target/arm/helper.c
-@@ -XXX,XX +XXX,XX @@ static void gt_recalc_timer(ARMCPU *cpu, int timeridx)
-          * timer expires we will reset the timer for any remaining period.
-          */
-         if (nexttick > INT64_MAX / GTIMER_SCALE) {
--            nexttick = INT64_MAX / GTIMER_SCALE;
-+            timer_mod_ns(cpu->gt_timer[timeridx], INT64_MAX);
-+        } else {
-+            timer_mod(cpu->gt_timer[timeridx], nexttick);
-         }
--        timer_mod(cpu->gt_timer[timeridx], nexttick);
-         trace_arm_gt_recalc(timeridx, irqstate, nexttick);
-     } else {
-         /* Timer disabled: ISTATUS and timer output always clear */
---
-.20.1

-[PULL 02/12] target/arm: Abstract the generic timer frequency
+Deleted patch
-From: Andrew Jeffery <andrew@aj.id.au>
-Prepare for SoCs such as the ASPEED AST2600 whose firmware configures
-CNTFRQ to values significantly larger than the static 62.5MHz value
-currently derived from GTIMER_SCALE. As the OS potentially derives its
-timer periods from the CNTFRQ value the lack of support for running
-QEMUTimers at the appropriate rate leads to sticky behaviour in the
-guest.
-Substitute the GTIMER_SCALE constant with use of a helper to derive the
-period from gt_cntfrq_hz stored in struct ARMCPU. Initially set
-gt_cntfrq_hz to the frequency associated with GTIMER_SCALE so current
-behaviour is maintained.
-Signed-off-by: Andrew Jeffery <andrew@aj.id.au>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
-Message-id: 40bd8df043f66e1ccfb3e9482999d099ac72bb2e.1576215453.git-series.andrew@aj.id.au
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- target/arm/cpu.h    |  5 +++++
- target/arm/cpu.c    |  8 ++++++++
- target/arm/helper.c | 10 +++++++---
-files changed, 20 insertions(+), 3 deletions(-)
-diff --git a/target/arm/cpu.h b/target/arm/cpu.h
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.h
-+++ b/target/arm/cpu.h
-@@ -XXX,XX +XXX,XX @@ struct ARMCPU {
-      */
-     DECLARE_BITMAP(sve_vq_map, ARM_MAX_VQ);
-     DECLARE_BITMAP(sve_vq_init, ARM_MAX_VQ);
-+
-+    /* Generic timer counter frequency, in Hz */
-+    uint64_t gt_cntfrq_hz;
- };
-+unsigned int gt_cntfrq_period_ns(ARMCPU *cpu);
-+
- void arm_cpu_post_init(Object *obj);
- uint64_t arm_cpu_mp_affinity(int idx, uint8_t clustersz);
-diff --git a/target/arm/cpu.c b/target/arm/cpu.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.c
-+++ b/target/arm/cpu.c
-@@ -XXX,XX +XXX,XX @@ static void arm_cpu_initfn(Object *obj)
-     if (tcg_enabled()) {
-         cpu->psci_version = 2; /* TCG implements PSCI 0.2 */
-     }
-+
-+    cpu->gt_cntfrq_hz = NANOSECONDS_PER_SECOND / GTIMER_SCALE;
- }
- static Property arm_cpu_reset_cbar_property =
-@@ -XXX,XX +XXX,XX @@ static void arm_set_init_svtor(Object *obj, Visitor *v, const char *name,
-     visit_type_uint32(v, name, &cpu->init_svtor, errp);
- }
-+unsigned int gt_cntfrq_period_ns(ARMCPU *cpu)
-+{
-+    return NANOSECONDS_PER_SECOND > cpu->gt_cntfrq_hz ?
-+      NANOSECONDS_PER_SECOND / cpu->gt_cntfrq_hz : 1;
-+}
-+
- void arm_cpu_post_init(Object *obj)
- {
-     ARMCPU *cpu = ARM_CPU(obj);
-diff --git a/target/arm/helper.c b/target/arm/helper.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper.c
-+++ b/target/arm/helper.c
-@@ -XXX,XX +XXX,XX @@ static CPAccessResult gt_stimer_access(CPUARMState *env,
- static uint64_t gt_get_countervalue(CPUARMState *env)
- {
--    return qemu_clock_get_ns(QEMU_CLOCK_VIRTUAL) / GTIMER_SCALE;
-+    ARMCPU *cpu = env_archcpu(env);
-+
-+    return qemu_clock_get_ns(QEMU_CLOCK_VIRTUAL) / gt_cntfrq_period_ns(cpu);
- }
- static void gt_recalc_timer(ARMCPU *cpu, int timeridx)
-@@ -XXX,XX +XXX,XX @@ static void gt_recalc_timer(ARMCPU *cpu, int timeridx)
-          * set the timer for as far in the future as possible. When the
-          * timer expires we will reset the timer for any remaining period.
-          */
--        if (nexttick > INT64_MAX / GTIMER_SCALE) {
-+        if (nexttick > INT64_MAX / gt_cntfrq_period_ns(cpu)) {
-             timer_mod_ns(cpu->gt_timer[timeridx], INT64_MAX);
-         } else {
-             timer_mod(cpu->gt_timer[timeridx], nexttick);
-@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo generic_timer_cp_reginfo[] = {
- static uint64_t gt_virt_cnt_read(CPUARMState *env, const ARMCPRegInfo *ri)
- {
-+    ARMCPU *cpu = env_archcpu(env);
-+
-     /* Currently we have no support for QEMUTimer in linux-user so we
-      * can't call gt_get_countervalue(env), instead we directly
-      * call the lower level functions.
-      */
--    return cpu_get_clock() / GTIMER_SCALE;
-+    return cpu_get_clock() / gt_cntfrq_period_ns(cpu);
- }
- static const ARMCPRegInfo generic_timer_cp_reginfo[] = {
---
-.20.1

-[PULL 08/12] hw/arm/smmuv3: Align stream table base address to table size
+[PULL 1/6] target/arm: Fix MemOp for STGP
-From: Simon Veith <sveith@amazon.de>
+From: Richard Henderson <richard.henderson@linaro.org>
-Per the specification, and as observed in hardware, the SMMUv3 aligns
+When converting to decodetree, the code to rebuild mop for the pair
-the SMMU_STRTAB_BASE address to the size of the table by masking out the
+only made it into trans_STP and not into trans_STGP.
 respective least significant bits in the ADDR field.
-Apply this masking logic to our smmu_find_ste() lookup function per the
+Resolves: https://gitlab.com/qemu-project/qemu/-/issues/1790
-specification.
+Fixes: 8c212eb6594 ("target/arm: Convert load/store-pair to decodetree")
+Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-ref. ARM IHI 0070C, section 6.3.23.
+Message-id: 20230726165416.309624-1-richard.henderson@linaro.org
 Signed-off-by: Simon Veith <sveith@amazon.de>
 Acked-by: Eric Auger <eric.auger@redhat.com>
 Tested-by: Eric Auger <eric.auger@redhat.com>
 Message-id: 1576509312-13083-5-git-send-email-sveith@amazon.de
 Cc: Eric Auger <eric.auger@redhat.com>
 Cc: qemu-devel@nongnu.org
 Cc: qemu-arm@nongnu.org
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- hw/arm/smmuv3.c | 18 ++++++++++++++----
+ target/arm/tcg/translate-a64.c | 21 ++++++++++++++++++---
-file changed, 14 insertions(+), 4 deletions(-)
+file changed, 18 insertions(+), 3 deletions(-)
-diff --git a/hw/arm/smmuv3.c b/hw/arm/smmuv3.c
+diff --git a/target/arm/tcg/translate-a64.c b/target/arm/tcg/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/smmuv3.c
+--- a/target/arm/tcg/translate-a64.c
-+++ b/hw/arm/smmuv3.c
++++ b/target/arm/tcg/translate-a64.c
-@@ -XXX,XX +XXX,XX @@ bad_ste:
+@@ -XXX,XX +XXX,XX @@ static bool trans_STGP(DisasContext *s, arg_ldstpair *a)
- static int smmu_find_ste(SMMUv3State *s, uint32_t sid, STE *ste,
+     MemOp mop;
-                          SMMUEventInfo *event)
+     TCGv_i128 tmp;
- {
--    dma_addr_t addr;
++    /* STGP only comes in one size. */
-+    dma_addr_t addr, strtab_base;
++    tcg_debug_assert(a->sz == MO_64);
-     uint32_t log2size;
++
-+    int strtab_size_shift;
+     if (!dc_isar_feature(aa64_mte_insn_reg, s)) {
-     int ret;
+         return false;
      trace_smmuv3_find_ste(sid, s->features, s->sid_split);
@@ -XXX,XX +XXX,XX @@ static int smmu_find_ste(SMMUv3State *s, uint32_t sid, STE *ste,
      }
-     if (s->features & SMMU_FEATURE_2LVL_STE) {
+@@ -XXX,XX +XXX,XX @@ static bool trans_STGP(DisasContext *s, arg_ldstpair *a)
-         int l1_ste_offset, l2_ste_offset, max_l2_ste, span;
+         gen_helper_stg(cpu_env, dirty_addr, dirty_addr);
 -        dma_addr_t strtab_base, l1ptr, l2ptr;
 +        dma_addr_t l1ptr, l2ptr;
          STEDesc l1std;
 -        strtab_base = s->strtab_base & SMMU_BASE_ADDR_MASK;
 +        /*
 +         * Align strtab base address to table size. For this purpose, assume it
 +         * is not bounded by SMMU_IDR1_SIDSIZE.
 +         */
 +        strtab_size_shift = MAX(5, (int)log2size - s->sid_split - 1 + 3);
 +        strtab_base = s->strtab_base & SMMU_BASE_ADDR_MASK &
 +                      ~MAKE_64BIT_MASK(0, strtab_size_shift);
          l1_ste_offset = sid >> s->sid_split;
          l2_ste_offset = sid & ((1 << s->sid_split) - 1);
          l1ptr = (dma_addr_t)(strtab_base + l1_ste_offset * sizeof(l1std));
@@ -XXX,XX +XXX,XX @@ static int smmu_find_ste(SMMUv3State *s, uint32_t sid, STE *ste,
          }
          addr = l2ptr + l2_ste_offset * sizeof(*ste);
      } else {
 -        addr = (s->strtab_base & SMMU_BASE_ADDR_MASK) + sid * sizeof(*ste);
 +        strtab_size_shift = log2size + 5;
 +        strtab_base = s->strtab_base & SMMU_BASE_ADDR_MASK &
 +                      ~MAKE_64BIT_MASK(0, strtab_size_shift);
 +        addr = strtab_base + sid * sizeof(*ste);
      }
-     if (smmu_get_ste(s, addr, ste, event)) {
+-    mop = finalize_memop(s, a->sz);
 -    clean_addr = gen_mte_checkN(s, dirty_addr, true, false, 2 << a->sz, mop);
 +    mop = finalize_memop(s, MO_64);
 +    clean_addr = gen_mte_checkN(s, dirty_addr, true, false, 2 << MO_64, mop);
      tcg_rt = cpu_reg(s, a->rt);
      tcg_rt2 = cpu_reg(s, a->rt2);
 -    assert(a->sz == 3);
 +    /*
 +     * STGP is defined as two 8-byte memory operations and one tag operation.
 +     * We implement it as one single 16-byte memory operation for convenience.
 +     * Rebuild mop as for STP.
 +     * TODO: The atomicity with LSE2 is stronger than required.
 +     * Need a form of MO_ATOM_WITHIN16_PAIR that never requires
 +     * 16-byte atomicity.
 +     */
 +    mop = MO_128;
 +    if (s->align_mem) {
 +        mop |= MO_ALIGN_8;
 +    }
 +    mop = finalize_memop_pair(s, mop);
      tmp = tcg_temp_new_i128();
      if (s->be_data == MO_LE) {
 --
-.20.1
+.34.1

-[PULL 03/12] target/arm: Prepare generic timer for per-platform CNTFRQ
+[PULL 2/6] elf2dmp: Don't abandon when Prcb is set to 0
-From: Andrew Jeffery <andrew@aj.id.au>
+From: Akihiko Odaki <akihiko.odaki@daynix.com>
-The ASPEED AST2600 clocks the generic timer at the rate of HPLL. On
+Prcb may be set to 0 for some CPUs if the dump was taken before they
-recent firmwares this is at 1125MHz, which is considerably quicker than
+start. The dump may still contain valuable information for started CPUs
-the assumed 62.5MHz of the current generic timer implementation. The
+so don't abandon conversion in such a case.
 delta between the value as read from CNTFRQ and the true rate of the
 underlying QEMUTimer leads to sticky behaviour in AST2600 guests.
-Add a feature-gated property exposing CNTFRQ for ARM CPUs providing the
+Signed-off-by: Akihiko Odaki <akihiko.odaki@daynix.com>
-generic timer. This allows platforms to configure CNTFRQ (and the
+Reviewed-by: Viktor Prutyanov <viktor.prutyanov@phystech.edu>
-associated QEMUTimer) to the appropriate frequency prior to starting the
+Message-id: 20230611033434.14659-1-akihiko.odaki@daynix.com
 guest.
 As the platform can now determine the rate of CNTFRQ we're exposed to
 limitations of QEMUTimer that didn't previously materialise: In the
 course of emulation we need to arbitrarily and accurately convert
 between guest ticks and time, but we're constrained by QEMUTimer's use
 of an integer scaling factor. The effect is QEMUTimer cannot exactly
 capture the period of frequencies that do not cleanly divide
 NANOSECONDS_PER_SECOND for scaling ticks to time. As such, provide an
 equally inaccurate scaling factor for scaling time to ticks so at least
 a self-consistent inverse relationship holds.
 Signed-off-by: Andrew Jeffery <andrew@aj.id.au>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: a22db9325f96e39f76e3c2baddcb712149f46bf2.1576215453.git-series.andrew@aj.id.au
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/cpu.c    | 61 +++++++++++++++++++++++++++++++++++++--------
+ contrib/elf2dmp/main.c | 5 +++++
- target/arm/helper.c |  9 ++++++-
+file changed, 5 insertions(+)
 files changed, 59 insertions(+), 11 deletions(-)
-diff --git a/target/arm/cpu.c b/target/arm/cpu.c
+diff --git a/contrib/elf2dmp/main.c b/contrib/elf2dmp/main.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu.c
+--- a/contrib/elf2dmp/main.c
-+++ b/target/arm/cpu.c
++++ b/contrib/elf2dmp/main.c
-@@ -XXX,XX +XXX,XX @@ static void arm_cpu_initfn(Object *obj)
+@@ -XXX,XX +XXX,XX @@ static int fill_context(KDDEBUGGER_DATA64 *kdbg,
-     if (tcg_enabled()) {
+             return 1;
          cpu->psci_version = 2; /* TCG implements PSCI 0.2 */
      }
 -
 -    cpu->gt_cntfrq_hz = NANOSECONDS_PER_SECOND / GTIMER_SCALE;
  }
 +static Property arm_cpu_gt_cntfrq_property =
 +            DEFINE_PROP_UINT64("cntfrq", ARMCPU, gt_cntfrq_hz,
 +                               NANOSECONDS_PER_SECOND / GTIMER_SCALE);
 +
  static Property arm_cpu_reset_cbar_property =
              DEFINE_PROP_UINT64("reset-cbar", ARMCPU, reset_cbar, 0);
@@ -XXX,XX +XXX,XX @@ static void arm_set_init_svtor(Object *obj, Visitor *v, const char *name,
  unsigned int gt_cntfrq_period_ns(ARMCPU *cpu)
  {
 +    /*
 +     * The exact approach to calculating guest ticks is:
 +     *
 +     *     muldiv64(qemu_clock_get_ns(QEMU_CLOCK_VIRTUAL), cpu->gt_cntfrq_hz,
 +     *              NANOSECONDS_PER_SECOND);
 +     *
 +     * We don't do that. Rather we intentionally use integer division
 +     * truncation below and in the caller for the conversion of host monotonic
 +     * time to guest ticks to provide the exact inverse for the semantics of
 +     * the QEMUTimer scale factor. QEMUTimer's scale facter is an integer, so
 +     * it loses precision when representing frequencies where
 +     * `(NANOSECONDS_PER_SECOND % cpu->gt_cntfrq) > 0` holds. Failing to
 +     * provide an exact inverse leads to scheduling timers with negative
 +     * periods, which in turn leads to sticky behaviour in the guest.
 +     *
 +     * Finally, CNTFRQ is effectively capped at 1GHz to ensure our scale factor
 +     * cannot become zero.
 +     */
      return NANOSECONDS_PER_SECOND > cpu->gt_cntfrq_hz ?
        NANOSECONDS_PER_SECOND / cpu->gt_cntfrq_hz : 1;
  }
@@ -XXX,XX +XXX,XX @@ void arm_cpu_post_init(Object *obj)
      qdev_property_add_static(DEVICE(obj), &arm_cpu_cfgend_property,
                               &error_abort);
 +
 +    if (arm_feature(&cpu->env, ARM_FEATURE_GENERIC_TIMER)) {
 +        qdev_property_add_static(DEVICE(cpu), &arm_cpu_gt_cntfrq_property,
 +                                 &error_abort);
 +    }
  }
  static void arm_cpu_finalizefn(Object *obj)
@@ -XXX,XX +XXX,XX @@ static void arm_cpu_realizefn(DeviceState *dev, Error **errp)
          }
-     }
++        if (!Prcb) {
--    cpu->gt_timer[GTIMER_PHYS] = timer_new(QEMU_CLOCK_VIRTUAL, GTIMER_SCALE,
++            eprintf("Context for CPU #%d is missing\n", i);
--                                           arm_gt_ptimer_cb, cpu);
++            continue;
 -    cpu->gt_timer[GTIMER_VIRT] = timer_new(QEMU_CLOCK_VIRTUAL, GTIMER_SCALE,
 -                                           arm_gt_vtimer_cb, cpu);
 -    cpu->gt_timer[GTIMER_HYP] = timer_new(QEMU_CLOCK_VIRTUAL, GTIMER_SCALE,
 -                                          arm_gt_htimer_cb, cpu);
 -    cpu->gt_timer[GTIMER_SEC] = timer_new(QEMU_CLOCK_VIRTUAL, GTIMER_SCALE,
 -                                          arm_gt_stimer_cb, cpu);
 +
 +    {
 +        uint64_t scale;
 +
 +        if (arm_feature(env, ARM_FEATURE_GENERIC_TIMER)) {
 +            if (!cpu->gt_cntfrq_hz) {
 +                error_setg(errp, "Invalid CNTFRQ: %"PRId64"Hz",
 +                           cpu->gt_cntfrq_hz);
 +                return;
 +            }
 +            scale = gt_cntfrq_period_ns(cpu);
 +        } else {
 +            scale = GTIMER_SCALE;
 +        }
 +
-+        cpu->gt_timer[GTIMER_PHYS] = timer_new(QEMU_CLOCK_VIRTUAL, scale,
+         if (va_space_rw(vs, Prcb + kdbg->OffsetPrcbContext,
-+                                               arm_gt_ptimer_cb, cpu);
+                     &Context, sizeof(Context), 0)) {
-+        cpu->gt_timer[GTIMER_VIRT] = timer_new(QEMU_CLOCK_VIRTUAL, scale,
+             eprintf("Failed to read CPU #%d ContextFrame location\n", i);
 +                                               arm_gt_vtimer_cb, cpu);
 +        cpu->gt_timer[GTIMER_HYP] = timer_new(QEMU_CLOCK_VIRTUAL, scale,
 +                                              arm_gt_htimer_cb, cpu);
 +        cpu->gt_timer[GTIMER_SEC] = timer_new(QEMU_CLOCK_VIRTUAL, scale,
 +                                              arm_gt_stimer_cb, cpu);
 +    }
  #endif
      cpu_exec_realizefn(cs, &local_err);
 diff --git a/target/arm/helper.c b/target/arm/helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper.c
 +++ b/target/arm/helper.c
@@ -XXX,XX +XXX,XX @@ void arm_gt_stimer_cb(void *opaque)
      gt_recalc_timer(cpu, GTIMER_SEC);
  }
 +static void arm_gt_cntfrq_reset(CPUARMState *env, const ARMCPRegInfo *opaque)
 +{
 +    ARMCPU *cpu = env_archcpu(env);
 +
 +    cpu->env.cp15.c14_cntfrq = cpu->gt_cntfrq_hz;
 +}
 +
  static const ARMCPRegInfo generic_timer_cp_reginfo[] = {
      /* Note that CNTFRQ is purely reads-as-written for the benefit
       * of software; writing it doesn't actually change the timer frequency.
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo generic_timer_cp_reginfo[] = {
        .opc0 = 3, .opc1 = 3, .crn = 14, .crm = 0, .opc2 = 0,
        .access = PL1_RW | PL0_R, .accessfn = gt_cntfrq_access,
        .fieldoffset = offsetof(CPUARMState, cp15.c14_cntfrq),
 -      .resetvalue = (1000 * 1000 * 1000) / GTIMER_SCALE,
 +      .resetfn = arm_gt_cntfrq_reset,
      },
      /* overall control: mostly access permissions */
      { .name = "CNTKCTL", .state = ARM_CP_STATE_BOTH,
 --
-.20.1
+.34.1

-[PULL 04/12] ast2600: Configure CNTFRQ at 1125MHz
+Deleted patch
-From: Andrew Jeffery <andrew@aj.id.au>
-This matches the configuration set by u-boot on the AST2600.
-Signed-off-by: Andrew Jeffery <andrew@aj.id.au>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Reviewed-by: Cédric Le Goater <clg@kaod.org>
-Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
-Message-id: 080ca1267a09381c43cf3c50d434fb6c186f2b6e.1576215453.git-series.andrew@aj.id.au
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- hw/arm/aspeed_ast2600.c | 3 +++
-file changed, 3 insertions(+)
-diff --git a/hw/arm/aspeed_ast2600.c b/hw/arm/aspeed_ast2600.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/aspeed_ast2600.c
-+++ b/hw/arm/aspeed_ast2600.c
-@@ -XXX,XX +XXX,XX @@ static void aspeed_soc_ast2600_realize(DeviceState *dev, Error **errp)
-         object_property_set_int(OBJECT(&s->cpu[i]), aspeed_calc_affinity(i),
-                                 "mp-affinity", &error_abort);
-+        object_property_set_int(OBJECT(&s->cpu[i]), 1125000000, "cntfrq",
-+                                &error_abort);
-+
-         /*
-          * TODO: the secondary CPUs are started and a boot helper
-          * is needed when using -kernel
---
-.20.1

-[PULL 12/12] arm/arm-powerctl: rebuild hflags after setting CP15 bits in arm_set_cpu_on()
+[PULL 3/6] target/arm: Avoid writing to constant TCGv in trans_CSEL()
-From: Niek Linnenbank <nieklinnenbank@gmail.com>
+In commit 0b188ea05acb5 we changed the implementation of
 trans_CSEL() to use tcg_constant_i32(). However, this change
 was incorrect, because the implementation of the function
 sets up the TCGv_i32 rn and rm to be either zero or else
 a TCG temp created in load_reg(), and these TCG temps are
 then in both cases written to by the emitted TCG ops.
 The result is that we hit a TCG assertion:
-After setting CP15 bits in arm_set_cpu_on() the cached hflags must
+qemu-system-arm: ../../tcg/tcg.c:4455: tcg_reg_alloc_mov: Assertion `!temp_readonly(ots)' failed.
 be rebuild to reflect the changed processor state. Without rebuilding,
 the cached hflags would be inconsistent until the next call to
 arm_rebuild_hflags(). When QEMU is compiled with debugging enabled
 (--enable-debug), this problem is captured shortly after the first
 call to arm_set_cpu_on() for CPUs running in ARM 32-bit non-secure mode:
-  qemu-system-arm: target/arm/helper.c:11359: cpu_get_tb_cpu_state:
+(or on a non-debug build, just produce a garbage result)
   Assertion `flags == rebuild_hflags_internal(env)' failed.
   Aborted (core dumped)
-Fixes: 0c7f8c43daf65
+Adjust the code so that rn and rm are always writeable
 temporaries whether the instruction is using the special
 case "0" or a normal register as input.
 Cc: qemu-stable@nongnu.org
-Signed-off-by: Niek Linnenbank <nieklinnenbank@gmail.com>
+Fixes: 0b188ea05acb5 ("target/arm: Use tcg_constant in trans_CSEL")
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Message-id: 20230727103906.2641264-1-peter.maydell@linaro.org
 ---
- target/arm/arm-powerctl.c | 3 +++
+ target/arm/tcg/translate.c | 15 ++++++++-------
-file changed, 3 insertions(+)
+file changed, 8 insertions(+), 7 deletions(-)
-diff --git a/target/arm/arm-powerctl.c b/target/arm/arm-powerctl.c
+diff --git a/target/arm/tcg/translate.c b/target/arm/tcg/translate.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/arm-powerctl.c
+--- a/target/arm/tcg/translate.c
-+++ b/target/arm/arm-powerctl.c
++++ b/target/arm/tcg/translate.c
-@@ -XXX,XX +XXX,XX @@ static void arm_set_cpu_on_async_work(CPUState *target_cpu_state,
+@@ -XXX,XX +XXX,XX @@ static bool trans_IT(DisasContext *s, arg_IT *a)
-         target_cpu->env.regs[0] = info->context_id;
+ /* v8.1M CSEL/CSINC/CSNEG/CSINV */
  static bool trans_CSEL(DisasContext *s, arg_CSEL *a)
  {
 -    TCGv_i32 rn, rm, zero;
 +    TCGv_i32 rn, rm;
      DisasCompare c;
      if (!arm_dc_feature(s, ARM_FEATURE_V8_1M)) {
@@ -XXX,XX +XXX,XX @@ static bool trans_CSEL(DisasContext *s, arg_CSEL *a)
      }
-+    /* CP15 update requires rebuilding hflags */
+     /* In this insn input reg fields of 0b1111 mean "zero", not "PC" */
-+    arm_rebuild_hflags(&target_cpu->env);
+-    zero = tcg_constant_i32(0);
-+
++    rn = tcg_temp_new_i32();
-     /* Start the new CPU at the requested address */
++    rm = tcg_temp_new_i32();
-     cpu_set_pc(target_cpu_state, info->entry);
+     if (a->rn == 15) {
+-        rn = zero;
 +        tcg_gen_movi_i32(rn, 0);
      } else {
 -        rn = load_reg(s, a->rn);
 +        load_reg_var(s, rn, a->rn);
      }
      if (a->rm == 15) {
 -        rm = zero;
 +        tcg_gen_movi_i32(rm, 0);
      } else {
 -        rm = load_reg(s, a->rm);
 +        load_reg_var(s, rm, a->rm);
      }
      switch (a->op) {
@@ -XXX,XX +XXX,XX @@ static bool trans_CSEL(DisasContext *s, arg_CSEL *a)
      }
      arm_test_cc(&c, a->fcond);
 -    tcg_gen_movcond_i32(c.cond, rn, c.value, zero, rn, rm);
 +    tcg_gen_movcond_i32(c.cond, rn, c.value, tcg_constant_i32(0), rn, rm);
      store_reg(s, a->rd, rn);
      return true;
 --
-.20.1
+.34.1

-[PULL 11/12] target/arm: Display helpful message when hflags mismatch
+[PULL 4/6] target/arm/tcg: Don't build AArch64 decodetree files for qemu-system-arm
-From: Philippe Mathieu-Daudé <philmd@redhat.com>
+Currently we list all the Arm decodetree files together and add them
 unconditionally to arm_ss.  This means we build them for both
 qemu-system-aarch64 and qemu-system-arm.  However, some of them are
 AArch64-specific, so there is no need to build them for
 qemu-system-arm.  (Meson is smart enough to notice that the generated
 .c.inc file is not used by any objects that go into qemu-system-arm,
 so we only unnecessarily run decodetree, not anything more
 heavyweight like a recompile or relink, but it's still unnecessary
 work.)
-Instead of crashing in a confuse way, give some hint to the user
+Split gen into gen_a32 and gen_a64, and only add gen_a64 for
-about why we aborted. He might report the issue without having
+TARGET_AARCH64 compiles.
 to use a debugger.
-Signed-off-by: Philippe Mathieu-Daudé <philmd@redhat.com>
-Message-id: 20191209134552.27733-1-philmd@redhat.com
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Tested-by: Niek Linnenbank <nieklinnenbank@gmail.com>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
+Message-id: 20230718104628.1137734-1-peter.maydell@linaro.org
 ---
- target/arm/helper.c | 18 +++++++++++++++---
+ target/arm/tcg/meson.build | 10 +++++++---
-file changed, 15 insertions(+), 3 deletions(-)
+file changed, 7 insertions(+), 3 deletions(-)
-diff --git a/target/arm/helper.c b/target/arm/helper.c
+diff --git a/target/arm/tcg/meson.build b/target/arm/tcg/meson.build
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper.c
+--- a/target/arm/tcg/meson.build
-+++ b/target/arm/helper.c
++++ b/target/arm/tcg/meson.build
-@@ -XXX,XX +XXX,XX @@ void HELPER(rebuild_hflags_a64)(CPUARMState *env, int el)
+@@ -XXX,XX +XXX,XX @@
-     env->hflags = rebuild_hflags_a64(env, el, fp_el, mmu_idx);
+-gen = [
- }
++gen_a64 = [
++  decodetree.process('a64.decode', extra_args: ['--static-decode=disas_a64']),
-+static inline void assert_hflags_rebuild_correctly(CPUARMState *env)
+   decodetree.process('sve.decode', extra_args: '--decode=disas_sve'),
-+{
+   decodetree.process('sme.decode', extra_args: '--decode=disas_sme'),
-+#ifdef CONFIG_DEBUG_TCG
+   decodetree.process('sme-fa64.decode', extra_args: '--static-decode=disas_sme_fa64'),
-+    uint32_t env_flags_current = env->hflags;
++]
 +    uint32_t env_flags_rebuilt = rebuild_hflags_internal(env);
 +
-+    if (unlikely(env_flags_current != env_flags_rebuilt)) {
++gen_a32 = [
-+        fprintf(stderr, "TCG hflags mismatch (current:0x%08x rebuilt:0x%08x)\n",
+   decodetree.process('neon-shared.decode', extra_args: '--decode=disas_neon_shared'),
-+                env_flags_current, env_flags_rebuilt);
+   decodetree.process('neon-dp.decode', extra_args: '--decode=disas_neon_dp'),
-+        abort();
+   decodetree.process('neon-ls.decode', extra_args: '--decode=disas_neon_ls'),
-+    }
+@@ -XXX,XX +XXX,XX @@ gen = [
-+#endif
+   decodetree.process('a32-uncond.decode', extra_args: '--static-decode=disas_a32_uncond'),
-+}
+   decodetree.process('t32.decode', extra_args: '--static-decode=disas_t32'),
-+
+   decodetree.process('t16.decode', extra_args: ['-w', '16', '--static-decode=disas_t16']),
- void cpu_get_tb_cpu_state(CPUARMState *env, target_ulong *pc,
+-  decodetree.process('a64.decode', extra_args: ['--static-decode=disas_a64']),
-                           target_ulong *cs_base, uint32_t *pflags)
+ ]
- {
-@@ -XXX,XX +XXX,XX @@ void cpu_get_tb_cpu_state(CPUARMState *env, target_ulong *pc,
+-arm_ss.add(gen)
-     uint32_t pstate_for_ss;
++arm_ss.add(gen_a32)
++arm_ss.add(when: 'TARGET_AARCH64', if_true: gen_a64)
-     *cs_base = 0;
--#ifdef CONFIG_DEBUG_TCG
+ arm_ss.add(files(
--    assert(flags == rebuild_hflags_internal(env));
+   'cpu32.c',
 -#endif
 +    assert_hflags_rebuild_correctly(env);
      if (FIELD_EX32(flags, TBFLAG_ANY, AARCH64_STATE)) {
          *pc = env->pc;
 --
-.20.1
+.34.1

-[PULL 05/12] hw/arm/smmuv3: Apply address mask to linear strtab base address
+[PULL 5/6] kvm: Fix crash due to access uninitialized kvm_state
-From: Simon Veith <sveith@amazon.de>
+From: Gavin Shan <gshan@redhat.com>
-In the SMMU_STRTAB_BASE register, the stream table base address only
+Runs into core dump on arm64 and the backtrace extracted from the
-occupies bits [51:6]. Other bits, such as RA (bit [62]), must be masked
+core dump is shown as below. It's caused by accessing uninitialized
-out to obtain the base address.
+@kvm_state in kvm_flush_coalesced_mmio_buffer() due to commit 176d073029
 ("hw/arm/virt: Use machine_memory_devices_init()"), where the machine's
 memory region is added earlier than before.
-The branch for 2-level stream tables correctly applies this mask by way
+    main
-of SMMU_BASE_ADDR_MASK, but the one for linear stream tables does not.
+    qemu_init
     configure_accelerators
     qemu_opts_foreach
     do_configure_accelerator
     accel_init_machine
     kvm_init
     virt_kvm_type
     virt_set_memmap
     machine_memory_devices_init
     memory_region_add_subregion
     memory_region_add_subregion_common
     memory_region_update_container_subregions
     memory_region_transaction_begin
     qemu_flush_coalesced_mmio_buffer
     kvm_flush_coalesced_mmio_buffer
-Apply the missing mask in that case as well so that the correct stream
+Fix it by bailing early in kvm_flush_coalesced_mmio_buffer() on the
-base address is used by guests which configure a linear stream table.
+uninitialized @kvm_state. With this applied, no crash is observed on
 arm64.
-Linux guests are unaffected by this change because they choose a 2-level
+Fixes: 176d073029 ("hw/arm/virt: Use machine_memory_devices_init()")
-stream table layout for the QEMU SMMUv3, based on the size of its stream
+Signed-off-by: Gavin Shan <gshan@redhat.com>
-ID space.
+Reviewed-by: David Hildenbrand <david@redhat.com>
+Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
-ref. ARM IHI 0070C, section 6.3.23.
+Message-id: 20230731125946.2038742-1-gshan@redhat.com
 Signed-off-by: Simon Veith <sveith@amazon.de>
 Acked-by: Eric Auger <eric.auger@redhat.com>
 Tested-by: Eric Auger <eric.auger@redhat.com>
 Message-id: 1576509312-13083-2-git-send-email-sveith@amazon.de
 Cc: Eric Auger <eric.auger@redhat.com>
 Cc: qemu-devel@nongnu.org
 Cc: qemu-arm@nongnu.org
 Acked-by: Eric Auger <eric.auger@redhat.com>
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- hw/arm/smmuv3.c | 2 +-
+ accel/kvm/kvm-all.c | 2 +-
 file changed, 1 insertion(+), 1 deletion(-)
-diff --git a/hw/arm/smmuv3.c b/hw/arm/smmuv3.c
+diff --git a/accel/kvm/kvm-all.c b/accel/kvm/kvm-all.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/smmuv3.c
+--- a/accel/kvm/kvm-all.c
-+++ b/hw/arm/smmuv3.c
++++ b/accel/kvm/kvm-all.c
-@@ -XXX,XX +XXX,XX @@ static int smmu_find_ste(SMMUv3State *s, uint32_t sid, STE *ste,
+@@ -XXX,XX +XXX,XX @@ void kvm_flush_coalesced_mmio_buffer(void)
-         }
+ {
-         addr = l2ptr + l2_ste_offset * sizeof(*ste);
+     KVMState *s = kvm_state;
-     } else {
--        addr = s->strtab_base + sid * sizeof(*ste);
+-    if (s->coalesced_flush_in_progress) {
-+        addr = (s->strtab_base & SMMU_BASE_ADDR_MASK) + sid * sizeof(*ste);
++    if (!s || s->coalesced_flush_in_progress) {
          return;
      }
-     if (smmu_get_ste(s, addr, ste, event)) {
 --
-.20.1
+.34.1

-[PULL 06/12] hw/arm/smmuv3: Correct SMMU_BASE_ADDR_MASK value
+Deleted patch
-From: Simon Veith <sveith@amazon.de>
-There are two issues with the current value of SMMU_BASE_ADDR_MASK:
-- At the lower end, we are clearing bits [4:0]. Per the SMMUv3 spec,
-  we should also be treating bit 5 as zero in the base address.
-- At the upper end, we are clearing bits [63:48]. Per the SMMUv3 spec,
-  only bits [63:52] must be explicitly treated as zero.
-Update the SMMU_BASE_ADDR_MASK value to mask out bits [63:52] and [5:0].
-ref. ARM IHI 0070C, section 6.3.23.
-Signed-off-by: Simon Veith <sveith@amazon.de>
-Acked-by: Eric Auger <eric.auger@redhat.com>
-Tested-by: Eric Auger <eric.auger@redhat.com>
-Message-id: 1576509312-13083-3-git-send-email-sveith@amazon.de
-Cc: Eric Auger <eric.auger@redhat.com>
-Cc: qemu-devel@nongnu.org
-Cc: qemu-arm@nongnu.org
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- hw/arm/smmuv3-internal.h | 2 +-
-file changed, 1 insertion(+), 1 deletion(-)
-diff --git a/hw/arm/smmuv3-internal.h b/hw/arm/smmuv3-internal.h
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/smmuv3-internal.h
-+++ b/hw/arm/smmuv3-internal.h
-@@ -XXX,XX +XXX,XX @@ REG32(GERROR_IRQ_CFG2, 0x74)
- #define A_STRTAB_BASE      0x80 /* 64b */
--#define SMMU_BASE_ADDR_MASK 0xffffffffffe0
-+#define SMMU_BASE_ADDR_MASK 0xfffffffffffc0
- REG32(STRTAB_BASE_CFG,     0x88)
-     FIELD(STRTAB_BASE_CFG, FMT,      16, 2)
---
-.20.1

-[PULL 07/12] hw/arm/smmuv3: Check stream IDs against actual table LOG2SIZE
+Deleted patch
-From: Simon Veith <sveith@amazon.de>
-When checking whether a stream ID is in range of the stream table, we
-have so far been only checking it against our implementation limit
-(SMMU_IDR1_SIDSIZE). However, the guest can program the
-STRTAB_BASE_CFG.LOG2SIZE field to a size that is smaller than this
-limit.
-Check the stream ID against this limit as well to match the hardware
-behavior of raising C_BAD_STREAMID events in case the limit is exceeded.
-Also, ensure that we do not go one entry beyond the end of the table by
-checking that its index is strictly smaller than the table size.
-ref. ARM IHI 0070C, section 6.3.24.
-Signed-off-by: Simon Veith <sveith@amazon.de>
-Acked-by: Eric Auger <eric.auger@redhat.com>
-Tested-by: Eric Auger <eric.auger@redhat.com>
-Message-id: 1576509312-13083-4-git-send-email-sveith@amazon.de
-Cc: Eric Auger <eric.auger@redhat.com>
-Cc: qemu-devel@nongnu.org
-Cc: qemu-arm@nongnu.org
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- hw/arm/smmuv3.c | 8 ++++++--
-file changed, 6 insertions(+), 2 deletions(-)
-diff --git a/hw/arm/smmuv3.c b/hw/arm/smmuv3.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/smmuv3.c
-+++ b/hw/arm/smmuv3.c
-@@ -XXX,XX +XXX,XX @@ static int smmu_find_ste(SMMUv3State *s, uint32_t sid, STE *ste,
-                          SMMUEventInfo *event)
- {
-     dma_addr_t addr;
-+    uint32_t log2size;
-     int ret;
-     trace_smmuv3_find_ste(sid, s->features, s->sid_split);
--    /* Check SID range */
--    if (sid > (1 << SMMU_IDR1_SIDSIZE)) {
-+    log2size = FIELD_EX32(s->strtab_base_cfg, STRTAB_BASE_CFG, LOG2SIZE);
-+    /*
-+     * Check SID range against both guest-configured and implementation limits
-+     */
-+    if (sid >= (1 << MIN(log2size, SMMU_IDR1_SIDSIZE))) {
-         event->type = SMMU_EVT_C_BAD_STREAMID;
-         return -EINVAL;
-     }
---
-.20.1

-[PULL 09/12] hw/arm/smmuv3: Use correct bit positions in EVT_SET_ADDR2 macro
+Deleted patch
-From: Simon Veith <sveith@amazon.de>
-The bit offsets in the EVT_SET_ADDR2 macro do not match those specified
-in the ARM SMMUv3 Architecture Specification. In all events that use
-this macro, e.g. F_WALK_EABT, the faulting fetch address or IPA actually
-occupies the 32-bit words 6 and 7 in the event record contiguously, with
-the upper and lower unused bits clear due to alignment or maximum
-supported address bits. How many bits are clear depends on the
-individual event type.
-Update the macro to write to the correct words in the event record so
-that guest drivers can obtain accurate address information on events.
-ref. ARM IHI 0070C, sections 7.3.12 through 7.3.16.
-Signed-off-by: Simon Veith <sveith@amazon.de>
-Acked-by: Eric Auger <eric.auger@redhat.com>
-Tested-by: Eric Auger <eric.auger@redhat.com>
-Message-id: 1576509312-13083-6-git-send-email-sveith@amazon.de
-Cc: Eric Auger <eric.auger@redhat.com>
-Cc: qemu-devel@nongnu.org
-Cc: qemu-arm@nongnu.org
-Acked-by: Eric Auger <eric.auger@redhat.com>
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- hw/arm/smmuv3-internal.h | 4 ++--
-file changed, 2 insertions(+), 2 deletions(-)
-diff --git a/hw/arm/smmuv3-internal.h b/hw/arm/smmuv3-internal.h
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/smmuv3-internal.h
-+++ b/hw/arm/smmuv3-internal.h
-@@ -XXX,XX +XXX,XX @@ typedef struct SMMUEventInfo {
-     } while (0)
- #define EVT_SET_ADDR2(x, addr)                            \
-     do {                                                  \
--            (x)->word[7] = deposit32((x)->word[7], 3, 29, addr >> 16);   \
--            (x)->word[7] = deposit32((x)->word[7], 0, 16, addr & 0xffff);\
-+            (x)->word[7] = (uint32_t)(addr >> 32);        \
-+            (x)->word[6] = (uint32_t)(addr & 0xffffffff); \
-     } while (0)
- void smmuv3_record_event(SMMUv3State *s, SMMUEventInfo *event);
---
-.20.1

-[PULL 10/12] hw/arm/smmuv3: Report F_STE_FETCH fault address in correct word position
+[PULL 6/6] gdbstub: Fix client Ctrl-C handling
-From: Simon Veith <sveith@amazon.de>
+From: Nicholas Piggin <npiggin@gmail.com>
-The smmuv3_record_event() function that generates the F_STE_FETCH error
+The gdb remote protocol has a special interrupt character (0x03) that is
-uses the EVT_SET_ADDR macro to record the fetch address, placing it in
+transmitted outside the regular packet processing, and represents a
--bit words 4 and 5.
+Ctrl-C pressed in the client. Despite not being a regular packet, it
 does expect a regular stop response if the stub successfully stops the
 running program.
-The correct position for this address is in words 6 and 7, per the
+See: https://sourceware.org/gdb/onlinedocs/gdb/Interrupts.html
 SMMUv3 Architecture Specification.
-Update the function to use the EVT_SET_ADDR2 macro instead, which is the
+Inhibiting the stop reply packet can lead to gdb client hang. So permit
-macro intended for writing to these words.
+a stop response when receiving a character from gdb that stops the vm.
 Additionally, add a warning if that was not a 0x03 character, because
 the gdb session is likely to end up getting confused if this happens.
-ref. ARM IHI 0070C, section 7.3.4.
+Cc: qemu-stable@nongnu.org
+Fixes: 758370052fb ("gdbstub: only send stop-reply packets when allowed to")
-Signed-off-by: Simon Veith <sveith@amazon.de>
+Reported-by: Frederic Barrat <fbarrat@linux.ibm.com>
-Acked-by: Eric Auger <eric.auger@redhat.com>
+Signed-off-by: Nicholas Piggin <npiggin@gmail.com>
-Tested-by: Eric Auger <eric.auger@redhat.com>
+Tested-by: Joel Stanley <joel@jms.id.au>
-Message-id: 1576509312-13083-7-git-send-email-sveith@amazon.de
+Message-id: 20230711085903.304496-1-npiggin@gmail.com
 Cc: Eric Auger <eric.auger@redhat.com>
 Cc: qemu-devel@nongnu.org
 Cc: qemu-arm@nongnu.org
 Acked-by: Eric Auger <eric.auger@redhat.com>
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- hw/arm/smmuv3.c | 2 +-
+ gdbstub/gdbstub.c | 13 +++++++++++--
-file changed, 1 insertion(+), 1 deletion(-)
+file changed, 11 insertions(+), 2 deletions(-)
-diff --git a/hw/arm/smmuv3.c b/hw/arm/smmuv3.c
+diff --git a/gdbstub/gdbstub.c b/gdbstub/gdbstub.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/smmuv3.c
+--- a/gdbstub/gdbstub.c
-+++ b/hw/arm/smmuv3.c
++++ b/gdbstub/gdbstub.c
-@@ -XXX,XX +XXX,XX @@ void smmuv3_record_event(SMMUv3State *s, SMMUEventInfo *info)
+@@ -XXX,XX +XXX,XX @@ void gdb_read_byte(uint8_t ch)
-     case SMMU_EVT_F_STE_FETCH:
+             return;
-         EVT_SET_SSID(&evt, info->u.f_ste_fetch.ssid);
+     }
-         EVT_SET_SSV(&evt,  info->u.f_ste_fetch.ssv);
+     if (runstate_is_running()) {
--        EVT_SET_ADDR(&evt, info->u.f_ste_fetch.addr);
+-        /* when the CPU is running, we cannot do anything except stop
-+        EVT_SET_ADDR2(&evt, info->u.f_ste_fetch.addr);
+-           it when receiving a char */
-         break;
++        /*
-     case SMMU_EVT_C_BAD_STE:
++         * When the CPU is running, we cannot do anything except stop
-         EVT_SET_SSID(&evt, info->u.c_bad_ste.ssid);
++         * it when receiving a char. This is expected on a Ctrl-C in the
 +         * gdb client. Because we are in all-stop mode, gdb sends a
 +         * 0x03 byte which is not a usual packet, so we handle it specially
 +         * here, but it does expect a stop reply.
 +         */
 +        if (ch != 0x03) {
 +            warn_report("gdbstub: client sent packet while target running\n");
 +        }
 +        gdbserver_state.allow_stop_reply = true;
          vm_stop(RUN_STATE_PAUSED);
      } else
  #endif
 --
-.20.1
+.34.1

One last arm pullreq before I stop work for the end of the year...

-- PMM

The following changes since commit 8e5943260a8f765216674ee87ce8588cc4e7463e:

Merge remote-tracking branch 'remotes/vivier2/tags/trivial-branch-pull-request' into staging (2019-12-20 12:46:10 +0000)

are available in the Git repository at:

https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20191220

for you to fetch changes up to c8fa6079eb35888587f1be27c1590da4edcc5098:

arm/arm-powerctl: rebuild hflags after setting CP15 bits in arm_set_cpu_on() (2019-12-20 14:03:00 +0000)

----------------------------------------------------------------
target-arm queue:
 * Support emulating the generic timers at frequencies other than 62.5MHz
 * Various fixes for SMMUv3 emulation bugs
 * Improve assert error message for hflags mismatches
 * arm-powerctl: rebuild hflags after setting CP15 bits in arm_set_cpu_on()

----------------------------------------------------------------
Andrew Jeffery (4):
      target/arm: Remove redundant scaling of nexttick
      target/arm: Abstract the generic timer frequency
      target/arm: Prepare generic timer for per-platform CNTFRQ
      ast2600: Configure CNTFRQ at 1125MHz

Niek Linnenbank (1):
      arm/arm-powerctl: rebuild hflags after setting CP15 bits in arm_set_cpu_on()

Philippe Mathieu-Daudé (1):
      target/arm: Display helpful message when hflags mismatch

Simon Veith (6):
      hw/arm/smmuv3: Apply address mask to linear strtab base address
      hw/arm/smmuv3: Correct SMMU_BASE_ADDR_MASK value
      hw/arm/smmuv3: Check stream IDs against actual table LOG2SIZE
      hw/arm/smmuv3: Align stream table base address to table size
      hw/arm/smmuv3: Use correct bit positions in EVT_SET_ADDR2 macro
      hw/arm/smmuv3: Report F_STE_FETCH fault address in correct word position

hw/arm/smmuv3-internal.h  |  6 ++---
 target/arm/cpu.h          |  5 ++++
 hw/arm/aspeed_ast2600.c   |  3 +++
 hw/arm/smmuv3.c           | 28 +++++++++++++++-----
 target/arm/arm-powerctl.c |  3 +++
 target/arm/cpu.c          | 65 +++++++++++++++++++++++++++++++++++++++++------
 target/arm/helper.c       | 42 +++++++++++++++++++++++-------
 7 files changed, 125 insertions(+), 27 deletions(-)

From: Andrew Jeffery <andrew@aj.id.au>

The corner-case codepath was adjusting nexttick such that overflow
wouldn't occur when timer_mod() scaled the value back up. Remove a use
of GTIMER_SCALE and avoid unnecessary operations by calling
timer_mod_ns() directly.

Signed-off-by: Andrew Jeffery <andrew@aj.id.au>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Reviewed-by: Cédric Le Goater <clg@kaod.org>
Message-id: f8c680720e3abe55476e6d9cb604ad27fdbeb2e0.1576215453.git-series.andrew@aj.id.au
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 target/arm/helper.c | 5 +++--
 1 file changed, 3 insertions(+), 2 deletions(-)

diff --git a/target/arm/helper.c b/target/arm/helper.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/helper.c
+++ b/target/arm/helper.c
@@ -XXX,XX +XXX,XX @@ static void gt_recalc_timer(ARMCPU *cpu, int timeridx)
          * timer expires we will reset the timer for any remaining period.
          */
         if (nexttick > INT64_MAX / GTIMER_SCALE) {
-            nexttick = INT64_MAX / GTIMER_SCALE;
+            timer_mod_ns(cpu->gt_timer[timeridx], INT64_MAX);
+        } else {
+            timer_mod(cpu->gt_timer[timeridx], nexttick);
         }
-        timer_mod(cpu->gt_timer[timeridx], nexttick);
         trace_arm_gt_recalc(timeridx, irqstate, nexttick);
     } else {
         /* Timer disabled: ISTATUS and timer output always clear */
-- 
2.20.1

From: Andrew Jeffery <andrew@aj.id.au>

Prepare for SoCs such as the ASPEED AST2600 whose firmware configures
CNTFRQ to values significantly larger than the static 62.5MHz value
currently derived from GTIMER_SCALE. As the OS potentially derives its
timer periods from the CNTFRQ value the lack of support for running
QEMUTimers at the appropriate rate leads to sticky behaviour in the
guest.

Substitute the GTIMER_SCALE constant with use of a helper to derive the
period from gt_cntfrq_hz stored in struct ARMCPU. Initially set
gt_cntfrq_hz to the frequency associated with GTIMER_SCALE so current
behaviour is maintained.

Signed-off-by: Andrew Jeffery <andrew@aj.id.au>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
Message-id: 40bd8df043f66e1ccfb3e9482999d099ac72bb2e.1576215453.git-series.andrew@aj.id.au
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 target/arm/cpu.h    |  5 +++++
 target/arm/cpu.c    |  8 ++++++++
 target/arm/helper.c | 10 +++++++---
 3 files changed, 20 insertions(+), 3 deletions(-)

diff --git a/target/arm/cpu.h b/target/arm/cpu.h
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/cpu.h
+++ b/target/arm/cpu.h
@@ -XXX,XX +XXX,XX @@ struct ARMCPU {
      */
     DECLARE_BITMAP(sve_vq_map, ARM_MAX_VQ);
     DECLARE_BITMAP(sve_vq_init, ARM_MAX_VQ);
+
+    /* Generic timer counter frequency, in Hz */
+    uint64_t gt_cntfrq_hz;
 };
 
+unsigned int gt_cntfrq_period_ns(ARMCPU *cpu);
+
 void arm_cpu_post_init(Object *obj);
 
 uint64_t arm_cpu_mp_affinity(int idx, uint8_t clustersz);
diff --git a/target/arm/cpu.c b/target/arm/cpu.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/cpu.c
+++ b/target/arm/cpu.c
@@ -XXX,XX +XXX,XX @@ static void arm_cpu_initfn(Object *obj)
     if (tcg_enabled()) {
         cpu->psci_version = 2; /* TCG implements PSCI 0.2 */
     }
+
+    cpu->gt_cntfrq_hz = NANOSECONDS_PER_SECOND / GTIMER_SCALE;
 }
 
 static Property arm_cpu_reset_cbar_property =
@@ -XXX,XX +XXX,XX @@ static void arm_set_init_svtor(Object *obj, Visitor *v, const char *name,
     visit_type_uint32(v, name, &cpu->init_svtor, errp);
 }
 
+unsigned int gt_cntfrq_period_ns(ARMCPU *cpu)
+{
+    return NANOSECONDS_PER_SECOND > cpu->gt_cntfrq_hz ?
+      NANOSECONDS_PER_SECOND / cpu->gt_cntfrq_hz : 1;
+}
+
 void arm_cpu_post_init(Object *obj)
 {
     ARMCPU *cpu = ARM_CPU(obj);
diff --git a/target/arm/helper.c b/target/arm/helper.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/helper.c
+++ b/target/arm/helper.c
@@ -XXX,XX +XXX,XX @@ static CPAccessResult gt_stimer_access(CPUARMState *env,
 
 static uint64_t gt_get_countervalue(CPUARMState *env)
 {
-    return qemu_clock_get_ns(QEMU_CLOCK_VIRTUAL) / GTIMER_SCALE;
+    ARMCPU *cpu = env_archcpu(env);
+
+    return qemu_clock_get_ns(QEMU_CLOCK_VIRTUAL) / gt_cntfrq_period_ns(cpu);
 }
 
 static void gt_recalc_timer(ARMCPU *cpu, int timeridx)
@@ -XXX,XX +XXX,XX @@ static void gt_recalc_timer(ARMCPU *cpu, int timeridx)
          * set the timer for as far in the future as possible. When the
          * timer expires we will reset the timer for any remaining period.
          */
-        if (nexttick > INT64_MAX / GTIMER_SCALE) {
+        if (nexttick > INT64_MAX / gt_cntfrq_period_ns(cpu)) {
             timer_mod_ns(cpu->gt_timer[timeridx], INT64_MAX);
         } else {
             timer_mod(cpu->gt_timer[timeridx], nexttick);
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo generic_timer_cp_reginfo[] = {
 
 static uint64_t gt_virt_cnt_read(CPUARMState *env, const ARMCPRegInfo *ri)
 {
+    ARMCPU *cpu = env_archcpu(env);
+
     /* Currently we have no support for QEMUTimer in linux-user so we
      * can't call gt_get_countervalue(env), instead we directly
      * call the lower level functions.
      */
-    return cpu_get_clock() / GTIMER_SCALE;
+    return cpu_get_clock() / gt_cntfrq_period_ns(cpu);
 }
 
 static const ARMCPRegInfo generic_timer_cp_reginfo[] = {
-- 
2.20.1

From: Andrew Jeffery <andrew@aj.id.au>

The ASPEED AST2600 clocks the generic timer at the rate of HPLL. On
recent firmwares this is at 1125MHz, which is considerably quicker than
the assumed 62.5MHz of the current generic timer implementation. The
delta between the value as read from CNTFRQ and the true rate of the
underlying QEMUTimer leads to sticky behaviour in AST2600 guests.

Add a feature-gated property exposing CNTFRQ for ARM CPUs providing the
generic timer. This allows platforms to configure CNTFRQ (and the
associated QEMUTimer) to the appropriate frequency prior to starting the
guest.

As the platform can now determine the rate of CNTFRQ we're exposed to
limitations of QEMUTimer that didn't previously materialise: In the
course of emulation we need to arbitrarily and accurately convert
between guest ticks and time, but we're constrained by QEMUTimer's use
of an integer scaling factor. The effect is QEMUTimer cannot exactly
capture the period of frequencies that do not cleanly divide
NANOSECONDS_PER_SECOND for scaling ticks to time. As such, provide an
equally inaccurate scaling factor for scaling time to ticks so at least
a self-consistent inverse relationship holds.

Signed-off-by: Andrew Jeffery <andrew@aj.id.au>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Message-id: a22db9325f96e39f76e3c2baddcb712149f46bf2.1576215453.git-series.andrew@aj.id.au
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 target/arm/cpu.c    | 61 +++++++++++++++++++++++++++++++++++++--------
 target/arm/helper.c |  9 ++++++-
 2 files changed, 59 insertions(+), 11 deletions(-)

diff --git a/target/arm/cpu.c b/target/arm/cpu.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/cpu.c
+++ b/target/arm/cpu.c
@@ -XXX,XX +XXX,XX @@ static void arm_cpu_initfn(Object *obj)
     if (tcg_enabled()) {
         cpu->psci_version = 2; /* TCG implements PSCI 0.2 */
     }
-
-    cpu->gt_cntfrq_hz = NANOSECONDS_PER_SECOND / GTIMER_SCALE;
 }
 
+static Property arm_cpu_gt_cntfrq_property =
+            DEFINE_PROP_UINT64("cntfrq", ARMCPU, gt_cntfrq_hz,
+                               NANOSECONDS_PER_SECOND / GTIMER_SCALE);
+
 static Property arm_cpu_reset_cbar_property =
             DEFINE_PROP_UINT64("reset-cbar", ARMCPU, reset_cbar, 0);
 
@@ -XXX,XX +XXX,XX @@ static void arm_set_init_svtor(Object *obj, Visitor *v, const char *name,
 
 unsigned int gt_cntfrq_period_ns(ARMCPU *cpu)
 {
+    /*
+     * The exact approach to calculating guest ticks is:
+     *
+     *     muldiv64(qemu_clock_get_ns(QEMU_CLOCK_VIRTUAL), cpu->gt_cntfrq_hz,
+     *              NANOSECONDS_PER_SECOND);
+     *
+     * We don't do that. Rather we intentionally use integer division
+     * truncation below and in the caller for the conversion of host monotonic
+     * time to guest ticks to provide the exact inverse for the semantics of
+     * the QEMUTimer scale factor. QEMUTimer's scale facter is an integer, so
+     * it loses precision when representing frequencies where
+     * `(NANOSECONDS_PER_SECOND % cpu->gt_cntfrq) > 0` holds. Failing to
+     * provide an exact inverse leads to scheduling timers with negative
+     * periods, which in turn leads to sticky behaviour in the guest.
+     *
+     * Finally, CNTFRQ is effectively capped at 1GHz to ensure our scale factor
+     * cannot become zero.
+     */
     return NANOSECONDS_PER_SECOND > cpu->gt_cntfrq_hz ?
       NANOSECONDS_PER_SECOND / cpu->gt_cntfrq_hz : 1;
 }
@@ -XXX,XX +XXX,XX @@ void arm_cpu_post_init(Object *obj)
 
     qdev_property_add_static(DEVICE(obj), &arm_cpu_cfgend_property,
                              &error_abort);
+
+    if (arm_feature(&cpu->env, ARM_FEATURE_GENERIC_TIMER)) {
+        qdev_property_add_static(DEVICE(cpu), &arm_cpu_gt_cntfrq_property,
+                                 &error_abort);
+    }
 }
 
 static void arm_cpu_finalizefn(Object *obj)
@@ -XXX,XX +XXX,XX @@ static void arm_cpu_realizefn(DeviceState *dev, Error **errp)
         }
     }
 
-    cpu->gt_timer[GTIMER_PHYS] = timer_new(QEMU_CLOCK_VIRTUAL, GTIMER_SCALE,
-                                           arm_gt_ptimer_cb, cpu);
-    cpu->gt_timer[GTIMER_VIRT] = timer_new(QEMU_CLOCK_VIRTUAL, GTIMER_SCALE,
-                                           arm_gt_vtimer_cb, cpu);
-    cpu->gt_timer[GTIMER_HYP] = timer_new(QEMU_CLOCK_VIRTUAL, GTIMER_SCALE,
-                                          arm_gt_htimer_cb, cpu);
-    cpu->gt_timer[GTIMER_SEC] = timer_new(QEMU_CLOCK_VIRTUAL, GTIMER_SCALE,
-                                          arm_gt_stimer_cb, cpu);
+
+    {
+        uint64_t scale;
+
+        if (arm_feature(env, ARM_FEATURE_GENERIC_TIMER)) {
+            if (!cpu->gt_cntfrq_hz) {
+                error_setg(errp, "Invalid CNTFRQ: %"PRId64"Hz",
+                           cpu->gt_cntfrq_hz);
+                return;
+            }
+            scale = gt_cntfrq_period_ns(cpu);
+        } else {
+            scale = GTIMER_SCALE;
+        }
+
+        cpu->gt_timer[GTIMER_PHYS] = timer_new(QEMU_CLOCK_VIRTUAL, scale,
+                                               arm_gt_ptimer_cb, cpu);
+        cpu->gt_timer[GTIMER_VIRT] = timer_new(QEMU_CLOCK_VIRTUAL, scale,
+                                               arm_gt_vtimer_cb, cpu);
+        cpu->gt_timer[GTIMER_HYP] = timer_new(QEMU_CLOCK_VIRTUAL, scale,
+                                              arm_gt_htimer_cb, cpu);
+        cpu->gt_timer[GTIMER_SEC] = timer_new(QEMU_CLOCK_VIRTUAL, scale,
+                                              arm_gt_stimer_cb, cpu);
+    }
 #endif
 
     cpu_exec_realizefn(cs, &local_err);
diff --git a/target/arm/helper.c b/target/arm/helper.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/helper.c
+++ b/target/arm/helper.c
@@ -XXX,XX +XXX,XX @@ void arm_gt_stimer_cb(void *opaque)
     gt_recalc_timer(cpu, GTIMER_SEC);
 }
 
+static void arm_gt_cntfrq_reset(CPUARMState *env, const ARMCPRegInfo *opaque)
+{
+    ARMCPU *cpu = env_archcpu(env);
+
+    cpu->env.cp15.c14_cntfrq = cpu->gt_cntfrq_hz;
+}
+
 static const ARMCPRegInfo generic_timer_cp_reginfo[] = {
     /* Note that CNTFRQ is purely reads-as-written for the benefit
      * of software; writing it doesn't actually change the timer frequency.
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo generic_timer_cp_reginfo[] = {
       .opc0 = 3, .opc1 = 3, .crn = 14, .crm = 0, .opc2 = 0,
       .access = PL1_RW | PL0_R, .accessfn = gt_cntfrq_access,
       .fieldoffset = offsetof(CPUARMState, cp15.c14_cntfrq),
-      .resetvalue = (1000 * 1000 * 1000) / GTIMER_SCALE,
+      .resetfn = arm_gt_cntfrq_reset,
     },
     /* overall control: mostly access permissions */
     { .name = "CNTKCTL", .state = ARM_CP_STATE_BOTH,
-- 
2.20.1

From: Andrew Jeffery <andrew@aj.id.au>

This matches the configuration set by u-boot on the AST2600.

Signed-off-by: Andrew Jeffery <andrew@aj.id.au>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Reviewed-by: Cédric Le Goater <clg@kaod.org>
Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
Message-id: 080ca1267a09381c43cf3c50d434fb6c186f2b6e.1576215453.git-series.andrew@aj.id.au
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/arm/aspeed_ast2600.c | 3 +++
 1 file changed, 3 insertions(+)

diff --git a/hw/arm/aspeed_ast2600.c b/hw/arm/aspeed_ast2600.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/aspeed_ast2600.c
+++ b/hw/arm/aspeed_ast2600.c
@@ -XXX,XX +XXX,XX @@ static void aspeed_soc_ast2600_realize(DeviceState *dev, Error **errp)
         object_property_set_int(OBJECT(&s->cpu[i]), aspeed_calc_affinity(i),
                                 "mp-affinity", &error_abort);
 
+        object_property_set_int(OBJECT(&s->cpu[i]), 1125000000, "cntfrq",
+                                &error_abort);
+
         /*
          * TODO: the secondary CPUs are started and a boot helper
          * is needed when using -kernel
-- 
2.20.1

From: Simon Veith <sveith@amazon.de>

In the SMMU_STRTAB_BASE register, the stream table base address only
occupies bits [51:6]. Other bits, such as RA (bit [62]), must be masked
out to obtain the base address.

The branch for 2-level stream tables correctly applies this mask by way
of SMMU_BASE_ADDR_MASK, but the one for linear stream tables does not.

Apply the missing mask in that case as well so that the correct stream
base address is used by guests which configure a linear stream table.

Linux guests are unaffected by this change because they choose a 2-level
stream table layout for the QEMU SMMUv3, based on the size of its stream
ID space.

ref. ARM IHI 0070C, section 6.3.23.

Signed-off-by: Simon Veith <sveith@amazon.de>
Acked-by: Eric Auger <eric.auger@redhat.com>
Tested-by: Eric Auger <eric.auger@redhat.com>
Message-id: 1576509312-13083-2-git-send-email-sveith@amazon.de
Cc: Eric Auger <eric.auger@redhat.com>
Cc: qemu-devel@nongnu.org
Cc: qemu-arm@nongnu.org
Acked-by: Eric Auger <eric.auger@redhat.com>
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/arm/smmuv3.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/hw/arm/smmuv3.c b/hw/arm/smmuv3.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/smmuv3.c
+++ b/hw/arm/smmuv3.c
@@ -XXX,XX +XXX,XX @@ static int smmu_find_ste(SMMUv3State *s, uint32_t sid, STE *ste,
         }
         addr = l2ptr + l2_ste_offset * sizeof(*ste);
     } else {
-        addr = s->strtab_base + sid * sizeof(*ste);
+        addr = (s->strtab_base & SMMU_BASE_ADDR_MASK) + sid * sizeof(*ste);
     }
 
     if (smmu_get_ste(s, addr, ste, event)) {
-- 
2.20.1

From: Simon Veith <sveith@amazon.de>

There are two issues with the current value of SMMU_BASE_ADDR_MASK:

- At the lower end, we are clearing bits [4:0]. Per the SMMUv3 spec,
  we should also be treating bit 5 as zero in the base address.
- At the upper end, we are clearing bits [63:48]. Per the SMMUv3 spec,
  only bits [63:52] must be explicitly treated as zero.

Update the SMMU_BASE_ADDR_MASK value to mask out bits [63:52] and [5:0].

ref. ARM IHI 0070C, section 6.3.23.

Signed-off-by: Simon Veith <sveith@amazon.de>
Acked-by: Eric Auger <eric.auger@redhat.com>
Tested-by: Eric Auger <eric.auger@redhat.com>
Message-id: 1576509312-13083-3-git-send-email-sveith@amazon.de
Cc: Eric Auger <eric.auger@redhat.com>
Cc: qemu-devel@nongnu.org
Cc: qemu-arm@nongnu.org
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/arm/smmuv3-internal.h | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/hw/arm/smmuv3-internal.h b/hw/arm/smmuv3-internal.h
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/smmuv3-internal.h
+++ b/hw/arm/smmuv3-internal.h
@@ -XXX,XX +XXX,XX @@ REG32(GERROR_IRQ_CFG2, 0x74)
 
 #define A_STRTAB_BASE      0x80 /* 64b */
 
-#define SMMU_BASE_ADDR_MASK 0xffffffffffe0
+#define SMMU_BASE_ADDR_MASK 0xfffffffffffc0
 
 REG32(STRTAB_BASE_CFG,     0x88)
     FIELD(STRTAB_BASE_CFG, FMT,      16, 2)
-- 
2.20.1

From: Simon Veith <sveith@amazon.de>

When checking whether a stream ID is in range of the stream table, we
have so far been only checking it against our implementation limit
(SMMU_IDR1_SIDSIZE). However, the guest can program the
STRTAB_BASE_CFG.LOG2SIZE field to a size that is smaller than this
limit.

Check the stream ID against this limit as well to match the hardware
behavior of raising C_BAD_STREAMID events in case the limit is exceeded.
Also, ensure that we do not go one entry beyond the end of the table by
checking that its index is strictly smaller than the table size.

ref. ARM IHI 0070C, section 6.3.24.

Signed-off-by: Simon Veith <sveith@amazon.de>
Acked-by: Eric Auger <eric.auger@redhat.com>
Tested-by: Eric Auger <eric.auger@redhat.com>
Message-id: 1576509312-13083-4-git-send-email-sveith@amazon.de
Cc: Eric Auger <eric.auger@redhat.com>
Cc: qemu-devel@nongnu.org
Cc: qemu-arm@nongnu.org
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/arm/smmuv3.c | 8 ++++++--
 1 file changed, 6 insertions(+), 2 deletions(-)

diff --git a/hw/arm/smmuv3.c b/hw/arm/smmuv3.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/smmuv3.c
+++ b/hw/arm/smmuv3.c
@@ -XXX,XX +XXX,XX @@ static int smmu_find_ste(SMMUv3State *s, uint32_t sid, STE *ste,
                          SMMUEventInfo *event)
 {
     dma_addr_t addr;
+    uint32_t log2size;
     int ret;
 
     trace_smmuv3_find_ste(sid, s->features, s->sid_split);
-    /* Check SID range */
-    if (sid > (1 << SMMU_IDR1_SIDSIZE)) {
+    log2size = FIELD_EX32(s->strtab_base_cfg, STRTAB_BASE_CFG, LOG2SIZE);
+    /*
+     * Check SID range against both guest-configured and implementation limits
+     */
+    if (sid >= (1 << MIN(log2size, SMMU_IDR1_SIDSIZE))) {
         event->type = SMMU_EVT_C_BAD_STREAMID;
         return -EINVAL;
     }
-- 
2.20.1

From: Simon Veith <sveith@amazon.de>

Per the specification, and as observed in hardware, the SMMUv3 aligns
the SMMU_STRTAB_BASE address to the size of the table by masking out the
respective least significant bits in the ADDR field.

Apply this masking logic to our smmu_find_ste() lookup function per the
specification.

ref. ARM IHI 0070C, section 6.3.23.

Signed-off-by: Simon Veith <sveith@amazon.de>
Acked-by: Eric Auger <eric.auger@redhat.com>
Tested-by: Eric Auger <eric.auger@redhat.com>
Message-id: 1576509312-13083-5-git-send-email-sveith@amazon.de
Cc: Eric Auger <eric.auger@redhat.com>
Cc: qemu-devel@nongnu.org
Cc: qemu-arm@nongnu.org
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/arm/smmuv3.c | 18 ++++++++++++++----
 1 file changed, 14 insertions(+), 4 deletions(-)

diff --git a/hw/arm/smmuv3.c b/hw/arm/smmuv3.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/smmuv3.c
+++ b/hw/arm/smmuv3.c
@@ -XXX,XX +XXX,XX @@ bad_ste:
 static int smmu_find_ste(SMMUv3State *s, uint32_t sid, STE *ste,
                          SMMUEventInfo *event)
 {
-    dma_addr_t addr;
+    dma_addr_t addr, strtab_base;
     uint32_t log2size;
+    int strtab_size_shift;
     int ret;
 
     trace_smmuv3_find_ste(sid, s->features, s->sid_split);
@@ -XXX,XX +XXX,XX @@ static int smmu_find_ste(SMMUv3State *s, uint32_t sid, STE *ste,
     }
     if (s->features & SMMU_FEATURE_2LVL_STE) {
         int l1_ste_offset, l2_ste_offset, max_l2_ste, span;
-        dma_addr_t strtab_base, l1ptr, l2ptr;
+        dma_addr_t l1ptr, l2ptr;
         STEDesc l1std;
 
-        strtab_base = s->strtab_base & SMMU_BASE_ADDR_MASK;
+        /*
+         * Align strtab base address to table size. For this purpose, assume it
+         * is not bounded by SMMU_IDR1_SIDSIZE.
+         */
+        strtab_size_shift = MAX(5, (int)log2size - s->sid_split - 1 + 3);
+        strtab_base = s->strtab_base & SMMU_BASE_ADDR_MASK &
+                      ~MAKE_64BIT_MASK(0, strtab_size_shift);
         l1_ste_offset = sid >> s->sid_split;
         l2_ste_offset = sid & ((1 << s->sid_split) - 1);
         l1ptr = (dma_addr_t)(strtab_base + l1_ste_offset * sizeof(l1std));
@@ -XXX,XX +XXX,XX @@ static int smmu_find_ste(SMMUv3State *s, uint32_t sid, STE *ste,
         }
         addr = l2ptr + l2_ste_offset * sizeof(*ste);
     } else {
-        addr = (s->strtab_base & SMMU_BASE_ADDR_MASK) + sid * sizeof(*ste);
+        strtab_size_shift = log2size + 5;
+        strtab_base = s->strtab_base & SMMU_BASE_ADDR_MASK &
+                      ~MAKE_64BIT_MASK(0, strtab_size_shift);
+        addr = strtab_base + sid * sizeof(*ste);
     }
 
     if (smmu_get_ste(s, addr, ste, event)) {
-- 
2.20.1

From: Simon Veith <sveith@amazon.de>

The bit offsets in the EVT_SET_ADDR2 macro do not match those specified
in the ARM SMMUv3 Architecture Specification. In all events that use
this macro, e.g. F_WALK_EABT, the faulting fetch address or IPA actually
occupies the 32-bit words 6 and 7 in the event record contiguously, with
the upper and lower unused bits clear due to alignment or maximum
supported address bits. How many bits are clear depends on the
individual event type.

Update the macro to write to the correct words in the event record so
that guest drivers can obtain accurate address information on events.

ref. ARM IHI 0070C, sections 7.3.12 through 7.3.16.

Signed-off-by: Simon Veith <sveith@amazon.de>
Acked-by: Eric Auger <eric.auger@redhat.com>
Tested-by: Eric Auger <eric.auger@redhat.com>
Message-id: 1576509312-13083-6-git-send-email-sveith@amazon.de
Cc: Eric Auger <eric.auger@redhat.com>
Cc: qemu-devel@nongnu.org
Cc: qemu-arm@nongnu.org
Acked-by: Eric Auger <eric.auger@redhat.com>
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/arm/smmuv3-internal.h | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/hw/arm/smmuv3-internal.h b/hw/arm/smmuv3-internal.h
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/smmuv3-internal.h
+++ b/hw/arm/smmuv3-internal.h
@@ -XXX,XX +XXX,XX @@ typedef struct SMMUEventInfo {
     } while (0)
 #define EVT_SET_ADDR2(x, addr)                            \
     do {                                                  \
-            (x)->word[7] = deposit32((x)->word[7], 3, 29, addr >> 16);   \
-            (x)->word[7] = deposit32((x)->word[7], 0, 16, addr & 0xffff);\
+            (x)->word[7] = (uint32_t)(addr >> 32);        \
+            (x)->word[6] = (uint32_t)(addr & 0xffffffff); \
     } while (0)
 
 void smmuv3_record_event(SMMUv3State *s, SMMUEventInfo *event);
-- 
2.20.1

From: Simon Veith <sveith@amazon.de>

The smmuv3_record_event() function that generates the F_STE_FETCH error
uses the EVT_SET_ADDR macro to record the fetch address, placing it in
32-bit words 4 and 5.

The correct position for this address is in words 6 and 7, per the
SMMUv3 Architecture Specification.

Update the function to use the EVT_SET_ADDR2 macro instead, which is the
macro intended for writing to these words.

ref. ARM IHI 0070C, section 7.3.4.

Signed-off-by: Simon Veith <sveith@amazon.de>
Acked-by: Eric Auger <eric.auger@redhat.com>
Tested-by: Eric Auger <eric.auger@redhat.com>
Message-id: 1576509312-13083-7-git-send-email-sveith@amazon.de
Cc: Eric Auger <eric.auger@redhat.com>
Cc: qemu-devel@nongnu.org
Cc: qemu-arm@nongnu.org
Acked-by: Eric Auger <eric.auger@redhat.com>
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/arm/smmuv3.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/hw/arm/smmuv3.c b/hw/arm/smmuv3.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/smmuv3.c
+++ b/hw/arm/smmuv3.c
@@ -XXX,XX +XXX,XX @@ void smmuv3_record_event(SMMUv3State *s, SMMUEventInfo *info)
     case SMMU_EVT_F_STE_FETCH:
         EVT_SET_SSID(&evt, info->u.f_ste_fetch.ssid);
         EVT_SET_SSV(&evt,  info->u.f_ste_fetch.ssv);
-        EVT_SET_ADDR(&evt, info->u.f_ste_fetch.addr);
+        EVT_SET_ADDR2(&evt, info->u.f_ste_fetch.addr);
         break;
     case SMMU_EVT_C_BAD_STE:
         EVT_SET_SSID(&evt, info->u.c_bad_ste.ssid);
-- 
2.20.1

From: Philippe Mathieu-Daudé <philmd@redhat.com>

Instead of crashing in a confuse way, give some hint to the user
about why we aborted. He might report the issue without having
to use a debugger.

Signed-off-by: Philippe Mathieu-Daudé <philmd@redhat.com>
Message-id: 20191209134552.27733-1-philmd@redhat.com
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Tested-by: Niek Linnenbank <nieklinnenbank@gmail.com>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 target/arm/helper.c | 18 +++++++++++++++---
 1 file changed, 15 insertions(+), 3 deletions(-)

diff --git a/target/arm/helper.c b/target/arm/helper.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/helper.c
+++ b/target/arm/helper.c
@@ -XXX,XX +XXX,XX @@ void HELPER(rebuild_hflags_a64)(CPUARMState *env, int el)
     env->hflags = rebuild_hflags_a64(env, el, fp_el, mmu_idx);
 }
 
+static inline void assert_hflags_rebuild_correctly(CPUARMState *env)
+{
+#ifdef CONFIG_DEBUG_TCG
+    uint32_t env_flags_current = env->hflags;
+    uint32_t env_flags_rebuilt = rebuild_hflags_internal(env);
+
+    if (unlikely(env_flags_current != env_flags_rebuilt)) {
+        fprintf(stderr, "TCG hflags mismatch (current:0x%08x rebuilt:0x%08x)\n",
+                env_flags_current, env_flags_rebuilt);
+        abort();
+    }
+#endif
+}
+
 void cpu_get_tb_cpu_state(CPUARMState *env, target_ulong *pc,
                           target_ulong *cs_base, uint32_t *pflags)
 {
@@ -XXX,XX +XXX,XX @@ void cpu_get_tb_cpu_state(CPUARMState *env, target_ulong *pc,
     uint32_t pstate_for_ss;
 
     *cs_base = 0;
-#ifdef CONFIG_DEBUG_TCG
-    assert(flags == rebuild_hflags_internal(env));
-#endif
+    assert_hflags_rebuild_correctly(env);
 
     if (FIELD_EX32(flags, TBFLAG_ANY, AARCH64_STATE)) {
         *pc = env->pc;
-- 
2.20.1

From: Niek Linnenbank <nieklinnenbank@gmail.com>

After setting CP15 bits in arm_set_cpu_on() the cached hflags must
be rebuild to reflect the changed processor state. Without rebuilding,
the cached hflags would be inconsistent until the next call to
arm_rebuild_hflags(). When QEMU is compiled with debugging enabled
(--enable-debug), this problem is captured shortly after the first
call to arm_set_cpu_on() for CPUs running in ARM 32-bit non-secure mode:

qemu-system-arm: target/arm/helper.c:11359: cpu_get_tb_cpu_state:
  Assertion `flags == rebuild_hflags_internal(env)' failed.
  Aborted (core dumped)

Fixes: 0c7f8c43daf65
Cc: qemu-stable@nongnu.org
Signed-off-by: Niek Linnenbank <nieklinnenbank@gmail.com>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 target/arm/arm-powerctl.c | 3 +++
 1 file changed, 3 insertions(+)

diff --git a/target/arm/arm-powerctl.c b/target/arm/arm-powerctl.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/arm-powerctl.c
+++ b/target/arm/arm-powerctl.c
@@ -XXX,XX +XXX,XX @@ static void arm_set_cpu_on_async_work(CPUState *target_cpu_state,
         target_cpu->env.regs[0] = info->context_id;
     }
 
+    /* CP15 update requires rebuilding hflags */
+    arm_rebuild_hflags(&target_cpu->env);
+
     /* Start the new CPU at the requested address */
     cpu_set_pc(target_cpu_state, info->entry);
 
-- 
2.20.1

Hi; here's a target-arm pull for rc2. Four arm-related fixes,
and a couple of bug fixes for other areas of the codebase
that seemed like they'd fallen through the cracks.

thanks
-- PMM

The following changes since commit ccb86f079a9e4d94918086a9df18c1844347aff8:

Merge tag 'pull-nbd-2023-07-28' of https://repo.or.cz/qemu/ericb into staging (2023-07-28 09:56:57 -0700)

are available in the Git repository at:

https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20230731

for you to fetch changes up to 108e8180c6b0c315711aa54e914030a313505c17:

gdbstub: Fix client Ctrl-C handling (2023-07-31 14:57:32 +0100)

----------------------------------------------------------------
target-arm queue:
 * Don't build AArch64 decodetree files for qemu-system-arm
 * Fix TCG assert in v8.1M CSEL etc
 * Fix MemOp for STGP
 * gdbstub: Fix client Ctrl-C handling
 * kvm: Fix crash due to access uninitialized kvm_state
 * elf2dmp: Don't abandon when Prcb is set to 0

----------------------------------------------------------------
Akihiko Odaki (1):
      elf2dmp: Don't abandon when Prcb is set to 0

Gavin Shan (1):
      kvm: Fix crash due to access uninitialized kvm_state

Nicholas Piggin (1):
      gdbstub: Fix client Ctrl-C handling

Peter Maydell (2):
      target/arm: Avoid writing to constant TCGv in trans_CSEL()
      target/arm/tcg: Don't build AArch64 decodetree files for qemu-system-arm

Richard Henderson (1):
      target/arm: Fix MemOp for STGP

From: Richard Henderson <richard.henderson@linaro.org>

When converting to decodetree, the code to rebuild mop for the pair
only made it into trans_STP and not into trans_STGP.

Resolves: https://gitlab.com/qemu-project/qemu/-/issues/1790
Fixes: 8c212eb6594 ("target/arm: Convert load/store-pair to decodetree")
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
Message-id: 20230726165416.309624-1-richard.henderson@linaro.org
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 target/arm/tcg/translate-a64.c | 21 ++++++++++++++++++---
 1 file changed, 18 insertions(+), 3 deletions(-)

diff --git a/target/arm/tcg/translate-a64.c b/target/arm/tcg/translate-a64.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/tcg/translate-a64.c
+++ b/target/arm/tcg/translate-a64.c
@@ -XXX,XX +XXX,XX @@ static bool trans_STGP(DisasContext *s, arg_ldstpair *a)
     MemOp mop;
     TCGv_i128 tmp;
 
+    /* STGP only comes in one size. */
+    tcg_debug_assert(a->sz == MO_64);
+
     if (!dc_isar_feature(aa64_mte_insn_reg, s)) {
         return false;
     }
@@ -XXX,XX +XXX,XX @@ static bool trans_STGP(DisasContext *s, arg_ldstpair *a)
         gen_helper_stg(cpu_env, dirty_addr, dirty_addr);
     }
 
-    mop = finalize_memop(s, a->sz);
-    clean_addr = gen_mte_checkN(s, dirty_addr, true, false, 2 << a->sz, mop);
+    mop = finalize_memop(s, MO_64);
+    clean_addr = gen_mte_checkN(s, dirty_addr, true, false, 2 << MO_64, mop);
 
     tcg_rt = cpu_reg(s, a->rt);
     tcg_rt2 = cpu_reg(s, a->rt2);
 
-    assert(a->sz == 3);
+    /*
+     * STGP is defined as two 8-byte memory operations and one tag operation.
+     * We implement it as one single 16-byte memory operation for convenience.
+     * Rebuild mop as for STP.
+     * TODO: The atomicity with LSE2 is stronger than required.
+     * Need a form of MO_ATOM_WITHIN16_PAIR that never requires
+     * 16-byte atomicity.
+     */
+    mop = MO_128;
+    if (s->align_mem) {
+        mop |= MO_ALIGN_8;
+    }
+    mop = finalize_memop_pair(s, mop);
 
     tmp = tcg_temp_new_i128();
     if (s->be_data == MO_LE) {
-- 
2.34.1

From: Akihiko Odaki <akihiko.odaki@daynix.com>

Prcb may be set to 0 for some CPUs if the dump was taken before they
start. The dump may still contain valuable information for started CPUs
so don't abandon conversion in such a case.

Signed-off-by: Akihiko Odaki <akihiko.odaki@daynix.com>
Reviewed-by: Viktor Prutyanov <viktor.prutyanov@phystech.edu>
Message-id: 20230611033434.14659-1-akihiko.odaki@daynix.com
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 contrib/elf2dmp/main.c | 5 +++++
 1 file changed, 5 insertions(+)

diff --git a/contrib/elf2dmp/main.c b/contrib/elf2dmp/main.c
index XXXXXXX..XXXXXXX 100644
--- a/contrib/elf2dmp/main.c
+++ b/contrib/elf2dmp/main.c
@@ -XXX,XX +XXX,XX @@ static int fill_context(KDDEBUGGER_DATA64 *kdbg,
             return 1;
         }
 
+        if (!Prcb) {
+            eprintf("Context for CPU #%d is missing\n", i);
+            continue;
+        }
+
         if (va_space_rw(vs, Prcb + kdbg->OffsetPrcbContext,
                     &Context, sizeof(Context), 0)) {
             eprintf("Failed to read CPU #%d ContextFrame location\n", i);
-- 
2.34.1

In commit 0b188ea05acb5 we changed the implementation of
trans_CSEL() to use tcg_constant_i32(). However, this change
was incorrect, because the implementation of the function
sets up the TCGv_i32 rn and rm to be either zero or else
a TCG temp created in load_reg(), and these TCG temps are
then in both cases written to by the emitted TCG ops.
The result is that we hit a TCG assertion:

qemu-system-arm: ../../tcg/tcg.c:4455: tcg_reg_alloc_mov: Assertion `!temp_readonly(ots)' failed.

(or on a non-debug build, just produce a garbage result)

Adjust the code so that rn and rm are always writeable
temporaries whether the instruction is using the special
case "0" or a normal register as input.

Cc: qemu-stable@nongnu.org
Fixes: 0b188ea05acb5 ("target/arm: Use tcg_constant in trans_CSEL")
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Message-id: 20230727103906.2641264-1-peter.maydell@linaro.org
---
 target/arm/tcg/translate.c | 15 ++++++++-------
 1 file changed, 8 insertions(+), 7 deletions(-)

diff --git a/target/arm/tcg/translate.c b/target/arm/tcg/translate.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/tcg/translate.c
+++ b/target/arm/tcg/translate.c
@@ -XXX,XX +XXX,XX @@ static bool trans_IT(DisasContext *s, arg_IT *a)
 /* v8.1M CSEL/CSINC/CSNEG/CSINV */
 static bool trans_CSEL(DisasContext *s, arg_CSEL *a)
 {
-    TCGv_i32 rn, rm, zero;
+    TCGv_i32 rn, rm;
     DisasCompare c;
 
     if (!arm_dc_feature(s, ARM_FEATURE_V8_1M)) {
@@ -XXX,XX +XXX,XX @@ static bool trans_CSEL(DisasContext *s, arg_CSEL *a)
     }
 
     /* In this insn input reg fields of 0b1111 mean "zero", not "PC" */
-    zero = tcg_constant_i32(0);
+    rn = tcg_temp_new_i32();
+    rm = tcg_temp_new_i32();
     if (a->rn == 15) {
-        rn = zero;
+        tcg_gen_movi_i32(rn, 0);
     } else {
-        rn = load_reg(s, a->rn);
+        load_reg_var(s, rn, a->rn);
     }
     if (a->rm == 15) {
-        rm = zero;
+        tcg_gen_movi_i32(rm, 0);
     } else {
-        rm = load_reg(s, a->rm);
+        load_reg_var(s, rm, a->rm);
     }
 
     switch (a->op) {
@@ -XXX,XX +XXX,XX @@ static bool trans_CSEL(DisasContext *s, arg_CSEL *a)
     }
 
     arm_test_cc(&c, a->fcond);
-    tcg_gen_movcond_i32(c.cond, rn, c.value, zero, rn, rm);
+    tcg_gen_movcond_i32(c.cond, rn, c.value, tcg_constant_i32(0), rn, rm);
 
     store_reg(s, a->rd, rn);
     return true;
-- 
2.34.1

Currently we list all the Arm decodetree files together and add them
unconditionally to arm_ss.  This means we build them for both
qemu-system-aarch64 and qemu-system-arm.  However, some of them are
AArch64-specific, so there is no need to build them for
qemu-system-arm.  (Meson is smart enough to notice that the generated
.c.inc file is not used by any objects that go into qemu-system-arm,
so we only unnecessarily run decodetree, not anything more
heavyweight like a recompile or relink, but it's still unnecessary
work.)

Split gen into gen_a32 and gen_a64, and only add gen_a64 for
TARGET_AARCH64 compiles.

Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
Message-id: 20230718104628.1137734-1-peter.maydell@linaro.org
---
 target/arm/tcg/meson.build | 10 +++++++---
 1 file changed, 7 insertions(+), 3 deletions(-)

diff --git a/target/arm/tcg/meson.build b/target/arm/tcg/meson.build
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/tcg/meson.build
+++ b/target/arm/tcg/meson.build
@@ -XXX,XX +XXX,XX @@
-gen = [
+gen_a64 = [
+  decodetree.process('a64.decode', extra_args: ['--static-decode=disas_a64']),
   decodetree.process('sve.decode', extra_args: '--decode=disas_sve'),
   decodetree.process('sme.decode', extra_args: '--decode=disas_sme'),
   decodetree.process('sme-fa64.decode', extra_args: '--static-decode=disas_sme_fa64'),
+]
+
+gen_a32 = [
   decodetree.process('neon-shared.decode', extra_args: '--decode=disas_neon_shared'),
   decodetree.process('neon-dp.decode', extra_args: '--decode=disas_neon_dp'),
   decodetree.process('neon-ls.decode', extra_args: '--decode=disas_neon_ls'),
@@ -XXX,XX +XXX,XX @@ gen = [
   decodetree.process('a32-uncond.decode', extra_args: '--static-decode=disas_a32_uncond'),
   decodetree.process('t32.decode', extra_args: '--static-decode=disas_t32'),
   decodetree.process('t16.decode', extra_args: ['-w', '16', '--static-decode=disas_t16']),
-  decodetree.process('a64.decode', extra_args: ['--static-decode=disas_a64']),
 ]
 
-arm_ss.add(gen)
+arm_ss.add(gen_a32)
+arm_ss.add(when: 'TARGET_AARCH64', if_true: gen_a64)
 
 arm_ss.add(files(
   'cpu32.c',
-- 
2.34.1

From: Gavin Shan <gshan@redhat.com>

Runs into core dump on arm64 and the backtrace extracted from the
core dump is shown as below. It's caused by accessing uninitialized
@kvm_state in kvm_flush_coalesced_mmio_buffer() due to commit 176d073029
("hw/arm/virt: Use machine_memory_devices_init()"), where the machine's
memory region is added earlier than before.

main
    qemu_init
    configure_accelerators
    qemu_opts_foreach
    do_configure_accelerator
    accel_init_machine
    kvm_init
    virt_kvm_type
    virt_set_memmap
    machine_memory_devices_init
    memory_region_add_subregion
    memory_region_add_subregion_common
    memory_region_update_container_subregions
    memory_region_transaction_begin
    qemu_flush_coalesced_mmio_buffer
    kvm_flush_coalesced_mmio_buffer

Fix it by bailing early in kvm_flush_coalesced_mmio_buffer() on the
uninitialized @kvm_state. With this applied, no crash is observed on
arm64.

Fixes: 176d073029 ("hw/arm/virt: Use machine_memory_devices_init()")
Signed-off-by: Gavin Shan <gshan@redhat.com>
Reviewed-by: David Hildenbrand <david@redhat.com>
Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
Message-id: 20230731125946.2038742-1-gshan@redhat.com
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 accel/kvm/kvm-all.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/accel/kvm/kvm-all.c b/accel/kvm/kvm-all.c
index XXXXXXX..XXXXXXX 100644
--- a/accel/kvm/kvm-all.c
+++ b/accel/kvm/kvm-all.c
@@ -XXX,XX +XXX,XX @@ void kvm_flush_coalesced_mmio_buffer(void)
 {
     KVMState *s = kvm_state;
 
-    if (s->coalesced_flush_in_progress) {
+    if (!s || s->coalesced_flush_in_progress) {
         return;
     }
 
-- 
2.34.1

From: Nicholas Piggin <npiggin@gmail.com>

The gdb remote protocol has a special interrupt character (0x03) that is
transmitted outside the regular packet processing, and represents a
Ctrl-C pressed in the client. Despite not being a regular packet, it
does expect a regular stop response if the stub successfully stops the
running program.

See: https://sourceware.org/gdb/onlinedocs/gdb/Interrupts.html

Inhibiting the stop reply packet can lead to gdb client hang. So permit
a stop response when receiving a character from gdb that stops the vm.
Additionally, add a warning if that was not a 0x03 character, because
the gdb session is likely to end up getting confused if this happens.

Cc: qemu-stable@nongnu.org
Fixes: 758370052fb ("gdbstub: only send stop-reply packets when allowed to")
Reported-by: Frederic Barrat <fbarrat@linux.ibm.com>
Signed-off-by: Nicholas Piggin <npiggin@gmail.com>
Tested-by: Joel Stanley <joel@jms.id.au>
Message-id: 20230711085903.304496-1-npiggin@gmail.com
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 gdbstub/gdbstub.c | 13 +++++++++++--
 1 file changed, 11 insertions(+), 2 deletions(-)

diff --git a/gdbstub/gdbstub.c b/gdbstub/gdbstub.c
index XXXXXXX..XXXXXXX 100644
--- a/gdbstub/gdbstub.c
+++ b/gdbstub/gdbstub.c
@@ -XXX,XX +XXX,XX @@ void gdb_read_byte(uint8_t ch)
             return;
     }
     if (runstate_is_running()) {
-        /* when the CPU is running, we cannot do anything except stop
-           it when receiving a char */
+        /*
+         * When the CPU is running, we cannot do anything except stop
+         * it when receiving a char. This is expected on a Ctrl-C in the
+         * gdb client. Because we are in all-stop mode, gdb sends a
+         * 0x03 byte which is not a usual packet, so we handle it specially
+         * here, but it does expect a stop reply.
+         */
+        if (ch != 0x03) {
+            warn_report("gdbstub: client sent packet while target running\n");
+        }
+        gdbserver_state.allow_stop_reply = true;
         vm_stop(RUN_STATE_PAUSED);
     } else
 #endif
-- 
2.34.1