Series comparison

-[PULL 0/4] tcg patch queue
+[PULL v2 00/58] tcg patch queue
-Pretty small still, but there are two patches that ought
+v2: Rebase and resolve target/loongarch conflicts.
-to get backported to stable, so no point in delaying.
+    Include linux-user/aarch64 vdso fix.
 r~
-The following changes since commit a5ba0a7e4e150d1350a041f0d0ef9ca6c8d7c307:
+The following changes since commit 29b008927ef6e3fbb70e6607b25d3fcae26a5190:
-  Merge tag 'pull-aspeed-20241211' of https://github.com/legoater/qemu into staging (2024-12-11 15:16:47 +0000)
+  Merge tag 'pull-nic-config-2-20240202' of git://git.infradead.org/users/dwmw2/qemu into staging (2024-02-02 16:47:36 +0000)
 are available in the Git repository at:
-  https://gitlab.com/rth7680/qemu.git tags/pull-tcg-20241212
+  https://gitlab.com/rth7680/qemu.git tags/pull-tcg-20240202-2
-for you to fetch changes up to 7ac87b14a92234b6a89b701b4043ad6cf8bdcccf:
+for you to fetch changes up to 6400be014f80e4c2c246eb8be709ea3a96428233:
-  target/sparc: Use memcpy() and remove memcpy32() (2024-12-12 14:28:38 -0600)
+  linux-user/aarch64: Add padding before __kernel_rt_sigreturn (2024-02-03 16:46:10 +1000)
 ----------------------------------------------------------------
-tcg: Reset free_temps before tcg_optimize
+tests/tcg: Fix multiarch/gdbstub/prot-none.py
-tcg/riscv: Fix StoreStore barrier generation
+hw/core: Convert cpu_mmu_index to a CPUClass hook
-include/exec: Introduce fpst alias in helper-head.h.inc
+tcg/loongarch64: Set vector registers call clobbered
-target/sparc: Use memcpy() and remove memcpy32()
+target/sparc: floating-point cleanup
 linux-user/aarch64: Add padding before __kernel_rt_sigreturn
 ----------------------------------------------------------------
-Philippe Mathieu-Daudé (1):
+Ilya Leoshkevich (1):
-      target/sparc: Use memcpy() and remove memcpy32()
+      tests/tcg: Fix the /proc/self/mem probing in the PROT_NONE gdbstub test
-Richard Henderson (2):
+Richard Henderson (57):
-      tcg: Reset free_temps before tcg_optimize
+      include/hw/core: Add mmu_index to CPUClass
-      include/exec: Introduce fpst alias in helper-head.h.inc
+      target/alpha: Split out alpha_env_mmu_index
       target/alpha: Populate CPUClass.mmu_index
       target/arm: Split out arm_env_mmu_index
       target/arm: Populate CPUClass.mmu_index
       target/avr: Populate CPUClass.mmu_index
       target/cris: Cache mem_index in DisasContext
       target/cris: Populate CPUClass.mmu_index
       target/hppa: Populate CPUClass.mmu_index
       target/i386: Populate CPUClass.mmu_index
       target/loongarch: Populate CPUClass.mmu_index
       target/loongarch: Rename MMU_IDX_*
       target/m68k: Populate CPUClass.mmu_index
       target/microblaze: Populate CPUClass.mmu_index
       target/mips: Pass ptw_mmu_idx down from mips_cpu_tlb_fill
       target/mips: Split out mips_env_mmu_index
       target/mips: Populate CPUClass.mmu_index
       target/nios2: Populate CPUClass.mmu_index
       target/openrisc: Populate CPUClass.mmu_index
       target/ppc: Split out ppc_env_mmu_index
       target/ppc: Populate CPUClass.mmu_index
       target/riscv: Rename riscv_cpu_mmu_index to riscv_env_mmu_index
       target/riscv: Replace cpu_mmu_index with riscv_env_mmu_index
       target/riscv: Populate CPUClass.mmu_index
       target/rx: Populate CPUClass.mmu_index
       target/s390x: Split out s390x_env_mmu_index
       target/s390x: Populate CPUClass.mmu_index
       target/sh4: Populate CPUClass.mmu_index
       target/sparc: Populate CPUClass.mmu_index
       target/tricore: Populate CPUClass.mmu_index
       target/xtensa: Populate CPUClass.mmu_index
       include/exec: Implement cpu_mmu_index generically
       include/exec: Change cpu_mmu_index argument to CPUState
       tcg/loongarch64: Set vector registers call clobbered
       target/sparc: Use tcg_gen_qemu_{ld, st}_i128 for ASI_M_BCOPY
       target/sparc: Use tcg_gen_qemu_{ld, st}_i128 for ASI_M_BFILL
       target/sparc: Remove gen_dest_fpr_F
       target/sparc: Introduce gen_{load,store}_fpr_Q
       target/sparc: Inline FNEG, FABS
       target/sparc: Use i128 for FSQRTq
       target/sparc: Use i128 for FADDq, FSUBq, FMULq, FDIVq
       target/sparc: Use i128 for FqTOs, FqTOi
       target/sparc: Use i128 for FqTOd, FqTOx
       target/sparc: Use i128 for FCMPq, FCMPEq
       target/sparc: Use i128 for FsTOq, FiTOq
       target/sparc: Use i128 for FdTOq, FxTOq
       target/sparc: Use i128 for Fdmulq
       target/sparc: Remove qt0, qt1 temporaries
       target/sparc: Introduce cpu_get_fsr, cpu_put_fsr
       target/sparc: Split ver from env->fsr
       target/sparc: Clear cexc and ftt in do_check_ieee_exceptions
       target/sparc: Merge check_ieee_exceptions with FPop helpers
       target/sparc: Split cexc and ftt from env->fsr
       target/sparc: Remove cpu_fsr
       target/sparc: Split fcc out of env->fsr
       target/sparc: Remove FSR_FTT_NMASK, FSR_FTT_CEXC_NMASK
       linux-user/aarch64: Add padding before __kernel_rt_sigreturn
-Roman Artemev (1):
+ include/exec/cpu-all.h                             |   4 +
-      tcg/riscv: Fix StoreStore barrier generation
+ include/exec/cpu-common.h                          |  21 +
+ include/hw/core/cpu.h                              |   3 +
- include/tcg/tcg-temp-internal.h |  6 ++++++
+ target/alpha/cpu.h                                 |   2 +-
- accel/tcg/plugin-gen.c          |  2 +-
+ target/arm/cpu.h                                   |  13 -
- target/sparc/win_helper.c       | 26 ++++++++------------------
+ target/arm/internals.h                             |   5 +
- tcg/tcg.c                       |  5 ++++-
+ target/avr/cpu.h                                   |   7 -
- include/exec/helper-head.h.inc  |  3 +++
+ target/cris/cpu.h                                  |   4 -
- tcg/riscv/tcg-target.c.inc      |  2 +-
+ target/hexagon/cpu.h                               |   9 -
-files changed, 23 insertions(+), 21 deletions(-)
+ target/hppa/cpu.h                                  |  13 -
+ target/i386/cpu.h                                  |   7 -
  target/loongarch/cpu.h                             |  18 +-
  target/m68k/cpu.h                                  |   4 -
  target/microblaze/cpu.h                            |  15 -
  target/mips/cpu.h                                  |   6 +-
  target/nios2/cpu.h                                 |   6 -
  target/openrisc/cpu.h                              |  12 -
  target/ppc/cpu.h                                   |   2 +-
  target/riscv/cpu.h                                 |   4 +-
  target/rx/cpu.h                                    |   5 -
  target/s390x/cpu.h                                 |   2 +-
  target/sh4/cpu.h                                   |  10 -
  target/sparc/cpu.h                                 |  69 +-
  target/sparc/helper.h                              | 116 ++-
  target/tricore/cpu.h                               |   5 -
  target/xtensa/cpu.h                                |   5 -
  accel/tcg/cputlb.c                                 |  22 +-
  linux-user/sparc/cpu_loop.c                        |   2 +-
  linux-user/sparc/signal.c                          |  14 +-
  semihosting/uaccess.c                              |   2 +-
  target/alpha/cpu.c                                 |   6 +
  target/alpha/translate.c                           |   2 +-
  target/arm/cpu.c                                   |   6 +
  target/arm/helper.c                                |   2 +-
  target/arm/tcg/helper-a64.c                        |   4 +-
  target/arm/tcg/mte_helper.c                        |  18 +-
  target/arm/tcg/sve_helper.c                        |   8 +-
  target/arm/tcg/tlb_helper.c                        |   2 +-
  target/avr/cpu.c                                   |   6 +
  target/cris/cpu.c                                  |   6 +
  target/cris/translate.c                            |  14 +-
  target/hppa/cpu.c                                  |  12 +
  target/hppa/mem_helper.c                           |   2 +-
  target/hppa/op_helper.c                            |   8 +-
  target/i386/cpu.c                                  |  10 +
  target/i386/tcg/translate.c                        |   2 +-
  target/loongarch/cpu.c                             |  11 +
  target/loongarch/cpu_helper.c                      |   6 +-
  target/loongarch/tcg/tlb_helper.c                  |   2 +-
  target/loongarch/tcg/translate.c                   |   2 +-
  target/m68k/cpu.c                                  |   6 +
  target/m68k/op_helper.c                            |   2 +-
  target/microblaze/cpu.c                            |  18 +-
  target/microblaze/helper.c                         |   3 +-
  target/microblaze/mmu.c                            |   2 +-
  target/microblaze/translate.c                      |   2 +-
  target/mips/cpu.c                                  |   6 +
  target/mips/sysemu/physaddr.c                      |   2 +-
  target/mips/tcg/msa_helper.c                       |  10 +-
  target/mips/tcg/sysemu/cp0_helper.c                |   2 +-
  target/mips/tcg/sysemu/special_helper.c            |   2 +-
  target/mips/tcg/sysemu/tlb_helper.c                |  34 +-
  target/nios2/cpu.c                                 |   7 +
  target/nios2/translate.c                           |   2 +-
  target/openrisc/cpu.c                              |  13 +
  target/openrisc/translate.c                        |   2 +-
  target/ppc/cpu_init.c                              |   8 +-
  target/ppc/mem_helper.c                            |  10 +-
  target/ppc/mmu_common.c                            |   4 +-
  target/riscv/cpu.c                                 |   6 +
  target/riscv/cpu_helper.c                          |   6 +-
  target/riscv/op_helper.c                           |   4 +-
  target/riscv/vector_helper.c                       |   9 +-
  target/rx/cpu.c                                    |   6 +
  target/s390x/cpu.c                                 |   6 +
  target/s390x/tcg/mem_helper.c                      |  34 +-
  target/sh4/cpu.c                                   |  16 +
  target/sparc/cpu.c                                 |  61 +-
  target/sparc/fop_helper.c                          | 510 +++++++------
  target/sparc/gdbstub.c                             |   8 +-
  target/sparc/ldst_helper.c                         |   5 +-
  target/sparc/machine.c                             |  36 +-
  target/sparc/mmu_helper.c                          |   2 +-
  target/sparc/translate.c                           | 799 +++++++--------------
  target/tricore/cpu.c                               |   6 +
  target/tricore/helper.c                            |   2 +-
  target/tricore/translate.c                         |   2 +-
  target/xtensa/cpu.c                                |   6 +
  target/xtensa/mmu_helper.c                         |   2 +-
  accel/tcg/ldst_common.c.inc                        |  42 +-
  target/cris/translate_v10.c.inc                    |   6 +-
  .../tcg/insn_trans/trans_privileged.c.inc          |   2 +-
  tcg/loongarch64/tcg-target.c.inc                   |   2 +-
  linux-user/aarch64/vdso-be.so                      | Bin 3216 -> 3224 bytes
  linux-user/aarch64/vdso-le.so                      | Bin 3216 -> 3224 bytes
  linux-user/aarch64/vdso.S                          |   4 +
  tests/tcg/multiarch/gdbstub/prot-none.py           |   2 +-
 files changed, 1064 insertions(+), 1191 deletions(-)

-[PULL 3/4] include/exec: Introduce fpst alias in helper-head.h.inc
+[PULL v2 12/58] target/loongarch: Rename MMU_IDX_*
-This allows targets to declare that the helper requires a
+The expected form is MMU_FOO_IDX, not MMU_IDX_FOO.
-float_status pointer and instead of a generic void pointer.
+Rename to match generic code.
 Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
 Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
 ---
- include/exec/helper-head.h.inc | 3 +++
+ target/loongarch/cpu.h                                 | 8 ++++----
-file changed, 3 insertions(+)
+ target/loongarch/cpu.c                                 | 2 +-
  target/loongarch/cpu_helper.c                          | 4 ++--
  target/loongarch/tcg/translate.c                       | 2 +-
  target/loongarch/tcg/insn_trans/trans_privileged.c.inc | 2 +-
 files changed, 9 insertions(+), 9 deletions(-)
-diff --git a/include/exec/helper-head.h.inc b/include/exec/helper-head.h.inc
+diff --git a/target/loongarch/cpu.h b/target/loongarch/cpu.h
 index XXXXXXX..XXXXXXX 100644
---- a/include/exec/helper-head.h.inc
+--- a/target/loongarch/cpu.h
-+++ b/include/exec/helper-head.h.inc
++++ b/target/loongarch/cpu.h
-@@ -XXX,XX +XXX,XX @@
+@@ -XXX,XX +XXX,XX @@ struct LoongArchCPUClass {
- #define dh_alias_ptr ptr
+  */
- #define dh_alias_cptr ptr
+ #define MMU_PLV_KERNEL   0
- #define dh_alias_env ptr
+ #define MMU_PLV_USER     3
-+#define dh_alias_fpst ptr
+-#define MMU_IDX_KERNEL   MMU_PLV_KERNEL
- #define dh_alias_void void
+-#define MMU_IDX_USER     MMU_PLV_USER
- #define dh_alias_noreturn noreturn
+-#define MMU_IDX_DA       4
- #define dh_alias(t) glue(dh_alias_, t)
++#define MMU_KERNEL_IDX   MMU_PLV_KERNEL
-@@ -XXX,XX +XXX,XX @@
++#define MMU_USER_IDX     MMU_PLV_USER
- #define dh_ctype_ptr void *
++#define MMU_DA_IDX       4
- #define dh_ctype_cptr const void *
- #define dh_ctype_env CPUArchState *
+ int loongarch_cpu_mmu_index(CPUState *cs, bool ifetch);
-+#define dh_ctype_fpst float_status *
+ static inline int cpu_mmu_index(CPULoongArchState *env, bool ifetch)
- #define dh_ctype_void void
+ {
- #define dh_ctype_noreturn G_NORETURN void
+ #ifdef CONFIG_USER_ONLY
- #define dh_ctype(t) dh_ctype_##t
+-    return MMU_IDX_USER;
-@@ -XXX,XX +XXX,XX @@
++    return MMU_USER_IDX;
- #define dh_typecode_f64 dh_typecode_i64
+ #else
- #define dh_typecode_cptr dh_typecode_ptr
+     return loongarch_cpu_mmu_index(env_cpu(env), ifetch);
- #define dh_typecode_env dh_typecode_ptr
+ #endif
-+#define dh_typecode_fpst dh_typecode_ptr
+diff --git a/target/loongarch/cpu.c b/target/loongarch/cpu.c
- #define dh_typecode(t) dh_typecode_##t
+index XXXXXXX..XXXXXXX 100644
+--- a/target/loongarch/cpu.c
- #define dh_callflag_i32  0
++++ b/target/loongarch/cpu.c
@@ -XXX,XX +XXX,XX @@ int loongarch_cpu_mmu_index(CPUState *cs, bool ifetch)
      if (FIELD_EX64(env->CSR_CRMD, CSR_CRMD, PG)) {
          return FIELD_EX64(env->CSR_CRMD, CSR_CRMD, PLV);
      }
 -    return MMU_IDX_DA;
 +    return MMU_DA_IDX;
  }
  static void loongarch_la464_initfn(Object *obj)
 diff --git a/target/loongarch/cpu_helper.c b/target/loongarch/cpu_helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/loongarch/cpu_helper.c
 +++ b/target/loongarch/cpu_helper.c
@@ -XXX,XX +XXX,XX @@ int get_physical_address(CPULoongArchState *env, hwaddr *physical,
                           int *prot, target_ulong address,
                           MMUAccessType access_type, int mmu_idx)
  {
 -    int user_mode = mmu_idx == MMU_IDX_USER;
 -    int kernel_mode = mmu_idx == MMU_IDX_KERNEL;
 +    int user_mode = mmu_idx == MMU_USER_IDX;
 +    int kernel_mode = mmu_idx == MMU_KERNEL_IDX;
      uint32_t plv, base_c, base_v;
      int64_t addr_high;
      uint8_t da = FIELD_EX64(env->CSR_CRMD, CSR_CRMD, DA);
 diff --git a/target/loongarch/tcg/translate.c b/target/loongarch/tcg/translate.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/loongarch/tcg/translate.c
 +++ b/target/loongarch/tcg/translate.c
@@ -XXX,XX +XXX,XX @@ static void loongarch_tr_init_disas_context(DisasContextBase *dcbase,
      if (ctx->base.tb->flags & HW_FLAGS_CRMD_PG) {
          ctx->mem_idx = ctx->plv;
      } else {
 -        ctx->mem_idx = MMU_IDX_DA;
 +        ctx->mem_idx = MMU_DA_IDX;
      }
      /* Bound the number of insns to execute to those left on the page.  */
 diff --git a/target/loongarch/tcg/insn_trans/trans_privileged.c.inc b/target/loongarch/tcg/insn_trans/trans_privileged.c.inc
 index XXXXXXX..XXXXXXX 100644
 --- a/target/loongarch/tcg/insn_trans/trans_privileged.c.inc
 +++ b/target/loongarch/tcg/insn_trans/trans_privileged.c.inc
@@ -XXX,XX +XXX,XX @@ TRANS(iocsrwr_d, IOCSR, gen_iocsrwr, gen_helper_iocsrwr_d)
  static void check_mmu_idx(DisasContext *ctx)
  {
 -    if (ctx->mem_idx != MMU_IDX_DA) {
 +    if (ctx->mem_idx != MMU_DA_IDX) {
          tcg_gen_movi_tl(cpu_pc, ctx->base.pc_next + 4);
          ctx->base.is_jmp = DISAS_EXIT;
      }
 --
-.43.0
+.34.1

-[PULL 2/4] tcg/riscv: Fix StoreStore barrier generation
+[PULL v2 35/58] tcg/loongarch64: Set vector registers call clobbered
-From: Roman Artemev <roman.artemev@syntacore.com>
+Because there are more call clobbered registers than
 call saved registers, we begin with all registers as
 call clobbered and then reset those that are saved.
-On RISC-V to StoreStore barrier corresponds
+This was missed when we introduced the LSX support.
 `fence w, w` not `fence r, r`
 Cc: qemu-stable@nongnu.org
-Fixes: efbea94c76b ("tcg/riscv: Add slowpath load and store instructions")
+Fixes: 16288ded944 ("tcg/loongarch64: Lower basic tcg vec ops to LSX")
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Resolves: https://gitlab.com/qemu-project/qemu/-/issues/2136
 Signed-off-by: Denis Tomashev <denis.tomashev@syntacore.com>
 Signed-off-by: Roman Artemev <roman.artemev@syntacore.com>
 Message-ID: <e2f2131e294a49e79959d4fa9ec02cf4@syntacore.com>
 Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+Reviewed-by: Song Gao <gaosong@loongson.cn>
+Message-Id: <20240201233414.500588-1-richard.henderson@linaro.org>
 ---
- tcg/riscv/tcg-target.c.inc | 2 +-
+ tcg/loongarch64/tcg-target.c.inc | 2 +-
 file changed, 1 insertion(+), 1 deletion(-)
-diff --git a/tcg/riscv/tcg-target.c.inc b/tcg/riscv/tcg-target.c.inc
+diff --git a/tcg/loongarch64/tcg-target.c.inc b/tcg/loongarch64/tcg-target.c.inc
 index XXXXXXX..XXXXXXX 100644
---- a/tcg/riscv/tcg-target.c.inc
+--- a/tcg/loongarch64/tcg-target.c.inc
-+++ b/tcg/riscv/tcg-target.c.inc
++++ b/tcg/loongarch64/tcg-target.c.inc
-@@ -XXX,XX +XXX,XX @@ static void tcg_out_mb(TCGContext *s, TCGArg a0)
+@@ -XXX,XX +XXX,XX @@ static void tcg_target_init(TCGContext *s)
-         insn |= 0x02100000;
+     tcg_target_available_regs[TCG_TYPE_I32] = ALL_GENERAL_REGS;
-     }
+     tcg_target_available_regs[TCG_TYPE_I64] = ALL_GENERAL_REGS;
-     if (a0 & TCG_MO_ST_ST) {
--        insn |= 0x02200000;
+-    tcg_target_call_clobber_regs = ALL_GENERAL_REGS;
-+        insn |= 0x01100000;
++    tcg_target_call_clobber_regs = ALL_GENERAL_REGS | ALL_VECTOR_REGS;
-     }
+     tcg_regset_reset_reg(tcg_target_call_clobber_regs, TCG_REG_S0);
-     tcg_out32(s, insn);
+     tcg_regset_reset_reg(tcg_target_call_clobber_regs, TCG_REG_S1);
- }
+     tcg_regset_reset_reg(tcg_target_call_clobber_regs, TCG_REG_S2);
 --
-.43.0
+.34.1

-[PULL 1/4] tcg: Reset free_temps before tcg_optimize
+[PULL v2 58/58] linux-user/aarch64: Add padding before __kernel_rt_sigreturn
-When allocating new temps during tcg_optmize, do not re-use
+Without this padding, an unwind through the signal handler
-any EBB temps that were used within the TB.  We do not have
+will pick up the unwind info for the preceding syscall.
 any idea what span of the TB in which the temp was live.
-Introduce tcg_temp_ebb_reset_freed and use before tcg_optimize,
+This fixes gcc's 30_threads/thread/native_handle/cancel.cc.
 as well as replacing the equivalent in plugin_gen_inject and
 tcg_func_start.
 Cc: qemu-stable@nongnu.org
-Fixes: fb04ab7ddd8 ("tcg/optimize: Lower TCG_COND_TST{EQ,NE} if unsupported")
+Fixes: ee95fae075c6 ("linux-user/aarch64: Add vdso")
-Resolves: https://gitlab.com/qemu-project/qemu/-/issues/2711
+Resolves: https://linaro.atlassian.net/browse/GNU-974
 Reported-by: wannacu <wannacu2049@gmail.com>
 Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org>
+Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
-Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
+Message-Id: <20240202034427.504686-1-richard.henderson@linaro.org>
 ---
- include/tcg/tcg-temp-internal.h | 6 ++++++
+ linux-user/aarch64/vdso-be.so | Bin 3216 -> 3224 bytes
- accel/tcg/plugin-gen.c          | 2 +-
+ linux-user/aarch64/vdso-le.so | Bin 3216 -> 3224 bytes
- tcg/tcg.c                       | 5 ++++-
+ linux-user/aarch64/vdso.S     |   4 ++++
-files changed, 11 insertions(+), 2 deletions(-)
+files changed, 4 insertions(+)
-diff --git a/include/tcg/tcg-temp-internal.h b/include/tcg/tcg-temp-internal.h
+diff --git a/linux-user/aarch64/vdso-be.so b/linux-user/aarch64/vdso-be.so
 index XXXXXXX..XXXXXXX 100755
 GIT binary patch
 delta 121
 zcmbOrIYV-SKI4pu2Kk&{7{Gw#%fuBAMC1c?^>~k}v|avdxNjSSLfftVb3bgJ!|2S&
 z_-6A1CJrVZc?IUH8G;R$7#SF@Om<{a*v!K!&BXX-vIe^~TWO|cva$K*Om;sOMw`hy
 ZxXl@VO#Z-a&zLdUfXALuXmSCM0s#EKC)of1
 delta 116
 zcmbOsIYDxQKI4Rm2Kk&H7{Gw#!^9O2L>8U?-5V_M@!kH(Sx4vJn|*ujLPgija~Pc&
 z8DDIEz{J5c`3;N8W)W6tCdL<&4cM*OEF8_<v%@zRviq?xT1-B`ZO-^%@(*r%#)Qch
 RJocPi5ThAdCO2?N002V6C;<Qf
 diff --git a/linux-user/aarch64/vdso-le.so b/linux-user/aarch64/vdso-le.so
 index XXXXXXX..XXXXXXX 100755
 GIT binary patch
 delta 129
 zcmbOrIYV-S2IGv0n)#exSQx<I%fyAxMZTVBQ(04AP_*V|Vxp|@=@;x8zb9;-!)U|E
 z_-6A>CVnO!c?IUH8G;R$7#SF@Om<{a*v!K!!o>JyvLd?^n`3BUW_royOm=q`Mw`hS
 dxy>1WOn%92&zLb;lgFM@hy!9z%j7~Xc>tTxDQW-!
 delta 108
 zcmbOsIYDxQ2IGW@n)#d`SQx<I!^DNpMK&+G&+g_}w9WI@dn@@euKVesZ-h6`VYFdn
 ze6jf^6F<}BH!LcfMOa0c7+*}*WOrgKEO1Fl%G+GX?#{w!F?lDqIpc@PAGz%r6DAw-
 M*fVlXF62=M06owo?*IS*
 diff --git a/linux-user/aarch64/vdso.S b/linux-user/aarch64/vdso.S
 index XXXXXXX..XXXXXXX 100644
---- a/include/tcg/tcg-temp-internal.h
+--- a/linux-user/aarch64/vdso.S
-+++ b/include/tcg/tcg-temp-internal.h
++++ b/linux-user/aarch64/vdso.S
-@@ -XXX,XX +XXX,XX @@ TCGv_i64 tcg_temp_ebb_new_i64(void);
+@@ -XXX,XX +XXX,XX @@ vdso_syscall __kernel_clock_getres, __NR_clock_getres
- TCGv_ptr tcg_temp_ebb_new_ptr(void);
+  * For now, elide the unwind info for __kernel_rt_sigreturn and rely on
- TCGv_i128 tcg_temp_ebb_new_i128(void);
+  * the libgcc fallback routine as we have always done.  This requires
+  * that the code sequence used be exact.
-+/* Forget all freed EBB temps, so that new allocations produce new temps. */
++ *
-+static inline void tcg_temp_ebb_reset_freed(TCGContext *s)
++ * Add a nop as a spacer to ensure that unwind does not pick up the
-+{
++ * unwind info from the preceding syscall.
-+    memset(s->free_temps, 0, sizeof(s->free_temps));
+  */
-+}
++    nop
-+
+ __kernel_rt_sigreturn:
- #endif /* TCG_TEMP_FREE_H */
+     /* No BTI C insn here -- we arrive via RET. */
-diff --git a/accel/tcg/plugin-gen.c b/accel/tcg/plugin-gen.c
+     mov    x8, #__NR_rt_sigreturn
 index XXXXXXX..XXXXXXX 100644
 --- a/accel/tcg/plugin-gen.c
 +++ b/accel/tcg/plugin-gen.c
@@ -XXX,XX +XXX,XX @@ static void plugin_gen_inject(struct qemu_plugin_tb *plugin_tb)
       * that might be live within the existing opcode stream.
       * The simplest solution is to release them all and create new.
       */
 -    memset(tcg_ctx->free_temps, 0, sizeof(tcg_ctx->free_temps));
 +    tcg_temp_ebb_reset_freed(tcg_ctx);
      QTAILQ_FOREACH_SAFE(op, &tcg_ctx->ops, link, next) {
          switch (op->opc) {
 diff --git a/tcg/tcg.c b/tcg/tcg.c
 index XXXXXXX..XXXXXXX 100644
 --- a/tcg/tcg.c
 +++ b/tcg/tcg.c
@@ -XXX,XX +XXX,XX @@ void tcg_func_start(TCGContext *s)
      s->nb_temps = s->nb_globals;
      /* No temps have been previously allocated for size or locality.  */
 -    memset(s->free_temps, 0, sizeof(s->free_temps));
 +    tcg_temp_ebb_reset_freed(s);
      /* No constant temps have been previously allocated. */
      for (int i = 0; i < TCG_TYPE_COUNT; ++i) {
@@ -XXX,XX +XXX,XX @@ int tcg_gen_code(TCGContext *s, TranslationBlock *tb, uint64_t pc_start)
      }
  #endif
 +    /* Do not reuse any EBB that may be allocated within the TB. */
 +    tcg_temp_ebb_reset_freed(s);
 +
      tcg_optimize(s);
      reachable_code_pass(s);
 --
-.43.0
+.34.1

-[PULL 4/4] target/sparc: Use memcpy() and remove memcpy32()
+Deleted patch
-From: Philippe Mathieu-Daudé <philmd@linaro.org>
-Rather than manually copying each register, use
-the libc memcpy(), which is well optimized nowadays.
-Suggested-by: Pierrick Bouvier <pierrick.bouvier@linaro.org>
-Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
-Message-ID: <20241205205418.67613-1-philmd@linaro.org>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
----
- target/sparc/win_helper.c | 26 ++++++++------------------
-file changed, 8 insertions(+), 18 deletions(-)
-diff --git a/target/sparc/win_helper.c b/target/sparc/win_helper.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/sparc/win_helper.c
-+++ b/target/sparc/win_helper.c
-@@ -XXX,XX +XXX,XX @@
- #include "exec/helper-proto.h"
- #include "trace.h"
--static inline void memcpy32(target_ulong *dst, const target_ulong *src)
--{
--    dst[0] = src[0];
--    dst[1] = src[1];
--    dst[2] = src[2];
--    dst[3] = src[3];
--    dst[4] = src[4];
--    dst[5] = src[5];
--    dst[6] = src[6];
--    dst[7] = src[7];
--}
--
- void cpu_set_cwp(CPUSPARCState *env, int new_cwp)
- {
-     /* put the modified wrap registers at their proper location */
-     if (env->cwp == env->nwindows - 1) {
--        memcpy32(env->regbase, env->regbase + env->nwindows * 16);
-+        memcpy(env->regbase, env->regbase + env->nwindows * 16,
-+               sizeof(env->gregs));
-     }
-     env->cwp = new_cwp;
-     /* put the wrap registers at their temporary location */
-     if (new_cwp == env->nwindows - 1) {
--        memcpy32(env->regbase + env->nwindows * 16, env->regbase);
-+        memcpy(env->regbase + env->nwindows * 16, env->regbase,
-+               sizeof(env->gregs));
-     }
-     env->regwptr = env->regbase + (new_cwp * 16);
- }
-@@ -XXX,XX +XXX,XX @@ void cpu_gl_switch_gregs(CPUSPARCState *env, uint32_t new_gl)
-     dst = get_gl_gregset(env, env->gl);
-     if (src != dst) {
--        memcpy32(dst, env->gregs);
--        memcpy32(env->gregs, src);
-+        memcpy(dst, env->gregs, sizeof(env->gregs));
-+        memcpy(env->gregs, src, sizeof(env->gregs));
-     }
- }
-@@ -XXX,XX +XXX,XX @@ void cpu_change_pstate(CPUSPARCState *env, uint32_t new_pstate)
-         /* Switch global register bank */
-         src = get_gregset(env, new_pstate_regs);
-         dst = get_gregset(env, pstate_regs);
--        memcpy32(dst, env->gregs);
--        memcpy32(env->gregs, src);
-+        memcpy(dst, env->gregs, sizeof(env->gregs));
-+        memcpy(env->gregs, src, sizeof(env->gregs));
-     } else {
-         trace_win_helper_no_switch_pstate(new_pstate_regs);
-     }
---
-.43.0

Pretty small still, but there are two patches that ought
to get backported to stable, so no point in delaying.

The following changes since commit a5ba0a7e4e150d1350a041f0d0ef9ca6c8d7c307:

Merge tag 'pull-aspeed-20241211' of https://github.com/legoater/qemu into staging (2024-12-11 15:16:47 +0000)

are available in the Git repository at:

https://gitlab.com/rth7680/qemu.git tags/pull-tcg-20241212

for you to fetch changes up to 7ac87b14a92234b6a89b701b4043ad6cf8bdcccf:

target/sparc: Use memcpy() and remove memcpy32() (2024-12-12 14:28:38 -0600)

----------------------------------------------------------------
tcg: Reset free_temps before tcg_optimize
tcg/riscv: Fix StoreStore barrier generation
include/exec: Introduce fpst alias in helper-head.h.inc
target/sparc: Use memcpy() and remove memcpy32()

----------------------------------------------------------------
Philippe Mathieu-Daudé (1):
      target/sparc: Use memcpy() and remove memcpy32()

Richard Henderson (2):
      tcg: Reset free_temps before tcg_optimize
      include/exec: Introduce fpst alias in helper-head.h.inc

Roman Artemev (1):
      tcg/riscv: Fix StoreStore barrier generation

When allocating new temps during tcg_optmize, do not re-use
any EBB temps that were used within the TB.  We do not have
any idea what span of the TB in which the temp was live.

Introduce tcg_temp_ebb_reset_freed and use before tcg_optimize,
as well as replacing the equivalent in plugin_gen_inject and
tcg_func_start.

Cc: qemu-stable@nongnu.org
Fixes: fb04ab7ddd8 ("tcg/optimize: Lower TCG_COND_TST{EQ,NE} if unsupported")
Resolves: https://gitlab.com/qemu-project/qemu/-/issues/2711
Reported-by: wannacu <wannacu2049@gmail.com>
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org>
Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
---
 include/tcg/tcg-temp-internal.h | 6 ++++++
 accel/tcg/plugin-gen.c          | 2 +-
 tcg/tcg.c                       | 5 ++++-
 3 files changed, 11 insertions(+), 2 deletions(-)

diff --git a/include/tcg/tcg-temp-internal.h b/include/tcg/tcg-temp-internal.h
index XXXXXXX..XXXXXXX 100644
--- a/include/tcg/tcg-temp-internal.h
+++ b/include/tcg/tcg-temp-internal.h
@@ -XXX,XX +XXX,XX @@ TCGv_i64 tcg_temp_ebb_new_i64(void);
 TCGv_ptr tcg_temp_ebb_new_ptr(void);
 TCGv_i128 tcg_temp_ebb_new_i128(void);
 
+/* Forget all freed EBB temps, so that new allocations produce new temps. */
+static inline void tcg_temp_ebb_reset_freed(TCGContext *s)
+{
+    memset(s->free_temps, 0, sizeof(s->free_temps));
+}
+
 #endif /* TCG_TEMP_FREE_H */
diff --git a/accel/tcg/plugin-gen.c b/accel/tcg/plugin-gen.c
index XXXXXXX..XXXXXXX 100644
--- a/accel/tcg/plugin-gen.c
+++ b/accel/tcg/plugin-gen.c
@@ -XXX,XX +XXX,XX @@ static void plugin_gen_inject(struct qemu_plugin_tb *plugin_tb)
      * that might be live within the existing opcode stream.
      * The simplest solution is to release them all and create new.
      */
-    memset(tcg_ctx->free_temps, 0, sizeof(tcg_ctx->free_temps));
+    tcg_temp_ebb_reset_freed(tcg_ctx);
 
     QTAILQ_FOREACH_SAFE(op, &tcg_ctx->ops, link, next) {
         switch (op->opc) {
diff --git a/tcg/tcg.c b/tcg/tcg.c
index XXXXXXX..XXXXXXX 100644
--- a/tcg/tcg.c
+++ b/tcg/tcg.c
@@ -XXX,XX +XXX,XX @@ void tcg_func_start(TCGContext *s)
     s->nb_temps = s->nb_globals;
 
     /* No temps have been previously allocated for size or locality.  */
-    memset(s->free_temps, 0, sizeof(s->free_temps));
+    tcg_temp_ebb_reset_freed(s);
 
     /* No constant temps have been previously allocated. */
     for (int i = 0; i < TCG_TYPE_COUNT; ++i) {
@@ -XXX,XX +XXX,XX @@ int tcg_gen_code(TCGContext *s, TranslationBlock *tb, uint64_t pc_start)
     }
 #endif
 
+    /* Do not reuse any EBB that may be allocated within the TB. */
+    tcg_temp_ebb_reset_freed(s);
+
     tcg_optimize(s);
 
     reachable_code_pass(s);
-- 
2.43.0

From: Roman Artemev <roman.artemev@syntacore.com>

On RISC-V to StoreStore barrier corresponds
`fence w, w` not `fence r, r`

Cc: qemu-stable@nongnu.org
Fixes: efbea94c76b ("tcg/riscv: Add slowpath load and store instructions")
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Signed-off-by: Denis Tomashev <denis.tomashev@syntacore.com>
Signed-off-by: Roman Artemev <roman.artemev@syntacore.com>
Message-ID: <e2f2131e294a49e79959d4fa9ec02cf4@syntacore.com>
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
---
 tcg/riscv/tcg-target.c.inc | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/tcg/riscv/tcg-target.c.inc b/tcg/riscv/tcg-target.c.inc
index XXXXXXX..XXXXXXX 100644
--- a/tcg/riscv/tcg-target.c.inc
+++ b/tcg/riscv/tcg-target.c.inc
@@ -XXX,XX +XXX,XX @@ static void tcg_out_mb(TCGContext *s, TCGArg a0)
         insn |= 0x02100000;
     }
     if (a0 & TCG_MO_ST_ST) {
-        insn |= 0x02200000;
+        insn |= 0x01100000;
     }
     tcg_out32(s, insn);
 }
-- 
2.43.0

This allows targets to declare that the helper requires a
float_status pointer and instead of a generic void pointer.

Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
---
 include/exec/helper-head.h.inc | 3 +++
 1 file changed, 3 insertions(+)

diff --git a/include/exec/helper-head.h.inc b/include/exec/helper-head.h.inc
index XXXXXXX..XXXXXXX 100644
--- a/include/exec/helper-head.h.inc
+++ b/include/exec/helper-head.h.inc
@@ -XXX,XX +XXX,XX @@
 #define dh_alias_ptr ptr
 #define dh_alias_cptr ptr
 #define dh_alias_env ptr
+#define dh_alias_fpst ptr
 #define dh_alias_void void
 #define dh_alias_noreturn noreturn
 #define dh_alias(t) glue(dh_alias_, t)
@@ -XXX,XX +XXX,XX @@
 #define dh_ctype_ptr void *
 #define dh_ctype_cptr const void *
 #define dh_ctype_env CPUArchState *
+#define dh_ctype_fpst float_status *
 #define dh_ctype_void void
 #define dh_ctype_noreturn G_NORETURN void
 #define dh_ctype(t) dh_ctype_##t
@@ -XXX,XX +XXX,XX @@
 #define dh_typecode_f64 dh_typecode_i64
 #define dh_typecode_cptr dh_typecode_ptr
 #define dh_typecode_env dh_typecode_ptr
+#define dh_typecode_fpst dh_typecode_ptr
 #define dh_typecode(t) dh_typecode_##t
 
 #define dh_callflag_i32  0
-- 
2.43.0

From: Philippe Mathieu-Daudé <philmd@linaro.org>

Rather than manually copying each register, use
the libc memcpy(), which is well optimized nowadays.

Suggested-by: Pierrick Bouvier <pierrick.bouvier@linaro.org>
Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org>
Message-ID: <20241205205418.67613-1-philmd@linaro.org>
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
---
 target/sparc/win_helper.c | 26 ++++++++------------------
 1 file changed, 8 insertions(+), 18 deletions(-)

diff --git a/target/sparc/win_helper.c b/target/sparc/win_helper.c
index XXXXXXX..XXXXXXX 100644
--- a/target/sparc/win_helper.c
+++ b/target/sparc/win_helper.c
@@ -XXX,XX +XXX,XX @@
 #include "exec/helper-proto.h"
 #include "trace.h"
 
-static inline void memcpy32(target_ulong *dst, const target_ulong *src)
-{
-    dst[0] = src[0];
-    dst[1] = src[1];
-    dst[2] = src[2];
-    dst[3] = src[3];
-    dst[4] = src[4];
-    dst[5] = src[5];
-    dst[6] = src[6];
-    dst[7] = src[7];
-}
-
 void cpu_set_cwp(CPUSPARCState *env, int new_cwp)
 {
     /* put the modified wrap registers at their proper location */
     if (env->cwp == env->nwindows - 1) {
-        memcpy32(env->regbase, env->regbase + env->nwindows * 16);
+        memcpy(env->regbase, env->regbase + env->nwindows * 16,
+               sizeof(env->gregs));
     }
     env->cwp = new_cwp;
 
     /* put the wrap registers at their temporary location */
     if (new_cwp == env->nwindows - 1) {
-        memcpy32(env->regbase + env->nwindows * 16, env->regbase);
+        memcpy(env->regbase + env->nwindows * 16, env->regbase,
+               sizeof(env->gregs));
     }
     env->regwptr = env->regbase + (new_cwp * 16);
 }
@@ -XXX,XX +XXX,XX @@ void cpu_gl_switch_gregs(CPUSPARCState *env, uint32_t new_gl)
     dst = get_gl_gregset(env, env->gl);
 
     if (src != dst) {
-        memcpy32(dst, env->gregs);
-        memcpy32(env->gregs, src);
+        memcpy(dst, env->gregs, sizeof(env->gregs));
+        memcpy(env->gregs, src, sizeof(env->gregs));
     }
 }
 
@@ -XXX,XX +XXX,XX @@ void cpu_change_pstate(CPUSPARCState *env, uint32_t new_pstate)
         /* Switch global register bank */
         src = get_gregset(env, new_pstate_regs);
         dst = get_gregset(env, pstate_regs);
-        memcpy32(dst, env->gregs);
-        memcpy32(env->gregs, src);
+        memcpy(dst, env->gregs, sizeof(env->gregs));
+        memcpy(env->gregs, src, sizeof(env->gregs));
     } else {
         trace_win_helper_no_switch_pstate(new_pstate_regs);
     }
-- 
2.43.0

v2: Rebase and resolve target/loongarch conflicts.
    Include linux-user/aarch64 vdso fix.

The following changes since commit 29b008927ef6e3fbb70e6607b25d3fcae26a5190:

Merge tag 'pull-nic-config-2-20240202' of git://git.infradead.org/users/dwmw2/qemu into staging (2024-02-02 16:47:36 +0000)

are available in the Git repository at:

https://gitlab.com/rth7680/qemu.git tags/pull-tcg-20240202-2

for you to fetch changes up to 6400be014f80e4c2c246eb8be709ea3a96428233:

linux-user/aarch64: Add padding before __kernel_rt_sigreturn (2024-02-03 16:46:10 +1000)

----------------------------------------------------------------
tests/tcg: Fix multiarch/gdbstub/prot-none.py
hw/core: Convert cpu_mmu_index to a CPUClass hook
tcg/loongarch64: Set vector registers call clobbered
target/sparc: floating-point cleanup
linux-user/aarch64: Add padding before __kernel_rt_sigreturn

----------------------------------------------------------------
Ilya Leoshkevich (1):
      tests/tcg: Fix the /proc/self/mem probing in the PROT_NONE gdbstub test

Richard Henderson (57):
      include/hw/core: Add mmu_index to CPUClass
      target/alpha: Split out alpha_env_mmu_index
      target/alpha: Populate CPUClass.mmu_index
      target/arm: Split out arm_env_mmu_index
      target/arm: Populate CPUClass.mmu_index
      target/avr: Populate CPUClass.mmu_index
      target/cris: Cache mem_index in DisasContext
      target/cris: Populate CPUClass.mmu_index
      target/hppa: Populate CPUClass.mmu_index
      target/i386: Populate CPUClass.mmu_index
      target/loongarch: Populate CPUClass.mmu_index
      target/loongarch: Rename MMU_IDX_*
      target/m68k: Populate CPUClass.mmu_index
      target/microblaze: Populate CPUClass.mmu_index
      target/mips: Pass ptw_mmu_idx down from mips_cpu_tlb_fill
      target/mips: Split out mips_env_mmu_index
      target/mips: Populate CPUClass.mmu_index
      target/nios2: Populate CPUClass.mmu_index
      target/openrisc: Populate CPUClass.mmu_index
      target/ppc: Split out ppc_env_mmu_index
      target/ppc: Populate CPUClass.mmu_index
      target/riscv: Rename riscv_cpu_mmu_index to riscv_env_mmu_index
      target/riscv: Replace cpu_mmu_index with riscv_env_mmu_index
      target/riscv: Populate CPUClass.mmu_index
      target/rx: Populate CPUClass.mmu_index
      target/s390x: Split out s390x_env_mmu_index
      target/s390x: Populate CPUClass.mmu_index
      target/sh4: Populate CPUClass.mmu_index
      target/sparc: Populate CPUClass.mmu_index
      target/tricore: Populate CPUClass.mmu_index
      target/xtensa: Populate CPUClass.mmu_index
      include/exec: Implement cpu_mmu_index generically
      include/exec: Change cpu_mmu_index argument to CPUState
      tcg/loongarch64: Set vector registers call clobbered
      target/sparc: Use tcg_gen_qemu_{ld, st}_i128 for ASI_M_BCOPY
      target/sparc: Use tcg_gen_qemu_{ld, st}_i128 for ASI_M_BFILL
      target/sparc: Remove gen_dest_fpr_F
      target/sparc: Introduce gen_{load,store}_fpr_Q
      target/sparc: Inline FNEG, FABS
      target/sparc: Use i128 for FSQRTq
      target/sparc: Use i128 for FADDq, FSUBq, FMULq, FDIVq
      target/sparc: Use i128 for FqTOs, FqTOi
      target/sparc: Use i128 for FqTOd, FqTOx
      target/sparc: Use i128 for FCMPq, FCMPEq
      target/sparc: Use i128 for FsTOq, FiTOq
      target/sparc: Use i128 for FdTOq, FxTOq
      target/sparc: Use i128 for Fdmulq
      target/sparc: Remove qt0, qt1 temporaries
      target/sparc: Introduce cpu_get_fsr, cpu_put_fsr
      target/sparc: Split ver from env->fsr
      target/sparc: Clear cexc and ftt in do_check_ieee_exceptions
      target/sparc: Merge check_ieee_exceptions with FPop helpers
      target/sparc: Split cexc and ftt from env->fsr
      target/sparc: Remove cpu_fsr
      target/sparc: Split fcc out of env->fsr
      target/sparc: Remove FSR_FTT_NMASK, FSR_FTT_CEXC_NMASK
      linux-user/aarch64: Add padding before __kernel_rt_sigreturn

The expected form is MMU_FOO_IDX, not MMU_IDX_FOO.
Rename to match generic code.

Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
---
 target/loongarch/cpu.h                                 | 8 ++++----
 target/loongarch/cpu.c                                 | 2 +-
 target/loongarch/cpu_helper.c                          | 4 ++--
 target/loongarch/tcg/translate.c                       | 2 +-
 target/loongarch/tcg/insn_trans/trans_privileged.c.inc | 2 +-
 5 files changed, 9 insertions(+), 9 deletions(-)

diff --git a/target/loongarch/cpu.h b/target/loongarch/cpu.h
index XXXXXXX..XXXXXXX 100644
--- a/target/loongarch/cpu.h
+++ b/target/loongarch/cpu.h
@@ -XXX,XX +XXX,XX @@ struct LoongArchCPUClass {
  */
 #define MMU_PLV_KERNEL   0
 #define MMU_PLV_USER     3
-#define MMU_IDX_KERNEL   MMU_PLV_KERNEL
-#define MMU_IDX_USER     MMU_PLV_USER
-#define MMU_IDX_DA       4
+#define MMU_KERNEL_IDX   MMU_PLV_KERNEL
+#define MMU_USER_IDX     MMU_PLV_USER
+#define MMU_DA_IDX       4
 
 int loongarch_cpu_mmu_index(CPUState *cs, bool ifetch);
 static inline int cpu_mmu_index(CPULoongArchState *env, bool ifetch)
 {
 #ifdef CONFIG_USER_ONLY
-    return MMU_IDX_USER;
+    return MMU_USER_IDX;
 #else
     return loongarch_cpu_mmu_index(env_cpu(env), ifetch);
 #endif
diff --git a/target/loongarch/cpu.c b/target/loongarch/cpu.c
index XXXXXXX..XXXXXXX 100644
--- a/target/loongarch/cpu.c
+++ b/target/loongarch/cpu.c
@@ -XXX,XX +XXX,XX @@ int loongarch_cpu_mmu_index(CPUState *cs, bool ifetch)
     if (FIELD_EX64(env->CSR_CRMD, CSR_CRMD, PG)) {
         return FIELD_EX64(env->CSR_CRMD, CSR_CRMD, PLV);
     }
-    return MMU_IDX_DA;
+    return MMU_DA_IDX;
 }
 
 static void loongarch_la464_initfn(Object *obj)
diff --git a/target/loongarch/cpu_helper.c b/target/loongarch/cpu_helper.c
index XXXXXXX..XXXXXXX 100644
--- a/target/loongarch/cpu_helper.c
+++ b/target/loongarch/cpu_helper.c
@@ -XXX,XX +XXX,XX @@ int get_physical_address(CPULoongArchState *env, hwaddr *physical,
                          int *prot, target_ulong address,
                          MMUAccessType access_type, int mmu_idx)
 {
-    int user_mode = mmu_idx == MMU_IDX_USER;
-    int kernel_mode = mmu_idx == MMU_IDX_KERNEL;
+    int user_mode = mmu_idx == MMU_USER_IDX;
+    int kernel_mode = mmu_idx == MMU_KERNEL_IDX;
     uint32_t plv, base_c, base_v;
     int64_t addr_high;
     uint8_t da = FIELD_EX64(env->CSR_CRMD, CSR_CRMD, DA);
diff --git a/target/loongarch/tcg/translate.c b/target/loongarch/tcg/translate.c
index XXXXXXX..XXXXXXX 100644
--- a/target/loongarch/tcg/translate.c
+++ b/target/loongarch/tcg/translate.c
@@ -XXX,XX +XXX,XX @@ static void loongarch_tr_init_disas_context(DisasContextBase *dcbase,
     if (ctx->base.tb->flags & HW_FLAGS_CRMD_PG) {
         ctx->mem_idx = ctx->plv;
     } else {
-        ctx->mem_idx = MMU_IDX_DA;
+        ctx->mem_idx = MMU_DA_IDX;
     }
 
     /* Bound the number of insns to execute to those left on the page.  */
diff --git a/target/loongarch/tcg/insn_trans/trans_privileged.c.inc b/target/loongarch/tcg/insn_trans/trans_privileged.c.inc
index XXXXXXX..XXXXXXX 100644
--- a/target/loongarch/tcg/insn_trans/trans_privileged.c.inc
+++ b/target/loongarch/tcg/insn_trans/trans_privileged.c.inc
@@ -XXX,XX +XXX,XX @@ TRANS(iocsrwr_d, IOCSR, gen_iocsrwr, gen_helper_iocsrwr_d)
 
 static void check_mmu_idx(DisasContext *ctx)
 {
-    if (ctx->mem_idx != MMU_IDX_DA) {
+    if (ctx->mem_idx != MMU_DA_IDX) {
         tcg_gen_movi_tl(cpu_pc, ctx->base.pc_next + 4);
         ctx->base.is_jmp = DISAS_EXIT;
     }
-- 
2.34.1

Because there are more call clobbered registers than
call saved registers, we begin with all registers as
call clobbered and then reset those that are saved.

This was missed when we introduced the LSX support.

Cc: qemu-stable@nongnu.org
Fixes: 16288ded944 ("tcg/loongarch64: Lower basic tcg vec ops to LSX")
Resolves: https://gitlab.com/qemu-project/qemu/-/issues/2136
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
Reviewed-by: Song Gao <gaosong@loongson.cn>
Message-Id: <20240201233414.500588-1-richard.henderson@linaro.org>
---
 tcg/loongarch64/tcg-target.c.inc | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/tcg/loongarch64/tcg-target.c.inc b/tcg/loongarch64/tcg-target.c.inc
index XXXXXXX..XXXXXXX 100644
--- a/tcg/loongarch64/tcg-target.c.inc
+++ b/tcg/loongarch64/tcg-target.c.inc
@@ -XXX,XX +XXX,XX @@ static void tcg_target_init(TCGContext *s)
     tcg_target_available_regs[TCG_TYPE_I32] = ALL_GENERAL_REGS;
     tcg_target_available_regs[TCG_TYPE_I64] = ALL_GENERAL_REGS;
 
-    tcg_target_call_clobber_regs = ALL_GENERAL_REGS;
+    tcg_target_call_clobber_regs = ALL_GENERAL_REGS | ALL_VECTOR_REGS;
     tcg_regset_reset_reg(tcg_target_call_clobber_regs, TCG_REG_S0);
     tcg_regset_reset_reg(tcg_target_call_clobber_regs, TCG_REG_S1);
     tcg_regset_reset_reg(tcg_target_call_clobber_regs, TCG_REG_S2);
-- 
2.34.1

Without this padding, an unwind through the signal handler
will pick up the unwind info for the preceding syscall.

This fixes gcc's 30_threads/thread/native_handle/cancel.cc.

Cc: qemu-stable@nongnu.org
Fixes: ee95fae075c6 ("linux-user/aarch64: Add vdso")
Resolves: https://linaro.atlassian.net/browse/GNU-974
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
Message-Id: <20240202034427.504686-1-richard.henderson@linaro.org>
---
 linux-user/aarch64/vdso-be.so | Bin 3216 -> 3224 bytes
 linux-user/aarch64/vdso-le.so | Bin 3216 -> 3224 bytes
 linux-user/aarch64/vdso.S     |   4 ++++
 3 files changed, 4 insertions(+)

diff --git a/linux-user/aarch64/vdso-be.so b/linux-user/aarch64/vdso-be.so
index XXXXXXX..XXXXXXX 100755
GIT binary patch
delta 121
zcmbOrIYV-SKI4pu2Kk&{7{Gw#%fuBAMC1c?^>~k}v|avdxNjSSLfftVb3bgJ!|2S&
z_-6A1CJrVZc?IUH8G;R$7#SF@Om<{a*v!K!&BXX-vIe^~TWO|cva$K*Om;sOMw`hy
ZxXl@VO#Z-a&zLdUfXALuXmSCM0s#EKC)of1

delta 116
zcmbOsIYDxQKI4Rm2Kk&H7{Gw#!^9O2L>8U?-5V_M@!kH(Sx4vJn|*ujLPgija~Pc&
z8DDIEz{J5c`3;N8W)W6tCdL<&4cM*OEF8_<v%@zRviq?xT1-B`ZO-^%@(*r%#)Qch
RJocPi5ThAdCO2?N002V6C;<Qf

diff --git a/linux-user/aarch64/vdso-le.so b/linux-user/aarch64/vdso-le.so
index XXXXXXX..XXXXXXX 100755
GIT binary patch
delta 129
zcmbOrIYV-S2IGv0n)#exSQx<I%fyAxMZTVBQ(04AP_*V|Vxp|@=@;x8zb9;-!)U|E
z_-6A>CVnO!c?IUH8G;R$7#SF@Om<{a*v!K!!o>JyvLd?^n`3BUW_royOm=q`Mw`hS
dxy>1WOn%92&zLb;lgFM@hy!9z%j7~Xc>tTxDQW-!

delta 108
zcmbOsIYDxQ2IGW@n)#d`SQx<I!^DNpMK&+G&+g_}w9WI@dn@@euKVesZ-h6`VYFdn
ze6jf^6F<}BH!LcfMOa0c7+*}*WOrgKEO1Fl%G+GX?#{w!F?lDqIpc@PAGz%r6DAw-
M*fVlXF62=M06owo?*IS*

diff --git a/linux-user/aarch64/vdso.S b/linux-user/aarch64/vdso.S
index XXXXXXX..XXXXXXX 100644
--- a/linux-user/aarch64/vdso.S
+++ b/linux-user/aarch64/vdso.S
@@ -XXX,XX +XXX,XX @@ vdso_syscall __kernel_clock_getres, __NR_clock_getres
  * For now, elide the unwind info for __kernel_rt_sigreturn and rely on
  * the libgcc fallback routine as we have always done.  This requires
  * that the code sequence used be exact.
+ *
+ * Add a nop as a spacer to ensure that unwind does not pick up the
+ * unwind info from the preceding syscall.
  */
+	nop
 __kernel_rt_sigreturn:
 	/* No BTI C insn here -- we arrive via RET. */
 	mov	x8, #__NR_rt_sigreturn
-- 
2.34.1