Series comparison

-[Qemu-devel] [PULL 00/42] target-arm queue
+[PULL 00/36] target-arm queue
-Arm queue -- I have more stuff pending but I prefer to push
+First pullreq for 6.0: mostly my v8.1M work, plus some other
-this first lot out and keep the pull below 50 patches.
+bits and pieces. (I still have a lot of stuff in my to-review
-Most of this is Alex's FP16 support work.
+folder, which I may or may not get to before the Christmas break...)
+thanks
 -- PMM
+The following changes since commit 5e7b204dbfae9a562fc73684986f936b97f63877:
-The following changes since commit 6697439794f72b3501ee16bb95d16854f9981421:
+  Merge remote-tracking branch 'remotes/mst/tags/for_upstream' into staging (2020-12-09 20:08:54 +0000)
   Merge remote-tracking branch 'remotes/kraxel/tags/usb-20180227-pull-request' into staging (2018-02-27 17:50:46 +0000)
 are available in the Git repository at:
-  git://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20180301
+  https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20201210
-for you to fetch changes up to c22e580c2ad1cccef582e1490e732f254d4ac064:
+for you to fetch changes up to 71f916be1c7e9ede0e37d9cabc781b5a9e8638ff:
-  MAINTAINERS: Update my email address (2018-03-01 11:13:59 +0000)
+  hw/arm/armv7m: Correct typo in QOM object name (2020-12-10 11:44:56 +0000)
 ----------------------------------------------------------------
 target-arm queue:
- * update MAINTAINERS for Alistair's new email address
+ * hw/arm/smmuv3: Fix up L1STD_SPAN decoding
- * add Arm v8.2 FP16 arithmetic extension for linux-user
+ * xlnx-zynqmp: Support Xilinx ZynqMP CAN controllers
- * implement display connector emulation for vexpress board
+ * sbsa-ref: allow to use Cortex-A53/57/72 cpus
- * xilinx_spips: Enable only two slaves when reading/writing with stripe
+ * Various minor code cleanups
- * xilinx_spips: Use 8 dummy cycles with the QIOR/QIOR4 commands
+ * hw/intc/armv7m_nvic: Make all of system PPB range be RAZWI/BusFault
- * hw: register: Run post_write hook on reset
+ * Implement more pieces of ARMv8.1M support
 ----------------------------------------------------------------
-Alex Bennée (31):
+Alex Chen (4):
-      include/exec/helper-head.h: support f16 in helper calls
+      i.MX25: Fix bad printf format specifiers
-      target/arm/cpu64: introduce ARM_V8_FP16 feature bit
+      i.MX31: Fix bad printf format specifiers
-      target/arm/cpu.h: update comment for half-precision values
+      i.MX6: Fix bad printf format specifiers
-      target/arm/cpu.h: add additional float_status flags
+      i.MX6ul: Fix bad printf format specifiers
       target/arm/helper: pass explicit fpst to set_rmode
       arm/translate-a64: implement half-precision F(MIN|MAX)(V|NMV)
       arm/translate-a64: handle_3same_64 comment fix
       arm/translate-a64: initial decode for simd_three_reg_same_fp16
       arm/translate-a64: add FP16 FADD/FABD/FSUB/FMUL/FDIV to simd_three_reg_same_fp16
       arm/translate-a64: add FP16 F[A]C[EQ/GE/GT] to simd_three_reg_same_fp16
       arm/translate-a64: add FP16 FMULA/X/S to simd_three_reg_same_fp16
       arm/translate-a64: add FP16 FR[ECP/SQRT]S to simd_three_reg_same_fp16
       arm/translate-a64: add FP16 pairwise ops simd_three_reg_same_fp16
       arm/translate-a64: add FP16 FMULX/MLS/FMLA to simd_indexed
       arm/translate-a64: add FP16 x2 ops for simd_indexed
       arm/translate-a64: initial decode for simd_two_reg_misc_fp16
       arm/translate-a64: add FP16 FPRINTx to simd_two_reg_misc_fp16
       arm/translate-a64: add FCVTxx to simd_two_reg_misc_fp16
       arm/translate-a64: add FP16 FCMxx (zero) to simd_two_reg_misc_fp16
       arm/translate-a64: add FP16 SCVTF/UCVFT to simd_two_reg_misc_fp16
       arm/translate-a64: add FP16 FNEG/FABS to simd_two_reg_misc_fp16
       arm/helper.c: re-factor recpe and add recepe_f16
       arm/translate-a64: add FP16 FRECPE
       arm/translate-a64: add FP16 FRCPX to simd_two_reg_misc_fp16
       arm/translate-a64: add FP16 FSQRT to simd_two_reg_misc_fp16
       arm/helper.c: re-factor rsqrte and add rsqrte_f16
       arm/translate-a64: add FP16 FRSQRTE to simd_two_reg_misc_fp16
       arm/translate-a64: add FP16 FMOV to simd_mod_imm
       arm/translate-a64: add all FP16 ops in simd_scalar_pairwise
       arm/translate-a64: implement simd_scalar_three_reg_same_fp16
       arm/translate-a64: add all single op FP16 to handle_fp_1src_half
-Alistair Francis (2):
+Havard Skinnemoen (1):
-      hw: register: Run post_write hook on reset
+      tests/qtest/npcm7xx_rng-test: dump random data on failure
       MAINTAINERS: Update my email address
-Corey Minyard (2):
+Kunkun Jiang (1):
-      i2c: Fix some brace style issues
+      hw/arm/smmuv3: Fix up L1STD_SPAN decoding
       i2c: Move the bus class to i2c.h
-Francisco Iglesias (2):
+Marcin Juszkiewicz (1):
-      xilinx_spips: Enable only two slaves when reading/writing with stripe
+      sbsa-ref: allow to use Cortex-A53/57/72 cpus
       xilinx_spips: Use 8 dummy cycles with the QIOR/QIOR4 commands
-Linus Walleij (3):
+Peter Maydell (25):
-      hw/i2c-ddc: Do not fail writes
+      hw/intc/armv7m_nvic: Make all of system PPB range be RAZWI/BusFault
-      hw/sii9022: Add support for Silicon Image SII9022
+      target/arm: Implement v8.1M PXN extension
-      arm/vexpress: Add proper display connector emulation
+      target/arm: Don't clobber ID_PFR1.Security on M-profile cores
       target/arm: Implement VSCCLRM insn
       target/arm: Implement CLRM instruction
       target/arm: Enforce M-profile VMRS/VMSR register restrictions
       target/arm: Refactor M-profile VMSR/VMRS handling
       target/arm: Move general-use constant expanders up in translate.c
       target/arm: Implement VLDR/VSTR system register
       target/arm: Implement M-profile FPSCR_nzcvqc
       target/arm: Use new FPCR_NZCV_MASK constant
       target/arm: Factor out preserve-fp-state from full_vfp_access_check()
       target/arm: Implement FPCXT_S fp system register
       hw/intc/armv7m_nvic: Update FPDSCR masking for v8.1M
       target/arm: For v8.1M, always clear R0-R3, R12, APSR, EPSR on exception entry
       target/arm: In v8.1M, don't set HFSR.FORCED on vector table fetch failures
       target/arm: Implement v8.1M REVIDR register
       target/arm: Implement new v8.1M NOCP check for exception return
       target/arm: Implement new v8.1M VLLDM and VLSTM encodings
       hw/intc/armv7m_nvic: Support v8.1M CCR.TRD bit
       target/arm: Implement CCR_S.TRD behaviour for SG insns
       hw/intc/armv7m_nvic: Fix "return from inactive handler" check
       target/arm: Implement M-profile "minimal RAS implementation"
       hw/intc/armv7m_nvic: Implement read/write for RAS register block
       hw/arm/armv7m: Correct typo in QOM object name
-Peter Maydell (2):
+Vikram Garhwal (4):
-      target/arm: Enable ARM_V8_FP16 feature bit for the AArch64 "any" CPU
+      hw/net/can: Introduce Xilinx ZynqMP CAN controller
-      linux-user: Report AArch64 FP16 support via hwcap bits
+      xlnx-zynqmp: Connect Xilinx ZynqMP CAN controllers
       tests/qtest: Introduce tests for Xilinx ZynqMP CAN controller
       MAINTAINERS: Add maintainer entry for Xilinx ZynqMP CAN controller
- hw/display/Makefile.objs        |    1 +
+ meson.build                      |    1 +
- include/exec/helper-head.h      |    3 +
+ hw/arm/smmuv3-internal.h         |    2 +-
- include/fpu/softfloat.h         |   18 +-
+ hw/net/can/trace.h               |    1 +
- include/hw/i2c/i2c.h            |   23 +-
+ include/hw/arm/xlnx-zynqmp.h     |    8 +
- include/hw/register.h           |    6 +-
+ include/hw/intc/armv7m_nvic.h    |    2 +
- target/arm/cpu.h                |   34 +-
+ include/hw/net/xlnx-zynqmp-can.h |   78 +++
- target/arm/helper-a64.h         |   33 +
+ target/arm/cpu.h                 |   46 ++
- target/arm/helper.h             |   14 +-
+ target/arm/m-nocp.decode         |   10 +-
- hw/arm/vexpress.c               |    6 +-
+ target/arm/t32.decode            |   10 +-
- hw/core/register.c              |    8 +
+ target/arm/vfp.decode            |   14 +
- hw/display/sii9022.c            |  191 ++++++
+ hw/arm/armv7m.c                  |    4 +-
- hw/i2c/core.c                   |   18 -
+ hw/arm/sbsa-ref.c                |   23 +-
- hw/i2c/i2c-ddc.c                |    4 +-
+ hw/arm/xlnx-zcu102.c             |   20 +
- hw/ssi/xilinx_spips.c           |   43 +-
+ hw/arm/xlnx-zynqmp.c             |   34 ++
- linux-user/elfload.c            |    2 +
+ hw/intc/armv7m_nvic.c            |  246 ++++++--
- target/arm/cpu64.c              |    1 +
+ hw/misc/imx25_ccm.c              |   12 +-
- target/arm/helper-a64.c         |  269 +++++++++
+ hw/misc/imx31_ccm.c              |   14 +-
- target/arm/helper.c             |  481 ++++++++-------
+ hw/misc/imx6_ccm.c               |   20 +-
- target/arm/translate-a64.c      | 1266 +++++++++++++++++++++++++++++++++------
+ hw/misc/imx6_src.c               |    2 +-
- target/arm/translate.c          |   12 +-
+ hw/misc/imx6ul_ccm.c             |    4 +-
- MAINTAINERS                     |   12 +-
+ hw/misc/imx_ccm.c                |    4 +-
- default-configs/arm-softmmu.mak |    2 +
+ hw/net/can/xlnx-zynqmp-can.c     | 1161 ++++++++++++++++++++++++++++++++++++++
- hw/display/trace-events         |    5 +
+ target/arm/cpu.c                 |    5 +-
-files changed, 1981 insertions(+), 471 deletions(-)
+ target/arm/helper.c              |    7 +-
- create mode 100644 hw/display/sii9022.c
+ target/arm/m_helper.c            |  130 ++++-
  target/arm/translate.c           |  105 +++-
  tests/qtest/npcm7xx_rng-test.c   |   12 +
  tests/qtest/xlnx-can-test.c      |  360 ++++++++++++
  MAINTAINERS                      |    8 +
  hw/Kconfig                       |    1 +
  hw/net/can/meson.build           |    1 +
  hw/net/can/trace-events          |    9 +
  target/arm/translate-vfp.c.inc   |  511 ++++++++++++++++-
  tests/qtest/meson.build          |    1 +
 files changed, 2713 insertions(+), 153 deletions(-)
  create mode 100644 hw/net/can/trace.h
  create mode 100644 include/hw/net/xlnx-zynqmp-can.h
  create mode 100644 hw/net/can/xlnx-zynqmp-can.c
  create mode 100644 tests/qtest/xlnx-can-test.c
  create mode 100644 hw/net/can/trace-events

-[Qemu-devel] [PULL 01/42] hw: register: Run post_write hook on reset
+Deleted patch
-From: Alistair Francis <alistair.francis@xilinx.com>
-Ensure that the post write hook is called during reset. This allows us
-to rely on the post write functions instead of having to call them from
-the reset() function.
-Signed-off-by: Alistair Francis <alistair.francis@xilinx.com>
-Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
-Message-id: d131e24b911653a945e46ca2d8f90f572469e1dd.1517856214.git.alistair.francis@xilinx.com
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- include/hw/register.h | 6 +++---
- hw/core/register.c    | 8 ++++++++
-files changed, 11 insertions(+), 3 deletions(-)
-diff --git a/include/hw/register.h b/include/hw/register.h
-index XXXXXXX..XXXXXXX 100644
---- a/include/hw/register.h
-+++ b/include/hw/register.h
-@@ -XXX,XX +XXX,XX @@ typedef struct RegisterInfoArray RegisterInfoArray;
-  * immediately before the actual write. The returned value is what is written,
-  * giving the handler a chance to modify the written value.
-  * @post_write: Post write callback. Passed the written value. Most write side
-- * effects should be implemented here.
-+ * effects should be implemented here. This is called during device reset.
-  *
-  * @post_read: Post read callback. Passes the value that is about to be returned
-  * for a read. The return value from this function is what is ultimately read,
-@@ -XXX,XX +XXX,XX @@ uint64_t register_read(RegisterInfo *reg, uint64_t re, const char* prefix,
-                        bool debug);
- /**
-- * reset a register
-- * @reg: register to reset
-+ * Resets a register. This will also call the post_write hook if it exists.
-+ * @reg: The register to reset.
-  */
- void register_reset(RegisterInfo *reg);
-diff --git a/hw/core/register.c b/hw/core/register.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/core/register.c
-+++ b/hw/core/register.c
-@@ -XXX,XX +XXX,XX @@ uint64_t register_read(RegisterInfo *reg, uint64_t re, const char* prefix,
- void register_reset(RegisterInfo *reg)
- {
-+    const RegisterAccessInfo *ac;
-+
-     g_assert(reg);
-     if (!reg->data || !reg->access) {
-         return;
-     }
-+    ac = reg->access;
-+
-     register_write_val(reg, reg->access->reset);
-+
-+    if (ac->post_write) {
-+        ac->post_write(reg, reg->access->reset);
-+    }
- }
- void register_init(RegisterInfo *reg)
---
-.16.2

-[Qemu-devel] [PULL 02/42] xilinx_spips: Enable only two slaves when reading/writing with stripe
+[PULL 01/36] hw/arm/smmuv3: Fix up L1STD_SPAN decoding
-From: Francisco Iglesias <frasse.iglesias@gmail.com>
+From: Kunkun Jiang <jiangkunkun@huawei.com>
-Assert only the lower cs on bus 0 and upper cs on bus 1 when both buses and
+Accroding to the SMMUv3 spec, the SPAN field of Level1 Stream Table
-chip selects are enabled (e.g reading/writing with stripe).
+Descriptor is 5 bits([4:0]).
-Signed-off-by: Francisco Iglesias <frasse.iglesias@gmail.com>
+Fixes: 9bde7f0674f(hw/arm/smmuv3: Implement translate callback)
-Reviewed-by: Alistair Francis <alistair.francis@xilinx.com>
+Signed-off-by: Kunkun Jiang <jiangkunkun@huawei.com>
-Tested-by: Alistair Francis <alistair.francis@xilinx.com>
+Message-id: 20201124023711.1184-1-jiangkunkun@huawei.com
-Message-id: 20180223232233.31482-2-frasse.iglesias@gmail.com
+Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
 Acked-by: Eric Auger <eric.auger@redhat.com>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- hw/ssi/xilinx_spips.c | 41 +++++++++++++++++++++++++++++++++++++----
+ hw/arm/smmuv3-internal.h | 2 +-
-file changed, 37 insertions(+), 4 deletions(-)
+file changed, 1 insertion(+), 1 deletion(-)
-diff --git a/hw/ssi/xilinx_spips.c b/hw/ssi/xilinx_spips.c
+diff --git a/hw/arm/smmuv3-internal.h b/hw/arm/smmuv3-internal.h
 index XXXXXXX..XXXXXXX 100644
---- a/hw/ssi/xilinx_spips.c
+--- a/hw/arm/smmuv3-internal.h
-+++ b/hw/ssi/xilinx_spips.c
++++ b/hw/arm/smmuv3-internal.h
-@@ -XXX,XX +XXX,XX @@ static void xilinx_spips_update_cs(XilinxSPIPS *s, int field)
+@@ -XXX,XX +XXX,XX @@ static inline uint64_t l1std_l2ptr(STEDesc *desc)
- {
+     return hi << 32 | lo;
      int i;
 -    for (i = 0; i < s->num_cs; i++) {
 +    for (i = 0; i < s->num_cs * s->num_busses; i++) {
          bool old_state = s->cs_lines_state[i];
          bool new_state = field & (1 << i);
@@ -XXX,XX +XXX,XX @@ static void xilinx_spips_update_cs(XilinxSPIPS *s, int field)
          }
          qemu_set_irq(s->cs_lines[i], !new_state);
      }
 -    if (!(field & ((1 << s->num_cs) - 1))) {
 +    if (!(field & ((1 << (s->num_cs * s->num_busses)) - 1))) {
          s->snoop_state = SNOOP_CHECKING;
          s->cmd_dummies = 0;
          s->link_state = 1;
@@ -XXX,XX +XXX,XX @@ static void xlnx_zynqmp_qspips_update_cs_lines(XlnxZynqMPQSPIPS *s)
  {
      if (s->regs[R_GQSPI_GF_SNAPSHOT]) {
          int field = ARRAY_FIELD_EX32(s->regs, GQSPI_GF_SNAPSHOT, CHIP_SELECT);
 -        xilinx_spips_update_cs(XILINX_SPIPS(s), field);
 +        bool upper_cs_sel = field & (1 << 1);
 +        bool lower_cs_sel = field & 1;
 +        bool bus0_enabled;
 +        bool bus1_enabled;
 +        uint8_t buses;
 +        int cs = 0;
 +
 +        buses = ARRAY_FIELD_EX32(s->regs, GQSPI_GF_SNAPSHOT, DATA_BUS_SELECT);
 +        bus0_enabled = buses & 1;
 +        bus1_enabled = buses & (1 << 1);
 +
 +        if (bus0_enabled && bus1_enabled) {
 +            if (lower_cs_sel) {
 +                cs |= 1;
 +            }
 +            if (upper_cs_sel) {
 +                cs |= 1 << 3;
 +            }
 +        } else if (bus0_enabled) {
 +            if (lower_cs_sel) {
 +                cs |= 1;
 +            }
 +            if (upper_cs_sel) {
 +                cs |= 1 << 1;
 +            }
 +        } else if (bus1_enabled) {
 +            if (lower_cs_sel) {
 +                cs |= 1 << 2;
 +            }
 +            if (upper_cs_sel) {
 +                cs |= 1 << 3;
 +            }
 +        }
 +        xilinx_spips_update_cs(XILINX_SPIPS(s), cs);
      }
  }
-@@ -XXX,XX +XXX,XX @@ static void xilinx_spips_update_cs_lines(XilinxSPIPS *s)
+-#define L1STD_SPAN(stm) (extract32((stm)->word[0], 0, 4))
-     if (num_effective_busses(s) == 2) {
++#define L1STD_SPAN(stm) (extract32((stm)->word[0], 0, 5))
-         /* Single bit chip-select for qspi */
-         field &= 0x1;
+ #endif
 -        field |= field << 1;
 +        field |= field << 3;
      /* Dual stack U-Page */
      } else if (s->regs[R_LQSPI_CFG] & LQSPI_CFG_TWO_MEM &&
                 s->regs[R_LQSPI_STS] & LQSPI_CFG_U_PAGE) {
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 07/42] hw/sii9022: Add support for Silicon Image SII9022
+[PULL 02/36] hw/net/can: Introduce Xilinx ZynqMP CAN controller
-From: Linus Walleij <linus.walleij@linaro.org>
+From: Vikram Garhwal <fnu.vikram@xilinx.com>
-This adds support for emulating the Silicon Image SII9022 DVI/HDMI
+The Xilinx ZynqMP CAN controller is developed based on SocketCAN, QEMU CAN bus
-bridge. It's not very clever right now, it just acknowledges
+implementation. Bus connection and socketCAN connection for each CAN module
-the switch into DDC I2C mode and back. Combining this with the
+can be set through command lines.
 existing DDC I2C emulation gives the right behavior on the Versatile
 Express emulation passing through the QEMU EDID to the emulated
 platform.
-Cc: Peter Maydell <peter.maydell@linaro.org>
+Example for using single CAN:
-Signed-off-by: Linus Walleij <linus.walleij@linaro.org>
+    -object can-bus,id=canbus0 \
-Message-id: 20180227104903.21353-5-linus.walleij@linaro.org
+    -machine xlnx-zcu102.canbus0=canbus0 \
     -object can-host-socketcan,id=socketcan0,if=vcan0,canbus=canbus0
 Example for connecting both CAN to same virtual CAN on host machine:
     -object can-bus,id=canbus0 -object can-bus,id=canbus1 \
     -machine xlnx-zcu102.canbus0=canbus0 \
     -machine xlnx-zcu102.canbus1=canbus1 \
     -object can-host-socketcan,id=socketcan0,if=vcan0,canbus=canbus0 \
     -object can-host-socketcan,id=socketcan1,if=vcan0,canbus=canbus1
 To create virtual CAN on the host machine, please check the QEMU CAN docs:
 https://github.com/qemu/qemu/blob/master/docs/can.txt
 Signed-off-by: Vikram Garhwal <fnu.vikram@xilinx.com>
 Message-id: 1605728926-352690-2-git-send-email-fnu.vikram@xilinx.com
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-[PMM: explictly reset ddc_req/ddc_skip_finish/ddc]
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- hw/display/Makefile.objs |   1 +
+ meson.build                      |    1 +
- hw/display/sii9022.c     | 191 +++++++++++++++++++++++++++++++++++++++++++++++
+ hw/net/can/trace.h               |    1 +
- hw/display/trace-events  |   5 ++
+ include/hw/net/xlnx-zynqmp-can.h |   78 ++
-files changed, 197 insertions(+)
+ hw/net/can/xlnx-zynqmp-can.c     | 1161 ++++++++++++++++++++++++++++++
- create mode 100644 hw/display/sii9022.c
+ hw/Kconfig                       |    1 +
  hw/net/can/meson.build           |    1 +
  hw/net/can/trace-events          |    9 +
 files changed, 1252 insertions(+)
  create mode 100644 hw/net/can/trace.h
  create mode 100644 include/hw/net/xlnx-zynqmp-can.h
  create mode 100644 hw/net/can/xlnx-zynqmp-can.c
  create mode 100644 hw/net/can/trace-events
-diff --git a/hw/display/Makefile.objs b/hw/display/Makefile.objs
+diff --git a/meson.build b/meson.build
 index XXXXXXX..XXXXXXX 100644
---- a/hw/display/Makefile.objs
+--- a/meson.build
-+++ b/hw/display/Makefile.objs
++++ b/meson.build
-@@ -XXX,XX +XXX,XX @@ common-obj-$(CONFIG_VGA_CIRRUS) += cirrus_vga.o
+@@ -XXX,XX +XXX,XX @@ if have_system
- common-obj-$(CONFIG_G364FB) += g364fb.o
+     'hw/misc',
- common-obj-$(CONFIG_JAZZ_LED) += jazz_led.o
+     'hw/misc/macio',
- common-obj-$(CONFIG_PL110) += pl110.o
+     'hw/net',
-+common-obj-$(CONFIG_SII9022) += sii9022.o
++    'hw/net/can',
- common-obj-$(CONFIG_SSD0303) += ssd0303.o
+     'hw/nvram',
- common-obj-$(CONFIG_SSD0323) += ssd0323.o
+     'hw/pci',
- common-obj-$(CONFIG_XEN) += xenfb.o
+     'hw/pci-host',
-diff --git a/hw/display/sii9022.c b/hw/display/sii9022.c
+diff --git a/hw/net/can/trace.h b/hw/net/can/trace.h
 new file mode 100644
 index XXXXXXX..XXXXXXX
 --- /dev/null
-+++ b/hw/display/sii9022.c
++++ b/hw/net/can/trace.h
@@ -0,0 +1 @@
 +#include "trace/trace-hw_net_can.h"
 diff --git a/include/hw/net/xlnx-zynqmp-can.h b/include/hw/net/xlnx-zynqmp-can.h
 new file mode 100644
 index XXXXXXX..XXXXXXX
 --- /dev/null
 +++ b/include/hw/net/xlnx-zynqmp-can.h
 @@ -XXX,XX +XXX,XX @@
 +/*
-+ * Silicon Image SiI9022
++ * QEMU model of the Xilinx ZynqMP CAN controller.
 + *
-+ * This is a pretty hollow emulation: all we do is acknowledge that we
++ * Copyright (c) 2020 Xilinx Inc.
 + * exist (chip ID) and confirm that we get switched over into DDC mode
 + * so the emulated host can proceed to read out EDID data. All subsequent
 + * set-up of connectors etc will be acknowledged and ignored.
 + *
-+ * Copyright (C) 2018 Linus Walleij
++ * Written-by: Vikram Garhwal<fnu.vikram@xilinx.com>
 + *
-+ * This work is licensed under the terms of the GNU GPL, version 2 or later.
++ * Based on QEMU CAN Device emulation implemented by Jin Yang, Deniz Eren and
-+ * See the COPYING file in the top-level directory.
++ * Pavel Pisa.
-+ * SPDX-License-Identifier: GPL-2.0-or-later
++ *
 + * Permission is hereby granted, free of charge, to any person obtaining a copy
 + * of this software and associated documentation files (the "Software"), to deal
 + * in the Software without restriction, including without limitation the rights
 + * to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
 + * copies of the Software, and to permit persons to whom the Software is
 + * furnished to do so, subject to the following conditions:
 + *
 + * The above copyright notice and this permission notice shall be included in
 + * all copies or substantial portions of the Software.
 + *
 + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
 + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
 + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL
 + * THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
 + * LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
 + * OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
 + * THE SOFTWARE.
 + */
 +
++#ifndef XLNX_ZYNQMP_CAN_H
++#define XLNX_ZYNQMP_CAN_H
++
++#include "hw/register.h"
++#include "net/can_emu.h"
++#include "net/can_host.h"
++#include "qemu/fifo32.h"
++#include "hw/ptimer.h"
++#include "hw/qdev-clock.h"
++
++#define TYPE_XLNX_ZYNQMP_CAN "xlnx.zynqmp-can"
++
++#define XLNX_ZYNQMP_CAN(obj) \
++     OBJECT_CHECK(XlnxZynqMPCANState, (obj), TYPE_XLNX_ZYNQMP_CAN)
++
++#define MAX_CAN_CTRLS      2
++#define XLNX_ZYNQMP_CAN_R_MAX     (0x84 / 4)
++#define MAILBOX_CAPACITY   64
++#define CAN_TIMER_MAX  0XFFFFUL
++#define CAN_DEFAULT_CLOCK (24 * 1000 * 1000)
++
++/* Each CAN_FRAME will have 4 * 32bit size. */
++#define CAN_FRAME_SIZE     4
++#define RXFIFO_SIZE        (MAILBOX_CAPACITY * CAN_FRAME_SIZE)
++
++typedef struct XlnxZynqMPCANState {
++    SysBusDevice        parent_obj;
++    MemoryRegion        iomem;
++
++    qemu_irq            irq;
++
++    CanBusClientState   bus_client;
++    CanBusState         *canbus;
++
++    struct {
++        uint32_t        ext_clk_freq;
++    } cfg;
++
++    RegisterInfo        reg_info[XLNX_ZYNQMP_CAN_R_MAX];
++    uint32_t            regs[XLNX_ZYNQMP_CAN_R_MAX];
++
++    Fifo32              rx_fifo;
++    Fifo32              tx_fifo;
++    Fifo32              txhpb_fifo;
++
++    ptimer_state        *can_timer;
++} XlnxZynqMPCANState;
++
++#endif
+diff --git a/hw/net/can/xlnx-zynqmp-can.c b/hw/net/can/xlnx-zynqmp-can.c
+new file mode 100644
+index XXXXXXX..XXXXXXX
+--- /dev/null
++++ b/hw/net/can/xlnx-zynqmp-can.c
+@@ -XXX,XX +XXX,XX @@
++/*
++ * QEMU model of the Xilinx ZynqMP CAN controller.
++ * This implementation is based on the following datasheet:
++ * https://www.xilinx.com/support/documentation/user_guides/ug1085-zynq-ultrascale-trm.pdf
++ *
++ * Copyright (c) 2020 Xilinx Inc.
++ *
++ * Written-by: Vikram Garhwal<fnu.vikram@xilinx.com>
++ *
++ * Based on QEMU CAN Device emulation implemented by Jin Yang, Deniz Eren and
++ * Pavel Pisa
++ *
++ * Permission is hereby granted, free of charge, to any person obtaining a copy
++ * of this software and associated documentation files (the "Software"), to deal
++ * in the Software without restriction, including without limitation the rights
++ * to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
++ * copies of the Software, and to permit persons to whom the Software is
++ * furnished to do so, subject to the following conditions:
++ *
++ * The above copyright notice and this permission notice shall be included in
++ * all copies or substantial portions of the Software.
++ *
++ * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
++ * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
++ * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL
++ * THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
++ * LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
++ * OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
++ * THE SOFTWARE.
++ */
++
 +#include "qemu/osdep.h"
-+#include "qemu-common.h"
++#include "hw/sysbus.h"
-+#include "hw/i2c/i2c.h"
++#include "hw/register.h"
-+#include "hw/i2c/i2c-ddc.h"
++#include "hw/irq.h"
 +#include "qapi/error.h"
 +#include "qemu/bitops.h"
 +#include "qemu/log.h"
 +#include "qemu/cutils.h"
 +#include "sysemu/sysemu.h"
 +#include "migration/vmstate.h"
 +#include "hw/qdev-properties.h"
 +#include "net/can_emu.h"
 +#include "net/can_host.h"
 +#include "qemu/event_notifier.h"
 +#include "qom/object_interfaces.h"
 +#include "hw/net/xlnx-zynqmp-can.h"
 +#include "trace.h"
 +
-+#define SII9022_SYS_CTRL_DATA 0x1a
++#ifndef XLNX_ZYNQMP_CAN_ERR_DEBUG
-+#define SII9022_SYS_CTRL_PWR_DWN 0x10
++#define XLNX_ZYNQMP_CAN_ERR_DEBUG 0
-+#define SII9022_SYS_CTRL_AV_MUTE 0x08
++#endif
-+#define SII9022_SYS_CTRL_DDC_BUS_REQ 0x04
++
-+#define SII9022_SYS_CTRL_DDC_BUS_GRTD 0x02
++#define MAX_DLC            8
-+#define SII9022_SYS_CTRL_OUTPUT_MODE 0x01
++#undef ERROR
-+#define SII9022_SYS_CTRL_OUTPUT_HDMI 1
++
-+#define SII9022_SYS_CTRL_OUTPUT_DVI 0
++REG32(SOFTWARE_RESET_REGISTER, 0x0)
-+#define SII9022_REG_CHIPID 0x1b
++    FIELD(SOFTWARE_RESET_REGISTER, CEN, 1, 1)
-+#define SII9022_INT_ENABLE 0x3c
++    FIELD(SOFTWARE_RESET_REGISTER, SRST, 0, 1)
-+#define SII9022_INT_STATUS 0x3d
++REG32(MODE_SELECT_REGISTER, 0x4)
-+#define SII9022_INT_STATUS_HOTPLUG 0x01;
++    FIELD(MODE_SELECT_REGISTER, SNOOP, 2, 1)
-+#define SII9022_INT_STATUS_PLUGGED 0x04;
++    FIELD(MODE_SELECT_REGISTER, LBACK, 1, 1)
-+
++    FIELD(MODE_SELECT_REGISTER, SLEEP, 0, 1)
-+#define TYPE_SII9022 "sii9022"
++REG32(ARBITRATION_PHASE_BAUD_RATE_PRESCALER_REGISTER, 0x8)
-+#define SII9022(obj) OBJECT_CHECK(sii9022_state, (obj), TYPE_SII9022)
++    FIELD(ARBITRATION_PHASE_BAUD_RATE_PRESCALER_REGISTER, BRP, 0, 8)
-+
++REG32(ARBITRATION_PHASE_BIT_TIMING_REGISTER, 0xc)
-+typedef struct sii9022_state {
++    FIELD(ARBITRATION_PHASE_BIT_TIMING_REGISTER, SJW, 7, 2)
-+    I2CSlave parent_obj;
++    FIELD(ARBITRATION_PHASE_BIT_TIMING_REGISTER, TS2, 4, 3)
-+    uint8_t ptr;
++    FIELD(ARBITRATION_PHASE_BIT_TIMING_REGISTER, TS1, 0, 4)
-+    bool addr_byte;
++REG32(ERROR_COUNTER_REGISTER, 0x10)
-+    bool ddc_req;
++    FIELD(ERROR_COUNTER_REGISTER, REC, 8, 8)
-+    bool ddc_skip_finish;
++    FIELD(ERROR_COUNTER_REGISTER, TEC, 0, 8)
-+    bool ddc;
++REG32(ERROR_STATUS_REGISTER, 0x14)
-+} sii9022_state;
++    FIELD(ERROR_STATUS_REGISTER, ACKER, 4, 1)
-+
++    FIELD(ERROR_STATUS_REGISTER, BERR, 3, 1)
-+static const VMStateDescription vmstate_sii9022 = {
++    FIELD(ERROR_STATUS_REGISTER, STER, 2, 1)
-+    .name = "sii9022",
++    FIELD(ERROR_STATUS_REGISTER, FMER, 1, 1)
 +    FIELD(ERROR_STATUS_REGISTER, CRCER, 0, 1)
 +REG32(STATUS_REGISTER, 0x18)
 +    FIELD(STATUS_REGISTER, SNOOP, 12, 1)
 +    FIELD(STATUS_REGISTER, ACFBSY, 11, 1)
 +    FIELD(STATUS_REGISTER, TXFLL, 10, 1)
 +    FIELD(STATUS_REGISTER, TXBFLL, 9, 1)
 +    FIELD(STATUS_REGISTER, ESTAT, 7, 2)
 +    FIELD(STATUS_REGISTER, ERRWRN, 6, 1)
 +    FIELD(STATUS_REGISTER, BBSY, 5, 1)
 +    FIELD(STATUS_REGISTER, BIDLE, 4, 1)
 +    FIELD(STATUS_REGISTER, NORMAL, 3, 1)
 +    FIELD(STATUS_REGISTER, SLEEP, 2, 1)
 +    FIELD(STATUS_REGISTER, LBACK, 1, 1)
 +    FIELD(STATUS_REGISTER, CONFIG, 0, 1)
 +REG32(INTERRUPT_STATUS_REGISTER, 0x1c)
 +    FIELD(INTERRUPT_STATUS_REGISTER, TXFEMP, 14, 1)
 +    FIELD(INTERRUPT_STATUS_REGISTER, TXFWMEMP, 13, 1)
 +    FIELD(INTERRUPT_STATUS_REGISTER, RXFWMFLL, 12, 1)
 +    FIELD(INTERRUPT_STATUS_REGISTER, WKUP, 11, 1)
 +    FIELD(INTERRUPT_STATUS_REGISTER, SLP, 10, 1)
 +    FIELD(INTERRUPT_STATUS_REGISTER, BSOFF, 9, 1)
 +    FIELD(INTERRUPT_STATUS_REGISTER, ERROR, 8, 1)
 +    FIELD(INTERRUPT_STATUS_REGISTER, RXNEMP, 7, 1)
 +    FIELD(INTERRUPT_STATUS_REGISTER, RXOFLW, 6, 1)
 +    FIELD(INTERRUPT_STATUS_REGISTER, RXUFLW, 5, 1)
 +    FIELD(INTERRUPT_STATUS_REGISTER, RXOK, 4, 1)
 +    FIELD(INTERRUPT_STATUS_REGISTER, TXBFLL, 3, 1)
 +    FIELD(INTERRUPT_STATUS_REGISTER, TXFLL, 2, 1)
 +    FIELD(INTERRUPT_STATUS_REGISTER, TXOK, 1, 1)
 +    FIELD(INTERRUPT_STATUS_REGISTER, ARBLST, 0, 1)
 +REG32(INTERRUPT_ENABLE_REGISTER, 0x20)
 +    FIELD(INTERRUPT_ENABLE_REGISTER, ETXFEMP, 14, 1)
 +    FIELD(INTERRUPT_ENABLE_REGISTER, ETXFWMEMP, 13, 1)
 +    FIELD(INTERRUPT_ENABLE_REGISTER, ERXFWMFLL, 12, 1)
 +    FIELD(INTERRUPT_ENABLE_REGISTER, EWKUP, 11, 1)
 +    FIELD(INTERRUPT_ENABLE_REGISTER, ESLP, 10, 1)
 +    FIELD(INTERRUPT_ENABLE_REGISTER, EBSOFF, 9, 1)
 +    FIELD(INTERRUPT_ENABLE_REGISTER, EERROR, 8, 1)
 +    FIELD(INTERRUPT_ENABLE_REGISTER, ERXNEMP, 7, 1)
 +    FIELD(INTERRUPT_ENABLE_REGISTER, ERXOFLW, 6, 1)
 +    FIELD(INTERRUPT_ENABLE_REGISTER, ERXUFLW, 5, 1)
 +    FIELD(INTERRUPT_ENABLE_REGISTER, ERXOK, 4, 1)
 +    FIELD(INTERRUPT_ENABLE_REGISTER, ETXBFLL, 3, 1)
 +    FIELD(INTERRUPT_ENABLE_REGISTER, ETXFLL, 2, 1)
 +    FIELD(INTERRUPT_ENABLE_REGISTER, ETXOK, 1, 1)
 +    FIELD(INTERRUPT_ENABLE_REGISTER, EARBLST, 0, 1)
 +REG32(INTERRUPT_CLEAR_REGISTER, 0x24)
 +    FIELD(INTERRUPT_CLEAR_REGISTER, CTXFEMP, 14, 1)
 +    FIELD(INTERRUPT_CLEAR_REGISTER, CTXFWMEMP, 13, 1)
 +    FIELD(INTERRUPT_CLEAR_REGISTER, CRXFWMFLL, 12, 1)
 +    FIELD(INTERRUPT_CLEAR_REGISTER, CWKUP, 11, 1)
 +    FIELD(INTERRUPT_CLEAR_REGISTER, CSLP, 10, 1)
 +    FIELD(INTERRUPT_CLEAR_REGISTER, CBSOFF, 9, 1)
 +    FIELD(INTERRUPT_CLEAR_REGISTER, CERROR, 8, 1)
 +    FIELD(INTERRUPT_CLEAR_REGISTER, CRXNEMP, 7, 1)
 +    FIELD(INTERRUPT_CLEAR_REGISTER, CRXOFLW, 6, 1)
 +    FIELD(INTERRUPT_CLEAR_REGISTER, CRXUFLW, 5, 1)
 +    FIELD(INTERRUPT_CLEAR_REGISTER, CRXOK, 4, 1)
 +    FIELD(INTERRUPT_CLEAR_REGISTER, CTXBFLL, 3, 1)
 +    FIELD(INTERRUPT_CLEAR_REGISTER, CTXFLL, 2, 1)
 +    FIELD(INTERRUPT_CLEAR_REGISTER, CTXOK, 1, 1)
 +    FIELD(INTERRUPT_CLEAR_REGISTER, CARBLST, 0, 1)
 +REG32(TIMESTAMP_REGISTER, 0x28)
 +    FIELD(TIMESTAMP_REGISTER, CTS, 0, 1)
 +REG32(WIR, 0x2c)
 +    FIELD(WIR, EW, 8, 8)
 +    FIELD(WIR, FW, 0, 8)
 +REG32(TXFIFO_ID, 0x30)
 +    FIELD(TXFIFO_ID, IDH, 21, 11)
 +    FIELD(TXFIFO_ID, SRRRTR, 20, 1)
 +    FIELD(TXFIFO_ID, IDE, 19, 1)
 +    FIELD(TXFIFO_ID, IDL, 1, 18)
 +    FIELD(TXFIFO_ID, RTR, 0, 1)
 +REG32(TXFIFO_DLC, 0x34)
 +    FIELD(TXFIFO_DLC, DLC, 28, 4)
 +REG32(TXFIFO_DATA1, 0x38)
 +    FIELD(TXFIFO_DATA1, DB0, 24, 8)
 +    FIELD(TXFIFO_DATA1, DB1, 16, 8)
 +    FIELD(TXFIFO_DATA1, DB2, 8, 8)
 +    FIELD(TXFIFO_DATA1, DB3, 0, 8)
 +REG32(TXFIFO_DATA2, 0x3c)
 +    FIELD(TXFIFO_DATA2, DB4, 24, 8)
 +    FIELD(TXFIFO_DATA2, DB5, 16, 8)
 +    FIELD(TXFIFO_DATA2, DB6, 8, 8)
 +    FIELD(TXFIFO_DATA2, DB7, 0, 8)
 +REG32(TXHPB_ID, 0x40)
 +    FIELD(TXHPB_ID, IDH, 21, 11)
 +    FIELD(TXHPB_ID, SRRRTR, 20, 1)
 +    FIELD(TXHPB_ID, IDE, 19, 1)
 +    FIELD(TXHPB_ID, IDL, 1, 18)
 +    FIELD(TXHPB_ID, RTR, 0, 1)
 +REG32(TXHPB_DLC, 0x44)
 +    FIELD(TXHPB_DLC, DLC, 28, 4)
 +REG32(TXHPB_DATA1, 0x48)
 +    FIELD(TXHPB_DATA1, DB0, 24, 8)
 +    FIELD(TXHPB_DATA1, DB1, 16, 8)
 +    FIELD(TXHPB_DATA1, DB2, 8, 8)
 +    FIELD(TXHPB_DATA1, DB3, 0, 8)
 +REG32(TXHPB_DATA2, 0x4c)
 +    FIELD(TXHPB_DATA2, DB4, 24, 8)
 +    FIELD(TXHPB_DATA2, DB5, 16, 8)
 +    FIELD(TXHPB_DATA2, DB6, 8, 8)
 +    FIELD(TXHPB_DATA2, DB7, 0, 8)
 +REG32(RXFIFO_ID, 0x50)
 +    FIELD(RXFIFO_ID, IDH, 21, 11)
 +    FIELD(RXFIFO_ID, SRRRTR, 20, 1)
 +    FIELD(RXFIFO_ID, IDE, 19, 1)
 +    FIELD(RXFIFO_ID, IDL, 1, 18)
 +    FIELD(RXFIFO_ID, RTR, 0, 1)
 +REG32(RXFIFO_DLC, 0x54)
 +    FIELD(RXFIFO_DLC, DLC, 28, 4)
 +    FIELD(RXFIFO_DLC, RXT, 0, 16)
 +REG32(RXFIFO_DATA1, 0x58)
 +    FIELD(RXFIFO_DATA1, DB0, 24, 8)
 +    FIELD(RXFIFO_DATA1, DB1, 16, 8)
 +    FIELD(RXFIFO_DATA1, DB2, 8, 8)
 +    FIELD(RXFIFO_DATA1, DB3, 0, 8)
 +REG32(RXFIFO_DATA2, 0x5c)
 +    FIELD(RXFIFO_DATA2, DB4, 24, 8)
 +    FIELD(RXFIFO_DATA2, DB5, 16, 8)
 +    FIELD(RXFIFO_DATA2, DB6, 8, 8)
 +    FIELD(RXFIFO_DATA2, DB7, 0, 8)
 +REG32(AFR, 0x60)
 +    FIELD(AFR, UAF4, 3, 1)
 +    FIELD(AFR, UAF3, 2, 1)
 +    FIELD(AFR, UAF2, 1, 1)
 +    FIELD(AFR, UAF1, 0, 1)
 +REG32(AFMR1, 0x64)
 +    FIELD(AFMR1, AMIDH, 21, 11)
 +    FIELD(AFMR1, AMSRR, 20, 1)
 +    FIELD(AFMR1, AMIDE, 19, 1)
 +    FIELD(AFMR1, AMIDL, 1, 18)
 +    FIELD(AFMR1, AMRTR, 0, 1)
 +REG32(AFIR1, 0x68)
 +    FIELD(AFIR1, AIIDH, 21, 11)
 +    FIELD(AFIR1, AISRR, 20, 1)
 +    FIELD(AFIR1, AIIDE, 19, 1)
 +    FIELD(AFIR1, AIIDL, 1, 18)
 +    FIELD(AFIR1, AIRTR, 0, 1)
 +REG32(AFMR2, 0x6c)
 +    FIELD(AFMR2, AMIDH, 21, 11)
 +    FIELD(AFMR2, AMSRR, 20, 1)
 +    FIELD(AFMR2, AMIDE, 19, 1)
 +    FIELD(AFMR2, AMIDL, 1, 18)
 +    FIELD(AFMR2, AMRTR, 0, 1)
 +REG32(AFIR2, 0x70)
 +    FIELD(AFIR2, AIIDH, 21, 11)
 +    FIELD(AFIR2, AISRR, 20, 1)
 +    FIELD(AFIR2, AIIDE, 19, 1)
 +    FIELD(AFIR2, AIIDL, 1, 18)
 +    FIELD(AFIR2, AIRTR, 0, 1)
 +REG32(AFMR3, 0x74)
 +    FIELD(AFMR3, AMIDH, 21, 11)
 +    FIELD(AFMR3, AMSRR, 20, 1)
 +    FIELD(AFMR3, AMIDE, 19, 1)
 +    FIELD(AFMR3, AMIDL, 1, 18)
 +    FIELD(AFMR3, AMRTR, 0, 1)
 +REG32(AFIR3, 0x78)
 +    FIELD(AFIR3, AIIDH, 21, 11)
 +    FIELD(AFIR3, AISRR, 20, 1)
 +    FIELD(AFIR3, AIIDE, 19, 1)
 +    FIELD(AFIR3, AIIDL, 1, 18)
 +    FIELD(AFIR3, AIRTR, 0, 1)
 +REG32(AFMR4, 0x7c)
 +    FIELD(AFMR4, AMIDH, 21, 11)
 +    FIELD(AFMR4, AMSRR, 20, 1)
 +    FIELD(AFMR4, AMIDE, 19, 1)
 +    FIELD(AFMR4, AMIDL, 1, 18)
 +    FIELD(AFMR4, AMRTR, 0, 1)
 +REG32(AFIR4, 0x80)
 +    FIELD(AFIR4, AIIDH, 21, 11)
 +    FIELD(AFIR4, AISRR, 20, 1)
 +    FIELD(AFIR4, AIIDE, 19, 1)
 +    FIELD(AFIR4, AIIDL, 1, 18)
 +    FIELD(AFIR4, AIRTR, 0, 1)
 +
 +static void can_update_irq(XlnxZynqMPCANState *s)
 +{
 +    uint32_t irq;
 +
 +    /* Watermark register interrupts. */
 +    if ((fifo32_num_free(&s->tx_fifo) / CAN_FRAME_SIZE) >
 +            ARRAY_FIELD_EX32(s->regs, WIR, EW)) {
 +        ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, TXFWMEMP, 1);
 +    }
 +
 +    if ((fifo32_num_used(&s->rx_fifo) / CAN_FRAME_SIZE) >
 +            ARRAY_FIELD_EX32(s->regs, WIR, FW)) {
 +        ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, RXFWMFLL, 1);
 +    }
 +
 +    /* RX Interrupts. */
 +    if (fifo32_num_used(&s->rx_fifo) >= CAN_FRAME_SIZE) {
 +        ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, RXNEMP, 1);
 +    }
 +
 +    /* TX interrupts. */
 +    if (fifo32_is_empty(&s->tx_fifo)) {
 +        ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, TXFEMP, 1);
 +    }
 +
 +    if (fifo32_is_full(&s->tx_fifo)) {
 +        ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, TXFLL, 1);
 +    }
 +
 +    if (fifo32_is_full(&s->txhpb_fifo)) {
 +        ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, TXBFLL, 1);
 +    }
 +
 +    irq = s->regs[R_INTERRUPT_STATUS_REGISTER];
 +    irq &= s->regs[R_INTERRUPT_ENABLE_REGISTER];
 +
 +    trace_xlnx_can_update_irq(s->regs[R_INTERRUPT_STATUS_REGISTER],
 +                              s->regs[R_INTERRUPT_ENABLE_REGISTER], irq);
 +    qemu_set_irq(s->irq, irq);
 +}
 +
 +static void can_ier_post_write(RegisterInfo *reg, uint64_t val)
 +{
 +    XlnxZynqMPCANState *s = XLNX_ZYNQMP_CAN(reg->opaque);
 +
 +    can_update_irq(s);
 +}
 +
 +static uint64_t can_icr_pre_write(RegisterInfo *reg, uint64_t val)
 +{
 +    XlnxZynqMPCANState *s = XLNX_ZYNQMP_CAN(reg->opaque);
 +
 +    s->regs[R_INTERRUPT_STATUS_REGISTER] &= ~val;
 +    can_update_irq(s);
 +
 +    return 0;
 +}
 +
 +static void can_config_reset(XlnxZynqMPCANState *s)
 +{
 +    /* Reset all the configuration registers. */
 +    register_reset(&s->reg_info[R_SOFTWARE_RESET_REGISTER]);
 +    register_reset(&s->reg_info[R_MODE_SELECT_REGISTER]);
 +    register_reset(
 +              &s->reg_info[R_ARBITRATION_PHASE_BAUD_RATE_PRESCALER_REGISTER]);
 +    register_reset(&s->reg_info[R_ARBITRATION_PHASE_BIT_TIMING_REGISTER]);
 +    register_reset(&s->reg_info[R_STATUS_REGISTER]);
 +    register_reset(&s->reg_info[R_INTERRUPT_STATUS_REGISTER]);
 +    register_reset(&s->reg_info[R_INTERRUPT_ENABLE_REGISTER]);
 +    register_reset(&s->reg_info[R_INTERRUPT_CLEAR_REGISTER]);
 +    register_reset(&s->reg_info[R_WIR]);
 +}
 +
 +static void can_config_mode(XlnxZynqMPCANState *s)
 +{
 +    register_reset(&s->reg_info[R_ERROR_COUNTER_REGISTER]);
 +    register_reset(&s->reg_info[R_ERROR_STATUS_REGISTER]);
 +
 +    /* Put XlnxZynqMPCAN in configuration mode. */
 +    ARRAY_FIELD_DP32(s->regs, STATUS_REGISTER, CONFIG, 1);
 +    ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, WKUP, 0);
 +    ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, SLP, 0);
 +    ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, BSOFF, 0);
 +    ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, ERROR, 0);
 +    ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, RXOFLW, 0);
 +    ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, RXOK, 0);
 +    ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, TXOK, 0);
 +    ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, ARBLST, 0);
 +
 +    can_update_irq(s);
 +}
 +
 +static void update_status_register_mode_bits(XlnxZynqMPCANState *s)
 +{
 +    bool sleep_status = ARRAY_FIELD_EX32(s->regs, STATUS_REGISTER, SLEEP);
 +    bool sleep_mode = ARRAY_FIELD_EX32(s->regs, MODE_SELECT_REGISTER, SLEEP);
 +    /* Wake up interrupt bit. */
 +    bool wakeup_irq_val = sleep_status && (sleep_mode == 0);
 +    /* Sleep interrupt bit. */
 +    bool sleep_irq_val = sleep_mode && (sleep_status == 0);
 +
 +    /* Clear previous core mode status bits. */
 +    ARRAY_FIELD_DP32(s->regs, STATUS_REGISTER, LBACK, 0);
 +    ARRAY_FIELD_DP32(s->regs, STATUS_REGISTER, SLEEP, 0);
 +    ARRAY_FIELD_DP32(s->regs, STATUS_REGISTER, SNOOP, 0);
 +    ARRAY_FIELD_DP32(s->regs, STATUS_REGISTER, NORMAL, 0);
 +
 +    /* set current mode bit and generate irqs accordingly. */
 +    if (ARRAY_FIELD_EX32(s->regs, MODE_SELECT_REGISTER, LBACK)) {
 +        ARRAY_FIELD_DP32(s->regs, STATUS_REGISTER, LBACK, 1);
 +    } else if (ARRAY_FIELD_EX32(s->regs, MODE_SELECT_REGISTER, SLEEP)) {
 +        ARRAY_FIELD_DP32(s->regs, STATUS_REGISTER, SLEEP, 1);
 +        ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, SLP,
 +                         sleep_irq_val);
 +    } else if (ARRAY_FIELD_EX32(s->regs, MODE_SELECT_REGISTER, SNOOP)) {
 +        ARRAY_FIELD_DP32(s->regs, STATUS_REGISTER, SNOOP, 1);
 +    } else {
 +        /*
 +         * If all bits are zero then XlnxZynqMPCAN is set in normal mode.
 +         */
 +        ARRAY_FIELD_DP32(s->regs, STATUS_REGISTER, NORMAL, 1);
 +        /* Set wakeup interrupt bit. */
 +        ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, WKUP,
 +                         wakeup_irq_val);
 +    }
 +
 +    can_update_irq(s);
 +}
 +
 +static void can_exit_sleep_mode(XlnxZynqMPCANState *s)
 +{
 +    ARRAY_FIELD_DP32(s->regs, MODE_SELECT_REGISTER, SLEEP, 0);
 +    update_status_register_mode_bits(s);
 +}
 +
 +static void generate_frame(qemu_can_frame *frame, uint32_t *data)
 +{
 +    frame->can_id = data[0];
 +    frame->can_dlc = FIELD_EX32(data[1], TXFIFO_DLC, DLC);
 +
 +    frame->data[0] = FIELD_EX32(data[2], TXFIFO_DATA1, DB3);
 +    frame->data[1] = FIELD_EX32(data[2], TXFIFO_DATA1, DB2);
 +    frame->data[2] = FIELD_EX32(data[2], TXFIFO_DATA1, DB1);
 +    frame->data[3] = FIELD_EX32(data[2], TXFIFO_DATA1, DB0);
 +
 +    frame->data[4] = FIELD_EX32(data[3], TXFIFO_DATA2, DB7);
 +    frame->data[5] = FIELD_EX32(data[3], TXFIFO_DATA2, DB6);
 +    frame->data[6] = FIELD_EX32(data[3], TXFIFO_DATA2, DB5);
 +    frame->data[7] = FIELD_EX32(data[3], TXFIFO_DATA2, DB4);
 +}
 +
 +static bool tx_ready_check(XlnxZynqMPCANState *s)
 +{
 +    if (ARRAY_FIELD_EX32(s->regs, SOFTWARE_RESET_REGISTER, SRST)) {
 +        g_autofree char *path = object_get_canonical_path(OBJECT(s));
 +
 +        qemu_log_mask(LOG_GUEST_ERROR, "%s: Attempting to transfer data while"
 +                      " data while controller is in reset mode.\n",
 +                      path);
 +        return false;
 +    }
 +
 +    if (ARRAY_FIELD_EX32(s->regs, SOFTWARE_RESET_REGISTER, CEN) == 0) {
 +        g_autofree char *path = object_get_canonical_path(OBJECT(s));
 +
 +        qemu_log_mask(LOG_GUEST_ERROR, "%s: Attempting to transfer"
 +                      " data while controller is in configuration mode. Reset"
 +                      " the core so operations can start fresh.\n",
 +                      path);
 +        return false;
 +    }
 +
 +    if (ARRAY_FIELD_EX32(s->regs, STATUS_REGISTER, SNOOP)) {
 +        g_autofree char *path = object_get_canonical_path(OBJECT(s));
 +
 +        qemu_log_mask(LOG_GUEST_ERROR, "%s: Attempting to transfer"
 +                      " data while controller is in SNOOP MODE.\n",
 +                      path);
 +        return false;
 +    }
 +
 +    return true;
 +}
 +
 +static void transfer_fifo(XlnxZynqMPCANState *s, Fifo32 *fifo)
 +{
 +    qemu_can_frame frame;
 +    uint32_t data[CAN_FRAME_SIZE];
 +    int i;
 +    bool can_tx = tx_ready_check(s);
 +
 +    if (!can_tx) {
 +        g_autofree char *path = object_get_canonical_path(OBJECT(s));
 +
 +        qemu_log_mask(LOG_GUEST_ERROR, "%s: Controller is not enabled for data"
 +                      " transfer.\n", path);
 +        can_update_irq(s);
 +        return;
 +    }
 +
 +    while (!fifo32_is_empty(fifo)) {
 +        for (i = 0; i < CAN_FRAME_SIZE; i++) {
 +            data[i] = fifo32_pop(fifo);
 +        }
 +
 +        if (ARRAY_FIELD_EX32(s->regs, STATUS_REGISTER, LBACK)) {
 +            /*
 +             * Controller is in loopback. In Loopback mode, the CAN core
 +             * transmits a recessive bitstream on to the XlnxZynqMPCAN Bus.
 +             * Any message transmitted is looped back to the RX line and
 +             * acknowledged. The XlnxZynqMPCAN core receives any message
 +             * that it transmits.
 +             */
 +            if (fifo32_is_full(&s->rx_fifo)) {
 +                ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, RXOFLW, 1);
 +            } else {
 +                for (i = 0; i < CAN_FRAME_SIZE; i++) {
 +                    fifo32_push(&s->rx_fifo, data[i]);
 +                }
 +
 +                ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, RXOK, 1);
 +            }
 +        } else {
 +            /* Normal mode Tx. */
 +            generate_frame(&frame, data);
 +
 +            trace_xlnx_can_tx_data(frame.can_id, frame.can_dlc,
 +                                   frame.data[0], frame.data[1],
 +                                   frame.data[2], frame.data[3],
 +                                   frame.data[4], frame.data[5],
 +                                   frame.data[6], frame.data[7]);
 +            can_bus_client_send(&s->bus_client, &frame, 1);
 +        }
 +    }
 +
 +    ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, TXOK, 1);
 +    ARRAY_FIELD_DP32(s->regs, STATUS_REGISTER, TXBFLL, 0);
 +
 +    if (ARRAY_FIELD_EX32(s->regs, STATUS_REGISTER, SLEEP)) {
 +        can_exit_sleep_mode(s);
 +    }
 +
 +    can_update_irq(s);
 +}
 +
 +static uint64_t can_srr_pre_write(RegisterInfo *reg, uint64_t val)
 +{
 +    XlnxZynqMPCANState *s = XLNX_ZYNQMP_CAN(reg->opaque);
 +
 +    ARRAY_FIELD_DP32(s->regs, SOFTWARE_RESET_REGISTER, CEN,
 +                     FIELD_EX32(val, SOFTWARE_RESET_REGISTER, CEN));
 +
 +    if (FIELD_EX32(val, SOFTWARE_RESET_REGISTER, SRST)) {
 +        trace_xlnx_can_reset(val);
 +
 +        /* First, core will do software reset then will enter in config mode. */
 +        can_config_reset(s);
 +    }
 +
 +    if (ARRAY_FIELD_EX32(s->regs, SOFTWARE_RESET_REGISTER, CEN) == 0) {
 +        can_config_mode(s);
 +    } else {
 +        /*
 +         * Leave config mode. Now XlnxZynqMPCAN core will enter normal,
 +         * sleep, snoop or loopback mode depending upon LBACK, SLEEP, SNOOP
 +         * register states.
 +         */
 +        ARRAY_FIELD_DP32(s->regs, STATUS_REGISTER, CONFIG, 0);
 +
 +        ptimer_transaction_begin(s->can_timer);
 +        ptimer_set_count(s->can_timer, 0);
 +        ptimer_transaction_commit(s->can_timer);
 +
 +        /* XlnxZynqMPCAN is out of config mode. It will send pending data. */
 +        transfer_fifo(s, &s->txhpb_fifo);
 +        transfer_fifo(s, &s->tx_fifo);
 +    }
 +
 +    update_status_register_mode_bits(s);
 +
 +    return s->regs[R_SOFTWARE_RESET_REGISTER];
 +}
 +
 +static uint64_t can_msr_pre_write(RegisterInfo *reg, uint64_t val)
 +{
 +    XlnxZynqMPCANState *s = XLNX_ZYNQMP_CAN(reg->opaque);
 +    uint8_t multi_mode;
 +
 +    /*
 +     * Multiple mode set check. This is done to make sure user doesn't set
 +     * multiple modes.
 +     */
 +    multi_mode = FIELD_EX32(val, MODE_SELECT_REGISTER, LBACK) +
 +                 FIELD_EX32(val, MODE_SELECT_REGISTER, SLEEP) +
 +                 FIELD_EX32(val, MODE_SELECT_REGISTER, SNOOP);
 +
 +    if (multi_mode > 1) {
 +        g_autofree char *path = object_get_canonical_path(OBJECT(s));
 +
 +        qemu_log_mask(LOG_GUEST_ERROR, "%s: Attempting to config"
 +                      " several modes simultaneously. One mode will be selected"
 +                      " according to their priority: LBACK > SLEEP > SNOOP.\n",
 +                      path);
 +    }
 +
 +    if (ARRAY_FIELD_EX32(s->regs, SOFTWARE_RESET_REGISTER, CEN) == 0) {
 +        /* We are in configuration mode, any mode can be selected. */
 +        s->regs[R_MODE_SELECT_REGISTER] = val;
 +    } else {
 +        bool sleep_mode_bit = FIELD_EX32(val, MODE_SELECT_REGISTER, SLEEP);
 +
 +        ARRAY_FIELD_DP32(s->regs, MODE_SELECT_REGISTER, SLEEP, sleep_mode_bit);
 +
 +        if (FIELD_EX32(val, MODE_SELECT_REGISTER, LBACK)) {
 +            g_autofree char *path = object_get_canonical_path(OBJECT(s));
 +
 +            qemu_log_mask(LOG_GUEST_ERROR, "%s: Attempting to set"
 +                          " LBACK mode without setting CEN bit as 0.\n",
 +                          path);
 +        } else if (FIELD_EX32(val, MODE_SELECT_REGISTER, SNOOP)) {
 +            g_autofree char *path = object_get_canonical_path(OBJECT(s));
 +
 +            qemu_log_mask(LOG_GUEST_ERROR, "%s: Attempting to set"
 +                          " SNOOP mode without setting CEN bit as 0.\n",
 +                          path);
 +        }
 +
 +        update_status_register_mode_bits(s);
 +    }
 +
 +    return s->regs[R_MODE_SELECT_REGISTER];
 +}
 +
 +static uint64_t can_brpr_pre_write(RegisterInfo  *reg, uint64_t val)
 +{
 +    XlnxZynqMPCANState *s = XLNX_ZYNQMP_CAN(reg->opaque);
 +
 +    /* Only allow writes when in config mode. */
 +    if (ARRAY_FIELD_EX32(s->regs, SOFTWARE_RESET_REGISTER, CEN)) {
 +        return s->regs[R_ARBITRATION_PHASE_BAUD_RATE_PRESCALER_REGISTER];
 +    }
 +
 +    return val;
 +}
 +
 +static uint64_t can_btr_pre_write(RegisterInfo  *reg, uint64_t val)
 +{
 +    XlnxZynqMPCANState *s = XLNX_ZYNQMP_CAN(reg->opaque);
 +
 +    /* Only allow writes when in config mode. */
 +    if (ARRAY_FIELD_EX32(s->regs, SOFTWARE_RESET_REGISTER, CEN)) {
 +        return s->regs[R_ARBITRATION_PHASE_BIT_TIMING_REGISTER];
 +    }
 +
 +    return val;
 +}
 +
 +static uint64_t can_tcr_pre_write(RegisterInfo  *reg, uint64_t val)
 +{
 +    XlnxZynqMPCANState *s = XLNX_ZYNQMP_CAN(reg->opaque);
 +
 +    if (FIELD_EX32(val, TIMESTAMP_REGISTER, CTS)) {
 +        ptimer_transaction_begin(s->can_timer);
 +        ptimer_set_count(s->can_timer, 0);
 +        ptimer_transaction_commit(s->can_timer);
 +    }
 +
 +    return 0;
 +}
 +
 +static void update_rx_fifo(XlnxZynqMPCANState *s, const qemu_can_frame *frame)
 +{
 +    bool filter_pass = false;
 +    uint16_t timestamp = 0;
 +
 +    /* If no filter is enabled. Message will be stored in FIFO. */
 +    if (!((ARRAY_FIELD_EX32(s->regs, AFR, UAF1)) |
 +       (ARRAY_FIELD_EX32(s->regs, AFR, UAF2)) |
 +       (ARRAY_FIELD_EX32(s->regs, AFR, UAF3)) |
 +       (ARRAY_FIELD_EX32(s->regs, AFR, UAF4)))) {
 +        filter_pass = true;
 +    }
 +
 +    /*
 +     * Messages that pass any of the acceptance filters will be stored in
 +     * the RX FIFO.
 +     */
 +    if (ARRAY_FIELD_EX32(s->regs, AFR, UAF1)) {
 +        uint32_t id_masked = s->regs[R_AFMR1] & frame->can_id;
 +        uint32_t filter_id_masked = s->regs[R_AFMR1] & s->regs[R_AFIR1];
 +
 +        if (filter_id_masked == id_masked) {
 +            filter_pass = true;
 +        }
 +    }
 +
 +    if (ARRAY_FIELD_EX32(s->regs, AFR, UAF2)) {
 +        uint32_t id_masked = s->regs[R_AFMR2] & frame->can_id;
 +        uint32_t filter_id_masked = s->regs[R_AFMR2] & s->regs[R_AFIR2];
 +
 +        if (filter_id_masked == id_masked) {
 +            filter_pass = true;
 +        }
 +    }
 +
 +    if (ARRAY_FIELD_EX32(s->regs, AFR, UAF3)) {
 +        uint32_t id_masked = s->regs[R_AFMR3] & frame->can_id;
 +        uint32_t filter_id_masked = s->regs[R_AFMR3] & s->regs[R_AFIR3];
 +
 +        if (filter_id_masked == id_masked) {
 +            filter_pass = true;
 +        }
 +    }
 +
 +    if (ARRAY_FIELD_EX32(s->regs, AFR, UAF4)) {
 +        uint32_t id_masked = s->regs[R_AFMR4] & frame->can_id;
 +        uint32_t filter_id_masked = s->regs[R_AFMR4] & s->regs[R_AFIR4];
 +
 +        if (filter_id_masked == id_masked) {
 +            filter_pass = true;
 +        }
 +    }
 +
 +    if (!filter_pass) {
 +        trace_xlnx_can_rx_fifo_filter_reject(frame->can_id, frame->can_dlc);
 +        return;
 +    }
 +
 +    /* Store the message in fifo if it passed through any of the filters. */
 +    if (filter_pass && frame->can_dlc <= MAX_DLC) {
 +
 +        if (fifo32_is_full(&s->rx_fifo)) {
 +            ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, RXOFLW, 1);
 +        } else {
 +            timestamp = CAN_TIMER_MAX - ptimer_get_count(s->can_timer);
 +
 +            fifo32_push(&s->rx_fifo, frame->can_id);
 +
 +            fifo32_push(&s->rx_fifo, deposit32(0, R_RXFIFO_DLC_DLC_SHIFT,
 +                                               R_RXFIFO_DLC_DLC_LENGTH,
 +                                               frame->can_dlc) |
 +                                     deposit32(0, R_RXFIFO_DLC_RXT_SHIFT,
 +                                               R_RXFIFO_DLC_RXT_LENGTH,
 +                                               timestamp));
 +
 +            /* First 32 bit of the data. */
 +            fifo32_push(&s->rx_fifo, deposit32(0, R_TXFIFO_DATA1_DB3_SHIFT,
 +                                               R_TXFIFO_DATA1_DB3_LENGTH,
 +                                               frame->data[0]) |
 +                                     deposit32(0, R_TXFIFO_DATA1_DB2_SHIFT,
 +                                               R_TXFIFO_DATA1_DB2_LENGTH,
 +                                               frame->data[1]) |
 +                                     deposit32(0, R_TXFIFO_DATA1_DB1_SHIFT,
 +                                               R_TXFIFO_DATA1_DB1_LENGTH,
 +                                               frame->data[2]) |
 +                                     deposit32(0, R_TXFIFO_DATA1_DB0_SHIFT,
 +                                               R_TXFIFO_DATA1_DB0_LENGTH,
 +                                               frame->data[3]));
 +            /* Last 32 bit of the data. */
 +            fifo32_push(&s->rx_fifo, deposit32(0, R_TXFIFO_DATA2_DB7_SHIFT,
 +                                               R_TXFIFO_DATA2_DB7_LENGTH,
 +                                               frame->data[4]) |
 +                                     deposit32(0, R_TXFIFO_DATA2_DB6_SHIFT,
 +                                               R_TXFIFO_DATA2_DB6_LENGTH,
 +                                               frame->data[5]) |
 +                                     deposit32(0, R_TXFIFO_DATA2_DB5_SHIFT,
 +                                               R_TXFIFO_DATA2_DB5_LENGTH,
 +                                               frame->data[6]) |
 +                                     deposit32(0, R_TXFIFO_DATA2_DB4_SHIFT,
 +                                               R_TXFIFO_DATA2_DB4_LENGTH,
 +                                               frame->data[7]));
 +
 +            ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, RXOK, 1);
 +            trace_xlnx_can_rx_data(frame->can_id, frame->can_dlc,
 +                                   frame->data[0], frame->data[1],
 +                                   frame->data[2], frame->data[3],
 +                                   frame->data[4], frame->data[5],
 +                                   frame->data[6], frame->data[7]);
 +        }
 +
 +        can_update_irq(s);
 +    }
 +}
 +
 +static uint64_t can_rxfifo_pre_read(RegisterInfo *reg, uint64_t val)
 +{
 +    XlnxZynqMPCANState *s = XLNX_ZYNQMP_CAN(reg->opaque);
 +
 +    if (!fifo32_is_empty(&s->rx_fifo)) {
 +        val = fifo32_pop(&s->rx_fifo);
 +    } else {
 +        ARRAY_FIELD_DP32(s->regs, INTERRUPT_STATUS_REGISTER, RXUFLW, 1);
 +    }
 +
 +    can_update_irq(s);
 +    return val;
 +}
 +
 +static void can_filter_enable_post_write(RegisterInfo *reg, uint64_t val)
 +{
 +    XlnxZynqMPCANState *s = XLNX_ZYNQMP_CAN(reg->opaque);
 +
 +    if (ARRAY_FIELD_EX32(s->regs, AFR, UAF1) &&
 +        ARRAY_FIELD_EX32(s->regs, AFR, UAF2) &&
 +        ARRAY_FIELD_EX32(s->regs, AFR, UAF3) &&
 +        ARRAY_FIELD_EX32(s->regs, AFR, UAF4)) {
 +        ARRAY_FIELD_DP32(s->regs, STATUS_REGISTER, ACFBSY, 1);
 +    } else {
 +        ARRAY_FIELD_DP32(s->regs, STATUS_REGISTER, ACFBSY, 0);
 +    }
 +}
 +
 +static uint64_t can_filter_mask_pre_write(RegisterInfo *reg, uint64_t val)
 +{
 +    XlnxZynqMPCANState *s = XLNX_ZYNQMP_CAN(reg->opaque);
 +    uint32_t reg_idx = (reg->access->addr) / 4;
 +    uint32_t filter_number = (reg_idx - R_AFMR1) / 2;
 +
 +    /* modify an acceptance filter, the corresponding UAF bit should be '0'. */
 +    if (!(s->regs[R_AFR] & (1 << filter_number))) {
 +        s->regs[reg_idx] = val;
 +
 +        trace_xlnx_can_filter_mask_pre_write(filter_number, s->regs[reg_idx]);
 +    } else {
 +        g_autofree char *path = object_get_canonical_path(OBJECT(s));
 +
 +        qemu_log_mask(LOG_GUEST_ERROR, "%s: Acceptance filter %d"
 +                      " mask is not set as corresponding UAF bit is not 0.\n",
 +                      path, filter_number + 1);
 +    }
 +
 +    return s->regs[reg_idx];
 +}
 +
 +static uint64_t can_filter_id_pre_write(RegisterInfo *reg, uint64_t val)
 +{
 +    XlnxZynqMPCANState *s = XLNX_ZYNQMP_CAN(reg->opaque);
 +    uint32_t reg_idx = (reg->access->addr) / 4;
 +    uint32_t filter_number = (reg_idx - R_AFIR1) / 2;
 +
 +    if (!(s->regs[R_AFR] & (1 << filter_number))) {
 +        s->regs[reg_idx] = val;
 +
 +        trace_xlnx_can_filter_id_pre_write(filter_number, s->regs[reg_idx]);
 +    } else {
 +        g_autofree char *path = object_get_canonical_path(OBJECT(s));
 +
 +        qemu_log_mask(LOG_GUEST_ERROR, "%s: Acceptance filter %d"
 +                      " id is not set as corresponding UAF bit is not 0.\n",
 +                      path, filter_number + 1);
 +    }
 +
 +    return s->regs[reg_idx];
 +}
 +
 +static void can_tx_post_write(RegisterInfo *reg, uint64_t val)
 +{
 +    XlnxZynqMPCANState *s = XLNX_ZYNQMP_CAN(reg->opaque);
 +
 +    bool is_txhpb = reg->access->addr > A_TXFIFO_DATA2;
 +
 +    bool initiate_transfer = (reg->access->addr == A_TXFIFO_DATA2) ||
 +                             (reg->access->addr == A_TXHPB_DATA2);
 +
 +    Fifo32 *f = is_txhpb ? &s->txhpb_fifo : &s->tx_fifo;
 +
 +    if (!fifo32_is_full(f)) {
 +        fifo32_push(f, val);
 +    } else {
 +        g_autofree char *path = object_get_canonical_path(OBJECT(s));
 +
 +        qemu_log_mask(LOG_GUEST_ERROR, "%s: TX FIFO is full.\n", path);
 +    }
 +
 +    /* Initiate the message send if TX register is written. */
 +    if (initiate_transfer &&
 +        ARRAY_FIELD_EX32(s->regs, SOFTWARE_RESET_REGISTER, CEN)) {
 +        transfer_fifo(s, f);
 +    }
 +
 +    can_update_irq(s);
 +}
 +
 +static const RegisterAccessInfo can_regs_info[] = {
 +    {   .name = "SOFTWARE_RESET_REGISTER",
 +        .addr = A_SOFTWARE_RESET_REGISTER,
 +        .rsvd = 0xfffffffc,
 +        .pre_write = can_srr_pre_write,
 +    },{ .name = "MODE_SELECT_REGISTER",
 +        .addr = A_MODE_SELECT_REGISTER,
 +        .rsvd = 0xfffffff8,
 +        .pre_write = can_msr_pre_write,
 +    },{ .name = "ARBITRATION_PHASE_BAUD_RATE_PRESCALER_REGISTER",
 +        .addr = A_ARBITRATION_PHASE_BAUD_RATE_PRESCALER_REGISTER,
 +        .rsvd = 0xffffff00,
 +        .pre_write = can_brpr_pre_write,
 +    },{ .name = "ARBITRATION_PHASE_BIT_TIMING_REGISTER",
 +        .addr = A_ARBITRATION_PHASE_BIT_TIMING_REGISTER,
 +        .rsvd = 0xfffffe00,
 +        .pre_write = can_btr_pre_write,
 +    },{ .name = "ERROR_COUNTER_REGISTER",
 +        .addr = A_ERROR_COUNTER_REGISTER,
 +        .rsvd = 0xffff0000,
 +        .ro = 0xffffffff,
 +    },{ .name = "ERROR_STATUS_REGISTER",
 +        .addr = A_ERROR_STATUS_REGISTER,
 +        .rsvd = 0xffffffe0,
 +        .w1c = 0x1f,
 +    },{ .name = "STATUS_REGISTER",  .addr = A_STATUS_REGISTER,
 +        .reset = 0x1,
 +        .rsvd = 0xffffe000,
 +        .ro = 0x1fff,
 +    },{ .name = "INTERRUPT_STATUS_REGISTER",
 +        .addr = A_INTERRUPT_STATUS_REGISTER,
 +        .reset = 0x6000,
 +        .rsvd = 0xffff8000,
 +        .ro = 0x7fff,
 +    },{ .name = "INTERRUPT_ENABLE_REGISTER",
 +        .addr = A_INTERRUPT_ENABLE_REGISTER,
 +        .rsvd = 0xffff8000,
 +        .post_write = can_ier_post_write,
 +    },{ .name = "INTERRUPT_CLEAR_REGISTER",
 +        .addr = A_INTERRUPT_CLEAR_REGISTER,
 +        .rsvd = 0xffff8000,
 +        .pre_write = can_icr_pre_write,
 +    },{ .name = "TIMESTAMP_REGISTER",
 +        .addr = A_TIMESTAMP_REGISTER,
 +        .rsvd = 0xfffffffe,
 +        .pre_write = can_tcr_pre_write,
 +    },{ .name = "WIR",  .addr = A_WIR,
 +        .reset = 0x3f3f,
 +        .rsvd = 0xffff0000,
 +    },{ .name = "TXFIFO_ID",  .addr = A_TXFIFO_ID,
 +        .post_write = can_tx_post_write,
 +    },{ .name = "TXFIFO_DLC",  .addr = A_TXFIFO_DLC,
 +        .rsvd = 0xfffffff,
 +        .post_write = can_tx_post_write,
 +    },{ .name = "TXFIFO_DATA1",  .addr = A_TXFIFO_DATA1,
 +        .post_write = can_tx_post_write,
 +    },{ .name = "TXFIFO_DATA2",  .addr = A_TXFIFO_DATA2,
 +        .post_write = can_tx_post_write,
 +    },{ .name = "TXHPB_ID",  .addr = A_TXHPB_ID,
 +        .post_write = can_tx_post_write,
 +    },{ .name = "TXHPB_DLC",  .addr = A_TXHPB_DLC,
 +        .rsvd = 0xfffffff,
 +        .post_write = can_tx_post_write,
 +    },{ .name = "TXHPB_DATA1",  .addr = A_TXHPB_DATA1,
 +        .post_write = can_tx_post_write,
 +    },{ .name = "TXHPB_DATA2",  .addr = A_TXHPB_DATA2,
 +        .post_write = can_tx_post_write,
 +    },{ .name = "RXFIFO_ID",  .addr = A_RXFIFO_ID,
 +        .ro = 0xffffffff,
 +        .post_read = can_rxfifo_pre_read,
 +    },{ .name = "RXFIFO_DLC",  .addr = A_RXFIFO_DLC,
 +        .rsvd = 0xfff0000,
 +        .post_read = can_rxfifo_pre_read,
 +    },{ .name = "RXFIFO_DATA1",  .addr = A_RXFIFO_DATA1,
 +        .post_read = can_rxfifo_pre_read,
 +    },{ .name = "RXFIFO_DATA2",  .addr = A_RXFIFO_DATA2,
 +        .post_read = can_rxfifo_pre_read,
 +    },{ .name = "AFR",  .addr = A_AFR,
 +        .rsvd = 0xfffffff0,
 +        .post_write = can_filter_enable_post_write,
 +    },{ .name = "AFMR1",  .addr = A_AFMR1,
 +        .pre_write = can_filter_mask_pre_write,
 +    },{ .name = "AFIR1",  .addr = A_AFIR1,
 +        .pre_write = can_filter_id_pre_write,
 +    },{ .name = "AFMR2",  .addr = A_AFMR2,
 +        .pre_write = can_filter_mask_pre_write,
 +    },{ .name = "AFIR2",  .addr = A_AFIR2,
 +        .pre_write = can_filter_id_pre_write,
 +    },{ .name = "AFMR3",  .addr = A_AFMR3,
 +        .pre_write = can_filter_mask_pre_write,
 +    },{ .name = "AFIR3",  .addr = A_AFIR3,
 +        .pre_write = can_filter_id_pre_write,
 +    },{ .name = "AFMR4",  .addr = A_AFMR4,
 +        .pre_write = can_filter_mask_pre_write,
 +    },{ .name = "AFIR4",  .addr = A_AFIR4,
 +        .pre_write = can_filter_id_pre_write,
 +    }
 +};
 +
 +static void xlnx_zynqmp_can_ptimer_cb(void *opaque)
 +{
 +    /* No action required on the timer rollover. */
 +}
 +
 +static const MemoryRegionOps can_ops = {
 +    .read = register_read_memory,
 +    .write = register_write_memory,
 +    .endianness = DEVICE_LITTLE_ENDIAN,
 +    .valid = {
 +        .min_access_size = 4,
 +        .max_access_size = 4,
 +    },
 +};
 +
 +static void xlnx_zynqmp_can_reset_init(Object *obj, ResetType type)
 +{
 +    XlnxZynqMPCANState *s = XLNX_ZYNQMP_CAN(obj);
 +    unsigned int i;
 +
 +    for (i = R_RXFIFO_ID; i < ARRAY_SIZE(s->reg_info); ++i) {
 +        register_reset(&s->reg_info[i]);
 +    }
 +
 +    ptimer_transaction_begin(s->can_timer);
 +    ptimer_set_count(s->can_timer, 0);
 +    ptimer_transaction_commit(s->can_timer);
 +}
 +
 +static void xlnx_zynqmp_can_reset_hold(Object *obj)
 +{
 +    XlnxZynqMPCANState *s = XLNX_ZYNQMP_CAN(obj);
 +    unsigned int i;
 +
 +    for (i = 0; i < R_RXFIFO_ID; ++i) {
 +        register_reset(&s->reg_info[i]);
 +    }
 +
 +    /*
 +     * Reset FIFOs when CAN model is reset. This will clear the fifo writes
 +     * done by post_write which gets called from register_reset function,
 +     * post_write handle will not be able to trigger tx because CAN will be
 +     * disabled when software_reset_register is cleared first.
 +     */
 +    fifo32_reset(&s->rx_fifo);
 +    fifo32_reset(&s->tx_fifo);
 +    fifo32_reset(&s->txhpb_fifo);
 +}
 +
 +static bool xlnx_zynqmp_can_can_receive(CanBusClientState *client)
 +{
 +    XlnxZynqMPCANState *s = container_of(client, XlnxZynqMPCANState,
 +                                         bus_client);
 +
 +    if (ARRAY_FIELD_EX32(s->regs, SOFTWARE_RESET_REGISTER, SRST)) {
 +        g_autofree char *path = object_get_canonical_path(OBJECT(s));
 +
 +        qemu_log_mask(LOG_GUEST_ERROR, "%s: Controller is in reset state.\n",
 +                      path);
 +        return false;
 +    }
 +
 +    if ((ARRAY_FIELD_EX32(s->regs, SOFTWARE_RESET_REGISTER, CEN)) == 0) {
 +        g_autofree char *path = object_get_canonical_path(OBJECT(s));
 +
 +        qemu_log_mask(LOG_GUEST_ERROR, "%s: Controller is disabled. Incoming"
 +                      " messages will be discarded.\n", path);
 +        return false;
 +    }
 +
 +    return true;
 +}
 +
 +static ssize_t xlnx_zynqmp_can_receive(CanBusClientState *client,
 +                               const qemu_can_frame *buf, size_t buf_size) {
 +    XlnxZynqMPCANState *s = container_of(client, XlnxZynqMPCANState,
 +                                         bus_client);
 +    const qemu_can_frame *frame = buf;
 +
 +    if (buf_size <= 0) {
 +        g_autofree char *path = object_get_canonical_path(OBJECT(s));
 +
 +        qemu_log_mask(LOG_GUEST_ERROR, "%s: Error in the data received.\n",
 +                      path);
 +        return 0;
 +    }
 +
 +    if (ARRAY_FIELD_EX32(s->regs, STATUS_REGISTER, SNOOP)) {
 +        /* Snoop Mode: Just keep the data. no response back. */
 +        update_rx_fifo(s, frame);
 +    } else if ((ARRAY_FIELD_EX32(s->regs, STATUS_REGISTER, SLEEP))) {
 +        /*
 +         * XlnxZynqMPCAN is in sleep mode. Any data on bus will bring it to wake
 +         * up state.
 +         */
 +        can_exit_sleep_mode(s);
 +        update_rx_fifo(s, frame);
 +    } else if ((ARRAY_FIELD_EX32(s->regs, STATUS_REGISTER, SLEEP)) == 0) {
 +        update_rx_fifo(s, frame);
 +    } else {
 +        /*
 +         * XlnxZynqMPCAN will not participate in normal bus communication
 +         * and will not receive any messages transmitted by other CAN nodes.
 +         */
 +        trace_xlnx_can_rx_discard(s->regs[R_STATUS_REGISTER]);
 +    }
 +
 +    return 1;
 +}
 +
 +static CanBusClientInfo can_xilinx_bus_client_info = {
 +    .can_receive = xlnx_zynqmp_can_can_receive,
 +    .receive = xlnx_zynqmp_can_receive,
 +};
 +
 +static int xlnx_zynqmp_can_connect_to_bus(XlnxZynqMPCANState *s,
 +                                          CanBusState *bus)
 +{
 +    s->bus_client.info = &can_xilinx_bus_client_info;
 +
 +    if (can_bus_insert_client(bus, &s->bus_client) < 0) {
 +        return -1;
 +    }
 +    return 0;
 +}
 +
 +static void xlnx_zynqmp_can_realize(DeviceState *dev, Error **errp)
 +{
 +    XlnxZynqMPCANState *s = XLNX_ZYNQMP_CAN(dev);
 +
 +    if (s->canbus) {
 +        if (xlnx_zynqmp_can_connect_to_bus(s, s->canbus) < 0) {
 +            g_autofree char *path = object_get_canonical_path(OBJECT(s));
 +
 +            error_setg(errp, "%s: xlnx_zynqmp_can_connect_to_bus"
 +                       " failed.", path);
 +            return;
 +        }
 +    }
 +
 +    /* Create RX FIFO, TXFIFO, TXHPB storage. */
 +    fifo32_create(&s->rx_fifo, RXFIFO_SIZE);
 +    fifo32_create(&s->tx_fifo, RXFIFO_SIZE);
 +    fifo32_create(&s->txhpb_fifo, CAN_FRAME_SIZE);
 +
 +    /* Allocate a new timer. */
 +    s->can_timer = ptimer_init(xlnx_zynqmp_can_ptimer_cb, s,
 +                               PTIMER_POLICY_DEFAULT);
 +
 +    ptimer_transaction_begin(s->can_timer);
 +
 +    ptimer_set_freq(s->can_timer, s->cfg.ext_clk_freq);
 +    ptimer_set_limit(s->can_timer, CAN_TIMER_MAX, 1);
 +    ptimer_run(s->can_timer, 0);
 +    ptimer_transaction_commit(s->can_timer);
 +}
 +
 +static void xlnx_zynqmp_can_init(Object *obj)
 +{
 +    XlnxZynqMPCANState *s = XLNX_ZYNQMP_CAN(obj);
 +    SysBusDevice *sbd = SYS_BUS_DEVICE(obj);
 +
 +    RegisterInfoArray *reg_array;
 +
 +    memory_region_init(&s->iomem, obj, TYPE_XLNX_ZYNQMP_CAN,
 +                        XLNX_ZYNQMP_CAN_R_MAX * 4);
 +    reg_array = register_init_block32(DEVICE(obj), can_regs_info,
 +                               ARRAY_SIZE(can_regs_info),
 +                               s->reg_info, s->regs,
 +                               &can_ops,
 +                               XLNX_ZYNQMP_CAN_ERR_DEBUG,
 +                               XLNX_ZYNQMP_CAN_R_MAX * 4);
 +
 +    memory_region_add_subregion(&s->iomem, 0x00, &reg_array->mem);
 +    sysbus_init_mmio(sbd, &s->iomem);
 +    sysbus_init_irq(SYS_BUS_DEVICE(obj), &s->irq);
 +}
 +
 +static const VMStateDescription vmstate_can = {
 +    .name = TYPE_XLNX_ZYNQMP_CAN,
 +    .version_id = 1,
 +    .minimum_version_id = 1,
 +    .fields = (VMStateField[]) {
-+        VMSTATE_I2C_SLAVE(parent_obj, sii9022_state),
++        VMSTATE_FIFO32(rx_fifo, XlnxZynqMPCANState),
-+        VMSTATE_UINT8(ptr, sii9022_state),
++        VMSTATE_FIFO32(tx_fifo, XlnxZynqMPCANState),
-+        VMSTATE_BOOL(addr_byte, sii9022_state),
++        VMSTATE_FIFO32(txhpb_fifo, XlnxZynqMPCANState),
-+        VMSTATE_BOOL(ddc_req, sii9022_state),
++        VMSTATE_UINT32_ARRAY(regs, XlnxZynqMPCANState, XLNX_ZYNQMP_CAN_R_MAX),
-+        VMSTATE_BOOL(ddc_skip_finish, sii9022_state),
++        VMSTATE_PTIMER(can_timer, XlnxZynqMPCANState),
-+        VMSTATE_BOOL(ddc, sii9022_state),
++        VMSTATE_END_OF_LIST(),
 +        VMSTATE_END_OF_LIST()
 +    }
 +};
 +
-+static int sii9022_event(I2CSlave *i2c, enum i2c_event event)
++static Property xlnx_zynqmp_can_properties[] = {
-+{
++    DEFINE_PROP_UINT32("ext_clk_freq", XlnxZynqMPCANState, cfg.ext_clk_freq,
-+    sii9022_state *s = SII9022(i2c);
++                       CAN_DEFAULT_CLOCK),
-+
++    DEFINE_PROP_LINK("canbus", XlnxZynqMPCANState, canbus, TYPE_CAN_BUS,
-+    switch (event) {
++                     CanBusState *),
-+    case I2C_START_SEND:
++    DEFINE_PROP_END_OF_LIST(),
-+        s->addr_byte = true;
++};
-+        break;
++
-+    case I2C_START_RECV:
++static void xlnx_zynqmp_can_class_init(ObjectClass *klass, void *data)
 +        break;
 +    case I2C_FINISH:
 +        break;
 +    case I2C_NACK:
 +        break;
 +    }
 +
 +    return 0;
 +}
 +
 +static int sii9022_rx(I2CSlave *i2c)
 +{
 +    sii9022_state *s = SII9022(i2c);
 +    uint8_t res = 0x00;
 +
 +    switch (s->ptr) {
 +    case SII9022_SYS_CTRL_DATA:
 +        if (s->ddc_req) {
 +            /* Acknowledge DDC bus request */
 +            res = SII9022_SYS_CTRL_DDC_BUS_GRTD | SII9022_SYS_CTRL_DDC_BUS_REQ;
 +        }
 +        break;
 +    case SII9022_REG_CHIPID:
 +        res = 0xb0;
 +        break;
 +    case SII9022_INT_STATUS:
 +        /* Something is cold-plugged in, no interrupts */
 +        res = SII9022_INT_STATUS_PLUGGED;
 +        break;
 +    default:
 +        break;
 +    }
 +
 +    trace_sii9022_read_reg(s->ptr, res);
 +    s->ptr++;
 +
 +    return res;
 +}
 +
 +static int sii9022_tx(I2CSlave *i2c, uint8_t data)
 +{
 +    sii9022_state *s = SII9022(i2c);
 +
 +    if (s->addr_byte) {
 +        s->ptr = data;
 +        s->addr_byte = false;
 +        return 0;
 +    }
 +
 +    switch (s->ptr) {
 +    case SII9022_SYS_CTRL_DATA:
 +        if (data & SII9022_SYS_CTRL_DDC_BUS_REQ) {
 +            s->ddc_req = true;
 +            if (data & SII9022_SYS_CTRL_DDC_BUS_GRTD) {
 +                s->ddc = true;
 +                /* Skip this finish since we just switched to DDC */
 +                s->ddc_skip_finish = true;
 +                trace_sii9022_switch_mode("DDC");
 +            }
 +        } else {
 +            s->ddc_req = false;
 +            s->ddc = false;
 +            trace_sii9022_switch_mode("normal");
 +        }
 +        break;
 +    default:
 +        break;
 +    }
 +
 +    trace_sii9022_write_reg(s->ptr, data);
 +    s->ptr++;
 +
 +    return 0;
 +}
 +
 +static void sii9022_reset(DeviceState *dev)
 +{
 +    sii9022_state *s = SII9022(dev);
 +
 +    s->ptr = 0;
 +    s->addr_byte = false;
 +    s->ddc_req = false;
 +    s->ddc_skip_finish = false;
 +    s->ddc = false;
 +}
 +
 +static void sii9022_realize(DeviceState *dev, Error **errp)
 +{
 +    I2CBus *bus;
 +
 +    bus = I2C_BUS(qdev_get_parent_bus(dev));
 +    i2c_create_slave(bus, TYPE_I2CDDC, 0x50);
 +}
 +
 +static void sii9022_class_init(ObjectClass *klass, void *data)
 +{
 +    DeviceClass *dc = DEVICE_CLASS(klass);
-+    I2CSlaveClass *k = I2C_SLAVE_CLASS(klass);
++    ResettableClass *rc = RESETTABLE_CLASS(klass);
 +
-+    k->event = sii9022_event;
++    rc->phases.enter = xlnx_zynqmp_can_reset_init;
-+    k->recv = sii9022_rx;
++    rc->phases.hold = xlnx_zynqmp_can_reset_hold;
-+    k->send = sii9022_tx;
++    dc->realize = xlnx_zynqmp_can_realize;
-+    dc->reset = sii9022_reset;
++    device_class_set_props(dc, xlnx_zynqmp_can_properties);
-+    dc->realize = sii9022_realize;
++    dc->vmsd = &vmstate_can;
-+    dc->vmsd = &vmstate_sii9022;
++}
-+}
++
-+
++static const TypeInfo can_info = {
-+static const TypeInfo sii9022_info = {
++    .name          = TYPE_XLNX_ZYNQMP_CAN,
-+    .name          = TYPE_SII9022,
++    .parent        = TYPE_SYS_BUS_DEVICE,
-+    .parent        = TYPE_I2C_SLAVE,
++    .instance_size = sizeof(XlnxZynqMPCANState),
-+    .instance_size = sizeof(sii9022_state),
++    .class_init    = xlnx_zynqmp_can_class_init,
-+    .class_init    = sii9022_class_init,
++    .instance_init = xlnx_zynqmp_can_init,
 +};
 +
-+static void sii9022_register_types(void)
++static void can_register_types(void)
 +{
-+    type_register_static(&sii9022_info);
++    type_register_static(&can_info);
 +}
 +
-+type_init(sii9022_register_types)
++type_init(can_register_types)
-diff --git a/hw/display/trace-events b/hw/display/trace-events
+diff --git a/hw/Kconfig b/hw/Kconfig
 index XXXXXXX..XXXXXXX 100644
---- a/hw/display/trace-events
+--- a/hw/Kconfig
-+++ b/hw/display/trace-events
++++ b/hw/Kconfig
-@@ -XXX,XX +XXX,XX @@ vga_cirrus_read_io(uint32_t addr, uint32_t val) "addr 0x%x, val 0x%x"
+@@ -XXX,XX +XXX,XX @@ config XILINX_AXI
- vga_cirrus_write_io(uint32_t addr, uint32_t val) "addr 0x%x, val 0x%x"
+ config XLNX_ZYNQMP
- vga_cirrus_read_blt(uint32_t offset, uint32_t val) "offset 0x%x, val 0x%x"
+     bool
- vga_cirrus_write_blt(uint32_t offset, uint32_t val) "offset 0x%x, val 0x%x"
+     select REGISTER
-+
++    select CAN_BUS
-+# hw/display/sii9022.c
+diff --git a/hw/net/can/meson.build b/hw/net/can/meson.build
-+sii9022_read_reg(uint8_t addr, uint8_t val) "addr 0x%02x, val 0x%02x"
+index XXXXXXX..XXXXXXX 100644
-+sii9022_write_reg(uint8_t addr, uint8_t val) "addr 0x%02x, val 0x%02x"
+--- a/hw/net/can/meson.build
-+sii9022_switch_mode(const char *mode) "mode: %s"
++++ b/hw/net/can/meson.build
@@ -XXX,XX +XXX,XX @@ softmmu_ss.add(when: 'CONFIG_CAN_PCI', if_true: files('can_pcm3680_pci.c'))
  softmmu_ss.add(when: 'CONFIG_CAN_PCI', if_true: files('can_mioe3680_pci.c'))
  softmmu_ss.add(when: 'CONFIG_CAN_CTUCANFD', if_true: files('ctucan_core.c'))
  softmmu_ss.add(when: 'CONFIG_CAN_CTUCANFD_PCI', if_true: files('ctucan_pci.c'))
 +softmmu_ss.add(when: 'CONFIG_XLNX_ZYNQMP', if_true: files('xlnx-zynqmp-can.c'))
 diff --git a/hw/net/can/trace-events b/hw/net/can/trace-events
 new file mode 100644
 index XXXXXXX..XXXXXXX
 --- /dev/null
 +++ b/hw/net/can/trace-events
@@ -XXX,XX +XXX,XX @@
 +# xlnx-zynqmp-can.c
 +xlnx_can_update_irq(uint32_t isr, uint32_t ier, uint32_t irq) "ISR: 0x%08x IER: 0x%08x IRQ: 0x%08x"
 +xlnx_can_reset(uint32_t val) "Resetting controller with value = 0x%08x"
 +xlnx_can_rx_fifo_filter_reject(uint32_t id, uint8_t dlc) "Frame: ID: 0x%08x DLC: 0x%02x"
 +xlnx_can_filter_id_pre_write(uint8_t filter_num, uint32_t value) "Filter%d ID: 0x%08x"
 +xlnx_can_filter_mask_pre_write(uint8_t filter_num, uint32_t value) "Filter%d MASK: 0x%08x"
 +xlnx_can_tx_data(uint32_t id, uint8_t dlc, uint8_t db0, uint8_t db1, uint8_t db2, uint8_t db3, uint8_t db4, uint8_t db5, uint8_t db6, uint8_t db7) "Frame: ID: 0x%08x DLC: 0x%02x DATA: 0x%02x 0x%02x 0x%02x 0x%02x 0x%02x 0x%02x 0x%02x 0x%02x"
 +xlnx_can_rx_data(uint32_t id, uint32_t dlc, uint8_t db0, uint8_t db1, uint8_t db2, uint8_t db3, uint8_t db4, uint8_t db5, uint8_t db6, uint8_t db7) "Frame: ID: 0x%08x DLC: 0x%02x DATA: 0x%02x 0x%02x 0x%02x 0x%02x 0x%02x 0x%02x 0x%02x 0x%02x"
 +xlnx_can_rx_discard(uint32_t status) "Controller is not enabled for bus communication. Status Register: 0x%08x"
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 16/42] arm/translate-a64: initial decode for simd_three_reg_same_fp16
+[PULL 03/36] xlnx-zynqmp: Connect Xilinx ZynqMP CAN controllers
-From: Alex Bennée <alex.bennee@linaro.org>
+From: Vikram Garhwal <fnu.vikram@xilinx.com>
-This is the initial decode skeleton for the Advanced SIMD three same
+Connect CAN0 and CAN1 on the ZynqMP.
 instruction group.
-The fprintf is purely to aid debugging as the additional instructions
+Reviewed-by: Francisco Iglesias <francisco.iglesias@xilinx.com>
-are added. It will be removed once the group is complete.
+Reviewed-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
+Signed-off-by: Vikram Garhwal <fnu.vikram@xilinx.com>
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Message-id: 1605728926-352690-3-git-send-email-fnu.vikram@xilinx.com
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20180227143852.11175-9-alex.bennee@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/translate-a64.c | 73 ++++++++++++++++++++++++++++++++++++++++++++++
+ include/hw/arm/xlnx-zynqmp.h |  8 ++++++++
-file changed, 73 insertions(+)
+ hw/arm/xlnx-zcu102.c         | 20 ++++++++++++++++++++
  hw/arm/xlnx-zynqmp.c         | 34 ++++++++++++++++++++++++++++++++++
 files changed, 62 insertions(+)
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
+diff --git a/include/hw/arm/xlnx-zynqmp.h b/include/hw/arm/xlnx-zynqmp.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
+--- a/include/hw/arm/xlnx-zynqmp.h
-+++ b/target/arm/translate-a64.c
++++ b/include/hw/arm/xlnx-zynqmp.h
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_three_reg_same(DisasContext *s, uint32_t insn)
+@@ -XXX,XX +XXX,XX @@
-     }
+ #include "hw/intc/arm_gic.h"
- }
+ #include "hw/net/cadence_gem.h"
+ #include "hw/char/cadence_uart.h"
-+/*
++#include "hw/net/xlnx-zynqmp-can.h"
-+ * Advanced SIMD three same (ARMv8.2 FP16 variants)
+ #include "hw/ide/ahci.h"
-+ *
+ #include "hw/sd/sdhci.h"
-+ *  31  30  29  28       24 23  22 21 20  16 15 14 13    11 10  9    5 4    0
+ #include "hw/ssi/xilinx_spips.h"
-+ * +---+---+---+-----------+---------+------+-----+--------+---+------+------+
+@@ -XXX,XX +XXX,XX @@
-+ * | 0 | Q | U | 0 1 1 1 0 | a | 1 0 |  Rm  | 0 0 | opcode | 1 |  Rn  |  Rd  |
+ #include "hw/cpu/cluster.h"
-+ * +---+---+---+-----------+---------+------+-----+--------+---+------+------+
+ #include "target/arm/cpu.h"
-+ *
+ #include "qom/object.h"
-+ * This includes FMULX, FCMEQ (register), FRECPS, FRSQRTS, FCMGE
++#include "net/can_emu.h"
-+ * (register), FACGE, FABD, FCMGT (register) and FACGT.
-+ *
+ #define TYPE_XLNX_ZYNQMP "xlnx,zynqmp"
-+ */
+ OBJECT_DECLARE_SIMPLE_TYPE(XlnxZynqMPState, XLNX_ZYNQMP)
-+static void disas_simd_three_reg_same_fp16(DisasContext *s, uint32_t insn)
+@@ -XXX,XX +XXX,XX @@ OBJECT_DECLARE_SIMPLE_TYPE(XlnxZynqMPState, XLNX_ZYNQMP)
-+{
+ #define XLNX_ZYNQMP_NUM_RPU_CPUS 2
-+    int opcode, fpopcode;
+ #define XLNX_ZYNQMP_NUM_GEMS 4
-+    int is_q, u, a, rm, rn, rd;
+ #define XLNX_ZYNQMP_NUM_UARTS 2
-+    int datasize, elements;
++#define XLNX_ZYNQMP_NUM_CAN 2
-+    int pass;
++#define XLNX_ZYNQMP_CAN_REF_CLK (24 * 1000 * 1000)
-+    TCGv_ptr fpst;
+ #define XLNX_ZYNQMP_NUM_SDHCI 2
  #define XLNX_ZYNQMP_NUM_SPIS 2
  #define XLNX_ZYNQMP_NUM_GDMA_CH 8
@@ -XXX,XX +XXX,XX @@ struct XlnxZynqMPState {
      CadenceGEMState gem[XLNX_ZYNQMP_NUM_GEMS];
      CadenceUARTState uart[XLNX_ZYNQMP_NUM_UARTS];
 +    XlnxZynqMPCANState can[XLNX_ZYNQMP_NUM_CAN];
      SysbusAHCIState sata;
      SDHCIState sdhci[XLNX_ZYNQMP_NUM_SDHCI];
      XilinxSPIPS spi[XLNX_ZYNQMP_NUM_SPIS];
@@ -XXX,XX +XXX,XX @@ struct XlnxZynqMPState {
      bool virt;
      /* Has the RPU subsystem?  */
      bool has_rpu;
 +
-+    if (!arm_dc_feature(s, ARM_FEATURE_V8_FP16)) {
++    /* CAN bus. */
-+        unallocated_encoding(s);
++    CanBusState *canbus[XLNX_ZYNQMP_NUM_CAN];
-+        return;
+ };
  #endif
 diff --git a/hw/arm/xlnx-zcu102.c b/hw/arm/xlnx-zcu102.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/xlnx-zcu102.c
 +++ b/hw/arm/xlnx-zcu102.c
@@ -XXX,XX +XXX,XX @@
  #include "sysemu/qtest.h"
  #include "sysemu/device_tree.h"
  #include "qom/object.h"
 +#include "net/can_emu.h"
  struct XlnxZCU102 {
      MachineState parent_obj;
@@ -XXX,XX +XXX,XX @@ struct XlnxZCU102 {
      bool secure;
      bool virt;
 +    CanBusState *canbus[XLNX_ZYNQMP_NUM_CAN];
 +
      struct arm_boot_info binfo;
  };
@@ -XXX,XX +XXX,XX @@ static void xlnx_zcu102_init(MachineState *machine)
      object_property_set_bool(OBJECT(&s->soc), "virtualization", s->virt,
                               &error_fatal);
 +    for (i = 0; i < XLNX_ZYNQMP_NUM_CAN; i++) {
 +        gchar *bus_name = g_strdup_printf("canbus%d", i);
 +
 +        object_property_set_link(OBJECT(&s->soc), bus_name,
 +                                 OBJECT(s->canbus[i]), &error_fatal);
 +        g_free(bus_name);
 +    }
 +
-+    if (!fp_access_check(s)) {
+     qdev_realize(DEVICE(&s->soc), NULL, &error_fatal);
-+        return;
      /* Create and plug in the SD cards */
@@ -XXX,XX +XXX,XX @@ static void xlnx_zcu102_machine_instance_init(Object *obj)
      s->secure = false;
      /* Default to virt (EL2) being disabled */
      s->virt = false;
 +    object_property_add_link(obj, "xlnx-zcu102.canbus0", TYPE_CAN_BUS,
 +                             (Object **)&s->canbus[0],
 +                             object_property_allow_set_link,
 +                             0);
 +
 +    object_property_add_link(obj, "xlnx-zcu102.canbus1", TYPE_CAN_BUS,
 +                             (Object **)&s->canbus[1],
 +                             object_property_allow_set_link,
 +                             0);
  }
  static void xlnx_zcu102_machine_class_init(ObjectClass *oc, void *data)
 diff --git a/hw/arm/xlnx-zynqmp.c b/hw/arm/xlnx-zynqmp.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/xlnx-zynqmp.c
 +++ b/hw/arm/xlnx-zynqmp.c
@@ -XXX,XX +XXX,XX @@ static const int uart_intr[XLNX_ZYNQMP_NUM_UARTS] = {
 , 22,
  };
 +static const uint64_t can_addr[XLNX_ZYNQMP_NUM_CAN] = {
 +    0xFF060000, 0xFF070000,
 +};
 +
 +static const int can_intr[XLNX_ZYNQMP_NUM_CAN] = {
 +    23, 24,
 +};
 +
  static const uint64_t sdhci_addr[XLNX_ZYNQMP_NUM_SDHCI] = {
 xFF160000, 0xFF170000,
  };
@@ -XXX,XX +XXX,XX @@ static void xlnx_zynqmp_init(Object *obj)
                                  TYPE_CADENCE_UART);
      }
 +    for (i = 0; i < XLNX_ZYNQMP_NUM_CAN; i++) {
 +        object_initialize_child(obj, "can[*]", &s->can[i],
 +                                TYPE_XLNX_ZYNQMP_CAN);
 +    }
 +
-+    /* For these floating point ops, the U, a and opcode bits
+     object_initialize_child(obj, "sata", &s->sata, TYPE_SYSBUS_AHCI);
-+     * together indicate the operation.
-+     */
+     for (i = 0; i < XLNX_ZYNQMP_NUM_SDHCI; i++) {
-+    opcode = extract32(insn, 11, 3);
+@@ -XXX,XX +XXX,XX @@ static void xlnx_zynqmp_realize(DeviceState *dev, Error **errp)
-+    u = extract32(insn, 29, 1);
+                            gic_spi[uart_intr[i]]);
-+    a = extract32(insn, 23, 1);
+     }
-+    is_q = extract32(insn, 30, 1);
-+    rm = extract32(insn, 16, 5);
++    for (i = 0; i < XLNX_ZYNQMP_NUM_CAN; i++) {
-+    rn = extract32(insn, 5, 5);
++        object_property_set_int(OBJECT(&s->can[i]), "ext_clk_freq",
-+    rd = extract32(insn, 0, 5);
++                                XLNX_ZYNQMP_CAN_REF_CLK, &error_abort);
 +
-+    fpopcode = opcode | (a << 3) |  (u << 4);
++        object_property_set_link(OBJECT(&s->can[i]), "canbus",
-+    datasize = is_q ? 128 : 64;
++                                 OBJECT(s->canbus[i]), &error_fatal);
 +    elements = datasize / 16;
 +
-+    fpst = get_fpstatus_ptr(true);
++        sysbus_realize(SYS_BUS_DEVICE(&s->can[i]), &err);
-+
++        if (err) {
-+    for (pass = 0; pass < elements; pass++) {
++            error_propagate(errp, err);
-+        TCGv_i32 tcg_op1 = tcg_temp_new_i32();
++            return;
 +        TCGv_i32 tcg_op2 = tcg_temp_new_i32();
 +        TCGv_i32 tcg_res = tcg_temp_new_i32();
 +
 +        read_vec_element_i32(s, tcg_op1, rn, pass, MO_16);
 +        read_vec_element_i32(s, tcg_op2, rm, pass, MO_16);
 +
 +        switch (fpopcode) {
 +        default:
 +            fprintf(stderr, "%s: insn %#04x, fpop %#2x @ %#" PRIx64 "\n",
 +                    __func__, insn, fpopcode, s->pc);
 +            g_assert_not_reached();
 +        }
-+
++        sysbus_mmio_map(SYS_BUS_DEVICE(&s->can[i]), 0, can_addr[i]);
-+        write_vec_element_i32(s, tcg_res, rd, pass, MO_16);
++        sysbus_connect_irq(SYS_BUS_DEVICE(&s->can[i]), 0,
-+        tcg_temp_free_i32(tcg_res);
++                           gic_spi[can_intr[i]]);
 +        tcg_temp_free_i32(tcg_op1);
 +        tcg_temp_free_i32(tcg_op2);
 +    }
 +
-+    tcg_temp_free_ptr(fpst);
+     object_property_set_int(OBJECT(&s->sata), "num-ports", SATA_NUM_PORTS,
-+
+                             &error_abort);
-+    clear_vec_high(s, is_q, rd);
+     if (!sysbus_realize(SYS_BUS_DEVICE(&s->sata), errp)) {
-+}
+@@ -XXX,XX +XXX,XX @@ static Property xlnx_zynqmp_props[] = {
-+
+     DEFINE_PROP_BOOL("has_rpu", XlnxZynqMPState, has_rpu, false),
- static void handle_2misc_widening(DisasContext *s, int opcode, bool is_q,
+     DEFINE_PROP_LINK("ddr-ram", XlnxZynqMPState, ddr_ram, TYPE_MEMORY_REGION,
-                                   int size, int rn, int rd)
+                      MemoryRegion *),
- {
++    DEFINE_PROP_LINK("canbus0", XlnxZynqMPState, canbus[0], TYPE_CAN_BUS,
-@@ -XXX,XX +XXX,XX @@ static const AArch64DecodeTable data_proc_simd[] = {
++                     CanBusState *),
-     { 0xce000000, 0xff808000, disas_crypto_four_reg },
++    DEFINE_PROP_LINK("canbus1", XlnxZynqMPState, canbus[1], TYPE_CAN_BUS,
-     { 0xce800000, 0xffe00000, disas_crypto_xar },
++                     CanBusState *),
-     { 0xce408000, 0xffe0c000, disas_crypto_three_reg_imm2 },
+     DEFINE_PROP_END_OF_LIST()
 +    { 0x0e400400, 0x9f60c400, disas_simd_three_reg_same_fp16 },
      { 0x00000000, 0x00000000, NULL }
  };
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 37/42] arm/translate-a64: add all FP16 ops in simd_scalar_pairwise
+[PULL 04/36] tests/qtest: Introduce tests for Xilinx ZynqMP CAN controller
-From: Alex Bennée <alex.bennee@linaro.org>
+From: Vikram Garhwal <fnu.vikram@xilinx.com>
-I only needed to do a little light re-factoring to support the
+The QTests perform five tests on the Xilinx ZynqMP CAN controller:
-half-precision helpers.
+    Tests the CAN controller in loopback, sleep and snoop mode.
+    Tests filtering of incoming CAN messages.
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
-Message-id: 20180227143852.11175-30-alex.bennee@linaro.org
+Reviewed-by: Francisco Iglesias <francisco.iglesias@xilinx.com>
 Signed-off-by: Vikram Garhwal <fnu.vikram@xilinx.com>
 Message-id: 1605728926-352690-4-git-send-email-fnu.vikram@xilinx.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/translate-a64.c | 80 +++++++++++++++++++++++++++++++---------------
+ tests/qtest/xlnx-can-test.c | 360 ++++++++++++++++++++++++++++++++++++
-file changed, 54 insertions(+), 26 deletions(-)
+ tests/qtest/meson.build     |   1 +
+files changed, 361 insertions(+)
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
+ create mode 100644 tests/qtest/xlnx-can-test.c
 diff --git a/tests/qtest/xlnx-can-test.c b/tests/qtest/xlnx-can-test.c
 new file mode 100644
 index XXXXXXX..XXXXXXX
 --- /dev/null
 +++ b/tests/qtest/xlnx-can-test.c
@@ -XXX,XX +XXX,XX @@
 +/*
 + * QTests for the Xilinx ZynqMP CAN controller.
 + *
 + * Copyright (c) 2020 Xilinx Inc.
 + *
 + * Written-by: Vikram Garhwal<fnu.vikram@xilinx.com>
 + *
 + * Permission is hereby granted, free of charge, to any person obtaining a copy
 + * of this software and associated documentation files (the "Software"), to deal
 + * in the Software without restriction, including without limitation the rights
 + * to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
 + * copies of the Software, and to permit persons to whom the Software is
 + * furnished to do so, subject to the following conditions:
 + *
 + * The above copyright notice and this permission notice shall be included in
 + * all copies or substantial portions of the Software.
 + *
 + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
 + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
 + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL
 + * THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
 + * LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
 + * OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
 + * THE SOFTWARE.
 + */
 +
 +#include "qemu/osdep.h"
 +#include "libqos/libqtest.h"
 +
 +/* Base address. */
 +#define CAN0_BASE_ADDR          0xFF060000
 +#define CAN1_BASE_ADDR          0xFF070000
 +
 +/* Register addresses. */
 +#define R_SRR_OFFSET            0x00
 +#define R_MSR_OFFSET            0x04
 +#define R_SR_OFFSET             0x18
 +#define R_ISR_OFFSET            0x1C
 +#define R_ICR_OFFSET            0x24
 +#define R_TXID_OFFSET           0x30
 +#define R_TXDLC_OFFSET          0x34
 +#define R_TXDATA1_OFFSET        0x38
 +#define R_TXDATA2_OFFSET        0x3C
 +#define R_RXID_OFFSET           0x50
 +#define R_RXDLC_OFFSET          0x54
 +#define R_RXDATA1_OFFSET        0x58
 +#define R_RXDATA2_OFFSET        0x5C
 +#define R_AFR                   0x60
 +#define R_AFMR1                 0x64
 +#define R_AFIR1                 0x68
 +#define R_AFMR2                 0x6C
 +#define R_AFIR2                 0x70
 +#define R_AFMR3                 0x74
 +#define R_AFIR3                 0x78
 +#define R_AFMR4                 0x7C
 +#define R_AFIR4                 0x80
 +
 +/* CAN modes. */
 +#define CONFIG_MODE             0x00
 +#define NORMAL_MODE             0x00
 +#define LOOPBACK_MODE           0x02
 +#define SNOOP_MODE              0x04
 +#define SLEEP_MODE              0x01
 +#define ENABLE_CAN              (1 << 1)
 +#define STATUS_NORMAL_MODE      (1 << 3)
 +#define STATUS_LOOPBACK_MODE    (1 << 1)
 +#define STATUS_SNOOP_MODE       (1 << 12)
 +#define STATUS_SLEEP_MODE       (1 << 2)
 +#define ISR_TXOK                (1 << 1)
 +#define ISR_RXOK                (1 << 4)
 +
 +static void match_rx_tx_data(const uint32_t *buf_tx, const uint32_t *buf_rx,
 +                             uint8_t can_timestamp)
 +{
 +    uint16_t size = 0;
 +    uint8_t len = 4;
 +
 +    while (size < len) {
 +        if (R_RXID_OFFSET + 4 * size == R_RXDLC_OFFSET)  {
 +            g_assert_cmpint(buf_rx[size], ==, buf_tx[size] + can_timestamp);
 +        } else {
 +            g_assert_cmpint(buf_rx[size], ==, buf_tx[size]);
 +        }
 +
 +        size++;
 +    }
 +}
 +
 +static void read_data(QTestState *qts, uint64_t can_base_addr, uint32_t *buf_rx)
 +{
 +    uint32_t int_status;
 +
 +    /* Read the interrupt on CAN rx. */
 +    int_status = qtest_readl(qts, can_base_addr + R_ISR_OFFSET) & ISR_RXOK;
 +
 +    g_assert_cmpint(int_status, ==, ISR_RXOK);
 +
 +    /* Read the RX register data for CAN. */
 +    buf_rx[0] = qtest_readl(qts, can_base_addr + R_RXID_OFFSET);
 +    buf_rx[1] = qtest_readl(qts, can_base_addr + R_RXDLC_OFFSET);
 +    buf_rx[2] = qtest_readl(qts, can_base_addr + R_RXDATA1_OFFSET);
 +    buf_rx[3] = qtest_readl(qts, can_base_addr + R_RXDATA2_OFFSET);
 +
 +    /* Clear the RX interrupt. */
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_ICR_OFFSET, ISR_RXOK);
 +}
 +
 +static void send_data(QTestState *qts, uint64_t can_base_addr,
 +                      const uint32_t *buf_tx)
 +{
 +    uint32_t int_status;
 +
 +    /* Write the TX register data for CAN. */
 +    qtest_writel(qts, can_base_addr + R_TXID_OFFSET, buf_tx[0]);
 +    qtest_writel(qts, can_base_addr + R_TXDLC_OFFSET, buf_tx[1]);
 +    qtest_writel(qts, can_base_addr + R_TXDATA1_OFFSET, buf_tx[2]);
 +    qtest_writel(qts, can_base_addr + R_TXDATA2_OFFSET, buf_tx[3]);
 +
 +    /* Read the interrupt on CAN for tx. */
 +    int_status = qtest_readl(qts, can_base_addr + R_ISR_OFFSET) & ISR_TXOK;
 +
 +    g_assert_cmpint(int_status, ==, ISR_TXOK);
 +
 +    /* Clear the interrupt for tx. */
 +    qtest_writel(qts, CAN0_BASE_ADDR + R_ICR_OFFSET, ISR_TXOK);
 +}
 +
 +/*
 + * This test will be transferring data from CAN0 and CAN1 through canbus. CAN0
 + * initiate the data transfer to can-bus, CAN1 receives the data. Test compares
 + * the data sent from CAN0 with received on CAN1.
 + */
 +static void test_can_bus(void)
 +{
 +    const uint32_t buf_tx[4] = { 0xFF, 0x80000000, 0x12345678, 0x87654321 };
 +    uint32_t buf_rx[4] = { 0x00, 0x00, 0x00, 0x00 };
 +    uint32_t status = 0;
 +    uint8_t can_timestamp = 1;
 +
 +    QTestState *qts = qtest_init("-machine xlnx-zcu102"
 +                " -object can-bus,id=canbus0"
 +                " -machine xlnx-zcu102.canbus0=canbus0"
 +                " -machine xlnx-zcu102.canbus1=canbus0"
 +                );
 +
 +    /* Configure the CAN0 and CAN1. */
 +    qtest_writel(qts, CAN0_BASE_ADDR + R_SRR_OFFSET, ENABLE_CAN);
 +    qtest_writel(qts, CAN0_BASE_ADDR + R_MSR_OFFSET, NORMAL_MODE);
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_SRR_OFFSET, ENABLE_CAN);
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_MSR_OFFSET, NORMAL_MODE);
 +
 +    /* Check here if CAN0 and CAN1 are in normal mode. */
 +    status = qtest_readl(qts, CAN0_BASE_ADDR + R_SR_OFFSET);
 +    g_assert_cmpint(status, ==, STATUS_NORMAL_MODE);
 +
 +    status = qtest_readl(qts, CAN1_BASE_ADDR + R_SR_OFFSET);
 +    g_assert_cmpint(status, ==, STATUS_NORMAL_MODE);
 +
 +    send_data(qts, CAN0_BASE_ADDR, buf_tx);
 +
 +    read_data(qts, CAN1_BASE_ADDR, buf_rx);
 +    match_rx_tx_data(buf_tx, buf_rx, can_timestamp);
 +
 +    qtest_quit(qts);
 +}
 +
 +/*
 + * This test is performing loopback mode on CAN0 and CAN1. Data sent from TX of
 + * each CAN0 and CAN1 are compared with RX register data for respective CAN.
 + */
 +static void test_can_loopback(void)
 +{
 +    uint32_t buf_tx[4] = { 0xFF, 0x80000000, 0x12345678, 0x87654321 };
 +    uint32_t buf_rx[4] = { 0x00, 0x00, 0x00, 0x00 };
 +    uint32_t status = 0;
 +
 +    QTestState *qts = qtest_init("-machine xlnx-zcu102"
 +                " -object can-bus,id=canbus0"
 +                " -machine xlnx-zcu102.canbus0=canbus0"
 +                " -machine xlnx-zcu102.canbus1=canbus0"
 +                );
 +
 +    /* Configure the CAN0 in loopback mode. */
 +    qtest_writel(qts, CAN0_BASE_ADDR + R_SRR_OFFSET, CONFIG_MODE);
 +    qtest_writel(qts, CAN0_BASE_ADDR + R_MSR_OFFSET, LOOPBACK_MODE);
 +    qtest_writel(qts, CAN0_BASE_ADDR + R_SRR_OFFSET, ENABLE_CAN);
 +
 +    /* Check here if CAN0 is set in loopback mode. */
 +    status = qtest_readl(qts, CAN0_BASE_ADDR + R_SR_OFFSET);
 +
 +    g_assert_cmpint(status, ==, STATUS_LOOPBACK_MODE);
 +
 +    send_data(qts, CAN0_BASE_ADDR, buf_tx);
 +    read_data(qts, CAN0_BASE_ADDR, buf_rx);
 +    match_rx_tx_data(buf_tx, buf_rx, 0);
 +
 +    /* Configure the CAN1 in loopback mode. */
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_SRR_OFFSET, CONFIG_MODE);
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_MSR_OFFSET, LOOPBACK_MODE);
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_SRR_OFFSET, ENABLE_CAN);
 +
 +    /* Check here if CAN1 is set in loopback mode. */
 +    status = qtest_readl(qts, CAN1_BASE_ADDR + R_SR_OFFSET);
 +
 +    g_assert_cmpint(status, ==, STATUS_LOOPBACK_MODE);
 +
 +    send_data(qts, CAN1_BASE_ADDR, buf_tx);
 +    read_data(qts, CAN1_BASE_ADDR, buf_rx);
 +    match_rx_tx_data(buf_tx, buf_rx, 0);
 +
 +    qtest_quit(qts);
 +}
 +
 +/*
 + * Enable filters for CAN1. This will filter incoming messages with ID. In this
 + * test message will pass through filter 2.
 + */
 +static void test_can_filter(void)
 +{
 +    uint32_t buf_tx[4] = { 0x14, 0x80000000, 0x12345678, 0x87654321 };
 +    uint32_t buf_rx[4] = { 0x00, 0x00, 0x00, 0x00 };
 +    uint32_t status = 0;
 +    uint8_t can_timestamp = 1;
 +
 +    QTestState *qts = qtest_init("-machine xlnx-zcu102"
 +                " -object can-bus,id=canbus0"
 +                " -machine xlnx-zcu102.canbus0=canbus0"
 +                " -machine xlnx-zcu102.canbus1=canbus0"
 +                );
 +
 +    /* Configure the CAN0 and CAN1. */
 +    qtest_writel(qts, CAN0_BASE_ADDR + R_SRR_OFFSET, ENABLE_CAN);
 +    qtest_writel(qts, CAN0_BASE_ADDR + R_MSR_OFFSET, NORMAL_MODE);
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_SRR_OFFSET, ENABLE_CAN);
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_MSR_OFFSET, NORMAL_MODE);
 +
 +    /* Check here if CAN0 and CAN1 are in normal mode. */
 +    status = qtest_readl(qts, CAN0_BASE_ADDR + R_SR_OFFSET);
 +    g_assert_cmpint(status, ==, STATUS_NORMAL_MODE);
 +
 +    status = qtest_readl(qts, CAN1_BASE_ADDR + R_SR_OFFSET);
 +    g_assert_cmpint(status, ==, STATUS_NORMAL_MODE);
 +
 +    /* Set filter for CAN1 for incoming messages. */
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_AFR, 0x0);
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_AFMR1, 0xF7);
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_AFIR1, 0x121F);
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_AFMR2, 0x5431);
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_AFIR2, 0x14);
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_AFMR3, 0x1234);
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_AFIR3, 0x5431);
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_AFMR4, 0xFFF);
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_AFIR4, 0x1234);
 +
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_AFR, 0xF);
 +
 +    send_data(qts, CAN0_BASE_ADDR, buf_tx);
 +
 +    read_data(qts, CAN1_BASE_ADDR, buf_rx);
 +    match_rx_tx_data(buf_tx, buf_rx, can_timestamp);
 +
 +    qtest_quit(qts);
 +}
 +
 +/* Testing sleep mode on CAN0 while CAN1 is in normal mode. */
 +static void test_can_sleepmode(void)
 +{
 +    uint32_t buf_tx[4] = { 0x14, 0x80000000, 0x12345678, 0x87654321 };
 +    uint32_t buf_rx[4] = { 0x00, 0x00, 0x00, 0x00 };
 +    uint32_t status = 0;
 +    uint8_t can_timestamp = 1;
 +
 +    QTestState *qts = qtest_init("-machine xlnx-zcu102"
 +                " -object can-bus,id=canbus0"
 +                " -machine xlnx-zcu102.canbus0=canbus0"
 +                " -machine xlnx-zcu102.canbus1=canbus0"
 +                );
 +
 +    /* Configure the CAN0. */
 +    qtest_writel(qts, CAN0_BASE_ADDR + R_SRR_OFFSET, CONFIG_MODE);
 +    qtest_writel(qts, CAN0_BASE_ADDR + R_MSR_OFFSET, SLEEP_MODE);
 +    qtest_writel(qts, CAN0_BASE_ADDR + R_SRR_OFFSET, ENABLE_CAN);
 +
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_SRR_OFFSET, ENABLE_CAN);
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_MSR_OFFSET, NORMAL_MODE);
 +
 +    /* Check here if CAN0 is in SLEEP mode and CAN1 in normal mode. */
 +    status = qtest_readl(qts, CAN0_BASE_ADDR + R_SR_OFFSET);
 +    g_assert_cmpint(status, ==, STATUS_SLEEP_MODE);
 +
 +    status = qtest_readl(qts, CAN1_BASE_ADDR + R_SR_OFFSET);
 +    g_assert_cmpint(status, ==, STATUS_NORMAL_MODE);
 +
 +    send_data(qts, CAN1_BASE_ADDR, buf_tx);
 +
 +    /*
 +     * Once CAN1 sends data on can-bus. CAN0 should exit sleep mode.
 +     * Check the CAN0 status now. It should exit the sleep mode and receive the
 +     * incoming data.
 +     */
 +    status = qtest_readl(qts, CAN0_BASE_ADDR + R_SR_OFFSET);
 +    g_assert_cmpint(status, ==, STATUS_NORMAL_MODE);
 +
 +    read_data(qts, CAN0_BASE_ADDR, buf_rx);
 +
 +    match_rx_tx_data(buf_tx, buf_rx, can_timestamp);
 +
 +    qtest_quit(qts);
 +}
 +
 +/* Testing Snoop mode on CAN0 while CAN1 is in normal mode. */
 +static void test_can_snoopmode(void)
 +{
 +    uint32_t buf_tx[4] = { 0x14, 0x80000000, 0x12345678, 0x87654321 };
 +    uint32_t buf_rx[4] = { 0x00, 0x00, 0x00, 0x00 };
 +    uint32_t status = 0;
 +    uint8_t can_timestamp = 1;
 +
 +    QTestState *qts = qtest_init("-machine xlnx-zcu102"
 +                " -object can-bus,id=canbus0"
 +                " -machine xlnx-zcu102.canbus0=canbus0"
 +                " -machine xlnx-zcu102.canbus1=canbus0"
 +                );
 +
 +    /* Configure the CAN0. */
 +    qtest_writel(qts, CAN0_BASE_ADDR + R_SRR_OFFSET, CONFIG_MODE);
 +    qtest_writel(qts, CAN0_BASE_ADDR + R_MSR_OFFSET, SNOOP_MODE);
 +    qtest_writel(qts, CAN0_BASE_ADDR + R_SRR_OFFSET, ENABLE_CAN);
 +
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_SRR_OFFSET, ENABLE_CAN);
 +    qtest_writel(qts, CAN1_BASE_ADDR + R_MSR_OFFSET, NORMAL_MODE);
 +
 +    /* Check here if CAN0 is in SNOOP mode and CAN1 in normal mode. */
 +    status = qtest_readl(qts, CAN0_BASE_ADDR + R_SR_OFFSET);
 +    g_assert_cmpint(status, ==, STATUS_SNOOP_MODE);
 +
 +    status = qtest_readl(qts, CAN1_BASE_ADDR + R_SR_OFFSET);
 +    g_assert_cmpint(status, ==, STATUS_NORMAL_MODE);
 +
 +    send_data(qts, CAN1_BASE_ADDR, buf_tx);
 +
 +    read_data(qts, CAN0_BASE_ADDR, buf_rx);
 +
 +    match_rx_tx_data(buf_tx, buf_rx, can_timestamp);
 +
 +    qtest_quit(qts);
 +}
 +
 +int main(int argc, char **argv)
 +{
 +    g_test_init(&argc, &argv, NULL);
 +
 +    qtest_add_func("/net/can/can_bus", test_can_bus);
 +    qtest_add_func("/net/can/can_loopback", test_can_loopback);
 +    qtest_add_func("/net/can/can_filter", test_can_filter);
 +    qtest_add_func("/net/can/can_test_snoopmode", test_can_snoopmode);
 +    qtest_add_func("/net/can/can_test_sleepmode", test_can_sleepmode);
 +
 +    return g_test_run();
 +}
 diff --git a/tests/qtest/meson.build b/tests/qtest/meson.build
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
+--- a/tests/qtest/meson.build
-+++ b/target/arm/translate-a64.c
++++ b/tests/qtest/meson.build
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_scalar_pairwise(DisasContext *s, uint32_t insn)
+@@ -XXX,XX +XXX,XX @@ qtests_aarch64 = \
-     case 0xf: /* FMAXP */
+   ['arm-cpu-features',
-     case 0x2c: /* FMINNMP */
+    'numa-test',
-     case 0x2f: /* FMINP */
+    'boot-serial-test',
--        /* FP op, size[0] is 32 or 64 bit */
++   'xlnx-can-test',
-+        /* FP op, size[0] is 32 or 64 bit*/
+    'migration-test']
-         if (!u) {
--            unallocated_encoding(s);
+ qtests_s390x = \
 -            return;
 +            if (!arm_dc_feature(s, ARM_FEATURE_V8_FP16)) {
 +                unallocated_encoding(s);
 +                return;
 +            } else {
 +                size = MO_16;
 +            }
 +        } else {
 +            size = extract32(size, 0, 1) ? MO_64 : MO_32;
          }
 +
          if (!fp_access_check(s)) {
              return;
          }
 -        size = extract32(size, 0, 1) ? 3 : 2;
 -        fpst = get_fpstatus_ptr(false);
 +        fpst = get_fpstatus_ptr(size == MO_16);
          break;
      default:
          unallocated_encoding(s);
          return;
      }
 -    if (size == 3) {
 +    if (size == MO_64) {
          TCGv_i64 tcg_op1 = tcg_temp_new_i64();
          TCGv_i64 tcg_op2 = tcg_temp_new_i64();
          TCGv_i64 tcg_res = tcg_temp_new_i64();
@@ -XXX,XX +XXX,XX @@ static void disas_simd_scalar_pairwise(DisasContext *s, uint32_t insn)
          TCGv_i32 tcg_op2 = tcg_temp_new_i32();
          TCGv_i32 tcg_res = tcg_temp_new_i32();
 -        read_vec_element_i32(s, tcg_op1, rn, 0, MO_32);
 -        read_vec_element_i32(s, tcg_op2, rn, 1, MO_32);
 +        read_vec_element_i32(s, tcg_op1, rn, 0, size);
 +        read_vec_element_i32(s, tcg_op2, rn, 1, size);
 -        switch (opcode) {
 -        case 0xc: /* FMAXNMP */
 -            gen_helper_vfp_maxnums(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0xd: /* FADDP */
 -            gen_helper_vfp_adds(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0xf: /* FMAXP */
 -            gen_helper_vfp_maxs(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0x2c: /* FMINNMP */
 -            gen_helper_vfp_minnums(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0x2f: /* FMINP */
 -            gen_helper_vfp_mins(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        default:
 -            g_assert_not_reached();
 +        if (size == MO_16) {
 +            switch (opcode) {
 +            case 0xc: /* FMAXNMP */
 +                gen_helper_advsimd_maxnumh(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0xd: /* FADDP */
 +                gen_helper_advsimd_addh(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0xf: /* FMAXP */
 +                gen_helper_advsimd_maxh(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x2c: /* FMINNMP */
 +                gen_helper_advsimd_minnumh(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x2f: /* FMINP */
 +                gen_helper_advsimd_minh(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            default:
 +                g_assert_not_reached();
 +            }
 +        } else {
 +            switch (opcode) {
 +            case 0xc: /* FMAXNMP */
 +                gen_helper_vfp_maxnums(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0xd: /* FADDP */
 +                gen_helper_vfp_adds(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0xf: /* FMAXP */
 +                gen_helper_vfp_maxs(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x2c: /* FMINNMP */
 +                gen_helper_vfp_minnums(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x2f: /* FMINP */
 +                gen_helper_vfp_mins(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            default:
 +                g_assert_not_reached();
 +            }
          }
          write_fp_sreg(s, rd, tcg_res);
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 42/42] MAINTAINERS: Update my email address
+[PULL 05/36] MAINTAINERS: Add maintainer entry for Xilinx ZynqMP CAN controller
-From: Alistair Francis <alistair.francis@xilinx.com>
+From: Vikram Garhwal <fnu.vikram@xilinx.com>
-I am leaving Xilinx, so to avoid having an email address that bounces
+Reviewed-by: Francisco Iglesias <francisco.iglesias@xilinx.com>
-update my maintainer address to point to my personal email address.
+Reviewed-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
+Signed-off-by: Vikram Garhwal <fnu.vikram@xilinx.com>
-Signed-off-by: Alistair Francis <alistair.francis@xilinx.com>
+Message-id: 1605728926-352690-5-git-send-email-fnu.vikram@xilinx.com
 Signed-off-by: Alistair Francis <alistair@alistair23.me>
 Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
 Message-id: 7bb690382e3370aa1c1e047a84e36603c787ec0e.1519749987.git.alistair.francis@xilinx.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- MAINTAINERS | 12 ++++++------
+ MAINTAINERS | 8 ++++++++
-file changed, 6 insertions(+), 6 deletions(-)
+file changed, 8 insertions(+)
 diff --git a/MAINTAINERS b/MAINTAINERS
 index XXXXXXX..XXXXXXX 100644
 --- a/MAINTAINERS
 +++ b/MAINTAINERS
-@@ -XXX,XX +XXX,XX @@ F: hw/misc/arm_sysctl.c
+@@ -XXX,XX +XXX,XX @@ F: hw/net/opencores_eth.c
- Xilinx Zynq
+ Devices
- M: Edgar E. Iglesias <edgar.iglesias@gmail.com>
+ -------
--M: Alistair Francis <alistair.francis@xilinx.com>
++Xilinx CAN
-+M: Alistair Francis <alistair@alistair23.me>
++M: Vikram Garhwal <fnu.vikram@xilinx.com>
- L: qemu-arm@nongnu.org
++M: Francisco Iglesias <francisco.iglesias@xilinx.com>
 +S: Maintained
 +F: hw/net/can/xlnx-*
 +F: include/hw/net/xlnx-*
 +F: tests/qtest/xlnx-can-test*
 +
  EDU
  M: Jiri Slaby <jslaby@suse.cz>
  S: Maintained
- F: hw/*/xilinx_*
-@@ -XXX,XX +XXX,XX @@ F: include/hw/misc/zynq*
- X: hw/ssi/xilinx_*
- Xilinx ZynqMP
--M: Alistair Francis <alistair.francis@xilinx.com>
-+M: Alistair Francis <alistair@alistair23.me>
- M: Edgar E. Iglesias <edgar.iglesias@gmail.com>
- L: qemu-arm@nongnu.org
- S: Maintained
-@@ -XXX,XX +XXX,XX @@ T: git git://github.com/bonzini/qemu.git scsi-next
- SSI
- M: Peter Crosthwaite <crosthwaite.peter@gmail.com>
--M: Alistair Francis <alistair.francis@xilinx.com>
-+M: Alistair Francis <alistair@alistair23.me>
- S: Maintained
- F: hw/ssi/*
- F: hw/block/m25p80.c
-@@ -XXX,XX +XXX,XX @@ X: hw/ssi/xilinx_*
- F: tests/m25p80-test.c
- Xilinx SPI
--M: Alistair Francis <alistair.francis@xilinx.com>
-+M: Alistair Francis <alistair@alistair23.me>
- M: Peter Crosthwaite <crosthwaite.peter@gmail.com>
- S: Maintained
- F: hw/ssi/xilinx_*
-@@ -XXX,XX +XXX,XX @@ S: Maintained
- F: hw/net/eepro100.c
- Generic Loader
--M: Alistair Francis <alistair.francis@xilinx.com>
-+M: Alistair Francis <alistair@alistair23.me>
- S: Maintained
- F: hw/core/generic-loader.c
- F: include/hw/core/generic-loader.h
-@@ -XXX,XX +XXX,XX @@ F: tests/qmp-test.c
- T: git git://repo.or.cz/qemu/armbru.git qapi-next
- Register API
--M: Alistair Francis <alistair.francis@xilinx.com>
-+M: Alistair Francis <alistair@alistair23.me>
- S: Maintained
- F: hw/core/register.c
- F: include/hw/register.h
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 26/42] arm/translate-a64: add FCVTxx to simd_two_reg_misc_fp16
+[PULL 06/36] sbsa-ref: allow to use Cortex-A53/57/72 cpus
-From: Alex Bennée <alex.bennee@linaro.org>
+From: Marcin Juszkiewicz <marcin.juszkiewicz@linaro.org>
-This covers all the floating point convert operations.
+Trusted Firmware now supports A72 on sbsa-ref by default [1] so enable
 it for QEMU as well. A53 was already enabled there.
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+. https://review.trustedfirmware.org/c/TF-A/trusted-firmware-a/+/7117
 Signed-off-by: Marcin Juszkiewicz <marcin.juszkiewicz@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20180227143852.11175-19-alex.bennee@linaro.org
+Message-id: 20201120141705.246690-1-marcin.juszkiewicz@linaro.org
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/helper-a64.h    |  2 ++
+ hw/arm/sbsa-ref.c | 23 ++++++++++++++++++++---
- target/arm/helper-a64.c    | 32 +++++++++++++++++
+file changed, 20 insertions(+), 3 deletions(-)
  target/arm/translate-a64.c | 85 +++++++++++++++++++++++++++++++++++++++++++++-
 files changed, 118 insertions(+), 1 deletion(-)
-diff --git a/target/arm/helper-a64.h b/target/arm/helper-a64.h
+diff --git a/hw/arm/sbsa-ref.c b/hw/arm/sbsa-ref.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper-a64.h
+--- a/hw/arm/sbsa-ref.c
-+++ b/target/arm/helper-a64.h
++++ b/hw/arm/sbsa-ref.c
-@@ -XXX,XX +XXX,XX @@ DEF_HELPER_3(advsimd_mulx2h, i32, i32, i32, ptr)
+@@ -XXX,XX +XXX,XX @@ static const int sbsa_ref_irqmap[] = {
- DEF_HELPER_4(advsimd_muladd2h, i32, i32, i32, i32, ptr)
+     [SBSA_GWDT] = 16,
- DEF_HELPER_2(advsimd_rinth_exact, f16, f16, ptr)
+ };
- DEF_HELPER_2(advsimd_rinth, f16, f16, ptr)
-+DEF_HELPER_2(advsimd_f16tosinth, i32, f16, ptr)
++static const char * const valid_cpus[] = {
-+DEF_HELPER_2(advsimd_f16touinth, i32, f16, ptr)
++    ARM_CPU_TYPE_NAME("cortex-a53"),
-diff --git a/target/arm/helper-a64.c b/target/arm/helper-a64.c
++    ARM_CPU_TYPE_NAME("cortex-a57"),
-index XXXXXXX..XXXXXXX 100644
++    ARM_CPU_TYPE_NAME("cortex-a72"),
---- a/target/arm/helper-a64.c
++};
 +++ b/target/arm/helper-a64.c
@@ -XXX,XX +XXX,XX @@ float16 HELPER(advsimd_rinth)(float16 x, void *fp_status)
      return ret;
  }
 +
-+/*
++static bool cpu_type_valid(const char *cpu)
-+ * Half-precision floating point conversion functions
++{
-+ *
++    int i;
 + * There are a multitude of conversion functions with various
 + * different rounding modes. This is dealt with by the calling code
 + * setting the mode appropriately before calling the helper.
 + */
 +
-+uint32_t HELPER(advsimd_f16tosinth)(float16 a, void *fpstp)
++    for (i = 0; i < ARRAY_SIZE(valid_cpus); i++) {
-+{
++        if (strcmp(cpu, valid_cpus[i]) == 0) {
-+    float_status *fpst = fpstp;
++            return true;
-+
++        }
 +    /* Invalid if we are passed a NaN */
 +    if (float16_is_any_nan(a)) {
 +        float_raise(float_flag_invalid, fpst);
 +        return 0;
 +    }
-+    return float16_to_int16(a, fpst);
++    return false;
 +}
 +
-+uint32_t HELPER(advsimd_f16touinth)(float16 a, void *fpstp)
+ static uint64_t sbsa_ref_cpu_mp_affinity(SBSAMachineState *sms, int idx)
-+{
+ {
-+    float_status *fpst = fpstp;
+     uint8_t clustersz = ARM_DEFAULT_CPUS_PER_CLUSTER;
-+
+@@ -XXX,XX +XXX,XX @@ static void sbsa_ref_init(MachineState *machine)
-+    /* Invalid if we are passed a NaN */
+     const CPUArchIdList *possible_cpus;
-+    if (float16_is_any_nan(a)) {
+     int n, sbsa_max_cpus;
-+        float_raise(float_flag_invalid, fpst);
-+        return 0;
+-    if (strcmp(machine->cpu_type, ARM_CPU_TYPE_NAME("cortex-a57"))) {
-+    }
+-        error_report("sbsa-ref: CPU type other than the built-in "
-+    return float16_to_uint16(a, fpst);
+-                     "cortex-a57 not supported");
-+}
++    if (!cpu_type_valid(machine->cpu_type)) {
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
++        error_report("mach-virt: CPU type %s not supported", machine->cpu_type);
-index XXXXXXX..XXXXXXX 100644
+         exit(1);
 --- a/target/arm/translate-a64.c
 +++ b/target/arm/translate-a64.c
@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
          only_in_vector = true;
          /* current rounding mode */
          break;
 +    case 0x1a: /* FCVTNS */
 +        need_rmode = true;
 +        rmode = FPROUNDING_TIEEVEN;
 +        break;
 +    case 0x1b: /* FCVTMS */
 +        need_rmode = true;
 +        rmode = FPROUNDING_NEGINF;
 +        break;
 +    case 0x1c: /* FCVTAS */
 +        need_rmode = true;
 +        rmode = FPROUNDING_TIEAWAY;
 +        break;
 +    case 0x3a: /* FCVTPS */
 +        need_rmode = true;
 +        rmode = FPROUNDING_POSINF;
 +        break;
 +    case 0x3b: /* FCVTZS */
 +        need_rmode = true;
 +        rmode = FPROUNDING_ZERO;
 +        break;
 +    case 0x5a: /* FCVTNU */
 +        need_rmode = true;
 +        rmode = FPROUNDING_TIEEVEN;
 +        break;
 +    case 0x5b: /* FCVTMU */
 +        need_rmode = true;
 +        rmode = FPROUNDING_NEGINF;
 +        break;
 +    case 0x5c: /* FCVTAU */
 +        need_rmode = true;
 +        rmode = FPROUNDING_TIEAWAY;
 +        break;
 +    case 0x7a: /* FCVTPU */
 +        need_rmode = true;
 +        rmode = FPROUNDING_POSINF;
 +        break;
 +    case 0x7b: /* FCVTZU */
 +        need_rmode = true;
 +        rmode = FPROUNDING_ZERO;
 +        break;
      default:
          fprintf(stderr, "%s: insn %#04x fpop %#2x\n", __func__, insn, fpop);
          g_assert_not_reached();
@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
      }
-     if (is_scalar) {
--        /* no operations yet */
-+        TCGv_i32 tcg_op = tcg_temp_new_i32();
-+        TCGv_i32 tcg_res = tcg_temp_new_i32();
-+
-+        read_vec_element_i32(s, tcg_op, rn, 0, MO_16);
-+
-+        switch (fpop) {
-+        case 0x1a: /* FCVTNS */
-+        case 0x1b: /* FCVTMS */
-+        case 0x1c: /* FCVTAS */
-+        case 0x3a: /* FCVTPS */
-+        case 0x3b: /* FCVTZS */
-+            gen_helper_advsimd_f16tosinth(tcg_res, tcg_op, tcg_fpstatus);
-+            break;
-+        case 0x5a: /* FCVTNU */
-+        case 0x5b: /* FCVTMU */
-+        case 0x5c: /* FCVTAU */
-+        case 0x7a: /* FCVTPU */
-+        case 0x7b: /* FCVTZU */
-+            gen_helper_advsimd_f16touinth(tcg_res, tcg_op, tcg_fpstatus);
-+            break;
-+        default:
-+            g_assert_not_reached();
-+        }
-+
-+        /* limit any sign extension going on */
-+        tcg_gen_andi_i32(tcg_res, tcg_res, 0xffff);
-+        write_fp_sreg(s, rd, tcg_res);
-+
-+        tcg_temp_free_i32(tcg_res);
-+        tcg_temp_free_i32(tcg_op);
-     } else {
-         for (pass = 0; pass < (is_q ? 8 : 4); pass++) {
-             TCGv_i32 tcg_op = tcg_temp_new_i32();
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
-             read_vec_element_i32(s, tcg_op, rn, pass, MO_16);
-             switch (fpop) {
-+            case 0x1a: /* FCVTNS */
-+            case 0x1b: /* FCVTMS */
-+            case 0x1c: /* FCVTAS */
-+            case 0x3a: /* FCVTPS */
-+            case 0x3b: /* FCVTZS */
-+                gen_helper_advsimd_f16tosinth(tcg_res, tcg_op, tcg_fpstatus);
-+                break;
-+            case 0x5a: /* FCVTNU */
-+            case 0x5b: /* FCVTMU */
-+            case 0x5c: /* FCVTAU */
-+            case 0x7a: /* FCVTPU */
-+            case 0x7b: /* FCVTZU */
-+                gen_helper_advsimd_f16touinth(tcg_res, tcg_op, tcg_fpstatus);
-+                break;
-             case 0x18: /* FRINTN */
-             case 0x19: /* FRINTM */
-             case 0x38: /* FRINTP */
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 30/42] arm/helper.c: re-factor recpe and add recepe_f16
+[PULL 07/36] tests/qtest/npcm7xx_rng-test: dump random data on failure
-From: Alex Bennée <alex.bennee@linaro.org>
+From: Havard Skinnemoen <hskinnemoen@google.com>
-It looks like the ARM ARM has simplified the pseudo code for the
+Dump the collected random data after a randomness test failure.
 calculation which is done on a fixed point 9 bit integer maths. So
 while adding f16 we can also clean this up to be a little less heavy
 on the floating point and just return the fractional part and leave
 the calle's to do the final packing of the result.
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Note that this relies on the test having called
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+g_test_set_nonfatal_assertions() so we don't abort immediately on the
-Message-id: 20180227143852.11175-23-alex.bennee@linaro.org
+assertion failure.
 Signed-off-by: Havard Skinnemoen <hskinnemoen@google.com>
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
 [PMM: minor commit message tweak]
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/helper.h |   1 +
+ tests/qtest/npcm7xx_rng-test.c | 12 ++++++++++++
- target/arm/helper.c | 226 +++++++++++++++++++++++++++++-----------------------
+file changed, 12 insertions(+)
 files changed, 129 insertions(+), 98 deletions(-)
-diff --git a/target/arm/helper.h b/target/arm/helper.h
+diff --git a/tests/qtest/npcm7xx_rng-test.c b/tests/qtest/npcm7xx_rng-test.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper.h
+--- a/tests/qtest/npcm7xx_rng-test.c
-+++ b/target/arm/helper.h
++++ b/tests/qtest/npcm7xx_rng-test.c
-@@ -XXX,XX +XXX,XX @@ DEF_HELPER_4(vfp_muladds, f32, f32, f32, f32, ptr)
+@@ -XXX,XX +XXX,XX @@
- DEF_HELPER_3(recps_f32, f32, f32, f32, env)
+ #include "libqtest-single.h"
- DEF_HELPER_3(rsqrts_f32, f32, f32, f32, env)
+ #include "qemu/bitops.h"
-+DEF_HELPER_FLAGS_2(recpe_f16, TCG_CALL_NO_RWG, f16, f16, ptr)
++#include "qemu-common.h"
- DEF_HELPER_FLAGS_2(recpe_f32, TCG_CALL_NO_RWG, f32, f32, ptr)
- DEF_HELPER_FLAGS_2(recpe_f64, TCG_CALL_NO_RWG, f64, f64, ptr)
+ #define RNG_BASE_ADDR   0xf000b000
- DEF_HELPER_FLAGS_2(rsqrte_f32, TCG_CALL_NO_RWG, f32, f32, ptr)
-diff --git a/target/arm/helper.c b/target/arm/helper.c
+@@ -XXX,XX +XXX,XX @@
-index XXXXXXX..XXXXXXX 100644
+ /* Number of bits to collect for randomness tests. */
---- a/target/arm/helper.c
+ #define TEST_INPUT_BITS  (128)
-+++ b/target/arm/helper.c
-@@ -XXX,XX +XXX,XX @@ float32 HELPER(rsqrts_f32)(float32 a, float32 b, CPUARMState *env)
++static void dump_buf_if_failed(const uint8_t *buf, size_t size)
   * int->float conversions at run-time.  */
  #define float64_256 make_float64(0x4070000000000000LL)
  #define float64_512 make_float64(0x4080000000000000LL)
 +#define float16_maxnorm make_float16(0x7bff)
  #define float32_maxnorm make_float32(0x7f7fffff)
  #define float64_maxnorm make_float64(0x7fefffffffffffffLL)
  /* Reciprocal functions
   *
   * The algorithm that must be used to calculate the estimate
 - * is specified by the ARM ARM, see FPRecipEstimate()
 + * is specified by the ARM ARM, see FPRecipEstimate()/RecipEstimate
   */
 -static float64 recip_estimate(float64 a, float_status *real_fp_status)
 +/* See RecipEstimate()
 + *
 + * input is a 9 bit fixed point number
 + * input range 256 .. 511 for a number from 0.5 <= x < 1.0.
 + * result range 256 .. 511 for a number from 1.0 to 511/256.
 + */
 +
 +static int recip_estimate(int input)
  {
 -    /* These calculations mustn't set any fp exception flags,
 -     * so we use a local copy of the fp_status.
 -     */
 -    float_status dummy_status = *real_fp_status;
 -    float_status *s = &dummy_status;
 -    /* q = (int)(a * 512.0) */
 -    float64 q = float64_mul(float64_512, a, s);
 -    int64_t q_int = float64_to_int64_round_to_zero(q, s);
 -
 -    /* r = 1.0 / (((double)q + 0.5) / 512.0) */
 -    q = int64_to_float64(q_int, s);
 -    q = float64_add(q, float64_half, s);
 -    q = float64_div(q, float64_512, s);
 -    q = float64_div(float64_one, q, s);
 -
 -    /* s = (int)(256.0 * r + 0.5) */
 -    q = float64_mul(q, float64_256, s);
 -    q = float64_add(q, float64_half, s);
 -    q_int = float64_to_int64_round_to_zero(q, s);
 -
 -    /* return (double)s / 256.0 */
 -    return float64_div(int64_to_float64(q_int, s), float64_256, s);
 +    int a, b, r;
 +    assert(256 <= input && input < 512);
 +    a = (input * 2) + 1;
 +    b = (1 << 19) / a;
 +    r = (b + 1) >> 1;
 +    assert(256 <= r && r < 512);
 +    return r;
  }
 -/* Common wrapper to call recip_estimate */
 -static float64 call_recip_estimate(float64 num, int off, float_status *fpst)
 -{
 -    uint64_t val64 = float64_val(num);
 -    uint64_t frac = extract64(val64, 0, 52);
 -    int64_t exp = extract64(val64, 52, 11);
 -    uint64_t sbit;
 -    float64 scaled, estimate;
 +/*
 + * Common wrapper to call recip_estimate
 + *
 + * The parameters are exponent and 64 bit fraction (without implicit
 + * bit) where the binary point is nominally at bit 52. Returns a
 + * float64 which can then be rounded to the appropriate size by the
 + * callee.
 + */
 -    /* Generate the scaled number for the estimate function */
 -    if (exp == 0) {
 +static uint64_t call_recip_estimate(int *exp, int exp_off, uint64_t frac)
 +{
-+    uint32_t scaled, estimate;
++    if (g_test_failed()) {
-+    uint64_t result_frac;
++        qemu_hexdump(stderr, "", buf, size);
 +    int result_exp;
 +
 +    /* Handle sub-normals */
 +    if (*exp == 0) {
          if (extract64(frac, 51, 1) == 0) {
 -            exp = -1;
 -            frac = extract64(frac, 0, 50) << 2;
 +            *exp = -1;
 +            frac <<= 2;
          } else {
 -            frac = extract64(frac, 0, 51) << 1;
 +            frac <<= 1;
          }
      }
 -    /* scaled = '0' : '01111111110' : fraction<51:44> : Zeros(44); */
 -    scaled = make_float64((0x3feULL << 52)
 -                          | extract64(frac, 44, 8) << 44);
 +    /* scaled = UInt('1':fraction<51:44>) */
 +    scaled = deposit32(1 << 8, 0, 8, extract64(frac, 44, 8));
 +    estimate = recip_estimate(scaled);
 -    estimate = recip_estimate(scaled, fpst);
 -
 -    /* Build new result */
 -    val64 = float64_val(estimate);
 -    sbit = 0x8000000000000000ULL & val64;
 -    exp = off - exp;
 -    frac = extract64(val64, 0, 52);
 -
 -    if (exp == 0) {
 -        frac = 1ULL << 51 | extract64(frac, 1, 51);
 -    } else if (exp == -1) {
 -        frac = 1ULL << 50 | extract64(frac, 2, 50);
 -        exp = 0;
 +    result_exp = exp_off - *exp;
 +    result_frac = deposit64(0, 44, 8, estimate);
 +    if (result_exp == 0) {
 +        result_frac = deposit64(result_frac >> 1, 51, 1, 1);
 +    } else if (result_exp == -1) {
 +        result_frac = deposit64(result_frac >> 2, 50, 2, 1);
 +        result_exp = 0;
      }
 -    return make_float64(sbit | (exp << 52) | frac);
 +    *exp = result_exp;
 +
 +    return result_frac;
  }
  static bool round_to_inf(float_status *fpst, bool sign_bit)
@@ -XXX,XX +XXX,XX @@ static bool round_to_inf(float_status *fpst, bool sign_bit)
      g_assert_not_reached();
  }
 +float16 HELPER(recpe_f16)(float16 input, void *fpstp)
 +{
 +    float_status *fpst = fpstp;
 +    float16 f16 = float16_squash_input_denormal(input, fpst);
 +    uint32_t f16_val = float16_val(f16);
 +    uint32_t f16_sign = float16_is_neg(f16);
 +    int f16_exp = extract32(f16_val, 10, 5);
 +    uint32_t f16_frac = extract32(f16_val, 0, 10);
 +    uint64_t f64_frac;
 +
 +    if (float16_is_any_nan(f16)) {
 +        float16 nan = f16;
 +        if (float16_is_signaling_nan(f16, fpst)) {
 +            float_raise(float_flag_invalid, fpst);
 +            nan = float16_maybe_silence_nan(f16, fpst);
 +        }
 +        if (fpst->default_nan_mode) {
 +            nan =  float16_default_nan(fpst);
 +        }
 +        return nan;
 +    } else if (float16_is_infinity(f16)) {
 +        return float16_set_sign(float16_zero, float16_is_neg(f16));
 +    } else if (float16_is_zero(f16)) {
 +        float_raise(float_flag_divbyzero, fpst);
 +        return float16_set_sign(float16_infinity, float16_is_neg(f16));
 +    } else if (float16_abs(f16) < (1 << 8)) {
 +        /* Abs(value) < 2.0^-16 */
 +        float_raise(float_flag_overflow | float_flag_inexact, fpst);
 +        if (round_to_inf(fpst, f16_sign)) {
 +            return float16_set_sign(float16_infinity, f16_sign);
 +        } else {
 +            return float16_set_sign(float16_maxnorm, f16_sign);
 +        }
 +    } else if (f16_exp >= 29 && fpst->flush_to_zero) {
 +        float_raise(float_flag_underflow, fpst);
 +        return float16_set_sign(float16_zero, float16_is_neg(f16));
 +    }
-+
-+    f64_frac = call_recip_estimate(&f16_exp, 29,
-+                                   ((uint64_t) f16_frac) << (52 - 10));
-+
-+    /* result = sign : result_exp<4:0> : fraction<51:42> */
-+    f16_val = deposit32(0, 15, 1, f16_sign);
-+    f16_val = deposit32(f16_val, 10, 5, f16_exp);
-+    f16_val = deposit32(f16_val, 0, 10, extract64(f64_frac, 52 - 10, 10));
-+    return make_float16(f16_val);
 +}
 +
- float32 HELPER(recpe_f32)(float32 input, void *fpstp)
+ static void rng_writeb(unsigned int offset, uint8_t value)
  {
-     float_status *fpst = fpstp;
+     writeb(RNG_BASE_ADDR + offset, value);
-     float32 f32 = float32_squash_input_denormal(input, fpst);
+@@ -XXX,XX +XXX,XX @@ static void test_continuous_monobit(void)
      uint32_t f32_val = float32_val(f32);
 -    uint32_t f32_sbit = 0x80000000ULL & f32_val;
 -    int32_t f32_exp = extract32(f32_val, 23, 8);
 +    bool f32_sign = float32_is_neg(f32);
 +    int f32_exp = extract32(f32_val, 23, 8);
      uint32_t f32_frac = extract32(f32_val, 0, 23);
 -    float64 f64, r64;
 -    uint64_t r64_val;
 -    int64_t r64_exp;
 -    uint64_t r64_frac;
 +    uint64_t f64_frac;
      if (float32_is_any_nan(f32)) {
          float32 nan = f32;
@@ -XXX,XX +XXX,XX @@ float32 HELPER(recpe_f32)(float32 input, void *fpstp)
      } else if (float32_is_zero(f32)) {
          float_raise(float_flag_divbyzero, fpst);
          return float32_set_sign(float32_infinity, float32_is_neg(f32));
 -    } else if ((f32_val & ~(1ULL << 31)) < (1ULL << 21)) {
 +    } else if (float32_abs(f32) < (1ULL << 21)) {
          /* Abs(value) < 2.0^-128 */
          float_raise(float_flag_overflow | float_flag_inexact, fpst);
 -        if (round_to_inf(fpst, f32_sbit)) {
 -            return float32_set_sign(float32_infinity, float32_is_neg(f32));
 +        if (round_to_inf(fpst, f32_sign)) {
 +            return float32_set_sign(float32_infinity, f32_sign);
          } else {
 -            return float32_set_sign(float32_maxnorm, float32_is_neg(f32));
 +            return float32_set_sign(float32_maxnorm, f32_sign);
          }
      } else if (f32_exp >= 253 && fpst->flush_to_zero) {
          float_raise(float_flag_underflow, fpst);
          return float32_set_sign(float32_zero, float32_is_neg(f32));
      }
-+    f64_frac = call_recip_estimate(&f32_exp, 253,
+     g_assert_cmpfloat(calc_monobit_p(buf, sizeof(buf)), >, 0.01);
-+                                   ((uint64_t) f32_frac) << (52 - 23));
++    dump_buf_if_failed(buf, sizeof(buf));
 -    f64 = make_float64(((int64_t)(f32_exp) << 52) | (int64_t)(f32_frac) << 29);
 -    r64 = call_recip_estimate(f64, 253, fpst);
 -    r64_val = float64_val(r64);
 -    r64_exp = extract64(r64_val, 52, 11);
 -    r64_frac = extract64(r64_val, 0, 52);
 -
 -    /* result = sign : result_exp<7:0> : fraction<51:29>; */
 -    return make_float32(f32_sbit |
 -                        (r64_exp & 0xff) << 23 |
 -                        extract64(r64_frac, 29, 24));
 +    /* result = sign : result_exp<7:0> : fraction<51:29> */
 +    f32_val = deposit32(0, 31, 1, f32_sign);
 +    f32_val = deposit32(f32_val, 23, 8, f32_exp);
 +    f32_val = deposit32(f32_val, 0, 23, extract64(f64_frac, 52 - 23, 23));
 +    return make_float32(f32_val);
  }
- float64 HELPER(recpe_f64)(float64 input, void *fpstp)
+ /*
-@@ -XXX,XX +XXX,XX @@ float64 HELPER(recpe_f64)(float64 input, void *fpstp)
+@@ -XXX,XX +XXX,XX @@ static void test_continuous_runs(void)
      float_status *fpst = fpstp;
      float64 f64 = float64_squash_input_denormal(input, fpst);
      uint64_t f64_val = float64_val(f64);
 -    uint64_t f64_sbit = 0x8000000000000000ULL & f64_val;
 -    int64_t f64_exp = extract64(f64_val, 52, 11);
 -    float64 r64;
 -    uint64_t r64_val;
 -    int64_t r64_exp;
 -    uint64_t r64_frac;
 +    bool f64_sign = float64_is_neg(f64);
 +    int f64_exp = extract64(f64_val, 52, 11);
 +    uint64_t f64_frac = extract64(f64_val, 0, 52);
      /* Deal with any special cases */
      if (float64_is_any_nan(f64)) {
@@ -XXX,XX +XXX,XX @@ float64 HELPER(recpe_f64)(float64 input, void *fpstp)
      } else if ((f64_val & ~(1ULL << 63)) < (1ULL << 50)) {
          /* Abs(value) < 2.0^-1024 */
          float_raise(float_flag_overflow | float_flag_inexact, fpst);
 -        if (round_to_inf(fpst, f64_sbit)) {
 -            return float64_set_sign(float64_infinity, float64_is_neg(f64));
 +        if (round_to_inf(fpst, f64_sign)) {
 +            return float64_set_sign(float64_infinity, f64_sign);
          } else {
 -            return float64_set_sign(float64_maxnorm, float64_is_neg(f64));
 +            return float64_set_sign(float64_maxnorm, f64_sign);
          }
      } else if (f64_exp >= 2045 && fpst->flush_to_zero) {
          float_raise(float_flag_underflow, fpst);
          return float64_set_sign(float64_zero, float64_is_neg(f64));
      }
--    r64 = call_recip_estimate(f64, 2045, fpst);
+     g_assert_cmpfloat(calc_runs_p(buf.l, sizeof(buf) * BITS_PER_BYTE), >, 0.01);
--    r64_val = float64_val(r64);
++    dump_buf_if_failed(buf.c, sizeof(buf));
 -    r64_exp = extract64(r64_val, 52, 11);
 -    r64_frac = extract64(r64_val, 0, 52);
 +    f64_frac = call_recip_estimate(&f64_exp, 2045, f64_frac);
 -    /* result = sign : result_exp<10:0> : fraction<51:0> */
 -    return make_float64(f64_sbit |
 -                        ((r64_exp & 0x7ff) << 52) |
 -                        r64_frac);
 +    /* result = sign : result_exp<10:0> : fraction<51:0>; */
 +    f64_val = deposit64(0, 63, 1, f64_sign);
 +    f64_val = deposit64(f64_val, 52, 11, f64_exp);
 +    f64_val = deposit64(f64_val, 0, 52, f64_frac);
 +    return make_float64(f64_val);
  }
- /* The algorithm that must be used to calculate the estimate
+ /*
-@@ -XXX,XX +XXX,XX @@ float64 HELPER(rsqrte_f64)(float64 input, void *fpstp)
+@@ -XXX,XX +XXX,XX @@ static void test_first_byte_monobit(void)
  uint32_t HELPER(recpe_u32)(uint32_t a, void *fpstp)
  {
 -    float_status *s = fpstp;
 -    float64 f64;
 +    /* float_status *s = fpstp; */
 +    int input, estimate;
      if ((a & 0x80000000) == 0) {
          return 0xffffffff;
      }
--    f64 = make_float64((0x3feULL << 52)
+     g_assert_cmpfloat(calc_monobit_p(buf, sizeof(buf)), >, 0.01);
--                       | ((int64_t)(a & 0x7fffffff) << 21));
++    dump_buf_if_failed(buf, sizeof(buf));
 +    input = extract32(a, 23, 9);
 +    estimate = recip_estimate(input);
 -    f64 = recip_estimate(f64, s);
 -
 -    return 0x80000000 | ((float64_val(f64) >> 21) & 0x7fffffff);
 +    return deposit32(0, (32 - 9), 9, estimate);
  }
- uint32_t HELPER(rsqrte_u32)(uint32_t a, void *fpstp)
+ /*
@@ -XXX,XX +XXX,XX @@ static void test_first_byte_runs(void)
      }
      g_assert_cmpfloat(calc_runs_p(buf.l, sizeof(buf) * BITS_PER_BYTE), >, 0.01);
 +    dump_buf_if_failed(buf.c, sizeof(buf));
  }
  int main(int argc, char **argv)
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 14/42] arm/translate-a64: implement half-precision F(MIN|MAX)(V|NMV)
+[PULL 08/36] i.MX25: Fix bad printf format specifiers
-From: Alex Bennée <alex.bennee@linaro.org>
+From: Alex Chen <alex.chen@huawei.com>
-This implements the half-precision variants of the across vector
+We should use printf format specifier "%u" instead of "%d" for
-reduction operations. This involves a re-factor of the reduction code
+argument of type "unsigned int".
 which more closely matches the ARM ARM order (and handles 8 element
 reductions).
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Reported-by: Euler Robot <euler.robot@huawei.com>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Signed-off-by: Alex Chen <alex.chen@huawei.com>
-Message-id: 20180227143852.11175-7-alex.bennee@linaro.org
+Message-id: 20201126111109.112238-2-alex.chen@huawei.com
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/helper-a64.h    |   4 ++
+ hw/misc/imx25_ccm.c | 12 ++++++------
- target/arm/helper-a64.c    |  18 ++++++
+file changed, 6 insertions(+), 6 deletions(-)
  target/arm/translate-a64.c | 140 ++++++++++++++++++++++++++++-----------------
 files changed, 109 insertions(+), 53 deletions(-)
-diff --git a/target/arm/helper-a64.h b/target/arm/helper-a64.h
+diff --git a/hw/misc/imx25_ccm.c b/hw/misc/imx25_ccm.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper-a64.h
+--- a/hw/misc/imx25_ccm.c
-+++ b/target/arm/helper-a64.h
++++ b/hw/misc/imx25_ccm.c
-@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_4(paired_cmpxchg64_le_parallel, TCG_CALL_NO_WG,
+@@ -XXX,XX +XXX,XX @@ static const char *imx25_ccm_reg_name(uint32_t reg)
- DEF_HELPER_FLAGS_4(paired_cmpxchg64_be, TCG_CALL_NO_WG, i64, env, i64, i64, i64)
+     case IMX25_CCM_LPIMR1_REG:
- DEF_HELPER_FLAGS_4(paired_cmpxchg64_be_parallel, TCG_CALL_NO_WG,
+         return "lpimr1";
-                    i64, env, i64, i64, i64)
+     default:
-+DEF_HELPER_FLAGS_3(advsimd_maxh, TCG_CALL_NO_RWG, f16, f16, f16, ptr)
+-        sprintf(unknown, "[%d ?]", reg);
-+DEF_HELPER_FLAGS_3(advsimd_minh, TCG_CALL_NO_RWG, f16, f16, f16, ptr)
++        sprintf(unknown, "[%u ?]", reg);
-+DEF_HELPER_FLAGS_3(advsimd_maxnumh, TCG_CALL_NO_RWG, f16, f16, f16, ptr)
+         return unknown;
 +DEF_HELPER_FLAGS_3(advsimd_minnumh, TCG_CALL_NO_RWG, f16, f16, f16, ptr)
 diff --git a/target/arm/helper-a64.c b/target/arm/helper-a64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper-a64.c
 +++ b/target/arm/helper-a64.c
@@ -XXX,XX +XXX,XX @@ uint64_t HELPER(paired_cmpxchg64_be_parallel)(CPUARMState *env, uint64_t addr,
  {
      return do_paired_cmpxchg64_be(env, addr, new_lo, new_hi, true, GETPC());
  }
 +
 +/*
 + * AdvSIMD half-precision
 + */
 +
 +#define ADVSIMD_HELPER(name, suffix) HELPER(glue(glue(advsimd_, name), suffix))
 +
 +#define ADVSIMD_HALFOP(name) \
 +float16 ADVSIMD_HELPER(name, h)(float16 a, float16 b, void *fpstp) \
 +{ \
 +    float_status *fpst = fpstp; \
 +    return float16_ ## name(a, b, fpst);    \
 +}
 +
 +ADVSIMD_HALFOP(min)
 +ADVSIMD_HALFOP(max)
 +ADVSIMD_HALFOP(minnum)
 +ADVSIMD_HALFOP(maxnum)
 diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-a64.c
 +++ b/target/arm/translate-a64.c
@@ -XXX,XX +XXX,XX @@ static void disas_simd_zip_trn(DisasContext *s, uint32_t insn)
      tcg_temp_free_i64(tcg_resh);
  }
 -static void do_minmaxop(DisasContext *s, TCGv_i32 tcg_elt1, TCGv_i32 tcg_elt2,
 -                        int opc, bool is_min, TCGv_ptr fpst)
 +/*
 + * do_reduction_op helper
 + *
 + * This mirrors the Reduce() pseudocode in the ARM ARM. It is
 + * important for correct NaN propagation that we do these
 + * operations in exactly the order specified by the pseudocode.
 + *
 + * This is a recursive function, TCG temps should be freed by the
 + * calling function once it is done with the values.
 + */
 +static TCGv_i32 do_reduction_op(DisasContext *s, int fpopcode, int rn,
 +                                int esize, int size, int vmap, TCGv_ptr fpst)
  {
 -    /* Helper function for disas_simd_across_lanes: do a single precision
 -     * min/max operation on the specified two inputs,
 -     * and return the result in tcg_elt1.
 -     */
 -    if (opc == 0xc) {
 -        if (is_min) {
 -            gen_helper_vfp_minnums(tcg_elt1, tcg_elt1, tcg_elt2, fpst);
 -        } else {
 -            gen_helper_vfp_maxnums(tcg_elt1, tcg_elt1, tcg_elt2, fpst);
 -        }
 +    if (esize == size) {
 +        int element;
 +        TCGMemOp msize = esize == 16 ? MO_16 : MO_32;
 +        TCGv_i32 tcg_elem;
 +
 +        /* We should have one register left here */
 +        assert(ctpop8(vmap) == 1);
 +        element = ctz32(vmap);
 +        assert(element < 8);
 +
 +        tcg_elem = tcg_temp_new_i32();
 +        read_vec_element_i32(s, tcg_elem, rn, element, msize);
 +        return tcg_elem;
      } else {
 -        assert(opc == 0xf);
 -        if (is_min) {
 -            gen_helper_vfp_mins(tcg_elt1, tcg_elt1, tcg_elt2, fpst);
 -        } else {
 -            gen_helper_vfp_maxs(tcg_elt1, tcg_elt1, tcg_elt2, fpst);
 +        int bits = size / 2;
 +        int shift = ctpop8(vmap) / 2;
 +        int vmap_lo = (vmap >> shift) & vmap;
 +        int vmap_hi = (vmap & ~vmap_lo);
 +        TCGv_i32 tcg_hi, tcg_lo, tcg_res;
 +
 +        tcg_hi = do_reduction_op(s, fpopcode, rn, esize, bits, vmap_hi, fpst);
 +        tcg_lo = do_reduction_op(s, fpopcode, rn, esize, bits, vmap_lo, fpst);
 +        tcg_res = tcg_temp_new_i32();
 +
 +        switch (fpopcode) {
 +        case 0x0c: /* fmaxnmv half-precision */
 +            gen_helper_advsimd_maxnumh(tcg_res, tcg_lo, tcg_hi, fpst);
 +            break;
 +        case 0x0f: /* fmaxv half-precision */
 +            gen_helper_advsimd_maxh(tcg_res, tcg_lo, tcg_hi, fpst);
 +            break;
 +        case 0x1c: /* fminnmv half-precision */
 +            gen_helper_advsimd_minnumh(tcg_res, tcg_lo, tcg_hi, fpst);
 +            break;
 +        case 0x1f: /* fminv half-precision */
 +            gen_helper_advsimd_minh(tcg_res, tcg_lo, tcg_hi, fpst);
 +            break;
 +        case 0x2c: /* fmaxnmv */
 +            gen_helper_vfp_maxnums(tcg_res, tcg_lo, tcg_hi, fpst);
 +            break;
 +        case 0x2f: /* fmaxv */
 +            gen_helper_vfp_maxs(tcg_res, tcg_lo, tcg_hi, fpst);
 +            break;
 +        case 0x3c: /* fminnmv */
 +            gen_helper_vfp_minnums(tcg_res, tcg_lo, tcg_hi, fpst);
 +            break;
 +        case 0x3f: /* fminv */
 +            gen_helper_vfp_mins(tcg_res, tcg_lo, tcg_hi, fpst);
 +            break;
 +        default:
 +            g_assert_not_reached();
          }
 +
 +        tcg_temp_free_i32(tcg_hi);
 +        tcg_temp_free_i32(tcg_lo);
 +        return tcg_res;
      }
  }
+@@ -XXX,XX +XXX,XX @@ static uint32_t imx25_ccm_get_mpll_clk(IMXCCMState *dev)
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_across_lanes(DisasContext *s, uint32_t insn)
+         freq = imx_ccm_calc_pll(s->reg[IMX25_CCM_MPCTL_REG], CKIH_FREQ);
      }
 -    DPRINTF("freq = %d\n", freq);
 +    DPRINTF("freq = %u\n", freq);
      return freq;
  }
@@ -XXX,XX +XXX,XX @@ static uint32_t imx25_ccm_get_mcu_clk(IMXCCMState *dev)
      freq = freq / (1 + EXTRACT(s->reg[IMX25_CCM_CCTL_REG], ARM_CLK_DIV));
 -    DPRINTF("freq = %d\n", freq);
 +    DPRINTF("freq = %u\n", freq);
      return freq;
  }
@@ -XXX,XX +XXX,XX @@ static uint32_t imx25_ccm_get_ahb_clk(IMXCCMState *dev)
      freq = imx25_ccm_get_mcu_clk(dev)
             / (1 + EXTRACT(s->reg[IMX25_CCM_CCTL_REG], AHB_CLK_DIV));
 -    DPRINTF("freq = %d\n", freq);
 +    DPRINTF("freq = %u\n", freq);
      return freq;
  }
@@ -XXX,XX +XXX,XX @@ static uint32_t imx25_ccm_get_ipg_clk(IMXCCMState *dev)
      freq = imx25_ccm_get_ahb_clk(dev) / 2;
 -    DPRINTF("freq = %d\n", freq);
 +    DPRINTF("freq = %u\n", freq);
      return freq;
  }
@@ -XXX,XX +XXX,XX @@ static uint32_t imx25_ccm_get_clock_frequency(IMXCCMState *dev, IMXClk clock)
          break;
-     case 0xc: /* FMAXNMV, FMINNMV */
-     case 0xf: /* FMAXV, FMINV */
--        if (!is_u || !is_q || extract32(size, 0, 1)) {
--            unallocated_encoding(s);
--            return;
--        }
--        /* Bit 1 of size field encodes min vs max, and actual size is always
--         * 32 bits: adjust the size variable so following code can rely on it
-+        /* Bit 1 of size field encodes min vs max and the actual size
-+         * depends on the encoding of the U bit. If not set (and FP16
-+         * enabled) then we do half-precision float instead of single
-+         * precision.
-          */
-         is_min = extract32(size, 1, 1);
-         is_fp = true;
--        size = 2;
-+        if (!is_u && arm_dc_feature(s, ARM_FEATURE_V8_FP16)) {
-+            size = 1;
-+        } else if (!is_u || !is_q || extract32(size, 0, 1)) {
-+            unallocated_encoding(s);
-+            return;
-+        } else {
-+            size = 2;
-+        }
-         break;
-     default:
-         unallocated_encoding(s);
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_across_lanes(DisasContext *s, uint32_t insn)
-         }
-     } else {
--        /* Floating point ops which work on 32 bit (single) intermediates.
-+        /* Floating point vector reduction ops which work across 32
-+         * bit (single) or 16 bit (half-precision) intermediates.
-          * Note that correct NaN propagation requires that we do these
-          * operations in exactly the order specified by the pseudocode.
-          */
--        TCGv_i32 tcg_elt1 = tcg_temp_new_i32();
--        TCGv_i32 tcg_elt2 = tcg_temp_new_i32();
--        TCGv_i32 tcg_elt3 = tcg_temp_new_i32();
--        TCGv_ptr fpst = get_fpstatus_ptr(false);
--
--        assert(esize == 32);
--        assert(elements == 4);
--
--        read_vec_element(s, tcg_elt, rn, 0, MO_32);
--        tcg_gen_extrl_i64_i32(tcg_elt1, tcg_elt);
--        read_vec_element(s, tcg_elt, rn, 1, MO_32);
--        tcg_gen_extrl_i64_i32(tcg_elt2, tcg_elt);
--
--        do_minmaxop(s, tcg_elt1, tcg_elt2, opcode, is_min, fpst);
--
--        read_vec_element(s, tcg_elt, rn, 2, MO_32);
--        tcg_gen_extrl_i64_i32(tcg_elt2, tcg_elt);
--        read_vec_element(s, tcg_elt, rn, 3, MO_32);
--        tcg_gen_extrl_i64_i32(tcg_elt3, tcg_elt);
--
--        do_minmaxop(s, tcg_elt2, tcg_elt3, opcode, is_min, fpst);
--
--        do_minmaxop(s, tcg_elt1, tcg_elt2, opcode, is_min, fpst);
--
--        tcg_gen_extu_i32_i64(tcg_res, tcg_elt1);
--        tcg_temp_free_i32(tcg_elt1);
--        tcg_temp_free_i32(tcg_elt2);
--        tcg_temp_free_i32(tcg_elt3);
-+        TCGv_ptr fpst = get_fpstatus_ptr(size == MO_16);
-+        int fpopcode = opcode | is_min << 4 | is_u << 5;
-+        int vmap = (1 << elements) - 1;
-+        TCGv_i32 tcg_res32 = do_reduction_op(s, fpopcode, rn, esize,
-+                                             (is_q ? 128 : 64), vmap, fpst);
-+        tcg_gen_extu_i32_i64(tcg_res, tcg_res32);
-+        tcg_temp_free_i32(tcg_res32);
-         tcg_temp_free_ptr(fpst);
      }
+-    DPRINTF("Clock = %d) = %d\n", clock, freq);
++    DPRINTF("Clock = %d) = %u\n", clock, freq);
+     return freq;
+ }
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 29/42] arm/translate-a64: add FP16 FNEG/FABS to simd_two_reg_misc_fp16
+[PULL 09/36] i.MX31: Fix bad printf format specifiers
-From: Alex Bennée <alex.bennee@linaro.org>
+From: Alex Chen <alex.chen@huawei.com>
-Neither of these operations alter the floating point status registers
+We should use printf format specifier "%u" instead of "%d" for
-so we can do a pure bitwise operation, either squashing any sign
+argument of type "unsigned int".
 bit (ABS) or inverting it (NEG).
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Reported-by: Euler Robot <euler.robot@huawei.com>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Signed-off-by: Alex Chen <alex.chen@huawei.com>
-Message-id: 20180227143852.11175-22-alex.bennee@linaro.org
+Message-id: 20201126111109.112238-3-alex.chen@huawei.com
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/translate-a64.c | 16 +++++++++++++++-
+ hw/misc/imx31_ccm.c | 14 +++++++-------
-file changed, 15 insertions(+), 1 deletion(-)
+ hw/misc/imx_ccm.c   |  4 ++--
 files changed, 9 insertions(+), 9 deletions(-)
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
+diff --git a/hw/misc/imx31_ccm.c b/hw/misc/imx31_ccm.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
+--- a/hw/misc/imx31_ccm.c
-+++ b/target/arm/translate-a64.c
++++ b/hw/misc/imx31_ccm.c
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
+@@ -XXX,XX +XXX,XX @@ static const char *imx31_ccm_reg_name(uint32_t reg)
-     TCGv_i32 tcg_rmode = NULL;
+     case IMX31_CCM_PDR2_REG:
-     TCGv_ptr tcg_fpstatus = NULL;
+         return "PDR2";
-     bool need_rmode = false;
+     default:
-+    bool need_fpst = true;
+-        sprintf(unknown, "[%d ?]", reg);
-     int rmode;
++        sprintf(unknown, "[%u ?]", reg);
+         return unknown;
-     if (!arm_dc_feature(s, ARM_FEATURE_V8_FP16)) {
+     }
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
+ }
-         need_rmode = true;
+@@ -XXX,XX +XXX,XX @@ static uint32_t imx31_ccm_get_pll_ref_clk(IMXCCMState *dev)
-         rmode = FPROUNDING_ZERO;
+         freq = CKIH_FREQ;
      }
 -    DPRINTF("freq = %d\n", freq);
 +    DPRINTF("freq = %u\n", freq);
      return freq;
  }
@@ -XXX,XX +XXX,XX @@ static uint32_t imx31_ccm_get_mpll_clk(IMXCCMState *dev)
      freq = imx_ccm_calc_pll(s->reg[IMX31_CCM_MPCTL_REG],
                              imx31_ccm_get_pll_ref_clk(dev));
 -    DPRINTF("freq = %d\n", freq);
 +    DPRINTF("freq = %u\n", freq);
      return freq;
  }
@@ -XXX,XX +XXX,XX @@ static uint32_t imx31_ccm_get_mcu_main_clk(IMXCCMState *dev)
          freq = imx31_ccm_get_mpll_clk(dev);
      }
 -    DPRINTF("freq = %d\n", freq);
 +    DPRINTF("freq = %u\n", freq);
      return freq;
  }
@@ -XXX,XX +XXX,XX @@ static uint32_t imx31_ccm_get_hclk_clk(IMXCCMState *dev)
      freq = imx31_ccm_get_mcu_main_clk(dev)
             / (1 + EXTRACT(s->reg[IMX31_CCM_PDR0_REG], MAX));
 -    DPRINTF("freq = %d\n", freq);
 +    DPRINTF("freq = %u\n", freq);
      return freq;
  }
@@ -XXX,XX +XXX,XX @@ static uint32_t imx31_ccm_get_ipg_clk(IMXCCMState *dev)
      freq = imx31_ccm_get_hclk_clk(dev)
             / (1 + EXTRACT(s->reg[IMX31_CCM_PDR0_REG], IPG));
 -    DPRINTF("freq = %d\n", freq);
 +    DPRINTF("freq = %u\n", freq);
      return freq;
  }
@@ -XXX,XX +XXX,XX @@ static uint32_t imx31_ccm_get_clock_frequency(IMXCCMState *dev, IMXClk clock)
          break;
-+    case 0x2f: /* FABS */
-+    case 0x6f: /* FNEG */
-+        need_fpst = false;
-+        break;
-     default:
-         fprintf(stderr, "%s: insn %#04x fpop %#2x\n", __func__, insn, fpop);
-         g_assert_not_reached();
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
-         return;
      }
--    if (need_rmode) {
+-    DPRINTF("Clock = %d) = %d\n", clock, freq);
-+    if (need_rmode || need_fpst) {
++    DPRINTF("Clock = %d) = %u\n", clock, freq);
-         tcg_fpstatus = get_fpstatus_ptr(true);
      return freq;
  }
 diff --git a/hw/misc/imx_ccm.c b/hw/misc/imx_ccm.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/misc/imx_ccm.c
 +++ b/hw/misc/imx_ccm.c
@@ -XXX,XX +XXX,XX @@ uint32_t imx_ccm_get_clock_frequency(IMXCCMState *dev, IMXClk clock)
          freq = klass->get_clock_frequency(dev, clock);
      }
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
+-    DPRINTF("(clock = %d) = %d\n", clock, freq);
-         case 0x7b: /* FCVTZU */
++    DPRINTF("(clock = %d) = %u\n", clock, freq);
-             gen_helper_advsimd_f16touinth(tcg_res, tcg_op, tcg_fpstatus);
-             break;
+     return freq;
-+        case 0x6f: /* FNEG */
+ }
-+            tcg_gen_xori_i32(tcg_res, tcg_op, 0x8000);
+@@ -XXX,XX +XXX,XX @@ uint32_t imx_ccm_calc_pll(uint32_t pllreg, uint32_t base_freq)
-+            break;
+     freq = ((2 * (base_freq >> 10) * (mfi * mfd + mfn)) /
-         default:
+             (mfd * pd)) << 10;
-             g_assert_not_reached();
-         }
+-    DPRINTF("(pllreg = 0x%08x, base_freq = %d) = %d\n", pllreg, base_freq,
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
++    DPRINTF("(pllreg = 0x%08x, base_freq = %u) = %d\n", pllreg, base_freq,
-             case 0x59: /* FRINTX */
+             freq);
-                 gen_helper_advsimd_rinth_exact(tcg_res, tcg_op, tcg_fpstatus);
-                 break;
+     return freq;
 +            case 0x2f: /* FABS */
 +                tcg_gen_andi_i32(tcg_res, tcg_op, 0x7fff);
 +                break;
 +            case 0x6f: /* FNEG */
 +                tcg_gen_xori_i32(tcg_res, tcg_op, 0x8000);
 +                break;
              default:
                  g_assert_not_reached();
              }
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 36/42] arm/translate-a64: add FP16 FMOV to simd_mod_imm
+[PULL 10/36] i.MX6: Fix bad printf format specifiers
-From: Alex Bennée <alex.bennee@linaro.org>
+From: Alex Chen <alex.chen@huawei.com>
-Only one half-precision instruction has been added to this group.
+We should use printf format specifier "%u" instead of "%d" for
 argument of type "unsigned int".
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Reported-by: Euler Robot <euler.robot@huawei.com>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Signed-off-by: Alex Chen <alex.chen@huawei.com>
-Message-id: 20180227143852.11175-29-alex.bennee@linaro.org
+Message-id: 20201126111109.112238-4-alex.chen@huawei.com
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/translate-a64.c | 35 +++++++++++++++++++++++++----------
+ hw/misc/imx6_ccm.c | 20 ++++++++++----------
-file changed, 25 insertions(+), 10 deletions(-)
+ hw/misc/imx6_src.c |  2 +-
 files changed, 11 insertions(+), 11 deletions(-)
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
+diff --git a/hw/misc/imx6_ccm.c b/hw/misc/imx6_ccm.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
+--- a/hw/misc/imx6_ccm.c
-+++ b/target/arm/translate-a64.c
++++ b/hw/misc/imx6_ccm.c
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_copy(DisasContext *s, uint32_t insn)
+@@ -XXX,XX +XXX,XX @@ static const char *imx6_ccm_reg_name(uint32_t reg)
-  *   MVNI - move inverted (shifted) imm into register
+     case CCM_CMEOR:
-  *   ORR  - bitwise OR of (shifted) imm with register
+         return "CMEOR";
-  *   BIC  - bitwise clear of (shifted) imm with register
+     default:
-+ * With ARMv8.2 we also have:
+-        sprintf(unknown, "%d ?", reg);
-+ *   FMOV half-precision
++        sprintf(unknown, "%u ?", reg);
-  */
+         return unknown;
  static void disas_simd_mod_imm(DisasContext *s, uint32_t insn)
  {
@@ -XXX,XX +XXX,XX @@ static void disas_simd_mod_imm(DisasContext *s, uint32_t insn)
      uint64_t imm = 0;
      if (o2 != 0 || ((cmode == 0xf) && is_neg && !is_q)) {
 -        unallocated_encoding(s);
 -        return;
 +        /* Check for FMOV (vector, immediate) - half-precision */
 +        if (!(arm_dc_feature(s, ARM_FEATURE_V8_FP16) && o2 && cmode == 0xf)) {
 +            unallocated_encoding(s);
 +            return;
 +        }
      }
+ }
-     if (!fp_access_check(s)) {
+@@ -XXX,XX +XXX,XX @@ static const char *imx6_analog_reg_name(uint32_t reg)
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_mod_imm(DisasContext *s, uint32_t insn)
+     case USB_ANALOG_DIGPROG:
-                     imm |= 0x4000000000000000ULL;
+         return "USB_ANALOG_DIGPROG";
-                 }
+     default:
-             } else {
+-        sprintf(unknown, "%d ?", reg);
--                imm = (abcdefgh & 0x3f) << 19;
++        sprintf(unknown, "%u ?", reg);
--                if (abcdefgh & 0x80) {
+         return unknown;
--                    imm |= 0x80000000;
+     }
--                }
+ }
--                if (abcdefgh & 0x40) {
+@@ -XXX,XX +XXX,XX @@ static uint64_t imx6_analog_get_pll2_clk(IMX6CCMState *dev)
--                    imm |= 0x3e000000;
+         freq *= 20;
-+                if (o2) {
+     }
-+                    /* FMOV (vector, immediate) - half-precision */
-+                    imm = vfp_expand_imm(MO_16, abcdefgh);
+-    DPRINTF("freq = %d\n", (uint32_t)freq);
-+                    /* now duplicate across the lanes */
++    DPRINTF("freq = %u\n", (uint32_t)freq);
-+                    imm = bitfield_replicate(imm, 16);
-                 } else {
+     return freq;
--                    imm |= 0x40000000;
+ }
-+                    imm = (abcdefgh & 0x3f) << 19;
+@@ -XXX,XX +XXX,XX @@ static uint64_t imx6_analog_get_pll2_pfd0_clk(IMX6CCMState *dev)
-+                    if (abcdefgh & 0x80) {
+     freq = imx6_analog_get_pll2_clk(dev) * 18
-+                        imm |= 0x80000000;
+            / EXTRACT(dev->analog[CCM_ANALOG_PFD_528], PFD0_FRAC);
-+                    }
-+                    if (abcdefgh & 0x40) {
+-    DPRINTF("freq = %d\n", (uint32_t)freq);
-+                        imm |= 0x3e000000;
++    DPRINTF("freq = %u\n", (uint32_t)freq);
-+                    } else {
-+                        imm |= 0x40000000;
+     return freq;
-+                    }
+ }
-+                    imm |= (imm << 32);
+@@ -XXX,XX +XXX,XX @@ static uint64_t imx6_analog_get_pll2_pfd2_clk(IMX6CCMState *dev)
-                 }
+     freq = imx6_analog_get_pll2_clk(dev) * 18
--                imm |= (imm << 32);
+            / EXTRACT(dev->analog[CCM_ANALOG_PFD_528], PFD2_FRAC);
-             }
-         }
+-    DPRINTF("freq = %d\n", (uint32_t)freq);
 +    DPRINTF("freq = %u\n", (uint32_t)freq);
      return freq;
  }
@@ -XXX,XX +XXX,XX @@ static uint64_t imx6_analog_get_periph_clk(IMX6CCMState *dev)
          break;
-+    default:
-+        fprintf(stderr, "%s: cmode_3_1: %x\n", __func__, cmode_3_1);
-+        g_assert_not_reached();
      }
-     if (cmode_3_1 != 7 && is_neg) {
+-    DPRINTF("freq = %d\n", (uint32_t)freq);
 +    DPRINTF("freq = %u\n", (uint32_t)freq);
      return freq;
  }
@@ -XXX,XX +XXX,XX @@ static uint64_t imx6_ccm_get_ahb_clk(IMX6CCMState *dev)
      freq = imx6_analog_get_periph_clk(dev)
             / (1 + EXTRACT(dev->ccm[CCM_CBCDR], AHB_PODF));
 -    DPRINTF("freq = %d\n", (uint32_t)freq);
 +    DPRINTF("freq = %u\n", (uint32_t)freq);
      return freq;
  }
@@ -XXX,XX +XXX,XX @@ static uint64_t imx6_ccm_get_ipg_clk(IMX6CCMState *dev)
      freq = imx6_ccm_get_ahb_clk(dev)
             / (1 + EXTRACT(dev->ccm[CCM_CBCDR], IPG_PODF));
 -    DPRINTF("freq = %d\n", (uint32_t)freq);
 +    DPRINTF("freq = %u\n", (uint32_t)freq);
      return freq;
  }
@@ -XXX,XX +XXX,XX @@ static uint64_t imx6_ccm_get_per_clk(IMX6CCMState *dev)
      freq = imx6_ccm_get_ipg_clk(dev)
             / (1 + EXTRACT(dev->ccm[CCM_CSCMR1], PERCLK_PODF));
 -    DPRINTF("freq = %d\n", (uint32_t)freq);
 +    DPRINTF("freq = %u\n", (uint32_t)freq);
      return freq;
  }
@@ -XXX,XX +XXX,XX @@ static uint32_t imx6_ccm_get_clock_frequency(IMXCCMState *dev, IMXClk clock)
          break;
      }
 -    DPRINTF("Clock = %d) = %d\n", clock, freq);
 +    DPRINTF("Clock = %d) = %u\n", clock, freq);
      return freq;
  }
 diff --git a/hw/misc/imx6_src.c b/hw/misc/imx6_src.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/misc/imx6_src.c
 +++ b/hw/misc/imx6_src.c
@@ -XXX,XX +XXX,XX @@ static const char *imx6_src_reg_name(uint32_t reg)
      case SRC_GPR10:
          return "SRC_GPR10";
      default:
 -        sprintf(unknown, "%d ?", reg);
 +        sprintf(unknown, "%u ?", reg);
          return unknown;
      }
  }
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 06/42] hw/i2c-ddc: Do not fail writes
+[PULL 11/36] i.MX6ul: Fix bad printf format specifiers
-From: Linus Walleij <linus.walleij@linaro.org>
+From: Alex Chen <alex.chen@huawei.com>
-The tx function of the DDC I2C slave emulation was returning 1
+We should use printf format specifier "%u" instead of "%d" for
-on all writes resulting in NACK in the I2C bus. Changing it to
+argument of type "unsigned int".
 makes the DDC I2C work fine with bit-banged I2C such as the
 versatile I2C.
-I guess it was not affecting whatever I2C controller this was
+Reported-by: Euler Robot <euler.robot@huawei.com>
-used with until now, but with the Versatile I2C it surely
+Signed-off-by: Alex Chen <alex.chen@huawei.com>
-does not work.
+Message-id: 20201126111109.112238-5-alex.chen@huawei.com
 Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Linus Walleij <linus.walleij@linaro.org>
-Message-id: 20180227104903.21353-4-linus.walleij@linaro.org
-Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- hw/i2c/i2c-ddc.c | 4 ++--
+ hw/misc/imx6ul_ccm.c | 4 ++--
 file changed, 2 insertions(+), 2 deletions(-)
-diff --git a/hw/i2c/i2c-ddc.c b/hw/i2c/i2c-ddc.c
+diff --git a/hw/misc/imx6ul_ccm.c b/hw/misc/imx6ul_ccm.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/i2c/i2c-ddc.c
+--- a/hw/misc/imx6ul_ccm.c
-+++ b/hw/i2c/i2c-ddc.c
++++ b/hw/misc/imx6ul_ccm.c
-@@ -XXX,XX +XXX,XX @@ static int i2c_ddc_tx(I2CSlave *i2c, uint8_t data)
+@@ -XXX,XX +XXX,XX @@ static const char *imx6ul_ccm_reg_name(uint32_t reg)
-         s->reg = data;
+     case CCM_CMEOR:
-         s->firstbyte = false;
+         return "CMEOR";
-         DPRINTF("[EDID] Written new pointer: %u\n", data);
+     default:
--        return 1;
+-        sprintf(unknown, "%d ?", reg);
-+        return 0;
++        sprintf(unknown, "%u ?", reg);
          return unknown;
      }
-     /* Ignore all writes */
-     s->reg++;
--    return 1;
-+    return 0;
  }
+@@ -XXX,XX +XXX,XX @@ static const char *imx6ul_analog_reg_name(uint32_t reg)
- static void i2c_ddc_init(Object *obj)
+     case USB_ANALOG_DIGPROG:
          return "USB_ANALOG_DIGPROG";
      default:
 -        sprintf(unknown, "%d ?", reg);
 +        sprintf(unknown, "%u ?", reg);
          return unknown;
      }
  }
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 05/42] i2c: Move the bus class to i2c.h
+[PULL 12/36] hw/intc/armv7m_nvic: Make all of system PPB range be RAZWI/BusFault
-From: Corey Minyard <cminyard@mvista.com>
+For M-profile CPUs, the range from 0xe0000000 to 0xe00fffff is the
 Private Peripheral Bus range, which includes all of the memory mapped
 devices and registers that are part of the CPU itself, including the
 NVIC, systick timer, and debug and trace components like the Data
 Watchpoint and Trace unit (DWT).  Within this large region, the range
 xe000e000 to 0xe000efff is the System Control Space (NVIC, system
 registers, systick) and 0xe002e000 to 0exe002efff is its Non-secure
 alias.
-Some devices need access to it.
+The architecture is clear that within the SCS unimplemented registers
 should be RES0 for privileged accesses and generate BusFault for
 unprivileged accesses, and we currently implement this.
-Signed-off-by: Corey Minyard <cminyard@mvista.com>
+It is less clear about how to handle accesses to unimplemented
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
+regions of the wider PPB.  Unprivileged accesses should definitely
-Signed-off-by: Linus Walleij <linus.walleij@linaro.org>
+cause BusFaults (R_DQQS), but the behaviour of privileged accesses is
-Message-id: 20180227104903.21353-3-linus.walleij@linaro.org
+not given as a general rule.  However, the register definitions of
 individual registers for components like the DWT all state that they
 are RES0 if the relevant component is not implemented, so the
 simplest way to provide that is to provide RAZ/WI for the whole range
 for privileged accesses.  (The v7M Arm ARM does say that reserved
 registers should be UNK/SBZP.)
 Expand the container MemoryRegion that the NVIC exposes so that
 it covers the whole PPB space. This means:
  * moving the address that the ARMV7M device maps it to down by
 xe000 bytes
  * moving the off and the offsets within the container of all the
    subregions forward by 0xe000 bytes
  * adding a new default MemoryRegion that covers the whole container
    at a lower priority than anything else and which provides the
    RAZWI/BusFault behaviour
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20201119215617.29887-2-peter.maydell@linaro.org
 ---
- include/hw/i2c/i2c.h | 17 +++++++++++++++++
+ include/hw/intc/armv7m_nvic.h |  1 +
- hw/i2c/core.c        | 17 -----------------
+ hw/arm/armv7m.c               |  2 +-
-files changed, 17 insertions(+), 17 deletions(-)
+ hw/intc/armv7m_nvic.c         | 78 ++++++++++++++++++++++++++++++-----
 files changed, 69 insertions(+), 12 deletions(-)
-diff --git a/include/hw/i2c/i2c.h b/include/hw/i2c/i2c.h
+diff --git a/include/hw/intc/armv7m_nvic.h b/include/hw/intc/armv7m_nvic.h
 index XXXXXXX..XXXXXXX 100644
---- a/include/hw/i2c/i2c.h
+--- a/include/hw/intc/armv7m_nvic.h
-+++ b/include/hw/i2c/i2c.h
++++ b/include/hw/intc/armv7m_nvic.h
-@@ -XXX,XX +XXX,XX @@ struct I2CSlave {
+@@ -XXX,XX +XXX,XX @@ struct NVICState {
-     uint8_t address;
+     MemoryRegion systickmem;
      MemoryRegion systick_ns_mem;
      MemoryRegion container;
 +    MemoryRegion defaultmem;
      uint32_t num_irq;
      qemu_irq excpout;
 diff --git a/hw/arm/armv7m.c b/hw/arm/armv7m.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/arm/armv7m.c
 +++ b/hw/arm/armv7m.c
@@ -XXX,XX +XXX,XX @@ static void armv7m_realize(DeviceState *dev, Error **errp)
      sysbus_connect_irq(sbd, 0,
                         qdev_get_gpio_in(DEVICE(s->cpu), ARM_CPU_IRQ));
 -    memory_region_add_subregion(&s->container, 0xe000e000,
 +    memory_region_add_subregion(&s->container, 0xe0000000,
                                  sysbus_mmio_get_region(sbd, 0));
      for (i = 0; i < ARRAY_SIZE(s->bitband); i++) {
 diff --git a/hw/intc/armv7m_nvic.c b/hw/intc/armv7m_nvic.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/intc/armv7m_nvic.c
 +++ b/hw/intc/armv7m_nvic.c
@@ -XXX,XX +XXX,XX @@ static const MemoryRegionOps nvic_systick_ops = {
      .endianness = DEVICE_NATIVE_ENDIAN,
  };
-+#define TYPE_I2C_BUS "i2c-bus"
++/*
-+#define I2C_BUS(obj) OBJECT_CHECK(I2CBus, (obj), TYPE_I2C_BUS)
++ * Unassigned portions of the PPB space are RAZ/WI for privileged
 + * accesses, and fault for non-privileged accesses.
 + */
 +static MemTxResult ppb_default_read(void *opaque, hwaddr addr,
 +                                    uint64_t *data, unsigned size,
 +                                    MemTxAttrs attrs)
 +{
 +    qemu_log_mask(LOG_UNIMP, "Read of unassigned area of PPB: offset 0x%x\n",
 +                  (uint32_t)addr);
 +    if (attrs.user) {
 +        return MEMTX_ERROR;
 +    }
 +    *data = 0;
 +    return MEMTX_OK;
 +}
 +
-+typedef struct I2CNode I2CNode;
++static MemTxResult ppb_default_write(void *opaque, hwaddr addr,
 +                                     uint64_t value, unsigned size,
 +                                     MemTxAttrs attrs)
 +{
 +    qemu_log_mask(LOG_UNIMP, "Write of unassigned area of PPB: offset 0x%x\n",
 +                  (uint32_t)addr);
 +    if (attrs.user) {
 +        return MEMTX_ERROR;
 +    }
 +    return MEMTX_OK;
 +}
 +
-+struct I2CNode {
++static const MemoryRegionOps ppb_default_ops = {
-+    I2CSlave *elt;
++    .read_with_attrs = ppb_default_read,
-+    QLIST_ENTRY(I2CNode) next;
++    .write_with_attrs = ppb_default_write,
 +    .endianness = DEVICE_NATIVE_ENDIAN,
 +    .valid.min_access_size = 1,
 +    .valid.max_access_size = 8,
 +};
 +
-+struct I2CBus {
+ static int nvic_post_load(void *opaque, int version_id)
-+    BusState qbus;
+ {
-+    QLIST_HEAD(, I2CNode) current_devs;
+     NVICState *s = opaque;
-+    uint8_t saved_address;
+@@ -XXX,XX +XXX,XX @@ static void nvic_systick_trigger(void *opaque, int n, int level)
-+    bool broadcast;
+ static void armv7m_nvic_realize(DeviceState *dev, Error **errp)
-+};
+ {
-+
+     NVICState *s = NVIC(dev);
- I2CBus *i2c_init_bus(DeviceState *parent, const char *name);
+-    int regionlen;
- void i2c_set_slave_address(I2CSlave *dev, uint8_t address);
- int i2c_bus_busy(I2CBus *bus);
+     /* The armv7m container object will have set our CPU pointer */
-diff --git a/hw/i2c/core.c b/hw/i2c/core.c
+     if (!s->cpu || !arm_feature(&s->cpu->env, ARM_FEATURE_M)) {
-index XXXXXXX..XXXXXXX 100644
+@@ -XXX,XX +XXX,XX @@ static void armv7m_nvic_realize(DeviceState *dev, Error **errp)
---- a/hw/i2c/core.c
+                                                   M_REG_S));
-+++ b/hw/i2c/core.c
+     }
-@@ -XXX,XX +XXX,XX @@
- #include "qemu/osdep.h"
+-    /* The NVIC and System Control Space (SCS) starts at 0xe000e000
- #include "hw/i2c/i2c.h"
++    /*
++     * This device provides a single sysbus memory region which
--typedef struct I2CNode I2CNode;
++     * represents the whole of the "System PPB" space. This is the
--
++     * range from 0xe0000000 to 0xe00fffff and includes the NVIC,
--struct I2CNode {
++     * the System Control Space (system registers), the systick timer,
--    I2CSlave *elt;
++     * and for CPUs with the Security extension an NS banked version
--    QLIST_ENTRY(I2CNode) next;
++     * of all of these.
--};
++     *
--
++     * The default behaviour for unimplemented registers/ranges
- #define I2C_BROADCAST 0x00
++     * (for instance the Data Watchpoint and Trace unit at 0xe0001000)
++     * is to RAZ/WI for privileged access and BusFault for non-privileged
--struct I2CBus {
++     * access.
--    BusState qbus;
++     *
--    QLIST_HEAD(, I2CNode) current_devs;
++     * The NVIC and System Control Space (SCS) starts at 0xe000e000
--    uint8_t saved_address;
+      * and looks like this:
--    bool broadcast;
+      *  0x004 - ICTR
--};
+      *  0x010 - 0xff - systick
--
+@@ -XXX,XX +XXX,XX @@ static void armv7m_nvic_realize(DeviceState *dev, Error **errp)
- static Property i2c_props[] = {
+      * generally code determining which banked register to use should
-     DEFINE_PROP_UINT8("address", struct I2CSlave, address, 0),
+      * use attrs.secure; code determining actual behaviour of the system
-     DEFINE_PROP_END_OF_LIST(),
+      * should use env->v7m.secure.
- };
++     *
++     * The container covers the whole PPB space. Within it the priority
--#define TYPE_I2C_BUS "i2c-bus"
++     * of overlapping regions is:
--#define I2C_BUS(obj) OBJECT_CHECK(I2CBus, (obj), TYPE_I2C_BUS)
++     *  - default region (for RAZ/WI and BusFault) : -1
--
++     *  - system register regions : 0
- static const TypeInfo i2c_bus_info = {
++     *  - systick : 1
-     .name = TYPE_I2C_BUS,
++     * This is because the systick device is a small block of registers
-     .parent = TYPE_BUS,
++     * in the middle of the other system control registers.
       */
 -    regionlen = arm_feature(&s->cpu->env, ARM_FEATURE_V8) ? 0x21000 : 0x1000;
 -    memory_region_init(&s->container, OBJECT(s), "nvic", regionlen);
 -    /* The system register region goes at the bottom of the priority
 -     * stack as it covers the whole page.
 -     */
 +    memory_region_init(&s->container, OBJECT(s), "nvic", 0x100000);
 +    memory_region_init_io(&s->defaultmem, OBJECT(s), &ppb_default_ops, s,
 +                          "nvic-default", 0x100000);
 +    memory_region_add_subregion_overlap(&s->container, 0, &s->defaultmem, -1);
      memory_region_init_io(&s->sysregmem, OBJECT(s), &nvic_sysreg_ops, s,
                            "nvic_sysregs", 0x1000);
 -    memory_region_add_subregion(&s->container, 0, &s->sysregmem);
 +    memory_region_add_subregion(&s->container, 0xe000, &s->sysregmem);
      memory_region_init_io(&s->systickmem, OBJECT(s),
                            &nvic_systick_ops, s,
                            "nvic_systick", 0xe0);
 -    memory_region_add_subregion_overlap(&s->container, 0x10,
 +    memory_region_add_subregion_overlap(&s->container, 0xe010,
                                          &s->systickmem, 1);
      if (arm_feature(&s->cpu->env, ARM_FEATURE_V8)) {
          memory_region_init_io(&s->sysreg_ns_mem, OBJECT(s),
                                &nvic_sysreg_ns_ops, &s->sysregmem,
                                "nvic_sysregs_ns", 0x1000);
 -        memory_region_add_subregion(&s->container, 0x20000, &s->sysreg_ns_mem);
 +        memory_region_add_subregion(&s->container, 0x2e000, &s->sysreg_ns_mem);
          memory_region_init_io(&s->systick_ns_mem, OBJECT(s),
                                &nvic_sysreg_ns_ops, &s->systickmem,
                                "nvic_systick_ns", 0xe0);
 -        memory_region_add_subregion_overlap(&s->container, 0x20010,
 +        memory_region_add_subregion_overlap(&s->container, 0x2e010,
                                              &s->systick_ns_mem, 1);
      }
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 28/42] arm/translate-a64: add FP16 SCVTF/UCVFT to simd_two_reg_misc_fp16
+[PULL 13/36] target/arm: Implement v8.1M PXN extension
-From: Alex Bennée <alex.bennee@linaro.org>
+In v8.1M the PXN architecture extension adds a new PXN bit to the
 MPU_RLAR registers, which forbids execution of code in the region
 from a privileged mode.
-I've re-factored the handle_simd_intfp_conv helper to properly handle
+This is another feature which is just in the generic "in v8.1M" set
-half-precision as well as call plain conversion helpers when we are
+and has no ID register field indicating its presence.
 not doing fixed point conversion.
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20180227143852.11175-21-alex.bennee@linaro.org
+Message-id: 20201119215617.29887-3-peter.maydell@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/helper.h        |  10 ++++
+ target/arm/helper.c | 7 ++++++-
- target/arm/helper.c        |   4 ++
+file changed, 6 insertions(+), 1 deletion(-)
  target/arm/translate-a64.c | 122 ++++++++++++++++++++++++++++++++++-----------
 files changed, 108 insertions(+), 28 deletions(-)
-diff --git a/target/arm/helper.h b/target/arm/helper.h
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper.h
-+++ b/target/arm/helper.h
-@@ -XXX,XX +XXX,XX @@ DEF_HELPER_3(vfp_cmped, void, f64, f64, env)
- DEF_HELPER_2(vfp_fcvtds, f64, f32, env)
- DEF_HELPER_2(vfp_fcvtsd, f32, f64, env)
-+DEF_HELPER_2(vfp_uitoh, f16, i32, ptr)
- DEF_HELPER_2(vfp_uitos, f32, i32, ptr)
- DEF_HELPER_2(vfp_uitod, f64, i32, ptr)
-+DEF_HELPER_2(vfp_sitoh, f16, i32, ptr)
- DEF_HELPER_2(vfp_sitos, f32, i32, ptr)
- DEF_HELPER_2(vfp_sitod, f64, i32, ptr)
-+DEF_HELPER_2(vfp_touih, i32, f16, ptr)
- DEF_HELPER_2(vfp_touis, i32, f32, ptr)
- DEF_HELPER_2(vfp_touid, i32, f64, ptr)
-+DEF_HELPER_2(vfp_touizh, i32, f16, ptr)
- DEF_HELPER_2(vfp_touizs, i32, f32, ptr)
- DEF_HELPER_2(vfp_touizd, i32, f64, ptr)
-+DEF_HELPER_2(vfp_tosih, i32, f16, ptr)
- DEF_HELPER_2(vfp_tosis, i32, f32, ptr)
- DEF_HELPER_2(vfp_tosid, i32, f64, ptr)
-+DEF_HELPER_2(vfp_tosizh, i32, f16, ptr)
- DEF_HELPER_2(vfp_tosizs, i32, f32, ptr)
- DEF_HELPER_2(vfp_tosizd, i32, f64, ptr)
-@@ -XXX,XX +XXX,XX @@ DEF_HELPER_3(vfp_toshd_round_to_zero, i64, f64, i32, ptr)
- DEF_HELPER_3(vfp_tosld_round_to_zero, i64, f64, i32, ptr)
- DEF_HELPER_3(vfp_touhd_round_to_zero, i64, f64, i32, ptr)
- DEF_HELPER_3(vfp_tould_round_to_zero, i64, f64, i32, ptr)
-+DEF_HELPER_3(vfp_toulh, i32, f16, i32, ptr)
-+DEF_HELPER_3(vfp_toslh, i32, f16, i32, ptr)
- DEF_HELPER_3(vfp_toshs, i32, f32, i32, ptr)
- DEF_HELPER_3(vfp_tosls, i32, f32, i32, ptr)
- DEF_HELPER_3(vfp_tosqs, i64, f32, i32, ptr)
-@@ -XXX,XX +XXX,XX @@ DEF_HELPER_3(vfp_sqtod, f64, i64, i32, ptr)
- DEF_HELPER_3(vfp_uhtod, f64, i64, i32, ptr)
- DEF_HELPER_3(vfp_ultod, f64, i64, i32, ptr)
- DEF_HELPER_3(vfp_uqtod, f64, i64, i32, ptr)
-+DEF_HELPER_3(vfp_sltoh, f16, i32, i32, ptr)
-+DEF_HELPER_3(vfp_ultoh, f16, i32, i32, ptr)
- DEF_HELPER_FLAGS_2(set_rmode, TCG_CALL_NO_RWG, i32, i32, ptr)
- DEF_HELPER_FLAGS_2(set_neon_rmode, TCG_CALL_NO_RWG, i32, i32, env)
 diff --git a/target/arm/helper.c b/target/arm/helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper.c
 +++ b/target/arm/helper.c
-@@ -XXX,XX +XXX,XX @@ CONV_ITOF(vfp_##name##to##p, fsz, sign) \
+@@ -XXX,XX +XXX,XX @@ bool pmsav8_mpu_lookup(CPUARMState *env, uint32_t address,
- CONV_FTOI(vfp_to##name##p, fsz, sign, ) \
+     } else {
- CONV_FTOI(vfp_to##name##z##p, fsz, sign, _round_to_zero)
+         uint32_t ap = extract32(env->pmsav8.rbar[secure][matchregion], 1, 2);
+         uint32_t xn = extract32(env->pmsav8.rbar[secure][matchregion], 0, 1);
-+FLOAT_CONVS(si, h, 16, )
++        bool pxn = false;
  FLOAT_CONVS(si, s, 32, )
  FLOAT_CONVS(si, d, 64, )
 +FLOAT_CONVS(ui, h, 16, u)
  FLOAT_CONVS(ui, s, 32, u)
  FLOAT_CONVS(ui, d, 64, u)
@@ -XXX,XX +XXX,XX @@ VFP_CONV_FIX_A64(sq, s, 32, 64, int64)
  VFP_CONV_FIX(uh, s, 32, 32, uint16)
  VFP_CONV_FIX(ul, s, 32, 32, uint32)
  VFP_CONV_FIX_A64(uq, s, 32, 64, uint64)
 +VFP_CONV_FIX_A64(sl, h, 16, 32, int32)
 +VFP_CONV_FIX_A64(ul, h, 16, 32, uint32)
  #undef VFP_CONV_FIX
  #undef VFP_CONV_FIX_FLOAT
  #undef VFP_CONV_FLOAT_FIX_ROUND
 diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-a64.c
 +++ b/target/arm/translate-a64.c
@@ -XXX,XX +XXX,XX @@ static void handle_simd_intfp_conv(DisasContext *s, int rd, int rn,
                                     int elements, int is_signed,
                                     int fracbits, int size)
  {
 -    bool is_double = size == 3 ? true : false;
 -    TCGv_ptr tcg_fpst = get_fpstatus_ptr(false);
 -    TCGv_i32 tcg_shift = tcg_const_i32(fracbits);
 -    TCGv_i64 tcg_int = tcg_temp_new_i64();
 +    TCGv_ptr tcg_fpst = get_fpstatus_ptr(size == MO_16);
 +    TCGv_i32 tcg_shift = NULL;
 +
-     TCGMemOp mop = size | (is_signed ? MO_SIGN : 0);
++        if (arm_feature(env, ARM_FEATURE_V8_1M)) {
-     int pass;
++            pxn = extract32(env->pmsav8.rlar[secure][matchregion], 4, 1);
++        }
--    for (pass = 0; pass < elements; pass++) {
--        read_vec_element(s, tcg_int, rn, pass, mop);
+         if (m_is_system_region(env, address)) {
-+    if (fracbits || size == MO_64) {
+             /* System space is always execute never */
-+        tcg_shift = tcg_const_i32(fracbits);
+@@ -XXX,XX +XXX,XX @@ bool pmsav8_mpu_lookup(CPUARMState *env, uint32_t address,
 +    }
 +
 +    if (size == MO_64) {
 +        TCGv_i64 tcg_int64 = tcg_temp_new_i64();
 +        TCGv_i64 tcg_double = tcg_temp_new_i64();
 +
 +        for (pass = 0; pass < elements; pass++) {
 +            read_vec_element(s, tcg_int64, rn, pass, mop);
 -        if (is_double) {
 -            TCGv_i64 tcg_double = tcg_temp_new_i64();
              if (is_signed) {
 -                gen_helper_vfp_sqtod(tcg_double, tcg_int,
 +                gen_helper_vfp_sqtod(tcg_double, tcg_int64,
                                       tcg_shift, tcg_fpst);
              } else {
 -                gen_helper_vfp_uqtod(tcg_double, tcg_int,
 +                gen_helper_vfp_uqtod(tcg_double, tcg_int64,
                                       tcg_shift, tcg_fpst);
              }
              if (elements == 1) {
@@ -XXX,XX +XXX,XX @@ static void handle_simd_intfp_conv(DisasContext *s, int rd, int rn,
              } else {
                  write_vec_element(s, tcg_double, rd, pass, MO_64);
              }
 -            tcg_temp_free_i64(tcg_double);
 -        } else {
 -            TCGv_i32 tcg_single = tcg_temp_new_i32();
 -            if (is_signed) {
 -                gen_helper_vfp_sqtos(tcg_single, tcg_int,
 -                                     tcg_shift, tcg_fpst);
 -            } else {
 -                gen_helper_vfp_uqtos(tcg_single, tcg_int,
 -                                     tcg_shift, tcg_fpst);
 -            }
 -            if (elements == 1) {
 -                write_fp_sreg(s, rd, tcg_single);
 -            } else {
 -                write_vec_element_i32(s, tcg_single, rd, pass, MO_32);
 -            }
 -            tcg_temp_free_i32(tcg_single);
          }
-+
-+        tcg_temp_free_i64(tcg_int64);
+         *prot = simple_ap_to_rw_prot(env, mmu_idx, ap);
-+        tcg_temp_free_i64(tcg_double);
+-        if (*prot && !xn) {
-+
++        if (*prot && !xn && !(pxn && !is_user)) {
-+    } else {
+             *prot |= PAGE_EXEC;
-+        TCGv_i32 tcg_int32 = tcg_temp_new_i32();
+         }
-+        TCGv_i32 tcg_float = tcg_temp_new_i32();
+         /* We don't need to look the attribute up in the MAIR0/MAIR1
 +
 +        for (pass = 0; pass < elements; pass++) {
 +            read_vec_element_i32(s, tcg_int32, rn, pass, mop);
 +
 +            switch (size) {
 +            case MO_32:
 +                if (fracbits) {
 +                    if (is_signed) {
 +                        gen_helper_vfp_sltos(tcg_float, tcg_int32,
 +                                             tcg_shift, tcg_fpst);
 +                    } else {
 +                        gen_helper_vfp_ultos(tcg_float, tcg_int32,
 +                                             tcg_shift, tcg_fpst);
 +                    }
 +                } else {
 +                    if (is_signed) {
 +                        gen_helper_vfp_sitos(tcg_float, tcg_int32, tcg_fpst);
 +                    } else {
 +                        gen_helper_vfp_uitos(tcg_float, tcg_int32, tcg_fpst);
 +                    }
 +                }
 +                break;
 +            case MO_16:
 +                if (fracbits) {
 +                    if (is_signed) {
 +                        gen_helper_vfp_sltoh(tcg_float, tcg_int32,
 +                                             tcg_shift, tcg_fpst);
 +                    } else {
 +                        gen_helper_vfp_ultoh(tcg_float, tcg_int32,
 +                                             tcg_shift, tcg_fpst);
 +                    }
 +                } else {
 +                    if (is_signed) {
 +                        gen_helper_vfp_sitoh(tcg_float, tcg_int32, tcg_fpst);
 +                    } else {
 +                        gen_helper_vfp_uitoh(tcg_float, tcg_int32, tcg_fpst);
 +                    }
 +                }
 +                break;
 +            default:
 +                g_assert_not_reached();
 +            }
 +
 +            if (elements == 1) {
 +                write_fp_sreg(s, rd, tcg_float);
 +            } else {
 +                write_vec_element_i32(s, tcg_float, rd, pass, size);
 +            }
 +        }
 +
 +        tcg_temp_free_i32(tcg_int32);
 +        tcg_temp_free_i32(tcg_float);
      }
 -    tcg_temp_free_i64(tcg_int);
      tcg_temp_free_ptr(tcg_fpst);
 -    tcg_temp_free_i32(tcg_shift);
 +    if (tcg_shift) {
 +        tcg_temp_free_i32(tcg_shift);
 +    }
      clear_vec_high(s, elements << size == 16, rd);
  }
@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
      rn = extract32(insn, 5, 5);
      switch (fpop) {
 +    case 0x1d: /* SCVTF */
 +    case 0x5d: /* UCVTF */
 +    {
 +        int elements;
 +
 +        if (is_scalar) {
 +            elements = 1;
 +        } else {
 +            elements = (is_q ? 8 : 4);
 +        }
 +
 +        if (!fp_access_check(s)) {
 +            return;
 +        }
 +        handle_simd_intfp_conv(s, rd, rn, elements, !u, 0, MO_16);
 +        return;
 +    }
      break;
      case 0x2c: /* FCMGT (zero) */
      case 0x2d: /* FCMEQ (zero) */
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 27/42] arm/translate-a64: add FP16 FCMxx (zero) to simd_two_reg_misc_fp16
+[PULL 14/36] target/arm: Don't clobber ID_PFR1.Security on M-profile cores
-From: Alex Bennée <alex.bennee@linaro.org>
+In arm_cpu_realizefn() we check whether the board code disabled EL3
 via the has_el3 CPU object property, which we create if the CPU
 starts with the ARM_FEATURE_EL3 feature bit.  If it is disabled, then
 we turn off ARM_FEATURE_EL3 and also zero out the relevant fields in
 the ID_PFR1 and ID_AA64PFR0 registers.
-I re-use the existing handle_2misc_fcmp_zero handler and tweak it
+This codepath was incorrectly being taken for M-profile CPUs, which
-slightly to deal with the half-precision case.
+do not have an EL3 and don't set ARM_FEATURE_EL3, but which may have
 the M-profile Security extension and so should have non-zero values
 in the ID_PFR1.Security field.
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Restrict the handling of the feature flag to A/R-profile cores.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20180227143852.11175-20-alex.bennee@linaro.org
+Message-id: 20201119215617.29887-4-peter.maydell@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/translate-a64.c | 80 +++++++++++++++++++++++++++++++++-------------
+ target/arm/cpu.c | 2 +-
-file changed, 57 insertions(+), 23 deletions(-)
+file changed, 1 insertion(+), 1 deletion(-)
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
+diff --git a/target/arm/cpu.c b/target/arm/cpu.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
+--- a/target/arm/cpu.c
-+++ b/target/arm/translate-a64.c
++++ b/target/arm/cpu.c
-@@ -XXX,XX +XXX,XX @@ static void handle_2misc_fcmp_zero(DisasContext *s, int opcode,
+@@ -XXX,XX +XXX,XX @@ static void arm_cpu_realizefn(DeviceState *dev, Error **errp)
-                                    bool is_scalar, bool is_u, bool is_q,
+         }
                                     int size, int rn, int rd)
  {
 -    bool is_double = (size == 3);
 +    bool is_double = (size == MO_64);
      TCGv_ptr fpst;
      if (!fp_access_check(s)) {
          return;
      }
--    fpst = get_fpstatus_ptr(false);
+-    if (!cpu->has_el3) {
-+    fpst = get_fpstatus_ptr(size == MO_16);
++    if (!arm_feature(env, ARM_FEATURE_M) && !cpu->has_el3) {
+         /* If the has_el3 CPU property is disabled then we need to disable the
-     if (is_double) {
+          * feature.
-         TCGv_i64 tcg_op = tcg_temp_new_i64();
+          */
@@ -XXX,XX +XXX,XX @@ static void handle_2misc_fcmp_zero(DisasContext *s, int opcode,
          bool swap = false;
          int pass, maxpasses;
 -        switch (opcode) {
 -        case 0x2e: /* FCMLT (zero) */
 -            swap = true;
 -            /* fall through */
 -        case 0x2c: /* FCMGT (zero) */
 -            genfn = gen_helper_neon_cgt_f32;
 -            break;
 -        case 0x2d: /* FCMEQ (zero) */
 -            genfn = gen_helper_neon_ceq_f32;
 -            break;
 -        case 0x6d: /* FCMLE (zero) */
 -            swap = true;
 -            /* fall through */
 -        case 0x6c: /* FCMGE (zero) */
 -            genfn = gen_helper_neon_cge_f32;
 -            break;
 -        default:
 -            g_assert_not_reached();
 +        if (size == MO_16) {
 +            switch (opcode) {
 +            case 0x2e: /* FCMLT (zero) */
 +                swap = true;
 +                /* fall through */
 +            case 0x2c: /* FCMGT (zero) */
 +                genfn = gen_helper_advsimd_cgt_f16;
 +                break;
 +            case 0x2d: /* FCMEQ (zero) */
 +                genfn = gen_helper_advsimd_ceq_f16;
 +                break;
 +            case 0x6d: /* FCMLE (zero) */
 +                swap = true;
 +                /* fall through */
 +            case 0x6c: /* FCMGE (zero) */
 +                genfn = gen_helper_advsimd_cge_f16;
 +                break;
 +            default:
 +                g_assert_not_reached();
 +            }
 +        } else {
 +            switch (opcode) {
 +            case 0x2e: /* FCMLT (zero) */
 +                swap = true;
 +                /* fall through */
 +            case 0x2c: /* FCMGT (zero) */
 +                genfn = gen_helper_neon_cgt_f32;
 +                break;
 +            case 0x2d: /* FCMEQ (zero) */
 +                genfn = gen_helper_neon_ceq_f32;
 +                break;
 +            case 0x6d: /* FCMLE (zero) */
 +                swap = true;
 +                /* fall through */
 +            case 0x6c: /* FCMGE (zero) */
 +                genfn = gen_helper_neon_cge_f32;
 +                break;
 +            default:
 +                g_assert_not_reached();
 +            }
          }
          if (is_scalar) {
              maxpasses = 1;
          } else {
 -            maxpasses = is_q ? 4 : 2;
 +            int vector_size = 8 << is_q;
 +            maxpasses = vector_size >> size;
          }
          for (pass = 0; pass < maxpasses; pass++) {
 -            read_vec_element_i32(s, tcg_op, rn, pass, MO_32);
 +            read_vec_element_i32(s, tcg_op, rn, pass, size);
              if (swap) {
                  genfn(tcg_res, tcg_zero, tcg_op, fpst);
              } else {
@@ -XXX,XX +XXX,XX @@ static void handle_2misc_fcmp_zero(DisasContext *s, int opcode,
              if (is_scalar) {
                  write_fp_sreg(s, rd, tcg_res);
              } else {
 -                write_vec_element_i32(s, tcg_res, rd, pass, MO_32);
 +                write_vec_element_i32(s, tcg_res, rd, pass, size);
              }
          }
          tcg_temp_free_i32(tcg_res);
@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
      fpop = deposit32(opcode, 5, 1, a);
      fpop = deposit32(fpop, 6, 1, u);
 +    rd = extract32(insn, 0, 5);
 +    rn = extract32(insn, 5, 5);
 +
      switch (fpop) {
 +    break;
 +    case 0x2c: /* FCMGT (zero) */
 +    case 0x2d: /* FCMEQ (zero) */
 +    case 0x2e: /* FCMLT (zero) */
 +    case 0x6c: /* FCMGE (zero) */
 +    case 0x6d: /* FCMLE (zero) */
 +        handle_2misc_fcmp_zero(s, fpop, is_scalar, 0, is_q, MO_16, rn, rd);
 +        return;
      case 0x18: /* FRINTN */
          need_rmode = true;
          only_in_vector = true;
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 13/42] target/arm/helper: pass explicit fpst to set_rmode
+[PULL 15/36] target/arm: Implement VSCCLRM insn
-From: Alex Bennée <alex.bennee@linaro.org>
+Implement the v8.1M VSCCLRM insn, which zeros floating point
+registers if there is an active floating point context.
-As the rounding mode is now split between FP16 and the rest of
+This requires support in write_neon_element32() for the MO_32
-floating point we need to be explicit when tweaking it. Instead of
+element size, so add it.
-passing the CPU env we now pass the appropriate fpst pointer directly.
+Because we want to use arm_gen_condlabel(), we need to move
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+the definition of that function up in translate.c so it is
 before the #include of translate-vfp.c.inc.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20180227143852.11175-6-alex.bennee@linaro.org
+Message-id: 20201119215617.29887-5-peter.maydell@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/helper.h        |  2 +-
+ target/arm/cpu.h               |  9 ++++
- target/arm/helper.c        |  4 ++--
+ target/arm/m-nocp.decode       |  8 +++-
- target/arm/translate-a64.c | 26 +++++++++++++-------------
+ target/arm/translate.c         | 21 +++++----
- target/arm/translate.c     | 12 ++++++------
+ target/arm/translate-vfp.c.inc | 84 ++++++++++++++++++++++++++++++++++
-files changed, 22 insertions(+), 22 deletions(-)
+files changed, 111 insertions(+), 11 deletions(-)
-diff --git a/target/arm/helper.h b/target/arm/helper.h
+diff --git a/target/arm/cpu.h b/target/arm/cpu.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper.h
+--- a/target/arm/cpu.h
-+++ b/target/arm/helper.h
++++ b/target/arm/cpu.h
-@@ -XXX,XX +XXX,XX @@ DEF_HELPER_3(vfp_uhtod, f64, i64, i32, ptr)
+@@ -XXX,XX +XXX,XX @@ static inline bool isar_feature_aa32_mprofile(const ARMISARegisters *id)
- DEF_HELPER_3(vfp_ultod, f64, i64, i32, ptr)
+     return FIELD_EX32(id->id_pfr1, ID_PFR1, MPROGMOD) != 0;
- DEF_HELPER_3(vfp_uqtod, f64, i64, i32, ptr)
+ }
--DEF_HELPER_FLAGS_2(set_rmode, TCG_CALL_NO_RWG, i32, i32, env)
++static inline bool isar_feature_aa32_m_sec_state(const ARMISARegisters *id)
-+DEF_HELPER_FLAGS_2(set_rmode, TCG_CALL_NO_RWG, i32, i32, ptr)
++{
- DEF_HELPER_FLAGS_2(set_neon_rmode, TCG_CALL_NO_RWG, i32, i32, env)
++    /*
++     * Return true if M-profile state handling insns
- DEF_HELPER_2(vfp_fcvt_f16_to_f32, f32, i32, env)
++     * (VSCCLRM, CLRM, FPCTX access insns) are implemented
-diff --git a/target/arm/helper.c b/target/arm/helper.c
++     */
-index XXXXXXX..XXXXXXX 100644
++    return FIELD_EX32(id->id_pfr1, ID_PFR1, SECURITY) >= 3;
---- a/target/arm/helper.c
++}
-+++ b/target/arm/helper.c
++
-@@ -XXX,XX +XXX,XX @@ VFP_CONV_FIX_A64(uq, s, 32, 64, uint64)
+ static inline bool isar_feature_aa32_fp16_arith(const ARMISARegisters *id)
- /* Set the current fp rounding mode and return the old one.
+ {
-  * The argument is a softfloat float_round_ value.
+     /* Sadly this is encoded differently for A-profile and M-profile */
-  */
+diff --git a/target/arm/m-nocp.decode b/target/arm/m-nocp.decode
--uint32_t HELPER(set_rmode)(uint32_t rmode, CPUARMState *env)
+index XXXXXXX..XXXXXXX 100644
-+uint32_t HELPER(set_rmode)(uint32_t rmode, void *fpstp)
+--- a/target/arm/m-nocp.decode
- {
++++ b/target/arm/m-nocp.decode
--    float_status *fp_status = &env->vfp.fp_status;
+@@ -XXX,XX +XXX,XX @@
-+    float_status *fp_status = fpstp;
+ # If the coprocessor is not present or disabled then we will generate
+ # the NOCP exception; otherwise we let the insn through to the main decode.
-     uint32_t prev_rmode = get_float_rounding_mode(fp_status);
-     set_float_rounding_mode(rmode, fp_status);
++%vd_dp  22:1 12:4
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
++%vd_sp  12:4 22:1
-index XXXXXXX..XXXXXXX 100644
++
---- a/target/arm/translate-a64.c
+ &nocp cp
-+++ b/target/arm/translate-a64.c
-@@ -XXX,XX +XXX,XX @@ static void handle_fp_1src_single(DisasContext *s, int opcode, int rd, int rn)
+ {
-     {
+   # Special cases which do not take an early NOCP: VLLDM and VLSTM
-         TCGv_i32 tcg_rmode = tcg_const_i32(arm_rmode_to_sf(opcode & 7));
+   VLLDM_VLSTM  1110 1100 001 l:1 rn:4 0000 1010 0000 0000
+-  # TODO: VSCCLRM (new in v8.1M) is similar:
--        gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
+-  #VSCCLRM      1110 1100 1-01 1111 ---- 1011 ---- ---0
-+        gen_helper_set_rmode(tcg_rmode, tcg_rmode, fpst);
++  # VSCCLRM (new in v8.1M) is similar:
-         gen_helper_rints(tcg_res, tcg_op, fpst);
++  VSCCLRM      1110 1100 1.01 1111 .... 1011 imm:7 0   vd=%vd_dp size=3
++  VSCCLRM      1110 1100 1.01 1111 .... 1010 imm:8     vd=%vd_sp size=2
--        gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
-+        gen_helper_set_rmode(tcg_rmode, tcg_rmode, fpst);
+   NOCP         111- 1110 ---- ---- ---- cp:4 ---- ---- &nocp
-         tcg_temp_free_i32(tcg_rmode);
+   NOCP         111- 110- ---- ---- ---- cp:4 ---- ---- &nocp
          break;
      }
@@ -XXX,XX +XXX,XX @@ static void handle_fp_1src_double(DisasContext *s, int opcode, int rd, int rn)
      {
          TCGv_i32 tcg_rmode = tcg_const_i32(arm_rmode_to_sf(opcode & 7));
 -        gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
 +        gen_helper_set_rmode(tcg_rmode, tcg_rmode, fpst);
          gen_helper_rintd(tcg_res, tcg_op, fpst);
 -        gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
 +        gen_helper_set_rmode(tcg_rmode, tcg_rmode, fpst);
          tcg_temp_free_i32(tcg_rmode);
          break;
      }
@@ -XXX,XX +XXX,XX @@ static void handle_fpfpcvt(DisasContext *s, int rd, int rn, int opcode,
          tcg_rmode = tcg_const_i32(arm_rmode_to_sf(rmode));
 -        gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
 +        gen_helper_set_rmode(tcg_rmode, tcg_rmode, tcg_fpstatus);
          if (is_double) {
              TCGv_i64 tcg_double = read_fp_dreg(s, rn);
@@ -XXX,XX +XXX,XX @@ static void handle_fpfpcvt(DisasContext *s, int rd, int rn, int opcode,
              tcg_temp_free_i32(tcg_single);
          }
 -        gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
 +        gen_helper_set_rmode(tcg_rmode, tcg_rmode, tcg_fpstatus);
          tcg_temp_free_i32(tcg_rmode);
          if (!sf) {
@@ -XXX,XX +XXX,XX @@ static void handle_simd_shift_fpint_conv(DisasContext *s, bool is_scalar,
      assert(!(is_scalar && is_q));
      tcg_rmode = tcg_const_i32(arm_rmode_to_sf(FPROUNDING_ZERO));
 -    gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
      tcg_fpstatus = get_fpstatus_ptr(false);
 +    gen_helper_set_rmode(tcg_rmode, tcg_rmode, tcg_fpstatus);
      tcg_shift = tcg_const_i32(fracbits);
      if (is_double) {
@@ -XXX,XX +XXX,XX @@ static void handle_simd_shift_fpint_conv(DisasContext *s, bool is_scalar,
      tcg_temp_free_ptr(tcg_fpstatus);
      tcg_temp_free_i32(tcg_shift);
 -    gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
 +    gen_helper_set_rmode(tcg_rmode, tcg_rmode, tcg_fpstatus);
      tcg_temp_free_i32(tcg_rmode);
  }
@@ -XXX,XX +XXX,XX @@ static void disas_simd_scalar_two_reg_misc(DisasContext *s, uint32_t insn)
      if (is_fcvt) {
          tcg_rmode = tcg_const_i32(arm_rmode_to_sf(rmode));
 -        gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
          tcg_fpstatus = get_fpstatus_ptr(false);
 +        gen_helper_set_rmode(tcg_rmode, tcg_rmode, tcg_fpstatus);
      } else {
          tcg_rmode = NULL;
          tcg_fpstatus = NULL;
@@ -XXX,XX +XXX,XX @@ static void disas_simd_scalar_two_reg_misc(DisasContext *s, uint32_t insn)
      }
      if (is_fcvt) {
 -        gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
 +        gen_helper_set_rmode(tcg_rmode, tcg_rmode, tcg_fpstatus);
          tcg_temp_free_i32(tcg_rmode);
          tcg_temp_free_ptr(tcg_fpstatus);
      }
@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc(DisasContext *s, uint32_t insn)
          return;
      }
 -    if (need_fpstatus) {
 +    if (need_fpstatus || need_rmode) {
          tcg_fpstatus = get_fpstatus_ptr(false);
      } else {
          tcg_fpstatus = NULL;
      }
      if (need_rmode) {
          tcg_rmode = tcg_const_i32(arm_rmode_to_sf(rmode));
 -        gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
 +        gen_helper_set_rmode(tcg_rmode, tcg_rmode, tcg_fpstatus);
      } else {
          tcg_rmode = NULL;
      }
@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc(DisasContext *s, uint32_t insn)
      clear_vec_high(s, is_q, rd);
      if (need_rmode) {
 -        gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
 +        gen_helper_set_rmode(tcg_rmode, tcg_rmode, tcg_fpstatus);
          tcg_temp_free_i32(tcg_rmode);
      }
      if (need_fpstatus) {
 diff --git a/target/arm/translate.c b/target/arm/translate.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate.c
 +++ b/target/arm/translate.c
-@@ -XXX,XX +XXX,XX @@ static int handle_vrint(uint32_t insn, uint32_t rd, uint32_t rm, uint32_t dp,
+@@ -XXX,XX +XXX,XX @@ void arm_translate_init(void)
-     TCGv_i32 tcg_rmode;
+     a64_translate_init();
+ }
-     tcg_rmode = tcg_const_i32(arm_rmode_to_sf(rounding));
--    gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
++/* Generate a label used for skipping this instruction */
-+    gen_helper_set_rmode(tcg_rmode, tcg_rmode, fpst);
++static void arm_gen_condlabel(DisasContext *s)
++{
-     if (dp) {
++    if (!s->condjmp) {
-         TCGv_i64 tcg_op;
++        s->condlabel = gen_new_label();
-@@ -XXX,XX +XXX,XX @@ static int handle_vrint(uint32_t insn, uint32_t rd, uint32_t rm, uint32_t dp,
++        s->condjmp = 1;
-         tcg_temp_free_i32(tcg_res);
++    }
-     }
++}
++
--    gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
+ /* Flags for the disas_set_da_iss info argument:
-+    gen_helper_set_rmode(tcg_rmode, tcg_rmode, fpst);
+  * lower bits hold the Rt register number, higher bits are flags.
-     tcg_temp_free_i32(tcg_rmode);
+  */
+@@ -XXX,XX +XXX,XX @@ static void write_neon_element64(TCGv_i64 src, int reg, int ele, MemOp memop)
-     tcg_temp_free_ptr(fpst);
+     long off = neon_element_offset(reg, ele, memop);
-@@ -XXX,XX +XXX,XX @@ static int handle_vcvt(uint32_t insn, uint32_t rd, uint32_t rm, uint32_t dp,
-     tcg_shift = tcg_const_i32(0);
+     switch (memop) {
++    case MO_32:
-     tcg_rmode = tcg_const_i32(arm_rmode_to_sf(rounding));
++        tcg_gen_st32_i64(src, cpu_env, off);
--    gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
++        break;
-+    gen_helper_set_rmode(tcg_rmode, tcg_rmode, fpst);
+     case MO_64:
+         tcg_gen_st_i64(src, cpu_env, off);
-     if (dp) {
+         break;
-         TCGv_i64 tcg_double, tcg_res;
+@@ -XXX,XX +XXX,XX @@ static void gen_srs(DisasContext *s,
-@@ -XXX,XX +XXX,XX @@ static int handle_vcvt(uint32_t insn, uint32_t rd, uint32_t rm, uint32_t dp,
+     s->base.is_jmp = DISAS_UPDATE_EXIT;
-         tcg_temp_free_i32(tcg_single);
+ }
-     }
+-/* Generate a label used for skipping this instruction */
--    gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
+-static void arm_gen_condlabel(DisasContext *s)
-+    gen_helper_set_rmode(tcg_rmode, tcg_rmode, fpst);
+-{
-     tcg_temp_free_i32(tcg_rmode);
+-    if (!s->condjmp) {
+-        s->condlabel = gen_new_label();
-     tcg_temp_free_i32(tcg_shift);
+-        s->condjmp = 1;
-@@ -XXX,XX +XXX,XX @@ static int disas_vfp_insn(DisasContext *s, uint32_t insn)
+-    }
-                         TCGv_ptr fpst = get_fpstatus_ptr(0);
+-}
-                         TCGv_i32 tcg_rmode;
+-
-                         tcg_rmode = tcg_const_i32(float_round_to_zero);
+ /* Skip this instruction if the ARM condition is false */
--                        gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
+ static void arm_skip_unless(DisasContext *s, uint32_t cond)
-+                        gen_helper_set_rmode(tcg_rmode, tcg_rmode, fpst);
+ {
-                         if (dp) {
+diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
-                             gen_helper_rintd(cpu_F0d, cpu_F0d, fpst);
+index XXXXXXX..XXXXXXX 100644
-                         } else {
+--- a/target/arm/translate-vfp.c.inc
-                             gen_helper_rints(cpu_F0s, cpu_F0s, fpst);
++++ b/target/arm/translate-vfp.c.inc
-                         }
+@@ -XXX,XX +XXX,XX @@ static bool trans_VLLDM_VLSTM(DisasContext *s, arg_VLLDM_VLSTM *a)
--                        gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
+     return true;
-+                        gen_helper_set_rmode(tcg_rmode, tcg_rmode, fpst);
+ }
-                         tcg_temp_free_i32(tcg_rmode);
-                         tcg_temp_free_ptr(fpst);
++static bool trans_VSCCLRM(DisasContext *s, arg_VSCCLRM *a)
-                         break;
++{
 +    int btmreg, topreg;
 +    TCGv_i64 zero;
 +    TCGv_i32 aspen, sfpa;
 +
 +    if (!dc_isar_feature(aa32_m_sec_state, s)) {
 +        /* Before v8.1M, fall through in decode to NOCP check */
 +        return false;
 +    }
 +
 +    /* Explicitly UNDEF because this takes precedence over NOCP */
 +    if (!arm_dc_feature(s, ARM_FEATURE_M_MAIN) || !s->v8m_secure) {
 +        unallocated_encoding(s);
 +        return true;
 +    }
 +
 +    if (!dc_isar_feature(aa32_vfp_simd, s)) {
 +        /* NOP if we have neither FP nor MVE */
 +        return true;
 +    }
 +
 +    /*
 +     * If FPCCR.ASPEN != 0 && CONTROL_S.SFPA == 0 then there is no
 +     * active floating point context so we must NOP (without doing
 +     * any lazy state preservation or the NOCP check).
 +     */
 +    aspen = load_cpu_field(v7m.fpccr[M_REG_S]);
 +    sfpa = load_cpu_field(v7m.control[M_REG_S]);
 +    tcg_gen_andi_i32(aspen, aspen, R_V7M_FPCCR_ASPEN_MASK);
 +    tcg_gen_xori_i32(aspen, aspen, R_V7M_FPCCR_ASPEN_MASK);
 +    tcg_gen_andi_i32(sfpa, sfpa, R_V7M_CONTROL_SFPA_MASK);
 +    tcg_gen_or_i32(sfpa, sfpa, aspen);
 +    arm_gen_condlabel(s);
 +    tcg_gen_brcondi_i32(TCG_COND_EQ, sfpa, 0, s->condlabel);
 +
 +    if (s->fp_excp_el != 0) {
 +        gen_exception_insn(s, s->pc_curr, EXCP_NOCP,
 +                           syn_uncategorized(), s->fp_excp_el);
 +        return true;
 +    }
 +
 +    topreg = a->vd + a->imm - 1;
 +    btmreg = a->vd;
 +
 +    /* Convert to Sreg numbers if the insn specified in Dregs */
 +    if (a->size == 3) {
 +        topreg = topreg * 2 + 1;
 +        btmreg *= 2;
 +    }
 +
 +    if (topreg > 63 || (topreg > 31 && !(topreg & 1))) {
 +        /* UNPREDICTABLE: we choose to undef */
 +        unallocated_encoding(s);
 +        return true;
 +    }
 +
 +    /* Silently ignore requests to clear D16-D31 if they don't exist */
 +    if (topreg > 31 && !dc_isar_feature(aa32_simd_r32, s)) {
 +        topreg = 31;
 +    }
 +
 +    if (!vfp_access_check(s)) {
 +        return true;
 +    }
 +
 +    /* Zero the Sregs from btmreg to topreg inclusive. */
 +    zero = tcg_const_i64(0);
 +    if (btmreg & 1) {
 +        write_neon_element64(zero, btmreg >> 1, 1, MO_32);
 +        btmreg++;
 +    }
 +    for (; btmreg + 1 <= topreg; btmreg += 2) {
 +        write_neon_element64(zero, btmreg >> 1, 0, MO_64);
 +    }
 +    if (btmreg == topreg) {
 +        write_neon_element64(zero, btmreg >> 1, 0, MO_32);
 +        btmreg++;
 +    }
 +    assert(btmreg == topreg + 1);
 +    /* TODO: when MVE is implemented, zero VPR here */
 +    return true;
 +}
 +
  static bool trans_NOCP(DisasContext *s, arg_nocp *a)
  {
      /*
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 24/42] arm/translate-a64: initial decode for simd_two_reg_misc_fp16
+[PULL 16/36] target/arm: Implement CLRM instruction
-From: Alex Bennée <alex.bennee@linaro.org>
+In v8.1M the new CLRM instruction allows zeroing an arbitrary set of
 the general-purpose registers and APSR.  Implement this.
-This actually covers two different sections of the encoding table:
+The encoding is a subset of the LDMIA T2 encoding, using what would
 be Rn=0b1111 (which UNDEFs for LDMIA).
-   Advanced SIMD scalar two-register miscellaneous FP16
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-   Advanced SIMD two-register miscellaneous (FP16)
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20201119215617.29887-6-peter.maydell@linaro.org
 ---
  target/arm/t32.decode  |  6 +++++-
  target/arm/translate.c | 38 ++++++++++++++++++++++++++++++++++++++
 files changed, 43 insertions(+), 1 deletion(-)
-The difference between the two is covered by a combination of Q (bit
+diff --git a/target/arm/t32.decode b/target/arm/t32.decode
 ) and S (bit 28). Notably the FRINTx instructions are only
 available in the vector form.
 This is just the decode skeleton which will be filled out by later
 patches.
 Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20180227143852.11175-17-alex.bennee@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
  target/arm/translate-a64.c | 40 ++++++++++++++++++++++++++++++++++++++++
 file changed, 40 insertions(+)
 diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
+--- a/target/arm/t32.decode
-+++ b/target/arm/translate-a64.c
++++ b/target/arm/t32.decode
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc(DisasContext *s, uint32_t insn)
+@@ -XXX,XX +XXX,XX @@ UXTAB            1111 1010 0101 .... 1111 .... 10.. ....      @rrr_rot
-     }
  STM_t32          1110 1000 10.0 .... ................         @ldstm i=1 b=0
  STM_t32          1110 1001 00.0 .... ................         @ldstm i=0 b=1
 -LDM_t32          1110 1000 10.1 .... ................         @ldstm i=1 b=0
 +{
 +  # Rn=15 UNDEFs for LDM; M-profile CLRM uses that encoding
 +  CLRM           1110 1000 1001 1111 list:16
 +  LDM_t32        1110 1000 10.1 .... ................         @ldstm i=1 b=0
 +}
  LDM_t32          1110 1001 00.1 .... ................         @ldstm i=0 b=1
  &rfe             !extern rn w pu
 diff --git a/target/arm/translate.c b/target/arm/translate.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate.c
 +++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static bool trans_LDM_t16(DisasContext *s, arg_ldst_block *a)
      return do_ldm(s, a, 1);
  }
-+/* AdvSIMD [scalar] two register miscellaneous (FP16)
++static bool trans_CLRM(DisasContext *s, arg_CLRM *a)
 + *
 + *   31  30  29 28  27     24  23 22 21       17 16    12 11 10 9    5 4    0
 + * +---+---+---+---+---------+---+-------------+--------+-----+------+------+
 + * | 0 | Q | U | S | 1 1 1 0 | a | 1 1 1 1 0 0 | opcode | 1 0 |  Rn  |  Rd  |
 + * +---+---+---+---+---------+---+-------------+--------+-----+------+------+
 + *   mask: 1000 1111 0111 1110 0000 1100 0000 0000 0x8f7e 0c00
 + *   val:  0000 1110 0111 1000 0000 1000 0000 0000 0x0e78 0800
 + *
 + * This actually covers two groups where scalar access is governed by
 + * bit 28. A bunch of the instructions (float to integral) only exist
 + * in the vector form and are un-allocated for the scalar decode. Also
 + * in the scalar decode Q is always 1.
 + */
 +static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
 +{
-+    int fpop, opcode, a;
++    int i;
 +    TCGv_i32 zero;
 +
-+    if (!arm_dc_feature(s, ARM_FEATURE_V8_FP16)) {
++    if (!dc_isar_feature(aa32_m_sec_state, s)) {
-+        unallocated_encoding(s);
++        return false;
 +        return;
 +    }
 +
-+    if (!fp_access_check(s)) {
++    if (extract32(a->list, 13, 1)) {
-+        return;
++        return false;
 +    }
 +
-+    opcode = extract32(insn, 12, 4);
++    if (!a->list) {
-+    a = extract32(insn, 23, 1);
++        /* UNPREDICTABLE; we choose to UNDEF */
-+    fpop = deposit32(opcode, 5, 1, a);
++        return false;
 +
 +    switch (fpop) {
 +    default:
 +        fprintf(stderr, "%s: insn %#04x fpop %#2x\n", __func__, insn, fpop);
 +        g_assert_not_reached();
 +    }
 +
++    zero = tcg_const_i32(0);
++    for (i = 0; i < 15; i++) {
++        if (extract32(a->list, i, 1)) {
++            /* Clear R[i] */
++            tcg_gen_mov_i32(cpu_R[i], zero);
++        }
++    }
++    if (extract32(a->list, 15, 1)) {
++        /*
++         * Clear APSR (by calling the MSR helper with the same argument
++         * as for "MSR APSR_nzcvqg, Rn": mask = 0b1100, SYSM=0)
++         */
++        TCGv_i32 maskreg = tcg_const_i32(0xc << 8);
++        gen_helper_v7m_msr(cpu_env, maskreg, zero);
++        tcg_temp_free_i32(maskreg);
++    }
++    tcg_temp_free_i32(zero);
++    return true;
 +}
 +
- /* AdvSIMD scalar x indexed element
+ /*
-  *  31 30  29 28       24 23  22 21  20  19  16 15 12  11  10 9    5 4    0
+  * Branch, branch with link
-  * +-----+---+-----------+------+---+---+------+-----+---+---+------+------+
+  */
@@ -XXX,XX +XXX,XX @@ static const AArch64DecodeTable data_proc_simd[] = {
      { 0xce800000, 0xffe00000, disas_crypto_xar },
      { 0xce408000, 0xffe0c000, disas_crypto_three_reg_imm2 },
      { 0x0e400400, 0x9f60c400, disas_simd_three_reg_same_fp16 },
 +    { 0x0e780800, 0x8f7e0c00, disas_simd_two_reg_misc_fp16 },
      { 0x00000000, 0x00000000, NULL }
  };
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 03/42] xilinx_spips: Use 8 dummy cycles with the QIOR/QIOR4 commands
+[PULL 17/36] target/arm: Enforce M-profile VMRS/VMSR register restrictions
-From: Francisco Iglesias <frasse.iglesias@gmail.com>
+For M-profile before v8.1M, the only valid register for VMSR/VMRS is
 the FPSCR.  We have a comment that states this, but the actual logic
 to forbid accesses for any other register value is missing, so we
 would end up with A-profile style behaviour.  Add the missing check.
-Use 8 dummy cycles (4 dummy bytes) with the QIOR/QIOR4 commands in legacy mode
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-for matching what is expected by Micron (Numonyx) flashes (the default target
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-flash type of the QSPI).
+Message-id: 20201119215617.29887-7-peter.maydell@linaro.org
 ---
  target/arm/translate-vfp.c.inc | 5 ++++-
 file changed, 4 insertions(+), 1 deletion(-)
-Signed-off-by: Francisco Iglesias <frasse.iglesias@gmail.com>
+diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
 Tested-by: Alistair Francis <alistair.francis@xilinx.com>
 Reviewed-by: Alistair Francis <alistair.francis@xilinx.com>
 Message-id: 20180223232233.31482-3-frasse.iglesias@gmail.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
  hw/ssi/xilinx_spips.c | 2 +-
 file changed, 1 insertion(+), 1 deletion(-)
 diff --git a/hw/ssi/xilinx_spips.c b/hw/ssi/xilinx_spips.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/ssi/xilinx_spips.c
+--- a/target/arm/translate-vfp.c.inc
-+++ b/hw/ssi/xilinx_spips.c
++++ b/target/arm/translate-vfp.c.inc
-@@ -XXX,XX +XXX,XX @@ static int xilinx_spips_num_dummies(XilinxQSPIPS *qs, uint8_t command)
+@@ -XXX,XX +XXX,XX @@ static bool trans_VMSR_VMRS(DisasContext *s, arg_VMSR_VMRS *a)
-         return 2;
+          * Accesses to R15 are UNPREDICTABLE; we choose to undef.
-     case QIOR:
+          * (FPSCR -> r15 is a special case which writes to the PSR flags.)
-     case QIOR_4:
+          */
--        return 5;
+-        if (a->rt == 15 && (!a->l || a->reg != ARM_VFP_FPSCR)) {
-+        return 4;
++        if (a->reg != ARM_VFP_FPSCR) {
-     default:
++            return false;
-         return -1;
++        }
 +        if (a->rt == 15 && !a->l) {
              return false;
          }
      }
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 04/42] i2c: Fix some brace style issues
+Deleted patch
-From: Corey Minyard <cminyard@mvista.com>
-Signed-off-by: Corey Minyard <cminyard@mvista.com>
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Linus Walleij <linus.walleij@linaro.org>
-Message-id: 20180227104903.21353-2-linus.walleij@linaro.org
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- include/hw/i2c/i2c.h | 6 ++----
- hw/i2c/core.c        | 3 +--
-files changed, 3 insertions(+), 6 deletions(-)
-diff --git a/include/hw/i2c/i2c.h b/include/hw/i2c/i2c.h
-index XXXXXXX..XXXXXXX 100644
---- a/include/hw/i2c/i2c.h
-+++ b/include/hw/i2c/i2c.h
-@@ -XXX,XX +XXX,XX @@ typedef struct I2CSlave I2CSlave;
- #define I2C_SLAVE_GET_CLASS(obj) \
-      OBJECT_GET_CLASS(I2CSlaveClass, (obj), TYPE_I2C_SLAVE)
--typedef struct I2CSlaveClass
--{
-+typedef struct I2CSlaveClass {
-     DeviceClass parent_class;
-     /* Callbacks provided by the device.  */
-@@ -XXX,XX +XXX,XX @@ typedef struct I2CSlaveClass
-     int (*event)(I2CSlave *s, enum i2c_event event);
- } I2CSlaveClass;
--struct I2CSlave
--{
-+struct I2CSlave {
-     DeviceState qdev;
-     /* Remaining fields for internal use by the I2C code.  */
-diff --git a/hw/i2c/core.c b/hw/i2c/core.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/i2c/core.c
-+++ b/hw/i2c/core.c
-@@ -XXX,XX +XXX,XX @@ struct I2CNode {
- #define I2C_BROADCAST 0x00
--struct I2CBus
--{
-+struct I2CBus {
-     BusState qbus;
-     QLIST_HEAD(, I2CNode) current_devs;
-     uint8_t saved_address;
---
-.16.2

-[Qemu-devel] [PULL 08/42] arm/vexpress: Add proper display connector emulation
+Deleted patch
-From: Linus Walleij <linus.walleij@linaro.org>
-This adds the SiI9022 (and implicitly EDID I2C) device to the ARM
-Versatile Express machine, and selects the two I2C devices necessary
-in the arm-softmmu.mak configuration so everything will build
-smoothly.
-I am implementing proper handling of the graphics in the Linux
-kernel and adding proper emulation of SiI9022 and EDID makes the
-driver probe as nicely as before, retrieving the resolutions
-supported by the "QEMU monitor" and overall just working nice.
-Cc: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Linus Walleij <linus.walleij@linaro.org>
-Message-id: 20180227104903.21353-6-linus.walleij@linaro.org
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- hw/arm/vexpress.c               | 6 +++++-
- default-configs/arm-softmmu.mak | 2 ++
-files changed, 7 insertions(+), 1 deletion(-)
-diff --git a/hw/arm/vexpress.c b/hw/arm/vexpress.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/vexpress.c
-+++ b/hw/arm/vexpress.c
-@@ -XXX,XX +XXX,XX @@
- #include "hw/arm/arm.h"
- #include "hw/arm/primecell.h"
- #include "hw/devices.h"
-+#include "hw/i2c/i2c.h"
- #include "net/net.h"
- #include "sysemu/sysemu.h"
- #include "hw/boards.h"
-@@ -XXX,XX +XXX,XX @@ static void vexpress_common_init(MachineState *machine)
-     uint32_t sys_id;
-     DriveInfo *dinfo;
-     pflash_t *pflash0;
-+    I2CBus *i2c;
-     ram_addr_t vram_size, sram_size;
-     MemoryRegion *sysmem = get_system_memory();
-     MemoryRegion *vram = g_new(MemoryRegion, 1);
-@@ -XXX,XX +XXX,XX @@ static void vexpress_common_init(MachineState *machine)
-     sysbus_create_simple("sp804", map[VE_TIMER01], pic[2]);
-     sysbus_create_simple("sp804", map[VE_TIMER23], pic[3]);
--    /* VE_SERIALDVI: not modelled */
-+    dev = sysbus_create_simple("versatile_i2c", map[VE_SERIALDVI], NULL);
-+    i2c = (I2CBus *)qdev_get_child_bus(dev, "i2c");
-+    i2c_create_slave(i2c, "sii9022", 0x39);
-     sysbus_create_simple("pl031", map[VE_RTC], pic[4]); /* RTC */
-diff --git a/default-configs/arm-softmmu.mak b/default-configs/arm-softmmu.mak
-index XXXXXXX..XXXXXXX 100644
---- a/default-configs/arm-softmmu.mak
-+++ b/default-configs/arm-softmmu.mak
-@@ -XXX,XX +XXX,XX @@ CONFIG_STELLARIS_INPUT=y
- CONFIG_STELLARIS_ENET=y
- CONFIG_SSD0303=y
- CONFIG_SSD0323=y
-+CONFIG_DDC=y
-+CONFIG_SII9022=y
- CONFIG_ADS7846=y
- CONFIG_MAX111X=y
- CONFIG_SSI=y
---
-.16.2

-[Qemu-devel] [PULL 09/42] include/exec/helper-head.h: support f16 in helper calls
+Deleted patch
-From: Alex Bennée <alex.bennee@linaro.org>
-This allows us to explicitly pass float16 to helpers rather than
-assuming uint32_t and dealing with the result. Of course they will be
-passed in i32 sized registers by default.
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20180227143852.11175-2-alex.bennee@linaro.org
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- include/exec/helper-head.h | 3 +++
-file changed, 3 insertions(+)
-diff --git a/include/exec/helper-head.h b/include/exec/helper-head.h
-index XXXXXXX..XXXXXXX 100644
---- a/include/exec/helper-head.h
-+++ b/include/exec/helper-head.h
-@@ -XXX,XX +XXX,XX @@
- #define dh_alias_int i32
- #define dh_alias_i64 i64
- #define dh_alias_s64 i64
-+#define dh_alias_f16 i32
- #define dh_alias_f32 i32
- #define dh_alias_f64 i64
- #define dh_alias_ptr ptr
-@@ -XXX,XX +XXX,XX @@
- #define dh_ctype_int int
- #define dh_ctype_i64 uint64_t
- #define dh_ctype_s64 int64_t
-+#define dh_ctype_f16 float16
- #define dh_ctype_f32 float32
- #define dh_ctype_f64 float64
- #define dh_ctype_ptr void *
-@@ -XXX,XX +XXX,XX @@
- #define dh_is_signed_s32 1
- #define dh_is_signed_i64 0
- #define dh_is_signed_s64 1
-+#define dh_is_signed_f16 0
- #define dh_is_signed_f32 0
- #define dh_is_signed_f64 0
- #define dh_is_signed_tl  0
---
-.16.2

-[Qemu-devel] [PULL 38/42] arm/translate-a64: implement simd_scalar_three_reg_same_fp16
+[PULL 18/36] target/arm: Refactor M-profile VMSR/VMRS handling
-From: Alex Bennée <alex.bennee@linaro.org>
+Currently M-profile borrows the A-profile code for VMSR and VMRS
+(access to the FP system registers), because all it needs to support
-This covers the encoding group:
+is the FPSCR.  In v8.1M things become significantly more complicated
+in two ways:
-  Advanced SIMD scalar three same FP16
+ * there are several new FP system registers; some have side effects
-As all the helpers are already there it is simply a case of calling the
+   on read, and one (FPCXT_NS) needs to avoid the usual
-existing helpers in the scalar context.
+   vfp_access_check() and the "only if FPU implemented" check
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+ * all sysregs are now accessible both by VMRS/VMSR (which
    reads/writes a general purpose register) and also by VLDR/VSTR
    (which reads/writes them directly to memory)
 Refactor the structure of how we handle VMSR/VMRS to cope with this:
  * keep the M-profile code entirely separate from the A-profile code
  * abstract out the "read or write the general purpose register" part
    of the code into a loadfn or storefn function pointer, so we can
    reuse it for VLDR/VSTR.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20180227143852.11175-31-alex.bennee@linaro.org
+Message-id: 20201119215617.29887-8-peter.maydell@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/translate-a64.c | 99 ++++++++++++++++++++++++++++++++++++++++++++++
+ target/arm/cpu.h               |   3 +
-file changed, 99 insertions(+)
+ target/arm/translate-vfp.c.inc | 182 ++++++++++++++++++++++++++++++---
+files changed, 171 insertions(+), 14 deletions(-)
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
 diff --git a/target/arm/cpu.h b/target/arm/cpu.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
+--- a/target/arm/cpu.h
-+++ b/target/arm/translate-a64.c
++++ b/target/arm/cpu.h
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_scalar_three_reg_same(DisasContext *s, uint32_t insn)
+@@ -XXX,XX +XXX,XX @@ enum arm_cpu_mode {
-     tcg_temp_free_i64(tcg_rd);
+ #define ARM_VFP_FPINST  9
  #define ARM_VFP_FPINST2 10
 +/* QEMU-internal value meaning "FPSCR, but we care only about NZCV" */
 +#define QEMU_VFP_FPSCR_NZCV 0xffff
 +
  /* iwMMXt coprocessor control registers.  */
  #define ARM_IWMMXT_wCID  0
  #define ARM_IWMMXT_wCon  1
 diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-vfp.c.inc
 +++ b/target/arm/translate-vfp.c.inc
@@ -XXX,XX +XXX,XX @@ static bool trans_VDUP(DisasContext *s, arg_VDUP *a)
      return true;
  }
-+/* AdvSIMD scalar three same FP16
++/*
-+ *  31 30  29 28       24 23  22 21 20  16 15 14 13    11 10  9  5 4  0
++ * M-profile provides two different sets of instructions that can
-+ * +-----+---+-----------+---+-----+------+-----+--------+---+----+----+
++ * access floating point system registers: VMSR/VMRS (which move
-+ * | 0 1 | U | 1 1 1 1 0 | a | 1 0 |  Rm  | 0 0 | opcode | 1 | Rn | Rd |
++ * to/from a general purpose register) and VLDR/VSTR sysreg (which
-+ * +-----+---+-----------+---+-----+------+-----+--------+---+----+----+
++ * move directly to/from memory). In some cases there are also side
-+ * v: 0101 1110 0100 0000 0000 0100 0000 0000 => 5e400400
++ * effects which must happen after any write to memory (which could
-+ * m: 1101 1111 0110 0000 1100 0100 0000 0000 => df60c400
++ * cause an exception). So we implement the common logic for the
 + * sysreg access in gen_M_fp_sysreg_write() and gen_M_fp_sysreg_read(),
 + * which take pointers to callback functions which will perform the
 + * actual "read/write general purpose register" and "read/write
 + * memory" operations.
 + */
-+static void disas_simd_scalar_three_reg_same_fp16(DisasContext *s,
++
-+                                                  uint32_t insn)
++/*
-+{
++ * Emit code to store the sysreg to its final destination; frees the
-+    int rd = extract32(insn, 0, 5);
++ * TCG temp 'value' it is passed.
-+    int rn = extract32(insn, 5, 5);
++ */
-+    int opcode = extract32(insn, 11, 3);
++typedef void fp_sysreg_storefn(DisasContext *s, void *opaque, TCGv_i32 value);
-+    int rm = extract32(insn, 16, 5);
++/*
-+    bool u = extract32(insn, 29, 1);
++ * Emit code to load the value to be copied to the sysreg; returns
-+    bool a = extract32(insn, 23, 1);
++ * a new TCG temporary
-+    int fpopcode = opcode | (a << 3) |  (u << 4);
++ */
-+    TCGv_ptr fpst;
++typedef TCGv_i32 fp_sysreg_loadfn(DisasContext *s, void *opaque);
-+    TCGv_i32 tcg_op1;
++
-+    TCGv_i32 tcg_op2;
++/* Common decode/access checks for fp sysreg read/write */
-+    TCGv_i32 tcg_res;
++typedef enum FPSysRegCheckResult {
-+
++    FPSysRegCheckFailed, /* caller should return false */
-+    switch (fpopcode) {
++    FPSysRegCheckDone, /* caller should return true */
-+    case 0x03: /* FMULX */
++    FPSysRegCheckContinue, /* caller should continue generating code */
-+    case 0x04: /* FCMEQ (reg) */
++} FPSysRegCheckResult;
-+    case 0x07: /* FRECPS */
++
-+    case 0x0f: /* FRSQRTS */
++static FPSysRegCheckResult fp_sysreg_checks(DisasContext *s, int regno)
-+    case 0x14: /* FCMGE (reg) */
++{
-+    case 0x15: /* FACGE */
++    if (!dc_isar_feature(aa32_fpsp_v2, s)) {
-+    case 0x1a: /* FABD */
++        return FPSysRegCheckFailed;
-+    case 0x1c: /* FCMGT (reg) */
++    }
-+    case 0x1d: /* FACGT */
++
 +    switch (regno) {
 +    case ARM_VFP_FPSCR:
 +    case QEMU_VFP_FPSCR_NZCV:
 +        break;
 +    default:
-+        unallocated_encoding(s);
++        return FPSysRegCheckFailed;
-+        return;
++    }
-+    }
++
-+
++    if (!vfp_access_check(s)) {
-+    if (!arm_dc_feature(s, ARM_FEATURE_V8_FP16)) {
++        return FPSysRegCheckDone;
-+        unallocated_encoding(s);
++    }
-+    }
++
-+
++    return FPSysRegCheckContinue;
-+    if (!fp_access_check(s)) {
++}
-+        return;
++
-+    }
++static bool gen_M_fp_sysreg_write(DisasContext *s, int regno,
 +
-+    fpst = get_fpstatus_ptr(true);
++                                  fp_sysreg_loadfn *loadfn,
-+
++                                 void *opaque)
-+    tcg_op1 = tcg_temp_new_i32();
++{
-+    tcg_op2 = tcg_temp_new_i32();
++    /* Do a write to an M-profile floating point system register */
-+    tcg_res = tcg_temp_new_i32();
++    TCGv_i32 tmp;
 +
-+    read_vec_element_i32(s, tcg_op1, rn, 0, MO_16);
++    switch (fp_sysreg_checks(s, regno)) {
-+    read_vec_element_i32(s, tcg_op2, rm, 0, MO_16);
++    case FPSysRegCheckFailed:
-+
++        return false;
-+    switch (fpopcode) {
++    case FPSysRegCheckDone:
-+    case 0x03: /* FMULX */
++        return true;
-+        gen_helper_advsimd_mulxh(tcg_res, tcg_op1, tcg_op2, fpst);
++    case FPSysRegCheckContinue:
 +        break;
-+    case 0x04: /* FCMEQ (reg) */
++    }
-+        gen_helper_advsimd_ceq_f16(tcg_res, tcg_op1, tcg_op2, fpst);
++
-+        break;
++    switch (regno) {
-+    case 0x07: /* FRECPS */
++    case ARM_VFP_FPSCR:
-+        gen_helper_recpsf_f16(tcg_res, tcg_op1, tcg_op2, fpst);
++        tmp = loadfn(s, opaque);
-+        break;
++        gen_helper_vfp_set_fpscr(cpu_env, tmp);
-+    case 0x0f: /* FRSQRTS */
++        tcg_temp_free_i32(tmp);
-+        gen_helper_rsqrtsf_f16(tcg_res, tcg_op1, tcg_op2, fpst);
++        gen_lookup_tb(s);
 +        break;
 +    case 0x14: /* FCMGE (reg) */
 +        gen_helper_advsimd_cge_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 +        break;
 +    case 0x15: /* FACGE */
 +        gen_helper_advsimd_acge_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 +        break;
 +    case 0x1a: /* FABD */
 +        gen_helper_advsimd_subh(tcg_res, tcg_op1, tcg_op2, fpst);
 +        tcg_gen_andi_i32(tcg_res, tcg_res, 0x7fff);
 +        break;
 +    case 0x1c: /* FCMGT (reg) */
 +        gen_helper_advsimd_cgt_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 +        break;
 +    case 0x1d: /* FACGT */
 +        gen_helper_advsimd_acgt_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 +        break;
 +    default:
 +        g_assert_not_reached();
 +    }
-+
++    return true;
-+    write_fp_sreg(s, rd, tcg_res);
++}
 +
-+
++static bool gen_M_fp_sysreg_read(DisasContext *s, int regno,
-+    tcg_temp_free_i32(tcg_res);
++                                fp_sysreg_storefn *storefn,
-+    tcg_temp_free_i32(tcg_op1);
++                                void *opaque)
-+    tcg_temp_free_i32(tcg_op2);
++{
-+    tcg_temp_free_ptr(fpst);
++    /* Do a read from an M-profile floating point system register */
-+}
++    TCGv_i32 tmp;
 +
- static void handle_2misc_64(DisasContext *s, int opcode, bool u,
++    switch (fp_sysreg_checks(s, regno)) {
-                             TCGv_i64 tcg_rd, TCGv_i64 tcg_rn,
++    case FPSysRegCheckFailed:
-                             TCGv_i32 tcg_rmode, TCGv_ptr tcg_fpstatus)
++        return false;
-@@ -XXX,XX +XXX,XX @@ static const AArch64DecodeTable data_proc_simd[] = {
++    case FPSysRegCheckDone:
-     { 0xce408000, 0xffe0c000, disas_crypto_three_reg_imm2 },
++        return true;
-     { 0x0e400400, 0x9f60c400, disas_simd_three_reg_same_fp16 },
++    case FPSysRegCheckContinue:
-     { 0x0e780800, 0x8f7e0c00, disas_simd_two_reg_misc_fp16 },
++        break;
-+    { 0x5e400400, 0xdf60c400, disas_simd_scalar_three_reg_same_fp16 },
++    }
-     { 0x00000000, 0x00000000, NULL }
++
- };
++    switch (regno) {
++    case ARM_VFP_FPSCR:
 +        tmp = tcg_temp_new_i32();
 +        gen_helper_vfp_get_fpscr(tmp, cpu_env);
 +        storefn(s, opaque, tmp);
 +        break;
 +    case QEMU_VFP_FPSCR_NZCV:
 +        /*
 +         * Read just NZCV; this is a special case to avoid the
 +         * helper call for the "VMRS to CPSR.NZCV" insn.
 +         */
 +        tmp = load_cpu_field(vfp.xregs[ARM_VFP_FPSCR]);
 +        tcg_gen_andi_i32(tmp, tmp, 0xf0000000);
 +        storefn(s, opaque, tmp);
 +        break;
 +    default:
 +        g_assert_not_reached();
 +    }
 +    return true;
 +}
 +
 +static void fp_sysreg_to_gpr(DisasContext *s, void *opaque, TCGv_i32 value)
 +{
 +    arg_VMSR_VMRS *a = opaque;
 +
 +    if (a->rt == 15) {
 +        /* Set the 4 flag bits in the CPSR */
 +        gen_set_nzcv(value);
 +        tcg_temp_free_i32(value);
 +    } else {
 +        store_reg(s, a->rt, value);
 +    }
 +}
 +
 +static TCGv_i32 gpr_to_fp_sysreg(DisasContext *s, void *opaque)
 +{
 +    arg_VMSR_VMRS *a = opaque;
 +
 +    return load_reg(s, a->rt);
 +}
 +
 +static bool gen_M_VMSR_VMRS(DisasContext *s, arg_VMSR_VMRS *a)
 +{
 +    /*
 +     * Accesses to R15 are UNPREDICTABLE; we choose to undef.
 +     * FPSCR -> r15 is a special case which writes to the PSR flags;
 +     * set a->reg to a special value to tell gen_M_fp_sysreg_read()
 +     * we only care about the top 4 bits of FPSCR there.
 +     */
 +    if (a->rt == 15) {
 +        if (a->l && a->reg == ARM_VFP_FPSCR) {
 +            a->reg = QEMU_VFP_FPSCR_NZCV;
 +        } else {
 +            return false;
 +        }
 +    }
 +
 +    if (a->l) {
 +        /* VMRS, move FP system register to gp register */
 +        return gen_M_fp_sysreg_read(s, a->reg, fp_sysreg_to_gpr, a);
 +    } else {
 +        /* VMSR, move gp register to FP system register */
 +        return gen_M_fp_sysreg_write(s, a->reg, gpr_to_fp_sysreg, a);
 +    }
 +}
 +
  static bool trans_VMSR_VMRS(DisasContext *s, arg_VMSR_VMRS *a)
  {
      TCGv_i32 tmp;
      bool ignore_vfp_enabled = false;
 -    if (!dc_isar_feature(aa32_fpsp_v2, s)) {
 -        return false;
 +    if (arm_dc_feature(s, ARM_FEATURE_M)) {
 +        return gen_M_VMSR_VMRS(s, a);
      }
 -    if (arm_dc_feature(s, ARM_FEATURE_M)) {
 -        /*
 -         * The only M-profile VFP vmrs/vmsr sysreg is FPSCR.
 -         * Accesses to R15 are UNPREDICTABLE; we choose to undef.
 -         * (FPSCR -> r15 is a special case which writes to the PSR flags.)
 -         */
 -        if (a->reg != ARM_VFP_FPSCR) {
 -            return false;
 -        }
 -        if (a->rt == 15 && !a->l) {
 -            return false;
 -        }
 +    if (!dc_isar_feature(aa32_fpsp_v2, s)) {
 +        return false;
      }
      switch (a->reg) {
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 18/42] arm/translate-a64: add FP16 F[A]C[EQ/GE/GT] to simd_three_reg_same_fp16
+[PULL 19/36] target/arm: Move general-use constant expanders up in translate.c
-From: Alex Bennée <alex.bennee@linaro.org>
+The constant-expander functions like negate, plus_2, etc, are
 generally useful; move them up in translate.c so we can use them in
 the VFP/Neon decoders as well as in the A32/T32/T16 decoders.
-These use the generic float16_compare functionality which in turn uses
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-the common float_compare code from the softfloat re-factor.
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20201119215617.29887-9-peter.maydell@linaro.org
 ---
  target/arm/translate.c | 46 +++++++++++++++++++++++-------------------
 file changed, 25 insertions(+), 21 deletions(-)
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+diff --git a/target/arm/translate.c b/target/arm/translate.c
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20180227143852.11175-11-alex.bennee@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
  target/arm/helper-a64.h    |  5 +++++
  target/arm/helper-a64.c    | 49 ++++++++++++++++++++++++++++++++++++++++++++++
  target/arm/translate-a64.c | 15 ++++++++++++++
 files changed, 69 insertions(+)
 diff --git a/target/arm/helper-a64.h b/target/arm/helper-a64.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper-a64.h
+--- a/target/arm/translate.c
-+++ b/target/arm/helper-a64.h
++++ b/target/arm/translate.c
-@@ -XXX,XX +XXX,XX @@ DEF_HELPER_3(advsimd_addh, f16, f16, f16, ptr)
+@@ -XXX,XX +XXX,XX @@ static void arm_gen_condlabel(DisasContext *s)
- DEF_HELPER_3(advsimd_subh, f16, f16, f16, ptr)
+     }
- DEF_HELPER_3(advsimd_mulh, f16, f16, f16, ptr)
+ }
- DEF_HELPER_3(advsimd_divh, f16, f16, f16, ptr)
 +DEF_HELPER_3(advsimd_ceq_f16, i32, f16, f16, ptr)
 +DEF_HELPER_3(advsimd_cge_f16, i32, f16, f16, ptr)
 +DEF_HELPER_3(advsimd_cgt_f16, i32, f16, f16, ptr)
 +DEF_HELPER_3(advsimd_acge_f16, i32, f16, f16, ptr)
 +DEF_HELPER_3(advsimd_acgt_f16, i32, f16, f16, ptr)
 diff --git a/target/arm/helper-a64.c b/target/arm/helper-a64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper-a64.c
 +++ b/target/arm/helper-a64.c
@@ -XXX,XX +XXX,XX @@ ADVSIMD_HALFOP(min)
  ADVSIMD_HALFOP(max)
  ADVSIMD_HALFOP(minnum)
  ADVSIMD_HALFOP(maxnum)
 +
 +/*
-+ * Floating point comparisons produce an integer result. Softfloat
++ * Constant expanders for the decoders.
 + * routines return float_relation types which we convert to the 0/-1
 + * Neon requires.
 + */
 +
-+#define ADVSIMD_CMPRES(test) (test) ? 0xffff : 0
++static int negate(DisasContext *s, int x)
 +
 +uint32_t HELPER(advsimd_ceq_f16)(float16 a, float16 b, void *fpstp)
 +{
-+    float_status *fpst = fpstp;
++    return -x;
 +    int compare = float16_compare_quiet(a, b, fpst);
 +    return ADVSIMD_CMPRES(compare == float_relation_equal);
 +}
 +
-+uint32_t HELPER(advsimd_cge_f16)(float16 a, float16 b, void *fpstp)
++static int plus_2(DisasContext *s, int x)
 +{
-+    float_status *fpst = fpstp;
++    return x + 2;
 +    int compare = float16_compare(a, b, fpst);
 +    return ADVSIMD_CMPRES(compare == float_relation_greater ||
 +                          compare == float_relation_equal);
 +}
 +
-+uint32_t HELPER(advsimd_cgt_f16)(float16 a, float16 b, void *fpstp)
++static int times_2(DisasContext *s, int x)
 +{
-+    float_status *fpst = fpstp;
++    return x * 2;
 +    int compare = float16_compare(a, b, fpst);
 +    return ADVSIMD_CMPRES(compare == float_relation_greater);
 +}
 +
-+uint32_t HELPER(advsimd_acge_f16)(float16 a, float16 b, void *fpstp)
++static int times_4(DisasContext *s, int x)
 +{
-+    float_status *fpst = fpstp;
++    return x * 4;
 +    float16 f0 = float16_abs(a);
 +    float16 f1 = float16_abs(b);
 +    int compare = float16_compare(f0, f1, fpst);
 +    return ADVSIMD_CMPRES(compare == float_relation_greater ||
 +                          compare == float_relation_equal);
 +}
 +
-+uint32_t HELPER(advsimd_acgt_f16)(float16 a, float16 b, void *fpstp)
+ /* Flags for the disas_set_da_iss info argument:
-+{
+  * lower bits hold the Rt register number, higher bits are flags.
-+    float_status *fpst = fpstp;
+  */
-+    float16 f0 = float16_abs(a);
+@@ -XXX,XX +XXX,XX @@ static void arm_skip_unless(DisasContext *s, uint32_t cond)
-+    float16 f1 = float16_abs(b);
-+    int compare = float16_compare(f0, f1, fpst);
-+    return ADVSIMD_CMPRES(compare == float_relation_greater);
+ /*
-+}
+- * Constant expanders for the decoders.
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
++ * Constant expanders used by T16/T32 decode
-index XXXXXXX..XXXXXXX 100644
+  */
---- a/target/arm/translate-a64.c
-+++ b/target/arm/translate-a64.c
+-static int negate(DisasContext *s, int x)
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_three_reg_same_fp16(DisasContext *s, uint32_t insn)
+-{
-         case 0x2: /* FADD */
+-    return -x;
-             gen_helper_advsimd_addh(tcg_res, tcg_op1, tcg_op2, fpst);
+-}
-             break;
+-
-+        case 0x4: /* FCMEQ */
+-static int plus_2(DisasContext *s, int x)
-+            gen_helper_advsimd_ceq_f16(tcg_res, tcg_op1, tcg_op2, fpst);
+-{
-+            break;
+-    return x + 2;
-         case 0x6: /* FMAX */
+-}
-             gen_helper_advsimd_maxh(tcg_res, tcg_op1, tcg_op2, fpst);
+-
-             break;
+-static int times_2(DisasContext *s, int x)
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_three_reg_same_fp16(DisasContext *s, uint32_t insn)
+-{
-         case 0x13: /* FMUL */
+-    return x * 2;
-             gen_helper_advsimd_mulh(tcg_res, tcg_op1, tcg_op2, fpst);
+-}
-             break;
+-
-+        case 0x14: /* FCMGE */
+-static int times_4(DisasContext *s, int x)
-+            gen_helper_advsimd_cge_f16(tcg_res, tcg_op1, tcg_op2, fpst);
+-{
-+            break;
+-    return x * 4;
-+        case 0x15: /* FACGE */
+-}
-+            gen_helper_advsimd_acge_f16(tcg_res, tcg_op1, tcg_op2, fpst);
+-
-+            break;
+ /* Return only the rotation part of T32ExpandImm.  */
-         case 0x17: /* FDIV */
+ static int t32_expandimm_rot(DisasContext *s, int x)
-             gen_helper_advsimd_divh(tcg_res, tcg_op1, tcg_op2, fpst);
+ {
              break;
@@ -XXX,XX +XXX,XX @@ static void disas_simd_three_reg_same_fp16(DisasContext *s, uint32_t insn)
              gen_helper_advsimd_subh(tcg_res, tcg_op1, tcg_op2, fpst);
              tcg_gen_andi_i32(tcg_res, tcg_res, 0x7fff);
              break;
 +        case 0x1c: /* FCMGT */
 +            gen_helper_advsimd_cgt_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 +            break;
 +        case 0x1d: /* FACGT */
 +            gen_helper_advsimd_acgt_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 +            break;
          default:
              fprintf(stderr, "%s: insn %#04x, fpop %#2x @ %#" PRIx64 "\n",
                      __func__, insn, fpopcode, s->pc);
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 32/42] arm/translate-a64: add FP16 FRCPX to simd_two_reg_misc_fp16
+[PULL 20/36] target/arm: Implement VLDR/VSTR system register
-From: Alex Bennée <alex.bennee@linaro.org>
+Implement the new-in-v8.1M VLDR/VSTR variants which directly
 read or write FP system registers to memory.
-We go with the localised helper.
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20201119215617.29887-10-peter.maydell@linaro.org
 ---
  target/arm/vfp.decode          | 14 ++++++
  target/arm/translate-vfp.c.inc | 91 ++++++++++++++++++++++++++++++++++
 files changed, 105 insertions(+)
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+diff --git a/target/arm/vfp.decode b/target/arm/vfp.decode
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20180227143852.11175-25-alex.bennee@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
  target/arm/helper-a64.h    |  1 +
  target/arm/helper-a64.c    | 29 +++++++++++++++++++++++++++++
  target/arm/translate-a64.c |  4 ++++
 files changed, 34 insertions(+)
 diff --git a/target/arm/helper-a64.h b/target/arm/helper-a64.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper-a64.h
+--- a/target/arm/vfp.decode
-+++ b/target/arm/helper-a64.h
++++ b/target/arm/vfp.decode
-@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_1(neon_addlp_s16, TCG_CALL_NO_RWG_SE, i64, i64)
+@@ -XXX,XX +XXX,XX @@ VLDR_VSTR_hp ---- 1101 u:1 .0 l:1 rn:4 .... 1001 imm:8      vd=%vd_sp
- DEF_HELPER_FLAGS_1(neon_addlp_u16, TCG_CALL_NO_RWG_SE, i64, i64)
+ VLDR_VSTR_sp ---- 1101 u:1 .0 l:1 rn:4 .... 1010 imm:8      vd=%vd_sp
- DEF_HELPER_FLAGS_2(frecpx_f64, TCG_CALL_NO_RWG, f64, f64, ptr)
+ VLDR_VSTR_dp ---- 1101 u:1 .0 l:1 rn:4 .... 1011 imm:8      vd=%vd_dp
- DEF_HELPER_FLAGS_2(frecpx_f32, TCG_CALL_NO_RWG, f32, f32, ptr)
-+DEF_HELPER_FLAGS_2(frecpx_f16, TCG_CALL_NO_RWG, f16, f16, ptr)
++# M-profile VLDR/VSTR to sysreg
- DEF_HELPER_FLAGS_2(fcvtx_f64_to_f32, TCG_CALL_NO_RWG, f32, f64, env)
++%vldr_sysreg 22:1 13:3
- DEF_HELPER_FLAGS_3(crc32_64, TCG_CALL_NO_RWG_SE, i64, i64, i64, i32)
++%imm7_0x4 0:7 !function=times_4
- DEF_HELPER_FLAGS_3(crc32c_64, TCG_CALL_NO_RWG_SE, i64, i64, i64, i32)
++
-diff --git a/target/arm/helper-a64.c b/target/arm/helper-a64.c
++&vldr_sysreg rn reg imm a w p
 +@vldr_sysreg .... ... . a:1 . . . rn:4 ... . ... .. ....... \
 +             reg=%vldr_sysreg imm=%imm7_0x4 &vldr_sysreg
 +
 +# P=0 W=0 is SEE "Related encodings", so split into two patterns
 +VLDR_sysreg  ---- 110 1 . . w:1 1 .... ... 0 111 11 ....... @vldr_sysreg p=1
 +VLDR_sysreg  ---- 110 0 . . 1   1 .... ... 0 111 11 ....... @vldr_sysreg p=0 w=1
 +VSTR_sysreg  ---- 110 1 . . w:1 0 .... ... 0 111 11 ....... @vldr_sysreg p=1
 +VSTR_sysreg  ---- 110 0 . . 1   0 .... ... 0 111 11 ....... @vldr_sysreg p=0 w=1
 +
  # We split the load/store multiple up into two patterns to avoid
  # overlap with other insns in the "Advanced SIMD load/store and 64-bit move"
  # grouping:
 diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper-a64.c
+--- a/target/arm/translate-vfp.c.inc
-+++ b/target/arm/helper-a64.c
++++ b/target/arm/translate-vfp.c.inc
-@@ -XXX,XX +XXX,XX @@ uint64_t HELPER(neon_addlp_u16)(uint64_t a)
+@@ -XXX,XX +XXX,XX @@ static bool trans_VMSR_VMRS(DisasContext *s, arg_VMSR_VMRS *a)
      return true;
  }
- /* Floating-point reciprocal exponent - see FPRecpX in ARM ARM */
++static void fp_sysreg_to_memory(DisasContext *s, void *opaque, TCGv_i32 value)
 +float16 HELPER(frecpx_f16)(float16 a, void *fpstp)
 +{
-+    float_status *fpst = fpstp;
++    arg_vldr_sysreg *a = opaque;
-+    uint16_t val16, sbit;
++    uint32_t offset = a->imm;
-+    int16_t exp;
++    TCGv_i32 addr;
 +
-+    if (float16_is_any_nan(a)) {
++    if (!a->a) {
-+        float16 nan = a;
++        offset = - offset;
 +        if (float16_is_signaling_nan(a, fpst)) {
 +            float_raise(float_flag_invalid, fpst);
 +            nan = float16_maybe_silence_nan(a, fpst);
 +        }
 +        if (fpst->default_nan_mode) {
 +            nan = float16_default_nan(fpst);
 +        }
 +        return nan;
 +    }
 +
-+    val16 = float16_val(a);
++    addr = load_reg(s, a->rn);
-+    sbit = 0x8000 & val16;
++    if (a->p) {
-+    exp = extract32(val16, 10, 5);
++        tcg_gen_addi_i32(addr, addr, offset);
 +    }
 +
-+    if (exp == 0) {
++    if (s->v8m_stackcheck && a->rn == 13 && a->w) {
-+        return make_float16(deposit32(sbit, 10, 5, 0x1e));
++        gen_helper_v8m_stackcheck(cpu_env, addr);
 +    }
 +
 +    gen_aa32_st_i32(s, value, addr, get_mem_index(s),
 +                    MO_UL | MO_ALIGN | s->be_data);
 +    tcg_temp_free_i32(value);
 +
 +    if (a->w) {
 +        /* writeback */
 +        if (!a->p) {
 +            tcg_gen_addi_i32(addr, addr, offset);
 +        }
 +        store_reg(s, a->rn, addr);
 +    } else {
-+        return make_float16(deposit32(sbit, 10, 5, ~exp));
++        tcg_temp_free_i32(addr);
 +    }
 +}
 +
- float32 HELPER(frecpx_f32)(float32 a, void *fpstp)
++static TCGv_i32 memory_to_fp_sysreg(DisasContext *s, void *opaque)
 +{
 +    arg_vldr_sysreg *a = opaque;
 +    uint32_t offset = a->imm;
 +    TCGv_i32 addr;
 +    TCGv_i32 value = tcg_temp_new_i32();
 +
 +    if (!a->a) {
 +        offset = - offset;
 +    }
 +
 +    addr = load_reg(s, a->rn);
 +    if (a->p) {
 +        tcg_gen_addi_i32(addr, addr, offset);
 +    }
 +
 +    if (s->v8m_stackcheck && a->rn == 13 && a->w) {
 +        gen_helper_v8m_stackcheck(cpu_env, addr);
 +    }
 +
 +    gen_aa32_ld_i32(s, value, addr, get_mem_index(s),
 +                    MO_UL | MO_ALIGN | s->be_data);
 +
 +    if (a->w) {
 +        /* writeback */
 +        if (!a->p) {
 +            tcg_gen_addi_i32(addr, addr, offset);
 +        }
 +        store_reg(s, a->rn, addr);
 +    } else {
 +        tcg_temp_free_i32(addr);
 +    }
 +    return value;
 +}
 +
 +static bool trans_VLDR_sysreg(DisasContext *s, arg_vldr_sysreg *a)
 +{
 +    if (!arm_dc_feature(s, ARM_FEATURE_V8_1M)) {
 +        return false;
 +    }
 +    if (a->rn == 15) {
 +        return false;
 +    }
 +    return gen_M_fp_sysreg_write(s, a->reg, memory_to_fp_sysreg, a);
 +}
 +
 +static bool trans_VSTR_sysreg(DisasContext *s, arg_vldr_sysreg *a)
 +{
 +    if (!arm_dc_feature(s, ARM_FEATURE_V8_1M)) {
 +        return false;
 +    }
 +    if (a->rn == 15) {
 +        return false;
 +    }
 +    return gen_M_fp_sysreg_read(s, a->reg, fp_sysreg_to_memory, a);
 +}
 +
  static bool trans_VMOV_half(DisasContext *s, arg_VMOV_single *a)
  {
-     float_status *fpst = fpstp;
+     TCGv_i32 tmp;
 diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-a64.c
 +++ b/target/arm/translate-a64.c
@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
          handle_2misc_fcmp_zero(s, fpop, is_scalar, 0, is_q, MO_16, rn, rd);
          return;
      case 0x3d: /* FRECPE */
 +    case 0x3f: /* FRECPX */
          break;
      case 0x18: /* FRINTN */
          need_rmode = true;
@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
          case 0x3d: /* FRECPE */
              gen_helper_recpe_f16(tcg_res, tcg_op, tcg_fpstatus);
              break;
 +        case 0x3f: /* FRECPX */
 +            gen_helper_frecpx_f16(tcg_res, tcg_op, tcg_fpstatus);
 +            break;
          case 0x5a: /* FCVTNU */
          case 0x5b: /* FCVTMU */
          case 0x5c: /* FCVTAU */
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 10/42] target/arm/cpu64: introduce ARM_V8_FP16 feature bit
+[PULL 21/36] target/arm: Implement M-profile FPSCR_nzcvqc
-From: Alex Bennée <alex.bennee@linaro.org>
+v8.1M defines a new FP system register FPSCR_nzcvqc; this behaves
 like the existing FPSCR, except that it reads and writes only bits
 [31:27] of the FPSCR (the N, Z, C, V and QC flag bits).  (Unlike the
 FPSCR, the special case for Rt=15 of writing the CPSR.NZCV is not
 permitted.)
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Implement the register.  Since we don't yet implement MVE, we handle
 the QC bit as RES0, with todo comments for where we will need to add
 support later.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20180227143852.11175-3-alex.bennee@linaro.org
+Message-id: 20201119215617.29887-11-peter.maydell@linaro.org
 [PMM: postpone actually enabling feature until end of the
  patch series]
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/cpu.h | 1 +
+ target/arm/cpu.h               | 13 +++++++++++++
-file changed, 1 insertion(+)
+ target/arm/translate-vfp.c.inc | 27 +++++++++++++++++++++++++++
 files changed, 40 insertions(+)
 diff --git a/target/arm/cpu.h b/target/arm/cpu.h
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu.h
 +++ b/target/arm/cpu.h
-@@ -XXX,XX +XXX,XX @@ enum arm_features {
+@@ -XXX,XX +XXX,XX @@ void vfp_set_fpscr(CPUARMState *env, uint32_t val);
-     ARM_FEATURE_V8_SHA3, /* implements SHA3 part of v8 Crypto Extensions */
+ #define FPCR_FZ     (1 << 24)   /* Flush-to-zero enable bit */
-     ARM_FEATURE_V8_SM3, /* implements SM3 part of v8 Crypto Extensions */
+ #define FPCR_DN     (1 << 25)   /* Default NaN enable bit */
-     ARM_FEATURE_V8_SM4, /* implements SM4 part of v8 Crypto Extensions */
+ #define FPCR_QC     (1 << 27)   /* Cumulative saturation bit */
-+    ARM_FEATURE_V8_FP16, /* implements v8.2 half-precision float */
++#define FPCR_V      (1 << 28)   /* FP overflow flag */
- };
++#define FPCR_C      (1 << 29)   /* FP carry flag */
++#define FPCR_Z      (1 << 30)   /* FP zero flag */
- static inline int arm_feature(CPUARMState *env, int feature)
++#define FPCR_N      (1 << 31)   /* FP negative flag */
 +
 +#define FPCR_NZCV_MASK (FPCR_N | FPCR_Z | FPCR_C | FPCR_V)
 +#define FPCR_NZCVQC_MASK (FPCR_NZCV_MASK | FPCR_QC)
  static inline uint32_t vfp_get_fpsr(CPUARMState *env)
  {
@@ -XXX,XX +XXX,XX @@ enum arm_cpu_mode {
  #define ARM_VFP_FPEXC   8
  #define ARM_VFP_FPINST  9
  #define ARM_VFP_FPINST2 10
 +/* These ones are M-profile only */
 +#define ARM_VFP_FPSCR_NZCVQC 2
 +#define ARM_VFP_VPR 12
 +#define ARM_VFP_P0 13
 +#define ARM_VFP_FPCXT_NS 14
 +#define ARM_VFP_FPCXT_S 15
  /* QEMU-internal value meaning "FPSCR, but we care only about NZCV" */
  #define QEMU_VFP_FPSCR_NZCV 0xffff
 diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-vfp.c.inc
 +++ b/target/arm/translate-vfp.c.inc
@@ -XXX,XX +XXX,XX @@ static FPSysRegCheckResult fp_sysreg_checks(DisasContext *s, int regno)
      case ARM_VFP_FPSCR:
      case QEMU_VFP_FPSCR_NZCV:
          break;
 +    case ARM_VFP_FPSCR_NZCVQC:
 +        if (!arm_dc_feature(s, ARM_FEATURE_V8_1M)) {
 +            return false;
 +        }
 +        break;
      default:
          return FPSysRegCheckFailed;
      }
@@ -XXX,XX +XXX,XX @@ static bool gen_M_fp_sysreg_write(DisasContext *s, int regno,
          tcg_temp_free_i32(tmp);
          gen_lookup_tb(s);
          break;
 +    case ARM_VFP_FPSCR_NZCVQC:
 +    {
 +        TCGv_i32 fpscr;
 +        tmp = loadfn(s, opaque);
 +        /*
 +         * TODO: when we implement MVE, write the QC bit.
 +         * For non-MVE, QC is RES0.
 +         */
 +        tcg_gen_andi_i32(tmp, tmp, FPCR_NZCV_MASK);
 +        fpscr = load_cpu_field(vfp.xregs[ARM_VFP_FPSCR]);
 +        tcg_gen_andi_i32(fpscr, fpscr, ~FPCR_NZCV_MASK);
 +        tcg_gen_or_i32(fpscr, fpscr, tmp);
 +        store_cpu_field(fpscr, vfp.xregs[ARM_VFP_FPSCR]);
 +        tcg_temp_free_i32(tmp);
 +        break;
 +    }
      default:
          g_assert_not_reached();
      }
@@ -XXX,XX +XXX,XX @@ static bool gen_M_fp_sysreg_read(DisasContext *s, int regno,
          gen_helper_vfp_get_fpscr(tmp, cpu_env);
          storefn(s, opaque, tmp);
          break;
 +    case ARM_VFP_FPSCR_NZCVQC:
 +        /*
 +         * TODO: MVE has a QC bit, which we probably won't store
 +         * in the xregs[] field. For non-MVE, where QC is RES0,
 +         * we can just fall through to the FPSCR_NZCV case.
 +         */
      case QEMU_VFP_FPSCR_NZCV:
          /*
           * Read just NZCV; this is a special case to avoid the
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 35/42] arm/translate-a64: add FP16 FRSQRTE to simd_two_reg_misc_fp16
+[PULL 22/36] target/arm: Use new FPCR_NZCV_MASK constant
-From: Alex Bennée <alex.bennee@linaro.org>
+We defined a constant name for the mask of NZCV bits in the FPCR/FPSCR
 in the previous commit; use it in a couple of places in existing code,
 where we're masking out everything except NZCV for the "load to Rt=15
 sets CPSR.NZCV" special case.
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20180227143852.11175-28-alex.bennee@linaro.org
+Message-id: 20201119215617.29887-12-peter.maydell@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/translate-a64.c | 7 +++++++
+ target/arm/translate-vfp.c.inc | 4 ++--
-file changed, 7 insertions(+)
+file changed, 2 insertions(+), 2 deletions(-)
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
+diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
+--- a/target/arm/translate-vfp.c.inc
-+++ b/target/arm/translate-a64.c
++++ b/target/arm/translate-vfp.c.inc
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
+@@ -XXX,XX +XXX,XX @@ static bool gen_M_fp_sysreg_read(DisasContext *s, int regno,
-     case 0x6f: /* FNEG */
+          * helper call for the "VMRS to CPSR.NZCV" insn.
-         need_fpst = false;
+          */
-         break;
+         tmp = load_cpu_field(vfp.xregs[ARM_VFP_FPSCR]);
-+    case 0x7d: /* FRSQRTE */
+-        tcg_gen_andi_i32(tmp, tmp, 0xf0000000);
-     case 0x7f: /* FSQRT (vector) */
++        tcg_gen_andi_i32(tmp, tmp, FPCR_NZCV_MASK);
          storefn(s, opaque, tmp);
          break;
      default:
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
+@@ -XXX,XX +XXX,XX @@ static bool trans_VMSR_VMRS(DisasContext *s, arg_VMSR_VMRS *a)
-         case 0x6f: /* FNEG */
+         case ARM_VFP_FPSCR:
-             tcg_gen_xori_i32(tcg_res, tcg_op, 0x8000);
+             if (a->rt == 15) {
-             break;
+                 tmp = load_cpu_field(vfp.xregs[ARM_VFP_FPSCR]);
-+        case 0x7d: /* FRSQRTE */
+-                tcg_gen_andi_i32(tmp, tmp, 0xf0000000);
-+            gen_helper_rsqrte_f16(tcg_res, tcg_op, tcg_fpstatus);
++                tcg_gen_andi_i32(tmp, tmp, FPCR_NZCV_MASK);
-+            break;
+             } else {
-         default:
+                 tmp = tcg_temp_new_i32();
-             g_assert_not_reached();
+                 gen_helper_vfp_get_fpscr(tmp, cpu_env);
          }
@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
              case 0x6f: /* FNEG */
                  tcg_gen_xori_i32(tcg_res, tcg_op, 0x8000);
                  break;
 +            case 0x7d: /* FRSQRTE */
 +                gen_helper_rsqrte_f16(tcg_res, tcg_op, tcg_fpstatus);
 +                break;
              case 0x7f: /* FSQRT */
                  gen_helper_sqrt_f16(tcg_res, tcg_op, tcg_fpstatus);
                  break;
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 23/42] arm/translate-a64: add FP16 x2 ops for simd_indexed
+[PULL 23/36] target/arm: Factor out preserve-fp-state from full_vfp_access_check()
-From: Alex Bennée <alex.bennee@linaro.org>
+Factor out the code which handles M-profile lazy FP state preservation
 from full_vfp_access_check(); accesses to the FPCXT_NS register are
 a special case which need to do just this part (corresponding in the
 pseudocode to the PreserveFPState() function), and not the full
 set of actions matching the pseudocode ExecuteFPCheck() which
 normal FP instructions need to do.
-A bunch of the vectorised bitwise operations just operate on larger
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-chunks at a time. We can do the same for the new half-precision
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-operations by introducing some TWOHALFOP helpers which work on each
+Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
-half of a pair of half-precision operations at once.
+Message-id: 20201119215617.29887-13-peter.maydell@linaro.org
 ---
  target/arm/translate-vfp.c.inc | 45 ++++++++++++++++++++--------------
 file changed, 27 insertions(+), 18 deletions(-)
-Hopefully all this hoop jumping will get simpler once we have
+diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
 generically vectorised helpers here.
 Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20180227143852.11175-16-alex.bennee@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
  target/arm/helper-a64.h    | 10 ++++++++++
  target/arm/helper-a64.c    | 46 +++++++++++++++++++++++++++++++++++++++++++++-
  target/arm/translate-a64.c | 26 +++++++++++++++++++++-----
 files changed, 76 insertions(+), 6 deletions(-)
 diff --git a/target/arm/helper-a64.h b/target/arm/helper-a64.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper-a64.h
+--- a/target/arm/translate-vfp.c.inc
-+++ b/target/arm/helper-a64.h
++++ b/target/arm/translate-vfp.c.inc
-@@ -XXX,XX +XXX,XX @@ DEF_HELPER_3(advsimd_acge_f16, i32, f16, f16, ptr)
+@@ -XXX,XX +XXX,XX @@ static inline long vfp_f16_offset(unsigned reg, bool top)
- DEF_HELPER_3(advsimd_acgt_f16, i32, f16, f16, ptr)
+     return offs;
  DEF_HELPER_3(advsimd_mulxh, f16, f16, f16, ptr)
  DEF_HELPER_4(advsimd_muladdh, f16, f16, f16, f16, ptr)
 +DEF_HELPER_3(advsimd_add2h, i32, i32, i32, ptr)
 +DEF_HELPER_3(advsimd_sub2h, i32, i32, i32, ptr)
 +DEF_HELPER_3(advsimd_mul2h, i32, i32, i32, ptr)
 +DEF_HELPER_3(advsimd_div2h, i32, i32, i32, ptr)
 +DEF_HELPER_3(advsimd_max2h, i32, i32, i32, ptr)
 +DEF_HELPER_3(advsimd_min2h, i32, i32, i32, ptr)
 +DEF_HELPER_3(advsimd_maxnum2h, i32, i32, i32, ptr)
 +DEF_HELPER_3(advsimd_minnum2h, i32, i32, i32, ptr)
 +DEF_HELPER_3(advsimd_mulx2h, i32, i32, i32, ptr)
 +DEF_HELPER_4(advsimd_muladd2h, i32, i32, i32, i32, ptr)
 diff --git a/target/arm/helper-a64.c b/target/arm/helper-a64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper-a64.c
 +++ b/target/arm/helper-a64.c
@@ -XXX,XX +XXX,XX @@ ADVSIMD_HALFOP(max)
  ADVSIMD_HALFOP(minnum)
  ADVSIMD_HALFOP(maxnum)
 +#define ADVSIMD_TWOHALFOP(name)                                         \
 +uint32_t ADVSIMD_HELPER(name, 2h)(uint32_t two_a, uint32_t two_b, void *fpstp) \
 +{ \
 +    float16  a1, a2, b1, b2;                        \
 +    uint32_t r1, r2;                                \
 +    float_status *fpst = fpstp;                     \
 +    a1 = extract32(two_a, 0, 16);                   \
 +    a2 = extract32(two_a, 16, 16);                  \
 +    b1 = extract32(two_b, 0, 16);                   \
 +    b2 = extract32(two_b, 16, 16);                  \
 +    r1 = float16_ ## name(a1, b1, fpst);            \
 +    r2 = float16_ ## name(a2, b2, fpst);            \
 +    return deposit32(r1, 16, 16, r2);               \
 +}
 +
 +ADVSIMD_TWOHALFOP(add)
 +ADVSIMD_TWOHALFOP(sub)
 +ADVSIMD_TWOHALFOP(mul)
 +ADVSIMD_TWOHALFOP(div)
 +ADVSIMD_TWOHALFOP(min)
 +ADVSIMD_TWOHALFOP(max)
 +ADVSIMD_TWOHALFOP(minnum)
 +ADVSIMD_TWOHALFOP(maxnum)
 +
  /* Data processing - scalar floating-point and advanced SIMD */
 -float16 HELPER(advsimd_mulxh)(float16 a, float16 b, void *fpstp)
 +static float16 float16_mulx(float16 a, float16 b, void *fpstp)
  {
      float_status *fpst = fpstp;
@@ -XXX,XX +XXX,XX @@ float16 HELPER(advsimd_mulxh)(float16 a, float16 b, void *fpstp)
      return float16_mul(a, b, fpst);
  }
-+ADVSIMD_HALFOP(mulx)
++/*
-+ADVSIMD_TWOHALFOP(mulx)
++ * Generate code for M-profile lazy FP state preservation if needed;
-+
++ * this corresponds to the pseudocode PreserveFPState() function.
- /* fused multiply-accumulate */
++ */
- float16 HELPER(advsimd_muladdh)(float16 a, float16 b, float16 c, void *fpstp)
++static void gen_preserve_fp_state(DisasContext *s)
  {
@@ -XXX,XX +XXX,XX @@ float16 HELPER(advsimd_muladdh)(float16 a, float16 b, float16 c, void *fpstp)
      return float16_muladd(a, b, c, 0, fpst);
  }
 +uint32_t HELPER(advsimd_muladd2h)(uint32_t two_a, uint32_t two_b,
 +                                  uint32_t two_c, void *fpstp)
 +{
-+    float_status *fpst = fpstp;
++    if (s->v7m_lspact) {
-+    float16  a1, a2, b1, b2, c1, c2;
++        /*
-+    uint32_t r1, r2;
++         * Lazy state saving affects external memory and also the NVIC,
-+    a1 = extract32(two_a, 0, 16);
++         * so we must mark it as an IO operation for icount (and cause
-+    a2 = extract32(two_a, 16, 16);
++         * this to be the last insn in the TB).
-+    b1 = extract32(two_b, 0, 16);
++         */
-+    b2 = extract32(two_b, 16, 16);
++        if (tb_cflags(s->base.tb) & CF_USE_ICOUNT) {
-+    c1 = extract32(two_c, 0, 16);
++            s->base.is_jmp = DISAS_UPDATE_EXIT;
-+    c2 = extract32(two_c, 16, 16);
++            gen_io_start();
-+    r1 = float16_muladd(a1, b1, c1, 0, fpst);
++        }
-+    r2 = float16_muladd(a2, b2, c2, 0, fpst);
++        gen_helper_v7m_preserve_fp_state(cpu_env);
-+    return deposit32(r1, 16, 16, r2);
++        /*
 +         * If the preserve_fp_state helper doesn't throw an exception
 +         * then it will clear LSPACT; we don't need to repeat this for
 +         * any further FP insns in this TB.
 +         */
 +        s->v7m_lspact = false;
 +    }
 +}
 +
  /*
-  * Floating point comparisons produce an integer result. Softfloat
+  * Check that VFP access is enabled. If it is, do the necessary
-  * routines return float_relation types which we convert to the 0/-1
+  * M-profile lazy-FP handling and then return true.
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
+@@ -XXX,XX +XXX,XX @@ static bool full_vfp_access_check(DisasContext *s, bool ignore_vfp_enabled)
-index XXXXXXX..XXXXXXX 100644
+         /* Handle M-profile lazy FP state mechanics */
---- a/target/arm/translate-a64.c
-+++ b/target/arm/translate-a64.c
+         /* Trigger lazy-state preservation if necessary */
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_indexed(DisasContext *s, uint32_t insn)
+-        if (s->v7m_lspact) {
-                          * multiply-add */
+-            /*
-                         tcg_gen_xori_i32(tcg_op, tcg_op, 0x80008000);
+-             * Lazy state saving affects external memory and also the NVIC,
-                     }
+-             * so we must mark it as an IO operation for icount (and cause
--                    gen_helper_advsimd_muladdh(tcg_res, tcg_op, tcg_idx,
+-             * this to be the last insn in the TB).
--                                               tcg_res, fpst);
+-             */
-+                    if (is_scalar) {
+-            if (tb_cflags(s->base.tb) & CF_USE_ICOUNT) {
-+                        gen_helper_advsimd_muladdh(tcg_res, tcg_op, tcg_idx,
+-                s->base.is_jmp = DISAS_UPDATE_EXIT;
-+                                                   tcg_res, fpst);
+-                gen_io_start();
-+                    } else {
+-            }
-+                        gen_helper_advsimd_muladd2h(tcg_res, tcg_op, tcg_idx,
+-            gen_helper_v7m_preserve_fp_state(cpu_env);
-+                                                    tcg_res, fpst);
+-            /*
-+                    }
+-             * If the preserve_fp_state helper doesn't throw an exception
-                     break;
+-             * then it will clear LSPACT; we don't need to repeat this for
-                 case 2:
+-             * any further FP insns in this TB.
-                     if (opcode == 0x5) {
+-             */
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_indexed(DisasContext *s, uint32_t insn)
+-            s->v7m_lspact = false;
-                 switch (size) {
+-        }
-                 case 1:
++        gen_preserve_fp_state(s);
-                     if (u) {
--                        gen_helper_advsimd_mulxh(tcg_res, tcg_op, tcg_idx,
+         /* Update ownership of FP context: set FPCCR.S to match current state */
--                                                 fpst);
+         if (s->v8m_fpccr_s_wrong) {
 +                        if (is_scalar) {
 +                            gen_helper_advsimd_mulxh(tcg_res, tcg_op,
 +                                                     tcg_idx, fpst);
 +                        } else {
 +                            gen_helper_advsimd_mulx2h(tcg_res, tcg_op,
 +                                                      tcg_idx, fpst);
 +                        }
                      } else {
 -                        g_assert_not_reached();
 +                        if (is_scalar) {
 +                            gen_helper_advsimd_mulh(tcg_res, tcg_op,
 +                                                    tcg_idx, fpst);
 +                        } else {
 +                            gen_helper_advsimd_mul2h(tcg_res, tcg_op,
 +                                                     tcg_idx, fpst);
 +                        }
                      }
                      break;
                  case 2:
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 33/42] arm/translate-a64: add FP16 FSQRT to simd_two_reg_misc_fp16
+[PULL 24/36] target/arm: Implement FPCXT_S fp system register
-From: Alex Bennée <alex.bennee@linaro.org>
+Implement the new-in-v8.1M FPCXT_S floating point system register.
 This is for saving and restoring the secure floating point context,
 and it reads and writes bits [27:0] from the FPSCR and the
 CONTROL.SFPA bit in bit [31].
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20180227143852.11175-26-alex.bennee@linaro.org
+Message-id: 20201119215617.29887-14-peter.maydell@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/helper-a64.h    |  1 +
+ target/arm/translate-vfp.c.inc | 58 ++++++++++++++++++++++++++++++++++
- target/arm/helper-a64.c    | 13 +++++++++++++
+file changed, 58 insertions(+)
  target/arm/translate-a64.c |  5 +++++
 files changed, 19 insertions(+)
-diff --git a/target/arm/helper-a64.h b/target/arm/helper-a64.h
+diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper-a64.h
+--- a/target/arm/translate-vfp.c.inc
-+++ b/target/arm/helper-a64.h
++++ b/target/arm/translate-vfp.c.inc
-@@ -XXX,XX +XXX,XX @@ DEF_HELPER_2(advsimd_rinth_exact, f16, f16, ptr)
+@@ -XXX,XX +XXX,XX @@ static FPSysRegCheckResult fp_sysreg_checks(DisasContext *s, int regno)
- DEF_HELPER_2(advsimd_rinth, f16, f16, ptr)
+             return false;
- DEF_HELPER_2(advsimd_f16tosinth, i32, f16, ptr)
+         }
  DEF_HELPER_2(advsimd_f16touinth, i32, f16, ptr)
 +DEF_HELPER_2(sqrt_f16, f16, f16, ptr)
 diff --git a/target/arm/helper-a64.c b/target/arm/helper-a64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper-a64.c
 +++ b/target/arm/helper-a64.c
@@ -XXX,XX +XXX,XX @@ uint32_t HELPER(advsimd_f16touinth)(float16 a, void *fpstp)
      }
      return float16_to_uint16(a, fpst);
  }
 +
 +/*
 + * Square Root and Reciprocal square root
 + */
 +
 +float16 HELPER(sqrt_f16)(float16 a, void *fpstp)
 +{
 +    float_status *s = fpstp;
 +
 +    return float16_sqrt(a, s);
 +}
 +
 +
 diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-a64.c
 +++ b/target/arm/translate-a64.c
@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
      case 0x6f: /* FNEG */
          need_fpst = false;
          break;
-+    case 0x7f: /* FSQRT (vector) */
++    case ARM_VFP_FPCXT_S:
 +        if (!arm_dc_feature(s, ARM_FEATURE_V8_1M)) {
 +            return false;
 +        }
 +        if (!s->v8m_secure) {
 +            return false;
 +        }
 +        break;
      default:
-         fprintf(stderr, "%s: insn %#04x fpop %#2x\n", __func__, insn, fpop);
+         return FPSysRegCheckFailed;
      }
@@ -XXX,XX +XXX,XX @@ static bool gen_M_fp_sysreg_write(DisasContext *s, int regno,
          tcg_temp_free_i32(tmp);
          break;
      }
 +    case ARM_VFP_FPCXT_S:
 +    {
 +        TCGv_i32 sfpa, control, fpscr;
 +        /* Set FPSCR[27:0] and CONTROL.SFPA from value */
 +        tmp = loadfn(s, opaque);
 +        sfpa = tcg_temp_new_i32();
 +        tcg_gen_shri_i32(sfpa, tmp, 31);
 +        control = load_cpu_field(v7m.control[M_REG_S]);
 +        tcg_gen_deposit_i32(control, control, sfpa,
 +                            R_V7M_CONTROL_SFPA_SHIFT, 1);
 +        store_cpu_field(control, v7m.control[M_REG_S]);
 +        fpscr = load_cpu_field(vfp.xregs[ARM_VFP_FPSCR]);
 +        tcg_gen_andi_i32(fpscr, fpscr, FPCR_NZCV_MASK);
 +        tcg_gen_andi_i32(tmp, tmp, ~FPCR_NZCV_MASK);
 +        tcg_gen_or_i32(fpscr, fpscr, tmp);
 +        store_cpu_field(fpscr, vfp.xregs[ARM_VFP_FPSCR]);
 +        tcg_temp_free_i32(tmp);
 +        tcg_temp_free_i32(sfpa);
 +        break;
 +    }
      default:
          g_assert_not_reached();
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
+     }
-             case 0x6f: /* FNEG */
+@@ -XXX,XX +XXX,XX @@ static bool gen_M_fp_sysreg_read(DisasContext *s, int regno,
-                 tcg_gen_xori_i32(tcg_res, tcg_op, 0x8000);
+         tcg_gen_andi_i32(tmp, tmp, FPCR_NZCV_MASK);
-                 break;
+         storefn(s, opaque, tmp);
-+            case 0x7f: /* FSQRT */
+         break;
-+                gen_helper_sqrt_f16(tcg_res, tcg_op, tcg_fpstatus);
++    case ARM_VFP_FPCXT_S:
-+                break;
++    {
-             default:
++        TCGv_i32 control, sfpa, fpscr;
-                 g_assert_not_reached();
++        /* Bits [27:0] from FPSCR, bit [31] from CONTROL.SFPA */
-             }
++        tmp = tcg_temp_new_i32();
 +        sfpa = tcg_temp_new_i32();
 +        gen_helper_vfp_get_fpscr(tmp, cpu_env);
 +        tcg_gen_andi_i32(tmp, tmp, ~FPCR_NZCV_MASK);
 +        control = load_cpu_field(v7m.control[M_REG_S]);
 +        tcg_gen_andi_i32(sfpa, control, R_V7M_CONTROL_SFPA_MASK);
 +        tcg_gen_shli_i32(sfpa, sfpa, 31 - R_V7M_CONTROL_SFPA_SHIFT);
 +        tcg_gen_or_i32(tmp, tmp, sfpa);
 +        tcg_temp_free_i32(sfpa);
 +        /*
 +         * Store result before updating FPSCR etc, in case
 +         * it is a memory write which causes an exception.
 +         */
 +        storefn(s, opaque, tmp);
 +        /*
 +         * Now we must reset FPSCR from FPDSCR_NS, and clear
 +         * CONTROL.SFPA; so we'll end the TB here.
 +         */
 +        tcg_gen_andi_i32(control, control, ~R_V7M_CONTROL_SFPA_MASK);
 +        store_cpu_field(control, v7m.control[M_REG_S]);
 +        fpscr = load_cpu_field(v7m.fpdscr[M_REG_NS]);
 +        gen_helper_vfp_set_fpscr(cpu_env, fpscr);
 +        tcg_temp_free_i32(fpscr);
 +        gen_lookup_tb(s);
 +        break;
 +    }
      default:
          g_assert_not_reached();
      }
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 12/42] target/arm/cpu.h: add additional float_status flags
+[PULL 25/36] hw/intc/armv7m_nvic: Update FPDSCR masking for v8.1M
-From: Alex Bennée <alex.bennee@linaro.org>
+The FPDSCR register has a similar layout to the FPSCR.  In v8.1M it
 gains new fields FZ16 (if half-precision floating point is supported)
 and LTPSIZE (always reads as 4).  Update the reset value and the code
 that handles writes to this register accordingly.
-Half-precision flush to zero behaviour is controlled by a separate
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 FZ16 bit in the FPCR. To handle this we pass a pointer to
 fp_status_fp16 when working on half-precision operations. The value of
 the presented FPCR is calculated from an amalgam of the two when read.
 Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20180227143852.11175-5-alex.bennee@linaro.org
+Message-id: 20201119215617.29887-16-peter.maydell@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/cpu.h           | 32 ++++++++++++++++++++++------
+ target/arm/cpu.h      | 5 +++++
- target/arm/helper.c        | 26 ++++++++++++++++++-----
+ hw/intc/armv7m_nvic.c | 9 ++++++++-
- target/arm/translate-a64.c | 53 +++++++++++++++++++++++++---------------------
+ target/arm/cpu.c      | 3 +++
-files changed, 75 insertions(+), 36 deletions(-)
+files changed, 16 insertions(+), 1 deletion(-)
 diff --git a/target/arm/cpu.h b/target/arm/cpu.h
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu.h
 +++ b/target/arm/cpu.h
-@@ -XXX,XX +XXX,XX @@ typedef struct CPUARMState {
+@@ -XXX,XX +XXX,XX @@ void vfp_set_fpscr(CPUARMState *env, uint32_t val);
-         /* scratch space when Tn are not sufficient.  */
+ #define FPCR_IXE    (1 << 12)   /* Inexact exception trap enable */
-         uint32_t scratch[8];
+ #define FPCR_IDE    (1 << 15)   /* Input Denormal exception trap enable */
+ #define FPCR_FZ16   (1 << 19)   /* ARMv8.2+, FP16 flush-to-zero */
--        /* fp_status is the "normal" fp status. standard_fp_status retains
++#define FPCR_RMODE_MASK (3 << 22) /* Rounding mode */
--         * values corresponding to the ARM "Standard FPSCR Value", ie
+ #define FPCR_FZ     (1 << 24)   /* Flush-to-zero enable bit */
--         * default-NaN, flush-to-zero, round-to-nearest and is used by
+ #define FPCR_DN     (1 << 25)   /* Default NaN enable bit */
--         * any operations (generally Neon) which the architecture defines
++#define FPCR_AHP    (1 << 26)   /* Alternative half-precision */
--         * as controlled by the standard FPSCR value rather than the FPSCR.
+ #define FPCR_QC     (1 << 27)   /* Cumulative saturation bit */
-+        /* There are a number of distinct float control structures:
+ #define FPCR_V      (1 << 28)   /* FP overflow flag */
-+         *
+ #define FPCR_C      (1 << 29)   /* FP carry flag */
-+         *  fp_status: is the "normal" fp status.
+ #define FPCR_Z      (1 << 30)   /* FP zero flag */
-+         *  fp_status_fp16: used for half-precision calculations
+ #define FPCR_N      (1 << 31)   /* FP negative flag */
-+         *  standard_fp_status : the ARM "Standard FPSCR Value"
-+         *
++#define FPCR_LTPSIZE_SHIFT 16   /* LTPSIZE, M-profile only */
-+         * Half-precision operations are governed by a separate
++#define FPCR_LTPSIZE_MASK (7 << FPCR_LTPSIZE_SHIFT)
 +         * flush-to-zero control bit in FPSCR:FZ16. We pass a separate
 +         * status structure to control this.
 +         *
 +         * The "Standard FPSCR", ie default-NaN, flush-to-zero,
 +         * round-to-nearest and is used by any operations (generally
 +         * Neon) which the architecture defines as controlled by the
 +         * standard FPSCR value rather than the FPSCR.
           *
           * To avoid having to transfer exception bits around, we simply
           * say that the FPSCR cumulative exception flags are the logical
 -         * OR of the flags in the two fp statuses. This relies on the
 +         * OR of the flags in the three fp statuses. This relies on the
           * only thing which needs to read the exception flags being
           * an explicit FPSCR read.
           */
          float_status fp_status;
 +        float_status fp_status_f16;
          float_status standard_fp_status;
          /* ZCR_EL[1-3] */
@@ -XXX,XX +XXX,XX @@ static inline void xpsr_write(CPUARMState *env, uint32_t val, uint32_t mask)
  uint32_t vfp_get_fpscr(CPUARMState *env);
  void vfp_set_fpscr(CPUARMState *env, uint32_t val);
 -/* For A64 the FPSCR is split into two logically distinct registers,
 +/* FPCR, Floating Point Control Register
 + * FPSR, Floating Poiht Status Register
 + *
 + * For A64 the FPSCR is split into two logically distinct registers,
   * FPCR and FPSR. However since they still use non-overlapping bits
   * we store the underlying state in fpscr and just mask on read/write.
   */
  #define FPSR_MASK 0xf800009f
  #define FPCR_MASK 0x07f79f00
 +
-+#define FPCR_FZ16   (1 << 19)   /* ARMv8.2+, FP16 flush-to-zero */
+ #define FPCR_NZCV_MASK (FPCR_N | FPCR_Z | FPCR_C | FPCR_V)
-+#define FPCR_FZ     (1 << 24)   /* Flush-to-zero enable bit */
+ #define FPCR_NZCVQC_MASK (FPCR_NZCV_MASK | FPCR_QC)
-+#define FPCR_DN     (1 << 25)   /* Default NaN enable bit */
-+
+diff --git a/hw/intc/armv7m_nvic.c b/hw/intc/armv7m_nvic.c
  static inline uint32_t vfp_get_fpsr(CPUARMState *env)
  {
      return vfp_get_fpscr(env) & FPSR_MASK;
 diff --git a/target/arm/helper.c b/target/arm/helper.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper.c
+--- a/hw/intc/armv7m_nvic.c
-+++ b/target/arm/helper.c
++++ b/hw/intc/armv7m_nvic.c
-@@ -XXX,XX +XXX,XX @@ uint32_t HELPER(vfp_get_fpscr)(CPUARMState *env)
+@@ -XXX,XX +XXX,XX @@ static void nvic_writel(NVICState *s, uint32_t offset, uint32_t value,
-             | (env->vfp.vec_stride << 20);
+         break;
-     i = get_float_exception_flags(&env->vfp.fp_status);
+     case 0xf3c: /* FPDSCR */
-     i |= get_float_exception_flags(&env->vfp.standard_fp_status);
+         if (cpu_isar_feature(aa32_vfp_simd, cpu)) {
-+    i |= get_float_exception_flags(&env->vfp.fp_status_f16);
+-            value &= 0x07c00000;
-     fpscr |= vfp_exceptbits_from_host(i);
++            uint32_t mask = FPCR_AHP | FPCR_DN | FPCR_FZ | FPCR_RMODE_MASK;
-     return fpscr;
++            if (cpu_isar_feature(any_fp16, cpu)) {
- }
++                mask |= FPCR_FZ16;
-@@ -XXX,XX +XXX,XX @@ void HELPER(vfp_set_fpscr)(CPUARMState *env, uint32_t val)
++            }
-             break;
++            value &= mask;
 +            if (cpu_isar_feature(aa32_lob, cpu)) {
 +                value |= 4 << FPCR_LTPSIZE_SHIFT;
 +            }
              cpu->env.v7m.fpdscr[attrs.secure] = value;
          }
-         set_float_rounding_mode(i, &env->vfp.fp_status);
+         break;
-+        set_float_rounding_mode(i, &env->vfp.fp_status_f16);
+diff --git a/target/arm/cpu.c b/target/arm/cpu.c
      }
 -    if (changed & (1 << 24)) {
 -        set_flush_to_zero((val & (1 << 24)) != 0, &env->vfp.fp_status);
 -        set_flush_inputs_to_zero((val & (1 << 24)) != 0, &env->vfp.fp_status);
 +    if (changed & FPCR_FZ16) {
 +        bool ftz_enabled = val & FPCR_FZ16;
 +        set_flush_to_zero(ftz_enabled, &env->vfp.fp_status_f16);
 +        set_flush_inputs_to_zero(ftz_enabled, &env->vfp.fp_status_f16);
 +    }
 +    if (changed & FPCR_FZ) {
 +        bool ftz_enabled = val & FPCR_FZ;
 +        set_flush_to_zero(ftz_enabled, &env->vfp.fp_status);
 +        set_flush_inputs_to_zero(ftz_enabled, &env->vfp.fp_status);
 +    }
 +    if (changed & FPCR_DN) {
 +        bool dnan_enabled = val & FPCR_DN;
 +        set_default_nan_mode(dnan_enabled, &env->vfp.fp_status);
 +        set_default_nan_mode(dnan_enabled, &env->vfp.fp_status_f16);
      }
 -    if (changed & (1 << 25))
 -        set_default_nan_mode((val & (1 << 25)) != 0, &env->vfp.fp_status);
 +    /* The exception flags are ORed together when we read fpscr so we
 +     * only need to preserve the current state in one of our
 +     * float_status values.
 +     */
      i = vfp_exceptbits_to_host(val);
      set_float_exception_flags(i, &env->vfp.fp_status);
 +    set_float_exception_flags(0, &env->vfp.fp_status_f16);
      set_float_exception_flags(0, &env->vfp.standard_fp_status);
  }
 diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
+--- a/target/arm/cpu.c
-+++ b/target/arm/translate-a64.c
++++ b/target/arm/cpu.c
-@@ -XXX,XX +XXX,XX @@ static void write_fp_sreg(DisasContext *s, int reg, TCGv_i32 v)
+@@ -XXX,XX +XXX,XX @@ static void arm_cpu_reset(DeviceState *dev)
-     tcg_temp_free_i64(tmp);
+              * always reset to 4.
- }
+              */
+             env->v7m.ltpsize = 4;
--static TCGv_ptr get_fpstatus_ptr(void)
++            /* The LTPSIZE field in FPDSCR is constant and reads as 4. */
-+static TCGv_ptr get_fpstatus_ptr(bool is_f16)
++            env->v7m.fpdscr[M_REG_NS] = 4 << FPCR_LTPSIZE_SHIFT;
- {
++            env->v7m.fpdscr[M_REG_S] = 4 << FPCR_LTPSIZE_SHIFT;
      TCGv_ptr statusptr = tcg_temp_new_ptr();
      int offset;
 -    /* In A64 all instructions (both FP and Neon) use the FPCR;
 -     * there is no equivalent of the A32 Neon "standard FPSCR value"
 -     * and all operations use vfp.fp_status.
 +    /* In A64 all instructions (both FP and Neon) use the FPCR; there
 +     * is no equivalent of the A32 Neon "standard FPSCR value".
 +     * However half-precision operations operate under a different
 +     * FZ16 flag and use vfp.fp_status_f16 instead of vfp.fp_status.
       */
 -    offset = offsetof(CPUARMState, vfp.fp_status);
 +    if (is_f16) {
 +        offset = offsetof(CPUARMState, vfp.fp_status_f16);
 +    } else {
 +        offset = offsetof(CPUARMState, vfp.fp_status);
 +    }
      tcg_gen_addi_ptr(statusptr, cpu_env, offset);
      return statusptr;
  }
@@ -XXX,XX +XXX,XX @@ static void handle_fp_compare(DisasContext *s, bool is_double,
                                bool cmp_with_zero, bool signal_all_nans)
  {
      TCGv_i64 tcg_flags = tcg_temp_new_i64();
 -    TCGv_ptr fpst = get_fpstatus_ptr();
 +    TCGv_ptr fpst = get_fpstatus_ptr(false);
      if (is_double) {
          TCGv_i64 tcg_vn, tcg_vm;
@@ -XXX,XX +XXX,XX @@ static void handle_fp_1src_single(DisasContext *s, int opcode, int rd, int rn)
      TCGv_i32 tcg_op;
      TCGv_i32 tcg_res;
 -    fpst = get_fpstatus_ptr();
 +    fpst = get_fpstatus_ptr(false);
      tcg_op = read_fp_sreg(s, rn);
      tcg_res = tcg_temp_new_i32();
@@ -XXX,XX +XXX,XX @@ static void handle_fp_1src_double(DisasContext *s, int opcode, int rd, int rn)
          return;
      }
 -    fpst = get_fpstatus_ptr();
 +    fpst = get_fpstatus_ptr(false);
      tcg_op = read_fp_dreg(s, rn);
      tcg_res = tcg_temp_new_i64();
@@ -XXX,XX +XXX,XX @@ static void handle_fp_2src_single(DisasContext *s, int opcode,
      TCGv_ptr fpst;
      tcg_res = tcg_temp_new_i32();
 -    fpst = get_fpstatus_ptr();
 +    fpst = get_fpstatus_ptr(false);
      tcg_op1 = read_fp_sreg(s, rn);
      tcg_op2 = read_fp_sreg(s, rm);
@@ -XXX,XX +XXX,XX @@ static void handle_fp_2src_double(DisasContext *s, int opcode,
      TCGv_ptr fpst;
      tcg_res = tcg_temp_new_i64();
 -    fpst = get_fpstatus_ptr();
 +    fpst = get_fpstatus_ptr(false);
      tcg_op1 = read_fp_dreg(s, rn);
      tcg_op2 = read_fp_dreg(s, rm);
@@ -XXX,XX +XXX,XX @@ static void handle_fp_3src_single(DisasContext *s, bool o0, bool o1,
  {
      TCGv_i32 tcg_op1, tcg_op2, tcg_op3;
      TCGv_i32 tcg_res = tcg_temp_new_i32();
 -    TCGv_ptr fpst = get_fpstatus_ptr();
 +    TCGv_ptr fpst = get_fpstatus_ptr(false);
      tcg_op1 = read_fp_sreg(s, rn);
      tcg_op2 = read_fp_sreg(s, rm);
@@ -XXX,XX +XXX,XX @@ static void handle_fp_3src_double(DisasContext *s, bool o0, bool o1,
  {
      TCGv_i64 tcg_op1, tcg_op2, tcg_op3;
      TCGv_i64 tcg_res = tcg_temp_new_i64();
 -    TCGv_ptr fpst = get_fpstatus_ptr();
 +    TCGv_ptr fpst = get_fpstatus_ptr(false);
      tcg_op1 = read_fp_dreg(s, rn);
      tcg_op2 = read_fp_dreg(s, rm);
@@ -XXX,XX +XXX,XX @@ static void handle_fpfpcvt(DisasContext *s, int rd, int rn, int opcode,
      TCGv_ptr tcg_fpstatus;
      TCGv_i32 tcg_shift;
 -    tcg_fpstatus = get_fpstatus_ptr();
 +    tcg_fpstatus = get_fpstatus_ptr(false);
      tcg_shift = tcg_const_i32(64 - scale);
@@ -XXX,XX +XXX,XX @@ static void disas_simd_across_lanes(DisasContext *s, uint32_t insn)
          TCGv_i32 tcg_elt1 = tcg_temp_new_i32();
          TCGv_i32 tcg_elt2 = tcg_temp_new_i32();
          TCGv_i32 tcg_elt3 = tcg_temp_new_i32();
 -        TCGv_ptr fpst = get_fpstatus_ptr();
 +        TCGv_ptr fpst = get_fpstatus_ptr(false);
          assert(esize == 32);
          assert(elements == 4);
@@ -XXX,XX +XXX,XX @@ static void disas_simd_scalar_pairwise(DisasContext *s, uint32_t insn)
          }
-         size = extract32(size, 0, 1) ? 3 : 2;
+         if (arm_feature(env, ARM_FEATURE_M_SECURITY)) {
 -        fpst = get_fpstatus_ptr();
 +        fpst = get_fpstatus_ptr(false);
          break;
      default:
          unallocated_encoding(s);
@@ -XXX,XX +XXX,XX @@ static void handle_simd_intfp_conv(DisasContext *s, int rd, int rn,
                                     int fracbits, int size)
  {
      bool is_double = size == 3 ? true : false;
 -    TCGv_ptr tcg_fpst = get_fpstatus_ptr();
 +    TCGv_ptr tcg_fpst = get_fpstatus_ptr(false);
      TCGv_i32 tcg_shift = tcg_const_i32(fracbits);
      TCGv_i64 tcg_int = tcg_temp_new_i64();
      TCGMemOp mop = size | (is_signed ? MO_SIGN : 0);
@@ -XXX,XX +XXX,XX @@ static void handle_simd_shift_fpint_conv(DisasContext *s, bool is_scalar,
      tcg_rmode = tcg_const_i32(arm_rmode_to_sf(FPROUNDING_ZERO));
      gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
 -    tcg_fpstatus = get_fpstatus_ptr();
 +    tcg_fpstatus = get_fpstatus_ptr(false);
      tcg_shift = tcg_const_i32(fracbits);
      if (is_double) {
@@ -XXX,XX +XXX,XX @@ static void handle_3same_float(DisasContext *s, int size, int elements,
                                 int fpopcode, int rd, int rn, int rm)
  {
      int pass;
 -    TCGv_ptr fpst = get_fpstatus_ptr();
 +    TCGv_ptr fpst = get_fpstatus_ptr(false);
      for (pass = 0; pass < elements; pass++) {
          if (size) {
@@ -XXX,XX +XXX,XX @@ static void handle_2misc_fcmp_zero(DisasContext *s, int opcode,
          return;
      }
 -    fpst = get_fpstatus_ptr();
 +    fpst = get_fpstatus_ptr(false);
      if (is_double) {
          TCGv_i64 tcg_op = tcg_temp_new_i64();
@@ -XXX,XX +XXX,XX @@ static void handle_2misc_reciprocal(DisasContext *s, int opcode,
                                      int size, int rn, int rd)
  {
      bool is_double = (size == 3);
 -    TCGv_ptr fpst = get_fpstatus_ptr();
 +    TCGv_ptr fpst = get_fpstatus_ptr(false);
      if (is_double) {
          TCGv_i64 tcg_op = tcg_temp_new_i64();
@@ -XXX,XX +XXX,XX @@ static void disas_simd_scalar_two_reg_misc(DisasContext *s, uint32_t insn)
      if (is_fcvt) {
          tcg_rmode = tcg_const_i32(arm_rmode_to_sf(rmode));
          gen_helper_set_rmode(tcg_rmode, tcg_rmode, cpu_env);
 -        tcg_fpstatus = get_fpstatus_ptr();
 +        tcg_fpstatus = get_fpstatus_ptr(false);
      } else {
          tcg_rmode = NULL;
          tcg_fpstatus = NULL;
@@ -XXX,XX +XXX,XX @@ static void handle_simd_3same_pair(DisasContext *s, int is_q, int u, int opcode,
      /* Floating point operations need fpst */
      if (opcode >= 0x58) {
 -        fpst = get_fpstatus_ptr();
 +        fpst = get_fpstatus_ptr(false);
      } else {
          fpst = NULL;
      }
@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc(DisasContext *s, uint32_t insn)
      }
      if (need_fpstatus) {
 -        tcg_fpstatus = get_fpstatus_ptr();
 +        tcg_fpstatus = get_fpstatus_ptr(false);
      } else {
          tcg_fpstatus = NULL;
      }
@@ -XXX,XX +XXX,XX @@ static void disas_simd_indexed(DisasContext *s, uint32_t insn)
      }
      if (is_fp) {
 -        fpst = get_fpstatus_ptr();
 +        fpst = get_fpstatus_ptr(false);
      } else {
          fpst = NULL;
      }
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 31/42] arm/translate-a64: add FP16 FRECPE
+[PULL 26/36] target/arm: For v8.1M, always clear R0-R3, R12, APSR, EPSR on exception entry
-From: Alex Bennée <alex.bennee@linaro.org>
+In v8.0M, on exception entry the registers R0-R3, R12, APSR and EPSR
 are zeroed for an exception taken to Non-secure state; for an
 exception taken to Secure state they become UNKNOWN, and we chose to
 leave them at their previous values.
-Now we have added f16 during the re-factoring we can simply call the
+In v8.1M the behaviour is specified more tightly and these registers
-helper.
+are always zeroed regardless of the security state that the exception
 targets (see rule R_KPZV).  Implement this.
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20180227143852.11175-24-alex.bennee@linaro.org
+Message-id: 20201119215617.29887-17-peter.maydell@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/translate-a64.c | 8 ++++++++
+ target/arm/m_helper.c | 16 ++++++++++++----
-file changed, 8 insertions(+)
+file changed, 12 insertions(+), 4 deletions(-)
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
+diff --git a/target/arm/m_helper.c b/target/arm/m_helper.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
+--- a/target/arm/m_helper.c
-+++ b/target/arm/translate-a64.c
++++ b/target/arm/m_helper.c
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
+@@ -XXX,XX +XXX,XX @@ static void v7m_exception_taken(ARMCPU *cpu, uint32_t lr, bool dotailchain,
-     case 0x6d: /* FCMLE (zero) */
+          * Clear registers if necessary to prevent non-secure exception
-         handle_2misc_fcmp_zero(s, fpop, is_scalar, 0, is_q, MO_16, rn, rd);
+          * code being able to see register values from secure code.
-         return;
+          * Where register values become architecturally UNKNOWN we leave
-+    case 0x3d: /* FRECPE */
+-         * them with their previous values.
-+        break;
++         * them with their previous values. v8.1M is tighter than v8.0M
-     case 0x18: /* FRINTN */
++         * here and always zeroes the caller-saved registers regardless
-         need_rmode = true;
++         * of the security state the exception is targeting.
-         only_in_vector = true;
+          */
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
+         if (arm_feature(env, ARM_FEATURE_M_SECURITY)) {
-         case 0x3b: /* FCVTZS */
+-            if (!targets_secure) {
-             gen_helper_advsimd_f16tosinth(tcg_res, tcg_op, tcg_fpstatus);
++            if (!targets_secure || arm_feature(env, ARM_FEATURE_V8_1M)) {
-             break;
+                 /*
-+        case 0x3d: /* FRECPE */
+                  * Always clear the caller-saved registers (they have been
-+            gen_helper_recpe_f16(tcg_res, tcg_op, tcg_fpstatus);
+                  * pushed to the stack earlier in v7m_push_stack()).
-+            break;
+@@ -XXX,XX +XXX,XX @@ static void v7m_exception_taken(ARMCPU *cpu, uint32_t lr, bool dotailchain,
-         case 0x5a: /* FCVTNU */
+                  * v7m_push_callee_stack()).
-         case 0x5b: /* FCVTMU */
+                  */
-         case 0x5c: /* FCVTAU */
+                 int i;
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
++                /*
-             case 0x3b: /* FCVTZS */
++                 * r4..r11 are callee-saves, zero only if background
-                 gen_helper_advsimd_f16tosinth(tcg_res, tcg_op, tcg_fpstatus);
++                 * state was Secure (EXCRET.S == 1) and exception
-                 break;
++                 * targets Non-secure state
-+            case 0x3d: /* FRECPE */
++                 */
-+                gen_helper_recpe_f16(tcg_res, tcg_op, tcg_fpstatus);
++                bool zero_callee_saves = !targets_secure &&
-+                break;
++                    (lr & R_V7M_EXCRET_S_MASK);
-             case 0x5a: /* FCVTNU */
-             case 0x5b: /* FCVTMU */
+                 for (i = 0; i < 13; i++) {
-             case 0x5c: /* FCVTAU */
+-                    /* r4..r11 are callee-saves, zero only if EXCRET.S == 1 */
 -                    if (i < 4 || i > 11 || (lr & R_V7M_EXCRET_S_MASK)) {
 +                    if (i < 4 || i > 11 || zero_callee_saves) {
                          env->regs[i] = 0;
                      }
                  }
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 40/42] target/arm: Enable ARM_V8_FP16 feature bit for the AArch64 "any" CPU
+[PULL 27/36] target/arm: In v8.1M, don't set HFSR.FORCED on vector table fetch failures
-Now we have implemented FP16 we can enable it for the "any" CPU.
+In v8.1M, vector table fetch failures don't set HFSR.FORCED (see rule
 R_LLRP).  (In previous versions of the architecture this was either
 required or IMPDEF.)
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-[PMM: split out from an earlier patch in the series]
+Message-id: 20201119215617.29887-18-peter.maydell@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/cpu64.c | 1 +
+ target/arm/m_helper.c | 6 +++++-
-file changed, 1 insertion(+)
+file changed, 5 insertions(+), 1 deletion(-)
-diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
+diff --git a/target/arm/m_helper.c b/target/arm/m_helper.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/cpu64.c
+--- a/target/arm/m_helper.c
-+++ b/target/arm/cpu64.c
++++ b/target/arm/m_helper.c
-@@ -XXX,XX +XXX,XX @@ static void aarch64_any_initfn(Object *obj)
+@@ -XXX,XX +XXX,XX @@ load_fail:
-     set_feature(&cpu->env, ARM_FEATURE_V8_SM4);
+      * The HardFault is Secure if BFHFNMINS is 0 (meaning that all HFs are
-     set_feature(&cpu->env, ARM_FEATURE_V8_PMULL);
+      * secure); otherwise it targets the same security state as the
-     set_feature(&cpu->env, ARM_FEATURE_CRC);
+      * underlying exception.
-+    set_feature(&cpu->env, ARM_FEATURE_V8_FP16);
++     * In v8.1M HardFaults from vector table fetch fails don't set FORCED.
-     cpu->ctr = 0x80038003; /* 32 byte I and D cacheline size, VIPT icache */
+      */
-     cpu->dcz_blocksize = 7; /*  512 bytes */
+     if (!(cpu->env.v7m.aircr & R_V7M_AIRCR_BFHFNMINS_MASK)) {
          exc_secure = true;
      }
 -    env->v7m.hfsr |= R_V7M_HFSR_VECTTBL_MASK | R_V7M_HFSR_FORCED_MASK;
 +    env->v7m.hfsr |= R_V7M_HFSR_VECTTBL_MASK;
 +    if (!arm_feature(env, ARM_FEATURE_V8_1M)) {
 +        env->v7m.hfsr |= R_V7M_HFSR_FORCED_MASK;
 +    }
      armv7m_nvic_set_pending_derived(env->nvic, ARMV7M_EXCP_HARD, exc_secure);
      return false;
  }
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 22/42] arm/translate-a64: add FP16 FMULX/MLS/FMLA to simd_indexed
+[PULL 28/36] target/arm: Implement v8.1M REVIDR register
-From: Alex Bennée <alex.bennee@linaro.org>
+In v8.1M a REVIDR register is defined, which is at address 0xe00ecfc
 and is a read-only IMPDEF register providing implementation specific
 minor revision information, like the v8A REVIDR_EL1. Implement this.
-The helpers use the new re-factored muladd support in SoftFloat for
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-the float16 work.
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20201119215617.29887-19-peter.maydell@linaro.org
 ---
  hw/intc/armv7m_nvic.c | 5 +++++
 file changed, 5 insertions(+)
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+diff --git a/hw/intc/armv7m_nvic.c b/hw/intc/armv7m_nvic.c
 Message-id: 20180227143852.11175-15-alex.bennee@linaro.org
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
  target/arm/translate-a64.c | 82 +++++++++++++++++++++++++++++++++++++---------
 file changed, 66 insertions(+), 16 deletions(-)
 diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
+--- a/hw/intc/armv7m_nvic.c
-+++ b/target/arm/translate-a64.c
++++ b/hw/intc/armv7m_nvic.c
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_indexed(DisasContext *s, uint32_t insn)
+@@ -XXX,XX +XXX,XX @@ static uint32_t nvic_readl(NVICState *s, uint32_t offset, MemTxAttrs attrs)
      int rd = extract32(insn, 0, 5);
      bool is_long = false;
      bool is_fp = false;
 +    bool is_fp16 = false;
      int index;
      TCGv_ptr fpst;
@@ -XXX,XX +XXX,XX @@ static void disas_simd_indexed(DisasContext *s, uint32_t insn)
          }
-         /* fall through */
+         return val;
      case 0x9: /* FMUL, FMULX */
 -        if (!extract32(size, 1, 1)) {
 +        if (size == 1) {
              unallocated_encoding(s);
              return;
          }
@@ -XXX,XX +XXX,XX @@ static void disas_simd_indexed(DisasContext *s, uint32_t insn)
      }
++    case 0xcfc:
-     if (is_fp) {
++        if (!arm_feature(&cpu->env, ARM_FEATURE_V8_1M)) {
--        /* low bit of size indicates single/double */
++            goto bad_offset;
--        size = extract32(size, 0, 1) ? 3 : 2;
++        }
--        if (size == 2) {
++        return cpu->revidr;
-+        /* convert insn encoded size to TCGMemOp size */
+     case 0xd00: /* CPUID Base.  */
-+        switch (size) {
+         return cpu->midr;
-+        case 2: /* single precision */
+     case 0xd04: /* Interrupt Control State (ICSR) */
 +            size = MO_32;
              index = h << 1 | l;
 -        } else {
 +            rm |= (m << 4);
 +            break;
 +        case 3: /* double precision */
 +            size = MO_64;
              if (l || !is_q) {
                  unallocated_encoding(s);
                  return;
              }
              index = h;
 +            rm |= (m << 4);
 +            break;
 +        case 0: /* half precision */
 +            size = MO_16;
 +            index = h << 2 | l << 1 | m;
 +            is_fp16 = true;
 +            if (arm_dc_feature(s, ARM_FEATURE_V8_FP16)) {
 +                break;
 +            }
 +            /* fallthru */
 +        default: /* unallocated */
 +            unallocated_encoding(s);
 +            return;
          }
 -        rm |= (m << 4);
      } else {
          switch (size) {
          case 1:
@@ -XXX,XX +XXX,XX @@ static void disas_simd_indexed(DisasContext *s, uint32_t insn)
      }
      if (is_fp) {
 -        fpst = get_fpstatus_ptr(false);
 +        fpst = get_fpstatus_ptr(is_fp16);
      } else {
          fpst = NULL;
      }
@@ -XXX,XX +XXX,XX @@ static void disas_simd_indexed(DisasContext *s, uint32_t insn)
                  break;
              }
              case 0x5: /* FMLS */
 -                /* As usual for ARM, separate negation for fused multiply-add */
 -                gen_helper_vfp_negs(tcg_op, tcg_op);
 -                /* fall through */
              case 0x1: /* FMLA */
 -                read_vec_element_i32(s, tcg_res, rd, pass, MO_32);
 -                gen_helper_vfp_muladds(tcg_res, tcg_op, tcg_idx, tcg_res, fpst);
 +                read_vec_element_i32(s, tcg_res, rd, pass,
 +                                     is_scalar ? size : MO_32);
 +                switch (size) {
 +                case 1:
 +                    if (opcode == 0x5) {
 +                        /* As usual for ARM, separate negation for fused
 +                         * multiply-add */
 +                        tcg_gen_xori_i32(tcg_op, tcg_op, 0x80008000);
 +                    }
 +                    gen_helper_advsimd_muladdh(tcg_res, tcg_op, tcg_idx,
 +                                               tcg_res, fpst);
 +                    break;
 +                case 2:
 +                    if (opcode == 0x5) {
 +                        /* As usual for ARM, separate negation for
 +                         * fused multiply-add */
 +                        tcg_gen_xori_i32(tcg_op, tcg_op, 0x80000000);
 +                    }
 +                    gen_helper_vfp_muladds(tcg_res, tcg_op, tcg_idx,
 +                                           tcg_res, fpst);
 +                    break;
 +                default:
 +                    g_assert_not_reached();
 +                }
                  break;
              case 0x9: /* FMUL, FMULX */
 -                if (u) {
 -                    gen_helper_vfp_mulxs(tcg_res, tcg_op, tcg_idx, fpst);
 -                } else {
 -                    gen_helper_vfp_muls(tcg_res, tcg_op, tcg_idx, fpst);
 +                switch (size) {
 +                case 1:
 +                    if (u) {
 +                        gen_helper_advsimd_mulxh(tcg_res, tcg_op, tcg_idx,
 +                                                 fpst);
 +                    } else {
 +                        g_assert_not_reached();
 +                    }
 +                    break;
 +                case 2:
 +                    if (u) {
 +                        gen_helper_vfp_mulxs(tcg_res, tcg_op, tcg_idx, fpst);
 +                    } else {
 +                        gen_helper_vfp_muls(tcg_res, tcg_op, tcg_idx, fpst);
 +                    }
 +                    break;
 +                default:
 +                    g_assert_not_reached();
                  }
                  break;
              case 0xc: /* SQDMULH */
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 41/42] linux-user: Report AArch64 FP16 support via hwcap bits
+[PULL 29/36] target/arm: Implement new v8.1M NOCP check for exception return
-Set the appropriate Linux hwcap bits to tell the guest binary if we
+In v8.1M a new exception return check is added which may cause a NOCP
-have implemented half-precision floating point support.
+UsageFault (see rule R_XLTP): before we clear s0..s15 and the FPSCR
 we must check whether access to CP10 from the Security state of the
 returning exception is disabled; if it is then we must take a fault.
 (Note that for our implementation CPPWR is always RAZ/WI and so can
 never cause CP10 accesses to fail.)
 The other v8.1M change to this register-clearing code is that if MVE
 is implemented VPR must also be cleared, so add a TODO comment to
 that effect.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Message-id: 20201119215617.29887-20-peter.maydell@linaro.org
 ---
- linux-user/elfload.c | 2 ++
+ target/arm/m_helper.c | 22 +++++++++++++++++++++-
-file changed, 2 insertions(+)
+file changed, 21 insertions(+), 1 deletion(-)
-diff --git a/linux-user/elfload.c b/linux-user/elfload.c
+diff --git a/target/arm/m_helper.c b/target/arm/m_helper.c
 index XXXXXXX..XXXXXXX 100644
---- a/linux-user/elfload.c
+--- a/target/arm/m_helper.c
-+++ b/linux-user/elfload.c
++++ b/target/arm/m_helper.c
-@@ -XXX,XX +XXX,XX @@ static uint32_t get_elf_hwcap(void)
+@@ -XXX,XX +XXX,XX @@ static void do_v7m_exception_exit(ARMCPU *cpu)
-     GET_FEATURE(ARM_FEATURE_V8_SM3, ARM_HWCAP_A64_SM3);
+             v7m_exception_taken(cpu, excret, true, false);
-     GET_FEATURE(ARM_FEATURE_V8_SM4, ARM_HWCAP_A64_SM4);
+             return;
-     GET_FEATURE(ARM_FEATURE_V8_SHA512, ARM_HWCAP_A64_SHA512);
+         } else {
-+    GET_FEATURE(ARM_FEATURE_V8_FP16,
+-            /* Clear s0..s15 and FPSCR */
-+                ARM_HWCAP_A64_FPHP | ARM_HWCAP_A64_ASIMDHP);
++            if (arm_feature(env, ARM_FEATURE_V8_1M)) {
- #undef GET_FEATURE
++                /* v8.1M adds this NOCP check */
++                bool nsacr_pass = exc_secure ||
-     return hwcaps;
++                    extract32(env->v7m.nsacr, 10, 1);
 +                bool cpacr_pass = v7m_cpacr_pass(env, exc_secure, true);
 +                if (!nsacr_pass) {
 +                    armv7m_nvic_set_pending(env->nvic, ARMV7M_EXCP_USAGE, true);
 +                    env->v7m.cfsr[M_REG_S] |= R_V7M_CFSR_NOCP_MASK;
 +                    qemu_log_mask(CPU_LOG_INT, "...taking UsageFault on existing "
 +                        "stackframe: NSACR prevents clearing FPU registers\n");
 +                    v7m_exception_taken(cpu, excret, true, false);
 +                } else if (!cpacr_pass) {
 +                    armv7m_nvic_set_pending(env->nvic, ARMV7M_EXCP_USAGE,
 +                                            exc_secure);
 +                    env->v7m.cfsr[exc_secure] |= R_V7M_CFSR_NOCP_MASK;
 +                    qemu_log_mask(CPU_LOG_INT, "...taking UsageFault on existing "
 +                        "stackframe: CPACR prevents clearing FPU registers\n");
 +                    v7m_exception_taken(cpu, excret, true, false);
 +                }
 +            }
 +            /* Clear s0..s15 and FPSCR; TODO also VPR when MVE is implemented */
              int i;
              for (i = 0; i < 16; i += 2) {
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 21/42] arm/translate-a64: add FP16 pairwise ops simd_three_reg_same_fp16
+[PULL 30/36] target/arm: Implement new v8.1M VLLDM and VLSTM encodings
-From: Alex Bennée <alex.bennee@linaro.org>
+v8.1M adds new encodings of VLLDM and VLSTM (where bit 7 is set).
 The only difference is that:
  * the old T1 encodings UNDEF if the implementation implements 32
    Dregs (this is currently architecturally impossible for M-profile)
  * the new T2 encodings have the implementation-defined option to
    read from memory (discarding the data) or write UNKNOWN values to
    memory for the stack slots that would be D16-D31
-This includes FMAXNMP, FADDP, FMAXP, FMINNMP, FMINP.
+We choose not to make those accesses, so for us the two
 instructions behave identically assuming they don't UNDEF.
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20180227143852.11175-14-alex.bennee@linaro.org
+Message-id: 20201119215617.29887-21-peter.maydell@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/translate-a64.c | 208 +++++++++++++++++++++++++++++----------------
+ target/arm/m-nocp.decode       |  2 +-
-file changed, 133 insertions(+), 75 deletions(-)
+ target/arm/translate-vfp.c.inc | 25 +++++++++++++++++++++++++
 files changed, 26 insertions(+), 1 deletion(-)
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
+diff --git a/target/arm/m-nocp.decode b/target/arm/m-nocp.decode
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
+--- a/target/arm/m-nocp.decode
-+++ b/target/arm/translate-a64.c
++++ b/target/arm/m-nocp.decode
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_three_reg_same_fp16(DisasContext *s, uint32_t insn)
+@@ -XXX,XX +XXX,XX @@
-     int datasize, elements;
-     int pass;
+ {
-     TCGv_ptr fpst;
+   # Special cases which do not take an early NOCP: VLLDM and VLSTM
-+    bool pairwise = false;
+-  VLLDM_VLSTM  1110 1100 001 l:1 rn:4 0000 1010 0000 0000
++  VLLDM_VLSTM  1110 1100 001 l:1 rn:4 0000 1010 op:1 000 0000
-     if (!arm_dc_feature(s, ARM_FEATURE_V8_FP16)) {
+   # VSCCLRM (new in v8.1M) is similar:
-         unallocated_encoding(s);
+   VSCCLRM      1110 1100 1.01 1111 .... 1011 imm:7 0   vd=%vd_dp size=3
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_three_reg_same_fp16(DisasContext *s, uint32_t insn)
+   VSCCLRM      1110 1100 1.01 1111 .... 1010 imm:8     vd=%vd_sp size=2
-     datasize = is_q ? 128 : 64;
+diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
-     elements = datasize / 16;
+index XXXXXXX..XXXXXXX 100644
+--- a/target/arm/translate-vfp.c.inc
-+    switch (fpopcode) {
++++ b/target/arm/translate-vfp.c.inc
-+    case 0x10: /* FMAXNMP */
+@@ -XXX,XX +XXX,XX @@ static bool trans_VLLDM_VLSTM(DisasContext *s, arg_VLLDM_VLSTM *a)
-+    case 0x12: /* FADDP */
+         !arm_dc_feature(s, ARM_FEATURE_V8)) {
-+    case 0x16: /* FMAXP */
+         return false;
-+    case 0x18: /* FMINNMP */
+     }
-+    case 0x1e: /* FMINP */
++
-+        pairwise = true;
++    if (a->op) {
-+        break;
++        /*
 +         * T2 encoding ({D0-D31} reglist): v8.1M and up. We choose not
 +         * to take the IMPDEF option to make memory accesses to the stack
 +         * slots that correspond to the D16-D31 registers (discarding
 +         * read data and writing UNKNOWN values), so for us the T2
 +         * encoding behaves identically to the T1 encoding.
 +         */
 +        if (!arm_dc_feature(s, ARM_FEATURE_V8_1M)) {
 +            return false;
 +        }
 +    } else {
 +        /*
 +         * T1 encoding ({D0-D15} reglist); undef if we have 32 Dregs.
 +         * This is currently architecturally impossible, but we add the
 +         * check to stay in line with the pseudocode. Note that we must
 +         * emit code for the UNDEF so it takes precedence over the NOCP.
 +         */
 +        if (dc_isar_feature(aa32_simd_r32, s)) {
 +            unallocated_encoding(s);
 +            return true;
 +        }
 +    }
 +
-     fpst = get_fpstatus_ptr(true);
+     /*
+      * If not secure, UNDEF. We must emit code for this
--    for (pass = 0; pass < elements; pass++) {
+      * rather than returning false so that this takes
 +    if (pairwise) {
 +        int maxpass = is_q ? 8 : 4;
          TCGv_i32 tcg_op1 = tcg_temp_new_i32();
          TCGv_i32 tcg_op2 = tcg_temp_new_i32();
 -        TCGv_i32 tcg_res = tcg_temp_new_i32();
 +        TCGv_i32 tcg_res[8];
 -        read_vec_element_i32(s, tcg_op1, rn, pass, MO_16);
 -        read_vec_element_i32(s, tcg_op2, rm, pass, MO_16);
 +        for (pass = 0; pass < maxpass; pass++) {
 +            int passreg = pass < (maxpass / 2) ? rn : rm;
 +            int passelt = (pass << 1) & (maxpass - 1);
 -        switch (fpopcode) {
 -        case 0x0: /* FMAXNM */
 -            gen_helper_advsimd_maxnumh(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0x1: /* FMLA */
 -            read_vec_element_i32(s, tcg_res, rd, pass, MO_16);
 -            gen_helper_advsimd_muladdh(tcg_res, tcg_op1, tcg_op2, tcg_res,
 -                                       fpst);
 -            break;
 -        case 0x2: /* FADD */
 -            gen_helper_advsimd_addh(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0x3: /* FMULX */
 -            gen_helper_advsimd_mulxh(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0x4: /* FCMEQ */
 -            gen_helper_advsimd_ceq_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0x6: /* FMAX */
 -            gen_helper_advsimd_maxh(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0x7: /* FRECPS */
 -            gen_helper_recpsf_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0x8: /* FMINNM */
 -            gen_helper_advsimd_minnumh(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0x9: /* FMLS */
 -             /* As usual for ARM, separate negation for fused multiply-add */
 -            tcg_gen_xori_i32(tcg_op1, tcg_op1, 0x8000);
 -            read_vec_element_i32(s, tcg_res, rd, pass, MO_16);
 -            gen_helper_advsimd_muladdh(tcg_res, tcg_op1, tcg_op2, tcg_res,
 -                                       fpst);
 -            break;
 -        case 0xa: /* FSUB */
 -            gen_helper_advsimd_subh(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0xe: /* FMIN */
 -            gen_helper_advsimd_minh(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0xf: /* FRSQRTS */
 -            gen_helper_rsqrtsf_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0x13: /* FMUL */
 -            gen_helper_advsimd_mulh(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0x14: /* FCMGE */
 -            gen_helper_advsimd_cge_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0x15: /* FACGE */
 -            gen_helper_advsimd_acge_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0x17: /* FDIV */
 -            gen_helper_advsimd_divh(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0x1a: /* FABD */
 -            gen_helper_advsimd_subh(tcg_res, tcg_op1, tcg_op2, fpst);
 -            tcg_gen_andi_i32(tcg_res, tcg_res, 0x7fff);
 -            break;
 -        case 0x1c: /* FCMGT */
 -            gen_helper_advsimd_cgt_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        case 0x1d: /* FACGT */
 -            gen_helper_advsimd_acgt_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 -            break;
 -        default:
 -            fprintf(stderr, "%s: insn %#04x, fpop %#2x @ %#" PRIx64 "\n",
 -                    __func__, insn, fpopcode, s->pc);
 -            g_assert_not_reached();
 +            read_vec_element_i32(s, tcg_op1, passreg, passelt, MO_16);
 +            read_vec_element_i32(s, tcg_op2, passreg, passelt + 1, MO_16);
 +            tcg_res[pass] = tcg_temp_new_i32();
 +
 +            switch (fpopcode) {
 +            case 0x10: /* FMAXNMP */
 +                gen_helper_advsimd_maxnumh(tcg_res[pass], tcg_op1, tcg_op2,
 +                                           fpst);
 +                break;
 +            case 0x12: /* FADDP */
 +                gen_helper_advsimd_addh(tcg_res[pass], tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x16: /* FMAXP */
 +                gen_helper_advsimd_maxh(tcg_res[pass], tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x18: /* FMINNMP */
 +                gen_helper_advsimd_minnumh(tcg_res[pass], tcg_op1, tcg_op2,
 +                                           fpst);
 +                break;
 +            case 0x1e: /* FMINP */
 +                gen_helper_advsimd_minh(tcg_res[pass], tcg_op1, tcg_op2, fpst);
 +                break;
 +            default:
 +                g_assert_not_reached();
 +            }
 +        }
 +
 +        for (pass = 0; pass < maxpass; pass++) {
 +            write_vec_element_i32(s, tcg_res[pass], rd, pass, MO_16);
 +            tcg_temp_free_i32(tcg_res[pass]);
          }
 -        write_vec_element_i32(s, tcg_res, rd, pass, MO_16);
 -        tcg_temp_free_i32(tcg_res);
          tcg_temp_free_i32(tcg_op1);
          tcg_temp_free_i32(tcg_op2);
 +
 +    } else {
 +        for (pass = 0; pass < elements; pass++) {
 +            TCGv_i32 tcg_op1 = tcg_temp_new_i32();
 +            TCGv_i32 tcg_op2 = tcg_temp_new_i32();
 +            TCGv_i32 tcg_res = tcg_temp_new_i32();
 +
 +            read_vec_element_i32(s, tcg_op1, rn, pass, MO_16);
 +            read_vec_element_i32(s, tcg_op2, rm, pass, MO_16);
 +
 +            switch (fpopcode) {
 +            case 0x0: /* FMAXNM */
 +                gen_helper_advsimd_maxnumh(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x1: /* FMLA */
 +                read_vec_element_i32(s, tcg_res, rd, pass, MO_16);
 +                gen_helper_advsimd_muladdh(tcg_res, tcg_op1, tcg_op2, tcg_res,
 +                                           fpst);
 +                break;
 +            case 0x2: /* FADD */
 +                gen_helper_advsimd_addh(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x3: /* FMULX */
 +                gen_helper_advsimd_mulxh(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x4: /* FCMEQ */
 +                gen_helper_advsimd_ceq_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x6: /* FMAX */
 +                gen_helper_advsimd_maxh(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x7: /* FRECPS */
 +                gen_helper_recpsf_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x8: /* FMINNM */
 +                gen_helper_advsimd_minnumh(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x9: /* FMLS */
 +                /* As usual for ARM, separate negation for fused multiply-add */
 +                tcg_gen_xori_i32(tcg_op1, tcg_op1, 0x8000);
 +                read_vec_element_i32(s, tcg_res, rd, pass, MO_16);
 +                gen_helper_advsimd_muladdh(tcg_res, tcg_op1, tcg_op2, tcg_res,
 +                                           fpst);
 +                break;
 +            case 0xa: /* FSUB */
 +                gen_helper_advsimd_subh(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0xe: /* FMIN */
 +                gen_helper_advsimd_minh(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0xf: /* FRSQRTS */
 +                gen_helper_rsqrtsf_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x13: /* FMUL */
 +                gen_helper_advsimd_mulh(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x14: /* FCMGE */
 +                gen_helper_advsimd_cge_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x15: /* FACGE */
 +                gen_helper_advsimd_acge_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x17: /* FDIV */
 +                gen_helper_advsimd_divh(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x1a: /* FABD */
 +                gen_helper_advsimd_subh(tcg_res, tcg_op1, tcg_op2, fpst);
 +                tcg_gen_andi_i32(tcg_res, tcg_res, 0x7fff);
 +                break;
 +            case 0x1c: /* FCMGT */
 +                gen_helper_advsimd_cgt_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            case 0x1d: /* FACGT */
 +                gen_helper_advsimd_acgt_f16(tcg_res, tcg_op1, tcg_op2, fpst);
 +                break;
 +            default:
 +                fprintf(stderr, "%s: insn %#04x, fpop %#2x @ %#" PRIx64 "\n",
 +                        __func__, insn, fpopcode, s->pc);
 +                g_assert_not_reached();
 +            }
 +
 +            write_vec_element_i32(s, tcg_res, rd, pass, MO_16);
 +            tcg_temp_free_i32(tcg_res);
 +            tcg_temp_free_i32(tcg_op1);
 +            tcg_temp_free_i32(tcg_op2);
 +        }
      }
      tcg_temp_free_ptr(fpst);
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 11/42] target/arm/cpu.h: update comment for half-precision values
+[PULL 31/36] hw/intc/armv7m_nvic: Support v8.1M CCR.TRD bit
-From: Alex Bennée <alex.bennee@linaro.org>
+v8.1M introduces a new TRD flag in the CCR register, which enables
 checking for stack frame integrity signatures on SG instructions.
 This bit is not banked, and is always RAZ/WI to Non-secure code.
 Adjust the code for handling CCR reads and writes to handle this.
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20180227143852.11175-4-alex.bennee@linaro.org
+Message-id: 20201119215617.29887-23-peter.maydell@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/cpu.h | 1 +
+ target/arm/cpu.h      |  2 ++
-file changed, 1 insertion(+)
+ hw/intc/armv7m_nvic.c | 26 ++++++++++++++++++--------
 files changed, 20 insertions(+), 8 deletions(-)
 diff --git a/target/arm/cpu.h b/target/arm/cpu.h
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu.h
 +++ b/target/arm/cpu.h
-@@ -XXX,XX +XXX,XX @@ typedef struct {
+@@ -XXX,XX +XXX,XX @@ FIELD(V7M_CCR, STKOFHFNMIGN, 10, 1)
-  *  Qn = regs[n].d[1]:regs[n].d[0]
+ FIELD(V7M_CCR, DC, 16, 1)
-  *  Dn = regs[n].d[0]
+ FIELD(V7M_CCR, IC, 17, 1)
-  *  Sn = regs[n].d[0] bits 31..0
+ FIELD(V7M_CCR, BP, 18, 1)
-+ *  Hn = regs[n].d[0] bits 15..0
++FIELD(V7M_CCR, LOB, 19, 1)
-  *
++FIELD(V7M_CCR, TRD, 20, 1)
-  * This corresponds to the architecturally defined mapping between
-  * the two execution states, and means we do not need to explicitly
+ /* V7M SCR bits */
  FIELD(V7M_SCR, SLEEPONEXIT, 1, 1)
 diff --git a/hw/intc/armv7m_nvic.c b/hw/intc/armv7m_nvic.c
 index XXXXXXX..XXXXXXX 100644
 --- a/hw/intc/armv7m_nvic.c
 +++ b/hw/intc/armv7m_nvic.c
@@ -XXX,XX +XXX,XX @@ static uint32_t nvic_readl(NVICState *s, uint32_t offset, MemTxAttrs attrs)
          }
          return cpu->env.v7m.scr[attrs.secure];
      case 0xd14: /* Configuration Control.  */
 -        /* The BFHFNMIGN bit is the only non-banked bit; we
 -         * keep it in the non-secure copy of the register.
 +        /*
 +         * Non-banked bits: BFHFNMIGN (stored in the NS copy of the register)
 +         * and TRD (stored in the S copy of the register)
           */
          val = cpu->env.v7m.ccr[attrs.secure];
          val |= cpu->env.v7m.ccr[M_REG_NS] & R_V7M_CCR_BFHFNMIGN_MASK;
@@ -XXX,XX +XXX,XX @@ static void nvic_writel(NVICState *s, uint32_t offset, uint32_t value,
          cpu->env.v7m.scr[attrs.secure] = value;
          break;
      case 0xd14: /* Configuration Control.  */
 +    {
 +        uint32_t mask;
 +
          if (!arm_feature(&cpu->env, ARM_FEATURE_M_MAIN)) {
              goto bad_offset;
          }
          /* Enforce RAZ/WI on reserved and must-RAZ/WI bits */
 -        value &= (R_V7M_CCR_STKALIGN_MASK |
 -                  R_V7M_CCR_BFHFNMIGN_MASK |
 -                  R_V7M_CCR_DIV_0_TRP_MASK |
 -                  R_V7M_CCR_UNALIGN_TRP_MASK |
 -                  R_V7M_CCR_USERSETMPEND_MASK |
 -                  R_V7M_CCR_NONBASETHRDENA_MASK);
 +        mask = R_V7M_CCR_STKALIGN_MASK |
 +            R_V7M_CCR_BFHFNMIGN_MASK |
 +            R_V7M_CCR_DIV_0_TRP_MASK |
 +            R_V7M_CCR_UNALIGN_TRP_MASK |
 +            R_V7M_CCR_USERSETMPEND_MASK |
 +            R_V7M_CCR_NONBASETHRDENA_MASK;
 +        if (arm_feature(&cpu->env, ARM_FEATURE_V8_1M) && attrs.secure) {
 +            /* TRD is always RAZ/WI from NS */
 +            mask |= R_V7M_CCR_TRD_MASK;
 +        }
 +        value &= mask;
          if (arm_feature(&cpu->env, ARM_FEATURE_V8)) {
              /* v8M makes NONBASETHRDENA and STKALIGN be RES1 */
@@ -XXX,XX +XXX,XX @@ static void nvic_writel(NVICState *s, uint32_t offset, uint32_t value,
          cpu->env.v7m.ccr[attrs.secure] = value;
          break;
 +    }
      case 0xd24: /* System Handler Control and State (SHCSR) */
          if (!arm_feature(&cpu->env, ARM_FEATURE_V7)) {
              goto bad_offset;
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 39/42] arm/translate-a64: add all single op FP16 to handle_fp_1src_half
+[PULL 32/36] target/arm: Implement CCR_S.TRD behaviour for SG insns
-From: Alex Bennée <alex.bennee@linaro.org>
+v8.1M introduces a new TRD flag in the CCR register, which enables
 checking for stack frame integrity signatures on SG instructions.
 Add the code in the SG insn implementation for the new behaviour.
-This includes FMOV, FABS, FNEG, FSQRT and  FRINT[NPMZAXI]. We re-use
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-existing helpers to achieve this.
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20201119215617.29887-24-peter.maydell@linaro.org
 ---
  target/arm/m_helper.c | 86 +++++++++++++++++++++++++++++++++++++++++++
 file changed, 86 insertions(+)
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+diff --git a/target/arm/m_helper.c b/target/arm/m_helper.c
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20180227143852.11175-32-alex.bennee@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
  target/arm/translate-a64.c | 71 ++++++++++++++++++++++++++++++++++++++++++++++
 file changed, 71 insertions(+)
 diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
+--- a/target/arm/m_helper.c
-+++ b/target/arm/translate-a64.c
++++ b/target/arm/m_helper.c
-@@ -XXX,XX +XXX,XX @@ static void disas_fp_csel(DisasContext *s, uint32_t insn)
+@@ -XXX,XX +XXX,XX @@ static bool v7m_read_half_insn(ARMCPU *cpu, ARMMMUIdx mmu_idx,
-     tcg_temp_free_i64(t_true);
+     return true;
  }
-+/* Floating-point data-processing (1 source) - half precision */
++static bool v7m_read_sg_stack_word(ARMCPU *cpu, ARMMMUIdx mmu_idx,
-+static void handle_fp_1src_half(DisasContext *s, int opcode, int rd, int rn)
++                                   uint32_t addr, uint32_t *spdata)
 +{
-+    TCGv_ptr fpst = NULL;
++    /*
-+    TCGv_i32 tcg_op = tcg_temp_new_i32();
++     * Read a word of data from the stack for the SG instruction,
-+    TCGv_i32 tcg_res = tcg_temp_new_i32();
++     * writing the value into *spdata. If the load succeeds, return
 +     * true; otherwise pend an appropriate exception and return false.
 +     * (We can't use data load helpers here that throw an exception
 +     * because of the context we're called in, which is halfway through
 +     * arm_v7m_cpu_do_interrupt().)
 +     */
 +    CPUState *cs = CPU(cpu);
 +    CPUARMState *env = &cpu->env;
 +    MemTxAttrs attrs = {};
 +    MemTxResult txres;
 +    target_ulong page_size;
 +    hwaddr physaddr;
 +    int prot;
 +    ARMMMUFaultInfo fi = {};
 +    ARMCacheAttrs cacheattrs = {};
 +    uint32_t value;
 +
-+    read_vec_element_i32(s, tcg_op, rn, 0, MO_16);
++    if (get_phys_addr(env, addr, MMU_DATA_LOAD, mmu_idx, &physaddr,
-+
++                      &attrs, &prot, &page_size, &fi, &cacheattrs)) {
-+    switch (opcode) {
++        /* MPU/SAU lookup failed */
-+    case 0x0: /* FMOV */
++        if (fi.type == ARMFault_QEMU_SFault) {
-+        tcg_gen_mov_i32(tcg_res, tcg_op);
++            qemu_log_mask(CPU_LOG_INT,
-+        break;
++                          "...SecureFault during stack word read\n");
-+    case 0x1: /* FABS */
++            env->v7m.sfsr |= R_V7M_SFSR_AUVIOL_MASK | R_V7M_SFSR_SFARVALID_MASK;
-+        tcg_gen_andi_i32(tcg_res, tcg_op, 0x7fff);
++            env->v7m.sfar = addr;
-+        break;
++            armv7m_nvic_set_pending(env->nvic, ARMV7M_EXCP_SECURE, false);
-+    case 0x2: /* FNEG */
++        } else {
-+        tcg_gen_xori_i32(tcg_res, tcg_op, 0x8000);
++            qemu_log_mask(CPU_LOG_INT,
-+        break;
++                          "...MemManageFault during stack word read\n");
-+    case 0x3: /* FSQRT */
++            env->v7m.cfsr[M_REG_S] |= R_V7M_CFSR_DACCVIOL_MASK |
-+        gen_helper_sqrt_f16(tcg_res, tcg_op, cpu_env);
++                R_V7M_CFSR_MMARVALID_MASK;
-+        break;
++            env->v7m.mmfar[M_REG_S] = addr;
-+    case 0x8: /* FRINTN */
++            armv7m_nvic_set_pending(env->nvic, ARMV7M_EXCP_MEM, false);
-+    case 0x9: /* FRINTP */
++        }
-+    case 0xa: /* FRINTM */
++        return false;
 +    case 0xb: /* FRINTZ */
 +    case 0xc: /* FRINTA */
 +    {
 +        TCGv_i32 tcg_rmode = tcg_const_i32(arm_rmode_to_sf(opcode & 7));
 +        fpst = get_fpstatus_ptr(true);
 +
 +        gen_helper_set_rmode(tcg_rmode, tcg_rmode, fpst);
 +        gen_helper_advsimd_rinth(tcg_res, tcg_op, fpst);
 +
 +        gen_helper_set_rmode(tcg_rmode, tcg_rmode, fpst);
 +        tcg_temp_free_i32(tcg_rmode);
 +        break;
 +    }
-+    case 0xe: /* FRINTX */
++    value = address_space_ldl(arm_addressspace(cs, attrs), physaddr,
-+        fpst = get_fpstatus_ptr(true);
++                              attrs, &txres);
-+        gen_helper_advsimd_rinth_exact(tcg_res, tcg_op, fpst);
++    if (txres != MEMTX_OK) {
-+        break;
++        /* BusFault trying to read the data */
-+    case 0xf: /* FRINTI */
++        qemu_log_mask(CPU_LOG_INT,
-+        fpst = get_fpstatus_ptr(true);
++                      "...BusFault during stack word read\n");
-+        gen_helper_advsimd_rinth(tcg_res, tcg_op, fpst);
++        env->v7m.cfsr[M_REG_NS] |=
-+        break;
++            (R_V7M_CFSR_PRECISERR_MASK | R_V7M_CFSR_BFARVALID_MASK);
-+    default:
++        env->v7m.bfar = addr;
-+        abort();
++        armv7m_nvic_set_pending(env->nvic, ARMV7M_EXCP_BUS, false);
 +        return false;
 +    }
 +
-+    write_fp_sreg(s, rd, tcg_res);
++    *spdata = value;
-+
++    return true;
 +    if (fpst) {
 +        tcg_temp_free_ptr(fpst);
 +    }
 +    tcg_temp_free_i32(tcg_op);
 +    tcg_temp_free_i32(tcg_res);
 +}
 +
- /* Floating-point data-processing (1 source) - single precision */
+ static bool v7m_handle_execute_nsc(ARMCPU *cpu)
  static void handle_fp_1src_single(DisasContext *s, int opcode, int rd, int rn)
  {
-@@ -XXX,XX +XXX,XX @@ static void disas_fp_1src(DisasContext *s, uint32_t insn)
+     /*
+@@ -XXX,XX +XXX,XX @@ static bool v7m_handle_execute_nsc(ARMCPU *cpu)
-             handle_fp_1src_double(s, opcode, rd, rn);
+      */
-             break;
+     qemu_log_mask(CPU_LOG_INT, "...really an SG instruction at 0x%08" PRIx32
-+        case 3:
+                   ", executing it\n", env->regs[15]);
-+            if (!arm_dc_feature(s, ARM_FEATURE_V8_FP16)) {
++
-+                unallocated_encoding(s);
++    if (cpu_isar_feature(aa32_m_sec_state, cpu) &&
-+                return;
++        !arm_v7m_is_handler_mode(env)) {
 +        /*
 +         * v8.1M exception stack frame integrity check. Note that we
 +         * must perform the memory access even if CCR_S.TRD is zero
 +         * and we aren't going to check what the data loaded is.
 +         */
 +        uint32_t spdata, sp;
 +
 +        /*
 +         * We know we are currently NS, so the S stack pointers must be
 +         * in other_ss_{psp,msp}, not in regs[13]/other_sp.
 +         */
 +        sp = v7m_using_psp(env) ? env->v7m.other_ss_psp : env->v7m.other_ss_msp;
 +        if (!v7m_read_sg_stack_word(cpu, mmu_idx, sp, &spdata)) {
 +            /* Stack access failed and an exception has been pended */
 +            return false;
 +        }
 +
 +        if (env->v7m.ccr[M_REG_S] & R_V7M_CCR_TRD_MASK) {
 +            if (((spdata & ~1) == 0xfefa125a) ||
 +                !(env->v7m.control[M_REG_S] & 1)) {
 +                goto gen_invep;
 +            }
++        }
++    }
 +
-+            if (!fp_access_check(s)) {
+     env->regs[14] &= ~1;
-+                return;
+     env->v7m.control[M_REG_S] &= ~R_V7M_CONTROL_SFPA_MASK;
-+            }
+     switch_v7m_security_state(env, true);
 +
 +            handle_fp_1src_half(s, opcode, rd, rn);
 +            break;
          default:
              unallocated_encoding(s);
          }
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 34/42] arm/helper.c: re-factor rsqrte and add rsqrte_f16
+[PULL 33/36] hw/intc/armv7m_nvic: Fix "return from inactive handler" check
-From: Alex Bennée <alex.bennee@linaro.org>
+In commit 077d7449100d824a4 we added code to handle the v8M
 requirement that returns from NMI or HardFault forcibly deactivate
 those exceptions regardless of what interrupt the guest is trying to
 deactivate.  Unfortunately this broke the handling of the "illegal
 exception return because the returning exception number is not
 active" check for those cases.  In the pseudocode this test is done
 on the exception the guest asks to return from, but because our
 implementation was doing this in armv7m_nvic_complete_irq() after the
 new "deactivate NMI/HardFault regardless" code we ended up doing the
 test on the VecInfo for that exception instead, which usually meant
 failing to raise the illegal exception return fault.
-Much like recpe the ARM ARM has simplified the pseudo code for the
+In the case for "configurable exception targeting the opposite
-calculation which is done on a fixed point 9 bit integer maths. So
+security state" we detected the illegal-return case but went ahead
-while adding f16 we can also clean this up to be a little less heavy
+and deactivated the VecInfo anyway, which is wrong because that is
-on the floating point and just return the fractional part and leave
+the VecInfo for the other security state.
 the calle's to do the final packing of the result.
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Rearrange the code so that we first identify the illegal return
 cases, then see if we really need to deactivate NMI or HardFault
 instead, and finally do the deactivation.
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20180227143852.11175-27-alex.bennee@linaro.org
+Message-id: 20201119215617.29887-25-peter.maydell@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/helper.h |   1 +
+ hw/intc/armv7m_nvic.c | 59 +++++++++++++++++++++++--------------------
- target/arm/helper.c | 221 ++++++++++++++++++++++++----------------------------
+file changed, 32 insertions(+), 27 deletions(-)
 files changed, 104 insertions(+), 118 deletions(-)
-diff --git a/target/arm/helper.h b/target/arm/helper.h
+diff --git a/hw/intc/armv7m_nvic.c b/hw/intc/armv7m_nvic.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper.h
+--- a/hw/intc/armv7m_nvic.c
-+++ b/target/arm/helper.h
++++ b/hw/intc/armv7m_nvic.c
-@@ -XXX,XX +XXX,XX @@ DEF_HELPER_3(rsqrts_f32, f32, f32, f32, env)
+@@ -XXX,XX +XXX,XX @@ int armv7m_nvic_complete_irq(void *opaque, int irq, bool secure)
- DEF_HELPER_FLAGS_2(recpe_f16, TCG_CALL_NO_RWG, f16, f16, ptr)
+ {
- DEF_HELPER_FLAGS_2(recpe_f32, TCG_CALL_NO_RWG, f32, f32, ptr)
+     NVICState *s = (NVICState *)opaque;
- DEF_HELPER_FLAGS_2(recpe_f64, TCG_CALL_NO_RWG, f64, f64, ptr)
+     VecInfo *vec = NULL;
-+DEF_HELPER_FLAGS_2(rsqrte_f16, TCG_CALL_NO_RWG, f16, f16, ptr)
+-    int ret;
- DEF_HELPER_FLAGS_2(rsqrte_f32, TCG_CALL_NO_RWG, f32, f32, ptr)
++    int ret = 0;
- DEF_HELPER_FLAGS_2(rsqrte_f64, TCG_CALL_NO_RWG, f64, f64, ptr)
- DEF_HELPER_2(recpe_u32, i32, i32, ptr)
+     assert(irq > ARMV7M_EXCP_RESET && irq < s->num_irq);
-diff --git a/target/arm/helper.c b/target/arm/helper.c
-index XXXXXXX..XXXXXXX 100644
++    trace_nvic_complete_irq(irq, secure);
 --- a/target/arm/helper.c
 +++ b/target/arm/helper.c
@@ -XXX,XX +XXX,XX @@ float64 HELPER(recpe_f64)(float64 input, void *fpstp)
  /* The algorithm that must be used to calculate the estimate
   * is specified by the ARM ARM.
   */
 -static float64 recip_sqrt_estimate(float64 a, float_status *real_fp_status)
 +
-+static int do_recip_sqrt_estimate(int a)
++    if (secure && exc_is_banked(irq)) {
- {
++        vec = &s->sec_vectors[irq];
--    /* These calculations mustn't set any fp exception flags,
++    } else {
--     * so we use a local copy of the fp_status.
++        vec = &s->vectors[irq];
 -     */
 -    float_status dummy_status = *real_fp_status;
 -    float_status *s = &dummy_status;
 -    float64 q;
 -    int64_t q_int;
 +    int b, estimate;
 -    if (float64_lt(a, float64_half, s)) {
 -        /* range 0.25 <= a < 0.5 */
 -
 -        /* a in units of 1/512 rounded down */
 -        /* q0 = (int)(a * 512.0);  */
 -        q = float64_mul(float64_512, a, s);
 -        q_int = float64_to_int64_round_to_zero(q, s);
 -
 -        /* reciprocal root r */
 -        /* r = 1.0 / sqrt(((double)q0 + 0.5) / 512.0);  */
 -        q = int64_to_float64(q_int, s);
 -        q = float64_add(q, float64_half, s);
 -        q = float64_div(q, float64_512, s);
 -        q = float64_sqrt(q, s);
 -        q = float64_div(float64_one, q, s);
 +    assert(128 <= a && a < 512);
 +    if (a < 256) {
 +        a = a * 2 + 1;
      } else {
 -        /* range 0.5 <= a < 1.0 */
 -
 -        /* a in units of 1/256 rounded down */
 -        /* q1 = (int)(a * 256.0); */
 -        q = float64_mul(float64_256, a, s);
 -        int64_t q_int = float64_to_int64_round_to_zero(q, s);
 -
 -        /* reciprocal root r */
 -        /* r = 1.0 /sqrt(((double)q1 + 0.5) / 256); */
 -        q = int64_to_float64(q_int, s);
 -        q = float64_add(q, float64_half, s);
 -        q = float64_div(q, float64_256, s);
 -        q = float64_sqrt(q, s);
 -        q = float64_div(float64_one, q, s);
 +        a = (a >> 1) << 1;
 +        a = (a + 1) * 2;
      }
 -    /* r in units of 1/256 rounded to nearest */
 -    /* s = (int)(256.0 * r + 0.5); */
 +    b = 512;
 +    while (a * (b + 1) * (b + 1) < (1 << 28)) {
 +        b += 1;
 +    }
 +    estimate = (b + 1) / 2;
 +    assert(256 <= estimate && estimate < 512);
 -    q = float64_mul(q, float64_256,s );
 -    q = float64_add(q, float64_half, s);
 -    q_int = float64_to_int64_round_to_zero(q, s);
 +    return estimate;
 +}
 -    /* return (double)s / 256.0;*/
 -    return float64_div(int64_to_float64(q_int, s), float64_256, s);
 +
 +static uint64_t recip_sqrt_estimate(int *exp , int exp_off, uint64_t frac)
 +{
 +    int estimate;
 +    uint32_t scaled;
 +
 +    if (*exp == 0) {
 +        while (extract64(frac, 51, 1) == 0) {
 +            frac = frac << 1;
 +            *exp -= 1;
 +        }
 +        frac = extract64(frac, 0, 51) << 1;
 +    }
 +
-+    if (*exp & 1) {
++    /*
-+        /* scaled = UInt('01':fraction<51:45>) */
++     * Identify illegal exception return cases. We can't immediately
-+        scaled = deposit32(1 << 7, 0, 7, extract64(frac, 45, 7));
++     * return at this point because we still need to deactivate
 +     * (either this exception or NMI/HardFault) first.
 +     */
 +    if (!exc_is_banked(irq) && exc_targets_secure(s, irq) != secure) {
 +        /*
 +         * Return from a configurable exception targeting the opposite
 +         * security state from the one we're trying to complete it for.
 +         * Clear vec because it's not really the VecInfo for this
 +         * (irq, secstate) so we mustn't deactivate it.
 +         */
 +        ret = -1;
 +        vec = NULL;
 +    } else if (!vec->active) {
 +        /* Return from an inactive interrupt */
 +        ret = -1;
 +    } else {
-+        /* scaled = UInt('1':fraction<51:44>) */
++        /* Legal return, we will return the RETTOBASE bit value to the caller */
-+        scaled = deposit32(1 << 8, 0, 8, extract64(frac, 44, 8));
++        ret = nvic_rettobase(s);
 +    }
 +    estimate = do_recip_sqrt_estimate(scaled);
 +
 +    *exp = (exp_off - *exp) / 2;
 +    return extract64(estimate, 0, 8) << 44;
 +}
 +
 +float16 HELPER(rsqrte_f16)(float16 input, void *fpstp)
 +{
 +    float_status *s = fpstp;
 +    float16 f16 = float16_squash_input_denormal(input, s);
 +    uint16_t val = float16_val(f16);
 +    bool f16_sign = float16_is_neg(f16);
 +    int f16_exp = extract32(val, 10, 5);
 +    uint16_t f16_frac = extract32(val, 0, 10);
 +    uint64_t f64_frac;
 +
 +    if (float16_is_any_nan(f16)) {
 +        float16 nan = f16;
 +        if (float16_is_signaling_nan(f16, s)) {
 +            float_raise(float_flag_invalid, s);
 +            nan = float16_maybe_silence_nan(f16, s);
 +        }
 +        if (s->default_nan_mode) {
 +            nan =  float16_default_nan(s);
 +        }
 +        return nan;
 +    } else if (float16_is_zero(f16)) {
 +        float_raise(float_flag_divbyzero, s);
 +        return float16_set_sign(float16_infinity, f16_sign);
 +    } else if (f16_sign) {
 +        float_raise(float_flag_invalid, s);
 +        return float16_default_nan(s);
 +    } else if (float16_is_infinity(f16)) {
 +        return float16_zero;
 +    }
 +
-+    /* Scale and normalize to a double-precision value between 0.25 and 1.0,
+     /*
-+     * preserving the parity of the exponent.  */
+      * For negative priorities, v8M will forcibly deactivate the appropriate
-+
+      * NMI or HardFault regardless of what interrupt we're being asked to
-+    f64_frac = ((uint64_t) f16_frac) << (52 - 10);
+@@ -XXX,XX +XXX,XX @@ int armv7m_nvic_complete_irq(void *opaque, int irq, bool secure)
-+
+     }
-+    f64_frac = recip_sqrt_estimate(&f16_exp, 44, f64_frac);
-+
+     if (!vec) {
-+    /* result = sign : result_exp<4:0> : estimate<7:0> : Zeros(2) */
+-        if (secure && exc_is_banked(irq)) {
-+    val = deposit32(0, 15, 1, f16_sign);
+-            vec = &s->sec_vectors[irq];
-+    val = deposit32(val, 10, 5, f16_exp);
+-        } else {
-+    val = deposit32(val, 2, 8, extract64(f64_frac, 52 - 8, 8));
+-            vec = &s->vectors[irq];
 +    return make_float16(val);
  }
  float32 HELPER(rsqrte_f32)(float32 input, void *fpstp)
@@ -XXX,XX +XXX,XX @@ float32 HELPER(rsqrte_f32)(float32 input, void *fpstp)
      float_status *s = fpstp;
      float32 f32 = float32_squash_input_denormal(input, s);
      uint32_t val = float32_val(f32);
 -    uint32_t f32_sbit = 0x80000000 & val;
 -    int32_t f32_exp = extract32(val, 23, 8);
 +    uint32_t f32_sign = float32_is_neg(f32);
 +    int f32_exp = extract32(val, 23, 8);
      uint32_t f32_frac = extract32(val, 0, 23);
      uint64_t f64_frac;
 -    uint64_t val64;
 -    int result_exp;
 -    float64 f64;
      if (float32_is_any_nan(f32)) {
          float32 nan = f32;
@@ -XXX,XX +XXX,XX @@ float32 HELPER(rsqrte_f32)(float32 input, void *fpstp)
       * preserving the parity of the exponent.  */
      f64_frac = ((uint64_t) f32_frac) << 29;
 -    if (f32_exp == 0) {
 -        while (extract64(f64_frac, 51, 1) == 0) {
 -            f64_frac = f64_frac << 1;
 -            f32_exp = f32_exp-1;
 -        }
--        f64_frac = extract64(f64_frac, 0, 51) << 1;
--    }
--    if (extract64(f32_exp, 0, 1) == 0) {
--        f64 = make_float64(((uint64_t) f32_sbit) << 32
--                           | (0x3feULL << 52)
--                           | f64_frac);
--    } else {
--        f64 = make_float64(((uint64_t) f32_sbit) << 32
--                           | (0x3fdULL << 52)
--                           | f64_frac);
--    }
-+    f64_frac = recip_sqrt_estimate(&f32_exp, 380, f64_frac);
--    result_exp = (380 - f32_exp) / 2;
--
--    f64 = recip_sqrt_estimate(f64, s);
--
--    val64 = float64_val(f64);
--
--    val = ((result_exp & 0xff) << 23)
--        | ((val64 >> 29)  & 0x7fffff);
-+    /* result = sign : result_exp<4:0> : estimate<7:0> : Zeros(15) */
-+    val = deposit32(0, 31, 1, f32_sign);
-+    val = deposit32(val, 23, 8, f32_exp);
-+    val = deposit32(val, 15, 8, extract64(f64_frac, 52 - 8, 8));
-     return make_float32(val);
- }
-@@ -XXX,XX +XXX,XX @@ float64 HELPER(rsqrte_f64)(float64 input, void *fpstp)
-     float_status *s = fpstp;
-     float64 f64 = float64_squash_input_denormal(input, s);
-     uint64_t val = float64_val(f64);
--    uint64_t f64_sbit = 0x8000000000000000ULL & val;
--    int64_t f64_exp = extract64(val, 52, 11);
-+    bool f64_sign = float64_is_neg(f64);
-+    int f64_exp = extract64(val, 52, 11);
-     uint64_t f64_frac = extract64(val, 0, 52);
--    int64_t result_exp;
--    uint64_t result_frac;
-     if (float64_is_any_nan(f64)) {
-         float64 nan = f64;
-@@ -XXX,XX +XXX,XX @@ float64 HELPER(rsqrte_f64)(float64 input, void *fpstp)
-         return float64_zero;
-     }
--    /* Scale and normalize to a double-precision value between 0.25 and 1.0,
--     * preserving the parity of the exponent.  */
-+    f64_frac = recip_sqrt_estimate(&f64_exp, 3068, f64_frac);
--    if (f64_exp == 0) {
--        while (extract64(f64_frac, 51, 1) == 0) {
--            f64_frac = f64_frac << 1;
--            f64_exp = f64_exp - 1;
--        }
--        f64_frac = extract64(f64_frac, 0, 51) << 1;
 -    }
 -
--    if (extract64(f64_exp, 0, 1) == 0) {
+-    trace_nvic_complete_irq(irq, secure);
--        f64 = make_float64(f64_sbit
+-
--                           | (0x3feULL << 52)
+-    if (!vec->active) {
--                           | f64_frac);
+-        /* Tell the caller this was an illegal exception return */
--    } else {
+-        return -1;
 -        f64 = make_float64(f64_sbit
 -                           | (0x3fdULL << 52)
 -                           | f64_frac);
 -    }
 -
--    result_exp = (3068 - f64_exp) / 2;
+-    /*
--
+-     * If this is a configurable exception and it is currently
--    f64 = recip_sqrt_estimate(f64, s);
+-     * targeting the opposite security state from the one we're trying
--
+-     * to complete it for, this counts as an illegal exception return.
--    result_frac = extract64(float64_val(f64), 0, 52);
+-     * We still need to deactivate whatever vector the logic above has
--
+-     * selected, though, as it might not be the same as the one for the
--    return make_float64(f64_sbit |
+-     * requested exception number.
--                        ((result_exp & 0x7ff) << 52) |
+-     */
--                        result_frac);
+-    if (!exc_is_banked(irq) && exc_targets_secure(s, irq) != secure) {
-+    /* result = sign : result_exp<4:0> : estimate<7:0> : Zeros(44) */
+-        ret = -1;
-+    val = deposit64(0, 61, 1, f64_sign);
+-    } else {
-+    val = deposit64(val, 52, 11, f64_exp);
+-        ret = nvic_rettobase(s);
-+    val = deposit64(val, 44, 8, extract64(f64_frac, 52 - 8, 8));
++        return ret;
 +    return make_float64(val);
  }
  uint32_t HELPER(recpe_u32)(uint32_t a, void *fpstp)
@@ -XXX,XX +XXX,XX @@ uint32_t HELPER(recpe_u32)(uint32_t a, void *fpstp)
  uint32_t HELPER(rsqrte_u32)(uint32_t a, void *fpstp)
  {
 -    float_status *fpst = fpstp;
 -    float64 f64;
 +    int estimate;
      if ((a & 0xc0000000) == 0) {
          return 0xffffffff;
      }
--    if (a & 0x80000000) {
+     vec->active = 0;
 -        f64 = make_float64((0x3feULL << 52)
 -                           | ((uint64_t)(a & 0x7fffffff) << 21));
 -    } else { /* bits 31-30 == '01' */
 -        f64 = make_float64((0x3fdULL << 52)
 -                           | ((uint64_t)(a & 0x3fffffff) << 22));
 -    }
 +    estimate = do_recip_sqrt_estimate(extract32(a, 23, 9));
 -    f64 = recip_sqrt_estimate(f64, fpst);
 -
 -    return 0x80000000 | ((float64_val(f64) >> 21) & 0x7fffffff);
 +    return deposit32(0, 23, 9, estimate);
  }
  /* VFPv4 fused multiply-accumulate */
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 20/42] arm/translate-a64: add FP16 FR[ECP/SQRT]S to simd_three_reg_same_fp16
+[PULL 34/36] target/arm: Implement M-profile "minimal RAS implementation"
-From: Alex Bennée <alex.bennee@linaro.org>
+For v8.1M the architecture mandates that CPUs must provide at
 least the "minimal RAS implementation" from the Reliability,
 Availability and Serviceability extension. This consists of:
  * an ESB instruction which is a NOP
    -- since it is in the HINT space we need only add a comment
  * an RFSR register which will RAZ/WI
  * a RAZ/WI AIRCR.IESB bit
    -- the code which handles writes to AIRCR does not allow setting
       of RES0 bits, so we already treat this as RAZ/WI; add a comment
       noting that this is deliberate
  * minimal implementation of the RAS register block at 0xe0005000
    -- this will be in a subsequent commit
  * setting the ID_PFR0.RAS field to 0b0010
    -- we will do this when we add the Cortex-M55 CPU model
-As some of the constants here will also be needed
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-elsewhere (specifically for the upcoming SVE support) we move them out
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-to softfloat.h.
+Message-id: 20201119215617.29887-26-peter.maydell@linaro.org
 ---
  target/arm/cpu.h      | 14 ++++++++++++++
  target/arm/t32.decode |  4 ++++
  hw/intc/armv7m_nvic.c | 13 +++++++++++++
 files changed, 31 insertions(+)
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+diff --git a/target/arm/cpu.h b/target/arm/cpu.h
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20180227143852.11175-13-alex.bennee@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
  include/fpu/softfloat.h    | 18 +++++++++++++-----
  target/arm/helper-a64.h    |  2 ++
  target/arm/helper-a64.c    | 34 ++++++++++++++++++++++++++++++++++
  target/arm/translate-a64.c |  6 ++++++
 files changed, 55 insertions(+), 5 deletions(-)
 diff --git a/include/fpu/softfloat.h b/include/fpu/softfloat.h
 index XXXXXXX..XXXXXXX 100644
---- a/include/fpu/softfloat.h
+--- a/target/arm/cpu.h
-+++ b/include/fpu/softfloat.h
++++ b/target/arm/cpu.h
-@@ -XXX,XX +XXX,XX @@ static inline float16 float16_set_sign(float16 a, int sign)
+@@ -XXX,XX +XXX,XX @@ FIELD(ID_MMFR4, LSM, 20, 4)
  FIELD(ID_MMFR4, CCIDX, 24, 4)
  FIELD(ID_MMFR4, EVT, 28, 4)
 +FIELD(ID_PFR0, STATE0, 0, 4)
 +FIELD(ID_PFR0, STATE1, 4, 4)
 +FIELD(ID_PFR0, STATE2, 8, 4)
 +FIELD(ID_PFR0, STATE3, 12, 4)
 +FIELD(ID_PFR0, CSV2, 16, 4)
 +FIELD(ID_PFR0, AMU, 20, 4)
 +FIELD(ID_PFR0, DIT, 24, 4)
 +FIELD(ID_PFR0, RAS, 28, 4)
 +
  FIELD(ID_PFR1, PROGMOD, 0, 4)
  FIELD(ID_PFR1, SECURITY, 4, 4)
  FIELD(ID_PFR1, MPROGMOD, 8, 4)
@@ -XXX,XX +XXX,XX @@ static inline bool isar_feature_aa32_predinv(const ARMISARegisters *id)
      return FIELD_EX32(id->id_isar6, ID_ISAR6, SPECRES) != 0;
  }
- #define float16_zero make_float16(0)
++static inline bool isar_feature_aa32_ras(const ARMISARegisters *id)
 -#define float16_one make_float16(0x3c00)
  #define float16_half make_float16(0x3800)
 +#define float16_one make_float16(0x3c00)
 +#define float16_one_point_five make_float16(0x3e00)
 +#define float16_two make_float16(0x4000)
 +#define float16_three make_float16(0x4200)
  #define float16_infinity make_float16(0x7c00)
  /*----------------------------------------------------------------------------
@@ -XXX,XX +XXX,XX @@ static inline float32 float32_set_sign(float32 a, int sign)
  }
  #define float32_zero make_float32(0)
 -#define float32_one make_float32(0x3f800000)
  #define float32_half make_float32(0x3f000000)
 +#define float32_one make_float32(0x3f800000)
 +#define float32_one_point_five make_float32(0x3fc00000)
 +#define float32_two make_float32(0x40000000)
 +#define float32_three make_float32(0x40400000)
  #define float32_infinity make_float32(0x7f800000)
 -
  /*----------------------------------------------------------------------------
  | The pattern for a default generated single-precision NaN.
  *----------------------------------------------------------------------------*/
@@ -XXX,XX +XXX,XX @@ static inline float64 float64_set_sign(float64 a, int sign)
  }
  #define float64_zero make_float64(0)
 -#define float64_one make_float64(0x3ff0000000000000LL)
 -#define float64_ln2 make_float64(0x3fe62e42fefa39efLL)
  #define float64_half make_float64(0x3fe0000000000000LL)
 +#define float64_one make_float64(0x3ff0000000000000LL)
 +#define float64_one_point_five make_float64(0x3FF8000000000000ULL)
 +#define float64_two make_float64(0x4000000000000000ULL)
 +#define float64_three make_float64(0x4008000000000000ULL)
 +#define float64_ln2 make_float64(0x3fe62e42fefa39efLL)
  #define float64_infinity make_float64(0x7ff0000000000000LL)
  /*----------------------------------------------------------------------------
 diff --git a/target/arm/helper-a64.h b/target/arm/helper-a64.h
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper-a64.h
 +++ b/target/arm/helper-a64.h
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_3(vfp_mulxd, TCG_CALL_NO_RWG, f64, f64, f64, ptr)
  DEF_HELPER_FLAGS_3(neon_ceq_f64, TCG_CALL_NO_RWG, i64, i64, i64, ptr)
  DEF_HELPER_FLAGS_3(neon_cge_f64, TCG_CALL_NO_RWG, i64, i64, i64, ptr)
  DEF_HELPER_FLAGS_3(neon_cgt_f64, TCG_CALL_NO_RWG, i64, i64, i64, ptr)
 +DEF_HELPER_FLAGS_3(recpsf_f16, TCG_CALL_NO_RWG, f16, f16, f16, ptr)
  DEF_HELPER_FLAGS_3(recpsf_f32, TCG_CALL_NO_RWG, f32, f32, f32, ptr)
  DEF_HELPER_FLAGS_3(recpsf_f64, TCG_CALL_NO_RWG, f64, f64, f64, ptr)
 +DEF_HELPER_FLAGS_3(rsqrtsf_f16, TCG_CALL_NO_RWG, f16, f16, f16, ptr)
  DEF_HELPER_FLAGS_3(rsqrtsf_f32, TCG_CALL_NO_RWG, f32, f32, f32, ptr)
  DEF_HELPER_FLAGS_3(rsqrtsf_f64, TCG_CALL_NO_RWG, f64, f64, f64, ptr)
  DEF_HELPER_FLAGS_1(neon_addlp_s8, TCG_CALL_NO_RWG_SE, i64, i64)
 diff --git a/target/arm/helper-a64.c b/target/arm/helper-a64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper-a64.c
 +++ b/target/arm/helper-a64.c
@@ -XXX,XX +XXX,XX @@ uint64_t HELPER(neon_cgt_f64)(float64 a, float64 b, void *fpstp)
   * versions, these do a fully fused multiply-add or
   * multiply-add-and-halve.
   */
 +#define float16_two make_float16(0x4000)
 +#define float16_three make_float16(0x4200)
 +#define float16_one_point_five make_float16(0x3e00)
 +
  #define float32_two make_float32(0x40000000)
  #define float32_three make_float32(0x40400000)
  #define float32_one_point_five make_float32(0x3fc00000)
@@ -XXX,XX +XXX,XX @@ uint64_t HELPER(neon_cgt_f64)(float64 a, float64 b, void *fpstp)
  #define float64_three make_float64(0x4008000000000000ULL)
  #define float64_one_point_five make_float64(0x3FF8000000000000ULL)
 +float16 HELPER(recpsf_f16)(float16 a, float16 b, void *fpstp)
 +{
-+    float_status *fpst = fpstp;
++    return FIELD_EX32(id->id_pfr0, ID_PFR0, RAS) != 0;
 +
 +    a = float16_squash_input_denormal(a, fpst);
 +    b = float16_squash_input_denormal(b, fpst);
 +
 +    a = float16_chs(a);
 +    if ((float16_is_infinity(a) && float16_is_zero(b)) ||
 +        (float16_is_infinity(b) && float16_is_zero(a))) {
 +        return float16_two;
 +    }
 +    return float16_muladd(a, b, float16_two, 0, fpst);
 +}
 +
- float32 HELPER(recpsf_f32)(float32 a, float32 b, void *fpstp)
+ static inline bool isar_feature_aa32_mprofile(const ARMISARegisters *id)
  {
-     float_status *fpst = fpstp;
+     return FIELD_EX32(id->id_pfr1, ID_PFR1, MPROGMOD) != 0;
-@@ -XXX,XX +XXX,XX @@ float64 HELPER(recpsf_f64)(float64 a, float64 b, void *fpstp)
+diff --git a/target/arm/t32.decode b/target/arm/t32.decode
-     return float64_muladd(a, b, float64_two, 0, fpst);
+index XXXXXXX..XXXXXXX 100644
- }
+--- a/target/arm/t32.decode
++++ b/target/arm/t32.decode
-+float16 HELPER(rsqrtsf_f16)(float16 a, float16 b, void *fpstp)
+@@ -XXX,XX +XXX,XX @@ CLZ              1111 1010 1011 ---- 1111 .... 1000 ....      @rdm
-+{
+       # SEV      1111 0011 1010 1111 1000 0000 0000 0100
-+    float_status *fpst = fpstp;
+       # SEVL     1111 0011 1010 1111 1000 0000 0000 0101
 +      # For M-profile minimal-RAS ESB can be a NOP, which is the
 +      # default behaviour since it is in the hint space.
 +      # ESB      1111 0011 1010 1111 1000 0000 0001 0000
 +
-+    a = float16_squash_input_denormal(a, fpst);
+       # The canonical nop ends in 0000 0000, but the whole rest
-+    b = float16_squash_input_denormal(b, fpst);
+       # of the space is "reserved hint, behaves as nop".
-+
+       NOP        1111 0011 1010 1111 1000 0000 ---- ----
-+    a = float16_chs(a);
+diff --git a/hw/intc/armv7m_nvic.c b/hw/intc/armv7m_nvic.c
 +    if ((float16_is_infinity(a) && float16_is_zero(b)) ||
 +        (float16_is_infinity(b) && float16_is_zero(a))) {
 +        return float16_one_point_five;
 +    }
 +    return float16_muladd(a, b, float16_three, float_muladd_halve_result, fpst);
 +}
 +
  float32 HELPER(rsqrtsf_f32)(float32 a, float32 b, void *fpstp)
  {
      float_status *fpst = fpstp;
 diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
+--- a/hw/intc/armv7m_nvic.c
-+++ b/target/arm/translate-a64.c
++++ b/hw/intc/armv7m_nvic.c
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_three_reg_same_fp16(DisasContext *s, uint32_t insn)
+@@ -XXX,XX +XXX,XX @@ static uint32_t nvic_readl(NVICState *s, uint32_t offset, MemTxAttrs attrs)
-         case 0x6: /* FMAX */
+             return 0;
-             gen_helper_advsimd_maxh(tcg_res, tcg_op1, tcg_op2, fpst);
+         }
-             break;
+         return cpu->env.v7m.sfar;
-+        case 0x7: /* FRECPS */
++    case 0xf04: /* RFSR */
-+            gen_helper_recpsf_f16(tcg_res, tcg_op1, tcg_op2, fpst);
++        if (!cpu_isar_feature(aa32_ras, cpu)) {
-+            break;
++            goto bad_offset;
-         case 0x8: /* FMINNM */
++        }
-             gen_helper_advsimd_minnumh(tcg_res, tcg_op1, tcg_op2, fpst);
++        /* We provide minimal-RAS only: RFSR is RAZ/WI */
-             break;
++        return 0;
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_three_reg_same_fp16(DisasContext *s, uint32_t insn)
+     case 0xf34: /* FPCCR */
-         case 0xe: /* FMIN */
+         if (!cpu_isar_feature(aa32_vfp_simd, cpu)) {
-             gen_helper_advsimd_minh(tcg_res, tcg_op1, tcg_op2, fpst);
+             return 0;
-             break;
+@@ -XXX,XX +XXX,XX @@ static void nvic_writel(NVICState *s, uint32_t offset, uint32_t value,
-+        case 0xf: /* FRSQRTS */
+                               R_V7M_AIRCR_PRIGROUP_SHIFT,
-+            gen_helper_rsqrtsf_f16(tcg_res, tcg_op1, tcg_op2, fpst);
+                               R_V7M_AIRCR_PRIGROUP_LENGTH);
-+            break;
+             }
-         case 0x13: /* FMUL */
++            /* AIRCR.IESB is RAZ/WI because we implement only minimal RAS */
-             gen_helper_advsimd_mulh(tcg_res, tcg_op1, tcg_op2, fpst);
+             if (attrs.secure) {
-             break;
+                 /* These bits are only writable by secure */
                  cpu->env.v7m.aircr = value &
@@ -XXX,XX +XXX,XX @@ static void nvic_writel(NVICState *s, uint32_t offset, uint32_t value,
          }
          break;
      }
 +    case 0xf04: /* RFSR */
 +        if (!cpu_isar_feature(aa32_ras, cpu)) {
 +            goto bad_offset;
 +        }
 +        /* We provide minimal-RAS only: RFSR is RAZ/WI */
 +        break;
      case 0xf34: /* FPCCR */
          if (cpu_isar_feature(aa32_vfp_simd, cpu)) {
              /* Not all bits here are banked. */
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 25/42] arm/translate-a64: add FP16 FPRINTx to simd_two_reg_misc_fp16
+[PULL 35/36] hw/intc/armv7m_nvic: Implement read/write for RAS register block
-From: Alex Bennée <alex.bennee@linaro.org>
+The RAS feature has a block of memory-mapped registers at offset
 x5000 within the PPB.  For a "minimal RAS" implementation we provide
 no error records and so the only registers that exist in the block
 are ERRIIDR and ERRDEVID.
-This adds the full range of half-precision floating point to integral
+The "RAZ/WI for privileged, BusFault for nonprivileged" behaviour
-instructions.
+of the "nvic-default" region is actually valid for minimal-RAS,
 so the main benefit of providing an explicit implementation of
 the register block is more accurate LOG_UNIMP messages, and a
 framework for where we could add a real RAS implementation later
 if necessary.
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20180227143852.11175-18-alex.bennee@linaro.org
+Message-id: 20201119215617.29887-27-peter.maydell@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/helper-a64.h    |   2 +
+ include/hw/intc/armv7m_nvic.h |  1 +
- target/arm/helper-a64.c    |  22 ++++++++
+ hw/intc/armv7m_nvic.c         | 56 +++++++++++++++++++++++++++++++++++
- target/arm/translate-a64.c | 123 +++++++++++++++++++++++++++++++++++++++++++--
+files changed, 57 insertions(+)
 files changed, 142 insertions(+), 5 deletions(-)
-diff --git a/target/arm/helper-a64.h b/target/arm/helper-a64.h
+diff --git a/include/hw/intc/armv7m_nvic.h b/include/hw/intc/armv7m_nvic.h
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper-a64.h
+--- a/include/hw/intc/armv7m_nvic.h
-+++ b/target/arm/helper-a64.h
++++ b/include/hw/intc/armv7m_nvic.h
-@@ -XXX,XX +XXX,XX @@ DEF_HELPER_3(advsimd_maxnum2h, i32, i32, i32, ptr)
+@@ -XXX,XX +XXX,XX @@ struct NVICState {
- DEF_HELPER_3(advsimd_minnum2h, i32, i32, i32, ptr)
+     MemoryRegion sysreg_ns_mem;
- DEF_HELPER_3(advsimd_mulx2h, i32, i32, i32, ptr)
+     MemoryRegion systickmem;
- DEF_HELPER_4(advsimd_muladd2h, i32, i32, i32, i32, ptr)
+     MemoryRegion systick_ns_mem;
-+DEF_HELPER_2(advsimd_rinth_exact, f16, f16, ptr)
++    MemoryRegion ras_mem;
-+DEF_HELPER_2(advsimd_rinth, f16, f16, ptr)
+     MemoryRegion container;
-diff --git a/target/arm/helper-a64.c b/target/arm/helper-a64.c
+     MemoryRegion defaultmem;
 diff --git a/hw/intc/armv7m_nvic.c b/hw/intc/armv7m_nvic.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper-a64.c
+--- a/hw/intc/armv7m_nvic.c
-+++ b/target/arm/helper-a64.c
++++ b/hw/intc/armv7m_nvic.c
-@@ -XXX,XX +XXX,XX @@ uint32_t HELPER(advsimd_acgt_f16)(float16 a, float16 b, void *fpstp)
+@@ -XXX,XX +XXX,XX @@ static const MemoryRegionOps nvic_systick_ops = {
-     int compare = float16_compare(f0, f1, fpst);
+     .endianness = DEVICE_NATIVE_ENDIAN,
-     return ADVSIMD_CMPRES(compare == float_relation_greater);
+ };
- }
 +
-+/* round to integral */
++static MemTxResult ras_read(void *opaque, hwaddr addr,
-+float16 HELPER(advsimd_rinth_exact)(float16 x, void *fp_status)
++                            uint64_t *data, unsigned size,
 +                            MemTxAttrs attrs)
 +{
-+    return float16_round_to_int(x, fp_status);
++    if (attrs.user) {
 +        return MEMTX_ERROR;
 +    }
 +
 +    switch (addr) {
 +    case 0xe10: /* ERRIIDR */
 +        /* architect field = Arm; product/variant/revision 0 */
 +        *data = 0x43b;
 +        break;
 +    case 0xfc8: /* ERRDEVID */
 +        /* Minimal RAS: we implement 0 error record indexes */
 +        *data = 0;
 +        break;
 +    default:
 +        qemu_log_mask(LOG_UNIMP, "Read RAS register offset 0x%x\n",
 +                      (uint32_t)addr);
 +        *data = 0;
 +        break;
 +    }
 +    return MEMTX_OK;
 +}
 +
-+float16 HELPER(advsimd_rinth)(float16 x, void *fp_status)
++static MemTxResult ras_write(void *opaque, hwaddr addr,
 +                             uint64_t value, unsigned size,
 +                             MemTxAttrs attrs)
 +{
-+    int old_flags = get_float_exception_flags(fp_status), new_flags;
++    if (attrs.user) {
-+    float16 ret;
++        return MEMTX_ERROR;
 +
 +    ret = float16_round_to_int(x, fp_status);
 +
 +    /* Suppress any inexact exceptions the conversion produced */
 +    if (!(old_flags & float_flag_inexact)) {
 +        new_flags = get_float_exception_flags(fp_status);
 +        set_float_exception_flags(new_flags & ~float_flag_inexact, fp_status);
 +    }
 +
-+    return ret;
++    switch (addr) {
 +    default:
 +        qemu_log_mask(LOG_UNIMP, "Write to RAS register offset 0x%x\n",
 +                      (uint32_t)addr);
 +        break;
 +    }
 +    return MEMTX_OK;
 +}
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
-+++ b/target/arm/translate-a64.c
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_two_reg_misc(DisasContext *s, uint32_t insn)
-  */
- static void disas_simd_two_reg_misc_fp16(DisasContext *s, uint32_t insn)
- {
--    int fpop, opcode, a;
-+    int fpop, opcode, a, u;
-+    int rn, rd;
-+    bool is_q;
-+    bool is_scalar;
-+    bool only_in_vector = false;
 +
-+    int pass;
++static const MemoryRegionOps ras_ops = {
-+    TCGv_i32 tcg_rmode = NULL;
++    .read_with_attrs = ras_read,
-+    TCGv_ptr tcg_fpstatus = NULL;
++    .write_with_attrs = ras_write,
-+    bool need_rmode = false;
++    .endianness = DEVICE_NATIVE_ENDIAN,
-+    int rmode;
++};
++
-     if (!arm_dc_feature(s, ARM_FEATURE_V8_FP16)) {
+ /*
-         unallocated_encoding(s);
+  * Unassigned portions of the PPB space are RAZ/WI for privileged
-         return;
+  * accesses, and fault for non-privileged accesses.
@@ -XXX,XX +XXX,XX @@ static void armv7m_nvic_realize(DeviceState *dev, Error **errp)
                                              &s->systick_ns_mem, 1);
      }
--    if (!fp_access_check(s)) {
++    if (cpu_isar_feature(aa32_ras, s->cpu)) {
--        return;
++        memory_region_init_io(&s->ras_mem, OBJECT(s),
--    }
++                              &ras_ops, s, "nvic_ras", 0x1000);
-+    rd = extract32(insn, 0, 5);
++        memory_region_add_subregion(&s->container, 0x5000, &s->ras_mem);
 +    rn = extract32(insn, 5, 5);
 -    opcode = extract32(insn, 12, 4);
      a = extract32(insn, 23, 1);
 +    u = extract32(insn, 29, 1);
 +    is_scalar = extract32(insn, 28, 1);
 +    is_q = extract32(insn, 30, 1);
 +
 +    opcode = extract32(insn, 12, 5);
      fpop = deposit32(opcode, 5, 1, a);
 +    fpop = deposit32(fpop, 6, 1, u);
      switch (fpop) {
 +    case 0x18: /* FRINTN */
 +        need_rmode = true;
 +        only_in_vector = true;
 +        rmode = FPROUNDING_TIEEVEN;
 +        break;
 +    case 0x19: /* FRINTM */
 +        need_rmode = true;
 +        only_in_vector = true;
 +        rmode = FPROUNDING_NEGINF;
 +        break;
 +    case 0x38: /* FRINTP */
 +        need_rmode = true;
 +        only_in_vector = true;
 +        rmode = FPROUNDING_POSINF;
 +        break;
 +    case 0x39: /* FRINTZ */
 +        need_rmode = true;
 +        only_in_vector = true;
 +        rmode = FPROUNDING_ZERO;
 +        break;
 +    case 0x58: /* FRINTA */
 +        need_rmode = true;
 +        only_in_vector = true;
 +        rmode = FPROUNDING_TIEAWAY;
 +        break;
 +    case 0x59: /* FRINTX */
 +    case 0x79: /* FRINTI */
 +        only_in_vector = true;
 +        /* current rounding mode */
 +        break;
      default:
          fprintf(stderr, "%s: insn %#04x fpop %#2x\n", __func__, insn, fpop);
          g_assert_not_reached();
      }
 +
 +    /* Check additional constraints for the scalar encoding */
 +    if (is_scalar) {
 +        if (!is_q) {
 +            unallocated_encoding(s);
 +            return;
 +        }
 +        /* FRINTxx is only in the vector form */
 +        if (only_in_vector) {
 +            unallocated_encoding(s);
 +            return;
 +        }
 +    }
 +
-+    if (!fp_access_check(s)) {
+     sysbus_init_mmio(SYS_BUS_DEVICE(dev), &s->container);
 +        return;
 +    }
 +
 +    if (need_rmode) {
 +        tcg_fpstatus = get_fpstatus_ptr(true);
 +    }
 +
 +    if (need_rmode) {
 +        tcg_rmode = tcg_const_i32(arm_rmode_to_sf(rmode));
 +        gen_helper_set_rmode(tcg_rmode, tcg_rmode, tcg_fpstatus);
 +    }
 +
 +    if (is_scalar) {
 +        /* no operations yet */
 +    } else {
 +        for (pass = 0; pass < (is_q ? 8 : 4); pass++) {
 +            TCGv_i32 tcg_op = tcg_temp_new_i32();
 +            TCGv_i32 tcg_res = tcg_temp_new_i32();
 +
 +            read_vec_element_i32(s, tcg_op, rn, pass, MO_16);
 +
 +            switch (fpop) {
 +            case 0x18: /* FRINTN */
 +            case 0x19: /* FRINTM */
 +            case 0x38: /* FRINTP */
 +            case 0x39: /* FRINTZ */
 +            case 0x58: /* FRINTA */
 +            case 0x79: /* FRINTI */
 +                gen_helper_advsimd_rinth(tcg_res, tcg_op, tcg_fpstatus);
 +                break;
 +            case 0x59: /* FRINTX */
 +                gen_helper_advsimd_rinth_exact(tcg_res, tcg_op, tcg_fpstatus);
 +                break;
 +            default:
 +                g_assert_not_reached();
 +            }
 +
 +            write_vec_element_i32(s, tcg_res, rd, pass, MO_16);
 +
 +            tcg_temp_free_i32(tcg_res);
 +            tcg_temp_free_i32(tcg_op);
 +        }
 +
 +        clear_vec_high(s, is_q, rd);
 +    }
 +
 +    if (tcg_rmode) {
 +        gen_helper_set_rmode(tcg_rmode, tcg_rmode, tcg_fpstatus);
 +        tcg_temp_free_i32(tcg_rmode);
 +    }
 +
 +    if (tcg_fpstatus) {
 +        tcg_temp_free_ptr(tcg_fpstatus);
 +    }
  }
- /* AdvSIMD scalar x indexed element
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 15/42] arm/translate-a64: handle_3same_64 comment fix
+[PULL 36/36] hw/arm/armv7m: Correct typo in QOM object name
-From: Alex Bennée <alex.bennee@linaro.org>
+Correct a typo in the name we give the NVIC object.
-We do implement all the opcodes.
+Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20201119215617.29887-28-peter.maydell@linaro.org
 ---
  hw/arm/armv7m.c | 2 +-
 file changed, 1 insertion(+), 1 deletion(-)
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+diff --git a/hw/arm/armv7m.c b/hw/arm/armv7m.c
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Message-id: 20180227143852.11175-8-alex.bennee@linaro.org
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
  target/arm/translate-a64.c | 3 +--
 file changed, 1 insertion(+), 2 deletions(-)
 diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
+--- a/hw/arm/armv7m.c
-+++ b/target/arm/translate-a64.c
++++ b/hw/arm/armv7m.c
-@@ -XXX,XX +XXX,XX @@ static void handle_3same_64(DisasContext *s, int opcode, bool u,
+@@ -XXX,XX +XXX,XX @@ static void armv7m_instance_init(Object *obj)
-     /* Handle 64x64->64 opcodes which are shared between the scalar
-      * and vector 3-same groups. We cover every opcode where size == 3
+     memory_region_init(&s->container, obj, "armv7m-container", UINT64_MAX);
-      * is valid in either the three-reg-same (integer, not pairwise)
--     * or scalar-three-reg-same groups. (Some opcodes are not yet
+-    object_initialize_child(obj, "nvnic", &s->nvic, TYPE_NVIC);
--     * implemented.)
++    object_initialize_child(obj, "nvic", &s->nvic, TYPE_NVIC);
-+     * or scalar-three-reg-same groups.
+     object_property_add_alias(obj, "num-irq",
-      */
+                               OBJECT(&s->nvic), "num-irq");
      TCGCond cond;
 --
-.16.2
+.20.1

-[Qemu-devel] [PULL 17/42] arm/translate-a64: add FP16 FADD/FABD/FSUB/FMUL/FDIV to simd_three_reg_same_fp16
+Deleted patch
-From: Alex Bennée <alex.bennee@linaro.org>
-The fprintf is only there for debugging as the skeleton is added to,
-it will be removed once the skeleton is complete.
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20180227143852.11175-10-alex.bennee@linaro.org
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- target/arm/helper-a64.h    |  4 ++++
- target/arm/helper-a64.c    |  4 ++++
- target/arm/translate-a64.c | 28 ++++++++++++++++++++++++++++
-files changed, 36 insertions(+)
-diff --git a/target/arm/helper-a64.h b/target/arm/helper-a64.h
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper-a64.h
-+++ b/target/arm/helper-a64.h
-@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_3(advsimd_maxh, TCG_CALL_NO_RWG, f16, f16, f16, ptr)
- DEF_HELPER_FLAGS_3(advsimd_minh, TCG_CALL_NO_RWG, f16, f16, f16, ptr)
- DEF_HELPER_FLAGS_3(advsimd_maxnumh, TCG_CALL_NO_RWG, f16, f16, f16, ptr)
- DEF_HELPER_FLAGS_3(advsimd_minnumh, TCG_CALL_NO_RWG, f16, f16, f16, ptr)
-+DEF_HELPER_3(advsimd_addh, f16, f16, f16, ptr)
-+DEF_HELPER_3(advsimd_subh, f16, f16, f16, ptr)
-+DEF_HELPER_3(advsimd_mulh, f16, f16, f16, ptr)
-+DEF_HELPER_3(advsimd_divh, f16, f16, f16, ptr)
-diff --git a/target/arm/helper-a64.c b/target/arm/helper-a64.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper-a64.c
-+++ b/target/arm/helper-a64.c
-@@ -XXX,XX +XXX,XX @@ float16 ADVSIMD_HELPER(name, h)(float16 a, float16 b, void *fpstp) \
-     return float16_ ## name(a, b, fpst);    \
- }
-+ADVSIMD_HALFOP(add)
-+ADVSIMD_HALFOP(sub)
-+ADVSIMD_HALFOP(mul)
-+ADVSIMD_HALFOP(div)
- ADVSIMD_HALFOP(min)
- ADVSIMD_HALFOP(max)
- ADVSIMD_HALFOP(minnum)
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
-+++ b/target/arm/translate-a64.c
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_three_reg_same_fp16(DisasContext *s, uint32_t insn)
-         read_vec_element_i32(s, tcg_op2, rm, pass, MO_16);
-         switch (fpopcode) {
-+        case 0x0: /* FMAXNM */
-+            gen_helper_advsimd_maxnumh(tcg_res, tcg_op1, tcg_op2, fpst);
-+            break;
-+        case 0x2: /* FADD */
-+            gen_helper_advsimd_addh(tcg_res, tcg_op1, tcg_op2, fpst);
-+            break;
-+        case 0x6: /* FMAX */
-+            gen_helper_advsimd_maxh(tcg_res, tcg_op1, tcg_op2, fpst);
-+            break;
-+        case 0x8: /* FMINNM */
-+            gen_helper_advsimd_minnumh(tcg_res, tcg_op1, tcg_op2, fpst);
-+            break;
-+        case 0xa: /* FSUB */
-+            gen_helper_advsimd_subh(tcg_res, tcg_op1, tcg_op2, fpst);
-+            break;
-+        case 0xe: /* FMIN */
-+            gen_helper_advsimd_minh(tcg_res, tcg_op1, tcg_op2, fpst);
-+            break;
-+        case 0x13: /* FMUL */
-+            gen_helper_advsimd_mulh(tcg_res, tcg_op1, tcg_op2, fpst);
-+            break;
-+        case 0x17: /* FDIV */
-+            gen_helper_advsimd_divh(tcg_res, tcg_op1, tcg_op2, fpst);
-+            break;
-+        case 0x1a: /* FABD */
-+            gen_helper_advsimd_subh(tcg_res, tcg_op1, tcg_op2, fpst);
-+            tcg_gen_andi_i32(tcg_res, tcg_res, 0x7fff);
-+            break;
-         default:
-             fprintf(stderr, "%s: insn %#04x, fpop %#2x @ %#" PRIx64 "\n",
-                     __func__, insn, fpopcode, s->pc);
---
-.16.2

-[Qemu-devel] [PULL 19/42] arm/translate-a64: add FP16 FMULA/X/S to simd_three_reg_same_fp16
+Deleted patch
-From: Alex Bennée <alex.bennee@linaro.org>
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20180227143852.11175-12-alex.bennee@linaro.org
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- target/arm/helper-a64.h    |  2 ++
- target/arm/helper-a64.c    | 24 ++++++++++++++++++++++++
- target/arm/translate-a64.c | 15 +++++++++++++++
-files changed, 41 insertions(+)
-diff --git a/target/arm/helper-a64.h b/target/arm/helper-a64.h
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper-a64.h
-+++ b/target/arm/helper-a64.h
-@@ -XXX,XX +XXX,XX @@ DEF_HELPER_3(advsimd_cge_f16, i32, f16, f16, ptr)
- DEF_HELPER_3(advsimd_cgt_f16, i32, f16, f16, ptr)
- DEF_HELPER_3(advsimd_acge_f16, i32, f16, f16, ptr)
- DEF_HELPER_3(advsimd_acgt_f16, i32, f16, f16, ptr)
-+DEF_HELPER_3(advsimd_mulxh, f16, f16, f16, ptr)
-+DEF_HELPER_4(advsimd_muladdh, f16, f16, f16, f16, ptr)
-diff --git a/target/arm/helper-a64.c b/target/arm/helper-a64.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper-a64.c
-+++ b/target/arm/helper-a64.c
-@@ -XXX,XX +XXX,XX @@ ADVSIMD_HALFOP(max)
- ADVSIMD_HALFOP(minnum)
- ADVSIMD_HALFOP(maxnum)
-+/* Data processing - scalar floating-point and advanced SIMD */
-+float16 HELPER(advsimd_mulxh)(float16 a, float16 b, void *fpstp)
-+{
-+    float_status *fpst = fpstp;
-+
-+    a = float16_squash_input_denormal(a, fpst);
-+    b = float16_squash_input_denormal(b, fpst);
-+
-+    if ((float16_is_zero(a) && float16_is_infinity(b)) ||
-+        (float16_is_infinity(a) && float16_is_zero(b))) {
-+        /* 2.0 with the sign bit set to sign(A) XOR sign(B) */
-+        return make_float16((1U << 14) |
-+                            ((float16_val(a) ^ float16_val(b)) & (1U << 15)));
-+    }
-+    return float16_mul(a, b, fpst);
-+}
-+
-+/* fused multiply-accumulate */
-+float16 HELPER(advsimd_muladdh)(float16 a, float16 b, float16 c, void *fpstp)
-+{
-+    float_status *fpst = fpstp;
-+    return float16_muladd(a, b, c, 0, fpst);
-+}
-+
- /*
-  * Floating point comparisons produce an integer result. Softfloat
-  * routines return float_relation types which we convert to the 0/-1
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
-+++ b/target/arm/translate-a64.c
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_three_reg_same_fp16(DisasContext *s, uint32_t insn)
-         case 0x0: /* FMAXNM */
-             gen_helper_advsimd_maxnumh(tcg_res, tcg_op1, tcg_op2, fpst);
-             break;
-+        case 0x1: /* FMLA */
-+            read_vec_element_i32(s, tcg_res, rd, pass, MO_16);
-+            gen_helper_advsimd_muladdh(tcg_res, tcg_op1, tcg_op2, tcg_res,
-+                                       fpst);
-+            break;
-         case 0x2: /* FADD */
-             gen_helper_advsimd_addh(tcg_res, tcg_op1, tcg_op2, fpst);
-             break;
-+        case 0x3: /* FMULX */
-+            gen_helper_advsimd_mulxh(tcg_res, tcg_op1, tcg_op2, fpst);
-+            break;
-         case 0x4: /* FCMEQ */
-             gen_helper_advsimd_ceq_f16(tcg_res, tcg_op1, tcg_op2, fpst);
-             break;
-@@ -XXX,XX +XXX,XX @@ static void disas_simd_three_reg_same_fp16(DisasContext *s, uint32_t insn)
-         case 0x8: /* FMINNM */
-             gen_helper_advsimd_minnumh(tcg_res, tcg_op1, tcg_op2, fpst);
-             break;
-+        case 0x9: /* FMLS */
-+             /* As usual for ARM, separate negation for fused multiply-add */
-+            tcg_gen_xori_i32(tcg_op1, tcg_op1, 0x8000);
-+            read_vec_element_i32(s, tcg_res, rd, pass, MO_16);
-+            gen_helper_advsimd_muladdh(tcg_res, tcg_op1, tcg_op2, tcg_res,
-+                                       fpst);
-+            break;
-         case 0xa: /* FSUB */
-             gen_helper_advsimd_subh(tcg_res, tcg_op1, tcg_op2, fpst);
-             break;
---
-.16.2

Arm queue -- I have more stuff pending but I prefer to push
this first lot out and keep the pull below 50 patches.
Most of this is Alex's FP16 support work.

-- PMM

The following changes since commit 6697439794f72b3501ee16bb95d16854f9981421:

Merge remote-tracking branch 'remotes/kraxel/tags/usb-20180227-pull-request' into staging (2018-02-27 17:50:46 +0000)

are available in the Git repository at:

git://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20180301

for you to fetch changes up to c22e580c2ad1cccef582e1490e732f254d4ac064:

MAINTAINERS: Update my email address (2018-03-01 11:13:59 +0000)

----------------------------------------------------------------
target-arm queue:
 * update MAINTAINERS for Alistair's new email address
 * add Arm v8.2 FP16 arithmetic extension for linux-user
 * implement display connector emulation for vexpress board
 * xilinx_spips: Enable only two slaves when reading/writing with stripe
 * xilinx_spips: Use 8 dummy cycles with the QIOR/QIOR4 commands
 * hw: register: Run post_write hook on reset

----------------------------------------------------------------
Alex Bennée (31):
      include/exec/helper-head.h: support f16 in helper calls
      target/arm/cpu64: introduce ARM_V8_FP16 feature bit
      target/arm/cpu.h: update comment for half-precision values
      target/arm/cpu.h: add additional float_status flags
      target/arm/helper: pass explicit fpst to set_rmode
      arm/translate-a64: implement half-precision F(MIN|MAX)(V|NMV)
      arm/translate-a64: handle_3same_64 comment fix
      arm/translate-a64: initial decode for simd_three_reg_same_fp16
      arm/translate-a64: add FP16 FADD/FABD/FSUB/FMUL/FDIV to simd_three_reg_same_fp16
      arm/translate-a64: add FP16 F[A]C[EQ/GE/GT] to simd_three_reg_same_fp16
      arm/translate-a64: add FP16 FMULA/X/S to simd_three_reg_same_fp16
      arm/translate-a64: add FP16 FR[ECP/SQRT]S to simd_three_reg_same_fp16
      arm/translate-a64: add FP16 pairwise ops simd_three_reg_same_fp16
      arm/translate-a64: add FP16 FMULX/MLS/FMLA to simd_indexed
      arm/translate-a64: add FP16 x2 ops for simd_indexed
      arm/translate-a64: initial decode for simd_two_reg_misc_fp16
      arm/translate-a64: add FP16 FPRINTx to simd_two_reg_misc_fp16
      arm/translate-a64: add FCVTxx to simd_two_reg_misc_fp16
      arm/translate-a64: add FP16 FCMxx (zero) to simd_two_reg_misc_fp16
      arm/translate-a64: add FP16 SCVTF/UCVFT to simd_two_reg_misc_fp16
      arm/translate-a64: add FP16 FNEG/FABS to simd_two_reg_misc_fp16
      arm/helper.c: re-factor recpe and add recepe_f16
      arm/translate-a64: add FP16 FRECPE
      arm/translate-a64: add FP16 FRCPX to simd_two_reg_misc_fp16
      arm/translate-a64: add FP16 FSQRT to simd_two_reg_misc_fp16
      arm/helper.c: re-factor rsqrte and add rsqrte_f16
      arm/translate-a64: add FP16 FRSQRTE to simd_two_reg_misc_fp16
      arm/translate-a64: add FP16 FMOV to simd_mod_imm
      arm/translate-a64: add all FP16 ops in simd_scalar_pairwise
      arm/translate-a64: implement simd_scalar_three_reg_same_fp16
      arm/translate-a64: add all single op FP16 to handle_fp_1src_half

Alistair Francis (2):
      hw: register: Run post_write hook on reset
      MAINTAINERS: Update my email address

Corey Minyard (2):
      i2c: Fix some brace style issues
      i2c: Move the bus class to i2c.h

Francisco Iglesias (2):
      xilinx_spips: Enable only two slaves when reading/writing with stripe
      xilinx_spips: Use 8 dummy cycles with the QIOR/QIOR4 commands

Linus Walleij (3):
      hw/i2c-ddc: Do not fail writes
      hw/sii9022: Add support for Silicon Image SII9022
      arm/vexpress: Add proper display connector emulation

Peter Maydell (2):
      target/arm: Enable ARM_V8_FP16 feature bit for the AArch64 "any" CPU
      linux-user: Report AArch64 FP16 support via hwcap bits

From: Alistair Francis <alistair.francis@xilinx.com>

Ensure that the post write hook is called during reset. This allows us
to rely on the post write functions instead of having to call them from
the reset() function.

Signed-off-by: Alistair Francis <alistair.francis@xilinx.com>
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
Message-id: d131e24b911653a945e46ca2d8f90f572469e1dd.1517856214.git.alistair.francis@xilinx.com
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 include/hw/register.h | 6 +++---
 hw/core/register.c    | 8 ++++++++
 2 files changed, 11 insertions(+), 3 deletions(-)

diff --git a/include/hw/register.h b/include/hw/register.h
index XXXXXXX..XXXXXXX 100644
--- a/include/hw/register.h
+++ b/include/hw/register.h
@@ -XXX,XX +XXX,XX @@ typedef struct RegisterInfoArray RegisterInfoArray;
  * immediately before the actual write. The returned value is what is written,
  * giving the handler a chance to modify the written value.
  * @post_write: Post write callback. Passed the written value. Most write side
- * effects should be implemented here.
+ * effects should be implemented here. This is called during device reset.
  *
  * @post_read: Post read callback. Passes the value that is about to be returned
  * for a read. The return value from this function is what is ultimately read,
@@ -XXX,XX +XXX,XX @@ uint64_t register_read(RegisterInfo *reg, uint64_t re, const char* prefix,
                        bool debug);
 
 /**
- * reset a register
- * @reg: register to reset
+ * Resets a register. This will also call the post_write hook if it exists.
+ * @reg: The register to reset.
  */
 
 void register_reset(RegisterInfo *reg);
diff --git a/hw/core/register.c b/hw/core/register.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/core/register.c
+++ b/hw/core/register.c
@@ -XXX,XX +XXX,XX @@ uint64_t register_read(RegisterInfo *reg, uint64_t re, const char* prefix,
 
 void register_reset(RegisterInfo *reg)
 {
+    const RegisterAccessInfo *ac;
+
     g_assert(reg);
 
     if (!reg->data || !reg->access) {
         return;
     }
 
+    ac = reg->access;
+
     register_write_val(reg, reg->access->reset);
+
+    if (ac->post_write) {
+        ac->post_write(reg, reg->access->reset);
+    }
 }
 
 void register_init(RegisterInfo *reg)
-- 
2.16.2

From: Francisco Iglesias <frasse.iglesias@gmail.com>

Assert only the lower cs on bus 0 and upper cs on bus 1 when both buses and
chip selects are enabled (e.g reading/writing with stripe).

Signed-off-by: Francisco Iglesias <frasse.iglesias@gmail.com>
Reviewed-by: Alistair Francis <alistair.francis@xilinx.com>
Tested-by: Alistair Francis <alistair.francis@xilinx.com>
Message-id: 20180223232233.31482-2-frasse.iglesias@gmail.com
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/ssi/xilinx_spips.c | 41 +++++++++++++++++++++++++++++++++++++----
 1 file changed, 37 insertions(+), 4 deletions(-)

diff --git a/hw/ssi/xilinx_spips.c b/hw/ssi/xilinx_spips.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/ssi/xilinx_spips.c
+++ b/hw/ssi/xilinx_spips.c
@@ -XXX,XX +XXX,XX @@ static void xilinx_spips_update_cs(XilinxSPIPS *s, int field)
 {
     int i;
 
-    for (i = 0; i < s->num_cs; i++) {
+    for (i = 0; i < s->num_cs * s->num_busses; i++) {
         bool old_state = s->cs_lines_state[i];
         bool new_state = field & (1 << i);
 
@@ -XXX,XX +XXX,XX @@ static void xilinx_spips_update_cs(XilinxSPIPS *s, int field)
         }
         qemu_set_irq(s->cs_lines[i], !new_state);
     }
-    if (!(field & ((1 << s->num_cs) - 1))) {
+    if (!(field & ((1 << (s->num_cs * s->num_busses)) - 1))) {
         s->snoop_state = SNOOP_CHECKING;
         s->cmd_dummies = 0;
         s->link_state = 1;
@@ -XXX,XX +XXX,XX @@ static void xlnx_zynqmp_qspips_update_cs_lines(XlnxZynqMPQSPIPS *s)
 {
     if (s->regs[R_GQSPI_GF_SNAPSHOT]) {
         int field = ARRAY_FIELD_EX32(s->regs, GQSPI_GF_SNAPSHOT, CHIP_SELECT);
-        xilinx_spips_update_cs(XILINX_SPIPS(s), field);
+        bool upper_cs_sel = field & (1 << 1);
+        bool lower_cs_sel = field & 1;
+        bool bus0_enabled;
+        bool bus1_enabled;
+        uint8_t buses;
+        int cs = 0;
+
+        buses = ARRAY_FIELD_EX32(s->regs, GQSPI_GF_SNAPSHOT, DATA_BUS_SELECT);
+        bus0_enabled = buses & 1;
+        bus1_enabled = buses & (1 << 1);
+
+        if (bus0_enabled && bus1_enabled) {
+            if (lower_cs_sel) {
+                cs |= 1;
+            }
+            if (upper_cs_sel) {
+                cs |= 1 << 3;
+            }
+        } else if (bus0_enabled) {
+            if (lower_cs_sel) {
+                cs |= 1;
+            }
+            if (upper_cs_sel) {
+                cs |= 1 << 1;
+            }
+        } else if (bus1_enabled) {
+            if (lower_cs_sel) {
+                cs |= 1 << 2;
+            }
+            if (upper_cs_sel) {
+                cs |= 1 << 3;
+            }
+        }
+        xilinx_spips_update_cs(XILINX_SPIPS(s), cs);
     }
 }
 
@@ -XXX,XX +XXX,XX @@ static void xilinx_spips_update_cs_lines(XilinxSPIPS *s)
     if (num_effective_busses(s) == 2) {
         /* Single bit chip-select for qspi */
         field &= 0x1;
-        field |= field << 1;
+        field |= field << 3;
     /* Dual stack U-Page */
     } else if (s->regs[R_LQSPI_CFG] & LQSPI_CFG_TWO_MEM &&
                s->regs[R_LQSPI_STS] & LQSPI_CFG_U_PAGE) {
-- 
2.16.2

From: Corey Minyard <cminyard@mvista.com>

Signed-off-by: Corey Minyard <cminyard@mvista.com>
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Linus Walleij <linus.walleij@linaro.org>
Message-id: 20180227104903.21353-2-linus.walleij@linaro.org
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 include/hw/i2c/i2c.h | 6 ++----
 hw/i2c/core.c        | 3 +--
 2 files changed, 3 insertions(+), 6 deletions(-)

diff --git a/include/hw/i2c/i2c.h b/include/hw/i2c/i2c.h
index XXXXXXX..XXXXXXX 100644
--- a/include/hw/i2c/i2c.h
+++ b/include/hw/i2c/i2c.h
@@ -XXX,XX +XXX,XX @@ typedef struct I2CSlave I2CSlave;
 #define I2C_SLAVE_GET_CLASS(obj) \
      OBJECT_GET_CLASS(I2CSlaveClass, (obj), TYPE_I2C_SLAVE)
 
-typedef struct I2CSlaveClass
-{
+typedef struct I2CSlaveClass {
     DeviceClass parent_class;
 
     /* Callbacks provided by the device.  */
@@ -XXX,XX +XXX,XX @@ typedef struct I2CSlaveClass
     int (*event)(I2CSlave *s, enum i2c_event event);
 } I2CSlaveClass;
 
-struct I2CSlave
-{
+struct I2CSlave {
     DeviceState qdev;
 
     /* Remaining fields for internal use by the I2C code.  */
diff --git a/hw/i2c/core.c b/hw/i2c/core.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/i2c/core.c
+++ b/hw/i2c/core.c
@@ -XXX,XX +XXX,XX @@ struct I2CNode {
 
 #define I2C_BROADCAST 0x00
 
-struct I2CBus
-{
+struct I2CBus {
     BusState qbus;
     QLIST_HEAD(, I2CNode) current_devs;
     uint8_t saved_address;
-- 
2.16.2

From: Corey Minyard <cminyard@mvista.com>

Some devices need access to it.

Signed-off-by: Corey Minyard <cminyard@mvista.com>
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Linus Walleij <linus.walleij@linaro.org>
Message-id: 20180227104903.21353-3-linus.walleij@linaro.org
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 include/hw/i2c/i2c.h | 17 +++++++++++++++++
 hw/i2c/core.c        | 17 -----------------
 2 files changed, 17 insertions(+), 17 deletions(-)

diff --git a/include/hw/i2c/i2c.h b/include/hw/i2c/i2c.h
index XXXXXXX..XXXXXXX 100644
--- a/include/hw/i2c/i2c.h
+++ b/include/hw/i2c/i2c.h
@@ -XXX,XX +XXX,XX @@ struct I2CSlave {
     uint8_t address;
 };
 
+#define TYPE_I2C_BUS "i2c-bus"
+#define I2C_BUS(obj) OBJECT_CHECK(I2CBus, (obj), TYPE_I2C_BUS)
+
+typedef struct I2CNode I2CNode;
+
+struct I2CNode {
+    I2CSlave *elt;
+    QLIST_ENTRY(I2CNode) next;
+};
+
+struct I2CBus {
+    BusState qbus;
+    QLIST_HEAD(, I2CNode) current_devs;
+    uint8_t saved_address;
+    bool broadcast;
+};
+
 I2CBus *i2c_init_bus(DeviceState *parent, const char *name);
 void i2c_set_slave_address(I2CSlave *dev, uint8_t address);
 int i2c_bus_busy(I2CBus *bus);
diff --git a/hw/i2c/core.c b/hw/i2c/core.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/i2c/core.c
+++ b/hw/i2c/core.c
@@ -XXX,XX +XXX,XX @@
 #include "qemu/osdep.h"
 #include "hw/i2c/i2c.h"
 
-typedef struct I2CNode I2CNode;
-
-struct I2CNode {
-    I2CSlave *elt;
-    QLIST_ENTRY(I2CNode) next;
-};
-
 #define I2C_BROADCAST 0x00
 
-struct I2CBus {
-    BusState qbus;
-    QLIST_HEAD(, I2CNode) current_devs;
-    uint8_t saved_address;
-    bool broadcast;
-};
-
 static Property i2c_props[] = {
     DEFINE_PROP_UINT8("address", struct I2CSlave, address, 0),
     DEFINE_PROP_END_OF_LIST(),
 };
 
-#define TYPE_I2C_BUS "i2c-bus"
-#define I2C_BUS(obj) OBJECT_CHECK(I2CBus, (obj), TYPE_I2C_BUS)
-
 static const TypeInfo i2c_bus_info = {
     .name = TYPE_I2C_BUS,
     .parent = TYPE_BUS,
-- 
2.16.2

From: Linus Walleij <linus.walleij@linaro.org>

The tx function of the DDC I2C slave emulation was returning 1
on all writes resulting in NACK in the I2C bus. Changing it to
0 makes the DDC I2C work fine with bit-banged I2C such as the
versatile I2C.

I guess it was not affecting whatever I2C controller this was
used with until now, but with the Versatile I2C it surely
does not work.

Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Linus Walleij <linus.walleij@linaro.org>
Message-id: 20180227104903.21353-4-linus.walleij@linaro.org
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/i2c/i2c-ddc.c | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/hw/i2c/i2c-ddc.c b/hw/i2c/i2c-ddc.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/i2c/i2c-ddc.c
+++ b/hw/i2c/i2c-ddc.c
@@ -XXX,XX +XXX,XX @@ static int i2c_ddc_tx(I2CSlave *i2c, uint8_t data)
         s->reg = data;
         s->firstbyte = false;
         DPRINTF("[EDID] Written new pointer: %u\n", data);
-        return 1;
+        return 0;
     }
 
     /* Ignore all writes */
     s->reg++;
-    return 1;
+    return 0;
 }
 
 static void i2c_ddc_init(Object *obj)
-- 
2.16.2

From: Linus Walleij <linus.walleij@linaro.org>

This adds support for emulating the Silicon Image SII9022 DVI/HDMI
bridge. It's not very clever right now, it just acknowledges
the switch into DDC I2C mode and back. Combining this with the
existing DDC I2C emulation gives the right behavior on the Versatile
Express emulation passing through the QEMU EDID to the emulated
platform.

Cc: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Linus Walleij <linus.walleij@linaro.org>
Message-id: 20180227104903.21353-5-linus.walleij@linaro.org
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
[PMM: explictly reset ddc_req/ddc_skip_finish/ddc]
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/display/Makefile.objs |   1 +
 hw/display/sii9022.c     | 191 +++++++++++++++++++++++++++++++++++++++++++++++
 hw/display/trace-events  |   5 ++
 3 files changed, 197 insertions(+)
 create mode 100644 hw/display/sii9022.c

diff --git a/hw/display/Makefile.objs b/hw/display/Makefile.objs
index XXXXXXX..XXXXXXX 100644
--- a/hw/display/Makefile.objs
+++ b/hw/display/Makefile.objs
@@ -XXX,XX +XXX,XX @@ common-obj-$(CONFIG_VGA_CIRRUS) += cirrus_vga.o
 common-obj-$(CONFIG_G364FB) += g364fb.o
 common-obj-$(CONFIG_JAZZ_LED) += jazz_led.o
 common-obj-$(CONFIG_PL110) += pl110.o
+common-obj-$(CONFIG_SII9022) += sii9022.o
 common-obj-$(CONFIG_SSD0303) += ssd0303.o
 common-obj-$(CONFIG_SSD0323) += ssd0323.o
 common-obj-$(CONFIG_XEN) += xenfb.o
diff --git a/hw/display/sii9022.c b/hw/display/sii9022.c
new file mode 100644
index XXXXXXX..XXXXXXX
--- /dev/null
+++ b/hw/display/sii9022.c
@@ -XXX,XX +XXX,XX @@
+/*
+ * Silicon Image SiI9022
+ *
+ * This is a pretty hollow emulation: all we do is acknowledge that we
+ * exist (chip ID) and confirm that we get switched over into DDC mode
+ * so the emulated host can proceed to read out EDID data. All subsequent
+ * set-up of connectors etc will be acknowledged and ignored.
+ *
+ * Copyright (C) 2018 Linus Walleij
+ *
+ * This work is licensed under the terms of the GNU GPL, version 2 or later.
+ * See the COPYING file in the top-level directory.
+ * SPDX-License-Identifier: GPL-2.0-or-later
+ */
+
+#include "qemu/osdep.h"
+#include "qemu-common.h"
+#include "hw/i2c/i2c.h"
+#include "hw/i2c/i2c-ddc.h"
+#include "trace.h"
+
+#define SII9022_SYS_CTRL_DATA 0x1a
+#define SII9022_SYS_CTRL_PWR_DWN 0x10
+#define SII9022_SYS_CTRL_AV_MUTE 0x08
+#define SII9022_SYS_CTRL_DDC_BUS_REQ 0x04
+#define SII9022_SYS_CTRL_DDC_BUS_GRTD 0x02
+#define SII9022_SYS_CTRL_OUTPUT_MODE 0x01
+#define SII9022_SYS_CTRL_OUTPUT_HDMI 1
+#define SII9022_SYS_CTRL_OUTPUT_DVI 0
+#define SII9022_REG_CHIPID 0x1b
+#define SII9022_INT_ENABLE 0x3c
+#define SII9022_INT_STATUS 0x3d
+#define SII9022_INT_STATUS_HOTPLUG 0x01;
+#define SII9022_INT_STATUS_PLUGGED 0x04;
+
+#define TYPE_SII9022 "sii9022"
+#define SII9022(obj) OBJECT_CHECK(sii9022_state, (obj), TYPE_SII9022)
+
+typedef struct sii9022_state {
+    I2CSlave parent_obj;
+    uint8_t ptr;
+    bool addr_byte;
+    bool ddc_req;
+    bool ddc_skip_finish;
+    bool ddc;
+} sii9022_state;
+
+static const VMStateDescription vmstate_sii9022 = {
+    .name = "sii9022",
+    .version_id = 1,
+    .minimum_version_id = 1,
+    .fields = (VMStateField[]) {
+        VMSTATE_I2C_SLAVE(parent_obj, sii9022_state),
+        VMSTATE_UINT8(ptr, sii9022_state),
+        VMSTATE_BOOL(addr_byte, sii9022_state),
+        VMSTATE_BOOL(ddc_req, sii9022_state),
+        VMSTATE_BOOL(ddc_skip_finish, sii9022_state),
+        VMSTATE_BOOL(ddc, sii9022_state),
+        VMSTATE_END_OF_LIST()
+    }
+};
+
+static int sii9022_event(I2CSlave *i2c, enum i2c_event event)
+{
+    sii9022_state *s = SII9022(i2c);
+
+    switch (event) {
+    case I2C_START_SEND:
+        s->addr_byte = true;
+        break;
+    case I2C_START_RECV:
+        break;
+    case I2C_FINISH:
+        break;
+    case I2C_NACK:
+        break;
+    }
+
+    return 0;
+}
+
+static int sii9022_rx(I2CSlave *i2c)
+{
+    sii9022_state *s = SII9022(i2c);
+    uint8_t res = 0x00;
+
+    switch (s->ptr) {
+    case SII9022_SYS_CTRL_DATA:
+        if (s->ddc_req) {
+            /* Acknowledge DDC bus request */
+            res = SII9022_SYS_CTRL_DDC_BUS_GRTD | SII9022_SYS_CTRL_DDC_BUS_REQ;
+        }
+        break;
+    case SII9022_REG_CHIPID:
+        res = 0xb0;
+        break;
+    case SII9022_INT_STATUS:
+        /* Something is cold-plugged in, no interrupts */
+        res = SII9022_INT_STATUS_PLUGGED;
+        break;
+    default:
+        break;
+    }
+
+    trace_sii9022_read_reg(s->ptr, res);
+    s->ptr++;
+
+    return res;
+}
+
+static int sii9022_tx(I2CSlave *i2c, uint8_t data)
+{
+    sii9022_state *s = SII9022(i2c);
+
+    if (s->addr_byte) {
+        s->ptr = data;
+        s->addr_byte = false;
+        return 0;
+    }
+
+    switch (s->ptr) {
+    case SII9022_SYS_CTRL_DATA:
+        if (data & SII9022_SYS_CTRL_DDC_BUS_REQ) {
+            s->ddc_req = true;
+            if (data & SII9022_SYS_CTRL_DDC_BUS_GRTD) {
+                s->ddc = true;
+                /* Skip this finish since we just switched to DDC */
+                s->ddc_skip_finish = true;
+                trace_sii9022_switch_mode("DDC");
+            }
+        } else {
+            s->ddc_req = false;
+            s->ddc = false;
+            trace_sii9022_switch_mode("normal");
+        }
+        break;
+    default:
+        break;
+    }
+
+    trace_sii9022_write_reg(s->ptr, data);
+    s->ptr++;
+
+    return 0;
+}
+
+static void sii9022_reset(DeviceState *dev)
+{
+    sii9022_state *s = SII9022(dev);
+
+    s->ptr = 0;
+    s->addr_byte = false;
+    s->ddc_req = false;
+    s->ddc_skip_finish = false;
+    s->ddc = false;
+}
+
+static void sii9022_realize(DeviceState *dev, Error **errp)
+{
+    I2CBus *bus;
+
+    bus = I2C_BUS(qdev_get_parent_bus(dev));
+    i2c_create_slave(bus, TYPE_I2CDDC, 0x50);
+}
+
+static void sii9022_class_init(ObjectClass *klass, void *data)
+{
+    DeviceClass *dc = DEVICE_CLASS(klass);
+    I2CSlaveClass *k = I2C_SLAVE_CLASS(klass);
+
+    k->event = sii9022_event;
+    k->recv = sii9022_rx;
+    k->send = sii9022_tx;
+    dc->reset = sii9022_reset;
+    dc->realize = sii9022_realize;
+    dc->vmsd = &vmstate_sii9022;
+}
+
+static const TypeInfo sii9022_info = {
+    .name          = TYPE_SII9022,
+    .parent        = TYPE_I2C_SLAVE,
+    .instance_size = sizeof(sii9022_state),
+    .class_init    = sii9022_class_init,
+};
+
+static void sii9022_register_types(void)
+{
+    type_register_static(&sii9022_info);
+}
+
+type_init(sii9022_register_types)
diff --git a/hw/display/trace-events b/hw/display/trace-events
index XXXXXXX..XXXXXXX 100644
--- a/hw/display/trace-events
+++ b/hw/display/trace-events
@@ -XXX,XX +XXX,XX @@ vga_cirrus_read_io(uint32_t addr, uint32_t val) "addr 0x%x, val 0x%x"
 vga_cirrus_write_io(uint32_t addr, uint32_t val) "addr 0x%x, val 0x%x"
 vga_cirrus_read_blt(uint32_t offset, uint32_t val) "offset 0x%x, val 0x%x"
 vga_cirrus_write_blt(uint32_t offset, uint32_t val) "offset 0x%x, val 0x%x"
+
+# hw/display/sii9022.c
+sii9022_read_reg(uint8_t addr, uint8_t val) "addr 0x%02x, val 0x%02x"
+sii9022_write_reg(uint8_t addr, uint8_t val) "addr 0x%02x, val 0x%02x"
+sii9022_switch_mode(const char *mode) "mode: %s"
-- 
2.16.2

From: Linus Walleij <linus.walleij@linaro.org>

This adds the SiI9022 (and implicitly EDID I2C) device to the ARM
Versatile Express machine, and selects the two I2C devices necessary
in the arm-softmmu.mak configuration so everything will build
smoothly.

I am implementing proper handling of the graphics in the Linux
kernel and adding proper emulation of SiI9022 and EDID makes the
driver probe as nicely as before, retrieving the resolutions
supported by the "QEMU monitor" and overall just working nice.

Cc: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Linus Walleij <linus.walleij@linaro.org>
Message-id: 20180227104903.21353-6-linus.walleij@linaro.org
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/arm/vexpress.c               | 6 +++++-
 default-configs/arm-softmmu.mak | 2 ++
 2 files changed, 7 insertions(+), 1 deletion(-)

diff --git a/hw/arm/vexpress.c b/hw/arm/vexpress.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/vexpress.c
+++ b/hw/arm/vexpress.c
@@ -XXX,XX +XXX,XX @@
 #include "hw/arm/arm.h"
 #include "hw/arm/primecell.h"
 #include "hw/devices.h"
+#include "hw/i2c/i2c.h"
 #include "net/net.h"
 #include "sysemu/sysemu.h"
 #include "hw/boards.h"
@@ -XXX,XX +XXX,XX @@ static void vexpress_common_init(MachineState *machine)
     uint32_t sys_id;
     DriveInfo *dinfo;
     pflash_t *pflash0;
+    I2CBus *i2c;
     ram_addr_t vram_size, sram_size;
     MemoryRegion *sysmem = get_system_memory();
     MemoryRegion *vram = g_new(MemoryRegion, 1);
@@ -XXX,XX +XXX,XX @@ static void vexpress_common_init(MachineState *machine)
     sysbus_create_simple("sp804", map[VE_TIMER01], pic[2]);
     sysbus_create_simple("sp804", map[VE_TIMER23], pic[3]);
 
-    /* VE_SERIALDVI: not modelled */
+    dev = sysbus_create_simple("versatile_i2c", map[VE_SERIALDVI], NULL);
+    i2c = (I2CBus *)qdev_get_child_bus(dev, "i2c");
+    i2c_create_slave(i2c, "sii9022", 0x39);
 
     sysbus_create_simple("pl031", map[VE_RTC], pic[4]); /* RTC */
 
diff --git a/default-configs/arm-softmmu.mak b/default-configs/arm-softmmu.mak
index XXXXXXX..XXXXXXX 100644
--- a/default-configs/arm-softmmu.mak
+++ b/default-configs/arm-softmmu.mak
@@ -XXX,XX +XXX,XX @@ CONFIG_STELLARIS_INPUT=y
 CONFIG_STELLARIS_ENET=y
 CONFIG_SSD0303=y
 CONFIG_SSD0323=y
+CONFIG_DDC=y
+CONFIG_SII9022=y
 CONFIG_ADS7846=y
 CONFIG_MAX111X=y
 CONFIG_SSI=y
-- 
2.16.2