Series comparison

-[Qemu-devel] [PULL 00/21] target-arm queue
+[PULL 00/11] target-arm queue
-target-arm queue: this time around is all small fixes
+target-arm queue: two bug fixes, plus the KVM/SVE patchset,
-and changes.
+which is a new feature but one which was in my pre-softfreeze
 pullreq (it just had to be dropped due to an unexpected test failure.)
 thanks
 -- PMM
-The following changes since commit fec105c2abda8567ec15230429c41429b5ee307c:
+The following changes since commit b7c9a7f353c0e260519bf735ff0d4aa01e72784b:
-  Merge remote-tracking branch 'remotes/kraxel/tags/audio-20190828-pull-request' into staging (2019-09-03 14:03:15 +0100)
+  Merge remote-tracking branch 'remotes/jnsnow/tags/ide-pull-request' into staging (2019-10-31 15:57:30 +0000)
 are available in the Git repository at:
-  https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20190903
+  https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20191101-1
-for you to fetch changes up to 5e5584c89f36b302c666bc6db535fd3f7ff35ad2:
+for you to fetch changes up to d9ae7624b659362cb2bb2b04fee53bf50829ca56:
-  target/arm: Don't abort on M-profile exception return in linux-user mode (2019-09-03 16:20:35 +0100)
+  target/arm: Allow reading flags from FPSCR for M-profile (2019-11-01 08:49:10 +0000)
 ----------------------------------------------------------------
 target-arm queue:
- * Revert and correctly fix refactoring of unallocated_encoding()
+ * Support SVE in KVM guests
- * Take exceptions on ATS instructions when needed
+ * Don't UNDEF on M-profile 'vmrs apsr_nzcv, fpscr'
- * aspeed/timer: Provide back-pressure information for short periods
+ * Update hflags after boot.c modifies CPU state
  * memory: Remove unused memory_region_iommu_replay_all()
  * hw/arm/smmuv3: Log a guest error when decoding an invalid STE
  * hw/arm/smmuv3: Remove spurious error messages on IOVA invalidations
  * target/arm: Fix SMMLS argument order
  * hw/arm: Use ARM_CPU_TYPE_NAME() macro when appropriate
  * hw/arm: Correct reference counting for creation of various objects
  * includes: remove stale [smp|max]_cpus externs
  * tcg/README: fix typo
  * atomic_template: fix indentation in GEN_ATOMIC_HELPER
  * include/exec/cpu-defs.h: fix typo
  * target/arm: Free TCG temps in trans_VMOV_64_sp()
  * target/arm: Don't abort on M-profile exception return in linux-user mode
 ----------------------------------------------------------------
-Alex Bennée (2):
+Andrew Jones (9):
-      includes: remove stale [smp|max]_cpus externs
+      target/arm/monitor: Introduce qmp_query_cpu_model_expansion
-      include/exec/cpu-defs.h: fix typo
+      tests: arm: Introduce cpu feature tests
       target/arm: Allow SVE to be disabled via a CPU property
       target/arm/cpu64: max cpu: Introduce sve<N> properties
       target/arm/kvm64: Add kvm_arch_get/put_sve
       target/arm/kvm64: max cpu: Enable SVE when available
       target/arm/kvm: scratch vcpu: Preserve input kvm_vcpu_init features
       target/arm/cpu64: max cpu: Support sve properties with KVM
       target/arm/kvm: host cpu: Add support for sve<N> properties
-Andrew Jeffery (1):
+Christophe Lyon (1):
-      aspeed/timer: Provide back-pressure information for short periods
+      target/arm: Allow reading flags from FPSCR for M-profile
-Emilio G. Cota (2):
+Edgar E. Iglesias (1):
-      tcg/README: fix typo s/afterwise/afterwards/
+      hw/arm/boot: Rebuild hflags when modifying CPUState at boot
       atomic_template: fix indentation in GEN_ATOMIC_HELPER
-Eric Auger (3):
+ tests/Makefile.include         |   5 +-
-      memory: Remove unused memory_region_iommu_replay_all()
+ qapi/machine-target.json       |   6 +-
-      hw/arm/smmuv3: Log a guest error when decoding an invalid STE
+ include/qemu/bitops.h          |   1 +
-      hw/arm/smmuv3: Remove spurious error messages on IOVA invalidations
+ target/arm/cpu.h               |  21 ++
  target/arm/kvm_arm.h           |  39 +++
  hw/arm/boot.c                  |   1 +
  target/arm/cpu.c               |  25 +-
  target/arm/cpu64.c             | 364 +++++++++++++++++++++++++--
  target/arm/helper.c            |  10 +-
  target/arm/kvm.c               |  25 +-
  target/arm/kvm32.c             |   6 +-
  target/arm/kvm64.c             | 325 +++++++++++++++++++++---
  target/arm/monitor.c           | 158 ++++++++++++
  target/arm/translate-vfp.inc.c |   5 +-
  tests/arm-cpu-features.c       | 551 +++++++++++++++++++++++++++++++++++++++++
  docs/arm-cpu-features.rst      | 317 ++++++++++++++++++++++++
 files changed, 1795 insertions(+), 64 deletions(-)
  create mode 100644 tests/arm-cpu-features.c
  create mode 100644 docs/arm-cpu-features.rst
-Peter Maydell (4):
-      target/arm: Allow ARMCPRegInfo read/write functions to throw exceptions
-      target/arm: Take exceptions on ATS instructions when needed
-      target/arm: Free TCG temps in trans_VMOV_64_sp()
-      target/arm: Don't abort on M-profile exception return in linux-user mode
-Philippe Mathieu-Daudé (6):
-      hw/arm: Use ARM_CPU_TYPE_NAME() macro when appropriate
-      hw/arm: Use object_initialize_child for correct reference counting
-      hw/arm: Use sysbus_init_child_obj for correct reference counting
-      hw/arm/fsl-imx: Add the cpu as child of the SoC object
-      hw/dma/xilinx_axi: Use object_initialize_child for correct ref. counting
-      hw/net/xilinx_axi: Use object_initialize_child for correct ref. counting
-Richard Henderson (3):
-      Revert "target/arm: Use unallocated_encoding for aarch32"
-      target/arm: Factor out unallocated_encoding for aarch32
-      target/arm: Fix SMMLS argument order
- accel/tcg/atomic_template.h    |   2 +-
- hw/arm/smmuv3-internal.h       |   1 +
- include/exec/cpu-defs.h        |   2 +-
- include/exec/memory.h          |  10 ----
- include/sysemu/sysemu.h        |   2 -
- target/arm/cpu.h               |   6 ++-
- target/arm/translate-a64.h     |   2 +
- target/arm/translate.h         |   2 -
- hw/arm/allwinner-a10.c         |   3 +-
- hw/arm/cubieboard.c            |   3 +-
- hw/arm/digic.c                 |   3 +-
- hw/arm/exynos4_boards.c        |   4 +-
- hw/arm/fsl-imx25.c             |   4 +-
- hw/arm/fsl-imx31.c             |   4 +-
- hw/arm/fsl-imx6.c              |   3 +-
- hw/arm/fsl-imx6ul.c            |   2 +-
- hw/arm/mcimx7d-sabre.c         |   9 ++--
- hw/arm/mps2-tz.c               |  15 +++---
- hw/arm/musca.c                 |   9 ++--
- hw/arm/smmuv3.c                |  18 ++++---
- hw/arm/xlnx-zynqmp.c           |   8 +--
- hw/dma/xilinx_axidma.c         |  16 +++---
- hw/net/xilinx_axienet.c        |  17 +++----
- hw/timer/aspeed_timer.c        |  17 ++++++-
- memory.c                       |   9 ----
- target/arm/helper.c            | 107 +++++++++++++++++++++++++++++++++++------
- target/arm/translate-a64.c     |  13 +++++
- target/arm/translate-vfp.inc.c |   2 +
- target/arm/translate.c         |  50 +++++++++++++++++--
- tcg/README                     |   2 +-
-files changed, 244 insertions(+), 101 deletions(-)

-[Qemu-devel] [PULL 21/21] target/arm: Don't abort on M-profile exception return in linux-user mode
+[PULL 01/11] target/arm/monitor: Introduce qmp_query_cpu_model_expansion
-An attempt to do an exception-return (branch to one of the magic
+From: Andrew Jones <drjones@redhat.com>
-addresses) in linux-user mode for M-profile should behave like
-a normal branch, because linux-user mode is always going to be
+Add support for the query-cpu-model-expansion QMP command to Arm. We
-in 'handler' mode. This used to work, but we broke it when we added
+do this selectively, only exposing CPU properties which represent
-support for the M-profile security extension in commit d02a8698d7ae2bfed.
+optional CPU features which the user may want to enable/disable.
+Additionally we restrict the list of queryable cpu models to 'max',
-In that commit we allowed even handler-mode calls to magic return
+'host', or the current type when KVM is in use. And, finally, we only
-values to be checked for and dealt with by causing an
+implement expansion type 'full', as Arm does not yet have a "base"
-EXCP_EXCEPTION_EXIT exception to be taken, because this is
+CPU type. More details and example queries are described in a new
-needed for the FNC_RETURN return-from-non-secure-function-call
+document (docs/arm-cpu-features.rst).
-handling. For system mode we added a check in do_v7m_exception_exit()
-to make any spurious calls from Handler mode behave correctly, but
+Note, certainly more features may be added to the list of advertised
-forgot that linux-user mode would also be affected.
+features, e.g. 'vfp' and 'neon'. The only requirement is that we can
+detect invalid configurations and emit failures at QMP query time.
-How an attempted return-from-non-secure-function-call in linux-user
+For 'vfp' and 'neon' this will require some refactoring to share a
-mode should be handled is not clear -- on real hardware it would
+validation function between the QMP query and the CPU realize
-result in return to secure code (not to the Linux kernel) which
+functions.
-could then handle the error in any way it chose. For QEMU we take
-the simple approach of treating this erroneous return the same way
+Signed-off-by: Andrew Jones <drjones@redhat.com>
 it would be handled on a CPU without the security extensions --
 treat it as a normal branch.
 The upshot of all this is that for linux-user mode we should never
 do any of the bx_excret magic, so the code change is simple.
 This ought to be a weird corner case that only affects broken guest
 code (because Linux user processes should never be attempting to do
 exception returns or NS function returns), except that the code that
 assigns addresses in RAM for the process and stack in our linux-user
 code does not attempt to avoid this magic address range, so
 legitimate code attempting to return to a trampoline routine on the
 stack can fall into this case. This change fixes those programs,
 but we should also look at restricting the range of memory we
 use for M-profile linux-user guests to the area that would be
 real RAM in hardware.
 Cc: qemu-stable@nongnu.org
 Reported-by: Christophe Lyon <christophe.lyon@linaro.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
+Reviewed-by: Eric Auger <eric.auger@redhat.com>
-Message-id: 20190822131534.16602-1-peter.maydell@linaro.org
+Reviewed-by: Beata Michalska <beata.michalska@linaro.org>
-Fixes: https://bugs.launchpad.net/qemu/+bug/1840922
+Message-id: 20191031142734.8590-2-drjones@redhat.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/translate.c | 21 ++++++++++++++++++++-
+ qapi/machine-target.json  |   6 +-
-file changed, 20 insertions(+), 1 deletion(-)
+ target/arm/monitor.c      | 146 ++++++++++++++++++++++++++++++++++++++
+ docs/arm-cpu-features.rst | 137 +++++++++++++++++++++++++++++++++++
-diff --git a/target/arm/translate.c b/target/arm/translate.c
+files changed, 286 insertions(+), 3 deletions(-)
  create mode 100644 docs/arm-cpu-features.rst
 diff --git a/qapi/machine-target.json b/qapi/machine-target.json
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate.c
+--- a/qapi/machine-target.json
-+++ b/target/arm/translate.c
++++ b/qapi/machine-target.json
-@@ -XXX,XX +XXX,XX @@ static inline void gen_bx(DisasContext *s, TCGv_i32 var)
+@@ -XXX,XX +XXX,XX @@
-     store_cpu_field(var, thumb);
+ ##
  { 'struct': 'CpuModelExpansionInfo',
    'data': { 'model': 'CpuModelInfo' },
 -  'if': 'defined(TARGET_S390X) || defined(TARGET_I386)' }
 +  'if': 'defined(TARGET_S390X) || defined(TARGET_I386) || defined(TARGET_ARM)' }
  ##
  # @query-cpu-model-expansion:
@@ -XXX,XX +XXX,XX @@
  #   query-cpu-model-expansion while using these is not advised.
  #
  # Some architectures may not support all expansion types. s390x supports
 -# "full" and "static".
 +# "full" and "static". Arm only supports "full".
  #
  # Returns: a CpuModelExpansionInfo. Returns an error if expanding CPU models is
  #          not supported, if the model cannot be expanded, if the model contains
@@ -XXX,XX +XXX,XX @@
    'data': { 'type': 'CpuModelExpansionType',
              'model': 'CpuModelInfo' },
    'returns': 'CpuModelExpansionInfo',
 -  'if': 'defined(TARGET_S390X) || defined(TARGET_I386)' }
 +  'if': 'defined(TARGET_S390X) || defined(TARGET_I386) || defined(TARGET_ARM)' }
  ##
  # @CpuDefinitionInfo:
 diff --git a/target/arm/monitor.c b/target/arm/monitor.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/monitor.c
 +++ b/target/arm/monitor.c
@@ -XXX,XX +XXX,XX @@
   */
  #include "qemu/osdep.h"
 +#include "hw/boards.h"
  #include "kvm_arm.h"
 +#include "qapi/error.h"
 +#include "qapi/visitor.h"
 +#include "qapi/qobject-input-visitor.h"
 +#include "qapi/qapi-commands-machine-target.h"
  #include "qapi/qapi-commands-misc-target.h"
 +#include "qapi/qmp/qerror.h"
 +#include "qapi/qmp/qdict.h"
 +#include "qom/qom-qobject.h"
  static GICCapability *gic_cap_new(int version)
  {
@@ -XXX,XX +XXX,XX @@ GICCapabilityList *qmp_query_gic_capabilities(Error **errp)
      return head;
  }
++
 -/* Set PC and Thumb state from var. var is marked as dead.
 +/*
-+ * Set PC and Thumb state from var. var is marked as dead.
++ * These are cpu model features we want to advertise. The order here
-  * For M-profile CPUs, include logic to detect exception-return
++ * matters as this is the order in which qmp_query_cpu_model_expansion
-  * branches and handle them. This is needed for Thumb POP/LDM to PC, LDR to PC,
++ * will attempt to set them. If there are dependencies between features,
-  * and BX reg, and no others, and happens only for code in Handler mode.
++ * then the order that considers those dependencies must be used.
-+ * The Security Extension also requires us to check for the FNC_RETURN
++ */
-+ * which signals a function return from non-secure state; this can happen
++static const char *cpu_model_advertised_features[] = {
-+ * in both Handler and Thread mode.
++    "aarch64", "pmu",
-+ * To avoid having to do multiple comparisons in inline generated code,
++    NULL
-+ * we make the check we do here loose, so it will match for EXC_RETURN
++};
-+ * in Thread mode. For system emulation do_v7m_exception_exit() checks
++
-+ * for these spurious cases and returns without doing anything (giving
++CpuModelExpansionInfo *qmp_query_cpu_model_expansion(CpuModelExpansionType type,
-+ * the same behaviour as for a branch to a non-magic address).
++                                                     CpuModelInfo *model,
-+ *
++                                                     Error **errp)
-+ * In linux-user mode it is unclear what the right behaviour for an
++{
-+ * attempted FNC_RETURN should be, because in real hardware this will go
++    CpuModelExpansionInfo *expansion_info;
-+ * directly to Secure code (ie not the Linux kernel) which will then treat
++    const QDict *qdict_in = NULL;
-+ * the error in any way it chooses. For QEMU we opt to make the FNC_RETURN
++    QDict *qdict_out;
-+ * attempt behave the way it would on a CPU without the security extension,
++    ObjectClass *oc;
-+ * which is to say "like a normal branch". That means we can simply treat
++    Object *obj;
-+ * all branches as normal with no magic address behaviour.
++    const char *name;
-  */
++    int i;
- static inline void gen_bx_excret(DisasContext *s, TCGv_i32 var)
++
- {
++    if (type != CPU_MODEL_EXPANSION_TYPE_FULL) {
-@@ -XXX,XX +XXX,XX @@ static inline void gen_bx_excret(DisasContext *s, TCGv_i32 var)
++        error_setg(errp, "The requested expansion type is not supported");
-      * s->base.is_jmp that we need to do the rest of the work later.
++        return NULL;
-      */
++    }
-     gen_bx(s, var);
++
-+#ifndef CONFIG_USER_ONLY
++    if (!kvm_enabled() && !strcmp(model->name, "host")) {
-     if (arm_dc_feature(s, ARM_FEATURE_M_SECURITY) ||
++        error_setg(errp, "The CPU type '%s' requires KVM", model->name);
-         (s->v7m_handler_mode && arm_dc_feature(s, ARM_FEATURE_M))) {
++        return NULL;
-         s->base.is_jmp = DISAS_BX_EXCRET;
++    }
-     }
++
-+#endif
++    oc = cpu_class_by_name(TYPE_ARM_CPU, model->name);
- }
++    if (!oc) {
++        error_setg(errp, "The CPU type '%s' is not a recognized ARM CPU type",
- static inline void gen_bx_excret_final_code(DisasContext *s)
++                   model->name);
 +        return NULL;
 +    }
 +
 +    if (kvm_enabled()) {
 +        const char *cpu_type = current_machine->cpu_type;
 +        int len = strlen(cpu_type) - strlen(ARM_CPU_TYPE_SUFFIX);
 +        bool supported = false;
 +
 +        if (!strcmp(model->name, "host") || !strcmp(model->name, "max")) {
 +            /* These are kvmarm's recommended cpu types */
 +            supported = true;
 +        } else if (strlen(model->name) == len &&
 +                   !strncmp(model->name, cpu_type, len)) {
 +            /* KVM is enabled and we're using this type, so it works. */
 +            supported = true;
 +        }
 +        if (!supported) {
 +            error_setg(errp, "We cannot guarantee the CPU type '%s' works "
 +                             "with KVM on this host", model->name);
 +            return NULL;
 +        }
 +    }
 +
 +    if (model->props) {
 +        qdict_in = qobject_to(QDict, model->props);
 +        if (!qdict_in) {
 +            error_setg(errp, QERR_INVALID_PARAMETER_TYPE, "props", "dict");
 +            return NULL;
 +        }
 +    }
 +
 +    obj = object_new(object_class_get_name(oc));
 +
 +    if (qdict_in) {
 +        Visitor *visitor;
 +        Error *err = NULL;
 +
 +        visitor = qobject_input_visitor_new(model->props);
 +        visit_start_struct(visitor, NULL, NULL, 0, &err);
 +        if (err) {
 +            visit_free(visitor);
 +            object_unref(obj);
 +            error_propagate(errp, err);
 +            return NULL;
 +        }
 +
 +        i = 0;
 +        while ((name = cpu_model_advertised_features[i++]) != NULL) {
 +            if (qdict_get(qdict_in, name)) {
 +                object_property_set(obj, visitor, name, &err);
 +                if (err) {
 +                    break;
 +                }
 +            }
 +        }
 +
 +        if (!err) {
 +            visit_check_struct(visitor, &err);
 +        }
 +        visit_end_struct(visitor, NULL);
 +        visit_free(visitor);
 +        if (err) {
 +            object_unref(obj);
 +            error_propagate(errp, err);
 +            return NULL;
 +        }
 +    }
 +
 +    expansion_info = g_new0(CpuModelExpansionInfo, 1);
 +    expansion_info->model = g_malloc0(sizeof(*expansion_info->model));
 +    expansion_info->model->name = g_strdup(model->name);
 +
 +    qdict_out = qdict_new();
 +
 +    i = 0;
 +    while ((name = cpu_model_advertised_features[i++]) != NULL) {
 +        ObjectProperty *prop = object_property_find(obj, name, NULL);
 +        if (prop) {
 +            Error *err = NULL;
 +            QObject *value;
 +
 +            assert(prop->get);
 +            value = object_property_get_qobject(obj, name, &err);
 +            assert(!err);
 +
 +            qdict_put_obj(qdict_out, name, value);
 +        }
 +    }
 +
 +    if (!qdict_size(qdict_out)) {
 +        qobject_unref(qdict_out);
 +    } else {
 +        expansion_info->model->props = QOBJECT(qdict_out);
 +        expansion_info->model->has_props = true;
 +    }
 +
 +    object_unref(obj);
 +
 +    return expansion_info;
 +}
 diff --git a/docs/arm-cpu-features.rst b/docs/arm-cpu-features.rst
 new file mode 100644
 index XXXXXXX..XXXXXXX
 --- /dev/null
 +++ b/docs/arm-cpu-features.rst
@@ -XXX,XX +XXX,XX @@
 +================
 +ARM CPU Features
 +================
 +
 +Examples of probing and using ARM CPU features
 +
 +Introduction
 +============
 +
 +CPU features are optional features that a CPU of supporting type may
 +choose to implement or not.  In QEMU, optional CPU features have
 +corresponding boolean CPU proprieties that, when enabled, indicate
 +that the feature is implemented, and, conversely, when disabled,
 +indicate that it is not implemented. An example of an ARM CPU feature
 +is the Performance Monitoring Unit (PMU).  CPU types such as the
 +Cortex-A15 and the Cortex-A57, which respectively implement ARM
 +architecture reference manuals ARMv7-A and ARMv8-A, may both optionally
 +implement PMUs.  For example, if a user wants to use a Cortex-A15 without
 +a PMU, then the `-cpu` parameter should contain `pmu=off` on the QEMU
 +command line, i.e. `-cpu cortex-a15,pmu=off`.
 +
 +As not all CPU types support all optional CPU features, then whether or
 +not a CPU property exists depends on the CPU type.  For example, CPUs
 +that implement the ARMv8-A architecture reference manual may optionally
 +support the AArch32 CPU feature, which may be enabled by disabling the
 +`aarch64` CPU property.  A CPU type such as the Cortex-A15, which does
 +not implement ARMv8-A, will not have the `aarch64` CPU property.
 +
 +QEMU's support may be limited for some CPU features, only partially
 +supporting the feature or only supporting the feature under certain
 +configurations.  For example, the `aarch64` CPU feature, which, when
 +disabled, enables the optional AArch32 CPU feature, is only supported
 +when using the KVM accelerator and when running on a host CPU type that
 +supports the feature.
 +
 +CPU Feature Probing
 +===================
 +
 +Determining which CPU features are available and functional for a given
 +CPU type is possible with the `query-cpu-model-expansion` QMP command.
 +Below are some examples where `scripts/qmp/qmp-shell` (see the top comment
 +block in the script for usage) is used to issue the QMP commands.
 +
 +(1) Determine which CPU features are available for the `max` CPU type
 +    (Note, we started QEMU with qemu-system-aarch64, so `max` is
 +     implementing the ARMv8-A reference manual in this case)::
 +
 +      (QEMU) query-cpu-model-expansion type=full model={"name":"max"}
 +      { "return": {
 +        "model": { "name": "max", "props": {
 +        "pmu": true, "aarch64": true
 +      }}}}
 +
 +We see that the `max` CPU type has the `pmu` and `aarch64` CPU features.
 +We also see that the CPU features are enabled, as they are all `true`.
 +
 +(2) Let's try to disable the PMU::
 +
 +      (QEMU) query-cpu-model-expansion type=full model={"name":"max","props":{"pmu":false}}
 +      { "return": {
 +        "model": { "name": "max", "props": {
 +        "pmu": false, "aarch64": true
 +      }}}}
 +
 +We see it worked, as `pmu` is now `false`.
 +
 +(3) Let's try to disable `aarch64`, which enables the AArch32 CPU feature::
 +
 +      (QEMU) query-cpu-model-expansion type=full model={"name":"max","props":{"aarch64":false}}
 +      {"error": {
 +       "class": "GenericError", "desc":
 +       "'aarch64' feature cannot be disabled unless KVM is enabled and 32-bit EL1 is supported"
 +      }}
 +
 +It looks like this feature is limited to a configuration we do not
 +currently have.
 +
 +(4) Let's try probing CPU features for the Cortex-A15 CPU type::
 +
 +      (QEMU) query-cpu-model-expansion type=full model={"name":"cortex-a15"}
 +      {"return": {"model": {"name": "cortex-a15", "props": {"pmu": true}}}}
 +
 +Only the `pmu` CPU feature is available.
 +
 +A note about CPU feature dependencies
 +-------------------------------------
 +
 +It's possible for features to have dependencies on other features. I.e.
 +it may be possible to change one feature at a time without error, but
 +when attempting to change all features at once an error could occur
 +depending on the order they are processed.  It's also possible changing
 +all at once doesn't generate an error, because a feature's dependencies
 +are satisfied with other features, but the same feature cannot be changed
 +independently without error.  For these reasons callers should always
 +attempt to make their desired changes all at once in order to ensure the
 +collection is valid.
 +
 +A note about CPU models and KVM
 +-------------------------------
 +
 +Named CPU models generally do not work with KVM.  There are a few cases
 +that do work, e.g. using the named CPU model `cortex-a57` with KVM on a
 +seattle host, but mostly if KVM is enabled the `host` CPU type must be
 +used.  This means the guest is provided all the same CPU features as the
 +host CPU type has.  And, for this reason, the `host` CPU type should
 +enable all CPU features that the host has by default.  Indeed it's even
 +a bit strange to allow disabling CPU features that the host has when using
 +the `host` CPU type, but in the absence of CPU models it's the best we can
 +do if we want to launch guests without all the host's CPU features enabled.
 +
 +Enabling KVM also affects the `query-cpu-model-expansion` QMP command.  The
 +affect is not only limited to specific features, as pointed out in example
 +(3) of "CPU Feature Probing", but also to which CPU types may be expanded.
 +When KVM is enabled, only the `max`, `host`, and current CPU type may be
 +expanded.  This restriction is necessary as it's not possible to know all
 +CPU types that may work with KVM, but it does impose a small risk of users
 +experiencing unexpected errors.  For example on a seattle, as mentioned
 +above, the `cortex-a57` CPU type is also valid when KVM is enabled.
 +Therefore a user could use the `host` CPU type for the current type, but
 +then attempt to query `cortex-a57`, however that query will fail with our
 +restrictions.  This shouldn't be an issue though as management layers and
 +users have been preferring the `host` CPU type for use with KVM for quite
 +some time.  Additionally, if the KVM-enabled QEMU instance running on a
 +seattle host is using the `cortex-a57` CPU type, then querying `cortex-a57`
 +will work.
 +
 +Using CPU Features
 +==================
 +
 +After determining which CPU features are available and supported for a
 +given CPU type, then they may be selectively enabled or disabled on the
 +QEMU command line with that CPU type::
 +
 +  $ qemu-system-aarch64 -M virt -cpu max,pmu=off
 +
 +The example above disables the PMU for the `max` CPU type.
 +
 --
 .20.1

-[Qemu-devel] [PULL 19/21] include/exec/cpu-defs.h: fix typo
+[PULL 02/11] tests: arm: Introduce cpu feature tests
-From: Alex Bennée <alex.bennee@linaro.org>
+From: Andrew Jones <drjones@redhat.com>
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Now that Arm CPUs have advertised features lets add tests to ensure
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+we maintain their expected availability with and without KVM.
-Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
-Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
+Signed-off-by: Andrew Jones <drjones@redhat.com>
-Message-id: 20190828165307.18321-10-alex.bennee@linaro.org
+Reviewed-by: Eric Auger <eric.auger@redhat.com>
 Message-id: 20191031142734.8590-3-drjones@redhat.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- include/exec/cpu-defs.h | 2 +-
+ tests/Makefile.include   |   5 +-
-file changed, 1 insertion(+), 1 deletion(-)
+ tests/arm-cpu-features.c | 253 +++++++++++++++++++++++++++++++++++++++
+files changed, 257 insertions(+), 1 deletion(-)
-diff --git a/include/exec/cpu-defs.h b/include/exec/cpu-defs.h
+ create mode 100644 tests/arm-cpu-features.c
 diff --git a/tests/Makefile.include b/tests/Makefile.include
 index XXXXXXX..XXXXXXX 100644
---- a/include/exec/cpu-defs.h
+--- a/tests/Makefile.include
-+++ b/include/exec/cpu-defs.h
++++ b/tests/Makefile.include
-@@ -XXX,XX +XXX,XX @@ typedef struct CPUTLB { } CPUTLB;
+@@ -XXX,XX +XXX,XX @@ check-qtest-sparc64-$(CONFIG_ISA_TESTDEV) = tests/endianness-test$(EXESUF)
- #endif  /* !CONFIG_USER_ONLY && CONFIG_TCG */
+ check-qtest-sparc64-y += tests/prom-env-test$(EXESUF)
+ check-qtest-sparc64-y += tests/boot-serial-test$(EXESUF)
- /*
-- * This structure must be placed in ArchCPU immedately
++check-qtest-arm-y += tests/arm-cpu-features$(EXESUF)
-+ * This structure must be placed in ArchCPU immediately
+ check-qtest-arm-y += tests/microbit-test$(EXESUF)
-  * before CPUArchState, as a field named "neg".
+ check-qtest-arm-y += tests/m25p80-test$(EXESUF)
-  */
+ check-qtest-arm-y += tests/test-arm-mptimer$(EXESUF)
- typedef struct CPUNegativeOffsetState {
+@@ -XXX,XX +XXX,XX @@ check-qtest-arm-y += tests/boot-serial-test$(EXESUF)
  check-qtest-arm-y += tests/hexloader-test$(EXESUF)
  check-qtest-arm-$(CONFIG_PFLASH_CFI02) += tests/pflash-cfi02-test$(EXESUF)
 -check-qtest-aarch64-y = tests/numa-test$(EXESUF)
 +check-qtest-aarch64-y += tests/arm-cpu-features$(EXESUF)
 +check-qtest-aarch64-y += tests/numa-test$(EXESUF)
  check-qtest-aarch64-y += tests/boot-serial-test$(EXESUF)
  check-qtest-aarch64-y += tests/migration-test$(EXESUF)
  # TODO: once aarch64 TCG is fixed on ARM 32 bit host, make test unconditional
@@ -XXX,XX +XXX,XX @@ tests/test-qapi-util$(EXESUF): tests/test-qapi-util.o $(test-util-obj-y)
  tests/numa-test$(EXESUF): tests/numa-test.o
  tests/vmgenid-test$(EXESUF): tests/vmgenid-test.o tests/boot-sector.o tests/acpi-utils.o
  tests/cdrom-test$(EXESUF): tests/cdrom-test.o tests/boot-sector.o $(libqos-obj-y)
 +tests/arm-cpu-features$(EXESUF): tests/arm-cpu-features.o
  tests/migration/stress$(EXESUF): tests/migration/stress.o
      $(call quiet-command, $(LINKPROG) -static -O3 $(PTHREAD_LIB) -o $@ $< ,"LINK","$(TARGET_DIR)$@")
 diff --git a/tests/arm-cpu-features.c b/tests/arm-cpu-features.c
 new file mode 100644
 index XXXXXXX..XXXXXXX
 --- /dev/null
 +++ b/tests/arm-cpu-features.c
@@ -XXX,XX +XXX,XX @@
 +/*
 + * Arm CPU feature test cases
 + *
 + * Copyright (c) 2019 Red Hat Inc.
 + * Authors:
 + *  Andrew Jones <drjones@redhat.com>
 + *
 + * This work is licensed under the terms of the GNU GPL, version 2 or later.
 + * See the COPYING file in the top-level directory.
 + */
 +#include "qemu/osdep.h"
 +#include "libqtest.h"
 +#include "qapi/qmp/qdict.h"
 +#include "qapi/qmp/qjson.h"
 +
 +#define MACHINE     "-machine virt,gic-version=max,accel=tcg "
 +#define MACHINE_KVM "-machine virt,gic-version=max,accel=kvm:tcg "
 +#define QUERY_HEAD  "{ 'execute': 'query-cpu-model-expansion', " \
 +                    "  'arguments': { 'type': 'full', "
 +#define QUERY_TAIL  "}}"
 +
 +static bool kvm_enabled(QTestState *qts)
 +{
 +    QDict *resp, *qdict;
 +    bool enabled;
 +
 +    resp = qtest_qmp(qts, "{ 'execute': 'query-kvm' }");
 +    g_assert(qdict_haskey(resp, "return"));
 +    qdict = qdict_get_qdict(resp, "return");
 +    g_assert(qdict_haskey(qdict, "enabled"));
 +    enabled = qdict_get_bool(qdict, "enabled");
 +    qobject_unref(resp);
 +
 +    return enabled;
 +}
 +
 +static QDict *do_query_no_props(QTestState *qts, const char *cpu_type)
 +{
 +    return qtest_qmp(qts, QUERY_HEAD "'model': { 'name': %s }"
 +                          QUERY_TAIL, cpu_type);
 +}
 +
 +static QDict *do_query(QTestState *qts, const char *cpu_type,
 +                       const char *fmt, ...)
 +{
 +    QDict *resp;
 +
 +    if (fmt) {
 +        QDict *args;
 +        va_list ap;
 +
 +        va_start(ap, fmt);
 +        args = qdict_from_vjsonf_nofail(fmt, ap);
 +        va_end(ap);
 +
 +        resp = qtest_qmp(qts, QUERY_HEAD "'model': { 'name': %s, "
 +                                                    "'props': %p }"
 +                              QUERY_TAIL, cpu_type, args);
 +    } else {
 +        resp = do_query_no_props(qts, cpu_type);
 +    }
 +
 +    return resp;
 +}
 +
 +static const char *resp_get_error(QDict *resp)
 +{
 +    QDict *qdict;
 +
 +    g_assert(resp);
 +
 +    qdict = qdict_get_qdict(resp, "error");
 +    if (qdict) {
 +        return qdict_get_str(qdict, "desc");
 +    }
 +
 +    return NULL;
 +}
 +
 +#define assert_error(qts, cpu_type, expected_error, fmt, ...)          \
 +({                                                                     \
 +    QDict *_resp;                                                      \
 +    const char *_error;                                                \
 +                                                                       \
 +    _resp = do_query(qts, cpu_type, fmt, ##__VA_ARGS__);               \
 +    g_assert(_resp);                                                   \
 +    _error = resp_get_error(_resp);                                    \
 +    g_assert(_error);                                                  \
 +    g_assert(g_str_equal(_error, expected_error));                     \
 +    qobject_unref(_resp);                                              \
 +})
 +
 +static bool resp_has_props(QDict *resp)
 +{
 +    QDict *qdict;
 +
 +    g_assert(resp);
 +
 +    if (!qdict_haskey(resp, "return")) {
 +        return false;
 +    }
 +    qdict = qdict_get_qdict(resp, "return");
 +
 +    if (!qdict_haskey(qdict, "model")) {
 +        return false;
 +    }
 +    qdict = qdict_get_qdict(qdict, "model");
 +
 +    return qdict_haskey(qdict, "props");
 +}
 +
 +static QDict *resp_get_props(QDict *resp)
 +{
 +    QDict *qdict;
 +
 +    g_assert(resp);
 +    g_assert(resp_has_props(resp));
 +
 +    qdict = qdict_get_qdict(resp, "return");
 +    qdict = qdict_get_qdict(qdict, "model");
 +    qdict = qdict_get_qdict(qdict, "props");
 +
 +    return qdict;
 +}
 +
 +#define assert_has_feature(qts, cpu_type, feature)                     \
 +({                                                                     \
 +    QDict *_resp = do_query_no_props(qts, cpu_type);                   \
 +    g_assert(_resp);                                                   \
 +    g_assert(resp_has_props(_resp));                                   \
 +    g_assert(qdict_get(resp_get_props(_resp), feature));               \
 +    qobject_unref(_resp);                                              \
 +})
 +
 +#define assert_has_not_feature(qts, cpu_type, feature)                 \
 +({                                                                     \
 +    QDict *_resp = do_query_no_props(qts, cpu_type);                   \
 +    g_assert(_resp);                                                   \
 +    g_assert(!resp_has_props(_resp) ||                                 \
 +             !qdict_get(resp_get_props(_resp), feature));              \
 +    qobject_unref(_resp);                                              \
 +})
 +
 +static void assert_type_full(QTestState *qts)
 +{
 +    const char *error;
 +    QDict *resp;
 +
 +    resp = qtest_qmp(qts, "{ 'execute': 'query-cpu-model-expansion', "
 +                            "'arguments': { 'type': 'static', "
 +                                           "'model': { 'name': 'foo' }}}");
 +    g_assert(resp);
 +    error = resp_get_error(resp);
 +    g_assert(error);
 +    g_assert(g_str_equal(error,
 +                         "The requested expansion type is not supported"));
 +    qobject_unref(resp);
 +}
 +
 +static void assert_bad_props(QTestState *qts, const char *cpu_type)
 +{
 +    const char *error;
 +    QDict *resp;
 +
 +    resp = qtest_qmp(qts, "{ 'execute': 'query-cpu-model-expansion', "
 +                            "'arguments': { 'type': 'full', "
 +                                           "'model': { 'name': %s, "
 +                                                      "'props': false }}}",
 +                     cpu_type);
 +    g_assert(resp);
 +    error = resp_get_error(resp);
 +    g_assert(error);
 +    g_assert(g_str_equal(error,
 +                         "Invalid parameter type for 'props', expected: dict"));
 +    qobject_unref(resp);
 +}
 +
 +static void test_query_cpu_model_expansion(const void *data)
 +{
 +    QTestState *qts;
 +
 +    qts = qtest_init(MACHINE "-cpu max");
 +
 +    /* Test common query-cpu-model-expansion input validation */
 +    assert_type_full(qts);
 +    assert_bad_props(qts, "max");
 +    assert_error(qts, "foo", "The CPU type 'foo' is not a recognized "
 +                 "ARM CPU type", NULL);
 +    assert_error(qts, "max", "Parameter 'not-a-prop' is unexpected",
 +                 "{ 'not-a-prop': false }");
 +    assert_error(qts, "host", "The CPU type 'host' requires KVM", NULL);
 +
 +    /* Test expected feature presence/absence for some cpu types */
 +    assert_has_feature(qts, "max", "pmu");
 +    assert_has_feature(qts, "cortex-a15", "pmu");
 +    assert_has_not_feature(qts, "cortex-a15", "aarch64");
 +
 +    if (g_str_equal(qtest_get_arch(), "aarch64")) {
 +        assert_has_feature(qts, "max", "aarch64");
 +        assert_has_feature(qts, "cortex-a57", "pmu");
 +        assert_has_feature(qts, "cortex-a57", "aarch64");
 +
 +        /* Test that features that depend on KVM generate errors without. */
 +        assert_error(qts, "max",
 +                     "'aarch64' feature cannot be disabled "
 +                     "unless KVM is enabled and 32-bit EL1 "
 +                     "is supported",
 +                     "{ 'aarch64': false }");
 +    }
 +
 +    qtest_quit(qts);
 +}
 +
 +static void test_query_cpu_model_expansion_kvm(const void *data)
 +{
 +    QTestState *qts;
 +
 +    qts = qtest_init(MACHINE_KVM "-cpu max");
 +
 +    /*
 +     * These tests target the 'host' CPU type, so KVM must be enabled.
 +     */
 +    if (!kvm_enabled(qts)) {
 +        qtest_quit(qts);
 +        return;
 +    }
 +
 +    if (g_str_equal(qtest_get_arch(), "aarch64")) {
 +        assert_has_feature(qts, "host", "aarch64");
 +        assert_has_feature(qts, "host", "pmu");
 +
 +        assert_error(qts, "cortex-a15",
 +            "We cannot guarantee the CPU type 'cortex-a15' works "
 +            "with KVM on this host", NULL);
 +    } else {
 +        assert_has_not_feature(qts, "host", "aarch64");
 +        assert_has_not_feature(qts, "host", "pmu");
 +    }
 +
 +    qtest_quit(qts);
 +}
 +
 +int main(int argc, char **argv)
 +{
 +    g_test_init(&argc, &argv, NULL);
 +
 +    qtest_add_data_func("/arm/query-cpu-model-expansion",
 +                        NULL, test_query_cpu_model_expansion);
 +    qtest_add_data_func("/arm/kvm/query-cpu-model-expansion",
 +                        NULL, test_query_cpu_model_expansion_kvm);
 +
 +    return g_test_run();
 +}
 --
 .20.1

-[Qemu-devel] [PULL 17/21] tcg/README: fix typo s/afterwise/afterwards/
+[PULL 03/11] target/arm: Allow SVE to be disabled via a CPU property
-From: "Emilio G. Cota" <cota@braap.org>
+From: Andrew Jones <drjones@redhat.com>
-Afterwise is "wise after the fact", as in "hindsight".
+Since 97a28b0eeac14 ("target/arm: Allow VFP and Neon to be disabled via
-Here we meant "afterwards" (as in "subsequently"). Fix it.
+a CPU property") we can disable the 'max' cpu model's VFP and neon
 features, but there's no way to disable SVE. Add the 'sve=on|off'
 property to give it that flexibility. We also rename
 cpu_max_get/set_sve_vq to cpu_max_get/set_sve_max_vq in order for them
 to follow the typical *_get/set_<property-name> pattern.
-Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
+Signed-off-by: Andrew Jones <drjones@redhat.com>
 Signed-off-by: Emilio G. Cota <cota@braap.org>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Reviewed-by: Eric Auger <eric.auger@redhat.com>
-Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
+Tested-by: Masayoshi Mizuma <m.mizuma@jp.fujitsu.com>
-Message-id: 20190828165307.18321-7-alex.bennee@linaro.org
+Reviewed-by: Beata Michalska <beata.michalska@linaro.org>
 Message-id: 20191031142734.8590-4-drjones@redhat.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- tcg/README | 2 +-
+ target/arm/cpu.c         |  3 ++-
-file changed, 1 insertion(+), 1 deletion(-)
+ target/arm/cpu64.c       | 52 ++++++++++++++++++++++++++++++++++------
  target/arm/monitor.c     |  2 +-
  tests/arm-cpu-features.c |  1 +
 files changed, 49 insertions(+), 9 deletions(-)
-diff --git a/tcg/README b/tcg/README
+diff --git a/target/arm/cpu.c b/target/arm/cpu.c
 index XXXXXXX..XXXXXXX 100644
---- a/tcg/README
+--- a/target/arm/cpu.c
-+++ b/tcg/README
++++ b/target/arm/cpu.c
-@@ -XXX,XX +XXX,XX @@ This can be overridden using the following function modifiers:
+@@ -XXX,XX +XXX,XX @@ static void arm_cpu_reset(CPUState *s)
-   canonical locations before calling the helper.
+         env->cp15.cpacr_el1 = deposit64(env->cp15.cpacr_el1, 16, 2, 3);
- - TCG_CALL_NO_WRITE_GLOBALS means that the helper does not modify any globals.
+         env->cp15.cptr_el[3] |= CPTR_EZ;
-   They will only be saved to their canonical location before calling helpers,
+         /* with maximum vector length */
--  but they won't be reloaded afterwise.
+-        env->vfp.zcr_el[1] = cpu->sve_max_vq - 1;
-+  but they won't be reloaded afterwards.
++        env->vfp.zcr_el[1] = cpu_isar_feature(aa64_sve, cpu) ?
- - TCG_CALL_NO_SIDE_EFFECTS means that the call to the function is removed if
++                             cpu->sve_max_vq - 1 : 0;
-   the return value is not used.
+         env->vfp.zcr_el[2] = env->vfp.zcr_el[1];
          env->vfp.zcr_el[3] = env->vfp.zcr_el[1];
          /*
 diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu64.c
 +++ b/target/arm/cpu64.c
@@ -XXX,XX +XXX,XX @@ static void aarch64_a72_initfn(Object *obj)
      define_arm_cp_regs(cpu, cortex_a72_a57_a53_cp_reginfo);
  }
 -static void cpu_max_get_sve_vq(Object *obj, Visitor *v, const char *name,
 -                               void *opaque, Error **errp)
 +static void cpu_max_get_sve_max_vq(Object *obj, Visitor *v, const char *name,
 +                                   void *opaque, Error **errp)
  {
      ARMCPU *cpu = ARM_CPU(obj);
 -    visit_type_uint32(v, name, &cpu->sve_max_vq, errp);
 +    uint32_t value;
 +
 +    /* All vector lengths are disabled when SVE is off. */
 +    if (!cpu_isar_feature(aa64_sve, cpu)) {
 +        value = 0;
 +    } else {
 +        value = cpu->sve_max_vq;
 +    }
 +    visit_type_uint32(v, name, &value, errp);
  }
 -static void cpu_max_set_sve_vq(Object *obj, Visitor *v, const char *name,
 -                               void *opaque, Error **errp)
 +static void cpu_max_set_sve_max_vq(Object *obj, Visitor *v, const char *name,
 +                                   void *opaque, Error **errp)
  {
      ARMCPU *cpu = ARM_CPU(obj);
      Error *err = NULL;
@@ -XXX,XX +XXX,XX @@ static void cpu_max_set_sve_vq(Object *obj, Visitor *v, const char *name,
      error_propagate(errp, err);
  }
 +static void cpu_arm_get_sve(Object *obj, Visitor *v, const char *name,
 +                            void *opaque, Error **errp)
 +{
 +    ARMCPU *cpu = ARM_CPU(obj);
 +    bool value = cpu_isar_feature(aa64_sve, cpu);
 +
 +    visit_type_bool(v, name, &value, errp);
 +}
 +
 +static void cpu_arm_set_sve(Object *obj, Visitor *v, const char *name,
 +                            void *opaque, Error **errp)
 +{
 +    ARMCPU *cpu = ARM_CPU(obj);
 +    Error *err = NULL;
 +    bool value;
 +    uint64_t t;
 +
 +    visit_type_bool(v, name, &value, &err);
 +    if (err) {
 +        error_propagate(errp, err);
 +        return;
 +    }
 +
 +    t = cpu->isar.id_aa64pfr0;
 +    t = FIELD_DP64(t, ID_AA64PFR0, SVE, value);
 +    cpu->isar.id_aa64pfr0 = t;
 +}
 +
  /* -cpu max: if KVM is enabled, like -cpu host (best possible with this host);
   * otherwise, a CPU with as many features enabled as our emulation supports.
   * The version of '-cpu max' for qemu-system-arm is defined in cpu.c;
@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
  #endif
          cpu->sve_max_vq = ARM_MAX_VQ;
 -        object_property_add(obj, "sve-max-vq", "uint32", cpu_max_get_sve_vq,
 -                            cpu_max_set_sve_vq, NULL, NULL, &error_fatal);
 +        object_property_add(obj, "sve-max-vq", "uint32", cpu_max_get_sve_max_vq,
 +                            cpu_max_set_sve_max_vq, NULL, NULL, &error_fatal);
 +        object_property_add(obj, "sve", "bool", cpu_arm_get_sve,
 +                            cpu_arm_set_sve, NULL, NULL, &error_fatal);
      }
  }
 diff --git a/target/arm/monitor.c b/target/arm/monitor.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/monitor.c
 +++ b/target/arm/monitor.c
@@ -XXX,XX +XXX,XX @@ GICCapabilityList *qmp_query_gic_capabilities(Error **errp)
   * then the order that considers those dependencies must be used.
   */
  static const char *cpu_model_advertised_features[] = {
 -    "aarch64", "pmu",
 +    "aarch64", "pmu", "sve",
      NULL
  };
 diff --git a/tests/arm-cpu-features.c b/tests/arm-cpu-features.c
 index XXXXXXX..XXXXXXX 100644
 --- a/tests/arm-cpu-features.c
 +++ b/tests/arm-cpu-features.c
@@ -XXX,XX +XXX,XX @@ static void test_query_cpu_model_expansion(const void *data)
      if (g_str_equal(qtest_get_arch(), "aarch64")) {
          assert_has_feature(qts, "max", "aarch64");
 +        assert_has_feature(qts, "max", "sve");
          assert_has_feature(qts, "cortex-a57", "pmu");
          assert_has_feature(qts, "cortex-a57", "aarch64");
 --
 .20.1

-[Qemu-devel] [PULL 03/21] target/arm: Allow ARMCPRegInfo read/write functions to throw exceptions
+[PULL 04/11] target/arm/cpu64: max cpu: Introduce sve<N> properties
-Currently the only part of an ARMCPRegInfo which is allowed to cause
+From: Andrew Jones <drjones@redhat.com>
 a CPU exception is the access function, which returns a value indicating
 that some flavour of UNDEF should be generated.
-For the ATS system instructions, we would like to conditionally
+Introduce cpu properties to give fine control over SVE vector lengths.
-generate exceptions as part of the writefn, because some faults
+We introduce a property for each valid length up to the current
-during the page table walk (like external aborts) should cause
+maximum supported, which is 2048-bits. The properties are named, e.g.
-an exception to be raised rather than returning a value.
+sve128, sve256, sve384, sve512, ..., where the number is the number of
 bits. See the updates to docs/arm-cpu-features.rst for a description
 of the semantics and for example uses.
-There are several ways we could do this:
+Note, as sve-max-vq is still present and we'd like to be able to
- * plumb the GETPC() value from the top level set_cp_reg/get_cp_reg
+support qmp_query_cpu_model_expansion with guests launched with e.g.
-   helper functions through into the readfn and writefn hooks
+-cpu max,sve-max-vq=8 on their command lines, then we do allow
- * add extra readfn_with_ra/writefn_with_ra hooks that take the GETPC()
+sve-max-vq and sve<N> properties to be provided at the same time, but
-   value
+this is not recommended, and is why sve-max-vq is not mentioned in the
- * require the ATS instructions to provide a dummy accessfn,
+document.  If sve-max-vq is provided then it enables all lengths smaller
-   which serves no purpose except to cause the code generation
+than and including the max and disables all lengths larger. It also has
-   to emit TCG ops to sync the CPU state
+the side-effect that no larger lengths may be enabled and that the max
- * add an ARM_CP_ flag to mark the ARMCPRegInfo as possibly
+itself cannot be disabled. Smaller non-power-of-two lengths may,
-   throwing an exception in its read/write hooks, and make the
+however, be disabled, e.g. -cpu max,sve-max-vq=4,sve384=off provides a
-   codegen sync the CPU state before calling the hooks if the
+guest the vector lengths 128, 256, and 512 bits.
    flag is set
-This patch opts for the last of these, as it is fairly simple
+This patch has been co-authored with Richard Henderson, who reworked
-to implement and doesn't require invasive changes like updating
+the target/arm/cpu64.c changes in order to push all the validation and
-the readfn/writefn hook function prototype signature.
+auto-enabling/disabling steps into the finalizer, resulting in a nice
 LOC reduction.
+Signed-off-by: Andrew Jones <drjones@redhat.com>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Reviewed-by: Eric Auger <eric.auger@redhat.com>
+Tested-by: Masayoshi Mizuma <m.mizuma@jp.fujitsu.com>
+Reviewed-by: Beata Michalska <beata.michalska@linaro.org>
+Message-id: 20191031142734.8590-5-drjones@redhat.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Tested-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
-Message-id: 20190816125802.25877-2-peter.maydell@linaro.org
 ---
- target/arm/cpu.h           | 6 +++++-
+ include/qemu/bitops.h     |   1 +
- target/arm/translate-a64.c | 6 ++++++
+ target/arm/cpu.h          |  19 ++++
- target/arm/translate.c     | 7 +++++++
+ target/arm/cpu.c          |  19 ++++
-files changed, 18 insertions(+), 1 deletion(-)
+ target/arm/cpu64.c        | 192 ++++++++++++++++++++++++++++++++++++-
  target/arm/helper.c       |  10 +-
  target/arm/monitor.c      |  12 +++
  tests/arm-cpu-features.c  | 194 ++++++++++++++++++++++++++++++++++++++
  docs/arm-cpu-features.rst | 168 +++++++++++++++++++++++++++++++--
 files changed, 606 insertions(+), 9 deletions(-)
+diff --git a/include/qemu/bitops.h b/include/qemu/bitops.h
+index XXXXXXX..XXXXXXX 100644
+--- a/include/qemu/bitops.h
++++ b/include/qemu/bitops.h
+@@ -XXX,XX +XXX,XX @@
+ #define BITS_PER_LONG           (sizeof (unsigned long) * BITS_PER_BYTE)
+ #define BIT(nr)                 (1UL << (nr))
++#define BIT_ULL(nr)             (1ULL << (nr))
+ #define BIT_MASK(nr)            (1UL << ((nr) % BITS_PER_LONG))
+ #define BIT_WORD(nr)            ((nr) / BITS_PER_LONG)
+ #define BITS_TO_LONGS(nr)       DIV_ROUND_UP(nr, BITS_PER_BYTE * sizeof(long))
 diff --git a/target/arm/cpu.h b/target/arm/cpu.h
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu.h
 +++ b/target/arm/cpu.h
-@@ -XXX,XX +XXX,XX @@ static inline uint64_t cpreg_to_kvm_id(uint32_t cpregid)
+@@ -XXX,XX +XXX,XX @@ typedef struct {
-  * IO indicates that this register does I/O and therefore its accesses
-  * need to be surrounded by gen_io_start()/gen_io_end(). In particular,
+ #ifdef TARGET_AARCH64
-  * registers which implement clocks or timers require this.
+ # define ARM_MAX_VQ    16
-+ * RAISES_EXC is for when the read or write hook might raise an exception;
++void arm_cpu_sve_finalize(ARMCPU *cpu, Error **errp);
-+ * the generated code will synchronize the CPU state before calling the hook
++uint32_t arm_cpu_vq_map_next_smaller(ARMCPU *cpu, uint32_t vq);
-+ * so that it is safe for the hook to call raise_exception().
+ #else
  # define ARM_MAX_VQ    1
 +static inline void arm_cpu_sve_finalize(ARMCPU *cpu, Error **errp) { }
 +static inline uint32_t arm_cpu_vq_map_next_smaller(ARMCPU *cpu, uint32_t vq)
 +{ return 0; }
  #endif
  typedef struct ARMVectorReg {
@@ -XXX,XX +XXX,XX @@ struct ARMCPU {
      /* Used to set the maximum vector length the cpu will support.  */
      uint32_t sve_max_vq;
 +
 +    /*
 +     * In sve_vq_map each set bit is a supported vector length of
 +     * (bit-number + 1) * 16 bytes, i.e. each bit number + 1 is the vector
 +     * length in quadwords.
 +     *
 +     * While processing properties during initialization, corresponding
 +     * sve_vq_init bits are set for bits in sve_vq_map that have been
 +     * set by properties.
 +     */
 +    DECLARE_BITMAP(sve_vq_map, ARM_MAX_VQ);
 +    DECLARE_BITMAP(sve_vq_init, ARM_MAX_VQ);
  };
  void arm_cpu_post_init(Object *obj);
@@ -XXX,XX +XXX,XX @@ static inline int arm_feature(CPUARMState *env, int feature)
      return (env->features & (1ULL << feature)) != 0;
  }
 +void arm_cpu_finalize_features(ARMCPU *cpu, Error **errp);
 +
  #if !defined(CONFIG_USER_ONLY)
  /* Return true if exception levels below EL3 are in secure state,
   * or would be following an exception return to that level.
 diff --git a/target/arm/cpu.c b/target/arm/cpu.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu.c
 +++ b/target/arm/cpu.c
@@ -XXX,XX +XXX,XX @@ static void arm_cpu_finalizefn(Object *obj)
  #endif
  }
 +void arm_cpu_finalize_features(ARMCPU *cpu, Error **errp)
 +{
 +    Error *local_err = NULL;
 +
 +    if (arm_feature(&cpu->env, ARM_FEATURE_AARCH64)) {
 +        arm_cpu_sve_finalize(cpu, &local_err);
 +        if (local_err != NULL) {
 +            error_propagate(errp, local_err);
 +            return;
 +        }
 +    }
 +}
 +
  static void arm_cpu_realizefn(DeviceState *dev, Error **errp)
  {
      CPUState *cs = CPU(dev);
@@ -XXX,XX +XXX,XX @@ static void arm_cpu_realizefn(DeviceState *dev, Error **errp)
          return;
      }
 +    arm_cpu_finalize_features(cpu, &local_err);
 +    if (local_err != NULL) {
 +        error_propagate(errp, local_err);
 +        return;
 +    }
 +
      if (arm_feature(env, ARM_FEATURE_AARCH64) &&
          cpu->has_vfp != cpu->has_neon) {
          /*
 diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu64.c
 +++ b/target/arm/cpu64.c
@@ -XXX,XX +XXX,XX @@ static void aarch64_a72_initfn(Object *obj)
      define_arm_cp_regs(cpu, cortex_a72_a57_a53_cp_reginfo);
  }
 +void arm_cpu_sve_finalize(ARMCPU *cpu, Error **errp)
 +{
 +    /*
 +     * If any vector lengths are explicitly enabled with sve<N> properties,
 +     * then all other lengths are implicitly disabled.  If sve-max-vq is
 +     * specified then it is the same as explicitly enabling all lengths
 +     * up to and including the specified maximum, which means all larger
 +     * lengths will be implicitly disabled.  If no sve<N> properties
 +     * are enabled and sve-max-vq is not specified, then all lengths not
 +     * explicitly disabled will be enabled.  Additionally, all power-of-two
 +     * vector lengths less than the maximum enabled length will be
 +     * automatically enabled and all vector lengths larger than the largest
 +     * disabled power-of-two vector length will be automatically disabled.
 +     * Errors are generated if the user provided input that interferes with
 +     * any of the above.  Finally, if SVE is not disabled, then at least one
 +     * vector length must be enabled.
 +     */
 +    DECLARE_BITMAP(tmp, ARM_MAX_VQ);
 +    uint32_t vq, max_vq = 0;
 +
 +    /*
 +     * Process explicit sve<N> properties.
 +     * From the properties, sve_vq_map<N> implies sve_vq_init<N>.
 +     * Check first for any sve<N> enabled.
 +     */
 +    if (!bitmap_empty(cpu->sve_vq_map, ARM_MAX_VQ)) {
 +        max_vq = find_last_bit(cpu->sve_vq_map, ARM_MAX_VQ) + 1;
 +
 +        if (cpu->sve_max_vq && max_vq > cpu->sve_max_vq) {
 +            error_setg(errp, "cannot enable sve%d", max_vq * 128);
 +            error_append_hint(errp, "sve%d is larger than the maximum vector "
 +                              "length, sve-max-vq=%d (%d bits)\n",
 +                              max_vq * 128, cpu->sve_max_vq,
 +                              cpu->sve_max_vq * 128);
 +            return;
 +        }
 +
 +        /* Propagate enabled bits down through required powers-of-two. */
 +        for (vq = pow2floor(max_vq); vq >= 1; vq >>= 1) {
 +            if (!test_bit(vq - 1, cpu->sve_vq_init)) {
 +                set_bit(vq - 1, cpu->sve_vq_map);
 +            }
 +        }
 +    } else if (cpu->sve_max_vq == 0) {
 +        /*
 +         * No explicit bits enabled, and no implicit bits from sve-max-vq.
 +         */
 +        if (!cpu_isar_feature(aa64_sve, cpu)) {
 +            /* SVE is disabled and so are all vector lengths.  Good. */
 +            return;
 +        }
 +
 +        /* Disabling a power-of-two disables all larger lengths. */
 +        if (test_bit(0, cpu->sve_vq_init)) {
 +            error_setg(errp, "cannot disable sve128");
 +            error_append_hint(errp, "Disabling sve128 results in all vector "
 +                              "lengths being disabled.\n");
 +            error_append_hint(errp, "With SVE enabled, at least one vector "
 +                              "length must be enabled.\n");
 +            return;
 +        }
 +        for (vq = 2; vq <= ARM_MAX_VQ; vq <<= 1) {
 +            if (test_bit(vq - 1, cpu->sve_vq_init)) {
 +                break;
 +            }
 +        }
 +        max_vq = vq <= ARM_MAX_VQ ? vq - 1 : ARM_MAX_VQ;
 +
 +        bitmap_complement(cpu->sve_vq_map, cpu->sve_vq_init, max_vq);
 +        max_vq = find_last_bit(cpu->sve_vq_map, max_vq) + 1;
 +    }
 +
 +    /*
 +     * Process the sve-max-vq property.
 +     * Note that we know from the above that no bit above
 +     * sve-max-vq is currently set.
 +     */
 +    if (cpu->sve_max_vq != 0) {
 +        max_vq = cpu->sve_max_vq;
 +
 +        if (!test_bit(max_vq - 1, cpu->sve_vq_map) &&
 +            test_bit(max_vq - 1, cpu->sve_vq_init)) {
 +            error_setg(errp, "cannot disable sve%d", max_vq * 128);
 +            error_append_hint(errp, "The maximum vector length must be "
 +                              "enabled, sve-max-vq=%d (%d bits)\n",
 +                              max_vq, max_vq * 128);
 +            return;
 +        }
 +
 +        /* Set all bits not explicitly set within sve-max-vq. */
 +        bitmap_complement(tmp, cpu->sve_vq_init, max_vq);
 +        bitmap_or(cpu->sve_vq_map, cpu->sve_vq_map, tmp, max_vq);
 +    }
 +
 +    /*
 +     * We should know what max-vq is now.  Also, as we're done
 +     * manipulating sve-vq-map, we ensure any bits above max-vq
 +     * are clear, just in case anybody looks.
 +     */
 +    assert(max_vq != 0);
 +    bitmap_clear(cpu->sve_vq_map, max_vq, ARM_MAX_VQ - max_vq);
 +
 +    /* Ensure all required powers-of-two are enabled. */
 +    for (vq = pow2floor(max_vq); vq >= 1; vq >>= 1) {
 +        if (!test_bit(vq - 1, cpu->sve_vq_map)) {
 +            error_setg(errp, "cannot disable sve%d", vq * 128);
 +            error_append_hint(errp, "sve%d is required as it "
 +                              "is a power-of-two length smaller than "
 +                              "the maximum, sve%d\n",
 +                              vq * 128, max_vq * 128);
 +            return;
 +        }
 +    }
 +
 +    /*
 +     * Now that we validated all our vector lengths, the only question
 +     * left to answer is if we even want SVE at all.
 +     */
 +    if (!cpu_isar_feature(aa64_sve, cpu)) {
 +        error_setg(errp, "cannot enable sve%d", max_vq * 128);
 +        error_append_hint(errp, "SVE must be enabled to enable vector "
 +                          "lengths.\n");
 +        error_append_hint(errp, "Add sve=on to the CPU property list.\n");
 +        return;
 +    }
 +
 +    /* From now on sve_max_vq is the actual maximum supported length. */
 +    cpu->sve_max_vq = max_vq;
 +}
 +
 +uint32_t arm_cpu_vq_map_next_smaller(ARMCPU *cpu, uint32_t vq)
 +{
 +    uint32_t bitnum;
 +
 +    /*
 +     * We allow vq == ARM_MAX_VQ + 1 to be input because the caller may want
 +     * to find the maximum vq enabled, which may be ARM_MAX_VQ, but this
 +     * function always returns the next smaller than the input.
 +     */
 +    assert(vq && vq <= ARM_MAX_VQ + 1);
 +
 +    bitnum = find_last_bit(cpu->sve_vq_map, vq - 1);
 +    return bitnum == vq - 1 ? 0 : bitnum + 1;
 +}
 +
  static void cpu_max_get_sve_max_vq(Object *obj, Visitor *v, const char *name,
                                     void *opaque, Error **errp)
  {
@@ -XXX,XX +XXX,XX @@ static void cpu_max_set_sve_max_vq(Object *obj, Visitor *v, const char *name,
      error_propagate(errp, err);
  }
 +static void cpu_arm_get_sve_vq(Object *obj, Visitor *v, const char *name,
 +                               void *opaque, Error **errp)
 +{
 +    ARMCPU *cpu = ARM_CPU(obj);
 +    uint32_t vq = atoi(&name[3]) / 128;
 +    bool value;
 +
 +    /* All vector lengths are disabled when SVE is off. */
 +    if (!cpu_isar_feature(aa64_sve, cpu)) {
 +        value = false;
 +    } else {
 +        value = test_bit(vq - 1, cpu->sve_vq_map);
 +    }
 +    visit_type_bool(v, name, &value, errp);
 +}
 +
 +static void cpu_arm_set_sve_vq(Object *obj, Visitor *v, const char *name,
 +                               void *opaque, Error **errp)
 +{
 +    ARMCPU *cpu = ARM_CPU(obj);
 +    uint32_t vq = atoi(&name[3]) / 128;
 +    Error *err = NULL;
 +    bool value;
 +
 +    visit_type_bool(v, name, &value, &err);
 +    if (err) {
 +        error_propagate(errp, err);
 +        return;
 +    }
 +
 +    if (value) {
 +        set_bit(vq - 1, cpu->sve_vq_map);
 +    } else {
 +        clear_bit(vq - 1, cpu->sve_vq_map);
 +    }
 +    set_bit(vq - 1, cpu->sve_vq_init);
 +}
 +
  static void cpu_arm_get_sve(Object *obj, Visitor *v, const char *name,
                              void *opaque, Error **errp)
  {
@@ -XXX,XX +XXX,XX @@ static void cpu_arm_set_sve(Object *obj, Visitor *v, const char *name,
  static void aarch64_max_initfn(Object *obj)
  {
      ARMCPU *cpu = ARM_CPU(obj);
 +    uint32_t vq;
      if (kvm_enabled()) {
          kvm_arm_set_cpu_features_from_host(cpu);
@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
          cpu->dcz_blocksize = 7; /*  512 bytes */
  #endif
 -        cpu->sve_max_vq = ARM_MAX_VQ;
          object_property_add(obj, "sve-max-vq", "uint32", cpu_max_get_sve_max_vq,
                              cpu_max_set_sve_max_vq, NULL, NULL, &error_fatal);
          object_property_add(obj, "sve", "bool", cpu_arm_get_sve,
                              cpu_arm_set_sve, NULL, NULL, &error_fatal);
 +
 +        for (vq = 1; vq <= ARM_MAX_VQ; ++vq) {
 +            char name[8];
 +            sprintf(name, "sve%d", vq * 128);
 +            object_property_add(obj, name, "bool", cpu_arm_get_sve_vq,
 +                                cpu_arm_set_sve_vq, NULL, NULL, &error_fatal);
 +        }
      }
  }
 diff --git a/target/arm/helper.c b/target/arm/helper.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/helper.c
 +++ b/target/arm/helper.c
@@ -XXX,XX +XXX,XX @@ int sve_exception_el(CPUARMState *env, int el)
      return 0;
  }
 +static uint32_t sve_zcr_get_valid_len(ARMCPU *cpu, uint32_t start_len)
 +{
 +    uint32_t start_vq = (start_len & 0xf) + 1;
 +
 +    return arm_cpu_vq_map_next_smaller(cpu, start_vq + 1) - 1;
 +}
 +
  /*
   * Given that SVE is enabled, return the vector length for EL.
   */
- #define ARM_CP_SPECIAL           0x0001
+@@ -XXX,XX +XXX,XX @@ uint32_t sve_zcr_len_for_el(CPUARMState *env, int el)
- #define ARM_CP_CONST             0x0002
+     if (arm_feature(env, ARM_FEATURE_EL3)) {
-@@ -XXX,XX +XXX,XX @@ static inline uint64_t cpreg_to_kvm_id(uint32_t cpregid)
+         zcr_len = MIN(zcr_len, 0xf & (uint32_t)env->vfp.zcr_el[3]);
- #define ARM_CP_FPU               0x1000
+     }
- #define ARM_CP_SVE               0x2000
+-    return zcr_len;
- #define ARM_CP_NO_GDB            0x4000
++
-+#define ARM_CP_RAISES_EXC        0x8000
++    return sve_zcr_get_valid_len(cpu, zcr_len);
- /* Used only as a terminator for ARMCPRegInfo lists */
+ }
- #define ARM_CP_SENTINEL          0xffff
- /* Mask of only the flag bits in a type field */
+ static void zcr_write(CPUARMState *env, const ARMCPRegInfo *ri,
--#define ARM_CP_FLAG_MASK         0x70ff
+diff --git a/target/arm/monitor.c b/target/arm/monitor.c
 +#define ARM_CP_FLAG_MASK         0xf0ff
  /* Valid values for ARMCPRegInfo state field, indicating which of
   * the AArch32 and AArch64 execution states this register is visible in.
 diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate-a64.c
+--- a/target/arm/monitor.c
-+++ b/target/arm/translate-a64.c
++++ b/target/arm/monitor.c
-@@ -XXX,XX +XXX,XX @@ static void handle_sys(DisasContext *s, uint32_t insn, bool isread,
+@@ -XXX,XX +XXX,XX @@ GICCapabilityList *qmp_query_gic_capabilities(Error **errp)
-         tcg_temp_free_ptr(tmpptr);
+     return head;
-         tcg_temp_free_i32(tcg_syn);
+ }
-         tcg_temp_free_i32(tcg_isread);
-+    } else if (ri->type & ARM_CP_RAISES_EXC) {
++QEMU_BUILD_BUG_ON(ARM_MAX_VQ > 16);
-+        /*
++
-+         * The readfn or writefn might raise an exception;
+ /*
-+         * synchronize the CPU state in case it does.
+  * These are cpu model features we want to advertise. The order here
-+         */
+  * matters as this is the order in which qmp_query_cpu_model_expansion
-+        gen_a64_set_pc_im(s->pc_curr);
+@@ -XXX,XX +XXX,XX @@ GICCapabilityList *qmp_query_gic_capabilities(Error **errp)
   */
  static const char *cpu_model_advertised_features[] = {
      "aarch64", "pmu", "sve",
 +    "sve128", "sve256", "sve384", "sve512",
 +    "sve640", "sve768", "sve896", "sve1024", "sve1152", "sve1280",
 +    "sve1408", "sve1536", "sve1664", "sve1792", "sve1920", "sve2048",
      NULL
  };
@@ -XXX,XX +XXX,XX @@ CpuModelExpansionInfo *qmp_query_cpu_model_expansion(CpuModelExpansionType type,
          if (!err) {
              visit_check_struct(visitor, &err);
          }
 +        if (!err) {
 +            arm_cpu_finalize_features(ARM_CPU(obj), &err);
 +        }
          visit_end_struct(visitor, NULL);
          visit_free(visitor);
          if (err) {
@@ -XXX,XX +XXX,XX @@ CpuModelExpansionInfo *qmp_query_cpu_model_expansion(CpuModelExpansionType type,
              error_propagate(errp, err);
              return NULL;
          }
 +    } else {
 +        Error *err = NULL;
 +        arm_cpu_finalize_features(ARM_CPU(obj), &err);
 +        assert(err == NULL);
      }
-     /* Handle special cases first */
+     expansion_info = g_new0(CpuModelExpansionInfo, 1);
-diff --git a/target/arm/translate.c b/target/arm/translate.c
+diff --git a/tests/arm-cpu-features.c b/tests/arm-cpu-features.c
 index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate.c
+--- a/tests/arm-cpu-features.c
-+++ b/target/arm/translate.c
++++ b/tests/arm-cpu-features.c
-@@ -XXX,XX +XXX,XX @@ static int disas_coproc_insn(DisasContext *s, uint32_t insn)
+@@ -XXX,XX +XXX,XX @@
-             tcg_temp_free_ptr(tmpptr);
+  * See the COPYING file in the top-level directory.
-             tcg_temp_free_i32(tcg_syn);
+  */
-             tcg_temp_free_i32(tcg_isread);
+ #include "qemu/osdep.h"
-+        } else if (ri->type & ARM_CP_RAISES_EXC) {
++#include "qemu/bitops.h"
-+            /*
+ #include "libqtest.h"
-+             * The readfn or writefn might raise an exception;
+ #include "qapi/qmp/qdict.h"
-+             * synchronize the CPU state in case it does.
+ #include "qapi/qmp/qjson.h"
-+             */
-+            gen_set_condexec(s);
++/*
-+            gen_set_pc_im(s, s->pc_curr);
++ * We expect the SVE max-vq to be 16. Also it must be <= 64
-         }
++ * for our test code, otherwise 'vls' can't just be a uint64_t.
++ */
-         /* Handle special cases first */
++#define SVE_MAX_VQ 16
 +
  #define MACHINE     "-machine virt,gic-version=max,accel=tcg "
  #define MACHINE_KVM "-machine virt,gic-version=max,accel=kvm:tcg "
  #define QUERY_HEAD  "{ 'execute': 'query-cpu-model-expansion', " \
@@ -XXX,XX +XXX,XX @@ static void assert_bad_props(QTestState *qts, const char *cpu_type)
      qobject_unref(resp);
  }
 +static uint64_t resp_get_sve_vls(QDict *resp)
 +{
 +    QDict *props;
 +    const QDictEntry *e;
 +    uint64_t vls = 0;
 +    int n = 0;
 +
 +    g_assert(resp);
 +    g_assert(resp_has_props(resp));
 +
 +    props = resp_get_props(resp);
 +
 +    for (e = qdict_first(props); e; e = qdict_next(props, e)) {
 +        if (strlen(e->key) > 3 && !strncmp(e->key, "sve", 3) &&
 +            g_ascii_isdigit(e->key[3])) {
 +            char *endptr;
 +            int bits;
 +
 +            bits = g_ascii_strtoll(&e->key[3], &endptr, 10);
 +            if (!bits || *endptr != '\0') {
 +                continue;
 +            }
 +
 +            if (qdict_get_bool(props, e->key)) {
 +                vls |= BIT_ULL((bits / 128) - 1);
 +            }
 +            ++n;
 +        }
 +    }
 +
 +    g_assert(n == SVE_MAX_VQ);
 +
 +    return vls;
 +}
 +
 +#define assert_sve_vls(qts, cpu_type, expected_vls, fmt, ...)          \
 +({                                                                     \
 +    QDict *_resp = do_query(qts, cpu_type, fmt, ##__VA_ARGS__);        \
 +    g_assert(_resp);                                                   \
 +    g_assert(resp_has_props(_resp));                                   \
 +    g_assert(resp_get_sve_vls(_resp) == expected_vls);                 \
 +    qobject_unref(_resp);                                              \
 +})
 +
 +static void sve_tests_default(QTestState *qts, const char *cpu_type)
 +{
 +    /*
 +     * With no sve-max-vq or sve<N> properties on the command line
 +     * the default is to have all vector lengths enabled. This also
 +     * tests that 'sve' is 'on' by default.
 +     */
 +    assert_sve_vls(qts, cpu_type, BIT_ULL(SVE_MAX_VQ) - 1, NULL);
 +
 +    /* With SVE off, all vector lengths should also be off. */
 +    assert_sve_vls(qts, cpu_type, 0, "{ 'sve': false }");
 +
 +    /* With SVE on, we must have at least one vector length enabled. */
 +    assert_error(qts, cpu_type, "cannot disable sve128", "{ 'sve128': false }");
 +
 +    /* Basic enable/disable tests. */
 +    assert_sve_vls(qts, cpu_type, 0x7, "{ 'sve384': true }");
 +    assert_sve_vls(qts, cpu_type, ((BIT_ULL(SVE_MAX_VQ) - 1) & ~BIT_ULL(2)),
 +                   "{ 'sve384': false }");
 +
 +    /*
 +     * ---------------------------------------------------------------------
 +     *               power-of-two(vq)   all-power-            can      can
 +     *                                  of-two(< vq)        enable   disable
 +     * ---------------------------------------------------------------------
 +     * vq < max_vq      no                MUST*              yes      yes
 +     * vq < max_vq      yes               MUST*              yes      no
 +     * ---------------------------------------------------------------------
 +     * vq == max_vq     n/a               MUST*              yes**    yes**
 +     * ---------------------------------------------------------------------
 +     * vq > max_vq      n/a               no                 no       yes
 +     * vq > max_vq      n/a               yes                yes      yes
 +     * ---------------------------------------------------------------------
 +     *
 +     * [*] "MUST" means this requirement must already be satisfied,
 +     *     otherwise 'max_vq' couldn't itself be enabled.
 +     *
 +     * [**] Not testable with the QMP interface, only with the command line.
 +     */
 +
 +    /* max_vq := 8 */
 +    assert_sve_vls(qts, cpu_type, 0x8b, "{ 'sve1024': true }");
 +
 +    /* max_vq := 8, vq < max_vq, !power-of-two(vq) */
 +    assert_sve_vls(qts, cpu_type, 0x8f,
 +                   "{ 'sve1024': true, 'sve384': true }");
 +    assert_sve_vls(qts, cpu_type, 0x8b,
 +                   "{ 'sve1024': true, 'sve384': false }");
 +
 +    /* max_vq := 8, vq < max_vq, power-of-two(vq) */
 +    assert_sve_vls(qts, cpu_type, 0x8b,
 +                   "{ 'sve1024': true, 'sve256': true }");
 +    assert_error(qts, cpu_type, "cannot disable sve256",
 +                 "{ 'sve1024': true, 'sve256': false }");
 +
 +    /* max_vq := 3, vq > max_vq, !all-power-of-two(< vq) */
 +    assert_error(qts, cpu_type, "cannot disable sve512",
 +                 "{ 'sve384': true, 'sve512': false, 'sve640': true }");
 +
 +    /*
 +     * We can disable power-of-two vector lengths when all larger lengths
 +     * are also disabled. We only need to disable the power-of-two length,
 +     * as all non-enabled larger lengths will then be auto-disabled.
 +     */
 +    assert_sve_vls(qts, cpu_type, 0x7, "{ 'sve512': false }");
 +
 +    /* max_vq := 3, vq > max_vq, all-power-of-two(< vq) */
 +    assert_sve_vls(qts, cpu_type, 0x1f,
 +                   "{ 'sve384': true, 'sve512': true, 'sve640': true }");
 +    assert_sve_vls(qts, cpu_type, 0xf,
 +                   "{ 'sve384': true, 'sve512': true, 'sve640': false }");
 +}
 +
 +static void sve_tests_sve_max_vq_8(const void *data)
 +{
 +    QTestState *qts;
 +
 +    qts = qtest_init(MACHINE "-cpu max,sve-max-vq=8");
 +
 +    assert_sve_vls(qts, "max", BIT_ULL(8) - 1, NULL);
 +
 +    /*
 +     * Disabling the max-vq set by sve-max-vq is not allowed, but
 +     * of course enabling it is OK.
 +     */
 +    assert_error(qts, "max", "cannot disable sve1024", "{ 'sve1024': false }");
 +    assert_sve_vls(qts, "max", 0xff, "{ 'sve1024': true }");
 +
 +    /*
 +     * Enabling anything larger than max-vq set by sve-max-vq is not
 +     * allowed, but of course disabling everything larger is OK.
 +     */
 +    assert_error(qts, "max", "cannot enable sve1152", "{ 'sve1152': true }");
 +    assert_sve_vls(qts, "max", 0xff, "{ 'sve1152': false }");
 +
 +    /*
 +     * We can enable/disable non power-of-two lengths smaller than the
 +     * max-vq set by sve-max-vq, but, while we can enable power-of-two
 +     * lengths, we can't disable them.
 +     */
 +    assert_sve_vls(qts, "max", 0xff, "{ 'sve384': true }");
 +    assert_sve_vls(qts, "max", 0xfb, "{ 'sve384': false }");
 +    assert_sve_vls(qts, "max", 0xff, "{ 'sve256': true }");
 +    assert_error(qts, "max", "cannot disable sve256", "{ 'sve256': false }");
 +
 +    qtest_quit(qts);
 +}
 +
 +static void sve_tests_sve_off(const void *data)
 +{
 +    QTestState *qts;
 +
 +    qts = qtest_init(MACHINE "-cpu max,sve=off");
 +
 +    /* SVE is off, so the map should be empty. */
 +    assert_sve_vls(qts, "max", 0, NULL);
 +
 +    /* The map stays empty even if we turn lengths off. */
 +    assert_sve_vls(qts, "max", 0, "{ 'sve128': false }");
 +
 +    /* It's an error to enable lengths when SVE is off. */
 +    assert_error(qts, "max", "cannot enable sve128", "{ 'sve128': true }");
 +
 +    /* With SVE re-enabled we should get all vector lengths enabled. */
 +    assert_sve_vls(qts, "max", BIT_ULL(SVE_MAX_VQ) - 1, "{ 'sve': true }");
 +
 +    /* Or enable SVE with just specific vector lengths. */
 +    assert_sve_vls(qts, "max", 0x3,
 +                   "{ 'sve': true, 'sve128': true, 'sve256': true }");
 +
 +    qtest_quit(qts);
 +}
 +
  static void test_query_cpu_model_expansion(const void *data)
  {
      QTestState *qts;
@@ -XXX,XX +XXX,XX @@ static void test_query_cpu_model_expansion(const void *data)
      if (g_str_equal(qtest_get_arch(), "aarch64")) {
          assert_has_feature(qts, "max", "aarch64");
          assert_has_feature(qts, "max", "sve");
 +        assert_has_feature(qts, "max", "sve128");
          assert_has_feature(qts, "cortex-a57", "pmu");
          assert_has_feature(qts, "cortex-a57", "aarch64");
 +        sve_tests_default(qts, "max");
 +
          /* Test that features that depend on KVM generate errors without. */
          assert_error(qts, "max",
                       "'aarch64' feature cannot be disabled "
@@ -XXX,XX +XXX,XX @@ int main(int argc, char **argv)
      qtest_add_data_func("/arm/kvm/query-cpu-model-expansion",
                          NULL, test_query_cpu_model_expansion_kvm);
 +    if (g_str_equal(qtest_get_arch(), "aarch64")) {
 +        qtest_add_data_func("/arm/max/query-cpu-model-expansion/sve-max-vq-8",
 +                            NULL, sve_tests_sve_max_vq_8);
 +        qtest_add_data_func("/arm/max/query-cpu-model-expansion/sve-off",
 +                            NULL, sve_tests_sve_off);
 +    }
 +
      return g_test_run();
  }
 diff --git a/docs/arm-cpu-features.rst b/docs/arm-cpu-features.rst
 index XXXXXXX..XXXXXXX 100644
 --- a/docs/arm-cpu-features.rst
 +++ b/docs/arm-cpu-features.rst
@@ -XXX,XX +XXX,XX @@ block in the script for usage) is used to issue the QMP commands.
        (QEMU) query-cpu-model-expansion type=full model={"name":"max"}
        { "return": {
          "model": { "name": "max", "props": {
 -        "pmu": true, "aarch64": true
 +        "sve1664": true, "pmu": true, "sve1792": true, "sve1920": true,
 +        "sve128": true, "aarch64": true, "sve1024": true, "sve": true,
 +        "sve640": true, "sve768": true, "sve1408": true, "sve256": true,
 +        "sve1152": true, "sve512": true, "sve384": true, "sve1536": true,
 +        "sve896": true, "sve1280": true, "sve2048": true
        }}}}
 -We see that the `max` CPU type has the `pmu` and `aarch64` CPU features.
 -We also see that the CPU features are enabled, as they are all `true`.
 +We see that the `max` CPU type has the `pmu`, `aarch64`, `sve`, and many
 +`sve<N>` CPU features.  We also see that all the CPU features are
 +enabled, as they are all `true`.  (The `sve<N>` CPU features are all
 +optional SVE vector lengths (see "SVE CPU Properties").  While with TCG
 +all SVE vector lengths can be supported, when KVM is in use it's more
 +likely that only a few lengths will be supported, if SVE is supported at
 +all.)
  (2) Let's try to disable the PMU::
        (QEMU) query-cpu-model-expansion type=full model={"name":"max","props":{"pmu":false}}
        { "return": {
          "model": { "name": "max", "props": {
 -        "pmu": false, "aarch64": true
 +        "sve1664": true, "pmu": false, "sve1792": true, "sve1920": true,
 +        "sve128": true, "aarch64": true, "sve1024": true, "sve": true,
 +        "sve640": true, "sve768": true, "sve1408": true, "sve256": true,
 +        "sve1152": true, "sve512": true, "sve384": true, "sve1536": true,
 +        "sve896": true, "sve1280": true, "sve2048": true
        }}}}
  We see it worked, as `pmu` is now `false`.
@@ -XXX,XX +XXX,XX @@ We see it worked, as `pmu` is now `false`.
  It looks like this feature is limited to a configuration we do not
  currently have.
 -(4) Let's try probing CPU features for the Cortex-A15 CPU type::
 +(4) Let's disable `sve` and see what happens to all the optional SVE
 +    vector lengths::
 +
 +      (QEMU) query-cpu-model-expansion type=full model={"name":"max","props":{"sve":false}}
 +      { "return": {
 +        "model": { "name": "max", "props": {
 +        "sve1664": false, "pmu": true, "sve1792": false, "sve1920": false,
 +        "sve128": false, "aarch64": true, "sve1024": false, "sve": false,
 +        "sve640": false, "sve768": false, "sve1408": false, "sve256": false,
 +        "sve1152": false, "sve512": false, "sve384": false, "sve1536": false,
 +        "sve896": false, "sve1280": false, "sve2048": false
 +      }}}}
 +
 +As expected they are now all `false`.
 +
 +(5) Let's try probing CPU features for the Cortex-A15 CPU type::
        (QEMU) query-cpu-model-expansion type=full model={"name":"cortex-a15"}
        {"return": {"model": {"name": "cortex-a15", "props": {"pmu": true}}}}
@@ -XXX,XX +XXX,XX @@ After determining which CPU features are available and supported for a
  given CPU type, then they may be selectively enabled or disabled on the
  QEMU command line with that CPU type::
 -  $ qemu-system-aarch64 -M virt -cpu max,pmu=off
 +  $ qemu-system-aarch64 -M virt -cpu max,pmu=off,sve=on,sve128=on,sve256=on
 -The example above disables the PMU for the `max` CPU type.
 +The example above disables the PMU and enables the first two SVE vector
 +lengths for the `max` CPU type.  Note, the `sve=on` isn't actually
 +necessary, because, as we observed above with our probe of the `max` CPU
 +type, `sve` is already on by default.  Also, based on our probe of
 +defaults, it would seem we need to disable many SVE vector lengths, rather
 +than only enabling the two we want.  This isn't the case, because, as
 +disabling many SVE vector lengths would be quite verbose, the `sve<N>` CPU
 +properties have special semantics (see "SVE CPU Property Parsing
 +Semantics").
 +
 +SVE CPU Properties
 +==================
 +
 +There are two types of SVE CPU properties: `sve` and `sve<N>`.  The first
 +is used to enable or disable the entire SVE feature, just as the `pmu`
 +CPU property completely enables or disables the PMU.  The second type
 +is used to enable or disable specific vector lengths, where `N` is the
 +number of bits of the length.  The `sve<N>` CPU properties have special
 +dependencies and constraints, see "SVE CPU Property Dependencies and
 +Constraints" below.  Additionally, as we want all supported vector lengths
 +to be enabled by default, then, in order to avoid overly verbose command
 +lines (command lines full of `sve<N>=off`, for all `N` not wanted), we
 +provide the parsing semantics listed in "SVE CPU Property Parsing
 +Semantics".
 +
 +SVE CPU Property Dependencies and Constraints
 +---------------------------------------------
 +
 +  1) At least one vector length must be enabled when `sve` is enabled.
 +
 +  2) If a vector length `N` is enabled, then all power-of-two vector
 +     lengths smaller than `N` must also be enabled.  E.g. if `sve512`
 +     is enabled, then the 128-bit and 256-bit vector lengths must also
 +     be enabled.
 +
 +SVE CPU Property Parsing Semantics
 +----------------------------------
 +
 +  1) If SVE is disabled (`sve=off`), then which SVE vector lengths
 +     are enabled or disabled is irrelevant to the guest, as the entire
 +     SVE feature is disabled and that disables all vector lengths for
 +     the guest.  However QEMU will still track any `sve<N>` CPU
 +     properties provided by the user.  If later an `sve=on` is provided,
 +     then the guest will get only the enabled lengths.  If no `sve=on`
 +     is provided and there are explicitly enabled vector lengths, then
 +     an error is generated.
 +
 +  2) If SVE is enabled (`sve=on`), but no `sve<N>` CPU properties are
 +     provided, then all supported vector lengths are enabled, including
 +     the non-power-of-two lengths.
 +
 +  3) If SVE is enabled, then an error is generated when attempting to
 +     disable the last enabled vector length (see constraint (1) of "SVE
 +     CPU Property Dependencies and Constraints").
 +
 +  4) If one or more vector lengths have been explicitly enabled and at
 +     at least one of the dependency lengths of the maximum enabled length
 +     has been explicitly disabled, then an error is generated (see
 +     constraint (2) of "SVE CPU Property Dependencies and Constraints").
 +
 +  5) If one or more `sve<N>` CPU properties are set `off`, but no `sve<N>`,
 +     CPU properties are set `on`, then the specified vector lengths are
 +     disabled but the default for any unspecified lengths remains enabled.
 +     Disabling a power-of-two vector length also disables all vector
 +     lengths larger than the power-of-two length (see constraint (2) of
 +     "SVE CPU Property Dependencies and Constraints").
 +
 +  6) If one or more `sve<N>` CPU properties are set to `on`, then they
 +     are enabled and all unspecified lengths default to disabled, except
 +     for the required lengths per constraint (2) of "SVE CPU Property
 +     Dependencies and Constraints", which will even be auto-enabled if
 +     they were not explicitly enabled.
 +
 +  7) If SVE was disabled (`sve=off`), allowing all vector lengths to be
 +     explicitly disabled (i.e. avoiding the error specified in (3) of
 +     "SVE CPU Property Parsing Semantics"), then if later an `sve=on` is
 +     provided an error will be generated.  To avoid this error, one must
 +     enable at least one vector length prior to enabling SVE.
 +
 +SVE CPU Property Examples
 +-------------------------
 +
 +  1) Disable SVE::
 +
 +     $ qemu-system-aarch64 -M virt -cpu max,sve=off
 +
 +  2) Implicitly enable all vector lengths for the `max` CPU type::
 +
 +     $ qemu-system-aarch64 -M virt -cpu max
 +
 +  3) Only enable the 128-bit vector length::
 +
 +     $ qemu-system-aarch64 -M virt -cpu max,sve128=on
 +
 +  4) Disable the 512-bit vector length and all larger vector lengths,
 +     since 512 is a power-of-two.  This results in all the smaller,
 +     uninitialized lengths (128, 256, and 384) defaulting to enabled::
 +
 +     $ qemu-system-aarch64 -M virt -cpu max,sve512=off
 +
 +  5) Enable the 128-bit, 256-bit, and 512-bit vector lengths::
 +
 +     $ qemu-system-aarch64 -M virt -cpu max,sve128=on,sve256=on,sve512=on
 +
 +  6) The same as (5), but since the 128-bit and 256-bit vector
 +     lengths are required for the 512-bit vector length to be enabled,
 +     then allow them to be auto-enabled::
 +
 +     $ qemu-system-aarch64 -M virt -cpu max,sve512=on
 +
 +  7) Do the same as (6), but by first disabling SVE and then re-enabling it::
 +
 +     $ qemu-system-aarch64 -M virt -cpu max,sve=off,sve512=on,sve=on
 +
 +  8) Force errors regarding the last vector length::
 +
 +     $ qemu-system-aarch64 -M virt -cpu max,sve128=off
 +     $ qemu-system-aarch64 -M virt -cpu max,sve=off,sve128=off,sve=on
 +
 +SVE CPU Property Recommendations
 +--------------------------------
 +
 +The examples in "SVE CPU Property Examples" exhibit many ways to select
 +vector lengths which developers may find useful in order to avoid overly
 +verbose command lines.  However, the recommended way to select vector
 +lengths is to explicitly enable each desired length.  Therefore only
 +example's (1), (3), and (5) exhibit recommended uses of the properties.
 --
 .20.1

-[Qemu-devel] [PULL 18/21] atomic_template: fix indentation in GEN_ATOMIC_HELPER
+[PULL 05/11] target/arm/kvm64: Add kvm_arch_get/put_sve
-From: "Emilio G. Cota" <cota@braap.org>
+From: Andrew Jones <drjones@redhat.com>
-Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
+These are the SVE equivalents to kvm_arch_get/put_fpsimd. Note, the
-Signed-off-by: Emilio G. Cota <cota@braap.org>
+swabbing is different than it is for fpsmid because the vector format
 is a little-endian stream of words.
 Signed-off-by: Andrew Jones <drjones@redhat.com>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
+Reviewed-by: Eric Auger <eric.auger@redhat.com>
-Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
+Tested-by: Masayoshi Mizuma <m.mizuma@jp.fujitsu.com>
-Message-id: 20190828165307.18321-8-alex.bennee@linaro.org
+Message-id: 20191031142734.8590-6-drjones@redhat.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- accel/tcg/atomic_template.h | 2 +-
+ target/arm/kvm64.c | 185 ++++++++++++++++++++++++++++++++++++++-------
-file changed, 1 insertion(+), 1 deletion(-)
+file changed, 156 insertions(+), 29 deletions(-)
-diff --git a/accel/tcg/atomic_template.h b/accel/tcg/atomic_template.h
+diff --git a/target/arm/kvm64.c b/target/arm/kvm64.c
 index XXXXXXX..XXXXXXX 100644
---- a/accel/tcg/atomic_template.h
+--- a/target/arm/kvm64.c
-+++ b/accel/tcg/atomic_template.h
++++ b/target/arm/kvm64.c
-@@ -XXX,XX +XXX,XX @@ ABI_TYPE ATOMIC_NAME(xchg)(CPUArchState *env, target_ulong addr,
+@@ -XXX,XX +XXX,XX @@ int kvm_arch_destroy_vcpu(CPUState *cs)
+ bool kvm_arm_reg_syncs_via_cpreg_list(uint64_t regidx)
- #define GEN_ATOMIC_HELPER(X)                                        \
+ {
- ABI_TYPE ATOMIC_NAME(X)(CPUArchState *env, target_ulong addr,       \
+     /* Return true if the regidx is a register we should synchronize
--                 ABI_TYPE val EXTRA_ARGS)                           \
+-     * via the cpreg_tuples array (ie is not a core reg we sync by
-+                        ABI_TYPE val EXTRA_ARGS)                    \
+-     * hand in kvm_arch_get/put_registers())
- {                                                                   \
++     * via the cpreg_tuples array (ie is not a core or sve reg that
-     ATOMIC_MMU_DECLS;                                               \
++     * we sync by hand in kvm_arch_get/put_registers())
-     DATA_TYPE *haddr = ATOMIC_MMU_LOOKUP;                           \
+      */
      switch (regidx & KVM_REG_ARM_COPROC_MASK) {
      case KVM_REG_ARM_CORE:
 +    case KVM_REG_ARM64_SVE:
          return false;
      default:
          return true;
@@ -XXX,XX +XXX,XX @@ int kvm_arm_cpreg_level(uint64_t regidx)
  static int kvm_arch_put_fpsimd(CPUState *cs)
  {
 -    ARMCPU *cpu = ARM_CPU(cs);
 -    CPUARMState *env = &cpu->env;
 +    CPUARMState *env = &ARM_CPU(cs)->env;
      struct kvm_one_reg reg;
 -    uint32_t fpr;
      int i, ret;
      for (i = 0; i < 32; i++) {
@@ -XXX,XX +XXX,XX @@ static int kvm_arch_put_fpsimd(CPUState *cs)
          }
      }
 -    reg.addr = (uintptr_t)(&fpr);
 -    fpr = vfp_get_fpsr(env);
 -    reg.id = AARCH64_SIMD_CTRL_REG(fp_regs.fpsr);
 -    ret = kvm_vcpu_ioctl(cs, KVM_SET_ONE_REG, &reg);
 -    if (ret) {
 -        return ret;
 +    return 0;
 +}
 +
 +/*
 + * SVE registers are encoded in KVM's memory in an endianness-invariant format.
 + * The byte at offset i from the start of the in-memory representation contains
 + * the bits [(7 + 8 * i) : (8 * i)] of the register value. As this means the
 + * lowest offsets are stored in the lowest memory addresses, then that nearly
 + * matches QEMU's representation, which is to use an array of host-endian
 + * uint64_t's, where the lower offsets are at the lower indices. To complete
 + * the translation we just need to byte swap the uint64_t's on big-endian hosts.
 + */
 +static uint64_t *sve_bswap64(uint64_t *dst, uint64_t *src, int nr)
 +{
 +#ifdef HOST_WORDS_BIGENDIAN
 +    int i;
 +
 +    for (i = 0; i < nr; ++i) {
 +        dst[i] = bswap64(src[i]);
      }
 -    reg.addr = (uintptr_t)(&fpr);
 -    fpr = vfp_get_fpcr(env);
 -    reg.id = AARCH64_SIMD_CTRL_REG(fp_regs.fpcr);
 +    return dst;
 +#else
 +    return src;
 +#endif
 +}
 +
 +/*
 + * KVM SVE registers come in slices where ZREGs have a slice size of 2048 bits
 + * and PREGS and the FFR have a slice size of 256 bits. However we simply hard
 + * code the slice index to zero for now as it's unlikely we'll need more than
 + * one slice for quite some time.
 + */
 +static int kvm_arch_put_sve(CPUState *cs)
 +{
 +    ARMCPU *cpu = ARM_CPU(cs);
 +    CPUARMState *env = &cpu->env;
 +    uint64_t tmp[ARM_MAX_VQ * 2];
 +    uint64_t *r;
 +    struct kvm_one_reg reg;
 +    int n, ret;
 +
 +    for (n = 0; n < KVM_ARM64_SVE_NUM_ZREGS; ++n) {
 +        r = sve_bswap64(tmp, &env->vfp.zregs[n].d[0], cpu->sve_max_vq * 2);
 +        reg.addr = (uintptr_t)r;
 +        reg.id = KVM_REG_ARM64_SVE_ZREG(n, 0);
 +        ret = kvm_vcpu_ioctl(cs, KVM_SET_ONE_REG, &reg);
 +        if (ret) {
 +            return ret;
 +        }
 +    }
 +
 +    for (n = 0; n < KVM_ARM64_SVE_NUM_PREGS; ++n) {
 +        r = sve_bswap64(tmp, r = &env->vfp.pregs[n].p[0],
 +                        DIV_ROUND_UP(cpu->sve_max_vq * 2, 8));
 +        reg.addr = (uintptr_t)r;
 +        reg.id = KVM_REG_ARM64_SVE_PREG(n, 0);
 +        ret = kvm_vcpu_ioctl(cs, KVM_SET_ONE_REG, &reg);
 +        if (ret) {
 +            return ret;
 +        }
 +    }
 +
 +    r = sve_bswap64(tmp, &env->vfp.pregs[FFR_PRED_NUM].p[0],
 +                    DIV_ROUND_UP(cpu->sve_max_vq * 2, 8));
 +    reg.addr = (uintptr_t)r;
 +    reg.id = KVM_REG_ARM64_SVE_FFR(0);
      ret = kvm_vcpu_ioctl(cs, KVM_SET_ONE_REG, &reg);
      if (ret) {
          return ret;
@@ -XXX,XX +XXX,XX @@ int kvm_arch_put_registers(CPUState *cs, int level)
  {
      struct kvm_one_reg reg;
      uint64_t val;
 +    uint32_t fpr;
      int i, ret;
      unsigned int el;
@@ -XXX,XX +XXX,XX @@ int kvm_arch_put_registers(CPUState *cs, int level)
          }
      }
 -    ret = kvm_arch_put_fpsimd(cs);
 +    if (cpu_isar_feature(aa64_sve, cpu)) {
 +        ret = kvm_arch_put_sve(cs);
 +    } else {
 +        ret = kvm_arch_put_fpsimd(cs);
 +    }
 +    if (ret) {
 +        return ret;
 +    }
 +
 +    reg.addr = (uintptr_t)(&fpr);
 +    fpr = vfp_get_fpsr(env);
 +    reg.id = AARCH64_SIMD_CTRL_REG(fp_regs.fpsr);
 +    ret = kvm_vcpu_ioctl(cs, KVM_SET_ONE_REG, &reg);
 +    if (ret) {
 +        return ret;
 +    }
 +
 +    reg.addr = (uintptr_t)(&fpr);
 +    fpr = vfp_get_fpcr(env);
 +    reg.id = AARCH64_SIMD_CTRL_REG(fp_regs.fpcr);
 +    ret = kvm_vcpu_ioctl(cs, KVM_SET_ONE_REG, &reg);
      if (ret) {
          return ret;
      }
@@ -XXX,XX +XXX,XX @@ int kvm_arch_put_registers(CPUState *cs, int level)
  static int kvm_arch_get_fpsimd(CPUState *cs)
  {
 -    ARMCPU *cpu = ARM_CPU(cs);
 -    CPUARMState *env = &cpu->env;
 +    CPUARMState *env = &ARM_CPU(cs)->env;
      struct kvm_one_reg reg;
 -    uint32_t fpr;
      int i, ret;
      for (i = 0; i < 32; i++) {
@@ -XXX,XX +XXX,XX @@ static int kvm_arch_get_fpsimd(CPUState *cs)
          }
      }
 -    reg.addr = (uintptr_t)(&fpr);
 -    reg.id = AARCH64_SIMD_CTRL_REG(fp_regs.fpsr);
 -    ret = kvm_vcpu_ioctl(cs, KVM_GET_ONE_REG, &reg);
 -    if (ret) {
 -        return ret;
 -    }
 -    vfp_set_fpsr(env, fpr);
 +    return 0;
 +}
 -    reg.addr = (uintptr_t)(&fpr);
 -    reg.id = AARCH64_SIMD_CTRL_REG(fp_regs.fpcr);
 +/*
 + * KVM SVE registers come in slices where ZREGs have a slice size of 2048 bits
 + * and PREGS and the FFR have a slice size of 256 bits. However we simply hard
 + * code the slice index to zero for now as it's unlikely we'll need more than
 + * one slice for quite some time.
 + */
 +static int kvm_arch_get_sve(CPUState *cs)
 +{
 +    ARMCPU *cpu = ARM_CPU(cs);
 +    CPUARMState *env = &cpu->env;
 +    struct kvm_one_reg reg;
 +    uint64_t *r;
 +    int n, ret;
 +
 +    for (n = 0; n < KVM_ARM64_SVE_NUM_ZREGS; ++n) {
 +        r = &env->vfp.zregs[n].d[0];
 +        reg.addr = (uintptr_t)r;
 +        reg.id = KVM_REG_ARM64_SVE_ZREG(n, 0);
 +        ret = kvm_vcpu_ioctl(cs, KVM_GET_ONE_REG, &reg);
 +        if (ret) {
 +            return ret;
 +        }
 +        sve_bswap64(r, r, cpu->sve_max_vq * 2);
 +    }
 +
 +    for (n = 0; n < KVM_ARM64_SVE_NUM_PREGS; ++n) {
 +        r = &env->vfp.pregs[n].p[0];
 +        reg.addr = (uintptr_t)r;
 +        reg.id = KVM_REG_ARM64_SVE_PREG(n, 0);
 +        ret = kvm_vcpu_ioctl(cs, KVM_GET_ONE_REG, &reg);
 +        if (ret) {
 +            return ret;
 +        }
 +        sve_bswap64(r, r, DIV_ROUND_UP(cpu->sve_max_vq * 2, 8));
 +    }
 +
 +    r = &env->vfp.pregs[FFR_PRED_NUM].p[0];
 +    reg.addr = (uintptr_t)r;
 +    reg.id = KVM_REG_ARM64_SVE_FFR(0);
      ret = kvm_vcpu_ioctl(cs, KVM_GET_ONE_REG, &reg);
      if (ret) {
          return ret;
      }
 -    vfp_set_fpcr(env, fpr);
 +    sve_bswap64(r, r, DIV_ROUND_UP(cpu->sve_max_vq * 2, 8));
      return 0;
  }
@@ -XXX,XX +XXX,XX @@ int kvm_arch_get_registers(CPUState *cs)
      struct kvm_one_reg reg;
      uint64_t val;
      unsigned int el;
 +    uint32_t fpr;
      int i, ret;
      ARMCPU *cpu = ARM_CPU(cs);
@@ -XXX,XX +XXX,XX @@ int kvm_arch_get_registers(CPUState *cs)
          env->spsr = env->banked_spsr[i];
      }
 -    ret = kvm_arch_get_fpsimd(cs);
 +    if (cpu_isar_feature(aa64_sve, cpu)) {
 +        ret = kvm_arch_get_sve(cs);
 +    } else {
 +        ret = kvm_arch_get_fpsimd(cs);
 +    }
      if (ret) {
          return ret;
      }
 +    reg.addr = (uintptr_t)(&fpr);
 +    reg.id = AARCH64_SIMD_CTRL_REG(fp_regs.fpsr);
 +    ret = kvm_vcpu_ioctl(cs, KVM_GET_ONE_REG, &reg);
 +    if (ret) {
 +        return ret;
 +    }
 +    vfp_set_fpsr(env, fpr);
 +
 +    reg.addr = (uintptr_t)(&fpr);
 +    reg.id = AARCH64_SIMD_CTRL_REG(fp_regs.fpcr);
 +    ret = kvm_vcpu_ioctl(cs, KVM_GET_ONE_REG, &reg);
 +    if (ret) {
 +        return ret;
 +    }
 +    vfp_set_fpcr(env, fpr);
 +
      ret = kvm_get_vcpu_events(cpu);
      if (ret) {
          return ret;
 --
 .20.1

-[Qemu-devel] [PULL 02/21] target/arm: Factor out unallocated_encoding for aarch32
+[PULL 06/11] target/arm/kvm64: max cpu: Enable SVE when available
-From: Richard Henderson <richard.henderson@linaro.org>
+From: Andrew Jones <drjones@redhat.com>
-Make this a static function private to translate.c.
+Enable SVE in the KVM guest when the 'max' cpu type is configured
-Thus we can use the same idiom between aarch64 and aarch32
+and KVM supports it. KVM SVE requires use of the new finalize
-without actually sharing function implementations.
+vcpu ioctl, so we add that now too. For starters SVE can only be
+turned on or off, getting all vector lengths the host CPU supports
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+when on. We'll add the other SVE CPU properties in later patches.
-Reviewed-by: Laurent Desnogues <laurent.desnogues@gmail.com>
-Message-id: 20190826151536.6771-3-richard.henderson@linaro.org
+Signed-off-by: Andrew Jones <drjones@redhat.com>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Reviewed-by: Eric Auger <eric.auger@redhat.com>
 Tested-by: Masayoshi Mizuma <m.mizuma@jp.fujitsu.com>
 Reviewed-by: Beata Michalska <beata.michalska@linaro.org>
 Message-id: 20191031142734.8590-7-drjones@redhat.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/translate-vfp.inc.c |  3 +--
+ target/arm/kvm_arm.h     | 27 +++++++++++++++++++++++++++
- target/arm/translate.c         | 22 ++++++++++++----------
+ target/arm/cpu64.c       | 17 ++++++++++++++---
-files changed, 13 insertions(+), 12 deletions(-)
+ target/arm/kvm.c         |  5 +++++
+ target/arm/kvm64.c       | 20 +++++++++++++++++++-
-diff --git a/target/arm/translate-vfp.inc.c b/target/arm/translate-vfp.inc.c
+ tests/arm-cpu-features.c |  4 ++++
-index XXXXXXX..XXXXXXX 100644
+files changed, 69 insertions(+), 4 deletions(-)
---- a/target/arm/translate-vfp.inc.c
-+++ b/target/arm/translate-vfp.inc.c
+diff --git a/target/arm/kvm_arm.h b/target/arm/kvm_arm.h
-@@ -XXX,XX +XXX,XX @@ static bool full_vfp_access_check(DisasContext *s, bool ignore_vfp_enabled)
+index XXXXXXX..XXXXXXX 100644
+--- a/target/arm/kvm_arm.h
-     if (!s->vfp_enabled && !ignore_vfp_enabled) {
++++ b/target/arm/kvm_arm.h
-         assert(!arm_dc_feature(s, ARM_FEATURE_M));
+@@ -XXX,XX +XXX,XX @@
--        gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
+  */
--                           default_exception_el(s));
+ int kvm_arm_vcpu_init(CPUState *cs);
-+        unallocated_encoding(s);
-         return false;
++/**
-     }
++ * kvm_arm_vcpu_finalize
++ * @cs: CPUState
-diff --git a/target/arm/translate.c b/target/arm/translate.c
++ * @feature: int
-index XXXXXXX..XXXXXXX 100644
++ *
---- a/target/arm/translate.c
++ * Finalizes the configuration of the specified VCPU feature by
-+++ b/target/arm/translate.c
++ * invoking the KVM_ARM_VCPU_FINALIZE ioctl. Features requiring
-@@ -XXX,XX +XXX,XX @@ static void gen_exception_bkpt_insn(DisasContext *s, uint32_t syn)
++ * this are documented in the "KVM_ARM_VCPU_FINALIZE" section of
-     s->base.is_jmp = DISAS_NORETURN;
++ * KVM's API documentation.
- }
++ *
++ * Returns: 0 if success else < 0 error code
-+static void unallocated_encoding(DisasContext *s)
++ */
 +int kvm_arm_vcpu_finalize(CPUState *cs, int feature);
 +
  /**
   * kvm_arm_register_device:
   * @mr: memory region for this device
@@ -XXX,XX +XXX,XX @@ bool kvm_arm_aarch32_supported(CPUState *cs);
   */
  bool kvm_arm_pmu_supported(CPUState *cs);
 +/**
 + * bool kvm_arm_sve_supported:
 + * @cs: CPUState
 + *
 + * Returns true if the KVM VCPU can enable SVE and false otherwise.
 + */
 +bool kvm_arm_sve_supported(CPUState *cs);
 +
  /**
   * kvm_arm_get_max_vm_ipa_size - Returns the number of bits in the
   * IPA address space supported by KVM
@@ -XXX,XX +XXX,XX @@ static inline bool kvm_arm_pmu_supported(CPUState *cs)
      return false;
  }
 +static inline bool kvm_arm_sve_supported(CPUState *cs)
 +{
-+    /* Unallocated and reserved encodings are uncategorized */
++    return false;
 +    gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
 +                       default_exception_el(s));
 +}
 +
- /* Force a TB lookup after an instruction that changes the CPU state.  */
+ static inline int kvm_arm_get_max_vm_ipa_size(MachineState *ms)
  static inline void gen_lookup_tb(DisasContext *s)
  {
-@@ -XXX,XX +XXX,XX @@ static inline void gen_hlt(DisasContext *s, int imm)
+     return -ENOENT;
 diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu64.c
 +++ b/target/arm/cpu64.c
@@ -XXX,XX +XXX,XX @@ static void cpu_arm_set_sve(Object *obj, Visitor *v, const char *name,
          return;
      }
--    gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
++    if (value && kvm_enabled() && !kvm_arm_sve_supported(CPU(cpu))) {
--                       default_exception_el(s));
++        error_setg(errp, "'sve' feature not supported by KVM on this host");
-+    unallocated_encoding(s);
++        return;
- }
++    }
++
- static inline void gen_add_data_offset(DisasContext *s, unsigned int insn,
+     t = cpu->isar.id_aa64pfr0;
-@@ -XXX,XX +XXX,XX @@ static void gen_srs(DisasContext *s,
+     t = FIELD_DP64(t, ID_AA64PFR0, SVE, value);
-     }
+     cpu->isar.id_aa64pfr0 = t;
+@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
-     if (undef) {
+ {
--        gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
+     ARMCPU *cpu = ARM_CPU(obj);
--                           default_exception_el(s));
+     uint32_t vq;
-+        unallocated_encoding(s);
++    uint64_t t;
-         return;
-     }
+     if (kvm_enabled()) {
+         kvm_arm_set_cpu_features_from_host(cpu);
-@@ -XXX,XX +XXX,XX @@ static void disas_arm_insn(DisasContext *s, unsigned int insn)
++        if (kvm_arm_sve_supported(CPU(cpu))) {
-             break;
++            t = cpu->isar.id_aa64pfr0;
-         default:
++            t = FIELD_DP64(t, ID_AA64PFR0, SVE, 1);
-         illegal_op:
++            cpu->isar.id_aa64pfr0 = t;
--            gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
++        }
--                               default_exception_el(s));
+     } else {
-+            unallocated_encoding(s);
+-        uint64_t t;
-             break;
+         uint32_t u;
          aarch64_a57_initfn(obj);
@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
          object_property_add(obj, "sve-max-vq", "uint32", cpu_max_get_sve_max_vq,
                              cpu_max_set_sve_max_vq, NULL, NULL, &error_fatal);
 -        object_property_add(obj, "sve", "bool", cpu_arm_get_sve,
 -                            cpu_arm_set_sve, NULL, NULL, &error_fatal);
          for (vq = 1; vq <= ARM_MAX_VQ; ++vq) {
              char name[8];
@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
                                  cpu_arm_set_sve_vq, NULL, NULL, &error_fatal);
          }
      }
-@@ -XXX,XX +XXX,XX @@ static void disas_thumb2_insn(DisasContext *s, uint32_t insn)
++
-     }
++    object_property_add(obj, "sve", "bool", cpu_arm_get_sve,
-     return;
++                        cpu_arm_set_sve, NULL, NULL, &error_fatal);
- illegal_op:
+ }
--    gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
--                       default_exception_el(s));
+ struct ARMCPUInfo {
-+    unallocated_encoding(s);
+diff --git a/target/arm/kvm.c b/target/arm/kvm.c
- }
+index XXXXXXX..XXXXXXX 100644
+--- a/target/arm/kvm.c
- static void disas_thumb_insn(DisasContext *s, uint32_t insn)
++++ b/target/arm/kvm.c
-@@ -XXX,XX +XXX,XX @@ static void disas_thumb_insn(DisasContext *s, uint32_t insn)
+@@ -XXX,XX +XXX,XX @@ int kvm_arm_vcpu_init(CPUState *cs)
-     return;
+     return kvm_vcpu_ioctl(cs, KVM_ARM_VCPU_INIT, &init);
- illegal_op:
+ }
- undef:
--    gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
++int kvm_arm_vcpu_finalize(CPUState *cs, int feature)
--                       default_exception_el(s));
++{
-+    unallocated_encoding(s);
++    return kvm_vcpu_ioctl(cs, KVM_ARM_VCPU_FINALIZE, &feature);
- }
++}
++
- static bool insn_crosses_page(CPUARMState *env, DisasContext *s)
+ void kvm_arm_init_serror_injection(CPUState *cs)
  {
      cap_has_inject_serror_esr = kvm_check_extension(cs->kvm_state,
 diff --git a/target/arm/kvm64.c b/target/arm/kvm64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/kvm64.c
 +++ b/target/arm/kvm64.c
@@ -XXX,XX +XXX,XX @@ bool kvm_arm_aarch32_supported(CPUState *cpu)
      return kvm_check_extension(s, KVM_CAP_ARM_EL1_32BIT);
  }
 +bool kvm_arm_sve_supported(CPUState *cpu)
 +{
 +    KVMState *s = KVM_STATE(current_machine->accelerator);
 +
 +    return kvm_check_extension(s, KVM_CAP_ARM_SVE);
 +}
 +
  #define ARM_CPU_ID_MPIDR       3, 0, 0, 0, 5
  int kvm_arch_init_vcpu(CPUState *cs)
@@ -XXX,XX +XXX,XX @@ int kvm_arch_init_vcpu(CPUState *cs)
          cpu->kvm_init_features[0] |= 1 << KVM_ARM_VCPU_EL1_32BIT;
      }
      if (!kvm_check_extension(cs->kvm_state, KVM_CAP_ARM_PMU_V3)) {
 -            cpu->has_pmu = false;
 +        cpu->has_pmu = false;
      }
      if (cpu->has_pmu) {
          cpu->kvm_init_features[0] |= 1 << KVM_ARM_VCPU_PMU_V3;
      } else {
          unset_feature(&env->features, ARM_FEATURE_PMU);
      }
 +    if (cpu_isar_feature(aa64_sve, cpu)) {
 +        assert(kvm_arm_sve_supported(cs));
 +        cpu->kvm_init_features[0] |= 1 << KVM_ARM_VCPU_SVE;
 +    }
      /* Do KVM_ARM_VCPU_INIT ioctl */
      ret = kvm_arm_vcpu_init(cs);
@@ -XXX,XX +XXX,XX @@ int kvm_arch_init_vcpu(CPUState *cs)
          return ret;
      }
 +    if (cpu_isar_feature(aa64_sve, cpu)) {
 +        ret = kvm_arm_vcpu_finalize(cs, KVM_ARM_VCPU_SVE);
 +        if (ret) {
 +            return ret;
 +        }
 +    }
 +
      /*
       * When KVM is in use, PSCI is emulated in-kernel and not by qemu.
       * Currently KVM has its own idea about MPIDR assignment, so we
 diff --git a/tests/arm-cpu-features.c b/tests/arm-cpu-features.c
 index XXXXXXX..XXXXXXX 100644
 --- a/tests/arm-cpu-features.c
 +++ b/tests/arm-cpu-features.c
@@ -XXX,XX +XXX,XX @@ static void test_query_cpu_model_expansion_kvm(const void *data)
          assert_has_feature(qts, "host", "aarch64");
          assert_has_feature(qts, "host", "pmu");
 +        assert_has_feature(qts, "max", "sve");
 +
          assert_error(qts, "cortex-a15",
              "We cannot guarantee the CPU type 'cortex-a15' works "
              "with KVM on this host", NULL);
      } else {
          assert_has_not_feature(qts, "host", "aarch64");
          assert_has_not_feature(qts, "host", "pmu");
 +
 +        assert_has_not_feature(qts, "max", "sve");
      }
      qtest_quit(qts);
 --
 .20.1

-[Qemu-devel] [PULL 07/21] hw/arm/smmuv3: Log a guest error when decoding an invalid STE
+[PULL 07/11] target/arm/kvm: scratch vcpu: Preserve input kvm_vcpu_init features
-From: Eric Auger <eric.auger@redhat.com>
+From: Andrew Jones <drjones@redhat.com>
-Log a guest error when encountering an invalid STE.
+kvm_arm_create_scratch_host_vcpu() takes a struct kvm_vcpu_init
 parameter. Rather than just using it as an output parameter to
 pass back the preferred target, use it also as an input parameter,
 allowing a caller to pass a selected target if they wish and to
 also pass cpu features. If the caller doesn't want to select a
 target they can pass -1 for the target which indicates they want
 to use the preferred target and have it passed back like before.
-Signed-off-by: Eric Auger <eric.auger@redhat.com>
+Signed-off-by: Andrew Jones <drjones@redhat.com>
-Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20190822172350.12008-5-eric.auger@redhat.com
+Reviewed-by: Eric Auger <eric.auger@redhat.com>
 Tested-by: Masayoshi Mizuma <m.mizuma@jp.fujitsu.com>
 Reviewed-by: Beata Michalska <beata.michalska@linaro.org>
 Message-id: 20191031142734.8590-8-drjones@redhat.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- hw/arm/smmuv3.c | 1 +
+ target/arm/kvm.c   | 20 +++++++++++++++-----
-file changed, 1 insertion(+)
+ target/arm/kvm32.c |  6 +++++-
  target/arm/kvm64.c |  6 +++++-
 files changed, 25 insertions(+), 7 deletions(-)
-diff --git a/hw/arm/smmuv3.c b/hw/arm/smmuv3.c
+diff --git a/target/arm/kvm.c b/target/arm/kvm.c
 index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/smmuv3.c
+--- a/target/arm/kvm.c
-+++ b/hw/arm/smmuv3.c
++++ b/target/arm/kvm.c
-@@ -XXX,XX +XXX,XX @@ static int decode_ste(SMMUv3State *s, SMMUTransCfg *cfg,
+@@ -XXX,XX +XXX,XX @@ bool kvm_arm_create_scratch_host_vcpu(const uint32_t *cpus_to_try,
-     uint32_t config;
+                                       int *fdarray,
+                                       struct kvm_vcpu_init *init)
-     if (!STE_VALID(ste)) {
+ {
-+        qemu_log_mask(LOG_GUEST_ERROR, "invalid STE\n");
+-    int ret, kvmfd = -1, vmfd = -1, cpufd = -1;
-         goto bad_ste;
++    int ret = 0, kvmfd = -1, vmfd = -1, cpufd = -1;
      kvmfd = qemu_open("/dev/kvm", O_RDWR);
      if (kvmfd < 0) {
@@ -XXX,XX +XXX,XX @@ bool kvm_arm_create_scratch_host_vcpu(const uint32_t *cpus_to_try,
          goto finish;
      }
+-    ret = ioctl(vmfd, KVM_ARM_PREFERRED_TARGET, init);
++    if (init->target == -1) {
++        struct kvm_vcpu_init preferred;
++
++        ret = ioctl(vmfd, KVM_ARM_PREFERRED_TARGET, &preferred);
++        if (!ret) {
++            init->target = preferred.target;
++        }
++    }
+     if (ret >= 0) {
+         ret = ioctl(cpufd, KVM_ARM_VCPU_INIT, init);
+         if (ret < 0) {
+@@ -XXX,XX +XXX,XX @@ bool kvm_arm_create_scratch_host_vcpu(const uint32_t *cpus_to_try,
+          * creating one kind of guest CPU which is its preferred
+          * CPU type.
+          */
++        struct kvm_vcpu_init try;
++
+         while (*cpus_to_try != QEMU_KVM_ARM_TARGET_NONE) {
+-            init->target = *cpus_to_try++;
+-            memset(init->features, 0, sizeof(init->features));
+-            ret = ioctl(cpufd, KVM_ARM_VCPU_INIT, init);
++            try.target = *cpus_to_try++;
++            memcpy(try.features, init->features, sizeof(init->features));
++            ret = ioctl(cpufd, KVM_ARM_VCPU_INIT, &try);
+             if (ret >= 0) {
+                 break;
+             }
+@@ -XXX,XX +XXX,XX @@ bool kvm_arm_create_scratch_host_vcpu(const uint32_t *cpus_to_try,
+         if (ret < 0) {
+             goto err;
+         }
++        init->target = try.target;
+     } else {
+         /* Treat a NULL cpus_to_try argument the same as an empty
+          * list, which means we will fail the call since this must
+diff --git a/target/arm/kvm32.c b/target/arm/kvm32.c
+index XXXXXXX..XXXXXXX 100644
+--- a/target/arm/kvm32.c
++++ b/target/arm/kvm32.c
+@@ -XXX,XX +XXX,XX @@ bool kvm_arm_get_host_cpu_features(ARMHostCPUFeatures *ahcf)
+         QEMU_KVM_ARM_TARGET_CORTEX_A15,
+         QEMU_KVM_ARM_TARGET_NONE
+     };
+-    struct kvm_vcpu_init init;
++    /*
++     * target = -1 informs kvm_arm_create_scratch_host_vcpu()
++     * to use the preferred target
++     */
++    struct kvm_vcpu_init init = { .target = -1, };
+     if (!kvm_arm_create_scratch_host_vcpu(cpus_to_try, fdarray, &init)) {
+         return false;
+diff --git a/target/arm/kvm64.c b/target/arm/kvm64.c
+index XXXXXXX..XXXXXXX 100644
+--- a/target/arm/kvm64.c
++++ b/target/arm/kvm64.c
+@@ -XXX,XX +XXX,XX @@ bool kvm_arm_get_host_cpu_features(ARMHostCPUFeatures *ahcf)
+         KVM_ARM_TARGET_CORTEX_A57,
+         QEMU_KVM_ARM_TARGET_NONE
+     };
+-    struct kvm_vcpu_init init;
++    /*
++     * target = -1 informs kvm_arm_create_scratch_host_vcpu()
++     * to use the preferred target
++     */
++    struct kvm_vcpu_init init = { .target = -1, };
+     if (!kvm_arm_create_scratch_host_vcpu(cpus_to_try, fdarray, &init)) {
+         return false;
 --
 .20.1

-[Qemu-devel] [PULL 05/21] aspeed/timer: Provide back-pressure information for short periods
+[PULL 08/11] target/arm/cpu64: max cpu: Support sve properties with KVM
-From: Andrew Jeffery <andrew@aj.id.au>
+From: Andrew Jones <drjones@redhat.com>
-First up: This is not the way the hardware behaves.
+Extend the SVE vq map initialization and validation with KVM's
 supported vector lengths when KVM is enabled. In order to determine
 and select supported lengths we add two new KVM functions for getting
 and setting the KVM_REG_ARM64_SVE_VLS pseudo-register.
-However, it helps resolve real-world problems with short periods being
+This patch has been co-authored with Richard Henderson, who reworked
-used under Linux. Commit 4451d3f59f2a ("clocksource/drivers/fttmr010:
+the target/arm/cpu64.c changes in order to push all the validation and
-Fix set_next_event handler") in Linux fixed the timer driver to
+auto-enabling/disabling steps into the finalizer, resulting in a nice
-correctly schedule the next event for the Aspeed controller, and in
+LOC reduction.
 combination with 5daa8212c08e ("ARM: dts: aspeed: Describe random number
 device") Linux will now set a timer with a period as low as 1us.
-Configuring a qemu timer with such a short period results in spending
+Signed-off-by: Andrew Jones <drjones@redhat.com>
-time handling the interrupt in the model rather than executing guest
+Reviewed-by: Eric Auger <eric.auger@redhat.com>
-code, leading to noticeable "sticky" behaviour in the guest.
+Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
+Tested-by: Masayoshi Mizuma <m.mizuma@jp.fujitsu.com>
-The behaviour of Linux is correct with respect to the hardware, so we
+Message-id: 20191031142734.8590-9-drjones@redhat.com
 need to improve our handling under emulation. The approach chosen is to
 provide back-pressure information by calculating an acceptable minimum
 number of ticks to be set on the model. Under Linux an additional read
 is added in the timer configuration path to detect back-pressure, which
 will never occur on hardware. However if back-pressure is observed, the
 driver alerts the clock event subsystem, which then performs its own
 next event dilation via a config option - d1748302f70b ("clockevents:
 Make minimum delay adjustments configurable")
 A minimum period of 5us was experimentally determined on a Lenovo
 T480s, which I've increased to 20us for "safety".
 Signed-off-by: Andrew Jeffery <andrew@aj.id.au>
 Reviewed-by: Joel Stanley <joel@jms.id.au>
 Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
 Tested-by: Joel Stanley <joel@jms.id.au>
 Signed-off-by: Cédric Le Goater <clg@kaod.org>
 Message-id: 20190704055150.4899-1-clg@kaod.org
 [clg: - changed the computation of min_ticks to be done each time the
         timer value is reloaded. It removes the ordering issue of the
         timer and scu reset handlers but is slightly slower ]
       - introduced TIMER_MIN_NS
       - introduced calculate_min_ticks() ]
 Signed-off-by: Cédric Le Goater <clg@kaod.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- hw/timer/aspeed_timer.c | 17 ++++++++++++++++-
+ target/arm/kvm_arm.h      |  12 +++
-file changed, 16 insertions(+), 1 deletion(-)
+ target/arm/cpu64.c        | 176 ++++++++++++++++++++++++++++----------
  target/arm/kvm64.c        | 100 +++++++++++++++++++++-
  tests/arm-cpu-features.c  | 104 +++++++++++++++++++++-
  docs/arm-cpu-features.rst |  45 +++++++---
 files changed, 379 insertions(+), 58 deletions(-)
-diff --git a/hw/timer/aspeed_timer.c b/hw/timer/aspeed_timer.c
+diff --git a/target/arm/kvm_arm.h b/target/arm/kvm_arm.h
 index XXXXXXX..XXXXXXX 100644
---- a/hw/timer/aspeed_timer.c
+--- a/target/arm/kvm_arm.h
-+++ b/hw/timer/aspeed_timer.c
++++ b/target/arm/kvm_arm.h
-@@ -XXX,XX +XXX,XX @@ enum timer_ctrl_op {
+@@ -XXX,XX +XXX,XX @@ typedef struct ARMHostCPUFeatures {
-     op_pulse_enable
+  */
- };
+ bool kvm_arm_get_host_cpu_features(ARMHostCPUFeatures *ahcf);
-+/*
++/**
-+ * Minimum value of the reload register to filter out short period
++ * kvm_arm_sve_get_vls:
-+ * timers which have a noticeable impact in emulation. 5us should be
++ * @cs: CPUState
-+ * enough, use 20us for "safety".
++ * @map: bitmap to fill in
 + *
 + * Get all the SVE vector lengths supported by the KVM host, setting
 + * the bits corresponding to their length in quadwords minus one
 + * (vq - 1) in @map up to ARM_MAX_VQ.
 + */
-+#define TIMER_MIN_NS (20 * SCALE_US)
++void kvm_arm_sve_get_vls(CPUState *cs, unsigned long *map);
 +
  /**
-  * Avoid mutual references between AspeedTimerCtrlState and AspeedTimer
+  * kvm_arm_set_cpu_features_from_host:
-  * structs, as it's a waste of memory. The ptimer BH callback needs to know
+  * @cpu: ARMCPU to set the features for
-@@ -XXX,XX +XXX,XX @@ static inline uint32_t calculate_ticks(struct AspeedTimer *t, uint64_t now_ns)
+@@ -XXX,XX +XXX,XX @@ static inline int kvm_arm_vgic_probe(void)
-     return t->reload - MIN(t->reload, ticks);
+ static inline void kvm_arm_pmu_set_irq(CPUState *cs, int irq) {}
  static inline void kvm_arm_pmu_init(CPUState *cs) {}
 +static inline void kvm_arm_sve_get_vls(CPUState *cs, unsigned long *map) {}
  #endif
  static inline const char *gic_class_name(void)
 diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu64.c
 +++ b/target/arm/cpu64.c
@@ -XXX,XX +XXX,XX @@ void arm_cpu_sve_finalize(ARMCPU *cpu, Error **errp)
       * any of the above.  Finally, if SVE is not disabled, then at least one
       * vector length must be enabled.
       */
 +    DECLARE_BITMAP(kvm_supported, ARM_MAX_VQ);
      DECLARE_BITMAP(tmp, ARM_MAX_VQ);
      uint32_t vq, max_vq = 0;
 +    /* Collect the set of vector lengths supported by KVM. */
 +    bitmap_zero(kvm_supported, ARM_MAX_VQ);
 +    if (kvm_enabled() && kvm_arm_sve_supported(CPU(cpu))) {
 +        kvm_arm_sve_get_vls(CPU(cpu), kvm_supported);
 +    } else if (kvm_enabled()) {
 +        assert(!cpu_isar_feature(aa64_sve, cpu));
 +    }
 +
      /*
       * Process explicit sve<N> properties.
       * From the properties, sve_vq_map<N> implies sve_vq_init<N>.
@@ -XXX,XX +XXX,XX @@ void arm_cpu_sve_finalize(ARMCPU *cpu, Error **errp)
              return;
          }
 -        /* Propagate enabled bits down through required powers-of-two. */
 -        for (vq = pow2floor(max_vq); vq >= 1; vq >>= 1) {
 -            if (!test_bit(vq - 1, cpu->sve_vq_init)) {
 -                set_bit(vq - 1, cpu->sve_vq_map);
 +        if (kvm_enabled()) {
 +            /*
 +             * For KVM we have to automatically enable all supported unitialized
 +             * lengths, even when the smaller lengths are not all powers-of-two.
 +             */
 +            bitmap_andnot(tmp, kvm_supported, cpu->sve_vq_init, max_vq);
 +            bitmap_or(cpu->sve_vq_map, cpu->sve_vq_map, tmp, max_vq);
 +        } else {
 +            /* Propagate enabled bits down through required powers-of-two. */
 +            for (vq = pow2floor(max_vq); vq >= 1; vq >>= 1) {
 +                if (!test_bit(vq - 1, cpu->sve_vq_init)) {
 +                    set_bit(vq - 1, cpu->sve_vq_map);
 +                }
              }
          }
      } else if (cpu->sve_max_vq == 0) {
@@ -XXX,XX +XXX,XX @@ void arm_cpu_sve_finalize(ARMCPU *cpu, Error **errp)
              return;
          }
 -        /* Disabling a power-of-two disables all larger lengths. */
 -        if (test_bit(0, cpu->sve_vq_init)) {
 -            error_setg(errp, "cannot disable sve128");
 -            error_append_hint(errp, "Disabling sve128 results in all vector "
 -                              "lengths being disabled.\n");
 -            error_append_hint(errp, "With SVE enabled, at least one vector "
 -                              "length must be enabled.\n");
 -            return;
 -        }
 -        for (vq = 2; vq <= ARM_MAX_VQ; vq <<= 1) {
 -            if (test_bit(vq - 1, cpu->sve_vq_init)) {
 -                break;
 +        if (kvm_enabled()) {
 +            /* Disabling a supported length disables all larger lengths. */
 +            for (vq = 1; vq <= ARM_MAX_VQ; ++vq) {
 +                if (test_bit(vq - 1, cpu->sve_vq_init) &&
 +                    test_bit(vq - 1, kvm_supported)) {
 +                    break;
 +                }
              }
 +            max_vq = vq <= ARM_MAX_VQ ? vq - 1 : ARM_MAX_VQ;
 +            bitmap_andnot(cpu->sve_vq_map, kvm_supported,
 +                          cpu->sve_vq_init, max_vq);
 +            if (max_vq == 0 || bitmap_empty(cpu->sve_vq_map, max_vq)) {
 +                error_setg(errp, "cannot disable sve%d", vq * 128);
 +                error_append_hint(errp, "Disabling sve%d results in all "
 +                                  "vector lengths being disabled.\n",
 +                                  vq * 128);
 +                error_append_hint(errp, "With SVE enabled, at least one "
 +                                  "vector length must be enabled.\n");
 +                return;
 +            }
 +        } else {
 +            /* Disabling a power-of-two disables all larger lengths. */
 +            if (test_bit(0, cpu->sve_vq_init)) {
 +                error_setg(errp, "cannot disable sve128");
 +                error_append_hint(errp, "Disabling sve128 results in all "
 +                                  "vector lengths being disabled.\n");
 +                error_append_hint(errp, "With SVE enabled, at least one "
 +                                  "vector length must be enabled.\n");
 +                return;
 +            }
 +            for (vq = 2; vq <= ARM_MAX_VQ; vq <<= 1) {
 +                if (test_bit(vq - 1, cpu->sve_vq_init)) {
 +                    break;
 +                }
 +            }
 +            max_vq = vq <= ARM_MAX_VQ ? vq - 1 : ARM_MAX_VQ;
 +            bitmap_complement(cpu->sve_vq_map, cpu->sve_vq_init, max_vq);
          }
 -        max_vq = vq <= ARM_MAX_VQ ? vq - 1 : ARM_MAX_VQ;
 -        bitmap_complement(cpu->sve_vq_map, cpu->sve_vq_init, max_vq);
          max_vq = find_last_bit(cpu->sve_vq_map, max_vq) + 1;
      }
@@ -XXX,XX +XXX,XX @@ void arm_cpu_sve_finalize(ARMCPU *cpu, Error **errp)
      assert(max_vq != 0);
      bitmap_clear(cpu->sve_vq_map, max_vq, ARM_MAX_VQ - max_vq);
 -    /* Ensure all required powers-of-two are enabled. */
 -    for (vq = pow2floor(max_vq); vq >= 1; vq >>= 1) {
 -        if (!test_bit(vq - 1, cpu->sve_vq_map)) {
 -            error_setg(errp, "cannot disable sve%d", vq * 128);
 -            error_append_hint(errp, "sve%d is required as it "
 -                              "is a power-of-two length smaller than "
 -                              "the maximum, sve%d\n",
 -                              vq * 128, max_vq * 128);
 +    if (kvm_enabled()) {
 +        /* Ensure the set of lengths matches what KVM supports. */
 +        bitmap_xor(tmp, cpu->sve_vq_map, kvm_supported, max_vq);
 +        if (!bitmap_empty(tmp, max_vq)) {
 +            vq = find_last_bit(tmp, max_vq) + 1;
 +            if (test_bit(vq - 1, cpu->sve_vq_map)) {
 +                if (cpu->sve_max_vq) {
 +                    error_setg(errp, "cannot set sve-max-vq=%d",
 +                               cpu->sve_max_vq);
 +                    error_append_hint(errp, "This KVM host does not support "
 +                                      "the vector length %d-bits.\n",
 +                                      vq * 128);
 +                    error_append_hint(errp, "It may not be possible to use "
 +                                      "sve-max-vq with this KVM host. Try "
 +                                      "using only sve<N> properties.\n");
 +                } else {
 +                    error_setg(errp, "cannot enable sve%d", vq * 128);
 +                    error_append_hint(errp, "This KVM host does not support "
 +                                      "the vector length %d-bits.\n",
 +                                      vq * 128);
 +                }
 +            } else {
 +                error_setg(errp, "cannot disable sve%d", vq * 128);
 +                error_append_hint(errp, "The KVM host requires all "
 +                                  "supported vector lengths smaller "
 +                                  "than %d bits to also be enabled.\n",
 +                                  max_vq * 128);
 +            }
              return;
          }
 +    } else {
 +        /* Ensure all required powers-of-two are enabled. */
 +        for (vq = pow2floor(max_vq); vq >= 1; vq >>= 1) {
 +            if (!test_bit(vq - 1, cpu->sve_vq_map)) {
 +                error_setg(errp, "cannot disable sve%d", vq * 128);
 +                error_append_hint(errp, "sve%d is required as it "
 +                                  "is a power-of-two length smaller than "
 +                                  "the maximum, sve%d\n",
 +                                  vq * 128, max_vq * 128);
 +                return;
 +            }
 +        }
      }
      /*
@@ -XXX,XX +XXX,XX @@ static void cpu_max_set_sve_max_vq(Object *obj, Visitor *v, const char *name,
  {
      ARMCPU *cpu = ARM_CPU(obj);
      Error *err = NULL;
 +    uint32_t max_vq;
 -    visit_type_uint32(v, name, &cpu->sve_max_vq, &err);
 -
 -    if (!err && (cpu->sve_max_vq == 0 || cpu->sve_max_vq > ARM_MAX_VQ)) {
 -        error_setg(&err, "unsupported SVE vector length");
 -        error_append_hint(&err, "Valid sve-max-vq in range [1-%d]\n",
 -                          ARM_MAX_VQ);
 +    visit_type_uint32(v, name, &max_vq, &err);
 +    if (err) {
 +        error_propagate(errp, err);
 +        return;
      }
 -    error_propagate(errp, err);
 +
 +    if (kvm_enabled() && !kvm_arm_sve_supported(CPU(cpu))) {
 +        error_setg(errp, "cannot set sve-max-vq");
 +        error_append_hint(errp, "SVE not supported by KVM on this host\n");
 +        return;
 +    }
 +
 +    if (max_vq == 0 || max_vq > ARM_MAX_VQ) {
 +        error_setg(errp, "unsupported SVE vector length");
 +        error_append_hint(errp, "Valid sve-max-vq in range [1-%d]\n",
 +                          ARM_MAX_VQ);
 +        return;
 +    }
 +
 +    cpu->sve_max_vq = max_vq;
  }
-+static uint32_t calculate_min_ticks(AspeedTimer *t, uint32_t value)
+ static void cpu_arm_get_sve_vq(Object *obj, Visitor *v, const char *name,
@@ -XXX,XX +XXX,XX @@ static void cpu_arm_set_sve_vq(Object *obj, Visitor *v, const char *name,
          return;
      }
 +    if (value && kvm_enabled() && !kvm_arm_sve_supported(CPU(cpu))) {
 +        error_setg(errp, "cannot enable %s", name);
 +        error_append_hint(errp, "SVE not supported by KVM on this host\n");
 +        return;
 +    }
 +
      if (value) {
          set_bit(vq - 1, cpu->sve_vq_map);
      } else {
@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
          cpu->ctr = 0x80038003; /* 32 byte I and D cacheline size, VIPT icache */
          cpu->dcz_blocksize = 7; /*  512 bytes */
  #endif
 -
 -        object_property_add(obj, "sve-max-vq", "uint32", cpu_max_get_sve_max_vq,
 -                            cpu_max_set_sve_max_vq, NULL, NULL, &error_fatal);
 -
 -        for (vq = 1; vq <= ARM_MAX_VQ; ++vq) {
 -            char name[8];
 -            sprintf(name, "sve%d", vq * 128);
 -            object_property_add(obj, name, "bool", cpu_arm_get_sve_vq,
 -                                cpu_arm_set_sve_vq, NULL, NULL, &error_fatal);
 -        }
      }
      object_property_add(obj, "sve", "bool", cpu_arm_get_sve,
                          cpu_arm_set_sve, NULL, NULL, &error_fatal);
 +    object_property_add(obj, "sve-max-vq", "uint32", cpu_max_get_sve_max_vq,
 +                        cpu_max_set_sve_max_vq, NULL, NULL, &error_fatal);
 +
 +    for (vq = 1; vq <= ARM_MAX_VQ; ++vq) {
 +        char name[8];
 +        sprintf(name, "sve%d", vq * 128);
 +        object_property_add(obj, name, "bool", cpu_arm_get_sve_vq,
 +                            cpu_arm_set_sve_vq, NULL, NULL, &error_fatal);
 +    }
  }
  struct ARMCPUInfo {
 diff --git a/target/arm/kvm64.c b/target/arm/kvm64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/kvm64.c
 +++ b/target/arm/kvm64.c
@@ -XXX,XX +XXX,XX @@ bool kvm_arm_sve_supported(CPUState *cpu)
      return kvm_check_extension(s, KVM_CAP_ARM_SVE);
  }
 +QEMU_BUILD_BUG_ON(KVM_ARM64_SVE_VQ_MIN != 1);
 +
 +void kvm_arm_sve_get_vls(CPUState *cs, unsigned long *map)
 +{
-+    uint32_t rate = calculate_rate(t);
++    /* Only call this function if kvm_arm_sve_supported() returns true. */
-+    uint32_t min_ticks = muldiv64(TIMER_MIN_NS, rate, NANOSECONDS_PER_SECOND);
++    static uint64_t vls[KVM_ARM64_SVE_VLS_WORDS];
-+
++    static bool probed;
-+    return  value < min_ticks ? min_ticks : value;
++    uint32_t vq = 0;
 +    int i, j;
 +
 +    bitmap_clear(map, 0, ARM_MAX_VQ);
 +
 +    /*
 +     * KVM ensures all host CPUs support the same set of vector lengths.
 +     * So we only need to create the scratch VCPUs once and then cache
 +     * the results.
 +     */
 +    if (!probed) {
 +        struct kvm_vcpu_init init = {
 +            .target = -1,
 +            .features[0] = (1 << KVM_ARM_VCPU_SVE),
 +        };
 +        struct kvm_one_reg reg = {
 +            .id = KVM_REG_ARM64_SVE_VLS,
 +            .addr = (uint64_t)&vls[0],
 +        };
 +        int fdarray[3], ret;
 +
 +        probed = true;
 +
 +        if (!kvm_arm_create_scratch_host_vcpu(NULL, fdarray, &init)) {
 +            error_report("failed to create scratch VCPU with SVE enabled");
 +            abort();
 +        }
 +        ret = ioctl(fdarray[2], KVM_GET_ONE_REG, &reg);
 +        kvm_arm_destroy_scratch_host_vcpu(fdarray);
 +        if (ret) {
 +            error_report("failed to get KVM_REG_ARM64_SVE_VLS: %s",
 +                         strerror(errno));
 +            abort();
 +        }
 +
 +        for (i = KVM_ARM64_SVE_VLS_WORDS - 1; i >= 0; --i) {
 +            if (vls[i]) {
 +                vq = 64 - clz64(vls[i]) + i * 64;
 +                break;
 +            }
 +        }
 +        if (vq > ARM_MAX_VQ) {
 +            warn_report("KVM supports vector lengths larger than "
 +                        "QEMU can enable");
 +        }
 +    }
 +
 +    for (i = 0; i < KVM_ARM64_SVE_VLS_WORDS; ++i) {
 +        if (!vls[i]) {
 +            continue;
 +        }
 +        for (j = 1; j <= 64; ++j) {
 +            vq = j + i * 64;
 +            if (vq > ARM_MAX_VQ) {
 +                return;
 +            }
 +            if (vls[i] & (1UL << (j - 1))) {
 +                set_bit(vq - 1, map);
 +            }
 +        }
 +    }
 +}
 +
- static inline uint64_t calculate_time(struct AspeedTimer *t, uint32_t ticks)
++static int kvm_arm_sve_set_vls(CPUState *cs)
 +{
 +    uint64_t vls[KVM_ARM64_SVE_VLS_WORDS] = {0};
 +    struct kvm_one_reg reg = {
 +        .id = KVM_REG_ARM64_SVE_VLS,
 +        .addr = (uint64_t)&vls[0],
 +    };
 +    ARMCPU *cpu = ARM_CPU(cs);
 +    uint32_t vq;
 +    int i, j;
 +
 +    assert(cpu->sve_max_vq <= KVM_ARM64_SVE_VQ_MAX);
 +
 +    for (vq = 1; vq <= cpu->sve_max_vq; ++vq) {
 +        if (test_bit(vq - 1, cpu->sve_vq_map)) {
 +            i = (vq - 1) / 64;
 +            j = (vq - 1) % 64;
 +            vls[i] |= 1UL << j;
 +        }
 +    }
 +
 +    return kvm_vcpu_ioctl(cs, KVM_SET_ONE_REG, &reg);
 +}
 +
  #define ARM_CPU_ID_MPIDR       3, 0, 0, 0, 5
  int kvm_arch_init_vcpu(CPUState *cs)
@@ -XXX,XX +XXX,XX @@ int kvm_arch_init_vcpu(CPUState *cs)
      if (cpu->kvm_target == QEMU_KVM_ARM_TARGET_NONE ||
          !object_dynamic_cast(OBJECT(cpu), TYPE_AARCH64_CPU)) {
 -        fprintf(stderr, "KVM is not supported for this guest CPU type\n");
 +        error_report("KVM is not supported for this guest CPU type");
          return -EINVAL;
      }
@@ -XXX,XX +XXX,XX @@ int kvm_arch_init_vcpu(CPUState *cs)
      }
      if (cpu_isar_feature(aa64_sve, cpu)) {
 +        ret = kvm_arm_sve_set_vls(cs);
 +        if (ret) {
 +            return ret;
 +        }
          ret = kvm_arm_vcpu_finalize(cs, KVM_ARM_VCPU_SVE);
          if (ret) {
              return ret;
 diff --git a/tests/arm-cpu-features.c b/tests/arm-cpu-features.c
 index XXXXXXX..XXXXXXX 100644
 --- a/tests/arm-cpu-features.c
 +++ b/tests/arm-cpu-features.c
@@ -XXX,XX +XXX,XX @@ static QDict *resp_get_props(QDict *resp)
      return qdict;
  }
 +static bool resp_get_feature(QDict *resp, const char *feature)
 +{
 +    QDict *props;
 +
 +    g_assert(resp);
 +    g_assert(resp_has_props(resp));
 +    props = resp_get_props(resp);
 +    g_assert(qdict_get(props, feature));
 +    return qdict_get_bool(props, feature);
 +}
 +
  #define assert_has_feature(qts, cpu_type, feature)                     \
  ({                                                                     \
      QDict *_resp = do_query_no_props(qts, cpu_type);                   \
@@ -XXX,XX +XXX,XX @@ static void sve_tests_sve_off(const void *data)
      qtest_quit(qts);
  }
 +static void sve_tests_sve_off_kvm(const void *data)
 +{
 +    QTestState *qts;
 +
 +    qts = qtest_init(MACHINE_KVM "-cpu max,sve=off");
 +
 +    /*
 +     * We don't know if this host supports SVE so we don't
 +     * attempt to test enabling anything. We only test that
 +     * everything is disabled (as it should be with sve=off)
 +     * and that using sve<N>=off to explicitly disable vector
 +     * lengths is OK too.
 +     */
 +    assert_sve_vls(qts, "max", 0, NULL);
 +    assert_sve_vls(qts, "max", 0, "{ 'sve128': false }");
 +
 +    qtest_quit(qts);
 +}
 +
  static void test_query_cpu_model_expansion(const void *data)
  {
-     uint64_t delta_ns;
+     QTestState *qts;
-@@ -XXX,XX +XXX,XX @@ static void aspeed_timer_set_value(AspeedTimerCtrlState *s, int timer, int reg,
+@@ -XXX,XX +XXX,XX @@ static void test_query_cpu_model_expansion_kvm(const void *data)
-     switch (reg) {
+     }
-     case TIMER_REG_RELOAD:
-         old_reload = t->reload;
+     if (g_str_equal(qtest_get_arch(), "aarch64")) {
--        t->reload = value;
++        bool kvm_supports_sve;
-+        t->reload = calculate_min_ticks(t, value);
++        char max_name[8], name[8];
++        uint32_t max_vq, vq;
-         /* If the reload value was not previously set, or zero, and
++        uint64_t vls;
-          * the current value is valid, try to start the timer if it is
++        QDict *resp;
 +        char *error;
 +
          assert_has_feature(qts, "host", "aarch64");
          assert_has_feature(qts, "host", "pmu");
 -        assert_has_feature(qts, "max", "sve");
 -
          assert_error(qts, "cortex-a15",
              "We cannot guarantee the CPU type 'cortex-a15' works "
              "with KVM on this host", NULL);
 +
 +        assert_has_feature(qts, "max", "sve");
 +        resp = do_query_no_props(qts, "max");
 +        kvm_supports_sve = resp_get_feature(resp, "sve");
 +        vls = resp_get_sve_vls(resp);
 +        qobject_unref(resp);
 +
 +        if (kvm_supports_sve) {
 +            g_assert(vls != 0);
 +            max_vq = 64 - __builtin_clzll(vls);
 +            sprintf(max_name, "sve%d", max_vq * 128);
 +
 +            /* Enabling a supported length is of course fine. */
 +            assert_sve_vls(qts, "max", vls, "{ %s: true }", max_name);
 +
 +            /* Get the next supported length smaller than max-vq. */
 +            vq = 64 - __builtin_clzll(vls & ~BIT_ULL(max_vq - 1));
 +            if (vq) {
 +                /*
 +                 * We have at least one length smaller than max-vq,
 +                 * so we can disable max-vq.
 +                 */
 +                assert_sve_vls(qts, "max", (vls & ~BIT_ULL(max_vq - 1)),
 +                               "{ %s: false }", max_name);
 +
 +                /*
 +                 * Smaller, supported vector lengths cannot be disabled
 +                 * unless all larger, supported vector lengths are also
 +                 * disabled.
 +                 */
 +                sprintf(name, "sve%d", vq * 128);
 +                error = g_strdup_printf("cannot disable %s", name);
 +                assert_error(qts, "max", error,
 +                             "{ %s: true, %s: false }",
 +                             max_name, name);
 +                g_free(error);
 +            }
 +
 +            /*
 +             * The smallest, supported vector length is required, because
 +             * we need at least one vector length enabled.
 +             */
 +            vq = __builtin_ffsll(vls);
 +            sprintf(name, "sve%d", vq * 128);
 +            error = g_strdup_printf("cannot disable %s", name);
 +            assert_error(qts, "max", error, "{ %s: false }", name);
 +            g_free(error);
 +
 +            /* Get an unsupported length. */
 +            for (vq = 1; vq <= max_vq; ++vq) {
 +                if (!(vls & BIT_ULL(vq - 1))) {
 +                    break;
 +                }
 +            }
 +            if (vq <= SVE_MAX_VQ) {
 +                sprintf(name, "sve%d", vq * 128);
 +                error = g_strdup_printf("cannot enable %s", name);
 +                assert_error(qts, "max", error, "{ %s: true }", name);
 +                g_free(error);
 +            }
 +        } else {
 +            g_assert(vls == 0);
 +        }
      } else {
          assert_has_not_feature(qts, "host", "aarch64");
          assert_has_not_feature(qts, "host", "pmu");
@@ -XXX,XX +XXX,XX @@ int main(int argc, char **argv)
                              NULL, sve_tests_sve_max_vq_8);
          qtest_add_data_func("/arm/max/query-cpu-model-expansion/sve-off",
                              NULL, sve_tests_sve_off);
 +        qtest_add_data_func("/arm/kvm/query-cpu-model-expansion/sve-off",
 +                            NULL, sve_tests_sve_off_kvm);
      }
      return g_test_run();
 diff --git a/docs/arm-cpu-features.rst b/docs/arm-cpu-features.rst
 index XXXXXXX..XXXXXXX 100644
 --- a/docs/arm-cpu-features.rst
 +++ b/docs/arm-cpu-features.rst
@@ -XXX,XX +XXX,XX @@ SVE CPU Property Dependencies and Constraints
 ) At least one vector length must be enabled when `sve` is enabled.
 -  2) If a vector length `N` is enabled, then all power-of-two vector
 -     lengths smaller than `N` must also be enabled.  E.g. if `sve512`
 -     is enabled, then the 128-bit and 256-bit vector lengths must also
 -     be enabled.
 +  2) If a vector length `N` is enabled, then, when KVM is enabled, all
 +     smaller, host supported vector lengths must also be enabled.  If
 +     KVM is not enabled, then only all the smaller, power-of-two vector
 +     lengths must be enabled.  E.g. with KVM if the host supports all
 +     vector lengths up to 512-bits (128, 256, 384, 512), then if `sve512`
 +     is enabled, the 128-bit vector length, 256-bit vector length, and
 +     384-bit vector length must also be enabled. Without KVM, the 384-bit
 +     vector length would not be required.
 +
 +  3) If KVM is enabled then only vector lengths that the host CPU type
 +     support may be enabled.  If SVE is not supported by the host, then
 +     no `sve*` properties may be enabled.
  SVE CPU Property Parsing Semantics
  ----------------------------------
@@ -XXX,XX +XXX,XX @@ SVE CPU Property Parsing Semantics
       an error is generated.
 ) If SVE is enabled (`sve=on`), but no `sve<N>` CPU properties are
 -     provided, then all supported vector lengths are enabled, including
 -     the non-power-of-two lengths.
 +     provided, then all supported vector lengths are enabled, which when
 +     KVM is not in use means including the non-power-of-two lengths, and,
 +     when KVM is in use, it means all vector lengths supported by the host
 +     processor.
 ) If SVE is enabled, then an error is generated when attempting to
       disable the last enabled vector length (see constraint (1) of "SVE
@@ -XXX,XX +XXX,XX @@ SVE CPU Property Parsing Semantics
       has been explicitly disabled, then an error is generated (see
       constraint (2) of "SVE CPU Property Dependencies and Constraints").
 -  5) If one or more `sve<N>` CPU properties are set `off`, but no `sve<N>`,
 +  5) When KVM is enabled, if the host does not support SVE, then an error
 +     is generated when attempting to enable any `sve*` properties (see
 +     constraint (3) of "SVE CPU Property Dependencies and Constraints").
 +
 +  6) When KVM is enabled, if the host does support SVE, then an error is
 +     generated when attempting to enable any vector lengths not supported
 +     by the host (see constraint (3) of "SVE CPU Property Dependencies and
 +     Constraints").
 +
 +  7) If one or more `sve<N>` CPU properties are set `off`, but no `sve<N>`,
       CPU properties are set `on`, then the specified vector lengths are
       disabled but the default for any unspecified lengths remains enabled.
 -     Disabling a power-of-two vector length also disables all vector
 -     lengths larger than the power-of-two length (see constraint (2) of
 -     "SVE CPU Property Dependencies and Constraints").
 +     When KVM is not enabled, disabling a power-of-two vector length also
 +     disables all vector lengths larger than the power-of-two length.
 +     When KVM is enabled, then disabling any supported vector length also
 +     disables all larger vector lengths (see constraint (2) of "SVE CPU
 +     Property Dependencies and Constraints").
 -  6) If one or more `sve<N>` CPU properties are set to `on`, then they
 +  8) If one or more `sve<N>` CPU properties are set to `on`, then they
       are enabled and all unspecified lengths default to disabled, except
       for the required lengths per constraint (2) of "SVE CPU Property
       Dependencies and Constraints", which will even be auto-enabled if
       they were not explicitly enabled.
 -  7) If SVE was disabled (`sve=off`), allowing all vector lengths to be
 +  9) If SVE was disabled (`sve=off`), allowing all vector lengths to be
       explicitly disabled (i.e. avoiding the error specified in (3) of
       "SVE CPU Property Parsing Semantics"), then if later an `sve=on` is
       provided an error will be generated.  To avoid this error, one must
 --
 .20.1

-[Qemu-devel] [PULL 01/21] Revert "target/arm: Use unallocated_encoding for aarch32"
+[PULL 09/11] target/arm/kvm: host cpu: Add support for sve<N> properties
-From: Richard Henderson <richard.henderson@linaro.org>
+From: Andrew Jones <drjones@redhat.com>
-This reverts commit 3cb36637157088892e9e33ddb1034bffd1251d3b.
+Allow cpu 'host' to enable SVE when it's available, unless the
+user chooses to disable it with the added 'sve=off' cpu property.
-Despite the fact that the text for the call to gen_exception_insn
+Also give the user the ability to select vector lengths with the
-is identical for aarch64 and aarch32, the implementation inside
+sve<N> properties. We don't adopt 'max' cpu's other sve property,
-gen_exception_insn is totally different.
+sve-max-vq, because that property is difficult to use with KVM.
+That property assumes all vector lengths in the range from 1 up
-This fixes exceptions raised from aarch64.
+to and including the specified maximum length are supported, but
+there may be optional lengths not supported by the host in that
-Reported-by: Laurent Desnogues <laurent.desnogues@gmail.com>
+range. With KVM one must be more specific when enabling vector
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
+lengths.
-Reviewed-by: Laurent Desnogues <laurent.desnogues@gmail.com>
-Message-id: 20190826151536.6771-2-richard.henderson@linaro.org
+Signed-off-by: Andrew Jones <drjones@redhat.com>
 Reviewed-by: Eric Auger <eric.auger@redhat.com>
 Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
 Tested-by: Masayoshi Mizuma <m.mizuma@jp.fujitsu.com>
 Message-id: 20191031142734.8590-10-drjones@redhat.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- target/arm/translate-a64.h     |  2 ++
+ target/arm/cpu.h          |  2 ++
- target/arm/translate.h         |  2 --
+ target/arm/cpu.c          |  3 +++
- target/arm/translate-a64.c     |  7 +++++++
+ target/arm/cpu64.c        | 33 +++++++++++++++++----------------
- target/arm/translate-vfp.inc.c |  3 ++-
+ target/arm/kvm64.c        | 14 +++++++++++++-
- target/arm/translate.c         | 22 ++++++++++------------
+ tests/arm-cpu-features.c  | 17 ++++++++---------
-files changed, 21 insertions(+), 15 deletions(-)
+ docs/arm-cpu-features.rst | 19 ++++++++++++-------
+files changed, 55 insertions(+), 33 deletions(-)
-diff --git a/target/arm/translate-a64.h b/target/arm/translate-a64.h
-index XXXXXXX..XXXXXXX 100644
+diff --git a/target/arm/cpu.h b/target/arm/cpu.h
---- a/target/arm/translate-a64.h
+index XXXXXXX..XXXXXXX 100644
-+++ b/target/arm/translate-a64.h
+--- a/target/arm/cpu.h
-@@ -XXX,XX +XXX,XX @@
++++ b/target/arm/cpu.h
- #ifndef TARGET_ARM_TRANSLATE_A64_H
+@@ -XXX,XX +XXX,XX @@ int aarch64_cpu_gdb_write_register(CPUState *cpu, uint8_t *buf, int reg);
- #define TARGET_ARM_TRANSLATE_A64_H
+ void aarch64_sve_narrow_vq(CPUARMState *env, unsigned vq);
+ void aarch64_sve_change_el(CPUARMState *env, int old_el,
-+void unallocated_encoding(DisasContext *s);
+                            int new_el, bool el0_a64);
-+
++void aarch64_add_sve_properties(Object *obj);
- #define unsupported_encoding(s, insn)                                    \
+ #else
-     do {                                                                 \
+ static inline void aarch64_sve_narrow_vq(CPUARMState *env, unsigned vq) { }
-         qemu_log_mask(LOG_UNIMP,                                         \
+ static inline void aarch64_sve_change_el(CPUARMState *env, int o,
-diff --git a/target/arm/translate.h b/target/arm/translate.h
+                                          int n, bool a)
-index XXXXXXX..XXXXXXX 100644
+ { }
---- a/target/arm/translate.h
++static inline void aarch64_add_sve_properties(Object *obj) { }
-+++ b/target/arm/translate.h
+ #endif
-@@ -XXX,XX +XXX,XX @@ typedef struct DisasCompare {
-     bool value_global;
+ #if !defined(CONFIG_TCG)
- } DisasCompare;
+diff --git a/target/arm/cpu.c b/target/arm/cpu.c
+index XXXXXXX..XXXXXXX 100644
--void unallocated_encoding(DisasContext *s);
+--- a/target/arm/cpu.c
 +++ b/target/arm/cpu.c
@@ -XXX,XX +XXX,XX @@ static void arm_host_initfn(Object *obj)
      ARMCPU *cpu = ARM_CPU(obj);
      kvm_arm_set_cpu_features_from_host(cpu);
 +    if (arm_feature(&cpu->env, ARM_FEATURE_AARCH64)) {
 +        aarch64_add_sve_properties(obj);
 +    }
      arm_cpu_post_init(obj);
  }
 diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/cpu64.c
 +++ b/target/arm/cpu64.c
@@ -XXX,XX +XXX,XX @@ static void cpu_arm_set_sve(Object *obj, Visitor *v, const char *name,
      cpu->isar.id_aa64pfr0 = t;
  }
 +void aarch64_add_sve_properties(Object *obj)
 +{
 +    uint32_t vq;
 +
 +    object_property_add(obj, "sve", "bool", cpu_arm_get_sve,
 +                        cpu_arm_set_sve, NULL, NULL, &error_fatal);
 +
 +    for (vq = 1; vq <= ARM_MAX_VQ; ++vq) {
 +        char name[8];
 +        sprintf(name, "sve%d", vq * 128);
 +        object_property_add(obj, name, "bool", cpu_arm_get_sve_vq,
 +                            cpu_arm_set_sve_vq, NULL, NULL, &error_fatal);
 +    }
 +}
 +
  /* -cpu max: if KVM is enabled, like -cpu host (best possible with this host);
   * otherwise, a CPU with as many features enabled as our emulation supports.
   * The version of '-cpu max' for qemu-system-arm is defined in cpu.c;
@@ -XXX,XX +XXX,XX @@ static void cpu_arm_set_sve(Object *obj, Visitor *v, const char *name,
  static void aarch64_max_initfn(Object *obj)
  {
      ARMCPU *cpu = ARM_CPU(obj);
 -    uint32_t vq;
 -    uint64_t t;
      if (kvm_enabled()) {
          kvm_arm_set_cpu_features_from_host(cpu);
 -        if (kvm_arm_sve_supported(CPU(cpu))) {
 -            t = cpu->isar.id_aa64pfr0;
 -            t = FIELD_DP64(t, ID_AA64PFR0, SVE, 1);
 -            cpu->isar.id_aa64pfr0 = t;
 -        }
      } else {
 +        uint64_t t;
          uint32_t u;
          aarch64_a57_initfn(obj);
@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
  #endif
      }
 -    object_property_add(obj, "sve", "bool", cpu_arm_get_sve,
 -                        cpu_arm_set_sve, NULL, NULL, &error_fatal);
 +    aarch64_add_sve_properties(obj);
      object_property_add(obj, "sve-max-vq", "uint32", cpu_max_get_sve_max_vq,
                          cpu_max_set_sve_max_vq, NULL, NULL, &error_fatal);
 -
- /* Share the TCG temporaries common between 32 and 64 bit modes.  */
+-    for (vq = 1; vq <= ARM_MAX_VQ; ++vq) {
- extern TCGv_i32 cpu_NF, cpu_ZF, cpu_CF, cpu_VF;
+-        char name[8];
- extern TCGv_i64 cpu_exclusive_addr;
+-        sprintf(name, "sve%d", vq * 128);
-diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
+-        object_property_add(obj, name, "bool", cpu_arm_get_sve_vq,
-index XXXXXXX..XXXXXXX 100644
+-                            cpu_arm_set_sve_vq, NULL, NULL, &error_fatal);
---- a/target/arm/translate-a64.c
+-    }
 +++ b/target/arm/translate-a64.c
@@ -XXX,XX +XXX,XX @@ static inline void gen_goto_tb(DisasContext *s, int n, uint64_t dest)
      }
  }
-+void unallocated_encoding(DisasContext *s)
+ struct ARMCPUInfo {
-+{
+diff --git a/target/arm/kvm64.c b/target/arm/kvm64.c
-+    /* Unallocated and reserved encodings are uncategorized */
+index XXXXXXX..XXXXXXX 100644
-+    gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
+--- a/target/arm/kvm64.c
-+                       default_exception_el(s));
++++ b/target/arm/kvm64.c
-+}
+@@ -XXX,XX +XXX,XX @@ bool kvm_arm_get_host_cpu_features(ARMHostCPUFeatures *ahcf)
-+
+      * and then query that CPU for the relevant ID registers.
- static void init_tmp_a64_array(DisasContext *s)
+      */
- {
+     int fdarray[3];
- #ifdef CONFIG_DEBUG_TCG
++    bool sve_supported;
-diff --git a/target/arm/translate-vfp.inc.c b/target/arm/translate-vfp.inc.c
+     uint64_t features = 0;
-index XXXXXXX..XXXXXXX 100644
++    uint64_t t;
---- a/target/arm/translate-vfp.inc.c
+     int err;
-+++ b/target/arm/translate-vfp.inc.c
-@@ -XXX,XX +XXX,XX @@ static bool full_vfp_access_check(DisasContext *s, bool ignore_vfp_enabled)
+     /* Old kernels may not know about the PREFERRED_TARGET ioctl: however
+@@ -XXX,XX +XXX,XX @@ bool kvm_arm_get_host_cpu_features(ARMHostCPUFeatures *ahcf)
-     if (!s->vfp_enabled && !ignore_vfp_enabled) {
+                               ARM64_SYS_REG(3, 0, 0, 3, 2));
-         assert(!arm_dc_feature(s, ARM_FEATURE_M));
+     }
--        unallocated_encoding(s);
-+        gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
++    sve_supported = ioctl(fdarray[0], KVM_CHECK_EXTENSION, KVM_CAP_ARM_SVE) > 0;
-+                           default_exception_el(s));
++
      kvm_arm_destroy_scratch_host_vcpu(fdarray);
      if (err < 0) {
          return false;
      }
-diff --git a/target/arm/translate.c b/target/arm/translate.c
+-   /* We can assume any KVM supporting CPU is at least a v8
-index XXXXXXX..XXXXXXX 100644
++    /* Add feature bits that can't appear until after VCPU init. */
---- a/target/arm/translate.c
++    if (sve_supported) {
-+++ b/target/arm/translate.c
++        t = ahcf->isar.id_aa64pfr0;
-@@ -XXX,XX +XXX,XX @@ static void gen_exception_bkpt_insn(DisasContext *s, uint32_t syn)
++        t = FIELD_DP64(t, ID_AA64PFR0, SVE, 1);
-     s->base.is_jmp = DISAS_NORETURN;
++        ahcf->isar.id_aa64pfr0 = t;
- }
++    }
++
--void unallocated_encoding(DisasContext *s)
++    /*
--{
++     * We can assume any KVM supporting CPU is at least a v8
--    /* Unallocated and reserved encodings are uncategorized */
+      * with VFPv4+Neon; this in turn implies most of the other
--    gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
+      * feature bits.
--                       default_exception_el(s));
+      */
--}
+diff --git a/tests/arm-cpu-features.c b/tests/arm-cpu-features.c
 index XXXXXXX..XXXXXXX 100644
 --- a/tests/arm-cpu-features.c
 +++ b/tests/arm-cpu-features.c
@@ -XXX,XX +XXX,XX @@ static void test_query_cpu_model_expansion_kvm(const void *data)
              "We cannot guarantee the CPU type 'cortex-a15' works "
              "with KVM on this host", NULL);
 -        assert_has_feature(qts, "max", "sve");
 -        resp = do_query_no_props(qts, "max");
 +        assert_has_feature(qts, "host", "sve");
 +        resp = do_query_no_props(qts, "host");
          kvm_supports_sve = resp_get_feature(resp, "sve");
          vls = resp_get_sve_vls(resp);
          qobject_unref(resp);
@@ -XXX,XX +XXX,XX @@ static void test_query_cpu_model_expansion_kvm(const void *data)
              sprintf(max_name, "sve%d", max_vq * 128);
              /* Enabling a supported length is of course fine. */
 -            assert_sve_vls(qts, "max", vls, "{ %s: true }", max_name);
 +            assert_sve_vls(qts, "host", vls, "{ %s: true }", max_name);
              /* Get the next supported length smaller than max-vq. */
              vq = 64 - __builtin_clzll(vls & ~BIT_ULL(max_vq - 1));
@@ -XXX,XX +XXX,XX @@ static void test_query_cpu_model_expansion_kvm(const void *data)
                   * We have at least one length smaller than max-vq,
                   * so we can disable max-vq.
                   */
 -                assert_sve_vls(qts, "max", (vls & ~BIT_ULL(max_vq - 1)),
 +                assert_sve_vls(qts, "host", (vls & ~BIT_ULL(max_vq - 1)),
                                 "{ %s: false }", max_name);
                  /*
@@ -XXX,XX +XXX,XX @@ static void test_query_cpu_model_expansion_kvm(const void *data)
                   */
                  sprintf(name, "sve%d", vq * 128);
                  error = g_strdup_printf("cannot disable %s", name);
 -                assert_error(qts, "max", error,
 +                assert_error(qts, "host", error,
                               "{ %s: true, %s: false }",
                               max_name, name);
                  g_free(error);
@@ -XXX,XX +XXX,XX @@ static void test_query_cpu_model_expansion_kvm(const void *data)
              vq = __builtin_ffsll(vls);
              sprintf(name, "sve%d", vq * 128);
              error = g_strdup_printf("cannot disable %s", name);
 -            assert_error(qts, "max", error, "{ %s: false }", name);
 +            assert_error(qts, "host", error, "{ %s: false }", name);
              g_free(error);
              /* Get an unsupported length. */
@@ -XXX,XX +XXX,XX @@ static void test_query_cpu_model_expansion_kvm(const void *data)
              if (vq <= SVE_MAX_VQ) {
                  sprintf(name, "sve%d", vq * 128);
                  error = g_strdup_printf("cannot enable %s", name);
 -                assert_error(qts, "max", error, "{ %s: true }", name);
 +                assert_error(qts, "host", error, "{ %s: true }", name);
                  g_free(error);
              }
          } else {
@@ -XXX,XX +XXX,XX @@ static void test_query_cpu_model_expansion_kvm(const void *data)
      } else {
          assert_has_not_feature(qts, "host", "aarch64");
          assert_has_not_feature(qts, "host", "pmu");
 -
- /* Force a TB lookup after an instruction that changes the CPU state.  */
+-        assert_has_not_feature(qts, "max", "sve");
- static inline void gen_lookup_tb(DisasContext *s)
++        assert_has_not_feature(qts, "host", "sve");
- {
+     }
-@@ -XXX,XX +XXX,XX @@ static inline void gen_hlt(DisasContext *s, int imm)
-         return;
+     qtest_quit(qts);
-     }
+diff --git a/docs/arm-cpu-features.rst b/docs/arm-cpu-features.rst
+index XXXXXXX..XXXXXXX 100644
--    unallocated_encoding(s);
+--- a/docs/arm-cpu-features.rst
-+    gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
++++ b/docs/arm-cpu-features.rst
-+                       default_exception_el(s));
+@@ -XXX,XX +XXX,XX @@ SVE CPU Property Examples
- }
+      $ qemu-system-aarch64 -M virt -cpu max
- static inline void gen_add_data_offset(DisasContext *s, unsigned int insn,
-@@ -XXX,XX +XXX,XX @@ static void gen_srs(DisasContext *s,
+-  3) Only enable the 128-bit vector length::
-     }
++  3) When KVM is enabled, implicitly enable all host CPU supported vector
++     lengths with the `host` CPU type::
-     if (undef) {
++
--        unallocated_encoding(s);
++     $ qemu-system-aarch64 -M virt,accel=kvm -cpu host
-+        gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
++
-+                           default_exception_el(s));
++  4) Only enable the 128-bit vector length::
-         return;
-     }
+      $ qemu-system-aarch64 -M virt -cpu max,sve128=on
-@@ -XXX,XX +XXX,XX @@ static void disas_arm_insn(DisasContext *s, unsigned int insn)
+-  4) Disable the 512-bit vector length and all larger vector lengths,
-             break;
++  5) Disable the 512-bit vector length and all larger vector lengths,
-         default:
+      since 512 is a power-of-two.  This results in all the smaller,
-         illegal_op:
+      uninitialized lengths (128, 256, and 384) defaulting to enabled::
--            unallocated_encoding(s);
-+            gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
+      $ qemu-system-aarch64 -M virt -cpu max,sve512=off
-+                               default_exception_el(s));
-             break;
+-  5) Enable the 128-bit, 256-bit, and 512-bit vector lengths::
-         }
++  6) Enable the 128-bit, 256-bit, and 512-bit vector lengths::
-     }
-@@ -XXX,XX +XXX,XX @@ static void disas_thumb2_insn(DisasContext *s, uint32_t insn)
+      $ qemu-system-aarch64 -M virt -cpu max,sve128=on,sve256=on,sve512=on
-     }
-     return;
+-  6) The same as (5), but since the 128-bit and 256-bit vector
- illegal_op:
++  7) The same as (6), but since the 128-bit and 256-bit vector
--    unallocated_encoding(s);
+      lengths are required for the 512-bit vector length to be enabled,
-+    gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
+      then allow them to be auto-enabled::
-+                       default_exception_el(s));
- }
+      $ qemu-system-aarch64 -M virt -cpu max,sve512=on
- static void disas_thumb_insn(DisasContext *s, uint32_t insn)
+-  7) Do the same as (6), but by first disabling SVE and then re-enabling it::
-@@ -XXX,XX +XXX,XX @@ static void disas_thumb_insn(DisasContext *s, uint32_t insn)
++  8) Do the same as (7), but by first disabling SVE and then re-enabling it::
-     return;
- illegal_op:
+      $ qemu-system-aarch64 -M virt -cpu max,sve=off,sve512=on,sve=on
- undef:
--    unallocated_encoding(s);
+-  8) Force errors regarding the last vector length::
-+    gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
++  9) Force errors regarding the last vector length::
-+                       default_exception_el(s));
- }
+      $ qemu-system-aarch64 -M virt -cpu max,sve128=off
+      $ qemu-system-aarch64 -M virt -cpu max,sve=off,sve128=off,sve=on
- static bool insn_crosses_page(CPUARMState *env, DisasContext *s)
+@@ -XXX,XX +XXX,XX @@ The examples in "SVE CPU Property Examples" exhibit many ways to select
  vector lengths which developers may find useful in order to avoid overly
  verbose command lines.  However, the recommended way to select vector
  lengths is to explicitly enable each desired length.  Therefore only
 -example's (1), (3), and (5) exhibit recommended uses of the properties.
 +example's (1), (4), and (6) exhibit recommended uses of the properties.
 --
 .20.1

-[Qemu-devel] [PULL 04/21] target/arm: Take exceptions on ATS instructions when needed
+Deleted patch
-The translation table walk for an ATS instruction can result in
-various faults.  In general these are just reported back via the
-PAR_EL1 fault status fields, but in some cases the architecture
-requires that the fault is turned into an exception:
- * synchronous stage 2 faults of any kind during AT S1E0* and
-   AT S1E1* instructions executed from NS EL1 fault to EL2 or EL3
- * synchronous external aborts are taken as Data Abort exceptions
-(This is documented in the v8A Arm ARM DDI0487A.e D5.2.11 and
-G5.13.4.)
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Tested-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
-Message-id: 20190816125802.25877-3-peter.maydell@linaro.org
----
- target/arm/helper.c | 107 +++++++++++++++++++++++++++++++++++++-------
-file changed, 92 insertions(+), 15 deletions(-)
-diff --git a/target/arm/helper.c b/target/arm/helper.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/helper.c
-+++ b/target/arm/helper.c
-@@ -XXX,XX +XXX,XX @@ static uint64_t do_ats_write(CPUARMState *env, uint64_t value,
-     ret = get_phys_addr(env, value, access_type, mmu_idx, &phys_addr, &attrs,
-                         &prot, &page_size, &fi, &cacheattrs);
-+    if (ret) {
-+        /*
-+         * Some kinds of translation fault must cause exceptions rather
-+         * than being reported in the PAR.
-+         */
-+        int current_el = arm_current_el(env);
-+        int target_el;
-+        uint32_t syn, fsr, fsc;
-+        bool take_exc = false;
-+
-+        if (fi.s1ptw && current_el == 1 && !arm_is_secure(env)
-+            && (mmu_idx == ARMMMUIdx_S1NSE1 || mmu_idx == ARMMMUIdx_S1NSE0)) {
-+            /*
-+             * Synchronous stage 2 fault on an access made as part of the
-+             * translation table walk for AT S1E0* or AT S1E1* insn
-+             * executed from NS EL1. If this is a synchronous external abort
-+             * and SCR_EL3.EA == 1, then we take a synchronous external abort
-+             * to EL3. Otherwise the fault is taken as an exception to EL2,
-+             * and HPFAR_EL2 holds the faulting IPA.
-+             */
-+            if (fi.type == ARMFault_SyncExternalOnWalk &&
-+                (env->cp15.scr_el3 & SCR_EA)) {
-+                target_el = 3;
-+            } else {
-+                env->cp15.hpfar_el2 = extract64(fi.s2addr, 12, 47) << 4;
-+                target_el = 2;
-+            }
-+            take_exc = true;
-+        } else if (fi.type == ARMFault_SyncExternalOnWalk) {
-+            /*
-+             * Synchronous external aborts during a translation table walk
-+             * are taken as Data Abort exceptions.
-+             */
-+            if (fi.stage2) {
-+                if (current_el == 3) {
-+                    target_el = 3;
-+                } else {
-+                    target_el = 2;
-+                }
-+            } else {
-+                target_el = exception_target_el(env);
-+            }
-+            take_exc = true;
-+        }
-+
-+        if (take_exc) {
-+            /* Construct FSR and FSC using same logic as arm_deliver_fault() */
-+            if (target_el == 2 || arm_el_is_aa64(env, target_el) ||
-+                arm_s1_regime_using_lpae_format(env, mmu_idx)) {
-+                fsr = arm_fi_to_lfsc(&fi);
-+                fsc = extract32(fsr, 0, 6);
-+            } else {
-+                fsr = arm_fi_to_sfsc(&fi);
-+                fsc = 0x3f;
-+            }
-+            /*
-+             * Report exception with ESR indicating a fault due to a
-+             * translation table walk for a cache maintenance instruction.
-+             */
-+            syn = syn_data_abort_no_iss(current_el == target_el,
-+                                        fi.ea, 1, fi.s1ptw, 1, fsc);
-+            env->exception.vaddress = value;
-+            env->exception.fsr = fsr;
-+            raise_exception(env, EXCP_DATA_ABORT, syn, target_el);
-+        }
-+    }
-+
-     if (is_a64(env)) {
-         format64 = true;
-     } else if (arm_feature(env, ARM_FEATURE_LPAE)) {
-@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo vapa_cp_reginfo[] = {
-     /* This underdecoding is safe because the reginfo is NO_RAW. */
-     { .name = "ATS", .cp = 15, .crn = 7, .crm = 8, .opc1 = 0, .opc2 = CP_ANY,
-       .access = PL1_W, .accessfn = ats_access,
--      .writefn = ats_write, .type = ARM_CP_NO_RAW },
-+      .writefn = ats_write, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC },
- #endif
-     REGINFO_SENTINEL
- };
-@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo v8_cp_reginfo[] = {
-     /* 64 bit address translation operations */
-     { .name = "AT_S1E1R", .state = ARM_CP_STATE_AA64,
-       .opc0 = 1, .opc1 = 0, .crn = 7, .crm = 8, .opc2 = 0,
--      .access = PL1_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
-+      .access = PL1_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
-+      .writefn = ats_write64 },
-     { .name = "AT_S1E1W", .state = ARM_CP_STATE_AA64,
-       .opc0 = 1, .opc1 = 0, .crn = 7, .crm = 8, .opc2 = 1,
--      .access = PL1_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
-+      .access = PL1_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
-+      .writefn = ats_write64 },
-     { .name = "AT_S1E0R", .state = ARM_CP_STATE_AA64,
-       .opc0 = 1, .opc1 = 0, .crn = 7, .crm = 8, .opc2 = 2,
--      .access = PL1_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
-+      .access = PL1_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
-+      .writefn = ats_write64 },
-     { .name = "AT_S1E0W", .state = ARM_CP_STATE_AA64,
-       .opc0 = 1, .opc1 = 0, .crn = 7, .crm = 8, .opc2 = 3,
--      .access = PL1_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
-+      .access = PL1_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
-+      .writefn = ats_write64 },
-     { .name = "AT_S12E1R", .state = ARM_CP_STATE_AA64,
-       .opc0 = 1, .opc1 = 4, .crn = 7, .crm = 8, .opc2 = 4,
--      .access = PL2_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
-+      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
-+      .writefn = ats_write64 },
-     { .name = "AT_S12E1W", .state = ARM_CP_STATE_AA64,
-       .opc0 = 1, .opc1 = 4, .crn = 7, .crm = 8, .opc2 = 5,
--      .access = PL2_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
-+      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
-+      .writefn = ats_write64 },
-     { .name = "AT_S12E0R", .state = ARM_CP_STATE_AA64,
-       .opc0 = 1, .opc1 = 4, .crn = 7, .crm = 8, .opc2 = 6,
--      .access = PL2_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
-+      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
-+      .writefn = ats_write64 },
-     { .name = "AT_S12E0W", .state = ARM_CP_STATE_AA64,
-       .opc0 = 1, .opc1 = 4, .crn = 7, .crm = 8, .opc2 = 7,
--      .access = PL2_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
-+      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
-+      .writefn = ats_write64 },
-     /* AT S1E2* are elsewhere as they UNDEF from EL3 if EL2 is not present */
-     { .name = "AT_S1E3R", .state = ARM_CP_STATE_AA64,
-       .opc0 = 1, .opc1 = 6, .crn = 7, .crm = 8, .opc2 = 0,
--      .access = PL3_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
-+      .access = PL3_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
-+      .writefn = ats_write64 },
-     { .name = "AT_S1E3W", .state = ARM_CP_STATE_AA64,
-       .opc0 = 1, .opc1 = 6, .crn = 7, .crm = 8, .opc2 = 1,
--      .access = PL3_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
-+      .access = PL3_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
-+      .writefn = ats_write64 },
-     { .name = "PAR_EL1", .state = ARM_CP_STATE_AA64,
-       .type = ARM_CP_ALIAS,
-       .opc0 = 3, .opc1 = 0, .crn = 7, .crm = 4, .opc2 = 0,
-@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo el2_cp_reginfo[] = {
-     { .name = "AT_S1E2R", .state = ARM_CP_STATE_AA64,
-       .opc0 = 1, .opc1 = 4, .crn = 7, .crm = 8, .opc2 = 0,
-       .access = PL2_W, .accessfn = at_s1e2_access,
--      .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
-+      .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC, .writefn = ats_write64 },
-     { .name = "AT_S1E2W", .state = ARM_CP_STATE_AA64,
-       .opc0 = 1, .opc1 = 4, .crn = 7, .crm = 8, .opc2 = 1,
-       .access = PL2_W, .accessfn = at_s1e2_access,
--      .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
-+      .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC, .writefn = ats_write64 },
-     /* The AArch32 ATS1H* operations are CONSTRAINED UNPREDICTABLE
-      * if EL2 is not implemented; we choose to UNDEF. Behaviour at EL3
-      * with SCR.NS == 0 outside Monitor mode is UNPREDICTABLE; we choose
-@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo el2_cp_reginfo[] = {
-      */
-     { .name = "ATS1HR", .cp = 15, .opc1 = 4, .crn = 7, .crm = 8, .opc2 = 0,
-       .access = PL2_W,
--      .writefn = ats1h_write, .type = ARM_CP_NO_RAW },
-+      .writefn = ats1h_write, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC },
-     { .name = "ATS1HW", .cp = 15, .opc1 = 4, .crn = 7, .crm = 8, .opc2 = 1,
-       .access = PL2_W,
--      .writefn = ats1h_write, .type = ARM_CP_NO_RAW },
-+      .writefn = ats1h_write, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC },
-     { .name = "CNTHCTL_EL2", .state = ARM_CP_STATE_BOTH,
-       .opc0 = 3, .opc1 = 4, .crn = 14, .crm = 1, .opc2 = 0,
-       /* ARMv7 requires bit 0 and 1 to reset to 1. ARMv8 defines the
---
-.20.1

-[Qemu-devel] [PULL 06/21] memory: Remove unused memory_region_iommu_replay_all()
+[PULL 10/11] hw/arm/boot: Rebuild hflags when modifying CPUState at boot
-From: Eric Auger <eric.auger@redhat.com>
+From: "Edgar E. Iglesias" <edgar.iglesias@xilinx.com>
-memory_region_iommu_replay_all is not used. Remove it.
+Rebuild hflags when modifying CPUState at boot.
-Signed-off-by: Eric Auger <eric.auger@redhat.com>
+Fixes: e979972a6a
-Reported-by: Peter Maydell <peter.maydell@linaro.org>
+Signed-off-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
 Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
-Reviewed-by: Peter Xu <peterx@redhat.com>
+Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
-Message-id: 20190822172350.12008-2-eric.auger@redhat.com
+Reviewed-by: Luc Michel <luc.michel@greensocs.com>
 Message-id: 20191031040830.18800-2-edgar.iglesias@xilinx.com
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
 ---
- include/exec/memory.h | 10 ----------
+ hw/arm/boot.c | 1 +
- memory.c              |  9 ---------
+file changed, 1 insertion(+)
 files changed, 19 deletions(-)
-diff --git a/include/exec/memory.h b/include/exec/memory.h
+diff --git a/hw/arm/boot.c b/hw/arm/boot.c
 index XXXXXXX..XXXXXXX 100644
---- a/include/exec/memory.h
+--- a/hw/arm/boot.c
-+++ b/include/exec/memory.h
++++ b/hw/arm/boot.c
-@@ -XXX,XX +XXX,XX @@ void memory_region_register_iommu_notifier(MemoryRegion *mr,
+@@ -XXX,XX +XXX,XX @@ static void do_cpu_reset(void *opaque)
-  */
+                 info->secondary_cpu_reset_hook(cpu, info);
- void memory_region_iommu_replay(IOMMUMemoryRegion *iommu_mr, IOMMUNotifier *n);
+             }
+         }
--/**
++        arm_rebuild_hflags(env);
 - * memory_region_iommu_replay_all: replay existing IOMMU translations
 - * to all the notifiers registered.
 - *
 - * Note: this is not related to record-and-replay functionality.
 - *
 - * @iommu_mr: the memory region to observe
 - */
 -void memory_region_iommu_replay_all(IOMMUMemoryRegion *iommu_mr);
 -
  /**
   * memory_region_unregister_iommu_notifier: unregister a notifier for
   * changes to IOMMU translation entries.
 diff --git a/memory.c b/memory.c
 index XXXXXXX..XXXXXXX 100644
 --- a/memory.c
 +++ b/memory.c
@@ -XXX,XX +XXX,XX @@ void memory_region_iommu_replay(IOMMUMemoryRegion *iommu_mr, IOMMUNotifier *n)
      }
  }
--void memory_region_iommu_replay_all(IOMMUMemoryRegion *iommu_mr)
--{
--    IOMMUNotifier *notifier;
--
--    IOMMU_NOTIFIER_FOREACH(notifier, iommu_mr) {
--        memory_region_iommu_replay(iommu_mr, notifier);
--    }
--}
--
- void memory_region_unregister_iommu_notifier(MemoryRegion *mr,
-                                              IOMMUNotifier *n)
- {
 --
 .20.1

-[Qemu-devel] [PULL 08/21] hw/arm/smmuv3: Remove spurious error messages on IOVA invalidations
+Deleted patch
-From: Eric Auger <eric.auger@redhat.com>
-An IOVA/ASID invalidation is notified to all IOMMU Memory Regions
-through smmuv3_inv_notifiers_iova/smmuv3_notify_iova.
-When the notification occurs it is possible that some of the
-PCIe devices associated to the notified regions do not have a
-valid stream table entry. In that case we output a LOG_GUEST_ERROR
-message, for example:
-invalid sid=<SID> (L1STD span=0)
-"smmuv3_notify_iova error decoding the configuration for iommu mr=<MR>
-This is unfortunate as the user gets the impression that there
-are some translation decoding errors whereas there are not.
-This patch adds a new field in SMMUEventInfo that tells whether
-the detection of an invalid STE must lead to an error report.
-invalid_ste_allowed is set before doing the invalidations and
-kept unset on actual translation.
-The other configuration decoding error messages are kept since if the
-STE is valid then the rest of the config must be correct.
-Signed-off-by: Eric Auger <eric.auger@redhat.com>
-Message-id: 20190822172350.12008-6-eric.auger@redhat.com
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- hw/arm/smmuv3-internal.h |  1 +
- hw/arm/smmuv3.c          | 19 +++++++++++--------
-files changed, 12 insertions(+), 8 deletions(-)
-diff --git a/hw/arm/smmuv3-internal.h b/hw/arm/smmuv3-internal.h
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/smmuv3-internal.h
-+++ b/hw/arm/smmuv3-internal.h
-@@ -XXX,XX +XXX,XX @@ typedef struct SMMUEventInfo {
-     uint32_t sid;
-     bool recorded;
-     bool record_trans_faults;
-+    bool inval_ste_allowed;
-     union {
-         struct {
-             uint32_t ssid;
-diff --git a/hw/arm/smmuv3.c b/hw/arm/smmuv3.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/smmuv3.c
-+++ b/hw/arm/smmuv3.c
-@@ -XXX,XX +XXX,XX @@ static int decode_ste(SMMUv3State *s, SMMUTransCfg *cfg,
-     uint32_t config;
-     if (!STE_VALID(ste)) {
--        qemu_log_mask(LOG_GUEST_ERROR, "invalid STE\n");
-+        if (!event->inval_ste_allowed) {
-+            qemu_log_mask(LOG_GUEST_ERROR, "invalid STE\n");
-+        }
-         goto bad_ste;
-     }
-@@ -XXX,XX +XXX,XX @@ static int smmu_find_ste(SMMUv3State *s, uint32_t sid, STE *ste,
-         if (!span) {
-             /* l2ptr is not valid */
--            qemu_log_mask(LOG_GUEST_ERROR,
--                          "invalid sid=%d (L1STD span=0)\n", sid);
-+            if (!event->inval_ste_allowed) {
-+                qemu_log_mask(LOG_GUEST_ERROR,
-+                              "invalid sid=%d (L1STD span=0)\n", sid);
-+            }
-             event->type = SMMU_EVT_C_BAD_STREAMID;
-             return -EINVAL;
-         }
-@@ -XXX,XX +XXX,XX @@ static IOMMUTLBEntry smmuv3_translate(IOMMUMemoryRegion *mr, hwaddr addr,
-     SMMUDevice *sdev = container_of(mr, SMMUDevice, iommu);
-     SMMUv3State *s = sdev->smmu;
-     uint32_t sid = smmu_get_sid(sdev);
--    SMMUEventInfo event = {.type = SMMU_EVT_NONE, .sid = sid};
-+    SMMUEventInfo event = {.type = SMMU_EVT_NONE,
-+                           .sid = sid,
-+                           .inval_ste_allowed = false};
-     SMMUPTWEventInfo ptw_info = {};
-     SMMUTranslationStatus status;
-     SMMUState *bs = ARM_SMMU(s);
-@@ -XXX,XX +XXX,XX @@ static void smmuv3_notify_iova(IOMMUMemoryRegion *mr,
-                                dma_addr_t iova)
- {
-     SMMUDevice *sdev = container_of(mr, SMMUDevice, iommu);
--    SMMUEventInfo event = {};
-+    SMMUEventInfo event = {.inval_ste_allowed = true};
-     SMMUTransTableInfo *tt;
-     SMMUTransCfg *cfg;
-     IOMMUTLBEntry entry;
-     cfg = smmuv3_get_config(sdev, &event);
-     if (!cfg) {
--        qemu_log_mask(LOG_GUEST_ERROR,
--                      "%s error decoding the configuration for iommu mr=%s\n",
--                      __func__, mr->parent_obj.name);
-         return;
-     }
---
-.20.1

-[Qemu-devel] [PULL 09/21] target/arm: Fix SMMLS argument order
+Deleted patch
-From: Richard Henderson <richard.henderson@linaro.org>
-The previous simplification got the order of operands to the
-subtraction wrong.  Since the 64-bit product is the subtrahend,
-we must use a 64-bit subtract to properly compute the borrow
-from the low-part of the product.
-Fixes: 5f8cd06ebcf5 ("target/arm: Simplify SMMLA, SMMLAR, SMMLS, SMMLSR")
-Reported-by: Laurent Desnogues <laurent.desnogues@gmail.com>
-Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
-Tested-by: Laurent Desnogues <laurent.desnogues@gmail.com>
-Message-id: 20190829013258.16102-1-richard.henderson@linaro.org
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- target/arm/translate.c | 20 ++++++++++++++++++--
-file changed, 18 insertions(+), 2 deletions(-)
-diff --git a/target/arm/translate.c b/target/arm/translate.c
-index XXXXXXX..XXXXXXX 100644
---- a/target/arm/translate.c
-+++ b/target/arm/translate.c
-@@ -XXX,XX +XXX,XX @@ static void disas_arm_insn(DisasContext *s, unsigned int insn)
-                         if (rd != 15) {
-                             tmp3 = load_reg(s, rd);
-                             if (insn & (1 << 6)) {
--                                tcg_gen_sub_i32(tmp, tmp, tmp3);
-+                                /*
-+                                 * For SMMLS, we need a 64-bit subtract.
-+                                 * Borrow caused by a non-zero multiplicand
-+                                 * lowpart, and the correct result lowpart
-+                                 * for rounding.
-+                                 */
-+                                TCGv_i32 zero = tcg_const_i32(0);
-+                                tcg_gen_sub2_i32(tmp2, tmp, zero, tmp3,
-+                                                 tmp2, tmp);
-+                                tcg_temp_free_i32(zero);
-                             } else {
-                                 tcg_gen_add_i32(tmp, tmp, tmp3);
-                             }
-@@ -XXX,XX +XXX,XX @@ static void disas_thumb2_insn(DisasContext *s, uint32_t insn)
-                     if (insn & (1 << 20)) {
-                         tcg_gen_add_i32(tmp, tmp, tmp3);
-                     } else {
--                        tcg_gen_sub_i32(tmp, tmp, tmp3);
-+                        /*
-+                         * For SMMLS, we need a 64-bit subtract.
-+                         * Borrow caused by a non-zero multiplicand lowpart,
-+                         * and the correct result lowpart for rounding.
-+                         */
-+                        TCGv_i32 zero = tcg_const_i32(0);
-+                        tcg_gen_sub2_i32(tmp2, tmp, zero, tmp3, tmp2, tmp);
-+                        tcg_temp_free_i32(zero);
-                     }
-                     tcg_temp_free_i32(tmp3);
-                 }
---
-.20.1

-[Qemu-devel] [PULL 10/21] hw/arm: Use ARM_CPU_TYPE_NAME() macro when appropriate
+Deleted patch
-From: Philippe Mathieu-Daudé <philmd@redhat.com>
-Commit ba1ba5cca introduce the ARM_CPU_TYPE_NAME() macro.
-Unify the code base by use it in all places.
-Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
-Signed-off-by: Philippe Mathieu-Daudé <philmd@redhat.com>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20190823143249.8096-2-philmd@redhat.com
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- hw/arm/allwinner-a10.c | 3 ++-
- hw/arm/cubieboard.c    | 3 ++-
- hw/arm/digic.c         | 3 ++-
- hw/arm/fsl-imx25.c     | 2 +-
- hw/arm/fsl-imx31.c     | 2 +-
- hw/arm/fsl-imx6.c      | 3 ++-
- hw/arm/fsl-imx6ul.c    | 2 +-
- hw/arm/xlnx-zynqmp.c   | 8 ++++----
-files changed, 15 insertions(+), 11 deletions(-)
-diff --git a/hw/arm/allwinner-a10.c b/hw/arm/allwinner-a10.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/allwinner-a10.c
-+++ b/hw/arm/allwinner-a10.c
-@@ -XXX,XX +XXX,XX @@ static void aw_a10_init(Object *obj)
-     AwA10State *s = AW_A10(obj);
-     object_initialize_child(obj, "cpu", &s->cpu, sizeof(s->cpu),
--                            "cortex-a8-" TYPE_ARM_CPU, &error_abort, NULL);
-+                            ARM_CPU_TYPE_NAME("cortex-a8"),
-+                            &error_abort, NULL);
-     sysbus_init_child_obj(obj, "intc", &s->intc, sizeof(s->intc),
-                           TYPE_AW_A10_PIC);
-diff --git a/hw/arm/cubieboard.c b/hw/arm/cubieboard.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/cubieboard.c
-+++ b/hw/arm/cubieboard.c
-@@ -XXX,XX +XXX,XX @@ static void cubieboard_init(MachineState *machine)
- static void cubieboard_machine_init(MachineClass *mc)
- {
--    mc->desc = "cubietech cubieboard";
-+    mc->desc = "cubietech cubieboard (Cortex-A9)";
-+    mc->default_cpu_type = ARM_CPU_TYPE_NAME("cortex-a9");
-     mc->init = cubieboard_init;
-     mc->block_default_type = IF_IDE;
-     mc->units_per_default_bus = 1;
-diff --git a/hw/arm/digic.c b/hw/arm/digic.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/digic.c
-+++ b/hw/arm/digic.c
-@@ -XXX,XX +XXX,XX @@ static void digic_init(Object *obj)
-     int i;
-     object_initialize_child(obj, "cpu", &s->cpu, sizeof(s->cpu),
--                            "arm946-" TYPE_ARM_CPU, &error_abort, NULL);
-+                            ARM_CPU_TYPE_NAME("arm946"),
-+                            &error_abort, NULL);
-     for (i = 0; i < DIGIC4_NB_TIMERS; i++) {
- #define DIGIC_TIMER_NAME_MLEN    11
-diff --git a/hw/arm/fsl-imx25.c b/hw/arm/fsl-imx25.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/fsl-imx25.c
-+++ b/hw/arm/fsl-imx25.c
-@@ -XXX,XX +XXX,XX @@ static void fsl_imx25_init(Object *obj)
-     FslIMX25State *s = FSL_IMX25(obj);
-     int i;
--    object_initialize(&s->cpu, sizeof(s->cpu), "arm926-" TYPE_ARM_CPU);
-+    object_initialize(&s->cpu, sizeof(s->cpu), ARM_CPU_TYPE_NAME("arm926"));
-     sysbus_init_child_obj(obj, "avic", &s->avic, sizeof(s->avic),
-                           TYPE_IMX_AVIC);
-diff --git a/hw/arm/fsl-imx31.c b/hw/arm/fsl-imx31.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/fsl-imx31.c
-+++ b/hw/arm/fsl-imx31.c
-@@ -XXX,XX +XXX,XX @@ static void fsl_imx31_init(Object *obj)
-     FslIMX31State *s = FSL_IMX31(obj);
-     int i;
--    object_initialize(&s->cpu, sizeof(s->cpu), "arm1136-" TYPE_ARM_CPU);
-+    object_initialize(&s->cpu, sizeof(s->cpu), ARM_CPU_TYPE_NAME("arm1136"));
-     sysbus_init_child_obj(obj, "avic", &s->avic, sizeof(s->avic),
-                           TYPE_IMX_AVIC);
-diff --git a/hw/arm/fsl-imx6.c b/hw/arm/fsl-imx6.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/fsl-imx6.c
-+++ b/hw/arm/fsl-imx6.c
-@@ -XXX,XX +XXX,XX @@ static void fsl_imx6_init(Object *obj)
-     for (i = 0; i < MIN(ms->smp.cpus, FSL_IMX6_NUM_CPUS); i++) {
-         snprintf(name, NAME_SIZE, "cpu%d", i);
-         object_initialize_child(obj, name, &s->cpu[i], sizeof(s->cpu[i]),
--                                "cortex-a9-" TYPE_ARM_CPU, &error_abort, NULL);
-+                                ARM_CPU_TYPE_NAME("cortex-a9"),
-+                                &error_abort, NULL);
-     }
-     sysbus_init_child_obj(obj, "a9mpcore", &s->a9mpcore, sizeof(s->a9mpcore),
-diff --git a/hw/arm/fsl-imx6ul.c b/hw/arm/fsl-imx6ul.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/fsl-imx6ul.c
-+++ b/hw/arm/fsl-imx6ul.c
-@@ -XXX,XX +XXX,XX @@ static void fsl_imx6ul_init(Object *obj)
-     int i;
-     object_initialize_child(obj, "cpu0", &s->cpu, sizeof(s->cpu),
--                            "cortex-a7-" TYPE_ARM_CPU, &error_abort, NULL);
-+                            ARM_CPU_TYPE_NAME("cortex-a7"), &error_abort, NULL);
-     /*
-      * A7MPCORE
-diff --git a/hw/arm/xlnx-zynqmp.c b/hw/arm/xlnx-zynqmp.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/xlnx-zynqmp.c
-+++ b/hw/arm/xlnx-zynqmp.c
-@@ -XXX,XX +XXX,XX @@ static void xlnx_zynqmp_create_rpu(MachineState *ms, XlnxZynqMPState *s,
-         object_initialize_child(OBJECT(&s->rpu_cluster), "rpu-cpu[*]",
-                                 &s->rpu_cpu[i], sizeof(s->rpu_cpu[i]),
--                                "cortex-r5f-" TYPE_ARM_CPU, &error_abort,
--                                NULL);
-+                                ARM_CPU_TYPE_NAME("cortex-r5f"),
-+                                &error_abort, NULL);
-         name = object_get_canonical_path_component(OBJECT(&s->rpu_cpu[i]));
-         if (strcmp(name, boot_cpu)) {
-@@ -XXX,XX +XXX,XX @@ static void xlnx_zynqmp_init(Object *obj)
-     for (i = 0; i < num_apus; i++) {
-         object_initialize_child(OBJECT(&s->apu_cluster), "apu-cpu[*]",
-                                 &s->apu_cpu[i], sizeof(s->apu_cpu[i]),
--                                "cortex-a53-" TYPE_ARM_CPU, &error_abort,
--                                NULL);
-+                                ARM_CPU_TYPE_NAME("cortex-a53"),
-+                                &error_abort, NULL);
-     }
-     sysbus_init_child_obj(obj, "gic", &s->gic, sizeof(s->gic),
---
-.20.1

-[Qemu-devel] [PULL 11/21] hw/arm: Use object_initialize_child for correct reference counting
+Deleted patch
-From: Philippe Mathieu-Daudé <philmd@redhat.com>
-As explained in commit aff39be0ed97:
-  Both functions, object_initialize() and object_property_add_child()
-  increase the reference counter of the new object, so one of the
-  references has to be dropped afterwards to get the reference
-  counting right. Otherwise the child object will not be properly
-  cleaned up when the parent gets destroyed.
-  Thus let's use now object_initialize_child() instead to get the
-  reference counting here right.
-Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
-Signed-off-by: Philippe Mathieu-Daudé <philmd@redhat.com>
-Reviewed-by: Thomas Huth <thuth@redhat.com>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20190823143249.8096-3-philmd@redhat.com
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- hw/arm/mcimx7d-sabre.c |  9 ++++-----
- hw/arm/mps2-tz.c       | 15 +++++++--------
- hw/arm/musca.c         |  9 +++++----
-files changed, 16 insertions(+), 17 deletions(-)
-diff --git a/hw/arm/mcimx7d-sabre.c b/hw/arm/mcimx7d-sabre.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/mcimx7d-sabre.c
-+++ b/hw/arm/mcimx7d-sabre.c
-@@ -XXX,XX +XXX,XX @@ static void mcimx7d_sabre_init(MachineState *machine)
- {
-     static struct arm_boot_info boot_info;
-     MCIMX7Sabre *s = g_new0(MCIMX7Sabre, 1);
--    Object *soc;
-     int i;
-     if (machine->ram_size > FSL_IMX7_MMDC_SIZE) {
-@@ -XXX,XX +XXX,XX @@ static void mcimx7d_sabre_init(MachineState *machine)
-         .nb_cpus = machine->smp.cpus,
-     };
--    object_initialize(&s->soc, sizeof(s->soc), TYPE_FSL_IMX7);
--    soc = OBJECT(&s->soc);
--    object_property_add_child(OBJECT(machine), "soc", soc, &error_fatal);
--    object_property_set_bool(soc, true, "realized", &error_fatal);
-+    object_initialize_child(OBJECT(machine), "soc",
-+                            &s->soc, sizeof(s->soc),
-+                            TYPE_FSL_IMX7, &error_fatal, NULL);
-+    object_property_set_bool(OBJECT(&s->soc), true, "realized", &error_fatal);
-     memory_region_allocate_system_memory(&s->ram, NULL, "mcimx7d-sabre.ram",
-                                          machine->ram_size);
-diff --git a/hw/arm/mps2-tz.c b/hw/arm/mps2-tz.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/mps2-tz.c
-+++ b/hw/arm/mps2-tz.c
-@@ -XXX,XX +XXX,XX @@ static void mps2tz_common_init(MachineState *machine)
-     /* The sec_resp_cfg output from the IoTKit must be split into multiple
-      * lines, one for each of the PPCs we create here, plus one per MSC.
-      */
--    object_initialize(&mms->sec_resp_splitter, sizeof(mms->sec_resp_splitter),
--                      TYPE_SPLIT_IRQ);
--    object_property_add_child(OBJECT(machine), "sec-resp-splitter",
--                              OBJECT(&mms->sec_resp_splitter), &error_abort);
-+    object_initialize_child(OBJECT(machine), "sec-resp-splitter",
-+                            &mms->sec_resp_splitter,
-+                            sizeof(mms->sec_resp_splitter),
-+                            TYPE_SPLIT_IRQ, &error_abort, NULL);
-     object_property_set_int(OBJECT(&mms->sec_resp_splitter),
-                             ARRAY_SIZE(mms->ppc) + ARRAY_SIZE(mms->msc),
-                             "num-lines", &error_fatal);
-@@ -XXX,XX +XXX,XX @@ static void mps2tz_common_init(MachineState *machine)
-      * Tx, Rx and "combined" IRQs are sent to the NVIC separately.
-      * Create the OR gate for this.
-      */
--    object_initialize(&mms->uart_irq_orgate, sizeof(mms->uart_irq_orgate),
--                      TYPE_OR_IRQ);
--    object_property_add_child(OBJECT(mms), "uart-irq-orgate",
--                              OBJECT(&mms->uart_irq_orgate), &error_abort);
-+    object_initialize_child(OBJECT(mms), "uart-irq-orgate",
-+                            &mms->uart_irq_orgate, sizeof(mms->uart_irq_orgate),
-+                            TYPE_OR_IRQ, &error_abort, NULL);
-     object_property_set_int(OBJECT(&mms->uart_irq_orgate), 10, "num-lines",
-                             &error_fatal);
-     object_property_set_bool(OBJECT(&mms->uart_irq_orgate), true,
-diff --git a/hw/arm/musca.c b/hw/arm/musca.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/musca.c
-+++ b/hw/arm/musca.c
-@@ -XXX,XX +XXX,XX @@ static void musca_init(MachineState *machine)
-      * The sec_resp_cfg output from the SSE-200 must be split into multiple
-      * lines, one for each of the PPCs we create here.
-      */
--    object_initialize(&mms->sec_resp_splitter, sizeof(mms->sec_resp_splitter),
--                      TYPE_SPLIT_IRQ);
--    object_property_add_child(OBJECT(machine), "sec-resp-splitter",
--                              OBJECT(&mms->sec_resp_splitter), &error_fatal);
-+    object_initialize_child(OBJECT(machine), "sec-resp-splitter",
-+                            &mms->sec_resp_splitter,
-+                            sizeof(mms->sec_resp_splitter),
-+                            TYPE_SPLIT_IRQ, &error_fatal, NULL);
-+
-     object_property_set_int(OBJECT(&mms->sec_resp_splitter),
-                             ARRAY_SIZE(mms->ppc), "num-lines", &error_fatal);
-     object_property_set_bool(OBJECT(&mms->sec_resp_splitter), true,
---
-.20.1

-[Qemu-devel] [PULL 12/21] hw/arm: Use sysbus_init_child_obj for correct reference counting
+Deleted patch
-From: Philippe Mathieu-Daudé <philmd@redhat.com>
-Both object_initialize() and qdev_set_parent_bus() increase the
-reference counter of the new object, so one of the references has
-to be dropped afterwards to get the reference counting right.
-In machine model code this refcount leak is not particularly
-problematic because (unlike devices) machines will never be
-created on demand via QMP, and they are never destroyed.
-But in any case let's use the new sysbus_init_child_obj() instead
-to get the reference counting here right.
-Signed-off-by: Philippe Mathieu-Daudé <philmd@redhat.com>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20190823143249.8096-4-philmd@redhat.com
-[PMM: rewrote commit message]
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- hw/arm/exynos4_boards.c | 4 ++--
-file changed, 2 insertions(+), 2 deletions(-)
-diff --git a/hw/arm/exynos4_boards.c b/hw/arm/exynos4_boards.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/exynos4_boards.c
-+++ b/hw/arm/exynos4_boards.c
-@@ -XXX,XX +XXX,XX @@ exynos4_boards_init_common(MachineState *machine,
-     exynos4_boards_init_ram(s, get_system_memory(),
-                             exynos4_board_ram_size[board_type]);
--    object_initialize(&s->soc, sizeof(s->soc), TYPE_EXYNOS4210_SOC);
--    qdev_set_parent_bus(DEVICE(&s->soc), sysbus_get_default());
-+    sysbus_init_child_obj(OBJECT(machine), "soc",
-+                          &s->soc, sizeof(s->soc), TYPE_EXYNOS4210_SOC);
-     object_property_set_bool(OBJECT(&s->soc), true, "realized",
-                              &error_fatal);
---
-.20.1

-[Qemu-devel] [PULL 13/21] hw/arm/fsl-imx: Add the cpu as child of the SoC object
+Deleted patch
-From: Philippe Mathieu-Daudé <philmd@redhat.com>
-Child properties form the composition tree. All objects need to be
-a child of another object. Objects can only be a child of one object.
-Respect this with the i.MX SoC, to get a cleaner composition tree.
-Signed-off-by: Philippe Mathieu-Daudé <philmd@redhat.com>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20190823143249.8096-5-philmd@redhat.com
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- hw/arm/fsl-imx25.c | 4 +++-
- hw/arm/fsl-imx31.c | 4 +++-
-files changed, 6 insertions(+), 2 deletions(-)
-diff --git a/hw/arm/fsl-imx25.c b/hw/arm/fsl-imx25.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/fsl-imx25.c
-+++ b/hw/arm/fsl-imx25.c
-@@ -XXX,XX +XXX,XX @@ static void fsl_imx25_init(Object *obj)
-     FslIMX25State *s = FSL_IMX25(obj);
-     int i;
--    object_initialize(&s->cpu, sizeof(s->cpu), ARM_CPU_TYPE_NAME("arm926"));
-+    object_initialize_child(obj, "cpu", &s->cpu, sizeof(s->cpu),
-+                            ARM_CPU_TYPE_NAME("arm926"),
-+                            &error_abort, NULL);
-     sysbus_init_child_obj(obj, "avic", &s->avic, sizeof(s->avic),
-                           TYPE_IMX_AVIC);
-diff --git a/hw/arm/fsl-imx31.c b/hw/arm/fsl-imx31.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/arm/fsl-imx31.c
-+++ b/hw/arm/fsl-imx31.c
-@@ -XXX,XX +XXX,XX @@ static void fsl_imx31_init(Object *obj)
-     FslIMX31State *s = FSL_IMX31(obj);
-     int i;
--    object_initialize(&s->cpu, sizeof(s->cpu), ARM_CPU_TYPE_NAME("arm1136"));
-+    object_initialize_child(obj, "cpu", &s->cpu, sizeof(s->cpu),
-+                            ARM_CPU_TYPE_NAME("arm1136"),
-+                            &error_abort, NULL);
-     sysbus_init_child_obj(obj, "avic", &s->avic, sizeof(s->avic),
-                           TYPE_IMX_AVIC);
---
-.20.1

-[Qemu-devel] [PULL 14/21] hw/dma/xilinx_axi: Use object_initialize_child for correct ref. counting
+Deleted patch
-From: Philippe Mathieu-Daudé <philmd@redhat.com>
-As explained in commit aff39be0ed97:
-  Both functions, object_initialize() and object_property_add_child()
-  increase the reference counter of the new object, so one of the
-  references has to be dropped afterwards to get the reference
-  counting right. Otherwise the child object will not be properly
-  cleaned up when the parent gets destroyed.
-  Thus let's use now object_initialize_child() instead to get the
-  reference counting here right.
-Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
-Signed-off-by: Philippe Mathieu-Daudé <philmd@redhat.com>
-Reviewed-by: Thomas Huth <thuth@redhat.com>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20190823143249.8096-6-philmd@redhat.com
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- hw/dma/xilinx_axidma.c | 16 ++++++++--------
-file changed, 8 insertions(+), 8 deletions(-)
-diff --git a/hw/dma/xilinx_axidma.c b/hw/dma/xilinx_axidma.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/dma/xilinx_axidma.c
-+++ b/hw/dma/xilinx_axidma.c
-@@ -XXX,XX +XXX,XX @@ static void xilinx_axidma_init(Object *obj)
-     XilinxAXIDMA *s = XILINX_AXI_DMA(obj);
-     SysBusDevice *sbd = SYS_BUS_DEVICE(obj);
--    object_initialize(&s->rx_data_dev, sizeof(s->rx_data_dev),
--                      TYPE_XILINX_AXI_DMA_DATA_STREAM);
--    object_initialize(&s->rx_control_dev, sizeof(s->rx_control_dev),
--                      TYPE_XILINX_AXI_DMA_CONTROL_STREAM);
--    object_property_add_child(OBJECT(s), "axistream-connected-target",
--                              (Object *)&s->rx_data_dev, &error_abort);
--    object_property_add_child(OBJECT(s), "axistream-control-connected-target",
--                              (Object *)&s->rx_control_dev, &error_abort);
-+    object_initialize_child(OBJECT(s), "axistream-connected-target",
-+                            &s->rx_data_dev, sizeof(s->rx_data_dev),
-+                            TYPE_XILINX_AXI_DMA_DATA_STREAM, &error_abort,
-+                            NULL);
-+    object_initialize_child(OBJECT(s), "axistream-control-connected-target",
-+                            &s->rx_control_dev, sizeof(s->rx_control_dev),
-+                            TYPE_XILINX_AXI_DMA_CONTROL_STREAM, &error_abort,
-+                            NULL);
-     sysbus_init_irq(sbd, &s->streams[0].irq);
-     sysbus_init_irq(sbd, &s->streams[1].irq);
---
-.20.1

-[Qemu-devel] [PULL 15/21] hw/net/xilinx_axi: Use object_initialize_child for correct ref. counting
+Deleted patch
-From: Philippe Mathieu-Daudé <philmd@redhat.com>
-As explained in commit aff39be0ed97:
-  Both functions, object_initialize() and object_property_add_child()
-  increase the reference counter of the new object, so one of the
-  references has to be dropped afterwards to get the reference
-  counting right. Otherwise the child object will not be properly
-  cleaned up when the parent gets destroyed.
-  Thus let's use now object_initialize_child() instead to get the
-  reference counting here right.
-Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
-Signed-off-by: Philippe Mathieu-Daudé <philmd@redhat.com>
-Reviewed-by: Thomas Huth <thuth@redhat.com>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20190823143249.8096-7-philmd@redhat.com
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- hw/net/xilinx_axienet.c | 17 ++++++++---------
-file changed, 8 insertions(+), 9 deletions(-)
-diff --git a/hw/net/xilinx_axienet.c b/hw/net/xilinx_axienet.c
-index XXXXXXX..XXXXXXX 100644
---- a/hw/net/xilinx_axienet.c
-+++ b/hw/net/xilinx_axienet.c
-@@ -XXX,XX +XXX,XX @@ static void xilinx_enet_init(Object *obj)
-     XilinxAXIEnet *s = XILINX_AXI_ENET(obj);
-     SysBusDevice *sbd = SYS_BUS_DEVICE(obj);
--    object_initialize(&s->rx_data_dev, sizeof(s->rx_data_dev),
--                      TYPE_XILINX_AXI_ENET_DATA_STREAM);
--    object_initialize(&s->rx_control_dev, sizeof(s->rx_control_dev),
--                      TYPE_XILINX_AXI_ENET_CONTROL_STREAM);
--    object_property_add_child(OBJECT(s), "axistream-connected-target",
--                              (Object *)&s->rx_data_dev, &error_abort);
--    object_property_add_child(OBJECT(s), "axistream-control-connected-target",
--                              (Object *)&s->rx_control_dev, &error_abort);
--
-+    object_initialize_child(OBJECT(s), "axistream-connected-target",
-+                            &s->rx_data_dev, sizeof(s->rx_data_dev),
-+                            TYPE_XILINX_AXI_ENET_DATA_STREAM, &error_abort,
-+                            NULL);
-+    object_initialize_child(OBJECT(s), "axistream-control-connected-target",
-+                            &s->rx_control_dev, sizeof(s->rx_control_dev),
-+                            TYPE_XILINX_AXI_ENET_CONTROL_STREAM, &error_abort,
-+                            NULL);
-     sysbus_init_irq(sbd, &s->irq);
-     memory_region_init_io(&s->iomem, OBJECT(s), &enet_ops, s, "enet", 0x40000);
---
-.20.1

-[Qemu-devel] [PULL 16/21] includes: remove stale [smp|max]_cpus externs
+Deleted patch
-From: Alex Bennée <alex.bennee@linaro.org>
-Commit a5e0b3311 removed these in favour of querying machine
-properties. Remove the extern declarations as well.
-Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
-Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Message-id: 20190828165307.18321-6-alex.bennee@linaro.org
-Cc: Like Xu <like.xu@linux.intel.com>
-Message-Id: <20190711130546.18578-1-alex.bennee@linaro.org>
-Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
----
- include/sysemu/sysemu.h | 2 --
-file changed, 2 deletions(-)
-diff --git a/include/sysemu/sysemu.h b/include/sysemu/sysemu.h
-index XXXXXXX..XXXXXXX 100644
---- a/include/sysemu/sysemu.h
-+++ b/include/sysemu/sysemu.h
-@@ -XXX,XX +XXX,XX @@ extern const char *keyboard_layout;
- extern int win2k_install_hack;
- extern int alt_grab;
- extern int ctrl_grab;
--extern int smp_cpus;
--extern unsigned int max_cpus;
- extern int cursor_hide;
- extern int graphic_rotate;
- extern int no_quit;
---
-.20.1

-[Qemu-devel] [PULL 20/21] target/arm: Free TCG temps in trans_VMOV_64_sp()
+[PULL 11/11] target/arm: Allow reading flags from FPSCR for M-profile
-The function neon_store_reg32() doesn't free the TCG temp that it
+From: Christophe Lyon <christophe.lyon@linaro.org>
-is passed, so the caller must do that. We got this right in most
-places but forgot to free the TCG temps in trans_VMOV_64_sp().
+rt==15 is a special case when reading the flags: it means the
 destination is APSR. This patch avoids rejecting
 vmrs apsr_nzcv, fpscr
 as illegal instruction.
 Cc: qemu-stable@nongnu.org
+Signed-off-by: Christophe Lyon <christophe.lyon@linaro.org>
+Message-id: 20191025095711.10853-1-christophe.lyon@linaro.org
+[PMM: updated the comment]
+Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
 Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
-Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
-Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
-Message-id: 20190827121931.26836-1-peter.maydell@linaro.org
 ---
- target/arm/translate-vfp.inc.c | 2 ++
+ target/arm/translate-vfp.inc.c | 5 +++--
-file changed, 2 insertions(+)
+file changed, 3 insertions(+), 2 deletions(-)
 diff --git a/target/arm/translate-vfp.inc.c b/target/arm/translate-vfp.inc.c
 index XXXXXXX..XXXXXXX 100644
 --- a/target/arm/translate-vfp.inc.c
 +++ b/target/arm/translate-vfp.inc.c
-@@ -XXX,XX +XXX,XX @@ static bool trans_VMOV_64_sp(DisasContext *s, arg_VMOV_64_sp *a)
+@@ -XXX,XX +XXX,XX @@ static bool trans_VMSR_VMRS(DisasContext *s, arg_VMSR_VMRS *a)
-         /* gpreg to fpreg */
+     if (arm_dc_feature(s, ARM_FEATURE_M)) {
-         tmp = load_reg(s, a->rt);
+         /*
-         neon_store_reg32(tmp, a->vm);
+          * The only M-profile VFP vmrs/vmsr sysreg is FPSCR.
-+        tcg_temp_free_i32(tmp);
+-         * Writes to R15 are UNPREDICTABLE; we choose to undef.
-         tmp = load_reg(s, a->rt2);
++         * Accesses to R15 are UNPREDICTABLE; we choose to undef.
-         neon_store_reg32(tmp, a->vm + 1);
++         * (FPSCR -> r15 is a special case which writes to the PSR flags.)
-+        tcg_temp_free_i32(tmp);
+          */
 -        if (a->rt == 15 || a->reg != ARM_VFP_FPSCR) {
 +        if (a->rt == 15 && (!a->l || a->reg != ARM_VFP_FPSCR)) {
              return false;
          }
      }
-     return true;
 --
 .20.1

target-arm queue: this time around is all small fixes
and changes.

thanks
-- PMM

The following changes since commit fec105c2abda8567ec15230429c41429b5ee307c:

Merge remote-tracking branch 'remotes/kraxel/tags/audio-20190828-pull-request' into staging (2019-09-03 14:03:15 +0100)

are available in the Git repository at:

https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20190903

for you to fetch changes up to 5e5584c89f36b302c666bc6db535fd3f7ff35ad2:

target/arm: Don't abort on M-profile exception return in linux-user mode (2019-09-03 16:20:35 +0100)

----------------------------------------------------------------
target-arm queue:
 * Revert and correctly fix refactoring of unallocated_encoding()
 * Take exceptions on ATS instructions when needed
 * aspeed/timer: Provide back-pressure information for short periods
 * memory: Remove unused memory_region_iommu_replay_all()
 * hw/arm/smmuv3: Log a guest error when decoding an invalid STE
 * hw/arm/smmuv3: Remove spurious error messages on IOVA invalidations
 * target/arm: Fix SMMLS argument order
 * hw/arm: Use ARM_CPU_TYPE_NAME() macro when appropriate
 * hw/arm: Correct reference counting for creation of various objects
 * includes: remove stale [smp|max]_cpus externs
 * tcg/README: fix typo
 * atomic_template: fix indentation in GEN_ATOMIC_HELPER
 * include/exec/cpu-defs.h: fix typo
 * target/arm: Free TCG temps in trans_VMOV_64_sp()
 * target/arm: Don't abort on M-profile exception return in linux-user mode

----------------------------------------------------------------
Alex Bennée (2):
      includes: remove stale [smp|max]_cpus externs
      include/exec/cpu-defs.h: fix typo

Andrew Jeffery (1):
      aspeed/timer: Provide back-pressure information for short periods

Emilio G. Cota (2):
      tcg/README: fix typo s/afterwise/afterwards/
      atomic_template: fix indentation in GEN_ATOMIC_HELPER

Eric Auger (3):
      memory: Remove unused memory_region_iommu_replay_all()
      hw/arm/smmuv3: Log a guest error when decoding an invalid STE
      hw/arm/smmuv3: Remove spurious error messages on IOVA invalidations

Peter Maydell (4):
      target/arm: Allow ARMCPRegInfo read/write functions to throw exceptions
      target/arm: Take exceptions on ATS instructions when needed
      target/arm: Free TCG temps in trans_VMOV_64_sp()
      target/arm: Don't abort on M-profile exception return in linux-user mode

Philippe Mathieu-Daudé (6):
      hw/arm: Use ARM_CPU_TYPE_NAME() macro when appropriate
      hw/arm: Use object_initialize_child for correct reference counting
      hw/arm: Use sysbus_init_child_obj for correct reference counting
      hw/arm/fsl-imx: Add the cpu as child of the SoC object
      hw/dma/xilinx_axi: Use object_initialize_child for correct ref. counting
      hw/net/xilinx_axi: Use object_initialize_child for correct ref. counting

Richard Henderson (3):
      Revert "target/arm: Use unallocated_encoding for aarch32"
      target/arm: Factor out unallocated_encoding for aarch32
      target/arm: Fix SMMLS argument order

From: Richard Henderson <richard.henderson@linaro.org>

This reverts commit 3cb36637157088892e9e33ddb1034bffd1251d3b.

Despite the fact that the text for the call to gen_exception_insn
is identical for aarch64 and aarch32, the implementation inside
gen_exception_insn is totally different.

This fixes exceptions raised from aarch64.

Reported-by: Laurent Desnogues <laurent.desnogues@gmail.com>
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
Reviewed-by: Laurent Desnogues <laurent.desnogues@gmail.com>
Message-id: 20190826151536.6771-2-richard.henderson@linaro.org
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 target/arm/translate-a64.h     |  2 ++
 target/arm/translate.h         |  2 --
 target/arm/translate-a64.c     |  7 +++++++
 target/arm/translate-vfp.inc.c |  3 ++-
 target/arm/translate.c         | 22 ++++++++++------------
 5 files changed, 21 insertions(+), 15 deletions(-)

diff --git a/target/arm/translate-a64.h b/target/arm/translate-a64.h
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/translate-a64.h
+++ b/target/arm/translate-a64.h
@@ -XXX,XX +XXX,XX @@
 #ifndef TARGET_ARM_TRANSLATE_A64_H
 #define TARGET_ARM_TRANSLATE_A64_H
 
+void unallocated_encoding(DisasContext *s);
+
 #define unsupported_encoding(s, insn)                                    \
     do {                                                                 \
         qemu_log_mask(LOG_UNIMP,                                         \
diff --git a/target/arm/translate.h b/target/arm/translate.h
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/translate.h
+++ b/target/arm/translate.h
@@ -XXX,XX +XXX,XX @@ typedef struct DisasCompare {
     bool value_global;
 } DisasCompare;
 
-void unallocated_encoding(DisasContext *s);
-
 /* Share the TCG temporaries common between 32 and 64 bit modes.  */
 extern TCGv_i32 cpu_NF, cpu_ZF, cpu_CF, cpu_VF;
 extern TCGv_i64 cpu_exclusive_addr;
diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/translate-a64.c
+++ b/target/arm/translate-a64.c
@@ -XXX,XX +XXX,XX @@ static inline void gen_goto_tb(DisasContext *s, int n, uint64_t dest)
     }
 }
 
+void unallocated_encoding(DisasContext *s)
+{
+    /* Unallocated and reserved encodings are uncategorized */
+    gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
+                       default_exception_el(s));
+}
+
 static void init_tmp_a64_array(DisasContext *s)
 {
 #ifdef CONFIG_DEBUG_TCG
diff --git a/target/arm/translate-vfp.inc.c b/target/arm/translate-vfp.inc.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/translate-vfp.inc.c
+++ b/target/arm/translate-vfp.inc.c
@@ -XXX,XX +XXX,XX @@ static bool full_vfp_access_check(DisasContext *s, bool ignore_vfp_enabled)
 
     if (!s->vfp_enabled && !ignore_vfp_enabled) {
         assert(!arm_dc_feature(s, ARM_FEATURE_M));
-        unallocated_encoding(s);
+        gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
+                           default_exception_el(s));
         return false;
     }
 
diff --git a/target/arm/translate.c b/target/arm/translate.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/translate.c
+++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static void gen_exception_bkpt_insn(DisasContext *s, uint32_t syn)
     s->base.is_jmp = DISAS_NORETURN;
 }
 
-void unallocated_encoding(DisasContext *s)
-{
-    /* Unallocated and reserved encodings are uncategorized */
-    gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
-                       default_exception_el(s));
-}
-
 /* Force a TB lookup after an instruction that changes the CPU state.  */
 static inline void gen_lookup_tb(DisasContext *s)
 {
@@ -XXX,XX +XXX,XX @@ static inline void gen_hlt(DisasContext *s, int imm)
         return;
     }
 
-    unallocated_encoding(s);
+    gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
+                       default_exception_el(s));
 }
 
 static inline void gen_add_data_offset(DisasContext *s, unsigned int insn,
@@ -XXX,XX +XXX,XX @@ static void gen_srs(DisasContext *s,
     }
 
     if (undef) {
-        unallocated_encoding(s);
+        gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
+                           default_exception_el(s));
         return;
     }
 
@@ -XXX,XX +XXX,XX @@ static void disas_arm_insn(DisasContext *s, unsigned int insn)
             break;
         default:
         illegal_op:
-            unallocated_encoding(s);
+            gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
+                               default_exception_el(s));
             break;
         }
     }
@@ -XXX,XX +XXX,XX @@ static void disas_thumb2_insn(DisasContext *s, uint32_t insn)
     }
     return;
 illegal_op:
-    unallocated_encoding(s);
+    gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
+                       default_exception_el(s));
 }
 
 static void disas_thumb_insn(DisasContext *s, uint32_t insn)
@@ -XXX,XX +XXX,XX @@ static void disas_thumb_insn(DisasContext *s, uint32_t insn)
     return;
 illegal_op:
 undef:
-    unallocated_encoding(s);
+    gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
+                       default_exception_el(s));
 }
 
 static bool insn_crosses_page(CPUARMState *env, DisasContext *s)
-- 
2.20.1

From: Richard Henderson <richard.henderson@linaro.org>

Make this a static function private to translate.c.
Thus we can use the same idiom between aarch64 and aarch32
without actually sharing function implementations.

Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
Reviewed-by: Laurent Desnogues <laurent.desnogues@gmail.com>
Message-id: 20190826151536.6771-3-richard.henderson@linaro.org
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 target/arm/translate-vfp.inc.c |  3 +--
 target/arm/translate.c         | 22 ++++++++++++----------
 2 files changed, 13 insertions(+), 12 deletions(-)

diff --git a/target/arm/translate-vfp.inc.c b/target/arm/translate-vfp.inc.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/translate-vfp.inc.c
+++ b/target/arm/translate-vfp.inc.c
@@ -XXX,XX +XXX,XX @@ static bool full_vfp_access_check(DisasContext *s, bool ignore_vfp_enabled)
 
     if (!s->vfp_enabled && !ignore_vfp_enabled) {
         assert(!arm_dc_feature(s, ARM_FEATURE_M));
-        gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
-                           default_exception_el(s));
+        unallocated_encoding(s);
         return false;
     }
 
diff --git a/target/arm/translate.c b/target/arm/translate.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/translate.c
+++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static void gen_exception_bkpt_insn(DisasContext *s, uint32_t syn)
     s->base.is_jmp = DISAS_NORETURN;
 }
 
+static void unallocated_encoding(DisasContext *s)
+{
+    /* Unallocated and reserved encodings are uncategorized */
+    gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
+                       default_exception_el(s));
+}
+
 /* Force a TB lookup after an instruction that changes the CPU state.  */
 static inline void gen_lookup_tb(DisasContext *s)
 {
@@ -XXX,XX +XXX,XX @@ static inline void gen_hlt(DisasContext *s, int imm)
         return;
     }
 
-    gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
-                       default_exception_el(s));
+    unallocated_encoding(s);
 }
 
 static inline void gen_add_data_offset(DisasContext *s, unsigned int insn,
@@ -XXX,XX +XXX,XX @@ static void gen_srs(DisasContext *s,
     }
 
     if (undef) {
-        gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
-                           default_exception_el(s));
+        unallocated_encoding(s);
         return;
     }
 
@@ -XXX,XX +XXX,XX @@ static void disas_arm_insn(DisasContext *s, unsigned int insn)
             break;
         default:
         illegal_op:
-            gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
-                               default_exception_el(s));
+            unallocated_encoding(s);
             break;
         }
     }
@@ -XXX,XX +XXX,XX @@ static void disas_thumb2_insn(DisasContext *s, uint32_t insn)
     }
     return;
 illegal_op:
-    gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
-                       default_exception_el(s));
+    unallocated_encoding(s);
 }
 
 static void disas_thumb_insn(DisasContext *s, uint32_t insn)
@@ -XXX,XX +XXX,XX @@ static void disas_thumb_insn(DisasContext *s, uint32_t insn)
     return;
 illegal_op:
 undef:
-    gen_exception_insn(s, s->pc_curr, EXCP_UDEF, syn_uncategorized(),
-                       default_exception_el(s));
+    unallocated_encoding(s);
 }
 
 static bool insn_crosses_page(CPUARMState *env, DisasContext *s)
-- 
2.20.1

Currently the only part of an ARMCPRegInfo which is allowed to cause
a CPU exception is the access function, which returns a value indicating
that some flavour of UNDEF should be generated.

For the ATS system instructions, we would like to conditionally
generate exceptions as part of the writefn, because some faults
during the page table walk (like external aborts) should cause
an exception to be raised rather than returning a value.

There are several ways we could do this:
 * plumb the GETPC() value from the top level set_cp_reg/get_cp_reg
   helper functions through into the readfn and writefn hooks
 * add extra readfn_with_ra/writefn_with_ra hooks that take the GETPC()
   value
 * require the ATS instructions to provide a dummy accessfn,
   which serves no purpose except to cause the code generation
   to emit TCG ops to sync the CPU state
 * add an ARM_CP_ flag to mark the ARMCPRegInfo as possibly
   throwing an exception in its read/write hooks, and make the
   codegen sync the CPU state before calling the hooks if the
   flag is set

This patch opts for the last of these, as it is fairly simple
to implement and doesn't require invasive changes like updating
the readfn/writefn hook function prototype signature.

Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Tested-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
Message-id: 20190816125802.25877-2-peter.maydell@linaro.org
---
 target/arm/cpu.h           | 6 +++++-
 target/arm/translate-a64.c | 6 ++++++
 target/arm/translate.c     | 7 +++++++
 3 files changed, 18 insertions(+), 1 deletion(-)

diff --git a/target/arm/cpu.h b/target/arm/cpu.h
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/cpu.h
+++ b/target/arm/cpu.h
@@ -XXX,XX +XXX,XX @@ static inline uint64_t cpreg_to_kvm_id(uint32_t cpregid)
  * IO indicates that this register does I/O and therefore its accesses
  * need to be surrounded by gen_io_start()/gen_io_end(). In particular,
  * registers which implement clocks or timers require this.
+ * RAISES_EXC is for when the read or write hook might raise an exception;
+ * the generated code will synchronize the CPU state before calling the hook
+ * so that it is safe for the hook to call raise_exception().
  */
 #define ARM_CP_SPECIAL           0x0001
 #define ARM_CP_CONST             0x0002
@@ -XXX,XX +XXX,XX @@ static inline uint64_t cpreg_to_kvm_id(uint32_t cpregid)
 #define ARM_CP_FPU               0x1000
 #define ARM_CP_SVE               0x2000
 #define ARM_CP_NO_GDB            0x4000
+#define ARM_CP_RAISES_EXC        0x8000
 /* Used only as a terminator for ARMCPRegInfo lists */
 #define ARM_CP_SENTINEL          0xffff
 /* Mask of only the flag bits in a type field */
-#define ARM_CP_FLAG_MASK         0x70ff
+#define ARM_CP_FLAG_MASK         0xf0ff
 
 /* Valid values for ARMCPRegInfo state field, indicating which of
  * the AArch32 and AArch64 execution states this register is visible in.
diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/translate-a64.c
+++ b/target/arm/translate-a64.c
@@ -XXX,XX +XXX,XX @@ static void handle_sys(DisasContext *s, uint32_t insn, bool isread,
         tcg_temp_free_ptr(tmpptr);
         tcg_temp_free_i32(tcg_syn);
         tcg_temp_free_i32(tcg_isread);
+    } else if (ri->type & ARM_CP_RAISES_EXC) {
+        /*
+         * The readfn or writefn might raise an exception;
+         * synchronize the CPU state in case it does.
+         */
+        gen_a64_set_pc_im(s->pc_curr);
     }
 
     /* Handle special cases first */
diff --git a/target/arm/translate.c b/target/arm/translate.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/translate.c
+++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static int disas_coproc_insn(DisasContext *s, uint32_t insn)
             tcg_temp_free_ptr(tmpptr);
             tcg_temp_free_i32(tcg_syn);
             tcg_temp_free_i32(tcg_isread);
+        } else if (ri->type & ARM_CP_RAISES_EXC) {
+            /*
+             * The readfn or writefn might raise an exception;
+             * synchronize the CPU state in case it does.
+             */
+            gen_set_condexec(s);
+            gen_set_pc_im(s, s->pc_curr);
         }
 
         /* Handle special cases first */
-- 
2.20.1

The translation table walk for an ATS instruction can result in
various faults.  In general these are just reported back via the
PAR_EL1 fault status fields, but in some cases the architecture
requires that the fault is turned into an exception:
 * synchronous stage 2 faults of any kind during AT S1E0* and
   AT S1E1* instructions executed from NS EL1 fault to EL2 or EL3
 * synchronous external aborts are taken as Data Abort exceptions

(This is documented in the v8A Arm ARM DDI0487A.e D5.2.11 and
G5.13.4.)

Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Tested-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>
Message-id: 20190816125802.25877-3-peter.maydell@linaro.org
---
 target/arm/helper.c | 107 +++++++++++++++++++++++++++++++++++++-------
 1 file changed, 92 insertions(+), 15 deletions(-)

diff --git a/target/arm/helper.c b/target/arm/helper.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/helper.c
+++ b/target/arm/helper.c
@@ -XXX,XX +XXX,XX @@ static uint64_t do_ats_write(CPUARMState *env, uint64_t value,
     ret = get_phys_addr(env, value, access_type, mmu_idx, &phys_addr, &attrs,
                         &prot, &page_size, &fi, &cacheattrs);
 
+    if (ret) {
+        /*
+         * Some kinds of translation fault must cause exceptions rather
+         * than being reported in the PAR.
+         */
+        int current_el = arm_current_el(env);
+        int target_el;
+        uint32_t syn, fsr, fsc;
+        bool take_exc = false;
+
+        if (fi.s1ptw && current_el == 1 && !arm_is_secure(env)
+            && (mmu_idx == ARMMMUIdx_S1NSE1 || mmu_idx == ARMMMUIdx_S1NSE0)) {
+            /*
+             * Synchronous stage 2 fault on an access made as part of the
+             * translation table walk for AT S1E0* or AT S1E1* insn
+             * executed from NS EL1. If this is a synchronous external abort
+             * and SCR_EL3.EA == 1, then we take a synchronous external abort
+             * to EL3. Otherwise the fault is taken as an exception to EL2,
+             * and HPFAR_EL2 holds the faulting IPA.
+             */
+            if (fi.type == ARMFault_SyncExternalOnWalk &&
+                (env->cp15.scr_el3 & SCR_EA)) {
+                target_el = 3;
+            } else {
+                env->cp15.hpfar_el2 = extract64(fi.s2addr, 12, 47) << 4;
+                target_el = 2;
+            }
+            take_exc = true;
+        } else if (fi.type == ARMFault_SyncExternalOnWalk) {
+            /*
+             * Synchronous external aborts during a translation table walk
+             * are taken as Data Abort exceptions.
+             */
+            if (fi.stage2) {
+                if (current_el == 3) {
+                    target_el = 3;
+                } else {
+                    target_el = 2;
+                }
+            } else {
+                target_el = exception_target_el(env);
+            }
+            take_exc = true;
+        }
+
+        if (take_exc) {
+            /* Construct FSR and FSC using same logic as arm_deliver_fault() */
+            if (target_el == 2 || arm_el_is_aa64(env, target_el) ||
+                arm_s1_regime_using_lpae_format(env, mmu_idx)) {
+                fsr = arm_fi_to_lfsc(&fi);
+                fsc = extract32(fsr, 0, 6);
+            } else {
+                fsr = arm_fi_to_sfsc(&fi);
+                fsc = 0x3f;
+            }
+            /*
+             * Report exception with ESR indicating a fault due to a
+             * translation table walk for a cache maintenance instruction.
+             */
+            syn = syn_data_abort_no_iss(current_el == target_el,
+                                        fi.ea, 1, fi.s1ptw, 1, fsc);
+            env->exception.vaddress = value;
+            env->exception.fsr = fsr;
+            raise_exception(env, EXCP_DATA_ABORT, syn, target_el);
+        }
+    }
+
     if (is_a64(env)) {
         format64 = true;
     } else if (arm_feature(env, ARM_FEATURE_LPAE)) {
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo vapa_cp_reginfo[] = {
     /* This underdecoding is safe because the reginfo is NO_RAW. */
     { .name = "ATS", .cp = 15, .crn = 7, .crm = 8, .opc1 = 0, .opc2 = CP_ANY,
       .access = PL1_W, .accessfn = ats_access,
-      .writefn = ats_write, .type = ARM_CP_NO_RAW },
+      .writefn = ats_write, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC },
 #endif
     REGINFO_SENTINEL
 };
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo v8_cp_reginfo[] = {
     /* 64 bit address translation operations */
     { .name = "AT_S1E1R", .state = ARM_CP_STATE_AA64,
       .opc0 = 1, .opc1 = 0, .crn = 7, .crm = 8, .opc2 = 0,
-      .access = PL1_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
+      .access = PL1_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
+      .writefn = ats_write64 },
     { .name = "AT_S1E1W", .state = ARM_CP_STATE_AA64,
       .opc0 = 1, .opc1 = 0, .crn = 7, .crm = 8, .opc2 = 1,
-      .access = PL1_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
+      .access = PL1_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
+      .writefn = ats_write64 },
     { .name = "AT_S1E0R", .state = ARM_CP_STATE_AA64,
       .opc0 = 1, .opc1 = 0, .crn = 7, .crm = 8, .opc2 = 2,
-      .access = PL1_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
+      .access = PL1_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
+      .writefn = ats_write64 },
     { .name = "AT_S1E0W", .state = ARM_CP_STATE_AA64,
       .opc0 = 1, .opc1 = 0, .crn = 7, .crm = 8, .opc2 = 3,
-      .access = PL1_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
+      .access = PL1_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
+      .writefn = ats_write64 },
     { .name = "AT_S12E1R", .state = ARM_CP_STATE_AA64,
       .opc0 = 1, .opc1 = 4, .crn = 7, .crm = 8, .opc2 = 4,
-      .access = PL2_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
+      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
+      .writefn = ats_write64 },
     { .name = "AT_S12E1W", .state = ARM_CP_STATE_AA64,
       .opc0 = 1, .opc1 = 4, .crn = 7, .crm = 8, .opc2 = 5,
-      .access = PL2_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
+      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
+      .writefn = ats_write64 },
     { .name = "AT_S12E0R", .state = ARM_CP_STATE_AA64,
       .opc0 = 1, .opc1 = 4, .crn = 7, .crm = 8, .opc2 = 6,
-      .access = PL2_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
+      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
+      .writefn = ats_write64 },
     { .name = "AT_S12E0W", .state = ARM_CP_STATE_AA64,
       .opc0 = 1, .opc1 = 4, .crn = 7, .crm = 8, .opc2 = 7,
-      .access = PL2_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
+      .access = PL2_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
+      .writefn = ats_write64 },
     /* AT S1E2* are elsewhere as they UNDEF from EL3 if EL2 is not present */
     { .name = "AT_S1E3R", .state = ARM_CP_STATE_AA64,
       .opc0 = 1, .opc1 = 6, .crn = 7, .crm = 8, .opc2 = 0,
-      .access = PL3_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
+      .access = PL3_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
+      .writefn = ats_write64 },
     { .name = "AT_S1E3W", .state = ARM_CP_STATE_AA64,
       .opc0 = 1, .opc1 = 6, .crn = 7, .crm = 8, .opc2 = 1,
-      .access = PL3_W, .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
+      .access = PL3_W, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC,
+      .writefn = ats_write64 },
     { .name = "PAR_EL1", .state = ARM_CP_STATE_AA64,
       .type = ARM_CP_ALIAS,
       .opc0 = 3, .opc1 = 0, .crn = 7, .crm = 4, .opc2 = 0,
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo el2_cp_reginfo[] = {
     { .name = "AT_S1E2R", .state = ARM_CP_STATE_AA64,
       .opc0 = 1, .opc1 = 4, .crn = 7, .crm = 8, .opc2 = 0,
       .access = PL2_W, .accessfn = at_s1e2_access,
-      .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
+      .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC, .writefn = ats_write64 },
     { .name = "AT_S1E2W", .state = ARM_CP_STATE_AA64,
       .opc0 = 1, .opc1 = 4, .crn = 7, .crm = 8, .opc2 = 1,
       .access = PL2_W, .accessfn = at_s1e2_access,
-      .type = ARM_CP_NO_RAW, .writefn = ats_write64 },
+      .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC, .writefn = ats_write64 },
     /* The AArch32 ATS1H* operations are CONSTRAINED UNPREDICTABLE
      * if EL2 is not implemented; we choose to UNDEF. Behaviour at EL3
      * with SCR.NS == 0 outside Monitor mode is UNPREDICTABLE; we choose
@@ -XXX,XX +XXX,XX @@ static const ARMCPRegInfo el2_cp_reginfo[] = {
      */
     { .name = "ATS1HR", .cp = 15, .opc1 = 4, .crn = 7, .crm = 8, .opc2 = 0,
       .access = PL2_W,
-      .writefn = ats1h_write, .type = ARM_CP_NO_RAW },
+      .writefn = ats1h_write, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC },
     { .name = "ATS1HW", .cp = 15, .opc1 = 4, .crn = 7, .crm = 8, .opc2 = 1,
       .access = PL2_W,
-      .writefn = ats1h_write, .type = ARM_CP_NO_RAW },
+      .writefn = ats1h_write, .type = ARM_CP_NO_RAW | ARM_CP_RAISES_EXC },
     { .name = "CNTHCTL_EL2", .state = ARM_CP_STATE_BOTH,
       .opc0 = 3, .opc1 = 4, .crn = 14, .crm = 1, .opc2 = 0,
       /* ARMv7 requires bit 0 and 1 to reset to 1. ARMv8 defines the
-- 
2.20.1

From: Andrew Jeffery <andrew@aj.id.au>

First up: This is not the way the hardware behaves.

However, it helps resolve real-world problems with short periods being
used under Linux. Commit 4451d3f59f2a ("clocksource/drivers/fttmr010:
Fix set_next_event handler") in Linux fixed the timer driver to
correctly schedule the next event for the Aspeed controller, and in
combination with 5daa8212c08e ("ARM: dts: aspeed: Describe random number
device") Linux will now set a timer with a period as low as 1us.

Configuring a qemu timer with such a short period results in spending
time handling the interrupt in the model rather than executing guest
code, leading to noticeable "sticky" behaviour in the guest.

The behaviour of Linux is correct with respect to the hardware, so we
need to improve our handling under emulation. The approach chosen is to
provide back-pressure information by calculating an acceptable minimum
number of ticks to be set on the model. Under Linux an additional read
is added in the timer configuration path to detect back-pressure, which
will never occur on hardware. However if back-pressure is observed, the
driver alerts the clock event subsystem, which then performs its own
next event dilation via a config option - d1748302f70b ("clockevents:
Make minimum delay adjustments configurable")

A minimum period of 5us was experimentally determined on a Lenovo
T480s, which I've increased to 20us for "safety".

Signed-off-by: Andrew Jeffery <andrew@aj.id.au>
Reviewed-by: Joel Stanley <joel@jms.id.au>
Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
Tested-by: Joel Stanley <joel@jms.id.au>
Signed-off-by: Cédric Le Goater <clg@kaod.org>
Message-id: 20190704055150.4899-1-clg@kaod.org
[clg: - changed the computation of min_ticks to be done each time the
        timer value is reloaded. It removes the ordering issue of the
        timer and scu reset handlers but is slightly slower ]
      - introduced TIMER_MIN_NS
      - introduced calculate_min_ticks() ]
Signed-off-by: Cédric Le Goater <clg@kaod.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/timer/aspeed_timer.c | 17 ++++++++++++++++-
 1 file changed, 16 insertions(+), 1 deletion(-)

diff --git a/hw/timer/aspeed_timer.c b/hw/timer/aspeed_timer.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/timer/aspeed_timer.c
+++ b/hw/timer/aspeed_timer.c
@@ -XXX,XX +XXX,XX @@ enum timer_ctrl_op {
     op_pulse_enable
 };
 
+/*
+ * Minimum value of the reload register to filter out short period
+ * timers which have a noticeable impact in emulation. 5us should be
+ * enough, use 20us for "safety".
+ */
+#define TIMER_MIN_NS (20 * SCALE_US)
+
 /**
  * Avoid mutual references between AspeedTimerCtrlState and AspeedTimer
  * structs, as it's a waste of memory. The ptimer BH callback needs to know
@@ -XXX,XX +XXX,XX @@ static inline uint32_t calculate_ticks(struct AspeedTimer *t, uint64_t now_ns)
     return t->reload - MIN(t->reload, ticks);
 }
 
+static uint32_t calculate_min_ticks(AspeedTimer *t, uint32_t value)
+{
+    uint32_t rate = calculate_rate(t);
+    uint32_t min_ticks = muldiv64(TIMER_MIN_NS, rate, NANOSECONDS_PER_SECOND);
+
+    return  value < min_ticks ? min_ticks : value;
+}
+
 static inline uint64_t calculate_time(struct AspeedTimer *t, uint32_t ticks)
 {
     uint64_t delta_ns;
@@ -XXX,XX +XXX,XX @@ static void aspeed_timer_set_value(AspeedTimerCtrlState *s, int timer, int reg,
     switch (reg) {
     case TIMER_REG_RELOAD:
         old_reload = t->reload;
-        t->reload = value;
+        t->reload = calculate_min_ticks(t, value);
 
         /* If the reload value was not previously set, or zero, and
          * the current value is valid, try to start the timer if it is
-- 
2.20.1

From: Eric Auger <eric.auger@redhat.com>

memory_region_iommu_replay_all is not used. Remove it.

Signed-off-by: Eric Auger <eric.auger@redhat.com>
Reported-by: Peter Maydell <peter.maydell@linaro.org>
Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
Reviewed-by: Peter Xu <peterx@redhat.com>
Message-id: 20190822172350.12008-2-eric.auger@redhat.com
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 include/exec/memory.h | 10 ----------
 memory.c              |  9 ---------
 2 files changed, 19 deletions(-)

diff --git a/include/exec/memory.h b/include/exec/memory.h
index XXXXXXX..XXXXXXX 100644
--- a/include/exec/memory.h
+++ b/include/exec/memory.h
@@ -XXX,XX +XXX,XX @@ void memory_region_register_iommu_notifier(MemoryRegion *mr,
  */
 void memory_region_iommu_replay(IOMMUMemoryRegion *iommu_mr, IOMMUNotifier *n);
 
-/**
- * memory_region_iommu_replay_all: replay existing IOMMU translations
- * to all the notifiers registered.
- *
- * Note: this is not related to record-and-replay functionality.
- *
- * @iommu_mr: the memory region to observe
- */
-void memory_region_iommu_replay_all(IOMMUMemoryRegion *iommu_mr);
-
 /**
  * memory_region_unregister_iommu_notifier: unregister a notifier for
  * changes to IOMMU translation entries.
diff --git a/memory.c b/memory.c
index XXXXXXX..XXXXXXX 100644
--- a/memory.c
+++ b/memory.c
@@ -XXX,XX +XXX,XX @@ void memory_region_iommu_replay(IOMMUMemoryRegion *iommu_mr, IOMMUNotifier *n)
     }
 }
 
-void memory_region_iommu_replay_all(IOMMUMemoryRegion *iommu_mr)
-{
-    IOMMUNotifier *notifier;
-
-    IOMMU_NOTIFIER_FOREACH(notifier, iommu_mr) {
-        memory_region_iommu_replay(iommu_mr, notifier);
-    }
-}
-
 void memory_region_unregister_iommu_notifier(MemoryRegion *mr,
                                              IOMMUNotifier *n)
 {
-- 
2.20.1

From: Eric Auger <eric.auger@redhat.com>

An IOVA/ASID invalidation is notified to all IOMMU Memory Regions
through smmuv3_inv_notifiers_iova/smmuv3_notify_iova.

When the notification occurs it is possible that some of the
PCIe devices associated to the notified regions do not have a
valid stream table entry. In that case we output a LOG_GUEST_ERROR
message, for example:

invalid sid=<SID> (L1STD span=0)
"smmuv3_notify_iova error decoding the configuration for iommu mr=<MR>

This is unfortunate as the user gets the impression that there
are some translation decoding errors whereas there are not.

This patch adds a new field in SMMUEventInfo that tells whether
the detection of an invalid STE must lead to an error report.
invalid_ste_allowed is set before doing the invalidations and
kept unset on actual translation.

The other configuration decoding error messages are kept since if the
STE is valid then the rest of the config must be correct.

Signed-off-by: Eric Auger <eric.auger@redhat.com>
Message-id: 20190822172350.12008-6-eric.auger@redhat.com
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/arm/smmuv3-internal.h |  1 +
 hw/arm/smmuv3.c          | 19 +++++++++++--------
 2 files changed, 12 insertions(+), 8 deletions(-)

diff --git a/hw/arm/smmuv3-internal.h b/hw/arm/smmuv3-internal.h
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/smmuv3-internal.h
+++ b/hw/arm/smmuv3-internal.h
@@ -XXX,XX +XXX,XX @@ typedef struct SMMUEventInfo {
     uint32_t sid;
     bool recorded;
     bool record_trans_faults;
+    bool inval_ste_allowed;
     union {
         struct {
             uint32_t ssid;
diff --git a/hw/arm/smmuv3.c b/hw/arm/smmuv3.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/smmuv3.c
+++ b/hw/arm/smmuv3.c
@@ -XXX,XX +XXX,XX @@ static int decode_ste(SMMUv3State *s, SMMUTransCfg *cfg,
     uint32_t config;
 
     if (!STE_VALID(ste)) {
-        qemu_log_mask(LOG_GUEST_ERROR, "invalid STE\n");
+        if (!event->inval_ste_allowed) {
+            qemu_log_mask(LOG_GUEST_ERROR, "invalid STE\n");
+        }
         goto bad_ste;
     }
 
@@ -XXX,XX +XXX,XX @@ static int smmu_find_ste(SMMUv3State *s, uint32_t sid, STE *ste,
 
         if (!span) {
             /* l2ptr is not valid */
-            qemu_log_mask(LOG_GUEST_ERROR,
-                          "invalid sid=%d (L1STD span=0)\n", sid);
+            if (!event->inval_ste_allowed) {
+                qemu_log_mask(LOG_GUEST_ERROR,
+                              "invalid sid=%d (L1STD span=0)\n", sid);
+            }
             event->type = SMMU_EVT_C_BAD_STREAMID;
             return -EINVAL;
         }
@@ -XXX,XX +XXX,XX @@ static IOMMUTLBEntry smmuv3_translate(IOMMUMemoryRegion *mr, hwaddr addr,
     SMMUDevice *sdev = container_of(mr, SMMUDevice, iommu);
     SMMUv3State *s = sdev->smmu;
     uint32_t sid = smmu_get_sid(sdev);
-    SMMUEventInfo event = {.type = SMMU_EVT_NONE, .sid = sid};
+    SMMUEventInfo event = {.type = SMMU_EVT_NONE,
+                           .sid = sid,
+                           .inval_ste_allowed = false};
     SMMUPTWEventInfo ptw_info = {};
     SMMUTranslationStatus status;
     SMMUState *bs = ARM_SMMU(s);
@@ -XXX,XX +XXX,XX @@ static void smmuv3_notify_iova(IOMMUMemoryRegion *mr,
                                dma_addr_t iova)
 {
     SMMUDevice *sdev = container_of(mr, SMMUDevice, iommu);
-    SMMUEventInfo event = {};
+    SMMUEventInfo event = {.inval_ste_allowed = true};
     SMMUTransTableInfo *tt;
     SMMUTransCfg *cfg;
     IOMMUTLBEntry entry;
 
     cfg = smmuv3_get_config(sdev, &event);
     if (!cfg) {
-        qemu_log_mask(LOG_GUEST_ERROR,
-                      "%s error decoding the configuration for iommu mr=%s\n",
-                      __func__, mr->parent_obj.name);
         return;
     }
 
-- 
2.20.1

From: Richard Henderson <richard.henderson@linaro.org>

The previous simplification got the order of operands to the
subtraction wrong.  Since the 64-bit product is the subtrahend,
we must use a 64-bit subtract to properly compute the borrow
from the low-part of the product.

Fixes: 5f8cd06ebcf5 ("target/arm: Simplify SMMLA, SMMLAR, SMMLS, SMMLSR")
Reported-by: Laurent Desnogues <laurent.desnogues@gmail.com>
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
Tested-by: Laurent Desnogues <laurent.desnogues@gmail.com>
Message-id: 20190829013258.16102-1-richard.henderson@linaro.org
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 target/arm/translate.c | 20 ++++++++++++++++++--
 1 file changed, 18 insertions(+), 2 deletions(-)

diff --git a/target/arm/translate.c b/target/arm/translate.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/translate.c
+++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static void disas_arm_insn(DisasContext *s, unsigned int insn)
                         if (rd != 15) {
                             tmp3 = load_reg(s, rd);
                             if (insn & (1 << 6)) {
-                                tcg_gen_sub_i32(tmp, tmp, tmp3);
+                                /*
+                                 * For SMMLS, we need a 64-bit subtract.
+                                 * Borrow caused by a non-zero multiplicand
+                                 * lowpart, and the correct result lowpart
+                                 * for rounding.
+                                 */
+                                TCGv_i32 zero = tcg_const_i32(0);
+                                tcg_gen_sub2_i32(tmp2, tmp, zero, tmp3,
+                                                 tmp2, tmp);
+                                tcg_temp_free_i32(zero);
                             } else {
                                 tcg_gen_add_i32(tmp, tmp, tmp3);
                             }
@@ -XXX,XX +XXX,XX @@ static void disas_thumb2_insn(DisasContext *s, uint32_t insn)
                     if (insn & (1 << 20)) {
                         tcg_gen_add_i32(tmp, tmp, tmp3);
                     } else {
-                        tcg_gen_sub_i32(tmp, tmp, tmp3);
+                        /*
+                         * For SMMLS, we need a 64-bit subtract.
+                         * Borrow caused by a non-zero multiplicand lowpart,
+                         * and the correct result lowpart for rounding.
+                         */
+                        TCGv_i32 zero = tcg_const_i32(0);
+                        tcg_gen_sub2_i32(tmp2, tmp, zero, tmp3, tmp2, tmp);
+                        tcg_temp_free_i32(zero);
                     }
                     tcg_temp_free_i32(tmp3);
                 }
-- 
2.20.1

From: Philippe Mathieu-Daudé <philmd@redhat.com>

Commit ba1ba5cca introduce the ARM_CPU_TYPE_NAME() macro.
Unify the code base by use it in all places.

Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
Signed-off-by: Philippe Mathieu-Daudé <philmd@redhat.com>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Message-id: 20190823143249.8096-2-philmd@redhat.com
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/arm/allwinner-a10.c | 3 ++-
 hw/arm/cubieboard.c    | 3 ++-
 hw/arm/digic.c         | 3 ++-
 hw/arm/fsl-imx25.c     | 2 +-
 hw/arm/fsl-imx31.c     | 2 +-
 hw/arm/fsl-imx6.c      | 3 ++-
 hw/arm/fsl-imx6ul.c    | 2 +-
 hw/arm/xlnx-zynqmp.c   | 8 ++++----
 8 files changed, 15 insertions(+), 11 deletions(-)

diff --git a/hw/arm/allwinner-a10.c b/hw/arm/allwinner-a10.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/allwinner-a10.c
+++ b/hw/arm/allwinner-a10.c
@@ -XXX,XX +XXX,XX @@ static void aw_a10_init(Object *obj)
     AwA10State *s = AW_A10(obj);
 
     object_initialize_child(obj, "cpu", &s->cpu, sizeof(s->cpu),
-                            "cortex-a8-" TYPE_ARM_CPU, &error_abort, NULL);
+                            ARM_CPU_TYPE_NAME("cortex-a8"),
+                            &error_abort, NULL);
 
     sysbus_init_child_obj(obj, "intc", &s->intc, sizeof(s->intc),
                           TYPE_AW_A10_PIC);
diff --git a/hw/arm/cubieboard.c b/hw/arm/cubieboard.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/cubieboard.c
+++ b/hw/arm/cubieboard.c
@@ -XXX,XX +XXX,XX @@ static void cubieboard_init(MachineState *machine)
 
 static void cubieboard_machine_init(MachineClass *mc)
 {
-    mc->desc = "cubietech cubieboard";
+    mc->desc = "cubietech cubieboard (Cortex-A9)";
+    mc->default_cpu_type = ARM_CPU_TYPE_NAME("cortex-a9");
     mc->init = cubieboard_init;
     mc->block_default_type = IF_IDE;
     mc->units_per_default_bus = 1;
diff --git a/hw/arm/digic.c b/hw/arm/digic.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/digic.c
+++ b/hw/arm/digic.c
@@ -XXX,XX +XXX,XX @@ static void digic_init(Object *obj)
     int i;
 
     object_initialize_child(obj, "cpu", &s->cpu, sizeof(s->cpu),
-                            "arm946-" TYPE_ARM_CPU, &error_abort, NULL);
+                            ARM_CPU_TYPE_NAME("arm946"),
+                            &error_abort, NULL);
 
     for (i = 0; i < DIGIC4_NB_TIMERS; i++) {
 #define DIGIC_TIMER_NAME_MLEN    11
diff --git a/hw/arm/fsl-imx25.c b/hw/arm/fsl-imx25.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/fsl-imx25.c
+++ b/hw/arm/fsl-imx25.c
@@ -XXX,XX +XXX,XX @@ static void fsl_imx25_init(Object *obj)
     FslIMX25State *s = FSL_IMX25(obj);
     int i;
 
-    object_initialize(&s->cpu, sizeof(s->cpu), "arm926-" TYPE_ARM_CPU);
+    object_initialize(&s->cpu, sizeof(s->cpu), ARM_CPU_TYPE_NAME("arm926"));
 
     sysbus_init_child_obj(obj, "avic", &s->avic, sizeof(s->avic),
                           TYPE_IMX_AVIC);
diff --git a/hw/arm/fsl-imx31.c b/hw/arm/fsl-imx31.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/fsl-imx31.c
+++ b/hw/arm/fsl-imx31.c
@@ -XXX,XX +XXX,XX @@ static void fsl_imx31_init(Object *obj)
     FslIMX31State *s = FSL_IMX31(obj);
     int i;
 
-    object_initialize(&s->cpu, sizeof(s->cpu), "arm1136-" TYPE_ARM_CPU);
+    object_initialize(&s->cpu, sizeof(s->cpu), ARM_CPU_TYPE_NAME("arm1136"));
 
     sysbus_init_child_obj(obj, "avic", &s->avic, sizeof(s->avic),
                           TYPE_IMX_AVIC);
diff --git a/hw/arm/fsl-imx6.c b/hw/arm/fsl-imx6.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/fsl-imx6.c
+++ b/hw/arm/fsl-imx6.c
@@ -XXX,XX +XXX,XX @@ static void fsl_imx6_init(Object *obj)
     for (i = 0; i < MIN(ms->smp.cpus, FSL_IMX6_NUM_CPUS); i++) {
         snprintf(name, NAME_SIZE, "cpu%d", i);
         object_initialize_child(obj, name, &s->cpu[i], sizeof(s->cpu[i]),
-                                "cortex-a9-" TYPE_ARM_CPU, &error_abort, NULL);
+                                ARM_CPU_TYPE_NAME("cortex-a9"),
+                                &error_abort, NULL);
     }
 
     sysbus_init_child_obj(obj, "a9mpcore", &s->a9mpcore, sizeof(s->a9mpcore),
diff --git a/hw/arm/fsl-imx6ul.c b/hw/arm/fsl-imx6ul.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/fsl-imx6ul.c
+++ b/hw/arm/fsl-imx6ul.c
@@ -XXX,XX +XXX,XX @@ static void fsl_imx6ul_init(Object *obj)
     int i;
 
     object_initialize_child(obj, "cpu0", &s->cpu, sizeof(s->cpu),
-                            "cortex-a7-" TYPE_ARM_CPU, &error_abort, NULL);
+                            ARM_CPU_TYPE_NAME("cortex-a7"), &error_abort, NULL);
 
     /*
      * A7MPCORE
diff --git a/hw/arm/xlnx-zynqmp.c b/hw/arm/xlnx-zynqmp.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/xlnx-zynqmp.c
+++ b/hw/arm/xlnx-zynqmp.c
@@ -XXX,XX +XXX,XX @@ static void xlnx_zynqmp_create_rpu(MachineState *ms, XlnxZynqMPState *s,
 
         object_initialize_child(OBJECT(&s->rpu_cluster), "rpu-cpu[*]",
                                 &s->rpu_cpu[i], sizeof(s->rpu_cpu[i]),
-                                "cortex-r5f-" TYPE_ARM_CPU, &error_abort,
-                                NULL);
+                                ARM_CPU_TYPE_NAME("cortex-r5f"),
+                                &error_abort, NULL);
 
         name = object_get_canonical_path_component(OBJECT(&s->rpu_cpu[i]));
         if (strcmp(name, boot_cpu)) {
@@ -XXX,XX +XXX,XX @@ static void xlnx_zynqmp_init(Object *obj)
     for (i = 0; i < num_apus; i++) {
         object_initialize_child(OBJECT(&s->apu_cluster), "apu-cpu[*]",
                                 &s->apu_cpu[i], sizeof(s->apu_cpu[i]),
-                                "cortex-a53-" TYPE_ARM_CPU, &error_abort,
-                                NULL);
+                                ARM_CPU_TYPE_NAME("cortex-a53"),
+                                &error_abort, NULL);
     }
 
     sysbus_init_child_obj(obj, "gic", &s->gic, sizeof(s->gic),
-- 
2.20.1

From: Philippe Mathieu-Daudé <philmd@redhat.com>

As explained in commit aff39be0ed97:

Both functions, object_initialize() and object_property_add_child()
  increase the reference counter of the new object, so one of the
  references has to be dropped afterwards to get the reference
  counting right. Otherwise the child object will not be properly
  cleaned up when the parent gets destroyed.
  Thus let's use now object_initialize_child() instead to get the
  reference counting here right.

Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
Signed-off-by: Philippe Mathieu-Daudé <philmd@redhat.com>
Reviewed-by: Thomas Huth <thuth@redhat.com>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Message-id: 20190823143249.8096-3-philmd@redhat.com
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/arm/mcimx7d-sabre.c |  9 ++++-----
 hw/arm/mps2-tz.c       | 15 +++++++--------
 hw/arm/musca.c         |  9 +++++----
 3 files changed, 16 insertions(+), 17 deletions(-)

diff --git a/hw/arm/mcimx7d-sabre.c b/hw/arm/mcimx7d-sabre.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/mcimx7d-sabre.c
+++ b/hw/arm/mcimx7d-sabre.c
@@ -XXX,XX +XXX,XX @@ static void mcimx7d_sabre_init(MachineState *machine)
 {
     static struct arm_boot_info boot_info;
     MCIMX7Sabre *s = g_new0(MCIMX7Sabre, 1);
-    Object *soc;
     int i;
 
     if (machine->ram_size > FSL_IMX7_MMDC_SIZE) {
@@ -XXX,XX +XXX,XX @@ static void mcimx7d_sabre_init(MachineState *machine)
         .nb_cpus = machine->smp.cpus,
     };
 
-    object_initialize(&s->soc, sizeof(s->soc), TYPE_FSL_IMX7);
-    soc = OBJECT(&s->soc);
-    object_property_add_child(OBJECT(machine), "soc", soc, &error_fatal);
-    object_property_set_bool(soc, true, "realized", &error_fatal);
+    object_initialize_child(OBJECT(machine), "soc",
+                            &s->soc, sizeof(s->soc),
+                            TYPE_FSL_IMX7, &error_fatal, NULL);
+    object_property_set_bool(OBJECT(&s->soc), true, "realized", &error_fatal);
 
     memory_region_allocate_system_memory(&s->ram, NULL, "mcimx7d-sabre.ram",
                                          machine->ram_size);
diff --git a/hw/arm/mps2-tz.c b/hw/arm/mps2-tz.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/mps2-tz.c
+++ b/hw/arm/mps2-tz.c
@@ -XXX,XX +XXX,XX @@ static void mps2tz_common_init(MachineState *machine)
     /* The sec_resp_cfg output from the IoTKit must be split into multiple
      * lines, one for each of the PPCs we create here, plus one per MSC.
      */
-    object_initialize(&mms->sec_resp_splitter, sizeof(mms->sec_resp_splitter),
-                      TYPE_SPLIT_IRQ);
-    object_property_add_child(OBJECT(machine), "sec-resp-splitter",
-                              OBJECT(&mms->sec_resp_splitter), &error_abort);
+    object_initialize_child(OBJECT(machine), "sec-resp-splitter",
+                            &mms->sec_resp_splitter,
+                            sizeof(mms->sec_resp_splitter),
+                            TYPE_SPLIT_IRQ, &error_abort, NULL);
     object_property_set_int(OBJECT(&mms->sec_resp_splitter),
                             ARRAY_SIZE(mms->ppc) + ARRAY_SIZE(mms->msc),
                             "num-lines", &error_fatal);
@@ -XXX,XX +XXX,XX @@ static void mps2tz_common_init(MachineState *machine)
      * Tx, Rx and "combined" IRQs are sent to the NVIC separately.
      * Create the OR gate for this.
      */
-    object_initialize(&mms->uart_irq_orgate, sizeof(mms->uart_irq_orgate),
-                      TYPE_OR_IRQ);
-    object_property_add_child(OBJECT(mms), "uart-irq-orgate",
-                              OBJECT(&mms->uart_irq_orgate), &error_abort);
+    object_initialize_child(OBJECT(mms), "uart-irq-orgate",
+                            &mms->uart_irq_orgate, sizeof(mms->uart_irq_orgate),
+                            TYPE_OR_IRQ, &error_abort, NULL);
     object_property_set_int(OBJECT(&mms->uart_irq_orgate), 10, "num-lines",
                             &error_fatal);
     object_property_set_bool(OBJECT(&mms->uart_irq_orgate), true,
diff --git a/hw/arm/musca.c b/hw/arm/musca.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/musca.c
+++ b/hw/arm/musca.c
@@ -XXX,XX +XXX,XX @@ static void musca_init(MachineState *machine)
      * The sec_resp_cfg output from the SSE-200 must be split into multiple
      * lines, one for each of the PPCs we create here.
      */
-    object_initialize(&mms->sec_resp_splitter, sizeof(mms->sec_resp_splitter),
-                      TYPE_SPLIT_IRQ);
-    object_property_add_child(OBJECT(machine), "sec-resp-splitter",
-                              OBJECT(&mms->sec_resp_splitter), &error_fatal);
+    object_initialize_child(OBJECT(machine), "sec-resp-splitter",
+                            &mms->sec_resp_splitter,
+                            sizeof(mms->sec_resp_splitter),
+                            TYPE_SPLIT_IRQ, &error_fatal, NULL);
+
     object_property_set_int(OBJECT(&mms->sec_resp_splitter),
                             ARRAY_SIZE(mms->ppc), "num-lines", &error_fatal);
     object_property_set_bool(OBJECT(&mms->sec_resp_splitter), true,
-- 
2.20.1

From: Philippe Mathieu-Daudé <philmd@redhat.com>

Both object_initialize() and qdev_set_parent_bus() increase the
reference counter of the new object, so one of the references has
to be dropped afterwards to get the reference counting right.
In machine model code this refcount leak is not particularly
problematic because (unlike devices) machines will never be
created on demand via QMP, and they are never destroyed.
But in any case let's use the new sysbus_init_child_obj() instead
to get the reference counting here right.

Signed-off-by: Philippe Mathieu-Daudé <philmd@redhat.com>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Message-id: 20190823143249.8096-4-philmd@redhat.com
[PMM: rewrote commit message]
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/arm/exynos4_boards.c | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/hw/arm/exynos4_boards.c b/hw/arm/exynos4_boards.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/exynos4_boards.c
+++ b/hw/arm/exynos4_boards.c
@@ -XXX,XX +XXX,XX @@ exynos4_boards_init_common(MachineState *machine,
     exynos4_boards_init_ram(s, get_system_memory(),
                             exynos4_board_ram_size[board_type]);
 
-    object_initialize(&s->soc, sizeof(s->soc), TYPE_EXYNOS4210_SOC);
-    qdev_set_parent_bus(DEVICE(&s->soc), sysbus_get_default());
+    sysbus_init_child_obj(OBJECT(machine), "soc",
+                          &s->soc, sizeof(s->soc), TYPE_EXYNOS4210_SOC);
     object_property_set_bool(OBJECT(&s->soc), true, "realized",
                              &error_fatal);
 
-- 
2.20.1

From: Philippe Mathieu-Daudé <philmd@redhat.com>

Child properties form the composition tree. All objects need to be
a child of another object. Objects can only be a child of one object.

Respect this with the i.MX SoC, to get a cleaner composition tree.

Signed-off-by: Philippe Mathieu-Daudé <philmd@redhat.com>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Message-id: 20190823143249.8096-5-philmd@redhat.com
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/arm/fsl-imx25.c | 4 +++-
 hw/arm/fsl-imx31.c | 4 +++-
 2 files changed, 6 insertions(+), 2 deletions(-)

diff --git a/hw/arm/fsl-imx25.c b/hw/arm/fsl-imx25.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/fsl-imx25.c
+++ b/hw/arm/fsl-imx25.c
@@ -XXX,XX +XXX,XX @@ static void fsl_imx25_init(Object *obj)
     FslIMX25State *s = FSL_IMX25(obj);
     int i;
 
-    object_initialize(&s->cpu, sizeof(s->cpu), ARM_CPU_TYPE_NAME("arm926"));
+    object_initialize_child(obj, "cpu", &s->cpu, sizeof(s->cpu),
+                            ARM_CPU_TYPE_NAME("arm926"),
+                            &error_abort, NULL);
 
     sysbus_init_child_obj(obj, "avic", &s->avic, sizeof(s->avic),
                           TYPE_IMX_AVIC);
diff --git a/hw/arm/fsl-imx31.c b/hw/arm/fsl-imx31.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/arm/fsl-imx31.c
+++ b/hw/arm/fsl-imx31.c
@@ -XXX,XX +XXX,XX @@ static void fsl_imx31_init(Object *obj)
     FslIMX31State *s = FSL_IMX31(obj);
     int i;
 
-    object_initialize(&s->cpu, sizeof(s->cpu), ARM_CPU_TYPE_NAME("arm1136"));
+    object_initialize_child(obj, "cpu", &s->cpu, sizeof(s->cpu),
+                            ARM_CPU_TYPE_NAME("arm1136"),
+                            &error_abort, NULL);
 
     sysbus_init_child_obj(obj, "avic", &s->avic, sizeof(s->avic),
                           TYPE_IMX_AVIC);
-- 
2.20.1

From: Philippe Mathieu-Daudé <philmd@redhat.com>

As explained in commit aff39be0ed97:

Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
Signed-off-by: Philippe Mathieu-Daudé <philmd@redhat.com>
Reviewed-by: Thomas Huth <thuth@redhat.com>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Message-id: 20190823143249.8096-6-philmd@redhat.com
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/dma/xilinx_axidma.c | 16 ++++++++--------
 1 file changed, 8 insertions(+), 8 deletions(-)

diff --git a/hw/dma/xilinx_axidma.c b/hw/dma/xilinx_axidma.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/dma/xilinx_axidma.c
+++ b/hw/dma/xilinx_axidma.c
@@ -XXX,XX +XXX,XX @@ static void xilinx_axidma_init(Object *obj)
     XilinxAXIDMA *s = XILINX_AXI_DMA(obj);
     SysBusDevice *sbd = SYS_BUS_DEVICE(obj);
 
-    object_initialize(&s->rx_data_dev, sizeof(s->rx_data_dev),
-                      TYPE_XILINX_AXI_DMA_DATA_STREAM);
-    object_initialize(&s->rx_control_dev, sizeof(s->rx_control_dev),
-                      TYPE_XILINX_AXI_DMA_CONTROL_STREAM);
-    object_property_add_child(OBJECT(s), "axistream-connected-target",
-                              (Object *)&s->rx_data_dev, &error_abort);
-    object_property_add_child(OBJECT(s), "axistream-control-connected-target",
-                              (Object *)&s->rx_control_dev, &error_abort);
+    object_initialize_child(OBJECT(s), "axistream-connected-target",
+                            &s->rx_data_dev, sizeof(s->rx_data_dev),
+                            TYPE_XILINX_AXI_DMA_DATA_STREAM, &error_abort,
+                            NULL);
+    object_initialize_child(OBJECT(s), "axistream-control-connected-target",
+                            &s->rx_control_dev, sizeof(s->rx_control_dev),
+                            TYPE_XILINX_AXI_DMA_CONTROL_STREAM, &error_abort,
+                            NULL);
 
     sysbus_init_irq(sbd, &s->streams[0].irq);
     sysbus_init_irq(sbd, &s->streams[1].irq);
-- 
2.20.1

From: Philippe Mathieu-Daudé <philmd@redhat.com>

As explained in commit aff39be0ed97:

Reviewed-by: Alistair Francis <alistair.francis@wdc.com>
Signed-off-by: Philippe Mathieu-Daudé <philmd@redhat.com>
Reviewed-by: Thomas Huth <thuth@redhat.com>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Message-id: 20190823143249.8096-7-philmd@redhat.com
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 hw/net/xilinx_axienet.c | 17 ++++++++---------
 1 file changed, 8 insertions(+), 9 deletions(-)

diff --git a/hw/net/xilinx_axienet.c b/hw/net/xilinx_axienet.c
index XXXXXXX..XXXXXXX 100644
--- a/hw/net/xilinx_axienet.c
+++ b/hw/net/xilinx_axienet.c
@@ -XXX,XX +XXX,XX @@ static void xilinx_enet_init(Object *obj)
     XilinxAXIEnet *s = XILINX_AXI_ENET(obj);
     SysBusDevice *sbd = SYS_BUS_DEVICE(obj);
 
-    object_initialize(&s->rx_data_dev, sizeof(s->rx_data_dev),
-                      TYPE_XILINX_AXI_ENET_DATA_STREAM);
-    object_initialize(&s->rx_control_dev, sizeof(s->rx_control_dev),
-                      TYPE_XILINX_AXI_ENET_CONTROL_STREAM);
-    object_property_add_child(OBJECT(s), "axistream-connected-target",
-                              (Object *)&s->rx_data_dev, &error_abort);
-    object_property_add_child(OBJECT(s), "axistream-control-connected-target",
-                              (Object *)&s->rx_control_dev, &error_abort);
-
+    object_initialize_child(OBJECT(s), "axistream-connected-target",
+                            &s->rx_data_dev, sizeof(s->rx_data_dev),
+                            TYPE_XILINX_AXI_ENET_DATA_STREAM, &error_abort,
+                            NULL);
+    object_initialize_child(OBJECT(s), "axistream-control-connected-target",
+                            &s->rx_control_dev, sizeof(s->rx_control_dev),
+                            TYPE_XILINX_AXI_ENET_CONTROL_STREAM, &error_abort,
+                            NULL);
     sysbus_init_irq(sbd, &s->irq);
 
     memory_region_init_io(&s->iomem, OBJECT(s), &enet_ops, s, "enet", 0x40000);
-- 
2.20.1

From: Alex Bennée <alex.bennee@linaro.org>

Commit a5e0b3311 removed these in favour of querying machine
properties. Remove the extern declarations as well.

Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Message-id: 20190828165307.18321-6-alex.bennee@linaro.org
Cc: Like Xu <like.xu@linux.intel.com>
Message-Id: <20190711130546.18578-1-alex.bennee@linaro.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 include/sysemu/sysemu.h | 2 --
 1 file changed, 2 deletions(-)

diff --git a/include/sysemu/sysemu.h b/include/sysemu/sysemu.h
index XXXXXXX..XXXXXXX 100644
--- a/include/sysemu/sysemu.h
+++ b/include/sysemu/sysemu.h
@@ -XXX,XX +XXX,XX @@ extern const char *keyboard_layout;
 extern int win2k_install_hack;
 extern int alt_grab;
 extern int ctrl_grab;
-extern int smp_cpus;
-extern unsigned int max_cpus;
 extern int cursor_hide;
 extern int graphic_rotate;
 extern int no_quit;
-- 
2.20.1

From: "Emilio G. Cota" <cota@braap.org>

Afterwise is "wise after the fact", as in "hindsight".
Here we meant "afterwards" (as in "subsequently"). Fix it.

Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
Signed-off-by: Emilio G. Cota <cota@braap.org>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
Message-id: 20190828165307.18321-7-alex.bennee@linaro.org
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 tcg/README | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/tcg/README b/tcg/README
index XXXXXXX..XXXXXXX 100644
--- a/tcg/README
+++ b/tcg/README
@@ -XXX,XX +XXX,XX @@ This can be overridden using the following function modifiers:
   canonical locations before calling the helper.
 - TCG_CALL_NO_WRITE_GLOBALS means that the helper does not modify any globals.
   They will only be saved to their canonical location before calling helpers,
-  but they won't be reloaded afterwise.
+  but they won't be reloaded afterwards.
 - TCG_CALL_NO_SIDE_EFFECTS means that the call to the function is removed if
   the return value is not used.
 
-- 
2.20.1

From: "Emilio G. Cota" <cota@braap.org>

Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
Signed-off-by: Emilio G. Cota <cota@braap.org>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Signed-off-by: Alex Bennée <alex.bennee@linaro.org>
Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
Message-id: 20190828165307.18321-8-alex.bennee@linaro.org
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 accel/tcg/atomic_template.h | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/accel/tcg/atomic_template.h b/accel/tcg/atomic_template.h
index XXXXXXX..XXXXXXX 100644
--- a/accel/tcg/atomic_template.h
+++ b/accel/tcg/atomic_template.h
@@ -XXX,XX +XXX,XX @@ ABI_TYPE ATOMIC_NAME(xchg)(CPUArchState *env, target_ulong addr,
 
 #define GEN_ATOMIC_HELPER(X)                                        \
 ABI_TYPE ATOMIC_NAME(X)(CPUArchState *env, target_ulong addr,       \
-                 ABI_TYPE val EXTRA_ARGS)                           \
+                        ABI_TYPE val EXTRA_ARGS)                    \
 {                                                                   \
     ATOMIC_MMU_DECLS;                                               \
     DATA_TYPE *haddr = ATOMIC_MMU_LOOKUP;                           \
-- 
2.20.1

The function neon_store_reg32() doesn't free the TCG temp that it
is passed, so the caller must do that. We got this right in most
places but forgot to free the TCG temps in trans_VMOV_64_sp().

Cc: qemu-stable@nongnu.org
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
Message-id: 20190827121931.26836-1-peter.maydell@linaro.org
---
 target/arm/translate-vfp.inc.c | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/target/arm/translate-vfp.inc.c b/target/arm/translate-vfp.inc.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/translate-vfp.inc.c
+++ b/target/arm/translate-vfp.inc.c
@@ -XXX,XX +XXX,XX @@ static bool trans_VMOV_64_sp(DisasContext *s, arg_VMOV_64_sp *a)
         /* gpreg to fpreg */
         tmp = load_reg(s, a->rt);
         neon_store_reg32(tmp, a->vm);
+        tcg_temp_free_i32(tmp);
         tmp = load_reg(s, a->rt2);
         neon_store_reg32(tmp, a->vm + 1);
+        tcg_temp_free_i32(tmp);
     }
 
     return true;
-- 
2.20.1

An attempt to do an exception-return (branch to one of the magic
addresses) in linux-user mode for M-profile should behave like
a normal branch, because linux-user mode is always going to be
in 'handler' mode. This used to work, but we broke it when we added
support for the M-profile security extension in commit d02a8698d7ae2bfed.

In that commit we allowed even handler-mode calls to magic return
values to be checked for and dealt with by causing an
EXCP_EXCEPTION_EXIT exception to be taken, because this is
needed for the FNC_RETURN return-from-non-secure-function-call
handling. For system mode we added a check in do_v7m_exception_exit()
to make any spurious calls from Handler mode behave correctly, but
forgot that linux-user mode would also be affected.

How an attempted return-from-non-secure-function-call in linux-user
mode should be handled is not clear -- on real hardware it would
result in return to secure code (not to the Linux kernel) which
could then handle the error in any way it chose. For QEMU we take
the simple approach of treating this erroneous return the same way
it would be handled on a CPU without the security extensions --
treat it as a normal branch.

The upshot of all this is that for linux-user mode we should never
do any of the bx_excret magic, so the code change is simple.

This ought to be a weird corner case that only affects broken guest
code (because Linux user processes should never be attempting to do
exception returns or NS function returns), except that the code that
assigns addresses in RAM for the process and stack in our linux-user
code does not attempt to avoid this magic address range, so
legitimate code attempting to return to a trampoline routine on the
stack can fall into this case. This change fixes those programs,
but we should also look at restricting the range of memory we
use for M-profile linux-user guests to the area that would be
real RAM in hardware.

Cc: qemu-stable@nongnu.org
Reported-by: Christophe Lyon <christophe.lyon@linaro.org>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
Message-id: 20190822131534.16602-1-peter.maydell@linaro.org
Fixes: https://bugs.launchpad.net/qemu/+bug/1840922
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
---
 target/arm/translate.c | 21 ++++++++++++++++++++-
 1 file changed, 20 insertions(+), 1 deletion(-)

diff --git a/target/arm/translate.c b/target/arm/translate.c
index XXXXXXX..XXXXXXX 100644
--- a/target/arm/translate.c
+++ b/target/arm/translate.c
@@ -XXX,XX +XXX,XX @@ static inline void gen_bx(DisasContext *s, TCGv_i32 var)
     store_cpu_field(var, thumb);
 }
 
-/* Set PC and Thumb state from var. var is marked as dead.
+/*
+ * Set PC and Thumb state from var. var is marked as dead.
  * For M-profile CPUs, include logic to detect exception-return
  * branches and handle them. This is needed for Thumb POP/LDM to PC, LDR to PC,
  * and BX reg, and no others, and happens only for code in Handler mode.
+ * The Security Extension also requires us to check for the FNC_RETURN
+ * which signals a function return from non-secure state; this can happen
+ * in both Handler and Thread mode.
+ * To avoid having to do multiple comparisons in inline generated code,
+ * we make the check we do here loose, so it will match for EXC_RETURN
+ * in Thread mode. For system emulation do_v7m_exception_exit() checks
+ * for these spurious cases and returns without doing anything (giving
+ * the same behaviour as for a branch to a non-magic address).
+ *
+ * In linux-user mode it is unclear what the right behaviour for an
+ * attempted FNC_RETURN should be, because in real hardware this will go
+ * directly to Secure code (ie not the Linux kernel) which will then treat
+ * the error in any way it chooses. For QEMU we opt to make the FNC_RETURN
+ * attempt behave the way it would on a CPU without the security extension,
+ * which is to say "like a normal branch". That means we can simply treat
+ * all branches as normal with no magic address behaviour.
  */
 static inline void gen_bx_excret(DisasContext *s, TCGv_i32 var)
 {
@@ -XXX,XX +XXX,XX @@ static inline void gen_bx_excret(DisasContext *s, TCGv_i32 var)
      * s->base.is_jmp that we need to do the rest of the work later.
      */
     gen_bx(s, var);
+#ifndef CONFIG_USER_ONLY
     if (arm_dc_feature(s, ARM_FEATURE_M_SECURITY) ||
         (s->v7m_handler_mode && arm_dc_feature(s, ARM_FEATURE_M))) {
         s->base.is_jmp = DISAS_BX_EXCRET;
     }
+#endif
 }
 
 static inline void gen_bx_excret_final_code(DisasContext *s)
-- 
2.20.1

target-arm queue: two bug fixes, plus the KVM/SVE patchset,
which is a new feature but one which was in my pre-softfreeze
pullreq (it just had to be dropped due to an unexpected test failure.)

thanks
-- PMM

The following changes since commit b7c9a7f353c0e260519bf735ff0d4aa01e72784b:

Merge remote-tracking branch 'remotes/jnsnow/tags/ide-pull-request' into staging (2019-10-31 15:57:30 +0000)

are available in the Git repository at:

https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20191101-1

for you to fetch changes up to d9ae7624b659362cb2bb2b04fee53bf50829ca56:

target/arm: Allow reading flags from FPSCR for M-profile (2019-11-01 08:49:10 +0000)

----------------------------------------------------------------
target-arm queue:
 * Support SVE in KVM guests
 * Don't UNDEF on M-profile 'vmrs apsr_nzcv, fpscr'
 * Update hflags after boot.c modifies CPU state

----------------------------------------------------------------
Andrew Jones (9):
      target/arm/monitor: Introduce qmp_query_cpu_model_expansion
      tests: arm: Introduce cpu feature tests
      target/arm: Allow SVE to be disabled via a CPU property
      target/arm/cpu64: max cpu: Introduce sve<N> properties
      target/arm/kvm64: Add kvm_arch_get/put_sve
      target/arm/kvm64: max cpu: Enable SVE when available
      target/arm/kvm: scratch vcpu: Preserve input kvm_vcpu_init features
      target/arm/cpu64: max cpu: Support sve properties with KVM
      target/arm/kvm: host cpu: Add support for sve<N> properties

Christophe Lyon (1):
      target/arm: Allow reading flags from FPSCR for M-profile

Edgar E. Iglesias (1):
      hw/arm/boot: Rebuild hflags when modifying CPUState at boot