From nobody Mon Feb  9 17:36:51 2026
Delivered-To: importer@patchew.org
Received-SPF: none (zoho.com: 192.237.175.120 is neither permitted nor denied
 by domain of lists.xenproject.org) client-ip=192.237.175.120;
 envelope-from=xen-devel-bounces@lists.xenproject.org;
 helo=lists.xenproject.org;
Authentication-Results: mx.zohomail.com;
	spf=none (zoho.com: 192.237.175.120 is neither permitted nor denied by domain
 of lists.xenproject.org)
  smtp.mailfrom=xen-devel-bounces@lists.xenproject.org
ARC-Seal: i=1; a=rsa-sha256; t=1568451270; cv=none;
	d=zoho.com; s=zohoarc;
	b=hpf2IZ1OeBXxtfCQF9ZArdjCMMq6QUc/48TpvaXmr/0Shqu2zctXuOAMqZQAEMzYT1bqQbeGiyD2juqksev+yplVFBzneYJWi9dJy8v5pWwtYWp52D6C1yFpO4UVFaas1bNCJ2QcbwJahDbxgYeKEh7x2ofJemal/ZYgIoS/lxw=
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=zoho.com;
 s=zohoarc;
	t=1568451270;
 h=Content-Type:Content-Transfer-Encoding:Cc:Date:From:In-Reply-To:List-Subscribe:List-Post:List-Id:List-Help:List-Unsubscribe:MIME-Version:Message-ID:References:Sender:Subject:To:ARC-Authentication-Results;
	bh=AXqSaL8YLiOoJCXfQFgWvvCthABxX3heQkyqyudEPYg=;
	b=XUlNE0+LFlT8o1EJt048JcrIgMlYOg+tN4XsaqlJpVE8bH+yZrJtV8yOQXDvcd0PhIGD1ourKbueQ5ln/9Zzrxkm7uCr4OWajUAuwjS4tk47P+cgIJtBCOYv98c1o+Kr/zAerlTZzadNefDa3GtS7jNPLeTTcvyTBdOB7lHIfAc=
ARC-Authentication-Results: i=1; mx.zoho.com;
	spf=none (zoho.com: 192.237.175.120 is neither permitted nor denied by domain
 of lists.xenproject.org)
  smtp.mailfrom=xen-devel-bounces@lists.xenproject.org
Return-Path: <xen-devel-bounces@lists.xenproject.org>
Received: from lists.xenproject.org (lists.xenproject.org [192.237.175.120])
 by mx.zohomail.com
	with SMTPS id 1568451270852754.7489168958722;
 Sat, 14 Sep 2019 01:54:30 -0700 (PDT)
Received: from localhost ([127.0.0.1] helo=lists.xenproject.org)
	by lists.xenproject.org with esmtp (Exim 4.89)
	(envelope-from <xen-devel-bounces@lists.xenproject.org>)
	id 1i93o4-0007oX-Fa; Sat, 14 Sep 2019 08:53:24 +0000
Received: from us1-rack-iad1.inumbo.com ([172.99.69.81])
 by lists.xenproject.org with esmtp (Exim 4.89)
 (envelope-from <SRS0=rDpt=XJ=suse.com=jgross@srs-us1.protection.inumbo.net>)
 id 1i93o2-0007mv-UT
 for xen-devel@lists.xenproject.org; Sat, 14 Sep 2019 08:53:22 +0000
Received: from mx1.suse.de (unknown [195.135.220.15])
 by us1-rack-iad1.inumbo.com (Halon) with ESMTPS
 id 0fd8876a-d6cd-11e9-b299-bc764e2007e4;
 Sat, 14 Sep 2019 08:53:03 +0000 (UTC)
Received: from relay2.suse.de (unknown [195.135.220.254])
 by mx1.suse.de (Postfix) with ESMTP id 28289B66E;
 Sat, 14 Sep 2019 08:53:02 +0000 (UTC)
X-Inumbo-ID: 0fd8876a-d6cd-11e9-b299-bc764e2007e4
X-Virus-Scanned: by amavisd-new at test-mx.suse.de
From: Juergen Gross <jgross@suse.com>
To: xen-devel@lists.xenproject.org
Date: Sat, 14 Sep 2019 10:52:28 +0200
Message-Id: <20190914085251.18816-25-jgross@suse.com>
X-Mailer: git-send-email 2.16.4
In-Reply-To: <20190914085251.18816-1-jgross@suse.com>
References: <20190914085251.18816-1-jgross@suse.com>
Subject: [Xen-devel] [PATCH v3 24/47] xen: switch from for_each_vcpu() to
 for_each_sched_unit()
X-BeenThere: xen-devel@lists.xenproject.org
X-Mailman-Version: 2.1.23
Precedence: list
List-Id: Xen developer discussion <xen-devel.lists.xenproject.org>
List-Unsubscribe: <https://lists.xenproject.org/mailman/options/xen-devel>,
 <mailto:xen-devel-request@lists.xenproject.org?subject=unsubscribe>
List-Post: <mailto:xen-devel@lists.xenproject.org>
List-Help: <mailto:xen-devel-request@lists.xenproject.org?subject=help>
List-Subscribe: <https://lists.xenproject.org/mailman/listinfo/xen-devel>,
 <mailto:xen-devel-request@lists.xenproject.org?subject=subscribe>
Cc: Juergen Gross <jgross@suse.com>,
 Stefano Stabellini <sstabellini@kernel.org>, Wei Liu <wl@xen.org>,
 Konrad Rzeszutek Wilk <konrad.wilk@oracle.com>,
 George Dunlap <George.Dunlap@eu.citrix.com>,
 Andrew Cooper <andrew.cooper3@citrix.com>,
 Ian Jackson <ian.jackson@eu.citrix.com>, Tim Deegan <tim@xen.org>,
 Julien Grall <julien.grall@arm.com>, Jan Beulich <jbeulich@suse.com>,
 Dario Faggioli <dfaggioli@suse.com>
MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
Errors-To: xen-devel-bounces@lists.xenproject.org
Sender: "Xen-devel" <xen-devel-bounces@lists.xenproject.org>

Where appropriate switch from for_each_vcpu() to for_each_sched_unit()
in order to prepare core scheduling.

As it is beneficial once here and for sure in future add a
unit_scheduler() helper and let vcpu_scheduler() use it.

Signed-off-by: Juergen Gross <jgross@suse.com>
---
V2:
- handle affinity_broken correctly (Jan Beulich)
- add unit_scheduler() (Jan Beulich)
V3:
- add const (Jan Beulich)
- add TODOs for missing multiple vcpu per unit support (Jan Beulich)
---
 xen/common/domain.c   |   9 ++-
 xen/common/schedule.c | 158 +++++++++++++++++++++++++++++++---------------=
----
 2 files changed, 103 insertions(+), 64 deletions(-)

diff --git a/xen/common/domain.c b/xen/common/domain.c
index a3e23f2ee7..7a1be85be9 100644
--- a/xen/common/domain.c
+++ b/xen/common/domain.c
@@ -554,7 +554,7 @@ void domain_update_node_affinity(struct domain *d)
     cpumask_var_t dom_cpumask, dom_cpumask_soft;
     cpumask_t *dom_affinity;
     const cpumask_t *online;
-    struct vcpu *v;
+    struct sched_unit *unit;
     unsigned int cpu;
=20
     /* Do we have vcpus already? If not, no need to update node-affinity. =
*/
@@ -587,12 +587,11 @@ void domain_update_node_affinity(struct domain *d)
          * and the full mask of where it would prefer to run (the union of
          * the soft affinity of all its various vcpus). Let's build them.
          */
-        for_each_vcpu ( d, v )
+        for_each_sched_unit ( d, unit )
         {
-            cpumask_or(dom_cpumask, dom_cpumask,
-                       v->sched_unit->cpu_hard_affinity);
+            cpumask_or(dom_cpumask, dom_cpumask, unit->cpu_hard_affinity);
             cpumask_or(dom_cpumask_soft, dom_cpumask_soft,
-                       v->sched_unit->cpu_soft_affinity);
+                       unit->cpu_soft_affinity);
         }
         /* Filter out non-online cpus */
         cpumask_and(dom_cpumask, dom_cpumask, online);
diff --git a/xen/common/schedule.c b/xen/common/schedule.c
index 9c41b2dd4d..7b37461db9 100644
--- a/xen/common/schedule.c
+++ b/xen/common/schedule.c
@@ -150,26 +150,32 @@ static inline struct scheduler *dom_scheduler(const s=
truct domain *d)
     return &ops;
 }
=20
-static inline struct scheduler *vcpu_scheduler(const struct vcpu *v)
+static inline struct scheduler *unit_scheduler(const struct sched_unit *un=
it)
 {
-    struct domain *d =3D v->domain;
+    struct domain *d =3D unit->domain;
=20
     if ( likely(d->cpupool !=3D NULL) )
         return d->cpupool->sched;
=20
     /*
-     * If d->cpupool is NULL, this is a vCPU of the idle domain. And this
+     * If d->cpupool is NULL, this is a unit of the idle domain. And this
      * case is special because the idle domain does not really belong to
      * a cpupool and, hence, doesn't really have a scheduler). In fact, its
-     * vCPUs (may) run on pCPUs which are in different pools, with differe=
nt
+     * units (may) run on pCPUs which are in different pools, with differe=
nt
      * schedulers.
      *
      * What we want, in this case, is the scheduler of the pCPU where this
-     * particular idle vCPU is running. And, since v->processor never chan=
ges
-     * for idle vCPUs, it is safe to use it, with no locks, to figure that=
 out.
+     * particular idle unit is running. And, since unit->res never changes
+     * for idle units, it is safe to use it, with no locks, to figure that=
 out.
      */
+
     ASSERT(is_idle_domain(d));
-    return per_cpu(scheduler, v->processor);
+    return per_cpu(scheduler, unit->res->master_cpu);
+}
+
+static inline struct scheduler *vcpu_scheduler(const struct vcpu *v)
+{
+    return unit_scheduler(v->sched_unit);
 }
 #define VCPU2ONLINE(_v) cpupool_domain_cpumask((_v)->domain)
=20
@@ -491,10 +497,11 @@ static void sched_move_irqs(struct sched_unit *unit)
 int sched_move_domain(struct domain *d, struct cpupool *c)
 {
     struct vcpu *v;
-    unsigned int new_p;
-    void **vcpu_priv;
+    struct sched_unit *unit;
+    unsigned int new_p, unit_idx;
+    void **unit_priv;
     void *domdata;
-    void *vcpudata;
+    void *unitdata;
     struct scheduler *old_ops;
     void *old_domdata;
=20
@@ -508,25 +515,27 @@ int sched_move_domain(struct domain *d, struct cpupoo=
l *c)
     if ( IS_ERR(domdata) )
         return PTR_ERR(domdata);
=20
-    vcpu_priv =3D xzalloc_array(void *, d->max_vcpus);
-    if ( vcpu_priv =3D=3D NULL )
+    /* TODO: fix array size with multiple vcpus per unit. */
+    unit_priv =3D xzalloc_array(void *, d->max_vcpus);
+    if ( unit_priv =3D=3D NULL )
     {
         sched_free_domdata(c->sched, domdata);
         return -ENOMEM;
     }
=20
-    for_each_vcpu ( d, v )
+    unit_idx =3D 0;
+    for_each_sched_unit ( d, unit )
     {
-        vcpu_priv[v->vcpu_id] =3D sched_alloc_vdata(c->sched, v->sched_uni=
t,
-                                                  domdata);
-        if ( vcpu_priv[v->vcpu_id] =3D=3D NULL )
+        unit_priv[unit_idx] =3D sched_alloc_vdata(c->sched, unit, domdata);
+        if ( unit_priv[unit_idx] =3D=3D NULL )
         {
-            for_each_vcpu ( d, v )
-                xfree(vcpu_priv[v->vcpu_id]);
-            xfree(vcpu_priv);
+            for ( unit_idx =3D 0; unit_priv[unit_idx]; unit_idx++ )
+                sched_free_vdata(c->sched, unit_priv[unit_idx]);
+            xfree(unit_priv);
             sched_free_domdata(c->sched, domdata);
             return -ENOMEM;
         }
+        unit_idx++;
     }
=20
     domain_pause(d);
@@ -534,30 +543,36 @@ int sched_move_domain(struct domain *d, struct cpupoo=
l *c)
     old_ops =3D dom_scheduler(d);
     old_domdata =3D d->sched_priv;
=20
-    for_each_vcpu ( d, v )
+    for_each_sched_unit ( d, unit )
     {
-        sched_remove_unit(old_ops, v->sched_unit);
+        sched_remove_unit(old_ops, unit);
     }
=20
     d->cpupool =3D c;
     d->sched_priv =3D domdata;
=20
     new_p =3D cpumask_first(c->cpu_valid);
-    for_each_vcpu ( d, v )
+    unit_idx =3D 0;
+    for_each_sched_unit ( d, unit )
     {
         spinlock_t *lock;
+        unsigned int unit_p =3D new_p;
=20
-        vcpudata =3D v->sched_unit->priv;
+        unitdata =3D unit->priv;
=20
-        migrate_timer(&v->periodic_timer, new_p);
-        migrate_timer(&v->singleshot_timer, new_p);
-        migrate_timer(&v->poll_timer, new_p);
+        for_each_sched_unit_vcpu ( unit, v )
+        {
+            migrate_timer(&v->periodic_timer, new_p);
+            migrate_timer(&v->singleshot_timer, new_p);
+            migrate_timer(&v->poll_timer, new_p);
+            new_p =3D cpumask_cycle(new_p, c->cpu_valid);
+        }
=20
-        lock =3D unit_schedule_lock_irq(v->sched_unit);
+        lock =3D unit_schedule_lock_irq(unit);
=20
-        sched_set_affinity(v, &cpumask_all, &cpumask_all);
+        sched_set_affinity(unit->vcpu_list, &cpumask_all, &cpumask_all);
=20
-        sched_set_res(v->sched_unit, get_sched_res(new_p));
+        sched_set_res(unit, get_sched_res(unit_p));
         /*
          * With v->processor modified we must not
          * - make any further changes assuming we hold the scheduler lock,
@@ -565,15 +580,15 @@ int sched_move_domain(struct domain *d, struct cpupoo=
l *c)
          */
         spin_unlock_irq(lock);
=20
-        v->sched_unit->priv =3D vcpu_priv[v->vcpu_id];
+        unit->priv =3D unit_priv[unit_idx];
         if ( !d->is_dying )
-            sched_move_irqs(v->sched_unit);
+            sched_move_irqs(unit);
=20
-        new_p =3D cpumask_cycle(new_p, c->cpu_valid);
+        sched_insert_unit(c->sched, unit);
=20
-        sched_insert_unit(c->sched, v->sched_unit);
+        sched_free_vdata(old_ops, unitdata);
=20
-        sched_free_vdata(old_ops, vcpudata);
+        unit_idx++;
     }
=20
     domain_update_node_affinity(d);
@@ -582,7 +597,7 @@ int sched_move_domain(struct domain *d, struct cpupool =
*c)
=20
     sched_free_domdata(old_ops, old_domdata);
=20
-    xfree(vcpu_priv);
+    xfree(unit_priv);
=20
     return 0;
 }
@@ -866,18 +881,36 @@ static void vcpu_migrate_finish(struct vcpu *v)
     vcpu_wake(v);
 }
=20
+static bool sched_check_affinity_broken(const struct sched_unit *unit)
+{
+    const struct vcpu *v;
+
+    for_each_sched_unit_vcpu ( unit, v )
+        if ( v->affinity_broken )
+            return true;
+
+    return false;
+}
+
+static void sched_reset_affinity_broken(struct sched_unit *unit)
+{
+    struct vcpu *v;
+
+    for_each_sched_unit_vcpu ( unit, v )
+        v->affinity_broken =3D false;
+}
+
 void restore_vcpu_affinity(struct domain *d)
 {
     unsigned int cpu =3D smp_processor_id();
-    struct vcpu *v;
+    struct sched_unit *unit;
=20
     ASSERT(system_state =3D=3D SYS_STATE_resume);
=20
-    for_each_vcpu ( d, v )
+    for_each_sched_unit ( d, unit )
     {
         spinlock_t *lock;
-        unsigned int old_cpu =3D v->processor;
-        struct sched_unit *unit =3D v->sched_unit;
+        unsigned int old_cpu =3D sched_unit_cpu(unit);
         struct sched_resource *res;
=20
         ASSERT(!unit_runnable(unit));
@@ -896,18 +929,22 @@ void restore_vcpu_affinity(struct domain *d)
                     cpupool_domain_cpumask(d));
         if ( cpumask_empty(cpumask_scratch_cpu(cpu)) )
         {
-            if ( v->affinity_broken )
+            if ( sched_check_affinity_broken(unit) )
             {
-                sched_set_affinity(v, unit->cpu_hard_affinity_saved, NULL);
-                v->affinity_broken =3D 0;
+                /* Affinity settings of one vcpu are for the complete unit=
. */
+                sched_set_affinity(unit->vcpu_list,
+                                   unit->cpu_hard_affinity_saved, NULL);
+                sched_reset_affinity_broken(unit);
                 cpumask_and(cpumask_scratch_cpu(cpu), unit->cpu_hard_affin=
ity,
                             cpupool_domain_cpumask(d));
             }
=20
             if ( cpumask_empty(cpumask_scratch_cpu(cpu)) )
             {
-                printk(XENLOG_DEBUG "Breaking affinity for %pv\n", v);
-                sched_set_affinity(v, &cpumask_all, NULL);
+                /* Affinity settings of one vcpu are for the complete unit=
. */
+                printk(XENLOG_DEBUG "Breaking affinity for %pv\n",
+                       unit->vcpu_list);
+                sched_set_affinity(unit->vcpu_list, &cpumask_all, NULL);
                 cpumask_and(cpumask_scratch_cpu(cpu), unit->cpu_hard_affin=
ity,
                             cpupool_domain_cpumask(d));
             }
@@ -920,12 +957,12 @@ void restore_vcpu_affinity(struct domain *d)
=20
         /* v->processor might have changed, so reacquire the lock. */
         lock =3D unit_schedule_lock_irq(unit);
-        res =3D sched_pick_resource(vcpu_scheduler(v), unit);
+        res =3D sched_pick_resource(unit_scheduler(unit), unit);
         sched_set_res(unit, res);
         spin_unlock_irq(lock);
=20
-        if ( old_cpu !=3D v->processor )
-            sched_move_irqs(v->sched_unit);
+        if ( old_cpu !=3D sched_unit_cpu(unit) )
+            sched_move_irqs(unit);
     }
=20
     domain_update_node_affinity(d);
@@ -939,7 +976,6 @@ void restore_vcpu_affinity(struct domain *d)
 int cpu_disable_scheduler(unsigned int cpu)
 {
     struct domain *d;
-    struct vcpu *v;
     struct cpupool *c;
     cpumask_t online_affinity;
     int ret =3D 0;
@@ -950,17 +986,19 @@ int cpu_disable_scheduler(unsigned int cpu)
=20
     for_each_domain_in_cpupool ( d, c )
     {
-        for_each_vcpu ( d, v )
+        struct sched_unit *unit;
+
+        for_each_sched_unit ( d, unit )
         {
             unsigned long flags;
-            struct sched_unit *unit =3D v->sched_unit;
             spinlock_t *lock =3D unit_schedule_lock_irqsave(unit, &flags);
=20
             cpumask_and(&online_affinity, unit->cpu_hard_affinity, c->cpu_=
valid);
             if ( cpumask_empty(&online_affinity) &&
                  cpumask_test_cpu(cpu, unit->cpu_hard_affinity) )
             {
-                if ( v->affinity_broken )
+                /* TODO: multiple vcpus per unit. */
+                if ( unit->vcpu_list->affinity_broken )
                 {
                     /* The vcpu is temporarily pinned, can't move it. */
                     unit_schedule_unlock_irqrestore(lock, flags, unit);
@@ -968,14 +1006,15 @@ int cpu_disable_scheduler(unsigned int cpu)
                     break;
                 }
=20
-                printk(XENLOG_DEBUG "Breaking affinity for %pv\n", v);
+                printk(XENLOG_DEBUG "Breaking affinity for %pv\n",
+                       unit->vcpu_list);
=20
-                sched_set_affinity(v, &cpumask_all, NULL);
+                sched_set_affinity(unit->vcpu_list, &cpumask_all, NULL);
             }
=20
-            if ( v->processor !=3D cpu )
+            if ( sched_unit_cpu(unit) !=3D sched_get_resource_cpu(cpu) )
             {
-                /* The vcpu is not on this cpu, so we can move on. */
+                /* The unit is not on this cpu, so we can move on. */
                 unit_schedule_unlock_irqrestore(lock, flags, unit);
                 continue;
             }
@@ -988,17 +1027,18 @@ int cpu_disable_scheduler(unsigned int cpu)
              *  * the scheduler will always find a suitable solution, or
              *    things would have failed before getting in here.
              */
-            vcpu_migrate_start(v);
+            /* TODO: multiple vcpus per unit. */
+            vcpu_migrate_start(unit->vcpu_list);
             unit_schedule_unlock_irqrestore(lock, flags, unit);
=20
-            vcpu_migrate_finish(v);
+            vcpu_migrate_finish(unit->vcpu_list);
=20
             /*
              * The only caveat, in this case, is that if a vcpu active in
              * the hypervisor isn't migratable. In this case, the caller
              * should try again after releasing and reaquiring all locks.
              */
-            if ( v->processor =3D=3D cpu )
+            if ( sched_unit_cpu(unit) =3D=3D sched_get_resource_cpu(cpu) )
                 ret =3D -EAGAIN;
         }
     }
@@ -1009,8 +1049,8 @@ int cpu_disable_scheduler(unsigned int cpu)
 static int cpu_disable_scheduler_check(unsigned int cpu)
 {
     struct domain *d;
-    struct vcpu *v;
     struct cpupool *c;
+    struct vcpu *v;
=20
     c =3D per_cpu(cpupool, cpu);
     if ( c =3D=3D NULL )
--=20
2.16.4


_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xenproject.org
https://lists.xenproject.org/mailman/listinfo/xen-devel