Series comparison

-[RFC PATCH 0/5]
+[PATCH 0/2]
-Hello everyone,
+The ADMV1013 is a wideband, microwave upconverter optimized
 for point to point microwave radio designs operating in the
 GHz to 44 GHz radio frequency (RF) range.
-There was some interest at OSPM'25 to explore using the push task
+Datasheet:
-mechanism for idle and newidle balance. This series implements one such
+https://www.analog.com/media/en/technical-documentation/data-sheets/ADMV1013.pdf
 idea. The main reason for the RFC is to understand if this is the
 implementation people were in favor of before trying to optimize it for
 all the workloads from my test setup.
-Note: The current performance of the prototype is rough. I haven't
+NOTE:
-optimized it yet since I would love some feedback first on the approach.
+Currently depends on 64-bit architecture since the input
 clock that server as Local Oscillator should support values
 in the range 5.4 GHz to 10.25 GHz.
+We might need some scaling implementation in the clock
+framework so that u64 types are supported when using 32-bit
+architectures.
-Current approach
+Antoniu Miclaus (2):
-================
+  iio: frequency: admv1013: add support for ADMV1013
   dt-bindings: iio: frequency: add admv1013 doc
-The push task framework for fair class has been cherry-pick from
+ .../bindings/iio/frequency/adi,admv1013.yaml  | 110 ++++
-Vincent's series and has been implemented for !EAS case.
+ drivers/iio/frequency/Kconfig                 |  13 +
  drivers/iio/frequency/Makefile                |   1 +
  drivers/iio/frequency/admv1013.c              | 579 ++++++++++++++++++
 files changed, 703 insertions(+)
  create mode 100644 Documentation/devicetree/bindings/iio/frequency/adi,admv1013.yaml
  create mode 100644 drivers/iio/frequency/admv1013.c
-This series implements the idea from Valentin [2] where, in presence of
+--
-pushable tasks, the CPU will set itself on a per-LLC "overloaded_mask".
+.33.1
-The inter-NUMA newidle balance has been modified to traverse the CPUs
-set on the overloaded mask, first in the local-LLC, and then CPUs set on
-overloaded mask of other LLCs in same NUMA node with the goal of pulling
-a single task towards itself rather than performing a full fledged load
-balancing.
-This implements some of the ideas from David Vernet's SAHRED_RUNQ
-prototype [3] except, instead of a single SHARED_RUNQ per-LLC /
-per-shard, the overloaded mask serves an indicator of per-CPU rq(s)
-containing pushable task that can be migrated to the CPU going idle.
-This avoids having a per-SHARED_RUNQ lock at the expense of maintaining
-the overloaded cpumask.
-The push callback itself has been modified to try push the tasks on the
-pushable task list to one of the CPUs on the "nohz.idle_cpus_mask"
-taking the load off of idle balancing.
-Clarification required
-======================
-I believe using the per-CPU pushable task list as a proxy for a single
-SHARED_RUNQ was the idea Peter was implying during the discussion. Is
-this correct or did I completely misunderstand it? P.S. SHARED_RUNQ
-could also be modelled as a large per-LLC push list.
-An alternate implementation is to allow CPUs to go to idle as quickly as
-possible and then rely completely on push mechanism and the
-"idle_cpu_mask" to push task to an idle CPU however this puts the burden
-of moving tasks on a busy overloaded CPU which may not be ideal.
-Since folks mentioned using "push mechanism" for newidle balance, was
-the above idea the one they had in mind?
-There seems to be some clear advantage from doing a complete balance in
-the newidle path. Since the schedstats are not rigged up yet for the new
-approach, I'm not completely sure where the advantages vs disadvantages
-are currently.
-If the current approach is right, I'll dig deeper to try address all the
-shortcomings of this prototype.
-Systems with unified LLC will likely run into bottlenecks to maintain a
-large per-LLC mask that can have multiple concurrent updates. I have
-plans to implement a "sd_shard" which shards the large LLC making the
-cpumask maintenance less heavy on these systems.
-References
-==========
-[1] https://lore.kernel.org/lkml/20250302210539.1563190-6-vincent.guittot@linaro.org/
-[2] https://lore.kernel.org/lkml/xhsmh1putoxbz.mognet@vschneid-thinkpadt14sgen2i.remote.csb/
-[3] https://lore.kernel.org/lkml/20231212003141.216236-1-void@manifault.com/
---
-K Prateek Nayak (4):
-  sched/fair: Introduce overloaded_mask in sched_domain_shared
-  sched/fair: Update overloaded mask in presence of pushable task
-  sched/fair: Rework inter-NUMA newidle balancing
-  sched/fair: Proactive idle balance using push mechanism
-Vincent Guittot (1):
-  sched/fair: Add push task framework
- include/linux/sched/topology.h |   1 +
- kernel/sched/fair.c            | 297 +++++++++++++++++++++++++++++++--
- kernel/sched/sched.h           |   2 +
- kernel/sched/topology.c        |  25 ++-
-files changed, 306 insertions(+), 19 deletions(-)
-base-commit: 6432e163ba1b7d80b5876792ce53e511f041ab91
---
-.34.1

-[RFC PATCH 1/5] sched/fair: Add push task framework
+Deleted patch
-From: Vincent Guittot <vincent.guittot@linaro.org>
-Add the skeleton for push task infrastructure. The empty
-push_fair_task() prototype will be used to implement proactive idle
-balancing in subsequent commits.
-  [ prateek: Broke off relevant bits from [1] ]
-Link: https://lore.kernel.org/all/20250302210539.1563190-6-vincent.guittot@linaro.org/ [1]
-Signed-off-by: Vincent Guittot <vincent.guittot@linaro.org>
-Signed-off-by: K Prateek Nayak <kprateek.nayak@amd.com>
----
- kernel/sched/fair.c  | 85 ++++++++++++++++++++++++++++++++++++++++++++
- kernel/sched/sched.h |  2 ++
-files changed, 87 insertions(+)
-diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
-index XXXXXXX..XXXXXXX 100644
---- a/kernel/sched/fair.c
-+++ b/kernel/sched/fair.c
-@@ -XXX,XX +XXX,XX @@ enqueue_task_fair(struct rq *rq, struct task_struct *p, int flags)
-     hrtick_update(rq);
- }
-+static void fair_remove_pushable_task(struct rq *rq, struct task_struct *p);
- static void set_next_buddy(struct sched_entity *se);
- /*
-@@ -XXX,XX +XXX,XX @@ static int dequeue_entities(struct rq *rq, struct sched_entity *se, int flags)
-         h_nr_idle = task_has_idle_policy(p);
-         if (task_sleep || task_delayed || !se->sched_delayed)
-             h_nr_runnable = 1;
-+
-+        fair_remove_pushable_task(rq, p);
-     } else {
-         cfs_rq = group_cfs_rq(se);
-         slice = cfs_rq_min_slice(cfs_rq);
-@@ -XXX,XX +XXX,XX @@ static int find_energy_efficient_cpu(struct task_struct *p, int prev_cpu)
-     return target;
- }
-+static inline bool fair_push_task(struct task_struct *p)
-+{
-+    if (!task_on_rq_queued(p))
-+        return false;
-+
-+    if (p->se.sched_delayed)
-+        return false;
-+
-+    if (p->nr_cpus_allowed == 1)
-+        return false;
-+
-+    return true;
-+}
-+
-+static inline int has_pushable_tasks(struct rq *rq)
-+{
-+    return !plist_head_empty(&rq->cfs.pushable_tasks);
-+}
-+
-+/*
-+ * See if the non running fair tasks on this rq can be sent on other CPUs
-+ * that fits better with their profile.
-+ */
-+static bool push_fair_task(struct rq *rq)
-+{
-+    return false;
-+}
-+
-+static void push_fair_tasks(struct rq *rq)
-+{
-+    /* push_fair_task() will return true if it moved a fair task */
-+    while (push_fair_task(rq))
-+        ;
-+}
-+
-+static DEFINE_PER_CPU(struct balance_callback, fair_push_head);
-+
-+static inline void fair_queue_pushable_tasks(struct rq *rq)
-+{
-+    if (!has_pushable_tasks(rq))
-+        return;
-+
-+    queue_balance_callback(rq, &per_cpu(fair_push_head, rq->cpu), push_fair_tasks);
-+}
-+static void fair_remove_pushable_task(struct rq *rq, struct task_struct *p)
-+{
-+    plist_del(&p->pushable_tasks, &rq->cfs.pushable_tasks);
-+}
-+
-+static void fair_add_pushable_task(struct rq *rq, struct task_struct *p)
-+{
-+    if (fair_push_task(p)) {
-+        plist_del(&p->pushable_tasks, &rq->cfs.pushable_tasks);
-+        plist_node_init(&p->pushable_tasks, p->prio);
-+        plist_add(&p->pushable_tasks, &rq->cfs.pushable_tasks);
-+    }
-+}
-+
- /*
-  * select_task_rq_fair: Select target runqueue for the waking task in domains
-  * that have the relevant SD flag set. In practice, this is SD_BALANCE_WAKE,
-@@ -XXX,XX +XXX,XX @@ balance_fair(struct rq *rq, struct task_struct *prev, struct rq_flags *rf)
-     return sched_balance_newidle(rq, rf) != 0;
- }
- #else
-+static inline void fair_queue_pushable_tasks(struct rq *rq) {}
-+static void fair_remove_pushable_task(struct rq *rq, struct task_struct *p) {}
-+static inline void fair_add_pushable_task(struct rq *rq, struct task_struct *p) {}
- static inline void set_task_max_allowed_capacity(struct task_struct *p) {}
- #endif /* CONFIG_SMP */
-@@ -XXX,XX +XXX,XX @@ pick_next_task_fair(struct rq *rq, struct task_struct *prev, struct rq_flags *rf
-         put_prev_entity(cfs_rq, pse);
-         set_next_entity(cfs_rq, se);
-+        /*
-+         * The previous task might be eligible for being pushed on
-+         * another cpu if it is still active.
-+         */
-+        fair_add_pushable_task(rq, prev);
-+
-         __set_next_task_fair(rq, p, true);
-     }
-@@ -XXX,XX +XXX,XX @@ static void put_prev_task_fair(struct rq *rq, struct task_struct *prev, struct t
-         cfs_rq = cfs_rq_of(se);
-         put_prev_entity(cfs_rq, se);
-     }
-+
-+    /*
-+     * The previous task might be eligible for being pushed on another cpu
-+     * if it is still active.
-+     */
-+    fair_add_pushable_task(rq, prev);
-+
- }
- /*
-@@ -XXX,XX +XXX,XX @@ static void __set_next_task_fair(struct rq *rq, struct task_struct *p, bool firs
- {
-     struct sched_entity *se = &p->se;
-+    fair_remove_pushable_task(rq, p);
-+
- #ifdef CONFIG_SMP
-     if (task_on_rq_queued(p)) {
-         /*
-@@ -XXX,XX +XXX,XX @@ static void __set_next_task_fair(struct rq *rq, struct task_struct *p, bool firs
-     if (hrtick_enabled_fair(rq))
-         hrtick_start_fair(rq, p);
-+    /*
-+     * Try to push prev task before checking misfit for next task as
-+     * the migration of prev can make next fitting the CPU
-+     */
-+    fair_queue_pushable_tasks(rq);
-     update_misfit_status(p, rq);
-     sched_fair_update_stop_tick(rq, p);
- }
-@@ -XXX,XX +XXX,XX @@ void init_cfs_rq(struct cfs_rq *cfs_rq)
-     cfs_rq->tasks_timeline = RB_ROOT_CACHED;
-     cfs_rq->min_vruntime = (u64)(-(1LL << 20));
- #ifdef CONFIG_SMP
-+    plist_head_init(&cfs_rq->pushable_tasks);
-     raw_spin_lock_init(&cfs_rq->removed.lock);
- #endif
- }
-diff --git a/kernel/sched/sched.h b/kernel/sched/sched.h
-index XXXXXXX..XXXXXXX 100644
---- a/kernel/sched/sched.h
-+++ b/kernel/sched/sched.h
-@@ -XXX,XX +XXX,XX @@ struct cfs_rq {
-     struct list_head    leaf_cfs_rq_list;
-     struct task_group    *tg;    /* group that "owns" this runqueue */
-+    struct plist_head    pushable_tasks;
-+
-     /* Locally cached copy of our task_group's idle value */
-     int            idle;
---
-.34.1

-[RFC PATCH 2/5] sched/fair: Introduce overloaded_mask in sched_domain_shared
+Deleted patch
-Introduce a new cpumask member "overloaded_mask" in sched_domain_shared.
-This mask will be used to keep track of overloaded CPUs with pushable
-tasks on them and will be later used by newidle balance to only scan
-through the overloaded CPUs to pull a task to it.
-Signed-off-by: K Prateek Nayak <kprateek.nayak@amd.com>
----
- include/linux/sched/topology.h |  1 +
- kernel/sched/topology.c        | 25 ++++++++++++++++++-------
-files changed, 19 insertions(+), 7 deletions(-)
-diff --git a/include/linux/sched/topology.h b/include/linux/sched/topology.h
-index XXXXXXX..XXXXXXX 100644
---- a/include/linux/sched/topology.h
-+++ b/include/linux/sched/topology.h
-@@ -XXX,XX +XXX,XX @@ struct sched_domain_shared {
-     atomic_t    nr_busy_cpus;
-     int        has_idle_cores;
-     int        nr_idle_scan;
-+    cpumask_var_t    overloaded_mask;
- };
- struct sched_domain {
-diff --git a/kernel/sched/topology.c b/kernel/sched/topology.c
-index XXXXXXX..XXXXXXX 100644
---- a/kernel/sched/topology.c
-+++ b/kernel/sched/topology.c
-@@ -XXX,XX +XXX,XX @@ static void destroy_sched_domain(struct sched_domain *sd)
-      */
-     free_sched_groups(sd->groups, 1);
--    if (sd->shared && atomic_dec_and_test(&sd->shared->ref))
-+    if (sd->shared && atomic_dec_and_test(&sd->shared->ref)) {
-+        free_cpumask_var(sd->shared->overloaded_mask);
-         kfree(sd->shared);
-+    }
-     kfree(sd);
- }
-@@ -XXX,XX +XXX,XX @@ static int __sdt_alloc(const struct cpumask *cpu_map)
-             return -ENOMEM;
-         for_each_cpu(j, cpu_map) {
-+            int node = cpu_to_node(j);
-             struct sched_domain *sd;
-             struct sched_domain_shared *sds;
-             struct sched_group *sg;
-             struct sched_group_capacity *sgc;
-             sd = kzalloc_node(sizeof(struct sched_domain) + cpumask_size(),
--                    GFP_KERNEL, cpu_to_node(j));
-+                    GFP_KERNEL, node);
-             if (!sd)
-                 return -ENOMEM;
-             *per_cpu_ptr(sdd->sd, j) = sd;
-             sds = kzalloc_node(sizeof(struct sched_domain_shared),
--                    GFP_KERNEL, cpu_to_node(j));
-+                    GFP_KERNEL, node);
-             if (!sds)
-                 return -ENOMEM;
-+            if (!zalloc_cpumask_var_node(&sds->overloaded_mask, GFP_KERNEL, node))
-+                return -ENOMEM;
-+
-             *per_cpu_ptr(sdd->sds, j) = sds;
-             sg = kzalloc_node(sizeof(struct sched_group) + cpumask_size(),
--                    GFP_KERNEL, cpu_to_node(j));
-+                    GFP_KERNEL, node);
-             if (!sg)
-                 return -ENOMEM;
-@@ -XXX,XX +XXX,XX @@ static int __sdt_alloc(const struct cpumask *cpu_map)
-             *per_cpu_ptr(sdd->sg, j) = sg;
-             sgc = kzalloc_node(sizeof(struct sched_group_capacity) + cpumask_size(),
--                    GFP_KERNEL, cpu_to_node(j));
-+                    GFP_KERNEL, node);
-             if (!sgc)
-                 return -ENOMEM;
-@@ -XXX,XX +XXX,XX @@ static void __sdt_free(const struct cpumask *cpu_map)
-                 kfree(*per_cpu_ptr(sdd->sd, j));
-             }
--            if (sdd->sds)
--                kfree(*per_cpu_ptr(sdd->sds, j));
-+            if (sdd->sds) {
-+                struct sched_domain_shared *sds = *per_cpu_ptr(sdd->sds, j);
-+
-+                if (sds)
-+                    free_cpumask_var(sds->overloaded_mask);
-+                kfree(sds);
-+            }
-             if (sdd->sg)
-                 kfree(*per_cpu_ptr(sdd->sg, j));
-             if (sdd->sgc)
---
-.34.1

-[RFC PATCH 3/5] sched/fair: Update overloaded mask in presence of pushable task
+Deleted patch
-In presence of pushable tasks on the CPU, set it on the newly introduced
-"overloaded+mask" in sched_domain_shared struct. This will be used by
-the newidle balance to limit the scanning to these overloaded CPUs since
-they contain tasks that could be run on the newly idle target.
-Signed-off-by: K Prateek Nayak <kprateek.nayak@amd.com>
----
- kernel/sched/fair.c | 24 ++++++++++++++++++++++++
-file changed, 24 insertions(+)
-diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
-index XXXXXXX..XXXXXXX 100644
---- a/kernel/sched/fair.c
-+++ b/kernel/sched/fair.c
-@@ -XXX,XX +XXX,XX @@ static int find_energy_efficient_cpu(struct task_struct *p, int prev_cpu)
-     return target;
- }
-+static inline void update_overloaded_mask(int cpu, bool contains_pushable)
-+{
-+    struct sched_domain_shared *sd_share = rcu_dereference(per_cpu(sd_llc_shared, cpu));
-+    cpumask_var_t overloaded_mask;
-+
-+    if (!sd_share)
-+        return;
-+
-+    overloaded_mask = sd_share->overloaded_mask;
-+    if (!overloaded_mask)
-+        return;
-+
-+    if (contains_pushable)
-+        cpumask_set_cpu(cpu, overloaded_mask);
-+    else
-+        cpumask_clear_cpu(cpu, overloaded_mask);
-+}
-+
- static inline bool fair_push_task(struct task_struct *p)
- {
-     if (!task_on_rq_queued(p))
-@@ -XXX,XX +XXX,XX @@ static inline void fair_queue_pushable_tasks(struct rq *rq)
- static void fair_remove_pushable_task(struct rq *rq, struct task_struct *p)
- {
-     plist_del(&p->pushable_tasks, &rq->cfs.pushable_tasks);
-+
-+    if (!has_pushable_tasks(rq))
-+        update_overloaded_mask(rq->cpu, false);
- }
- static void fair_add_pushable_task(struct rq *rq, struct task_struct *p)
- {
-     if (fair_push_task(p)) {
-+        if (!has_pushable_tasks(rq))
-+            update_overloaded_mask(rq->cpu, true);
-+
-         plist_del(&p->pushable_tasks, &rq->cfs.pushable_tasks);
-         plist_node_init(&p->pushable_tasks, p->prio);
-         plist_add(&p->pushable_tasks, &rq->cfs.pushable_tasks);
---
-.34.1

-[RFC PATCH 5/5] sched/fair: Proactive idle balance using push mechanism
+[PATCH 1/2] iio: frequency: admv1013: add support for ADMV1013
-Proactively try to push tasks to one of the CPUs set in the
+The ADMV1013 is a wideband, microwave upconverter optimized
-"nohz.idle_cpus_mask" from the push callback.
+for point to point microwave radio designs operating in the
 GHz to 44 GHz radio frequency (RF) range.
-pick_next_pushable_fair_task() is taken from Vincent's series [1] as is
+Datasheet:
-but the locking rules in push_fair_task() has been relaxed to release
+https://www.analog.com/media/en/technical-documentation/data-sheets/ADMV1013.pdf
 the local rq lock after dequeuing the task and reacquiring it after
 pushing it to the idle target.
-double_lock_balance() used in RT seems necessary to maintain strict
+Signed-off-by: Antoniu Miclaus <antoniu.miclaus@analog.com>
-priority ordering however that may not be necessary for fair tasks.
+---
  drivers/iio/frequency/Kconfig    |  13 +
  drivers/iio/frequency/Makefile   |   1 +
  drivers/iio/frequency/admv1013.c | 579 +++++++++++++++++++++++++++++++
 files changed, 593 insertions(+)
  create mode 100644 drivers/iio/frequency/admv1013.c
-Link: https://lore.kernel.org/all/20250302210539.1563190-6-vincent.guittot@linaro.org/ [1]
+diff --git a/drivers/iio/frequency/Kconfig b/drivers/iio/frequency/Kconfig
-Signed-off-by: K Prateek Nayak <kprateek.nayak@amd.com>
+index XXXXXXX..XXXXXXX 100644
----
+--- a/drivers/iio/frequency/Kconfig
- kernel/sched/fair.c | 59 +++++++++++++++++++++++++++++++++++++++++++++
++++ b/drivers/iio/frequency/Kconfig
-file changed, 59 insertions(+)
+@@ -XXX,XX +XXX,XX @@ config ADF4371
        To compile this driver as a module, choose M here: the
        module will be called adf4371.
 +
 +config ADMV1013
 +    tristate "Analog Devices ADMV1013 Microwave Upconverter"
 +    depends on SPI
 +    depends on COMMON_CLK
 +    depends on 64BIT
 +    help
 +      Say yes here to build support for Analog Devices ADMV1013
 +      24 GHz to 44 GHz, Wideband, Microwave Upconverter.
 +
 +      To compile this driver as a module, choose M here: the
 +      module will be called admv1013.
 +
  endmenu
  endmenu
 diff --git a/drivers/iio/frequency/Makefile b/drivers/iio/frequency/Makefile
 index XXXXXXX..XXXXXXX 100644
 --- a/drivers/iio/frequency/Makefile
 +++ b/drivers/iio/frequency/Makefile
@@ -XXX,XX +XXX,XX @@
  obj-$(CONFIG_AD9523) += ad9523.o
  obj-$(CONFIG_ADF4350) += adf4350.o
  obj-$(CONFIG_ADF4371) += adf4371.o
 +obj-$(CONFIG_ADMV1013) += admv1013.o
 diff --git a/drivers/iio/frequency/admv1013.c b/drivers/iio/frequency/admv1013.c
 new file mode 100644
 index XXXXXXX..XXXXXXX
 --- /dev/null
 +++ b/drivers/iio/frequency/admv1013.c
@@ -XXX,XX +XXX,XX @@
 +// SPDX-License-Identifier: GPL-2.0-only
 +/*
 + * ADMV1013 driver
 + *
 + * Copyright 2021 Analog Devices Inc.
 + */
 +
 +#include <linux/bitfield.h>
 +#include <linux/bitops.h>
 +#include <linux/bits.h>
 +#include <linux/clk.h>
 +#include <linux/clkdev.h>
 +#include <linux/clk-provider.h>
 +#include <linux/device.h>
 +#include <linux/iio/iio.h>
 +#include <linux/module.h>
 +#include <linux/notifier.h>
 +#include <linux/regmap.h>
 +#include <linux/regulator/consumer.h>
 +#include <linux/spi/spi.h>
 +
 +#include <asm/unaligned.h>
 +
 +/* ADMV1013 Register Map */
 +#define ADMV1013_REG_SPI_CONTROL        0x00
 +#define ADMV1013_REG_ALARM            0x01
 +#define ADMV1013_REG_ALARM_MASKS        0x02
 +#define ADMV1013_REG_ENABLE            0x03
 +#define ADMV1013_REG_LO_AMP_I            0x05
 +#define ADMV1013_REG_LO_AMP_Q            0x06
 +#define ADMV1013_REG_OFFSET_ADJUST_I        0x07
 +#define ADMV1013_REG_OFFSET_ADJUST_Q        0x08
 +#define ADMV1013_REG_QUAD            0x09
 +#define ADMV1013_REG_VVA_TEMP_COMP        0x0A
 +
 +/* ADMV1013_REG_SPI_CONTROL Map */
 +#define ADMV1013_PARITY_EN_MSK            BIT(15)
 +#define ADMV1013_SPI_SOFT_RESET_MSK        BIT(14)
 +#define ADMV1013_CHIP_ID_MSK            GENMASK(11, 4)
 +#define ADMV1013_CHIP_ID            0xA
 +#define ADMV1013_REVISION_ID_MSK        GENMASK(3, 0)
 +
 +/* ADMV1013_REG_ALARM Map */
 +#define ADMV1013_PARITY_ERROR_MSK        BIT(15)
 +#define ADMV1013_TOO_FEW_ERRORS_MSK        BIT(14)
 +#define ADMV1013_TOO_MANY_ERRORS_MSK        BIT(13)
 +#define ADMV1013_ADDRESS_RANGE_ERROR_MSK    BIT(12)
 +
 +/* ADMV1013_REG_ENABLE Map */
 +#define ADMV1013_VGA_PD_MSK            BIT(15)
 +#define ADMV1013_MIXER_PD_MSK            BIT(14)
 +#define ADMV1013_QUAD_PD_MSK            GENMASK(13, 11)
 +#define ADMV1013_BG_PD_MSK            BIT(10)
 +#define ADMV1013_MIXER_IF_EN_MSK        BIT(7)
 +#define ADMV1013_DET_EN_MSK            BIT(5)
 +
 +/* ADMV1013_REG_LO_AMP_I Map */
 +#define ADMV1013_LOAMP_PH_ADJ_I_FINE_MSK    GENMASK(13, 7)
 +#define ADMV1013_MIXER_VGATE_MSK        GENMASK(6, 0)
 +
 +/* ADMV1013_REG_LO_AMP_Q Map */
 +#define ADMV1013_LOAMP_PH_ADJ_Q_FINE_MSK    GENMASK(13, 7)
 +
 +/* ADMV1013_REG_OFFSET_ADJUST_I Map */
 +#define ADMV1013_MIXER_OFF_ADJ_I_P_MSK        GENMASK(15, 9)
 +#define ADMV1013_MIXER_OFF_ADJ_I_N_MSK        GENMASK(8, 2)
 +
 +/* ADMV1013_REG_OFFSET_ADJUST_Q Map */
 +#define ADMV1013_MIXER_OFF_ADJ_Q_P_MSK        GENMASK(15, 9)
 +#define ADMV1013_MIXER_OFF_ADJ_Q_N_MSK        GENMASK(8, 2)
 +
 +/* ADMV1013_REG_QUAD Map */
 +#define ADMV1013_QUAD_SE_MODE_MSK        GENMASK(9, 6)
 +#define ADMV1013_QUAD_FILTERS_MSK        GENMASK(3, 0)
 +
 +/* ADMV1013_REG_VVA_TEMP_COMP Map */
 +#define ADMV1013_VVA_TEMP_COMP_MSK        GENMASK(15, 0)
 +
 +struct admv1013_state {
 +    struct spi_device    *spi;
 +    struct clk        *clkin;
 +    /* Protect against concurrent accesses to the device */
 +    struct mutex        lock;
 +    struct regulator    *reg;
 +    struct notifier_block    nb;
 +    unsigned int        quad_se_mode;
 +    bool            vga_pd;
 +    bool            mixer_pd;
 +    bool            quad_pd;
 +    bool            bg_pd;
 +    bool            mixer_if_en;
 +    bool            det_en;
 +    u8            data[3] ____cacheline_aligned;
 +};
 +
 +static int __admv1013_spi_read(struct admv1013_state *st, unsigned int reg,
 +                   unsigned int *val)
 +{
 +    int ret;
 +    struct spi_transfer t = {0};
 +
 +    st->data[0] = 0x80 | (reg << 1);
 +    st->data[1] = 0x0;
 +    st->data[2] = 0x0;
 +
 +    t.rx_buf = &st->data[0];
 +    t.tx_buf = &st->data[0];
 +    t.len = 3;
 +
 +    ret = spi_sync_transfer(st->spi, &t, 1);
 +    if (ret)
 +        return ret;
 +
 +    *val = (get_unaligned_be24(&st->data[0]) >> 1) & GENMASK(15, 0);
 +
 +    return ret;
 +}
 +
 +static int admv1013_spi_read(struct admv1013_state *st, unsigned int reg,
 +                 unsigned int *val)
 +{
 +    int ret;
 +
 +    mutex_lock(&st->lock);
 +    ret = __admv1013_spi_read(st, reg, val);
 +    mutex_unlock(&st->lock);
 +
 +    return ret;
 +}
 +
 +static int __admv1013_spi_write(struct admv1013_state *st,
 +                unsigned int reg,
 +                unsigned int val)
 +{
 +    put_unaligned_be24((val << 1) | (reg << 17), &st->data[0]);
 +
 +    return spi_write(st->spi, &st->data[0], 3);
 +}
 +
 +static int admv1013_spi_write(struct admv1013_state *st, unsigned int reg,
 +                  unsigned int val)
 +{
 +    int ret;
 +
 +    mutex_lock(&st->lock);
 +    ret = __admv1013_spi_write(st, reg, val);
 +    mutex_unlock(&st->lock);
 +
 +    return ret;
 +}
 +
 +static int __admv1013_spi_update_bits(struct admv1013_state *st, unsigned int reg,
 +                      unsigned int mask, unsigned int val)
 +{
 +    int ret;
 +    unsigned int data, temp;
 +
 +    ret = __admv1013_spi_read(st, reg, &data);
 +    if (ret)
 +        return ret;
 +
 +    temp = (data & ~mask) | (val & mask);
 +
 +    return __admv1013_spi_write(st, reg, temp);
 +}
 +
 +static int admv1013_spi_update_bits(struct admv1013_state *st, unsigned int reg,
 +                    unsigned int mask, unsigned int val)
 +{
 +    int ret;
 +
 +    mutex_lock(&st->lock);
 +    ret = __admv1013_spi_update_bits(st, reg, mask, val);
 +    mutex_unlock(&st->lock);
 +
 +    return ret;
 +}
 +
 +static int admv1013_read_raw(struct iio_dev *indio_dev,
 +                 struct iio_chan_spec const *chan,
 +                 int *val, int *val2, long info)
 +{
 +    struct admv1013_state *st = iio_priv(indio_dev);
 +    unsigned int data;
 +    int ret;
 +
 +    switch (info) {
 +    case IIO_CHAN_INFO_OFFSET:
 +        if (chan->channel2 == IIO_MOD_I) {
 +            ret = admv1013_spi_read(st, ADMV1013_REG_OFFSET_ADJUST_I, &data);
 +            if (ret)
 +                return ret;
 +
 +            *val = FIELD_GET(ADMV1013_MIXER_OFF_ADJ_I_P_MSK, data);
 +            *val2 = FIELD_GET(ADMV1013_MIXER_OFF_ADJ_I_N_MSK, data);
 +        } else {
 +            ret = admv1013_spi_read(st, ADMV1013_REG_OFFSET_ADJUST_Q, &data);
 +            if (ret)
 +                return ret;
 +
 +            *val = FIELD_GET(ADMV1013_MIXER_OFF_ADJ_Q_P_MSK, data);
 +            *val2 = FIELD_GET(ADMV1013_MIXER_OFF_ADJ_Q_N_MSK, data);
 +        }
 +
 +        return IIO_VAL_INT_MULTIPLE;
 +    case IIO_CHAN_INFO_PHASE:
 +        if (chan->channel2 == IIO_MOD_I) {
 +            ret = admv1013_spi_read(st, ADMV1013_REG_LO_AMP_I, &data);
 +            if (ret)
 +                return ret;
 +
 +            *val = FIELD_GET(ADMV1013_LOAMP_PH_ADJ_I_FINE_MSK, data);
 +        } else {
 +            ret = admv1013_spi_read(st, ADMV1013_REG_LO_AMP_Q, &data);
 +            if (ret)
 +                return ret;
 +
 +            *val = FIELD_GET(ADMV1013_LOAMP_PH_ADJ_Q_FINE_MSK, data);
 +        }
 +
 +        return IIO_VAL_INT;
 +    default:
 +        return -EINVAL;
 +    }
 +}
 +
 +static int admv1013_write_raw(struct iio_dev *indio_dev,
 +                  struct iio_chan_spec const *chan,
 +                  int val, int val2, long info)
 +{
 +    struct admv1013_state *st = iio_priv(indio_dev);
 +    int ret;
 +
 +    switch (info) {
 +    case IIO_CHAN_INFO_OFFSET:
 +        val2 /= 100000;
 +
 +        if (chan->channel2 == IIO_MOD_I)
 +            ret = admv1013_spi_update_bits(st, ADMV1013_REG_OFFSET_ADJUST_I,
 +                               ADMV1013_MIXER_OFF_ADJ_I_P_MSK |
 +                               ADMV1013_MIXER_OFF_ADJ_I_N_MSK,
 +                               FIELD_PREP(ADMV1013_MIXER_OFF_ADJ_I_P_MSK, val) |
 +                               FIELD_PREP(ADMV1013_MIXER_OFF_ADJ_I_N_MSK, val2));
 +        else
 +            ret = admv1013_spi_update_bits(st, ADMV1013_REG_OFFSET_ADJUST_Q,
 +                               ADMV1013_MIXER_OFF_ADJ_Q_P_MSK |
 +                               ADMV1013_MIXER_OFF_ADJ_Q_N_MSK,
 +                               FIELD_PREP(ADMV1013_MIXER_OFF_ADJ_Q_P_MSK, val) |
 +                               FIELD_PREP(ADMV1013_MIXER_OFF_ADJ_Q_N_MSK, val2));
 +
 +        return ret;
 +    case IIO_CHAN_INFO_PHASE:
 +        if (chan->channel2 == IIO_MOD_I)
 +            return admv1013_spi_update_bits(st, ADMV1013_REG_LO_AMP_I,
 +                            ADMV1013_LOAMP_PH_ADJ_I_FINE_MSK,
 +                            FIELD_PREP(ADMV1013_LOAMP_PH_ADJ_I_FINE_MSK, val));
 +        else
 +            return admv1013_spi_update_bits(st, ADMV1013_REG_LO_AMP_Q,
 +                            ADMV1013_LOAMP_PH_ADJ_Q_FINE_MSK,
 +                            FIELD_PREP(ADMV1013_LOAMP_PH_ADJ_Q_FINE_MSK, val));
 +    default:
 +        return -EINVAL;
 +    }
 +}
 +
 +static int admv1013_update_quad_filters(struct admv1013_state *st)
 +{
 +    unsigned int filt_raw;
 +    u64 rate = clk_get_rate(st->clkin);
 +
 +    if (rate >= 5400000000 && rate <= 7000000000)
 +        filt_raw = 15;
 +    else if (rate >= 5400000000 && rate <= 8000000000)
 +        filt_raw = 10;
 +    else if (rate >= 6600000000 && rate <= 9200000000)
 +        filt_raw = 5;
 +    else
 +        filt_raw = 0;
 +
 +    return __admv1013_spi_update_bits(st, ADMV1013_REG_QUAD,
 +                    ADMV1013_QUAD_FILTERS_MSK,
 +                    FIELD_PREP(ADMV1013_QUAD_FILTERS_MSK, filt_raw));
 +}
 +
 +static int admv1013_update_mixer_vgate(struct admv1013_state *st)
 +{
 +    unsigned int vcm, mixer_vgate;
 +
 +    vcm = regulator_get_voltage(st->reg);
 +
 +    if (vcm >= 0 && vcm < 1800000)
 +        mixer_vgate = (2389 * vcm / 1000000 + 8100) / 100;
 +    else if (vcm > 1800000 && vcm < 2600000)
 +        mixer_vgate = (2375 * vcm / 1000000 + 125) / 100;
 +    else
 +        return -EINVAL;
 +
 +    return __admv1013_spi_update_bits(st, ADMV1013_REG_LO_AMP_I,
 +                 ADMV1013_MIXER_VGATE_MSK,
 +                 FIELD_PREP(ADMV1013_MIXER_VGATE_MSK, mixer_vgate));
 +}
 +
 +static int admv1013_reg_access(struct iio_dev *indio_dev,
 +                   unsigned int reg,
 +                   unsigned int write_val,
 +                   unsigned int *read_val)
 +{
 +    struct admv1013_state *st = iio_priv(indio_dev);
 +    int ret;
 +
 +    if (read_val)
 +        ret = admv1013_spi_read(st, reg, read_val);
 +    else
 +        ret = admv1013_spi_write(st, reg, write_val);
 +
 +    return ret;
 +}
 +
 +static const struct iio_info admv1013_info = {
 +    .read_raw = admv1013_read_raw,
 +    .write_raw = admv1013_write_raw,
 +    .debugfs_reg_access = &admv1013_reg_access,
 +};
 +
 +static int admv1013_freq_change(struct notifier_block *nb, unsigned long action, void *data)
 +{
 +    struct admv1013_state *st = container_of(nb, struct admv1013_state, nb);
 +    int ret;
 +
 +    if (action == POST_RATE_CHANGE) {
 +        mutex_lock(&st->lock);
 +        ret = notifier_from_errno(admv1013_update_quad_filters(st));
 +        mutex_unlock(&st->lock);
 +        return ret;
 +    }
 +
 +    return NOTIFY_OK;
 +}
 +
 +static void admv1013_clk_notifier_unreg(void *data)
 +{
 +    struct admv1013_state *st = data;
 +
 +    clk_notifier_unregister(st->clkin, &st->nb);
 +}
 +
 +#define ADMV1013_CHAN(_channel, rf_comp) {            \
 +    .type = IIO_ALTVOLTAGE,                    \
 +    .modified = 1,                        \
 +    .output = 1,                        \
 +    .indexed = 1,                        \
 +    .channel2 = IIO_MOD_##rf_comp,                \
 +    .channel = _channel,                    \
 +    .info_mask_separate = BIT(IIO_CHAN_INFO_PHASE) |    \
 +        BIT(IIO_CHAN_INFO_OFFSET)            \
 +    }
 +
 +static const struct iio_chan_spec admv1013_channels[] = {
 +    ADMV1013_CHAN(0, I),
 +    ADMV1013_CHAN(0, Q),
 +};
 +
 +static int admv1013_init(struct admv1013_state *st)
 +{
 +    int ret;
 +    unsigned int chip_id, enable_reg, enable_reg_msk;
 +    struct spi_device *spi = st->spi;
 +
 +    /* Perform a software reset */
 +    ret = __admv1013_spi_update_bits(st, ADMV1013_REG_SPI_CONTROL,
 +                     ADMV1013_SPI_SOFT_RESET_MSK,
 +                     FIELD_PREP(ADMV1013_SPI_SOFT_RESET_MSK, 1));
 +    if (ret)
 +        return ret;
 +
 +    ret = __admv1013_spi_update_bits(st, ADMV1013_REG_SPI_CONTROL,
 +                     ADMV1013_SPI_SOFT_RESET_MSK,
 +                     FIELD_PREP(ADMV1013_SPI_SOFT_RESET_MSK, 0));
 +    if (ret)
 +        return ret;
 +
 +    ret = __admv1013_spi_read(st, ADMV1013_REG_SPI_CONTROL, &chip_id);
 +    if (ret)
 +        return ret;
 +
 +    chip_id = FIELD_GET(ADMV1013_CHIP_ID_MSK, chip_id);
 +    if (chip_id != ADMV1013_CHIP_ID) {
 +        dev_err(&spi->dev, "Invalid Chip ID.\n");
 +        return -EINVAL;
 +    }
 +
 +    ret = __admv1013_spi_write(st, ADMV1013_REG_VVA_TEMP_COMP, 0xE700);
 +    if (ret)
 +        return ret;
 +
 +    ret = __admv1013_spi_update_bits(st, ADMV1013_REG_QUAD,
 +                     ADMV1013_QUAD_SE_MODE_MSK,
 +                     FIELD_PREP(ADMV1013_QUAD_SE_MODE_MSK, st->quad_se_mode));
 +    if (ret)
 +        return ret;
 +
 +    ret = admv1013_update_mixer_vgate(st);
 +    if (ret)
 +        return ret;
 +
 +    ret = admv1013_update_quad_filters(st);
 +    if (ret)
 +        return ret;
 +
 +    enable_reg_msk = ADMV1013_VGA_PD_MSK |
 +            ADMV1013_MIXER_PD_MSK |
 +            ADMV1013_QUAD_PD_MSK |
 +            ADMV1013_BG_PD_MSK |
 +            ADMV1013_MIXER_IF_EN_MSK |
 +            ADMV1013_DET_EN_MSK;
 +
 +    enable_reg = FIELD_PREP(ADMV1013_VGA_PD_MSK, st->vga_pd) |
 +            FIELD_PREP(ADMV1013_MIXER_PD_MSK, st->mixer_pd) |
 +            FIELD_PREP(ADMV1013_QUAD_PD_MSK, st->quad_pd ? 7 : 0) |
 +            FIELD_PREP(ADMV1013_BG_PD_MSK, st->bg_pd) |
 +            FIELD_PREP(ADMV1013_MIXER_IF_EN_MSK, st->mixer_if_en) |
 +            FIELD_PREP(ADMV1013_DET_EN_MSK, st->det_en);
 +
 +    return __admv1013_spi_update_bits(st, ADMV1013_REG_ENABLE, enable_reg_msk, enable_reg);
 +}
 +
 +static void admv1013_clk_disable(void *data)
 +{
 +    clk_disable_unprepare(data);
 +}
 +
 +static void admv1013_reg_disable(void *data)
 +{
 +    regulator_disable(data);
 +}
 +
 +static void admv1013_powerdown(void *data)
 +{
 +    unsigned int enable_reg, enable_reg_msk;
 +
 +    /* Disable all components in the Enable Register */
 +    enable_reg_msk = ADMV1013_VGA_PD_MSK |
 +            ADMV1013_MIXER_PD_MSK |
 +            ADMV1013_QUAD_PD_MSK |
 +            ADMV1013_BG_PD_MSK |
 +            ADMV1013_MIXER_IF_EN_MSK |
 +            ADMV1013_DET_EN_MSK;
 +
 +    enable_reg = FIELD_PREP(ADMV1013_VGA_PD_MSK, 1) |
 +            FIELD_PREP(ADMV1013_MIXER_PD_MSK, 1) |
 +            FIELD_PREP(ADMV1013_QUAD_PD_MSK, 7) |
 +            FIELD_PREP(ADMV1013_BG_PD_MSK, 1) |
 +            FIELD_PREP(ADMV1013_MIXER_IF_EN_MSK, 0) |
 +            FIELD_PREP(ADMV1013_DET_EN_MSK, 0);
 +
 +    admv1013_spi_update_bits(data, ADMV1013_REG_ENABLE, enable_reg_msk, enable_reg);
 +}
 +
 +static int admv1013_properties_parse(struct admv1013_state *st)
 +{
 +    int ret;
 +    struct spi_device *spi = st->spi;
 +
 +    st->vga_pd = device_property_read_bool(&spi->dev, "adi,vga-pd");
 +    st->mixer_pd = device_property_read_bool(&spi->dev, "adi,mixer-pd");
 +    st->quad_pd = device_property_read_bool(&spi->dev, "adi,quad-pd");
 +    st->bg_pd = device_property_read_bool(&spi->dev, "adi,bg-pd");
 +    st->mixer_if_en = device_property_read_bool(&spi->dev, "adi,mixer-if-en");
 +    st->det_en = device_property_read_bool(&spi->dev, "adi,det-en");
 +
 +    ret = device_property_read_u32(&spi->dev, "adi,quad-se-mode", &st->quad_se_mode);
 +    if (ret)
 +        st->quad_se_mode = 12;
 +
 +    st->reg = devm_regulator_get(&spi->dev, "vcm");
 +    if (IS_ERR(st->reg))
 +        return dev_err_probe(&spi->dev, PTR_ERR(st->reg),
 +                     "failed to get the common-mode voltage\n");
 +
 +    st->clkin = devm_clk_get(&spi->dev, "lo_in");
 +    if (IS_ERR(st->clkin))
 +        return dev_err_probe(&spi->dev, PTR_ERR(st->clkin),
 +                     "failed to get the LO input clock\n");
 +
 +    return 0;
 +}
 +
 +static int admv1013_probe(struct spi_device *spi)
 +{
 +    struct iio_dev *indio_dev;
 +    struct admv1013_state *st;
 +    int ret;
 +
 +    indio_dev = devm_iio_device_alloc(&spi->dev, sizeof(*st));
 +    if (!indio_dev)
 +        return -ENOMEM;
 +
 +    st = iio_priv(indio_dev);
 +
 +    indio_dev->dev.parent = &spi->dev;
 +    indio_dev->info = &admv1013_info;
 +    indio_dev->name = "admv1013";
 +    indio_dev->channels = admv1013_channels;
 +    indio_dev->num_channels = ARRAY_SIZE(admv1013_channels);
 +
 +    st->spi = spi;
 +
 +    ret = admv1013_properties_parse(st);
 +    if (ret)
 +        return ret;
 +
 +    ret = regulator_enable(st->reg);
 +    if (ret) {
 +        dev_err(&spi->dev, "Failed to enable specified Common-Mode Voltage!\n");
 +        return ret;
 +    }
 +
 +    ret = devm_add_action_or_reset(&spi->dev, admv1013_reg_disable,
 +                       st->reg);
 +    if (ret)
 +        return ret;
 +
 +    ret = clk_prepare_enable(st->clkin);
 +    if (ret)
 +        return ret;
 +
 +    ret = devm_add_action_or_reset(&spi->dev, admv1013_clk_disable, st->clkin);
 +    if (ret)
 +        return ret;
 +
 +    st->nb.notifier_call = admv1013_freq_change;
 +    ret = clk_notifier_register(st->clkin, &st->nb);
 +    if (ret)
 +        return ret;
 +
 +    ret = devm_add_action_or_reset(&spi->dev, admv1013_clk_notifier_unreg, st);
 +    if (ret)
 +        return ret;
 +
 +    mutex_init(&st->lock);
 +
 +    ret = admv1013_init(st);
 +    if (ret) {
 +        dev_err(&spi->dev, "admv1013 init failed\n");
 +        return ret;
 +    }
 +
 +    ret = devm_add_action_or_reset(&spi->dev, admv1013_powerdown, st);
 +    if (ret)
 +        return ret;
 +
 +    return devm_iio_device_register(&spi->dev, indio_dev);
 +}
 +
 +static const struct spi_device_id admv1013_id[] = {
 +    { "admv1013", 0},
 +    {}
 +};
 +MODULE_DEVICE_TABLE(spi, admv1013_id);
 +
 +static const struct of_device_id admv1013_of_match[] = {
 +    { .compatible = "adi,admv1013" },
 +    {},
 +};
 +MODULE_DEVICE_TABLE(of, admv1013_of_match);
 +
 +static struct spi_driver admv1013_driver = {
 +    .driver = {
 +        .name = "admv1013",
 +        .of_match_table = admv1013_of_match,
 +    },
 +    .probe = admv1013_probe,
 +    .id_table = admv1013_id,
 +};
 +module_spi_driver(admv1013_driver);
 +
 +MODULE_AUTHOR("Antoniu Miclaus <antoniu.miclaus@analog.com");
 +MODULE_DESCRIPTION("Analog Devices ADMV1013");
 +MODULE_LICENSE("GPL v2");
 --
 .33.1
-diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
-index XXXXXXX..XXXXXXX 100644
---- a/kernel/sched/fair.c
-+++ b/kernel/sched/fair.c
-@@ -XXX,XX +XXX,XX @@ static inline int has_pushable_tasks(struct rq *rq)
-     return !plist_head_empty(&rq->cfs.pushable_tasks);
- }
-+static struct task_struct *pick_next_pushable_fair_task(struct rq *rq)
-+{
-+    struct task_struct *p;
-+
-+    if (!has_pushable_tasks(rq))
-+        return NULL;
-+
-+    p = plist_first_entry(&rq->cfs.pushable_tasks,
-+                  struct task_struct, pushable_tasks);
-+
-+    WARN_ON_ONCE(rq->cpu != task_cpu(p));
-+    WARN_ON_ONCE(task_current(rq, p));
-+    WARN_ON_ONCE(p->nr_cpus_allowed <= 1);
-+    WARN_ON_ONCE(!task_on_rq_queued(p));
-+
-+    /*
-+     * Remove task from the pushable list as we try only once after that
-+     * the task has been put back in enqueued list.
-+     */
-+    plist_del(&p->pushable_tasks, &rq->cfs.pushable_tasks);
-+
-+    return p;
-+}
-+
-+static void fair_add_pushable_task(struct rq *rq, struct task_struct *p);
-+static void attach_one_task(struct rq *rq, struct task_struct *p);
-+
- /*
-  * See if the non running fair tasks on this rq can be sent on other CPUs
-  * that fits better with their profile.
-  */
- static bool push_fair_task(struct rq *rq)
- {
-+    struct cpumask *cpus = this_cpu_cpumask_var_ptr(load_balance_mask);
-+    struct task_struct *p = pick_next_pushable_fair_task(rq);
-+    int cpu, this_cpu = cpu_of(rq);
-+
-+    if (!p)
-+        return false;
-+
-+    if (!cpumask_and(cpus, nohz.idle_cpus_mask, housekeeping_cpumask(HK_TYPE_KERNEL_NOISE)))
-+        goto requeue;
-+
-+    if (!cpumask_and(cpus, cpus, p->cpus_ptr))
-+        goto requeue;
-+
-+    for_each_cpu_wrap(cpu, cpus, this_cpu + 1) {
-+        struct rq *target_rq;
-+
-+        if (!idle_cpu(cpu))
-+            continue;
-+
-+        target_rq = cpu_rq(cpu);
-+        deactivate_task(rq, p, 0);
-+        set_task_cpu(p, cpu);
-+        raw_spin_rq_unlock(rq);
-+
-+        attach_one_task(target_rq, p);
-+        raw_spin_rq_lock(rq);
-+
-+        return true;
-+    }
-+
-+requeue:
-+    fair_add_pushable_task(rq, p);
-     return false;
- }
---
-.34.1

-[RFC PATCH 4/5] sched/fair: Rework inter-NUMA newidle balancing
+[PATCH 2/2] dt-bindings: iio: frequency: add admv1013 doc
-With the introduction of "overloaded_mask" in sched_domain_shared
+Add device tree bindings for the ADMV1013 Upconverter.
 struct, it is now possible to scan through the CPUs that contain
 pushable tasks that could be run on the CPU going newly idle.
-Redesign the inter-NUMA newidle balancing to opportunistically pull a
+Signed-off-by: Antoniu Miclaus <antoniu.miclaus@analog.com>
-task to the CPU going idle from the overloaded CPUs only.
+---
  .../bindings/iio/frequency/adi,admv1013.yaml  | 110 ++++++++++++++++++
 file changed, 110 insertions(+)
  create mode 100644 Documentation/devicetree/bindings/iio/frequency/adi,admv1013.yaml
-The search starts from sd_llc and moves up until sd_numa. Since
+diff --git a/Documentation/devicetree/bindings/iio/frequency/adi,admv1013.yaml b/Documentation/devicetree/bindings/iio/frequency/adi,admv1013.yaml
-"overloaded_mask" is per-LLC, each LLC domain is visited individually
+new file mode 100644
-using per-CPU sd_llc struct shared by all CPUs in an LLC.
+index XXXXXXX..XXXXXXX
 --- /dev/null
 +++ b/Documentation/devicetree/bindings/iio/frequency/adi,admv1013.yaml
@@ -XXX,XX +XXX,XX @@
 +# SPDX-License-Identifier: (GPL-2.0-only OR BSD-2-Clause)
 +%YAML 1.2
 +---
 +$id: http://devicetree.org/schemas/iio/frequency/adi,admv1013.yaml#
 +$schema: http://devicetree.org/meta-schemas/core.yaml#
 +
 +title: ADMV1013 Microwave Upconverter
 +
 +maintainers:
 +  - Antoniu Miclaus <antoniu.miclaus@analog.com>
 +
 +description: |
 +   Wideband, microwave upconverter optimized for point to point microwave
 +   radio designs operating in the 24 GHz to 44 GHz frequency range.
 +
 +   https://www.analog.com/en/products/admv1013.html
 +
 +properties:
 +  compatible:
 +    enum:
 +      - adi,admv1013
 +
 +  reg:
 +    maxItems: 1
 +
 +  spi-max-frequency:
 +    maximum: 1000000
 +
 +  clocks:
 +    description:
 +      Definition of the external clock.
 +    minItems: 1
 +
 +  clock-names:
 +    items:
 +      - const: lo_in
 +
 +  clock-output-names:
 +    maxItems: 1
 +
 +  vcm-supply:
 +    description:
 +      Analog voltage regulator.
 +
 +  adi,vga-pd:
 +    description:
 +      Power Down the Voltage Gain Amplifier Circuit.
 +    type: boolean
 +
 +  adi,mixer-pd:
 +    description:
 +      Power Down the Mixer Circuit.
 +    type: boolean
 +
 +  adi,quad-pd:
 +    description:
 +      Power Down the Quadrupler.
 +    type: boolean
 +
 +  adi,bg-pd:
 +    description:
 +      Power Down the Transmitter Band Gap.
 +    type: boolean
 +
 +  adi,mixer-if-en:
 +    description:
 +      Enable the Intermediate Frequency Mode.
 +    type: boolean
 +
 +  adi,det-en:
 +    description:
 +      Enable the Envelope Detector.
 +    type: boolean
 +
 +  adi,quad-se-mode:
 +    description:
 +      Switch the LO path from differential to single-ended operation.
 +    $ref: /schemas/types.yaml#/definitions/uint32
 +    enum: [6, 9, 12]
 +
 +  '#clock-cells':
 +    const: 0
 +
 +required:
 +  - compatible
 +  - reg
 +  - clocks
 +  - clock-names
 +  - vcm-supply
 +
 +additionalProperties: false
 +
 +examples:
 +  - |
 +    spi {
 +      #address-cells = <1>;
 +      #size-cells = <0>;
 +      admv1013@0{
 +        compatible = "adi,admv1013";
 +        reg = <0>;
 +        spi-max-frequency = <1000000>;
 +        clocks = <&admv1013_lo>;
 +        clock-names = "lo_in";
 +        vcm-supply = <&vcm>;
 +        adi,quad-se-mode = <12>;
 +        adi,mixer-if-en;
 +        adi,det-en;
 +      };
 +    };
 +...
 --
 .33.1
-Once visited for one, all CPUs in the LLC are marked visited and the
-search resumes for the LLCs of CPUs that remain to be visited.
-detach_one_task() was used in instead of pick_next_pushable_fair_task()
-since detach_one_task() also considers the CPU affinity of the task
-being pulled as opposed to pick_next_pushable_fair_task() which returns
-the first pushable task.
-Since each iteration of overloaded_mask rechecks the idle state of the
-CPU doing newidle balance, the initial gating factor based on
-"rq->avg_idle" has been removed.
-Signed-off-by: K Prateek Nayak <kprateek.nayak@amd.com>
----
- kernel/sched/fair.c | 129 +++++++++++++++++++++++++++++++++++++++-----
-file changed, 117 insertions(+), 12 deletions(-)
-diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
-index XXXXXXX..XXXXXXX 100644
---- a/kernel/sched/fair.c
-+++ b/kernel/sched/fair.c
-@@ -XXX,XX +XXX,XX @@ static inline bool nohz_idle_balance(struct rq *this_rq, enum cpu_idle_type idle
- static inline void nohz_newidle_balance(struct rq *this_rq) { }
- #endif /* CONFIG_NO_HZ_COMMON */
-+static inline bool sched_newidle_continue_balance(struct rq *rq)
-+{
-+    return !rq->nr_running && !rq->ttwu_pending;
-+}
-+
-+static inline int sched_newidle_pull_overloaded(struct sched_domain *sd,
-+                        struct rq *this_rq,
-+                        int *continue_balancing)
-+{
-+    struct cpumask *cpus = this_cpu_cpumask_var_ptr(load_balance_mask);
-+    int cpu, this_cpu = cpu_of(this_rq);
-+    struct sched_domain *sd_parent;
-+    struct lb_env env = {
-+        .dst_cpu    = this_cpu,
-+        .dst_rq        = this_rq,
-+        .idle        = CPU_NEWLY_IDLE,
-+    };
-+
-+
-+    cpumask_and(cpus, sched_domain_span(sd), cpu_active_mask);
-+
-+next_domain:
-+    env.sd = sd;
-+    /* Allow migrating cache_hot tasks too. */
-+    sd->nr_balance_failed = sd->cache_nice_tries + 1;
-+
-+    for_each_cpu_wrap(cpu, cpus, this_cpu) {
-+        struct sched_domain_shared *sd_share;
-+        struct cpumask *overloaded_mask;
-+        struct sched_domain *cpu_llc;
-+        int overloaded_cpu;
-+
-+        cpu_llc = rcu_dereference(per_cpu(sd_llc, cpu));
-+        if (!cpu_llc)
-+            break;
-+
-+        sd_share = cpu_llc->shared;
-+        if (!sd_share)
-+            break;
-+
-+        overloaded_mask = sd_share->overloaded_mask;
-+        if (!overloaded_mask)
-+            break;
-+
-+        for_each_cpu_wrap(overloaded_cpu, overloaded_mask, this_cpu + 1) {
-+            struct rq *overloaded_rq = cpu_rq(overloaded_cpu);
-+            struct task_struct *p = NULL;
-+
-+            if (sched_newidle_continue_balance(this_rq)) {
-+                *continue_balancing = 0;
-+                return 0;
-+            }
-+
-+            /* Quick peek to find if pushable tasks exist. */
-+            if (!has_pushable_tasks(overloaded_rq))
-+                continue;
-+
-+            scoped_guard (rq_lock, overloaded_rq) {
-+                update_rq_clock(overloaded_rq);
-+
-+                if (!has_pushable_tasks(overloaded_rq))
-+                    break;
-+
-+                env.src_cpu = overloaded_cpu;
-+                env.src_rq = overloaded_rq;
-+
-+                p = detach_one_task(&env);
-+            }
-+
-+            if (!p)
-+                continue;
-+
-+            attach_one_task(this_rq, p);
-+            return 1;
-+        }
-+
-+        cpumask_andnot(cpus, cpus, sched_domain_span(cpu_llc));
-+    }
-+
-+    if (sched_newidle_continue_balance(this_rq)) {
-+        *continue_balancing = 0;
-+        return 0;
-+    }
-+
-+    sd_parent = sd->parent;
-+    if (sd_parent && !(sd_parent->flags & SD_NUMA)) {
-+        cpumask_andnot(cpus, sched_domain_span(sd_parent), sched_domain_span(sd));
-+        sd = sd_parent;
-+        goto next_domain;
-+    }
-+
-+    return 0;
-+}
-+
- /*
-  * sched_balance_newidle is called by schedule() if this_cpu is about to become
-  * idle. Attempts to pull tasks from other CPUs.
-@@ -XXX,XX +XXX,XX @@ static int sched_balance_newidle(struct rq *this_rq, struct rq_flags *rf)
-     u64 t0, t1, curr_cost = 0;
-     struct sched_domain *sd;
-     int pulled_task = 0;
-+    u64 domain_cost;
-     update_misfit_status(NULL, this_rq);
-@@ -XXX,XX +XXX,XX @@ static int sched_balance_newidle(struct rq *this_rq, struct rq_flags *rf)
-     rq_unpin_lock(this_rq, rf);
-     rcu_read_lock();
--    sd = rcu_dereference_check_sched_domain(this_rq->sd);
--
--    if (!get_rd_overloaded(this_rq->rd) ||
--        (sd && this_rq->avg_idle < sd->max_newidle_lb_cost)) {
--
--        if (sd)
--            update_next_balance(sd, &next_balance);
-+    if (!get_rd_overloaded(this_rq->rd)) {
-         rcu_read_unlock();
--
-         goto out;
-     }
-     rcu_read_unlock();
-     raw_spin_rq_unlock(this_rq);
-+    rcu_read_lock();
-     t0 = sched_clock_cpu(this_cpu);
--    sched_balance_update_blocked_averages(this_cpu);
--    rcu_read_lock();
--    for_each_domain(this_cpu, sd) {
--        u64 domain_cost;
-+    sd = rcu_dereference(per_cpu(sd_llc, this_cpu));
-+    if (sd) {
-+        pulled_task = sched_newidle_pull_overloaded(sd, this_rq, &continue_balancing);
-+
-+        t1 = sched_clock_cpu(this_cpu);
-+        domain_cost = t1 - t0;
-+        curr_cost += domain_cost;
-+        t0 = t1;
-+        if (pulled_task || !continue_balancing)
-+            goto skip_numa;
-+    }
-+
-+    sched_balance_update_blocked_averages(this_cpu);
-+
-+    sd = rcu_dereference(per_cpu(sd_numa, this_cpu));
-+    while (sd) {
-         update_next_balance(sd, &next_balance);
-         if (this_rq->avg_idle < curr_cost + sd->max_newidle_lb_cost)
-@@ -XXX,XX +XXX,XX @@ static int sched_balance_newidle(struct rq *this_rq, struct rq_flags *rf)
-          */
-         if (pulled_task || !continue_balancing)
-             break;
-+
-+        sd = sd->parent;
-     }
-+
-+skip_numa:
-     rcu_read_unlock();
-     raw_spin_rq_lock(this_rq);
---
-.34.1

Hello everyone,

There was some interest at OSPM'25 to explore using the push task
mechanism for idle and newidle balance. This series implements one such
idea. The main reason for the RFC is to understand if this is the
implementation people were in favor of before trying to optimize it for
all the workloads from my test setup.

Note: The current performance of the prototype is rough. I haven't
optimized it yet since I would love some feedback first on the approach.

Current approach
================

The push task framework for fair class has been cherry-pick from
Vincent's series and has been implemented for !EAS case.

This series implements the idea from Valentin [2] where, in presence of
pushable tasks, the CPU will set itself on a per-LLC "overloaded_mask".

The inter-NUMA newidle balance has been modified to traverse the CPUs
set on the overloaded mask, first in the local-LLC, and then CPUs set on
overloaded mask of other LLCs in same NUMA node with the goal of pulling
a single task towards itself rather than performing a full fledged load
balancing.

This implements some of the ideas from David Vernet's SAHRED_RUNQ
prototype [3] except, instead of a single SHARED_RUNQ per-LLC /
per-shard, the overloaded mask serves an indicator of per-CPU rq(s)
containing pushable task that can be migrated to the CPU going idle.
This avoids having a per-SHARED_RUNQ lock at the expense of maintaining
the overloaded cpumask.

The push callback itself has been modified to try push the tasks on the
pushable task list to one of the CPUs on the "nohz.idle_cpus_mask"
taking the load off of idle balancing.

Clarification required
======================

I believe using the per-CPU pushable task list as a proxy for a single
SHARED_RUNQ was the idea Peter was implying during the discussion. Is
this correct or did I completely misunderstand it? P.S. SHARED_RUNQ
could also be modelled as a large per-LLC push list.

An alternate implementation is to allow CPUs to go to idle as quickly as
possible and then rely completely on push mechanism and the
"idle_cpu_mask" to push task to an idle CPU however this puts the burden
of moving tasks on a busy overloaded CPU which may not be ideal.

Since folks mentioned using "push mechanism" for newidle balance, was
the above idea the one they had in mind?

There seems to be some clear advantage from doing a complete balance in
the newidle path. Since the schedstats are not rigged up yet for the new
approach, I'm not completely sure where the advantages vs disadvantages
are currently.

If the current approach is right, I'll dig deeper to try address all the
shortcomings of this prototype.

Systems with unified LLC will likely run into bottlenecks to maintain a
large per-LLC mask that can have multiple concurrent updates. I have
plans to implement a "sd_shard" which shards the large LLC making the
cpumask maintenance less heavy on these systems.

References
==========

[1] https://lore.kernel.org/lkml/20250302210539.1563190-6-vincent.guittot@linaro.org/
[2] https://lore.kernel.org/lkml/xhsmh1putoxbz.mognet@vschneid-thinkpadt14sgen2i.remote.csb/
[3] https://lore.kernel.org/lkml/20231212003141.216236-1-void@manifault.com/

--
K Prateek Nayak (4):
  sched/fair: Introduce overloaded_mask in sched_domain_shared
  sched/fair: Update overloaded mask in presence of pushable task
  sched/fair: Rework inter-NUMA newidle balancing
  sched/fair: Proactive idle balance using push mechanism

Vincent Guittot (1):
  sched/fair: Add push task framework

include/linux/sched/topology.h |   1 +
 kernel/sched/fair.c            | 297 +++++++++++++++++++++++++++++++--
 kernel/sched/sched.h           |   2 +
 kernel/sched/topology.c        |  25 ++-
 4 files changed, 306 insertions(+), 19 deletions(-)

base-commit: 6432e163ba1b7d80b5876792ce53e511f041ab91
-- 
2.34.1

From: Vincent Guittot <vincent.guittot@linaro.org>

Add the skeleton for push task infrastructure. The empty
push_fair_task() prototype will be used to implement proactive idle
balancing in subsequent commits.

[ prateek: Broke off relevant bits from [1] ]

Link: https://lore.kernel.org/all/20250302210539.1563190-6-vincent.guittot@linaro.org/ [1]
Signed-off-by: Vincent Guittot <vincent.guittot@linaro.org>
Signed-off-by: K Prateek Nayak <kprateek.nayak@amd.com>
---
 kernel/sched/fair.c  | 85 ++++++++++++++++++++++++++++++++++++++++++++
 kernel/sched/sched.h |  2 ++
 2 files changed, 87 insertions(+)

diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
index XXXXXXX..XXXXXXX 100644
--- a/kernel/sched/fair.c
+++ b/kernel/sched/fair.c
@@ -XXX,XX +XXX,XX @@ enqueue_task_fair(struct rq *rq, struct task_struct *p, int flags)
 	hrtick_update(rq);
 }
 
+static void fair_remove_pushable_task(struct rq *rq, struct task_struct *p);
 static void set_next_buddy(struct sched_entity *se);
 
 /*
@@ -XXX,XX +XXX,XX @@ static int dequeue_entities(struct rq *rq, struct sched_entity *se, int flags)
 		h_nr_idle = task_has_idle_policy(p);
 		if (task_sleep || task_delayed || !se->sched_delayed)
 			h_nr_runnable = 1;
+
+		fair_remove_pushable_task(rq, p);
 	} else {
 		cfs_rq = group_cfs_rq(se);
 		slice = cfs_rq_min_slice(cfs_rq);
@@ -XXX,XX +XXX,XX @@ static int find_energy_efficient_cpu(struct task_struct *p, int prev_cpu)
 	return target;
 }
 
+static inline bool fair_push_task(struct task_struct *p)
+{
+	if (!task_on_rq_queued(p))
+		return false;
+
+	if (p->se.sched_delayed)
+		return false;
+
+	if (p->nr_cpus_allowed == 1)
+		return false;
+
+	return true;
+}
+
+static inline int has_pushable_tasks(struct rq *rq)
+{
+	return !plist_head_empty(&rq->cfs.pushable_tasks);
+}
+
+/*
+ * See if the non running fair tasks on this rq can be sent on other CPUs
+ * that fits better with their profile.
+ */
+static bool push_fair_task(struct rq *rq)
+{
+	return false;
+}
+
+static void push_fair_tasks(struct rq *rq)
+{
+	/* push_fair_task() will return true if it moved a fair task */
+	while (push_fair_task(rq))
+		;
+}
+
+static DEFINE_PER_CPU(struct balance_callback, fair_push_head);
+
+static inline void fair_queue_pushable_tasks(struct rq *rq)
+{
+	if (!has_pushable_tasks(rq))
+		return;
+
+	queue_balance_callback(rq, &per_cpu(fair_push_head, rq->cpu), push_fair_tasks);
+}
+static void fair_remove_pushable_task(struct rq *rq, struct task_struct *p)
+{
+	plist_del(&p->pushable_tasks, &rq->cfs.pushable_tasks);
+}
+
+static void fair_add_pushable_task(struct rq *rq, struct task_struct *p)
+{
+	if (fair_push_task(p)) {
+		plist_del(&p->pushable_tasks, &rq->cfs.pushable_tasks);
+		plist_node_init(&p->pushable_tasks, p->prio);
+		plist_add(&p->pushable_tasks, &rq->cfs.pushable_tasks);
+	}
+}
+
 /*
  * select_task_rq_fair: Select target runqueue for the waking task in domains
  * that have the relevant SD flag set. In practice, this is SD_BALANCE_WAKE,
@@ -XXX,XX +XXX,XX @@ balance_fair(struct rq *rq, struct task_struct *prev, struct rq_flags *rf)
 	return sched_balance_newidle(rq, rf) != 0;
 }
 #else
+static inline void fair_queue_pushable_tasks(struct rq *rq) {}
+static void fair_remove_pushable_task(struct rq *rq, struct task_struct *p) {}
+static inline void fair_add_pushable_task(struct rq *rq, struct task_struct *p) {}
 static inline void set_task_max_allowed_capacity(struct task_struct *p) {}
 #endif /* CONFIG_SMP */
 
@@ -XXX,XX +XXX,XX @@ pick_next_task_fair(struct rq *rq, struct task_struct *prev, struct rq_flags *rf
 		put_prev_entity(cfs_rq, pse);
 		set_next_entity(cfs_rq, se);
 
+		/*
+		 * The previous task might be eligible for being pushed on
+		 * another cpu if it is still active.
+		 */
+		fair_add_pushable_task(rq, prev);
+
 		__set_next_task_fair(rq, p, true);
 	}
 
@@ -XXX,XX +XXX,XX @@ static void put_prev_task_fair(struct rq *rq, struct task_struct *prev, struct t
 		cfs_rq = cfs_rq_of(se);
 		put_prev_entity(cfs_rq, se);
 	}
+
+	/*
+	 * The previous task might be eligible for being pushed on another cpu
+	 * if it is still active.
+	 */
+	fair_add_pushable_task(rq, prev);
+
 }
 
 /*
@@ -XXX,XX +XXX,XX @@ static void __set_next_task_fair(struct rq *rq, struct task_struct *p, bool firs
 {
 	struct sched_entity *se = &p->se;
 
+	fair_remove_pushable_task(rq, p);
+
 #ifdef CONFIG_SMP
 	if (task_on_rq_queued(p)) {
 		/*
@@ -XXX,XX +XXX,XX @@ static void __set_next_task_fair(struct rq *rq, struct task_struct *p, bool firs
 	if (hrtick_enabled_fair(rq))
 		hrtick_start_fair(rq, p);
 
+	/*
+	 * Try to push prev task before checking misfit for next task as
+	 * the migration of prev can make next fitting the CPU
+	 */
+	fair_queue_pushable_tasks(rq);
 	update_misfit_status(p, rq);
 	sched_fair_update_stop_tick(rq, p);
 }
@@ -XXX,XX +XXX,XX @@ void init_cfs_rq(struct cfs_rq *cfs_rq)
 	cfs_rq->tasks_timeline = RB_ROOT_CACHED;
 	cfs_rq->min_vruntime = (u64)(-(1LL << 20));
 #ifdef CONFIG_SMP
+	plist_head_init(&cfs_rq->pushable_tasks);
 	raw_spin_lock_init(&cfs_rq->removed.lock);
 #endif
 }
diff --git a/kernel/sched/sched.h b/kernel/sched/sched.h
index XXXXXXX..XXXXXXX 100644
--- a/kernel/sched/sched.h
+++ b/kernel/sched/sched.h
@@ -XXX,XX +XXX,XX @@ struct cfs_rq {
 	struct list_head	leaf_cfs_rq_list;
 	struct task_group	*tg;	/* group that "owns" this runqueue */
 
+	struct plist_head	pushable_tasks;
+
 	/* Locally cached copy of our task_group's idle value */
 	int			idle;
 
-- 
2.34.1

Introduce a new cpumask member "overloaded_mask" in sched_domain_shared.
This mask will be used to keep track of overloaded CPUs with pushable
tasks on them and will be later used by newidle balance to only scan
through the overloaded CPUs to pull a task to it.

Signed-off-by: K Prateek Nayak <kprateek.nayak@amd.com>
---
 include/linux/sched/topology.h |  1 +
 kernel/sched/topology.c        | 25 ++++++++++++++++++-------
 2 files changed, 19 insertions(+), 7 deletions(-)

diff --git a/include/linux/sched/topology.h b/include/linux/sched/topology.h
index XXXXXXX..XXXXXXX 100644
--- a/include/linux/sched/topology.h
+++ b/include/linux/sched/topology.h
@@ -XXX,XX +XXX,XX @@ struct sched_domain_shared {
 	atomic_t	nr_busy_cpus;
 	int		has_idle_cores;
 	int		nr_idle_scan;
+	cpumask_var_t	overloaded_mask;
 };
 
 struct sched_domain {
diff --git a/kernel/sched/topology.c b/kernel/sched/topology.c
index XXXXXXX..XXXXXXX 100644
--- a/kernel/sched/topology.c
+++ b/kernel/sched/topology.c
@@ -XXX,XX +XXX,XX @@ static void destroy_sched_domain(struct sched_domain *sd)
 	 */
 	free_sched_groups(sd->groups, 1);
 
-	if (sd->shared && atomic_dec_and_test(&sd->shared->ref))
+	if (sd->shared && atomic_dec_and_test(&sd->shared->ref)) {
+		free_cpumask_var(sd->shared->overloaded_mask);
 		kfree(sd->shared);
+	}
 	kfree(sd);
 }
 
@@ -XXX,XX +XXX,XX @@ static int __sdt_alloc(const struct cpumask *cpu_map)
 			return -ENOMEM;
 
 		for_each_cpu(j, cpu_map) {
+			int node = cpu_to_node(j);
 			struct sched_domain *sd;
 			struct sched_domain_shared *sds;
 			struct sched_group *sg;
 			struct sched_group_capacity *sgc;
 
 			sd = kzalloc_node(sizeof(struct sched_domain) + cpumask_size(),
-					GFP_KERNEL, cpu_to_node(j));
+					GFP_KERNEL, node);
 			if (!sd)
 				return -ENOMEM;
 
 			*per_cpu_ptr(sdd->sd, j) = sd;
 
 			sds = kzalloc_node(sizeof(struct sched_domain_shared),
-					GFP_KERNEL, cpu_to_node(j));
+					GFP_KERNEL, node);
 			if (!sds)
 				return -ENOMEM;
 
+			if (!zalloc_cpumask_var_node(&sds->overloaded_mask, GFP_KERNEL, node))
+				return -ENOMEM;
+
 			*per_cpu_ptr(sdd->sds, j) = sds;
 
 			sg = kzalloc_node(sizeof(struct sched_group) + cpumask_size(),
-					GFP_KERNEL, cpu_to_node(j));
+					GFP_KERNEL, node);
 			if (!sg)
 				return -ENOMEM;
 
@@ -XXX,XX +XXX,XX @@ static int __sdt_alloc(const struct cpumask *cpu_map)
 			*per_cpu_ptr(sdd->sg, j) = sg;
 
 			sgc = kzalloc_node(sizeof(struct sched_group_capacity) + cpumask_size(),
-					GFP_KERNEL, cpu_to_node(j));
+					GFP_KERNEL, node);
 			if (!sgc)
 				return -ENOMEM;
 
@@ -XXX,XX +XXX,XX @@ static void __sdt_free(const struct cpumask *cpu_map)
 				kfree(*per_cpu_ptr(sdd->sd, j));
 			}
 
-			if (sdd->sds)
-				kfree(*per_cpu_ptr(sdd->sds, j));
+			if (sdd->sds) {
+				struct sched_domain_shared *sds = *per_cpu_ptr(sdd->sds, j);
+
+				if (sds)
+					free_cpumask_var(sds->overloaded_mask);
+				kfree(sds);
+			}
 			if (sdd->sg)
 				kfree(*per_cpu_ptr(sdd->sg, j));
 			if (sdd->sgc)
-- 
2.34.1

In presence of pushable tasks on the CPU, set it on the newly introduced
"overloaded+mask" in sched_domain_shared struct. This will be used by
the newidle balance to limit the scanning to these overloaded CPUs since
they contain tasks that could be run on the newly idle target.

Signed-off-by: K Prateek Nayak <kprateek.nayak@amd.com>
---
 kernel/sched/fair.c | 24 ++++++++++++++++++++++++
 1 file changed, 24 insertions(+)

diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
index XXXXXXX..XXXXXXX 100644
--- a/kernel/sched/fair.c
+++ b/kernel/sched/fair.c
@@ -XXX,XX +XXX,XX @@ static int find_energy_efficient_cpu(struct task_struct *p, int prev_cpu)
 	return target;
 }
 
+static inline void update_overloaded_mask(int cpu, bool contains_pushable)
+{
+	struct sched_domain_shared *sd_share = rcu_dereference(per_cpu(sd_llc_shared, cpu));
+	cpumask_var_t overloaded_mask;
+
+	if (!sd_share)
+		return;
+
+	overloaded_mask = sd_share->overloaded_mask;
+	if (!overloaded_mask)
+		return;
+
+	if (contains_pushable)
+		cpumask_set_cpu(cpu, overloaded_mask);
+	else
+		cpumask_clear_cpu(cpu, overloaded_mask);
+}
+
 static inline bool fair_push_task(struct task_struct *p)
 {
 	if (!task_on_rq_queued(p))
@@ -XXX,XX +XXX,XX @@ static inline void fair_queue_pushable_tasks(struct rq *rq)
 static void fair_remove_pushable_task(struct rq *rq, struct task_struct *p)
 {
 	plist_del(&p->pushable_tasks, &rq->cfs.pushable_tasks);
+
+	if (!has_pushable_tasks(rq))
+		update_overloaded_mask(rq->cpu, false);
 }
 
 static void fair_add_pushable_task(struct rq *rq, struct task_struct *p)
 {
 	if (fair_push_task(p)) {
+		if (!has_pushable_tasks(rq))
+			update_overloaded_mask(rq->cpu, true);
+
 		plist_del(&p->pushable_tasks, &rq->cfs.pushable_tasks);
 		plist_node_init(&p->pushable_tasks, p->prio);
 		plist_add(&p->pushable_tasks, &rq->cfs.pushable_tasks);
-- 
2.34.1

With the introduction of "overloaded_mask" in sched_domain_shared
struct, it is now possible to scan through the CPUs that contain
pushable tasks that could be run on the CPU going newly idle.

Redesign the inter-NUMA newidle balancing to opportunistically pull a
task to the CPU going idle from the overloaded CPUs only.

The search starts from sd_llc and moves up until sd_numa. Since
"overloaded_mask" is per-LLC, each LLC domain is visited individually
using per-CPU sd_llc struct shared by all CPUs in an LLC.

Once visited for one, all CPUs in the LLC are marked visited and the
search resumes for the LLCs of CPUs that remain to be visited.

detach_one_task() was used in instead of pick_next_pushable_fair_task()
since detach_one_task() also considers the CPU affinity of the task
being pulled as opposed to pick_next_pushable_fair_task() which returns
the first pushable task.

Since each iteration of overloaded_mask rechecks the idle state of the
CPU doing newidle balance, the initial gating factor based on
"rq->avg_idle" has been removed.

Signed-off-by: K Prateek Nayak <kprateek.nayak@amd.com>
---
 kernel/sched/fair.c | 129 +++++++++++++++++++++++++++++++++++++++-----
 1 file changed, 117 insertions(+), 12 deletions(-)

diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
index XXXXXXX..XXXXXXX 100644
--- a/kernel/sched/fair.c
+++ b/kernel/sched/fair.c
@@ -XXX,XX +XXX,XX @@ static inline bool nohz_idle_balance(struct rq *this_rq, enum cpu_idle_type idle
 static inline void nohz_newidle_balance(struct rq *this_rq) { }
 #endif /* CONFIG_NO_HZ_COMMON */
 
+static inline bool sched_newidle_continue_balance(struct rq *rq)
+{
+	return !rq->nr_running && !rq->ttwu_pending;
+}
+
+static inline int sched_newidle_pull_overloaded(struct sched_domain *sd,
+						struct rq *this_rq,
+						int *continue_balancing)
+{
+	struct cpumask *cpus = this_cpu_cpumask_var_ptr(load_balance_mask);
+	int cpu, this_cpu = cpu_of(this_rq);
+	struct sched_domain *sd_parent;
+	struct lb_env env = {
+		.dst_cpu	= this_cpu,
+		.dst_rq		= this_rq,
+		.idle		= CPU_NEWLY_IDLE,
+	};
+
+
+	cpumask_and(cpus, sched_domain_span(sd), cpu_active_mask);
+
+next_domain:
+	env.sd = sd;
+	/* Allow migrating cache_hot tasks too. */
+	sd->nr_balance_failed = sd->cache_nice_tries + 1;
+
+	for_each_cpu_wrap(cpu, cpus, this_cpu) {
+		struct sched_domain_shared *sd_share;
+		struct cpumask *overloaded_mask;
+		struct sched_domain *cpu_llc;
+		int overloaded_cpu;
+
+		cpu_llc = rcu_dereference(per_cpu(sd_llc, cpu));
+		if (!cpu_llc)
+			break;
+
+		sd_share = cpu_llc->shared;
+		if (!sd_share)
+			break;
+
+		overloaded_mask = sd_share->overloaded_mask;
+		if (!overloaded_mask)
+			break;
+
+		for_each_cpu_wrap(overloaded_cpu, overloaded_mask, this_cpu + 1) {
+			struct rq *overloaded_rq = cpu_rq(overloaded_cpu);
+			struct task_struct *p = NULL;
+
+			if (sched_newidle_continue_balance(this_rq)) {
+				*continue_balancing = 0;
+				return 0;
+			}
+
+			/* Quick peek to find if pushable tasks exist. */
+			if (!has_pushable_tasks(overloaded_rq))
+				continue;
+
+			scoped_guard (rq_lock, overloaded_rq) {
+				update_rq_clock(overloaded_rq);
+
+				if (!has_pushable_tasks(overloaded_rq))
+					break;
+
+				env.src_cpu = overloaded_cpu;
+				env.src_rq = overloaded_rq;
+
+				p = detach_one_task(&env);
+			}
+
+			if (!p)
+				continue;
+
+			attach_one_task(this_rq, p);
+			return 1;
+		}
+
+		cpumask_andnot(cpus, cpus, sched_domain_span(cpu_llc));
+	}
+
+	if (sched_newidle_continue_balance(this_rq)) {
+		*continue_balancing = 0;
+		return 0;
+	}
+
+	sd_parent = sd->parent;
+	if (sd_parent && !(sd_parent->flags & SD_NUMA)) {
+		cpumask_andnot(cpus, sched_domain_span(sd_parent), sched_domain_span(sd));
+		sd = sd_parent;
+		goto next_domain;
+	}
+
+	return 0;
+}
+
 /*
  * sched_balance_newidle is called by schedule() if this_cpu is about to become
  * idle. Attempts to pull tasks from other CPUs.
@@ -XXX,XX +XXX,XX @@ static int sched_balance_newidle(struct rq *this_rq, struct rq_flags *rf)
 	u64 t0, t1, curr_cost = 0;
 	struct sched_domain *sd;
 	int pulled_task = 0;
+	u64 domain_cost;
 
 	update_misfit_status(NULL, this_rq);
 
@@ -XXX,XX +XXX,XX @@ static int sched_balance_newidle(struct rq *this_rq, struct rq_flags *rf)
 	rq_unpin_lock(this_rq, rf);
 
 	rcu_read_lock();
-	sd = rcu_dereference_check_sched_domain(this_rq->sd);
-
-	if (!get_rd_overloaded(this_rq->rd) ||
-	    (sd && this_rq->avg_idle < sd->max_newidle_lb_cost)) {
-
-		if (sd)
-			update_next_balance(sd, &next_balance);
+	if (!get_rd_overloaded(this_rq->rd)) {
 		rcu_read_unlock();
-
 		goto out;
 	}
 	rcu_read_unlock();
 
 	raw_spin_rq_unlock(this_rq);
 
+	rcu_read_lock();
 	t0 = sched_clock_cpu(this_cpu);
-	sched_balance_update_blocked_averages(this_cpu);
 
-	rcu_read_lock();
-	for_each_domain(this_cpu, sd) {
-		u64 domain_cost;
+	sd = rcu_dereference(per_cpu(sd_llc, this_cpu));
+	if (sd) {
+		pulled_task = sched_newidle_pull_overloaded(sd, this_rq, &continue_balancing);
+
+		t1 = sched_clock_cpu(this_cpu);
+		domain_cost = t1 - t0;
+		curr_cost += domain_cost;
+		t0 = t1;
 
+		if (pulled_task || !continue_balancing)
+			goto skip_numa;
+	}
+
+	sched_balance_update_blocked_averages(this_cpu);
+
+	sd = rcu_dereference(per_cpu(sd_numa, this_cpu));
+	while (sd) {
 		update_next_balance(sd, &next_balance);
 
 		if (this_rq->avg_idle < curr_cost + sd->max_newidle_lb_cost)
@@ -XXX,XX +XXX,XX @@ static int sched_balance_newidle(struct rq *this_rq, struct rq_flags *rf)
 		 */
 		if (pulled_task || !continue_balancing)
 			break;
+
+		sd = sd->parent;
 	}
+
+skip_numa:
 	rcu_read_unlock();
 
 	raw_spin_rq_lock(this_rq);
-- 
2.34.1

Proactively try to push tasks to one of the CPUs set in the
"nohz.idle_cpus_mask" from the push callback.

pick_next_pushable_fair_task() is taken from Vincent's series [1] as is
but the locking rules in push_fair_task() has been relaxed to release
the local rq lock after dequeuing the task and reacquiring it after
pushing it to the idle target.

double_lock_balance() used in RT seems necessary to maintain strict
priority ordering however that may not be necessary for fair tasks.

Link: https://lore.kernel.org/all/20250302210539.1563190-6-vincent.guittot@linaro.org/ [1]
Signed-off-by: K Prateek Nayak <kprateek.nayak@amd.com>
---
 kernel/sched/fair.c | 59 +++++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 59 insertions(+)

diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
index XXXXXXX..XXXXXXX 100644
--- a/kernel/sched/fair.c
+++ b/kernel/sched/fair.c
@@ -XXX,XX +XXX,XX @@ static inline int has_pushable_tasks(struct rq *rq)
 	return !plist_head_empty(&rq->cfs.pushable_tasks);
 }
 
+static struct task_struct *pick_next_pushable_fair_task(struct rq *rq)
+{
+	struct task_struct *p;
+
+	if (!has_pushable_tasks(rq))
+		return NULL;
+
+	p = plist_first_entry(&rq->cfs.pushable_tasks,
+			      struct task_struct, pushable_tasks);
+
+	WARN_ON_ONCE(rq->cpu != task_cpu(p));
+	WARN_ON_ONCE(task_current(rq, p));
+	WARN_ON_ONCE(p->nr_cpus_allowed <= 1);
+	WARN_ON_ONCE(!task_on_rq_queued(p));
+
+	/*
+	 * Remove task from the pushable list as we try only once after that
+	 * the task has been put back in enqueued list.
+	 */
+	plist_del(&p->pushable_tasks, &rq->cfs.pushable_tasks);
+
+	return p;
+}
+
+static void fair_add_pushable_task(struct rq *rq, struct task_struct *p);
+static void attach_one_task(struct rq *rq, struct task_struct *p);
+
 /*
  * See if the non running fair tasks on this rq can be sent on other CPUs
  * that fits better with their profile.
  */
 static bool push_fair_task(struct rq *rq)
 {
+	struct cpumask *cpus = this_cpu_cpumask_var_ptr(load_balance_mask);
+	struct task_struct *p = pick_next_pushable_fair_task(rq);
+	int cpu, this_cpu = cpu_of(rq);
+
+	if (!p)
+		return false;
+
+	if (!cpumask_and(cpus, nohz.idle_cpus_mask, housekeeping_cpumask(HK_TYPE_KERNEL_NOISE)))
+		goto requeue;
+
+	if (!cpumask_and(cpus, cpus, p->cpus_ptr))
+		goto requeue;
+
+	for_each_cpu_wrap(cpu, cpus, this_cpu + 1) {
+		struct rq *target_rq;
+
+		if (!idle_cpu(cpu))
+			continue;
+
+		target_rq = cpu_rq(cpu);
+		deactivate_task(rq, p, 0);
+		set_task_cpu(p, cpu);
+		raw_spin_rq_unlock(rq);
+
+		attach_one_task(target_rq, p);
+		raw_spin_rq_lock(rq);
+
+		return true;
+	}
+
+requeue:
+	fair_add_pushable_task(rq, p);
 	return false;
 }
 
-- 
2.34.1

The ADMV1013 is a wideband, microwave upconverter optimized
for point to point microwave radio designs operating in the
24 GHz to 44 GHz radio frequency (RF) range.

Datasheet:
https://www.analog.com/media/en/technical-documentation/data-sheets/ADMV1013.pdf

NOTE:
Currently depends on 64-bit architecture since the input
clock that server as Local Oscillator should support values
in the range 5.4 GHz to 10.25 GHz.

We might need some scaling implementation in the clock
framework so that u64 types are supported when using 32-bit
architectures.

Antoniu Miclaus (2):
  iio: frequency: admv1013: add support for ADMV1013
  dt-bindings: iio: frequency: add admv1013 doc

.../bindings/iio/frequency/adi,admv1013.yaml  | 110 ++++
 drivers/iio/frequency/Kconfig                 |  13 +
 drivers/iio/frequency/Makefile                |   1 +
 drivers/iio/frequency/admv1013.c              | 579 ++++++++++++++++++
 4 files changed, 703 insertions(+)
 create mode 100644 Documentation/devicetree/bindings/iio/frequency/adi,admv1013.yaml
 create mode 100644 drivers/iio/frequency/admv1013.c

-- 
2.33.1

The ADMV1013 is a wideband, microwave upconverter optimized
for point to point microwave radio designs operating in the
24 GHz to 44 GHz radio frequency (RF) range.

Datasheet:
https://www.analog.com/media/en/technical-documentation/data-sheets/ADMV1013.pdf

Signed-off-by: Antoniu Miclaus <antoniu.miclaus@analog.com>
---
 drivers/iio/frequency/Kconfig    |  13 +
 drivers/iio/frequency/Makefile   |   1 +
 drivers/iio/frequency/admv1013.c | 579 +++++++++++++++++++++++++++++++
 3 files changed, 593 insertions(+)
 create mode 100644 drivers/iio/frequency/admv1013.c

diff --git a/drivers/iio/frequency/Kconfig b/drivers/iio/frequency/Kconfig
index XXXXXXX..XXXXXXX 100644
--- a/drivers/iio/frequency/Kconfig
+++ b/drivers/iio/frequency/Kconfig
@@ -XXX,XX +XXX,XX @@ config ADF4371
 
 	  To compile this driver as a module, choose M here: the
 	  module will be called adf4371.
+
+config ADMV1013
+	tristate "Analog Devices ADMV1013 Microwave Upconverter"
+	depends on SPI
+	depends on COMMON_CLK
+	depends on 64BIT
+	help
+	  Say yes here to build support for Analog Devices ADMV1013
+	  24 GHz to 44 GHz, Wideband, Microwave Upconverter.
+
+	  To compile this driver as a module, choose M here: the
+	  module will be called admv1013.
+
 endmenu
 endmenu
diff --git a/drivers/iio/frequency/Makefile b/drivers/iio/frequency/Makefile
index XXXXXXX..XXXXXXX 100644
--- a/drivers/iio/frequency/Makefile
+++ b/drivers/iio/frequency/Makefile
@@ -XXX,XX +XXX,XX @@
 obj-$(CONFIG_AD9523) += ad9523.o
 obj-$(CONFIG_ADF4350) += adf4350.o
 obj-$(CONFIG_ADF4371) += adf4371.o
+obj-$(CONFIG_ADMV1013) += admv1013.o
diff --git a/drivers/iio/frequency/admv1013.c b/drivers/iio/frequency/admv1013.c
new file mode 100644
index XXXXXXX..XXXXXXX
--- /dev/null
+++ b/drivers/iio/frequency/admv1013.c
@@ -XXX,XX +XXX,XX @@
+// SPDX-License-Identifier: GPL-2.0-only
+/*
+ * ADMV1013 driver
+ *
+ * Copyright 2021 Analog Devices Inc.
+ */
+
+#include <linux/bitfield.h>
+#include <linux/bitops.h>
+#include <linux/bits.h>
+#include <linux/clk.h>
+#include <linux/clkdev.h>
+#include <linux/clk-provider.h>
+#include <linux/device.h>
+#include <linux/iio/iio.h>
+#include <linux/module.h>
+#include <linux/notifier.h>
+#include <linux/regmap.h>
+#include <linux/regulator/consumer.h>
+#include <linux/spi/spi.h>
+
+#include <asm/unaligned.h>
+
+/* ADMV1013 Register Map */
+#define ADMV1013_REG_SPI_CONTROL		0x00
+#define ADMV1013_REG_ALARM			0x01
+#define ADMV1013_REG_ALARM_MASKS		0x02
+#define ADMV1013_REG_ENABLE			0x03
+#define ADMV1013_REG_LO_AMP_I			0x05
+#define ADMV1013_REG_LO_AMP_Q			0x06
+#define ADMV1013_REG_OFFSET_ADJUST_I		0x07
+#define ADMV1013_REG_OFFSET_ADJUST_Q		0x08
+#define ADMV1013_REG_QUAD			0x09
+#define ADMV1013_REG_VVA_TEMP_COMP		0x0A
+
+/* ADMV1013_REG_SPI_CONTROL Map */
+#define ADMV1013_PARITY_EN_MSK			BIT(15)
+#define ADMV1013_SPI_SOFT_RESET_MSK		BIT(14)
+#define ADMV1013_CHIP_ID_MSK			GENMASK(11, 4)
+#define ADMV1013_CHIP_ID			0xA
+#define ADMV1013_REVISION_ID_MSK		GENMASK(3, 0)
+
+/* ADMV1013_REG_ALARM Map */
+#define ADMV1013_PARITY_ERROR_MSK		BIT(15)
+#define ADMV1013_TOO_FEW_ERRORS_MSK		BIT(14)
+#define ADMV1013_TOO_MANY_ERRORS_MSK		BIT(13)
+#define ADMV1013_ADDRESS_RANGE_ERROR_MSK	BIT(12)
+
+/* ADMV1013_REG_ENABLE Map */
+#define ADMV1013_VGA_PD_MSK			BIT(15)
+#define ADMV1013_MIXER_PD_MSK			BIT(14)
+#define ADMV1013_QUAD_PD_MSK			GENMASK(13, 11)
+#define ADMV1013_BG_PD_MSK			BIT(10)
+#define ADMV1013_MIXER_IF_EN_MSK		BIT(7)
+#define ADMV1013_DET_EN_MSK			BIT(5)
+
+/* ADMV1013_REG_LO_AMP_I Map */
+#define ADMV1013_LOAMP_PH_ADJ_I_FINE_MSK	GENMASK(13, 7)
+#define ADMV1013_MIXER_VGATE_MSK		GENMASK(6, 0)
+
+/* ADMV1013_REG_LO_AMP_Q Map */
+#define ADMV1013_LOAMP_PH_ADJ_Q_FINE_MSK	GENMASK(13, 7)
+
+/* ADMV1013_REG_OFFSET_ADJUST_I Map */
+#define ADMV1013_MIXER_OFF_ADJ_I_P_MSK		GENMASK(15, 9)
+#define ADMV1013_MIXER_OFF_ADJ_I_N_MSK		GENMASK(8, 2)
+
+/* ADMV1013_REG_OFFSET_ADJUST_Q Map */
+#define ADMV1013_MIXER_OFF_ADJ_Q_P_MSK		GENMASK(15, 9)
+#define ADMV1013_MIXER_OFF_ADJ_Q_N_MSK		GENMASK(8, 2)
+
+/* ADMV1013_REG_QUAD Map */
+#define ADMV1013_QUAD_SE_MODE_MSK		GENMASK(9, 6)
+#define ADMV1013_QUAD_FILTERS_MSK		GENMASK(3, 0)
+
+/* ADMV1013_REG_VVA_TEMP_COMP Map */
+#define ADMV1013_VVA_TEMP_COMP_MSK		GENMASK(15, 0)
+
+struct admv1013_state {
+	struct spi_device	*spi;
+	struct clk		*clkin;
+	/* Protect against concurrent accesses to the device */
+	struct mutex		lock;
+	struct regulator	*reg;
+	struct notifier_block	nb;
+	unsigned int		quad_se_mode;
+	bool			vga_pd;
+	bool			mixer_pd;
+	bool			quad_pd;
+	bool			bg_pd;
+	bool			mixer_if_en;
+	bool			det_en;
+	u8			data[3] ____cacheline_aligned;
+};
+
+static int __admv1013_spi_read(struct admv1013_state *st, unsigned int reg,
+			       unsigned int *val)
+{
+	int ret;
+	struct spi_transfer t = {0};
+
+	st->data[0] = 0x80 | (reg << 1);
+	st->data[1] = 0x0;
+	st->data[2] = 0x0;
+
+	t.rx_buf = &st->data[0];
+	t.tx_buf = &st->data[0];
+	t.len = 3;
+
+	ret = spi_sync_transfer(st->spi, &t, 1);
+	if (ret)
+		return ret;
+
+	*val = (get_unaligned_be24(&st->data[0]) >> 1) & GENMASK(15, 0);
+
+	return ret;
+}
+
+static int admv1013_spi_read(struct admv1013_state *st, unsigned int reg,
+			     unsigned int *val)
+{
+	int ret;
+
+	mutex_lock(&st->lock);
+	ret = __admv1013_spi_read(st, reg, val);
+	mutex_unlock(&st->lock);
+
+	return ret;
+}
+
+static int __admv1013_spi_write(struct admv1013_state *st,
+				unsigned int reg,
+				unsigned int val)
+{
+	put_unaligned_be24((val << 1) | (reg << 17), &st->data[0]);
+
+	return spi_write(st->spi, &st->data[0], 3);
+}
+
+static int admv1013_spi_write(struct admv1013_state *st, unsigned int reg,
+			      unsigned int val)
+{
+	int ret;
+
+	mutex_lock(&st->lock);
+	ret = __admv1013_spi_write(st, reg, val);
+	mutex_unlock(&st->lock);
+
+	return ret;
+}
+
+static int __admv1013_spi_update_bits(struct admv1013_state *st, unsigned int reg,
+				      unsigned int mask, unsigned int val)
+{
+	int ret;
+	unsigned int data, temp;
+
+	ret = __admv1013_spi_read(st, reg, &data);
+	if (ret)
+		return ret;
+
+	temp = (data & ~mask) | (val & mask);
+
+	return __admv1013_spi_write(st, reg, temp);
+}
+
+static int admv1013_spi_update_bits(struct admv1013_state *st, unsigned int reg,
+				    unsigned int mask, unsigned int val)
+{
+	int ret;
+
+	mutex_lock(&st->lock);
+	ret = __admv1013_spi_update_bits(st, reg, mask, val);
+	mutex_unlock(&st->lock);
+
+	return ret;
+}
+
+static int admv1013_read_raw(struct iio_dev *indio_dev,
+			     struct iio_chan_spec const *chan,
+			     int *val, int *val2, long info)
+{
+	struct admv1013_state *st = iio_priv(indio_dev);
+	unsigned int data;
+	int ret;
+
+	switch (info) {
+	case IIO_CHAN_INFO_OFFSET:
+		if (chan->channel2 == IIO_MOD_I) {
+			ret = admv1013_spi_read(st, ADMV1013_REG_OFFSET_ADJUST_I, &data);
+			if (ret)
+				return ret;
+
+			*val = FIELD_GET(ADMV1013_MIXER_OFF_ADJ_I_P_MSK, data);
+			*val2 = FIELD_GET(ADMV1013_MIXER_OFF_ADJ_I_N_MSK, data);
+		} else {
+			ret = admv1013_spi_read(st, ADMV1013_REG_OFFSET_ADJUST_Q, &data);
+			if (ret)
+				return ret;
+
+			*val = FIELD_GET(ADMV1013_MIXER_OFF_ADJ_Q_P_MSK, data);
+			*val2 = FIELD_GET(ADMV1013_MIXER_OFF_ADJ_Q_N_MSK, data);
+		}
+
+		return IIO_VAL_INT_MULTIPLE;
+	case IIO_CHAN_INFO_PHASE:
+		if (chan->channel2 == IIO_MOD_I) {
+			ret = admv1013_spi_read(st, ADMV1013_REG_LO_AMP_I, &data);
+			if (ret)
+				return ret;
+
+			*val = FIELD_GET(ADMV1013_LOAMP_PH_ADJ_I_FINE_MSK, data);
+		} else {
+			ret = admv1013_spi_read(st, ADMV1013_REG_LO_AMP_Q, &data);
+			if (ret)
+				return ret;
+
+			*val = FIELD_GET(ADMV1013_LOAMP_PH_ADJ_Q_FINE_MSK, data);
+		}
+
+		return IIO_VAL_INT;
+	default:
+		return -EINVAL;
+	}
+}
+
+static int admv1013_write_raw(struct iio_dev *indio_dev,
+			      struct iio_chan_spec const *chan,
+			      int val, int val2, long info)
+{
+	struct admv1013_state *st = iio_priv(indio_dev);
+	int ret;
+
+	switch (info) {
+	case IIO_CHAN_INFO_OFFSET:
+		val2 /= 100000;
+
+		if (chan->channel2 == IIO_MOD_I)
+			ret = admv1013_spi_update_bits(st, ADMV1013_REG_OFFSET_ADJUST_I,
+						       ADMV1013_MIXER_OFF_ADJ_I_P_MSK |
+						       ADMV1013_MIXER_OFF_ADJ_I_N_MSK,
+						       FIELD_PREP(ADMV1013_MIXER_OFF_ADJ_I_P_MSK, val) |
+						       FIELD_PREP(ADMV1013_MIXER_OFF_ADJ_I_N_MSK, val2));
+		else
+			ret = admv1013_spi_update_bits(st, ADMV1013_REG_OFFSET_ADJUST_Q,
+						       ADMV1013_MIXER_OFF_ADJ_Q_P_MSK |
+						       ADMV1013_MIXER_OFF_ADJ_Q_N_MSK,
+						       FIELD_PREP(ADMV1013_MIXER_OFF_ADJ_Q_P_MSK, val) |
+						       FIELD_PREP(ADMV1013_MIXER_OFF_ADJ_Q_N_MSK, val2));
+
+		return ret;
+	case IIO_CHAN_INFO_PHASE:
+		if (chan->channel2 == IIO_MOD_I)
+			return admv1013_spi_update_bits(st, ADMV1013_REG_LO_AMP_I,
+							ADMV1013_LOAMP_PH_ADJ_I_FINE_MSK,
+							FIELD_PREP(ADMV1013_LOAMP_PH_ADJ_I_FINE_MSK, val));
+		else
+			return admv1013_spi_update_bits(st, ADMV1013_REG_LO_AMP_Q,
+							ADMV1013_LOAMP_PH_ADJ_Q_FINE_MSK,
+							FIELD_PREP(ADMV1013_LOAMP_PH_ADJ_Q_FINE_MSK, val));
+	default:
+		return -EINVAL;
+	}
+}
+
+static int admv1013_update_quad_filters(struct admv1013_state *st)
+{
+	unsigned int filt_raw;
+	u64 rate = clk_get_rate(st->clkin);
+
+	if (rate >= 5400000000 && rate <= 7000000000)
+		filt_raw = 15;
+	else if (rate >= 5400000000 && rate <= 8000000000)
+		filt_raw = 10;
+	else if (rate >= 6600000000 && rate <= 9200000000)
+		filt_raw = 5;
+	else
+		filt_raw = 0;
+
+	return __admv1013_spi_update_bits(st, ADMV1013_REG_QUAD,
+					ADMV1013_QUAD_FILTERS_MSK,
+					FIELD_PREP(ADMV1013_QUAD_FILTERS_MSK, filt_raw));
+}
+
+static int admv1013_update_mixer_vgate(struct admv1013_state *st)
+{
+	unsigned int vcm, mixer_vgate;
+
+	vcm = regulator_get_voltage(st->reg);
+
+	if (vcm >= 0 && vcm < 1800000)
+		mixer_vgate = (2389 * vcm / 1000000 + 8100) / 100;
+	else if (vcm > 1800000 && vcm < 2600000)
+		mixer_vgate = (2375 * vcm / 1000000 + 125) / 100;
+	else
+		return -EINVAL;
+
+	return __admv1013_spi_update_bits(st, ADMV1013_REG_LO_AMP_I,
+				 ADMV1013_MIXER_VGATE_MSK,
+				 FIELD_PREP(ADMV1013_MIXER_VGATE_MSK, mixer_vgate));
+}
+
+static int admv1013_reg_access(struct iio_dev *indio_dev,
+			       unsigned int reg,
+			       unsigned int write_val,
+			       unsigned int *read_val)
+{
+	struct admv1013_state *st = iio_priv(indio_dev);
+	int ret;
+
+	if (read_val)
+		ret = admv1013_spi_read(st, reg, read_val);
+	else
+		ret = admv1013_spi_write(st, reg, write_val);
+
+	return ret;
+}
+
+static const struct iio_info admv1013_info = {
+	.read_raw = admv1013_read_raw,
+	.write_raw = admv1013_write_raw,
+	.debugfs_reg_access = &admv1013_reg_access,
+};
+
+static int admv1013_freq_change(struct notifier_block *nb, unsigned long action, void *data)
+{
+	struct admv1013_state *st = container_of(nb, struct admv1013_state, nb);
+	int ret;
+
+	if (action == POST_RATE_CHANGE) {
+		mutex_lock(&st->lock);
+		ret = notifier_from_errno(admv1013_update_quad_filters(st));
+		mutex_unlock(&st->lock);
+		return ret;
+	}
+
+	return NOTIFY_OK;
+}
+
+static void admv1013_clk_notifier_unreg(void *data)
+{
+	struct admv1013_state *st = data;
+
+	clk_notifier_unregister(st->clkin, &st->nb);
+}
+
+#define ADMV1013_CHAN(_channel, rf_comp) {			\
+	.type = IIO_ALTVOLTAGE,					\
+	.modified = 1,						\
+	.output = 1,						\
+	.indexed = 1,						\
+	.channel2 = IIO_MOD_##rf_comp,				\
+	.channel = _channel,					\
+	.info_mask_separate = BIT(IIO_CHAN_INFO_PHASE) |	\
+		BIT(IIO_CHAN_INFO_OFFSET)			\
+	}
+
+static const struct iio_chan_spec admv1013_channels[] = {
+	ADMV1013_CHAN(0, I),
+	ADMV1013_CHAN(0, Q),
+};
+
+static int admv1013_init(struct admv1013_state *st)
+{
+	int ret;
+	unsigned int chip_id, enable_reg, enable_reg_msk;
+	struct spi_device *spi = st->spi;
+
+	/* Perform a software reset */
+	ret = __admv1013_spi_update_bits(st, ADMV1013_REG_SPI_CONTROL,
+					 ADMV1013_SPI_SOFT_RESET_MSK,
+					 FIELD_PREP(ADMV1013_SPI_SOFT_RESET_MSK, 1));
+	if (ret)
+		return ret;
+
+	ret = __admv1013_spi_update_bits(st, ADMV1013_REG_SPI_CONTROL,
+					 ADMV1013_SPI_SOFT_RESET_MSK,
+					 FIELD_PREP(ADMV1013_SPI_SOFT_RESET_MSK, 0));
+	if (ret)
+		return ret;
+
+	ret = __admv1013_spi_read(st, ADMV1013_REG_SPI_CONTROL, &chip_id);
+	if (ret)
+		return ret;
+
+	chip_id = FIELD_GET(ADMV1013_CHIP_ID_MSK, chip_id);
+	if (chip_id != ADMV1013_CHIP_ID) {
+		dev_err(&spi->dev, "Invalid Chip ID.\n");
+		return -EINVAL;
+	}
+
+	ret = __admv1013_spi_write(st, ADMV1013_REG_VVA_TEMP_COMP, 0xE700);
+	if (ret)
+		return ret;
+
+	ret = __admv1013_spi_update_bits(st, ADMV1013_REG_QUAD,
+					 ADMV1013_QUAD_SE_MODE_MSK,
+					 FIELD_PREP(ADMV1013_QUAD_SE_MODE_MSK, st->quad_se_mode));
+	if (ret)
+		return ret;
+
+	ret = admv1013_update_mixer_vgate(st);
+	if (ret)
+		return ret;
+
+	ret = admv1013_update_quad_filters(st);
+	if (ret)
+		return ret;
+
+	enable_reg_msk = ADMV1013_VGA_PD_MSK |
+			ADMV1013_MIXER_PD_MSK |
+			ADMV1013_QUAD_PD_MSK |
+			ADMV1013_BG_PD_MSK |
+			ADMV1013_MIXER_IF_EN_MSK |
+			ADMV1013_DET_EN_MSK;
+
+	enable_reg = FIELD_PREP(ADMV1013_VGA_PD_MSK, st->vga_pd) |
+			FIELD_PREP(ADMV1013_MIXER_PD_MSK, st->mixer_pd) |
+			FIELD_PREP(ADMV1013_QUAD_PD_MSK, st->quad_pd ? 7 : 0) |
+			FIELD_PREP(ADMV1013_BG_PD_MSK, st->bg_pd) |
+			FIELD_PREP(ADMV1013_MIXER_IF_EN_MSK, st->mixer_if_en) |
+			FIELD_PREP(ADMV1013_DET_EN_MSK, st->det_en);
+
+	return __admv1013_spi_update_bits(st, ADMV1013_REG_ENABLE, enable_reg_msk, enable_reg);
+}
+
+static void admv1013_clk_disable(void *data)
+{
+	clk_disable_unprepare(data);
+}
+
+static void admv1013_reg_disable(void *data)
+{
+	regulator_disable(data);
+}
+
+static void admv1013_powerdown(void *data)
+{
+	unsigned int enable_reg, enable_reg_msk;
+
+	/* Disable all components in the Enable Register */
+	enable_reg_msk = ADMV1013_VGA_PD_MSK |
+			ADMV1013_MIXER_PD_MSK |
+			ADMV1013_QUAD_PD_MSK |
+			ADMV1013_BG_PD_MSK |
+			ADMV1013_MIXER_IF_EN_MSK |
+			ADMV1013_DET_EN_MSK;
+
+	enable_reg = FIELD_PREP(ADMV1013_VGA_PD_MSK, 1) |
+			FIELD_PREP(ADMV1013_MIXER_PD_MSK, 1) |
+			FIELD_PREP(ADMV1013_QUAD_PD_MSK, 7) |
+			FIELD_PREP(ADMV1013_BG_PD_MSK, 1) |
+			FIELD_PREP(ADMV1013_MIXER_IF_EN_MSK, 0) |
+			FIELD_PREP(ADMV1013_DET_EN_MSK, 0);
+
+	admv1013_spi_update_bits(data, ADMV1013_REG_ENABLE, enable_reg_msk, enable_reg);
+}
+
+static int admv1013_properties_parse(struct admv1013_state *st)
+{
+	int ret;
+	struct spi_device *spi = st->spi;
+
+	st->vga_pd = device_property_read_bool(&spi->dev, "adi,vga-pd");
+	st->mixer_pd = device_property_read_bool(&spi->dev, "adi,mixer-pd");
+	st->quad_pd = device_property_read_bool(&spi->dev, "adi,quad-pd");
+	st->bg_pd = device_property_read_bool(&spi->dev, "adi,bg-pd");
+	st->mixer_if_en = device_property_read_bool(&spi->dev, "adi,mixer-if-en");
+	st->det_en = device_property_read_bool(&spi->dev, "adi,det-en");
+
+	ret = device_property_read_u32(&spi->dev, "adi,quad-se-mode", &st->quad_se_mode);
+	if (ret)
+		st->quad_se_mode = 12;
+
+	st->reg = devm_regulator_get(&spi->dev, "vcm");
+	if (IS_ERR(st->reg))
+		return dev_err_probe(&spi->dev, PTR_ERR(st->reg),
+				     "failed to get the common-mode voltage\n");
+
+	st->clkin = devm_clk_get(&spi->dev, "lo_in");
+	if (IS_ERR(st->clkin))
+		return dev_err_probe(&spi->dev, PTR_ERR(st->clkin),
+				     "failed to get the LO input clock\n");
+
+	return 0;
+}
+
+static int admv1013_probe(struct spi_device *spi)
+{
+	struct iio_dev *indio_dev;
+	struct admv1013_state *st;
+	int ret;
+
+	indio_dev = devm_iio_device_alloc(&spi->dev, sizeof(*st));
+	if (!indio_dev)
+		return -ENOMEM;
+
+	st = iio_priv(indio_dev);
+
+	indio_dev->dev.parent = &spi->dev;
+	indio_dev->info = &admv1013_info;
+	indio_dev->name = "admv1013";
+	indio_dev->channels = admv1013_channels;
+	indio_dev->num_channels = ARRAY_SIZE(admv1013_channels);
+
+	st->spi = spi;
+
+	ret = admv1013_properties_parse(st);
+	if (ret)
+		return ret;
+
+	ret = regulator_enable(st->reg);
+	if (ret) {
+		dev_err(&spi->dev, "Failed to enable specified Common-Mode Voltage!\n");
+		return ret;
+	}
+
+	ret = devm_add_action_or_reset(&spi->dev, admv1013_reg_disable,
+				       st->reg);
+	if (ret)
+		return ret;
+
+	ret = clk_prepare_enable(st->clkin);
+	if (ret)
+		return ret;
+
+	ret = devm_add_action_or_reset(&spi->dev, admv1013_clk_disable, st->clkin);
+	if (ret)
+		return ret;
+
+	st->nb.notifier_call = admv1013_freq_change;
+	ret = clk_notifier_register(st->clkin, &st->nb);
+	if (ret)
+		return ret;
+
+	ret = devm_add_action_or_reset(&spi->dev, admv1013_clk_notifier_unreg, st);
+	if (ret)
+		return ret;
+
+	mutex_init(&st->lock);
+
+	ret = admv1013_init(st);
+	if (ret) {
+		dev_err(&spi->dev, "admv1013 init failed\n");
+		return ret;
+	}
+
+	ret = devm_add_action_or_reset(&spi->dev, admv1013_powerdown, st);
+	if (ret)
+		return ret;
+
+	return devm_iio_device_register(&spi->dev, indio_dev);
+}
+
+static const struct spi_device_id admv1013_id[] = {
+	{ "admv1013", 0},
+	{}
+};
+MODULE_DEVICE_TABLE(spi, admv1013_id);
+
+static const struct of_device_id admv1013_of_match[] = {
+	{ .compatible = "adi,admv1013" },
+	{},
+};
+MODULE_DEVICE_TABLE(of, admv1013_of_match);
+
+static struct spi_driver admv1013_driver = {
+	.driver = {
+		.name = "admv1013",
+		.of_match_table = admv1013_of_match,
+	},
+	.probe = admv1013_probe,
+	.id_table = admv1013_id,
+};
+module_spi_driver(admv1013_driver);
+
+MODULE_AUTHOR("Antoniu Miclaus <antoniu.miclaus@analog.com");
+MODULE_DESCRIPTION("Analog Devices ADMV1013");
+MODULE_LICENSE("GPL v2");
-- 
2.33.1

Add device tree bindings for the ADMV1013 Upconverter.

Signed-off-by: Antoniu Miclaus <antoniu.miclaus@analog.com>
---
 .../bindings/iio/frequency/adi,admv1013.yaml  | 110 ++++++++++++++++++
 1 file changed, 110 insertions(+)
 create mode 100644 Documentation/devicetree/bindings/iio/frequency/adi,admv1013.yaml

diff --git a/Documentation/devicetree/bindings/iio/frequency/adi,admv1013.yaml b/Documentation/devicetree/bindings/iio/frequency/adi,admv1013.yaml
new file mode 100644
index XXXXXXX..XXXXXXX
--- /dev/null
+++ b/Documentation/devicetree/bindings/iio/frequency/adi,admv1013.yaml
@@ -XXX,XX +XXX,XX @@
+# SPDX-License-Identifier: (GPL-2.0-only OR BSD-2-Clause)
+%YAML 1.2
+---
+$id: http://devicetree.org/schemas/iio/frequency/adi,admv1013.yaml#
+$schema: http://devicetree.org/meta-schemas/core.yaml#
+
+title: ADMV1013 Microwave Upconverter
+
+maintainers:
+  - Antoniu Miclaus <antoniu.miclaus@analog.com>
+
+description: |
+   Wideband, microwave upconverter optimized for point to point microwave
+   radio designs operating in the 24 GHz to 44 GHz frequency range.
+
+   https://www.analog.com/en/products/admv1013.html
+
+properties:
+  compatible:
+    enum:
+      - adi,admv1013
+
+  reg:
+    maxItems: 1
+
+  spi-max-frequency:
+    maximum: 1000000
+
+  clocks:
+    description:
+      Definition of the external clock.
+    minItems: 1
+
+  clock-names:
+    items:
+      - const: lo_in
+
+  clock-output-names:
+    maxItems: 1
+
+  vcm-supply:
+    description:
+      Analog voltage regulator.
+
+  adi,vga-pd:
+    description:
+      Power Down the Voltage Gain Amplifier Circuit.
+    type: boolean
+
+  adi,mixer-pd:
+    description:
+      Power Down the Mixer Circuit.
+    type: boolean
+
+  adi,quad-pd:
+    description:
+      Power Down the Quadrupler.
+    type: boolean
+
+  adi,bg-pd:
+    description:
+      Power Down the Transmitter Band Gap.
+    type: boolean
+
+  adi,mixer-if-en:
+    description:
+      Enable the Intermediate Frequency Mode.
+    type: boolean
+
+  adi,det-en:
+    description:
+      Enable the Envelope Detector.
+    type: boolean
+
+  adi,quad-se-mode:
+    description:
+      Switch the LO path from differential to single-ended operation.
+    $ref: /schemas/types.yaml#/definitions/uint32
+    enum: [6, 9, 12]
+
+  '#clock-cells':
+    const: 0
+
+required:
+  - compatible
+  - reg
+  - clocks
+  - clock-names
+  - vcm-supply
+
+additionalProperties: false
+
+examples:
+  - |
+    spi {
+      #address-cells = <1>;
+      #size-cells = <0>;
+      admv1013@0{
+        compatible = "adi,admv1013";
+        reg = <0>;
+        spi-max-frequency = <1000000>;
+        clocks = <&admv1013_lo>;
+        clock-names = "lo_in";
+        vcm-supply = <&vcm>;
+        adi,quad-se-mode = <12>;
+        adi,mixer-if-en;
+        adi,det-en;
+      };
+    };
+...
-- 
2.33.1