[v2] mm: better GUP pin lru_add_drain_all()

[PATCH v2 6/6] mm: lru_add_drain_all() do local lru_add_drain() first

Posted by Hugh Dickins 1 day, 3 hours ago

No numbers to back this up, but it seemed obvious to me, that if there
are competing lru_add_drain_all()ers, the work will be minimized if each
flushes its own local queues before locking and doing cross-CPU drains.

Signed-off-by: Hugh Dickins <hughd@google.com>
Acked-by: David Hildenbrand <david@redhat.com>
---
 mm/swap.c | 3 +++
 1 file changed, 3 insertions(+)

diff --git a/mm/swap.c b/mm/swap.c
index b74ebe865dd9..881e53b2877e 100644
--- a/mm/swap.c
+++ b/mm/swap.c
@@ -834,6 +834,9 @@ static inline void __lru_add_drain_all(bool force_all_cpus)
 	 */
 	this_gen = smp_load_acquire(&lru_drain_gen);
 
+	/* It helps everyone if we do our own local drain immediately. */
+	lru_add_drain();
+
 	mutex_lock(&lock);
 
 	/*
-- 
2.51.0

[PATCH v2 1/6] mm/gup: check ref_count instead of lru before migration
[PATCH v2 2/6] mm/gup: local lru_add_drain() to avoid lru_add_drain_all()
[PATCH v2 3/6] mm: Revert "mm/gup: clear the LRU flag of a page before adding to LRU batch"
[PATCH v2 4/6] mm: Revert "mm: vmscan.c: fix OOM on swap stress test"
[PATCH v2 5/6] mm: folio_may_be_lru_cached() unless folio_test_large()
[PATCH v2 6/6] mm: lru_add_drain_all() do local lru_add_drain() first