[PATCH v2 0/7] accel/tcg: Rewrite user-only vma tracking

Richard Henderson posted 7 patches 1 year, 6 months ago
Failed in applying to current master (apply log)
Maintainers: Richard Henderson <richard.henderson@linaro.org>, Paolo Bonzini <pbonzini@redhat.com>, Riku Voipio <riku.voipio@iki.fi>, "Alex Bennée" <alex.bennee@linaro.org>
There is a newer version of this series
accel/tcg/internal.h            |  85 +--
include/exec/exec-all.h         |  43 +-
include/exec/translate-all.h    |   6 -
include/qemu/interval-tree.h    |  99 ++++
accel/tcg/tb-maint.c            | 856 +++++++++++++++++++++++++------
accel/tcg/translate-all.c       | 746 ---------------------------
accel/tcg/user-exec.c           | 659 +++++++++++++++++++++++-
tests/tcg/multiarch/test-vma.c  |  22 +
tests/unit/test-interval-tree.c | 209 ++++++++
util/interval-tree.c            | 882 ++++++++++++++++++++++++++++++++
tests/unit/meson.build          |   1 +
util/meson.build                |   1 +
12 files changed, 2592 insertions(+), 1017 deletions(-)
create mode 100644 include/qemu/interval-tree.h
create mode 100644 tests/tcg/multiarch/test-vma.c
create mode 100644 tests/unit/test-interval-tree.c
create mode 100644 util/interval-tree.c
[PATCH v2 0/7] accel/tcg: Rewrite user-only vma tracking
Posted by Richard Henderson 1 year, 6 months ago
The primary motivator here are the numerous bug reports (e.g. #290)
about not being able to handle very large memory allocations.
I presume all or most of these are due to guest use of the clang
address sanitizer, which allocates a massive shadow vma.

This patch set copies the linux kernel code for interval trees,
which is what the kernel itself uses for managing vmas.  I then
purge all (real) use of PageDesc from user-only.  This is easy
for user-only because everything tricky happens under mmap_lock();

I have thought only briefly about using interval trees for system
mode too, but the locking situation there is more difficult.  So
for now that code gets moved around but not substantially changed.

The test case from #290 is added to test/tcg/multiarch/.
Before this patch set, on my moderately beefy laptop, it takes 39s
and has an RSS of 28GB before the qemu process is killed.  After
the patch set, the test case successfully allocates 16TB and
completes in 0.013s.

Changes for v2:
  * Rebase on master, 17 patches merged.
  * Structure of page_get_target_data adjusted (ajb).


r~


Richard Henderson (7):
  util: Add interval-tree.c
  accel/tcg: Use interval tree for TBs in user-only mode
  accel/tcg: Use interval tree for TARGET_PAGE_DATA_SIZE
  accel/tcg: Move page_{get,set}_flags to user-exec.c
  accel/tcg: Use interval tree for user-only page tracking
  accel/tcg: Move PageDesc tree into tb-maint.c for system
  accel/tcg: Move remainder of page locking to tb-maint.c

 accel/tcg/internal.h            |  85 +--
 include/exec/exec-all.h         |  43 +-
 include/exec/translate-all.h    |   6 -
 include/qemu/interval-tree.h    |  99 ++++
 accel/tcg/tb-maint.c            | 856 +++++++++++++++++++++++++------
 accel/tcg/translate-all.c       | 746 ---------------------------
 accel/tcg/user-exec.c           | 659 +++++++++++++++++++++++-
 tests/tcg/multiarch/test-vma.c  |  22 +
 tests/unit/test-interval-tree.c | 209 ++++++++
 util/interval-tree.c            | 882 ++++++++++++++++++++++++++++++++
 tests/unit/meson.build          |   1 +
 util/meson.build                |   1 +
 12 files changed, 2592 insertions(+), 1017 deletions(-)
 create mode 100644 include/qemu/interval-tree.h
 create mode 100644 tests/tcg/multiarch/test-vma.c
 create mode 100644 tests/unit/test-interval-tree.c
 create mode 100644 util/interval-tree.c

-- 
2.34.1