Series comparison

-[PULL 0/3] Block patches
+[Qemu-devel] [PULL for-2.11-rc1 v2 0/2] Block patches
-The following changes since commit 8c5f94cd4182753959c8be8de415120dc879d8f0:
+The following changes since commit b0fbe46ad82982b289a44ee2495b59b0bad8a842:
-  Merge tag 'pull-loong-20211221-2' of https://gitlab.com/rth7680/qemu into staging (2021-12-21 13:30:35 -0800)
+  Update version for v2.11.0-rc0 release (2017-11-07 16:05:28 +0000)
-are available in the Git repository at:
+are available in the git repository at:
-  https://gitlab.com/hreitz/qemu.git tags/pull-block-2021-12-22
+  git://github.com/stefanha/qemu.git tags/block-pull-request
-for you to fetch changes up to 722f87df2545b308aec49b459b028f0802b4fd9e:
+for you to fetch changes up to ef6dada8b44e1e7c4bec5c1115903af9af415b50:
-  iotests: check: multiprocessing support (2021-12-22 16:29:48 +0100)
+  util/async: use atomic_mb_set in qemu_bh_cancel (2017-11-08 19:09:15 +0000)
 ----------------------------------------------------------------
-Block patches:
+Pull request
-- Added support to the iotests for running tests in several parallel
-  jobs (using the new -j parameter)
+v2:
  * v1 emails 2/3 and 3/3 weren't sent due to an email failure
  * Included Sergio's updated wording in the commit description
 ----------------------------------------------------------------
-Vladimir Sementsov-Ogievskiy (3):
-  iotests/testrunner.py: add doc string for run_test()
-  iotests/testrunner.py: move updating last_elapsed to run_tests
-  iotests: check: multiprocessing support
- tests/qemu-iotests/check         |  4 +-
+Sergio Lopez (1):
- tests/qemu-iotests/testrunner.py | 86 ++++++++++++++++++++++++++++----
+  util/async: use atomic_mb_set in qemu_bh_cancel
-files changed, 80 insertions(+), 10 deletions(-)
 Stefan Hajnoczi (1):
   tests-aio-multithread: fix /aio/multi/schedule race condition
  tests/test-aio-multithread.c | 5 ++---
  util/async.c                 | 2 +-
 files changed, 3 insertions(+), 4 deletions(-)
 --
-.33.1
+.13.6

-[PULL 3/3] iotests: check: multiprocessing support
+[Qemu-devel] [PULL for-2.11-rc1 v2 1/2] tests-aio-multithread: fix /aio/multi/schedule race condition
-From: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>
+test_multi_co_schedule_entry() set to_schedule[id] in the final loop
 iteration before terminating the coroutine.  There is a race condition
 where the main thread attempts to enter the terminating or terminated
 coroutine when signalling coroutines to stop:
-Add -j <JOBS> parameter, to run tests in several jobs simultaneously.
+  atomic_mb_set(&now_stopping, true);
-For realization - simply utilize multiprocessing.Pool class.
+  for (i = 0; i < NUM_CONTEXTS; i++) {
       ctx_run(i, finish_cb, NULL);  <--- enters dead coroutine!
       to_schedule[i] = NULL;
   }
-Notes:
+Make sure only to set to_schedule[id] if this coroutine really needs to
 be scheduled!
-. Of course, tests can't run simultaneously in same TEST_DIR. So,
+Reported-by: "R.Nageswara Sastry" <nasastry@in.ibm.com>
-   use subdirectories TEST_DIR/testname/ and SOCK_DIR/testname/
+Signed-off-by: Stefan Hajnoczi <stefanha@redhat.com>
-   instead of simply TEST_DIR and SOCK_DIR
+Reviewed-by: Paolo Bonzini <pbonzini@redhat.com>
 Message-id: 20171106190233.1175-1-stefanha@redhat.com
 Signed-off-by: Stefan Hajnoczi <stefanha@redhat.com>
 ---
  tests/test-aio-multithread.c | 5 ++---
 file changed, 2 insertions(+), 3 deletions(-)
-. multiprocessing.Pool.starmap function doesn't support passing
+diff --git a/tests/test-aio-multithread.c b/tests/test-aio-multithread.c
    context managers, so we can't simply pass "self". Happily, we need
    self only for read-only access, and it just works if it is defined
    in global space. So, add a temporary link TestRunner.shared_self
    during run_tests().
 Signed-off-by: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>
 Message-Id: <20211203122223.2780098-4-vsementsov@virtuozzo.com>
 Reviewed-by: John Snow <jsnow@redhat.com>
 Tested-by: John Snow <jsnow@redhat.com>
 Signed-off-by: Hanna Reitz <hreitz@redhat.com>
 ---
  tests/qemu-iotests/check         |  4 +-
  tests/qemu-iotests/testrunner.py | 69 ++++++++++++++++++++++++++++----
 files changed, 64 insertions(+), 9 deletions(-)
 diff --git a/tests/qemu-iotests/check b/tests/qemu-iotests/check
 index XXXXXXX..XXXXXXX 100755
 --- a/tests/qemu-iotests/check
 +++ b/tests/qemu-iotests/check
@@ -XXX,XX +XXX,XX @@ def make_argparser() -> argparse.ArgumentParser:
                     help='show me, do not run tests')
      p.add_argument('-makecheck', action='store_true',
                     help='pretty print output for make check')
 +    p.add_argument('-j', dest='jobs', type=int, default=1,
 +                   help='run tests in multiple parallel jobs')
      p.add_argument('-d', dest='debug', action='store_true', help='debug')
      p.add_argument('-p', dest='print', action='store_true',
@@ -XXX,XX +XXX,XX @@ if __name__ == '__main__':
          with TestRunner(env, makecheck=args.makecheck,
                          color=args.color) as tr:
              paths = [os.path.join(env.source_iotests, t) for t in tests]
 -            ok = tr.run_tests(paths)
 +            ok = tr.run_tests(paths, args.jobs)
              if not ok:
                  sys.exit(1)
 diff --git a/tests/qemu-iotests/testrunner.py b/tests/qemu-iotests/testrunner.py
 index XXXXXXX..XXXXXXX 100644
---- a/tests/qemu-iotests/testrunner.py
+--- a/tests/test-aio-multithread.c
-+++ b/tests/qemu-iotests/testrunner.py
++++ b/tests/test-aio-multithread.c
-@@ -XXX,XX +XXX,XX @@
+@@ -XXX,XX +XXX,XX @@ static void finish_cb(void *opaque)
- import json
+ static coroutine_fn void test_multi_co_schedule_entry(void *opaque)
- import termios
+ {
- import sys
+     g_assert(to_schedule[id] == NULL);
-+from multiprocessing import Pool
+-    atomic_mb_set(&to_schedule[id], qemu_coroutine_self());
- from contextlib import contextmanager
- from typing import List, Optional, Iterator, Any, Sequence, Dict, \
+     while (!atomic_mb_read(&now_stopping)) {
-         ContextManager
+         int n;
-@@ -XXX,XX +XXX,XX @@ def __init__(self, status: str, description: str = '',
+         n = g_test_rand_int_range(0, NUM_CONTEXTS);
+         schedule_next(n);
  class TestRunner(ContextManager['TestRunner']):
 +    shared_self = None
 +
-+    @staticmethod
++        atomic_mb_set(&to_schedule[id], qemu_coroutine_self());
-+    def proc_run_test(test: str, test_field_width: int) -> TestResult:
+         qemu_coroutine_yield();
-+        # We are in a subprocess, we can't change the runner object!
+-
-+        runner = TestRunner.shared_self
+         g_assert(to_schedule[id] == NULL);
-+        assert runner is not None
+-        atomic_mb_set(&to_schedule[id], qemu_coroutine_self());
-+        return runner.run_test(test, test_field_width, mp=True)
+     }
-+
+ }
 +    def run_tests_pool(self, tests: List[str],
 +                       test_field_width: int, jobs: int) -> List[TestResult]:
 +
 +        # passing self directly to Pool.starmap() just doesn't work, because
 +        # it's a context manager.
 +        assert TestRunner.shared_self is None
 +        TestRunner.shared_self = self
 +
 +        with Pool(jobs) as p:
 +            results = p.starmap(self.proc_run_test,
 +                                zip(tests, [test_field_width] * len(tests)))
 +
 +        TestRunner.shared_self = None
 +
 +        return results
 +
      def __init__(self, env: TestEnv, makecheck: bool = False,
                   color: str = 'auto') -> None:
          self.env = env
@@ -XXX,XX +XXX,XX @@ def find_reference(self, test: str) -> str:
          return f'{test}.out'
 -    def do_run_test(self, test: str) -> TestResult:
 +    def do_run_test(self, test: str, mp: bool) -> TestResult:
          """
          Run one test
          :param test: test file path
 +        :param mp: if true, we are in a multiprocessing environment, use
 +                   personal subdirectories for test run
 +
 +        Note: this method may be called from subprocess, so it does not
 +        change ``self`` object in any way!
          """
          f_test = Path(test)
@@ -XXX,XX +XXX,XX @@ def do_run_test(self, test: str) -> TestResult:
          args = [str(f_test.resolve())]
          env = self.env.prepare_subprocess(args)
 +        if mp:
 +            # Split test directories, so that tests running in parallel don't
 +            # break each other.
 +            for d in ['TEST_DIR', 'SOCK_DIR']:
 +                env[d] = os.path.join(env[d], f_test.name)
 +                Path(env[d]).mkdir(parents=True, exist_ok=True)
          t0 = time.time()
          with f_bad.open('w', encoding="utf-8") as f:
@@ -XXX,XX +XXX,XX @@ def do_run_test(self, test: str) -> TestResult:
                                casenotrun=casenotrun)
      def run_test(self, test: str,
 -                 test_field_width: Optional[int] = None) -> TestResult:
 +                 test_field_width: Optional[int] = None,
 +                 mp: bool = False) -> TestResult:
          """
          Run one test and print short status
          :param test: test file path
          :param test_field_width: width for first field of status format
 +        :param mp: if true, we are in a multiprocessing environment, don't try
 +                   to rewrite things in stdout
 +
 +        Note: this method may be called from subprocess, so it does not
 +        change ``self`` object in any way!
          """
          last_el = self.last_elapsed.get(test)
          start = datetime.datetime.now().strftime('%H:%M:%S')
          if not self.makecheck:
 -            self.test_print_one_line(test=test, starttime=start,
 -                                     lasttime=last_el, end='\r',
 +            self.test_print_one_line(test=test,
 +                                     status = 'started' if mp else '...',
 +                                     starttime=start,
 +                                     lasttime=last_el,
 +                                     end = '\n' if mp else '\r',
                                       test_field_width=test_field_width)
 -        res = self.do_run_test(test)
 +        res = self.do_run_test(test, mp)
          end = datetime.datetime.now().strftime('%H:%M:%S')
          self.test_print_one_line(test=test, status=res.status,
@@ -XXX,XX +XXX,XX @@ def run_test(self, test: str,
          return res
 -    def run_tests(self, tests: List[str]) -> bool:
 +    def run_tests(self, tests: List[str], jobs: int = 1) -> bool:
          n_run = 0
          failed = []
          notrun = []
@@ -XXX,XX +XXX,XX @@ def run_tests(self, tests: List[str]) -> bool:
          test_field_width = max(len(os.path.basename(t)) for t in tests) + 2
 -        for t in tests:
 +        if jobs > 1:
 +            results = self.run_tests_pool(tests, test_field_width, jobs)
 +
 +        for i, t in enumerate(tests):
              name = os.path.basename(t)
 -            res = self.run_test(t, test_field_width=test_field_width)
 +
 +            if jobs > 1:
 +                res = results[i]
 +            else:
 +                res = self.run_test(t, test_field_width)
              assert res.status in ('pass', 'fail', 'not run')
 --
-.33.1
+.13.6

-[PULL 1/3] iotests/testrunner.py: add doc string for run_test()
+[Qemu-devel] [PULL for-2.11-rc1 v2 2/2] util/async: use atomic_mb_set in qemu_bh_cancel
-From: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>
+From: Sergio Lopez <slp@redhat.com>
-We are going to modify these methods and will add more documentation in
+Commit b7a745d added a qemu_bh_cancel call to the completion function
-further commit. As a preparation add basic documentation.
+as an optimization to prevent it from unnecessarily rescheduling itself.
-Signed-off-by: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>
+This completion function is scheduled from worker_thread, after setting
-Message-Id: <20211203122223.2780098-2-vsementsov@virtuozzo.com>
+the state of a ThreadPoolElement to THREAD_DONE.
-Reviewed-by: John Snow <jsnow@redhat.com>
-Tested-by: John Snow <jsnow@redhat.com>
+This was considered to be safe, as the completion function restarts the
-Signed-off-by: Hanna Reitz <hreitz@redhat.com>
+loop just after the call to qemu_bh_cancel. But, as this loop lacks a HW
 memory barrier, the read of req->state may actually happen _before_ the
 call, seeing it still as THREAD_QUEUED, and ending the completion
 function without having processed a pending TPE linked at pool->head:
          worker thread             |            I/O thread
 ------------------------------------------------------------------------
                                    | speculatively read req->state
 req->state = THREAD_DONE;          |
 qemu_bh_schedule(p->completion_bh) |
   bh->scheduled = 1;               |
                                    | qemu_bh_cancel(p->completion_bh)
                                    |   bh->scheduled = 0;
                                    | if (req->state == THREAD_DONE)
                                    |   // sees THREAD_QUEUED
 The source of the misunderstanding was that qemu_bh_cancel is now being
 used by the _consumer_ rather than the producer, and therefore now needs
 to have acquire semantics just like e.g. aio_bh_poll.
 In some situations, if there are no other independent requests in the
 same aio context that could eventually trigger the scheduling of the
 completion function, the omitted TPE and all operations pending on it
 will get stuck forever.
 [Added Sergio's updated wording about the HW memory barrier.
 --Stefan]
 Signed-off-by: Sergio Lopez <slp@redhat.com>
 Message-id: 20171108063447.2842-1-slp@redhat.com
 Signed-off-by: Stefan Hajnoczi <stefanha@redhat.com>
 ---
- tests/qemu-iotests/testrunner.py | 13 +++++++++++++
+ util/async.c | 2 +-
-file changed, 13 insertions(+)
+file changed, 1 insertion(+), 1 deletion(-)
-diff --git a/tests/qemu-iotests/testrunner.py b/tests/qemu-iotests/testrunner.py
+diff --git a/util/async.c b/util/async.c
 index XXXXXXX..XXXXXXX 100644
---- a/tests/qemu-iotests/testrunner.py
+--- a/util/async.c
-+++ b/tests/qemu-iotests/testrunner.py
++++ b/util/async.c
-@@ -XXX,XX +XXX,XX @@ def find_reference(self, test: str) -> str:
+@@ -XXX,XX +XXX,XX @@ void qemu_bh_schedule(QEMUBH *bh)
-         return f'{test}.out'
+  */
+ void qemu_bh_cancel(QEMUBH *bh)
-     def do_run_test(self, test: str) -> TestResult:
+ {
-+        """
+-    bh->scheduled = 0;
-+        Run one test
++    atomic_mb_set(&bh->scheduled, 0);
-+
+ }
-+        :param test: test file path
-+        """
+ /* This func is async.The bottom half will do the delete action at the finial
 +
          f_test = Path(test)
          f_bad = Path(f_test.name + '.out.bad')
          f_notrun = Path(f_test.name + '.notrun')
@@ -XXX,XX +XXX,XX @@ def do_run_test(self, test: str) -> TestResult:
      def run_test(self, test: str,
                   test_field_width: Optional[int] = None) -> TestResult:
 +        """
 +        Run one test and print short status
 +
 +        :param test: test file path
 +        :param test_field_width: width for first field of status format
 +        """
 +
          last_el = self.last_elapsed.get(test)
          start = datetime.datetime.now().strftime('%H:%M:%S')
 --
-.33.1
+.13.6

-[PULL 2/3] iotests/testrunner.py: move updating last_elapsed to run_tests
+Deleted patch
-From: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>
-We are going to use do_run_test() in multiprocessing environment, where
-we'll not be able to change original runner object.
-Happily, the only thing we change is that last_elapsed and it's simple
-to do it in run_tests() instead. All other accesses to self in
-do_runt_test() and in run_test() are read-only.
-Signed-off-by: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>
-Message-Id: <20211203122223.2780098-3-vsementsov@virtuozzo.com>
-Reviewed-by: John Snow <jsnow@redhat.com>
-Tested-by: John Snow <jsnow@redhat.com>
-Signed-off-by: Hanna Reitz <hreitz@redhat.com>
----
- tests/qemu-iotests/testrunner.py | 4 +++-
-file changed, 3 insertions(+), 1 deletion(-)
-diff --git a/tests/qemu-iotests/testrunner.py b/tests/qemu-iotests/testrunner.py
-index XXXXXXX..XXXXXXX 100644
---- a/tests/qemu-iotests/testrunner.py
-+++ b/tests/qemu-iotests/testrunner.py
-@@ -XXX,XX +XXX,XX @@ def do_run_test(self, test: str) -> TestResult:
-                               diff=diff, casenotrun=casenotrun)
-         else:
-             f_bad.unlink()
--            self.last_elapsed.update(test, elapsed)
-             return TestResult(status='pass', elapsed=elapsed,
-                               casenotrun=casenotrun)
-@@ -XXX,XX +XXX,XX @@ def run_tests(self, tests: List[str]) -> bool:
-                     print('\n'.join(res.diff))
-             elif res.status == 'not run':
-                 notrun.append(name)
-+            elif res.status == 'pass':
-+                assert res.elapsed is not None
-+                self.last_elapsed.update(t, res.elapsed)
-             sys.stdout.flush()
-             if res.interrupted:
---
-.33.1

The following changes since commit 8c5f94cd4182753959c8be8de415120dc879d8f0:

Merge tag 'pull-loong-20211221-2' of https://gitlab.com/rth7680/qemu into staging (2021-12-21 13:30:35 -0800)

are available in the Git repository at:

https://gitlab.com/hreitz/qemu.git tags/pull-block-2021-12-22

for you to fetch changes up to 722f87df2545b308aec49b459b028f0802b4fd9e:

iotests: check: multiprocessing support (2021-12-22 16:29:48 +0100)

----------------------------------------------------------------
Block patches:
- Added support to the iotests for running tests in several parallel
  jobs (using the new -j parameter)

----------------------------------------------------------------
Vladimir Sementsov-Ogievskiy (3):
  iotests/testrunner.py: add doc string for run_test()
  iotests/testrunner.py: move updating last_elapsed to run_tests
  iotests: check: multiprocessing support

tests/qemu-iotests/check         |  4 +-
 tests/qemu-iotests/testrunner.py | 86 ++++++++++++++++++++++++++++----
 2 files changed, 80 insertions(+), 10 deletions(-)

-- 
2.33.1

From: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>

We are going to modify these methods and will add more documentation in
further commit. As a preparation add basic documentation.

Signed-off-by: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>
Message-Id: <20211203122223.2780098-2-vsementsov@virtuozzo.com>
Reviewed-by: John Snow <jsnow@redhat.com>
Tested-by: John Snow <jsnow@redhat.com>
Signed-off-by: Hanna Reitz <hreitz@redhat.com>
---
 tests/qemu-iotests/testrunner.py | 13 +++++++++++++
 1 file changed, 13 insertions(+)

diff --git a/tests/qemu-iotests/testrunner.py b/tests/qemu-iotests/testrunner.py
index XXXXXXX..XXXXXXX 100644
--- a/tests/qemu-iotests/testrunner.py
+++ b/tests/qemu-iotests/testrunner.py
@@ -XXX,XX +XXX,XX @@ def find_reference(self, test: str) -> str:
         return f'{test}.out'
 
     def do_run_test(self, test: str) -> TestResult:
+        """
+        Run one test
+
+        :param test: test file path
+        """
+
         f_test = Path(test)
         f_bad = Path(f_test.name + '.out.bad')
         f_notrun = Path(f_test.name + '.notrun')
@@ -XXX,XX +XXX,XX @@ def do_run_test(self, test: str) -> TestResult:
 
     def run_test(self, test: str,
                  test_field_width: Optional[int] = None) -> TestResult:
+        """
+        Run one test and print short status
+
+        :param test: test file path
+        :param test_field_width: width for first field of status format
+        """
+
         last_el = self.last_elapsed.get(test)
         start = datetime.datetime.now().strftime('%H:%M:%S')
 
-- 
2.33.1

From: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>

We are going to use do_run_test() in multiprocessing environment, where
we'll not be able to change original runner object.

Happily, the only thing we change is that last_elapsed and it's simple
to do it in run_tests() instead. All other accesses to self in
do_runt_test() and in run_test() are read-only.

Signed-off-by: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>
Message-Id: <20211203122223.2780098-3-vsementsov@virtuozzo.com>
Reviewed-by: John Snow <jsnow@redhat.com>
Tested-by: John Snow <jsnow@redhat.com>
Signed-off-by: Hanna Reitz <hreitz@redhat.com>
---
 tests/qemu-iotests/testrunner.py | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/tests/qemu-iotests/testrunner.py b/tests/qemu-iotests/testrunner.py
index XXXXXXX..XXXXXXX 100644
--- a/tests/qemu-iotests/testrunner.py
+++ b/tests/qemu-iotests/testrunner.py
@@ -XXX,XX +XXX,XX @@ def do_run_test(self, test: str) -> TestResult:
                               diff=diff, casenotrun=casenotrun)
         else:
             f_bad.unlink()
-            self.last_elapsed.update(test, elapsed)
             return TestResult(status='pass', elapsed=elapsed,
                               casenotrun=casenotrun)
 
@@ -XXX,XX +XXX,XX @@ def run_tests(self, tests: List[str]) -> bool:
                     print('\n'.join(res.diff))
             elif res.status == 'not run':
                 notrun.append(name)
+            elif res.status == 'pass':
+                assert res.elapsed is not None
+                self.last_elapsed.update(t, res.elapsed)
 
             sys.stdout.flush()
             if res.interrupted:
-- 
2.33.1

From: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>

Add -j <JOBS> parameter, to run tests in several jobs simultaneously.
For realization - simply utilize multiprocessing.Pool class.

Notes:

1. Of course, tests can't run simultaneously in same TEST_DIR. So,
   use subdirectories TEST_DIR/testname/ and SOCK_DIR/testname/
   instead of simply TEST_DIR and SOCK_DIR

2. multiprocessing.Pool.starmap function doesn't support passing
   context managers, so we can't simply pass "self". Happily, we need
   self only for read-only access, and it just works if it is defined
   in global space. So, add a temporary link TestRunner.shared_self
   during run_tests().

Signed-off-by: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>
Message-Id: <20211203122223.2780098-4-vsementsov@virtuozzo.com>
Reviewed-by: John Snow <jsnow@redhat.com>
Tested-by: John Snow <jsnow@redhat.com>
Signed-off-by: Hanna Reitz <hreitz@redhat.com>
---
 tests/qemu-iotests/check         |  4 +-
 tests/qemu-iotests/testrunner.py | 69 ++++++++++++++++++++++++++++----
 2 files changed, 64 insertions(+), 9 deletions(-)

diff --git a/tests/qemu-iotests/check b/tests/qemu-iotests/check
index XXXXXXX..XXXXXXX 100755
--- a/tests/qemu-iotests/check
+++ b/tests/qemu-iotests/check
@@ -XXX,XX +XXX,XX @@ def make_argparser() -> argparse.ArgumentParser:
                    help='show me, do not run tests')
     p.add_argument('-makecheck', action='store_true',
                    help='pretty print output for make check')
+    p.add_argument('-j', dest='jobs', type=int, default=1,
+                   help='run tests in multiple parallel jobs')
 
     p.add_argument('-d', dest='debug', action='store_true', help='debug')
     p.add_argument('-p', dest='print', action='store_true',
@@ -XXX,XX +XXX,XX @@ if __name__ == '__main__':
         with TestRunner(env, makecheck=args.makecheck,
                         color=args.color) as tr:
             paths = [os.path.join(env.source_iotests, t) for t in tests]
-            ok = tr.run_tests(paths)
+            ok = tr.run_tests(paths, args.jobs)
             if not ok:
                 sys.exit(1)
diff --git a/tests/qemu-iotests/testrunner.py b/tests/qemu-iotests/testrunner.py
index XXXXXXX..XXXXXXX 100644
--- a/tests/qemu-iotests/testrunner.py
+++ b/tests/qemu-iotests/testrunner.py
@@ -XXX,XX +XXX,XX @@
 import json
 import termios
 import sys
+from multiprocessing import Pool
 from contextlib import contextmanager
 from typing import List, Optional, Iterator, Any, Sequence, Dict, \
         ContextManager
@@ -XXX,XX +XXX,XX @@ def __init__(self, status: str, description: str = '',
 
 
 class TestRunner(ContextManager['TestRunner']):
+    shared_self = None
+
+    @staticmethod
+    def proc_run_test(test: str, test_field_width: int) -> TestResult:
+        # We are in a subprocess, we can't change the runner object!
+        runner = TestRunner.shared_self
+        assert runner is not None
+        return runner.run_test(test, test_field_width, mp=True)
+
+    def run_tests_pool(self, tests: List[str],
+                       test_field_width: int, jobs: int) -> List[TestResult]:
+
+        # passing self directly to Pool.starmap() just doesn't work, because
+        # it's a context manager.
+        assert TestRunner.shared_self is None
+        TestRunner.shared_self = self
+
+        with Pool(jobs) as p:
+            results = p.starmap(self.proc_run_test,
+                                zip(tests, [test_field_width] * len(tests)))
+
+        TestRunner.shared_self = None
+
+        return results
+
     def __init__(self, env: TestEnv, makecheck: bool = False,
                  color: str = 'auto') -> None:
         self.env = env
@@ -XXX,XX +XXX,XX @@ def find_reference(self, test: str) -> str:
 
         return f'{test}.out'
 
-    def do_run_test(self, test: str) -> TestResult:
+    def do_run_test(self, test: str, mp: bool) -> TestResult:
         """
         Run one test
 
         :param test: test file path
+        :param mp: if true, we are in a multiprocessing environment, use
+                   personal subdirectories for test run
+
+        Note: this method may be called from subprocess, so it does not
+        change ``self`` object in any way!
         """
 
         f_test = Path(test)
@@ -XXX,XX +XXX,XX @@ def do_run_test(self, test: str) -> TestResult:
 
         args = [str(f_test.resolve())]
         env = self.env.prepare_subprocess(args)
+        if mp:
+            # Split test directories, so that tests running in parallel don't
+            # break each other.
+            for d in ['TEST_DIR', 'SOCK_DIR']:
+                env[d] = os.path.join(env[d], f_test.name)
+                Path(env[d]).mkdir(parents=True, exist_ok=True)
 
         t0 = time.time()
         with f_bad.open('w', encoding="utf-8") as f:
@@ -XXX,XX +XXX,XX @@ def do_run_test(self, test: str) -> TestResult:
                               casenotrun=casenotrun)
 
     def run_test(self, test: str,
-                 test_field_width: Optional[int] = None) -> TestResult:
+                 test_field_width: Optional[int] = None,
+                 mp: bool = False) -> TestResult:
         """
         Run one test and print short status
 
         :param test: test file path
         :param test_field_width: width for first field of status format
+        :param mp: if true, we are in a multiprocessing environment, don't try
+                   to rewrite things in stdout
+
+        Note: this method may be called from subprocess, so it does not
+        change ``self`` object in any way!
         """
 
         last_el = self.last_elapsed.get(test)
         start = datetime.datetime.now().strftime('%H:%M:%S')
 
         if not self.makecheck:
-            self.test_print_one_line(test=test, starttime=start,
-                                     lasttime=last_el, end='\r',
+            self.test_print_one_line(test=test,
+                                     status = 'started' if mp else '...',
+                                     starttime=start,
+                                     lasttime=last_el,
+                                     end = '\n' if mp else '\r',
                                      test_field_width=test_field_width)
 
-        res = self.do_run_test(test)
+        res = self.do_run_test(test, mp)
 
         end = datetime.datetime.now().strftime('%H:%M:%S')
         self.test_print_one_line(test=test, status=res.status,
@@ -XXX,XX +XXX,XX @@ def run_test(self, test: str,
 
         return res
 
-    def run_tests(self, tests: List[str]) -> bool:
+    def run_tests(self, tests: List[str], jobs: int = 1) -> bool:
         n_run = 0
         failed = []
         notrun = []
@@ -XXX,XX +XXX,XX @@ def run_tests(self, tests: List[str]) -> bool:
 
         test_field_width = max(len(os.path.basename(t)) for t in tests) + 2
 
-        for t in tests:
+        if jobs > 1:
+            results = self.run_tests_pool(tests, test_field_width, jobs)
+
+        for i, t in enumerate(tests):
             name = os.path.basename(t)
-            res = self.run_test(t, test_field_width=test_field_width)
+
+            if jobs > 1:
+                res = results[i]
+            else:
+                res = self.run_test(t, test_field_width)
 
             assert res.status in ('pass', 'fail', 'not run')
 
-- 
2.33.1

test_multi_co_schedule_entry() set to_schedule[id] in the final loop
iteration before terminating the coroutine.  There is a race condition
where the main thread attempts to enter the terminating or terminated
coroutine when signalling coroutines to stop:

atomic_mb_set(&now_stopping, true);
  for (i = 0; i < NUM_CONTEXTS; i++) {
      ctx_run(i, finish_cb, NULL);  <--- enters dead coroutine!
      to_schedule[i] = NULL;
  }

Make sure only to set to_schedule[id] if this coroutine really needs to
be scheduled!

Reported-by: "R.Nageswara Sastry" <nasastry@in.ibm.com>
Signed-off-by: Stefan Hajnoczi <stefanha@redhat.com>
Reviewed-by: Paolo Bonzini <pbonzini@redhat.com>
Message-id: 20171106190233.1175-1-stefanha@redhat.com
Signed-off-by: Stefan Hajnoczi <stefanha@redhat.com>
---
 tests/test-aio-multithread.c | 5 ++---
 1 file changed, 2 insertions(+), 3 deletions(-)

diff --git a/tests/test-aio-multithread.c b/tests/test-aio-multithread.c
index XXXXXXX..XXXXXXX 100644
--- a/tests/test-aio-multithread.c
+++ b/tests/test-aio-multithread.c
@@ -XXX,XX +XXX,XX @@ static void finish_cb(void *opaque)
 static coroutine_fn void test_multi_co_schedule_entry(void *opaque)
 {
     g_assert(to_schedule[id] == NULL);
-    atomic_mb_set(&to_schedule[id], qemu_coroutine_self());
 
     while (!atomic_mb_read(&now_stopping)) {
         int n;
 
         n = g_test_rand_int_range(0, NUM_CONTEXTS);
         schedule_next(n);
+
+        atomic_mb_set(&to_schedule[id], qemu_coroutine_self());
         qemu_coroutine_yield();
-
         g_assert(to_schedule[id] == NULL);
-        atomic_mb_set(&to_schedule[id], qemu_coroutine_self());
     }
 }
 
-- 
2.13.6

From: Sergio Lopez <slp@redhat.com>

Commit b7a745d added a qemu_bh_cancel call to the completion function
as an optimization to prevent it from unnecessarily rescheduling itself.

This completion function is scheduled from worker_thread, after setting
the state of a ThreadPoolElement to THREAD_DONE.

This was considered to be safe, as the completion function restarts the
loop just after the call to qemu_bh_cancel. But, as this loop lacks a HW
memory barrier, the read of req->state may actually happen _before_ the
call, seeing it still as THREAD_QUEUED, and ending the completion
function without having processed a pending TPE linked at pool->head:

The source of the misunderstanding was that qemu_bh_cancel is now being
used by the _consumer_ rather than the producer, and therefore now needs
to have acquire semantics just like e.g. aio_bh_poll.

In some situations, if there are no other independent requests in the
same aio context that could eventually trigger the scheduling of the
completion function, the omitted TPE and all operations pending on it
will get stuck forever.

[Added Sergio's updated wording about the HW memory barrier.
--Stefan]

Signed-off-by: Sergio Lopez <slp@redhat.com>
Message-id: 20171108063447.2842-1-slp@redhat.com
Signed-off-by: Stefan Hajnoczi <stefanha@redhat.com>
---
 util/async.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/util/async.c b/util/async.c
index XXXXXXX..XXXXXXX 100644
--- a/util/async.c
+++ b/util/async.c
@@ -XXX,XX +XXX,XX @@ void qemu_bh_schedule(QEMUBH *bh)
  */
 void qemu_bh_cancel(QEMUBH *bh)
 {
-    bh->scheduled = 0;
+    atomic_mb_set(&bh->scheduled, 0);
 }
 
 /* This func is async.The bottom half will do the delete action at the finial
-- 
2.13.6