[PATCH] tests/avocado/replay_kernel: Mark the test_x86_64_pc as flaky

Thomas Huth posted 1 patch 1 year ago
Patches applied successfully (tree, apply log)
git fetch https://github.com/patchew-project/qemu tags/patchew/20231114153019.295131-1-thuth@redhat.com
Maintainers: Pavel Dovgalyuk <pavel.dovgaluk@ispras.ru>, Paolo Bonzini <pbonzini@redhat.com>, Cleber Rosa <crosa@redhat.com>, "Philippe Mathieu-Daudé" <philmd@linaro.org>, Wainer dos Santos Moschetta <wainersm@redhat.com>, Beraldo Leal <bleal@redhat.com>
tests/avocado/replay_kernel.py | 3 ++-
1 file changed, 2 insertions(+), 1 deletion(-)
[PATCH] tests/avocado/replay_kernel: Mark the test_x86_64_pc as flaky
Posted by Thomas Huth 1 year ago
It's failing very often, so don't run this by default anymore
until it gets fixed.

Signed-off-by: Thomas Huth <thuth@redhat.com>
---
 tests/avocado/replay_kernel.py | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/tests/avocado/replay_kernel.py b/tests/avocado/replay_kernel.py
index a18610542e..53cb7e5091 100644
--- a/tests/avocado/replay_kernel.py
+++ b/tests/avocado/replay_kernel.py
@@ -81,7 +81,8 @@ def run_rr(self, kernel_path, kernel_command_line, console_pattern,
         logger.info('replay overhead {:.2%}'.format(t2 / t1 - 1))
 
 class ReplayKernelNormal(ReplayKernelBase):
-    @skipIf(os.getenv('GITLAB_CI'), 'Running on GitLab')
+
+    @skipUnless(os.getenv('QEMU_TEST_FLAKY_TESTS'), 'Test sometimes gets stuck')
     def test_x86_64_pc(self):
         """
         :avocado: tags=arch:x86_64
-- 
2.41.0
Re: [PATCH] tests/avocado/replay_kernel: Mark the test_x86_64_pc as flaky
Posted by Philippe Mathieu-Daudé 1 year ago
On 14/11/23 16:30, Thomas Huth wrote:
> It's failing very often, so don't run this by default anymore
> until it gets fixed.
> 
> Signed-off-by: Thomas Huth <thuth@redhat.com>
> ---
>   tests/avocado/replay_kernel.py | 3 ++-
>   1 file changed, 2 insertions(+), 1 deletion(-)
> 
> diff --git a/tests/avocado/replay_kernel.py b/tests/avocado/replay_kernel.py
> index a18610542e..53cb7e5091 100644
> --- a/tests/avocado/replay_kernel.py
> +++ b/tests/avocado/replay_kernel.py
> @@ -81,7 +81,8 @@ def run_rr(self, kernel_path, kernel_command_line, console_pattern,
>           logger.info('replay overhead {:.2%}'.format(t2 / t1 - 1))
>   
>   class ReplayKernelNormal(ReplayKernelBase):
> -    @skipIf(os.getenv('GITLAB_CI'), 'Running on GitLab')
> +
> +    @skipUnless(os.getenv('QEMU_TEST_FLAKY_TESTS'), 'Test sometimes gets stuck')
>       def test_x86_64_pc(self):
>           """
>           :avocado: tags=arch:x86_64

Hmm I suspect https://gitlab.com/qemu-project/qemu/-/issues/1961 which
has a fix ready:
https://lore.kernel.org/qemu-devel/20231110170831.185001-1-richard.henderson@linaro.org/
Re: [PATCH] tests/avocado/replay_kernel: Mark the test_x86_64_pc as flaky
Posted by Thomas Huth 1 year ago
On 14/11/2023 17.36, Philippe Mathieu-Daudé wrote:
> On 14/11/23 16:30, Thomas Huth wrote:
>> It's failing very often, so don't run this by default anymore
>> until it gets fixed.
>>
>> Signed-off-by: Thomas Huth <thuth@redhat.com>
>> ---
>>   tests/avocado/replay_kernel.py | 3 ++-
>>   1 file changed, 2 insertions(+), 1 deletion(-)
>>
>> diff --git a/tests/avocado/replay_kernel.py b/tests/avocado/replay_kernel.py
>> index a18610542e..53cb7e5091 100644
>> --- a/tests/avocado/replay_kernel.py
>> +++ b/tests/avocado/replay_kernel.py
>> @@ -81,7 +81,8 @@ def run_rr(self, kernel_path, kernel_command_line, 
>> console_pattern,
>>           logger.info('replay overhead {:.2%}'.format(t2 / t1 - 1))
>>   class ReplayKernelNormal(ReplayKernelBase):
>> -    @skipIf(os.getenv('GITLAB_CI'), 'Running on GitLab')
>> +
>> +    @skipUnless(os.getenv('QEMU_TEST_FLAKY_TESTS'), 'Test sometimes gets 
>> stuck')
>>       def test_x86_64_pc(self):
>>           """
>>           :avocado: tags=arch:x86_64
> 
> Hmm I suspect https://gitlab.com/qemu-project/qemu/-/issues/1961 which
> has a fix ready:
> https://lore.kernel.org/qemu-devel/20231110170831.185001-1-richard.henderson@linaro.org/

I just gave it a try, too, and it does not fix the issue for me.

Is this test still working for anybody?

  Thomas


Re: [PATCH] tests/avocado/replay_kernel: Mark the test_x86_64_pc as flaky
Posted by Peter Maydell 1 year ago
On Tue, 14 Nov 2023 at 15:30, Thomas Huth <thuth@redhat.com> wrote:
>
> It's failing very often, so don't run this by default anymore
> until it gets fixed.
>
> Signed-off-by: Thomas Huth <thuth@redhat.com>
> ---
>  tests/avocado/replay_kernel.py | 3 ++-
>  1 file changed, 2 insertions(+), 1 deletion(-)
>
> diff --git a/tests/avocado/replay_kernel.py b/tests/avocado/replay_kernel.py
> index a18610542e..53cb7e5091 100644
> --- a/tests/avocado/replay_kernel.py
> +++ b/tests/avocado/replay_kernel.py
> @@ -81,7 +81,8 @@ def run_rr(self, kernel_path, kernel_command_line, console_pattern,
>          logger.info('replay overhead {:.2%}'.format(t2 / t1 - 1))
>
>  class ReplayKernelNormal(ReplayKernelBase):
> -    @skipIf(os.getenv('GITLAB_CI'), 'Running on GitLab')
> +
> +    @skipUnless(os.getenv('QEMU_TEST_FLAKY_TESTS'), 'Test sometimes gets stuck')
>      def test_x86_64_pc(self):
>          """
>          :avocado: tags=arch:x86_64

Reviewed-by: Peter Maydell <peter.maydell@linaro.org>

We should replace all of the "skip if GITLAB_CI" checks
with "skip on QEMU_TEST_FLAKY_TESTS" (or other check as
appropriate), I think.

thanks
-- PMM
Re: [PATCH] tests/avocado/replay_kernel: Mark the test_x86_64_pc as flaky
Posted by Thomas Huth 1 year ago
On 14/11/2023 17.22, Peter Maydell wrote:
> On Tue, 14 Nov 2023 at 15:30, Thomas Huth <thuth@redhat.com> wrote:
>>
>> It's failing very often, so don't run this by default anymore
>> until it gets fixed.
>>
>> Signed-off-by: Thomas Huth <thuth@redhat.com>
>> ---
>>   tests/avocado/replay_kernel.py | 3 ++-
>>   1 file changed, 2 insertions(+), 1 deletion(-)
>>
>> diff --git a/tests/avocado/replay_kernel.py b/tests/avocado/replay_kernel.py
>> index a18610542e..53cb7e5091 100644
>> --- a/tests/avocado/replay_kernel.py
>> +++ b/tests/avocado/replay_kernel.py
>> @@ -81,7 +81,8 @@ def run_rr(self, kernel_path, kernel_command_line, console_pattern,
>>           logger.info('replay overhead {:.2%}'.format(t2 / t1 - 1))
>>
>>   class ReplayKernelNormal(ReplayKernelBase):
>> -    @skipIf(os.getenv('GITLAB_CI'), 'Running on GitLab')
>> +
>> +    @skipUnless(os.getenv('QEMU_TEST_FLAKY_TESTS'), 'Test sometimes gets stuck')
>>       def test_x86_64_pc(self):
>>           """
>>           :avocado: tags=arch:x86_64
> 
> Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
> 
> We should replace all of the "skip if GITLAB_CI" checks
> with "skip on QEMU_TEST_FLAKY_TESTS" (or other check as
> appropriate), I think.

Yes ... the question is whether they just time out because gitlab is too 
slow (in that case we should increase the timeout), or whether they are 
really flaky ... needs some time for investigation, I guess.

  Thomas