From nobody Mon Apr 29 15:09:40 2024 Delivered-To: importer@patchew.org Received-SPF: pass (zohomail.com: domain of lists.xenproject.org designates 192.237.175.120 as permitted sender) client-ip=192.237.175.120; envelope-from=xen-devel-bounces@lists.xenproject.org; helo=lists.xenproject.org; Authentication-Results: mx.zohomail.com; dkim=pass; spf=pass (zohomail.com: domain of lists.xenproject.org designates 192.237.175.120 as permitted sender) smtp.mailfrom=xen-devel-bounces@lists.xenproject.org; dmarc=pass(p=quarantine dis=none) header.from=suse.com ARC-Seal: i=1; a=rsa-sha256; t=1612183373; cv=none; d=zohomail.com; s=zohoarc; b=E3ZfRpUkEEEc/iis3FDCBGblFM2kCtEmwyPLA1vdl+ZlwktR+F/XqM+fcF8/kAEQZD0j9m5WF/NUf26XH8NSu079fH4dErPQoTM1jNzGAXeQsZZ1AzpYORPCzINu3CXB5OpFPpjCMZYUYeGs0cxOnDVWqT7N0rRu/2sFGYPhW1w= ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=zohomail.com; s=zohoarc; t=1612183373; h=Content-Type:Content-Transfer-Encoding:Cc:Date:From:In-Reply-To:List-Subscribe:List-Post:List-Id:List-Help:List-Unsubscribe:MIME-Version:Message-ID:References:Sender:Subject:To; bh=pQE/B5fzZf3jiK+QiCdMMnScKlarCr8sADBb9isYXQM=; b=VNyNqJtgnOdLYF/nzdkPrjXUppnnCO9qGZaKjG27DuTu2tYZLc+mWnFV1fR9gR67l1enQ+q1qCpwsuiOazaiu+wrPkxCUUiVPmnjeSfSByLUUyUsLn6lFGZ3kMdhEjOsbr4HWnd0XqT496l0J0jjJZkcFdg9b22HAAgTjLOOPcI= ARC-Authentication-Results: i=1; mx.zohomail.com; dkim=pass; spf=pass (zohomail.com: domain of lists.xenproject.org designates 192.237.175.120 as permitted sender) smtp.mailfrom=xen-devel-bounces@lists.xenproject.org; dmarc=pass header.from= (p=quarantine dis=none) header.from= Return-Path: Received: from lists.xenproject.org (lists.xenproject.org [192.237.175.120]) by mx.zohomail.com with SMTPS id 1612183373201949.4300845845823; Mon, 1 Feb 2021 04:42:53 -0800 (PST) Received: from list by lists.xenproject.org with outflank-mailman.79837.145459 (Exim 4.92) (envelope-from ) id 1l6YXP-0003Gv-Ar; Mon, 01 Feb 2021 12:42:39 +0000 Received: by outflank-mailman (output) from mailman id 79837.145459; Mon, 01 Feb 2021 12:42:39 +0000 Received: from localhost ([127.0.0.1] helo=lists.xenproject.org) by lists.xenproject.org with esmtp (Exim 4.92) (envelope-from ) id 1l6YXP-0003Go-6R; Mon, 01 Feb 2021 12:42:39 +0000 Received: by outflank-mailman (input) for mailman id 79837; Mon, 01 Feb 2021 12:42:37 +0000 Received: from us1-rack-iad1.inumbo.com ([172.99.69.81]) by lists.xenproject.org with esmtp (Exim 4.92) (envelope-from ) id 1l6YXN-0003Gh-TX for xen-devel@lists.xenproject.org; Mon, 01 Feb 2021 12:42:37 +0000 Received: from mx2.suse.de (unknown [195.135.220.15]) by us1-rack-iad1.inumbo.com (Halon) with ESMTPS id 067a2b04-0906-4776-b2c3-2a24514f1815; Mon, 01 Feb 2021 12:42:36 +0000 (UTC) Received: from relay2.suse.de (unknown [195.135.221.27]) by mx2.suse.de (Postfix) with ESMTP id EE3A3ABD5; Mon, 1 Feb 2021 12:42:35 +0000 (UTC) X-Outflank-Mailman: Message body and most headers restored to incoming version X-BeenThere: xen-devel@lists.xenproject.org List-Id: Xen developer discussion List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Errors-To: xen-devel-bounces@lists.xenproject.org Precedence: list Sender: "Xen-devel" X-Inumbo-ID: 067a2b04-0906-4776-b2c3-2a24514f1815 X-Virus-Scanned: by amavisd-new at test-mx.suse.de DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=susede1; t=1612183356; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=pQE/B5fzZf3jiK+QiCdMMnScKlarCr8sADBb9isYXQM=; b=ogx0M9L/sRFh0TkJYhEmDi/1u4/+aqxSxUPQRicSox3pAZqKQ0ZpEjUfMfKltleZlOEtmK kJkKGzh1oJf9PMnYfddaFWWHl6i8PqtR2+VMg4oH9LiKtBMj0ob/cUvScvzz/zJM3+B9if cPpH/XqQwjcIJhJyXxLX8PLnbWr1puM= Subject: [PATCH v2 1/3] x86/time: change initiation of the calibration timer From: Jan Beulich To: "xen-devel@lists.xenproject.org" Cc: Andrew Cooper , Wei Liu , =?UTF-8?Q?Roger_Pau_Monn=c3=a9?= , Claudemir Todo Bom References: Message-ID: Date: Mon, 1 Feb 2021 13:42:35 +0100 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.7.0 MIME-Version: 1.0 In-Reply-To: Content-Language: en-US Content-Transfer-Encoding: quoted-printable X-ZohoMail-DKIM: pass (identity @suse.com) Content-Type: text/plain; charset="utf-8" Setting the timer a second (EPOCH) into the future at a random point during boot (prior to bringing up APs and prior to launching Dom0) does not yield predictable results: The timer may expire while we're still bringing up APs (too early) or when Dom0 already boots (too late). Instead invoke the timer handler function explicitly at a predictable point in time, once we've established the rendezvous function to use (and hence also once all APs are online). This will, through the raising and handling of TIMER_SOFTIRQ, then also have the effect of arming the timer. Signed-off-by: Jan Beulich Acked-by: Roger Pau Monn=C3=A9 --- a/xen/arch/x86/time.c +++ b/xen/arch/x86/time.c @@ -854,9 +854,7 @@ static void resume_platform_timer(void) =20 static void __init reset_platform_timer(void) { - /* Deactivate any timers running */ kill_timer(&plt_overflow_timer); - kill_timer(&calibration_timer); =20 /* Reset counters and stamps */ spin_lock_irq(&platform_timer_lock); @@ -1956,19 +1954,13 @@ static void __init reset_percpu_time(voi t->stamp.master_stime =3D t->stamp.local_stime; } =20 -static void __init try_platform_timer_tail(bool late) +static void __init try_platform_timer_tail(void) { init_timer(&plt_overflow_timer, plt_overflow, NULL, 0); plt_overflow(NULL); =20 platform_timer_stamp =3D plt_stamp64; stime_platform_stamp =3D NOW(); - - if ( !late ) - init_percpu_time(); - - init_timer(&calibration_timer, time_calibration, NULL, 0); - set_timer(&calibration_timer, NOW() + EPOCH); } =20 /* Late init function, after all cpus have booted */ @@ -2009,10 +2001,13 @@ static int __init verify_tsc_reliability time_calibration_rendezvous_fn =3D time_calibration_nop_rendez= vous; =20 /* Finish platform timer switch. */ - try_platform_timer_tail(true); + try_platform_timer_tail(); =20 printk("Switched to Platform timer %s TSC\n", freq_string(plt_src.frequency)); + + time_calibration(NULL); + return 0; } } @@ -2033,6 +2028,8 @@ static int __init verify_tsc_reliability !boot_cpu_has(X86_FEATURE_TSC_RELIABLE) ) time_calibration_rendezvous_fn =3D time_calibration_tsc_rendezvous; =20 + time_calibration(NULL); + return 0; } __initcall(verify_tsc_reliability); @@ -2048,7 +2045,11 @@ int __init init_xen_time(void) do_settime(get_wallclock_time(), 0, NOW()); =20 /* Finish platform timer initialization. */ - try_platform_timer_tail(false); + try_platform_timer_tail(); + + init_percpu_time(); + + init_timer(&calibration_timer, time_calibration, NULL, 0); =20 /* * Setup space to track per-socket TSC_ADJUST values. Don't fiddle with From nobody Mon Apr 29 15:09:40 2024 Delivered-To: importer@patchew.org Received-SPF: pass (zohomail.com: domain of lists.xenproject.org designates 192.237.175.120 as permitted sender) client-ip=192.237.175.120; envelope-from=xen-devel-bounces@lists.xenproject.org; helo=lists.xenproject.org; Authentication-Results: mx.zohomail.com; dkim=pass; spf=pass (zohomail.com: domain of lists.xenproject.org designates 192.237.175.120 as permitted sender) smtp.mailfrom=xen-devel-bounces@lists.xenproject.org; dmarc=pass(p=quarantine dis=none) header.from=suse.com ARC-Seal: i=1; a=rsa-sha256; t=1612183404; cv=none; d=zohomail.com; s=zohoarc; b=FDKJ/2lB9ni8+8AI8IH50r2vi8HQ2O0TAgqq+A6HUyFEc4gKSDsjRy0eVN4lold/FGu9KQXCcXnf9FuQJgPDIct7C6NPL64q3Idq3DztWYG2w9CV9Zq4zhQcTknH9FITTJRaATPoC5Q79Y/yKQoPcm6SShxuC1zSEA33Q7U/McE= ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=zohomail.com; s=zohoarc; t=1612183404; h=Content-Type:Content-Transfer-Encoding:Cc:Date:From:In-Reply-To:List-Subscribe:List-Post:List-Id:List-Help:List-Unsubscribe:MIME-Version:Message-ID:References:Sender:Subject:To; bh=DnYSxTzJBDvOfiz9wcen1XbR/0uoT6XyzGIZOo8FU/0=; b=eVeD+jx8BeHfGi53DRVHpCUZ1+SQtngenoE423NyYEJQIYdZUsOuOpn4WiPnJViT/ia6jE1D1/B1AZ84Srudx4SoD90UHpYQrr9YqBlkqRI5ez9eu7+iR5/gHRz4qOHJK1PdUypfNg2G8QdJQMOSWi9/mMTPevn/Hvnf1TknO/E= ARC-Authentication-Results: i=1; mx.zohomail.com; dkim=pass; spf=pass (zohomail.com: domain of lists.xenproject.org designates 192.237.175.120 as permitted sender) smtp.mailfrom=xen-devel-bounces@lists.xenproject.org; dmarc=pass header.from= (p=quarantine dis=none) header.from= Return-Path: Received: from lists.xenproject.org (lists.xenproject.org [192.237.175.120]) by mx.zohomail.com with SMTPS id 1612183404294953.6402041912182; Mon, 1 Feb 2021 04:43:24 -0800 (PST) Received: from list by lists.xenproject.org with outflank-mailman.79840.145486 (Exim 4.92) (envelope-from ) id 1l6YXs-0003SK-1Q; Mon, 01 Feb 2021 12:43:08 +0000 Received: by outflank-mailman (output) from mailman id 79840.145486; Mon, 01 Feb 2021 12:43:07 +0000 Received: from localhost ([127.0.0.1] helo=lists.xenproject.org) by lists.xenproject.org with esmtp (Exim 4.92) (envelope-from ) id 1l6YXr-0003SD-TR; Mon, 01 Feb 2021 12:43:07 +0000 Received: by outflank-mailman (input) for mailman id 79840; Mon, 01 Feb 2021 12:43:06 +0000 Received: from all-amaz-eas1.inumbo.com ([34.197.232.57] helo=us1-amaz-eas2.inumbo.com) by lists.xenproject.org with esmtp (Exim 4.92) (envelope-from ) id 1l6YXq-0003Rw-Eh for xen-devel@lists.xenproject.org; Mon, 01 Feb 2021 12:43:06 +0000 Received: from mx2.suse.de (unknown [195.135.220.15]) by us1-amaz-eas2.inumbo.com (Halon) with ESMTPS id 69894d35-6bd9-4c1c-aaea-8908e3e09a16; Mon, 01 Feb 2021 12:43:05 +0000 (UTC) Received: from relay2.suse.de (unknown [195.135.221.27]) by mx2.suse.de (Postfix) with ESMTP id 9DC19AC45; Mon, 1 Feb 2021 12:43:04 +0000 (UTC) X-Outflank-Mailman: Message body and most headers restored to incoming version X-BeenThere: xen-devel@lists.xenproject.org List-Id: Xen developer discussion List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Errors-To: xen-devel-bounces@lists.xenproject.org Precedence: list Sender: "Xen-devel" X-Inumbo-ID: 69894d35-6bd9-4c1c-aaea-8908e3e09a16 X-Virus-Scanned: by amavisd-new at test-mx.suse.de DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=susede1; t=1612183384; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=DnYSxTzJBDvOfiz9wcen1XbR/0uoT6XyzGIZOo8FU/0=; b=sh+nbL0yu7qztUDe5BYyZVRiJLK4UzCa3t5KRokxVQ/2hbl4jQHrs7HZDCyHuAwNrhyKqR VdRlWaxgZqz5/LE+rq7L1pef+SZ7CJ/rj7yG0NlU3dESUuyMjylj/7QBXdyNW/5/O/7raA UsJHeoN6i+vRZ3n4+LH3kQdaZLXQE7Q= Subject: [PATCH v2 2/3] x86/time: adjust time recording time_calibration_tsc_rendezvous() From: Jan Beulich To: "xen-devel@lists.xenproject.org" Cc: Andrew Cooper , Wei Liu , =?UTF-8?Q?Roger_Pau_Monn=c3=a9?= , Claudemir Todo Bom References: Message-ID: <26b71f94-d1c7-d906-5b2a-4e7994d6f7c0@suse.com> Date: Mon, 1 Feb 2021 13:43:04 +0100 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.7.0 MIME-Version: 1.0 In-Reply-To: Content-Language: en-US Content-Transfer-Encoding: quoted-printable X-ZohoMail-DKIM: pass (identity @suse.com) Content-Type: text/plain; charset="utf-8" The (stime,tsc) tuple is the basis for extrapolation by get_s_time(). Therefore the two better get taken as close to one another as possible. This means two things: First, reading platform time is too early when done on the first iteration. The closest we can get is on the last iteration, immediately before telling other CPUs to write their TSCs (and then also writing CPU0's). While at the first glance it may seem not overly relevant when exactly platform time is read (when assuming that only stime is ever relevant anywhere, and hence the association with the precise TSC values is of lower interest), both CPU frequency changes and the effects of SMT make it unpredictable (between individual rendezvous instances) how long the loop iterations will take. This will in turn lead to higher an error than neccesary in how close to linear stime movement we can get. Second, re-reading the TSC for local recording is increasing the overall error as well, when we already know a more precise value - the one just written. Signed-off-by: Jan Beulich Acked-by: Roger Pau Monn=C3=A9 --- v2: New. --- a/xen/arch/x86/time.c +++ b/xen/arch/x86/time.c @@ -1662,11 +1662,12 @@ struct calibration_rendezvous { }; =20 static void -time_calibration_rendezvous_tail(const struct calibration_rendezvous *r) +time_calibration_rendezvous_tail(const struct calibration_rendezvous *r, + uint64_t tsc) { struct cpu_time_stamp *c =3D &this_cpu(cpu_calibration); =20 - c->local_tsc =3D rdtsc_ordered(); + c->local_tsc =3D tsc; c->local_stime =3D get_s_time_fixed(c->local_tsc); c->master_stime =3D r->master_stime; =20 @@ -1691,11 +1692,11 @@ static void time_calibration_tsc_rendezv while ( atomic_read(&r->semaphore) !=3D (total_cpus - 1) ) cpu_relax(); =20 - if ( r->master_stime =3D=3D 0 ) - { - r->master_stime =3D read_platform_stime(NULL); + if ( r->master_tsc_stamp =3D=3D 0 ) r->master_tsc_stamp =3D rdtsc_ordered(); - } + else if ( i =3D=3D 0 ) + r->master_stime =3D read_platform_stime(NULL); + atomic_inc(&r->semaphore); =20 if ( i =3D=3D 0 ) @@ -1720,7 +1721,7 @@ static void time_calibration_tsc_rendezv } } =20 - time_calibration_rendezvous_tail(r); + time_calibration_rendezvous_tail(r, r->master_tsc_stamp); } =20 /* Ordinary rendezvous function which does not modify TSC values. */ @@ -1745,7 +1746,7 @@ static void time_calibration_std_rendezv smp_rmb(); /* receive signal /then/ read r->master_stime */ } =20 - time_calibration_rendezvous_tail(r); + time_calibration_rendezvous_tail(r, rdtsc_ordered()); } =20 /* From nobody Mon Apr 29 15:09:40 2024 Delivered-To: importer@patchew.org Received-SPF: pass (zohomail.com: domain of lists.xenproject.org designates 192.237.175.120 as permitted sender) client-ip=192.237.175.120; envelope-from=xen-devel-bounces@lists.xenproject.org; helo=lists.xenproject.org; Authentication-Results: mx.zohomail.com; dkim=pass; spf=pass (zohomail.com: domain of lists.xenproject.org designates 192.237.175.120 as permitted sender) smtp.mailfrom=xen-devel-bounces@lists.xenproject.org; dmarc=pass(p=quarantine dis=none) header.from=suse.com ARC-Seal: i=1; a=rsa-sha256; t=1612183429; cv=none; d=zohomail.com; s=zohoarc; b=Eyb2e8os/PokRVHthRk50/ziBFb8LRV/BNLAiAXsC3VCwZVO8kt3Vw3iy5k1mfOWIpzglyV+rypIxbPW2EUEIagzymMc6n7KDYsr4oELR5T5sXOmk+cD/AK1xqcVesf+7r1wdXjZCV3UiYu2i4ISgCKTI0fdeSatuZsmTNLMjT8= ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=zohomail.com; s=zohoarc; t=1612183429; h=Content-Type:Content-Transfer-Encoding:Cc:Date:From:In-Reply-To:List-Subscribe:List-Post:List-Id:List-Help:List-Unsubscribe:MIME-Version:Message-ID:References:Sender:Subject:To; bh=pO95WGmT9luUPpVQj5Mfxb1ngFYgSQ2HBQd/KaR67jk=; b=P6cCRQBirYFqpqqTtu9tpKA9m8BOeUjI9U5rQ2zs6lP1HEDTAkV4OfFjCOjC1huXs+saLcSdwoaQ1SiJO3WBahfFb4+vMfZVA1ENAXOrxG5bk5cAvnJn0A7KxDTl6hnPiWWPN7vB2MJFLbqvYpWYyUVK716XZRI/KJFzhrH7EY8= ARC-Authentication-Results: i=1; mx.zohomail.com; dkim=pass; spf=pass (zohomail.com: domain of lists.xenproject.org designates 192.237.175.120 as permitted sender) smtp.mailfrom=xen-devel-bounces@lists.xenproject.org; dmarc=pass header.from= (p=quarantine dis=none) header.from= Return-Path: Received: from lists.xenproject.org (lists.xenproject.org [192.237.175.120]) by mx.zohomail.com with SMTPS id 1612183429118568.0925993360102; Mon, 1 Feb 2021 04:43:49 -0800 (PST) Received: from list by lists.xenproject.org with outflank-mailman.79842.145497 (Exim 4.92) (envelope-from ) id 1l6YYH-0003Yy-Ac; Mon, 01 Feb 2021 12:43:33 +0000 Received: by outflank-mailman (output) from mailman id 79842.145497; Mon, 01 Feb 2021 12:43:33 +0000 Received: from localhost ([127.0.0.1] helo=lists.xenproject.org) by lists.xenproject.org with esmtp (Exim 4.92) (envelope-from ) id 1l6YYH-0003Yq-7N; Mon, 01 Feb 2021 12:43:33 +0000 Received: by outflank-mailman (input) for mailman id 79842; Mon, 01 Feb 2021 12:43:31 +0000 Received: from all-amaz-eas1.inumbo.com ([34.197.232.57] helo=us1-amaz-eas2.inumbo.com) by lists.xenproject.org with esmtp (Exim 4.92) (envelope-from ) id 1l6YYF-0003Yd-JB for xen-devel@lists.xenproject.org; Mon, 01 Feb 2021 12:43:31 +0000 Received: from mx2.suse.de (unknown [195.135.220.15]) by us1-amaz-eas2.inumbo.com (Halon) with ESMTPS id 50e4a339-9990-4a43-b796-6f872f291cb7; Mon, 01 Feb 2021 12:43:30 +0000 (UTC) Received: from relay2.suse.de (unknown [195.135.221.27]) by mx2.suse.de (Postfix) with ESMTP id 43731AB92; Mon, 1 Feb 2021 12:43:29 +0000 (UTC) X-Outflank-Mailman: Message body and most headers restored to incoming version X-BeenThere: xen-devel@lists.xenproject.org List-Id: Xen developer discussion List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Errors-To: xen-devel-bounces@lists.xenproject.org Precedence: list Sender: "Xen-devel" X-Inumbo-ID: 50e4a339-9990-4a43-b796-6f872f291cb7 X-Virus-Scanned: by amavisd-new at test-mx.suse.de DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=susede1; t=1612183409; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=pO95WGmT9luUPpVQj5Mfxb1ngFYgSQ2HBQd/KaR67jk=; b=SxDTn8Yr2eNncgqM27CwVzLnQR04GX1SPvm2AuWxwmStGoz9j4HlR6Jb1v4d4ati86Jxjv KQVK2jp0BCFZl7Fmaeq+TEP5bLZZbBnZ6J5RzYquvxxZLX6rEM5MCnaBiPjIkuMzvoCwzq EN9XRLPHwXHTIEQa8vftAcYYKngFIXk= Subject: [PATCH v2 3/3] x86/time: don't move TSC backwards in time_calibration_tsc_rendezvous() From: Jan Beulich To: "xen-devel@lists.xenproject.org" Cc: Andrew Cooper , Wei Liu , =?UTF-8?Q?Roger_Pau_Monn=c3=a9?= , Claudemir Todo Bom References: Message-ID: <80d05abb-4d53-3229-8326-21d79e32dfe4@suse.com> Date: Mon, 1 Feb 2021 13:43:28 +0100 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.7.0 MIME-Version: 1.0 In-Reply-To: Content-Language: en-US Content-Transfer-Encoding: quoted-printable X-ZohoMail-DKIM: pass (identity @suse.com) Content-Type: text/plain; charset="utf-8" While doing this for small amounts may be okay, the unconditional use of CPU0's value here has been found to be a problem when the boot time TSC of the BSP was behind that of all APs by more than a second. In particular because of get_s_time_fixed() producing insane output when the calculated delta is negative, we can't allow this to happen. On the first iteration have all other CPUs sort out the highest TSC value any one of them has read. On the second iteration, if that maximum is higher than CPU0's, update its recorded value from that taken in the first iteration. Use the resulting value on the last iteration to write everyone's TSCs. To account for the possible discontinuity, have time_calibration_rendezvous_tail() record the newly written value, but extrapolate local stime using the value read. Reported-by: Claudemir Todo Bom Signed-off-by: Jan Beulich --- v2: Don't update r->master_stime by calculation. Re-base over new earlier patch. Make time_calibration_rendezvous_tail() take two TSC values. --- Since CPU0 reads its TSC last on the first iteration, if TSCs were perfectly sync-ed there shouldn't ever be a need to update. However, even on the TSC-reliable system I first tested this on (using "tsc=3Dskewed" to get this rendezvous function into use in the first place) updates by up to several thousand clocks did happen. I wonder whether this points at some problem with the approach that I'm not (yet) seeing. Considering the sufficiently modern CPU it's using, I suspect the reporter's system wouldn't even need to turn off TSC_RELIABLE, if only there wasn't the boot time skew. Hence another approach might be to fix this boot time skew. Of course to recognize whether the TSCs then still aren't in sync we'd need to run tsc_check_reliability() sufficiently long after that adjustment. Which is besides the need to have this "fixing" be precise enough for the TSCs to not look skewed anymore afterwards. As per the comment ahead of it, the original purpose of the function was to deal with TSCs halted in deep C states. While this probably explains why only forward moves were ever expected, I don't see how this could have been reliable in case CPU0 was deep-sleeping for a sufficiently long time. My only guess here is a hidden assumption of CPU0 never being idle for long enough. --- a/xen/arch/x86/time.c +++ b/xen/arch/x86/time.c @@ -1658,17 +1658,17 @@ struct calibration_rendezvous { cpumask_t cpu_calibration_map; atomic_t semaphore; s_time_t master_stime; - u64 master_tsc_stamp; + uint64_t master_tsc_stamp, max_tsc_stamp; }; =20 static void time_calibration_rendezvous_tail(const struct calibration_rendezvous *r, - uint64_t tsc) + uint64_t old_tsc, uint64_t new_tsc) { struct cpu_time_stamp *c =3D &this_cpu(cpu_calibration); =20 - c->local_tsc =3D tsc; - c->local_stime =3D get_s_time_fixed(c->local_tsc); + c->local_tsc =3D new_tsc; + c->local_stime =3D get_s_time_fixed(old_tsc ?: new_tsc); c->master_stime =3D r->master_stime; =20 raise_softirq(TIME_CALIBRATE_SOFTIRQ); @@ -1683,6 +1683,7 @@ static void time_calibration_tsc_rendezv int i; struct calibration_rendezvous *r =3D _r; unsigned int total_cpus =3D cpumask_weight(&r->cpu_calibration_map); + uint64_t tsc =3D 0; =20 /* Loop to get rid of cache effects on TSC skew. */ for ( i =3D 4; i >=3D 0; i-- ) @@ -1692,8 +1693,15 @@ static void time_calibration_tsc_rendezv while ( atomic_read(&r->semaphore) !=3D (total_cpus - 1) ) cpu_relax(); =20 - if ( r->master_tsc_stamp =3D=3D 0 ) - r->master_tsc_stamp =3D rdtsc_ordered(); + if ( tsc =3D=3D 0 ) + r->master_tsc_stamp =3D tsc =3D rdtsc_ordered(); + else if ( r->master_tsc_stamp < r->max_tsc_stamp ) + /* + * We want to avoid moving the TSC backwards for any CPU. + * Use the largest value observed anywhere on the first + * iteration. + */ + r->master_tsc_stamp =3D r->max_tsc_stamp; else if ( i =3D=3D 0 ) r->master_stime =3D read_platform_stime(NULL); =20 @@ -1712,6 +1720,16 @@ static void time_calibration_tsc_rendezv while ( atomic_read(&r->semaphore) < total_cpus ) cpu_relax(); =20 + if ( tsc =3D=3D 0 ) + { + uint64_t cur; + + tsc =3D rdtsc_ordered(); + while ( tsc > (cur =3D r->max_tsc_stamp) ) + if ( cmpxchg(&r->max_tsc_stamp, cur, tsc) =3D=3D cur ) + break; + } + if ( i =3D=3D 0 ) write_tsc(r->master_tsc_stamp); =20 @@ -1719,9 +1737,12 @@ static void time_calibration_tsc_rendezv while ( atomic_read(&r->semaphore) > total_cpus ) cpu_relax(); } + + /* Just in case a read above ended up reading zero. */ + tsc +=3D !tsc; } =20 - time_calibration_rendezvous_tail(r, r->master_tsc_stamp); + time_calibration_rendezvous_tail(r, tsc, r->master_tsc_stamp); } =20 /* Ordinary rendezvous function which does not modify TSC values. */ @@ -1746,7 +1767,7 @@ static void time_calibration_std_rendezv smp_rmb(); /* receive signal /then/ read r->master_stime */ } =20 - time_calibration_rendezvous_tail(r, rdtsc_ordered()); + time_calibration_rendezvous_tail(r, 0, rdtsc_ordered()); } =20 /*