[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH linux-next v2] x86/xen/time: prefer tsc as clocksource when it is invariant


  • To: Krister Johansen <kjlx@xxxxxxxxxxxxxxxxxx>
  • From: Boris Ostrovsky <boris.ostrovsky@xxxxxxxxxx>
  • Date: Wed, 14 Dec 2022 16:46:10 -0500
  • Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=oracle.com; dmarc=pass action=none header.from=oracle.com; dkim=pass header.d=oracle.com; arc=none
  • Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=WCVQxcn5xfuG2qoVHLb5zOS1maUjNeo6BQXYosGStl4=; b=jidfQfCnNYs3pK7W71p1yE+aRylvbrnJf3PJX2KSbs38DLk4sDyT9zfd2RZueAsL9Y6kP7mFhDRE3rTxk+koClx8m9vSW1qFioGj9M5e6P6lbAplD1RFWwDKaM2EO6o3Cke6lMcwRDlU6wirGjHHIZIwZjGuDNXUwKZNQK3zD7hseMyHLE5Ls/VpFd6rj7tET9FdeyNRifpsw38emzF7PbaTZcDDx1GPp7t0KPxKyXxbHqOR8UqM3q6tLAUFeymgyUtMpZJEFviW5l3aUanLzsQg+9YOMhug5G9/uZqDBbJOR2k7A1YyQpRNxTlHOJ4o/GEPOQFBouUO9iD4E4jTeQ==
  • Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=WY3ZTrMzZkI8UP2IDroRQBUl57Xm8GW76lISTzTQt4KZPaXDsusTiCVAR+WT/o8IgmV7VCtJBQrN3wcPnwxbaKmfcFZErHKyrdur0fxxSOJqH7JgyGLYreP4uJzwipoZkvv+NJxBeUyB5fiDTPSbmFVaNrbD+pa6tSedoxvOMviso7doqd7bfMmXRP91RYn5wpezk6OWRAcLCYkOYpRNgQ9Ai5dKtq+luAN2h2XP5Zac5NotU7mEYRcf/CzxqiMXbdF5jPVYkzFJNGOYAmwsMH9uX6XOTc63uBLggKOiyVwzh0ebaygpiBe33NAEvnvfYKi6EQ+QTSDxBOpDtkdyCQ==
  • Cc: Juergen Gross <jgross@xxxxxxxx>, Thomas Gleixner <tglx@xxxxxxxxxxxxx>, Ingo Molnar <mingo@xxxxxxxxxx>, Borislav Petkov <bp@xxxxxxxxx>, Dave Hansen <dave.hansen@xxxxxxxxxxxxxxx>, x86@xxxxxxxxxx, "H. Peter Anvin" <hpa@xxxxxxxxx>, xen-devel@xxxxxxxxxxxxxxxxxxxx, linux-kernel@xxxxxxxxxxxxxxx, Marcelo Tosatti <mtosatti@xxxxxxxxxx>, Anthony Liguori <aliguori@xxxxxxxxxx>, David Reaver <me@xxxxxxxxxxxxxxx>, Brendan Gregg <brendan@xxxxxxxxx>
  • Delivery-date: Wed, 14 Dec 2022 21:47:07 +0000
  • List-id: Xen developer discussion <xen-devel.lists.xenproject.org>


On 12/14/22 1:01 PM, Krister Johansen wrote:
On Tue, Dec 13, 2022 at 04:25:32PM -0500, Boris Ostrovsky wrote:
On 12/12/22 5:09 PM, Krister Johansen wrote:
On Mon, Dec 12, 2022 at 01:48:24PM -0500, Boris Ostrovsky wrote:
On 12/12/22 11:05 AM, Krister Johansen wrote:
diff --git a/arch/x86/include/asm/xen/cpuid.h b/arch/x86/include/asm/xen/cpuid.h
index 6daa9b0c8d11..d9d7432481e9 100644
--- a/arch/x86/include/asm/xen/cpuid.h
+++ b/arch/x86/include/asm/xen/cpuid.h
@@ -88,6 +88,12 @@
     *             EDX: shift amount for tsc->ns conversion
     * Sub-leaf 2: EAX: host tsc frequency in kHz
     */
+#define XEN_CPUID_TSC_EMULATED       (1u << 0)
+#define XEN_CPUID_HOST_TSC_RELIABLE  (1u << 1)
+#define XEN_CPUID_RDTSCP_INSTR_AVAIL (1u << 2)
+#define XEN_CPUID_TSC_MODE_DEFAULT   (0)
+#define XEN_CPUID_TSC_MODE_EMULATE   (1u)
+#define XEN_CPUID_TSC_MODE_NOEMULATE (2u)
This file is a copy of Xen public interface so this change should go to Xen 
first.
Ok, should I split this into a separate patch on the linux side too?
Yes. Once the Xen patch has been accepted you will either submit the same patch 
for Linux or sync Linux file with Xen (if there are more differences).
Thanks.  Based upon the feedback I received from you and Jan, I may try
to shrink the check in xen_tsc_safe_clocksource() down a bit.  In that
case, I may only need to refer to a single field in the leaf that
provides this information.  In that case, are you alright with dropping
the change to the header and referring to the value directly, or would
you prefer that I proceed with adding these to the public API?


It would certainly be appreciated if you updated the header files but it's up 
to maintainers to decide whether it's required.


+static int __init xen_tsc_safe_clocksource(void)
+{
+       u32 eax, ebx, ecx, edx;
+
+       if (!(xen_hvm_domain() || xen_pvh_domain()))
+               return 0;
+
+       if (!(boot_cpu_has(X86_FEATURE_CONSTANT_TSC)))
+               return 0;
+
+       if (!(boot_cpu_has(X86_FEATURE_NONSTOP_TSC)))
+               return 0;
+
+       if (check_tsc_unstable())
+               return 0;
+
+       cpuid(xen_cpuid_base() + 3, &eax, &ebx, &ecx, &edx);
+
+       if (eax & XEN_CPUID_TSC_EMULATED)
+               return 0;
+
+       if (ebx != XEN_CPUID_TSC_MODE_NOEMULATE)
+               return 0;
Why is the last test needed?
I was under the impression that if the mode was 0 (default) it would be
possible for the tsc to become emulated in the future, perhaps after a
migration.  The presence of the tsc_mode noemulate meant that we could
count on the falseneess of the XEN_CPUID_TSC_EMULATED check remaining
constant.
This will filter out most modern processors with TSC scaling support where in 
default mode we don't intercept RDTCS after migration. But I don't think we 
have proper interface to determine this so we don't have much choice but to 
indeed make this check.
Yes, if this remains a single boot-time check, I'm not sure that knowing
whether the processor supports tsc scaling helps us.  If tsc_mode is
default, there's always a possibility of the tsc becoming emulated later
on as part of migration, correct?


If the processor supports TSC scaling I don't think it's possible (it can 
happen in theory) but if it doesn't and you migrate to a CPU running at 
different frequency then yes, hypervisor will start emulating RDTSC.



The other thing that might be possible here is to add a background
timer that periodically checks if the tsc is still not emulated, and if
it suddenly becomes so, change the rating again to prefer the xen
clocksource.  I had written this off initially as an impractical
solution, since it seemed like a lot more mechanism and because it meant
the performance characteristics of the system would change without user
intervention.  However, if this seems like a good idea, I'm not opposed
to giving it a try.


I don't think we should do it. Having the kernel suddenly change clocksource 
will probably be somewhat of a surprise to users.


-boris




 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.