[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v2 02/17] xsm/silo: Support hardware & xenstore domains


  • To: Jan Beulich <jbeulich@xxxxxxxx>
  • From: Jason Andryuk <jason.andryuk@xxxxxxx>
  • Date: Wed, 30 Jul 2025 17:05:59 -0400
  • Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass (sender ip is 165.204.84.17) smtp.rcpttodomain=suse.com smtp.mailfrom=amd.com; dmarc=pass (p=quarantine sp=quarantine pct=100) action=none header.from=amd.com; dkim=none (message not signed); arc=none (0)
  • Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector10001; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=Gi9tgzhIs4miN1EK+9aVL2wZ6Sk+LH/xIyL3np/Wr8g=; b=Dj7iQEvtJyfWusFtUIXLB73s0Y1hbnC3M8FKvcVrniHCT6FWbSFJHu0qgZ91Z6THEU54p1TNBBP05qL6mA/GJTFzPIrj+thYLiRmV2GsU52OpyQdb44e1wQULBUWZXUoNHpsqyDxBB9ZvpcLGSYiB4DeC3LCy4tyuaiA2BtkfKmwR/S1swcNUk4TDIXQXmm5M58DRKg1JYqEgBzV5KCTVl9aOAVQwZO5+woDArXeJtK0R7t9zBbUA07ANXRSrfJN+d+I+g7zgBjaJVnXR+UM1u4MigX/X5bqfUaZd85kRSulOF1xdfp3tEBu70vmmsqdVFMwa1FC27WGuYtcHXaSVQ==
  • Arc-seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none; b=sJ6AHKVwajdeW1QiVIhLPEhp7LKs07Wg+3KJMUQvMNmCOzVMQqfY1R0Y17bsZlfau7Bodv4+VW9gL0tXt9mDMYdiCtGWh9EEM02Xh3ZMtlWCkP1+lJSWtzUr1Hc56OYAAVQu9CDlmq5M9MNwgh3VRHCSdZdAZdG4RTlqLVhWShSyeKR255qCsHcxUi89mpJoXNOVBtqhoD8XS/+PD3DPCCvDpfO11LCWqevYPjctJWXK+ptgBQgpxF++V5Fb3zMYYIzLEo7e9QGihnuFDqTO4D/7dK1abkD/cyuIhEat1LB8E8b8p6n1EvC+sjFpHXi6/ZPWPzwCQOdFQ0InuDwxqg==
  • Cc: "Daniel P. Smith" <dpsmith@xxxxxxxxxxxxxxxxxxxx>, <xen-devel@xxxxxxxxxxxxxxxxxxxx>
  • Delivery-date: Wed, 30 Jul 2025 21:06:16 +0000
  • List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On 2025-07-30 06:17, Jan Beulich wrote:
On 16.07.2025 23:14, Jason Andryuk wrote:
In a disaggregated environment, dom0 is split into Control, Hardware,
and Xenstore domains, along with domUs.

Here we are with terminology again. In a truly disaggregated env, yet
more (service) domains would come into play. What you mean here is
only coarse disaggregation, as you're trying to get away without using
Flask.

If disaggregation only means, fine grain disaggregation, then I'm not sure how to differentiate coarse. I could write "split control and hardware domain".

  The is_control_domain() check
is not sufficient to handle all these cases.  Add is_priv_domain() to
support allowing for the various domains.

The purpose of SILO mode is to prevent domUs from interacting with each
other.  But dom0 was allowed to communicate with domUs to provide
services.

To provide xenstore connections, the Xenstore domain must be allowed to
connect via grants and event channels.  Xenstore domain must also be
allowed to connect to Control and Hardware to provide xenstore to them.

Hardware domain will provide PV devices to domains, so it must be
allowed to connect to domains.

That leaves Control.  Xenstore and Hardware would already allow access
to Control, so it can obtain services that way.  Control should be
"privileged", which would mean it can make the connections.  But with
Xenstore and Hardware providing their services to domUs, there may not
be a reason to allow Control to use grants or event channels with domUs.

"may not be" is too weak for my taste to forbid such.

I can't come up with a concrete example of why Control needs to directly communicate with a domU. Originally I allowed it, but it was your previous feedback which made me remove Control.

I don't have a strong opinion on the handling of Control. I can see it argued either way.

This silo check is for grants, event channels and argo.  The dummy
policy handles other calls, so Hardware is prevented from foreign
mapping Control's memory with that.

By "foreign mapping" you only mean what would result in p2m_foreign
entries? But grant mapping is okay?

Yes, "foreign mapping" = p2m_foreign.

Using grants requires explicit actions on both sides. silo_mode_dom_check() passing still requires action by both sides to establish a communication channel.

This is different from a foreign mapping, which is a unilateral action by the privileged side.

My intent was to highlight that allowing hardware domain to pass the silo_mode_dom_check() does not grant additional is_privilege permissions.

@@ -29,8 +40,8 @@ static bool silo_mode_dom_check(const struct domain *ldom,
  {
      const struct domain *currd = current->domain;
- return (is_control_domain(currd) || is_control_domain(ldom) ||
-            is_control_domain(rdom) || ldom == rdom);
+    return (is_priv_domain(currd) || is_priv_domain(ldom) ||
+            is_priv_domain(rdom) || ldom == rdom);
  }

IOW we're turning by 180°? Interesting ...

While the previous code is written "is_control_domain()" its use in silo mode is really "is_dom0()".

(Working on this, I've been thinking about how dom0 is like the god Janus with two faces. A single entity with two faces. We use is_control_domain() and is_hardware_domain(), but often it is just a different name for the single dom0)

As stated above, domU <-> Xenstore and domU <-> Hardware (for PV devices) are needed to allow those services. So while it looks like a 180°, maybe is_control_domain() was the wrong name for dom0?

Regards,
Jason



 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.