[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH 2/2] NUMA: replace phys_to_nid()



Hi Jan,

On 13/12/2022 14:08, Jan Beulich wrote:
On 13.12.2022 14:48, Julien Grall wrote:
On 13/12/2022 12:46, Jan Beulich wrote:
On 13.12.2022 13:06, Julien Grall wrote:
On 13/12/2022 11:38, Jan Beulich wrote:
All callers convert frame numbers (perhaps in turn derived from struct
page_info pointers) to an address, just for the function to convert it
back to a frame number (as the first step of paddr_to_pdx()). Replace
the function by mfn_to_nid() plus a page_to_nid() wrapper macro. Replace
call sites by the respectively most suitable one.

While there also introduce a !NUMA stub, eliminating the need for Arm
(and potentially other ports) to carry one individually.

Signed-off-by: Jan Beulich <jbeulich@xxxxxxxx>
---
At the top of free_heap_pages() mfn_to_nid() could also be used, since
the MFN is calculated immediately ahead. The choice of using
page_to_nid() (for now at least) was with the earlier patch's RFC in
mind, addressing of which may require to make mfn_to_nid() do weaker
checking than page_to_nid().

I haven't looked in details at the previous patch. However, I don't like
the idea of making mfn_to_nid() do weaker checking because this could
easily confuse the reader/developper.

If you want to use weaker check, then it would be better if a separate
helper is provided with a name reflecting its purpose.

Well, the purpose then still is the very same conversion, so the name
is quite appropriate. I don't view mfn_to_nid_bug_dont_look_very_closely()
(exaggerating) as very sensible a name.

I understand they are both doing the same conversion. But the checks
will be different. With your proposal, we are now going to say if the
caller is "buggy" then use mfn_to_nid() if not then you can use any.

I think this is wrong to hide the "bug" just because the name is longer.
In fact, it means that any non-buggy caller will still have relaxed
check. The risk if we are going to introduce more "buggy" caller in the
future.

While I, too, have taken your perspective as one possible one, I've
also been considering a slightly different perspective: page_to_nid()
implies the caller to have a struct page_info *, which in turn implies
you pass in something identifying valid memory (which hence should have
a valid node ID associated with it). mfn_to_nid(), otoh, has nothing
to pre-qualify (see patch 1's RFC remark as to mfn_valid() not being
sufficient). Hence less rigid checking there can make sense (and you'll
notice that mfn_to_nid() was also used quite sparingly in the course of
the conversion.)

So from my perspective there are only two acceptable solutions:
    1. Provide a different helper that will be used for just "buggy"
caller. This will make super clear that the helper should only be used
in very limited circumstances.
    2. Fix the "buggy" callers.

  From your previous e-mails, it wasn't clear whether 2) is possible. So
that's leave us only with 1).

The buggy callers are the ones touched by patch 1; see (again) the RFC
remark there for limitations of that approach.

Even with what you wrote above, I still think that relaxing the check for everyone is wrong. Anyway, this patch is not changing the helper. So I will wait and see a formal proposal.


--- a/xen/common/numa.c
+++ b/xen/common/numa.c
@@ -671,15 +671,15 @@ static void cf_check dump_numa(unsigned
for_each_online_node ( i )
        {
-        paddr_t pa = pfn_to_paddr(node_start_pfn(i) + 1);
+        mfn_t mfn = _mfn(node_start_pfn(i) + 1);
printk("NODE%u start->%lu size->%lu free->%lu\n",
                   i, node_start_pfn(i), node_spanned_pages(i),
                   avail_node_heap_pages(i));
-        /* Sanity check phys_to_nid() */
-        if ( phys_to_nid(pa) != i )
-            printk("phys_to_nid(%"PRIpaddr") -> %d should be %u\n",
-                   pa, phys_to_nid(pa), i);
+        /* Sanity check mfn_to_nid() */
+        if ( node_spanned_pages(i) && mfn_to_nid(mfn) != i )


   From the commit message, I would have expected that we would only
replace phys_to_nid() with either mfn_to_nid() or page_to_nid().
However, here you added node_spanned_pages(). Can you explain why?

Oh, indeed, I meant to say a word on this but then forgot. This
simply is because the adding of 1 to the start PFN (which by
itself is imo a little funny) makes it so that the printk()
inside the conditional would be certain to be called for an
empty (e.g. CPU-only) node.

Ok. I think this wants to be a separate patch as this sounds like bug
and we should avoid mixing code conversion with bug fix.

Yet then this is only in a debug key handler. (Else I would have made
it a separate patch, yes.)

IMO, the fact it is a debug key handler doesn't matter. While I am generally OK if we do minor swapin patch modifying the behavior. I think the other way around is quite confusing. And therefore, I would rather prefer the split unless another maintainer thinks otherwise.

Cheers,

--
Julien Grall



 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.