[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: [PATCH v2 3/3] xen/riscv: update mfn calculation in pt_mapping_level()
On 03.02.2025 14:12, Oleksii Kurochko wrote: > --- a/xen/arch/riscv/include/asm/page.h > +++ b/xen/arch/riscv/include/asm/page.h > @@ -55,6 +55,22 @@ > #define PTE_SMALL BIT(10, UL) > #define PTE_POPULATE BIT(11, UL) > > +/* > + * In the case when modifying or destroying a mapping, it is necessary to > + * search until a leaf node is found, instead of searching for a page table > + * entry based on the precalculated `level` and `order` (look at > pt_update()). > + * This is because when `mfn` == INVALID_MFN, the `mask`(in > pt_mapping_level()) > + * will take into account only `vfn`, which could accidentally return an > + * incorrect level, leading to the discovery of an incorrect page table > entry. > + * > + * For example, if `vfn` is page table level 1 aligned, but it was mapped as > + * page table level 0, then pt_mapping_level() will return `level` = 1, > + * since only `vfn` (which is page table level 1 aligned) is taken into > account > + * when `mfn` == INVALID_MFN (look at pt_mapping_level()). > + */ > + > +#define PTE_LEAF_SEARCH BIT(12, UL) Is it intended for callers outside of pt.c to make use of this? If not, it better wouldn't be globally exposed. Furthermore, this isn't a property of the PTE(s) to be created, so is likely wrong to mix with PTE_* flags. (PTE_POPULATE is on the edge of also falling in this category, btw.) Perhaps ... > --- a/xen/arch/riscv/pt.c > +++ b/xen/arch/riscv/pt.c > @@ -187,11 +187,10 @@ static int pt_next_level(bool alloc_tbl, pte_t **table, > unsigned int offset) > > /* Update an entry at the level @target. */ > static int pt_update_entry(mfn_t root, vaddr_t virt, > - mfn_t mfn, unsigned int target, > + mfn_t mfn, unsigned int *target, ... you instead want to have callers of this function preset *target to a special value (e.g. UINT_MAX or CONFIG_PAGING_LEVELS) indicating the level is wanted as an output. > @@ -205,39 +204,48 @@ static int pt_update_entry(mfn_t root, vaddr_t virt, > bool alloc_tbl = !mfn_eq(mfn, INVALID_MFN) || (flags & PTE_POPULATE); > pte_t pte, *entry; > > - /* convenience aliases */ > - DECLARE_OFFSETS(offsets, virt); > - > - table = map_table(root); > - for ( ; level > target; level-- ) > + if ( flags & PTE_LEAF_SEARCH ) > { > - rc = pt_next_level(alloc_tbl, &table, offsets[level]); > - if ( rc == XEN_TABLE_MAP_NOMEM ) > + entry = pt_walk(virt, target); > + BUG_ON(!pte_is_mapping(*entry)); Is this really necessarily a bug? Can't one want to determine how deep the (populated) page tables are for a given VA? Hmm, here I can see why you have pt_walk() return a pointer. As per the comment on the earlier patch, I don't think this is a good idea. You may want to have static pte_t *_pt_walk(...) { ... } pte_t pt_walk(...) { return *_pt_walk(...); } > @@ -345,9 +353,6 @@ static int pt_mapping_level(unsigned long vfn, mfn_t mfn, > unsigned long nr, > return level; > > /* > - * Don't take into account the MFN when removing mapping (i.e > - * MFN_INVALID) to calculate the correct target order. > - * > * `vfn` and `mfn` must be both superpage aligned. > * They are or-ed together and then checked against the size of > * each level. You drop part of the comment without altering the code being commented. What's the deal? > @@ -415,19 +420,33 @@ static int pt_update(vaddr_t virt, mfn_t mfn, > > spin_lock(&pt_lock); > > - while ( left ) > + /* look at the comment above the definition of PTE_LEAF_SEARCH */ > + if ( mfn_eq(mfn, INVALID_MFN) && !(flags & PTE_POPULATE) ) > { > - unsigned int order, level; > + flags |= PTE_LEAF_SEARCH; > + } For readability I think it would be better if the figure braces were dropped. > - level = pt_mapping_level(vfn, mfn, left, flags); > - order = XEN_PT_LEVEL_ORDER(level); > + while ( left ) > + { > + unsigned int order = 0, level; > > - ASSERT(left >= BIT(order, UL)); > + if ( !(flags & PTE_LEAF_SEARCH) ) > + { > + level = pt_mapping_level(vfn, mfn, left, flags); > + order = XEN_PT_LEVEL_ORDER(level); > + ASSERT(left >= BIT(order, UL)); Assignment to order and assertion are ... > + } > > - rc = pt_update_entry(root, vfn << PAGE_SHIFT, mfn, level, flags); > + rc = pt_update_entry(root, vfn << PAGE_SHIFT, mfn, &level, flags); > if ( rc ) > break; > > + if ( flags & PTE_LEAF_SEARCH ) > + { > + order = XEN_PT_LEVEL_ORDER(level); > + ASSERT(left >= BIT(order, UL)); > + } ... repeated here, with neither left nor order being passed into pt_update_entry(). Does this really need doing twice? (I have to admit that I have trouble determining what the assertion is about. For order alone it clearly could be done centrally after the call.) Jan
|
![]() |
Lists.xenproject.org is hosted with RackSpace, monitoring our |