[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] [PATCH] x86: fix map_domain_page() last resort fallback



On 12/06/2013 16:59, "Jan Beulich" <JBeulich@xxxxxxxx> wrote:

> Guests with vCPU count not divisible by 4 have unused bits in the last
> word of their inuse bitmap, and the garbage collection code therefore
> would get mislead believing that some entries were actually recoverable
> for use.
> 
> Also use an earlier established local variable in mapcache_vcpu_init()
> instead of re-calculating the value (noticed while investigating the
> generally better option of setting those overhanging bits once during
> setup - this didn't work out in a simple enough fashion because the
> mapping getting established there isn't in the current address space,
> and hence the bitmap isn't directly accessible there).
> 
> Reported-by: Konrad Wilk <konrad.wilk@xxxxxxxxxx>
> Signed-off-by: Jan Beulich <jbeulich@xxxxxxxx>

Whilst I can't argue against this as the obvious bugfix to the existing
code, I personally object to clawing back hash-table entries at all. The
size of the per-vcpu hashtable is small, and it should be perfectly possible
to always allow enough extra entries in the mapcache to always be able to
allocate an entry even when all vcpu's maphash buckets are in use.

Perhaps this is the right fix for 4.3 at this point, but in that case I am
quite inclined to simplify this down after 4.3, sidestepping the whole
issue.

 -- Keir

> --- a/xen/arch/x86/domain_page.c
> +++ b/xen/arch/x86/domain_page.c
> @@ -111,16 +111,17 @@ void *map_domain_page(unsigned long mfn)
>      idx = find_next_zero_bit(dcache->inuse, dcache->entries, dcache->cursor);
>      if ( unlikely(idx >= dcache->entries) )
>      {
> -        unsigned long accum = 0;
> +        unsigned long accum = 0, prev = 0;
>  
>          /* /First/, clean the garbage map and update the inuse list. */
>          for ( i = 0; i < BITS_TO_LONGS(dcache->entries); i++ )
>          {
> +            accum |= prev;
>              dcache->inuse[i] &= ~xchg(&dcache->garbage[i], 0);
> -            accum |= ~dcache->inuse[i];
> +            prev = ~dcache->inuse[i];
>          }
>  
> -        if ( accum )
> +        if ( accum | (prev & BITMAP_LAST_WORD_MASK(dcache->entries)) )
>              idx = find_first_zero_bit(dcache->inuse, dcache->entries);
>          else
>          {
> @@ -280,8 +281,7 @@ int mapcache_vcpu_init(struct vcpu *v)
>      if ( ents > dcache->entries )
>      {
>          /* Populate page tables. */
> -        int rc = create_perdomain_mapping(d, MAPCACHE_VIRT_START,
> -                                          d->max_vcpus *
> MAPCACHE_VCPU_ENTRIES,
> +        int rc = create_perdomain_mapping(d, MAPCACHE_VIRT_START, ents,
>                                            NIL(l1_pgentry_t *), NULL);
>  
>          /* Populate bit maps. */
> 
> 
> 



_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel


 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.