[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: IRQ latency measurements in hypervisor
On 15/01/2021 23:41, Stefano Stabellini wrote: >>>>> This is very interestingi too. Did you get any spikes with the >>>>> period >>>>> set to 100us? It would be fantastic if there were none. >>>>> >>>>>> 3. Huge latency spike during domain creation. I conducted some >>>>>> additional tests, including use of PV drivers, but this didn't >>>>>> affected the latency in my "real time" domain. But attempt to >>>>>> create another domain with relatively large memory size of 2GB >>>>>> led >>>>>> to huge spike in latency. Debugging led to this call path: >>>>>> >>>>>> XENMEM_populate_physmap -> populate_physmap() -> >>>>>> alloc_domheap_pages() -> alloc_heap_pages()-> huge >>>>>> "for ( i = 0; i < (1 << order); i++ )" loop. >>>> There are two for loops in alloc_heap_pages() using this syntax. Which >>>> one are your referring to? >>> I did some tracing with Lautrebach. It pointed to the first loop and >>> especially to flush_page_to_ram() call if I remember correctly. >> Thanks, I am not entirely surprised because we are clean and invalidating the >> region line by line and across all the CPUs. >> >> If we are assuming 128 bytes cacheline, we will need to issue 32 cache >> instructions per page. This going to involve quite a bit of traffic on the >> system. > I think Julien is most likely right. It would be good to verify this > with an experiment. For instance, you could remove the > flush_page_to_ram() call for one test and see if you see any latency > problems. > > >> One possibility would be to defer the cache flush when the domain is created >> and use the hypercall XEN_DOMCTL_cacheflush to issue the flush. >> >> Note that XEN_DOMCTL_cacheflush would need some modification to be >> preemptible. But at least, it will work on a GFN which is easier to track. > > This looks like a solid suggestion. XEN_DOMCTL_cacheflush is already > used by the toolstack in a few places. > > I am also wondering if we can get away with fewer flush_page_to_ram() > calls from alloc_heap_pages() for memory allocations done at boot time > soon after global boot memory scrubbing. I'm pretty sure there is room to improve Xen's behaviour in general, by not scrubbing pages already known to be zero. As far as I'm aware, there are improvements which never got completed when lazy scrubbing was added, and I think it is giving us a hit on x86, where we don't even have to do any cache maintenance on the side. ~Andrew
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |