Xen project Mailing List

Re: [PATCH v6 01/15] xen/common: add cache coloring common code

To: Carlo Nonato <carlo.nonato@xxxxxxxxxxxxxxx>

Date: Mon, 5 Feb 2024 10:21:17 +0100

Autocrypt: addr=jbeulich@xxxxxxxx; keydata= xsDiBFk3nEQRBADAEaSw6zC/EJkiwGPXbWtPxl2xCdSoeepS07jW8UgcHNurfHvUzogEq5xk hu507c3BarVjyWCJOylMNR98Yd8VqD9UfmX0Hb8/BrA+Hl6/DB/eqGptrf4BSRwcZQM32aZK 7Pj2XbGWIUrZrd70x1eAP9QE3P79Y2oLrsCgbZJfEwCgvz9JjGmQqQkRiTVzlZVCJYcyGGsD /0tbFCzD2h20ahe8rC1gbb3K3qk+LpBtvjBu1RY9drYk0NymiGbJWZgab6t1jM7sk2vuf0Py O9Hf9XBmK0uE9IgMaiCpc32XV9oASz6UJebwkX+zF2jG5I1BfnO9g7KlotcA/v5ClMjgo6Gl MDY4HxoSRu3i1cqqSDtVlt+AOVBJBACrZcnHAUSuCXBPy0jOlBhxPqRWv6ND4c9PH1xjQ3NP nxJuMBS8rnNg22uyfAgmBKNLpLgAGVRMZGaGoJObGf72s6TeIqKJo/LtggAS9qAUiuKVnygo 3wjfkS9A3DRO+SpU7JqWdsveeIQyeyEJ/8PTowmSQLakF+3fote9ybzd880fSmFuIEJldWxp Y2ggPGpiZXVsaWNoQHN1c2UuY29tPsJgBBMRAgAgBQJZN5xEAhsDBgsJCAcDAgQVAggDBBYC AwECHgECF4AACgkQoDSui/t3IH4J+wCfQ5jHdEjCRHj23O/5ttg9r9OIruwAn3103WUITZee e7Sbg12UgcQ5lv7SzsFNBFk3nEQQCACCuTjCjFOUdi5Nm244F+78kLghRcin/awv+IrTcIWF hUpSs1Y91iQQ7KItirz5uwCPlwejSJDQJLIS+QtJHaXDXeV6NI0Uef1hP20+y8qydDiVkv6l IreXjTb7DvksRgJNvCkWtYnlS3mYvQ9NzS9PhyALWbXnH6sIJd2O9lKS1Mrfq+y0IXCP10eS FFGg+Av3IQeFatkJAyju0PPthyTqxSI4lZYuJVPknzgaeuJv/2NccrPvmeDg6Coe7ZIeQ8Yj t0ARxu2xytAkkLCel1Lz1WLmwLstV30g80nkgZf/wr+/BXJW/oIvRlonUkxv+IbBM3dX2OV8 AmRv1ySWPTP7AAMFB/9PQK/VtlNUJvg8GXj9ootzrteGfVZVVT4XBJkfwBcpC/XcPzldjv+3 HYudvpdNK3lLujXeA5fLOH+Z/G9WBc5pFVSMocI71I8bT8lIAzreg0WvkWg5V2WZsUMlnDL9 mpwIGFhlbM3gfDMs7MPMu8YQRFVdUvtSpaAs8OFfGQ0ia3LGZcjA6Ik2+xcqscEJzNH+qh8V m5jjp28yZgaqTaRbg3M/+MTbMpicpZuqF4rnB0AQD12/3BNWDR6bmh+EkYSMcEIpQmBM51qM EKYTQGybRCjpnKHGOxG0rfFY1085mBDZCH5Kx0cl0HVJuQKC+dV2ZY5AqjcKwAxpE75MLFkr wkkEGBECAAkFAlk3nEQCGwwACgkQoDSui/t3IH7nnwCfcJWUDUFKdCsBH/E5d+0ZnMQi+G0A nAuWpQkjM1ASeQwSHEeAWPgskBQL

Cc: andrea.bastoni@xxxxxxxxxxxxxxx, Andrew Cooper <andrew.cooper3@xxxxxxxxxx>, George Dunlap <george.dunlap@xxxxxxxxxx>, Julien Grall <julien@xxxxxxx>, Stefano Stabellini <sstabellini@xxxxxxxxxx>, Wei Liu <wl@xxxxxxx>, Marco Solieri <marco.solieri@xxxxxxxxxxxxxxx>, xen-devel@xxxxxxxxxxxxxxxxxxxx

Delivery-date: Mon, 05 Feb 2024 09:21:19 +0000

List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On 03.02.2024 11:57, Carlo Nonato wrote: > On Wed, Jan 31, 2024 at 4:57 PM Jan Beulich <jbeulich@xxxxxxxx> wrote: >> On 29.01.2024 18:17, Carlo Nonato wrote: >>> +Background >>> +********** >>> + >>> +Cache hierarchy of a modern multi-core CPU typically has first levels >>> dedicated >>> +to each core (hence using multiple cache units), while the last level is >>> shared >>> +among all of them. Such configuration implies that memory operations on one >>> +core (e.g. running a DomU) are able to generate interference on another >>> core >>> +(e.g .hosting another DomU). Cache coloring allows eliminating this >>> +mutual interference, and thus guaranteeing higher and more predictable >>> +performances for memory accesses. >>> +The key concept underlying cache coloring is a fragmentation of the memory >>> +space into a set of sub-spaces called colors that are mapped to disjoint >>> cache >>> +partitions. Technically, the whole memory space is first divided into a >>> number >>> +of subsequent regions. Then each region is in turn divided into a number of >>> +subsequent sub-colors. The generic i-th color is then obtained by all the >>> +i-th sub-colors in each region. >>> + >>> +:: >>> + >>> + Region j Region j+1 >>> + ..................... ............ >>> + . . . >>> + . . >>> + _ _ _______________ _ _____________________ _ _ >>> + | | | | | | | >>> + | c_0 | c_1 | | c_n | c_0 | c_1 | >>> + _ _ _|_____|_____|_ _ _|_____|_____|_____|_ _ _ >>> + : : >>> + : :... ... . >>> + : color 0 >>> + :........................... ... . >>> + : >>> + . . ..................................: >>> + >>> +There are two pragmatic lesson to be learnt. >>> + >>> +1. If one wants to avoid cache interference between two domains, different >>> + colors needs to be used for their memory. >>> + >>> +2. Color assignment must privilege contiguity in the partitioning. E.g., >>> + assigning colors (0,1) to domain I and (2,3) to domain J is better >>> than >>> + assigning colors (0,2) to I and (1,3) to J. >> >> I can't connect this 2nd point with any of what was said above. > > If colors are contiguous then a greater spatial locality is achievable. You > mean we should better explain this? Yes, but not just that. See how you using "must" in the text contradicts you now suggesting this is merely an optimization. >>> +How to compute the number of colors >>> +*********************************** >>> + >>> +To compute the number of available colors for a specific platform, the >>> size of >>> +an LLC way and the page size used by Xen must be known. The first >>> parameter can >>> +be found in the processor manual or can be also computed dividing the total >>> +cache size by the number of its ways. The second parameter is the minimum >>> +amount of memory that can be mapped by the hypervisor, >> >> I find "amount of memory that can be mapped" quite confusing here. Don't you >> really mean the granularity at which memory can be mapped? > > Yes that's what I wanted to describe. I'll change it. > >>> thus dividing the way >>> +size by the page size, the number of total cache partitions is found. So >>> for >>> +example, an Arm Cortex-A53 with a 16-ways associative 1 MiB LLC, can >>> isolate up >>> +to 16 colors when pages are 4 KiB in size. >> >> I guess it's a matter of what one's use to, but to me talking of "way size" >> and how the calculation is described is, well, unusual. What I would start >> from is the smallest entity, i.e. a cache line. Then it would be relevant >> to describe how, after removing the low so many bits to cover for cache line >> size, the remaining address bits are used to map to a particular set. It >> looks to me as if you're assuming that this mapping is linear, using the >> next so many bits from the address. Afaik this isn't true on various modern >> CPUs; instead hash functions are used. Without knowing at least certain >> properties of such a hash function, I'm afraid your mapping from address to >> color isn't necessarily guaranteeing the promised isolation. The guarantee >> may hold for processors you specifically target, but then I think in this >> description it would help if you would fully spell out any assumptions you >> make on how hardware maps addresses to elements of the cache. > > You're right, we are assuming a linear mapping. We are going to review and > extend the documentation in order to fully specify when coloring can be > applied. > > About the "way size" it's a way of summarizing all the parameters into one. > We could ask for different cache parameters as you said, but in the end what > we are interested in is how many partitions is the cache capable of isolate > and how big they are. The answer is, in theory, as many partitions as the > number of sets, each one as big as a cache line, bacause we can't have > isolation inside a set. > Then memory mapping comes into place and the minimum granularity at which > mapping can happen actually lowers the number of partitions. > To recap we can isolate: > nr_sets * line_size / page_size > Then we simply named: > way_size = nr_sets * line_size > Another way of computing it: > way_size = cache_size / nr_ways > > We are ok with having two parameters: cache_size and nr_ways which are even > easier and intuitive to find for a normal user. Right, that's the aspect I was actually after. Jan

©2013 Xen Project, A Linux Foundation Collaborative Project. All Rights Reserved.
Linux Foundation is a registered trademark of The Linux Foundation.
Xen Project is a trademark of The Linux Foundation.