Xen project Mailing List

Re: [Xen-devel] [RFC PATCH] docs: add README.atomic

To: Andre Przywara <andre.przywara@xxxxxxx>

From: Stefano Stabellini <sstabellini@xxxxxxxxxx>

Date: Tue, 13 Jun 2017 15:21:19 -0700 (PDT)

Cc: Andrew Cooper <andrew.cooper3@xxxxxxxxxx>, Julien Grall <julien.grall@xxxxxxx>, Stefano Stabellini <sstabellini@xxxxxxxxxx>, Jan Beulich <JBeulich@xxxxxxxx>, xen-devel@xxxxxxxxxxxxxxxxxxxx

Delivery-date: Tue, 13 Jun 2017 22:21:26 +0000

Dmarc-filter: OpenDMARC Filter v1.3.2 mail.kernel.org 4EC2323698

List-id: Xen developer discussion <xen-devel.lists.xen.org>

On Tue, 13 Jun 2017, Andre Przywara wrote: > Recently there were some discussions about the nature and guarantees of > the atomic primitives that Xen provides. > This README.atomic file tries to document our expectations in those > functions and macros. > > Signed-off-by: Andre Przywara <andre.przywara@xxxxxxx> > --- > Hi, > > as mentioned in my previous mail, I consider this more of a discussion > base that an actual patch. I am by no means an expert in this area, so > part of this exercise here is to write down my understanding and see it > corrected by more knowledgable people ;-) > > Cheers, > Andre. > > docs/README.atomic | 116 > +++++++++++++++++++++++++++++++++++++++++++++++++++++ > 1 file changed, 116 insertions(+) > create mode 100644 docs/README.atomic > > diff --git a/docs/README.atomic b/docs/README.atomic > new file mode 100644 > index 0000000..57ec59b > --- /dev/null > +++ b/docs/README.atomic > @@ -0,0 +1,116 @@ > +Atomic operations in Xen > +======================== > + > +Data structures in Xen memory which can be accessed by multiple CPUs > +at the same time need to be protected against getting corrupted. > +The easiest way to do this is using a lock (spinlock in Xen's case), > +that guarantees that only one CPU can access that memory at a given point > +in time, also allows protecting whole data structures against becoming > +inconsistent. For most use cases this should be the way to go and programmers > +should stop reading here. > + > +However sometimes taking and releasing a lock is too costly or creates > +deadlocks or potential contention, so some lockless accesses are used. > +Those atomic accesses need to be done very carefully to be correct. > + > +Xen offers three kinds of atomic primitives that deal with those accesses: > + > +ACCESS_ONCE() > +------------- > +A macro basically casting the access to a volatile data type. That prevents > +the compiler from accessing that memory location multiple times, effectively > +caching this value (either in a register or on the stack). > +In terms of atomicity this prevents inconsistent values for a certain shared > +variable if used multiple times across the same function, as the compiler is > +normally free to re-read the value from memory at any time - given that it > +doesn't know that another entity might change this value. Consider the > +following code: > +=========== > + int x = shared_pointer->counter; > + > + some_var = x + something_else; > + function_call(x); > +=========== > +The compiler is free to actually *not* use a local variable, instead > derefence > +the pointer and directly access the shared memory twice when using the value > +of "x" in the assignment and in the function call. Now if some other CPU > +changes the value of "counter" meanwhile, the value of "x" is different, > +which violates the program semantic. The compiler is not to blame here, > +because it cannot know that this memory could change behind its back. > +The solution here is to use ACCESS_ONCE, which casts the access with the > +"volatile" keyword, thus making sure that the compiler knows that > +accessing this memory has side effects, so it needs to cache the value: > +=========== > + int x = ACCESS_ONCE(shared_pointer->counter); > + > + some_var = x + something_else; > + function_call(x); > +=========== > + > +What ACCESS_ONCE does *not* guarantee though is this access is done in a > +single instruction, so complex or non-native or unaligned data types are > +not guaranteed to be atomic. If for instance counter would be a 64-bit value > +on a 32-bit system, the compiler would probably generate two load > instructions, > +which could end up in reading a wrong value if some other CPU changes the > other > +half of the variable in between those two reads. > +However accessing _aligned and native_ data types is guaranteed to be atomic > +in the architectures supported by Xen, so ACCESS_ONCE is safe to use when > +these conditions are met. > +We expect a variable to be aligned if it comes from a non-packed struct or > +some other compiler-generated address, as sane compilers will not generate > +unaligned accesses by default. > +Extra care must be taken however if the address is coming from a crafted > +pointer or some address passed in by a non-trusted source (guests). > + > +read_atomic()/write_atomic() > +---------------------------- > +read_atomic() and write_atomic() are macros that make sure that the access > +to this variable happens using a single machine instruction. This guarantees > +the access to be atomic when the address is aligned (on all architectures > +supported by Xen). > +For most practical cases the generated code does not differ from what > +the compiler would generate anyway, but it makes sure that a change to an > +unsuitable data type would break compilation, also it annotates this access > +as being potentially concurrent (to a human reader). > +Also this macro does not check whether the address is actually aligned, > +though it is assumed that the compiler only generates aligned addresses > +unless being told otherwise explicitly. In the latter case it would be the > +responsibility of the coder to ensure atomicity using other means. > + > +atomic_read()/atomic_write() (and other variants starting with "atomic_") > +------------------------------------------------------------------------- > +(Not to be confused with the above!) > +Those (group of) functions work on a special atomic_t data type, which wraps > +an "int" in a structure to avoid accidential messing with the data type > +(for instance due to implicit casts). As a side effect this guarantees that > +this variable is aligned (though this would apply to most other "int" > +declarations as well). This special data type also makes sure that any > +accesses are only valid using those special accessor functions, which > +make sure that the atomic property is always preserved. > +On top of the basic read/write accesses this also provides read-modify-write > +primitives which use architecture specific means to guarantee atomic accesses > +(atomic_inc(), atomic_dec_and_test(), ...). > + > + > +It has to be noted that following the letters of the C standard a > +standards-compliant compiler has quite some freedom to generate code which > +would make a lot of accesses non-atomic (for instance splitting a 32-bit > +read into a series of four 8-bit reads). > +However a sane compiler, especially when following a certain ABI, would > +for instance always try to align variables and use as few machine > +instructions as possible, so for practical purposes most accesses are > +actually atomic without further ado, especially when being confined to > +certain architectures (like x86, ARM and ARM64 in Xen). > + > +So for practical purposes we assume a sane compiler to be used for Xen, > +with the following properties: > +- Compiler generated addresses for native-data-typed variables are aligned. > +- Simple read/write accesses to a native and aligned data type from compiler > + generated addresses are done using a single machine instruction. > + > +This makes read and write accesses to ints and longs (and their respective > +unsigned counter parts) naturally atomic. > +However it would be beneficial to use atomic primitives anyway to annotate > +the code as being concurrent and to prevent silent breakage when changing > +the code. Modern compilers tend to optimize quite well, often resulting in > +the same code to be generated for both the annotated and naive version. By "atomic primitives" you mean atomic_read()/atomic_write() or read_atomic()/write_atomic(), correct? If so, it makes a lot of sense to me. Thanks for writing this down! _______________________________________________ Xen-devel mailing list Xen-devel@xxxxxxxxxxxxx https://lists.xen.org/xen-devel

©2013 Xen Project, A Linux Foundation Collaborative Project. All Rights Reserved.
Linux Foundation is a registered trademark of The Linux Foundation.
Xen Project is a trademark of The Linux Foundation.