Xen project Mailing List

Re: Hypercall fault injection (Was [PATCH 0/3] xen/domain: More structured teardown)

From: Andrew Cooper <andrew.cooper3@xxxxxxxxxx>

Date: Tue, 22 Dec 2020 11:14:29 +0000

Authentication-results: esa1.hc3370-68.iphmx.com; dkim=none (message not signed) header.i=none

Cc: Roger Pau Monné <roger.pau@xxxxxxxxxx>, Wei Liu <wl@xxxxxxx>, Stefano Stabellini <sstabellini@xxxxxxxxxx>, Julien Grall <julien@xxxxxxx>, Volodymyr Babchuk <Volodymyr_Babchuk@xxxxxxxx>, "Juergen Gross" <jgross@xxxxxxxx>, Xen-devel <xen-devel@xxxxxxxxxxxxxxxxxxxx>, "Tamas K Lengyel" <tamas@xxxxxxxxxxxxx>

Delivery-date: Tue, 22 Dec 2020 11:14:40 +0000

Ironport-sdr: ZrAqm0flu03ilPafD0q9i8gsMoiCsNChBDL1SFIwA12VuJLDnRpBIL3BSMfFfIpUsA2uS28hz0 svrETz8EuB6m2C9U9qDdrd5QSrhCfkq/Z3/93RWGuHbmYD4jp/mOakl4y2Y0aDAjsybgqQEVI3 lcbLLnGalG35pBwpMz+Dsx4dvKUSu5YH0b51E7lMUXIK27BGi+qV45v5HZaGLnlMT4D1OxE5Op zQRNhPkrDfAXx0YuyrJX52Suggph1canS3ZhuaVVcAkAjGpLwhycyOLFC6yQMBXqtqiJleW25U A+4=

List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On 22/12/2020 10:00, Jan Beulich wrote: > On 21.12.2020 20:36, Andrew Cooper wrote: >> Hello, >> >> We have some very complicated hypercalls, createdomain, and max_vcpus a >> close second, with immense complexity, and very hard-to-test error handling. >> >> It is no surprise that the error handling is riddled with bugs. >> >> Random failures from core functions is one way, but I'm not sure that >> will be especially helpful. In particular, we'd need a way to exclude >> "dom0 critical" operations so we've got a usable system to run testing on. >> >> As an alternative, how about adding a fault_ttl field into the hypercall? >> >> The exact paths taken in {domain,vcpu}_create() are sensitive to the >> hardware, Xen Kconfig, and other parameters passed into the >> hypercall(s). The testing logic doesn't really want to care about what >> failed; simply that the error was handled correctly. >> >> So a test for this might look like: >> >> cfg = { ... }; >> while ( xc_create_domain(xch, cfg) < 0 ) >> cfg.fault_ttl++; >> >> >> The pro's of this approach is that for a specific build of Xen on a >> piece of hardware, it ought to check every failure path in >> domain_create(), until the ttl finally gets higher than the number of >> fail-able actions required to construct a domain. Also, the test >> doesn't need changing as the complexity of domain_create() changes. >> >> The main con will mostly likely be the invasiveness of code in Xen, but >> I suppose any fault injection is going to be invasive to a certain extent. > While I like the idea in principle, the innocent looking > > cfg = { ... }; > > is quite a bit of a concern here as well: Depending on the precise > settings, paths taken in the hypervisor may heavily vary, and hence > such a test will only end up being useful if it covers a wide > variety of settings. Even if the number of tests to execute turned > out to still be manageable today, it may quickly turn out not > sufficiently scalable as we add new settings controllable right at > domain creation (which I understand is the plan). Well - there are two aspects here. First, 99% of all VMs in practice are one of 3 or 4 configurations. An individual configuration is O(n) time complexity to test with fault_ttl, depending on the size of Xen's logic, and we absolutely want to be able to test these deterministically and to completion. For the plethora of other configurations, I agree that it is infeasible to test them all. However, a hypercall like this is easy to wire up into a fuzzing harness. TBH, I was thinking of something like https://github.com/intel/kernel-fuzzer-for-xen-project with a PVH Xen and XTF "dom0" poking just this hypercall. All the other complicated bits of wiring AFL up appear to have been done. Perhaps when we exhaust that as a source of bugs, we move onto fuzzing the L0 Xen, because running on native will give it more paths to explore. We'd need some way of reporting path/trace data back to AFL in dom0 which might require a bit plumbing. ~Andrew

©2013 Xen Project, A Linux Foundation Collaborative Project. All Rights Reserved.
Linux Foundation is a registered trademark of The Linux Foundation.
Xen Project is a trademark of The Linux Foundation.