[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: [PATCH v2 1/7] x86: Introduce x86_decode_lite()
On 02.10.2024 17:27, Andrew Cooper wrote: > --- /dev/null > +++ b/xen/arch/x86/x86_emulate/decode-lite.c > @@ -0,0 +1,311 @@ > +/* SPDX-License-Identifier: GPL-2.0-only */ > + > +#include "private.h" > + > +#define Imm8 (1 << 0) > +#define Imm (1 << 1) > +#define Moffs (1 << 2) > +#define Branch (1 << 5) /* ... that we care about */ > +/* ModRM (1 << 6) */ > +#define Known (1 << 7) > + > +#define ALU_OPS \ > + (Known|ModRM), \ > + (Known|ModRM), \ > + (Known|ModRM), \ > + (Known|ModRM), \ > + (Known|Imm8), \ > + (Known|Imm) > + > +static const uint8_t init_or_livepatch_const onebyte[256] = { > + [0x00] = ALU_OPS, /* ADD */ [0x08] = ALU_OPS, /* OR */ > + [0x10] = ALU_OPS, /* ADC */ [0x18] = ALU_OPS, /* SBB */ > + [0x20] = ALU_OPS, /* AND */ [0x28] = ALU_OPS, /* SUB */ > + [0x30] = ALU_OPS, /* XOR */ [0x38] = ALU_OPS, /* CMP */ > +/* [0x40 ... 0x4f] = REX prefixes */ For these and other aspects further down, may I ask for a comment at the top of the file setting the scope for this new function (and its associated data) as being strictly 64-bit only? And perhaps even strictly covering only what Xen may legitimately use (largely excluding APX for the foreseeable future, i.e. until such time that we might decide to allow Xen itself to use APX throughout its code). Besides APX, with more immediate uses in mind, I wonder about e.g. BMI/BMI2 insns, which live outside the one/two-byte maps. I would further appreciate if we could be consistent with the mentioning (or not) of prefixes: The REX ones here are the only ones that the table mentions right now. In fact I wonder whether a Prefix attribute wouldn't be nice to have, so you don't need to open-code all of them in the function itself. > + [0x50 ... 0x5f] = (Known), /* PUSH/POP %reg */ > + > + [0x63] = (Known|ModRM), /* MOVSxd */ > + > + [0x68] = (Known|Imm), /* PUSH $imm */ > + [0x69] = (Known|ModRM|Imm), /* IMUL $imm/r/rm */ > + [0x6a] = (Known|Imm8), /* PUSH $imm8 */ > + [0x6b] = (Known|ModRM|Imm8), /* PUSH $imm8/r/rm */ IMUL? I'm also slightly irritated by $imm{,8}/r/rm - better to use commas to separate operands, and then rm is the middle one (2nd source) while r is the destination (last), if already you want AT&T operand order. > + [0x6c ... 0x6f] = (Known), /* INS/OUTS */ > + > + [0x70 ... 0x7f] = (Known|Branch|Imm8), /* Jcc disp8 */ > + [0x80] = (Known|ModRM|Imm8), /* Grp1 */ > + [0x81] = (Known|ModRM|Imm), /* Grp1 */ > + > + [0x83] = (Known|ModRM|Imm8), /* Grp1 */ > + [0x84 ... 0x8e] = (Known|ModRM), /* TEST/XCHG/MOV/MOV-SREG/LEA > r/rm */ > + > + [0x90 ... 0x99] = (Known), /* NOP/XCHG rAX/CLTQ/CQTO */ Omitting PUSH (at 0x8f) is somewhat odd, considering that it's a pretty "normal" insn. > + [0x9b ... 0x9f] = (Known), /* FWAIT/PUSHF/POPF/SAHF/LAHF */ > + > + [0xa0 ... 0xa3] = (Known|Moffs), /* MOVABS */ > + [0xa4 ... 0xa7] = (Known), /* MOVS/CMPS */ > + [0xa8] = (Known|Imm8), /* TEST %al */ > + [0xa9] = (Known|Imm), /* TEST %al */ %rAX? > + [0xaa ... 0xaf] = (Known), /* STOS/LODS/SCAS */ > + [0xb0 ... 0xb7] = (Known|Imm8), /* MOV $imm8, %reg */ > + [0xb8 ... 0xbf] = (Known|Imm), /* MOV $imm{16,32,64}, %reg */ > + [0xc0 ... 0xc1] = (Known|ModRM|Imm8), /* Grp2 (ROL..SAR $imm8, %reg) */ > + > + [0xc3] = (Known), /* RET */ With the absence of Branch here I think it would be nice if at least the description went into a little more detail as to the comment higher up saying "... that we care about". > + [0xc6] = (Known|ModRM|Imm8), /* Grp11, Further ModRM decode */ > + [0xc7] = (Known|ModRM|Imm), /* Grp11, Further ModRM decode */ > + > + [0xcb ... 0xcc] = (Known), /* LRET/INT3 */ > + [0xcd] = (Known|Imm8), /* INT $imm8 */ No IRET, when you have things like e.g. ICEBP and CLTS? > + [0xd0 ... 0xd3] = (Known|ModRM), /* Grp2 (ROL..SAR {$1,%cl}, %reg) > */ > + > + [0xe4 ... 0xe7] = (Known|Imm8), /* IN/OUT $imm8 */ > + [0xe8 ... 0xe9] = (Known|Branch|Imm), /* CALL/JMP disp32 */ > + [0xeb] = (Known|Branch|Imm8), /* JMP disp8 */ > + [0xec ... 0xef] = (Known), /* IN/OUT %dx */ > + > + [0xf1] = (Known), /* ICEBP */ > + [0xf4] = (Known), /* HLT */ > + [0xf5] = (Known), /* CMC */ > + [0xf6 ... 0xf7] = (Known|ModRM), /* Grp3, Further ModRM decode */ > + [0xf8 ... 0xfd] = (Known), /* CLC ... STD */ > + [0xfe ... 0xff] = (Known|ModRM), /* Grp4 */ > +}; > +static const uint8_t init_or_livepatch_const twobyte[256] = { > + [0x00 ... 0x03] = (Known|ModRM), /* Grp6/Grp7/LAR/LSL */ > + [0x06] = (Known), /* CLTS */ > + [0x09] = (Known), /* WBINVD */ > + [0x0b] = (Known), /* UD2 */ > + > + [0x18 ... 0x1f] = (Known|ModRM), /* Grp16 (Hint Nop) */ > + [0x20 ... 0x23] = (Known|ModRM), /* MOV %cr/%dr */ > + > + [0x30 ... 0x33] = (Known), /* WRMSR/RDTSC/RDMSR/RDPMC */ > + > + [0x40 ... 0x4f] = (Known|ModRM), /* CMOVcc */ > + > + [0x80 ... 0x8f] = (Known|Branch|Imm), /* Jcc disp32 */ > + [0x90 ... 0x9f] = (Known|ModRM), /* SETcc */ > + > + [0xa0 ... 0xa2] = (Known), /* PUSH/POP %fs/CPUID */ > + [0xa3] = (Known|ModRM), /* BT r/rm */ > + [0xa4] = (Known|ModRM|Imm8), /* SHLD $imm8 */ > + [0xa5] = (Known|ModRM), /* SHLD %cl */ > + > + [0xa8 ... 0xa9] = (Known), /* PUSH/POP %gs */ > + > + [0xab] = (Known|ModRM), /* BTS */ > + [0xac] = (Known|ModRM|Imm8), /* SHRD $imm8 */ > + [0xad ... 0xaf] = (Known|ModRM), /* SHRD %cl/Grp15/IMUL */ > + > + [0xb0 ... 0xb9] = (Known|ModRM), /* > CMPXCHG/LSS/BTR/LFS/LGS/MOVZxx/POPCNT/Grp10 */ Grp10 is kind of odd - I think UD1 would be more concise, despite the SDM indeed using group 10 there. > + [0xba] = (Known|ModRM|Imm8), /* Grp8 */ > + [0xbb ... 0xbf] = (Known|ModRM), /* BSR/BSF/BSR/MOVSX */ 0xbb is BTC I think. > + [0xc0 ... 0xc1] = (Known|ModRM), /* XADD */ > + [0xc7] = (Known|ModRM), /* Grp9 */ > + [0xc8 ... 0xcf] = (Known), /* BSWAP */ With UD1 and UD2, perhaps UD0 would make sense to also have in the table? > +}; > + > +/* > + * Bare minimum x86 instruction decoder to parse the alternative replacement > + * instructions and locate the IP-relative references that may need updating. > + * > + * These are: > + * - disp8/32 from near branches > + * - RIP-relative memory references > + * > + * The following simplifications are used: > + * - All code is 64bit, and the instruction stream is safe to read. > + * - The 67 prefix is not implemented, so the address size is only 64bit. As to the earlier remark - maybe this part of the comment could simply move to the top of the file? > + * Inputs: > + * @ip The position to start decoding from. > + * @end End of the replacement block. Exceeding this is considered an > error. > + * > + * Returns: x86_decode_lite_t > + * - On failure, length of 0. > + * - On success, length > 0. For rel_sz > 0, rel points at the relative > + * field in the instruction stream. > + */ > +x86_decode_lite_t init_or_livepatch x86_decode_lite(void *ip, void *end) Imo both pointers would be nice to be to-const. > +{ > + void *start = ip, *rel = NULL; > + unsigned int opc, rel_sz = 0; > + uint8_t b, d, rex = 0, osize = 4; > + > +#define OPC_TWOBYTE (1 << 8) > + > + /* Mutates IP, uses END. */ > +#define FETCH(ty) \ > + ({ \ > + ty _val; \ > + \ > + if ( (ip + sizeof(ty)) > end ) \ > + goto overrun; \ > + _val = *(ty *)ip; \ > + ip += sizeof(ty); \ > + _val; \ > + }) > + > + for ( ;; ) /* Prefixes */ > + { > + switch ( b = FETCH(uint8_t) ) > + { > + case 0x26: /* ES override */ > + case 0x2e: /* CS override */ > + case 0x36: /* DS override */ > + case 0x3e: /* SS override */ > + case 0x64: /* FS override */ > + case 0x65: /* GS override */ If you don't like the idea of a Prefix attribute I wonder in how far we actually need all of the above, when you already ... > + case 0xf0: /* LOCK */ > + case 0xf2: /* REPNE */ > + case 0xf3: /* REP */ > + break; > + > + case 0x66: /* Operand size override */ > + osize = 2; > + break; > + > + /* case 0x67: Address size override, not implemented */ ... deliberately leave of this one. > + case 0x40 ... 0x4f: /* REX */ > + rex = b; > + continue; > + > + default: > + goto prefixes_done; > + } > + rex = 0; /* REX cancelled by subsequent legacy prefix. */ > + } > + prefixes_done: > + > + if ( rex & 0x08 ) /* REX.W */ Can you please use REX_W here? > + osize = 8; > + > + /* Fetch the main opcode byte(s) */ > + if ( b == 0x0f ) > + { > + b = FETCH(uint8_t); > + opc = OPC_TWOBYTE | b; > + > + d = twobyte[b]; > + } > + else > + { > + opc = b; > + d = onebyte[b]; > + } > + > + if ( unlikely(!(d & Known)) ) > + goto unknown; > + > + if ( d & ModRM ) > + { > + uint8_t modrm = FETCH(uint8_t); > + uint8_t mod = modrm >> 6; > + uint8_t reg = (modrm >> 3) & 7; > + uint8_t rm = modrm & 7; > + > + /* ModRM/SIB decode */ > + if ( mod == 0 && rm == 5 ) /* RIP relative */ > + { > + rel = ip; > + rel_sz = 4; > + FETCH(uint32_t); > + } > + else if ( mod != 3 && rm == 4 ) /* SIB */ > + { > + uint8_t sib = FETCH(uint8_t); > + uint8_t base = sib & 7; > + > + if ( mod == 0 && base == 5 ) > + goto disp32; > + } > + > + if ( mod == 1 ) /* disp8 */ > + FETCH(uint8_t); > + else if ( mod == 2 ) /* disp32 */ > + { > + disp32: > + FETCH(uint32_t); The values aren't used, so the types don't matter overly much, yet int8_t and int32_t would be a more accurate representation of what's being fetched. > + } > + > + /* ModRM based decode adjustements */ > + switch ( opc ) > + { > + case 0xc7: /* Grp11 XBEGIN is a branch. */ > + if ( modrm == 0xf8 ) > + d |= Branch; > + break; > + case 0xf6: /* Grp3 TEST(s) have extra Imm8 */ > + if ( reg == 0 || reg == 1 ) > + d |= Imm8; > + break; > + case 0xf7: /* Grp3 TEST(s) have extra Imm */ > + if ( reg == 0 || reg == 1 ) > + d |= Imm; > + break; > + } In this switch() you don't distinguish 1- and 2-byte maps at all. This is an issue in particular for 0xc7. Blank lines between case blocks would also be nice, as this switch() is going to only ever grow. > + } > + > + if ( d & Branch ) > + { > + /* > + * We don't tolerate 66-prefixed call/jmp in alternatives. Some are > + * genuinely decoded differently between Intel and AMD CPUs. > + * > + * We also don't support APX instructions, so don't have to cope with > + * JMPABS which is the first branch to have an 8-byte immediate. > + */ > + if ( osize < 4 ) > + goto bad_osize; > + > + rel = ip; > + rel_sz = (d & Imm8) ? 1 : 4; > + } > + > + if ( d & (Imm | Imm8 | Moffs) ) > + { > + if ( d & Imm8 ) > + osize = 1; > + else if ( d & Moffs ) > + osize = 8; > + else if ( osize == 8 && !(opc >= 0xb8 && opc <= 0xbf) ) Again want to also take the opcode map into account, even if - by luck - this would work as is for now. Also could I talk you into converting the two comparisons into one, along the lines of "(opc | 7) != 0xbf"? > --- a/xen/arch/x86/x86_emulate/private.h > +++ b/xen/arch/x86/x86_emulate/private.h > @@ -9,7 +9,9 @@ > #ifdef __XEN__ > > # include <xen/bug.h> > +# include <xen/init.h> > # include <xen/kernel.h> > +# include <xen/livepatch.h> > # include <asm/endbr.h> > # include <asm/msr-index.h> > # include <asm/x86-vendors.h> It's only the new file that needs these - can we limit the dependencies to just that one by putting these new #include-s there? Jan
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |