binutils-gdb

Author	SHA1	Message	Date
H.J. Lu	d871f3f483	x86: Add CpuCMOV and CpuFXSR There are separate CPUID feature bits for fxsave/fxrstor and cmovCC instructions. This patch adds CpuCMOV and CpuFXSR to replace Cpu686 on corresponding instructions. gas/ * config/tc-i386.c (cpu_arch): Add .cmov and .fxsr. (cpu_noarch): Add nocmov and nofxsr. * doc/c-i386.texi: Document cmov and fxsr. opcodes/ * i386-gen.c (cpu_flag_init): Add CpuCMOV and CpuFXSR to CPU_I686_FLAGS. Add CPU_CMOV_FLAGS, CPU_FXSR_FLAGS, CPU_ANY_CMOV_FLAGS and CPU_ANY_FXSR_FLAGS. (cpu_flags): Add CpuCMOV and CpuFXSR. * i386-opc.tbl: Replace Cpu686 with CpuFXSR on fxsave, fxsave64, fxrstor and fxrstor64. Replace Cpu686 with CpuCMOV on cmovCC. * i386-init.h: Regenerated. * i386-tbl.h: Likewise.	2018-08-11 14:37:32 -07:00
Jan Beulich	e968fc9b63	x86: fold RegEip/RegRip and RegEiz/RegRiz This allows to simplify the code in a number of places.	2018-08-06 08:34:36 +02:00
Jan Beulich	dbf8be89ed	x86: drop NoRex64 from {,v}pmov{s,z}x* They're pointless with IgnoreSize also specified, and even more so when no Qword operand exists.	2018-08-03 09:30:58 +02:00
Jan Beulich	c48dadc9a8	x86: drop "mem" operand type attribute No template specifies this bit, so there's no point recording it in the templates. Use a flags[] bit instead.	2018-08-03 09:30:02 +02:00
Jan Beulich	1424ad8677	x86: also optimize KXOR{D,Q} and KANDN{D,Q} These can be converted to 2-byte VEX encoding when both source registers are the same, by using KXORW / KANDNW as replacement.	2018-07-31 10:58:05 +02:00
Jan Beulich	ae2387feae	x86: fold various AVX512 templates with so far differing Masking attributes There's no insn allowing ZEROING_MASKING alone. Re-purpose its value for handling the not uncommon case of insns allowing either form of masking with register operands, but only merging masking with a memory operand.	2018-07-31 10:57:09 +02:00
Jan Beulich	6ff00b5e12	x86/Intel: correct permitted operand sizes for AVX512 scatter/gather AVX gather insns correctly allow the element size to be specified rather than the full vector size. Make AVX512 ones match.	2018-07-31 10:55:17 +02:00
Jan Beulich	e951d5ca3d	x86: drop CpuVREX It is fully redundant with CpuAVX512F.	2018-07-31 10:52:37 +02:00
H.J. Lu	4a1b91eabb	x86: Expand Broadcast to 3 bits Expand Broadcast to 3 bits so that the number of bytes to broadcast can be computed as 1 << (Broadcast - 1). Use it to simplify x86 assembler. gas/ * config/tc-i386.c (Broadcast_Operation): Add bytes. (build_evex_prefix): Use i.broadcast->bytes. (match_broadcast_size): New function. (check_VecOperands): Use the broadcast field to compute the number of bytes to broadcast directly. Set i.broadcast->bytes. Use match_broadcast_size. opcodes/ * i386-gen.c (adjust_broadcast_modifier): New function. (process_i386_opcode_modifier): Add an argument for operands. Adjust the Broadcast value based on operands. (output_i386_opcode): Pass operand_types to process_i386_opcode_modifier. (process_i386_opcodes): Pass NULL as operands to process_i386_opcode_modifier. * i386-opc.h (BYTE_BROADCAST): New. (WORD_BROADCAST): Likewise. (DWORD_BROADCAST): Likewise. (QWORD_BROADCAST): Likewise. (i386_opcode_modifier): Expand broadcast to 3 bits. * i386-tbl.h: Regenerated.	2018-07-25 15:28:24 -07:00
Jan Beulich	4174bfff8a	x86-64: correct AVX512F vcvtsi2s{d,s} handling Just like for their AVX counterparts and CVTSI2S{D,S}, a memory source here is ambiguous and hence - in source files should be qualified with a suitable suffix or operand size specifier (not doing so is an error in Intel mode, and will gain a diagnostic in AT&T mode in the future), - in disassembly should be properly suffixed (the Intel operand size specifiers were emitted correctly already).	2018-07-24 09:46:27 +02:00
Jan Beulich	5f32791e1e	x86: fold narrowing VCVT* templates When multiple (here: two) forms of an insn take different width inputs but produce identical size outputs (here: RegXMM), the templates can be combined. Also drop IgnoreSize (and the now redundant size specifiers) wherever applicable.	2018-07-19 08:36:19 +02:00
Jan Beulich	625cbd7ac1	x86: fold VFPCLASSP{D,S} templates These are special because they may not have a register operand to derive the vector length from, which requires to also deal with the braodcast case when determining vector length in build_evex_prefix(). Also drop IgnoreSize (and the now redundant size specifiers) from their suffixed counterparts.	2018-07-19 08:35:38 +02:00
Jan Beulich	86b15c3204	x86: fold various AVX512* templates	2018-07-19 08:34:45 +02:00
Jan Beulich	cf769ed505	x86: fold various AVX512DQ templates	2018-07-19 08:34:01 +02:00
Jan Beulich	8282b7ad0d	x86: fold various AVX512BW templates	2018-07-19 08:32:17 +02:00
Jan Beulich	755908cce5	x86: fold various AVX512CD templates	2018-07-19 08:31:24 +02:00
Jan Beulich	7091c61201	x86: fold various AVX512VL templates into their AVX512F counterparts	2018-07-19 08:29:35 +02:00
H.J. Lu	11a322db5c	x86: Split vcvtps2{,u}qq and vcvttps2{,u}qq After commit `1b54b8d7e4` Author: Jan Beulich <jbeulich@novell.com> Date: Mon Dec 18 09:36:14 2017 +0100 x86: fold RegXMM/RegYMM/RegZMM into RegSIMD ... qualified by their respective sizes, allowing to drop FirstXmm0 at the same time. folded RegXMM, RegYMM and RegZMM into RegSIMD, it's no longer impossible to distinguish if Xmmword can represent a memory reference when operand specification contains SIMD register. For example, template operands specification like these RegXMM\|...\|Xmmword\|... and RegXMM\|... The Xmmword bitfield is always set by RegXMM which is represented by "RegSIMD\|Xmmword". This patch splits each of vcvtps2qq, vcvtps2uqq, vcvttps2qq and vcvttps2uqq into 2 templates: one template only has RegXMM source operand and the other only has mempry source operand. gas/ PR gas/23418 * testsuite/gas/i386/xmmword.s: Add tests for vcvtps2qq, vcvtps2uqq, vcvttps2qq and vcvttps2uqq. * testsuite/gas/i386/xmmword.l: Updated. opcodes/ PR gas/23418 * i386-opc.h (Byte): Update comments. (Word): Likewise. (Dword): Likewise. (Fword): Likewise. (Qword): Likewise. (Tbyte): Likewise. (Xmmword): Likewise. (Ymmword): Likewise. (Zmmword): Likewise. * i386-opc.tbl: Split vcvtps2qq, vcvtps2uqq, vcvttps2qq and vcvttps2uqq. * i386-tbl.h: Regenerated.	2018-07-18 05:33:50 -07:00
Jan Beulich	a79eaed6a5	x86: adjust monitor/mwait templates Architecturally, MONITOR's and MONITORX'es memory operand is a 16- or 32-bit register outside of 64-bit mode, and a 64- or 32-bit register inside 64-bit mode. The other register operands, including all of them for MWAIT and MWAITX, are uniformly 32-bit, irrespective of mode. Retain the original 64-bit MONITOR{,X} templates for compatibility only, and fold the MWAIT{,X} ones.	2018-07-11 10:30:00 +02:00
Jan Beulich	7f5cad3047	x86/Intel: accept memory operand size specifiers for CET insns	2018-07-11 10:25:40 +02:00
Jan Beulich	733bd0abdd	x86: fold MOV to/from segment register templates First of all there's no point in having separate Cpu386 templates - the respective SReg3 registers can't be specified for pre-386 anyway; see parse_real_register(). And then we can also make use of D here for the memory forms of the insn. This cannot be done for the non-64bit GPR forms because of the IgnoreSize that cannot be dropped from the to-SREG variant.	2018-06-01 08:41:16 +02:00
Jan Beulich	dfd27d4183	x86: don't emit REX.W for SLDT and STR Just like for other selector register reads, they're unnecessary and should hence be avoided.	2018-06-01 08:40:38 +02:00
Jan Beulich	6479571075	x86/Intel: accept "oword ptr" for INVPCID The insn is no different in this reagrd from INVEPT and INVVPID.	2018-06-01 08:37:24 +02:00
H.J. Lu	c0a30a9f0a	Enable Intel MOVDIRI, MOVDIR64B instructions gas/ * config/tc-i386.c (cpu_arch): Add .movdir, .movdir64b. (cpu_noarch): Likewise. (process_suffix): Add check for register size. * doc/c-i386.texi: Document movdiri, movdir64b. * testsuite/gas/i386/i386.exp: Run MOVDIR{I,64B} tests. * testsuite/gas/i386/movdir-intel.d: New file. * testsuite/gas/i386/movdir.d: Likewise. * testsuite/gas/i386/movdir.s: Likewise. * testsuite/gas/i386/movdir64b-reg.s: Likewise. * testsuite/gas/i386/movdir64b-reg.l: Likewise. * testsuite/gas/i386/x86-64-movdir-intel.d: Likewise. * testsuite/gas/i386/x86-64-movdir.d: Likewise. * testsuite/gas/i386/x86-64-movdir.s: Likewise. * testsuite/gas/i386/x86-64-movdir64b-reg.s: Likewise. * testsuite/gas/i386/x86-64-movdir64b-reg.l: Likewise. opcodes/ * i386-dis.c (Gva): New. (enum): Add PREFIX_0F38F8, PREFIX_0F38F9, MOD_0F38F8_PREFIX_2, MOD_0F38F9_PREFIX_0. (prefix_table): New instructions (see prefix above). (mod_table): New instructions (see prefix above). (OP_G): Handle va_mode. * i386-gen.c (cpu_flag_init): Add CPU_MOVDIRI_FLAGS, CPU_MOVDIR64B_FLAGS. (cpu_flags): Add CpuMOVDIRI and CpuMOVDIR64B. * i386-opc.h (enum): Add CpuMOVDIRI, CpuMOVDIR64B. (i386_cpu_flags): Add cpumovdiri and cpumovdir64b. * i386-opc.tbl: Add movidir{i,64b}. * i386-init.h: Regenerated. * i386-tbl.h: Likewise.	2018-05-07 16:57:48 -07:00
Igor Tsimbalist	aa17843739	Revert "Enable Intel MOVDIRI, MOVDIR64B instructions." This reverts commit `a914a7c958`.	2018-04-27 14:34:13 +02:00
Igor Tsimbalist	a914a7c958	Enable Intel MOVDIRI, MOVDIR64B instructions. gas/ * config/tc-i386.c (cpu_arch): Add .movdir, .movdir64b. (cpu_noarch): Likewise. (process_suffix): Add check for register size. * doc/c-i386.texi: Document movdiri, movdir64b. * testsuite/gas/i386/i386.exp: Run MOVDIR{I,64B} tests. * testsuite/gas/i386/movdir-intel.d: New test. * testsuite/gas/i386/movdir.d: Likewise. * testsuite/gas/i386/movdir.s: Likewise. * testsuite/gas/i386/movdir64b-reg.s: Likewise. * testsuite/gas/i386/movdir64b-reg.l: Likewise. * testsuite/gas/i386/x86-64-movdir-intel.d: Likewise. * testsuite/gas/i386/x86-64-movdir.d: Likewise. * testsuite/gas/i386/x86-64-movdir.s: Likewise. * testsuite/gas/i386/x86-64-movdir64b-reg.s: Likewise. * testsuite/gas/i386/x86-64-movdir64b-reg.l: Likewise. opcodes/ * i386-dis.c (enum): Add PREFIX_0F38F8, PREFIX_0F38F9. (prefix_table): New instructions (see prefix above). Add Gva macro and handling in OP_G. * i386-gen.c (cpu_flag_init): Add CPU_MOVDIRI_FLAGS, CPU_MOVDIR64B_FLAGS. (cpu_flags): Likewise. (opcode_modifiers): Add AddrPrefixOpReg. (i386_opcode_modifier): Likewise. * i386-opc.h (enum): Add CpuMOVDIRI, CpuMOVDIR64B. (i386_cpu_flags): Likewise. * i386-opc.tbl: Add movidir{i,64b}. * i386-init.h: Regenerate. * i386-tbl.h: Likewise.	2018-04-26 23:34:04 +02:00
Jan Beulich	e2195274d4	x86: fold various non-memory operand AVX512VL templates There's little point carrying up to three templates per insn flavor when the sole difference is operand size and the dependency on AVX512VL being enabled. Instead the need for AVX512VL can be derived from an operand allowing for ZMMword as well as one or both or XMMword and YMMword (irrespective of whether this is a register or memory operand). Without further abstraction to deal with the different Disp8MemShift values between the templates, only a limited set (mostly ones only allowing for non-memory operands) can be folded, which is being done here. Also drop IgnoreSize wherever possible from anything that's being touched anyway.	2018-04-26 08:55:02 +02:00
Jan Beulich	6e041cf4b0	x86: drop CpuRegMMX, CpuReg[XYZ]MM, and CpuRegMask It's not clear to me why they had been introduced - the respective comments in opcodes/i386-gen.c are certainly wrong: ymm<N> registers are very well supported (and necessary) with just AVX512F.	2018-04-26 08:48:01 +02:00
Jan Beulich	2f1bada2dc	x86: drop VexImmExt It's only used in assertions, and hence not really needed for correct code generation.	2018-04-26 08:30:06 +02:00
Jan Beulich	bacd145775	x86: drop redundant AVX512VL shift templates These were wrongly left in place by commit `ed438a93f1` ("x86: fold certain AVX512 rotate and shift templates").	2018-04-25 16:26:10 +02:00
Igor Tsimbalist	c48935d75f	Enable Intel CLDEMOTE instruction. gas/ * config/tc-i386.c (cpu_arch): Add .cldemote. * doc/c-i386.texi: Document cldemote/.cldemote. * testsuite/gas/i386/cldemote-intel.d: New. * testsuite/gas/i386/cldemote.d: Likewise. * testsuite/gas/i386/cldemote.s: Likewise. * testsuite/gas/i386/i386.exp: Run new tests. * testsuite/gas/i386/x86-64-cldemote-intel.d: New. * testsuite/gas/i386/x86-64-cldemote.d: Likewise. * testsuite/gas/i386/x86-64-cldemote.s: Likewise. * testsuite/gas/i386/ilp32/x86-64-nops.d: Remove 0x0f1c NOP encoding that maps to cldemote. * testsuite/gas/i386/nops.d: Likewise. * testsuite/gas/i386/nops.s: Likewise. * testsuite/gas/i386/x86-64-nops.d: Likewise. * testsuite/gas/i386/x86-64-nops.s: Likewise. opcode/ * i386-dis.c: Add REG_0F1C_MOD_0, MOD_0F1C_PREFIX_0, PREFIX_0F1C. * i386-gen.c (cpu_flag_init): Add CPU_CLDEMOTE_FLAGS, (cpu_flags): Add CpuCLDEMOTE. * i386-init.h: Regenerate. * i386-opc.h (enum): Add CpuCLDEMOTE, (i386_cpu_flags): Add cpucldemote. * i386-opc.tbl: Add cldemote. * i386-tbl.h: Regenerate.	2018-04-17 11:56:34 +02:00
H.J. Lu	ae1d384372	x86: Allow 32-bit registers for tpause and umwait Since only the first 32 bits of input operand are used for tpause and umwait, the REX.W bit is skipped. Both 32-bit registers and 64-bit registers are allowed. gas/ * testsuite/gas/i386/x86-64-waitpkg.s: Add 32-bit registers tests for tpause and umwait. * testsuite/gas/i386/x86-64-waitpkg-intel.d: Updated. * testsuite/gas/i386/x86-64-waitpkg.d: Likewise. opcodes/ * i386-dis.c (prefix_table): Replace Em with Edq on tpause and umwait. * i386-opc.tbl: Allow 32-bit registers for tpause and umwait in 64-bit mode. * i386-tbl.h: Regenerated.	2018-04-15 08:38:36 -07:00
Igor Tsimbalist	de89d0a34d	Enable Intel WAITPKG instructions. Intel has disclosed a set of new instructions for Tremont processor. The spec is https://software.intel.com/en-us/intel-architecture-instruction-set-extensions-programming-reference This patch enables Intel WAITPKG instructions. gas/ * config/tc-i386.c (cpu_arch): Add WAITPKG. (cpu_noarch): Likewise. * doc/c-i386.texi: Document WAITPKG. * i386/i386.exp: Run WAITPKG tests. * testsuite/gas/i386/waitpkg-intel.d: New test. * testsuite/gas/i386/waitpkg.d: Likewise. * testsuite/gas/i386/waitpkg.s: Likewise. * testsuite/gas/i386/x86-64-waitpkg-intel.d: Likewise. * testsuite/gas/i386/x86-64-waitpkg.d: Likewise. * testsuite/gas/i386/x86-64-waitpkg.s: Likewise. opcodes/ * i386-dis.c (enum): Add PREFIX_MOD_0_0FAE_REG_6, PREFIX_MOD_1_0FAE_REG_6. (va_mode): New. (OP_E_register): Use va_mode. * i386-dis-evex.h (prefix_table): New instructions (see prefixes above). * i386-gen.c (cpu_flag_init): Add WAITPKG. (cpu_flags): Likewise. * i386-opc.h (enum): Likewise. (i386_cpu_flags): Likewise. * i386-opc.tbl: Add umonitor, umwait, tpause. * i386-init.h: Regenerate. * i386-tbl.h: Likewise.	2018-04-11 21:37:12 +02:00
Jan Beulich	c39e5b2671	x86: drop VecESize It again can be inferred from other information. The vpopcntd templates all need to have Dword added to their memory operands; the lack thereof was actually a bug preventing certain Intel syntax code to assemble, so test cases get extended.	2018-03-28 14:25:07 +02:00
Jan Beulich	8e6e0792d1	x86: convert broadcast insn attribute to boolean The (only) valid broadcast type for an insn can be inferred from other information.	2018-03-28 14:24:05 +02:00
Jan Beulich	9f123b911e	x86: fold to-scalar-int conversion insns	2018-03-28 14:22:56 +02:00
Jan Beulich	b8c169f359	x86: drop pointless VecESize The attribute is meaningful only in templates allowing embedded broadcast. Drop them everywhere else.	2018-03-22 08:46:25 +01:00
Jan Beulich	9f79e88693	x86: fix swapped operand handling for BNDMOV The wrong placement of the Load attribute in the templates prevented this from working. The disassembler also didn't handle this consistently with other similar dual-encoding insns.	2018-03-22 08:32:50 +01:00
Jan Beulich	d6793fa1ac	x86/Intel: fix fallout from earlier template folding While many templates allowing multiple suitably matching XMM/YMM/ZMM operand sizes can be folded, a few need to be split in order to not wrongly accept "xmmword ptr" operands when only XMM registers are permitted (and memory operands are more narrow). Add a test case validating this.	2018-03-22 08:31:43 +01:00
Jan Beulich	f776822506	x86: fold a few XOP templates Also add a new test case verifying that mixed operands of SIMD insns with a size-less memory operand in the middle are properly rejected.	2018-03-22 08:29:45 +01:00
H.J. Lu	d3d50934a9	x86-64: Also optimize "clr reg64" "clr reg" is an alias of "xor reg, reg". We can encode "clr reg64" as "xor reg32, reg32". gas/ * config/tc-i386.c (optimize_encoding): Also encode "clr reg64" as "xor reg32, reg32". * testsuite/gas/i386/x86-64-optimize-1.s: Add "clr reg64" tests. * testsuite/gas/i386/x86-64-optimize-1.d: Updated. opcodes/ * i386-opc.tbl: Add Optimize to clr. * i386-tbl.h: Regenerated.	2018-03-08 06:41:34 -08:00
H.J. Lu	bd5dea8822	x86: Remove support for old (<= 2.8.1) versions of gcc Old (<= 2.8.1) versions of gcc generate broken fsubp, fsubrp, fdivp and fdivrp instructions. Assembler translates them to correct ones with a warning: [hjl@gnu-cfl-1 gas]$ cat x.s fsubp %st(3),%st [hjl@gnu-cfl-1 gas]$ gcc -c x.s x.s: Assembler messages: x.s:1: Warning: translating to `fsubp %st,%st(3)' [hjl@gnu-cfl-1 gas]$ This patch removes support for old (<= 2.8.1) versions of gcc: [hjl@gnu-cfl-1 gas]$ ./as-new -o x.o x.s x.s: Assembler messages: x.s:1: Error: operand type mismatch for `fsubp' [hjl@gnu-cfl-1 gas]$ gas/ * NEWS: Mention -mold-gcc removal. * config/tc-i386.c (i386_error): Remove old_gcc_only. (old_gcc): Removed. (match_template): Remove old gcc support. (OPTION_MOLD_GCC): Removed. (OPTION_MRELAX_RELOCATIONS): Updated. (md_longopts): Remove OPTION_MOLD_GCC. (md_parse_option): Likewise. (md_show_usage): Remove -mold-gcc. * testsuite/gas/i386/general.s: Convert fsub/fdiv tests for old (<= 2.8.1) versions of gcc. * testsuite/gas/i386/intel.s: Likewise. * testsuite/gas/i386/general.l: Updated. * testsuite/gas/i386/intel-intel.d: Likewise. * testsuite/gas/i386/intel.d: Likewise. * testsuite/gas/i386/intel.e: Likewise. * testsuite/gas/i386/i386.exp: Don't pass -mold-gcc to general. include/ * opcode/i386 (OLDGCC_COMPAT): Removed. opcodes/ * i386-gen.c (opcode_modifiers): Remove OldGcc. * i386-opc.h (OldGcc): Removed. (i386_opcode_modifier): Remove oldgcc. * i386-opc.tbl: Remove fsubp, fsubrp, fdivp and fdivrp instructions for old (<= 2.8.1) versions of gcc. * i386-tbl.h: Regenerated.	2018-03-08 06:31:52 -08:00
Jan Beulich	e771e7c9fb	x86: fold several AVX512VL templates The differences between some of the register and memory forms of the same insn often don't really require the templates to be separate. For example, Disp8MemShift is simply irrelevant to register forms. Fold these as far as possible, and also fold register-only forms. Further folding is possible, but needs other prereq work done first. A note regarding EVEXDYN: This is intended to be used only when no other properties of the template would make is_evex_encoding() return true. In all "normal" cases I think it is preferable to omit this indicator, to keep the table half way readable.	2018-03-08 08:58:55 +01:00
Jan Beulich	ed438a93f1	x86: fold certain AVX512 rotate and shift templates Their memory forms were bogusly using VexLWP instead of VexNDD. Adjust VexNDD handling to cope with these, allowing their register and memory forms to be folded.	2018-03-08 08:58:05 +01:00
Jan Beulich	454172a99e	x86: fold VEX-encoded GFNI templates	2018-03-08 08:57:19 +01:00
Jan Beulich	3682415023	x86: fold a few AVX512F templates The differences between some of the register and memory forms of the same insn often don't really require the templates to be separate. For example, Disp8MemShift is simply irrelevant to register forms. Fold them as far as possible. Further folding is possible, but needs other prereq work done first.	2018-03-08 08:56:47 +01:00
Jan Beulich	e7f5c0a99e	x86: fold LWP templates Also drop the no longer necessary explicit Disp<N> from them.	2018-03-08 08:56:08 +01:00
Jan Beulich	25a4277fec	x86: fold FMA and FMA4 templates	2018-03-08 08:55:37 +01:00
Jan Beulich	d2224064f1	x86: drop {X,Y,Z}MMWORD_MNEM_SUFFIX They aren't really useful (anymore?): The conflicting operand size check isn't applicable to any insn validly using respective memory operand sizes (and if they're used wrongly, another error would result), and the logic in process_suffix() can be easily changed to work without them. While re-structuring conditionals in process_suffix() also drop the CMPXCHG8B special case in favor of a NoRex64 attribute in the opcode table.	2018-03-08 08:52:27 +01:00
Jan Beulich	1b193f0b12	x86: drop bogus NoAVX These are meaningful only for insns with CpuSSE* (and a few other Cpu*) attribute.	2018-03-08 08:36:41 +01:00

1 2 3 4 5 ...

258 commits