paweld/fpc - fpc - brudnopis.ovh

paweld/fpc

mirror of https://gitlab.com/freepascal.org/fpc/source.git synced 2025-10-17 23:11:47 +02:00

Author	SHA1	Message	Date
Jonas Maebe	8b9e90dc7a	* keep track of whether a routine has a C-style variadic parameter in the procoptions even when it's through an array-of-const parameter * always call create_varargs_paraloc_info() instead of create_paraloc_info() in the former case, even when no varargs parameters are specified (because on some platforms even some non-variadic parameters need to be passed differently, such as on ARM with gnueabihf) git-svn-id: trunk@41420 -	2019-02-23 15:42:45 +00:00
florian	597a23d278	+ tls support for x86_64-linux (not yet enabled by default) git-svn-id: trunk@41081 -	2019-01-27 09:37:25 +00:00
florian	c7bb028d35	* factored out OptPass1FLD, used bx x86-64 now as well - removed unused function git-svn-id: trunk@41061 -	2019-01-24 20:10:29 +00:00
florian	24f6cc0da5	* factored out OptPass1FSTP, used by x86-64 now as well git-svn-id: trunk@41059 -	2019-01-24 19:38:32 +00:00
florian	4f0da5fcc3	+ patch by Marģers to support the x86 assembler instructions blsi, blsr, blsmsk, adcx, adox, movbe, pclmulqdq, resolves #34815 and #34799 + avxopcodes tests also movbe and pclmulqdq git-svn-id: trunk@40951 -	2019-01-20 18:50:12 +00:00
Jonas Maebe	91d5457b38	* moved around/replaced the following procedures to stop nflw from depending on pdecsub (node units should not depend on parser units): o maybe_add_public_default_java_constructor() o handle_calling_convention() o create_finalizer_procdef() (replaced with create_outline_procdef()) o insert_record_hidden_paras() o handle_calling_convention() o proc_add_definition() o build_parentfpstruct() o maybe_guarantee_record_typesym() o get_first_proc_str() * factored out the creation of a procinfo for a nested procdef based on a subnodetree of the current procdef into tprocinfo.create_for_outlining() git-svn-id: trunk@40773 -	2019-01-05 16:26:33 +00:00
Károly Balogh	2f7fe0e737	haiku-x86_64: add target to the compiler and ppudump, enable it in fpmake and fpcmake git-svn-id: trunk@40753 -	2019-01-04 02:16:24 +00:00
florian	7f5c2fa3aa	* (modified) patch by Gareth Moreton: Speed improvement in case blocks, resolves #0034762 This patch improves the compiler where "case" statements are concerned, using jump tables more often and creating more efficient machine code in some situations: * If a case block only contains one branch (not including the else block), the initial range check is removed, since this becomes wasted effort. * If the else block is empty, the else label is set to the end label - though this doesn't decrease the code size, it takes a bit of strain off the peephole optimizer. * On -O2 and above, some node analysis is now done on the branch labels. Most of the time this just redirects it to the end label for empty blocks, but if the block contains a goto statement, it will redirect it to its destination instead, thus increasing performance by not having multiple jumps (this won't get picked up by the peephole optimiser if the label addresses are in a jump table). * Some checks now use what I call the 'true count' rather than the 'label count'. The true count includes each individual value in a range - for example, 0..2 counts as 3. This increases the chance that a jump table will be utilised in situations where it is more efficient than a linear list. * For jump tables, if the case block almost covers the entire range (32 entries or fewer from full coverage), the initial range check is removed and the gaps included in the jump table (pointing to the else label). git-svn-id: trunk@40676 -	2018-12-27 18:31:55 +00:00
Jonas Maebe	acf02ab64b	* when creating wrappers, add a prefix to parameter names to prevent them hiding the method name of the wrapped routine o also add a few more '&' prefixes to the generated wrapper code to prevent issues when keywords are used as identifiers git-svn-id: trunk@40634 -	2018-12-24 22:10:06 +00:00
Jonas Maebe	3b9f5a5e96	* moved finalization of code generator temps to a node, so it can be getcopy'd (needed for LLVM, where we need two copies of the finally code: one in case an exception occurs, and one in case none is raised) o also first finalize parameters and only then code generator temps, since in theory the former could create more of the latter git-svn-id: trunk@40345 -	2018-11-17 22:38:36 +00:00
florian	9b0ff05ee8	- get rid of MaxOps, it is redundant with max_operands * MatchOpType with three operands is only available of max_operands>2 git-svn-id: trunk@40190 -	2018-11-02 21:32:29 +00:00
Jonas Maebe	513481ad71	* fixed overzealous removal of an assigned(t1) check in r40142 (it was for a ttryexceptnode instead of for a ttryfinally node) git-svn-id: trunk@40189 -	2018-11-02 21:12:18 +00:00
yury	a026252508	* Commented out unused "size" local var. git-svn-id: trunk@40184 -	2018-11-02 18:47:34 +00:00
yury	4357caaad8	* Removed unused local vars. git-svn-id: trunk@40183 -	2018-11-02 18:44:29 +00:00
florian	59d5d6ec95	+ factored out TX86AsmOptimizer.PrePeepholeOptIMUL, used now by x86-64 and i386 * generalized and simplified the code git-svn-id: trunk@40162 -	2018-11-01 20:49:20 +00:00
Jonas Maebe	0cd85d73e4	* ttryfinallynode now inherits from tbinarynode instead of tloopnode, as its t1 field was always set to tnothingnode git-svn-id: trunk@40142 -	2018-11-01 13:19:33 +00:00
yury	e367577bf1	+ added support for x86_64-android target. git-svn-id: trunk@39956 -	2018-10-17 16:56:27 +00:00
Jonas Maebe	0b246f3dbd	* converted Boolean8 to an internal type, and mapped Boolean to the new internal pasbool1(type) (part of mantis #34411) o apply the _Bool x86-64 parameter passing rules only to pasbool1 git-svn-id: trunk@39949 -	2018-10-16 21:14:18 +00:00
svenbarth	2f74a51eb8	* fix .seh_savereg: the offset is checked with a bitmask, not a divisor, so use "and", not "mod" git-svn-id: trunk@39877 -	2018-10-07 12:25:09 +00:00
florian	8943c0584e	+ patch by J. Gareth Moreton to support BMI2 instructions + extended avx test generator with the newly added BMI2 instructions git-svn-id: trunk@39875 -	2018-10-07 10:10:19 +00:00
florian	64ad48e89e	* forgotten part of r39750 git-svn-id: trunk@39752 -	2018-09-13 20:20:40 +00:00
florian	78943ea843	+ patch by J. Gareth Moreton: x86 optimisations for Jcc and SETcc, resolves #33899 * optimization also added for i386 git-svn-id: trunk@39307 -	2018-06-25 20:40:05 +00:00
Jonas Maebe	4686f61002	* keep track of the temp position separately from the offset in references, so that they can still be freed after the reference has been changed (e.g. in case of array indexing or record field accesses) (mantis #33628) git-svn-id: trunk@38814 -	2018-04-22 17:03:16 +00:00
nickysn	518cdf9674	* replaced the saved_XXX_registers arrays with virtual methods inside tcpuparamanager, very similar to the existing get_volatile_registers_XXX. The new methods are called get_saved_registers_XXX, where XXX is the register type ("int", "address", "fpu" or "mm") git-svn-id: trunk@38794 -	2018-04-19 21:22:16 +00:00
florian	25c5ee2fc3	* patch by J. Gareth Moreton to fix vectorcall (no effect) for linux, resolves #33184 and #33542 git-svn-id: trunk@38663 -	2018-04-02 21:14:26 +00:00
florian	8b0bbdcaab	* fix flag subregs after r38206 git-svn-id: trunk@38502 -	2018-03-11 20:30:11 +00:00
florian	9b18e39c81	* enable Lea2AddBase and Lea2AddIndex in TX86AsmOptimizer.PostPeepholeOptLea as we have flag tracking now * some flag allocations fixed git-svn-id: trunk@38501 -	2018-03-11 20:30:09 +00:00
florian	5fbecc5501	+ use TX86AsmOptimizer.OptPass1SHLSAL on x86-64 git-svn-id: trunk@38499 -	2018-03-11 14:35:22 +00:00
marco	f0042a4719	* vcmppd hardcoded primitives like vcmpeqpd. * required increasing maxinfolen to 9 git-svn-id: trunk@38404 -	2018-03-03 23:32:54 +00:00
marco	f21a141144	* mantis #32001 , add 32 vcmpps variants. git-svn-id: trunk@38403 -	2018-03-03 23:10:03 +00:00
florian	8c5606b41d	+ support mmx shifting git-svn-id: trunk@38367 -	2018-02-27 21:40:12 +00:00
florian	3b779278e2	+ (slightly) patch by Emelyanov Roman to add support of SEH directive in FPC internal assembler with INTEL syntax, resolves #29894 git-svn-id: trunk@38331 -	2018-02-24 16:14:08 +00:00
florian	91514da267	* factored out TX86AsmOptimizer.PostPeepholeOptCall + use TX86AsmOptimizer.PostPeepholeOptCall on x86-64 git-svn-id: trunk@38278 -	2018-02-17 23:25:01 +00:00
florian	112373a124	* fix passing floats on unix after vectorcall patch git-svn-id: trunk@38209 -	2018-02-11 17:50:43 +00:00
florian	31f78ea2b6	+ implementation of the vectorcall calling convention by J. Gareth Moreton + tests git-svn-id: trunk@38206 -	2018-02-11 17:50:37 +00:00
florian	810acd82b2	* patch by J. Gareth Moreton that makes some improvements to the Peephole Optimizer for x86 and x86-64 code, as well as some cleanup with formatting, code syntax consistency, and debug messages. - xorq %reg,%reg (identical registers) is now changed to xorl %reg,%reg if doing so removes the REX prefix. - movw %bx,%ax; andl $0xffff,%eax, for example, is now changed to movzwl %bx,%eax as long as a conditional operation doesn't follow 'and' (checks to see if the CPU flags are in use). - movzbq and movzwq get optimised to movzbl and movzwl respectively if doing so removes the REX prefix. - Removal of optimisation code that zero-extends from 32-bit to 64-bit, because there isn't actually a valid combination of opcodes for MOVZX that allows that (for registers, just use MOV). This is not the case with MOVSX. - movq is now optimised to movl even if the CPU flags are in use (this stops mov %reg,0 from being optimised to xor %reg,%reg if doing so breaks an algorithm that relies on them). - Fixed typo in peephole message regarding movq to movl (it said movd instead). - Made the peephole debug messages more consistent in formatting, some of which now have more detail. * small fixes of the patch git-svn-id: trunk@38070 -	2018-01-28 14:41:54 +00:00
florian	73fda1ccb6	* factored out OptPass1Sub + make use of OptPass1Sub on x86_64 and i8086 as well git-svn-id: trunk@37572 -	2017-11-10 20:55:22 +00:00
nickysn	ae92973196	+ added support for the retw, retnw, retfw, retd, retnd, retfd, retq, retnq and retfq x86 instructions. These are variants of the ret instruction with the return offset size set explicitly, e.g. retfw is a 16-bit far ret (i.e. pops a 16-bit offset and a 16-bit segment), retfd is a 32-bit far ret (pops a 32-bit offset, followed by a 16-bit segment), etc. git-svn-id: trunk@37571 -	2017-11-10 16:53:29 +00:00
pierre	ba3afefa4c	Regenerate register include files after commit 37564: Fix value of NR_DR6 and NR_DR7 git-svn-id: trunk@37565 -	2017-11-07 07:30:42 +00:00
florian	4da4b768ec	* factored out PostPeepholeOptTest + use PostPeepholeOptTest on x86-64 git-svn-id: trunk@37551 -	2017-11-04 19:10:14 +00:00
florian	3097eaf8ee	* made PostPeepholeOptMov a function git-svn-id: trunk@37550 -	2017-11-04 19:10:12 +00:00
florian	a7ea7fb569	* factored out PostPeepholeOptCmp + use PostPeepholeOptCmp for x86_64 git-svn-id: trunk@37549 -	2017-11-04 19:10:09 +00:00
nickysn	80226e3af4	+ added an optimization pass, that optimizes x86 references git-svn-id: trunk@37494 -	2017-10-20 15:55:55 +00:00
nickysn	e8bbc4eef9	+ support the xlat x86 instruction syntax with a memory operand. This allows specifying the address size (e.g. xlat byte ptr [bx] or xlat byte ptr [ebx]) git-svn-id: trunk@37478 -	2017-10-17 16:40:06 +00:00
nickysn	0fb79946a5	+ added support for the parameterized versions of the x86 string instructions (movs, cmps, scas, lods, stos, ins, outs) in the inline asm of the i8086, i386 and x86_64 targets. Both intel and at&t syntax is supported. * NEC V20/V30 instruction 'ins' (available only on the i8086 target, because it is incompatible with 386+ instructions) renamed 'nec_ins', to avoid conflict with the 186+ 'ins' instruction. git-svn-id: trunk@37446 -	2017-10-12 00:07:02 +00:00
nickysn	92a52a9f4d	+ implemented support for instructions with non-native address size on i8086 (16-bit and 32-bit), i386 (16-bit and 32-bit) and x86_64 (32-bit and 64-bit). Known bug: 32-bit addresses with an offset have their offset truncated to its low 16-bits on i8086 git-svn-id: trunk@37409 -	2017-10-06 15:27:14 +00:00
florian	198c53a908	o patch by J. Gareth "Kit" Moreton, resolves partially issue #32037 * generate instructions with shorter imm on x86-64 if possible git-svn-id: trunk@37376 -	2017-10-01 18:40:09 +00:00
florian	15b617546e	+ call TX86AsmOptimizer.OptPass1VOP for logical operations as well git-svn-id: trunk@37367 -	2017-10-01 14:40:21 +00:00
nickysn	aec03309ef	+ added CPUX86_HAS_SSE2 to x86 tcpuflags git-svn-id: trunk@37326 -	2017-09-26 16:02:56 +00:00
nickysn	e701fa8de1	* converted the x86 instruction flags to a set, so they can be extended more easily and so that all the values are now available to the compiler (previously, there were several, which were mapped to the same value and thus were only used to make x86ins.dat easier to read) git-svn-id: trunk@37299 -	2017-09-21 15:48:27 +00:00

1 2 3 4 5 ...

583 Commits