paweld/fpc - fpc - brudnopis.ovh

paweld/fpc

mirror of https://gitlab.com/freepascal.org/fpc/source.git synced 2025-04-30 17:33:55 +02:00

Author	SHA1	Message	Date
florian	08bf826334	Merges 22828,22829 git-svn-id: trunk@23756 -	2013-03-09 20:49:57 +00:00
florian	9198630ce3	* fix armv7m compiler git-svn-id: trunk@23686 -	2013-03-03 21:59:34 +00:00
florian	1eeeb309c7	* intial armv6m support, it is not working yet, constant pool insertation and conditional branch fixup is not working yet git-svn-id: trunk@23682 -	2013-03-03 12:20:10 +00:00
yury	d8902af3d6	* Do not replace ADD by SUB and vice-versa when overflow checking is enabled. Fixes tw15304 for ARM. git-svn-id: trunk@23517 -	2013-01-24 21:49:21 +00:00
Jonas Maebe	69c29a415f	* pass the procdef to getintparaloc instead of only the proccalloption, so that the type of the parameters can be determined automatically o added compilerproc declarations for all helpers called in the compiler via their assembler name, so we can look up the corresponding procdef git-svn-id: trunk@23325 -	2013-01-06 15:05:40 +00:00
florian	47d43750e4	* remove unused units from uses statements git-svn-id: trunk@23306 -	2013-01-03 23:07:09 +00:00
florian	3143f0e1be	* fix by Jeppe Johansen for bitscan which was broken by the last fix for normal arm code git-svn-id: trunk@22866 -	2012-10-28 17:57:22 +00:00
florian	1520bcc4f0	* fix bsf for armv7+ git-svn-id: trunk@22860 -	2012-10-27 20:17:58 +00:00
Jeppe Johansen	818b43c060	Updated localalign values, and changed the behaviour of alignment of load/stores to not break aligned half-word copies git-svn-id: branches/laksen/arm-embedded@22828 -	2012-10-23 12:45:40 +00:00
Jeppe Johansen	628d46f2d3	Fixed Bsf* functions on platforms that support RBIT Fixed stackframe epilogue code for Thumb2 to allow proper processing of interrupts git-svn-id: branches/laksen/arm-embedded@22813 -	2012-10-21 19:13:59 +00:00
Jeppe Johansen	4e84431dde	Fix some optimizations which assume that there are 3 operands Add simple Mul+Sub/Mul+Add into MLS/MLA optimizations Fix some other small issues in the optimizer Implement Interlocked* functions with proper use of LDREX/STREX git-svn-id: branches/laksen/arm-embedded@22801 -	2012-10-21 16:20:52 +00:00
Jeppe Johansen	a8f9b0dac4	Added initial support for the Cortex-M4F FPv4_S16 FPU git-svn-id: branches/laksen/arm-embedded@22597 -	2012-10-08 20:10:45 +00:00
Jonas Maebe	1a97e61619	* factored out fixref from handle_load_store, and handle indirect symbol loads for darwin in fixref (used to be done in ncgld, but was removed there) git-svn-id: trunk@22445 -	2012-09-23 21:25:32 +00:00
florian	5facc6ad5e	* BsrX(0) should return 255 instead of $ffffffff git-svn-id: trunk@22316 -	2012-09-04 15:16:59 +00:00
florian	d67af82228	* patch by Jeppe Johansen: Thumb2-only targets don't support the BLX <label>, and have to use BL <label>, resolves #22770 git-svn-id: trunk@22314 -	2012-09-04 14:57:26 +00:00
florian	d63ebe6464	* bsr implementation for armv5+ using clz git-svn-id: trunk@22310 -	2012-09-04 12:56:38 +00:00
florian	59012afe26	* better heuristics to decide when a mul by a constant shall be replaced by shift/add/sub sequences git-svn-id: trunk@22300 -	2012-09-03 13:39:52 +00:00
florian	2f8027c63f	+ more sophisticated code to optimize multiplications on arm + the multiplication optimization needs a popcnt function + simple test git-svn-id: trunk@22299 -	2012-09-03 13:39:45 +00:00
florian	45383fd32d	+ a lot missing flag allocs/deallocs added git-svn-id: trunk@22201 -	2012-08-23 08:54:52 +00:00
florian	d8161c185c	+ track usage of flags by using a new register RS_/NR_DEFAULTFLAGS git-svn-id: trunk@22179 -	2012-08-22 19:37:51 +00:00
florian	7588896775	* make use of cpuflags in the arm compiler * armv5te architecture git-svn-id: trunk@22103 -	2012-08-17 10:37:17 +00:00
masta	aa21845cd9	Small optimization for OP_AND on ARM Especially with 64bit operators the CG sometimes generates: and r0, r1, #0 Which just clears r0 and is equivalent with mov r0, #0 git-svn-id: trunk@22032 -	2012-08-08 06:44:20 +00:00
florian	7513291ad8	* generate different code for OS_S8 -> OS_16 conversion which might fold better, idea by Nico Erfurth git-svn-id: trunk@22027 -	2012-08-07 19:36:46 +00:00
masta	6529307d9e	Don't emit useless AND/BICs in ARM CG In certain cases the CG would emit something like bic r1, r0, #0 As BIC is clearing the specified bits this is equivalent to mov r1, r0 This patch changes the CG to emit the mov instead which the register allocator will hopefully remove most of the time. git-svn-id: trunk@22024 -	2012-08-07 06:46:45 +00:00
florian	f619a1aaf6	* fld/fst can have a base register+offset git-svn-id: trunk@22016 -	2012-08-05 18:34:13 +00:00
florian	e81ba0f82e	+ make use of the armv6+ sign/zero extension instructions if appropriate git-svn-id: trunk@22013 -	2012-08-05 14:04:11 +00:00
florian	19ed835f2b	* don't generate an extra indirection when loading vfp constants git-svn-id: trunk@22010 -	2012-08-04 17:01:57 +00:00
masta	c16871e129	Generate better code in Tthumb2cgarm.g_flags2reg The old code generated a strange IT-sequence: IT EQ MOVEQ r0, #1 IT NE MOVNE r0, #1 Now we generate: ITE EQ MOVEQ r0, #1 MOVNE r0, #1 IT stands for IfThen, ITE for IfThenElse it has a couple of other forms where the instruction gets extended to handle more of the following instructions. So we have ITEE, ITETE etc, up to 4 instructions can be handled. git-svn-id: trunk@21996 -	2012-08-02 00:56:15 +00:00
masta	57b67dfa30	Better SP adjustments on entry/exit for ARM If the needed adjustment is not expressible in a shifterconst, the old code loaded a temporary register (fixed to r12) via a_load_const_reg and used it to adjust the SP. Resulting in: mov r12, #44 orr r12, r12, #4096 sub sp, sp, r12 The new code will try to split the adjustment into 2 shifterconstants and will do two seperate adjustments: sub sp, sp, #44 sub sp, sp, #4096 If that doesn't work we'll fall back to the old code. But that should happen VERY rarely, only for stacks bigger than 256k which are not expressible in 2 shifter constants. git-svn-id: trunk@21863 -	2012-07-11 08:41:45 +00:00
florian	95732625cc	* use r11 as a normal register if no frame pointer is needed git-svn-id: trunk@21834 -	2012-07-09 17:17:23 +00:00
masta	dbf0404fb0	More consolidation of OP_SHL/SHR/ROR/SAR in ARM CodeGen This removes the duplications in a_op_reg_reg_reg_checkoverflow. OP_ROL stays seperate because it needs some special treatment again. The code for OP_ROL was changed, previously it generated: mov tempreg, #32 sub src1, tempreg, src1 mov dst, src2, ror src1 This would trash src1, which MIGHT be a problem, but i'm not totally sure. But the mov/sub was replaced with rsb, so the new code looks like this. rsb tempreg, src1, #32 mov dst, src2, ror tempreg If src1 gets freed afterwards the regallocator should be able to change that into: rsb src1, src1, #32 mov dst, src2, ror src1 git-svn-id: trunk@21804 -	2012-07-06 15:01:31 +00:00
masta	d2d5d17557	Consolidate handling of OP_SHL/SHR/ROL/ROR/SAR in ARM CodeGen The previous code was full with duplicated code, this new version just maps the OP_* to the correct SM_* and does some special handling for OP_ROL which is done via OP_ROR. git-svn-id: trunk@21801 -	2012-07-06 12:10:42 +00:00
Jonas Maebe	7a0ae38700	+ also specify the parameter def when allocating a parameter via getintparaloc + adapted all call sites of getintparaloc. This led to a number of additional, related changes: o corrected the type information for some getintparaloc parameters o don't allocate some intparalocs in cases they aren't used o changed "const tvardata" parameter into "constref tvardata" for fpc_variant_copy_overwrite to make pass-by-reference semantics explicit o moved a number of routines that now have to call find_system_type() from cgobj to hlcgobj so that cgobj doesn't have to start depending on the symtable unit o added versions of the cpureg alloc/dealloc methods to hlcgobj that call through to their cgobj counter parts, so we can call save/restore the cpu registers before/after calling system helpers from hlcgobj (not implemented in hlcgobj itself, because all basic register allocator functionality is still part of cgobj/cgcpu) git-svn-id: trunk@21696 -	2012-06-24 15:02:12 +00:00
masta	2768e0fc12	Folded Add/Sub/Or Splitter, lots of debug output git-svn-id: trunk@21660 -	2012-06-20 12:39:28 +00:00
masta	92c47148cc	Optimize 8/16 OP_NOT on ARM This now generates: mvn r0, r0, lsl #24/#16 mov r0, r0, lsr/asr #24/#16 The lsr/asr might be folded into a following instruction, making the whole operation 1 cycle instead of 2-3 with the previous solution. git-svn-id: trunk@21658 -	2012-06-20 12:39:09 +00:00
masta	0f3441a9c2	Split OP_ADD, OP_SUB, OP_AND and OP_ORR into multiple instructions if that can avoid constant construction or even loading from a pool. OP_ADD, OP_SUB, OP_ORR will be split into two intructions if possible when a load/const construction is required. OP_AND is a bit different, because we can't just split it up, but we try to find a two instruction BIC-equivalent to it. Till now code like a:= a and $FFFF; produced code like mov r0, $FF00 orr r0, r0, $FF and r1, r1, r0 With this addition we produce code like: bic r0, r0, $FF00 bic r0, r0, $FF Saving us at least a cycle and in some cases also a load from the constant-pool. This uses the new split_into_shifter_const function. git-svn-id: trunk@21647 -	2012-06-18 16:59:29 +00:00
masta	f11fbe527e	Improve loading of ARM constant values * use split_into_shifter_const to reduce the MOV/ORR combination to a single check and allow a broader rang of combinations. * Introduce MVN/BIC combination to load values which have more 1 than 0 bits set (like small negative values) git-svn-id: trunk@21646 -	2012-06-18 16:59:24 +00:00
florian	45c70ec81c	* patch by Nico Erfurth: Support the usage of BIC instead of AND on ARM BIC clears the specified bits, while AND keeps them. The usage of BIC allows a broader range of shifterconsts to be used on the ARM cpu, often saving a cycle. Previously code like: Data:=Data and $FFFFFF00 would result in mvn r1, #255 and r0, r0, r1 This patch changes this to bic r0, r0, #255 git-svn-id: trunk@21510 -	2012-06-06 19:45:26 +00:00
florian	4ea1d22c5a	* patch by Nico Erfruth: Support BX for function returns on armv5+ BX is supported from ARMv4T onwards, but i don't have a armv4t device to test it. Using BX instead of mov pc,lr allows for a better pipeline utilization by enabling the CPUs branch predictor to work properly. git-svn-id: trunk@21505 -	2012-06-06 19:42:26 +00:00
florian	c75486db89	* patch by Nico Erfurth: Reorder unaligned Load sequence on ARM The old version produced code like that: ldrb rDEST, [rBASE] ldrb rTemp, [rBASE, #1] orr rDEST, rDEST, rTEMP lsl #8 (2 stall cycles) ldrb rTemp, [rBASE, #2] orr rDEST, rDEST, rTEMP lsl #16 (2 stall cycles) ldrb rTemp, [rBASE, #3] orr rDEST, rDEST, rTEMP lsl #24 (2 stall cycles) This creates a lot of stall-cycles on ARM Implementations with load delay slots like Marvel Kirkwood or Intel XScale. With the usual up to 2 stall-cycles this code requires a total of 13 cycles (7 instructions + 6 stall cycles) in best case. The new code uses a second temp register to avoid the stall cycles. ldrb rDEST, [rBASE] ldrb rTemp1, [rBASE, #1] ldrb rTemp2, [rBASE, #2] orr rDEST, rDEST, rTEMP1 lsl #8 ldrb rTemp1, [rBASE, #3] orr rDEST, rDEST, rTEMP2 lsl #16 orr rDEST, rDEST, rTEMP1 lsl #24 (1 stall cycle) The rescheduling and second register bring the total cycles down to 8. If a later rescheduling should happen for the last orr it even can go down to 7. git-svn-id: trunk@21363 -	2012-05-22 19:09:20 +00:00
florian	5f0bcd9248	* patch by Nico Erfurth: Optimize ARM OP_MUL/OP_IMUL for xispowerof2(const+1) cases Calculations like a7 can be optimized to a*8-a with the usage of RSB and left shifts which can be done in a single cycle. git-svn-id: trunk@21351 -	2012-05-20 20:50:04 +00:00
Jonas Maebe	bba4b02eb2	* use r7 instead of r11 as frame pointer on Darwin/iOS, and make sure r7 always points to the previous r7 on the stack (with the saved return address coming right after it) so that the debugger and crashreporter can use it for backtraces as specified in the ABI o changed NR_FRAME_POINTER_REG and RS_FRAME_POINTER_REG from a symbolic into a typed constant, and added a new method to tprocinfo that can be used to initialze it (so it can be inited to r7/r11 depending on the target platform) * allow using r9 on Darwin, it was only used by the system on iOS up to 2.x, which we no longer support * prefer using r9 and r12 before r4..r11 on Darwin, because they are volatile and hence do not have to be saved git-svn-id: trunk@20661 -	2012-03-29 20:54:33 +00:00
Jonas Maebe	6ba8dc7146	+ support for the ARM hard float EABI on Linux (patch by Peter Green): o new eabihf (hard float) abi o vfpv3_d16 variant of VFP (default variant used by EABI assemblers: VFPv3 with only 16 double registers instead of 32) and pass it to GNU as o make the odd numbered single precision floating point VFP registers available for explicit allocation for use by the calling convention * fixed copy/paste error in stdname of S30 register -> use -dFPC_ARMHF to create an ARM eabi hard float compiler (mantis #21554) git-svn-id: trunk@20660 -	2012-03-29 20:50:09 +00:00
florian	841d67ec81	* don't waste an extra register when copying 4 bytes git-svn-id: trunk@20475 -	2012-03-05 19:12:00 +00:00
pierre	42c98f3cd5	Override abstract method to abvoid warning at compilation time git-svn-id: trunk@19578 -	2011-11-03 10:08:12 +00:00
florian	ce61891ca3	* offset used by A_LDF,A_STF,A_FLDS,A_FLDD,A_FSTS,A_FSTD must be dividable by 4 git-svn-id: trunk@19270 -	2011-09-28 19:01:09 +00:00
Jonas Maebe	2b11fd2bef	* cpus that only understand Thumb-2 don't support "blx <imm>" git-svn-id: trunk@19238 -	2011-09-25 20:44:39 +00:00
florian	5fa184c952	+ patch by Jeppe Johansen to make use of the div/udiv instruction on arm7m, resolves #20022 * explicitly make symbol addressing PC relative git-svn-id: trunk@19221 -	2011-09-24 21:41:01 +00:00
Jonas Maebe	852ae48cb7	* also use blx instead of bl for direct calls on ARMv5+, since the target may be thumb(2) (mantis #19896) * don't conditionalize "blx <imm target>", because that's not a valid encoding git-svn-id: trunk@18984 -	2011-09-05 20:33:15 +00:00
florian	26850e3425	* fix full cycle after adding new boolean types git-svn-id: branches/pasboolxx@17847 -	2011-06-27 20:11:08 +00:00

1 2 3 4 5

215 Commits