motorhead/qemu

mirror of https://github.com/Motorhead1991/qemu.git synced 2025-07-27 04:13:53 -06:00

Author	SHA1	Message	Date
Richard Henderson	c7739ab83e	tcg/optimize: Use fold_and and fold_masks_z in fold_deposit Avoid the use of the OptContext slots. Find TempOptInfo once. When we fold to and, use fold_and. Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-12-24 08:32:14 -08:00
Richard Henderson	81be07f905	tcg/optimize: Use fold_masks_z in fold_ctpop Add fold_masks_z as a trivial wrapper around fold_masks_zs. Avoid the use of the OptContext slots. Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-12-24 08:32:14 -08:00
Richard Henderson	ce1d663ff8	tcg/optimize: Use fold_masks_zs in fold_count_zeros Avoid the use of the OptContext slots. Find TempOptInfo once. Compute s_mask from the union of the maximum count and the op2 fallback for op1 being zero. Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-12-24 08:32:14 -08:00
Richard Henderson	c1e7b989c8	tcg/optimize: Use fold_masks_zs in fold_bswap Avoid the use of the OptContext slots. Find TempOptInfo once. Always set s_mask along the BSWAP_OS path, since the result is being explicitly sign-extended. Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-12-24 08:32:14 -08:00
Richard Henderson	21e2b5f9fa	tcg/optimize: Use fold_masks_zs in fold_andc Avoid the use of the OptContext slots. Find TempOptInfo once. Avoid double inversion of the value of second const operand. Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-12-24 08:32:14 -08:00
Richard Henderson	1ca7372c03	tcg/optimize: Use fold_masks_zs in fold_and Avoid the use of the OptContext slots. Find TempOptInfo once. Sink mask computation below fold_affected_mask early exit. Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-12-24 08:32:14 -08:00
Richard Henderson	e1b6c141e9	tcg/optimize: Introduce const value accessors for TempOptInfo Introduce ti_is_const, ti_const_val, ti_is_const_val. Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-12-24 08:32:14 -08:00
Richard Henderson	f3ed3cffb9	tcg/optimize: Use finish_folding in fold_add, fold_add_vec, fold_addsub2 Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-12-24 07:32:55 -08:00
Richard Henderson	6d70ddc635	tcg/optimize: Change representation of s_mask Change the representation from sign bit repetitions to all bits equal to the sign bit, including the sign bit itself. The previous format has a problem in that it is difficult to recreate a valid sign mask after a shift operation: the "repetitions" part of the previous format meant that applying the same shift as for the value lead to an off-by-one value. The new format, including the sign bit itself, means that the sign mask can be manipulated in exactly the same way as the value, canonicalization is easier. Canonicalize the s_mask in fold_masks_zs, rather than requiring callers to do so. Treat 0 as a non-canonical but typeless input for no sign information, which will be reset as appropriate for the data type. We can easily fold in the data from z_mask while canonicalizing. Temporarily disable optimizations using s_mask while each operation is converted to use fold_masks_zs and to the new form. Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-12-24 07:32:51 -08:00
Richard Henderson	75c3bf324d	tcg/optimize: Augment s_mask from z_mask in fold_masks_zs Consider the passed s_mask to be a minimum deduced from either existing s_mask or from a sign-extension operation. We may be able to deduce more from the set of known zeros. Remove identical logic from several opcode folders. Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-12-24 07:32:51 -08:00
Richard Henderson	d582b14d80	tcg/optimize: Split out fold_masks_zs Add a routine to which masks can be passed directly, rather than storing them into OptContext. To be used in upcoming patches. Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-12-24 07:32:50 -08:00
Richard Henderson	56e06ecfa5	tcg/optimize: Copy mask writeback to fold_masks Use of fold_masks should be restricted to those opcodes that can reliably make use of it -- those with a single output, and from higher-level folders that set up the masks. Prepare for conversion of each folder in turn. Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-12-24 07:32:50 -08:00
Richard Henderson	045ace35a8	tcg/optimize: Split out fold_affected_mask There are only a few logical operations which can compute an "affected" mask. Split out handling of this optimization to a separate function, only to be called when applicable. Remove the a_mask field from OptContext, as the mask is no longer stored anywhere. Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-12-24 07:32:50 -08:00
Richard Henderson	1526855c01	tcg/optimize: Split out finish_bb, finish_ebb Call them directly from the opcode switch statement in tcg_optimize, rather than in finish_folding based on opcode flags. Adjust folding of conditional branches to match. Reviewed-by: Pierrick Bouvier <pierrick.bouvier@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-12-24 07:32:50 -08:00
Richard Henderson	e58b977238	tcg/optimize: Optimize bitsel_vec Fold matching true/false operands. Fold true/false operands with 0/-1 to simpler logicals. Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-09-22 06:54:49 +02:00
Richard Henderson	1f106544fd	tcg/optimize: Optimize cmp_vec and cmpsel_vec Place immediate values second in the comparison. Place destination matches first in the true/false values. All of this mirrors what we do for integer setcond and movcond. Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-09-22 06:54:49 +02:00
Richard Henderson	141125e08c	tcg/optimize: Fold movcond with true and false values identical Fold "x = cond ? y : y" to "x = y". Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-09-22 06:54:49 +02:00
Richard Henderson	a71d9dfbf6	tcg/optimize: Fix TCG_COND_TST* simplification of setcond2 Argument ordering for setcond2 is: output, a_low, a_high, b_low, b_high, cond The test is supposed to be against b_low, not a_high. Cc: qemu-stable@nongnu.org Fixes: `ceb9ee06b7` ("tcg/optimize: Handle TCG_COND_TST{EQ,NE}") Resolves: https://gitlab.com/qemu-project/qemu/-/issues/2413 Signed-off-by: Richard Henderson <richard.henderson@linaro.org> Tested-by: Alex Bennée <alex.bennee@linaro.org> Message-Id: <20240701024623.1265028-1-richard.henderson@linaro.org>	2024-07-03 10:24:12 -07:00
Richard Henderson	8d65cda728	tcg/optimize: Optimize setcond with zmask If we can show that high bits of an input are zero, then we may optimize away some comparisons. Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-05-06 12:55:51 -07:00
Richard Henderson	e25fe886b8	tcg/optimize: Do not attempt to constant fold neg_vec Split out the tail of fold_neg to fold_neg_no_const so that we can avoid attempting to constant fold vector negate. Resolves: https://gitlab.com/qemu-project/qemu/-/issues/2150 Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-04-09 07:43:11 -10:00
Richard Henderson	2911e9b95f	tcg/optimize: Fix sign_mask for logical right-shift The 'sign' computation is attempting to locate the sign bit that has been repeated, so that we can test if that bit is known zero. That computation can be zero if there are no known sign repetitions. Cc: qemu-stable@nongnu.org Fixes: `93a967fbb5` ("tcg/optimize: Propagate sign info for shifting") Resolves: https://gitlab.com/qemu-project/qemu/-/issues/2248 Signed-off-by: Richard Henderson <richard.henderson@linaro.org> Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org>	2024-03-29 12:15:55 -10:00
Paolo Bonzini	ff202817dc	tcg/optimize: fix uninitialized variable The variables uext_opc and sext_opc are used without initialization if TCG_TARGET_extract_i{32,64}_valid returns false. The result, depending on the compiler, might be the generation of extract and sextract opcodes with invalid offset and count, or just random data in the TCG opcode stream. Fixes: `ceb9ee06b7` ("tcg/optimize: Handle TCG_COND_TST{EQ,NE}", 2024-02-03) Cc: Richard Henderson <pbonzini@redhat.com> Signed-off-by: Paolo Bonzini <pbonzini@redhat.com> Reviewed-by: Richard Henderson <richard.henderson@linaro.org> Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org> Message-Id: <20240228110641.287205-1-pbonzini@redhat.com> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-02-29 11:36:05 -10:00
Richard Henderson	fb04ab7ddd	tcg/optimize: Lower TCG_COND_TST{EQ,NE} if unsupported After having performed other simplifications, lower any remaining test comparisons with AND. Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-02-03 23:43:50 +00:00
Richard Henderson	ceb9ee06b7	tcg/optimize: Handle TCG_COND_TST{EQ,NE} Fold constant comparisons. Canonicalize "tst x,x" to equality vs zero. Canonicalize "tst x,sign" to sign test vs zero. Fold double-word comparisons with zero parts. Fold setcond of "tst x,pow2" to a bit extract. Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-02-03 23:43:50 +00:00
Richard Henderson	7e64b114d1	tcg/optimize: Do swap_commutative2 in do_constant_folding_cond2 Mirror the new do_constant_folding_cond1 by doing all argument and condition adjustment within one helper. Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-02-03 23:43:50 +00:00
Richard Henderson	246c4b72fa	tcg/optimize: Split out do_constant_folding_cond1 Handle modifications to the arguments and condition in a single place. Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-02-03 23:43:50 +00:00
Richard Henderson	27cdb85d06	tcg/optimize: Split out arg_is_const_val Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2024-02-03 23:43:50 +00:00
Richard Henderson	f245757701	tcg/optimize: Canonicalize sub2 with constants to add2 Signed-off-by: Richard Henderson <richard.henderson@linaro.org> Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org> Message-Id: <20231026013945.1152174-4-richard.henderson@linaro.org>	2023-11-06 10:43:04 -08:00
Richard Henderson	6334a968ee	tcg/optimize: Canonicalize subi to addi during optimization Signed-off-by: Richard Henderson <richard.henderson@linaro.org> Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org> Message-Id: <20231026013945.1152174-3-richard.henderson@linaro.org>	2023-11-06 10:43:04 -08:00
Richard Henderson	26aac97c84	tcg/optimize: Split out arg_new_constant Fixes a bug wherein raw uses of tcg_constant_internal do not have their TempOptInfo initialized. Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2023-11-06 10:36:56 -08:00
Richard Henderson	3eaadaeb4e	tcg: Eliminate duplicate env store operations Notice when a constant is stored to the same location twice. Reviewed-by: Song Gao <gaosong@loongson.cn> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2023-11-06 08:27:21 -08:00
Richard Henderson	ab84dc398b	tcg/optimize: Optimize env memory operations Propagate stores to loads, loads to loads. Reviewed-by: Song Gao <gaosong@loongson.cn> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2023-11-06 08:27:21 -08:00
Richard Henderson	9f75e52828	tcg/optimize: Split out cmp_better_copy Compare two temps for "better", split out from finding the best from a whole list. Use TCGKind, which already gives the proper priority. Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2023-11-06 08:27:21 -08:00
Richard Henderson	986cac1d2a	tcg/optimize: Pipe OptContext into reset_ts Will be needed in the next patch. Reviewed-by: Song Gao <gaosong@loongson.cn> Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2023-11-06 08:27:21 -08:00
Richard Henderson	b701f195d3	tcg: Remove TCG_TARGET_HAS_neg_{i32,i64} The movcond opcode is now mandatory for backends to implement. Signed-off-by: Richard Henderson <richard.henderson@linaro.org> Message-Id: <20231026041404.1229328-7-richard.henderson@linaro.org>	2023-11-06 08:27:21 -08:00
Richard Henderson	d97f8f3941	tcg: Optimize past conditional branches We already register allocate through extended basic blocks, optimize through extended basic blocks as well. Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2023-10-22 16:32:28 -07:00
Richard Henderson	3635502dd0	tcg: Introduce negsetcond opcodes Introduce a new opcode for negative setcond. Reviewed-by: Peter Maydell <peter.maydell@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2023-08-24 11:22:42 -07:00
Richard Henderson	8f7a840d7d	tcg: Fold deposit with zero to and Inserting a zero into a value, or inserting a value into zero at offset 0 may be implemented with AND. Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org> Reviewed-by: Peter Maydell <peter.maydell@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2023-08-24 11:22:42 -07:00
Richard Henderson	ad3d0e4d5d	tcg: Split tcg/tcg-op-common.h from tcg/tcg-op.h Create tcg/tcg-op-common.h, moving everything that does not concern TARGET_LONG_BITS or TCGv. Adjust tcg/*.c to use the new header instead of tcg-op.h, in preparation for compiling tcg/ only once. Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2023-06-05 12:04:28 -07:00
Richard Henderson	fecccfcc54	tcg: Split INDEX_op_qemu_{ld,st}* for guest address size For 32-bit hosts, we cannot simply rely on TCGContext.addr_bits, as we need one or two host registers to represent the guest address. Create the new opcodes and update all users. Since we have not yet eliminated TARGET_LONG_BITS, only one of the two opcodes will ever be used, so we can get away with treating them the same in the backends. Reviewed-by: Alex Bennée <alex.bennee@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2023-05-16 20:07:20 -07:00
Richard Henderson	12fde9bcdb	tcg: Add INDEX_op_qemu_{ld,st}_i128 Add opcodes for backend support for 128-bit memory operations. Reviewed-by: Peter Maydell <peter.maydell@linaro.org> Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2023-05-16 16:30:25 -07:00
Richard Henderson	732e89f4c4	tcg: Replace tcg_abort with g_assert_not_reached Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2023-04-23 08:17:46 +01:00
Richard Henderson	f57c69156c	tcg: Rename TEMP_LOCAL to TEMP_TB Use TEMP_TB as that is more explicit about the default lifetime of the data. While "global" and "local" used to be contrasting, we have more lifetimes than that now. Do not yet rename tcg_temp_local_new_*, just the enum. Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2023-03-01 07:33:27 -10:00
Richard Henderson	39004a71d8	tcg: Reorg function calls Pre-compute the function call layout for each helper at startup. Drop TCG_CALL_DUMMY_ARG, as we no longer need to leave gaps in the op->args[] array. This allows several places to stop checking for NULL TCGTemp, to which TCG_CALL_DUMMY_ARG mapped. For tcg_gen_callN, loop over the arguments once. Allocate the TCGOp for the call early but delay emitting it, collecting arguments first. This allows the argument processing loop to emit code for extensions and have them sequenced before the call. For tcg_reg_alloc_call, loop over the arguments in reverse order, which allows stack slots to be filled first naturally. Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2023-01-05 11:41:29 -08:00
Philippe Mathieu-Daudé	d44789434b	tcg: Pass number of arguments to tcg_emit_op() / tcg_op_insert_*() In order to have variable size allocated TCGOp, pass the number of arguments we use (and would allocate) up to tcg_op_alloc(). This alters tcg_emit_op(), tcg_op_insert_before() and tcg_op_insert_after() prototypes. In tcg_op_alloc() ensure the number of arguments is in range. Reviewed-by: Philippe Mathieu-Daudé <philmd@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org> [PMD: Extracted from bigger patch] Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org> Message-Id: <20221218211832.73312-2-philmd@linaro.org>	2023-01-05 11:41:29 -08:00
Richard Henderson	ed5234735a	tcg: Add opcodes for vector nand, nor, eqv We've had placeholders for these opcodes for a while, and should have support on ppc, s390x and avx512 hosts. Tested-by: Alex Bennée <alex.bennee@linaro.org> Reviewed-by: Alex Bennée <alex.bennee@linaro.org> Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2022-03-04 08:50:41 -10:00
Alex Bennée	9becc36f02	tcg/optimize: only read val after const check valgrind pointed out that arg_info()->val can be undefined which will be the case if the arguments are not constant. The ordering of the checks will have ensured we never relied on an undefined value but for the sake of completeness re-order the code to be clear. Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org> Signed-off-by: Alex Bennée <alex.bennee@linaro.org> Message-Id: <20220209112142.3367525-1-alex.bennee@linaro.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2022-03-04 08:50:41 -10:00
Richard Henderson	c578ff1858	tcg/optimize: Fix folding of vector ops Bitwise operations are easy to fold, because the operation is identical regardless of element size. But add and sub need extra element size info that is not currently propagated. Fixes: `2f9f08ba43` Cc: qemu-stable@nongnu.org Resolves: https://gitlab.com/qemu-project/qemu/-/issues/799 Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2022-01-04 15:14:42 -08:00
Richard Henderson	225bec0c0e	tcg/optimize: Add an extra cast to fold_extract2 There is no bug, but silence a warning about computation in int32_t being assigned to a uint64_t. Reported-by: Coverity CID 1465220 Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2021-11-11 11:47:01 +01:00
Richard Henderson	93a967fbb5	tcg/optimize: Propagate sign info for shifting For constant shifts, we can simply shift the s_mask. For variable shifts, we know that sar does not reduce the s_mask, which helps for sequences like ext32s_i64 t, in sar_i64 t, t, v ext32s_i64 out, t allowing the final extend to be eliminated. Reviewed-by: Alex Bennée <alex.bennee@linaro.org> Reviewed-by: Luis Pires <luis.pires@eldorado.org.br> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>	2021-10-28 20:55:07 -07:00

1 2 3 4 5 ...

340 commits