AlexIndustrial/mesa

Author	SHA1	Message	Date
Emma Anholt	74aea0b840	glsl: Remove optimize_swizzles. It will get turned into SSA and copy-propagated in NIR, no need to walk the IR collapsing it here. iris shader-db results appear to be noise: total instructions in shared programs: 8932195 -> 8932147 (<.01%) instructions in affected programs: 537 -> 489 (-8.94%) LOST: 12 GAINED: 11 lost/gained are simd32 switches in unigine, l4d2, portal2, asphalt9. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17613>	2022-07-20 20:45:09 +00:00
Emma Anholt	f075376823	glsl: Use the same NIR path for shared mem lowering as SPIRV does. Now that we have no non-NIR drivers, we can retire the old code. We just need to pass the variable accesses through to it. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17610>	2022-07-20 19:32:48 +00:00
Alejandro Piñeiro	8d3ce4eb06	nir: call nir_metadata_preserve at nir_remove_unused_io_vars Without it we got a metadata assert: deqp-vk: ../src/compiler/nir/nir_metadata.c:108: nir_metadata_check_validation_flag: Assertion `!(function->impl->valid_metadata & nir_metadata_not_properly_reset)' failed if we try to use NIR_PASS(_, instead of NIR_PASS_V (that among other things, do more validations). Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17609>	2022-07-20 11:35:24 +00:00
Marcin Ślusarz	5e14445430	nir: convert unused mesh outputs to shared memory Otherwise reads from output in one subgroup may not see writes from other subgroups. Temp variables are later converted to scratch, so even within one subgroup we may not see correct values. Test case in https://gitlab.freedesktop.org/mesa/crucible/-/merge_requests/115 Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17517>	2022-07-20 09:22:06 +00:00
Timothy Arceri	d1e36634bd	nir/loop_unroll: clean up after complex_unroll_single_terminator() Previously we would just unroll the loop one extra iteration and let other optimisation passes clean up the mess. This worked to a degree but if the loop happened to be nested inside another loop we would end up with phi chains that would block other passes from being able to do the cleanup. With this commit we explicitly clone the variables create by lcsaa and insert them directly in the last continue branch after we are done unrolling. With this optimisation passes can recognise both sides of the if output the same values and can progress further. Help with the issues described in: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6051 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17611>	2022-07-20 03:47:45 +00:00
Daniel Schürmann	6f25d45877	spirv: handle phis decorated with RelaxedPrecision If the driver can do 16-bit ALU ops, then store RelaxedPrecision phi values into 16-bit NIR variables with downconverts/upconverts on the way in/out. This has no impact on shader-db on freedreno (not that we have a ton of GLES content there), but it does cause an ANGLE-translated CTS shader on vulkan to get consistent conversions between two copies of a value, and avoid a test bug. Reviewed-by: Emma Anholt <emma@anholt.net> Closes: #6585 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14018>	2022-07-19 21:58:01 +00:00
Konstantin Seurer	fab0050223	nir: Add a common gen_rect_vertices implementation Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17535>	2022-07-19 12:47:30 +00:00
Iago Toral Quiroga	b18cecbfb6	nir: add nir_address_format_2x32bit_global This adds support for global 64-bit GPU addresses as a pair of 32-bit values. This is useful for platforms with 32-bit GPUs that want to support VK_KHR_buffer_device_address, which makes GPU addresses explicitly 64-bit. With the new format we also add new global intrinsics with 2x32 suffix that consume the new address format. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17275>	2022-07-19 09:47:34 +02:00
Arvind Yadav	8adbd2a964	ac/llvm: Implement nir_intrinsic_load_point_coord_maybe_flipped opcodes Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15117>	2022-07-16 07:08:10 -04:00
Arvind Yadav	30865756db	nir: Add a lowering pass for point smoothing When point smoothing is enabled then this lowering pass will modifies the alpha component of every write to fragment output. Anti-aliased points get rounded with respect to their radius instead of square. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15117>	2022-07-16 07:08:09 -04:00
Arvind Yadav	cad4908fa0	nir: add load_point_coord_maybe_flipped intrinsics for point smoothing gl_PointCoord can be flipped upside down via a state. To avoid this adding new load_point_coord_maybe_flipped intrinsics. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15117>	2022-07-16 07:07:32 -04:00
Arvind Yadav	2709786bde	nir: Add a lowering pass for polygon and line smoothing When poly_line smoothing is enabled then this lowering pass will modify the alpha component of every write to fragment output using sample coverage mask. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16245>	2022-07-16 10:15:22 +00:00
Lionel Landwerlin	a41e8dc588	spirv: switch to uint64 for rayquery internal type Fixes dEQP-VK.ray_query.advanced.using_wrapper_function.comp.* An empty struct is causing problems because when passing it as argument the spirv parser will just drop the argument, considering it does not hold any data. v2: update radv CI Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4c703686db` ("spirv: handle ray query intrinsics") Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17420>	2022-07-14 09:15:52 +00:00
Jason Ekstrand	1124bee4ba	glsl/nir: Set sample_shading if a FS output ever shows up as an rvalue If framebuffer fetch is used, we have to enable sample shading because the fetched framebuffer value is per-sample. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	3cf103f23d	nir/gather_info: Stop gathering uses_sample_shading Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	b6543470fe	spirv/nir: Set uses_sample_shading from spirv_to_nir We don't really want to base this on a late nir_gather_info for two reasons: 1) The Vulkan spec says that if a sample-qualified input, SampleID, or SamplePosition are in the entry-point's interface, you get per-sample dispatch. This means we really should gather this information before dead-code has a chance to delete anything. 2) We want to be able to add nir_intrinsic_load_sample_pos intrinsics as part of lowering passes without causing per-sample interpolation. This means nir_gather_info needs to stop gathering it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	830654b7b0	glsl/nir: Set uses_sample_shading from glsl_to_nir We don't really want to base this on a late nir_gather_info for two reasons: 1) The GL spec says that any static use of a sample-qualified input, gl_SampleID, or gl_SamplePosition causes per-sample dispatch. This means we really should gather this information before dead-code has a chance to delete anything. 2) We want to be able to add nir_intrinsic_load_sample_pos intrinsics as part of lowering passes without causing per-sample interpolation. This means nir_gather_info needs to stop gathering it. For 1, this doesn't actually get us quite there as GLSL IR may have deleted something already. However, it does get us closer. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	23b2d625dd	nir: Add a pass for lowering shaders to single-sampled On Intel, we have to do this because we can't ask for the per-sample barycentrics without setting the per-sample dispatch bit or the GPU will hang. However, nothing we're doing in this pass is Intel-specific and it may be a useful optimization for someone else so we may as well make it a generic NIR pass. This version actually does a bit more than the current brw_nir_demote_sample_qualifiers() pass as it also handles pre-nir_lower_io interp_dref_at* as well as a couple system values which we can easily constant-fold. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Georg Lehmann	aac8ddae2f	nir/opt_algebraic: Optimize [ui](add\|sub)_sat with 0. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17468>	2022-07-13 07:34:09 +00:00
Georg Lehmann	90a8fb0355	nir/lower_io: Fix array length of buffers larger than INT32_MAX. Before, if the ssbo is too large this would always return 0. Also, this code is easier to optimize, so the common case of offset 0 and pot stride results in one ushr instead of 5+ instructions. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17468>	2022-07-13 07:34:09 +00:00
Eric Engestrom	9844a2fb64	nir: use updated tokens from vk.xml Signed-off-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17342>	2022-07-12 15:53:11 +00:00
Emma Anholt	0e1fb2d984	nir+ir3: Rename load_size_ir3 to load_center_rhw_ir3. Now that we know what it does, it also explains what it's doing in interpolateAtOffset in ir3. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17322>	2022-07-11 16:56:05 +00:00
Alyssa Rosenzweig	befc68ec33	nir/opt_shrink_vectors: Round to supported vec size The set of supported vector sizes in NIR has holes in it. For example, we support vec5 and vec8, but not vec6 or vec7. However, this pass did not take that into account, and would happily shrink a vec8 down to a vec7, causing NIR validation to fail. Instead, the pass should round up to the next supported vector size. Fixes NIR validation fail in OpenCL's test_basic hiloeo subtest. v2: Clamp -> round rename. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17194>	2022-07-10 18:03:46 +00:00
Emma Anholt	421aa5dfac	mesa/st: Disable OES_texture_3D for PIPE_CAP_MAX_TEXTURE_3D_LEVELS==0. This will be used for vc4, where incorrectly exposing 3D textures accounts for most of the GLES2 conformance failures it has. This leaves EXT_texture3d exposed in the (already non-conformant) GL2.1 support it exposes, which has always been a best-effort thing. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17350>	2022-07-10 02:50:09 +00:00
Lionel Landwerlin	a4c5521ea9	nir/serialize: restore ray query variables The ray query status of a variable is tracked in the nir_variable::data. We need to store it in the serialization otherwise restoring NIR from a cache will drop the annotation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `5a9cdab170` ("nir: track variables representing ray queries") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16059>	2022-07-09 00:32:00 +00:00
Jason Ekstrand	b2ab6d10e4	mesa,glsl,ttn: Set subgroup_size to UNIFORM Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17337>	2022-07-08 22:47:22 +00:00
Jason Ekstrand	8851f50753	spirv,vulkan: Set shader_info::subgroup_size Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17337>	2022-07-08 22:47:22 +00:00
Jason Ekstrand	e1ee201722	shader_info: Move subgroup_size out of cs and make it an enum Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17337>	2022-07-08 22:47:22 +00:00
Iago Toral Quiroga	2071804f33	nir/serialize: fix missing divergence info after deserialization Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17397>	2022-07-08 06:15:28 +00:00
Rhys Perry	bb0415b697	nir: allow 16-bit fsin_amd/fcos_amd Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10587>	2022-07-07 22:18:08 +00:00
Rhys Perry	bc1ea2fda9	nir/algebraic: optimize bcsel(c, fsin/cos_amd(a), fsin/cos_amd(b)) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10587>	2022-07-07 22:18:08 +00:00
Rhys Perry	69d21a3dee	nir: rename fsin_r600/fcos_r600 to fsin_amd/fcos_amd GCN has better range, but constant folding is the same. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10587>	2022-07-07 22:18:08 +00:00
Iago Toral Quiroga	84a0dca9df	nir: fix documentation for uadd_carry and usub_borry opcodes These opcodes where fixed to return an integer instead of a boolean value some time ago but the documentation for them was not updated and still talked about a boolean result. Fixes: `b0d4ee520` ('nir/opcodes: Fix up uadd_carry and usub_borrow') Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17372>	2022-07-07 09:16:24 +00:00
Jason Ekstrand	bfbcd966f3	nir: Use util_mask_sign_extend when serializing constants Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17214>	2022-07-06 11:23:18 +00:00
Jason Ekstrand	efc63ea02d	util,nir: Move mask_sign_extend from opt_load_store_vectorize to util While we're moving it, reformat a bit to make it match util_sign_extend better. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17214>	2022-07-06 11:23:18 +00:00
Daniel Schürmann	862f1eacb2	nir/opt_shrink_vectors: fix re-using of components for vecN Cc: mesa-stable Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17276>	2022-07-01 12:01:21 +00:00
Timothy Arceri	f533dfff55	glsl: drop non-nir path for atan in builtin functions All drivers now use NIR. Here we drop the non NIR path and rename the NIR path to drop the extra "_op" chars from the function names. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17308>	2022-07-01 03:05:03 +00:00
Konstantin Seurer	4676b3d3dd	nir: Use nir_test_mask instead of i2b(iand) Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17242>	2022-06-30 18:00:32 +00:00
Konstantin Seurer	1018d0f9c0	nir: Add a nir_test_mask helper nir_ine_imm(b, nir_iand_imm(b, x, mask), 0) and nir_i2b(b, nir_iand_imm(b, x, mask)) are common patterns which become quite messy when they are part of a larger expression. Clang-format does not improve things either and we can end up with some rather interesting looking code. (RADV ray tracing pipeline and query lowering) Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17242>	2022-06-30 18:00:32 +00:00
Mike Blumenkrantz	0e7863c3b0	nir/types: fix glsl_matrix_type_is_row_major() assert interface blocks can have row_major set cc: mesa-stable affects (zink): dEQP-GLES2.functional.shaders* Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17238>	2022-06-30 00:41:46 +00:00
Timothy Arceri	9af897b9ed	glsl: merge lower_buffer_access with lower_shared_reference There are no longer any other users of lower_buffer_access so here we just merge them to make the code easier to follow. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17271>	2022-06-29 02:33:38 +00:00
Boris Brezillon	5e5b54c9d1	nir/serialize: Silence integer-overflow false positive Use util_sign_extend() to silence the following integer-overflow error. src/compiler/nir/nir_serialize.c:1333:40: runtime error: left shift of 1000165000 by 13 places cannot be represented in type 'int' Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17186>	2022-06-28 09:07:32 +00:00
Boris Brezillon	370ca07482	nir/serialize: Support texop >= 16 Extend the packed_instr struct to support texops above nir_texop_fragment_fetch_amd. Fixes: `603e6ba972` ("nir: add two new texture ops for multisample fragment color/mask fetches") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17186>	2022-06-28 09:07:32 +00:00
Lionel Landwerlin	cf44282deb	clc: add new feature options for intel_clc Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17253>	2022-06-27 15:31:49 +00:00
Boris Brezillon	ee536ea633	nir/serialize: Put dest last in packed_instr::tex packed_instr::tex::dest must be last to match the packed_instr::any::dest position. Fixes: `35655865cb` ("nir/serialize: pack instructions better") Cc: stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17212>	2022-06-27 09:39:22 +00:00
Qiang Yu	fdf589321c	ac/nir: add nir_intrinsic_load_hs_out_patch_data_offset_amd Also add radv and radeonsi implementation. Will be used in tess lowering. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16705>	2022-06-27 02:38:21 +00:00
Jason Ekstrand	7c127ca018	nir/opt_memcpy: Add another case for function_temp Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> (1.5 years later) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13166>	2022-06-24 19:21:26 +00:00
Jason Ekstrand	dc85065944	nir: Add an options parameter to deref_instr_has_complex_use Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> (1.5 years later) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13166>	2022-06-24 19:21:26 +00:00
Jason Ekstrand	d6123460fd	nir/opt_memcpy: lower copies to/from tightly packed types v2: Add comment by Jason (Lionel) Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> (1.5 years later) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13166>	2022-06-24 19:21:26 +00:00
Ian Romanick	6689fa2ab4	nir/range_analysis: Teach range analysis about fdot opcodes This really, really helps on platforms where fabs() isn't free. A great many shaders use a * frsq(fabs(fdot(a, a))) to normalize a vector. Since the result of the fdot must be non-negative, the fabs can be eliminated by an existing algebraic rule. shader-db results: r300 (run on R420 - X800XL) total instructions in shared programs: 1369807 -> 1368550 (-0.09%) instructions in affected programs: 59986 -> 58729 (-2.10%) helped: 609 HURT: 0 total vinst in shared programs: 512899 -> 512861 (<.01%) vinst in affected programs: 1522 -> 1484 (-2.50%) helped: 36 HURT: 0 total sinst in shared programs: 260690 -> 260570 (-0.05%) sinst in affected programs: 1419 -> 1299 (-8.46%) helped: 120 HURT: 0 total consts in shared programs: 957295 -> 957230 (<.01%) consts in affected programs: 849 -> 784 (-7.66%) helped: 65 HURT: 0 LOST: 0 GAINED: 3 The 3 gained shaders are all vertex shaders from XCom: Enemy Unknown. I'm guessing that game is never going to run on my X800XL. :) i915 total instructions in shared programs: 791121 -> 780843 (-1.30%) instructions in affected programs: 220170 -> 209892 (-4.67%) helped: 2085 HURT: 0 total temps in shared programs: 47765 -> 47766 (<.01%) temps in affected programs: 9 -> 10 (11.11%) helped: 0 HURT: 1 total const in shared programs: 93048 -> 92983 (-0.07%) const in affected programs: 784 -> 719 (-8.29%) helped: 65 HURT: 0 LOST: 0 GAINED: 36 Haswell, Ivy Bridge, and Sandy Bridge had similar results. (Haswell shown) total instructions in shared programs: 16702250 -> 16697908 (-0.03%) instructions in affected programs: 119277 -> 114935 (-3.64%) helped: 1065 HURT: 0 helped stats (abs) min: 1 max: 20 x̄: 4.08 x̃: 4 helped stats (rel) min: 0.48% max: 10.17% x̄: 3.66% x̃: 3.94% 95% mean confidence interval for instructions value: -4.26 -3.89 95% mean confidence interval for instructions %-change: -3.76% -3.56% Instructions are helped. total cycles in shared programs: 880772068 -> 880734134 (<.01%) cycles in affected programs: 2134456 -> 2096522 (-1.78%) helped: 941 HURT: 324 helped stats (abs) min: 2 max: 2180 x̄: 123.06 x̃: 44 helped stats (rel) min: 0.04% max: 49.96% x̄: 7.08% x̃: 3.81% HURT stats (abs) min: 2 max: 2098 x̄: 240.33 x̃: 35 HURT stats (rel) min: 0.04% max: 77.07% x̄: 12.34% x̃: 3.00% 95% mean confidence interval for cycles value: -47.93 -12.04 95% mean confidence interval for cycles %-change: -2.87% -1.34% Cycles are helped. No shader-db changes on any other Intel platform. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17181>	2022-06-23 18:46:27 +00:00

1 2 3 4 5 ...

7161 Commits