AlexIndustrial/mesa

Author	SHA1	Message	Date
Georg Lehmann	bb7e8d51b6	nir: delete nir_opt_reuse_constants Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31031>	2024-09-27 05:19:16 +00:00
Georg Lehmann	a9f8089240	nir: replace nir_opt_remove_phis_block with a single source version This is what callers actually want, and it simplifies nir_opt_remove_phis because we can assume dominance meta data is valid. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31031>	2024-09-27 05:19:16 +00:00
Timothy Arceri	60937b5286	nir: add implicit_conversion_prohibited field to nir_parameter Will be used in link time validation in following patches. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31137>	2024-09-25 09:39:44 +00:00
Timothy Arceri	5645495156	nir: store variable mode in nir_parameter This will be used by the nir glsl linker in following patches. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31137>	2024-09-25 09:39:44 +00:00
Timothy Arceri	6ff3e87e5f	nir: add function in/outs to variable modes Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31137>	2024-09-25 09:39:44 +00:00
Timothy Arceri	1cb115abd2	nir: add nir_function_impl_clone_remap_globals() This will be use by the glsl nir linker when we are combining different shaders from the same shader stage that might have multiple declarations of global variables across the different shaders. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31137>	2024-09-25 09:39:43 +00:00
Timothy Arceri	7a1061e0dd	nir: add max_ifc_array_access field to vars This will be used in following patches by the nir based glsl linker code. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31137>	2024-09-25 09:39:43 +00:00
Timothy Arceri	7c5b21c032	glsl: add support for converting global instructions to NIR NIR doesn't really support global instructions such as global val initilisation. So here we add functionality to glsl_to_nir() to put these instructions into a temporary function that will be later inlined into main. We give the function a name starting with gl_mesa_tmp_ as functions starting with gl_ are reserved and will not have any clashes with user functions, we finish the name with the blake3 of the shader source to avoid conflicts with multiple shaders attached to a single stage. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31137>	2024-09-25 09:39:43 +00:00
Boris Brezillon	eeb3512498	nir/lower_ssbo: Extend the load_ssbo_address intrinsic to pass an offset On Mali(Valhall), the bounds checking can be done when in hardware, but for this to work properly, we need to pass the offset to the nir_load_ssbo_address() intrinsic. Add an offset source to the intrinsic, and adjust the lowering pass to conditionally lower the offset addition. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31164>	2024-09-18 13:45:57 +00:00
Boris Brezillon	adadb097a3	nir/lower_ssbo: Add an option to conditionally lower loads On Mali(Valhall), we have a way to load SSBO data without going through an SSBO index -> global address translation, so let's provide a way to tell nir_lower_ssbo() when it shouldn't lower loads. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31164>	2024-09-18 13:45:57 +00:00
Ian Romanick	6a09d33549	nir: Add a pass to generate BFI instructions from logical operations Inspired by a commit message in !30934, I set about optimizing the code generated for nir_copysign. It would be possible to just implement an opt_algebraic pattern for the specific values used by nir_copysign, but this casts a slightly larger net. As noted in a comment in the code, there may be variations of the pattern that this pass misses. The opt_algebraic pattern would miss them too. v2: Use nir_def_replace. Suggested by Alyssa. Allow more "root" instruction types. Suggested by Georg. v3: Treat extract_u16(x, 0) as (x & 0x0000ffff), and treat extract_u8(x, 0) as (x & 0x000000ff). v4: Use nir_scalar. Suggested by Georg. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31006>	2024-09-13 00:21:00 +00:00
Rhys Perry	4f44a944bb	nir/opt_if: fix fighting between split_alu_of_phi and peel_initial_break Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `6b4b044739` ("nir/opt_loop: add loop peeling optimization") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11822 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31002>	2024-09-12 23:36:57 +00:00
Qiang Yu	d43c5003fc	nir: add skip_lower_packing_ops shader compile option Drivers like radeonsi and radv prefer to not lowering some packing ops. Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30885>	2024-08-30 05:46:51 +00:00
Daniel Schürmann	50d416fe89	nir: add nir_block *nir_src_get_block(src) helper Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7710>	2024-08-29 09:42:55 +00:00
Konstantin Seurer	81e3930ec0	nir/print: Add a helper for generating debug info Prints the shader to a string and assigns source locations based on that. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18903>	2024-08-25 10:26:33 +00:00
Konstantin Seurer	ce24486ee4	nir: Introduce nir_debug_info_instr Adds a new instruction type that stores metadata that might be useful for debugging purposes. Passes must ignore these instructions when making decisions. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18903>	2024-08-25 10:26:33 +00:00
Timur Kristóf	10dcf1fca6	nir: Remove unused nir_assign_linked_io_var_locations. The only user of this pass was RADV. Considering that driver locations are deprecated, nobody should write new code relying on this pass. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29812>	2024-08-08 16:55:02 +00:00
Alyssa Rosenzweig	24b722a692	nir: add derivative intrinsics Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30565>	2024-08-08 15:26:07 +00:00
Iván Briano	7fce39484e	nir: add pass to convert ViewIndex to DeviceIndex Used to implement VK_PIPELINE_CREATE_VIEW_INDEX_FROM_DEVICE_INDEX_BIT_KHR. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30329>	2024-08-07 19:09:55 +00:00
Zan Dobersek	7fd5f76393	nir/lower_vars_to_scratch: calculate threshold-limited variable size separately ir3's lowering of variables to scratch memory has to treat 8-bit values as 16-bit ones when comparing such value's size against the given threshold since those values are handled through 16-bit half-registers. But those values can still use natural 8-bit size and alignment for storing inside scratch memory. nir_lower_vars_to_scratch now accepts two size-and-alignment functions, one used for calculating the variable size and the other for calculating the size and alignment needed for storing inside scratch memory. Non-ir3 uses of this pass can just duplicate the currently-used function. ir3 provides a separate variable-size function that special-cases 8-bit types. Signed-off-by: Zan Dobersek <zdobersek@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29875>	2024-08-07 14:32:28 +00:00
Alyssa Rosenzweig	d99c2ef059	nir/opt_uniform_atomics: add fs atomics predicated? flag on agx (and mali), we predicate atomics on "if (!helper)", so doing so again in this pass is redundant. and would cause a problem since we'd then have to lower the "is helper inv?" flag late. so just skip the extra lowering code. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30488>	2024-08-06 11:48:17 -04:00
Karol Herbst	14ea102175	nir: add load_global_size intrinsic There is no need to compute it in the shader as the result is known at runtime already. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Tested-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30467>	2024-08-01 17:43:42 +00:00
Timothy Arceri	017770ff14	nir: add nir_tex_src_{sampler,texture}_deref_intrinsic To be used as a placeholder until after function inlining so we can replace function params with bindless handles if needed. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30315>	2024-07-29 00:06:10 +00:00
Marek Olšák	d90080b51b	nir/opt_vectorize_io: optionally don't vectorize IO with different types Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11443 Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29895>	2024-07-23 16:13:17 +00:00
Marek Olšák	9bfea3183a	nir/opt_varyings: improve convergent input handling to fix data corruption Backward inter-shader code motion can move any code into the previous shader if it only uses convergent inputs. The problem is the final input type can end up being integer or FP64, which is incompatible with the assumption that convergent inputs can always be interpolated. If such a case occurs and the type is integer or FP64, either don't do any code motion, or if the driver exposes the new flag, rewrite convergent loads to use load_input. If the new flag is supported, all convergent loads are rewritten to use load_input, and flat varyings are allowed to be classified as convergent, which means they are packed into interpolated vec4 slots if there are unused components. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29895>	2024-07-23 16:13:16 +00:00
Marek Olšák	b2d32ae246	nir: add nir_intrinsic_load_per_primitive_input, split from io_semantics flag Instead of having 1 bit in nir_io_semantics indicating a per-primitive FS input, add a dedicated intrinsic for it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29895>	2024-07-23 16:13:16 +00:00
Daniel Schürmann	540ee1c81a	nir: implement loop invariant code motion (LICM) pass This simple LICM pass hoists all loop-invariant instructions from the loops' top-level control flow, skipping any nested CF. The hoisted instructions are placed right before the loop. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28783>	2024-07-16 12:29:08 +00:00
Alyssa Rosenzweig	d238d766c6	nir: add lower_fminmax_signed_zero This implements IEEE-754-2019 signed zero semantics for fmin/fmax, as now required by NIR, for hardware that has busted signed zero behaviour for fmin/fmax. Ian expressed interest in this for Intel. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	d4c6fbc4a7	nir: add nir_alu_instr float controls queries These are helpful now that float_controls2 exists, these are common patterns worth factoring out into helpers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Qiang Yu	3151f5ec47	nir: add filter parameter to nir_lower_array_deref_of_vec To be used by latter commits to limit the lowering to specific variables. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29799>	2024-07-03 02:06:56 +00:00
Georg Lehmann	99372c1ed7	nir: add ford, funord, fneo, fequ, fltu, fgeu Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29467>	2024-06-27 08:12:29 +00:00
Alyssa Rosenzweig	bbdd34b4ad	nir: add nir_def_replace helper "Rewrite and remove" is a super common idiom in NIR passes. Let's add a helper to make it more ergonomic. More the point, I expect that /most/ of the time when a pass rewrites uses, they also want to remove the parent instruction. The principle reason not to is because it takes extra effort to add in the nir_instr_remove and nir_opt_dce will clean up after you eventually, right? From a compile time perspective, it's better to remove earlier to reduce the redundant processing between the pass and the next DCE run. So ... we want to be doing more removes. From a UX perspective - the way to nudge devs towards that is to make the preferred "rewrite-and-remove" pattern more ergonomic than the "rewrite but keep". That justifies the simple "replace" name rather than something silly like "rewrite_uses_and_remove". --- Something else I've wanted for a while. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29817>	2024-06-21 15:36:56 +00:00
Marek Olšák	75777f1dc8	nir: add a NIR option flag nir_io_prefer_scalar_fs_inputs It's a NIR option because passing flags from radeonsi to the GLSL linker is complicated. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29406>	2024-06-17 23:48:35 +00:00
Marek Olšák	2514999c9c	nir: add nir_opt_vectorize_io, vectorizing lowered IO Since nir_opt_varyings requires scalar IO and thus all drivers have to scalarize it, this gives the option to re-vectorize IO after that. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29406>	2024-06-17 23:48:35 +00:00
Alyssa Rosenzweig	90b6dba772	nir: add nir_metadata_control_flow Most passes want to preserve this specific combination of metadata, so let's add an alias for the combination. The alias communicates that the control flow graph is preserved, rather than a particular statement about e.g. dominance preservation. You don't need to understand dominance to write a simple nir_shader_instructions_pass. And since you were going to cargo cult the metadata anyway, this way you'll cargo cult a version you're more likely to understand. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29745>	2024-06-17 16:28:11 -04:00
Daniel Schürmann	9b1a748b5e	nir: remove nir_intrinsic_discard The semantics of discard differ between GLSL and HLSL and their various implementations. Subsequently, numerous application bugs occurred and SPV_EXT_demote_to_helper_invocation was written in order to clarify the behavior. In NIR, we now have 3 different intrinsics for 2 things, and while demote and terminate have clear semantics, discard still doesn't and can mean either of the two. This patch entirely removes nir_intrinsic_discard and nir_intrinsic_discard_if and replaces all occurences either with nir_intrinsic_terminate{_if} or nir_intrinsic_demote{_if} in the case that the NIR option 'discard_is_demote' is being set. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27617>	2024-06-17 19:37:16 +00:00
Daniel Schürmann	f3d8bd18dd	nir: introduce discard_is_demote compiler option This new option indicates that the driver emits the same code for nir_intrinsic_discard and nir_intrinsic_demote. Otherwise, it is assumed that discard is implemented as terminate. spirv_to_nir uses this option in order to directly emit nir_demote in case of OpKill. RADV GFX11: Totals from 3965 (4.99% of 79439) affected shaders: MaxWaves: 119418 -> 119424 (+0.01%); split: +0.03%, -0.03% Instrs: 1608753 -> 1620830 (+0.75%); split: -0.18%, +0.93% CodeSize: 8759152 -> 8785152 (+0.30%); split: -0.18%, +0.48% VGPRs: 152292 -> 149232 (-2.01%); split: -2.37%, +0.36% Latency: 9162314 -> 10033923 (+9.51%); split: -0.46%, +9.97% InvThroughput: 1491656 -> 1493408 (+0.12%); split: -0.10%, +0.22% VClause: 21424 -> 21452 (+0.13%); split: -0.31%, +0.44% SClause: 53598 -> 55871 (+4.24%); split: -2.15%, +6.39% Copies: 90553 -> 90462 (-0.10%); split: -2.91%, +2.81% Branches: 16283 -> 16311 (+0.17%) PreSGPRs: 113993 -> 113254 (-0.65%); split: -1.84%, +1.19% PreVGPRs: 110951 -> 108914 (-1.84%); split: -2.08%, +0.24% VALU: 963192 -> 963167 (-0.00%); split: -0.01%, +0.01% SALU: 87926 -> 90795 (+3.26%); split: -2.92%, +6.18% VMEM: 25937 -> 25936 (-0.00%) SMEM: 110012 -> 109799 (-0.19%); split: -0.20%, +0.01% Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27617>	2024-06-17 19:37:15 +00:00
Job Noorman	609a56d170	nir/opt_offsets: add option to allow offset wrapping On some ISAs (e.g., ir3) the offset calculation wraps the same way as normal unsigned addition so potentially wrapping operations do not have to be ignored. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28664>	2024-06-14 17:12:59 +00:00
Job Noorman	518c93768b	nir/opt_offsets: add callback for max base offset To support cases where different instructions may be used for the same storage type. For example, to load from an SSBO on ir3, either ldib (max offset 127) or isam.v (max offset 255) can be used. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28664>	2024-06-14 17:12:59 +00:00
Faith Ekstrand	e05cb967e7	nir: Add nir_foreach_block_in_cf_node_safe() iterators Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29591>	2024-06-13 20:43:46 +00:00
Timothy Arceri	9702570994	nir: clarify and update loop conditional instruction This value is intended to be used to remove out of bounds array access when unrolling loops so it should contain the comparison that contains the the induction variable not the overall condition of the loop terminator. So here we update the instruction when dealing with iand/ior loop terminator conditions. Acked-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28998>	2024-06-11 01:42:23 +00:00
Ian Romanick	7b7e5cf5d4	nir/algebraic: intel/fs: Optimize some patterns before lowering 64-bit integers v2: Add some comments explaining some of the nuance of the shift optimizations. Fix a bug in the shift count calculation of the upper 32-bits. Move the @64 from the variable to the opcode. All suggested by Jordan. No shader-db changes on any Intel platform. fossil-db: Meteor Lake and DG2 had similar results. (Meteor Lake shown) Totals: Instrs: 154507026 -> 154506576 (-0.00%) Cycle count: 17436298868 -> 17436295016 (-0.00%) Max live registers: 32635309 -> 32635297 (-0.00%) Totals from 42 (0.01% of 632575) affected shaders: Instrs: 5616 -> 5166 (-8.01%) Cycle count: 133680 -> 129828 (-2.88%) Max live registers: 1158 -> 1146 (-1.04%) No fossil-db changes on any other Intel platform. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29148>	2024-05-31 09:13:23 -07:00
Ian Romanick	22095c60bc	nir/algebraic: Add nir_lower_int64_options::nir_lower_iadd3_64 This allows us to not generate 64-bit iadd3 on Intel but continue generating it for NVIDIA. No shader-db or fossil-db changes. v2: Add nir_lower_iadd3_64 flag so we can continue to generate 64-bit iadd3 on NVIDIA platforms. v3: s/bit_size == 64/s == 64/. This cut-and-paste bug prevented any of the optimizations from ever occuring. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29148>	2024-05-31 09:13:23 -07:00
Timur Kristóf	0ea2bad74d	nir/lower_io: Add option to implement mediump as 32-bit. For drivers that don't lower mediump shader inputs / outputs to 16-bit, it's better to ignore the mediump flag completely, letting mediump inputs / outputs work like normal 32-bit IO. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29435>	2024-05-30 12:57:20 +00:00
Timur Kristóf	c1d38b0b37	nir: Add nir_opt_load_store_update_alignments. New pass that shares code with nir_opt_load_store_vectorize but it only updates the alignment of load/store instructions. It is useful before running other passes which may potentially destroy that information (eg. by removing some instructions from which the alignment may be deduced). Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29210>	2024-05-21 16:06:23 +00:00
Francisco Jerez	15a10786e3	nir: Add option to lower 64-bit uadd_sat. C.f. `16be909936`. Intel Xe2 won't support saturation for 64-bit integer addition, regardless of signedness. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28283>	2024-05-15 17:16:51 +00:00
Lionel Landwerlin	c518a176f5	nir: add ptr_bit_size parameter to nir_lower_printf Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25814>	2024-05-15 13:13:37 +00:00
Lionel Landwerlin	2be28ee58a	nir: add a base offset for printf indexing This will allow a driver to use a single table of printf strings across all shaders. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25814>	2024-05-15 13:13:37 +00:00
Alyssa Rosenzweig	2912f531a7	nir: add texops for AGX border colour emulation AGX has limited border colour hardware. To support full customBorderColorWithoutFormat semantics, we're forced to emulate in shaders at a substantial performance penalty. Actually, that's needed just to pass CTS because of other hardware issues stacking on top of each others... Hooray! Add the texops we need to facilitate efficient custom border colour lowering. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29179>	2024-05-14 04:57:24 +00:00
Alyssa Rosenzweig	b9a0c8dc6d	nir/lower_subgroups: add generic scan/reduce lower this is the lowering from NAK, fixed up for common code. the existing code is used for boolean scan/reduce. I make no guarantee that this works for subgroup sizes other than 32. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28993>	2024-05-10 17:00:54 +00:00

1 2 3 4 5 ...

1375 Commits