AlexIndustrial/mesa

Author	SHA1	Message	Date
Qiang Yu	84956286a8	nir/lower_gs_intrinsics: fix primitive count for points When primitive is points, EndPrimitive can't be used to count primitive. Need to use vertex count instead. And it's also not needed to do vertex per primitive count and overwrite incomplete primitive work for points. Fixes: `2be99012e9` ("nir: Add ability to count emitted GS primitives.") Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17805>	2022-08-15 01:39:28 +00:00
Michael Tang	97902a9ef8	nir: add nir_instr_as_str Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12510>	2022-08-11 16:17:46 +00:00
Mike Blumenkrantz	c37c6ac613	nir/validate: add some (light) validation for sampler type matching this adds minimal validation for tex ops with derefs to check that the dest type integer-ness matches the sampled type's integer-ness the aim is to provide the most basic validation that nir is being modified and created consistently, not to perform exact verification that the types are identical fix #6985 Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17874>	2022-08-10 19:44:59 +00:00
Mike Blumenkrantz	b7eda568a4	nir/validate: clamp unsized tex dests to 32bit this is the "default" size that's expected cc: mesa-stable Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17874>	2022-08-10 19:44:59 +00:00
Pierre-Eric Pelloux-Prayer	70891edd97	nir: add a nir_opt_if_options enum And don't enable nir_opt_if_optimize_phi_true_false on radeonsi with LLVM 14 because it crashes Blender. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6976 Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17949>	2022-08-10 12:55:39 +00:00
Timothy Arceri	8bffd601ed	Revert "nir: Preserve offsets in lower_io_to_scalar_early" This reverts commit `96fa23bca5`. The correct fix to the problem was `a1bc152340`, making this change obsolete as the pass skips any vars marked with always_active_io. There was no real advantage to allowing these vars to be split because they can't be removed anyway. Also there is no way to split varying arrays gracefully here due to the xfb layout rules, and this change didn't handle arrays at all. Removing this obsolete code also fixes an assert in the new CTS test KHR-Single-GL45.enhanced_layouts.xfb_all_stages. The test was legally adding xfb offsets to all vertex stages but since we only mark the varyings in the final vertex stage with the always_active_io flag the other stages were correctly lowering to scalars but when an array with an offset hit this code it asserted since it couldn't handle it. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Fixes: `a1bc152340` ("spirv: mark variables decorated with XfbBuffer as always active") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6928 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17878>	2022-08-08 01:37:20 +00:00
Iago Toral Quiroga	9d6770d20a	nir/lower_alu: drop unnecessary iand on uadd_carry result uadd_carry returns 1 or 0, so ANDing with 1 is unnecessary. Probably this was implemented thinking that it was returning a boolean value. shader-db results for V3D: total instructions in shared programs: 12463571 -> 12462964 (<.01%) instructions in affected programs: 28994 -> 28387 (-2.09%) helped: 110 HURT: 1 total uniforms in shared programs: 3704591 -> 3704588 (<.01%) uniforms in affected programs: 247 -> 244 (-1.21%) helped: 3 HURT: 0 total max-temps in shared programs: 2148138 -> 2148117 (<.01%) max-temps in affected programs: 729 -> 708 (-2.88%) helped: 23 HURT: 2 total sfu-stalls in shared programs: 21230 -> 21232 (<.01%) sfu-stalls in affected programs: 0 -> 2 helped: 0 HURT: 2 Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17903>	2022-08-06 23:11:40 +00:00
Jason Ekstrand	de2065496a	nir: Clean up and improve nir_dedup_inline_samplers It now removes dead inline sampler variables and moves everything to the end so we no longer need nir_move_inline_samplers_to_end(). Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:50 +00:00
Karol Herbst	2b12985465	nir: extract the clc inline sampler dedup pass from clc Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:50 +00:00
Karol Herbst	31ed24cec7	nir/lower_images: extract from clover Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:50 +00:00
Karol Herbst	01500198a6	nir: serialize printf metadata for CL kernels Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:49 +00:00
Karol Herbst	aa82808645	printf: extract clovers printf impl Also make the code cleaner and simplier. Signed-off-by: Karol Herbst <kherbst@redhat.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17334>	2022-08-04 23:53:49 +00:00
Constantine Shablya	fa5559f272	nir: add a pass to remove non-uniform access qualifier when the operands are uniform Signed-off-by: Constantine Shablya <constantine.shablya@collabora.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17558>	2022-08-03 23:57:50 +00:00
Marek Olšák	e075769a53	nir: add shader_info::uses_resource_info_query for txs, levels, samples, etc. AMD will use this to execute a lowering pass conditionally. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17693>	2022-08-03 17:44:15 +00:00
Marek Olšák	3098000e71	nir: add nir_texop_descriptor_amd AMD will use it to emulate resinfo. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17693>	2022-08-03 17:44:15 +00:00
Marek Olšák	6483fd394e	nir: add nir_intrinsic_image_descriptor_amd This returns the AMD shader resource descriptor. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17693>	2022-08-03 17:44:15 +00:00
Marek Olšák	ea6993f9c7	nir: add nir_intrinsic_image_samples_identical radeonsi will use it Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17693>	2022-08-03 17:44:15 +00:00
Alyssa Rosenzweig	a4a15f500c	nir/lower_idiv: Be less creative about signs I'm sorry to whoever wrote this, but (x - (int) (x < 0)) ^ -((int) (x < 0)) is not an acceptable way to write iabs. Shader-db results on Intel Tiger Lake with lower_idiv enabled: total instructions in shared programs: 21122548 -> 21122570 (<.01%) instructions in affected programs: 2369 -> 2391 (0.93%) helped: 2 HURT: 8 total cycles in shared programs: 791609360 -> 791608062 (<.01%) cycles in affected programs: 114106 -> 112808 (-1.14%) helped: 9 HURT: 1 If we make the Intel back-end less stupid, we get to 9/1 helped/HURT for instructions as well but that's for a different MR. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17845>	2022-08-03 14:24:38 +00:00
Jason Ekstrand	25dcb8d201	nir/from_ssa: Ignore undef sources Is a phi source is an undef, there's no point in copying it or really caring about it at all. We would just end up inserting a mov from an undef to a register. Instead, treat phi sources which point to an undef as if the phi source doesn't exist. This also prevents them from being included in phi webs which should reduce the overall interference seen in the shader. Currently, if two phis share an undef, their phi webs are consdiered to interfere. By ignoring undefs we can get rid of this false interference and reduce the size of phi webs. Reducing the number of things being copied by the parallel copy instructions should also free up the paralle copy algorithm and reduce the over-all churn of movs. Shader-db results on Haswell: total instructions in shared programs: 8156608 -> 8155406 (-0.01%) instructions in affected programs: 164838 -> 163636 (-0.73%) Shader-db results on Skylake: total instructions in shared programs: 18227370 -> 18227359 (<.01%) instructions in affected programs: 519 -> 508 (-2.12%) helped: 6 HURT: 0 Shader-db results on Tigerlake: total instructions in shared programs: 21167987 -> 21168025 (<.01%) instructions in affected programs: 23701 -> 23739 (0.16%) helped: 21 HURT: 27 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16817>	2022-08-01 22:13:24 +00:00
Emma Anholt	31b9b04880	nir: Use nir_foreach_phi_src consistently. I copy-and-pasted one of these and people noted that we had a better tool, so make sure nobody else copy and pastes it. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17664>	2022-08-01 17:39:30 +00:00
Emma Anholt	3714c89d0e	nir: Add an opt pass for phis after if choosing between true/false. This pattern almost always gets peephole-selected out anyway, but I noticed it once I removed glsl opt_conditional_discard. iris shader-db: total instructions in shared programs: 8933934 -> 8933158 (<.01%) instructions in affected programs: 75575 -> 74799 (-1.03%) helped: 179 HURT: 15 Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17664>	2022-08-01 17:39:30 +00:00
Eric Engestrom	2c67457e5e	util/list: rename LIST_ENTRY() to list_entry() This follows the Linux kernel convention, and avoids collision with macOS header macro. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6751 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6840 Cc: mesa-stable Signed-off-by: Eric Engestrom <eric@igalia.com> Acked-by: David Heidelberg <david.heidelberg@collabora.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17772>	2022-07-28 10:10:44 +00:00
Georg Lehmann	df4b5914cd	nir/fold_16bit_tex_image: Default to only_fold_all. No driver doesn't use this option. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17757>	2022-07-27 18:57:12 +00:00
Jesse Natalie	d216d32756	nir_lower_io_to_scalar: Support arrayed (per-vertex) I/O Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17603>	2022-07-23 14:48:17 +00:00
Emma Anholt	f6c5b1d6c6	nir: Split usub_sat lowering flag from uadd_sat. Intel vec4 would like to do uadd_sat, but use lowering for usub_sat. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17637>	2022-07-22 17:54:28 +00:00
Georg Lehmann	a93786fc26	nir/lower_mediump: Add an option to only fold if all tex sources can be folded. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16978>	2022-07-21 19:15:03 +00:00
Georg Lehmann	87e3277b82	nir: Rewrite and merge 16bit tex folding pass with 16bit image folding pass. Allow folding constants/undef sources by sharing more code with the image_store 16bit folding pass. Allow more than one set of sources because RADV wants two, one for G16 (ddx/ddy) and one for A16 (all other sources). Allow folding cube sampling destination conversions on radeonsi/radv because I think the limitation only applies to sources. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16978>	2022-07-21 19:15:03 +00:00
Alejandro Piñeiro	8d3ce4eb06	nir: call nir_metadata_preserve at nir_remove_unused_io_vars Without it we got a metadata assert: deqp-vk: ../src/compiler/nir/nir_metadata.c:108: nir_metadata_check_validation_flag: Assertion `!(function->impl->valid_metadata & nir_metadata_not_properly_reset)' failed if we try to use NIR_PASS(_, instead of NIR_PASS_V (that among other things, do more validations). Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17609>	2022-07-20 11:35:24 +00:00
Marcin Ślusarz	5e14445430	nir: convert unused mesh outputs to shared memory Otherwise reads from output in one subgroup may not see writes from other subgroups. Temp variables are later converted to scratch, so even within one subgroup we may not see correct values. Test case in https://gitlab.freedesktop.org/mesa/crucible/-/merge_requests/115 Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17517>	2022-07-20 09:22:06 +00:00
Timothy Arceri	d1e36634bd	nir/loop_unroll: clean up after complex_unroll_single_terminator() Previously we would just unroll the loop one extra iteration and let other optimisation passes clean up the mess. This worked to a degree but if the loop happened to be nested inside another loop we would end up with phi chains that would block other passes from being able to do the cleanup. With this commit we explicitly clone the variables create by lcsaa and insert them directly in the last continue branch after we are done unrolling. With this optimisation passes can recognise both sides of the if output the same values and can progress further. Help with the issues described in: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6051 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17611>	2022-07-20 03:47:45 +00:00
Konstantin Seurer	fab0050223	nir: Add a common gen_rect_vertices implementation Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17535>	2022-07-19 12:47:30 +00:00
Iago Toral Quiroga	b18cecbfb6	nir: add nir_address_format_2x32bit_global This adds support for global 64-bit GPU addresses as a pair of 32-bit values. This is useful for platforms with 32-bit GPUs that want to support VK_KHR_buffer_device_address, which makes GPU addresses explicitly 64-bit. With the new format we also add new global intrinsics with 2x32 suffix that consume the new address format. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17275>	2022-07-19 09:47:34 +02:00
Arvind Yadav	8adbd2a964	ac/llvm: Implement nir_intrinsic_load_point_coord_maybe_flipped opcodes Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15117>	2022-07-16 07:08:10 -04:00
Arvind Yadav	30865756db	nir: Add a lowering pass for point smoothing When point smoothing is enabled then this lowering pass will modifies the alpha component of every write to fragment output. Anti-aliased points get rounded with respect to their radius instead of square. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15117>	2022-07-16 07:08:09 -04:00
Arvind Yadav	cad4908fa0	nir: add load_point_coord_maybe_flipped intrinsics for point smoothing gl_PointCoord can be flipped upside down via a state. To avoid this adding new load_point_coord_maybe_flipped intrinsics. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15117>	2022-07-16 07:07:32 -04:00
Arvind Yadav	2709786bde	nir: Add a lowering pass for polygon and line smoothing When poly_line smoothing is enabled then this lowering pass will modify the alpha component of every write to fragment output using sample coverage mask. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16245>	2022-07-16 10:15:22 +00:00
Jason Ekstrand	3cf103f23d	nir/gather_info: Stop gathering uses_sample_shading Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	23b2d625dd	nir: Add a pass for lowering shaders to single-sampled On Intel, we have to do this because we can't ask for the per-sample barycentrics without setting the per-sample dispatch bit or the GPU will hang. However, nothing we're doing in this pass is Intel-specific and it may be a useful optimization for someone else so we may as well make it a generic NIR pass. This version actually does a bit more than the current brw_nir_demote_sample_qualifiers() pass as it also handles pre-nir_lower_io interp_dref_at* as well as a couple system values which we can easily constant-fold. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Georg Lehmann	aac8ddae2f	nir/opt_algebraic: Optimize [ui](add\|sub)_sat with 0. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17468>	2022-07-13 07:34:09 +00:00
Georg Lehmann	90a8fb0355	nir/lower_io: Fix array length of buffers larger than INT32_MAX. Before, if the ssbo is too large this would always return 0. Also, this code is easier to optimize, so the common case of offset 0 and pot stride results in one ushr instead of 5+ instructions. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17468>	2022-07-13 07:34:09 +00:00
Eric Engestrom	9844a2fb64	nir: use updated tokens from vk.xml Signed-off-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17342>	2022-07-12 15:53:11 +00:00
Emma Anholt	0e1fb2d984	nir+ir3: Rename load_size_ir3 to load_center_rhw_ir3. Now that we know what it does, it also explains what it's doing in interpolateAtOffset in ir3. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17322>	2022-07-11 16:56:05 +00:00
Alyssa Rosenzweig	befc68ec33	nir/opt_shrink_vectors: Round to supported vec size The set of supported vector sizes in NIR has holes in it. For example, we support vec5 and vec8, but not vec6 or vec7. However, this pass did not take that into account, and would happily shrink a vec8 down to a vec7, causing NIR validation to fail. Instead, the pass should round up to the next supported vector size. Fixes NIR validation fail in OpenCL's test_basic hiloeo subtest. v2: Clamp -> round rename. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17194>	2022-07-10 18:03:46 +00:00
Lionel Landwerlin	a4c5521ea9	nir/serialize: restore ray query variables The ray query status of a variable is tracked in the nir_variable::data. We need to store it in the serialization otherwise restoring NIR from a cache will drop the annotation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `5a9cdab170` ("nir: track variables representing ray queries") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16059>	2022-07-09 00:32:00 +00:00
Iago Toral Quiroga	2071804f33	nir/serialize: fix missing divergence info after deserialization Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17397>	2022-07-08 06:15:28 +00:00
Rhys Perry	bb0415b697	nir: allow 16-bit fsin_amd/fcos_amd Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10587>	2022-07-07 22:18:08 +00:00
Rhys Perry	bc1ea2fda9	nir/algebraic: optimize bcsel(c, fsin/cos_amd(a), fsin/cos_amd(b)) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10587>	2022-07-07 22:18:08 +00:00
Rhys Perry	69d21a3dee	nir: rename fsin_r600/fcos_r600 to fsin_amd/fcos_amd GCN has better range, but constant folding is the same. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10587>	2022-07-07 22:18:08 +00:00
Iago Toral Quiroga	84a0dca9df	nir: fix documentation for uadd_carry and usub_borry opcodes These opcodes where fixed to return an integer instead of a boolean value some time ago but the documentation for them was not updated and still talked about a boolean result. Fixes: `b0d4ee520` ('nir/opcodes: Fix up uadd_carry and usub_borrow') Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17372>	2022-07-07 09:16:24 +00:00
Jason Ekstrand	bfbcd966f3	nir: Use util_mask_sign_extend when serializing constants Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17214>	2022-07-06 11:23:18 +00:00

1 2 3 4 5 ...

3817 Commits