AlexIndustrial/mesa

Author	SHA1	Message	Date
Kenneth Graunke	2b14618daa	glsl: Use nir_opt_barrier_modes() to drop unnecessary barriers iris shader-db stats on Alchemist: total instructions in shared programs: 23150249 -> 23142733 (-0.03%) instructions in affected programs: 157322 -> 149806 (-4.78%) helped: 105 HURT: 2 helped stats (abs) min: 2 max: 821 x̄: 71.61 x̃: 15 helped stats (rel) min: 0.13% max: 27.56% x̄: 6.21% x̃: 2.35% HURT stats (abs) min: 1 max: 2 x̄: 1.50 x̃: 1 HURT stats (rel) min: 0.18% max: 0.23% x̄: 0.20% x̃: 0.20% 95% mean confidence interval for instructions value: -101.99 -38.50 95% mean confidence interval for instructions %-change: -7.59% -4.58% Instructions are helped. total sends in shared programs: 1036916 -> 1035366 (-0.15%) sends in affected programs: 15274 -> 13724 (-10.15%) helped: 108 / HURT: 0 helped stats (abs) min: 1 max: 162 x̄: 14.35 x̃: 3 helped stats (rel) min: 0.88% max: 33.83% x̄: 9.81% x̃: 5.05% 95% mean confidence interval for sends value: -20.79 -7.92 95% mean confidence interval for sends %-change: -11.66% -7.95% Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24842>	2023-09-09 04:41:24 +00:00
Kenneth Graunke	fc0aaa81ee	nir: Reduce the scope of shared memory barriers Originally written by Ian Romanick for the Intel backend, but ported to the new nir_opt_barrier_modes() common optimization pass. Ian's original explanation and commit message follows: Shared memory only exists within a workgroup, so synchronizing it beyond workgroup scope is nonsense. Basically every SPIR-V compiler generates operations like OpMemoryBarrier(/Memory/Device, /Semantics/AcquireRelease \| WorkgroupMemory) This is suggested in numerous places, including https://github.com/KhronosGroup/GLSL/blob/master/extensions/khr/GL_KHR_vulkan_glsl.txt. Even Mesa's glsl_to_nir pass does this. This advice, which has been copy-and-pasted everywhere, is contrary to issue 13 in the original GL_ARB_compute_shader spec: "Since shared memory is only accessible to threads within a single work group, memoryBarrierShared() also only requires synchronization with other threads in the same work group." Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24842>	2023-09-09 04:41:24 +00:00
Kenneth Graunke	7dd897e1cd	nir: Add an optimization pass to reduce barrier modes Many shaders issue full memory barriers, which may need to synchronize access to images, SSBOs, shared local memory, or global memory. However, many of them only use a subset of those memory types - say, only SSBOs. Shaders may also have patterns such as: 1. shared local memory access 2. barrier with full variable modes 3. more shared local memory access 4. image access In this case, the barrier is needed to ensure synchronization between the various shared memory operations. Image reads and writes do also exist, but they are all on one side of the barrier, so it is a no-op for image access. We can drop the image mode from the barrier here too. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24842>	2023-09-09 04:41:24 +00:00
Kenneth Graunke	1c3706fc28	nir: Fix function parameter indentation in nir_opt_barriers.c The first parameter should be on the first line, and any subsequent lines should line up. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24842>	2023-09-09 04:41:24 +00:00
Georg Lehmann	136a698251	nir/opt_algebraic: remove broken fddx/fddy patterns These patterns are broken in the following scenario: %1 = f2fmp %0 %2 = fddx %1 %3 = ... // non quad uniform if %3 { %4 = f2f32 %2 ... } Which would turn into %3 = ... if %3 { %4 = fddx %0 ... } Yet another example that shows why derivative instructions should be be intrinsics, not alu. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25014>	2023-09-08 14:14:47 +00:00
Timothy Arceri	84e0f5ce75	nir: remove unused param from nir_alu_src_copy() Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24986>	2023-09-08 03:01:39 +00:00
Timothy Arceri	9b6eae2e67	nir: remove unused nir_src_copy() Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24986>	2023-09-08 03:01:39 +00:00
Timothy Arceri	af1528cc15	nir: replace use of nir_src_copy() Since `03b2c34793` nir_src_copy() no longer does anything useful, it will be removed in the following patch. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24986>	2023-09-08 03:01:39 +00:00
Piotr Kocia	8019a1b929	glsl: ir_function_param_visitor::visit_enter always true condition The condition !param->type->is_vector() \|\| !param->type->is_scalar() alawys evaluates to true: * type is not scalar or vector -> true * type is vector, i.e. num_components > 1 -> num_components == 1 is false and !is_scalar() == true * type is scalar, i.e. num_components == 1 -> num_components > 1 is false and !is_vector() == true There is no comment explaining why such code has been written, therefore this seems to be a mistake. To maintain consistency with the surrounding code, glsl_type_is_scalar_or_vector has been used instead of replacing \|\| with &&. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24914>	2023-09-07 15:00:26 +10:00
Timothy Arceri	5d203c4ae0	glsl_to_nir: add more unhandled function types These are unhandled but were working ok because a mistake fixed in the following patch caused all functions to be skipped. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24914>	2023-09-07 14:08:02 +10:00
Timothy Arceri	67d1c36bb4	glsl: fix out params in glsl to nir We must use a temp var for out params and later copy the out values to the correct parameter otherwise we can end up overwriting global variables prematurely. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24914>	2023-09-07 14:03:21 +10:00
Marek Olšák	497c40be19	nir: remove nir_op_unpack_64 handling from nir_opt_undef It's no longer needed because undef is replaced with 0 in this case. It also has a bug that it doesn't freeze the undef value if undef has multiple uses. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24059>	2023-09-06 03:24:16 +00:00
Marek Olšák	861d274453	nir: replace undef only used by ALU opcodes with 0 or NaN If undef is consumed by an FP opcode, replace it with NaN to eliminate that opcode, else replace it with 0, but there are exceptions, such as when undef is used by stores or phis, it's not touched. This also contains workarounds for viewperf shaders. radeonsi: TOTALS FROM AFFECTED SHADERS (1987/58918) Code Size: 5158692 -> 5143796 (-0.29 %) bytes Max Waves: 22456 -> 22513 (0.25 %) Outputs: 3726 -> 3726 (0.00 %) Patch Outputs: 0 -> 0 (0.00 %) Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24059>	2023-09-06 03:24:16 +00:00
Danylo Piliaiev	d0ab1a6217	isaspec: Make possible to obtain gpu_id in <expr> blocks Done with ISA_GPU_ID() macro. This makes possible to use gpu generation in to select between overrides. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23217>	2023-09-05 16:19:29 +00:00
Alyssa Rosenzweig	31b5f5a51f	nir/opt_if: Simplify if's with general conditions Dolphin ubershaders have a pattern: if (x && y) { } else { discard; } The current code to simplify if's will bail on this pattern, since the condition is not a comparison. However, if that check is dropped and we allow NIR to invert this, we get: if (!(x && y)) { discard; } else { } which is now in a form for nir_opt_conditional_discard to turn into it discard_if(!(x && y)) which may be substantially cheaper than the original code. In general, I see no reason to restrict to conditionals. Assuming the backend is clever enough to delete empty else blocks (I think most are), then this patch is a strict win as long as inot instructions are cheaper than empty else blocks. This matches my intuition for typical GPUs, where simple ALU instructions are cheaper than control flow. Furthermore, it may be possible in practice for backends to fold the inot into a richer set of instructions. For example, most GPUs have a NAND instructions which would fold in the inot in the above code. So just drop the check, simplify the pass, get the win. --- Also, to avoid inflating register pressure, make sure we put the inot right before the if. Android shader-db on is uninspiring due to terrible coalescing decisions in the current RA. But it does fix the Dolphin smell. total instructions in shared programs: 1756571 -> 1756568 (<.01%) instructions in affected programs: 1600 -> 1597 (-0.19%) helped: 1 HURT: 4 Inconclusive result (value mean confidence interval includes 0). total bytes in shared programs: 11521172 -> 11521156 (<.01%) bytes in affected programs: 10080 -> 10064 (-0.16%) helped: 1 HURT: 4 Inconclusive result (value mean confidence interval includes 0). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24965>	2023-09-05 02:36:41 +00:00
Dave Airlie	07ef39ddc6	nir/gather: add support for fbfetch and bindless image loads. If a driver calls gather after lowering the uses_fbfetch_output needs to be set properly if we have bindless image loads. Fixes a regression seen calling gather info later in some llvmpipe work. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24987>	2023-09-04 08:06:08 +10:00
Georg Lehmann	3a715cc9d2	nir: add nir_scalar_equal Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24656>	2023-09-02 00:26:31 +00:00
Georg Lehmann	bce9bba90d	nir: add nir_scalar intrinsic helpers Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24656>	2023-09-02 00:26:31 +00:00
Alyssa Rosenzweig	f80c57c38f	treewide: Use nir_before/after_impl for more elaborate cases Via Coccinelle patch: @@ expression func_impl; @@ -nir_before_block(nir_start_block(func_impl)) +nir_before_impl(func_impl) @@ expression func_impl; @@ -nir_after_block(nir_impl_last_block(func_impl)) +nir_after_impl(func_impl) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24910>	2023-08-30 19:30:58 +00:00
Alyssa Rosenzweig	25cc04c59b	treewide: Use nir_before/after_impl in easy cases These open-code the same idiom as the helper. Via Coccinelle patch: @@ expression func_impl; @@ -nir_before_cf_list(&func_impl->body) +nir_before_impl(func_impl) @@ expression func_impl; @@ -nir_after_cf_list(&func_impl->body) +nir_after_impl(func_impl) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24910>	2023-08-30 19:30:58 +00:00
Alyssa Rosenzweig	4c45503aae	nir: Add nir_before/after_impl cursors These are common enough to merit their own helpers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24910>	2023-08-30 19:30:58 +00:00
Karol Herbst	513cd29eda	nir: make num_workgroups 32 bit only Signed-off-by: Karol Herbst <git@karolherbst.de> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24905>	2023-08-30 07:04:33 +00:00
Karol Herbst	1b22b67199	nir: make workgroup_id 32 bit only No backend supports 64 bit values natively anyway. Signed-off-by: Karol Herbst <git@karolherbst.de> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24905>	2023-08-30 07:04:33 +00:00
Alyssa Rosenzweig	011f0b0d7d	nir/lower_shader_calls: Fix warning with clang Implicit conversion warning. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24864>	2023-08-29 14:06:14 +00:00
Konstantin Seurer	a209d76722	nir/lower_shader_calls: Limit the remat chain length There is no way we will rematerialize a 40k instruction long chain and it also won't be beneficial. This improves the replay time if our CP2077 fossil by 350% when compiling only ray tracing pipelines. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24864>	2023-08-29 14:06:14 +00:00
Ian Romanick	5ce6e09ffc	nir/algebraic: Remove redundant pack / unpack lowering patterns No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24900>	2023-08-25 14:54:11 -07:00
Ian Romanick	69d086c6c4	nir/builder: Add nir_extract_i8_imm and nir_extract_u8_imm helpers v2: Fix problems with 16-bit src0. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24899>	2023-08-25 20:15:37 +00:00
twisted89	d3e796da6b	util/driconf: add workarounds for the Chronicles of Riddick Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24567>	2023-08-24 22:38:56 +00:00
Alyssa Rosenzweig	cda1961835	treewide: Also handle struct nir_builder form Via Coccinelle patch: @def@ typedef bool; typedef nir_builder; typedef nir_instr; typedef nir_def; identifier fn, instr, intr, x, builder, data; @@ static fn(struct nir_builder* builder, -nir_instr instr, +nir_intrinsic_instr intr, ...) { ( - if (instr->type != nir_instr_type_intrinsic) - return false; - nir_intrinsic_instr intr = nir_instr_as_intrinsic(instr); \| - nir_intrinsic_instr intr = nir_instr_as_intrinsic(instr); - if (instr->type != nir_instr_type_intrinsic) - return false; ) <... ( -instr->x +intr->instr.x \| -instr +&intr->instr ) ...> } @pass depends on def@ identifier def.fn; expression shader, progress; @@ ( -nir_shader_instructions_pass(shader, fn, +nir_shader_intrinsics_pass(shader, fn, ...) \| -NIR_PASS_V(shader, nir_shader_instructions_pass, fn, +NIR_PASS_V(shader, nir_shader_intrinsics_pass, fn, ...) \| -NIR_PASS(progress, shader, nir_shader_instructions_pass, fn, +NIR_PASS(progress, shader, nir_shader_intrinsics_pass, fn, ...) ) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24852>	2023-08-24 15:48:02 +00:00
Alyssa Rosenzweig	465b138f01	treewide: Use nir_shader_intrinsic_pass sometimes This converts a lot of trivial passes. Nice boilerplate deletion. Via Coccinelle patch (with a small manual fix-up for panfrost where coccinelle got confused by genxml + ninja clang-format squashed in, and for Zink because my semantic patch was slightly buggy). @def@ typedef bool; typedef nir_builder; typedef nir_instr; typedef nir_def; identifier fn, instr, intr, x, builder, data; @@ static fn(nir_builder* builder, -nir_instr instr, +nir_intrinsic_instr intr, ...) { ( - if (instr->type != nir_instr_type_intrinsic) - return false; - nir_intrinsic_instr intr = nir_instr_as_intrinsic(instr); \| - nir_intrinsic_instr intr = nir_instr_as_intrinsic(instr); - if (instr->type != nir_instr_type_intrinsic) - return false; ) <... ( -instr->x +intr->instr.x \| -instr +&intr->instr ) ...> } @pass depends on def@ identifier def.fn; expression shader, progress; @@ ( -nir_shader_instructions_pass(shader, fn, +nir_shader_intrinsics_pass(shader, fn, ...) \| -NIR_PASS_V(shader, nir_shader_instructions_pass, fn, +NIR_PASS_V(shader, nir_shader_intrinsics_pass, fn, ...) \| -NIR_PASS(progress, shader, nir_shader_instructions_pass, fn, +NIR_PASS(progress, shader, nir_shader_intrinsics_pass, fn, ...) ) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24852>	2023-08-24 15:48:02 +00:00
Yonggang Luo	26c5200acf	compiler/glsl: Move glsl_print_type from glsl_types.* to ir_print_visitor.cpp glsl_print_type only referenced in ir_print_visitor.cpp there is no need expose it Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24824>	2023-08-24 02:54:09 +00:00
Yonggang Luo	01ddb18427	compiler: use 4 instead ATOMIC_COUNTER_SIZE in glsl_types.h to avoid #include "mesa/main/config.h" Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24824>	2023-08-24 02:54:09 +00:00
Piotr Kocia	27eafbcd4e	nir: Remove dead nir_const_value variables nir_const_value variables in nir_const_value_for_int and nor_const_value_for_uint are unused resulting in unnecessary dead code. The unused-variable warning has been suppressed by the memset following their declarations. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24851>	2023-08-23 19:29:19 +00:00
Alyssa Rosenzweig	5189bae50c	asahi: Move UBO lowering into GL driver In Vulkan, UBOs are lowered by nir_lower_explicit_io, and the ubo_base_agx sysval is unused (since it doesn't handle descriptor sets). That makes the UBO lowering GL-only and hence belongs with the GL driver rather than the compiler. This lets us delete the ubo_base_agx sysval. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24847>	2023-08-23 15:06:55 +00:00
Alyssa Rosenzweig	1d77fb967d	nir,asahi: Remove texture_base_agx Doing a descriptor crawl with binding tables requires a real binding table in the shader, which won't work for VK or merged shader stages in GL. Instead, let's lower anything that needs a crawl to bindless in the driver, so the compiler code doesn't need to know anything about descriptor binding models. That gets rid of the texture_base_agx sysval, which is problematic when there are multiple descriptor sets worth of textures. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24847>	2023-08-23 15:06:55 +00:00
Alyssa Rosenzweig	ec2ab7d771	nir: Add load_sysval_agx intrinsic For merging shader stages, it will be useful to express a load from an explicit GL "descriptor set", so we can represent things like UBO loads with merged shaders where UBOs can come from either stage. To do so, we add an intrinsic representing a load from the driver's uniform tables, indexed like "descriptor sets" with "bindings". In principle, a layered GL-on-Vulkan implementation would use literal descriptor sets for each stage, so I feel comfortable with the analogy here. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24847>	2023-08-23 15:06:54 +00:00
Daniel Schürmann	7e246f7f2b	nir/opt_move: fix handling of if-condition By accident, this used the parent of the nir_src which is a nir_if instead of the parent of the SSA value. Totals from 10814 (8.10% of 133461) affected shaders: (GFX11) Instrs: 21759185 -> 21757190 (-0.01%); split: -0.02%, +0.01% CodeSize: 112320272 -> 112316008 (-0.00%); split: -0.02%, +0.01% SpillSGPRs: 11220 -> 11212 (-0.07%) SpillVGPRs: 911 -> 903 (-0.88%); split: -1.54%, +0.66% Latency: 258334759 -> 258316073 (-0.01%); split: -0.02%, +0.01% InvThroughput: 31428650 -> 31426394 (-0.01%); split: -0.02%, +0.01% VClause: 309119 -> 309090 (-0.01%); split: -0.01%, +0.01% SClause: 657028 -> 657150 (+0.02%); split: -0.03%, +0.04% Copies: 1434209 -> 1432420 (-0.12%); split: -0.28%, +0.15% Branches: 481804 -> 481801 (-0.00%) PreSGPRs: 829995 -> 829966 (-0.00%) PreVGPRs: 758249 -> 758253 (+0.00%) Fixes: `8a78706643` ('nir: refactor nir_opt_move') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24695>	2023-08-22 21:05:18 +00:00
Alyssa Rosenzweig	f9e5534182	nir/lower_gs_intrinsics: Remove end primitive for points EndPrimitive() for points is entirely pointless, so just remove it when lowering EndPrimitive to simplify the IR. This is (maybe) an optimization everywhere, and will be relied on for correctness on Asahi. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24798>	2023-08-22 20:24:40 +00:00
Alyssa Rosenzweig	8c7629524e	nir/print: Print access qualifiers for intrinsics Instead of printing an opaque integer that needs to be manually decoded. Example output: 32x4 %7 = @image_load (%4 (0x0), %6, %5 (0x0), %4 (0x0)) (image_dim=2D, image_array=false, format=r8g8b8a8_snorm, access=readonly\|reorderable, range_base=0, dest_type=float32) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24798>	2023-08-22 20:24:40 +00:00
Caio Oliveira	48b86a877f	compiler/types: Use smaller keys for explicit_matrix_types table Instead of using the name as key, use a shorter struct type. Only build a name string if we are adding a new entry to the table. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23281>	2023-08-22 18:52:15 +00:00
Caio Oliveira	fd1da0f7f5	compiler/types: Extract get_explicit_matrix_instance() function Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23281>	2023-08-22 18:52:15 +00:00
Caio Oliveira	b248740e30	compiler/types: Use smaller keys for array_types table Instead of building a string, build a short struct type and use that as key. The only caveat here is ensure there either there's no internal padding or the internal padding is always the same. Use a static assert to ensure we are in the former case. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23281>	2023-08-22 18:52:15 +00:00
Caio Oliveira	d4fcc97a3f	compiler/types: Use ralloc for the key in array_types Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23281>	2023-08-22 18:52:15 +00:00
norablackcat	f744c114d1	rusticl: add cl_khr_expect_assume Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Tested-by: Andrey Alekseenko <al42and@gmail.com> Tested-by: Yifeng Li <tomli@tomli.me> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23115>	2023-08-22 17:28:05 +00:00
norablackcat	25bc3d2824	spirv/nir_to_spirv: add expect assume op codes Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23115>	2023-08-22 17:28:05 +00:00
Friedrich Vock	a28ff7f240	nir/load_store_vectorize: Handle intrinsics with constant base This includes nir_load_stack and nir_store_stack, which are vectorized in nir_lower_shader_calls. If not adjusted, we end up loading from the wrong base. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9596 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9587 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24789>	2023-08-22 13:26:12 +00:00
Georg Lehmann	9cf6984200	nir: unify lower_find_msb with has_{find_msb_rev,uclz} Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24662>	2023-08-22 12:08:37 +00:00
Georg Lehmann	2ac7e6614a	nir: unify lower_bitfield_extract with has_bfe Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24662>	2023-08-22 12:08:37 +00:00
Georg Lehmann	34c3f81614	nir: unify lower_bitfield_insert with has_{bfm,bfi,bitfield_select} Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24662>	2023-08-22 12:08:37 +00:00
Marek Olšák	1ac379c4a0	nir/algebraic: collapse ALU opcodes sourcing NaN Undef will be replaced by NaN whenever it leads to elimination of FP instructions. This implements the elimination part. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24792>	2023-08-19 14:18:52 -04:00

1 2 3 4 5 ...

8514 Commits