AlexIndustrial/mesa

Author	SHA1	Message	Date
antonino	15b3d77b40	nir: only handle flat interpolation when needed in `nir_create_passthrough_gs` When turning primitives into line strips this function needs to move attributes around, but this is not needed in other cases. Fixes: `1a5bdca2dd` ("zink: implement flat shading using inlined uniforms") Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22162>	2023-03-31 11:03:48 +00:00
Ian Romanick	71e5530c07	nir/algebraic: Undistribute fsat from fmax To be helpful, the thing inside the fsat has to be used with and without the fsat. Otherwise it just moves a saturate destination modifier around. To not be harmful, the fsat has to only be used by the bcsel. All Broadwell and newer Intel platforms had similar results. (Ice Lake shown) total instructions in shared programs: 20174475 -> 20174449 (<.01%) instructions in affected programs: 3913 -> 3887 (-0.66%) helped: 13 / HURT: 0 total cycles in shared programs: 866844832 -> 866844719 (<.01%) cycles in affected programs: 46037 -> 45924 (-0.25%) helped: 10 / HURT: 1 All Intel platforms had similar results. (Ice Lake shown) Instructions in all programs: 161491468 -> 161491372 (-0.0%) helped: 31 / HURT: 8 Cycles in all programs: 10933090736 -> 10933024716 (-0.0%) helped: 32 / HURT: 18 Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22169>	2023-03-29 23:48:19 +00:00
antonino	2bd72a4101	nir: keep xfb properties in nir_create_passthrough_gs Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:40 +00:00
antonino	0b65514775	nir/zink: handle provoking vertex mode in `nir_create_passthrough_gs` Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:40 +00:00
antonino	1a5bdca2dd	zink: implement flat shading using inlined uniforms Zink will now handle flat interpolation correctly when line loops are generated from primitives. The flat shading information is passed to the emulation gs using constant uniforms which get inlined. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:40 +00:00
antonino	3b5fb8b060	nir: allow to force line strip out in nir_create_passthrough_gs `nir_create_passthrough_gs` now allows the user to force the generated GS to always output a line strip from the primitive regardless of whether edgeflags are present. Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:40 +00:00
antonino	24535ffb3d	nir: handle edge flags in nir_create_passthrough_gs `nir_create_passthrough_gs` will now take a boolean argument to decide whether it needs to handle edgeflags. When true is passed it will output a line strip where edges that shouldn't be visible are not emitted. This is usefull because geometry shaders will generally throw away edgeflags so for a passthrough GS to act transparently it needs to emulate them. Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:40 +00:00
antonino	a0751e8088	nir: calculate number of vertices in nir_create_passthrough_gs `nir_create_passthrough_gs` has been changed to take the type of primitive as opposed to the number of vertices as an argument. Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:40 +00:00
antonino	edecb66b01	nir: avoid generating conflicting output variables Because not all vertex outputs can have corresponding fragment inputs (eg. edgeflags) some logic is needed to correctly generate variables in a passthough gs. Before this change some output variables ened up with the same location. Fixes: `d0342e28b3` ("nir: Add helper to create passthrough GS shader") Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:39 +00:00
antonino	ea14579f3d	nir: handle primitives with adjacency `nir_create_passthrough_gs` can now handle primitives with adjacency where some vertices need to be skipped. Fixes: `d0342e28b3` ("nir: Add helper to create passthrough GS shader") Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:39 +00:00
Sil Vilerino	0d0221a574	nir: Fix use of alloca() without #include c99_alloca.h Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22150>	2023-03-29 16:56:42 +00:00
Emma Anholt	d3bbbc4c6c	glsl: Drop dead prototype. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21940>	2023-03-29 16:06:03 +00:00
Emma Anholt	d2a3fa7569	glsl: Remove the TessLevel lowering special case from xfb. The NIR vectorized tess level pass applies later, and it leaves the name as-is, so we don't need to mess around with gl_TessLevelInnerMesa/OuterMesa. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21940>	2023-03-29 16:06:03 +00:00
Emma Anholt	84006587d7	glsl: Delete the lower_tess_level pass. NIR i/o lowering and sysval lowering can handle the compact var fine at this point. Affects: nouveau, virgl, svga, radeonsi, r600, llvmpipe. Does not affect PIPE_CAP_NIR_COMPACT_ARRAYS drivers like crocus, iris, d3d12, freedreno, zink. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21940>	2023-03-29 16:06:03 +00:00
Emma Anholt	ceef2b9982	nir/lower_sysvals: Add support for un-lowered tess_level_inner/outer. GLSL has been responsible for doing this, but we can just extract the array index here. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21940>	2023-03-29 16:06:03 +00:00
Timur Kristóf	b688a6d227	nir: Remove IB address and stride intrinsics. RADV used these to emulate firstTask for NV_mesh_shader. They are no longer needed. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22139>	2023-03-29 15:08:55 +00:00
Qiang Yu	bf9c1699cd	nir: add nir_fisnan helper function Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21552>	2023-03-28 19:57:11 +00:00
Qiang Yu	c9d60547ef	nir,radeonsi: add and implement nir_load_alpha_reference_amd Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21552>	2023-03-28 19:57:11 +00:00
Qiang Yu	6848e05f9c	nir: pack_(s\|u)norm_2x16 support float16 as input For AMD GPU which has instruction to normalize and pack two float16 inputs, and used when fragment shader export color output. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21552>	2023-03-28 19:57:11 +00:00
Faith Ekstrand	cf1da3ef40	spirv: Drop a bunch of Authors tags This is what git blame is for Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22120>	2023-03-26 00:16:25 +00:00
Faith Ekstrand	01275a1a95	nir: Drop a bunch of Authors tags This is what git blame is for. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22120>	2023-03-26 00:16:25 +00:00
Danylo Piliaiev	330b64d1d1	spirv: sort spirv_supported_capabilities Makes easier for c++ driver to keep initializer in order. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21931>	2023-03-24 15:49:25 +00:00
Konstantin Seurer	200e551cbb	nir/lower_shader_calls: Remat derefs before lowering resumes Closes: #7923 cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20399>	2023-03-24 14:55:37 +00:00
Alyssa Rosenzweig	47ed0b41be	nir: Add Mali load_output taking converison Mali's LD_TILE instruction (mapping to NIR's load_output) requires a "conversion descriptor" specifying how to convert from the register foramt to the tilebuffer format. To implement framebuffer fetch on OpenGL without shader variants, we generate these descriptors in the driver and pass them in a uniform. However, to comply with the Ekstrand Rule, we can't have magically materialized system values -- they should come only from the NIR where the driver can lower as it pleases (e.g. PanVK can lower to a constant because it knows the framebuffer format at pipeline create time). Add intrinsics to model this. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:45 +00:00
Alyssa Rosenzweig	60bfc4deb9	nir: Add Panfrost intrinsics to lower sample mask We want to lower this in NIR instead of the backend IR to give the driver a chance to lower the "is multisampled?" system value, which makes more sense to do in NIR. This gets rid of one of the magic compiler materialized sysvals. Plus, this will let us constant fold away the lowering in Vulkan when we know that the pipeline is single-sampled / multi-sampled. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:45 +00:00
Amber	8da3494d53	freedreno, nir, ir3: implement GL_EXT_shader_framebuffer_fetch Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21260>	2023-03-23 16:59:56 +00:00
Amber	ca92183845	nir: Add memory coherency information to shaders. Signed-off-by: Amber Amber <amber@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21260>	2023-03-23 16:59:56 +00:00
Amber	1462da2a70	nir: allow nir_lower_fb_read to support multiple render targets Signed-off-by: Amber Amber <amber@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21260>	2023-03-23 16:59:56 +00:00
Emma Anholt	772cacff32	glsl: Write a new test for GLSL and NIR mediump lowering. The mediump lowering tests are important for poking at the lowering pass behavior, since you can't really assert the behavior in any given driver, given that the GLSL spec allows any mediump op to be done in highp. But, in hacking on mediump lowering, I wanted several things that the old test couldn't do: - Be able to assert about the actual NIR code we expect to generate for a hypothetical driver (important if other compiler stages might do invalid transformations like eliminating highp temps, or if we were to move the lowering after GLSL IR) - Run faster (gtest unit tests rather than python forking off the standalone glsl compiler per testcase). - Express expectations with a lot less escaping of typical syntax. - High-quality logs for displaying failures. This new test does all of that, I think, though I haven't converted all of the unit tests over yet. In converting, I dropped some of the combinatorial explosion for float/int variations, instead only doing so when it gets at some different code path (default precision flags). I've also included some new tests I wrote in the process of writing my proposed gl_nir mediump lowering. Even if the conversion isn't complete, getting these tests to run faster is probably a good idea on its own, for anyone iterating running Mesa's unit tests (80 tests in 25ms, compared to 109 tests in 1.5s!). Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21886>	2023-03-22 22:52:45 +00:00
Emma Anholt	41f51fe815	glsl/standalone: Make all standalone contexts have NewProgram set. It was in the standalone compiler but not unit tests. Only the standalone compiler had done linking and needed it, so far. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21886>	2023-03-22 22:52:45 +00:00
Emma Anholt	9b5326bdc1	glsl/standalone: Pull out a helper function for adding GLSL source shaders. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21886>	2023-03-22 22:52:45 +00:00
Emma Anholt	1c47609888	glsl/standalone: Pull program create/destroy out to a public function. For reuse with unit tests. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21886>	2023-03-22 22:52:45 +00:00
Rhys Perry	e99ba0b6d3	nir/range_analysis: use perform_analysis() in nir_analyze_range() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21381>	2023-03-22 09:24:18 +00:00
Rhys Perry	2b03db39b3	nir/range_analysis: use perform_analysis() in nir_unsigned_upper_bound() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21381>	2023-03-22 09:24:18 +00:00
Rhys Perry	29a38b09cf	nir/range_analysis: add helpers for limiting stack usage Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21381>	2023-03-22 09:24:18 +00:00
Rhys Perry	2145cf3dd1	nir/range_analysis: add missing masking of shift amounts Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Fixes: `72ac3f6026` ("nir: add nir_unsigned_upper_bound and nir_addition_might_overflow") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21381>	2023-03-22 09:24:18 +00:00
Alyssa Rosenzweig	2933af7576	nir/builder: Add nir_umod_imm helper Like nir_udiv_imm, we can do a similar power-of-two trick. It's also really convenient. v2: Assert reasonable bounds on the modulus (Faith). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> [v1] Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> [v1] Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22010>	2023-03-22 06:18:18 +00:00
Georg Lehmann	cec04adcee	nir: optimize i2f(f2i(fsign)) Foz-DB Navi10: Totals from 3013 (2.23% of 134906) affected shaders: VGPRs: 138068 -> 136964 (-0.80%); split: -0.80%, +0.00% CodeSize: 10476416 -> 10391800 (-0.81%) MaxWaves: 79118 -> 80088 (+1.23%) Instrs: 1963227 -> 1945003 (-0.93%) Latency: 24734883 -> 24649279 (-0.35%); split: -0.39%, +0.05% InvThroughput: 6366777 -> 6334735 (-0.50%); split: -0.50%, +0.00% VClause: 36845 -> 36882 (+0.10%); split: -0.26%, +0.36% SClause: 59249 -> 59273 (+0.04%); split: -0.25%, +0.29% Copies: 108570 -> 108501 (-0.06%); split: -0.19%, +0.13% PreSGPRs: 105371 -> 105862 (+0.47%) PreVGPRs: 117675 -> 116625 (-0.89%); split: -0.89%, +0.00% Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22003>	2023-03-22 05:34:55 +00:00
Samuel Pitoiset	bb7e0c4280	spirv,nir: add support for SpvBuiltInFullyCoveredEXT Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21497>	2023-03-21 08:44:09 +00:00
Samuel Pitoiset	cf2bc83c60	spirv: add SpvCapabilityFragmentFullyCoveredEXT Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21497>	2023-03-21 08:44:09 +00:00
Emma Anholt	5873dcb32f	nir/lower_mediump: Fix assertion about copy_deref lowering matching. Copy and paste typo. We shouldn't have copy_derefs during this pass, anyway, but caught a failure with my upcoming unit testing. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21666>	2023-03-21 00:51:24 +00:00
Emma Anholt	1fff562929	glsl/lower_precision: Add actual spec quotes for "check_parameters" Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21666>	2023-03-21 00:51:24 +00:00
Emma Anholt	4a51944639	glsl: Fix the precision of atomic counter builtin function args. More special-casing dropped from GLSL lower_precision. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21666>	2023-03-21 00:51:24 +00:00
Emma Anholt	b251f94e15	glsl/lower_precision: Drop most special-casing of builtin arg precision. bitCount is still special in that our lowering would try to demote its arg based on the precision of its output, and it shouldn't do that. But the other special cases now have appropriate qualifiers on them at the IR level. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21666>	2023-03-21 00:51:24 +00:00
Emma Anholt	18e096769c	glsl: Set the precision of function return value temporaries. The signature should dictate the precision of the temp we store into. This ends up ignored by lower_precision for now, which always rewrites it so as to handle custom lowering of builtin precision.. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21666>	2023-03-21 00:51:24 +00:00
Emma Anholt	b1d228e9d5	glsl: Handle highp promotion of builtin function args in the builtins. It's what the spec says to do. This will may help us avoid special-casing these functions if we ever lower precision after builtin inlining. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21666>	2023-03-21 00:51:24 +00:00
Emma Anholt	be2731f445	glsl: Set the precisions of builtin function arguments and returns. These have precision qualifiers defined in the spec, in which case we should emit them them while generating builtin signatures and code. We've been special-casing them in GLSL lower_precision, but now we can just rely on the precision qualifier of the builtin if non-NONE. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21666>	2023-03-21 00:51:24 +00:00
Emma Anholt	2e85c9a422	glsl/lower_precision: Add a cut-down testcase for #8124 This pattern is the core of the webgl conformance failure, I think. And, I think actually lower_precision was doing the right thing, just the conformance test going through ANGLE was screwing up. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21666>	2023-03-21 00:51:24 +00:00
Emma Anholt	41be2caa6d	glsl/lower_precision: Add a unit test that I thought we might fail at. If you lowered precision too late, it would be easy to break this. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21666>	2023-03-21 00:51:24 +00:00
Emma Anholt	9a2d66f5a5	glsl: Simplify vector constructors from scalars. No need to generate a temp in this case. Cleanup I noticed while looking at lower_precision behavior (and I've included a testcase to sanity check that things work out). This causes a tiny amount of scheduling change on freedreno: total instructions in shared programs: 11010012 -> 11010012 (0.00%) instructions in affected programs: 147 -> 147 (0.00%) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21666>	2023-03-21 00:51:24 +00:00

1 2 3 4 5 ...

7825 Commits