AlexIndustrial/mesa

Author	SHA1	Message	Date
Alyssa Rosenzweig	dad599f15e	panfrost: Don't clobber RT0 if RTn is disabled Fixes: `a124c47b9f` ("panfrost: Fix NULL derefs in pan_cmdstream.c") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10393>	2021-05-04 20:04:03 +00:00
Alyssa Rosenzweig	5268a8500a	panfrost: Minor cleanup of blend CSO No need to cast. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10393>	2021-05-04 20:04:03 +00:00
Alyssa Rosenzweig	3968f03754	panfrost: Support alpha_to_one Gets rid of a bogus assert in the blend CSO create. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10393>	2021-05-04 20:04:03 +00:00
Alyssa Rosenzweig	a368cc022d	panfrost: Make comment less confusing Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10393>	2021-05-04 20:04:03 +00:00
Alyssa Rosenzweig	c6bb55ffcf	pan/bi: Lower 8-bit fragment input Same reasons/technique as fragment output lowering, just need the NIR. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10393>	2021-05-04 20:04:03 +00:00
Alyssa Rosenzweig	3cc6a4c5d0	pan/bi: Handle swizzles in i2i8 Otherwise they get copypropped away. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10393>	2021-05-04 20:04:03 +00:00
Alyssa Rosenzweig	e180374ab1	pan/bi: Add single-component 8-bit mkvec lowering So we can implement scalar i2i8. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10393>	2021-05-04 20:04:03 +00:00
Alyssa Rosenzweig	ba17342a1f	pan/bi: Handle different sizes of LD_TILE v2: Fix overflow. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10393>	2021-05-04 20:04:03 +00:00
Alyssa Rosenzweig	f412801768	pan/bi: Track dual-src blend type Will be needed for fp16 outputs. I am acutely aware dual-src blending is broken on Bifrost right now anyway. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10393>	2021-05-04 20:04:03 +00:00
Yiwei Zhang	e44b4feb33	venus: query extended resource info from gralloc Creating Android swapchain image from gralloc buffer requires to use VkImageDrmFormatModifierExplicitCreateInfoEXT. To fill the struct info, we need to query extended resource info from gralloc. With the queried modifier from gralloc, we can ask the driver for the plane count of the given format and modifier pair. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10553>	2021-05-04 19:52:13 +00:00
Eric Anholt	a2efa2e833	tgsi: Mark the tgsi_exec_channel and tgsi_double_channel ALIGN16. We allocate them all align16, so mark the unions (and their container structs) that way so the compiler can do aligned SSE load/stores. glmark2 -b loop FPS +0.197265% +/- 0.117633% (n=1906) Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10604>	2021-05-04 18:58:51 +00:00
Charlie Turner	1d418e79b8	radv: Add a STONEY baseline for dEQP. See: https://gitlab.freedesktop.org/tanty/mesa-valve-ci/-/jobs/9286188 https://gitlab.freedesktop.org/tanty/mesa-valve-ci/-/jobs/9297109 https://gitlab.freedesktop.org/tanty/mesa-valve-ci/-/jobs/9297110 v2. - Clarify that the dEQP-VK.texture.explicit_lod.2d tests are skipped due to slow APU-based STONEY test devices. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10559>	2021-05-04 16:42:57 +00:00
Iago Toral Quiroga	f099fc3e07	v3d: choose a larger CSD supergroup size if possible Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10541>	2021-05-04 15:53:23 +00:00
Iago Toral Quiroga	3ce249e65e	broadcom/common: move CSD supergroup sizing to a common helper We want to use this in GL too. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10541>	2021-05-04 15:53:23 +00:00
Iago Toral Quiroga	afc33a7430	v3dv: limit supergroup size in presence of TSY barriers When a TSY barrier is hit, the entire supergroup will be synchronized. If the supergoup is large and uses all available QPU threads it would mean that we would sychronize and stall all running threads until all of them reach the barrier, which may be inefficient. This patch makes it so that if the compute shader has any such barriers we limit the supergroup size so each supergroup only takes half of the QPU threads available at most, so that if one supergroup hits a barrier we have at least one other supergroup we can run, reducing idle QPU time. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10541>	2021-05-04 15:53:23 +00:00
Iago Toral Quiroga	f514280524	broadcom/compiler: track if a shader has control barriers in prog_data Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10541>	2021-05-04 15:53:23 +00:00
Iago Toral Quiroga	2e0f6e5705	v3dv: choose a larger CSD supergroup size if possible Each supergroup executes a number batches. Each batch has 16 elements (one per QPU lane), except possibly the last batch which might be incomplete. Until now, we packed a single workgroup in each supergroup, which can lead to more incomplete batches and less efficient use of the QPUs depending on the configuration of workgroups being dispatched. This patch computes a number of workgroups per supergroup so that we reduce or completely eliminate incomplete batches if possible. It should be noted however, that TSY barriers act on supergroups, so larger supergroups lead to larger syncpoints on barriers too. A follow-up patch will try to find a good balance for compute shaders that use such barriers. This improves performance of the Sascha Willem's computecloth demo by ~13%. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10541>	2021-05-04 15:53:23 +00:00
Iago Toral Quiroga	aebb47b7d1	compiler/nir: add a divergence analysis option for non-uniform workgroup id The V3D hardware allows us to pack multiple workgroups together to avoid wasting execution lanes in shader cores. For example, if we dispatch 16 workgroups with a local size of 1 element, we can pack all 16 workgroups in a single 16-wide dispatch where each lane executes a different workgroup, instead of 16 1-wide dispatches. When we do this, we don't have a uniform workgroup id any more. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10541>	2021-05-04 15:53:23 +00:00
Caio Marcelo de Oliveira Filho	caf9fb1a10	intel/compiler: Remove unused exported functions Now that all drivers are using brw_cs_get_dispatch_info() we can remove one function (which is now unused) and reduce the scope of the other. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10504>	2021-05-04 08:15:19 -07:00
Caio Marcelo de Oliveira Filho	313c80c158	i965: Use brw_cs_get_dispatch_info() Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10504>	2021-05-04 08:15:19 -07:00
Caio Marcelo de Oliveira Filho	279acf1031	anv: Use brw_cs_get_dispatch_info() And since right_mask is already provided as part of dispatch_info, just use that instead of storing it. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10504>	2021-05-04 08:15:19 -07:00
Caio Marcelo de Oliveira Filho	59cbd50bfa	iris: Use brw_cs_get_dispatch_info() Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10504>	2021-05-04 08:15:19 -07:00
Caio Marcelo de Oliveira Filho	5cc758558d	intel/compiler: Add common function for CS dispatch info We have this small calculations repeated in each Intel driver, so move them to a single place to be reused. Also includes "right_mask" since is always used in the same context and depends on the dispatch info values. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10504>	2021-05-04 08:15:19 -07:00
Caio Marcelo de Oliveira Filho	7cc846788c	nir: Remove now unnecessary conditions from emit_load/store helpers The mode one was used before `0bc5a829dd` ("nir: Remove shared support from lower_io"). The others were used before `5f7c7c9a7f` ("nir: add src and dest types to all IO loads and stores for mediump"). All conditions now are always true, so drop them. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10533>	2021-05-04 06:33:24 -07:00
Boris Brezillon	693ae0d3e9	panfrost/ci: Run the full deqp-gles3 testsuite We recently added 5 more VIM3s to the lavalab, this should be more than enough to run the full GLES 3.0 testsuite on G52. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10614>	2021-05-04 11:22:15 +00:00
Erik Faye-Lund	a2140e29c5	docs: update gallium doxygen docs Gallium's background as a Tungstend Graphics technology is no longer significant; it's a historical detail. Besides, since Tungsten Graphics were acquired by VMware more than a decade ago, the website no longer exists. While we're at it, replace the docs link with a link to the mesa docs, and point to archive.org copy of the Tungsten Graphics paper. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2770 Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10452>	2021-05-04 08:38:45 +00:00
Gert Wollny	a199697642	nir/opt_algebraic: optimizations for add umax/umin with zero For unsigned comparisons with zero these ops can be eliminated. v2: Add comparison optimizations with -1 (Rhys Perry) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> (v1) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10583>	2021-05-04 09:33:32 +02:00
Erik Faye-Lund	301ceab7ce	lavapipe: consistently use nir macros NIR provides two helper macros to run transformation passes correctly, NIR_PASS() and NIR_PASS_V(). So far we've seemingly been a bit haphazard about when to use them. Let's correct that, and consistently use the NIR helpers here. This helps us in two ways: 1. We now run nir_validate_shader after each pass, ensuring we didn't break the shader 2. We now respect the NIR_PRINT environment variable for all NIR passes, making debugging much less surprising. In addition, we had an OPT()-macro that doesn't seem to provide much help other than to hiding some trivial details. But they make our code different to other users of NIR, which doesn't seem ideal. So let's drop that macro while we're at it. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10585>	2021-05-04 07:18:55 +00:00
Samuel Pitoiset	53fe74bbb1	radv: implement RADV_FORCE_VRS for the LLVM backend Just to make it consistent compared to ACO. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10432>	2021-05-04 08:23:56 +02:00
Marek Olšák	48d2ac4e88	util: fix (re-enable) L3 cache pinning cores_per_L3 was uninitialized, so it was always disabled. Remove the variable and do it differently. Fixes: `11d2db17c5` - util: rework AMD cpu L3 cache affinity code. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10526>	2021-05-04 01:02:07 -04:00
Marek Olšák	9b58e31f2d	util: print CPU caps in release builds too Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10526>	2021-05-04 01:02:07 -04:00
Dave Airlie	897bcc1e6b	i965: drop old brw ff gs code. This isn't needed anymore. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9721>	2021-05-04 03:39:45 +00:00
Dave Airlie	8d5f36fe14	i965: port fixed function geom shader to use compiler paths This just moves to the common code in the compiler. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9721>	2021-05-04 03:39:45 +00:00
Dave Airlie	52e426fd8b	intel/compiler: add support for compiling fixed function gs This is ported from i965, but the interface is cleaned up Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9721>	2021-05-04 03:39:45 +00:00
Dave Airlie	ac33e2b66b	intel: move brw_ff_gs_prog_key/data to compiler. Step one to moving the ff_gs emitter to compiler for sharing, move BRW_MAX_SOL_BINDINGS up so the keys are in same area Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9721>	2021-05-04 03:39:45 +00:00
Eric Anholt	7c52a79057	ci/freedreno: Add another db820c flake that's appeared in the last few months. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10597>	2021-05-04 01:03:50 +00:00
Eric Anholt	4c85d47df5	ci/freedreno: Fix the recent-a5xx-texture-flakes matches. We've had about 1/day of the texelfetch group in the IRC flake reports since apr 23, and tex-miplevel-selection that I marked before is actually all the subtests it looks like. Also, you can't include the ",Fail" if you want to actually match a test name. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10597>	2021-05-04 01:03:50 +00:00
Ian Romanick	2d5b64818f	gallivm: Remove unused GALLIVM_NAN_RETURN_NAN In the review, Roland says, "I think the unused nan behaviors was there just for completeness, so it can easily go." Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10532>	2021-05-04 00:13:34 +00:00
Ian Romanick	61624934f6	gallivm: Use GALLIVM_NAN_RETURN_OTHER_SECOND_NONNAN for norm clamping Since the second source is always a constant that is known to be a number, this should have the same performance as GALLIVM_NAN_BEHAVIOR_UNDEFINED. A lofty goal is to eventually remove GALLIVM_NAN_BEHAVIOR_UNDEFINED. There's still a lot of (mostly implicit) users, and I don't feel like tackling that right now. :) Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10532>	2021-05-04 00:13:34 +00:00
Ian Romanick	aaeff52bbe	gallivm: Use range analysis to generate better fmin and fmax code If it is known that one of the source must be a number, then the (more efficient) GALLIVM_NAN_RETURN_OTHER_SECOND_NONNAN path can be used. v2: s/know to be/known to be/. Noticed by Roland. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10532>	2021-05-04 00:13:34 +00:00
Ian Romanick	b3f3287eac	gallivm: Fix NaN behavior of min and max Like softpipe in mesa!10419, llvmpipe suffers from improper handling of NaN in nir_op_fmax and nir_op_fmin. nir_op_fsat is already handled correctly. OpenCL strictly requires the "NaN cleansing" behavior, so all of the functionality is in place. Just make the graphics APIs use the OpenCL path. The majority of the possible performance penalty incurred here should be resolved in the next commit. v2: Add updated checksum for bgfx/39-assao.rdc trace. Rendering goes from mostly garbage to looking correct to me. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10532>	2021-05-04 00:13:34 +00:00
Ian Romanick	8af325d192	tgsi_exec: Use C99 functions for min and max instead of open coding I don't know what I was thinking when I wrote `939bf7a419` ("tgsi_exec: Fix NaN behavior of min and max") and `d1c0f62b42` ("tgsi_exec: Fix NaN behavior of saturate"). I knew that C99 had fmin and fmax... I just forgot to use them. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10532>	2021-05-04 00:13:34 +00:00
Jason Ekstrand	05a37e2422	intel/nir: Set lower txs with non-zero LOD There's a recently discovered HW bug affecting hardware at least as far back as Skylake where, if the LOD is out-of-bounds for any SIMD lane, then garbage may be returned in all SIMD lanes. The easy solution is to set lower_txs_lod so that we always have a constant LOD of 0 which we know a priori is always in-bounds. Fortunately, not many shaders actually use textureSize() with LOD. Shader-db results on Ice Lake: total instructions in shared programs: 19948537 -> 19948564 (<.01%) instructions in affected programs: 3859 -> 3886 (0.70%) helped: 0 HURT: 7 One of the shaders is in Civilization: Beyond Earth, and the rest are all in Civilization VI. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Cc: mesa-stable@lists.freedesktop.org Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10538>	2021-05-04 00:02:43 +00:00
Jason Ekstrand	3f36e027d3	intel/fs: Don't use pixel_z for Gen4-5 source_depth_to_render_target The source_depth_to_render_target flag can get set on old gen4-5 HW in a few cases which are independent of the app writing gl_FragDepth. It should be safe to just use fetch_payload_reg in that case instead of depending in interpolation setup. This fixes a bug with certain very simple shaders where we might end up not including the depth when we should have. While we're here, rework the logic around setting src_depth and add a comment so it's more clear what's going on. Fixes: `6d4070f3dd` "intel/compiler: add support for fragment coordinate..." Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10596>	2021-05-03 23:51:51 +00:00
Rob Clark	71cff8171c	freedreno/query/acc: Set needs_flush Somehow this was missed, but when we emit a query start/stop we need have something that will need to be flushed in the batch. Detected due to TC assert, but this had the potential to cause problems in the non-TC case as well. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10599>	2021-05-03 23:27:31 +00:00
Rob Clark	a9c9a9938d	freedreno: Consolidate needs_flush and clearing last_fence Add a helper to both set batch->needs_flush and clear ctx->last_fence so that the two related bits of state do not get out of sync. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10599>	2021-05-03 23:27:31 +00:00
Adam Jackson	ceba7f6952	i915c: Add a symlink for i830_dri.so The gallium driver doesn't support gen2, so let's make it possible to keep both i915g and i830 drivers installed in parallel. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10554>	2021-05-03 23:03:09 +00:00
Adam Jackson	61b7e6578a	include: Remove unused i810_pci_ids.h Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10554>	2021-05-03 23:03:09 +00:00
Antonio Caggiano	1b87a7d5e0	panfrost: Meson dependency Declare a meson dependency for libpanfrost and wrap some key functions within an extern C block allowing proper compilation by C++ compilers. Signed-off-by: Antonio Caggiano <antonio.caggiano@collabora.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10462>	2021-05-03 21:08:47 +00:00
Chia-I Wu	390722620e	venus: clean up vn_device_fix_create_info The extension list should be more correct now. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10556>	2021-05-03 20:51:46 +00:00

... 41 42 43 44 45 ...

141245 Commits