AlexIndustrial/mesa

Author	SHA1	Message	Date
Iago Toral Quiroga	e7a4e97076	nir/schedule: use larger delay for non-filtered memory reads This has been pending for a long time. It is not very consistent to add a significant delay for textures and not do it for UBOs, etc The reason we have not been doing this so far is the accumulated effect on register pressure for V3D as shown by shader-db results below, but from the point of view of a generic scheduler it makes sense to do this. Later patches will address V3D specific issues with register pressure derived from this by letting the driver control its instruction delay settings. total instructions in shared programs: 12662138 -> 13126587 (3.67%) instructions in affected programs: 1813091 -> 2277540 (25.62%) helped: 2410 HURT: 10499 total threads in shared programs: 415858 -> 407208 (-2.08%) threads in affected programs: 17348 -> 8698 (-49.86%) helped: 8 HURT: 4333 total uniforms in shared programs: 3711483 -> 3812698 (2.73%) uniforms in affected programs: 128012 -> 229227 (79.07%) helped: 3474 HURT: 2143 total max-temps in shared programs: 2138763 -> 2318430 (8.40%) max-temps in affected programs: 318780 -> 498447 (56.36%) helped: 588 HURT: 11997 total spills in shared programs: 3860 -> 49086 (1171.66%) spills in affected programs: 709 -> 45935 (6378.84%) helped: 23 HURT: 1595 total fills in shared programs: 5573 -> 55810 (901.44%) fills in affected programs: 1067 -> 51304 (4708.25%) helped: 23 HURT: 1595 LOST: 3 GAINED: 0 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	9ef499b315	broadcom/compiler: stop moving UBO loads before NIR scheduling This doesn't have any significant impact shader-db stats and would reduce our capacity to hide latency from the loads, so it is probably undesirable: total instructions in shared programs: 12663189 -> 12663186 (<.01%) instructions in affected programs: 4222 -> 4219 (-0.07%) helped: 9 HURT: 4 total uniforms in shared programs: 3711624 -> 3711629 (<.01%) uniforms in affected programs: 186 -> 191 (2.69%) helped: 0 HURT: 2 total max-temps in shared programs: 2138822 -> 2138857 (<.01%) max-temps in affected programs: 569 -> 604 (6.15%) helped: 1 HURT: 9 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:03 +00:00
Timur Kristóf	64acec0ef9	nir: Fix lowering terminology of compute system values: "from"->"to". This is to match other NIR terminology. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15103>	2022-03-08 17:36:31 +00:00
Iago Toral Quiroga	f761f8fd9e	broadcom/compiler: simplify node/temp translation during register allocation Now that we don't sort our nodes we can arrange them so we can easily translate between nodes and temps without a mapping table, just applying an offset. To do this we have a single array of nodes where twe put first the nodes for accumulators and then the nodes for temps. With this setup we can ensure that for any given temp T, its node is always T + ACC_COUNT. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15168>	2022-03-02 08:09:11 +00:00
Iago Toral Quiroga	871b0a7f6a	broadcom/compiler: don't sort nodes for register allocation Nodes are allocated in order to registers so initially sorting was used to ensure that nodes with smaller life ranges would be assigned first and therefore be more likely to get accumulators. However, since `d81a6e5f1d` now we don't rely on order to make decisions about accumulators and instead we make policy decisions based on actual liveness, so sorting is no longer strictly relevant to this decision. Furthermore, we are not re-sorting nodes after each spill either, since that would probably require that we rebuild the interference graph after each spill (the graph identifies nodes by their index). Shader-db results show a significant improvement in instruction counts, due to more optimal accumulator assignments. The reason for this is that we use a round-robin policy for choosing the next accumulator to assign. The idea behind this is preventing nearby temps to be assigned to the same accumulator so that QPU scheduling is more flexible, but if we sort our nodes, we are basically not assigning temps in program order any more and the round-robin policy becomes less effective: total instructions in shared programs: 13000420 -> 12663189 (-2.59%) instructions in affected programs: 11791267 -> 11454036 (-2.86%) helped: 62890 HURT: 19987 total threads in shared programs: 415874 -> 415870 (<.01%) threads in affected programs: 20 -> 16 (-20.00%) helped: 2 HURT: 4 total uniforms in shared programs: 3711652 -> 3711624 (<.01%) uniforms in affected programs: 43430 -> 43402 (-0.06%) helped: 134 HURT: 173 total max-temps in shared programs: 2144876 -> 2138822 (-0.28%) max-temps in affected programs: 123334 -> 117280 (-4.91%) helped: 4112 HURT: 1195 total spills in shared programs: 3870 -> 3860 (-0.26%) spills in affected programs: 1013 -> 1003 (-0.99%) helped: 14 HURT: 12 total fills in shared programs: 5560 -> 5573 (0.23%) fills in affected programs: 1765 -> 1778 (0.74%) helped: 14 HURT: 17 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15168>	2022-03-02 08:09:11 +00:00
Iago Toral Quiroga	4483cd24af	broadcom/compiler: sink uniform loads total instructions in shared programs: 13014428 -> 13000420 (-0.11%) instructions in affected programs: 743624 -> 729616 (-1.88%) helped: 1392 HURT: 611 total threads in shared programs: 415858 -> 415874 (<.01%) threads in affected programs: 16 -> 32 (100.00%) helped: 8 HURT: 0 total uniforms in shared programs: 3720410 -> 3711652 (-0.24%) uniforms in affected programs: 113442 -> 104684 (-7.72%) helped: 635 HURT: 29 total max-temps in shared programs: 2154268 -> 2144876 (-0.44%) max-temps in affected programs: 61279 -> 51887 (-15.33%) helped: 1124 HURT: 187 total spills in shared programs: 4002 -> 3870 (-3.30%) spills in affected programs: 265 -> 133 (-49.81%) helped: 6 HURT: 0 total fills in shared programs: 5788 -> 5560 (-3.94%) fills in affected programs: 603 -> 375 (-37.81%) helped: 6 HURT: 0 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15168>	2022-03-02 08:09:11 +00:00
Iago Toral Quiroga	e228642cf5	broadcom/compiler: move constants before their first user For us they are basically uniforms too so we want to make their lifespans short to facilitate allocating them to accumulators. total instructions in shared programs: 13043585 -> 13015385 (-0.22%) instructions in affected programs: 8326040 -> 8297840 (-0.34%) helped: 24939 HURT: 19894 total threads in shared programs: 415860 -> 415858 (<.01%) threads in affected programs: 4 -> 2 (-50.00%) helped: 0 HURT: 1 total uniforms in shared programs: 3721953 -> 3720451 (-0.04%) uniforms in affected programs: 96134 -> 94632 (-1.56%) helped: 744 HURT: 435 total max-temps in shared programs: 2173431 -> 2154260 (-0.88%) max-temps in affected programs: 264598 -> 245427 (-7.25%) helped: 10858 HURT: 841 total spills in shared programs: 4005 -> 4010 (0.12%) spills in affected programs: 700 -> 705 (0.71%) helped: 5 HURT: 10 total fills in shared programs: 5801 -> 5817 (0.28%) fills in affected programs: 1346 -> 1362 (1.19%) helped: 6 HURT: 11 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15168>	2022-03-02 08:09:11 +00:00
Iago Toral Quiroga	a1998a9f43	broadcom/compiler: disallow TMU spills if max tmu spills is 0 If we are compiling with a strategy that does not allow TMU spills we should not allow spilling anything that is not a uniform. Otherwise the RA cost/benefit algorithm may choose to spill a temp that is not uniform and that will cause us to immediately fail the strategy and fallback to the next one, even if we could've instead chosen to spill more uniforms to compile the program successfully with that strategy. Some relevant shader-db stats: total instructions in shared programs: 13040711 -> 13043585 (0.02%) instructions in affected programs: 234238 -> 237112 (1.23%) helped: 73 HURT: 172 total threads in shared programs: 415664 -> 415860 (0.05%) threads in affected programs: 196 -> 392 (100.00%) helped: 98 HURT: 0 total uniforms in shared programs: 3717266 -> 3721953 (0.13%) uniforms in affected programs: 12831 -> 17518 (36.53%) helped: 6 HURT: 100 total max-temps in shared programs: 2174177 -> 2173431 (-0.03%) max-temps in affected programs: 4597 -> 3851 (-16.23%) helped: 79 HURT: 21 total spills in shared programs: 4010 -> 4005 (-0.12%) spills in affected programs: 55 -> 50 (-9.09%) helped: 5 HURT: 0 total fills in shared programs: 5820 -> 5801 (-0.33%) fills in affected programs: 186 -> 167 (-10.22%) helped: 5 HURT: 0 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15168>	2022-03-02 08:09:11 +00:00
Iago Toral Quiroga	cbb4d0dded	broadcom/compiler: increase cost of TMU spills to 10 Our cost was 5 which matches the number of instructions we have to add for a TMU spill (a fill is 4 instructions). Uniform spills on the other hand add an extra instruction for each fill and remove one instruction for the spill itself. These have a cost of 1. Therefore, if we have a single spill+fill, we end up with +9 instructions if it is a TMU spill and +0 instructions with a uniform spill, so making the former only 5 times more costly is probably not a good idea, and this is without even considering the added latency of the TMU accesses. Relevant shader-db changes show this causes as a marginal instruction count increase in a few shaders but better thread counts and lower TMU spilling overall: total instructions in shared programs: 13037315 -> 13040711 (0.03%) instructions in affected programs: 370106 -> 373502 (0.92%) helped: 187 HURT: 321 total threads in shared programs: 415090 -> 415664 (0.14%) threads in affected programs: 574 -> 1148 (100.00%) helped: 287 HURT: 0 total uniforms in shared programs: 3706674 -> 3717266 (0.29%) uniforms in affected programs: 63075 -> 73667 (16.79%) helped: 40 HURT: 395 total max-temps in shared programs: 2176080 -> 2174177 (-0.09%) max-temps in affected programs: 15838 -> 13935 (-12.02%) helped: 316 HURT: 34 total spills in shared programs: 4247 -> 4010 (-5.58%) spills in affected programs: 2599 -> 2362 (-9.12%) helped: 107 HURT: 14 total fills in shared programs: 6121 -> 5820 (-4.92%) fills in affected programs: 3622 -> 3321 (-8.31%) helped: 108 HURT: 13 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15168>	2022-03-02 08:09:11 +00:00
Guilherme Gallo	d1c6185b5a	ci: skqp: Add Vulkan support for a630_skqp job This commit adds support for Vulkan backend on a630_skqp job. = Needed changes - Needed to install libvulkan-dev package on system - Refactored the way the available skqp reports are printed tested in development builds with skia tools Piglit expectations had to be updated in various drivers due to !14750 not having bumped the tags when it tried to uprev. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14686>	2022-02-25 05:50:06 +00:00
Iago Toral Quiroga	cf99584f51	broadcom/compiler: move uniforms right before their first use after scheduling On V3D the quality of the code we generate is significantly affected by how we decide to assign accumulators during register allocation, which is determined by liveness, favoring short-lived temps. There are many shaders that end up doing a whole lot of uniform loads first, and using them later, which is very inconvenient for our register allocation process because this increases uniform liveness and causes us to use accumulators less efficientely, leading to significant churn. To fix this, we move uniforms right before their first use in the same block, but we need to do this after NIR scheduling, which means we are doing it in non-SSA form, since the scheduler has a tendency to undo this optimization and it is not easy to modify it to avoid it, since it works in more abstract terms, using instruction dependencies, estimated register pressure and instruction delay information to do its work, which are very different concepts. total instructions in shared programs: 13316738 -> 13033613 (-2.13%) instructions in affected programs: 10389172 -> 10106047 (-2.73%) helped: 55442 HURT: 16144 total threads in shared programs: 413722 -> 415048 (0.32%) threads in affected programs: 1428 -> 2754 (92.86%) helped: 680 HURT: 17 total loops in shared programs: 1716 -> 1690 (-1.52%) loops in affected programs: 26 -> 0 helped: 26 HURT: 0 total uniforms in shared programs: 3704313 -> 3705181 (0.02%) uniforms in affected programs: 687730 -> 688598 (0.13%) helped: 2920 HURT: 7384 total max-temps in shared programs: 2364785 -> 2175190 (-8.02%) max-temps in affected programs: 1215387 -> 1025792 (-15.60%) helped: 49667 HURT: 1556 total spills in shared programs: 4241 -> 4248 (0.17%) spills in affected programs: 642 -> 649 (1.09%) helped: 11 HURT: 19 total fills in shared programs: 6115 -> 6125 (0.16%) fills in affected programs: 1276 -> 1286 (0.78%) helped: 11 HURT: 21 total sfu-stalls in shared programs: 34381 -> 36578 (6.39%) sfu-stalls in affected programs: 16055 -> 18252 (13.68%) helped: 3647 HURT: 5206 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15056>	2022-02-24 11:36:00 +00:00
Iago Toral Quiroga	c4a78a2d2a	broadcom/compiler: fix register class patching for postponed spills If we have a postponed spill, the temp we create at ip is no longer the spilled temp and therefore is affected by the thrsw injection. Fixes corruption in the additive blending animation demo from Three.js. Fixes: `f3c3228522` ('broadcom/compiler: do not rebuild the interference graph after each spill') Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15112>	2022-02-22 11:17:10 +00:00
Erik Faye-Lund	25a37fabb7	vulkan/wsi: untangle buffer-images from prime Not all Vulkan implementations allows rendering to linear images, so in order to support scanning out from these on Windows we might have to copy through a buffer like we do in the PRIME path. To avoid reimplementing the same, let's instead generalize the code a bit so it doesn't have to specfy any PRIME-specific details. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12210>	2022-02-22 10:04:34 +00:00
Iago Toral Quiroga	a4b164b57b	broadcom/compiler: only patch temps that existed before the current spill When we spill we add new temps. We should be careful not to access liveness for these until we have re-computed it after all spills and fill for that the spilled temp have been processed so as to avoid out-of-bounds accesses to the c->temp_start and c->temp_end arrays. This fixes a crash in a Three.js demo when we try to patch register classes after a TMU spill that was caused because we would incorrectly try to patch the same temps we had just added for the spill itself, which is not only unnecessary but also incorrect since we these temps would not have liveness information available yet and thus would cause out of bounds accesses. Fixes: `f3c3228522` ('broadcom/compiler: do not rebuild the interference graph after each spill') Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15107>	2022-02-22 06:41:51 +00:00
Jose Maria Casanova Crespo	90f966e05f	v3dv/v3d: Fix copyright holder to Raspberry Pi Ltd Acked-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15057>	2022-02-18 11:50:07 +01:00
Juan A. Suarez Romero	bfdb1064c5	vc4/ci: make piglit test mandatory Make piglit test jobs to run always, as piglit testsuite offers more coverage for the VC4 driver. On the other hand, make the EGL testing manually, as we don't have enough devices to execute all the tests fast enough. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Acked-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15045>	2022-02-18 09:02:55 +00:00
Iago Toral Quiroga	750eeecf4e	broadcom/compiler: document that spill_base is used for spills and scratch Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15041>	2022-02-18 08:38:19 +00:00
Iago Toral Quiroga	8883975209	broadcom/compiler: drop spill_count and add spilling boolean We added spill_count to handle uniform batch spills, which we no longer do. What we want now is a way to know if we are spilling registers. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15041>	2022-02-18 08:38:19 +00:00
Iago Toral Quiroga	f3c3228522	broadcom/compiler: do not rebuild the interference graph after each spill Instead, we only recompute liveness and we add new nodes and interferences to the graph manually (we also need to patch register classes in some cases). To assist in this process, we also add an ip counter to our instructions that we also recompute after each spill, which we use to identify registers that cross thrsw boundries introduced with TMU spills and fills and adjust their register classes accordingly (removing their capacity to use accumulators). This significantly reduces the CPU cost of spills. Using shaders/closed/gputest/piano/7.shader_test as reference: Compile time up to the first successful compile strategy in main is ~24s and with this change it is ~11s. With this speed up, we can now try all 2-thread compile strategies (including the fallback scheduler) in only ~15s. A full shader-db run results in: Total CPU time (seconds): 9904.67 -> 9087.98 (-8.25%) Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15041>	2022-02-18 08:38:19 +00:00
Iago Toral Quiroga	59caaa7fb3	broadcom/compiler: reset spill/fill counts after lowering thread count. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15041>	2022-02-18 08:38:19 +00:00
Iago Toral Quiroga	92d819aaa0	broadcom/compiler: fix end of TMU sequence check We may be pipelining TMU writes and reads, in which case we can see both TMUWT and LDTMU at the end of a TMU sequence, so we should not assume that a TMUWT always terminates a sequence. Also, we had a bug where we were using inst instead of scan_inst to check if we find another TMUWT after the curent instruction. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15041>	2022-02-18 08:38:19 +00:00
Iago Toral Quiroga	40e091267d	broadcom/compiler: define max number of tmu spills for compile strategies Instead of whether they are allowed to spill or not. This is more flexible. Also, while we are not currently enabling spilling on any 4-thread strategies, should we do that in the future, always prefer a 4-thread compile. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15041>	2022-02-18 08:38:19 +00:00
Iago Toral Quiroga	919aedbfec	broadcom/compiler: choose compile strategy with lowest spilling Until now we would only allow spilling as a last resort in the last 2 strategies, however, it is possible that in some cases earlier strategies may produce less spills if we allowed spilling on them. Likewise, the fallback scheduler can sometimes produce less spills than 2 threads with optimizations disabled. With this change, we start allowing all our 2-thread strategies to spill, and instead of choosing the first strategy that is successful, we choose the one that doesn't spill or the one with the least amount of spilling. It should be noted that this may incur in a significant increase of compile times. We will address this in a follow-up patch. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15041>	2022-02-18 08:38:19 +00:00
Jason Ekstrand	05e9e7767d	vulkan: Rename vk_image_view::format to view_format When I originally added vk_image_view, I was overly clever when it came to the format field. I decided to make it only contain the bits of the format contained in the selected aspects. However, this is confusing (not generally a good thing) and it's also not always what you want. The Vulkan 1.3.204 spec says: "When using an image view of a depth/stencil image to populate a descriptor set (e.g. for sampling in the shader, or for use as an input attachment), the aspectMask must only include one bit, which selects whether the image view is used for depth reads (i.e. using a floating-point sampler or input attachment in the shader) or stencil reads (i.e. using an unsigned integer sampler or input attachment in the shader). When an image view of a depth/stencil image is used as a depth/stencil framebuffer attachment, the aspectMask is ignored and both depth and stencil image subresources are used." So, while the restricted format makes sense for texturing, it doesn't for when the image is being used as an attachment. What we probably actually want is both versions of the format. We'll call the one given by the VkImageViewCreateInfo vk_image_view::format and the restricted one vk_image_view::view_format. This is just the first commit which switches format to view_format so the compiler will make sure we get them all. The next commit will re-add vk_image_view::format but this time unmodified. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15007>	2022-02-16 00:14:50 +00:00
Juan A. Suarez Romero	801d33b2d9	vc4/ci: update failing piglit tests See https://gitlab.freedesktop.org/mesa/mesa/-/issues/6038. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15012>	2022-02-15 09:21:23 +01:00
Jason Ekstrand	29393a40ee	v3dv: Use the common command pool implementation The only interesting information stored in v3dv_cmd_pool is the list of command buffers and that's already tracked by vk_command_pool. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:25 +00:00
Jason Ekstrand	fcad979b72	v3dv: Don't use vk_alloc/free2 for command buffers The pool will always have a valid allocator. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:25 +00:00
Jason Ekstrand	bda4c4f6b6	vulkan: Take a vk_command_pool in vk_command_buffer_init() Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:25 +00:00
Jason Ekstrand	6fb9e4e7ff	v3dv: Use vk_command_pool Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:24 +00:00
Louis-Francis Ratté-Boulianne	5e263cc324	vulkan/runtime: Add a level field to vk_command_buffer Looks like 3 implementations already have that field in their private command_buffer struct, and having it at the vk_command_buffer opens the door for generic (but suboptimal) secondary command buffer support. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:24 +00:00
Juan A. Suarez Romero	7955df28a6	v3dv/ci: Update failure list Add more failing tests to the expected failures. These are obtained after executing the full Vulkan CTS. v2: - Add comments in the tests (Alejandro) Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14948>	2022-02-09 15:46:23 +00:00
Melissa Wen	70a219d4a3	broadcom/simulator: enable multisync in the simulator Use drmSyncobjSignal to signal out_syncobjs when a GPU job submission ends in the simulator. With this, we can enable multisync support in the simulator and keep the multisync approach to process fence by submitting a serialized no-op job that adds the fence to the array of out syncobjs, i.e. syncobjs to be signaled in the kernel when a job completes (job post deps). Signed-off-by: Melissa Wen <mwen@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14768>	2022-02-09 07:22:42 +00:00
Emma Anholt	ef112db311	ci: Bump VK-GL-CTS to 1.3.1.0. The main thing is VK 1.3 testing, but also includes test bugfixes. The 1.3 CTS required an uprev of deqp-runner to handle a new style of test output, and that deqp-runner brings in some neat new features, too (piglit in your deqp-runner suite, and extension list checking). A bunch of VK tests got renamed, so I replaced panvk's custom test list with simple include filters on the main test list. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> (panvk) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14920>	2022-02-08 22:16:36 +00:00
Emma Anholt	3f34251495	ci/broadcom: Remove unused v3dv xfails file. It's actually in broadcom-rpi4-fails.txt. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14920>	2022-02-08 22:16:35 +00:00
Dylan Baker	2f916f2be6	meson: add support for `meson devenv` with vulkan Meson devenv is a feature added in meson 0.58 (thus the features is version guarded) that allows creating a shell environment with environment variables automatically setup for running the project inside the build dir. Some variables (such as LD_LIBRARY_PATH and PATH) are set automatically, others must be added by the project. For vulkan is is relativley simple, we create a new, uninstalled, icd file for each driver and set the VK_ICD_FILENAMES variable appropriately. This can be used with: ```sh meson devenv -C $builddir ``` then, vulkan applications will automatically use the uninstall vulkan driver, no need to install. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14826>	2022-02-04 09:08:47 -08:00
Emma Anholt	a177f0de8f	ci: Uprev vulkan-cts to 1.2.8.0 This brings in some interesting new vulkan tests and fixes for the spurious KHR-GL TF failures. Also, reduces the runtime of dEQP-GLES31.functional.ssbo.layout.random.all_shared_buffer.36 so that it should stop timing out. Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13779>	2022-02-03 22:41:23 +00:00
Alejandro Piñeiro	5d8c659678	v3d/drm-shim: remove drm-shim driver After starting to use a new version of the simulator, it got outdated. We made some initial effort to update it, but it was not working. Taking into account that no one is using it, it is better to just remove it. We keep the noop drm drivers, as they could have some value for developers that doesn't have access to the v3dv3 simulator. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14682>	2022-02-03 09:53:29 +00:00
Iago Toral Quiroga	7561ea8fa1	broadcom/compiler: allow ldunifa with read-only SSBOs Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14830>	2022-02-03 07:35:07 +00:00
Iago Toral Quiroga	0a8449b07c	broadcom/compiler: fix offset alignment for ldunifa when skipping The intention was to align the address to 4 bytes (32-bit), not 16 bytes. Fixes: `bdb6201ea1` ("broadcom/compiler: use ldunifa with unaligned constant offset") Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14830>	2022-02-03 07:35:07 +00:00
Iago Toral Quiroga	ce99b1a746	v3dv: don't submit noop job if there is nothing to wait on or signal Also, do not unconditionally flag signaling for submits without any command buffers. Reviewed-by: Melissa Wen <mwen@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14802>	2022-02-01 07:28:46 +00:00
Melissa Wen	db9098f2ef	v3dv: move sems_info from event_wait job to wait_thread info Semaphores info was stored as an info of event_wait cpu jobs and this leads to mem leak when the same event_wait job in the same cmd buffer batch was submitted more than once. As a result, `dEQP-VK.api.command_buffers.record_simul_use_primary` fails due to a double-free of sems_info. In this patch, we no longer use v3dv_event_wait_cpu_job_info to store semaphores from a submit info, since semaphores is related to a queue submission and not to the event_wait job type. If we spawn a wait_thread, we copy semaphores to an auxiliary struct (v3dv_wait_thread_info) that will be used in wait_thread to get job and semaphores information. When the spawned thread finishes, it releases the related v3dv_wait_thread_info and the semaphores copy as well. Fixes: `d5bd18fb` ("v3dv: store wait semaphores in event_wait_cpu_job_info") Suggested-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Melissa Wen <mwen@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14736>	2022-01-31 23:01:54 +00:00
Thomas H.P. Andersen	430b1157a1	broadcom: drop unused functions Fixes a clang warning about unused static inlined functions Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14790>	2022-01-31 16:10:31 +00:00
Iago Toral Quiroga	5974949c0d	v3dv: expose VK_KHR_depth_stencil_resolve Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14752>	2022-01-28 12:25:43 +00:00
Iago Toral Quiroga	668653f830	v3dv: fallback to blit resolve if render area is not aligned to tile boundaries Just as with all other TLB operations, we can only use the TLB if the render area is aligned to tile boundaries. If it is not, then the operation would overwrite pixels outside the render area, which is not allowed. In this case, we can't even emit a previous TLB load to fix this because the TLB has the multisampled attachment, not the resolve attachment, which is just a destination buffer for the tile store. Because the condition for tile alignment has to be determined for each subpass, we handle this by storing this information in the attachment state of the command buffer with the start of each subpass. We store whether the attachment is to be resolved and whether it can use the TLB (considering tile alignment restrictions). Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14752>	2022-01-28 12:25:43 +00:00
Iago Toral Quiroga	7f87a1256e	v3dv: support resolving depth/stencil attachments This implements the bulk of VK_KHR_depth_stencil_resolve Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14752>	2022-01-28 12:25:43 +00:00
Vinson Lee	a97ec3eb13	v3dv: Add missing unlocks on errors. Fix defects reported by Coverity Scan. Missing unlock (LOCK) missing_unlock: Returning without unlocking. Fixes: `a7052dcf2c` ("v3dv: enable multiple semaphores for csd job") Fixes: `ad09e50129` ("v3dv: enable multiple semaphores for tfu job") Fixes: `ff8586c345` ("v3dv: enable multiple semaphores on cl submission") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14663>	2022-01-28 04:15:24 +00:00
Iago Toral Quiroga	764c8867b0	v3dv: document why we don't expose VK_EXT_scalar_block_layout And since this is an optional feature in Vulkan 1.2, fill in the corresponding feature query while we are at it. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14731>	2022-01-27 07:34:19 +00:00
Iago Toral Quiroga	06220a28e7	v3dv: rework Vulkan 1.2 feature queries Fill them into a VkPhysicalDeviceVulkan12Features struct like we do for Vulkan 1.1, and then read them from there. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14731>	2022-01-27 07:34:19 +00:00
Iago Toral Quiroga	692e0dfe27	v3dv: implement VK_KHR_imageless_framebuffer Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14704>	2022-01-27 07:11:20 +00:00
Iago Toral Quiroga	2ee9487ad7	v3dv: drop signature of undefined function This is a left over from when we added multi-version support in the driver, where we turned this helper into a versioned scheme. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14704>	2022-01-27 07:11:20 +00:00

1 2 3 4 5 ...

1910 Commits