AlexIndustrial/mesa

Author	SHA1	Message	Date
Jason Ekstrand	8d7cbe026e	anv: Drop GetPhysicalDeviceQueueFamilyProperties Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15459>	2022-03-18 10:10:37 -05:00
Jason Ekstrand	cdaa3a899c	anv: Use layerCount for clears and transitions in BeginRendering The Vulkan spec was recently clerified to say that transitions only happen to the bound layers: "Automatic layout transitions apply to the entire image subresource attached to the framebuffer. If multiview is not enabled and the attachment is a view of a 1D or 2D image, the automatic layout transitions apply to the number of layers specified by VkFramebufferCreateInfo::layers. If multiview is enabled and the attachment is a view of a 1D or 2D image, the automatic layout transitions apply to the layers corresponding to views which are used by some subpass in the render pass, even if that subpass does not reference the given attachment." This is in the context of render passes but it applies to dynamic rendering because the implicit layout transition stuff is a Mesa pseudo- extension and inherits those rules. For clears, the Vulkan spec says: "renderArea is the render area that is affected by the render pass instance. The effects of attachment load, store and multisample resolve operations are restricted to the pixels whose x and y coordinates fall within the render area on all attachments. The render area extends to all layers of framebuffer." Again, this is in the context of render passes but the same principals apply to dynamic rendering where the layerCount and renderArea are specified as part of the vkCmdBeginRendering() call. Fixes: `3501a3f9ed` ("anv: Convert to 100% dynamic rendering") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15441>	2022-03-18 09:27:15 -05:00
Lionel Landwerlin	8b71118aa0	anv: flush tile cache with query copy command This fixes the test_resolve_non_issued_query_data vkd3d-proton test. This change is required on TGL+ (maybe ICL?) because on all platforms 3D pipeline writes are not coherent with CS. On previous platform we fixed this by flushing the render cache to make sure data is visble to CS before it writes to memory. But on more recently platforms, flushing the render cache leaves the data in the tile cache which is still not coherent with CS, so we need to flush that one too. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14552>	2022-03-18 10:02:33 +00:00
Lionel Landwerlin	4e30da7874	anv: emit timestamp & availability using the same part of CS We've run into issues before where PIPE_CONTROL races MI_STORE_* commands. So make sure we emit the availability using the same type of CS so that memory writes are properly ordered. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14552>	2022-03-18 10:02:33 +00:00
Ian Romanick	19330eeb1d	intel/fs: Force destination types on DP4A instructions Most of the time, this doesn't matter. On the versions with _sat, if the destination type is incorrect, the clamping will not happen correctly. Fixes the following CTS tests: dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_packed_ss_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_packed_su_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_packed_us_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_packed_uu_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_ss_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_su_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_us_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_uu_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_packed_ss_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_packed_su_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_packed_us_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_packed_uu_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_ss_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_su_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_us_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_uu_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_packed_ss_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_packed_su_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_packed_us_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_packed_uu_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_ss_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_su_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_us_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_uu_v4i8_out32 v2: Update anv-tgl-fails.txt. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Fixes: `0f809dbf40` ("intel/compiler: Basic support for DP4A instruction") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15417>	2022-03-17 22:39:04 +00:00
Felix DeGrood	3bd9b25060	intel: change INTEL_MEASURE output to microseconds Change time event durations from ns -> us. Microseconds are easier to work with. Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15348>	2022-03-17 22:14:42 +00:00
Felix DeGrood	2e6d14cc7b	intel: increase INTEL_MEASURE batch/buffer sizes Increase default batch_size and buffer_size from 16 -> 64. These are sized to be big enough to service most games. As games have become more demanding, larger sizes become necessary. Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15348>	2022-03-17 22:14:42 +00:00
Felix DeGrood	e0c9032db8	anv: add indirect draw to INTEL_MEASURE Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15348>	2022-03-17 22:14:42 +00:00
Lionel Landwerlin	d68b9f0e6b	anv: zero-out anv_batch_bo anv_batch_bo has a length field that we use to flush cachelines. Not having that field initialized properly leads us to access out of bound memory. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15425>	2022-03-17 15:56:14 +00:00
Lionel Landwerlin	78acae3865	anv: fix variable shadowing Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `83fee30e85` ("anv: allow multiple command buffers in anv_queue_submit") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15425>	2022-03-17 15:56:14 +00:00
Sagar Ghuge	2e336c602d	intel/fs: Add Wa_14014435656 For any fence greater than local scope, always set flush type to at least invalidate so that fence goes on properly. v2: Fixup condition to trigger workaround (Lionel) v3: Simplify workaround (Curro) v4: Don't drop the existing WA (Curro) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: 22.0 <mesa-stable> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14947>	2022-03-17 14:18:02 +00:00
Sagar Ghuge	6031ad4bf6	intel/fs: Add Wa_22013689345 v2: Use a simpler framework (Lionel) v3: Rebase, add task/mesh (Lionel) v4: Fixup fence exec size (SIMDX -> SIMD1) v5: Fix invalidate_analysis, add finishme comment (Curro) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: 22.0 <mesa-stable> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14947>	2022-03-17 14:18:02 +00:00
Anuj Phogat	5cc4075f95	anv, iris: Add Wa_16011411144 for DG2 v2: Use CS_STALL instead of FLUSH_ENABLE in Iris (Lionel) Add missing CS_STALL after SO_BUFFER change in Anv (Lionel) Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> (v1) Reviewed-by: Francisco Jerez <currojerez@riseup.net> Cc: 22.0 <mesa-stable> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14947>	2022-03-17 14:18:02 +00:00
Jason Ekstrand	893fa30afe	anv: Include scissors in viewport calculations It's tricky to always get the render area to the viewport code. In particular, it's not provided to secondary command buffers as part of the inheritance info so we have to bend over backwards and look for a framebuffer. With VK_KHR_dynamic_rendering, there is no framebuffer and this approach won't work and we'll need something better if we want competent guardbands in secondary command buffers. The good news is that any client that's sloppily rendering and trusting the clipper to keep things inside the render area will set a scissor and that's something they have to set inside the secondary. We can dig through the scissor state and also include the corresponding scissor (if any) and use that for our render area. This should give us the same secondary command buffer performance with VK_KHR_dynamic_rendering. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 13:13:45 -05:00
Jason Ekstrand	b4e38e174f	anv: Move viewport/scissor emit to genX_cmd_buffer.c There's never been a particularly good reason to stick these in gfx7/8. We mostly did it to deduplicate the binary a bit but this shouldn't emit all that much code. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 13:13:45 -05:00
Jason Ekstrand	2c04373c45	anv: Calculate the real guardband based on render area Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 13:13:45 -05:00
Jason Ekstrand	12d815bcac	intel/guardband: Take min/max instead of total size Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 13:13:45 -05:00
Jason Ekstrand	3501a3f9ed	anv: Convert to 100% dynamic rendering Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 13:13:36 -05:00
Jason Ekstrand	8112e6d601	anv: Drop pipeline pass/subpass in favor of rendering_info This is about the only "small" change we can make in the process of converting from render-pass-based to dynamic-rendering-based. Make everything in pipeline creation work in terms of dynamic rendering and create the dynamic rendering structs from the render pass as-needed. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:16 -05:00
Jason Ekstrand	ee9c068043	anv/pipeline: Stop pretending we're the validator This was ill-conceived at best. Yes, it checks for a few error conditions but it doesn't check much and what checks it has are very far away from the code that relies on those invariants. If we care about these invariants, we should add asserts near the code that makes those assumptions rather than pretending to be the validation layers. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:16 -05:00
Jason Ekstrand	2da152b5e6	anv: Stop treating color input attachments specially Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:15 -05:00
Jason Ekstrand	1ad0f1b004	anv/pass: Make unused color attachments VK_ATTACHMENT_UNUSED Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:15 -05:00
Jason Ekstrand	9bbecbed7a	anv: Better null surface state size for dynamic rendering Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:15 -05:00
Jason Ekstrand	fff3f8bfe5	anv: Fix handling of null depth/stencil attachments with dynamic rendering Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:15 -05:00
Jason Ekstrand	83101429bf	anv: Convert to vk_framebuffer Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:15 -05:00
Vadym Shovkoplias	550f48a826	anv: implement EXT_depth_clip_control A new extension allowing the application to use the OpenGL depth range in NDC, i.e. with depth in range [-1, 1], as opposed to Vulkan’s default of [0, 1]. v2: - call gfx8_cmd_buffer_emit_viewport on ANV_CMD_DIRTY_PIPELINE (Jason) - remove redundant !! operator since negativeOneToOne must be true or false (Tapani) - coding style changes (Lionel) Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6070 Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15304>	2022-03-16 11:22:24 +00:00
Lionel Landwerlin	a54f5e8e00	anv: silence compiler warnings Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6146 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15387>	2022-03-16 01:02:05 +00:00
Jason Ekstrand	0b4a80b4c4	anv: Use vk_shader_module_to_nir() Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15305>	2022-03-15 23:13:16 +00:00
Ernst Sjöstrand	e5f3689cff	intel/compiler: Fix non-trivial designated initializer Not supported by GCC 7. src/compiler/nir/nir_builder_opcodes.h:14156:118: sorry, unimplemented: non-trivial designated initializers not supported src/intel/compiler/brw_mesh.cpp:515:7: note: in expansion of macro ‘nir_store_per_primitive_output’ Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Fixes: `bc4f8c073a` ("intel/compiler: inject MUE initialization") Signed-off-by: Ernst Sjöstrand <ernstp@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15360>	2022-03-14 09:56:04 +00:00
Dave Airlie	7edda218fd	intel: add some missing debug recompile info. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15315>	2022-03-12 08:12:41 +00:00
Jason Ekstrand	d65dbe8018	anv: Allow MSAA resolve with different numbers of planes The Vulkan spec for VK_KHR_depth_stencil_resolve allows a format mismatch between the primary attachment and the resolve attachment within certain limits. In particular, VUID-VkSubpassDescriptionDepthStencilResolve-pDepthStencilResolveAttachment-03181 If pDepthStencilResolveAttachment is not NULL and does not have the value VK_ATTACHMENT_UNUSED and VkFormat of pDepthStencilResolveAttachment has a depth component, then the VkFormat of pDepthStencilAttachment must have a depth component with the same number of bits and numerical type VUID-VkSubpassDescriptionDepthStencilResolve-pDepthStencilResolveAttachment-03182 If pDepthStencilResolveAttachment is not NULL and does not have the value VK_ATTACHMENT_UNUSED, and VkFormat of pDepthStencilResolveAttachment has a stencil component, then the VkFormat of pDepthStencilAttachment must have a stencil component with the same number of bits and numerical type So you can resolve from a depth/stencil format to a depth-only or stencil-only format so long as the number of bits matches. Unfortunately, this has never been tested because the CTS tests which purport to test this are broken and actually test with a destination combined depth/stencil format. Fixes: `5e4f9ea363` ("anv: Implement VK_KHR_depth_stencil_resolve") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15333>	2022-03-11 22:25:42 +00:00
Lionel Landwerlin	6cea8a43fa	anv: silence compiler warning Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15241>	2022-03-11 08:47:15 +00:00
Lionel Landwerlin	90000aea9b	anv: make a couple of descriptor function private Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15241>	2022-03-11 08:47:15 +00:00
Lionel Landwerlin	e12698724e	anv: rename host only descriptor internal flag We add an assert to verify that those are not bound. v2: Drop != 0 (Tapani) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15241>	2022-03-11 08:47:15 +00:00
Lionel Landwerlin	87f59b18cf	anv: don't lazy allocate surface states in descriptor sets In `4001d9ce1a` we started lazily allocating surface states in the descriptor sets rather than upfront in the descriptor pool. This was to workaround vkd3d-proton allocating more than we could handle at the HW level. The issue introduced in that change is that we didn't protect the descriptor pool free list as well as the anv_state_stream which are now potentially used from different threads through the descriptor set write functions. This reverts the lazy allocation part of that change. Host only descriptor sets changes remain. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4001d9ce1a` ("anv: Handle VK_DESCRIPTOR_POOL_CREATE_HOST_ONLY_BIT_VALVE for descriptor sets") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15241>	2022-03-11 08:47:15 +00:00
Lionel Landwerlin	71cd6a7b84	anv: fix acceleration structure descriptor copies We're not supposed to have a VkWriteDescriptorSetAccelerationStructureKHR when doing a copy. We should instead get the acceleration structure object from the source descriptor. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `03e1e19246` ("anv: Refactor descriptor copy") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15241>	2022-03-11 08:47:15 +00:00
Mike Blumenkrantz	5ab0e3f0bb	anv: fix some dynamic rasterization discard cases in pipeline construction cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15280>	2022-03-11 04:02:02 +00:00
Mike Blumenkrantz	1e3e7b3a4d	anv: fix CmdSetColorWriteEnableEXT for maximum rts Fixes: `b15bfe92f7` ("anv: implement VK_EXT_color_write_enable") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15280>	2022-03-11 04:02:02 +00:00
Mike Blumenkrantz	52f6978484	anv: fix xfb usage with rasterizer discard in the initial implementation, a stream like: * CmdBeginTransformFeedbackEXT * CmdSetRasterizerDiscardEnableEXT * CmdDraw * CmdEndTransformFeedbackEXT * CmdBeginTransformFeedbackEXT * CmdDraw * CmdEndTransformFeedbackEXT would never enable transform feedback, as it only checked for the change in rasterizer_discard state Fixes: `4d531c67df` ("anv: support rasterizer discard dynamic state") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15269>	2022-03-11 03:37:17 +00:00
Marcin Ślusarz	823cffbe1c	anv: include Primitive Header in mesh shader per-primitive output Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	f410c1142f	anv: set number of viewports in clip state (mesh) Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	81df66bfff	intel/compiler: mark some variables as per-primitive in FS if they come from MS Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	8c16ce53a9	intel/compiler: handle ViewportIndex, PrimitiveID and Layer in MUE setup Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	bc4f8c073a	intel/compiler: inject MUE initialization Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	333a490e32	intel/compiler: shift mesh urb read/write window when offset is too large Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Kenneth Graunke	e3a0e97300	intel: Limit Wa_1607854226 to Gfx12.0 only This workaround is needed on all Gfx12.0 parts, but doesn't appear to be necessary on XeHP. The other drivers do not appear to be applying this workaround on those parts. As further evidence, we accidentally added the 3DSTATE_BINDING_TABLE_POOL_ALLOC commands after switching back to GPGPU mode, which would be an incorrect way to implement the workaround, and things seem to be working. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14507>	2022-03-09 09:18:59 +00:00
Jason Ekstrand	a83c91a261	blorp: Add a binding_table_offset_to_pointer helper On Gen11+, we have a feature that requires us to shift binding table offsets by 3. This adds a helper which gives the driver a hook to do this if it so chooses. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14507>	2022-03-09 09:18:59 +00:00
Matt Turner	8860ff3310	intel/perf: Destination array calculation into function Cuts 119 KiB from iris_dri.so and libvulkan_intel.so. text data bss dec hex filename 917511 0 0 917511 e0007 meson-generated_.._intel_perf_metrics.c.o (before) 796986 0 0 796986 c293a meson-generated_.._intel_perf_metrics.c.o (after) text data bss dec hex filename 14130948 365708 210004 14706660 e067e4 iris_dri.so (before) 14009332 365708 210004 14585044 de8cd4 iris_dri.so (after) text data bss dec hex filename 8124225 214264 22820 8361309 7f955d libvulkan_intel.so (before) 8002609 214264 22820 8239693 7dba4d libvulkan_intel.so (after) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15237>	2022-03-07 21:09:54 +00:00
Matt Turner	d80d3c6760	intel/perf: Fix mistake in description string Along with fixing the grammar, this allows it to be deduplicated since the properly worded description exists in later generations' XMLs. Cuts 96 B from iris_dri.so and libvulkan_intel.so. text data bss dec hex filename 917613 0 0 917613 e006d meson-generated_.._intel_perf_metrics.c.o (before) 917511 0 0 917511 e0007 meson-generated_.._intel_perf_metrics.c.o (after) text data bss dec hex filename 14131044 365708 210004 14706756 e06844 iris_dri.so (before) 14130948 365708 210004 14706660 e067e4 iris_dri.so (after) text data bss dec hex filename 8124321 214264 22820 8361405 7f95bd libvulkan_intel.so (before) 8124225 214264 22820 8361309 7f955d libvulkan_intel.so (after) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15237>	2022-03-07 21:09:54 +00:00
Matt Turner	7024b8e0eb	intel/perf: Mark intel_perf_counter_* enums as PACKED Reduces their sizes from 4 bytes to 1. Cuts 6 KiB from iris_dri.so and libvulkan_intel.so. text data bss dec hex filename 924401 0 0 924401 e1af1 meson-generated_.._intel_perf_metrics.c.o (before) 917613 0 0 917613 e006d meson-generated_.._intel_perf_metrics.c.o (after) text data bss dec hex filename 14137732 365708 210004 14713444 e08264 iris_dri.so (before) 14131044 365708 210004 14706756 e06844 iris_dri.so (after) text data bss dec hex filename 8131009 214264 22820 8368093 7fafdd libvulkan_intel.so (before) 8124321 214264 22820 8361405 7f95bd libvulkan_intel.so (after) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15237>	2022-03-07 21:09:54 +00:00

1 2 3 4 5 ...

7782 Commits