AlexIndustrial/mesa

Author	SHA1	Message	Date
Nanley Chery	ea4de4ad3d	anv: Don't ambiguate for undefined layouts on TGL+ For Tiger Lake and onward, we generally don't need to ambiguate the CCS before accessing it. This is safe for two reasons: - Tiger Lake and onward treat all CCS values as legal. - We enable compression on all writable image layouts. The CCS will receive all writes and will therefore always be valid. When dealing with modifiers, we continue to allow ambiguates in some instances. Before this patch, I found ~19.5k ambiguates in Wolfenstein: Youngblood's Riverside benchmark (note that this includes manually entering the benchmark and exiting the app). With this patch, the number of ambiguates goes down to zero. Improves performance of Fallout 4 at 1080p/High settings on Arc A380 by around 22%. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20118>	2022-12-06 00:49:17 +00:00
Nanley Chery	5c84b31891	anv: Move aux vars up in transition_color_buffer I'd like to reuse one of them for an assert. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20118>	2022-12-06 00:49:17 +00:00
Sviatoslav Peleshko	77ecf9149c	anv: Defer flushing PIPE_CONTROL bits forbidden in CCS while in GPGPU mode Fixes: `313aeee8` ("anv: Use pending pipe control mechanism in flush_pipeline_select() ") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7816 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20124>	2022-12-03 00:10:32 +00:00
Lionel Landwerlin	4d05be49c2	anv: implement vkCmdTraceRaysIndirect2KHR Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	675c5bd4cc	anv: refactor ray tracing dispatch Preparing for vkCmdTraceRaysIndirect2KHR Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	6202a2c6b4	intel/rt/nir: enable the trampoline shader to load the indirect ray shader bsr Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	af3f7948d1	anv: correctly predicate ray tracing Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `7479fe6ae0` ("anv: Implement vkCmdTraceRays and vkCmdTraceRaysIndirect") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	7d7c32de4c	anv/genxml: make gen_rt more like other genxml files The main goal is to be able to generate genX_bits.h for those structures so we can get generated field offsets. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	bbbc8e7ce7	anv: use the anv_state_pool address helper more Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19983>	2022-11-25 10:29:56 +00:00
Lionel Landwerlin	9bb055ff5d	anv: generate correct addresses for state pool offsets Fixes a number of CTS patterns on DG2 : - dEQP-VK.dynamic_rendering.primary_cmd_buff.random* - dEQP-VK.draw.secondary_cmd - dEQP-VK.dynamic_rendering.secondary_cmd - dEQP-VK.geometry.secondary_cmd_buffer - dEQP-VK.multiview.secondary_cmd* Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `9c1c1888d9` ("intel/fs: put scratch surface in the surface state heap") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19946>	2022-11-23 14:37:19 +00:00
Yonggang Luo	40a9fc57aa	tree-wide: Use __func__ instead of __FUNCTION__ in non-gallium code Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19861>	2022-11-22 06:53:46 +00:00
Lionel Landwerlin	9c1c1888d9	intel/fs: put scratch surface in the surface state heap In `4ceaed7839` we made scratch surface state allocations part of the internal heap (mapped to STATE_BASE_ADDRESS::SurfaceStateBaseAddress) so that it doesn't uses slots in the application's expected 1M descriptors (especially with vkd3d-proton). But all our compiler code relies on BSS (STATE_BASE_ADDRESS::BindlessSurfaceStateBaseAddress). The additional issue is that there is only 26bits of surface offset available in CS instruction (CFE_STATE, 3DSTATE_VS, etc...) for scratch surfaces. So we need the drivers to put the scratch surfaces in the first chunk of STATE_BASE_ADDRESS::SurfaceStateBaseAddress (hence all the driver changes). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4ceaed7839` ("anv: split internal surface states from descriptors") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7687 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19727>	2022-11-19 14:58:58 +00:00
Tapani Pälli	0d85a0d7cd	anv: remove dg2 condition for Wa_22011440098 We need same workaround for MTL. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19636>	2022-11-11 10:38:24 +00:00
Tapani Pälli	ecd4517560	anv: setup stage bitmask for Wa_22011440098 Fixes: `40b66a4499` ("anv, iris: Add Wa_22011440098 for DG2") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19636>	2022-11-11 10:38:24 +00:00
Lionel Landwerlin	4ceaed7839	anv: split internal surface states from descriptors On Intel HW we use the same mechanism for internal operations surfaces as well as application surfaces (VkDescriptor). This change splits the surface pool in 2, one part dedicated to internal allocations, the other to application VkDescriptors. To do so, the STATE_BASE_ADDRESS::SurfaceStateBaseAddress points to a 4Gb area, with the following layout : - 1Gb of binding table pool - 2Gb of internal surface states - 1Gb of bindless surface states That way any entry from the binding table can refer to both internal & bindless surface states but none of the driver allocations interfere with the allocation of the application. Based off a change from Sviatoslav Peleshko. v2: Allocate image view null surface state from bindless heap (Sviatoslav) Removed debug stuff (Sviatoslav) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7110 Cc: mesa-stable Tested-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19275>	2022-11-11 10:13:27 +00:00
Jason Ekstrand	415bf88637	anv: Switch to common code for command buffer lifecycles This gets us command buffer object recycling. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18383>	2022-11-10 11:15:23 +00:00
Jason Ekstrand	402a9a36f0	anv: Rip out shadow surfaces These are only used for storage-compatible compressed surfaces on Broadwell and earlier and Stencil on Gfx7 where there isn't proper stencil sampling support. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18402>	2022-11-07 12:07:11 +00:00
Lionel Landwerlin	ba0336ab3f	anv: Reduce RHWO optimization (Wa_1508744258) Implement Wa_1508744258: Disable RHWO by setting 0x7010[14] by default except during resolve pass. Disable the RCC RHWO optimization at all times except when resolving single sampled color surfaces. v2: Move stalling to genX(cmd_buffer_apply_pipe_flushes) for clarity (Mark) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Mark Janes <markjanes@swizzler.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19450>	2022-11-03 10:47:59 +00:00
Marcin Ślusarz	ea7e331fb8	anv: add mesh shading tracepoints Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19344>	2022-10-27 15:03:28 +00:00
Marcin Ślusarz	2bc82581ad	anv: add support for mesh shading in INTEL_MEASURE Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19344>	2022-10-27 15:03:28 +00:00
Lionel Landwerlin	b49b18f0b7	anv: reduce BT emissions & surface state writes with push descriptors Zink on Anv running Gfxbench gl_driver2 is significantly slower than Iris. The reason is simple, whereas Iris implements uniform updates using push constants and only has to emit 3DSTATE_CONSTANT_* packets, Zink uses push descriptors with a uniform buffer, which on our implementation use both push constants & binding tables. Anv ends up doing the following for each uniform update : - allocate 2 surface states : - one for the uniform buffer as the offset specify by zink - one for the descriptor set buffer - pack the 2 RENDER_SURFACE_STATE - re-emit binding tables - re-emit push constants Of all of those operations, only the last one ends up being useful in this benchmark because all the uniforms have been promoted to push constants. This change defers the 3 first operations at draw time and executes them only if the pipeline needs them. Vkoverhead before / after : descriptor_template_1ubo_push: 40670 / 85786 descriptor_template_12ubo_push: 4050 / 13820 descriptor_template_1combined_sampler_push, 34410 / 34043 descriptor_template_16combined_sampler_push, 2746 / 2711 descriptor_template_1sampled_image_push, 34765 / 34089 descriptor_template_16sampled_image_push, 2794 / 2649 descriptor_template_1texelbuffer_push, 108537 / 111342 descriptor_template_16texelbuffer_push, 20619 / 20166 descriptor_template_1ssbo_push, 41506 / 85976 descriptor_template_8ssbo_push, 6036 / 18703 descriptor_template_1image_push, 88932 / 89610 descriptor_template_16image_push, 20937 / 20959 descriptor_template_1imagebuffer_push, 108407 / 113240 descriptor_template_16imagebuffer_push, 32661 / 34651 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19050>	2022-10-14 23:03:16 +00:00
Lionel Landwerlin	d7f1569307	anv: limit push constant reemission Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19050>	2022-10-14 23:03:16 +00:00
Lionel Landwerlin	803f438d85	anv: optimize 3DSTATE_VF emission We can avoid reemitting this when the index buffer index type doesn't change. Also we don't need to update this when the pipeline changes as we do not pull any value from the pipeline. Instead rely on the dynamic state to tell if dyn->ia.primitive_restart_enable changed. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19050>	2022-10-14 23:03:16 +00:00
Lionel Landwerlin	126f5bc15a	anv: limit calls into cmd_buffer_flush_dynamic_state Avoids a bunch of checks if we can. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19050>	2022-10-14 23:03:16 +00:00
Lionel Landwerlin	54bc34f70a	anv: comment out the Gfx8/9 VB cache key workaround for newer Gens This code shows up a little on profiling on Gfx12 and since it's only a gfx8/9 workaround we might as well ifdef it out. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19050>	2022-10-14 23:03:16 +00:00
Lionel Landwerlin	f8136ea5b6	anv: remove unused code Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19050>	2022-10-14 23:03:16 +00:00
Tapani Pälli	0b75376e4d	anv: dynamic provoking vertex mode This affects following packets: 3DSTATE_CLIP 3DSTATE_SF 3DSTATE_STREAMOUT Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18879>	2022-10-11 16:29:05 +00:00
Tapani Pälli	1a8209218e	anv: dynamic states for depth clip and clamp This change implements 3 states in one go: - depth clamp enable - depth clip enable - depth clip negative one to one This affects following packets: 3DSTATE_CLIP 3DSTATE_VIEWPORT_STATE_POINTERS_CC 3DSTATE_RASTER v2: remove clip enable bit check from viewport emit (Lionel) v3: use helper function from runtime to get depth clip (Lionel) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18879>	2022-10-11 16:29:04 +00:00
Tapani Pälli	0a6d0fed9d	anv: dynamic rasterization stream This affects following packets: 3DSTATE_STREAMOUT Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18879>	2022-10-11 16:29:04 +00:00
Tapani Pälli	cc0ada2d67	anv: dynamic state for polygon mode Remove 'polygon_mode' from pipeline and read it from dynamic state instead. This affects following packets: 3DSTATE_CLIP 3DSTATE_RASTER Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18879>	2022-10-11 16:29:04 +00:00
Rohan Garg	c0c243f1cb	anv, iris: Disable pre fetching the binding table entries on DG2 On DG2 the HW will fetch the binding entries into the cache for every single thread when a compute walker is dispatched, wiping out the advantages of the cache prefetch. The spec also advises to not do a cache prefetch when we have more than 31 binding table entries, but most real world applications will never hit that limit. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18498>	2022-10-11 15:16:09 +02:00
Lionel Landwerlin	64e8b0d255	anv: use the right dispatch size for tracing shaders We assumed the trampoline shader would always be SIMD8. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Zhang, Jianxun <jianxun.zhang@intel.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16970>	2022-09-28 05:38:37 +00:00
Lionel Landwerlin	5ad803840d	anv: setup scratch space correctly for RT shaders Things are a bit confusing because we use the term "scratch" for 2 different things : * the buffer for register allocation spilling * the buffer for storing live values between splitted shaders around shader calls Here we're fixing the missing register allocation spilling buffer. v2: update comments (Caio) fix scratch bo size computation with pipeline libraries (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16970>	2022-09-28 05:38:37 +00:00
Jason Ekstrand	f3ddfd81b4	anv: Build BVHs on the GPU with GRL Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16970>	2022-09-28 05:38:37 +00:00
Jason Ekstrand	639665053f	anv/grl: Build OpenCL kernels Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16970>	2022-09-28 05:38:37 +00:00
Jason Ekstrand	6c76ceb613	anv: Add support for OpenCL-style kernel dispatch v2: Use brw_cs_get_dispatch_info() (Lionel) Merge barrier fixes (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16970>	2022-09-28 05:38:37 +00:00
Jason Ekstrand	5814436159	anv: Set up the memory-backed FIFO buffer v2: Fix incorrect goto (Caio) Comment 3DSTATE_BTD programming (Caio) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16970>	2022-09-28 05:38:36 +00:00
Tapani Pälli	f2645229c2	anv: implement Wa_14016118574 After each 3DPRIMITIVE, we need to send a dummy post sync op if point or line list was used or if had only 1 or 2 vertices per primitive. v2: add missing _3DPRIM_POINTLIST_BF (Lionel) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18746>	2022-09-23 12:27:05 +00:00
Lionel Landwerlin	f9dbb65e7f	anv: add missing wokraround for texture cache invalidate Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18743>	2022-09-22 23:45:16 +00:00
Lionel Landwerlin	79c2f9e7cb	anv: trace xfb queries Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17467>	2022-09-21 12:38:34 +00:00
Lionel Landwerlin	b12d95f513	anv: add missing tracepoint Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `3501a3f9ed` ("anv: Convert to 100% dynamic rendering") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17467>	2022-09-21 12:38:34 +00:00
Tapani Pälli	85fc1decf0	anv: remove primitive_topology from 3DPRIMITIVE calls Field is ignored on BDW+, 3DSTATE_VF_TOPOLOGY is used to set topology. We still want to preserve topology information in state because of other upcoming changes that require it. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18698>	2022-09-21 04:42:42 +00:00
Mike Blumenkrantz	0bf18cc483	anv: force inline more pipe flush functions yields increased ~33% draw throughput Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18637>	2022-09-20 20:53:22 +00:00
Lionel Landwerlin	39c6e4db25	anv: combine flushes in Draw/DrawIndexed/DrawIndirectByteCountEXT Based off a patch from zmike Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18637>	2022-09-20 20:53:22 +00:00
Lionel Landwerlin	1be09ae81a	anv: don't export gfx state flushing helper Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18637>	2022-09-20 20:53:22 +00:00
Lionel Landwerlin	6aa2ddb9b6	anv: don't export flush_compute_state Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18637>	2022-09-20 20:53:22 +00:00
Illia Polishchuk	74658b01d2	driconf/Intel: Add lower_depth_range_rate option workaround for Homerun Clash misrendering issue Intel has different Z interpolation float point rounding than other mesa gpus For example gl_Position.z = 0.0 will be interpolated to gl_FragCoord.z = 0.5 for all gpus gl_FragCoord = -0.00000001 will be interpolated to gl_FragCoord.z = 0.4999999702 for Intel and rounded to gl_FragCoord.z = 0.5 for other gpus Games with LEQUAL depth func will fail depth test on Intel and will pass it on other gpus in such case This workaround lowers translated depth range and several gl_FragCoord.z coords with extra small difference will be translated to the same UINT16\UINT24\UINT32 value of an integer depth buffer Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7199 Signed-off-by: Illia Polishchuk <illia.a.polishchuk@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18412>	2022-09-19 10:08:48 +00:00
Marcin Ślusarz	d5dedecfe7	anv: implement draw calls for EXT_mesh_shader Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18371>	2022-09-02 17:40:47 +00:00
Marcin Ślusarz	b3354afd89	anv: replace VK_SHADER_STAGE_[TASK\|MESH]_BIT_NV with VK_SHADER_STAGE_[TASK\|MESH]_BIT_EXT They have the same numerical values, so nothing changes. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18371>	2022-09-02 17:40:47 +00:00
Kenneth Graunke	bc68e7b564	anv: Remove anv_batch_emit_reloc and just open-code it We don't need the relocation offsets anymore, and just want to pin the BO, and combine the address into a uint64_t. We can just open code those two things; it's actually less code. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18208>	2022-09-02 09:40:46 +00:00

1 2 3 4 5 ...

759 Commits