AlexIndustrial/mesa

Author	SHA1	Message	Date
Paulo Zanoni	a099d6ae4d	intel: add devinfo->has_64bit_float_via_math_pipe Unusual hardware features that require special hanlding usually get a devinfo field, so do this for MTL's unordered DF types. This will guarantee that any platform based on MTL (thus inheriting from MTL_FEATURES) will automatically be handled in these special cases. v2: s/has_unordered_64bit_float/has_64bit_float_via_math_pipe/ (Curro). Reviewed-by: Francisco Jerez <currojerez@riseup.net> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Paulo Zanoni	eac00f4ec7	intel/compiler: fix intel_swsb_decode for newer platforms In the previous patch we adjusted the scoreboard pass to take into consideration a new case of unordered operations for TGL. Fix the decoding as well. v2: use intel_device_info_is_mtl() (Curro, Jordan) v3: the part where we export num_sources_from_inst() is now a separate patch (Curro). v4: Work around false positive maybe-unitialized warning since Marge uses -Werror=maybe-uninitialized (Marge). Reviewed-by: Francisco Jerez <currojerez@riseup.net> (v3) Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Paulo Zanoni	295c5f59e0	intel/compiler: export brw_num_sources_from_inst We want to call this from brw_disasm.c, so move it out to brw_eu.c since it's about to become more of a shared utility function than something specific to the EU validator. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Paulo Zanoni	df50add27e	intel/compiler: avoid 64bit SEL_EXEC on MTL On MTL, instructions with DF type are unordered, executed in the math pipe. This means that they require different SWSB dependency handling, and also that in some cases such as MOVs it's generally faster to simply use 2 smaller ordered moves than a single unordered MOV. One problem we have with the current code is that generate_code() is not setting the proper SWSB dependencies for the generated DF MOVs, causing some tests to fail. One solution would be to fix generate_code() by making it set the appropriate dependencies. This was the first patch I wrote. Another solution to this problem, pointed to us by Curro, is to change required_exec_type() so we use UD instructions instead of DF, just like we do with platforms that don't have 64 bit instructions, which means there won't be anything to fix in generate_code(). The second solution is what this patch implements. This fixes at least: - dEQP-VK.subgroups.arithmetic.framebuffer.subgroupmin_double_vertex Thanks to Francisco Jerez for all the major help provided with this problem. Credits-to: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Paulo Zanoni	951855c349	intel/compiler: avoid (RegDist, SBID) on DF instructions on MTL When we use this form there's no way to specify which pipe RegDist refers to, so there are a few rules to figure this out, which is what inferred_sync_pipe() implements. But for MTL there's no long pipe and the documentation does not explicitly explain what should be the inferred type for its long (DF) instructions - which are out-of-order, by the way. One way to interpret this is that such case should be avoided. So add the extra check to entirely avoid this case. Notice that this is not actually fixing any bug, since returning TGL_PIPE_LONG (what we do today) will actually make these DF instructions incompatible with every in-order instruction, so we'll never opt to use the (RegDist, SBID) form anyway. But still, it's better to have this case explicitly documented instead of having it covered by a semi coincidence. v2: use intel_device_info_is_mtl() (Curro, Jordan) Reviewed-by: Francisco Jerez <currojerez@riseup.net> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Paulo Zanoni	16b9f87104	intel/compiler: on MTL, DF instructions run in the math pipe Adjust the scoreboard code to take that into account. Fixes at least: - dEQP-VK.glsl.builtin.precision_double.refract.compute.vec3 - dEQP-VK.glsl.builtin.precision_double.matrixcompmult.compute.mat4 v2: use intel_device_info_is_mtl() (Curro, Jordan) Reviewed-by: Francisco Jerez <currojerez@riseup.net> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Francisco Jerez	051887fbf3	intel/fs: Make the result of is_unordered() dependent on devinfo. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Lionel Landwerlin	d608706875	Revert "anv: compile anv_acceleration_structure.c" This reverts commit `74d0be27ae`. Also remove anv_acceleration_structure.c, it was meant to be removed earlier. There was probably a rebase issue somewhere. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20248>	2022-12-10 01:16:16 +00:00
Rebecca Mckeever	aa76b70751	hasvk: Delete VK_KHR_device_group provided entrypoints Delete anv_CmdDispatch, anv_CmdSetDeviceMask, and anv_GetDeviceGroupPeerMemoryFeatures so that the vk_common_* versions will be used instead. This will avoid repeated code. Signed-off-by: Rebecca Mckeever <rebecca.mckeever@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20218>	2022-12-09 14:07:59 -06:00
Rebecca Mckeever	43f9c66224	anv: Delete VK_KHR_device_group provided entrypoints Delete anv_CmdDispatch, anv_CmdSetDeviceMask, and anv_GetDeviceGroupPeerMemoryFeatures so that the vk_common_* versions will be used instead. This will avoid repeated code. Signed-off-by: Rebecca Mckeever <rebecca.mckeever@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20218>	2022-12-09 14:07:48 -06:00
Kenneth Graunke	8c2448d4e6	intel/compiler: Delete sampler key handling for planar format stuff i965 used these, but Gallium drivers do this lowering via a separate nir_lower_tex call from st/mesa. Vulkan drivers don't use these at all. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20223>	2022-12-09 10:18:25 +00:00
Kenneth Graunke	88918baf5c	intel/compiler: Delete key->msaa_16 None of the drivers have used this since we dropped i965, and BLORP no longer uses it as of the previous commit. We can also drop the former compressed_multisample_tex_mask (now padding) field so that things remain 64-bit aligned. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20223>	2022-12-09 10:18:25 +00:00
Kenneth Graunke	5d2a290cc7	intel/blorp: Set key->msaa_16 unconditionally on Gfx9+ This will result in us using the TXF_CMS_W message rather than the TXF_CMS message on Skylake through Tigerlake for 2/4/8x MSAA blits, which is technically slightly worse. However, it shouldn't be that much worse: the TXF_CMS message was removed altogether on Alchemist. iris and anv set key->msaa_16 unconditionally, to avoid paying the cost of shader recompiles for a miniscule gain. crocus and hasvk don't need to set it as they don't support 16x MSAA. BLORP already recompiles based on the sample count, so it could easily keep doing this for the minor benefit. But avoiding it will let us drop the entire msaa_16 key field out of the compiler, which is nice. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20223>	2022-12-09 10:18:25 +00:00
Kenneth Graunke	584e18863e	intel: Drop compressed_multisample_layout_mask from the compiler keys The compiler looks at this key field to determine whether to perform an MCS fetch for a txf_ms or samples_identical texture message, if a nir_tex_src_ms_mcs_intel source wasn't provided. If it isn't set, it instead uses constant 0 (nothing is compressed). All of the drivers (iris, crocus, anv, hasvk) unconditionally set this to ~0 because we don't want to pay for costly shader recompiles (which can cause nasty stuttering). Most textures are compressed anyway, and the hardware ignores the l2dms MCS parameter if MCS is disabled. The only user was BLORP, which sets the key field based on whether the texture's aux usage has MCS. But if it has MCS, it also does the MCS fetch itself and supplies it directly. Otherwise, it relies on the compiler to fill in the 0 value. But it could easily just provide the 0 value itself in that case and not rely on the compiler at all. With that fixed, we can just drop the key fields entirely. We leave them as padding for now to avoid repacking structures; we won't need to after the next commits anyway. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20223>	2022-12-09 10:18:25 +00:00
Jianxun Zhang	5c62f526a4	intel/common: use format struct in aux mapping Refactor aux mapping with the new format struct and helpers. Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20167>	2022-12-09 09:49:42 +00:00
Jianxun Zhang	8ad9549677	intel/common: initialize format of aux mapping on GFX12 Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20167>	2022-12-09 09:49:42 +00:00
Jianxun Zhang	cf3ee73f8f	intel/common: fix style of some comments in intel_aux_map.h Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20167>	2022-12-09 09:49:42 +00:00
Jianxun Zhang	d0520430aa	intel/common: Add a new struct to describe AUX mapping format The new struct and some helper functions are for further refactoring. Reworks: * Jordan: Refactor code around aux format array Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20167>	2022-12-09 09:49:42 +00:00
Jianxun Zhang	6b3740f359	intel/common: Add an enum of formats of AUX mapping The new enum allows us to support multiple formats in the future. Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20167>	2022-12-09 09:49:42 +00:00
Lionel Landwerlin	90c86fe63e	intel: add MTL performance metrics Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20228>	2022-12-09 09:13:02 +00:00
Väinö Mäkelä	d4bcfed422	hasvk: Allow aliasing with modifiers for WSI images Ignore ALIAS_BIT when format comes from WSI because we have the ability to bind the MEMORY_BINDING_PRIVATE from the other WSI image. This commit is the same as `f350b78b` but for hasvk. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19840>	2022-12-09 08:35:02 +00:00
Tapani Pälli	68ef0d8448	anv: emit sample mask state independent of fragment stage Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7861 Fixes: `9f6af43743` ("anv: dynamic multisample sample mask") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20221>	2022-12-09 08:00:42 +00:00
Jordan Justen	0d9be82fe6	intel/genxml: Add genX_rt_pack.h Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20225>	2022-12-09 01:43:39 +00:00
Lionel Landwerlin	b4b4294a78	intel/fs: add a saturation propagation test Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20206>	2022-12-09 00:39:05 +00:00
Oleksii Bozhenko	d5d8bb1dbb	brw: fix saturate propagation region overlap range Fixes: https://gitlab.freedesktop.org/mesa/mesa/-/commit/947c828d5cbffe9640ac63103a6223112eeff27f Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7691 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Oleksii Bozhenko <oleksii.bozhenko@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20206>	2022-12-09 00:39:05 +00:00
Tapani Pälli	bc4b7de0d0	intel/fs: implement Wa_14017989577 The first instruction of any kernel should have non-zero emask. This restriction needs to be obeyed to avoid GPU hangs. Patch adds a function to insert dummy mov as first instruction to make sure this requirement is fulfilled. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20194>	2022-12-08 23:58:32 +00:00
Kenneth Graunke	bafbe7c23a	intel/compiler: Set NoMask on cr0 access for float controls mode This is trying to clear a bit in the control register. However, it's executing with whatever channel mask happens to be active. Typically this is the one at the start of the program, so at least some channels will be active. Typically the first channel will be active due to packed dispatch, but that's not always guaranteed. Without NoMask, the float controls writes may randomly not happen. Recent GPUs also seem to have a hang issue when the first instruction in the shader doesn't have any active channels. Having an instruction with NoMask at the start of the program works around the issue. See HSD bug 14017989577. In our case, the float controls preamble was breaking that restriction every time, causing us to run into this problem frequently. Thanks to Tapani Pälli for finding this hang issue, and Francisco Jerez and Lionel Landwerlin for helping pinpoint this issue during review of a workaround patch in !20194. Fixes GPU hangs in Elder Scrolls Online, Witcher 3, and likely more. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7639 Fixes: `9da56ffc52` ("i965/fs: add emit_shader_float_controls_execution_mode() and aux functions") Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20214>	2022-12-08 09:54:09 +00:00
Väinö Mäkelä	4035853523	hasvk: Report correct multisampling limits on gfx7 Some limits reported by hasvk were too high, which caused CTS tests to fail. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19393>	2022-12-08 00:16:44 +00:00
Otavio Pontes	2e775b8bdb	anv/hasvk: Clamping Scissor Rect values in a valid range On cmd_buffer_emit_scissor(), if VkViewport height or width are set to a value lower than 1.0, y_max or x_max can be attributed negative values, causing an overflow. That leads to ScissorRectangleYMax or ScissorRectangleXMax to be set to values on an unsupported range. Clamping x_max and y_max in the valid range solves the problem. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7471 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20200>	2022-12-07 12:19:42 +00:00
Lionel Landwerlin	e25e17dd0c	intel/fs: clamp per vertex input accesses to patchControlPoints In a tesselation control shader where an input array is accessed using the index gl_InvocationID, we can end up accessing elements beyond the number of input vertices specified in the shader key. This happens because of the lowering in nir_lower_indirect_derefs(). This lowering will affect compact variables which happens in this case : in gl_PerVertex { vec4 gl_Position; float gl_ClipDistance[1]; } gl_in[gl_MaxPatchVertices]; The lowered code produced by NIR is somewhat ineffecient (implements a binary seach) : if (gl_InvocationID < 16) { if (gl_InvocationID < 8) { if (gl_InvocationID < 4) { vec4 vals = load_at_offset(0); value = bcsel(vals, gl_InvocationID); } else { vec4 vals = load_at_offset(4); value = bcsel(vals, gl_InvocationID - 4); } } else { if (gl_InvocationID < 12) { vec4 vals = load_at_offset(8); value = bcsel(vals, gl_InvocationID - 8); } else { vec4 vals = load_at_offset(12); value = bcsel(vals, gl_InvocationID - 12); } } } else { if (gl_InvocationID < 24) { ... } else { ... } } By default the gl_MaxPatchVertices must be set at 32 items and that's what the lowering code will use to divide the access into chunks of 4. But when running with 3 input vertices, this means we'll pull one more item than what was delivered in the shader payload. This triggers issues further down the register scheduling where the g5UD (register for the 4th item) is overwritten by a previous SEND, leading the URB read to use an invalid handle. This pass clamps any access load_per_vertex_input intrinsic vertex indice to (input_vertices - 1). Fixes issues with tests like : dEQP-VK.clipping.user_defined.clip_distance.vert_tess.* Also fixes a hang with zink/anv on : KHR-GL46.draw_elements_base_vertex_tests.AEP_shader_stages v2: Don't replace source register v3: Implement in NIR v4: Clamp per vertex array sizes in NIR (Jason) v5: Move the clamping on the intel compiler Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9749>	2022-12-07 08:16:03 +00:00
Marcin Ślusarz	7809f76fe8	intel/compiler/mesh: align payload size to the size of vec4 This reduces the number of instructions in task shaders when payload size is not aligned to vec4 and payload_in_shared WA is enabled, because nir_lower_task_shader will not need to handle the unaligned size case. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20080>	2022-12-06 16:31:11 +00:00
Nanley Chery	ea4de4ad3d	anv: Don't ambiguate for undefined layouts on TGL+ For Tiger Lake and onward, we generally don't need to ambiguate the CCS before accessing it. This is safe for two reasons: - Tiger Lake and onward treat all CCS values as legal. - We enable compression on all writable image layouts. The CCS will receive all writes and will therefore always be valid. When dealing with modifiers, we continue to allow ambiguates in some instances. Before this patch, I found ~19.5k ambiguates in Wolfenstein: Youngblood's Riverside benchmark (note that this includes manually entering the benchmark and exiting the app). With this patch, the number of ambiguates goes down to zero. Improves performance of Fallout 4 at 1080p/High settings on Arc A380 by around 22%. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20118>	2022-12-06 00:49:17 +00:00
Nanley Chery	5c84b31891	anv: Move aux vars up in transition_color_buffer I'd like to reuse one of them for an assert. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20118>	2022-12-06 00:49:17 +00:00
Nanley Chery	822687f4c0	intel/dev: Add a has_illegal_ccs_values flag Whether or not CCS can be used without initialization depends on the platform: - On gfx7-8, each CCS element is 1-bit and encodes "fast-cleared" or "pass-through". So, those platforms have no illegal values. - On gfx9-11, each CCS element is 2-bits and some bit combinations are invalid. - On gfx12+, each CCS element is 4-bits but they have no truly illegal values. Unused encodings are interpreted as "pass-through". Refer to the "MCS/CCS Buffers for Render Target(s)" sections of the PRMs for more info. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20118>	2022-12-06 00:49:17 +00:00
Nanley Chery	d307655e52	anv: Use specific flush reasons for CCS operations When INTEL_DEBUG=pc is set and a CCS operation is being performed, the driver reports that flushes are happing before and after the operation. It also reports that the operation is a fast clear, but that's not always the case. We could be resolving for example. Reporting the specific operation can help avoid confusion. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20118>	2022-12-06 00:49:17 +00:00
Lionel Landwerlin	d4cd33630a	intel: add missing restriction on fragment simd dispatch Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7755 Reviewed-by: Ivan Briano <ivan.briano@intel.com> Tested-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20169>	2022-12-06 00:37:50 +02:00
Lionel Landwerlin	b9403b1c47	intel: factor out dispatch PS enabling logic Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Tested-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20169>	2022-12-06 00:37:47 +02:00
Sviatoslav Peleshko	77ecf9149c	anv: Defer flushing PIPE_CONTROL bits forbidden in CCS while in GPGPU mode Fixes: `313aeee8` ("anv: Use pending pipe control mechanism in flush_pipeline_select() ") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7816 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20124>	2022-12-03 00:10:32 +00:00
Lionel Landwerlin	b7b91ae51e	anv: enable VK_KHR_ray_tracing_maintenance1 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	d844fa4def	anv: implement new queries for VK_KHR_ray_tracing_maintenance1 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	4d05be49c2	anv: implement vkCmdTraceRaysIndirect2KHR Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	675c5bd4cc	anv: refactor ray tracing dispatch Preparing for vkCmdTraceRaysIndirect2KHR Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	df38426072	intel/rt/nir: add support for RayCullMaskKHR Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	6202a2c6b4	intel/rt/nir: enable the trampoline shader to load the indirect ray shader bsr Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	af3f7948d1	anv: correctly predicate ray tracing Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `7479fe6ae0` ("anv: Implement vkCmdTraceRays and vkCmdTraceRaysIndirect") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	7d7c32de4c	anv/genxml: make gen_rt more like other genxml files The main goal is to be able to generate genX_bits.h for those structures so we can get generated field offsets. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011>	2022-12-02 09:28:23 +00:00
Lionel Landwerlin	8baacba4d6	hasvk: remove coarse pixel checks Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:17 +00:00
Jason Ekstrand	2d150f3ecd	hasvk: Drop more DG2 code v2: remove unused devinfo (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:17 +00:00
Jason Ekstrand	d0fea83d7b	hasvk: Rip out local memory support Things could probably be simplified further but this at least gets rid of most of the dead code and the dead flags and fields. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:17 +00:00
Jason Ekstrand	4256d2cbc2	hasvk: Rip out scratch surfaces These are a DG2+ thing Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852>	2022-12-02 09:18:17 +00:00

1 2 3 4 5 ...

8773 Commits