AlexIndustrial/mesa

Author	SHA1	Message	Date
Lionel Landwerlin	df059c6781	intel/clc: deal with SPIRV-Tools linker new behavior We're now required to provide all modules to link at the same SPIRV version. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15486>	2022-03-23 10:24:31 +00:00
Lionel Landwerlin	21aa1d3de1	intel/clc: fixup shared memory offsets We're running the io lowering twice so need to reset some fields so the offset don't go over what is really needed. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15486>	2022-03-23 10:24:31 +00:00
Lionel Landwerlin	de9c2312ea	intel/clc: compile fix Fixes: `c15bf88f01` ("intel: Add a little OpenCL C compiler binary") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15486>	2022-03-23 10:24:31 +00:00
Lionel Landwerlin	a7f264f33a	intel/clc: add option to printout kernel prog_data Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15486>	2022-03-23 10:24:31 +00:00
Lionel Landwerlin	451f907d16	intel/kernel: enable linkage cap Linkage should have happened before this in intel_clc. This just silence a parser warning. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15486>	2022-03-23 10:24:31 +00:00
Lionel Landwerlin	bb4ff3e6e2	intel/kernel: enable groups caps This is roughly the same as SpvCapabilityGroupNonUniform (subgroup_basic). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15486>	2022-03-23 10:24:31 +00:00
Iván Briano	5afbb0e730	intel/fs: handle interpolation modes for at_sample and at_offset too Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15424>	2022-03-22 19:05:05 +00:00
Lionel Landwerlin	9ca29c687b	intel/clc: disable tool prior to Gfx12.5 platforms This tool is currently only aimed at Gfx version 12.5+ with COMPUTE_WALKER. We could make it work on earlier platforms but they require pushing gl_SubgroupInvocation and the CLC code is missing the back-end compiler set-up bits for that. v2: Commit description by Jason Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Lionel Landwerlin	c735c4ca85	intel/clc: specify supported extensions Having everything ever known to man is confusing our SPIRV parser. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Lionel Landwerlin	a29b1d5716	intel/clc: allow producing SPIRV files Useful to debug the parser. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Lionel Landwerlin	77e929a527	intel/clc: allow multiple CL files to be compiled together v2: use util_dynarray_append() (Jason) identation fixes (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Jason Ekstrand	c15bf88f01	intel: Add a little OpenCL C compiler binary v2: Fix up indentation (Marcin) s/gen/gfx/ (Marcin) Deal with fd closing in success/fail cases (Marin) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Lionel Landwerlin	ec6e247a40	intel/fs: handle inline data on OpenCL style kernels This is for Gfx12.5 with the COMPUTE_WALKER::Inline Data payload. We do this in a similar way to the compute kernels. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Jason Ekstrand	4d8e788663	intel/kernel: Implement some Intel built-in functions v2: Document mangled function names (Marcin) Fixup progress & metadata (Marcin) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Jason Ekstrand	346a7f14fb	intel/compiler: Add code for compiling CL-style SPIR-V kernels v2: simplify INTEL_DEBUG expressions (Marcin) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Jason Ekstrand	d1bddfba6b	intel/nir: Add optimizations to help OpenCL-style kernels Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Lionel Landwerlin	4ec5da7270	intel/nir/fs: replace COMPUTE \|\| KERNEL by gl_shader_stage_is_compute() Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Ian Romanick	19330eeb1d	intel/fs: Force destination types on DP4A instructions Most of the time, this doesn't matter. On the versions with _sat, if the destination type is incorrect, the clamping will not happen correctly. Fixes the following CTS tests: dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_packed_ss_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_packed_su_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_packed_us_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_packed_uu_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_ss_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_su_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_us_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_uu_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_packed_ss_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_packed_su_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_packed_us_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_packed_uu_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_ss_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_su_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_us_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_uu_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_packed_ss_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_packed_su_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_packed_us_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_packed_uu_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_ss_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_su_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_us_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_uu_v4i8_out32 v2: Update anv-tgl-fails.txt. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Fixes: `0f809dbf40` ("intel/compiler: Basic support for DP4A instruction") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15417>	2022-03-17 22:39:04 +00:00
Sagar Ghuge	2e336c602d	intel/fs: Add Wa_14014435656 For any fence greater than local scope, always set flush type to at least invalidate so that fence goes on properly. v2: Fixup condition to trigger workaround (Lionel) v3: Simplify workaround (Curro) v4: Don't drop the existing WA (Curro) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: 22.0 <mesa-stable> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14947>	2022-03-17 14:18:02 +00:00
Sagar Ghuge	6031ad4bf6	intel/fs: Add Wa_22013689345 v2: Use a simpler framework (Lionel) v3: Rebase, add task/mesh (Lionel) v4: Fixup fence exec size (SIMDX -> SIMD1) v5: Fix invalidate_analysis, add finishme comment (Curro) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: 22.0 <mesa-stable> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14947>	2022-03-17 14:18:02 +00:00
Ernst Sjöstrand	e5f3689cff	intel/compiler: Fix non-trivial designated initializer Not supported by GCC 7. src/compiler/nir/nir_builder_opcodes.h:14156:118: sorry, unimplemented: non-trivial designated initializers not supported src/intel/compiler/brw_mesh.cpp:515:7: note: in expansion of macro ‘nir_store_per_primitive_output’ Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Fixes: `bc4f8c073a` ("intel/compiler: inject MUE initialization") Signed-off-by: Ernst Sjöstrand <ernstp@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15360>	2022-03-14 09:56:04 +00:00
Dave Airlie	7edda218fd	intel: add some missing debug recompile info. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15315>	2022-03-12 08:12:41 +00:00
Marcin Ślusarz	81df66bfff	intel/compiler: mark some variables as per-primitive in FS if they come from MS Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	8c16ce53a9	intel/compiler: handle ViewportIndex, PrimitiveID and Layer in MUE setup Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	bc4f8c073a	intel/compiler: inject MUE initialization Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	333a490e32	intel/compiler: shift mesh urb read/write window when offset is too large Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Lionel Landwerlin	96c8880900	intel/fs: fix total_scratch computation We only have a single prog_data::total_scratch for all shader variants (SIMD 8, 16, 32). Therefore we should always max the total_scratch on top of existing variant. We probably haven't run into that issue before because we compile by increasing SIMD size and higher SIMD size is more likely to spill. But for bindless shaders with return shaders, if the last return part doesn't spill, we completely ignore the previous parts' scratch computation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15193>	2022-03-02 13:13:03 +00:00
Caio Oliveira	dc77542ed2	intel/compiler: Use pass helper in brw_nir_adjust_offset_for_arrayed_indices Also change the code to preserve certain metadata: control flow is not changed so both block indices and dominance information is preserved. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15206>	2022-03-02 10:46:23 +00:00
Caio Oliveira	7460199a2f	intel/compiler: Lower Task/Mesh I/O before SIMD specific lowering These are the same for all variants, so just lower it before cloning the nir_shader for each of them. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15019>	2022-03-01 07:35:13 +00:00
Marcin Ślusarz	e5c39bc427	intel/compiler: optimize flat inputs mask calculation Don't bother looking at urb if variable is not flat. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15169>	2022-02-25 22:34:22 +00:00
Marcin Ślusarz	e2cb562dd1	intel/compiler: ignore per-primitive attrs when calculating flat input mask If we say that per-primitive attributes are flat (which is communicated by 3DSTATE_SBE.ConstantInterpolationEnable), GPU freaks out and applies it to other (non-flat) attributes. Fixes: `be89ea3231` ("intel/compiler: Handle per-primitive inputs in FS") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15169>	2022-02-25 22:34:22 +00:00
Marcin Ślusarz	f91bfc80ba	intel/compiler: remove redundant code from fs_visitor::run_* Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15079>	2022-02-22 09:09:05 +00:00
Lionel Landwerlin	2763a8af5a	anv/genxml/intel/fs: fix binding shader record entry Bit is flipped compared to all the other packets. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `705395344d` ("intel/fs: Add support for compiling bindless shaders with resume shaders") Fixes: `c3ac9afca3` ("anv: Create and return ray-tracing pipeline SBT handles") Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15078>	2022-02-19 13:50:56 +00:00
Ian Romanick	a01b262990	nir: Add missing dependency on nir_opcodes.py Commit `38800b38` changed nir_opcodes.py, but that doesn't seem to have triggered nir_opt_algebraic.py. The change in `75ef5991` depends on opt_algebraic lowering 16-bit versions of slt, but if opt_algebraic is not rebuilt, this may not happen. This resulted in some people seeing assertion failures in, for example, dEQP-VK.spirv_assembly.instruction.compute.float16.arithmetic_3.step, due to the backend seeing nir_op_slt that it didn't know how to handle. v2: Add nir_opcodes.py to nir_algebraic_py so that all the per-driver algebraic passes pick up the dependency too. Rename it to nir_algebraic_depends. Suggested by Emma. Closes: #6047 Fixes: `d1992255bb` ("meson: Add build Intel "anv" vulkan driver") Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15050>	2022-02-17 22:57:33 +00:00
Iván Briano	81f97905c3	intel/compiler: make CLUSTER_BROADCAST always deal with integers This way we don't run afoul of regioning restrictions around floating point types. Cc: 22.0 <mesa-stable> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15039>	2022-02-16 21:36:42 +00:00
Iván Briano	db48dcb4f3	intel/compiler: remove what looks like a bad rebase This bit in the compiler looks like it was added by accident on one of the latest versions of the original commit, but it clearly doesn't belong there. Fixes: `03e1e19246` ("anv: Refactor descriptor copy") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15016>	2022-02-15 01:04:47 +00:00
Marcin Ślusarz	b6557b80a5	intel/compiler: fix array & struct IO lowering in mesh shaders We really need offsets to be in dwords, not in vec4s. The bug manifests as random failure of func.mesh.clipdistance.5 crucible test, where stores to gl_MeshVerticesNV[x].gl_ClipDistance[4+n] actually write to gl_MeshVerticesNV[x].gl_ClipDistance[1+n]. Fixes: `1f438eb033` ("intel/compiler: Implement Mesh Output") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14997>	2022-02-14 19:18:23 +00:00
Timur Kristóf	0445802ab2	compiler: Extract num_mesh_vertices_per_primitive function. Prevent code duplication. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15005>	2022-02-14 11:13:42 +01:00
Daniel Schürmann	2a92452a0e	nir/opt_shrink_vectors: Remove shrinking of store intrinsics data source This is done via nir_opt_shrink_stores. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14480>	2022-02-11 11:50:47 +01:00
Ian Romanick	38a94c82e6	intel/fs: Don't optimize out 1.0x and -1.0x This (sort of) matches the behavior of nir_opt_algebraic. This ensures that subnormal values are properly flushed to zero. With the aid of "nir/search: Float sources of texture instructions are float users" and "nir/search: Transitively apply is_only_used_as_float", there would have been no shader-db regressions on Intel platforms. However, those caused a significant increase in compile time. Since the instruction regressions were so small, I just dropped those commits rather than improve them. All Haswell and newer platforms had similar results. (Ice Lake shown) total instructions in shared programs: 20125042 -> 20125094 (<.01%) instructions in affected programs: 7184 -> 7236 (0.72%) helped: 0 HURT: 32 HURT stats (abs) min: 1 max: 4 x̄: 1.62 x̃: 2 HURT stats (rel) min: 0.11% max: 1.49% x̄: 0.85% x̃: 0.78% 95% mean confidence interval for instructions value: 1.39 1.86 95% mean confidence interval for instructions %-change: 0.74% 0.96% Instructions are HURT. total cycles in shared programs: 862745586 -> 862746551 (<.01%) cycles in affected programs: 109872 -> 110837 (0.88%) helped: 12 HURT: 23 helped stats (abs) min: 2 max: 774 x̄: 90.83 x̃: 19 helped stats (rel) min: 0.07% max: 25.23% x̄: 3.06% x̃: 0.40% HURT stats (abs) min: 2 max: 1106 x̄: 89.35 x̃: 12 HURT stats (rel) min: 0.08% max: 45.40% x̄: 3.01% x̃: 0.47% 95% mean confidence interval for cycles value: -60.09 115.23 95% mean confidence interval for cycles %-change: -2.21% 4.07% Inconclusive result (value mean confidence interval includes 0). All of the shaders hurt are in either UE4 shooter-game or shooter_demo. Tiger Lake Instructions in all programs: 159893213 -> 159893290 (+0.0%) SENDs in all programs: 6936431 -> 6936431 (+0.0%) Loops in all programs: 38385 -> 38385 (+0.0%) Cycles in all programs: 7019259514 -> 7019260087 (+0.0%) Spills in all programs: 101389 -> 101389 (+0.0%) Fills in all programs: 131532 -> 131532 (+0.0%) Ice Lake Instructions in all programs: 143624164 -> 143624235 (+0.0%) SENDs in all programs: 6980289 -> 6980289 (+0.0%) Loops in all programs: 38383 -> 38383 (+0.0%) Cycles in all programs: 8440082767 -> 8440083238 (+0.0%) Spills in all programs: 102246 -> 102246 (+0.0%) Fills in all programs: 131908 -> 131908 (+0.0%) Skylake Instructions in all programs: 134185424 -> 134185495 (+0.0%) SENDs in all programs: 6938790 -> 6938790 (+0.0%) Loops in all programs: 38356 -> 38356 (+0.0%) Cycles in all programs: 8222366529 -> 8222366923 (+0.0%) Spills in all programs: 98821 -> 98821 (+0.0%) Fills in all programs: 125218 -> 125218 (+0.0%) Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Fixes: `f5dd6dfe01` ("anv: enable VK_KHR_shader_float_controls and SPV_KHR_float_controls") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Jordan Justen	e2cd0c3a3c	intel/fs: Assert that old pull-const code is not used if devinfo->has_lsc Jason changed this to use LSC in: `f5876dfdb9` ("intel/fs: Lower uniform pull constant load message to LSC dataport") Cc: 22.0 <mesa-stable> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14384>	2022-02-09 10:39:17 +00:00
Rohan Garg	03e1e19246	anv: Refactor descriptor copy Refactor descriptor copies to use the existing helper functions instead of rolling our own. In order to facilitate this, we need to store the appropriate buffer views for the relevant descriptors internally and reuse them in the helpers. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14909>	2022-02-09 09:24:37 +00:00
Lionel Landwerlin	c78be5da30	intel/fs: lower ray query intrinsics v2: Add helper for acceleration->root_node computation (Caio) v3: Update comment on "done" bit (Caio) Remove progress bool value for impl function (Caio) Don't use nir_shader_instructions_pass to search the shader (Caio) v4: Rename variable for if/else block (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	712d8fb043	intel/nir: document RT builder Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	0465714790	intel/nir/rt: add more helpers for ray queries v2: Split stack_id helper in sync/async version (Caio) Fixup a few bit field mistake (Caio) Simplify some bitfield manipulations (Caio) v3: Remove duplicated helper (Caio) Simplify brw_nir_rt_set_dword_bit_at (Caio) Comment brw_nir_rt_query_mark_init (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	d5b994ec8a	intel/nir/rt: make RT manipulation helpers helper invocations ready Since we need to be able to perform ray queries in helper invocations, we need to have all the helpers properly tag their load/store operations so that they operate in helper lanes. v2: Switch from macros to inline functions (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	fb69fed65b	intel/nir: document committed argument Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	b0624e414f	intel/fs: make trivial shader complete tracing operations with missing shaders v2: Apply workaround only on < DG2-512-C0 & < DG2-128-B0 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	35bd19f53d	intel/nir/rt: load bvh_level value off mem_hit structure Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	6d9ae6ec1e	intel: add a new intrinsic to get the shader stage from bindless shaders We'll use this to apply ray tracing operations in our trivial return shader based on the stage we're in. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00

1 2 3 4 5 ...

2037 Commits