AlexIndustrial/mesa

Author	SHA1	Message	Date
Antonio Ospite	ddf2aa3a4d	build: avoid redefining unreachable() which is standard in C23 In the C23 standard unreachable() is now a predefined function-like macro in <stddef.h> See https://android.googlesource.com/platform/bionic/+/HEAD/docs/c23.md#is-now-a-predefined-function_like-macro-in And this causes build errors when building for C23: ----------------------------------------------------------------------- In file included from ../src/util/log.h:30, from ../src/util/log.c:30: ../src/util/macros.h:123:9: warning: "unreachable" redefined 123 \| #define unreachable(str) \ \| ^~~~~~~~~~~ In file included from ../src/util/macros.h:31: /usr/lib/gcc/x86_64-linux-gnu/14/include/stddef.h:456:9: note: this is the location of the previous definition 456 \| #define unreachable() (__builtin_unreachable ()) \| ^~~~~~~~~~~ ----------------------------------------------------------------------- So don't redefine it with the same name, but use the name UNREACHABLE() to also signify it's a macro. Using a different name also makes sense because the behavior of the macro was extending the one of __builtin_unreachable() anyway, and it also had a different signature, accepting one argument, compared to the standard unreachable() with no arguments. This change improves the chances of building mesa with the C23 standard, which for instance is the default in recent AOSP versions. All the instances of the macro, including the definition, were updated with the following command line: git grep -l '[^_]unreachable(' -- "src/**" \| sort \| uniq \| \ while read file; \ do \ sed -e 's/$[^_]$unreachable(/\1UNREACHABLE(/g' -i "$file"; \ done && \ sed -e 's/#undef unreachable/#undef UNREACHABLE/g' -i src/intel/isl/isl_aux_info.c Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36437>	2025-07-31 17:49:42 +00:00
Jordan Justen	bca1acbb42	intel/dev: Add WCL PCI IDs Tested with: commit 3a252ff9d8b6dc22b20463bfcb31a4e8992b0e8f Merge: 9800bf6fae3b 11895f375939 Author: Simona Vetter <simona.vetter@ffwll.ch> Date: Fri Jul 11 11:25:34 2025 +0200 Note that the kernel treats WCL similar to PTL, so 94de1dfd4729 ("drm/xe/ptl: Drop force_probe requirement") also removed the force_probe for WCL. Backport-to: 25.1 Ref: 3c0f211bc8fc ("drm/xe: Add Wildcat Lake device IDs to PTL list") Ref: 94de1dfd4729 ("drm/xe/ptl: Drop force_probe requirement") Ref: drm/drm-next 3a252ff9d8b6dc22b20463bfcb31a4e8992b0e8f Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36148>	2025-07-21 21:22:05 +00:00
Jordan Justen	8b771e8937	intel/dev: Add WCL device info Backport-to: 25.1 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36148>	2025-07-21 21:22:05 +00:00
Matt Turner	7da88c76db	intel: Add support for BFloat16 as cooperative matrix accumulator The number of passing tests in ./deqp-vk -n 'cooperative_matrix.khr' on PTL increases from 914 -> 1030. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35320>	2025-07-02 20:06:59 +00:00
Matt Turner	6842a8179f	intel: Add support for float16 as cooperative matrix accumulator The number of passing tests in ./deqp-vk -n 'cooperative_matrix.khr' increases - on PTL from 787 -> 914 - on RPL from 799 -> 926 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13304 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35616>	2025-06-27 01:26:22 +00:00
José Roberto de Souza	ddca50584c	intel: Return PTL stepping Without this no temporary workaround is applied to PTL as by default INTEL_STEPPING_RELEASE is returned and it is larger than any stepping. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35110>	2025-05-23 13:52:27 +00:00
Jianxun Zhang	ca092db7ce	intel/dev: Differentiate displayable PAT entry of compression (xe2) We need two PAT entries with compression for displayable and non-displayable compressed images. The current 'compressed' entry is renamed to 'scanout_compressed' for the displayable. Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29928>	2025-05-16 16:03:54 -07:00
Caio Oliveira	07fa3b3785	intel: Add support for BFloat16 as cooperative matrix source Re-organize the configuration lists to make easier to include BFloat16 only for the Gfx125+ that support it, while keeping MTL supporting the "lowered" configurations from pre-Gfx125. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Tapani Pälli	765801fd9e	intel/dev: add note about PAT entries and Wa_18038669374 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34665>	2025-04-24 09:48:34 +00:00
Caio Oliveira	050acb9def	intel: Disable has_bfloat16 for MTL Not supported. Some operations do work, but proper support was removed since it also doesn't support DPAS. Fixes: `9916cc1050` ("brw: Add BRW_TYPE_BF for bfloat16") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34506>	2025-04-14 18:23:43 +00:00
Caio Oliveira	adfab666a4	intel: Add intel_device_info::has_systolic Gfx125+ has systolic, with exception for MTL and some ARL variants. Update code and tests to use it. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34506>	2025-04-14 18:23:43 +00:00
Lionel Landwerlin	bcaf08b47c	intel/dev: remove ADLN references Not used anymore, just use the existing ADL definitions. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34433>	2025-04-11 13:54:35 +00:00
Jordan Justen	bc86fd5b1f	intel/dev: Stop checking hwconfig values at driver runtime We will move this check into the `intel_dev_info` tool. Unfortunately, this means we will be much less likely to notice inconsistencies, but the current strategy has proven to be far too noisy. For example, if the driver was built in debug mode, then when test suites are running thousands of tests, the current approach can lead to thousands of messages being printed. Closes: mesa/mesa#12141 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34243>	2025-03-27 14:52:49 -07:00
Caio Oliveira	9916cc1050	brw: Add BRW_TYPE_BF for bfloat16 Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33664>	2025-03-25 05:23:37 +00:00
Kenneth Graunke	cdbedc9eff	intel: Move unlit centroid workaround into the elk compiler This was only needed on Sandybridge. We can delete the brw code, and replace the generic devinfo bit with a helper inside the elk compiler itself. Thanks to Iván Briano for noticing we still had dead brw code for this. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33764>	2025-03-10 17:23:07 -07:00
Kenneth Graunke	404ed1d153	intel/dev: Set a higher minimum number of URB entries for GS We've been programming our minimum number of URB entries for geometry shaders to 2, but it appears that we should have been setting 8 on Broadwell and later. Additionally, there's a workaround on Skylake and later that requires us to add flushing (which we haven't) or use a minimum of 16 URB entries. This alone will not fix anything, as nothing reads this devinfo field presently (will be fixed in the next commit). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33764>	2025-03-10 17:23:07 -07:00
Kenneth Graunke	dc66dee8ad	intel/dev: Rework device info macros for Gfx8+ As we added new platforms, the device info macros evolved over time Most platforms had a "FEATURES" macro, some had a "HW_INFO" macro, a few had macros for URB entries - some with min entries only, some with min and max, some including the .urb = { ... } braces, others not. Thread counts or subslice info was sometimes considered FEATURES, sometimes HW_INFO, sometimes inserted only in the final structure. FEATURES macros often inherited from an ancestor platform, but not necessarily the prior platform - many were based on GFX8_FEATURES. Many redundantly set the same feature bits as prior platforms. This patch aims to clean up the situation, so it's a little more organized, especially if you look at multiple generations. Macros are now split into several separate pieces: 1. The FEATURES macro only has architectural features, such as LSC, ray tracing support, 64-bit integers, flat CCS, and so on. Thread counts, subslice info, and URB sizes that may vary by SKU are not included here. This makes it easy for one platform to inherit the features from the previous, while not pulling in that extra data. 2. THREAD_COUNTS macros contain maximum thread counts from the 3DSTATE_VS documentation and so on. 3. URB_MIN_MAX_ENTRIES macros contain the entire URB configuration, including .urb = { ... }. 4. PAT_ENTRIES macros (on modern platforms) contains our choice of which PAT entries to use for various types of resources. 5. CONFIG macros combine all of the above into a tidy bundle for use in defining various structures, and may also include the platform macro or simulator ID for convenience. On recent platforms where hwconfig tables exist, items #2-3 could potentially be dropped and filled in from there instead. For XEHP+ where we require hwconfig, we instead have a PLACEHOLDER_THREADS_AND_URB macro that makes it clear that these values are updated from hwconfig. One nice thing is that the bits that could (or do) come from hwconfig tables are now cleanly separate from those that do not (i.e. platform feature support, PAT entry selection, and so on). This patch does not touch GFX7 or earlier macros. We could probably offer a similar treatment there, but they're generally working and not quite as complex. To verify that this commit does not have unintentional changes, I recommend running objdump -s build/src/intel/dev/libintel_dev.a.p/intel_device_info.c.o before and after this commit, and diffing the output. The devinfo structures produced are identical. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33764>	2025-03-10 17:23:07 -07:00
Kenneth Graunke	20a229bc06	intel/dev: Set max_wm_threads to 0 in the Gfx9+ devinfo structs intel_device_info_init_common calculates this for Gfx9+ based on max_threads_per_psd and slice information. Mark it as zero in the structures to make clear that the value there isn't useful, and make it easier to diff binaries for the next commit's refactors. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33764>	2025-03-10 17:23:07 -07:00
Kenneth Graunke	7ccc786acf	intel/dev: Set minimum HS URB entries to 0. The documentation for 3DSTATE_URB_HS has 0 as the minimum number of HS URB entries for all platforms. See BSpecs 32162, 47137, 56271 for Gfx6-11, Xe, and Xe2-3, respectively. This should silence warnings about our device info field not matching the hwconfig tables. Notably, nothing in our drivers currently uses this value so it cannot have a functional impact. Fixes: `4064b5546b` ("intel/dev: reduce warning noise from urb settings") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33764>	2025-03-10 17:23:07 -07:00
Kenneth Graunke	7f6b1dee2c	intel: Move devinfo->has_compr4 into the elk compiler Used in exactly one place in elk. Off to live there. Reviewed-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33764>	2025-03-10 17:23:07 -07:00
Kenneth Graunke	be8ec31e72	intel: Move devinfo->has_negative_rhw_bug into the elk compiler This is only needed for original 965G/GM clipper code, which only exists in the legacy compiler. Send it off to live with the elk. Reviewed-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33764>	2025-03-10 17:23:07 -07:00
Kenneth Graunke	0bf779ed31	intel: Delete devinfo->has_surface_tile_offset This is used in exactly one place in crocus, which already has a comment indicating that this code is needed for original Gfx4 hardware. Just replace that with a verx10 == 40 check. Reviewed-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33764>	2025-03-10 17:23:07 -07:00
Kenneth Graunke	7f50f1591b	intel: Delete devinfo->must_use_separate_stencil This is used by a single place in ISL only for sanity checking the decisions it has already made. The knowledge is already all centralized in ISL these days, so we don't need a device info bit. Reviewed-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33764>	2025-03-10 17:23:07 -07:00
José Roberto de Souza	7d4c91efef	intel/dev: Call intel_device_info_update_after_hwconfig() from common code Avoid backends duplication. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33585>	2025-02-17 20:52:31 +00:00
Valentine Burley	0d1fa0f1a3	intel/dev: Provide a toggle to avoid warnings about unsupported devices Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33282>	2025-02-05 14:01:03 +00:00
Tapani Pälli	4064b5546b	intel/dev: reduce warning noise from urb settings This sets up the min value as if stage was active, later on we set this to zero if such is not the case. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12141 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33353>	2025-02-04 09:07:48 +00:00
Sagar Ghuge	385977955b	intel: Set correct maxComputeSharedMemorySize for Xe3+ For Xe3+, set preferred SLM and SLM per threadgroup size. Bspec: 73211 Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32872>	2025-01-07 07:06:09 +00:00
Jordan Justen	1027b071f9	intel/dev: Add intel_check_hwconfig_items() Rather than checking hwconfig items when using them, wait until after devinfo has been fully initialized. This includes having workarounds implemented. We can then check if the hwconfig data and final Mesa initialization agree. If the match fails, we need to investigate if Mesa or the hwconfig data is wrong. This code becomes a no-op when not on a release build. Fixes: `a4c5bfd34c` ("intel/dev: Use hwconfig for urb min/max entry values") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12141 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32359>	2024-12-10 09:01:45 +00:00
Tapani Pälli	c2b7bafd76	intel/dev: lower amount of max gs threads for Wa_18040209780 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32245>	2024-11-21 20:43:38 +00:00
Jordan Justen	efa7aa4e47	intel/dev: Add PTL PCI IDs (with FORCE_PROBE set) Ref: bspec 72574 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31838>	2024-10-26 07:39:30 +00:00
Jordan Justen	bd52bef69e	intel/dev: Add PTL device info Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31838>	2024-10-26 07:39:30 +00:00
Jordan Justen	2d15c23e4a	intel/dev: Add XE3_FEATURES macro Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31838>	2024-10-26 07:39:29 +00:00
Jordan Justen	d476badb48	intel/dev: Support Xe3 device init (for intel_device_info_test) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31838>	2024-10-26 07:39:29 +00:00
Jordan Justen	ee727d7b66	intel/dev: Add devinfo::probe_forced based on INTEL_FORCE_PROBE Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31011>	2024-09-04 12:09:08 -07:00
Paulo Zanoni	0e38b794e2	intel: fix compute SLM sizes on Xe2 and newer Before the patch, intel_device_info_get_max_preferred_slm_size() returns values in kilobytes, but then intel_device_info_get_max_slm_size() is multiplying it by 1024. As a result, LNL is reporting maxComputeSharedMemorySize to be 134217728, which is 128mb. Fix this by making intel_device_info_get_max_slm_size() not multiply it by 1024. This should fix at least the following dEQP tests: dEQP-VK.compute.pipeline.zero_initialize_workgroup_memory.max_workgroup_memory.1 dEQP-VK.compute.pipeline.zero_initialize_workgroup_memory.max_workgroup_memory.128 dEQP-VK.compute.pipeline.zero_initialize_workgroup_memory.max_workgroup_memory.16 dEQP-VK.compute.pipeline.zero_initialize_workgroup_memory.max_workgroup_memory.2 dEQP-VK.compute.pipeline.zero_initialize_workgroup_memory.max_workgroup_memory.4 dEQP-VK.compute.pipeline.zero_initialize_workgroup_memory.max_workgroup_memory.64 Some tests were failing with: deqp-vk: ../../src/intel/common/intel_compute_slm.c:24: slm_encode_lookup: Assertion `kbytes <= table[table_len - 1].size_in_kb' failed. while other tests were triggering the OOM. v2: - Make everybody return sizes in bytes (José). v3: - Rename variable to bytes (José, Jordan). Fixes: `fd368f5521` ("anv: Set maxComputeSharedMemorySize value for Xe2 platforms") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30541>	2024-08-07 16:14:02 +00:00
José Roberto de Souza	de5d767f9a	intel/brw: Add a maximum scratch size restriction Gfx 12.5 moved scratch to a surface and SURFTYPE_SCRATCH has this pitch restriction: RENDER_SURFACE_STATE::Surface Pitch For surfaces of type SURFTYPE_SCRATCH, valid range of pitch is: [63,262143] -> [64B, 256KB] The pitch of the surface is the scratch size per thread and the surface should be large enough to accommodate every physical thread. So here adding a new field to intel_device_info, setting it in intel_device_info_init_common() so even offline tools can have it set. And finally adding a check to fail shader compilation if needed scratch is larger than supported. This issue can be reproduced in debug builds when running dEQP-VK.protected_memory.stack.stacksize_1024 on Gfx 12.5 or newer platforms. Ref: BSpec 43862 (r52666) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30271>	2024-07-22 18:17:38 +00:00
Francisco Jerez	bb2513918a	intel/dev: Add devinfo flag for TBIMR push constant workaround. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30031>	2024-07-20 01:13:19 +00:00
José Roberto de Souza	0500e35165	intel/dev: Drop writeback_incoherent from Xe2 Xe2 platforms are only supported by Xe KMD that do not support CPU WB + 0 way coherent. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29950>	2024-07-17 17:41:32 +00:00
José Roberto de Souza	6d77dfa75d	intel/dev: Use GPU WB PAT for Xe2 writecombining So for this entry we want the CPU mapping to be WC but GPU caches can be WB. This way GPU don't need to snoop to CPU caches and at the end of workloads L3 cache is flushed, so CPU access is coherent after get the signal that workload was finished. With this the transient(XD) L3 flushes will only affect displayable buffers. Ref: Bspec 71582 (r59285) Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29950>	2024-07-17 17:41:32 +00:00
José Roberto de Souza	48da8eab55	intel/dev: Add comment documenting the PAT entries Like said in the past patch, coherency is not needed and there was a miss understating about caching used by CPU and GPU. With this new comment it much better explained. Ref: Bspec 45101 (r51017) Ref: Bspec 71582 (r59285) Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29950>	2024-07-17 17:41:32 +00:00
José Roberto de Souza	7295e09b53	intel/dev: Drop coherency from intel_device_info_pat_entry It is not used in run-time so we can drop from the struct. It might have value as PAT entries documentation but that will be done in the next patch. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29950>	2024-07-17 17:41:32 +00:00
José Roberto de Souza	4173e0f910	intel/dev: Drop DG1 PAT entries It inherents that table from TGL. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29950>	2024-07-17 17:41:32 +00:00
José Roberto de Souza	f9efedb1a1	intel/dev: Replace intel_device_info::apply_hwconfig by a gfx version check There is no plans to remove hwconfig from platforms 12.5 and newer so lets replace this bool by a ip version check. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27897>	2024-07-03 22:17:37 +00:00
Jianxun Zhang	77c83069ad	intel/dev: Select a compressed PAT entry (xe2) Fix glxgears (LNL) glxgears: xe/iris_kmd_backend.c:81: xe_gem_create: Assertion `!"" "missing"' failed. Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28620>	2024-07-02 19:03:19 +00:00
Francisco Jerez	039f4fe25e	intel/dev: Add GRF size information to the intel_device_info struct. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29926>	2024-06-27 07:39:17 +00:00
Ian Romanick	2bbd0fd9da	intel/brw/xe2+: Add LNL cooperative matrix configurations Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28834>	2024-06-25 14:17:47 -07:00
Ian Romanick	ea6e10c0b2	intel/brw: Temporarily disable result=float16 matrix configs Even though the hardware does not naively support these configurations, there are many potential benefits to advertising them. These configurations can theoretically use half the memory bandwidth for loads and stores. For large matrices, that can be the limiting in performance. The current implementation, however, has a number of significant problems. The conversion from float16 to float32 is performed in the driver during conversion from NIR. As a result, many common usage patterns end up doing back-to-back conversions to and from float16 between matrix multiplications (when the result of one multiplication is used as the accumulator for the next). The float16 version of the matrix waste half the possible register space. Each float16 value sits alone in a dword. This is done so that the per-invocation slice of an 8x8 float16 result matrix and an 8x8 float32 result matrix will have the same number of elements. This makes it possible to do straightforward implementations of all the unary_op type conversions in NIR. It would be possible to perform N:M element type conversions in the backend using specialized NIR intrinsics. However, per #10961, this would be very, very painful. My hope is that, once a suitable resolution for that issue can be found, support for these configs can be restored. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28834>	2024-06-25 13:52:12 -07:00
Francisco Jerez	588c725f27	intel/xe2+: Enable native 64-bit integer arithmetic. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29148>	2024-05-31 09:14:01 -07:00
Jordan Justen	43f795d19f	intel/dev: If building the driver, always allow getting device info Now that we know when we are getting the devinfo as part of the build process, we can just always force the devinfo to be returned, regardless of whether INTEL_FORCE_PROBE is set. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29445>	2024-05-30 22:28:50 +00:00
Jordan Justen	fbf5ea6b44	intel/dev: Silence INTEL_FORCE_PROBE warning for intel_clc Running intel_clc as part of the build doesn't need to issue this warning. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29445>	2024-05-30 22:28:50 +00:00

1 2 3 4 5

203 Commits