diff --git a/docs/relnotes.rst b/docs/relnotes.rst index ed725ad22f5..0417b01f1d9 100644 --- a/docs/relnotes.rst +++ b/docs/relnotes.rst @@ -3,6 +3,7 @@ Release Notes The release notes summarize what's new or changed in each Mesa release. +- :doc:`24.3.0 release notes ` - :doc:`24.2.7 release notes ` - :doc:`24.2.6 release notes ` - :doc:`24.2.5 release notes ` @@ -436,6 +437,7 @@ The release notes summarize what's new or changed in each Mesa release. :maxdepth: 1 :hidden: + 24.3.0 24.2.7 24.2.6 24.2.5 diff --git a/docs/relnotes/24.3.0.rst b/docs/relnotes/24.3.0.rst new file mode 100644 index 00000000000..6c975ab20f1 --- /dev/null +++ b/docs/relnotes/24.3.0.rst @@ -0,0 +1,6156 @@ +Mesa 24.3.0 Release Notes / 2024-11-21 +====================================== + +Mesa 24.3.0 is a new development release. People who are concerned +with stability and reliability should stick with a previous release or +wait for Mesa 24.3.1. + +Mesa 24.3.0 implements the OpenGL 4.6 API, but the version reported by +glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / +glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. +Some drivers don't support all the features required in OpenGL 4.6. OpenGL +4.6 is **only** available if requested at context creation. +Compatibility contexts may report a lower version depending on each driver. + +Mesa 24.3.0 implements the Vulkan 1.3 API, but the version reported by +the apiVersion property of the VkPhysicalDeviceProperties struct +depends on the particular driver being used. + +SHA checksums +------------- + +:: + + TBD. + + +New features +------------ + +- Expose Vulkan 1.3 on v3dv, both rpi4 and rpi5 +- VK_EXT_descriptor_buffer on nvk +- VK_EXT_post_depth_coverage on nvk +- VK_KHR_video_maintenance1 on radv +- VK_EXT_legacy_vertex_attributes on nvk +- GL_KHR_shader_subgroup on radeonsi +- VK_KHR_maintenance7 on nvk +- VK_KHR_dynamic_rendering_local_read on nvk +- GL_ARB_timer_query on Panfrost +- GL_EXT_disjoint_timer_query on Panfrost +- VK_KHR_pipeline_binary on RADV +- VK_KHR_compute_shader_derivatives on anv +- VK_NV_compute_shader_derivatives on nvk +- VK_KHR_compute_shader_derivatives on nvk +- VK_KHR_compute_shader_derivatives on radv +- VK_KHR_shader_relaxed_extended_instruction on anv, hasvk, hk, nvk, radv, tu, v3dv, lvp +- GL_OVR_multiview and GL_OVR_multiview2 on zink +- VK_KHR_shader_float_controls2 on radv +- VK_KHR_shader_float_controls2 on nvk +- VK_EXT_device_generated_commands on nvk, radv +- VK_EXT_host_image_copy on nvk/Turing+ +- VK_EXT_depth_clamp_control on anv, hasvk, nvk, radv +- VK_KHR_shader_quad_control on nvk +- GL_EXT_draw_buffers2 on etnaviv/HALTI5+ +- GL_ARB_draw_buffers_blend on etnaviv/HALTI5+ +- VK_KHR_fragment_shading_rate on NVK +- GL_ARB_draw_indirect on etnaviv/HALTI5+ +- VK_EXT_depth_clamp_zero_one on NVK +- GL_ARB_framebuffer_no_attachments on etnaviv + + +Bug fixes +--------- + +- anv: Use-after-free detected by AddressSanitizer while running dEQP-VK +- Confidential issue #11770 +- radv: DCC causes glitches in Red Dead Redemption 2 +- A5xx rendering issues with firefox +- [ANV][Regression] Broken rendering in Flycast + Per-Pixel Alpha Sorting +- vtn: OpTypeStruct in kernel parameters trigger assertion in glsl_types.h +- anv: Assertion failure in \`dEQP-VK.image.extended_usage_bit_compatibility.image_format_list.s8_uint_optimal_transfer_src_bit` +- radv: Resident Evil 6 Benchmark Tool has artifacts on 7900 XTX when DCC is enabled, game launched on 4K monitor without scaling and with FullHD settings +- [AMD RX 6700 XT] Artifacts while upscaling games in fullscreen mode +- Distorted pixelated graphics with Radeon RX 7900 XT with some games +- Total War Warhammer 2 Graphical Glitch +- Glitching artifacts in tile shaped patterns on 6700 XT, when using upscaled fullscreen game on labwc +- anv: Page fault when using MTL simulator in dEQP-VK.ray_tracing_pipeline.data_spill.report_intersection.float32 +- mesa_cache_db.c:316:33: error: call to undeclared function 'mremap' +- [trunk] shaders fail hard in openmw after cbfc225e2bda2c8627a4580fa3a9b63bfb7133e0 +- u_perfetto.h:33:9: error: unknown type name 'clockid_t'; did you mean 'clock_t'? +- brw_fs_opt_copy_propagation incorrectly handles size changes of uniforms +- nouveau paraview msaa corruption 23.1 bisected regression +- mesa fails to build due to missing SPV_ENV_UNIVERSAL_1_6 symbol +- Missing xshmfence dependency for X11 WSI on macOS +- [r300g] Unigine Tropics: black stripes between center of the screen and horizon +- [AMDGPU RDNA3] Black square artifacts in viewport renders in Blender +- radv: test_sm67_sample_cmp_level test in vkd3d-proton fails +- [ANV] LNL tiled corruption in background traci Blackops3-trace-dx11-1080p-high +- [anv] Possible regression in e3814dee1ac0f90771b921a4f6f5aed10f06e8d4 +- Confidential issue #12092 +- radv: significantly less performance than amdgpu-pro in the AMD FSR2 sample +- radv: wrong index value in radv_skip_graphics_pipeline_compile +- Android: IMapper4 / HIDL is being deprecated +- d3d12_video_screen.cpp undefined reference to ConvertHEVCSupportFromProfile(D3D12_VIDEO_ENCODER_PROFILE_HEVC, D3D12_VIDEO_ENCODER_CODEC_CONFIGURATION_SUPPORT_HEVC1*) +- anv: Reduce the number of partial resolves for resource transitions with CCS_E +- Silent Hill 2 Remake performance regression with f5805bcb +- Disk Cache DB file descriptors are not closed on exec +- Shader cache takes too many fds +- Shader cache takes too many fds +- Disk Cache DB file descriptors are not closed on exec +- radv: \`test_instruction_msad_dxil` from vkd3d-proton fails on hawaii +- [anv] functional regression with 02294961eeeec003fd6113c0406a80ea1a2fec72 on DG2 +- [vulkan-intel][regression] War Thunder causes GPU hang +- RFE: rusticl: out-of-order queues +- [nak][nouveau] - error[E0425]: cannot find value \`DRM_RS_IOCTL_NOUVEAU_EXEC` in this scope --> ../src/nouveau/compiler/nak/hw_runner.rs:193:17 +- [radeonsi] glLinkProgram terminates the application when Shader is passing Bindless Texture into function +- gallium: crash when using images obtained from a texture with a bindless handle +- Using DXT1 textures fails in GL ES 1, even with EXT_texture_compression_dxt1 supported +- zink: discrepancy between reported max LOD bias and the actual allowed range +- gallium: Crash when writing to writeonly image3D passed in via parameter +- vulkan/queue: heap-use-after-free when enabling address sanitizer +- r300 regression : Upside down and broken redering for Source games +- vk_texcompress_astc_finish() may crash because astc is null +- intel A770 dg2 silent hill 2 doesn't run +- radv: vkGetPipelineKeyKHR global key is bugged +- Mesa's \`intel_hang_replay` tool fails to work +- Mesa's \`intel_hang_replay` tool fails to work +- radv: fix dEQP-VK.texture.mipmap.3d.image_view_min_lod.base_level.* regressions on GFX6-8 +- ci: Uprev Piglit failed +- PanVK-v10: Missing support for vkCmdDispatchIndirect +- radv: slow packed math in nested loops due to copies +- nvk,nak: Implement VK_KHR_shader_quad_control +- [Turnip] Detroit Become Human gets stuck perpetually compiling a shader +- Faaast Penguin requires force_vk_vendor set +- [clover] linking of libMesaOpenCL.so.1.0.0 fails with LLVM/Clang 20: undefined reference to \`clang::api_notes::APINotesManager::loadCurrentModuleAPINotes(...)` +- zink missing dep on spirv_info_h? +- ANV: Deadlock(game) bugged invisible player/NPC models and missing textures +- anv: dEQP-VK.mesh_shader.ext.misc.clip_geom_provoking_last and dEQP-VK.mesh_shader.ext.misc.clip_geom_and_task_shader_provoking_last fail on MTL +- anv: dEQP-VK.mesh_shader.ext.misc.clip_geom_provoking_last and dEQP-VK.mesh_shader.ext.misc.clip_geom_and_task_shader_provoking_last fail on MTL +- anv: dEQP-VK.subgroups.clustered.compute.subgroupclusteredmul_i64vec4_requiredsubgroupsize and friends time out due to register spilling +- [clc] [FTBFS] [llvm20+] no matching function for DataLayout +- building lavapipe with LLVM-19 and running any vulkan app on a Zen4 CPU shows text on terminal.. +- PanVK-v10: Missing support for vkCmdDraw[Indexed]Indirect +- Adreno 740v3 (Quest 3) Image Corruption +- [regression] "anv: Change params of anv_can_fast_clear_color_view" breaks a bunch of Sparse tests on LNL +- Update defaults for launch crash with XESS in Satisfactory 1.0 +- [bisected][zink] dEQP-GLES3.functional.fbo.multiview.samples_1 failing after feature enabled on zink +- ACO Error: Unsupported opcode: v4 -- AMD Radeon RX 590 Series (Black Myth Wukong Benchmark Tool) +- !31075 cause graphical artifact with Warhammer 40,000: Space Marine 2 +- PanVK-v10: VulkanCTS 1.3.9.2 - 3D Copy-and-blit tests hit GPU fault on Rock5B (RK3588) +- [regression] paraview regressed on T860 +- PanVK-v10: Assert in cs_to_reg_tuple for panvk_v10_CmdWaitEvents2 +- PanVK-v10: VulkanCTS 1.3.9.2 - Segfault in panvk_v10_link_shaders for nocolor-tests on Rock5B (RK3588) +- ANV: scalar FS validation failure is always hit when playing the steam game "Black Myth: Wukong Benchmark Tool" +- PanVK-v10: VulkanCTS 1.3.9.2 - Multiple tests hit "Unknown sampler dim type"-assert on Rock5B (RK3588) +- [REG 24.1.7->24.2.1, bisected] RADV: uVkCompute's mad_throughput benchmark hangs +- [panfrost] rk3588 Ion Fury game (from gog via box64) no image +- [panfrost] rk3588 Ion Fury game (from gog via box64) no image +- [panfrost] rk3588 Ion Fury game (from gog via box64) no image +- [panfrost] rk3588 Ion Fury game (from gog via box64) no image +- [v3d] performance regression in GTK's NGL renderer from 24.1 to 24.2 +- Color attribute not restored properly when using glPushAttrib/glPopAttrib calls +- 69c772e4ea8f (egl: link with libgallium directly) breaks VAAPI - OpenGL interop on radeonsi +- Select VA-API device under Wayland in Multi-GPU setup +- nine: intel-whl: ERROR: src0 is null +- [clc] [FTBFS] [llvm20+] no matching function for GetResourcesPath +- [AMDGPU RDNA3] Viewport compositing generates severe graphical artifacts in Blender +- v3d gbm_bo_unmap() crashes when used by TigerVNC +- !30610 broke piglit program\@execute\@bswap with rusticl on radeonsi +- [Regression] [BISECTED] Genshin Impact: GPU HANG: ecode 12:1:85def5fb +- Arma Reforger causes amdgpu crash on 24.2 +- [Regression][Vulkan][TGL]Failed to render to depth image at non-zero mip level on Mesa 24.1 and later +- [ANV] Breaking Limit benchmark does not run +- [anv][dg2,mtl][bisected][regression] dEQP-VK.ray_tracing_pipeline.acceleration_structures.device_compability_khr.gpu_built.bottom failure +- Artifacts on Black Myth Wukong +- DCS world radeon rx 7900 flicker +- radv/aco: Scheduler bug in ACO somewhere in parallel-gs +- Once human black artifacts consume the screen +- [Regression] Wayland-only MESA fails to build without libx11-xcb-dev, libxcb-dri3-dev, libxcb-present-dev, etc. +- Possible race condition when generating enums.h +- Rusticl: unsigned int division results in wrong value +- a6xx spec\@arb_texture_view\@rendering-formats and spec\@arb_shader_image_load_store\@bitcast regression +- [Bisected] Chromium GPU Process Crashes with Vulkan Backend +- radv: weird dEQP-GLES31.functional.blend_equation_advanced.msaa.* failures if DRLR is enabled with RADV +- Anv/Zink ICL hang +- [anv] possible regression found on 982106e6768e6f69d71710e314325c813591452d +- xe2: Non-compiler assertion failure replaying cp2077 fossil +- iris: GfxBench5's Car Chase slow-clears LOD0 when using HIZ_CCS_WT +- Mesa doesn't build with the following config options +- Intel: Enable Compression on depth buffers (Xe2) +- mpv fails with VA-API for Radeon RX 6600, Mesa 24.3.0-devel +- Failed to cross-compile Mesa-24.2 & Rusticl with bindgen 0.65.1 and rustc 1.80.0 +- Confidential issue #11711 +- Spectacle crashes recording h264 video with 24.2-rc3 +- [nak][nouveau][nvk] FAILED: src/nouveau/compiler/nak +- u_debug_stack_test.capture_not_overwritten fails on i386 +- [bdw,bsw][bisected] \*.spec.nv_compute_shader_derivatives.execution.derivatives* +- a6xx spec\@arb_texture_view\@rendering-formats and spec\@arb_shader_image_load_store\@bitcast regression +- turnip: vulkaninfo crashed +- turnip-freereno Build error (/usr/local/include/xf86drm.h:40:10: fatal error: drm.h: No such file or directory) +- tu: compilation failing when compiling turnip with only kgsl and X11 enabled +- samplerCube constructor in fragment shader no longer converting uvec2 bindless texture handles ( segmentation fault , core dumped ) +- ci/llvm18: AMD unitialized value +- anv: gen9.5 flakiness in dEQP-VK.multiview.dynamic_rendering.depth.* +- Turnip: shaderInt8 support +- intel/brw: q2rtx RT pipeline fails EU validation +- vaapi decoding corruption with green blocks +- interpolateAt precision lowering unhandled by glsl_to_nir() +- [anv] CS2 crashes on LNL +- [anv] Dota2 does not start on LNL +- Raspberry Pi 5 Vulkan 1.3 missing API checklist +- [radeonsi][bisected][regression] glClientWaitSync() quickly times out with INT64_MAX timeout +- d3d10umd: Build regression in 24.2.0-devel +- zink/tu: glcts flake on a750 +- nouveau: advertises GL_EXT_memory_object without implementing \`*UID` callbacks +- LIBGL_DRIVERS_PATH gone +- radv: GPU hang with Tauray +- [Regression][Vulkan][TGL][Bisected]vkCmdCopyQueryPoolResults failed to write buffer with compute pipeline on Mesa 24.1 +- Worms Revolution: not rendering explosion effects +- crash on pushbuf_validate nvc0_blit do_blit_framebuffer +- ci_run_n_monitor.py won't continue enabling the jobs +- ci_run_n_monitor.py doesn't monitor manual jobs +- piglit: cl-api-build-program crashes +- nir: regression with bindless textures (bisected) +- i915g: glGenerateMipmap() fails with 2048×2048 textures +- gallium-va: VA_RC_QVBR support +- nir/glsl: vectorize_io unpredictably vectorizes mismatched types +- [radeonsi] Asterix & Obelix XXLRomastered: river misrendered (completely black) +- Build fails without Vulkan +- No dependency check for PyYAML in meson.build +- GPU Hang in Metal Gear Rising Revengeance +- VK_ERROR_DEVICE_LOST A770 DXVK Fallout 3 + + +Changes +------- + +Aaron Germuth (1): + +- Add VK_EXT_host_query_reset to guest's allowed device extensions + +Aaron Ruby (15): + +- gfxstream: Advertise VK_KHR_external_semaphore_fd always on Linux/Android guests. +- gfxstream: logging on Linux guests +- gfxstream: Fix createResource call for Linux-guest VirtGpu layer +- gfxstream: translate physicalDevices list in VkPhysicalDeviceGroupProperties to their gfxstream objects +- gfxstream: Use vk_find_struct for other struct searches in gfxstream_vk_CreateDevice +- Revert "Reland "Partial revert of aosp/2858589 to avoid Mesa layer for Android"" +- gfxstream: Add "transformList" method for VkFences, in addition to VkSemaphores +- gfxstream: Clean up VK_KHR_external_memory_fd support for Linux +- gfxstream: Revert all unnecessary gfxstream_vk_* object types and corresponding translations in functable. +- gfxstream: Remove VkImage from functable handle translation +- gfxstream: Change createResource to use stride instead of bpp +- gfxstream: Switch to using scanout path with native DRM images for Linux WSI +- gfxstream: Add linearPeerImage for Linux guests, to use for imageLayout queries +- gfxstream: Add vkCmdPipelineBarrier2 implementation in VkDecoderGlobalState +- gfxstream: ResourceTracker: Fix bug in return-code check for blob wait + +Adam Jackson (8): + +- meson: Introduce the legacy-x11 option +- glx: Move DRI2 client support behind legacy-x11 +- egl/x11: Move DRI2 client support behind legacy-x11 +- gallium/video: Move DRI2 support behind legacy-x11 +- rusticl: Add clCreateSubDevices stub +- vtn: Handle SPV_INTEL_optnone +- rusticl: Enable out-of-order execution +- glx: Fix the GLX_EXT_swap_control_tear drawable attributes + +Aditya Swarup (3): + +- iris: Disable fast clear when surface height is 16k +- anv: Disable fast clear when surface height is 16k +- anv: Use RCS engine for copying stencil resource for gfx125 + +Agate, Jesse (2): + +- amd/vpelib: Refactor structs for API change +- amd/vpelib: Refactor frontend and backend config callback + +Alan Liu (2): + +- amd/vpelib: Amend log for tone map support check +- amd/vpelib: Remove unused structs + +Alejandro Piñeiro (5): + +- v3dv/device: set value for maxInlineUniformTotalSize +- v3dv: update conformanceVersion to 1.3.8.3 +- v3dv: expose Vulkan 1.3 +- panfrost: add support for DRM_PANTHOR_DEV_QUERY_TIMESTAMP_INFO on drm-shim +- panfrost: make DRM version panfrost/panthor conditional on drm-shim + +Aleksi Sapon (16): + +- gallivm: Fix LLVMPipe codegen issues discovered on Apple Silicon +- util: const hash_table for some readonly operations +- llvmpipe: much faster sample function cache using RCU pointers +- llvmpipe: half_to_float use LLVM fpext when fp16 is available +- llvmpipe: improvements to JIT assembly dump +- llvmpipe: FlameGraph profiling support +- wsi: Metal support +- wsi: Fix Metal WSI CAMetalDrawable leak +- llvmpipe: fix quad group helper invocation masking +- llvmpipe: correctly implement output variables loads +- llvmpipe: implement alpha-to-coverage dithering +- lavapipe: enable alpha-to-coverage dithering +- llvmpipe: Fix anisotropic sampling aliasing bug +- llvmpipe: Fix pmin calculation +- softpipe: Fix anisotropic sampling aliasing bug +- llvmpipe: update traces for aniso filtering fix + +Alessandro Astone (2): + +- egl/gbm: Walk device list to initialize DRM platform +- panvk: Add missing headers to android platform + +Ali Homafar (1): + +- zink: Optimize descriptor buffers struct filling + +Alvin Wong (1): + +- hasvk: Conditionally expose VK_KHR_present_wait + +Alyssa Rosenzweig (403): + +- vk/meta: add buffer view tracking +- vk/meta: add VK_IMAGE_VIEW_CREATE_INTERNAL_MESA flag +- asahi: drop dead code +- asahi: drop unused patch index buffer lowering +- asahi: refactor prolog lowering +- asahi: drop pointless conversion +- asahi: drop #includes +- libagx: fix prefix sum kernel +- asahi: make GS inputs explicit +- asahi: add TES->compute lowering +- asahi: split up agx_nir_lower_vs_before_gs +- libagx: hoist code out of loop +- asahi: offset buffer images in software +- ail: constify more arguments +- ail: add ail_get_level_size_B helper +- libagx: add statistic increment kernel +- libagx: mask counters to 32-bit +- libagx: rework tess i/o +- hk: add Vulkan driver for Apple GPUs +- agx: fix ballot extend packing +- asahi: don't count helper invs in pipeline stats query +- nir/opt_uniform_atomics: add fs atomics predicated? flag +- nir/divergence_analysis: handle AGX stuff +- agx: add some SRs +- agx: use opt_uniform_atomics +- nir/opt_peephole_select: allow speculatable load constant +- nir: add derivative intrinsics +- nir/opt_peephole_select: allow derivatives +- nir/gather_info: handle derivative intrinsics +- nir/lower_wpos_ytransform: handle intrinsic ddx +- nir/opt_constant_folding: handle derivative intrinsics +- nir/divergence_analysis: handle derivative intrinsics +- nir/instr_set: allow derivative intrinsics +- nir/schedule: handle derivative intrinsics +- treewide: use new-style derivative builders +- vtn: use new derivative builders +- radv: use glsl function name for dFdxfine +- amd: switch to derivative intrinsics +- agx: convert to ddx intrinsics +- agx: fix clang-format +- r600: remove 64-bit ddx handling +- r600: switch to derivative intrinsics +- dxil: switch to derivative intrinsics +- broadcom: switch to derivative intrinsics +- vc4: switch to derivative intrinsics +- brw: switch to derivative intrinsics +- elk: switch to derivative intrinsics +- ir3: switch to derivative intrinsics +- nir: add nir_shader_alu_pass +- treewide: use nir_shader_alu_pass +- nir/lower_alu: use alu_pass +- nir/scale_fdiv: use alu_pass +- nir/opt_idiv_const: use alu_pass +- nir/lower_packing: use alu_pass +- nir/opt_reassociate_bfi: use alu_pass +- asahi: fix warning +- agx: factor out agx_device_key +- agx: exploit soft fault info +- hk,asahi: pass through soft fault to VBO lower +- asahi: optimize out vbo clamp with soft fault +- hk: optimize bounded loads with soft fault +- asahi: remove agx_bo::dev +- asahi: drop segment list parsing +- asahi: drop mprotect in decoding +- asahi: drop agx_bo::type +- asahi: stop fixing up USC addresses +- asahi: drop iThing logic +- asahi: always pass align parameter for BOs +- asahi: introduce agx_va data structure +- hk: fix BDA alignment +- asahi,hk: generalize bo_bind hook +- hk: fix restart index overemit +- nir,agx: add "active threads in subgroup" intrinsic +- libagx: add drawIndirectCount clone&patch kernel +- hk: implement drawIndirectCount +- hk: fix vkd3d-proton descriptor-performance +- agx: optimize popcount(ballot(true)) +- agx: handle min_lod sources +- hk: advertise shaderResourceMinLod +- hk: advertise more eds features +- agx: tweak jmp_exec_none heuristic +- agx: fuse not into if +- agx: let if-fusing opts interact +- hk: denouveau +- hk: advertise KHR_sampler_ycbcr_conversion +- agx: fix block_image_store indirect ts +- nir: model AGX explicit coordinate intrinsics +- nir: add image_block_size_per_thread_agx info +- agx: plumb imageblock stride +- agx: allow st_tile in compute shaders +- agx: handle explicit coordinate tib instructions +- agx: add a comment about an unknown bit +- asahi: pack imageblock words for compute +- asahi: switch to block based compute blitter +- agx: lower exact frcp +- elk: fix compute shader derivatives +- ntt: switch to derivative intrinsics +- r300: switch to derivative intrinsics +- pan/bi: switch to derivative intrinsics +- pan/mdg: convert to derivative intrinsics +- nak: switch to derivative intrinsics +- vtn: infer SpvCapabilityLinkage +- asahi/clc: drop caps list +- nir/divergence_analysis: handle load_agx +- nir,agx: make block image store an image() intrinsic +- agx: support bindless block image store +- agx: fix wait_pix scheduling with eMRT +- ail: swallow up formats +- ail: consider format for compressability +- ail: clarify metadata buffer calculation +- ail: add metadata width/height in tile helpers +- ail: add enums/queries for AGX compression modes +- asahi/clc: add bindless image intrinsics +- libagx: add in-place decompression kernel +- asahi,ail: pull texture view logic into ail +- asahi,hk: deduplicate txf sampler settings +- asahi: skip unmapped bo in fault dbg +- asahi: optimize aligned blits +- asahi: allow rgb9e5 rendering internally +- asahi: factor out agx_usc_shared_non_fragment +- asahi: allow compute blitter to handle rgb9e5 blits +- asahi: fix fbfetch with eMRT +- asahi: add agx_fill_decompress_push helper +- asahi: handle cross-process eMRT +- asahi: add missing upcast for eMRT MSAA +- hk: be stricter about formats in copies +- hk: rework meta for compression +- hk: prep for block based blitter +- hk: remove texel buffers from meta +- hk: fix bg key with eMRT +- hk: handle compressed eMRT +- hk: lock device-wide scratch access +- hk: drop irrelevant comments +- hk: shrink cmd bo +- hk: disable submission merging +- hk: wire up compression +- lima: set no_integers +- etnaviv: set no_integers +- freedreno/a2xx: set no_integers +- svga: set no_integers +- virgl: set no_integers +- nir: use Horner's method for atan +- nir: correct comment for atan range reduction +- nir: simplify atan range reduction +- nir: push down fabs for atan +- nir: extend copysign for no-integer hw +- nir: use copysign for atan +- nir: simplify atan range reduction fixup +- nir: negate an expression in atan +- nir: fuse ffma for atan range fixup +- nir: push up abs in atan2 calculation +- nir: drop NaN fixup for atan +- nir: CSE comparisons in atan2 +- ci: update traces +- virtio/vdrm: don't spam +- tu/virtio: silence startup spam on asahi +- zink: fix buffer view clamp overflow +- vulkan: merge driver internal bits +- vulkan: export vk_robustness_disabled +- hk: add missing dep +- ail: pull in DRM modifier helper +- hk: handle compression modifiers +- hk: disable mods on funny float formats +- hk: advertise modifiers +- hk: switch to vk_meta for buffer copies +- hk: drop some dead code +- hk: drop bad comment +- hk: quiet gcc warning +- asahi: add tib sample setter helper +- hk: fix attachmentless multisampling tile sizes +- hk: ban compression with feedback loops +- hk: lower memory model +- agx: pull out the big hammers for device-barriers +- agx: plumb COHERENT +- hk: free large_bos +- hk: don't leak NIR with VS/GS +- hk: don't leak GS rast with rast disc +- hk: don't leak GS subparts in an error path +- libagx: cleanup loop vars +- libagx: simplify tcs out calc +- libagx: drop unused arg +- asahi: fix UB affecting x86 builds +- asahi: identify factor buffer size +- asahi/decode: fix ptr casts +- asahi/virtio: fix ptr cats +- asahi/virtio: fix allocate-with-alignment +- agx: fix spilling in i686 builds +- agx: fix =spill +- agx: clarify scratch size units +- agx: allocate scratch in spilling unit test +- agx: clarify spill lowering math +- hk: fix weird formatting +- hk: quiesce gcc uninitialized var warning +- hk: fix debug cache key +- hk: fix ptr cast +- hk: shush gcc warning +- hk: fix loop submit with NONE +- hk: fix vertex clustering +- hk: fix oq availability set +- hk: fix phi ordering +- hk: drop dead +- hk: clarify spilling +- hk: mutex shader compiles when debugging +- hk: add notess perftest +- hk: add noborder perf test +- hk: soup up =perf +- hk: optimize empty tiles +- hk: optimize out empty VDM batches +- hk: assert batches are nonempty +- hk: parallelize after-graphics available sets +- hk: run opt_access +- nir: add bounds_agx opcode +- asahi: carveout lower VAs +- hk: use 0 pointer for 0-sized buffers +- agx: handle bounds_agx +- hk: clean up before lowering bounds checks +- hk: smarten bounds check lowering +- agx: optimize robust access +- agx: clean up packs from bounds check +- asahi/lower_vbo: optimize robustness2 +- asahi: fix speculation of rgb32 loads +- asahi,hk: disable rgb32 buffer textures +- asahi: optimize trivial load_invocation_id +- agx: handle hw tess eval shaders +- asahi: add tess assert +- asahi: add nosoft debug +- agx: increase lower vars to scratch thresh +- hk: fix =spill with tess +- agx: add spilling debug helpers +- agx: fix spilling around exports +- agx: add helper to pad 16-bit to 32-bit +- agx: use padding helper +- agx: factor out reserved_region +- agx: fix RA shuffling algorithm +- agx: add helper to visualize reg file +- asahi/genxml: add nicer error checking +- asahi: fix AGX_MAX_OCCLUSION_QUERIES +- asahi,hk: fix st_tile mask size +- agx: validate phi sources are defined +- agx: validate sizes are consistent in the IR +- agx: add pass manager +- agx: split up optimizer +- agx: use reserved_size +- asahi: #define the ABI +- asahi: move sample mask to r1l +- agx: quiesche assert with r1l preload +- agx: fix corner case scratch allocation +- agx: don't reserve regs if we won't use them +- agx: fix shuffling with large reserved region +- agx: print pack assertions to stderr +- agx: don't propagate constants from trivial phis +- hk: bump max input/output components +- agx: expand macro +- agx: fix backwards copyprop with phis +- agx: add unit test for tricky backwards phi case +- hk: fix tess with notess +- hk: drop dead +- hk: flush with events +- agx: gimme c23 +- agx: fix corner with uniform source lowering +- agx: use reserved_size more +- agx: commonize reg_to_ssa update +- agx: factor out insert_copy +- agx: add another RA torture mode +- agx: fix shuffling with partial clobbered vector +- agx: add reg to agx_index +- agx: reset kill bits in liveness +- agx: strengthen assert +- agx: assert a subtle invariant +- agx: produce regful SSA in RA +- agx: lower swaps late +- agx: commonize ra_class +- agx: fix shuffles of vectors +- agx: validate RA +- hk: fix shadowing +- hk: fix partial EOT program +- hk: fix ZLS across split CRs +- hk: add helper to dump draw +- agx: fix load_first_vertex +- hk: split out HK_PERFTEST +- hk: add nobarrier test +- hk: add =batch perftest +- hk: add norobust test +- hk: parametrize sysmem heap fraction +- hk: reduce heap size +- asahi: optimize !robustness with soft fault +- asahi: optimize !robust VBO +- hk: optimize !robust VBO +- hk: use vk_meta for compressed MSAA copies +- hk: fix eMRT decompress of msaa +- hk: end pregfx even if there's no gfx +- hk: fix emrt compressed msaa arrays +- hk: enable compressed MSAA +- hk: reformat driconf +- util,hk: add border colour driconf +- hk: use real opaque black with integers +- hk: drop #include +- hk: perf debug resolves +- hk: perf debug some meta +- hk: Reset rendering attachment locations in BeginCommandBuffer/Rendering +- asahi,hk: implement remapping in epilog +- hk: advertise KHR_dynamic_rendering_local_read +- asahi: rm include +- agx: inline texture states into EOT +- hk: fix store_op_none +- hk: support geom/tess subgroups +- hk: drop FSR todo +- hk: check arch in pipeline cache +- hk: fail probing silently +- hk: drop gunk +- nir: extract round component helper +- nir/opt_algebraic: optimize patterns hit with OpenCL +- util/ralloc: add total_size helper +- nir: correct comment +- nir/serialize: fix name +- asahi: split out genxml header +- asahi: fix spacing +- asahi: assume compiler is not stupid +- asahi: normalize enum formatting +- asahi: migrate f-strings +- asahi: fix indentation +- asahi: cleanup validation +- asahi: cleanup generate valid +- asahi: drop extra new line +- asahi: guard some apple code +- asahi: guard darwin stuff +- asahi: fix newlines +- asahi: add exact fields to genxml +- asahi: convert to exact +- asahi: drop trailing +- asahi: be explicit about LOD clamps +- asahi: drop zero defaults +- asahi: drop macOS defaults +- asahi: drop PBE default +- asahi: drop fragment face 2 defaults +- asahi: set exact on some fixed? unknown fields +- asahi: drop comment +- asahi: drop comment about blob weirdness +- asahi: drop dated comments +- asahi: idetnify Clamp W +- asahi: set clamp W? explicitly +- asahi: drop unused default +- asahi: be explicit about usc reg unk +- asahi: be explicit about frag properties +- asahi: automatically set default to encode to zero +- asahi: drop implied defaults +- asahi: get rid of default from the XML +- asahi: make wrapping less silly +- asahi: don't do extra runtime validation for exact +- asahi: defer fixed->float to tess +- asahi: rm tessellator branch +- asahi,agx: clang-format +- agx: split up f16 ops +- agx: special case mov_imm +- agx: drop encoding_16 +- agx: encoding_32 -> encoding +- agx: consistent ffma name +- asahi: presumably fix indirect indexed tess +- libagx: extend indirect tess to handle indexed +- hk: plumb indirect_local +- hk: add mechanism to test indirects +- hk: wire up indirect tess +- hk: drop store_op_dontcare w/a +- hk: switch to 64-bit queries +- hk: use push size macro +- hk: bump max push size +- asahi: fix no16 flag +- agx: fix uniform packing with local_load +- agx: pack agx_cf_binding +- agx: move binary_size into info +- hk: fix alpha-to-coverage with sample shading +- hk: allow tess modes in either stage +- hk: make tess partitioning dynamic +- libagx: fix missing statics +- libagx: don't rely on loop unroll in txs +- libagx: drop generated VDM tess path (for now?) +- libagx: make points mode dynamic +- libagx: always tessellate clockwise +- asahi/clc: strip nir +- asahi: drop unused +- asahi: pad blit key +- asahi: use indirect ptr helper more +- asahi: add the BO even for internal dispatches +- asahi: fix indirect buffer UAF +- asahi: fix extremely subtle UAF +- asahi: assert guard previously-subtle code +- asahi: don't leak blit shaders +- asahi: don't leak binaries +- asahi: don't leak linked shaders +- asahi: don't leak drm version +- nir,agx: lower fmin/fmax in NIR +- agx: optimize wait_pix a bit +- agx: negate iadd/imsub constants +- agx: test immediate packing opt +- agx: run algebraic later +- asahi,agx: rework constant promotion upload +- agx: don't upload constant padding at the start +- agx: fold more inots +- hk: enable constant promotion +- nir/divergence_analysis: add AGX atomics +- clc: add mesa_clc tool +- asahi/clc: ingest spir-v +- asahi: fix a2c with sample shading, harder + +Amber (4): + +- ir3: add encoding support for 64-bit atomics introduced in a7xx. +- ir3: add support for 64 bit atomics +- nir: add pass to lower atomic arithmetic to a loop with cmpxchg. +- tu, ir3: Implement VK_KHR_shader_atomic_int64 for a7xx. + +Andrew Woloszyn (2): + +- Added some fixes for zircon tests in the Android CTS. +- Fix for dynamic rendering. + +Anil Hiranniah (1): + +- panfrost: Fix a memory leak in the CSF backend + +Antonio Ospite (2): + +- android: simplify building libgallium_dri on Android +- android: fix build by removing references to dri-search-path + +Asahi Lina (9): + +- asahi: Add 1queue debug option +- asahi: Print queue ID with batch debug +- asahi: Add more batch debug +- asahi: Fix non-async flush +- asahi: Relax cross-context flush +- asahi: Update to UAPI 10011 +- asahi: Do not sync against our own queue +- agx: Fix implicit sync for virtgpu +- agx: Fix queue destroy op for virtgpu + +Assadian, Navid (8): + +- amd/vpelib: Change Max DS support to 4:1 +- amd/vpelib: Add API function to get taps +- amd/vpelib: Fix CS translation for geometric downscaling +- amd/vpelib: Fix whitepoint for geometric downscaling +- amd/vpelib: Add 420 semi-planar 12bit handling +- amd/vpelib: Document public API structures +- amd/vpelib: Add better rotation and mirror capability check +- amd/vpelib: Add mirror & rotation capability check function for vpe1.1 + +Bas Nieuwenhuizen (1): + +- radv: Disable EXT BDA capture and replay. + +Benjamin Cheng (1): + +- radv/video: add event support for VCN4 + +Benjamin Herrenschmidt (1): + +- dril: Fixup order of pixel formats in drilConfigs + +Benjamin Lee (6): + +- nvk: implement vkCmdDispatchIndirect on pre-turing +- panvk: don't dirty preloaded tiles for writeback +- panvk: fix combined image/sampler descriptor arrays +- panvk: allow resuming secondary cmdbufs with dynamic rendering +- panvk: flush sync point before executing secondary cmdbufs +- panvk: hack to improve depth clipping with small viewport depth range + +Benjamin Otte (2): + +- nvk: Don't emit critical messages during init +- pvr: Don't emit critical warning all the time + +Bernhard Rosenkränzer (1): + +- dril: Fix return types of peglInitialize and peglTerminate + +Bo Hu (2): + +- goldfish: handle null handle +- gfxstream: global_state_wrapped_decoding of vkCreateComputePipelines + +Boris Brezillon (157): + +- panvk: Adjust RGB component order for fixed-function blending +- util/hexdump: Squeeze repeated patterns instead of zero-ed chunks +- panvk: Make sure we emit blend descriptors even if no FS shader is provided +- pan/genxml: Fix Resource descriptor alignment +- pan/decode: Select the right registers when dumping local storage descriptors +- pan/decode: Interpret LOAD_MULTIPLE opcode when decoding a command stream +- pan/cs: Allow lazy root chunk allocation +- pan/cs: Provide cs_root_chunk_{gpu_addr,size}() and cs_is_empty() helpers +- pan/shader: Fix FS varyings.input_count initialization on v9+ +- pan/bi: Support native layer_id store/load on Valhall +- pan/tex: Allow format reinterpretation on v9+ +- vk/sync: Use the proper type in vk_filter_{src,dst}_access_flags2() +- nir/format_convert: Promote input to 32-bit before packing integers +- vk/format: Add missing R10G10B10A2_SINT entry in the pipe -> vk table +- vk/image: Add a vk_image_buffer_range() helper +- vk/image: Fix the view extent of uncompressed views of compressed images +- vk/meta: Let meta users pass the color attachment write mask explicitly +- panvk: Don't spam the error output if the device is not for us +- panvk: Rename the error labels in create_device() +- panvk: Add missing vk_device_finish() calls +- pan/texture: Allow uncompressed view of compressed image +- panvk: Rework the NULL test in panvk_buffer_{gpu_ptr,range}() +- panvk: Constify the shader pointers in the graphics state +- panvk: Add helpers to save/restore the compute/graphics state for meta operations +- panvk: Rework the modifier selection +- panvk: Switch to vk_meta +- panvk: Lower load_multisampled_pan +- panvk: Add support for render-pass resolve operations +- panvk: Fix depth-stencil AFBC support +- panvk: Allow compressed formats +- panvk: Allow ZS formats on 1D/3D images +- panvk: Lower var copies before lowering IOs +- panvk/ci: Enable clearing tests +- panvk: Prepare the blend logic for Valhall +- panvk: Collect allocated push sets at the command level +- panvk: Prepare panvk_cmd_pool for CSF +- panvk: Make panvk_queue_finish() a per-arch function +- panvk: Allow more descriptor sets on Valhall +- panvk: Move panvk_debug_adjust_bo_flags() to panvk_device.h +- panvk: Conditionally register an host address when tracking user memory +- panvk: Prepare the cmd_desc_state logic for Valhall +- panvk: Prepare cmd_meta for Valhall +- pan/cs: Make sure we're not passed cs_now() to asynchronous instructions +- pan/cs: Accept val=undef when cs_branch_label() condition is ALWAYS +- pan/cs: Add helpers to emit match (AKA switch-case) blocks +- pan/cs: Allow sparse register set passed to loads/stores +- pan/cs: Add an optional load/store tracker to make sure we don't omit waits +- pan/cs: Add inline specifiers to a bunch of cs\_ helpers +- pan/cs: Add a facility to restrict register access +- pan/kmod: Don't cap VM bind operations to one +- pan/desc: Add a struct for valhall/bifrost to the union in pan_tiler_context +- pan/desc: Make vertex_count a Midgard field +- panvk: Fix the varying pool label +- panvk: Kill the set_dirty() macro +- panvk: Drop the NO_MMAP flag when PANVK_DEBUG=trace +- panvk: Fix valhall_pack_buf_idx() +- panvk: Don't open-code vk_index_type_to_bytes() +- panvk: Don't make panvk_device size gen dependent +- panvk: Make sure push sets are reset when picked from the CommandPool +- panvk: Prepare the panvk_device logic for CSF +- panvk: Set the submit mode to THREADED_ON_DEMAND +- panvk: Move Cmd{Bind,Push}DescriptorSets implementation to cmd_desc_state.c +- panvk: Move cmd_bind_shaders() to panvk_vX_shader.c +- panvk: Move CmdPushConstants2KHR to panvk_vX_cmd_push_constant.c +- panvk: Be robust against allocation failures in a command buffer context +- panvk: Don't pretend we can compile for v9 +- panvk: Transition to explicit VA assignment on v10+ +- panvk: Add a read-write non-cached memory pool for CSF events +- panvk: Add a CSF backend for panvk_queue/cmd_buffer +- panvk: Advertize v10 support +- panvk/csf: Fix missing TSD pointer initialization on RUN_IDVS +- panvk/csf: Fix inverted address/destination register in CmdSetEvent2() +- pan/va: Accept DIM_SUBPASS[_MS] in valhall_tex_dimension() +- panvk: Temporarily disable VK_FORMAT_D32_SFLOAT_S8_UINT on Valhall +- panvk: Fix valhall_pack_buf_idx() for SSBO store/atomic operations +- panfrost: Emit an SSBO table on v9+ +- nir/lower_ssbo: Add an option to conditionally lower loads +- nir/lower_ssbo: Extend the load_ssbo_address intrinsic to pass an offset +- panfrost: Pass the architecture to panfrost_nir_lower_sysvals() +- pan/va: Optimize SSBO lowering on Valhall +- panvk/csf: Unconditionally clean L2 and LS caches when closing a CS +- pan/cs: Fix buffer overflow in cs_block_end() +- pan/cs: Make sure we don't overflow the chunk capacity +- pan/cs: Make sure cs_alloc_ins() is never passed num_instrs=0 +- pan/cs: Make the CS loop helpers more robust +- pan/cs: Pass the block to end to cs_block_end() +- pan/cs: Drop the cs_builder::blocks::cur field +- pan/cs: Don't use a list for our block stack +- pan/cs: Add helpers for if/else blocks +- panvk/csf: Replace a cs_while() by a cs_if() +- panvk/csf: Pass a non-zero flush-id to benefit from cache flush reduction +- panvk/csf: Fix TLS pointer copy +- panvk: Use nir_metadata_control_flow +- panvk: Lower get_ssbo_size() on Valhall +- panvk: Pass the robustness state to nir_lower_descriptors() +- panvk: Hook up descriptor array bounds checking +- vk/meta: Make sure texel is 32-bit in build_buffer_to_image_cs() +- panfrost: Don't turn 3D/cube images into 2D arrays +- pan/decode: Interpret CS_BRANCH instructions +- pan/desc: Add layer_offset field to pan_tiler_context::valhall +- pan/genxml: Fix layer_offset definition on v9+ +- panvk/csf: Fix multi-layer rendering +- panvk: Protect access to the virtual address heap +- pan/va: Fix nir_op_pack_uvec4_to_uint +- pan/genxml: Remove surface descriptors on v6/v7 +- pan/genxml: Fix surface descriptor alignment on v6 +- pan/genxml: Fix resource table alignment constraint on v9 +- pan/kmod: Try to use local storage in panthor_kmod_vm_bind() +- panvk: Don't reset errno in panvk_kmod_zalloc() +- panvk: Stop including pan_blend.h from panvk_device.h +- panvk: Use the proper alignment for texture payloads +- panvk: Check allocation failures in panvk_shader_upload() +- panvk: Report allocation failures in the CreateImageView path +- panvk: Fix FB initialization when using non-compressed views of compressed images +- panvk: Fix stencil attachment resolve on JM +- panvk/v10: Add missing cleanup_queue() in queue_finish() +- panvk: Don't call queue_finish() on non-initialized queues +- panvk: Don't leak vertex shader program descriptors +- panvk: Let panvk_priv_bo_create() return a VkResult +- panvk: Catch indirect allocation failures +- panvk: Make sure RUN_IDVS has a valid tiler descriptor +- panvk: Fix bounds checking in upload_shader_desc_info() +- panvk/csf: Fix prepare_vp() when viewport->x is negative +- panvk: Advertise KHR_create_renderpass2 support +- vk/meta: Fix buffer -> image copy using a graphics pipeline +- zink: Adjust the surface size of non-compressed views +- vk/image: Fix the extent adjustment of non-compressed views +- vk/meta: Support VkShaderExt objects to allow tracking internal shaders +- vk/meta: Allow tracking of driver-specific objects in the meta list +- vk/meta: Make some helpers public +- panvk: Add a debug flag to force image copies through the gfx pipeline +- panvk: Add an helper to create internal shaders +- panvk: Keep our blend shaders in vk_meta_device +- panvk: Keep our copy_desc shader in vk_meta_device +- panvk: Emit textures needed for FB preload at image view creation time +- panvk: Store attachment image views in the graphics state +- panvk: Implement a custom FB preload logic +- panfrost: Move pan_blitter.{c,h} to the gallium driver +- panvk: Make panvk_pool_free_mem() error proof +- panvk/csf: Fix no-fragment IDVS +- panvk/csf: Fix dirty checking in prepare_ds() +- panvk/csf: Fix cmd_emit_dcd() in the FB preload logic +- panvk/csf: Fix a buffer/stack-overflow when PANVK_DEBUG=sync +- panvk: Advertise dynamic rendering support +- panvk/csf: Fix the clear-only RUN_FRAGMENT case +- pan/util: Discard depth/stencil writes when early fragment test is forced +- panvk: Set .is_blit=true when compiling preload shaders +- panvk: Fix the ZStencil descriptor initialization in the preload logic +- panvk: Fix the hierarchy_mask selection +- panvk: Fix the maximum sampler LOD bias +- pan/va: Don't set bit 26:27 on texture instructions +- pan/va: Add support for nir_texop_lod +- pan/bi: Add support for nir_texop_lod +- pan/desc: Fix the CRC validity update when clean tiles are not written +- pan/cs: Fix lazy allocation support +- pan/bi: Expose the packed TextureOperationDescriptor in bifrost_texture_operation +- panvk/csf: Fix a wait-LS operation in finish_cs() + +Boyuan Zhang (3): + +- frontends/va: get cu_qp_delta from vaapi +- radeonsi/vcn/enc:enable hevc cu_qp_delta for vcn2+ +- radeonsi/vcn: fix loop filter across slices + +Brendan (5): + +- amd/vpelib: Create virtual stream concept +- amd/vpelib: Multiple instance support in caching framework +- amd/vpelib: Update configs to be per-pipe specific +- amd/vpelib: Visual confirm on input +- amd/vpelib: Fix duplicate config writer bug + +Caio Oliveira (91): + +- intel/brw: Simplify \@file annotations +- intel/brw: Move remaining compile stages to their own files +- intel/brw: Move and reduce scope of run_*() functions +- intel/brw: Reduce scope of some VS specific functions +- intel/brw: Reduce scope of some FS specific functions +- intel/brw: Reduce scope of some GS specific functions +- intel/brw: Reduce scope of some TCS specific functions +- intel/brw: Reduce scope of some TES specific functions +- intel/brw: Reduce scope of some MESH specific functions +- intel/brw: Move printing functions to its own file +- intel/brw: Move out of fs_visitor and rename print instructions +- intel/brw: Remove workgroup_size() helper from fs_visitor +- intel/brw: Move regalloc and scheduling functions out of fs_visitor +- intel/brw: Move calculate_cfg out of fs_visitor +- intel/brw: Move VARYING_PULL_CONSTANT_LOAD from fs_visitor to fs_builder +- intel/brw: Move interp_reg and per_primitive_reg out of fs_visitor +- intel: Let compiler set indirect_ubos_use_sampler +- intel/brw: Don't retype load_subgroup_invocation result to signed +- intel/brw: Use CSE for LOAD_SUBGROUP_INVOCATION +- meson: Update Lua dependency version to 5.3 +- intel/brw: Also return the size of the assembled shader +- intel: Add executor tool +- intel/executor: Use PRIx64 to fix building in 32-bit +- intel/brw: Use %td format for pointer difference +- meson: Only use fallback for Lua if building freedreno tools +- intel/brw: Avoid modifying the shader in assign_curb_setup if not needed +- intel/brw: Remove unused variable +- intel/brw: Use a helper for common VEC pattern +- intel/brw: Remove more uses of variable length arrays +- intel/brw: Move encoding of Gfx11 3-src inside the inst helpers +- intel/brw: Explicitly map brw_reg_file into hardware values +- intel/brw: Make BAD_FILE the zero value for brw_reg_file +- intel/brw: Remove unused ARF values and helpers +- intel/brw: Remove long register file names +- intel/brw: Pack brw_reg struct +- intel/brw: Simplify fs_inst annotation +- intel/brw: Remove special treatment for 2-src in emit() helper +- intel/brw: Use linear walk when shader requires DERIVATIVE_GROUP_LINEAR +- intel/executor: Fix SWSB for sync.nop +- compiler: Allow derivative_group to be used for all stages in shader_info +- vk/runtime: Allow "require full subgroups" for Mesh/Task +- intel/executor: Small fixes to the help message +- spirv: Rename vtn_type::deref to vtn_type::pointed +- spirv: Remove (pointed) type from vtn_pointer +- spirv: Rename vtn_pointer::ptr_type to vtn_pointer::type +- vulkan: Update XML and headers to 1.3.295 +- spirv: Update headers and metadata from latest Khronos commit +- spirv: Prefer symbols from Khronos extension for compute shader derivatives +- nir: Allow Mesh/Task to use implicit LOD when DERIVATIVE_GROUP is set +- spirv: Allow Mesh/Task to use derivative execution modes +- anv: Advertise VK_KHR_compute_shader_derivatives +- intel/executor: Fix a couple of memory leaks in the tool +- intel/brw: Only validate GRF boundary crossing restriction for GRFs +- intel/brw: Consider pipe when comparing SWSB in tests +- intel/brw: Add SWSB MATH pipe to assembler +- intel/brw: Add DUMP flag to brw_assemble +- intel/executor: Dump both pre-processed source and assembly +- intel/brw: Skip per-primitive inputs when computing flat input mask +- intel/brw: Remove Gfx8- from test_eu_validate.c +- intel/brw: Update DPAS validation tests for Xe2 +- intel/brw: Enable EU validation and compaction tests for Xe2 +- intel/brw: Enable mi_builder test for Xe2 +- spirv: Update SPIR-V grammar to use aliases +- intel/brw: Add ERROR helper variant that returns to EU validation +- intel/brw: Create a struct to hold a decoded brw_inst in eu_validation +- intel/brw: Add opcode to brw_hw_decoded_inst +- intel/brw: Add num_sources to brw_hw_decoded_inst +- intel/brw: Add has_dst to brw_hw_decoded_inst +- intel/brw: Add exec_size to brw_hw_decoded_inst +- intel/brw: Add access_mode to brw_hw_decoded_inst +- intel/brw: Add dst/srcs reg file to brw_hw_decoded_inst +- intel/brw: Add dst/srcs type to brw_hw_decoded_inst +- intel/brw: Add pred/cmod/sat to brw_hw_decoded_inst +- intel/brw: Add dst/src0 address_mode to brw_hw_decoded_inst +- intel/brw: Add abs/negate to brw_hw_decoded_inst +- intel/brw: Add dst/srcs register numbers to brw_hw_decoded_inst +- intel/brw: Add vstride/width/hstride to brw_hw_decoded_inst +- intel/brw: Remove the remaining DO_SRC macro from EU validation +- intel/compiler: Rename shader_stats structs +- intel: Add statistic for Non SSA registers after NIR to BRW +- intel/brw: Print Non-SSA regs after NIR in debug output +- intel/brw: Fix Gfx9 3-src validation to handle FIXED_GRF +- intel/brw: Add phases to backend +- intel/brw: Validate some instructions exists only up until some phases +- intel/brw: Reduce scope of has_source_and_destination_hazard +- intel/brw: Add SHADER_OPCODE_REDUCE +- intel/brw: Add SHADER_OPCODE_*_SCAN +- intel/brw: Move emit_scan/emit_scan_step near its usage +- intel/brw: Add fs_builder::BROADCAST() helper +- intel/brw: Ensure BROADCAST() value respect register alignment +- intel/brw: Add SHADER_OPCODE_VOTE_* + +Carlos Santa (2): + +- intel/hang_replay: fix the typo in the ioctl call +- intel/hang_replay: remove EXEC_OBJECT_WRITE + +Casey Bowman (5): + +- vulkan/screenshot-layer: Add Vulkan screenshot layer +- vulkan/screenshot-layer: Fix image index selection +- vulkan/screenshot-layer: Increase buffer sizes +- vulkan/screenshot-layer: Fix memory leaks +- vulkan/screenshot-layer: Remove extraneous malloc operations + +Chan, Roy (2): + +- amd/vpelib: Fix compilation warnings +- amd/vpelib: Revise the config sharing handling + +Chang, Tomson (2): + +- amd/vpelib: Add and fix collaborate sync data +- amd/vpelib: Fix color fill performance issue on VPE1.1 (#419) + +Charmaine Lee (1): + +- svga: sync up with the latest svga include files + +Chenyu Chen (1): + +- amd/vpelib: Remove unused define macro + +Chia-I Wu (32): + +- panvk: remove unused panvk_physical_device::master_fd +- panvk: refactor panthor_kmod_dev creation +- panvk: query device ids of drm device nodes +- panvk: advertise VK_EXT_physical_device_drm +- panvk: fill in VkExternalBufferProperties +- panvk: fill in VkExternalImageFormatProperties +- panvk: remove panvk_GetPhysicalDeviceImageFormatProperties +- panvk: advertise VK_KHR_external_memory_fd +- panvk: advertise VK_EXT_external_memory_dma_buf +- panvk: advertise VK_KHR_external_{fence,semaphore}_fd +- panvk: remove panvk_GetImageDrmFormatModifierPropertiesEXT +- panvk: fix VkDrmFormatModifierPropertiesListEXT query +- panvk: check VkPhysicalDeviceImageDrmFormatModifierInfoEXT +- panvk: advertise VK_KHR_image_format_list +- panvk: advertise VK_EXT_image_drm_format_modifier +- panvk: remove unused is_sampler and is_texture +- panvk: fix descriptor set layout hash +- panvk: fix texture size/levels/samples query +- panvk: fix gl_VertexIndex +- panvk: fix scissor box +- panvk: disable depth write when depth test is disabled +- panvk: remove an incorrect assert in collect_cs_deps +- Revert "panvk: Set the submit mode to THREADED_ON_DEMAND" +- panvk/jm: assert that the submit mode is not threaded +- panvk: add support for VK_KHR_timeline_semaphore +- panvk: fix truncated access flags in collect_cs_deps +- panvk: expand meta stage and access flags +- panvk: use force_fb_preload for unaligned preload +- panvk: force_fb_preload should insert a barrier +- panvk: fix missing same-subqueue wait for CmdWaitEvents2 +- panvk: fix dummy sampler handle for vs +- panvk: ensure res table is restored after meta + +Christian Gmeiner (70): + +- etnaviv: isa: left shift is 3 bit long +- dri: fix driver names +- etnaviv: Rework uniform handling for UBO addresses +- meson: Add missing inc's to idep_nir_headers +- nir_lower_mem_access_bit_sizes: Support load_kernel_input +- ci: austriancoder farm maintenance +- nak: Move bitset to compiler crate +- nak: Move cfg to compiler crate +- etnaviv: Move halti determination to drm +- etnaviv: Switch etna_compiler_create(..) to etna_core_info +- etnaviv: Extend shader structs with etna_core_info +- etnaviv: Drop halti from etna_specs +- etnaviv: isa: Add conv instruction +- etnaviv: Drop has_sin_cos_sqrt and has_sign_floor_ceil +- etnaviv: Switch to shader_core_count from etna_core_info +- etnaviv: Switch to vertex_cache_size from etna_core_info +- etnaviv: Switch to vertex_output_buffer_size from etna_core_info +- etnaviv: npu: Switch to use etna_core_info +- etnaviv: npu: Drop not used spec values +- etnaviv: Switch to num_constants from etna_core_info +- etnaviv: Switch to max_registers from etna_core_info +- etnaviv: Switch to stream_count from etna_core_info +- etnaviv: Switch to etna_core_has_feature(..) for npot_tex_any_wrap +- etnaviv: Switch to etna_core_has_feature(..) for has_halti2_instructions +- etnaviv: Drop assignment of get_timestamp +- broadcom/simulator: Add support for asahi based systems +- compiler/rust: Copy MappedInstrs from NAK +- etnaviv: Update headers from rnndb +- etnaviv: nir: Make use of nir_lower_fragcolor +- etnaviv: Report number of render targets +- etnaviv: nir: Make use of reported number of render targets +- etnaviv: Call etna_resource_level_mark_changed(..) for all rts +- etnaviv: Extend frag_rb_swap for MRTs +- etnaviv: Extend compiled_shader_state for MRTs +- etnaviv: Extend compiler for MRTs +- etnaviv: Extend etna_record_flush_resources(..) for MRTs +- etnaviv: Extend etna_update_blend_color(..) for MRTs +- etnaviv: Extend etna_update_zsa(..) for MRTs +- etnaviv: Extend etna_set_framebuffer_state(..) for MRTs +- etnaviv: Extend etna_update_ts_config(..) for MRTs +- etnaviv: Emit MRT states +- etnaviv: Emit PS_OUTPUT_REG2 if num_rt is 8 +- etnaviv: blend: Store information per render target +- etnaviv: blend: Move logicop_enable assignment +- etnaviv: blend: Add support for MRTs +- etnaviv: blt/rs: Just pass the color to the callee +- etnaviv: rs: Extend to support MRTs +- etnaviv: blt: Extend to support MRTs +- etnaviv: shader: Apply output remapping +- etnaviv: shader: Apply output saturation +- etnaviv: Relink shader when framebuffer state changed +- etnaviv: TS usage for MRT needs HALTI2 +- etnaviv: Limit max supported render targets +- etnaviv: Report correct PIPE_CAP_MAX_RENDER_TARGETS value +- etnaviv: Enable EXT_draw_buffers2 for HALTI5+ +- etnaviv: Enable ARB_draw_buffers_blend for HALTI5+ +- etnaviv: Improve split sampler check +- etnaviv: blt: Convert assert to real if +- etnaviv: Implement ARB_draw_indirect +- v3d: Add a few function traces +- vc4: Add a few function traces +- docs: Update the list of drivers with CPU tracepoints +- etnaviv: nir: Enforce stricter swizzle for virtual scalar x register +- etnaviv: nir: Switch to derivative intrinsics +- compiler/rust: Copy NirInstrPrinter from NAK +- etnaviv: isa: Add clamp0_max instruction +- etnaviv: isa: Add img_load instruction +- etnaviv: Enable ARB_framebuffer_no_attachments +- etnaviv: isa: Add img_store instruction +- etnaviv: Fix incorrect pipe_nn creation + +Colin Marc (2): + +- vulkan/video: set HEVC nuh_temporal_id_plus1 and nal_unit_type correctly +- radv/video: set TemporalId correctly + +Collabora's Gfx CI Team (10): + +- Uprev Piglit to 4a62c26721a47552a96416a134b789a813dd51a6 +- Uprev Piglit to 0453436872b6e4d502c2e87817addb95e0d77e3b +- Uprev Piglit to f11abb664bfcad09586f32f411b90331e23be2e5 +- Uprev Piglit to 1d8fab9b49cc487ebcc949119eeb166fb3fa7591 +- Uprev Piglit to 6533b66878c8cb9361c554b7ff940518ab12c816 +- Uprev Piglit to a3826de3c26a279599d15b018a9a3e75ca46f4f8 +- Uprev Piglit to 93b4bd2e0aaab1c22ae3e1a23f9e057a8f7451b2 +- Uprev Piglit to 7ce69da1199d12ed0ddaa251ed489750523798fb +- Uprev Piglit to 791e420b2628c1e35eea81b3bafdb1c904a141e8 +- Uprev Piglit to c2b31333926a6171c3c02d182b756efad7770410 + +Connor Abbott (60): + +- egl/x11: pass render fd to screen create for zink +- freedreno/ci: Combine and document failures due to test bug +- ci: Move two failing loader-related tests to all-skips.txt +- ci: Uprev VK-CTS to 1.3.9.0 +- tu: Fix off-by-one in UBO CP_LOAD_STATE size +- ir3, tu: Use a UBO for VS primitive params on a750+ +- Revert "tu/a750: Disable HW binning when there is GS" +- ir3: Fix nir_op_vec8/vec16 +- freedreno/computerator: Fix enabling wave128 +- freedreno/a7xx: Document compute dispatch tiling registers +- freedreno, tu, ir3: Enable tiled workgroup item dispatch on a7xx +- ir3: Move reconvergence backedge handling +- tu: Fix passthrough D24S8 attachments +- tu: Treat partially-bound depth/stencil attachments as passthrough +- tu: Add a750 flush workaround and re-enable UBWC for storage images +- freedreno: Rename CP_SET_CTXSWITCH_IB to CP_SET_AMBLE +- ir3: Increase compute const size on a7xx +- tu: Initial support for VK_KHR_calibrated_timestamps on a750 +- freedreno/a6xx+: Add CP_CONTEXT_SWITCH_CNTL bitfields +- ir3: Ban conversions with mismatching sizes +- Reapply "ci/lima: Temporarily disable" +- ir3, turnip: Support VK_*_compute_shader_derivatives on a7xx +- tu: Don't WFI after initializing RB_CCU_CNTL on a7xx +- tu: Split out register setting from tu6_init_hw() +- tu: Write bin size for each bin +- tu: Re-emit LRZ state before each bin +- tu: Simplify device startup CS creation +- tu: Emit CP_SET_AMBLE packets +- freedreno: Bump kernel uapi +- tu/virtio: Make virtio_simple_ioctl take a vdrm_device +- tu: Allow being preempted on a7xx +- tu: Add TU_DEBUG=hiprio +- vulkan/state: Handle NULL in DS input attachment mapping correctly +- vulkan/state: Track the input attachment count +- v3dv: Don't misuse nir_variable::data.index +- spirv: Make the default input attachment index ~0 +- nir/lower_input_attachments: Handle unscaled input attachments with no index +- ir3: Fix non-bindless s2en texture/sampler order +- tu: Support color attachment remapping +- tu: Support dynamic input attachments +- tu: Make input attachments always contain a real descriptor +- tu: Fix flushes for feedback_invalidate case +- tu: Track possible feedback loops for dynamic renderpasses +- tu: Expose VK_KHR_dynamic_rendering_local_read +- ir3, tu: Plumb through support for per-shader robustness +- tu: Implement VK_EXT_pipeline_robustness +- freedreno/fdl: Extend 2bpp UBWC special case to 1bpp +- freedreno/a6xx: Remove dead fd6_get_ubwc_blockwidth() call +- freedreno/fdl: Add UBWC config struct +- freedreno: Add default UBWC config values +- virtio/drm: Update header +- tu: Gather UBWC config +- freedreno/fdl: Add tiling/untiling implementation for a6xx/a7xx +- tu: Add a flag for cached non-coherent BOs +- tu: Implement VK_EXT_host_image_copy +- freedreno: Add compute constlen quirk for X1-85 +- tu: Implement VK_PIPELINE_CREATE_2_VIEW_INDEX_FROM_DEVICE_INDEX_BIT_KHR +- tu: Don't invalidate CS state for 3D blits +- ir3: Fix detection of nontrivial continues +- tu: Re-emit visibility stream before each render pass + +Constantine Shablia (4): + +- vk/meta: Add copy/fill/update helpers +- pan/va: Define the TEX_GRADIENT instruction +- pan/va: Lower nir_texop_txd to TEX_GRADIENT with derivs followed by TEX_SINGLE +- pan/bi: Lower nir_texop_txd to TEXC in GRDESC_DER mode followed by sampling TEXC + +Corentin Noël (2): + +- ci: Allow to pass the PIGLIT_RUNNER_OPTIONS variable +- virgl: Avoid a race condition on handle removal + +Craig Stout (18): + +- util: fix fossilize_db compile error +- util: fix 32bit compiler error in fossilize_db +- [guest] Check that the singleton VirtGpuDevice matches desired capset +- Reapply "[guest] Check that the singleton VirtGpuDevice matches desired capset" +- Fix dispatchable object layout on Fuchsia +- Exclude Android entry points on Fuchsia +- [guest] Fix Fuchsia build +- [guest] Add skeleton Fuchsia VirtGpu implementation +- [guest] Build fixes for Fuchsia +- [guest] Fuchsia: include missing Vulkan entry points +- [guest] Remove Fuchsia syslog_static dependency +- [guest] Fix Fuchsia build, add missing header +- [guest] Fuchsia: open magma device +- [guest] Fuchsia: initialize queries +- [guest] Fix Fuchsia build +- tu: add OS guards to drm_format_mod +- u_gralloc: include dep_android_mapper4 as needed +- meson: add log dependency to dep_android + +Daniel Almeida (6): + +- nouveau/headers: Run rustfmt on generated files +- nouveau/headers: Add a Rust struct for each method +- nouveau/headers: Add an nv_push crate in Rust +- util: memstream: add fflush support +- nak: memstream: move into common code +- util: u_memstream: add tests + +Daniel Schürmann (55): + +- aco/reindex_ssa: free memory of previous live variable sets +- aco: implement aco::validate_live_vars() +- aco/scheduler: enable live variables validation when ACO_DEBUG=validate-livevars is set +- aco: don't attempt to spill dead phis +- aco/cssa: fix kill flags during lowering to CSSA +- aco: change signature of get_live_changes() and get_temp_registers() +- aco/cssa: update RegisterDemand and validate live variable information +- aco/cssa: short-cut some trivial case +- aco: call nir_copy_prop() and nir_opt_dce() before instruction selection +- nir: add nir_block \*nir_src_get_block(src) helper +- nir/opt_if: optimize phis between similar IFs +- nir/opt_if: merge IFs which have phis between them +- aco/ra: add add_rename() helper +- aco/ra: avoid rename map lookups in most cases +- aco/ra: use auto for std container iterators +- aco/ra: use arena allocator for hash maps +- aco/live_var_analysis: Don't attempt to re-insert the same temporary twice into live set +- aco: introduce new Operand flag 'Clobbered' +- aco/live_var_analysis: use Clobbered flag to calculate additional operand demand +- aco: introduce Operand flag 'CopyKill' +- radv: promote VK_NV_compute_shader_derivatives -> VK_KHR_compute_shader_derivatives +- aco/live_var_analysis: don't set lateKill for p_interp_gfx11 m0 operand +- aco: simplify Definition constructors +- aco/lower_to_hw: don't allocate new temporaries +- aco/reindex_ssa: remove update_live_out parameter +- aco: remove Program::allocationId +- aco/live_var_analysis: don't test whether phis are assigned to VCC +- aco/ra: add RegisterFile::fill_killed_operands(Instruction*) helper +- aco/ra: manually fill killed operands when required +- aco/ra: remove unnecessary check for duplicate precolored operands +- aco: introduce 'isPrecolored' flag for Operand and Definition +- aco: set Precolored flag before register allocation +- aco/live_var_analysis: check for isPrecolored flag rather than isFixed +- aco/ra: make use of Precolored flag +- aco/ra: always block register file for precolored operands +- aco/live_var_analysis: check isFixed() for definitions in order to set needs_vcc +- aco/spill: fix faulty assertions +- aco/spill: use float division for score() calculation rather than integers +- nir/divergence: separately indicate whether loops have divergent continues or breaks +- nir/opt_uniform_atomics: don't preserve divergence information +- nir: remove nir_update_instr_divergence() +- nir: change signature of nir_src_is_divergent() +- treewide: use nir_src_is_divergent() rather than checking the divergence of the SSA +- nir/divergence: require nir_metadata_block_index +- nir/divergence: Introduce and set nir_def::loop_invariant +- nir: consider loop invariance in nir_src_is_divergent() +- nir/divergence: calculate divergence without requiring LCSSA form +- nir/divergence: skip expensive nir_src_is_divergent() check in most cases +- nir/divergence: Don't assume that LCSSA phis are not loop-invariant +- treewide: don't lower to LCSSA before calling nir_divergence_analysis() +- aco: Respect addressible SGPR limit in VS prologs +- aco/ssa_elimination: perform jump threading after parallelcopy insertion +- aco/ssa_elimination: remove some redundant checks during jump threading +- aco: move jump threading optimization into separate pass +- aco/README: add descriptions of recently added passes + +Daniel Stone (146): + +- build: Check for PyYAML in Meson build +- dri: Allow INVALID for modifier-less drivers +- gbm/dri: Remove erroneous assert +- ci: Create new 'performance' stage +- ci/amd: Move manual/nightly jobs to postmerge stage +- ci/intel: Move manual/nightly jobs to postmerge stage +- ci/broadcom: Move manual/nightly jobs to postmerge stage +- ci/zink: Move manual/nightly jobs to postmerge stage +- ci/llvmpipe: Move manual/nightly jobs to postmerge stage +- ci/nouveau: Move manual/nightly jobs to postmerge stage +- ci/etnaviv: Move manual/nightly jobs to postmerge stage +- ci/panfrost: Move manual/nightly jobs to postmerge stage +- ci/run_n_monitor: Add --include-stage argument +- ci/run_n_monitor: Add --exclude-stage filtering +- ci/run_n_monitor: Remove --force-manual +- ci/run_n_monitor: Add --dry-run argument +- ci: Make per-build dependencies optional +- doc/llvmpipe: Update URL to fix linkcheck +- ci/fdno: Add some flaky tests for a618 +- ci/dozen: Add a couple more skips +- ci/windows: Don't upload shader cache to artifacts +- ci/{lvp,venus}: Skip massive-alloc tests +- intel/tests: Reduce load from anv_tests +- util/tests: Reduce load from sparse-array test +- ci/angle: Clean up stray open-coded path +- ci/lava: Fix path to structured logger +- ci/lava: Make it easier to run pytest locally +- ci/lava: Don't dump venv into .venv +- ci/lava: Allow passing more args to pytest +- ci/lava: Fix pytest not passing farm value +- ci/lava: Fix fastboot tests +- ci/alpine: Fix shellcheck errors +- ci: Fix shellcheck warnings in firmware download +- ci: Bring back yamllint and shellcheck +- ci: Run pytest once for both LAVA and bin/ci +- ci: Bring back shader-db and drop LTO +- ci/intel-gpu-freq: Fix shellcheck warning +- ci/vulkan: Backport dEQP commit to make it less OOMy +- ci/lima: Temporarily disable +- ci: Add Python requirements include +- ci: Add pyutils container +- ci/python: Allow empty PYTHONPATH +- ci/lava: Use new pyutils container +- doc/llvmpipe: Fix broken link syntax +- docs/gallium: Fix broken GL extension link +- doc/linkcheck: Exclude cgit.fd.o +- doc/vc4: Remove reference to old shader-db +- ci/windows: Increase dozen-deqp fraction +- ci/svga: Add two Piglit flakes seen in the wild +- ci/svga: Only run half the Piglit tests +- ci/virgl: Special-case llvmpipe parallelisation +- ci: Avoid subshell for executing HWCI_TEST_SCRIPT +- ci/bare-metal: Don't move structured log file +- ci: Always create results dir from init +- ci/android: Use common $RESULTS_DIR for cuttlefish +- ci/deqp: Use common $RESULTS_DIR +- ci/dxvk: Use common results dir +- ci/piglit: Use common $RESULTS_DIR +- ci/trace: Move trace cleanup to Piglit runner +- ci/gtest: Use common $RESULTS_DIR +- ci/kdl: Use common $RESULTS_DIR +- ci/devcoredump: Use common $RESULTS_DIR +- ci/xorg: Capture Xorg log in results artifacts +- ci/kdl: Fix KDL install location +- ci: Upgrade ci-kdl +- ci: Use new arguments to ci-kdl to avoid child management +- ci: Make token parsing more robust +- ci: Make common rules common +- ci: Make setup-test-env safe for set -u +- ci/piglit: Make build-piglit safe for set -u +- ci/deqp: Make build-deqp safe for set -u +- ci/apitrace: Make build-apitrace safe for set -u +- ci/va: Make build-va-tools safe for set -u +- ci/crosvm: Make build-crosvm safe for set -u +- ci/wayland: Make build-wayland safe for set -u +- ci/libdrm: Make build-libdrm safe for set -u +- ci/windows: Make build-directx-headers safe for set -u +- ci/wine: Make build-wine safe for set -u +- ci/libclc: Make build-libclc safe for set -u +- ci/vulkan: Make build-vulkan-validation safe for set -u +- ci/vkd3d: Make Proton build safe for set -u +- ci/angle: Make build-angle safe for set -u +- ci/deqp-runner: Make build-deqp-runner safe for set -u +- ci/skqp: Make build-skqp safe for set -u +- ci/skqp: Reorder some things +- ci/skqp: Don't hardcode LLVM version +- ci/kernel: Make download-prebuilt-kernel safe for set -u +- ci/swrast: Fix library paths for llvmpipe/softpipe +- ci/swrast: Archive Xvfb logs +- ci/zink: Fix library path +- ci/virgl: Fix library path +- ci/{b2c,swrast,layered}: Strip unnecessary things from runtime containers +- egl/x11: Downgrade warning to info +- ci: Disable trace jobs using old Wine +- ci: Remove non-Proton Wine +- ci/vk: Strip and optimise validation layers +- ci: Strip yet more unnecessary things from the rootfs +- ci/deqp: Flatten fraction/shard sed into a single pass +- ci/deqp: Compress caselists with zstd +- ci/angle: Update ANGLE, reduce build times +- ci: Don't carry static libraries around for tests +- ci/shellcheck: Don't overwrite SCRIPTS_DIR with relative path +- ci/shellcheck: Don't exit on first failure +- ci/python: Fix section emission +- ci: Don't move results directory +- ci: Don't print structured log data URL +- ci/lava: Remove pointless messages +- ci/lava: Quieten timestamp warnings from tar +- ci/lava: Break section-header print into separate function +- ci/lava: Add section for device wait +- ci/lava: Rename lava_boot section +- ci/lava: Hide more boot details into sections +- ci/lava: Explicitly pass UTC timezone +- ci/lava: Truncate printed times +- ci/lava: Fix colour definitions +- ci/lava: Be a little less enthusiastic with bold +- ci/lava: Change default section colour to cyan +- ci/hw: Wrap pre-test setup in collapsed section +- ci/lava: Print relative timestamps in sections +- ci/lava: Coalesce post-processed job information +- ci/bare-metal: Move preparation steps into section +- ci/bare-metal: Use section for boot messages from cros-servo +- ci/bare-metal: Truncate printed times and prefixes +- ci/bare-metal: Use section for boot messages from poe-powered +- ci/crosvm: Make crosvm-runner safe for set +u +- ci/crosvm: Replace SCRIPT_DIR with SCRIPTS_DIR +- ci/traces: Use sections to quiet execution +- ci: Quieten post-test cleanup +- ci: Make section emission really quiet +- ci: Quieten test execution +- ci/lava: Source setup-test-env +- ci/mold: Use appropriate parallelism and strip binaries +- ci: Really remove Mesa drivers from rootfs +- ci/firmware: Allow calling firmware as sourced script +- ci/angle: Use relative paths +- ci/skqp: Fix relative paths +- ci: Use sections for test container builds +- ci/skqp: Set $SKQP_ARCH from build-skqp.sh +- ci/rpi: Add flakes seen in the wild +- ci/fdno: Add flake seen in the wild +- ci/lava: Remove duplicate build download +- ci/lava: Use LAVA rootfs overlays for build/per-job +- ci/lava: Provide LAVA rootfs URL directly +- ci/lava: Provide list of overlays to submitter +- ci/lava: Refactor is-rootfs-uploaded function +- build: Don't run wayland-protocols tests + +Daniel Svensson (1): + +- zink: add spirv_info_h dep to libzink. + +Danylo Piliaiev (20): + +- tu/a750: Allow mutable images to have UBWC with all compatible formats +- tu: Enable UBWC for D24S8 with USAGE_SAMPLED and formatless border color +- freedreno: Add enum for fd_lrzfc_layout::dir_track values +- tu/u_trace: Add support for u_trace inidirect params +- tu/u_trace: dispatch indirect dims and LRZ status as indirect params +- freedreno: Clarify RB_BLIT_INFO::TYPE field +- tu/a7xx: Use generic clear for LOAD_OP_CLEAR +- tu/a7xx: Use generic clear for CmdClearAttachments +- tu/a7xx: Use generic clear for image clears +- tu/a7xx: Use BLIT_EVENT_STORE_AND_CLEAR when appropriate +- tu: Fix ZPASS_DONE having stale value with generic clear+early frag test +- ir3: Respect aligment of required consts when allocating ubo,preamble +- u_trace: Fix trace_payload_as_extra_func desync between drivers +- tu/a750: Workaround GPU fault when fast-clearing R8G8 formats +- freedreno/rnndec: Consider array length when finding by reg name +- freedreno/cffdec: Add option to dump bindless descriptors +- nir/opt_16b_tex_image: Sign extension should matter for texel buffer txf +- util/vma: Fix util_vma_heap_get_max_free_continuous_size calculation +- ir3: Fix cat5 parsing with a1.x src present +- nir/nir_opt_offsets: Do not fold load/store with const offset > max + +Dave Airlie (27): + +- draw/orcjit: supply stub function for tcs coro +- llvmpipe/cs/orcjit: add stub function name for coro +- gallivm/sample: fix sampling indirect from vertex shaders +- nvc0: fix null ptr deref on fermi due to debug changes. +- anv/video: add mode costs for h264 encoding +- vulkan/video: handle KHR_video_maintenance1 lack of profile list. +- radv/video: handle inline queries for vulkan video encode. +- radv/video: add KHR_video_maintenance1 support +- radv/video: fix reporting video format props for encode. +- radv: move video decode enable test into a flag +- radv: turn video decode/encode on for VCN4 with latest fw +- llvmpipe: handle stride properly on lvp udmabuf imports +- llvmpipe: make sure to duplicate the fd handle before giving out +- vulkan/video: fix vui encoding +- vl/bitstream: use an int32_t for se encoding. +- radv/anv/video: handling encoding both sps and pps in same buffer +- radv: Fix radeon_enc_code_ue with values over 2^16 +- radv/video: fix encode reference slot counting +- radv/video/enc: report pps overrides in feedback for h265 +- radv/video: use the h264 defines for macroblock w/h +- radv/video/enc: report pictureAccessGranularity of CTB size. +- radv/video: add encode field for vcn4 +- radv/video: handle missing h265 feedback struct. +- zink: free modifiers +- zink: handle mutable check without crashing if format list not found +- zink: whitespace fixups +- va: add some h264 fields + +David Heidelberg (66): + +- freedreno: Enable Adreno 306A +- freedreno: support lua pkgconfig variants +- freedreno: drop hack for old dEQP +- freedreno: Introduce emit_blit_fini +- freedreno: introduce color format parameter to emit_blit_buffer_dst +- freedreno: Implement fast clear_buffer for Adreno 6xx and 7xx +- freedreno: drop dEQP hacks +- intel/genxml: fix length of HCP_FQM_STATE for gen20 and 125 +- llvmpipe: Silence "possibly uninitialized value" warning for ssbo_limit (cont) +- meson/addrlib: allow unintialized callbacks +- ci/alpine: use llvm variables +- ci/alpine: build test with LLVM 18 +- ci/alpine: use packaged spirv-llvm-translator-dev +- ci/alpine: enable ORCJIT +- ci/alpine: libclc on Alpine offering necessary targets spirv-mesa3d-;spirv64-mesa3d- +- freedreno: Implement get_compute_state_info for Adreno 6xx/7xx +- freedreno: update night expectations +- etnaviv: rename enums_h appropriately +- etnaviv: build dependency for the etnaviv tests +- freedreno: Convert to SPDX-License-Identifier instead of pasting whole license +- freedreno: use unicode © instead of DOS-like (C) copyright sign +- bin/gen_release_notes: adjust checksums section +- ci/freedreno: add Adreno 306/307 occasional timeout +- ci/iris: implement nightly CL testing using piglit on ADL +- panfrost: drop leftover definition after pan_nir_lower_64bit_intrin removal +- nir_lower_mem_access_bit_sizes: Assert when 0 components or bits are requested +- freedreno/ir3: Use nir_lower_mem_access_bit_sizes instead custom lowering +- ci/intel: Officially switch intel-adl-cl to pre-merge +- ci/freedreno: move disabled a530 entries back to main gitlab-ci.yml +- ci/freedreno: mark jobs to be retested with patched 6.11 kernel +- ci/freedreno: re-enable piglit tests for Adreno 530 in nightly runs +- ci/freedreno: mark a530_piglit as postmerge job +- panfrost/midgard: Implement nir_lower_mem_access_bit_sizes pass +- panfrost/midgard: Support 64bit pack/unpack +- panfrost/midgard: Lower fisnormal +- panfrost/midgard: Lower \*hadd instructions +- ci/panfrost: Implement basic nightly OpenCL testing on T860 +- ci/panfrost: switch to 6.11 with timestamp patches for traces +- ci/panfrost: update furmark checksum for T860 traces +- ci/panfrost: update rock-5b from rc5 to rc7 +- freedreno/ir3: mad.x24 is not safe to lower +- freedreno/ir3: Do not allow 16-bit mad.x24 +- pvr: unify code into one condition +- ci/piglit: keep the include_test.h used for OpenCL testing +- ci/fedora: when doing non-LTO build, we need C_ARGS too +- ci: enable IPO/LTO only for nightly builds +- ci: restrict number of maximum threads +- util: add aarch64 fast half-float to float conversion +- amd: Pass addrlib cpp args to the tests +- osmesa: Fix OSMesaGetDepthBuffer() test without depth +- ci/etnaviv: unify job naming with the rest of the CI +- ci/freedreno: switch to dash instead of underscore, same as rest of the CI +- ci/alpine: stay trendy, Alpine flies on LLVM 19 now +- build: Fix LTO with Android NDK +- ci/build: debian-build-testing should follow same limit as other jobs +- ci/fedora: add workaround for buggy GCC warning triggered by r600 +- ci/build: Do not do LTO on any pre-merge job +- freedreno/ir3: Lower fisnormal op +- ci/freedreno: Introduce OpenCL testing for Adreno 618, 660, and 750 +- freedreno: python fixes +- util: Drop 3Dnow optimisation leftovers +- util: Remove MMX/MMXext detection code +- util: Drop ancient Intel CPU detection +- util: drop XOP detection code +- llvmpipe: align with u_cpu_detect struct changes +- compiler/rust: drop duplicated bindgen check + +David Rosca (147): + +- radeonsi/vcn: Only enable filler data for CBR +- radeonsi/vcn: Bump enc FW interface version for VCN3 and VCN4 +- radeonsi/vcn: Add support for QVBR rate control mode +- radeonsi/vcn: Fix MB/CTB calculation for slice encoding +- dri: Use --whole-archive when linking libdri +- radeonsi/vcn: Allow dynamic change of QVBR quality level +- radeonsi/vcn: Add decode DPB buffers as CS dependency +- gallium: Add header_flags to pipe_h2645_enc_picture_desc +- frontends/va: Don't check header type for packed header buffers +- frontends/va: Keep track if VPS/SPS/PPS/AUD was sent +- frontends/omx: Request SPS PPS for IDR pictures +- radeonsi/vcn: Only insert headers when requested for H264/5 +- frontends/va: Fix use after free with EFC +- radeonsi: Don't allow DCC for encode in is_video_target_buffer_supported +- vulkan/format: Add VK_FORMAT_G12X4_B12X4R12X4_2PLANE_420_UNORM_3PACK16 +- radv/video: Add support for 12-bit AV1 decode +- gallium: Change pipe_h265_enc_rate_control to array +- frontends/va: Assert maximum number of temporal layers +- frontends/va: Get per temporal layer params for HEVC +- frontends/va: Parse packed header slice for HEVC TemporalId +- frontends/va: Fix leaks with multiple coded buffer segments +- util/rbsp: Fill bits again if reading more than 16 leading zeros +- frontends/va: Implement DPB management for H264/5 encode +- frontends/va: Only use parameters for first slice in H264/5 encode +- frontends/va: Parse H264 slice packed header +- frontends/va: Parse HEVC slice packed header +- frontends/omx: Adjust to encoding changes +- radeonsi: Add GPU copy path to si_video_resize_buffer +- radeonsi/vcn: Switch to app DPB management for H264 and HEVC encode +- radeonsi/vcn: Deduplicate header encoding functions +- radeonsi/vcn: Fix radeon_enc_code_ue with values over 2^16 +- radeonsi/vcn: Use pipe header params in HEVC header encoder +- radeonsi/vcn: Use pipe header params in H264 header encoder +- radeonsi: Enable transform_skip HEVC feature support +- radeonsi: Implement buf_ofst_info in si_video_resize_buffer GPU copy path +- radeonsi/vcn: Support H264 constrained intra prediction +- d3d12: Remove get_feedback_fence implementation +- frontends/va: Implement sync buffer/surface timeout for encode feedback +- pipe: Add int return value to video end_frame +- frontends/va: Check for errors from end_frame in vlVaEndPicture +- radeonsi/vcn: Implement get_feedback_fence +- pipe: Use correct struct for h264/h265 dpb entry +- frontends/va: Allocate surface buffers on demand +- frontends/va: Allow drivers to allocate and use encode DPB surface buffers +- egl/wayland: Fix flush after blit in swap buffers multi-GPU case +- frontends/va: Fix locking in vlVaDeriveImage +- frontends/va: Fix locking in vlVaQueryVideoProcPipelineCaps +- vulkan/wsi: Fix stack-use-after-scope in x11_surface_create_swapchain +- frontends/va: Parse more H264 PPS for transform_8x8_mode_flag +- radeonsi/vcn: Stop hardcoding values in VCN version overrides +- radeonsi/vcn: Add support for H264 8x8 transform on VCN5 +- radv/video: Don't use op_close in encode +- radv/video: Don't use multiple tasks for one encode operation +- ci: Disable building OMX driver +- Remove OMX +- docs: Remove OMX frontend +- glx: Destroy base screen after deinitScreen +- pipe: Add pipe_h264_nal_unit_type and pipe_h265_nal_unit_type enums +- frontends/va: Support raw packed headers for H264 and HEVC +- frontends/va: Allow multiple NALUs in coded buffer segments +- radeonsi/vcn: Support raw packed headers for H264 and HEVC +- radeonsi: Remove unused radeon_temporal.h +- radeonsi/vcn: Don't copy the pipe enc structs +- pipe: Remove video enc header_flags +- meson/megadriver: Add megadriver_libdir argument +- targets/va: Build va driver into libgallium when building with dri +- targets/vdpau: Build vdpau driver into libgallium when building with dri +- radeonsi/vce,uvd_enc: Stop using obsolete rate control params +- radeonsi/vce: Stop using base.level and base.max_references +- radeonsi/uvd_enc: Stop using base.level +- radeonsi/vcn: Stop using base.level for H264 level_idc +- d3d12: Stop using base.level for H264 level_idc +- frontends/va: Set default encoding parameters at context creation +- Revert "frontends/va: Process VAEncSequenceParameterBufferType first in vaRenderPicture" +- frontends/va: Create encoder at context creation +- pipe: Remove unused fields in video rate control +- frontends/va: Reset intra refresh in beginPicture +- radeonsi: Fix reported minimum width for video encode +- frontends/va: Remove fallback if app doesn't enable packed headers +- radeonsi: Disable EFC on VCN 2.2 +- radeonsi/vcn: Add rc_per_pic_ex encode command +- ac: Fix typo RENCDOE -> RENCODE +- ac: Add ac_vcn_init_enc_cmds +- ac: Add remaining VCN encode defines +- radeonsi/vcn: Use ac_vcn_enc_init_cmds and AV1 defines from ac +- ac: Add VCN IB parser +- radeonsi/vcn: Enable IB parsing with AMD_DEBUG=ib +- radeonsi/vcn: Don't reuse context with multiple VCN instances +- frontends/va: Fix AV1 packed header parsing +- pipe: Add video fence_wait +- radeonsi/uvd: Implement fence_wait +- radeonsi/vcn: Implement fence_wait +- radeonsi/vpe: Implement fence_wait +- d3d12: Implement fence_wait +- r600/uvd: Implement fence_wait +- frontends/va: Use fence_wait instead of get_*_fence +- pipe: Remove video get_*_fence +- radeonsi/vcn: Allow per-frame QP and max frame size changes +- radeonsi/vcn: Use correct initial DPB size for HEVC encode +- radeonsi/vcn: Don't hardcode nal_ref_idc +- radeonsi/vcn: Fix some small coverity issues +- radeonsi/vcn: Fix out of bounds write when invalidating QP map regions +- radeonsi/vcn: Fix out of bounds read in H264 decode +- frontends/va: Implement DPB management for AV1 +- frontends/va: Parse more AV1 frame and sequence header params +- frontends/va: Support raw packed headers for AV1 +- radeonsi/vcn: Switch to app DPB management for AV1 +- frontends/va: Remove now unused ref fields for AV1 encode +- radeonsi/vcn: Support raw packed headers for AV1 +- radeonsi/vcn: Fix coded size in sequence header on VCN5 +- radeonsi/vcn: Add support for VCN5 AV1 compound +- radeonsi/vcn: Add support for VCN5 dpb tier2 +- frontends/va: Fix parsing HEVC st_ref_pic_set +- frontends/va: Parse H264 prefix for temporal_id +- frontends/va: Calculate VBV params for other layers relative to base layer +- radeonsi/vcn: Implement per-layer rate control for HEVC +- radeonsi/vcn: Select layer for rate control +- frontends/va: Fix some small coverity issues +- frontends/va: Fix NULL check in intra refresh params +- frontends/va: Fix parsing leb128 when using more than 4 bytes +- frontends/va: Don't relock mutex in vlVaEndPicture +- pipe: Fix video enc max DPB size for H264 +- frontends/va: Evict unused surfaces from encode DPB +- radeonsi/vcn: Only enable skip mode with matching references +- frontends/va: Add support for AV1 still picture encode +- radeonsi/vcn: Add support for AV1 still picture encode +- radeonsi/uvd_enc: Implement fence_wait +- radeonsi/vce: Implement fence_wait +- radeonsi: Enable VIDEO_CAP_ENC_SUPPORTS_ASYNC_OPERATION on VCE/UVD +- radeonsi/uvd_enc: Allocate session buffer in VRAM +- radeonsi: Support HEVC features and block sizes for UVD +- radeonsi/vcn: Enable VCN4 AV1 encode WA +- radeonsi/vce: Remove dual instance support +- frontends/va: Remove H264 encode delayed flush +- frontends/va: Stop using PIPE_VIDEO_CAP_ENC_SUPPORTS_ASYNC_OPERATION +- pipe: Remove PIPE_VIDEO_CAP_ENC_SUPPORTS_ASYNC_OPERATION +- frontends/va: Move encode fence to coded buffer +- radeonsi/vcn: Don't try to override HEVC SPS conformance window +- radeonsi/vcn: Gracefully handle encode errors and report to frontend +- radeonsi/vcn: Rework encode padding and session init params +- radv/video: Fix H264 slice control +- radv/video: Fix HEVC slice control +- radv/video: Report correct encodeInputPictureGranularity +- radv/video: Avoid selecting rc layer over maximum +- radeonsi/vcn: Use correct frame context buffer for preencode on VCN5 +- frontends/va: Create surfaces with correct fourcc for RT format +- frontends/va: Use compositor blit with different number of planes + +Deborah Brouwer (15): + +- ci/lava: Detect a6xx gpu recovery failures +- ci: add firmware files to rootfs +- ci/intel: add i915/MTL firmware to rootfs +- ci: remove xfail program\@build\@include-directories +- ci: remove obsolete build-hang-detection script +- ci/deqp-runner: fix option to build from a commit +- ci/deqp-runner: print version info +- ci/deqp-runner: update instructions for uprevving +- ci/deqp-runner: uprev from 0.20.0 to 0.20.2 +- ci/deqp-runner: implement max-fails +- freedreno/ci: remove redundant skip files +- ci: simplify .baremetal-arm32-asan-test +- ci: simplify .baremetal-arm64-asan-test +- freedreno/ci: convert a630-gles-asan to toml suite +- ci/b2c: update RESULTS_DIR for .b2c-test jobs + +Denis (1): + +- freedreno: Enable A505 + +Derek Foreman (5): + +- meson: Update wayland-protocols to 1.38 +- vulkan/wsi/wayland: Use presentation timing v2 when available +- vulkan/wsi/wayland: Use fifo protocol for FIFO +- vulkan/wsi/wayland: Pace frames with commit-timing-v1 +- adv+zink/ci: Add a recent flake + +Dmitry Baryshkov (3): + +- freedreno/ir3: Treat MESA_SHADER_KERNEL in the same way as compute +- freedreno/ir3: Lower the hadd operations +- rusticl: Enable on freedreno + +Dmitry Osipenko (6): + +- util/mesa-db: Fix missing O_CLOEXEC +- util/mesa-db-multipart: Open one cache part at a time +- util/mesa-db: Open DB files during access time +- util/mesa-db: Fix crash on compacting empty DB +- util/cache_test: Add mesa-db test for adding cache entry bigger than empty cache +- ci/zink: Mark glx-multithread-clearbuffer flake on ADL + +Donald Robson (2): + +- pvr: Stop creating waits when there are no waits +- pvr: Fixed creation of waits in queue submission + +Dylan Baker (35): + +- compilers/clc: Add missing break statements. +- mesa: fix memory leak when using shader cache +- util/glsl2spirv: fixup the generated depfile when copying sources +- util/glsl2spirv: only copy the input file if we're going to change it +- tgsi_to_nir: free disk cache value if the size is wrong +- crocus: properly free resources on BO allocation failure +- crocus: check for depth+stencil before creating resource +- mesa/program: remove memcpy where src == dst +- docs: add release notes for 24.2.2 +- docs: update sha sums for 24.2.2 +- docs: update sha sums for 24.2.2 +- iris: Run checks that do not require resources before creating them +- anv: if queue is NULL in vm_bind return early +- intel: replace \`(uint64_t - uint64_t) > 0` with \`uint64_t > uint64_t` +- anv: assert we don't write past the end of an array +- anv: enforce state->cmd_buffer is never null in emit_Simpler_shader_init_fragment +- intel/elk: delete copy constructor and copy-assignment-operator +- compiler/glsl: explicitly delete copy constructor and copy-assign +- util/fossilize_db: don't use check_files_opened_successfully for foz_prepare +- anv: remove useless uint >= 0 check +- hasvk: remove useless uint >= 0 check +- intel/perf: delete dead code. +- meson: use \`add_project_arguments` instead of \`global` +- anv: enable VK_EXT_ycbcr_2plane_444_formats +- release: push 24.3 out two weeks +- anv: assert that we don't read off the end color_att array +- anv: when handling descriptor sets, layout cannot be NULL if dynamic_offets are used +- VERSION: bump for 24.3.0-rc1 release +- .pick_status.json: Update to ced2404cb433deaa84cf6cf9edce212733117c0b +- .pick_status.json: Update to 5e0b81413de588803c9a0736acd8decd40d19ab6 +- .pick_status.json: Update to b0c9789bc1ed808d29f642e9445599dc85896444 +- VERSION: bump for 24.3.0-rc2 release +- .pick_status.json: Update to b8d253b30c0a8c1878f2429f0a4df0a068bbf0aa +- .pick_status.json: Update to 8845255881f03df9a695fae164a64201c4211d35 +- .pick_status.json: Update to 9b779068c3a7a8ded00f1d2519fd01dd1d05f3d8 + +Echo J (3): + +- d3d10umd: Use pipe_resource_usage enum in translate_resource_usage() +- util: Fix the integer addition in os_time_get_absolute_timeout() +- compiler/rust: Force native build for the compiler_proc library + +Eric Engestrom (249): + +- VERSION: bump to 24.3 +- docs: reset new_features.txt +- loader: gc loader_get_extensions_name() and __DRI_DRIVER_{GET\_,}EXTENSIONS defines +- radeonsi/ci: skip timing out test +- freedreno/ci: double job timeout for a306 +- freedreno/ci: document extra variants of failing tests on a618 and a630 +- anv+zink/ci: mark some tests as fixed +- anv+zink/ci: document two tests, one failing and one crashing +- anv+zink/ci: mark a couple of tests as flaky +- venus/ci: skip timing out test +- nak: fix meson typo +- venus: initialize bitset in CreateDescriptorPool() +- v3d/ci: mark spec\@amd_performance_monitor\@vc4 tests as flaky +- meson: xcb & xcb-randr are needed by the loader whenever x11 is built +- docs: add stub header for u_format_gen.h +- v3d,v3dv: reuse dep_v3d_hw from simulator/meson.build +- v3d,v3dv: figure out whether we're using the simulator only once +- v3d,v3dv: simplify USE_V3D_SIMULATOR/using_v3d_simulator logic +- vc4,v3d,v3dv: avoid compiling in unused \`sim_file` field in the {vc4,v3d}_screen/v3dv_physical_device struct +- ci/baremetal: fix logic for retrying boot when it failed +- v3d/ci: mark spec\@amd_performance_monitor\@api as flaky on rpi4 +- v3d/ci: mark glx\@glx-visuals-depth as flaky on rpi4, just like the -stencil variant +- meson: don't select the deprecated \`swrast` option ourselves +- meson: improve wording of "incompatible llvm options" error +- ci: remove llvmpipe in the job that disables llvm +- rpi4/ci: mark fs variant of arb_texture_buffer_object as flaky too +- docs/meson: add missing newline at the begining of code-block +- bin/ci: update python-gitlab to 4.x +- ci/prepare-artifacts: use find's -exec instead of iterating over its output with xargs +- ci/prepare-artifacts: check if we're about to overwrite ci config/expectations files +- llvmpipe/ci: bump timeout for nightly job to 1h +- ci: drop dead VKD3D_CONFIG +- ci/vkd3d: rename vkd3d runner script to vkd3d-runner.sh to match the other ones +- ci/vkd3d: store vkd3d "errors" in the results file instead of spamming the job log +- ci/vkd3d: use GPU_VERSION to identify the list of failures, unifying with deqp-runner.sh +- ci/vkd3d: move test expectations file to the results folder +- ci/vkd3d: move expectation file creation logic out of the test results handling +- ci/vkd3d: be more specific for what to grep +- ci/vkd3d: rename vkd3d expected failures list to be more explicit +- ci/vkd3d: add support for \*-skips.txt list files +- ci/vkd3d: add support for \*-flakes.txt list files +- pick-ui: move NominationType values around so that None is 0 +- ci/poe-powered: rename --test-timeout to --test-timeout-minutes to be more explicit +- ci/bare-metal: rename fastboot & cros-servo TEST_PHASE_TIMEOUT to TEST_PHASE_TIMEOUT_MINUTES to be coherent +- ci/poe-powered: allow setting a different boot timeout than 5 minutes +- rpi/ci: shorten the boot timeout to 90 seconds +- docs: add release notes for 24.1.5 +- docs: update calendar for 24.1.5 +- docs: add sha256sum for 24.1.5 +- nvk/ci: bump vkcts job timeout +- meson,ci: remove dead \`kmsro` option in \`gallium-drivers` +- ci: remove dead start-x.sh script +- rpi3/ci: remove {version,render}_check from [piglit.env] +- ci/deqp-runner: fix --fraction-start in non-parallel jobs +- ci: pass MESA_SPIRV_LOG_LEVEL from job to the test +- android: fix build in multiple ways +- docs: update calendar for 24.2 +- docs: add release notes for 24.2.0 +- docs: add sha256sum for 24.2.0 +- docs: update calendar for 24.1.6 +- docs: add release notes for 24.1.6 +- docs: add sha256sum for 24.1.6 +- ci/container: define LLVM_VERSION in the alpine container job +- ci/build: reuse alpine llvm version to make sure it stays coherent +- ci/build: add comment to explain why s390x is useful +- ci/build: document \@ajax as a stakeholder for s390x +- ci/build: drop softpipe from s390x job +- nvk/ci: add vkd3d job on the ga106 +- llvmpipe/ci: mark now-skipping test as no longer failing +- nvk+zink/ci: mark \`spec\@arb_sample_shading\@samplemask .*\@0\\..* partition` as fixed +- nvk/ci: document regressions +- turnip/ci: document regressions +- turnip/ci: skip timing out tests +- turnip/ci: document flakes +- nvk/ci: mark a bunch of tests as fixed +- turnip/ci: add a750_vk flakes +- docs/envvars: fix reference +- broadcom/ci: set default TEST_PHASE_TIMEOUT_MINUTES so that jobs never timeout without uploading results +- broadcom/ci: increase the timeout of the nightly job v3d-rpi4-gl:arm32 by 10min +- ci/build: add honeykrisp (asahi vk) to several builds +- intel/ci: don't trigger anv-jsl-full & anv-tgl-full on GL changes +- nvk/ci: document CTS bug +- ci: add check for misleading indentation in ci toml files +- ci: don't run hardware tests if the toml files don't pass validation +- ci: fix toml-lint rules to not block nightly pipelines +- r300/ci: move postmerge jobs to -postmerge stage +- lavapipe/ci: move postmerge jobs to -postmerge stage +- venus/ci: move postmerge jobs to -postmerge stage +- ci/rules: make every job exist as manual in fork pipelines +- kopper: only include loader_dri3_helper.h when xcb is enabled +- zink+nvk/ci: document new variant of test failing +- zink+nvk/ci: document all the flakes seen in the last week +- turnip/ci: document all the a750 flakes seen in the last week +- vc4: Add missing libvc4_neon build dependencies +- vc4/meson: simplify neon build now that the android build system doesn't exist anymore +- ci/deqp-runner: restore CC after temporarily overriding it +- ci/deqp: simplify command to list local deqp patches +- ci/deqp-runner: be less verbose in the loop printing the deqp builds info +- ci/deqp-runner: build from git checkout even on linux +- ci/deqp-runner: add infra to apply patches +- ci/image-tags: re-generate all the images building deqp-runner +- ci/build: fix ppc64le and s390x jobs rules +- ci: take igalia farm offline +- docs: add release notes for 24.2.1 +- docs: add sha sum for 24.2.1 +- docs: update calendar for 24.2.1 +- nvk/ci: drop skip of fixed vkd3d test +- ci/vkd3d: make the job output less spammy, focus on what is useful to print +- ci/vkd3d: use upstream test-runner.sh for process isolation +- ci: run only one vkd3d test at a time +- ci/vkd3d: check for tests listed in fails but not actually failing +- v3dv/ci: fix test timeout for v3dv-rpi5-vk-full:arm64 +- etnaviv/ci: fix gc2000_piglit test timeout +- freedreno/ci: fix a630_*_full tests timeout +- nouveau/ci: fix gk20a-gles test timeout +- freedreno/ci: drop unused TEST_PHASE_TIMEOUT_MINUTES in LAVA jobs +- ci/bare-metal/poe: change the default TEST_PHASE_TIMEOUT_MINUTES to \`CI_JOB_TIMEOUT - 5min` +- broadcom/ci: drop TEST_PHASE_TIMEOUT_MINUTES that match the default value +- broadcom/ci: use TEST_SETUP_AND_UPLOAD_MARGIN_MINUTES +- etnaviv/ci: drop TEST_PHASE_TIMEOUT_MINUTES that match the default value +- freedreno/ci: drop TEST_PHASE_TIMEOUT_MINUTES that match the default value +- nouveau/ci: drop TEST_PHASE_TIMEOUT_MINUTES that match the default value +- ci/build: untangle debian-s390x from debian-ppc64el +- freedreno/ci: add a630 flake +- docs: add release notes for 24.1.7 +- docs: update calendar for 24.1.7 +- docs: add sha sum for 24.1.7 +- ci: move build jobs that are used by tests to an earlier stage +- ci: merge build-x86_64 & build-misc +- ci: do not start build-only jobs until the critical build-for-tests jobs are done +- nvk/ci: mark -dEQP-VK.drm_format_modifiers.export_import* as fixed +- radeonsi/ci: bump timeout for nightly job glcts-vangogh-valve +- Revert "freedreno/ci: drop TEST_PHASE_TIMEOUT_MINUTES that match the default value" +- freedreno/ci: fix test timeout for a306_piglit +- radeonsi/ci: mark a bunch of subgroups tests as failing +- venus/ci: drop redundant flakes definitions +- venus/ci: add flake and skip timing out test +- zink+nvk/ci: bump zink-nvk-ga106-valve timeout as more tests are being run +- zink+nvk/ci: mark a ton of tests as fixed +- zink+turnip/ci: fix .zink-turnip-valve-manual-rules +- turnip/ci: add vkd3d job on the a750 +- ci: add virgl & venus to the s390 build +- docs: Fix linkcheck failures +- zink+nvk/ci: mark test as fixed +- zink+nvk/ci: add flakes seen recently +- docs: add release notes for 24.2.3 +- docs: update calendar for 24.2.3 +- docs: add sha sum for 24.2.3 +- amd/ci: track changes to the global driver \`*-skips.txt` files +- broadcom/ci: track changes to the global driver \`*-skips.txt` files +- llvmpipe/ci: track changes to the global driver \`*-skips.txt` files +- lavapipe/ci: track changes to the global driver \`*-skips.txt` files +- intel/ci: track changes to the global driver \`*-skips.txt` files +- v3dv/ci: drop skip of fixed test +- docs/release-calendar: add 24.3 branchpoint and rcs +- zink+nvk/ci: document spec\@egl_ext_surface_compression\@create as crashing +- zink+nvk/ci: add flakes seen recently +- lavapipe/ci: document regression +- llvmpipe/ci: document regression +- radeonsi/ci: document spec\@egl_ext_surface_compression\@create as crashing +- radeonsi/ci: mark KHR-GL46.shader_image_load_store.basic-allTargets-atomic as fixed +- ci/build: move debian-clang-release to priority build-for-tests stage +- lavapipe/ci: document ray tracing regression +- mr-label-maker: add gfxstream +- ci/deqp: make sure the git commit hash of deqp is stable across rebuilds +- ci: fix windows container jobs being missing on forks +- ci/build: don't wait on the windows build anymore +- turnip/ci: mark vkd3d test_planar_video_formats as fixed +- zink+nvk/ci: mark spec\@egl 1.4\@egl-ext_egl_image_storage as fixed +- zink+nvk/ci: document more flakes seen lately +- egl: fix dri2_from_names() call +- docs: add release notes for 24.2.4 +- docs: update calendar for 24.2.4 +- docs: add sha sum for 24.2.4 +- Revert "ci/build: move debian-clang-release to priority build-for-tests stage" +- ci/build: move debian-build-testing to priority build-for-tests stage +- ci/build: mention the .build-for-tests-jobs exception +- docs: add release notes for 24.2.5 +- docs: update calendar for 24.2.5 +- docs: add sha sum for 24.2.5 +- ci: drop spec\@egl_ext_surface_compression\@create from the rest of the expected fails +- zink+nvk/ci: document regression +- zink+nvk/ci: document more flakes seen +- zink+radv/ci: document more flakes seen +- nvk/ci: document more flakes seen +- ci: run shader-db only when one of the drivers it tests is modified +- zink+nvk/ci: mark a few tests as fixed +- zink+nvk/ci: document more flakes seen +- ci/{deqp,piglit}-runner: make zstd respect FDO_CI_CONCURRENT when set, and fallback to the current "all the threads" +- ci/prepare-artifacts: use more than one thread when compressing the install tarball +- freedreno/ci: add {freedreno,turnip}{,-manual}-rules to simplify the rest of the code +- freedreno/ci: use {freedreno,turnip}{,-manual}-rules +- ci: rename "freedreno" farm to "google-freedreno" +- {freedreno,intel}/ci: rename "premerge-skips.txt" to "merge-skips.txt" to accurately reflect what they are +- {freedreno,intel}/ci: add missing tracking of \`merge-skips.txt` files +- ci: simplify ci-tron priority variable +- ci: simplify setting lava job priority +- ci: fix shader-db rules +- ci/b2c: allow B2C_DTB_URL to be set to an empty string to unset it +- ci/b2c: make initramfs & kernel urls optional +- ci: move yaml-toml-shell-test out of build-for-tests stage +- ci: move comment into the appropriate section +- zink+turnip/ci: update a750 results +- docs: update link to \`perf` wiki +- nvk/ci: add back a crash that was mistakenly removed from the expectations +- zink+nvk/ci: document ext_egl_image_storage regression in c06a55fd...2fb4aed9 (likely !31585) +- all-drivers/ci: drop duplicate flakes lines +- zink+nvk/ci: drop duplicate flakes lines +- zink+nvk/ci: add flakes seen recently +- turnip/ci: add more dEQP-VK.renderpass2.fragment_density_map.* flakes seen recently +- nvk/ci: add flakes seen recently +- lavapipe/ci: group & sort fails +- lavapipe/ci: group & sort skips +- lavapipe/ci: skip builtin ray query tests that take too long and time out +- lavapipe/ci: document regression in the commit range 765d1c47...366f63fd +- ci: move shellcheck options to .shellcheckrc +- ci: rename "merge-skips" to "slow-skips" as they're about to be used outside of merge piplines +- ci: skip slow tests on all non-"full" jobs +- ci: drop dead "load jwt in the environment just before exiting the job" code +- egl: error out during setup if the configuration is invalid +- freedreno/ci: abort a750 testing when a hang is detected +- freedreno/ci: add nightly freedreno gl testing on a750 +- ci/build: deduplicate meson build command +- ci/build: handle build timeout ourselves to give more time to nightly LTO builds +- ci/build: lower most job's timeout from 30 to 15 min +- docs: add release notes for 24.2.6 +- docs: update calendar for 24.2.6 +- docs: add sha sum for 24.2.6 +- ci: only end current section if there is a current section +- ci: initialize state_x (tracking \`set -x`) +- ci: consistently restore \`-x` after temporarily disabling it +- ci: rename state_x to previous_state_x to be clear about what it contains +- ci/b2c: properly escape double-quoted container command +- ci/b2c: make dut boot quieter on ci-tron jobs +- ci: properly quote ccache dir and path +- ci: don't replace ld with ld.gold if ld.gold doesn't exist +- ci: bump ci-templates +- amd/ci: document flakes seen lately +- nvk/ci: document flakes seen lately +- freedreno/ci: document flakes seen lately +- broadcom/ci: document flakes seen lately +- meson: drop variable initialized twice +- meson: add dependencies needed by wsi_common_x11.c even on non-drm platforms +- ci: drop unused extra args in build-vkd3d-proton.sh +- ci: replace plain \`meson` with explicit \`meson setup` +- meson: bump spirv-tools version needed to v2022.1 +- ci: move error handling functions at the end +- ci: use quiet alias for commands +- ci: raise priority of release manager pipelines + +Eric R. Smith (19): + +- panfrost: make PAN_MESA_DEBUG=dump really show all GPU memory +- panfrost: fix texture.border_clamp regression for valhall +- panfrost: use RGB1 component ordering for R5G6B5 pixel formats +- panfrost: Make the unit to use for instructions explicit +- panfrost: Update bifrost_isa.py to handle some Valhall constructs +- panfrost: refactor valhall ISA parser +- panfrost: split pseudo instructions from Bifrost and Valhall +- panfrost: fix crash when disassembling shader for trace +- panfrost: Add b2i16 conversion to table +- panfrost: move pan_arch definition out of genxml to pan_props.h +- panfrost: move lower_image_ms to after lower_atomics +- panfrost: implement image2dms support for valhall +- panfrost: add support for image2DMSArray on bifrost +- panfrost: improve swizzle inversion +- panfrost: print human readable versions of some swizzle fields +- panfrost: Add back A8_UNORM format for valhall +- panfrost: fix SAMPLE_ALPHA_TO_ONE +- panvk: Support blend shaders for alpha_to_one_enable +- panfrost: fix earlyzs settings for alpha_to_coverage + +Erico Nunes (13): + +- dri: fix sun4i-drm driver name +- lima/ppir: add folding lowering for src modifiers +- lima/ppir: add folding lowering for dest modifiers +- lima/ppir: remove nir_legacy +- lima/ppir: add support and folding for fclamp_pos +- lima/ppir: add support and folding trunc to a modifier +- lima/ppir: switch to derivative intrinsics +- docs/ci: pass -D platforms=[] to avoid dependencies +- v3dv: unconditionally expose VK_EXT_queue_family_foreign +- v3dv: match render and display device for wsi present +- lima/ci: define FDO_HTTP_CACHE_URI for caching-proxy +- lima/ci: update piglit ci expectations +- Revert "ci/lima: Temporarily disable" + +Erik Faye-Lund (54): + +- docs/panfrost: mention that Mali-G610 is conformant +- mesa/main: use extension-helper +- mesa/main: validate GL_UNSIGNED_INT_5_9_9_9_REV +- mesa/main: validate GL_UNSIGNED_INT_2_10_10_10_REV +- mesa/main: validate GL_UNSIGNED_INT_10_10_10_2 +- mesa/main: check depth/stencil formats +- mesa/main: validate desktop gl format/types +- mesa/main: validate bgr formats +- mesa/main: validate abgr format +- mesa/main: validate integer-formats +- mesa/main: validate half-float types +- mesa/main: switch from deny-list to allow-list +- mesa/main: add gles-compatible check helpers +- mesa/main: tighten glsl-version checks +- mesa/main: tighten gl-version checks +- panvk: remove out-of-date detail +- panvk: drop unused define +- panvk: error-check get_fb_descs calls +- mesa: make compressed-format emulation optional +- panfrost: unify compressed formats +- panfrost: store texfeat_bit in panfrost_format +- panfrost: check fmt.bitfeat_bit for compressed-support +- panfrost: do not re-fetch compressed formats +- panvk: check against texfeat_bit +- panvk: implement KHR_get_memory_requirements2 +- panvk: drop needless function implementation +- panvk: expose already-implemented KHR_maintenance1 +- i915: do not calculate native instructions +- mesa: remove superfluous "native" limits and stats +- mesa: clean up setup of MaxAddressRegs constant +- panvk: implement and expose KHR_maintenance2 +- panvk: expose KHR_relaxed_block_layout support +- panvk: expose KHR_bind_memory2 support +- docs/features: fix bad sorting of drivers +- panvk: expose KHR_external_semaphore and KHR_external_semaphore_capabilities +- panvk: expose KHR_external_memory and KHR_external_memory_capabilities +- panvk: expose KHR_external_fence and KHR_external_fence_capabilities +- panvk: fix broken wsi +- panvk: avoid signed integer underflow +- panvk: drop needless assert +- panvk: put conditional outside of define +- panvk: assert on missing vs +- panvk/csf: only look at fs if it's required +- panfrost: drop needless assign +- panfrost: add an assert in render-target setup +- panvk: drop duplicate dirty-test +- panvk/jm: respect depth-clip state +- panvk/csf: respect depth-clip state +- panvk: expose EXT_depth_clip_enable +- docs/features: claim vk 1.0 for panvk +- docs/features: mark off more panvk extensions +- panvk: enable KHR_16bit_storage +- docs/features: mark off missing panvk feature +- glx: avoid null-deref + +Erwin Jansen (1): + +- Enable GFXStream bazel build + +Evan (3): + +- amd/vpelib: Color Keyer Implementation +- amd/vpelib: Luma AND Color Keyer Full Support +- amd/vpelib: Input Format Adjustment + +Faith Ekstrand (309): + +- nvk: Fix indirect cbuf binds pre-Turing +- nvk: Don't advertise sparse residency on Maxwell A +- nak/sm50: Split IAdd2 into IAdd2 and IAdd2X +- nak: Implement ineg on sm50 +- nak: Add an AtomCmpSrc to AtomOp::CmpExch +- nak/sm50: Support AtomOp::CmpExch +- nak/sm50: Improve encoding of OpFFma +- nak/sm50: Various encoding fixes +- nak/sm50: Implement OpPixLd +- nak: Lower fsqrt in NIR on Maxwell A and earlier +- nak/sm50: Implement OpCCtl +- nak/nir: Split 64-bit conversions pre-Volta +- nak: Drop the nvfuzz binary +- nak: Plumb tessellation parameters through ShaderStageInfo +- nak: Rework fragment shader stage info +- nak: Move a few more things to VtgIoInfo +- nak: Move creation of nak_shader_info to ShaderBin::new() +- nvk: Reject sparse images on Maxwell A and earlier +- nil: Add a helper for populating the null descriptor +- nvk: Create the zero page earlier in device creation +- nvk: Call into NIL to populate the null descriptor +- nouveau/winsys: Uninline nouveau_ws_bo_unmap() +- nak: Pull the QMD code out into its own crate +- nak: Make ShaderBin a public struct +- nak/meson: Rename libnak_bindings_gen to _libnak_bindings_rs +- nak: Add a bare HW shader runner +- nak: Add an implementation of the ACORN random number generator +- nak: Add dst_type decorations +- nak: Add a Foldable trait +- nak: Implement Foldable for OpIAdd3 and OpIAdd3X +- nak: Initialize DEBUG on-demand +- nak: Add a framework for running hardware tests +- nak: Add some opcode tests +- nak: Implement Foldable for OpPrmt +- nak: Implement Foldable for OpShf +- nak: Fix shf.l.high on Maxwell +- nak: Add 64-bit shift helpers +- nak: Add tests for 64-bit shifts +- nak: Use .wrap for 64-bit shifts +- nak: Fix shl64 for pre-Volta +- meson/megadriver: Don't invoke the megadriver script with no drivers +- nak/spill_values: Don't assume no trivial phis +- nak: Drop the old encode_sm*.rs files +- nak/sm50: Add control-flow sync ops +- nak: Add more NIR wrappers for walking the NIR CFG +- nak/sm50: Emit sync instructions for control-flow +- nak: Disable opt_jump_thread pre-Volta +- nak: Don't emit barrier ops for shared memory pre-Volta +- nak/sm70: Implement OpAtom with Dst::None as RED +- nak/sm50: Implement OpAtom with Dst::None as RED +- nak: Use the RED path for atomics with unused destinations +- nak: IMul and IMad are variable latency pre-Volta +- nak/calc_instr_deps: Set a delay of 13 for flow ops pre-Volta +- nak: Align shader data to the min CBuf alignment +- nak: Add tests for nak_builder::isetp64 +- nak: Add tests for nak_builder::iadd64 +- nak: Add tests for OpISetP +- nak/hw_tests: Add debug printing +- nak/sm50: Set the .x bit for isetp +- nak/builder: Fix isetp64 for pre-Volta +- nvk: Squash txq results to 0 for null descriptors +- nak: Fix OpCCtl encoding on Maxwell +- nak/nir: Don't emit CCtl for release barriers on Maxwell +- nvk: Don't push bound UBOs on Volta and earlier +- nak: Implement From for Pred +- nak/sm50: OpIAdd2 only supports one negate modifier +- nak/sm50: OpLop2 does not have .X +- nak: Add tests for OpLop2 and OpLop3 +- nak/sm50: Fix the encoding of ffma with cbuf in src2 +- nak: Implement depth clip control pre-Volta +- nvk: Use nvk_buffer_addr_range() for index buffers +- nvk: Fix zero-size vertex/index buffer bindings pre-Turing +- nak/sm50: Add support for OpAL2P +- nak: Move the runner into the main nak crate +- nak: Plumb through the call/return stack size +- nvk: Take CRS into account when allocating the SLM area +- nak/nir: Fix the reverse execlist iterator +- nak/nir: Add a few more NIR helpers +- nak/nir: Rework CRS handling +- nak: Add target labels to CRS pop instructions +- nak: Add an optimization for CRS ops +- nak: OpKill also needs a delay on Maxwell +- nak/sm50: Add execution delays for more ops +- nak: SrcMod::INeg cannot be folded +- nak: Add tests for nak_builder::ineg64 +- nak/builder: Allow source modifiers in ineg64() +- nak: Lower iadd64(x, -y) to isub64(x, y) in NIR +- nak/hw_tests: Explicitly test equal cases in test_isetp64 +- nak: Add tests for OpFlo +- nak: Assert no overflow when legalizing iadd with too many ineg modifiers +- nak: Add source and destination types for carry values +- nak/hw_tests: Respect src_mod and src_swizzle in op auto-tests +- nak/hw_tests: Use better test values for iadd tests +- nak: Add tests for OpIAdd2 and OpIAdd2X +- nak/sm50: Set the not bit for OpLop2::srcs[0] +- nak: Use OpFoldData::get_u32_bnot_src() more places +- nak: Add tests for OpPopC and enable SrcMod::BNot +- nak/nir: Preserve rounding modes in split_64bit_conversions() +- nak: Rename OpFSOut to OpRegOut +- nak: pre-Volta uses r0 to output the final GS handle +- nak: Add a PredSetOp::eval() helper +- nak: Add tests for OpPSetP +- nak: Non-constant offsets are allowed on patch loads +- nak: Rework isetp64 on sm50 again +- nak: Disallow isetp.x pre-Volta +- nak: Manually flush denorms for nir_op_fquantize2f16 pre-Volta +- nak/sm50: Set f2f.high +- nvk: Default to NAK on Maxwell+ +- nvk: Disable sparse D32S8 cubes pre-Turing +- nak: Move the QMD code back into the main nak crate +- nak/qmd: Return the complete QMD dispatch size layout +- mme/fermi: Don't try to access zero-size std::vector +- nouveau/mme: Add support for MUL on Fermi +- nouveau/mme: Add support for multiplication on Fermi +- nak/hw_runner: Use byte_offset() +- nak/hw_tests: Use ptr::from_ref() and ptr::cast() +- nak: Use nak_builder::fsetp() in op_fquantize2f16 +- nvk: Fix indirect dispatch on Turing+ +- vulkan: Update XML and headers to 1.3.292 +- nvk: Move nvk_sample_location to NAK +- nak: Sample locations are byte-aligned +- nak,nir: Rework gl_SampleMaskIn[] handling +- nvk: Don't force sample shading when gl_SampleMaskIn[] is read +- nvk: Set POST_PS_INITIAL_COVERAGE +- compiler/rust: Copy the Rust NIR wrappers from NAK +- compiler/rust/nir: Implement NIR types directly +- compiler/rust/nir: Add explicit lifetimes +- nak: Switch to the common NIR wrappers +- nak: Move DstsAsSlice::is_uniform() to its own trait +- nak: Replace the guts of Srcs/DstsAsSlice with a new AsSlice trait +- nak,compiler: Move AsSlice to common code +- nvk: Require color or depth/stencil attachment support for input attachments +- nvk: Support STORAGE_READ_WITHOUT_FORMAT on buffers +- nvk: Move descrptor structs into a separate header +- nvk: s/device/dev/ in nvk_buffer_view.c +- nvk/nvkmd: Advertise the usable VA range +- nvk: Add a VK_EXT_descriptor_buffer buffer view cache +- nvk: Refactor image intrinsic lowering a bit +- nvk: Lower descriptors for VK_EXT_descriptor_buffer buffer views +- nvk: Use nvk_edb_buffer_view_descriptor for EDB descriptor set layouts +- nvk: Use the EDB buffer view path with NVK_DEBUG=edb_bview +- nvk: Implement GetDescriptorLayoutSize/BindingOffsetEXT() +- nvk: Refactor some descriptor set helpers +- nvk: Implement GetDescriptorEXT +- nvk/descriptor_table: Add support for requesting a specific index +- nvk: Implement descriptor capture/replay +- nvk: Properly indent a comment +- nvk: Rework descriptor set bindings +- nvk: Implement descriptor buffer binding +- nvk: Add support for embedded immutable samplers +- zink: Align descriptor buffers to descriptorBufferOffsetAlignment +- nvk: Advertise VK_EXT_descriptor_buffer +- nvk: Invalidate MME data for descriptor buffers +- nak: Pass -C default-linker-libraries when building tests +- ci: Add NVK to debian-arm64-build-test +- vulkan: Add null descriptor bits to vk_pipeline_robustness_state +- nvk: Plumb the whole vk_pipeline_robustness_state through to nvk_ubo/ssbo_addr_format +- nvk: Enable shader bounds checking when nullDescriptor is enabled +- nouveau/headers: Add more nv_push asserts +- nouveau/winsys: Fix an undefined shift +- nouveau/mme: Fix add64 of immediates on Fermi +- nouveau/mme: Better handle the too many immedaties case on Turing+ +- nouveau/mme: Don't allow PRMT and MTHD in the same op on Turing+ +- nouveau/mme: Disassemble MERGE more like the mme_merge() +- nouveau/mme: Add NOT and AND_NOT ops +- nouveau/mme: Allow shifting immediates on Fermi +- nouveau/mme: Use a designated initializer in mme_fermi_decode() +- nouveau/mme/tu104: Break the state simulator into a pluggable component +- nouveau/mme/fermi: Break the state simulator into a pluggable component +- nouveau/mme: Add a pluggable state interface +- nouveau/mme/tu104: Properly handle mthd increment +- nouveau/mme: Add a generic simulator function +- nvk: Reserve MME scratch area for communicating with FALCON +- nvk/mme: Add a unit test framework for driver macros +- nvk: Emit NVK_MME_SCRATCH_DRAW_BEGIN as part of flush_ia_state() +- nvk: Increase the context init size +- nvk: Move the guts of BindIndexBuffer2KHR to a macro +- nvk: Move the guts of BindVertexBuffers2 into a macro +- nvk: Move setting VB enables and strides into macros +- nvk/mme: Add helpers for masked scratch updates +- nvk: Add macros to sort out tess parameters +- nvk: Move the ANTI_ALIAS_CONTROL logic to the MME +- nvk: Stop leaking the descriptor table in-use set +- nvk: Disable conditional rendering around CopyQueryPoolResults +- nvk: Expose a new nvk_compile_nir_shader() helper +- nvk: Refactor compute shader dispatch +- nvk: Add a helper for dispatching compute shaders +- nvk: Use the new dispatch helper for queries +- nvk: Treat NULL vertex/index buffer pointers as null descriptors +- nil,nvk: Disable modifiers for B10G11R11_UFLOAT and E5B9G9R9_UFLOAT +- nir: Preserve fp_fast_math in nir_opt_vectorize() +- v3dv: Update a debug message +- vtn: Move initialization of mesa_spirv_debug to a helper +- vtn: Allow SPIR-V debug options in release builds +- vtn: Add a debug flag to dump SPIR-V assembly +- v3dv: Drop the SPIR-V dumper +- radv: Use the SPIR-V printer in spirv_to_nir +- vulkan/pipeline: Handle VIEW_INDEX_FROM_DEVICE_INDEX_BIT +- lvp: Stash the PipelineCreateFlags2KHR in the pipeline +- dzn: Use vk_pipeline_shader_stage_to_nir() +- vulkan: Take a VkPipelineCreateFlags2KHR in vk_pipeline_*shader_stage*() +- vulkan: Handle VIEW_INDEX_FROM_DEVICE_INDEX_BIT in the runtime +- nvk: Fix a typo in a comment +- nvk: Use 4 bits per value for the anti alias values +- nvk: Hash minSampleShading in nvk_hash_graphics_state() +- nvk: Advertise VK_KHR_maintenance7 +- nvk/nvkmd: Add an nvkmd_mem_init() helper +- nvk/nvkmd: Plumb map_out through to the back-end nvkmd_mem_ops::map +- nvk/nvkmd: Plumb map pointers through to back-end unmap funcs +- nvk/nvkmd: Plumb map_flags through to unmap/overmap +- nvk/nvkmd: Don't inline nvkmd_mem_[un]map() +- nvk/nvkmd: Split client and internal mappings +- nvk/nvkmd: Reference count internal mappings +- nvk: Don't do linear<->tiled copies for rendering suspend/resume +- nvk: Take depth image layer counts from the VkRenderingInfo +- nvk: Add all_linear to nvk_rendering_state +- vulkan: Add a helper for getting VkRenderingAttachmentLocaiontInfoKHR +- vulkan: Allow pColorAttachmentLocations == NULL in CmdSetRenderingAttachmentLocationsKHR() +- vulkan: Expose a vk_cmd_set_rendering_attachment_locations() helper +- nvk: Reset rendering attachment locations in BeginCommandBuffer/Rendering +- nvk: Emit SET_CT_SELECT based on the dynamic color location map +- nvk: Adverties VK_KHR_dynamic_rendering_local_read +- nak: Add a NAK_SUBGROUP_SIZE #define +- nak: Move subgroup_id and num_subgroups to lower_system_values +- nak: Shuffle gl_LocalInvocationID for quad derivatives +- nvk: Advertise VK_NV/KHR_compute_shader_derivatives +- nvk: Only advertise compute_shader_derivatives when NAK is used +- vulkan: Handle variable-length property arrays more generically +- nil: Rework calculations of the sizes of miplevels +- nil: Return a more accurate size in level_size_B +- nil: Assert array_len == 0 in Extent4D::size_B() +- nvk: Advertise VK_KHR_shader_float_controls2 +- nvk: Silence a maybe-uninitialized warning +- nvk: Add an NVK_DEBUG=gart flag +- vulkan: Add a stages field to vk_pipeline +- vulkan: Add a vk_pipeline_ops::get_shader method +- vulkan/pipeline: Patch through INDIRECT_BINDABLE_BIT +- nvk: Fill out shader push data at compile time +- nvk: Use VkShaderStageFlags for shaders_dirty +- nvk: Disable CBuf pushing when INDIRECT_BINDABLE_BIT is set +- nvk: Add an implementation of VkIndirectExecutionSetEXT +- nvk/nvkmd: Always map read/write for internal maps +- nvk/nvkmd: Add a mechanism to track and look up nvkmd_mems +- nvk: Track and dump device-generated commands if possible +- nvk: Move flush_push_descriptors to the front of flush_state() +- nvk: Export graphics state flushing helpers +- nvk: Expose an nvk_cmd_flush_cs_qmd() helper +- nvk: Implement CmdProcess/ExecuteGeneratedCommandsEXT +- nvk: Handle indirect dispatch in pipeline barriers +- nvk: Advertise VK_EXT_device_generated_commands +- nvk: Handle aspects in D32_S8_UINT copies +- nvk: Assume a GOB height of 8 for copies +- nvk,nil: Replace gob_height_is_8 and is_tiled with a new GOBType enum +- vulkan: Use SEVERITY_WARNING_BIT for vk_error*() +- nil: Add a couple Extent4D and Offset4D methods +- nil: Add tiled memcpy helpers +- vulkan: Add a comment explainint SEVERITY_WARNING_BIT for vk_error*() +- nvk/nvkmd: Stop leaking the primary FD +- nir: Add an option to lower quad vote +- nak,nvk: Implement VK_KHR_shader_quad_control +- nvk: Fix a comment in SET_VIEWPORT_CLIP_CONTROL +- vulkan/queue: Don't use vk_semaphore in threaded payload stealing +- vulkan/queue: Move has_binary_permanent_semaphore_wait into the sumbit struct +- vulkan/queue: Use _mem_signal_temp instead of signal_mem_sync +- vulkan: Add a vk_queue_submit_has_bind() helper +- vulkan/queue: Use a builder pattern for vk_queue_submit +- vulkan/queue: Split vk_queue_submit into create and submit +- vulkan/queue: Merge submits when possible +- nouveau/mme: Test ifs with immediate parameters +- nouveau/mme: Evaluate methods immediately in the Turing sim +- nouveau/mme: Turing can't have emit and state in the same op +- nil: Add a helper to get samples from a SampleLayout +- nvk: Add an nvk_cmd_emit_sample_layout() helper +- nvk: Rework setup of sample masks +- nouveau/class_parser: Add more method arrays +- nil: Add D3D sample layouts +- nvk: Support D3D sample modes +- nil: Use D3D sample modes by default +- nvk: Implement GetPhysicalDeviceFragmentShadingRatesKHR() +- nvk: Initialize VARIABLE_PIXEL_RATE_SAMPLE_ORDER +- vulkan: Add a vk_fragment_shading_rate_is_disabled() helper +- nvk: Rework setting of min_sample_shading +- nvk: Emit FSR state +- nvk: Add support for binding fragment shading rate images +- nak: Get rid of some dead code warnings +- nak: Rename SV_VERTEX_COUNT to PRIM_TYPE +- nak: Add support for gl_ShadingRateEXT +- nak: Pass a nak_compiler into sysval and attrib helpers +- nak: Add support for gl_PrimitiveShadingRateEXT +- nvk: Set VARIABLE_PIXEL_RATE_SHADING_TABLE_SELECT +- nvk: Advertise VK_KHR_fragment_shading_rate +- vulkan/queue: Check for _mem_signal_temp before we submit +- nvk: Add an nvkmd_engines bitfield to nvk_queue +- nvk: Only set texture/sampler tables and SLM for enabled engines +- nvk: Re-sort the features table +- nvk: Advertise 64-bit atomics on buffer views +- nak: Remove annotations before calc_instr_deps() +- nak: Handle annotations in legalization +- compiler/rust: Enable unit tests +- compiler/rust: Add a unit test for the memstream abstraction +- compiler/rust: Fix a bad cast in the memstream abstraction +- nvk: Only wait on the upload queue if there are commands +- nvk: Only submit the state pushbuf if there are commands +- nvk: move nvk_queue_state_update to nvk_queue_submit_exec +- nvk: Use nvk_queue_submit_simple() for nvk_queue_state_update() +- nak/bindings: Use an enum for IOCTL numbers +- nvk: Fix host copies for array images +- nil: Add a nil_sample_offset() helper +- nvk: Put a sample map in the descriptor for MSAA storage images + +Felix DeGrood (5): + +- anv: fix src_hash dumping for compute shaders +- anv/measure: ignore events from reused command buffers +- intel/measure: add nogl feature +- intel/measure: defer file open until first write +- intel/measure: increase size of filename malloc to account for \\0 + +Francisco Jerez (8): + +- intel/dev: Add devinfo flag for TBIMR push constant workaround. +- intel/brw: Implement null push constant workaround. +- anv/gfx12.5: Pass non-empty push constant data to PS stage for TBIMR workaround. +- iris/gfx12.5: Pass non-empty push constant data to PS stage for TBIMR workaround. +- iris: Pin pixel hashing table BO from iris_batch submission instead of from iris_state. +- intel/brw/gfx12.5+: Fix IR of sub-dword atomic LSC operations. +- intel/brw/xe2+: Adjust performance analysis divergence weight due to EU fusion removal. +- intel/fs/xe2: Fix up subdword integer region restriction with strided byte src and packed byte dst. + +Frank Binns (9): + +- pvr: minor vulkan driver formatting fixes +- pvr: drop 'Imagination' from deviceName +- pvr: fix image size calculation when mipLevels is 1 +- pvr: fix bo refcounting for imported buffers +- pvr: fix use of uninitialised value when setting up tex info swizzle +- pvr: remove set but not used struct member +- pvr: add handling for retry error in pvr_srv_rgx_submit_transfer2() +- pvr: add handling for retry error in pvr_srv_rgx_kick_compute2() +- pvr: ensure stencil clear value fits TA_STATE_ISPA.sref field + +Friedrich Vock (5): + +- radv/rt: Only do ploc atomicCompSwap once per workgroup +- radv/rt: Don't atomicAdd local prefix sums +- aco: Fix 1D->2D dispatch conversion on vulkanCapset +- gfxstream: query non-Vulkan caps from the guest +- gfxstream/guest: add GetRingParamsFromCapset function +- gfxstream/guest: use GetRingParamsFromCapset +- gfxstream: nuke unused functions +- gfxstream: simplify VK_HOST_CONNECTION macro +- gfxstream: don't use renderControlEnc for Vulkan +- gfxstream: nuke ResourceTracker::Impl +- gfxstream: fix meson build system issues +- gfxstream/guest: pass through Android process names +- gfxstream: import Vulkan-docs from 463f8c +- gfxstream: vulkan-Docs: import the cerealgenerator +- gfxstream: vulkan-docs: track feature type +- gfxstream: vulkan-docs: add cereal generator +- gfxstream: genvk: add support for vk_gfxstream.xml +- gfxstream: vulkan-docs: don't auto-generate vk_android_native_buffer +- gfxstream: add generate-gfxstream-vulkan.sh +- gfxstream: add support for VK_KHR_maintenance5 + VK_EXT_host_image_copy +- gfxstream: use standard Vulkan headers and new auto-gen flow +- gfxstream: look for common/opengl headers +- gfxstream: cereal: [moduleType] --> [(moduleType, moduleName)] +- gfxstream: cereal: SUPPORTED_WRAPPERS --> SUPPORTED_MODULES +- gfxstream: codegen EXT_swapchain_maintenance1 + KHR_swapchain host only +- gfxstream: mesa: import Mesa +- gfxstream: mega-change to support guest Linux WSI with gfxstream +- gfxstream: nuke goldfish_vk_handlemap +- gfxstream: use absolute path for some code +- genvk: options.filename --> args.target +- gfxstream: codegen: allow generating host/guest autogen separately +- gfxstream: guest: cpp_args --> guest_cpp_args +- gfxstream: reduce use of PLATFORM_SDK_VERSION +- gfxstream: PLATFORM_SDK_VERSION --> ANDROID_API_LEVEL +- gfxstream: don't forward declare ProcessResources +- gfxstream: nuke VIRTIO_GPU flag +- gfxstream: nuke HOST_BUILD +- gfxstream: add GFXSTREAM_ENABLE_GUEST_GOLDFISH +- gfxstream: add REQUIRED_TYPES list +- gfxstream: always initialize rcEncoder and other things +- gfxstream: build the test layer via Meson +- gfxstream: generate development ICD for meson build +- gfxstream: always advertise device memory extension guest side +- gfxstream: convert some Vulkan to mesa_log(..) +- gfxstream: valgrind fix +- gfxstream: add VK_EXT_vertex_attribute_divisor +- Revert^2 "gfxstream: add VK_EXT_vertex_attribute_divisor" +- Revert "gfxstream: Add vkCmdPipelineBarrier2 implementation in VkDecoderGlobalState" +- Revert "Revert "gfxstream: Add vkCmdPipelineBarrier2 implementat..." +- gfxstream: ResourceTracker: fix mesa_loge(..) +- gfxstream: guest: add VK_EXT_color_write_enable +- gfxstream: guest: add XCB Surface support +- gfxstream: guest: support blobCmd + blobCmdSize +- gfxstream: don't define hasDedicatedImage + hasDedicatedBuffer multiple times +- Revert "Add snapshot support for vkBindImageMemory2" +- gfxstream: guest: enable virtio-gpu kumquat +- gfxstream: host + guest: bump up version of the protocol +- gfxstream: linux guest: support client-side blob allocations +- gfxstream: guest: fix useless return statement +- gfxstream: guest: simpler get param for kumquat +- gfxstream: host: introduce vkGetSemaphoreGOOGLE +- gfxstream: guest: add GFXSTREAM_ACQUIRE_SYNC protocol +- gfxstream: guest: add experimental fence passing APIs +- gfxstream: guest: mBlobId -> mAtomicId +- gfxstream: guest: start using new vkGetSemaphoreGOOGLE apis +- gfxstream: kumquat: unmap memory when needed +- gfxstream: guest: fix kumquat build +- gfxstream: vulkan: fix issue with GCC +- gfxstream: guest: nuke AndroidLock +- gfxstream: guest: nuke aemu/base/Optional.h +- gfxstream: guest: nuke unused functions +- gfxstream: guest: make sure signalSemaphoreValueCount is correct +- gfxstream: guest: vk_CmdBeginTransformFeedbackEXT fix +- gfxstream: host: don't abort when unbox fails +- gfxstream: guest: add support VK_FORMAT_A2R10G10B10_UNORM_PACK32 Wayland buffers +- gfxstream: guest: use MESA_SCOPE_TRACE +- gfxstream: guest: nuke more AEMU base stuff +- gfxstream: guest: kumquat: fixes in preparation for AOSP Kumquat testing +- gfxstream: nuke set instance for testing +- gfxstream: nuke libplatform stub +- gfxstream: guest: plumb descriptor all the way through the HostConnection +- gfxstream: connect to the specificied kumquat socket +- gfxstream: guest: add vkInitializeKumquat +- gfxstream: guest: nuke Android HealthMonitor +- gfxstream: guest: plumb the descriptor everywhere +- gfxstream: end2end: nuke RutabagaLayer, use kumquat for testing +- gfxstream: simplify GLESv1/GLESv2/EGL/Vulkan builds +- gfxstream: nuke ErrorLog.h +- gfxstream: nuke AlignedBuf.h +- gfxstream: nuke mWorkPool +- gfxstream: guest: nuke vk_enum_string_helper.h +- gfxstream: guest: nuke Fuchsia Goldfish Address Space +- gfxstream: guest: nuke unnecessary Android properties +- gfxstream: guest: nuke goldfish_address_space_android.impl +- gfxstream: guest: mesa-ify GoldfishAddressSpace +- gfxstream: guest: mesa-ify libplatform_virtgpu +- gfxstream: guest: add virtgpu suffix +- gfxstream: guest: remove EmulatorFeatureInfo dependency in libgfxstream_vulkan.so +- gfxstream: guest: simplify Vulkan sequence number logic +- gfxstream: sync command buffer api with Mesa upstream +- gfxstream: move EmulatorFeatureInfo and ExtendedRenderControl +- gfxstream: guest: remove dependency on EmulatorFeatureInfo +- gfxstream: add processPipeInit and connect to IOStream +- Reapply "gfxstream: nuke mWorkPool" +- gfxstream: guest: fix timeout issues +- gfxstream: guest: introduce Gralloc::getGrallocType() +- gfxstream: guest: remove renderControl dependency from Gralloc +- gfxstream: GrallocMinigbm uses platform_virtgpu +- gfxstream: mesa-ify libgfxstream_android +- gfxstream: delete duplicate virtgpu_drm.h +- gfxstream: guest: don't rely on HostConnection for platform helpers +- gfxstream: guest: introduce GfxStreamConnectionManager +- gfxstream: guest: vulkan: use connection manager +- gfxstream: guest: introduce QemuPipeStream stub +- gfxstream: guest: fully mesa-ify vulkan_enc +- gfxstream: guest: remove unnecessary includes in VkEncoder +- gfxstream: guest: remove unneeded dependencies in meson build +- gfxstream: guest: vulkan: use hardware/google/aemu +- gfxstream: guest: vulkan: don't assume Android +- gfxstream: guest: nuke android_compat directory +- gfxstream: move goldfish sync +- gfxstream: guest: Gralloc --> GfxStreamGralloc +- gfxstream: don't assert without host +- gfxstream: move include outside of ANDROID +- gfxstream: codegen: don't require clang-format +- gfxstream: delete unneccessary code +- gfxstream: build time guest autogeneration +- gfxstream: re-license files +- gfxstream: re-license cereal +- gfxstream: simplify codegen +- gfxstream: moar relicense +- mesa: gfxstream: nuke vk_icd_* +- mesa: add gfxstream-experimental to -Dvulkan-drivers +- gfxstream: add clang-format +- gfxstream: nuke util function +- gfxstream: use util/libsync +- util: add sync_fence_info +- gfxstream: use sync_fence_info +- gfxstream: move generate-gfxstream-vulkan.sh script +- gfxstream: update Kumquat API + +Hailin Zhang (1): + +- Vulkan: fix dstArrayElement index wrap issue + +Hans-Kristian Arntzen (4): + +- wsi/common: Do not update present mode with MESA_VK_WSI_PRESENT_MODE. +- wsi/x11: Bump maximum number of outstanding COMPLETE events. +- radv: Always make sure to write the pipeline binary key. +- vulkan/wsi/wayland: Use X11-style image count strategy when using FIFO. + +Hsieh, Mike (4): + +- amd/vpelib: Remove deprecated update_3dlut flag +- amd/vpelib: Refactor OPP registers +- amd/vpelib: Add cache mechanism for 3D Lut command +- amd/vpelib: Refactor 3D LUT code + +Hyunjun Ko (25): + +- intel/genxml: add a missing value for MFX_SURFACE_STATE +- intel/genxml: change the length of MFX_QM_STATE +- intel/genxml: update VDENC instructions +- vulkan/video: adds a bitstream writer of h264 slice header +- anv/query: handle VK_QUERY_TYPE_VIDEO_ENCODE_FEEDBACK_KHR +- anv/video: remove unnecessary macros +- anv/video: implemnt VkGetEncodedVideoSessionParametersKHR +- anv/video: initial support for h264 encoding +- anv: enable VK_KHR_video_encode_queue and VK_KHR_video_encode_h264 +- intel/genxml: Add missing fields for HCP_SLICE_STATE +- intel/genxml: fix the length of VDENC_DS_REF_SURFACE_STATE +- intel/genxml: Adds more VDENC commands +- intel/genxml: fix some length of HCP_FQM_STATE +- intel/genxml: adds a value of reference pic to HCP_SURFACE_STATE +- intel/decoder: Handle HCP_PAK_INSERT_OBJECT +- vulkan/video: adds a bitstream writer of h265 slice header +- anv/query: consider codec when querying the encoding status. +- anv/video: initial support for h265 encoding +- anv: support h265 encoding +- zink: walk the chain of resources for multi-planar formats. +- anv/video: support VK_VIDEO_ENCODE_RATE_CONTROL_MODE_DISABLED_BIT_KHR. +- anv: consider VK_VIDEO_CODEC_OPERATION_ENCODE_H264_BIT_KHR when allocating mv storgae. +- anv: support for inline query for vulkan video +- anv: support VK_IMAGE_CREATE_VIDEO_PROFILE_INDEPENDENT_BIT_KHR +- anv: enable KHR_video_maintenance1 + +Iago Toral Quiroga (22): + +- v3d: move TLB blit checks to a helper +- v3d: support blitting straight from tile buffer +- v3d: add a faster TLB blit path +- v3d: do not expose EXT_float_blend +- broadcom/compiler: emit instructions producing flags earlier +- broadcom: implement nir_op_uclz +- broadcom: drop backend implementation of nir_op_ufind_msb +- broadcom/compiler: skip small immediates optimization on vpm instructions +- broadcom/compiler: rename is_ldunif_dst to try_rf0 +- broadcom/compiler: avoid register conflict with ldunif(a) and ldvary +- broadcom/compiler: don't add const offset to unifa if it is 0 +- broadcom/compiler: don't use small immediates in geometry stages +- v3d: support shader precompile for compute shaders +- nir: make fclamp_pos_mali and fsat_signed_mali opcodes generic +- broadcom: add missing opcodes for fmov on mul alu for V3D 7.x +- broadcom: add helpers to identify availability of new unpack modifiers +- broadcom/compiler: implement NIR mali opcodes for clamping +- broadcom/compiler: generate mali opcodes for clamping on Pi5 +- broadcom: fix pairing tmu lookup with previous ldtmu +- v3dv: drop device_id field +- v3dv: rewrite device identification +- v3dv: fix leak during device initialization + +Ian Forbes (1): + +- driconf: Re-enable GLX_EXT_buffer_age on vmwgfx + +Ian Romanick (48): + +- nir/range_analysis: Fix errors in fmin and fmax tables +- nir/algebraic: Optimize some bit operation nonsense observed in some shaders +- nir/algebraic: Optimize some masking of extract_u8 operations +- intel/brw: Fix undefined shift by 64 of uint64_t in brw_compute_first_urb_slot_required +- intel/brw: Fix undefined left shift of negative value in update_uip_jip +- intel/brw: Fix undefined left shift of large UW value in brw_imm_uw +- intel/brw: Fix undefined left shift of negative value in brw_texture_offset +- intel/elk: Fix undefined shift by 64 of uint64_t in elk_compute_first_urb_slot_required +- intel/elk: Fix undefined left shift of negative value in update_uip_jip +- intel/elk: Fix undefined left shift of large UW value in elk_imm_uw +- intel/elk: Fix undefined left shift of negative value in elk_texture_offset +- intel/brw: Eliminate dead flag writes +- intel/brw: Don't propagate saturate to an instruction that writes flags +- intel/elk: Don't propagate saturate to an instruction that writes flags +- intel/brw: Small code refactor in brw_fs_opt_saturate_propagation +- intel/brw: Use def analysis for simple cases of saturate propagation +- intel/brw: Move fsat instructions closer to the source +- intel/brw: Make 16-bit ishl, ishr, and ushr SSA friendly +- intel/brw: Make ifind_msb SSA friendly +- nir/algebraic: Improve some find_lsb and ifind_msb patterns +- anv: Protect against OOB access to anv_state_pool::buckets +- anv: Larger memory pools for huge shaders +- intel/brw/xe2: Allow int64 conversions +- nir/algebraic: Remove incorrect bfi of iand pattern +- nir/divergence: resource_intel is less divergent than you thought +- intel/brw: Copy prop from raw integer moves with mismatched types +- intel/brw: Run intel_nir_lower_conversions only after brw_nir_optimize +- intel/brw: Unconditionally run optimizations after nir_opt_uniform_subgroup +- intel/brw: Relax is_partial_write check in cmod propagation +- intel/brw: Enable constant propagation for a couple more logical sends +- intel/brw: load_offset cannot be constant on this path +- nir/algebraic: Optimize more comparisons with b2f +- nir/algebraic: Recognize open-coded bitfield_reverse in XCOM 2 +- nir: Add a pass to generate BFI instructions from logical operations +- intel/brw: Use nir_opt_generate_bfi +- r600: Use nir_opt_generate_bfi to generate bitfield_select +- radeonsi: Use nir_opt_generate_bfi to generate bitfield_select +- radv: Use nir_opt_generate_bfi to generate bitfield_select +- brw/copy: Don't remove instructions w/ conditional modifier +- brw/builder: Add new style ALU3 builder +- brw/sat: Fix small typos, copy and paste, etc. +- brw/sat: Convert nearly all tests to use new style builders +- brw/algebraic: Refactor constant folding out of brw_fs_opt_algebraic +- brw/copy: Perform constant folding with constant propagation +- brw: Call brw_fs_opt_algebraic less often +- brw/emit: Add correct 3-source instruction assertions for each platform +- brw/copy: Don't copy propagate through smaller entry dest size +- brw/cse: Don't eliminate instructions that write flags + +Icenowy Zheng (10): + +- zink: reject Imagination proprietary driver w/o geometryShader +- gallivm: orcjit: keep the ownership of tm for LPJit +- gallivm: orcjit: use atexit to release LPJit singleton at exit +- util: detect LoongArch architecture +- gallivm: add LoongArch support to the mattrs setting code +- llvmpipe: add LoongArch support in ORCJIT +- zink: do not expose quad support when no geometry shader +- pvr: enlarge transfer fw_stream buffer for multicore +- pvr: add tpu_dm_global_registers feature +- pvr: emit tpu_tag_cdm_ctrl in compute stream when present + +Igor Chernyshev (4): + +- Add VK_EXT_debug_utils and VK_EXT_scalar_block_layout +- Enable VK_KHR_create_renderpass2 in Guest +- Move vkUpdateDescriptorSets to hand-written mode +- Do not copy invalid descriptor set image view handles + +Iliyan Dinev (2): + +- pvr: fix mipmap alignment for non-32bpp textures +- pvr: handle PVR_SRV_ERROR_RETRY pvr_srv_rgx_kick_render2() + +Iván Briano (13): + +- intel/rt: fix terminateOnFirstHit handling +- nir: add pass to convert ViewIndex to DeviceIndex +- anv: handle VK_PIPELINE_CREATE_VIEW_INDEX_FROM_DEVICE_INDEX_BIT +- anv: fix adding to wa_addr +- anv: be consistent about aux usage with modifiers +- anv: free shaders on rt pipeline compile error +- anv: skip rt pipeline compile if we found all shaders +- vulkan: use standard sample locations if there's no VkPipelineSampleLocationsStateCreateInfoEXT +- anv: allocate sparse descriptor buffers from the correct heap +- hasvk: fix non matching image/view format attachment resolve +- brw: fix task/mesh push constant loading +- intel/rt: fix ray_query stack address calculation +- anv: remove unused/misleading/wrong parameters from the RT trampoline + +Jami Kettunen (1): + +- nouveau/headers: Fix build without rustfmt + +Jason Macnak (49): + +- Rename imported system directory to guest +- Handle AHB and swapchain info in VkBindImageMemoryInfo +- Move platform into guest +- Move guest/shared/* to guest/* +- Add error logging to aid debugging +- Make VirtGpu* interfaces +- Wrap interactions with sync fds in an interface +- Update namespace in libandroidemu to gfxstream::guest +- Update iostream namespace to gfxstream::guest +- Enable building guest Gfxstream components on host +- Wrap AHardwareBuffer_* interactions into Gralloc interface +- Update VirtioGpuPipeStream to use VirtGpu abstraction +- Move SyncHelper to platform +- Revert "[guest] Check that the singleton VirtGpuDevice matches desired capset" +- Move info maps into ResourceTracker +- Promote testing framework to a proper platform backend +- Use libbase's property getting functions +- Add dispatch magic for Gfxstream linux ICD +- Wrap some classes in "namespace gfxstream::guest {}" +- Sort and dedup physical device extensions +- Fix "-Werror=conversion" errors for RanchuHwc +- Handle AHB R8 format conversions +- Make RutabagaLayer shared between "guest impls" +- Introduce Gfxstream Features to decouple Gfxstream from AEMU +- Ensure glProgramBinary initializes uniform and attrib info +- Add composition support to the end2end test framework +- Update emulated gralloc to use AHB format +- Rename VirtGpuBlob -> VirtGpuResource +- Explicitly specify target/bind/bpp in resource creation +- Partial revert of aosp/2858589 to avoid Mesa layer for Android +- Revert "Partial revert of aosp/2858589 to avoid Mesa layer for Android" +- Reland "Partial revert of aosp/2858589 to avoid Mesa layer for Android" +- Update vkMapMemory to not hold lock when calling into enc +- Remove extra semicolon in VkEncoder +- Support BGRA render target in CompositorVk +- Adds helper for tracking command buffers with device lost +- Adds AHB import and sampling test +- Add YUV AHB import and sample end2end tests +- Demote some logs to debug only +- Disable device groups +- Ensure mesa struct is translated to gfxstream struct on Android +- Handle DEPTH_STENCIL_ATTACHMENT_BIT in VK -> AHB usage +- Handle null handles in vkFreeCommandBuffers +- Fix vkAllocateCommandBuffers with failing allocation +- Revert "gfxstream: nuke mWorkPool" +- Handle external fences in vkGetFenceStatus() +- Add perfetto tracing to host renderer +- gfxstream: fix log levels in descriptor handling +- gfxstream: use gralloc metadata in vkGetAHBPropertiesANDROID + +Jean-Francois Thibert (4): + +- Add vkUpdateDescriptorSetWithTemplateKHR to resource tracker +- gfxstream: Fix usage of encoder after release in vkResetCommandBuffer +- gfxstream: move staging reset to end of reset command buffer +- Provided supported formats through gfxstream caps + +Jean-François Thibert (1): + +- Add depth formats support to import/export + +Jesse (1): + +- amd/vpelib: Config Writer hook and CDC refinement + +Jesse Natalie (18): + +- microsoft/clc: Split struct copies before vars_to_ssa in pre-inline optimizations +- mr-label-maker: Add d3d10umd label rules +- meson: Add an error message for llvmpipe without llvm draw support +- wgl: Add missing idep_mesautilformat +- d3d12: Don't use a vertex re-ordering GS for line primitives +- ci/windows: Use MSVC v143 build tools +- ci/windows: Update VK-GL-CTS to d48899f85b486a70d090af59a1453763458611d9 +- ci/windows: Bump image tags to rebuild deps with v143 build tools +- compiler, vk: Support subgroup size of 4 +- dzn: Drop custom subgroup size handling +- microsoft/compiler: Handle subgroup size 4 as a required wave size +- microsoft/compiler: Move nir_lower_undef_to_zero out of the optimization loop +- d3d12: Fix shader selector hash to hash array instead of pointer-to-array +- winsys/d3d12: Use the waitable object to decrease latency +- wgl: Add driconf options for controlling latency and swap interval +- winsys/d3d12: Implement the latency control option +- driconf: Disable dzn for bg3.exe +- dzn: Clean up dri options cache + +Jessica Clarke (3): + +- Revert "meson: Do not require libdrm for DRI2 on hurd" +- Revert "meson: fix with_dri2 definition for GNU Hurd" +- meson: egl: Build egl_dri2 driver even for plain DRI + +Jiale Zhao (1): + +- llvmpipe: add loongarch util_get_cpu_caps function + +Jiali (1): + +- amd/vpelib: Enhance output format capabilities + +Jianxun Zhang (8): + +- intel/common: Ensure SIMD16 for fast-clear kernel (xe2) +- intel/common: Remove blank lines in intel_set_ps_dispatch_state() (xe2) +- anv: Fix assertion failures on BMG (xe2) +- iris: Fix an assertion failure with compressed format +- anv: Disable compression on legacy modifiers (xe2) +- anv: Disable legacy CCS setup in binding (xe2) +- Revert "anv: Disable PAT-based compression on depth images (xe2)" +- Revert "iris: Disable PAT-based compression on depth surfaces (xe2)" + +Job Noorman (88): + +- ir3/postsched: improve debug state dumping +- ir3/postsched: fix calculation of max_delay +- ir3/postsched: remove unused delay field +- ir3/postsched: include ss/sy delay in max_delay +- ir3/postched: don't prioritize instructions with soft delays +- ir3: fix spill/reload split src/dst regs +- ir3: update merge set affinity in shared RA +- ir3: fix clearing merge sets after shared RA +- ir3: fix wrong dstn used in postsched +- ir3: fix counting of repeated registers +- ir3: correctly count vectorized instructions for tex prefetch +- ir3: add debug option to expand rpt instructions +- ir3: print (sat) modifier of instructions +- ir3: add backend support for repeated instructions +- ir3: add builders for repeated instructions +- ir3: make RA aware of repeat groups +- ir3: add pre-RA pass to clean up repeat groups +- ir3: add post-RA pass to merge repeat groups into rptN instructions +- ir3: lower vectorized NIR instructions +- nir/opt_vectorize: move rewriting of uses to a function +- nir/opt_vectorize: prepare for multiple try_combine functions +- nir/opt_vectorize: process blocks in source-code order +- nir/opt_vectorize: add support for phi nodes +- ir3: add support for vectorized NIR phi nodes +- ir3: replace \@load_uniform by new \@load_const_ir3 intrinsic +- ir3: rename \@store_uniform_ir3 to \@store_const_ir3 +- nir/load_store_vectorize: support non-byte offset +- nir/load_store_vectorize: support stores without wrmask +- nir/load_store_vectorize: add load/store_const_ir3 +- ir3: enable load/store_const_ir3 vectorization +- ir3: add support for rpt bary.f/flat.b +- ir3: use rpt instructions for frag coord +- ir3: use correct bit size for bools in emit_alu +- ir3/ci: remove fixed tests from a307-fails +- ir3: make fullsync sync after shared writes +- ir3/legalize: handle scalar ALU WAR hazards for a0.x +- zink/ci: add a618 flake +- freedreno: don't require binning and non-binning inputs to match +- ir3: remove unused outputs for binning pass in NIR +- ir3: don't modify const state for the binning variant in ir3_cp +- ir3: make ir3_const_state less error-prone to use +- ir3: fix calling ir3_const_state_mut in the binning VS +- ir3: print predicate inversion for branches +- ir3: print block divergence info +- ir3: remove unnecessary reconvergence point in scan_clusters lowering +- ir3: preserve block divergence info in ir3_lower_subgroups +- ir3: fix reconvergence points for blocks with two terminators +- ir3: fix adding physical edges multiple times +- ir3: fix reconvergence of blocks with multiple divergent predecessors +- ir3: recalculate reconvergence after opt_jump +- ir3: use physical cfg in helper_sched +- ir3: fix recognizing const/imm registers as a0 +- ir3/legalize: don't add WAR dependencies for const/imm regs +- ir3: add assert to detect getting reg file of const/imm +- ir3/legalize: add needs_ss_war helper +- ir3/legalize: don't add (ss) for WAR hazards synced with (sy) +- ir3: add is_war_hazard_producer helper +- ir3/postsched: take WAR ss-delay into account +- ir3/legalize: resolve WAR hazards for stc +- ir3/isa: add isaspec definition for shfl +- ir3: add ir3_compiler::has_shfl for shfl support on a6xx+ +- ir3/print: add support for shfl +- ir3: make backend aware of shfl: +- ir3: add codegen for rotate +- tu: advertise VK_KHR_shader_subgroup_rotate +- ir3: disallow immediates for shfl src1 +- nir/load_store_vectorize: fix division by zero +- nir/lower_int64: add nir_intrinsic_read_invocation_cond_ir3 +- nir/lower_int64: add nir_intrinsic_rotate +- ir3: lower 64b intrinsics after generic 64b lowering +- ir3: run 64b phi lowering together with other 64b lowerings +- ir3: lower 64b SSBO accesses +- ir3: make ir3_mem_access_size_align publically available +- tu: lower 64b push constants before lowering IO +- ir3: add support for 64b reductions +- tu: advertise shaderInt64 +- nir: add shuffle_{xor,up,down}_uniform_ir3 intrinsics +- ir3: add codegen for shuffle_{xor,up,down}_uniform_ir3 +- ir3: add helper to get glsl type for nir_def +- ir3: optimize subgroup shuffles using shfl +- nir/lower_subgroups: move up some helper functions +- nir/lower_subgroups: add build_cluster_mask helper +- nir/lower_subgroups: scan/reduce for multiple ballot components +- ir3: fix physical edges of predicated branches +- freedreno,computerator: add support for local memory +- ir3: merge is_reg_gpr and reg_gpr +- ir3/ra: prevent moving source intervals for shared collects +- ir3,tu: include ir3 debug flags in shader hash key + +Jocelyn Falempe (3): + +- loader: Fix typo in __DRI_IMAGE_FORMAT_XBGR16161616 definition +- gbm/dri: Use PIPE_FORMAT_* instead of using __DRI_IMAGE_* +- gbm/dri: Fix color format for big endian. + +John Anthony (4): + +- pan/kmod: Add max_tasks_per_core to kmod props +- panvk: Add cmd_dispatch +- panvk: Pull out task axis and increment calculation for dispatch +- panvk: Add support for CmdDispatchIndirect + +Jonathan Marek (2): + +- freedreno: improve a6xx CP_SET_MARKER xml definition +- freedreno/a6xx: add missing USES_GMEM flag and BIN_RENDER_END markers + +Jordan Justen (31): + +- include/drm-uapi/README: Update README for drm uapi header files +- intel/brw/validate: Simplify grf span validation check by not using a mask +- intel/brw/validate: Update dst grf crossing check for Xe2 +- intel/brw/validate: Convert access mask to be grf based +- intel/dev: Update hwconfig => max_threads_per_psd for Xe2 +- intel/dev: Enable BMG PCI IDs (without INTEL_FORCE_PROBE) +- anv: Drop "not yet supported" warning for Xe2 +- anv: Do hasvk devices check first +- intel/dev: Add devinfo::probe_forced based on INTEL_FORCE_PROBE +- anv: Don't warn about unsupported devices if INTEL_FORCE_PROBE was used +- intel/dev: Fix warning for max_threads_per_psd when devinfo->verx10 == 120 +- intel/dev: Add 0xb640 ARL PCI id +- intel/dev: Rework DEVINFO_HWCONFIG; add DEVINFO_HWCONFIG_KV macro +- intel/dev: Simplify DEVINFO_HWCONFIG_KV by adding should_apply_hwconfig_item() +- intel/dev: Allow specifying a version when to always use hwconfig +- intel/dev: Use hwconfig for urb min/max entry values +- intel/dev: Support Xe3 device init (for intel_device_info_test) +- intel/dev: Add XE3_FEATURES macro +- intel/genxml: Start Xe3 support +- intel/isl: Build for Xe3 +- intel/shaders: Build for Xe3 +- iris: Build for Xe3 +- intel/dev: Add Xe3 support to get_l3_list() +- intel/compiler: Add compiler enum for Xe3 +- intel/dev: Add INTEL_PLATFORM_PTL platform enum +- intel/dev: Add PTL device info +- intel/dev: Add PTL PCI IDs (with FORCE_PROBE set) +- intel/brw: Allow Xe3 in brw_stage_has_packed_dispatch() +- intel/compiler: Xe2 and Xe3 use the same compaction tables +- anv: Build for Xe3 +- intel/dev: Set L3 bank count for Xe2+ from Xe KMD + +Jose Maria Casanova Crespo (5): + +- v3d: v3d_resource Use LINEAR layout for importing with INVALID modifier +- v3d: avoid load/store of tile buffer on invalidated framebuffer +- v3d: initialize job local key with the 8 color buffer available in v7.1+ +- v3dv/ci: Add missing fails on RPi4/5 for uprev VKCTS to 1.3.10.0 +- v3d: Enable Early-Z with discards when depth updates are disabled + +Joshua Ashton (2): + +- radv: Enable variableSampleLocations +- radv: Fix sample locations at 0 for X/Y + +Joshua Duong (1): + +- For goldfish pipe, compute colorBufferMemoryIndex the same as host. + +José Roberto de Souza (50): + +- intel/brw: Add a maximum scratch size restriction +- anv: Drop useless 'if (total_scratch > 0) {' block in cmd_buffer_ensure_cfe_state() +- anv: Wait for Xe exec queue to be idle before destroying it +- isl: Fix Xe2 protected mask +- anv: Propagate protected information to blorp_batch_isl_copy_usage() +- anv: Improve error message when pipeline creation fails during shader compilation +- anv: Handle internal shader compilation failure +- intel: Sync xe_drm.h +- intel/dev: Support new topology type with SIMD16 EUs +- docs: Move all ANV environment variables to a single section in envvars +- anv: Disable sparse even on Xe KMD with ANV_SPARSE +- docs/anv: Document ANV_SPARSE and ANV_SPARSE_USE_TRTT +- anv: Extend ANV_QUEUE_OVERRIDE to blit count +- intel: Remove INTEL_ENGINE_CLASS_COMPUTE and INTEL_ENGINE_CLASS_COPY parameters +- docs/anv: Remove documentation about INTEL_COMPUTE_CLASS and INTEL_COPY_CLASS +- anv: Add trivial_batch and query-pool to the error capture +- anv: Use batch_bo_pool in utrace anv_async_submit_init() calls +- anv: Nuke anv_utrace_submit::trace_bo +- intel/isl/gfx20: Alow hierarchial depth buffer write through for multi sampled surfaces +- anv/gfx20: Enable depth buffer write through for multi sampled images +- iris/gfx20: Enable depth buffer write through for multi sampled images +- anv: Nuke perf_metric +- intel/perf: Fix intel_gem.h include +- anv: Drop useless '>= 0' check over a unsigned +- anv: Add anv_device_perf_close() +- anv: Fix context id or exec queue used to open perf stream +- anv: Add warning about mismatch between query queues +- anv: Make sure all previous vm binds are done before execute perf query pool +- anv: Check if vkCreateQueryPool() is being created in a supported queue +- anv: Add documentation to some fields in anv_query_pool +- anv/query: Fix batch end value +- anv: Improve readbility of khr_perf_query_availability_offset() and khr_perf_query_data_offset() +- anv: Fix condition to clear query pool with blorp +- intel: Split anv_xe_wait_exec_queue_idle() and move part of it to common/ +- iris: Use xe_queue_get_syncobj_for_idle() +- anv: Optimize vkQueueWaitIdle() on Xe KMD +- intel/perf: Sync oa-lnl.xml +- intel/perf: Add BMG OA support +- intel/genxml: Append 'Z Async Throttle settings' to gfx125 definition of STATE_COMPUTE_MODE +- anv: Set all async compute registers in STATE_COMPUTE_MODE +- intel/genxml: Do small fixes in gfx20 definition of STATE_COMPUTE_MODE +- anv: Set recommended values for gfx20 async compute registers in STATE_COMPUTE_MODE +- iris: Program async compute registers of STATE_COMPUTE_MODE in compute engine +- intel/dev/xe: Set max_eus_per_subslice using topology query +- intel/dev: Fix max_cs_threads value on simulator +- intel/perf: Add OA support to ARL +- drm-uapi: Sync xe_drm.h +- intel/perf: Extend intel_perf_stream_set_metrics_id() to syncronize metrics id changes +- intel/perf: Add INTEL_PERF_FEATURE_METRIC_SYNC and check if KMD supports it +- anv: Enable perf metrics id set syncronization + +Juan A. Suarez Romero (32): + +- v3d: use operations to specify what to save in blitter +- v3d: add new clear blitter op +- v3d/ci: add failures from piglit all profile +- v3d/ci: add new flakes +- vc4: fix typo in simulator conditional declaration +- v3d/vc4/ci: add address sanitizer jobs +- v3d/ci: enable full run jobs for rpi5 +- broadcom/ci: annotate some of the failures +- v3d/ci: add new flakes in rpi5 +- v3d/ci: update number of rpi5 available for testing +- v3d: free dbuf +- broadcom/ci: use kernel8 for rpi5 full jobs +- v3d/ci: update expected results +- vc4/ci: update expected results +- Revert "ci: take igalia farm offline" +- vc4/ci: update expected results +- vc4: build simulator references conditionally +- v3d/v3dv: build simulator references conditionally +- v3d/ci: add new flake for rpi5 +- Revert "v3d: never replace a mapped bo" +- v3d: do not rebind a sampler view already rebound +- v3d/ci: add new flake for rpi5 +- v3d/ci: update expected results +- v3dv/ci: add new flake +- broadcom: do not include simulator if not available +- v3d/ci: add new flakes and timeouts +- vc4: handle nir_op_ult32 intrinsic +- v3d/v3dv/ci: update expected results +- v3d/ci: use special kernel for full jobs +- v3dv/ci: move asan fail to skip +- v3d: add new flake +- v3d/ci: add OpenCL failures + +Jude Shih (2): + +- amd/vpelib: Dynamic Command List Expansion +- amd/vpelib: Restructure CDC FE/BE + +Jules Blok (5): + +- vulkan: Add support for VK_EXT_depth_clamp_control +- radv: Add support for VK_EXT_depth_clamp_control +- anv: Add support for VK_EXT_depth_clamp_control +- nvk: Add support for VK_EXT_depth_clamp_control +- docs: Update status of VK_EXT_depth_clamp_control support + +Julia Zhang (1): + +- mesa/st: use drawable->ID as hash for drawable_ht + +Juston Li (5): + +- anv/android: remove unneeded ANB implicit import flags +- anv/android: handle R8G8B8X8 as R8G8B8A8 +- anv/android: refactor out u_gralloc tiling query +- anv/android: resolve ANB swapchain images on bind +- android: look for debug/vendor prefixed options + +K900 (5): + +- gbm: split the DRI backend into a separate library and unify backend handling +- meson: require libdrm when building gbm +- meson/gbm: clean up dependencies +- meson: remove dri-search-path +- meson/gbm: fix meson devenv for GBM backends + +Kai Wasserbäch (2): + +- fix(FTBFS): clover: use \`.getDataLayout()` with LLVM >= 20 +- fix(FTBFS): build: link OpenCL target with \`dep_clang` from top-level + +Karmjit Mahil (12): + +- freedreno: Enable the A735 +- tu: Set \`TU_ACCESS_CCHE_READ` for transfer ops with read access +- freedreno: Define SP_DITHER_CNTL (0xA9AC) +- tu: Implement VK_EXT_legacy_dithering +- ir3: Use \`foreach_instr_safe` in ir3_shared_folding +- pvr: Fix \`barier`->`barrier` typo +- pvr: Use a pixel size of \`0` for invalid pbe accum formats +- pvr: Regenerate all descriptor program data sections +- pvr: Handle DISABLE_PIXELMERGE +- tu: Fix push_set host memory leak on command buffer reset +- tu: Fix potential alloc of 0 size +- nir: Fix \`no_lower_set` leak on early return + +Karol Herbst (101): + +- spirv: generate info for FunctionParameterAttribute +- spirv: initial parsing of function parameter decorations +- spirv: handle function parameters passed by value +- nak: allow clippy::not_unsafe_ptr_arg_deref lints +- nak: fix clippy::if_same_then_else error +- rusticl/platform: add perf debug option +- rusticl/memory: remove stale TODOs +- rusticl/memory: add a couple of performance warnings +- rusticl: move more kernel related things into kernel.rs +- rusticl/kernel: make some types and fields private +- rusticl/kernel: move setting of nir metadata into lower_and_optimize_nir +- rusticl/program: move attribute_str to the spirv module +- rusticl: use blob for all serialization +- rusticl: remove unsued serialization helpers +- rusticl/kernel: add type for the convert_spirv_to_nir return value +- rusticl/kernel: move internal_args into NirKernelBuild +- clc: force linking of spirvs with mismatching pointer types in signatures +- rusticl/spirv: do not add additional new lines when joining logs +- rusticl: fix clippy lint having bounds defined in multiple places +- rusticl/program: protect against 0 length in slice::from_raw_parts +- rusticl/api: protect against 0 length in slice::from_raw_parts +- rusticl/spirv: protect against 0 length in slice::from_raw_parts +- nouveau: handle realloc failure inside cli_kref_set +- rusticl: bump rust req to 1.76 +- rusticl: use div_ceil +- rusticl: use next_multiple_of +- rusticl: use pointer byte offset API +- rusticl: replace some raw pointer casts +- rusticl: use ptr::from_ref and ptr::from_mut +- rusticl/icd: use ptr::addr_eq +- meson: centralize rust handling +- docs: move rust update policy +- meson: centralize bindgen version check +- meson: centralize checking for new enough meson for rust support +- nir: add load_global_size intrinsic +- rusticl/kernel: emit system values with the correct device size +- rusticl/kernel: handle load_global_size +- mesa: check for enabled extensions for \*UID enums +- nouveau/winsys: fix handling of NV_DEVICE_TYPE_IGP +- nouveau: use nv_devince_info and fill in PCI and type information +- nouveau: add nv_device_uuid +- nouveau: implement driver_uuid and device_uuid +- nvk: use nv_device_uuid +- nouveau: remove nouveau_check_for_uma +- nouveau: implement PIPE_PCI caps +- nouveau: use nv_device_info directly for dumping push buffers +- zink: lower 64 bit find_lsb, ufind_msb and bit_count +- zink: lower 8/16 bit alu ops vk spirv doesn't allow +- rusticl/kernel: properly respect device thread limits per dimension +- rusticl/device: Make supported 1Dbuffer formats a strict subset of 1D +- rusticl/memory: Fix memory unmaps after rework +- rusticl/image: take pitches into account when allocating memory for maps +- rusticl/image: properly sync mappings content for 1Dbuffer images +- rusticl/queue: add clSetCommandQueueProperty +- util/u_printf: do not double print format string with unused arugments +- rusticl/memory: fix sampler argument size check +- rusticl/kernel: move kernel size into Constant arg type +- rusticl/kernel: rename InternalKernelArg to CompiledKernelArg +- rusticl/kernel: remove CompiledKernelArg::size +- rusticl/kernel: make CompiledKernelArg::offset a u32 +- rusticl/kernel: make KernelArg::offset and KernelArg::binding proper u32 values +- rusticl/kernel: rename InternalKernelArgType to CompiledKernelArgType +- rusticl/kernel: move serialization code of KernelArgType +- rusticl/kernel: big kernel arg rework +- rusticl/kernel: add KernelArgType::is_opaque +- rusticl/kernel: move CompiledKernelArg decleration above the impl block +- rusticl/kernel: simplify adding internal kernel args +- rusticl/mem: do not check against image base alignment for 1Dbuffer images +- rusticl: do not use CL vector types in bindings and code +- ac/llvm: fix umul_high +- rusticl/device: limit CL_DEVICE_IMAGE_MAX_BUFFER_SIZE more aggressively +- vtn: ignore volatile on functions for now +- clc: add support for more image related extensions +- compiler/types: Add multisample vimage/vtexture types +- nir: Support multisampled images in lower_read_only_images_to_tex() +- vtn, nir: handle OpImageQueryLevels on images +- vtn: mark ImageMipmap as supported +- rusticl: reuse PipeContext +- zink: cache generated vendor and device name inside zink_screen +- rusticl/mesa: Return CStr for device and vendor names. +- clc: fix compilation error with llvm-20 +- rusticl/kernel: move dead_var options into constant space +- rusticl/platform: add env variable to disable kernel variants +- rusticl/mesa: add more workgroup_Size functions to NirShader +- rusticl/mesa: implement Clone for NirShader +- rusticl/kernel: move assign_locations into CompiledKernelArg +- rusticl/kernel: split up nir compilation into various steps +- rusticl/kernel: track if arg is dead inside CompiledKernelArg +- rusticl/kernel: add CompilationResult to hold compilation artifacts +- rusticl/kernel: add optimized Kernel variant +- rusticl: add nir debugging option +- rusticl: allow devices to be enabled by default +- docs: document gallium-rusticl-enable-drivers +- asahi: implement get_cl_cts_version +- rusticl: allow asahi to be enabled by default +- zink: add CL CTS result +- radeonsi: move si_compute::global_buffers to si_context +- nv/codegen: Do not use a zero immediate for tex instructions +- nvc0: return NULL instead of asserting in nvc0_resource_from_user_memory +- rusticl/kernel: fix kernel variant selection +- vtn: handle struct kernel arguments passed by value + +Kenneth Graunke (48): + +- intel/brw: Don't force g1's live range to be the entire program +- intel/brw: Delete fs_reg_alloc::discard_interference_graph() +- intel/brw: Record that SHADER_OPCODE_SCRATCH_HEADER uses g0 +- intel/brw: Record g0 as live for sends with send_ex_desc_scratch set +- intel/brw: Only force g0's liveness to be the whole program if spilling +- intel/brw: Mark all UBO access with a direct buffer index as speculative +- intel/brw: Delete the brw_fs_opt_peephole_select() pass +- intel/brw: Delete the brw_fs_opt_dead_control_flow_eliminate() pass +- intel/brw: Replace predicated break optimization with a simple peephole +- intel/brw: Set appropriate types for 16-bit sampler trailing components +- intel/brw: Drop image_{load,store}_raw_intel handling +- intel/brw: Pass opcode to brw_swsb_encode/decode +- intel/brw: Fix Xe2+ SWSB encoding/decoding for DPAS instructions +- intel: Fix bad align_offset on global_constant_uniform_block_intel +- intel/brw: Fix OOB reads when printing instructions post-reg-alloc +- intel/brw: Print blocks in brw_print_instructions_to_file() +- intel/brw: Add a file parameter to idom_tree::dump() +- intel/brw: Switch from LSC CMASK opcodes to regular LOAD/STORE +- intel/brw: Get rid of the lsc_msg_desc_wcmask helper +- intel/brw: Drop misguided sign extension attempts in extract_imm() +- intel/brw: Fix extract_imm for subregion reads of 64-bit immediates +- intel/brw: Use NUM_BRW_OPCODES in can_omit_write() check +- intel/brw: Use size_written for NoMask instructions in is_partial_write +- intel/brw: Rename lsc_aop_for_nir_intrinsic to "op" instead of "aop" +- intel/brw: Handle load/stores in lsc_op_for_nir_intrinsic() +- intel/brw: Introduce new MEMORY_*_LOGICAL opcodes +- intel/brw: Expose functions to convert LSC enums to strings +- intel/brw: Pretty-print memory logical opcodes +- intel/brw: Lower MEMORY_OPCODE_*_LOGICAL to LSC messages +- intel/brw: Lower MEMORY_OPCODE_*_LOGICAL to HDC messages +- intel/brw: Switch load_num_workgroups to the new memory intrinsic +- intel/brw: Switch to emitting MEMORY_*_LOGICAL opcodes +- intel/brw: Switch load_ubo_uniform_block_intel over to memory intrinsics +- intel/brw: Delete old-style surface and A64 message opcodes +- intel/brw: Don't include sync.nop in INTEL_DEBUG instruction counts +- nir: Don't generate single iteration loops to zero-initialize memory +- intel/brw: Fix spill/fill count for load/store_scratch in SIMD32 +- intel/brw: Delete Gfx7-8 code from emit_barrier() +- intel/brw: Make a ubld temporary in emit_barrier() +- intel/brw: Fix register and builder size in emit_barrier() for Xe2 +- intel/brw: Delete more Gfx8 code from brw_fs_combine_constants +- intel/brw: Use whole 512-bit registers in constant combining on Xe2 +- brw: Optimize 16-bit texture fetches later +- intel/brw: Allow immediates in the BFE instruction on Gfx12+ +- brw/validate: Return an error for Align16 access mode on Icelake+ +- brw/emit: Fix align16 3src subregister encodings for HF types +- intel: Set shader_spilling_rate=11 in intel_clc +- brw: Fix try_rebuild_source's ult32/ushr handling to use unsigned types + +Konrad Dybcio (1): + +- freedreno: Add initial A621 support + +Konstantin (5): + +- util: Handle more formats in get_plane_(width|height) +- vtn: Remove dead shader_call_data from all RT stages +- radv: Handle instruction encodings > 8 bytes when splitting disassembly +- radv: Handle repeated instructions when splitting disassembly +- meson: Allow building lavapipe without specifying llvmpipe + +Konstantin Seurer (28): + +- llvmpipe: Improve PIPE_QUERY_PIPELINE_STATISTICS accuracy +- lavapipe: Fix box sorting during ray traversal +- aco: print s_delay_alu INSTSKIP>3 correctly +- gallivm: Use variables for loop exec mask handling +- gallivm: Skip inactive branches +- radv: Synchronize shader dumping +- llvmpipe: Use derivative intrinsics +- nir: Introduce nir_debug_info_instr +- spirv: Emit nir_debug_info_instr for OpLine +- nir/print: Add a helper for generating debug info +- nir/opt_loop: Fix handling else-breaks in merge_terminators +- gallium,st/mesa: Add and set pipe_image_view::is_2d_view_of_3d +- lavapipe: Implement VK_EXT_image_2d_view_of_3d with sparse textures +- lavapipe: Do not adjust imageGranularity for different block sizes +- radv: Work around broken terrain in Warhammer III +- radv: Initialize sqtt state before meta state +- radv: Workaround apps using ray tracing when it is unsupported +- lavapipe: Fix report_ray_intersection affecting terminated rays +- lavapipe: Do not return in report_ray_intersection +- radv: Fix report_ray_intersection affecting terminated rays +- lavapipe: Implement clustered reductions +- lavapipe: Implement VK_KHR_shader_subgroup_rotate +- Revert "lavapipe: Do not return in report_ray_intersection" +- lavapipe: Set minAccelerationStructureScratchOffsetAlignment to 8 +- radv/meta: Do not pass NULL to vk_texcompress_astc_finish +- nir/print: Fix the alignment of 8-bit definitions +- radv: Remap 10 and 12 bit formats to 16 bit formats +- radv: Move ac_addrlib to the physical device + +Kovac, Krunoslav (4): + +- amd/vpelib: DPP starting changes +- amd/vpelib: MPC refactoring HW registers +- amd/vpelib: Refactor MPC registers +- amd/vpelib: Reuse existing float to reg format conversion + +Lars Harrison (3): + +- Provide error message on unhandled VK structs +- Always initialize gfxstream vulkan +- Fix coherent memory allocation to use device + +Lars-Ivar Hesselberg Simonsen (12): + +- panfrost: Move ForEachMacros into panfrost +- pan/genxml: Add pan_pack_nodefaults() +- panfrost: Fix near/far depth clip +- panfrost: Enable support for depth clamping +- panvk: Fix paths that assume a fragment shader +- panvk: Fix use of cs_sync64_wait in cmd_wait_event +- panvk: Map device memory for PANVK_DEBUG=trace +- panvk: Use the passed Vertex Attribute divisor +- panvk: Only set index buffer size for DrawIndexed +- panvk: Add get_tiler_idvs_flags +- panvk: Split panvk_cmd_draw into prepare_draw +- panvk: Add support for Draw[Indexed]Indirect + +Leder, Brendan Steve (4): + +- amd/vpelib: Fix output_ctx gamma curve cache bug +- amd/vpelib: Fix backend bug for multiple instances +- amd/vpelib: Generalize visual confirm handling +- amd/vpelib: Update chip headers + +Lepton Wu (1): + +- egl/android: Fix wrong pipe format for RGB_565 + +Lin, Ricky (4): + +- amd/vpelib: Added JFIF format to RGB output side +- amd/vpelib: Use VPE_IP_LEVEL_1_0 for VPE IP 6.1.3 +- amd/vpelib: Remove extra collaborate sync commands in IB +- amd/vpelib: Increase the CD field in vpe descriptor programming + +LingMan (1): + +- nak/hw_tests: Use f32::to_bits() + +Lionel Landwerlin (139): + +- anv: split instance from anv_device.c +- anv: split physical_device from anv_device.c +- anv: split events from anv_device.c +- anv: split buffer from anv_device.c +- anv: split sampler from anv_device.c +- anv: split buffer view from anv_image.c +- anv: split image view from anv_image.c +- isl: account for protection in base usage checks +- anv: properly flag image/imageviews for ISL protection +- anv: propagate protected information for blorp operations +- anv: stop using 3DSTATE_WM::ForceThreadDispatchEnable +- anv: fix check on pipeline mode to track buffer writes +- u_trace: add mako exception printouts +- u_trace: rework tracepoint argument declaration +- u_trace: remove timestamp reference in allocations +- u_trace: add support for indirect data +- anv/hasvk: add indirect tracepoint arguments +- u_trace: add csv output +- util/debug: update parse_enable_string to deal with +all/-all +- anv: better signal new frames to utrace +- vulkan/runtime: allow null/empty debug names +- anv: reuse object string for RMV token +- anv: prevent asserts with debug printf in internal shaders +- anv: remove some unused includes +- anv: reuse cs_prog_data pointer +- intel-clc: missing printf lowering +- anv: add missing MEDIA_STATE_FLUSH for internal shaders +- anv: ensure max_plane_count is at least 1 +- genxml: unify some bits between Gfx8/Gfx11/Gfx12.5 +- intel/mi_builder: enable control flow API on Gfx9+ +- anv/blorp: force CC_VIEWPORT reallocation when programming 3DSTATE_VIEWPORT_STATE_POINTERS_CC +- anv: limit some state dirtying after blorp/simpler-shaders +- brw/rt: fix ray_object_(direction|origin) for closest-hit shaders +- intel/nir: remove load_global_const_block_intel intrinsic +- vulkan/runtime: fix GetBufferMemoryRequirements2 for maintenance4 +- anv: fix extended buffer flags usages +- anv: remove unused macro +- anv: only set 3DSTATE_CLIP::MaximumVPIndex once +- anv: add assert to detect problematic instruction merges +- anv: reuse common pipeline state for compute push allocations +- anv: simplify loading driver internal constants +- anv: move lowering of descriptor intrinsics to apply_layout +- nir: remove unused intel intrinsics +- brw: remove unused prog_data field +- anv: optimize CLIP::MaximumVPIndex setting +- anv: move conditional render predicate after gfx_flush_state +- anv: don't miss workaround for indirect draws +- anv: optimize STATE_BYTE_STRIDE emission +- anv: explicitly disable BT pool allocations at device init +- anv: always use workaround_address, not workaround_bo +- anv: order data in wa_bo to leave wa_addr last +- nir/divergence: add missing load_constant_base_ptr +- brw: switch mesh/task URB fence prior to EOT to GPU +- brw: add a comment what Gfx12.5 URB fences +- anv: fix utrace compute timestamp reads on Gfx20 +- iris: fix utrace compute end timestamp reads on Gfx20 +- brw: align spilling offsets to physical register sizes +- anv: selectively disable binding table usage on Gfx20 +- brw: remove (load|store)_raw_intel +- brw: fix TGM messages to use cmask lsc opcodes +- clc: find opencl headers from the installed llvm/clang location +- brw: use a builder of the size of the physical register for uniforms +- brw: fix vecN rebuilds +- brw: fix virtual register splitting to not go below physical register size +- anv: fix missing tracking for alpha-to-coverage runtime changes +- anv: Only flush render target cache when detecting RT changes +- iris: ensure null render target for specific cases +- brw: move null_rt control up a layer +- brw: disable null_rt only if color output does not affect other outputs +- anv: add missing pipeline instance multiplier +- brw: make sampler message emission more generic +- brw: fix mask componentation for 16-bit sampler returns +- brw: remove EOT handling from sampler messages +- intel/decoder: constify functions not modifying instructions/fields +- intel/decoder: add filter feature +- intel/decoder: split state tracking handlers from printing ones +- intel/decoder: decode the 8 BLEND_STATEs +- anv: avoid setting up a null RT unless needed +- anv: implement VK_KHR_dynamic_rendering_local_read +- anv: expose VK_EXT_pipeline_protected_access +- zink: avoid host transfer usage with sparse +- blorp: convert fast clear color for unsupported formats +- anv: limit render target cache flushing due to color output remapping +- isl: fix comment typo +- isl/tests: add a space in memcpy test error printouts +- isl/tests: fixup multi-tile testing +- isl/tests: fix coordinates for comparison scan +- isl/tests: rename ytile to be consistent with other names +- isl/tests: bump coordinate types to 32bits +- isl/tests: use modulo for more variability per pixel +- isl/tests: fix inclusive coordinate checking +- isl/tests: add TileX memcpy testing +- isl/tests: rename tile_(width|height) variables +- isl/tests: split linear & tiled buffer sizes +- anv: limit 22018402687 to impacted platforms +- anv: consolidate pre/post draw workaround in helpers +- anv: optimize WA 16011107343/22018402687 +- isl: remove duplicated copy for tileX/TileY +- isl/tests: add more coordinates for full tiles testing +- isl/tests: rename span variable to xt_sub_range_alignment +- isl: Tile W memcpy support +- iris: delete stencil mapping support +- isl: add support of aux disable bit on hiz +- anv: use stage mask to deduce cs/pb-stall requirements +- brw: remove unused prototype +- brw: delay printf lowering +- brw: avoid clashing nested loop indices +- brw: remove rebuild single element special case +- brw: fix mesh fence emission +- brw/nir: rework inline_data_intel to work with compute +- anv: stop using a binding table entry for gl_NumWorkgroups +- elk: Don't apply discard_if condition opt if it can change results +- isl: fix range_B_tile end_tile_B value +- isl/tests: add range_B_tile test +- anv: allow subresource queries on non-linear images +- anv: drop non host memory types for host-transfer on non-rebar +- anv: disable pat compression for host images +- anv: wrap binding address setting +- anv: add a host map of image for host image copy usage +- anv: factor out sanitizing mmap offset code +- anv: add VK_EXT_host_image_copy support +- anv: fix binding table entry count for compute shaders +- anv: fix missing inline parameter emission +- anv: reemit push constants on pipeline changes +- anv: use UINT32_MAX to be consistent +- anv: avoid companion usage on RCS +- anv: avoid L3 fabric flush in pipeline barriers +- anv: update some of the indirect invalidations +- vulkan/runtime: fix allocation failure handling +- anv: fix even set/reset on blitter engine +- anv: add texture cache inval after binding pool update +- anv: fix indentation +- anv: move pipe control debug to anv_util.c +- anv: fix extent computation in image->image host copies +- anv: update shader descriptor resource limits +- brw: allocate physical register sizes for spilling +- anv: fix incorrect aspect flag for depth/stencil formats +- anv: fix missing push constant reallocation +- anv: prevent access to destroyed vk_sync objects post submission + +Louis-Francis Ratté-Boulianne (18): + +- gallium: add PIPE_CAP_ASTC_DECODE_MODE and expose extension +- panfrost: add support for ASTC decode mode extension +- pan/bi: implement nir_intrinsic_load_draw_id +- panfrost: use special DrawID register on CSF-based GPUs +- panfrost: properly lower DrawID sysval on v9 GPUs +- panfrost: initialize all UBO and vertex buffer descriptors +- pan/cs: only flush block instrs if there is a pending \`if` block +- pan/cs: add block to handle registers backup in exception handler +- pan/genxml: enable decoding for SET_EXCEPTION_HANDLER +- pan/genxml: small fix to FINISH_FRAGMENT tracing +- panfrost: add debug tracing for CSF context initialization +- panfrost: create a backend-specific hook for \`emit_fbds` +- panfrost: add debug flag to enable extra CS checks +- panfrost: add support for incremental rendering in CSF +- panfrost: properly update data size of AFBC-packed resources +- panfrost: disable CRC for AFBC-packed resources +- panfrost: check the right discard property when selecting CRC target +- panfrost: properly align CRC buffer size for prefetching + +Lu Yao (1): + +- ac/radeonsi: compute htile for tile mode RADEON_SURF_MODE_1D on GFX6-8 + +Lucas Fryzek (13): + +- llvmpipe: Implement EGL_ANDROID_native_fence_sync +- lavapipe: Implement VK_KHR_external_*_fd +- venus/ci: Update skip tests to prevent timeouts +- lp: only map dt buffer on import from dmabuf +- egl/surfaceless: Add error checking to create image +- llvmpipe: Add android platform integration +- vulkan/runtime: Use DMA_BUF to import anb +- lavapipe: Add android platform integration +- docs/android: Add llvmpipe instructions +- lavapipe: Add AHB extension +- drisw: Copy entire buffer ignoring damage regions +- egl/dri/wl: Move swrast damage region from put to swap +- lp: Only close udmabuf handle if its valid + +Lucas Stach (18): + +- etnaviv: drm: use COARSE clock for BO cache timing +- etnaviv: drm: use COARSE clock for timeouts when possible +- etnaviv: hwdb: add COMPUTE_ONLY cap +- etnaviv: properly set PIPE_CAP_GRAPHICS +- etnaviv: emit SAMPLER_LOG_SIZE on sampler state changes +- etnaviv: wire up missing gl_VertexID bits +- etnaviv: allow shader machine code dumps in release builds +- etnaviv: singlethread shader variant compiles when dumping shaders +- etnaviv: emit all PA shader attributes +- etnaviv: support more VS outputs on halti5 GPUs +- etnaviv: fix total varying count assertion +- etnaviv: validate number of VS outputs against GPU limit +- etnaviv: limit number of varyings to fit into VS outputs +- gbm: mark surface buffers as explicit flushed +- etnaviv: re-emit uniforms on sampler view changes when txs is used +- etnaviv: flush shader caches when UBO content is modified +- etnaviv: Update headers from rnndb +- etnaviv: set PE_COLOR_FORMAT_OVERWRITE when no color target is active + +Luigi Santivetti (4): + +- pvr: fix calculation for textures z position fractional part +- pvr: fix when to emit ppp state uniform base address +- pvr: for fragment stages only do not insert a barrier +- pvr: really free memory in subpass render init + +M Henning (15): + +- nak: Don't mix up two types of barrier +- nak: Rename num_barriers to num_control_barriers +- nouveau/codegen: Switch to derivative intrinsics +- nv50/peephole: merge/split can swap word order +- nv50_ir_from_nir: Use getFile() more often +- nv/codegen: Use nir_lower_mem_access_bit_sizes +- nv/codegen: Limit MemoryOpt to input/output/const +- nv/codegen: Vector load ssbo/global/shared/scratch +- nv/codegen: Vector store ssbo/global/shared/scratc +- nv/codegen: Modify nir pass order +- nv/codegen: Use nir_opt_load_store_vectorize +- nv/codegen: Stop overaligning var_function_temp +- nak: Phi coalescing via biased register coloring +- nvk/cmd_buffer: Pass count to set_root_array +- nvk: Fix invalidation of NVK_CBUF_TYPE_DYNAMIC_UBO + +Maaz Mombasawala (4): + +- svga: Validate surface during copy check +- svga: Ensure an active context exists for a vmw_screen +- svga: Introduce userspace managed surfaces +- svga: Support older define commands for userspace surfaces + +Mahmood - Zer0xFF (1): + +- d3d12: Fix Xbox GDK Build + +Marcin Radomski (7): + +- Add missing includes +- Enable VK_EXT_image_drm_format_modifier in cerealgenerator.py +- Enable VK_EXT_image_drm_format_modifier on Linux guests +- vkAllocateMemory: support DRM format modifiers for dmabuf memory +- vkGetPhysicalDeviceImageFormatProperties2: support drm modifiers +- vkCreateInfo: support VkImageDrmFormatModifierListCreateInfoEXT +- Add mesa3d_platforms Soong config flag + +Marek Olšák (189): + +- nir/opt_algebraic: use fmulz for fpow lowering to fix incorrect rendering +- nir: add nir_intrinsic_load_per_primitive_input, split from io_semantics flag +- nir/opt_varyings: improve convergent input handling to fix data corruption +- amd: expose nir_io_mix_convergent_flat_with_interpolated +- ac,radeonsi: set 16-bit flags in io_options optimally +- nir/opt_vectorize_io: optionally don't vectorize IO with different types +- ac/surface/gfx12: fix setting tile_swizzle +- radeonsi/gfx12: fix a GPU hang due to an invalid packet with window rectangles +- radeonsi: ensure TC_L2_dirty is set if we don't sync after internal SSBO blits +- radeonsi: don't set TC_L2_dirty in si_query_hw_get_result_resource +- radeonsi: fix buffer coherency issues on gfx6-8,12 due to missing PFP->ME sync +- radeonsi/gfx12: fix register programming to fix GPU hangs +- radeonsi/gfx11: prepare for CWSR +- radeonsi: program OREO_MODE optimally on gfx11.x and gfx12 +- radeonsi/gfx12: fix VS output corruption with streamout +- ac/surface/gfx12: turn off HiZ for pre-production samples +- radeonsi: strengthen the condition determining OREO_MODE +- nir: add ACCESS_KEEP_SCALAR, preventing vectorization +- util: move util_lower_clearsize_to_dword here +- radeonsi: reject insert/extract opcodes in si_vectorize_callback +- radeonsi: test more alignment cases in si_test_dma_perf +- radeonsi: add correctness tests for the clear/copy_buffer compute shader +- radeonsi: minor changes at the beginning of si_compute_clear_copy_buffer +- radeonsi: implement optimized unaligned clear/copy_buffer compute shader +- radeonsi: align waves to 256B clear/copy area for the clear/copy_buffer shader +- ac/nir: import the clear/copy_buffer compute shader from radeonsi +- ac/nir: adjust performance-related decisions for clear/copy_buffer shader +- ac/nir: adjust gfx11 tuning for the compute blit +- amd: update addrlib +- ac/nir/tess: don't allocate LDS for HS inputs that are passed via VGPRs +- radeonsi: pass TCS inputs_read mask to LS output lowering on GFX9 + monolithic +- ac: fix WAVES_PER_SH value for gfx12 +- ac/nir/meta: move the "skip compute if no DCC image stores" condition to common +- ac,radeonsi: update comments related to the L2 cache, use "L2", not "TC" +- radeonsi/gfx12: remove CP DMA workarounds because CP DMA is never used on gfx12 +- radeonsi/gfx12: disallow DCC for protected content +- radeonsi/gfx11: disable RB+ when blending +- radeonsi: use better OREO_MODE programming +- radeonsi: tweak si_test_dma_perf for better experience +- radeonsi: add a new PM4 helper radeon_event_write +- radeonsi: remove SI_CONTEXT_VGT_STREAMOUT_SYNC, emit it directly +- radeonsi: add ACQUIRE_MEM, RELEASE_MEM PWS packet helpers +- radeonsi: add si_cp_acquire_mem helper and clean up its usage for gfx6-9 +- radeonsi: add gfx10+ support into si_cp_acquire_mem +- radeonsi: use the correct cs for SQTT barriers for gfx9 +- radeonsi: add si_cp_pfp_sync_me +- radeonsi: remove unused size parameter from get_cache_policy +- radeonsi: don't use get_cache_policy in si_launch_grid_internal_ssbos +- radeonsi: move get_cache_policy into si_cp_dma.c +- radeonsi: don't use SI_COHERENCY_NONE in si_compute_shorten_ubyte_buffer +- radeonsi: don't use SI_COHERENCY_CB_META +- radeonsi: don't use SI_COHERENCY_CP +- radeonsi: remove all SI_COHERENCY_* flags except SI_COHERENCY_SHADER +- radeonsi: remove enum si_coherency +- radeonsi: don't use si_get_flush_flags in si_launch_grid_internal_ssbos +- radeonsi: move enum si_cache_policy and si_get_flush_flags into si_cp_dma.c +- radeonsi: remove L2_STREAM enum +- radeonsi: replace enum si_cache_policy with cp_dma_has_L2() +- radeonsi: inline si_get_flush_flags +- radeonsi: split si_launch_grid_internal to start preparing for barrier rework +- radeonsi: move compute barrier calls up one level +- radeonsi: move barrier code out of si_launch_grid_internal_ssbos/images +- radeonsi: skip no-op si_compute_blit() +- radeonsi: only set need_check_render_feedback if binding textures for PS +- radeonsi: split image saving/binding/restoring from si_launch_grid_internal_images +- radeonsi: do init/deinit for multi-dispatch compute blits only once +- radeonsi: don't flag both VS and PS partial flush in si_barrier_before_internal_op +- radeonsi: remove CP DMA code for GDS & L2 prefetch in the clear/copy_buffer path +- radeonsi: use si_barrier_{before,after}_internal_op for CP DMA +- radeonsi: remove SI_OP_SYNC_CPDMA_BEFORE, always sync CP DMA +- radeonsi: fold si_improve_sync_flags logic into si_barrier_before_internal_op +- radeonsi: fix waiting for PS to finish if there are no CB/DB bindings on gfx6-8 +- radeonsi/gfx12: fix shader uploads via CP DMA +- ac/nir/meta: tune clear/copy_buffer performance for gfx6-10.3 +- radeonsi: fix broken tessellation in Unigine Heaven +- radeonsi: remove SI_OP_CS_IMAGE +- radeonsi: inline si_launch_grid_internal_images without the barriers +- radeonsi: move barriers out of si_launch_grid_internal_ssbos +- radeonsi: remove unnecessary barriers from clear_buffer_rmw and clear_dcc_msaa +- radeonsi: add barrier helpers for simple internal buffer ops +- radeonsi: move barriers out of si_compute_clear_copy_buffer & si_cp_dma_* +- radeonsi: move barriers out of si_clear_buffer +- radeonsi: move barriers out of si_copy_buffer +- radeonsi: move barriers out of si_compute_shorten_ubyte_buffer +- radeonsi: remove SI_OP_SKIP_CACHE_INV_BEFORE +- radeonsi: remove SI_OP_SYNC_AFTER +- radeonsi: always pass flags=0 into si_barrier_after_internal_op/simple_buffer_op +- radeonsi: always wait for idle before get_query_result_resource +- radeonsi: remove unused SI_OP_IS_NESTED +- radeonsi: remove SI_OP_SYNC_BEFORE* flags +- radeonsi: change the flags parameter of si_compute_blit to bool fail_if_slow +- radeonsi: remove unused flags and user_flags params from clear/copy functions +- radeonsi: enable conditional rendering for si_compute_shorten_ubyte_buffer +- radeonsi: replace SI_OP_CS_RENDER_COND_ENABLE with bool render_condition_enable +- radeonsi: remove SI_CP_DMA_CLEAR_METHOD, call si_cp_dma_clear_buffer directly +- radeonsi: remove unused buffer_subdata fallback from si_clear_buffer +- radeonsi: rename "cache_flush" -> "barrier" +- radeonsi: move most barrier code into a new file si_barrier.c +- radeonsi: update remaining comments related to the L2 cache, use "L2", not "TC" +- radeonsi: rename TC_L2_dirty -> L2_cache_dirty +- radeonsi: check sctx->flags inside si_emit_barrier_direct +- radeonsi: rewrite si_memory_barrier to generate fewer barrier flags +- radeonsi: rename si_context::flags -> barrier_flags +- radeonsi: move initial framebuffer barrier code into si_barrier.c +- radeonsi: restructure fb_barrier code in si_launch_grid +- radeonsi: move CB synchronization into si_fb_barrier_after_rendering +- radeonsi: move DB synchronization into si_fb_barrier_after_rendering +- radeonsi: add si_fb_barrier_before_rendering +- radeonsi: rename SI_CONTEXT_* flags to SI_BARRIER_* flags +- radeonsi: consolidate code around unsetting barrier_flags in emit_barrier +- radeonsi: remove an obsolete comment about SMEM stores +- radeonsi: skip CB_META/DB_META events when they are redundant on gfx10-11 +- radeonsi: deduplicate how GLM flush flags are set for gfx10-11 +- radeonsi: merge the CB/DB conditional blocks in gfx10_emit_barrier +- mesa,glsl,gallium: add GL_OVR_multiview +- radeonsi: check and update compute_is_busy in get_reduced_barrier_flags +- radeonsi: don't sync VS and PS if they are idle +- radeonsi: count VS/PS/CS/L2 flushes in get_reduced_barrier_flags +- radeonsi: deduplicate code emitting VGT_FLUSH/PIPELINESTAT events +- radeonsi: don't sync CS and PS before rendering if there are no FBO attachments +- radeonsi: remove CB sync after FMASK and DCC decompression +- radeonsi: wait for idle after end_query in si_test_blit_perf +- radeonsi: don't pad esgs_vertex_stride if it's 0 +- radeonsi: adjust GFX12 checks in si_compute.c +- radeonsi: clean up and make corrections to si_create_fmask_expand_cs +- radeonsi/aco: fix asm dumps to debug output via radeonsi_debug_disassembly=true +- radeonsi: move si_execute_clears barrier code into separate functions +- radeonsi: move barriers out of si_execute_clears +- radeonsi: execute clears at resource allocation using compute instead of gfx +- radeonsi: remove barriers around clears using aux_context.compute_resource_init +- radeonsi: don't use VS/PS/CS partial flushes if we use a TS event +- radeonsi: clean up set_log_context code for all aux contexts +- radeonsi: use ACO on GFX11.5 with LLVM 18 or older to work around GPU hangs +- radeonsi: don't insert any barrier after the copy for PIPE_MAP_READ +- radeonsi: rename hw_level -> view_level +- radeonsi: remove the make_texture_descriptor indirect function call +- nir/opt_shrink_vectors: shrink memory loads, not just IO +- ac: make sure VEGA20 and MI200 version ranges don't overlap with other chips +- ac/nir: set .image_dim and .image_array for all opcodes +- ac/llvm: use LLVM processor gfx942 for GFX940 when it's available +- ac/llvm: don't use the 64-bit umul_hi workaround with LLVM 19.1 +- ac/gpu_info: print 32bpp modifiers +- st/mesa: copy some TES shader info fields to TCS +- nir/opt_vectorize_io: fix skipped output vectorization if inputs were vectorized +- nir/opt_vectorize_io: fix stack buffer overflow with 16-bit output stores +- nir/opt_load_store_vectorize: add entry::num_components +- nir: add hole_size parameter into the vectorize callback +- nir: reject unsupported component counts from all vectorize callbacks +- nir/opt_load_store_vectorize: allow overfetching, merge overfetched loads +- nir/opt_load_store_vectorize: allow a 4-byte hole between 2 loads +- nir/opt_load_store_vectorize: vectorize load_smem_amd +- gallium/u_threaded: fix crash in tc_create_image_handle due to resource == NULL +- radeonsi: set the valid buffer range for bindless image buffers +- nir/opt_varyings: assign locations of no_varying IO for TCS outputs only +- nir/opt_varyings: pack TCS inputs with cross-invocation access together +- ac/nir: rewrite ac_nir_lower_ps epilog to fix dual src blending with mono PS +- nir: add shader_info::tess::tcs_same_invocation_inputs_read(_indirect) +- mesa_interface: remove unused stuff +- mesa_interface: replace opaque __DRIscreen with struct dri_screen everywhere +- mesa_interface: replace opaque __DRIcontext with struct dri_context everywhere +- mesa_interface: replace opaque __DRIdrawable with struct dri_drawable everywhere +- mesa_interface: remove redundant __DRI*Rec types +- mesa_interface: rename __DRIconfig to struct dri_config +- mesa_interface: rename __DRIimage to struct dri_image +- nir: add nir_gather_tcs_info, new gathering/analysis pass +- nir: rename load_cull_small_primitives -> triangles, add load_cull_small_lines +- nir: rename load_cull_small_primitive_precision -> triangle, add line_precision +- nir: add cull_triangles\_, cull_lines\_ prefixes to viewport_xy_scale_and_offset +- ac/nir,radeonsi: use load_cull_small_lines_enabled_amd +- ac/nir,radeonsi: use load_cull_small_line_precision_amd +- ac/nir,radeonsi: use load_cull_line_viewport_xy_scale_and_offset_amd +- radeonsi: rewrite how small prim precision is passed to culling code +- radeonsi: simplify util_rast_prim_is_lines_or_triangles +- radeonsi/gfx11: fix Z corruption for Blender +- amd: move Tonga and Iceland TC-compat HTILE workarounds to ac_gpu_info.c +- radeonsi/gfx12: fix AMD_DEBUG=nodcc not working +- glsl: lower IO in the linker if enabled, don't lower it later +- nir: rename nir_io_glsl_lower_derefs -> nir_io_has_io_intrinsics +- nir: print interp_mode better +- nir/lower_io: change INTERP_MODE_NONE to SMOOTH when NONE means SMOOTH +- nir: add nir_clear_divergence_info, use it in nir_opt_varyings +- gallium: set proper type for pipe_shader_state::ir::nir +- st/mesa: fix incorrect types of shader CSOs +- amd/ci: adjust stoney traces checksums +- glsl: fix accidentally disabling nir_opt_varyings for all drivers +- Revert "amd/ci: adjust stoney traces checksums" +- st/mesa: implement key->persample_shading for lowered IO +- nir/lower_clip_disable: handle non-scalar store intrinsics +- radeonsi: fix gl_FrontFace elimination when one side is culled + +Mark Burton (1): + +- gallivm: Fix compilation errors when using LLVM 13. + +Mark Collins (2): + +- freedreno/registers: Document TPL1_2D_SRC_CNTL register +- tu: Improve 2D buffer-to-image copies for A7XX + +Martin Krastev (12): + +- svga/ci: enable vmware farm +- svga/ci: change DNS server for vmware jobs +- svga/ci: triage unexpected piglit pass +- svga/ci: update FORCE_KERNEL_TAG +- svga/ci: triage unexpected piglit passes +- svga/ci: disable vmware-qemu-traces jobs +- svga/ci: disable vmware farm +- svga/ci: enable vmware farm +- svga/ci: triage piglit tessellation failures +- svga/ci: disable vmware farm +- svga/ci: enable vmware farm +- svga/ci: disable vmware farm + +Martin Roukala (né Peres) (9): + +- radv/ci: update the flakes lists +- zink/ci: update the flakes list for ga106 +- radv/ci: document more vkcts flakes +- zink/ci: document more flakes +- radv/ci: run vkcts-navi31-valve pre-merge +- freedreno/ci: fix the stage of the a750 jobs +- radv/ci: document more vkcts flakes +- zink/ci: document more radv flakes +- zink/ci: bump the polaris10 timeout to 30 minutes + +Mary Guillemard (61): + +- panvk: Update CI fails +- docs: Add panvk in VK_EXT_private_data to features.txt +- panvk: Implement CmdPushDescriptorSetWithTemplate2KHR +- panvk: Fix image support in vertex jobs +- panvk: Emit viewport when state is NULL +- panvk: Implement CmdBindDescriptorSets2KHR, CmdPushConstants2KHR and CmdPushDescriptorSet2KHR +- panfrost: Add B8G8R8A8_SNORM in panfrost_pipe_format +- panvk: Pass attrib_buf_idx_offset to desc_copy_info +- panvk: Fix NULL deref on model name when device isn't supported +- panvk: Skip blend descriptors when no fragment shader is present +- bi: Add bi_foreach_ssa_dest +- bi: Use bi_foreach_src in bi_foreach_ssa_src +- bi: Do not mark tex ops as skip when dest is used by control flow +- bi: Use nir_opt_load_store_vectorize +- bi: Use nir_opt_shrink_stores +- bi: Add loop_header in bi_block +- bi: Rewrite dead code elimination +- panvk/ci: Update G52 fail list to represent current state +- panvk: Properly propagate helper invocations requirement +- panvk: Expose trivial google extensions +- panvk: Fix viewport calculation +- nak: Use F2FP for nir_op_pack_half_2x16_split on SM86+ +- panvk: Close batch in CmdBeginRendering if already open +- panvk: Manually copy dynamic state in meta_gfx +- panvk: Simplify meta ctx save/restore +- panvk: Save and restore vertex buffer in gfx_meta +- panvk: Remove unused vk_vertex_input_state in graphics_save_ctx +- panvk: Extend the descriptor lowering pass to support Valhall +- panvk: Extend the shader logic to support Valhall +- nak: Fix wrong encoding for F2FP cbuf and immediate forms +- panfrost: Update panfrost_drm.h +- panfrost: Update panthor_drm.h +- pan/kmod: Add timestamp uapi support +- panfrost: Move occlusion query allocation to create_query +- panfrost: Implement timer queries +- docs: Mark timer queries as done on panfrost +- panvk: Add atomic modifiers for r32 based formats +- panvk: Reject SNORM only for color attachment and blend +- panvk: Return proper buffer format feature flags +- pan/lib: Simplify pan_blitter +- panvk: Include new failures from VKCTS 1.3.9.1 +- panvk: Ensure to clear dirty dynamic state in panvk_cmd_draw +- mr-label-maker: Add panvk +- pan/va: Define the LEA_BUFFER instruction +- panvk/ci: Update Mali-G52 status with full VKCTS +- panvk/ci: Update to run full CTS on G52 +- panvk/ci: Make panfrost-g52-vk run at premerge +- egl: Support NV_context_priority_realtime +- panfrost: Update panthor_drm.h with latest +- pan/kmod: Add priority query uapi support +- panfrost: Implement context priority on v10 +- panvk/ci: Increase panfrost-g52-vk fraction +- panvk: Cleanup rw_nc pool in panvk_device_cleanup_mempools +- hk: Implement global priority extensions +- panvk: Define primitive size for RUN_TILER/RUN_IDVS +- winsys/nouveau: Rework to use u_pipe_screen_lookup_or_create +- winsys/nouveau: Reformat to stop relying on tabs +- nir: Move atomic_op_to_alu to common code +- hk: Fill deviceUUID +- panvk: Ensure that render_info is not null in force_fb_preload +- bi: Execute nir_opt_algebraic after nir_lower_pack + +Matt Coster (3): + +- pvr: Fix ds subtile alignment NULL pointer dereference +- pvr: Fix reordering of sub-cmds when performing ds subtile alignment +- pvr: Use vk_buffer_view base + +Matt Turner (14): + +- intel/clc: Free parsed_spirv_data +- intel/clc: Free disk_cache +- intel/brw: Use REG_CLASS_COUNT +- intel/elk: Use REG_CLASS_COUNT +- docs: Drop references to LIBGL_DRIVERS_PATH +- util: Add ATTRIBUTE_OPTIMIZE(flags) +- util: Force emission of stack frame in stack unit test +- nir: Skip opt_if_merge when next_if has block ending in a jump +- nir/tests: Add tests for opt_if_merge +- anv: Set shader_spilling_rate=15 by default +- anv: Set shader_spilling_rate=11 +- anv: Align anv_descriptor_pool::host_mem +- vulkan: Avoid pointer aliasing +- nir: Get correct number of components + +Mauro Rossi (2): + +- nvk: Fix regression observed on Kepler +- android: gbm: split the DRI backend into separate library + +Maíra Canal (2): + +- v3d, vc4: remove unused functions +- v3d: Don't use performance counters names array with an older kernel + +Michael Catanzaro (1): + +- Disk cache should consistently create parent directories when needed + +Michael Cheng (2): + +- anv: Remove extra hdc_flush from Perfetto +- anv: move trace logic to batch_emit_pipe_control_write + +Michel Dänzer (10): + +- Revert "radeonsi: remove CB sync after FMASK and DCC decompression" +- radeonsi: Revert to GLSL_SAMPLER_DIM_2D in si_create_fmask_expand_cs +- util/mesa-db: Make mesa_db_lock robust against signals +- util/mesa-db: Recreate files if header load or index update fails +- util/mesa-db: Reserve hash table for total number of index entries +- util/mesa-db: Use single read for whole index +- util: Use persistent array of index entries +- util/mesa-db: Further simplify mesa_db_compact +- Revert "util/mesa-db: Further simplify mesa_db_compact" +- Revert "util: Use persistent array of index entries" + +Michel Zou (1): + +- ac/gpu_info: Fix missing prototype mingw error + +Mike Blumenkrantz (389): + +- zink: move some flags from driver_workarounds to separate struct +- zink: hash in some driver workaround flags to shader cache +- zink: add lavapipe to can_do_invalid_linear_modifier list +- zink: use local screen variable in zink_prep_fb_attachment +- zink: use GENERAL layout for depth attachments on some drivers +- zink: use blake3 instead of sha1 for program cache +- gallium: install gallium-$version.so to libdir +- ci: prune dri from LD_LIBRARY_PATH +- dril: rework config creation +- llvmpipe: handle vma allocation failure +- llvmpipe: only use vma allocations on linux +- glx: delete tests +- st/interop: return unsupported without screen hooks +- dri: fix kmsro define +- glx: tabs -> spaces +- glx: delete __GLXDRIdisplay +- glx: simplify zink init variables +- glx: rework __glXInitialize +- glx: delete dri3_display::loader_extensions +- glx: move has_multibuffer to base glx_screen struct +- glx: move driwindows_display::event_base to driwindows_context +- glx: rework/simplify drisw zink handling +- glx: move dri2Hash to base glx_display +- glx: tweak dri2 init ordering +- glx: move dri2 init checks to separate function +- glx: move dri2 loader extensions to static vtable +- glx: delete all driver-specific display handling +- glx: use base screen has_multibuffer for drisw +- glx: move drisw dri3 handling for zink out to glxInitialize +- glx: consolidate some LIBGL_KOPPER_DISABLE checks +- glx: unhackify inferring zink from dri3 +- Revert "vl/dri3: use loader's dri3 init code and delete everything else" +- loader: move some common dri3 functions out of dri3 loader +- meson: move glx subdir after gallium build +- loader: split out dri3 into subdir +- dri: declare IMAGE_DRIVER symbols PUBLIC +- glx: inline IMAGE_DRIVER functions +- dri: declare DRI_CORE functions public +- loader/dri3: inline DRI_CORE +- glx: inline DRI_CORE functions +- glx: inline createNewScreen3 +- glx: delete __DRImesaCoreExtension usage +- glx: inline DRI_SWRAST +- dri: delete __DRImesaCoreExtensionRec::createNewScreen +- glx: inline DRI_DRI2 +- dri: delete driCreateNewScreen2 +- dri: declare DRI_KOPPER as PUBLIC +- glx: use local var for LIBGL_KOPPER_DISABLE check +- glx: switch kopper check for setting up swapinterval/bufferage functions +- glx: delete DRI_KOPPER +- glx: rename driswCopySubBuffer +- drisw: declare DRI_COPY_SUB_BUFFER as PUBLIC +- glx: inline DRI_COPY_SUB_BUFFER +- dri: delete __DRI_COPY_SUB_BUFFER +- glx: delete releaseTexBuffer +- glx: delete the only users of __DRItexBufferExtension::setTexBuffer +- dri: delete __DRItexBufferExtension::setTexBuffer +- dri: declare DRI_TEX_BUFFER functions public +- glx: delete check for texbuffer extension +- glx: inline DRI_TEX_BUFFER +- loader/dri3: delete texBuffer struct member +- glx: delete DRI_TEX_BUFFER handling +- kopper: increment lastStamp during invalidate +- dri: add kopper stubs to avoid build failures +- dri: make DRI2_RENDERER_QUERY interface PUBLIC +- glx: inline __DRI2_RENDERER_QUERY +- dri: delete __DRI2_RENDERER_QUERY interface +- glx: always expose GLX_ARB_context_flush_control +- dri: make DRI_FLUSH public +- kopper: reuse dri2 invalidate +- loader/dri3: inline DRI2_FLUSH +- glx: inline DRI2_FLUSH +- loader/dri3: delete DRI2_FLUSH remnants +- dri: unify DRI2_CONFIG_QUERY interfaces +- dri: make DRI2_CONFIG_QUERY public +- dri: move swapinterval functions from loader to dri frontend +- dri: remove __DRI2configQueryExtension param from swapinterval functions +- glx: inline DRI2_CONFIG_QUERY +- loader/dri3: delete DRI2_CONFIG_QUERY remnants +- dri: make DRI2_THROTTLE public +- glx: inline DRI2_THROTTLE +- dri: delete DRI2_THROTTLE interface +- dri: make DRI_INTEROP public +- glx: inline DRI_INTEROP +- glx: delete checks for interop support +- glx: delete more DRI_INTEROP checks +- glx: call dri interop functions directly +- glx: stop binding DRI_INTEROP +- glx: expose GLX_MESA_gl_interop for drisw +- glx: simplify DRI_IMAGE checks +- dri: make DRI_IMAGE public +- gallium: move loader_dri_create_image to dri frontend +- dri: use image loader function directly in dri_create_image_with_modifiers +- loader/dri3: inline DRI_IMAGE usage +- loader/dri: delete DRI_IMAGE remnants +- glx: delete DRI_IMAGE remnants +- loader/dri3: delete loader_dri3_extensions +- glx: move up glx_driver enum declaration +- glx: include src/gallium for apple +- dri: link with libloader +- dri: only link_whole once +- kopper: check swapchain size after possible loader image resize +- egl/dri2: delete unused function +- egl: inline DRI_CORE usage +- gbm: inline DRI_CORE usage +- egl/gbm: delete remnants of DRI_CORE +- dri: publicize dri_query_compatible_render_only_device_fd +- gbm: inline DRI_MESA +- egl: inline DRI_MESA +- gbm: delete DRI_MESA remnants +- egl: delete DRI_MESA remnants +- egl: inline DRI_KOPPER +- egl/gbm: remove DRI_KOPPER remnants +- dri: make driSWRastQueryBufferAge public +- egl: inline DRI_SWRAST +- egl/gbm: rip out DRI_SWRAST +- egl: collapse driCreateNewDrawable conditional +- egl: simplify a non-swrast conditional +- egl/drm: always expose EXT_buffer_age +- gbm: inline DRI_IMAGE_DRIVER +- egl/gbm: delete DRI_IMAGE_DRIVER remnants +- dri: store dmabuf import caps to dri_screen on create +- dri: check for dmabuf_import at top of dri2_from_dma_bufs +- gbm: inline dri2_from_dma_bufs and delete now-redundant check in dmabuf import +- egl: simplify MESA_drm_image enablement +- dri: make a few more functions public +- egl: inline simple DRI_IMAGE usage +- gbm: inline simple DRI_IMAGE usage +- egl: swap DRI_IMAGE checks for dmabuf/modifier support for driver check +- egl/android: check PIPE_CAP_NATIVE_FENCE_FD directly +- egl/wayland: delete check for DRI_IMAGE blitImage in drm init +- egl: inline DRI_IMAGE compression modifier handling +- egl: replace display_name checks with swrast checks +- egl: always check dri caps for EGL_WL_bind_wayland_display support +- gbm: check for dmabuf import on device creation +- gbm: use new dmabuf import flag to replace DRI_IMAGE checks +- gbm: always pass through images to unmap +- gbm: check for compression modifiers on device create +- gbm: use driver check for dmabuf export +- gbm: delete DRI_IMAGE remnants +- egl: move some extension enablement out DRI_IMAGE conditional +- egl: delete last DRI_IMAGE check +- egl: delete DRI_IMAGE remnants +- egl: delete local_buffers +- egl/x11: replace DRI_DRI2 check with fd check +- egl: delete DRI_DRI2 remnants +- gallium: make some sw screen create functions public +- pipe-loader: fix driconf memory management +- egl: inline CONFIG_OPTIONS usage +- egl: delete DRI_CONFIG_OPTIONS remnants +- egl: further simplify driver loading +- egl: add a bool to indicate swrast but not kms_swrast +- gbm: inline DRI_FLUSH +- gbm: delete DRI_FLUSH remnants +- egl: use a non-kms_swrast check for swapbuffers DRI2_FLUSH +- egl: use a swrast check for a swapbuffers DRI2_FLUSH case +- egl: delete a DRI2_FLUSH check in dri2 swapbuffers +- egl: replace another DRI2_FLUSH check with non-kms_swrast check in dri2_wait_client +- egl: inline DRI2_FLUSH usage +- egl: replace another DRI2_FLUSH check with a non-kms_swrast check in surface query +- egl: replace a DRI2_FLUSH check with swrast check in x11_copy_buffers +- egl/drm: replace a DRI2_FLUSH check with a non-kms_swarst check +- egl/wayland: replace a couple DRI2_FLUSH checks with non-kms_swrast checks +- egl: delete DRI2_FLUSH remnants +- dri: only add DRI_BUFFER_DAMAGE if the driver supports set_damage_region +- egl: simplify DRI_BUFFER_DAMAGE checks +- dri: make set_damage_region public +- egl: add a util to get the pipe_screen +- egl: use pipe_screen check for damage region, simplify buffer_damage checks +- egl: inline DRI_BUFFER_DAMAGE +- egl: delete DRI_BUFFER_DAMAGE remnants +- egl: always advertise KHR_context_flush_control +- egl: delete DRI_FLUSH_CONTROL remnants +- egl: inline DRI2_CONFIG_QUERY +- egl: delete DRI_CONFIG_QUERY remnants +- egl: inline DRI_TEX_BUFFER +- egl: delete DRI_TEX_BUFFER remnants +- dri: make DRI2_FENCE public +- egl: delete checks for DRI2_FENCE +- egl: simplify native fence check +- egl: don't check for create_fence_fd method when importing a fence +- egl: inline DRI2_FENCE +- egl: delete DRI2_FENCE remnants +- egl: always advertise MESA_gl_interop, inline calls +- egl: delete DRI2_INTEROP remnants +- dri: make DRI2_BLOB public +- egl: inline DRI2_BLOB +- egl: delete DRI2_BLOB remnants +- egl/android: replace DRI_MUTABLE_RENDER_BUFFER_DRIVER check with non-kms_swrast check +- egl: delete DRI_MUTABLE_RENDER_BUFFER_DRIVER remnants +- egl: move multibuffers check into platform_x11 +- egl: simplify multibuffers check +- dril: always take the egl init path +- llvmpipe: init fd=-1 to avoid closing fd=0 on failure +- egl: fix zink init +- glx: replace a straggler DRI_CONFIG_OPTIONS usage +- dri: delete unused DRI_MESA queryCompatibleRenderOnlyDeviceFd method +- dri: delete __DRIdri2Extension interface +- dri: delete a couple unused functions +- dri: delete some dri_screen cruft +- dri: consolidate a bunch of dri_screen init code +- dri: delete all the extra dri2 extension handling +- egl/dri: move some screen accessors back to dri frontend +- glx: stop checking dri extensions +- glx: deduplicate and move driverName to base glx_screen struct +- glx: store the DRIscreen to the base glx_screen +- glx: set drisw glx_driver more usefully +- glx: use glx_driver to determine drisw extension enablement +- glx: add a glx_screen member to indicate GLX_EXT_texture_from_pixmap +- glx: unify extension binding +- dri: delete dri_screen::extensions accessor +- dri: delete dri_screen::extensions +- dri: fix kms_swrast screen fail +- dri: delete more dead code +- dri: pass through a type enum for creating screen instead of driver_extensions +- kopper: inline DRI_IMAGE usage +- egl: stop loading driver extensions +- glx: stop loading driver extensions +- gbm: stop loading driver extensions +- dri: massively prune the dri target +- dri: delete a bunch of unused interfaces +- egl/wayland: bail on zink init in non-sw mode if extension check fails +- egl: fix fd passing on init with zink +- zink: fix partial update handling +- egl: make dmabuf/modifier display bools a bit more precise/intelligible +- egl/glx: make detecting explicit modifier support more explicit +- dri: set __DRI_IMAGE_ERROR_BAD_PARAMETER if driver doesn't support dmabuf import +- egl: initialize dri error code variables +- kopper: reuse drisw_update_tex_buffer to delete some code +- dri: merge in loader_dri3 +- kopper: reuse dri3 functions +- dri: rename and move handle_in_fence to dri_helpers +- kopper: reuse dri_image_fence_sync +- dri: break out get_pixmap_buffer +- kopper: reuse loader_dri3_get_pixmap_buffer +- glx: always set swapInterval in drisw +- egl: check x11 multibuffer support before screen create +- dri: pass has_multibuffers through from the loader +- dri: further collapse dri_screen creation +- dri: guard dmabuf support by loader's dmabuf support +- dri: collapse some modifier/dmabuf checks +- kopper: call kopper_create_drawable directly +- kopper: delete some redundant code +- dri: rework drawable creation +- glx: simplify kopper_get_buffer_age +- glx: flatten out getBufferAge a bit +- glx: move/add __DRIdrawable struct to __GLXDRIdrawable +- glx: use __GLXDRIdrawable more broadly instead of display type drawables +- glx: unify dri bind_context +- glx: unify dri unbind context +- glx: unify dri destroy context +- glx: tweak some drisw context create code +- glx: tweak some dri3 context create code +- glx: unify dri create_context_attribs +- glx: unify renderer query hooks +- glx/dri2: strdup driver name +- glx/dri3: strdup existing driverName instead of fetching it again +- glx: unify dri get_driver_name +- glx: unify dri screen vtable +- glx: unify bind_tex_image +- glx: delete private DRIscreen pointers +- glx: move driver_configs to base glx_screen +- glx: rework screen destroy +- glx: move base screen destroy to glx_screen_cleanup +- glx/dri3: delete driScreenRenderGPU +- glx: unify more extension setup +- glx: determine is_direct_capable from driver type +- glx: inline __GLXDRIscreen onto base glx_screen +- glx: unify dri screen init +- zink: bail on choose_pdev immediately if no devices are available +- st/pbo: reject vs/fs pbo ops if rowstride < width +- device_select: reorder CreateInstance to have shorter failure paths +- device_select: shortcut EnumeratePhysicalDevice* for count-only calls +- device_select: disable reordering with zink on xwayland +- zink: delete hackaround for disabling wsi extensions +- egl: add DRI_USE_INVALIDATE with non-sw kopper +- egl: really fix kopper fd passing +- egl/wayland: split out dmabuf extension stuff for kopper +- egl: pass real value for software to dri2_setup_device() +- egl/wayland: do display gpu fd dance for non-sw zink init +- zink: don't skip cbuf store ops if resolve is set +- tc: set resolve on renderpass info if blit terminates the renderpass +- egl/android: call dri2_display_create() like every other driver +- lavapipe: VK_EXT_pipeline_robustness +- device-select: block xserver+zink reordering via xcb +- ci: add a660 flake +- dril: add zink stub +- egl: unify LIBGL_KOPPER_DRI2 checks +- egl/x11: pretend kopper is software with LIBGL_KOPPER_DRI2 +- llvmpipe: export dmabuf caps for kms_swrast +- zink: update profile with missing extensions +- revert part of 94e470a32d214052d89f519a07566d8098d05d28 +- egl: use more precise conditional for passing fd through to dri screen create +- dril: use the super fallback path for software fallback +- vk: rename DGC feature to DGC-NV +- zink: create a surface for resolve resources +- zink: use tc renderpass optimizing to inline resolves with dynamic render +- tc: break out blit enqueue to separate function +- tc: optimize out tracked winsys resolves +- dril: also create double-buffered configs in swrast fallback +- meson: delete dri3 build option +- meson: require dri3 modifiers +- ci: fix trace update script +- ci: add --pipeline-url to trace update script +- zink: delete erroneous kopper assert +- zink: set resolve layout when inferring scanout resolves +- zink: stop leaking inferred resolve surfaces +- zink: fix sparse bo deallocation +- va: flush_resource before presentation +- vl/winsys: move dri3_get_screen_for_root to common +- va/vl: add kopper support +- gallium: add PIPE_CAP_MULTIVIEW +- zink: support GL_OVR_multiview +- mesa: support OVR_multiview2 +- zink: support OVR_multiview2 +- docs: OVR_multiview features +- lavapipe: VK_EXT_pipeline_protected_access +- lavapipe: VK_KHR_global_priority +- egl: replace fd check with swrast check for dri2 +- llvmpipe: bump max point size to 256 +- egl/x11: fix dri3 connect conditional for kopper without modifiers +- lavapipe: bump mipmapPrecisionBits +- zink: add screen create to match windows LUID +- vl: rename/ifdef the x11 kopper functionality +- vl: add Windows kopper support +- va: support zink on Windows +- mesa: fix sample count handling for MSRTT +- mesa: OVR_multiview_multisampled_render_to_texture +- anv: fix video profile lists +- trace: add get_device_reset_status +- gallium: delete pipe_screen::is_compression_modifier +- egl/kopper: hook up EGL_EXT_surface_compression on wayland +- zink: hook up compression control extensions +- zink: store compression rate info for formats +- zink: implement compression control +- zink: move x8 format function to zink_format.c +- zink: fix zink_format_is_voidable_rgba_variant() +- zink: add X format variants for 30-bit formats +- zink: add some other missing X format emulation variants +- zink: add adl flake +- vk/image: fix view creation for planar video aspects +- zink: only clamp samplerview aspect for zs formats +- zink: fix planar image creation with mutable +- zink: correctly handle disjoint image creation +- zink: slightly refactor image tiling inference +- zink: swap sampler conversion conditional with yuv check +- zink: stop creating useless sampler conversions +- zink: move some image ici init up +- zink: check HAVE_LIBDRM for xf86drm.h include +- zink: only use DISJOINT for planar images +- zink: zink_modifier_prop -> zink_modifier_props +- zink: wrap all access to format_props and modifier_props +- zink: init format props dynamically +- zink: fix some indentation +- zink: delete redundant HIC usage check +- zink: add HIC back if suboptimal check fails +- zink: simplify some confusing modifier code +- zink: stop passing modifier pointers internally +- zink: rework modifier selection (again) +- zink: rework HIC image creation checks +- vk/runtime: add EXT DGC layout handling +- lavapipe: EXT DGC +- ci: bump gl cts versions +- ci: bump VVL to current week +- zink: revert compression control handling +- vl: add pipe_video_buffer::flags to sync up with pipe_resource::flags +- util/vbuf: delete/fix broken incompatible stride calc +- zink: block all 2d view creation with sparse +- zink: block dmabuf fallback into optimal tiling +- zink: assert images aren't created with dmabuf export and optimal tiling +- zink: also init format props when getting modifier props +- zink: clamp out dmabuf exports from optimal tiling images +- zink: fix external_only reporting for dmabuf formats +- zink: block srgb with winsys imports +- anv: add VK_FORMAT_G10X6_B10X6R10X6_2PLANE_420_UNORM_3PACK16 to modifier exceptions +- gallium: rework vbuf alignment pipe caps +- util/vbuf: rename/consolidate some bools +- gallium/sw: make llvmpipe the "default" sw driver +- util/framebuffer: add viewmask compare for fb equal +- iris: assert that viewmask is 0 +- gallium: delete duplicated viewmask member in draw info +- llvmpipe: expose GL multiview extensions +- zink: further improve image usage detection +- zink: rewrite the high-IQ image tiling selection loop +- device-select: only try wayland/x11 if the required vars are set +- va: fail context create if driver does not support video +- vdpau: fail context create if driver does not support video +- zink: stop leaking precompiled generated tcs +- zink: add VVL for RADV jobs + +Mike Lothian (1): + +- Revert "device_select: shortcut EnumeratePhysicalDevice* for count-only calls" + +Mitchell Kember (1): + +- [fxbug.dev] Migrate bug numbers + +Mohamed Ahmed (6): + +- nvk: Use stride in the explicit modifier case for linear images +- nil: Expose px_to_B helpers to C +- nil: Add level_layer_size_B() helper +- nvk: Add host copy functions +- nvk: Block off non-2D DRM format modifier images +- nvk: Enable VK_EXT_host_image_copy + +Nanley Chery (49): + +- anv: Drop flush from unused depth workaround +- intel/blorp: Use WA helpers for depth pipecontrol +- anv,iris: Use WriteImmediate instead of Z flush for WA +- anv: Batch MCS and CCS aux-op flushes +- anv: Add want_hiz_wt_for_image() +- iris: Add and use want_hiz_wt_for_res +- intel: Enable more LOD0 HIZ+CCS fast clears +- intel/blorp: Allow LOD0 fast-clears with HiZ WT +- intel: Adjust partial depth fast clear checks +- intel: Use a simpler workaround for HiZ WT fast-clears +- iris: Invalidate state cache for some depth fast clears +- intel: Add and use isl_get_sampler_clear_field_offset +- intel/blorp: Simplify depth clear value updates +- iris: Move a HIZ_CCS_WT fast-clear flush higher up +- intel: Move depth clear value writes to drivers +- intel/isl: Fix packing of SINT formats +- intel: Don't use HW clear color conversion on gfx11 +- intel/blorp: Don't use clear color conversion on gfx12 +- anv,iris: Skip tex invalidate for clear conversion +- anv,hasvk: Add and use set_image_clear_color() +- anv,hasvk: Always use BLORP_BATCH_NO_UPDATE_CLEAR_COLOR +- iris: Always use BLORP_BATCH_NO_UPDATE_CLEAR_COLOR +- iris: Skip some fast-clears even on color changes +- intel: Drop BLORP_BATCH_NO_UPDATE_CLEAR_COLOR +- anv: Pass the VkClearDepthStencilValue for clears +- anv: Add and use anv_image_hiz_clear_value() +- intel: Support any depth fast-clear value on Xe2 +- intel/isl: Always set EnableUnormPathInColorPipe +- anv,iris: Pack depth pixels into initialized arrays +- intel/isl: Reduce miptail slot usage to allow CCS +- intel/isl: Disable 3D Ys/Yf miptails for CCS +- anv: Change params of anv_can_fast_clear_color_view +- anv: Move and rename anv_can_fast_clear_color_view +- anv: Move exec_ccs_op and exec_mcs_op higher up +- anv: Support fast clears in anv_CmdClearColorImage +- intel: Avoid no-op calls to anv_image_clear_color +- anv: Require compression for fast-clears on gfx20+ +- anv: Use image formats when copying to/from buffers +- intel/blorp: Choose some copy formats independently +- intel/blorp: Use original surface format for some copies +- anv: Prevent clear color modifier corruption with views +- anv: Delete stale comment for BLORP clear color addr +- anv: Add an array of view formats to anv_image +- anv: Prepare dmabufs for clear color arrays +- anv: Move code out of loop in anv_CmdClearColorImage +- anv: Refactor clear color loading functions +- anv: Access more colors in fast_clear_memory_range +- anv: Load fast clear colors more often +- anv: Allow more fast clear colors for layouts + +Neha Bhende (1): + +- dri: fix macro name check to detect svga driver + +Nikita Popov (1): + +- gallium: Don't pass avx512er and avx512pf features on LLVM 19 + +Oskar Rundgren (1): + +- pvr: improve buffer copy format selection + +PODISHETTY KUMAR (1): + +- Revert "gfxstream: add VK_EXT_vertex_attribute_divisor" + +Patel, Utpal (2): + +- amd/vpelib: Add input pixel format support +- amd/vpelib: Add resource function hooks for checking support + +Patrick Lerda (8): + +- iris: fix indirect draw refcnt imbalance +- i915: fix vertex atan regression +- iris: fix iris_ensure_indirect_generation_shader() memory leak +- r600/sfn: fix class Shader object m_register_allocations memory leak +- r600/sfn: fix class Shader object last_alu_with_indirect_reg memory leak +- r600: fix spec ext_packed_depth_stencil getteximage +- glsl: fix gl_nir_validate_intrastage_interface_blocks() memory leak +- r600: fix sfn_nir_legalize_image_load_store cubearray behavior + +Paulo Zanoni (33): + +- intel/genxml: add the BLT and COMP_CTX0 versions of the TR-TT registers +- anv/trtt: make genX(init_trtt_context_state) a little more compact +- anv/trtt: check the return value of anv_trtt_init_context_state() +- anv/trtt: submit a separate batch in anv_trtt_init_context_state() +- anv/trtt: make all contexts have the same TR-TT programming +- anv/trtt: fix the process of picking device->trtt.queue +- anv/trtt: don't just crash when we can't find device->trtt.queue +- anv/xe: try harder when the vm_bind ioctl fails +- anv: don't expose the compressed memory types when DEBUG_NO_CCS +- anv: disable CCS for Source2 games on Xe2 +- intel: fix compute SLM sizes on Xe2 and newer +- anv: be consistent regarding non-render engines on i915.ko +- isl: don't assert(num_elements > (1ull << 27)) +- anv: fix compute engines when using ANV_QUEUE_OVERRIDE +- anv/trtt: mark vk_sync_get_value()'s value as defined for Valgrind +- anv: remove duplicate pipe_control workaround +- anv: remove another copy of the texture cache pipe_control workaround +- anv/trtt: set every entry to NULL when we create an L2 table +- anv/trtt: extract anv_trtt_first_bind_init() +- anv/trtt: inline anv_trtt_init_queues_state() +- anv/trtt: ensure all L3 entries are NULL-bound during init +- anv/trtt: unset trtt->l3_addr if initialization fails +- anv/trtt: fix error handling when adding binds +- anv/trtt: don't submit empty batches when there are no binds to do +- anv/trtt: decrement trtt->timeline_val when submission fails +- anv/trtt: delay batch garbage collection +- anv/trtt: remove TODO comment regarding the reloc list +- anv/trtt: convert anv_trtt_bind arrays to util_dynarray +- anv/trtt: extract anv_trtt_first_bind_init_queue() +- anv/trtt: fix the creation of sparse buffers of size 2^32 on 32bit systems +- anv/trtt: remove useless VK_RESULT checks +- brw: don't emit instruction to add zero in spilling code +- brw: add a NOP in between WHILE instructions on LNL + +Pavel Ondračka (28): + +- r300: bias presubtract fix +- ttn: switch to derivative intrinsics +- r300: fix RGB10_A2 CONSTANT_COLOR blending +- mesa: implement GL_FRAMEBUFFER_BLEND query +- r300/ci: add more flakes +- ttn: use nir_shader_get_entrypoint in nir_shader_gather_info +- ttn: unconditionally call nir_shader_gather_info +- r300: remove nir_opt_load_store_vectorize +- r300/ci: update fails list with a recent fix +- r300: move HyperZ blacklist to driconf +- r300: use fake ADD instead of MOV for negative KIL source fixup +- r300: preserve NaNs and denormals for MIN/MAX/CMP/CND +- r300: get rid of unneeded CMP before KIL +- r300: remove gl_ClipVertex early +- r300/ci: share some common variables +- r300/ci: add deqp and piglit on RV410 +- r300: add .clang-format file for the compiler +- r300: opt in to clang-format CI enforcement for the compiler +- r300/compiler: reformat using default mesa .clang-format rules +- r300/ci: update ci expectations after piglit uprev +- nir/nir_group_loads: reduce chance of max_distance check overflow +- r300/ci: add new RV410 flakes +- ci: disable ondracka farm +- Revert "ci: disable ondracka farm" +- r300: add switch to support IEEE and FF math opcodes +- r300: remove wrong Unigine Sanctuary driconf override +- r300: add driconf math mode override for Unigine Tropics and Oilrush +- r300/ci: fails update after recent piglit uprev + +Peyton Lee (1): + +- radeonsi: correct map and unmap function timing + +Pierre-Eric Pelloux-Prayer (26): + +- amd: use a valid size for ac_pm4_state allocation +- radeonsi: reject modifiers with DCC when NO_EXPORTED_DCC is used +- radeonsi: consider PIPE_BIND_LINEAR when filtering modifiers +- radeonsi: consider DBG(NO_TILING) when filtering modifiers +- egl,gbm,glx: fix log message spam +- radeonsi: don't always update shader coherency draw call counter +- gallium/ddebug: implement modifiers function +- gallium/ddebug: implement set_inlinable_constants +- gallium/ddebug: fix sampler_states handling +- gallium: add pipe_screen::get_driver_pipe_screen +- radeonsi: add si_screen helper +- kopper, dri: remove trace_screen_unwrap +- frontends/va: honor DRI_PRIME for VA_DISPLAY_WAYLAND +- radeonsi/tests: allow to run radeonsi-run-tests.py with AMD_DEBUG +- radeonsi: include the shader key in the sqtt pipeline hash +- radeonsi: check the pointer before adding an offset +- radeonsi/sqtt: don't store the offsets in the pipeline +- radeonsi/sqtt: use XXH64_update +- radeonsi/sqtt: don't leak the pipeline bo +- radeonsi/sqtt: don't store pipeline->bo in shader->bo +- radeonsi/gfx12: fill missing dcc tiling info +- radeonsi: fix radeon_canonicalize_bo_flags domain handling +- ac/surface: fix determination of gfx12_enable_dcc +- nir: skip offset=0 in nir_io_add_const_offset_to_base +- glsl: use nir_io_add_const_offset_to_base in gl_nir_opts +- radeonsi/ci: mark \*.tessellation_shader_tessellation.max_in_out_attributes as fixed + +Pohsiang (John) Hsu (1): + +- d3d12: fix incorrect memset in d3d12_video_encoder_references_manager_hevc + +Qiang Yu (29): + +- radeonsi: lower subgroup ops after wave size is known +- ac/llvm: build wqm for quad intrinsics only when fragment shader +- ac,radv,radeonsi: stop using quad vote any/all when llvm +- gallium: add caps for KHR_shader_subgroup +- mesa: add KHR_shader_subgroup extension +- mesa: implement KHR_shader_subgroup Get* values +- mesa: add spirv GroupNonUniform* capbilities support +- glsl: add KHR_shader_subgroup extensions +- glsl: add builtin variables for KHR_shader_subgroup +- glsl: remove unused builtin function macros +- glsl: use generic convertion code for some intrinsics +- glsl: add KHR_shader_subgroup_basic builtin functions +- glsl: prepare vote function args for subgroup usage +- glsl: add KHR_shader_subgroup_vote builtin functions +- glsl: prepare ballot function args for subgroup usage +- glsl: add KHR_shader_subgroup_ballot builtin functions +- glsl: add KHR_shader_subgroup_shuffle builtin functions +- glsl: add KHR_shader_subgroup_shuffle_relative builtin functions +- glsl: add KHR_shader_subgroup_arithmetic builtin functions +- glsl: add KHR_shader_subgroup_clustered builtin functions +- glsl: add KHR_shader_subgroup_quad builtin functions +- glsl: remove unused fields in gl_shader +- glsl: set shader_info.subgroup_size for KHR_shader_subgroup +- radeonsi: remove NULL check in si_determine_wave_size +- radeonsi: consider both stages to determine merged shader wave_size +- radeonsi: use wave64 for KHR_shader_subgroup enabled shader +- radeonsi: enable KHR_shader_subgroup +- nir: add skip_lower_packing_ops shader compile option +- ac: do not lower some ops in nir_lower_packing + +Rebecca Mckeever (7): + +- panvk: Move the VkEvent logic to the jm folder and make it per-arch +- panvk: Add VkEvent support to the CSF backend +- panvk/csf: Set and clear vb.dirty flag +- panvk/csf: Split tiler flags initialization into two steps +- panvk: Track VkFormats for depth and stencil attachments +- panvk: Move rendering info to panvk_rendering_state struct +- panvk/csf: Implement vkCmdExecuteCommands + +Renato Pereyra (1): + +- egl: Refactor init of CPU tracing in EGL for Android + +Rhys Perry (88): + +- nir/instr_set: combine XXH32 calls +- nir/instr_set: stop sorting phi sources +- nir/instr_set: hash tex sources commutatively +- aco/gfx11.5: workaround export priority issue +- aco: add tests for export priority issue +- aco/gfx11.5: skip dealloc_vgprs for stages with exports +- aco: add export instructions to should_form_clause +- aco: form export clauses +- aco: move s_setprio to before NGG exec initialization +- aco: consider exec empty after divergent continue then divergent break +- aco: add struct and helpers for exec potentially empty +- aco: combine DPP into v_cvt_f16_f32 +- aco: fix validation of v_s\_ opcodes +- aco: don't transform v_interp_p2_f32 with constant into fma +- docs: update ACO_DEBUG documentation for scheduler options +- docs: update ACO_DEBUG documentation for perfwarn +- nir/opt_uniform_atomics: require block index metadata +- aco: split selection_control_remove into rarely_taken and never_taken +- aco: only remove branch jumping over SMEM/barrier if it's never taken +- aco: completely skip branches if they're never taken +- aco: set prefer_remove for gfx9- too +- aco: calculate indices from dominance tree +- aco: use dominance helpers +- aco: preserve SSA in try_eliminate_scc_copy +- aco: validate temporary reachability +- aco: split insert_wait_states into two +- aco: move insert_delay_alu to after insert_NOPs +- aco: forget valu delays after certain s_waitcnt_depctr/LDSDIR +- aco: ignore exec and literals when mitigating VALUMaskWriteHazard +- aco: also consider VALU reads for VALUMaskWriteHazard +- aco: don't consider sa_sdst=0 before SALU write to fix VALUMaskWriteHazard +- aco: check SALU writing lanemask later for VALUMaskWriteHazard +- aco: preserve bitsets after a lane mask is written +- aco/tests: add more VALUMaskWriteHazard tests +- aco/ra: fix sub-dword get_reg_specified in some cases +- aco/ra: use DefInfo for get_reg_specified +- aco: have get_subdword_definition_info update DefInfo +- aco: fix printing of d16 MIMG dmask +- nir/opt_if: fix fighting between split_alu_of_phi and peel_initial_break +- nir/opt_loop: skip peeling if the break is non-trivial +- nir/opt_loop: skip peeling if the loop ends with any kind of jump +- nir/tests: add some loop peeling tests +- nir: skip opt_loop_peel_initial_break if continue block only has phis +- radv/rt: align constant data by 64 when inlining shaders +- radv/rt: don't split array/struct payload variables +- aco: do not use inline constants for 16-bit pseudo scalar trancendentals +- aco/tests: update assembler tests for llvm +- aco: workaround hazards in emit_long_jump +- nir/opt_loop: rematerialize header block derefs in their use blocks +- nir/tests: test opt_loop_peel_initial_break with derefs in header block +- aco: stop using instructions in ra_ctx::vectors +- aco: create vector affinities for phi operands +- aco: fix is_vector_intact for GFX11 BVH +- nir/divergence_analysis: disable phi undef optimization by default +- aco: ensure phis uniformized by divergence analysis are SGPR +- aco: skip uniformization of certain merge phis +- radv: fix output statistic for fragment shaders +- radv: optimize VS input load components to constants earlier +- radv: use explicitly sized types for some radv_shader_info members +- ac/llvm: cast to integer after derivative intrinsics +- aco,radv,radeonsi: add aco_shader_info::ps::has_prolog +- aco: split CounterMap off from VGPRCounterMap +- aco: minor CounterMap::operator== fix +- aco: workaround VALUReadSGPRHazard +- aco/tests: add tests for VALUReadSGPRHazard +- nir: fix shfr constant folding with zero src2 +- nir/algebraic: fix shfr optimization with zero src2 +- aco: add waitcnt build helper +- aco: fix printing of block_kind_discard_early_exit +- aco: insert NOP before dealloc_vgpr in the insert_NOPs pass +- aco: don't emit early exit over dealloc_vgprs +- aco: wait for scratch stores to complete before dealloc_vgprs +- aco: don't byte align global VMEM loads if it might be unsafe +- nir/algebraic: add bit-size check to extract_u8 pattern +- aco/gfx11: fix v1b=p_extract(src, 0, 16, 0) +- aco: check for SDWA before applying extract to lshl/cvt_f32 +- aco: disallow p_extract(,,32,) +- aco: handle SGPR limitations when applying extract +- aco: add a bit more p_extract/p_insert validation +- aco: shrink code size of some p_extract +- aco: allow applying sign-extended sel to p_extract more often +- aco: add and use apply_extract_twice helper +- aco: combine extracts with sub-dword definitions +- aco: apply extract to p_extract_vector +- aco: apply extract to v_cvt_f32_ubyte0 +- nir/algebraic: fix iabs(ishr(iabs(a), b)) optimization +- nir/algebraic: check bit sizes in lowered unpack(pack()) optimization +- nir/lcssa: fix premature exit of loop after rematerializing derefs + +Rob Clark (80): + +- freedreno/a6xx: Implement reg stomper support +- freedreno/a7xx: Fix GRAS_UNKNOWN_80F4 writes +- freedreno/cffdec: Fix a7xx CP_EVENT_WRITE decoding +- tu/drm/virtio: Add missing a7xx case +- freedreno/drm: Handle a7xx case +- freedreno: Move GENX/CALLX magic to common +- freedreno: Extract out common UBWC helper +- freedreno: Extract out shared LRZFC layout helpers +- freedreno/a6xx: Allocate lrcfc when needed for direction tracking +- freedreno/a6xx: Refactor CP_EVENT_WRITE emit +- freedreno/a6xx: Rework CCU_CNTL emit for a7xx +- freedreno/a6xx: Initial a7xx support +- tu: Fix issues with 16k (or larger) page sizes +- freedreno/drm/virtio: Fix issues with 16k (or larger) page sizes +- tu: Random non-consequential fixes +- subprojects: perfetto uprev to v47.0 +- egl: Fix surfaceless + modifiers +- gallium: Add option to not add version to libgallium filename +- freedreno/computerator: Use CHIP variant reg builders +- tu: Add helper to calculate layer address +- tu: Use CHIP variant reg builders +- freedreno/registers: Deprecate non-variant reg builders +- tu: Quiet vla warnings +- freedreno/a6xx: Fix arb_texture_view regression +- freedreno/decode: Dump register values on LRZ_CLEAR +- freedreno/a6xx: Fix LRZ +- freedreno: Re-enable LRZ for a7xx +- freedreno/a6xx: Cleanup setup_slices() +- nir/opt_loop: Don't peel initial break if loop ends in break +- freedreno/a6xx: Sysmem clear fixes +- freedreno/drm: Fix ring_heap flags +- freedreno/crashdec: Fix fault address handling +- freedreno/register: A couple pm4 updates +- freedreno/ir3: Fix GPU name in disasm test +- freedreno/ir3: Add half-FLUT cases +- freedreno/ir3: Fix cat1 parser ambiguity vs FLUTs +- freedreno/ir3: Add more cat1 float-immed cases +- freedreno/a6xx: Only emit VFD/PC_POWER_CNTL for a6xx +- freedreno/a6xx: De-open-code VFD_MODE_CNTL +- freedreno/a6xx: Cleanup WFIs around RB_CCU_CNTL +- freedreno/a6xx: Set bin size per bin +- freedreno/a6xx: Move PC_TESSFACTOR_ADDR emit +- freedreno/a6xx: Move static regs to preamble IB +- freedreno/a6xx: Move more state emit per-bin +- freedreno/a6xx: Emit CP_SET_AMBLE packets +- freedreno/drm: Add preemption support +- freedreno: Balance out u_blitter cb0 save/restore +- freedreno/a6xx: Add some missing a7xx bits +- freedreno: Assert we aren't writing to 0x0 +- freedreno/decode: Fix UBO decode on a7xx +- freedreno/ir3: Avoid draw/grid time input iteration +- freedreno/ir3: Add assert about const emit +- freedreno/ir3+tu: Convert driver-params to structs +- freedreno/ir3: Indentation fix +- freedreno/a6xx: Fix double SP_MODE_CONTROL emit +- freedreno/a6xx: Move tess-bo emit +- freedreno/ir3: Track # of app UBOs +- freedreno/ir3: Fix need_driver_params for UBO case +- freedreno: Rework indirect compute param emit +- freedreno/a6xx: Add support to load driver-params via UBO +- freedreno/a6xx: Fix color_cache_size +- freedreno/a6xx: Don't open-code INVALID_REG +- freedreno/a6xx: Add VPC hardware workaround for a750 +- freedreno/a6xx: Add missing GRAS_SU_DEPTH_CNTL +- freedreno/ir3: Create UBO variables for driver-UBOs +- nir/lower_amul: Fix ASAN error +- freedreno/a6xx: Support variable wg size +- ir3/ra: Better CL/kernel support +- freedreno/computerator: Make shader show up in devcore/etc +- freedreno/ir3: Do not propagate away a widening move +- ir3: Fix binning pass driver UBOs +- ir3: Don't lower VS driver params to UBO +- freedreno/a6xx: Don't try resolve blits +- freedreno/a6xx: Stop exposing MSAA image load/store harder +- freedreno: Fix tile-per-pipe debug overrides +- freedreno/a6xx: Fix MSAA depth on a6xx gen1 +- freedreno/ci: Garbage collect some obsolete xfails +- util/primconvert: Avoid OoB with improbable draws +- freedreno/a6xx: Don't check dst coords +- freedreno/a6xx: Random whitespace fix + +Robert Mader (5): + +- egl: Stop requiring texture_3D_image for EGL 1.5 +- perfetto: Add Panfrost data sources to system.cfg +- egl: Add more errors cases during context creation +- egl: Stop requiring EXT_create_context_robustness for EGL 1.5 +- v3d: Support SAND128 base modifier + +Rohan Garg (23): + +- anv: use the WA infrastructure when emitting WA 16013994831 +- anv: program a custom byte stride on Xe2 for indirect draws +- anv,iris: prefix the argument format with XI for a upcoming refactor +- anv: refactor indirect draw support into it's own function +- anv: dispatch indirect draws with a count buffer through the XI hardware on ARL+ +- anv: migrate indirect mesh draws to indirect draws on ARL+ +- vtn: handle SpvOpExtInstWithForwardRefsKHR for non semantic instructions +- anv: enable KHR_shader_relaxed_extended_instruction +- anv: prefetch samplers when dispatching compute shaders +- iris,anv: simplify and inline sampler count calculations +- anv: enable the VK_KHR_shader_relaxed_extended_instruction feature +- hasvk: enable VK_KHR_shader_relaxed_extended_instruction +- intel/compiler: version can never be above 11 due to the previous check +- intel/compiler: use the correct cache enum for loads and stores +- hk: enable VK_KHR_shader_relaxed_extended_instruction +- nvk: enable VK_KHR_shader_relaxed_extended_instruction +- radv: enable VK_KHR_shader_relaxed_extended_instruction +- tu: enable VK_KHR_shader_relaxed_extended_instruction +- v3dv: enable VK_KHR_shader_relaxed_extended_instruction +- lvp: enable VK_KHR_shader_relaxed_extended_instruction +- docs: update feature matrix for VK_KHR_shader_relaxed_extended_instruction +- intel/brw: lower math op regions for Xe2+ +- anv: Xe2+ doesn't need the special flush for sparse + +Rohit Athavale (1): + +- d3d12: Fix typo in copyright comments + +Roland Scheidegger (1): + +- llvmpipe: Fix type mismatch when storing residency info + +Romaric Jodin (1): + +- perfetto: fix perfetto counter following perfetto v46.0 + +Roy Chan (3): + +- amd/vpelib: Optimize the CPU usage by caching all the LUT configs +- amd/vpelib: fix zero input handling +- amd/vpelib: Add documentation + +Ruijing Dong (8): + +- radeonsi/vcn: correct a typo in a variable +- radeonsi/vcn: add new function for obu_header +- frontends/va: check av1 enc hdr metadata +- radeonsi/vcn: input av1 hdr metadata +- frontends/va: rework VAConfigAttribEncPackedHeaders query +- radeonsi/vcn: add HDR metadata obu in av1enc +- frontends/va: reset roi number +- radeonsi/vcn: qp map IB package sent by default + +Russell Greene (2): + +- wsi/wayland/perfetto: supply presentation clock to perfetto +- perfetto: fix macos compile + +Sagar Ghuge (8): + +- intel/compiler: Ray query requires write-back register +- intel/compiler: Adjust trace ray control field on Xe2 +- intel/compiler: Fix indirect offset in GS input read for Xe2+ +- anv: Reduce clear color state alignment to 64B +- iris: Reduce clear color state alignment to 64B +- intel: uncached L1 to fix memory barrier issue in RT shader +- anv: Track all the descriptor sets +- intel: Switch to COMPUTE_WALKER_BODY + +Sai Teja (2): + +- ci: Don't run vulkan jobs for gbm changes +- ci: Disable angle jobs for GL changes + +Sai Teja Pottumuttu (2): + +- docs: Add required dependencies to meson.rst +- docs: Match README installation guide with meson.rst + +Sam Lantinga (1): + +- util: Fixed crash in HEVC encoding on 32-bit systems + +Samuel Pitoiset (233): + +- radv: stop re-initializing HTILE for non-compressed to compressed transitions +- amd/drm-shim: add missing GC_11_5_0 family +- amd/drm-shim: add GFX1150 support +- ci: add GFX1150 support to radv-fossils +- radv: fix handling view index from device index +- radv: cleanup robustness with vk_pipeline_robustness_state_fill() +- radv: add a helper to set shader stage key robustness info +- radv: add support for dynamic vertex input state with DGC +- radv: allow VK_EXT_vertex_input_dynamic_state with DGC +- radv/meta: simplify radv_meta_blit2d_normal_dst() +- radv/meta: rework creating blit pipelines +- radv/meta: simplify initializing DCC comp-to-single pipelines +- radv/meta: remove unused parameter from some blit init functions +- radv/meta: create blit2d layouts on-demand +- radv/meta: create itob layouts on-demand +- radv/meta: create btoi layouts on-demand +- radv/meta: create itoi layouts on-demand +- radv/meta: create cleari layouts on-demand +- radv/meta: simplify initializing bufimage pipelines +- radv/meta: create DGC prepare pipeline on-demand +- aco: fix bogus assert in RT prolog on GFX11+ +- radv: add a small helper to determine the index type with DGC +- radv: pass a dgc_cmdbuf parameter to more dgc helpers +- radv: refactor the DGC helpers to determine cmdbuf size +- radv: add a small comment describing the DGC implementation +- radv/winsys: pad gfx and compute IBs with only one NOP +- radv: pad GFX preambles IBs with only one NOP +- radv: use a sized NOP packet for the DGC preamble +- radv/amdgpu: fix CS padding for non-GFX/COMPUTE queues +- radv: remove redundant PA_SU_PRIM_FILTER_CNTL in the GFX preamble +- radv: stop emitting DB_RENDER_OVERRIDE in the GFX preamble +- radv: stop emitting PA_SC_CLIPRECT_RULE in the GFX preamble +- radv: stop emitting DB_COUNT_CONTROL in the GFX preamble +- vk/render_pass: add Mesa-specific flag for dynamic rendering +- radv: use the Mesa-specifc dynamic rendering flag for meta operations +- radv: add radv_shader_info::ps::uses_fbfetch_output +- radv: add support for input attachment indices with DRLR +- radv: fix DRLR with subpass input attachments and feedback loops +- radv/amdgpu: do not check that a CS is aligned if no padding is added +- radv/amdgpu: assert that the DGC IB VA is correctly aligned +- radv: optimize NOPs padding with DGC +- radv/ci: update flakes lists for NAVI21/VANGOGH +- radv/ci: enable RADV_PERFTEST=transfer_queue on GFX9+ +- radv: remove RADV_DEBUG=nogsfastlaunch2 +- radv: rework computing the DGC cmdbuf layout +- ac: add ac_gfx103_get_cu_mask_ps() +- ac,radeonsi,radv: add common GFX preambles +- radv: prepare for specialized DGC shaders +- radv: add a pointer to the DGC layout in dgc_cmdbuf +- radv: specialize dispatch DGC token +- radv: specialize draw DGC token +- radv: specialize index buffer DGC token +- radv: specialize pipeline DGC token +- radv: specialize VBO DGC token +- radv: simplify allocating push constants with DGC +- radv: specialize push constant DGC token +- radv: specialize indirect command layout stride for DGC +- radv: remove useless check about non-indexed draws and DGC +- radv: move emitting the compute pipeline with DGC +- radv: stop passing the upload offset to dgc_emit_bind_pipeline() +- radv: add a helper to store data to the DGC upload space +- radv: move emitting VBOs with DGC +- radv: specialize push constant stages with DGC +- radv: allow VK_EXT_legacy_vertex_attributes with DGC +- radv: fix emitting DGC indirect draws with drawid/base_instance +- radv: return early when the vertex input state is dynamic +- radv: only initialize per-attribute vertex input state when needed +- radv: add radv_vs_input_state::attrib_index_offset +- radv: bind the static vertex input state from the pipeline +- radv: rename radv_vs_input_state to radv_vertex_input_state +- radv: pass the vertex shader to radv_write_vertex_descriptors() +- radv: pass more VBO info as parameters to the DGC prepare shader +- radv: stop storing DGC info as part of the VBO descriptors +- radv: specialize dispatch initiator for compute/task shaders with DGC +- radv: store the indirect compute pipeline VA to dgc_cmdbuf +- radv: use compute pipeline metadata for non-indirect DGC pipelines +- radv: fix missing attribute offset with dynamic VS inputs and DGC +- radv: specialize one more VBO parameter with DGC +- radv: pass use_per_attribute_vb_descs in radv_dgc_params +- radv: add a helper to write a single vertex descriptor with DGC +- radv: add a helper to get VBO info for emitting descriptors +- radv: compute rsrc3 for VBO descriptors in NIR for DGC +- radv: use radv_vbo_info for emitting descriptors with DGC +- radv: emit all VBO descriptors in the prepare DGC shader +- radv: pass vb_desc_usage_mask in radv_dgc_params +- radv: stop relying on the VS for emitting VBO descriptors in DGC +- radv: fix allocating sparse descriptor buffers in the 32-bit addr space +- radv: disable shaders linking with ESO when nextStage of VS/TES isn't present +- Revert "radv: specialize push constant stages with DGC" +- vulkan: Update XML and headers to 1.3.294 +- vulkan: skip the disk cache when disableInternalCache is true +- radv: rework helpers for serialize/deserialize shaders +- radv: store the SHA1 RT stage to the shaders cache +- radv: store the number of RT stages per pipeline to the shaders cache +- radv: store whether a RT pipeline is a library to the shaders cache +- radv: make pipeline hashing functions non-static +- radv: disable the in-memory cache when disableInternalCache is true +- radv: add initial support for pipeline binaries +- radv: add support for capturing pipeline binaries +- radv: add support for importing pipeline binaries +- radv: advertise VK_KHR_pipeline_binary +- radv: rework emitting indirect compute pipelines with DGC +- radv: cleanup some functions that emit shader user SGPRs +- radv: store the indirect descriptor sets VA to the descriptor state +- radv: rename radv_emit_descriptor_pointers() to radv_emit_descriptors_per_stage() +- radv: unify emitting non-indirect/indirect descriptor sets +- radv: make the helper that uploads indirect descriptors non-static +- radv: stop allocating upload space for indirect descriptors with DGC +- radv: use radv_upload_indirect_descriptor_sets() in DGC +- radv: update PGM register for TES+GS compiled separately with ESO +- radv: precompute more PGM registers for all stages +- radv: use radv_get_user_sgpr_loc() more +- radv,radeonsi: remove remaining occurrences of TCS epilog +- aco,radv,radeonsi: move has_epilog to the fragment shader info +- radv: merge radv_emit_epilog() with radv_emit_ps_epilog_state() +- radv/rt: skip shaders cache for pipelines created with the capture/replay flag +- radv: fix copying the pipeline binary key +- radv: compute shader hash for shaders created without internal cache +- radv: enable more properties with VK_KHR_maintenance5 +- radv: fix lowering the view index to an input varying for FS +- radv,aco: fix legacy vertex attributes when offset >= stride on GFX6-7 +- radv: fix lowering VS inputs when offset >= stride on GFX6-7 +- radv: add missing cache flushes for DRLR feedback loops +- radv: update the stencil layout for DRLR feedback loops +- radv/ci: stop skipping few tests that hang only with LLVM +- radv: stop reporting VKCTS conformance on GFX11+ +- amd/addrlib: remove bogus assert in HwlComputeSlicePipeBankXor() +- radv/ci: stop skipping dEQP-VK.api.command_buffers.many_indirect_disps_on_secondary +- zink/ci: update expected list of failures since recent piglit uprev +- radv: stop emulating GS invocations for legacy GS on RDNA1-2 +- radv: only export KHR_video_maintenance1 with KHR_video_queue +- radv: fix assigning mesh shader outputs when clip/cull distances are read in FS +- radv: emit all shader related user SGPR states in one place +- radv: make sure to re-emit shader query state when a task shader is bound +- radv: use only one user SGPR for all NGG state +- radv: rename shader_query_state to task_state +- radv: introduce dirty flags for shaders state +- aco: fix descriptor leaking when printing assembly with CLRX +- radv: move emitting some RT user SGPRs when the RT pipeline is emitted +- radv: move updating compute scratch for RT when stack size is emitted +- radv: do not keep executable info when compiling shaders for ESO +- vulkan: Update XML and headers to 1.3.296 +- radv: simplify determining conformant products +- ac/surface: add RADEON_SURF_VIEW_3D_AS_2D_ARRAY for GFX9+ +- radv: implement 2D views of 3D images using 2D_ARRAY descriptors on GFX9+ +- radv: advertise sampler2DViewOf3D +- zink/ci: update expected list of failures for VANGOGH +- ci: uprev VKCTS to 1.3.9.2 +- radv/amdgpu: do not use a constant value for the IB size in dwords +- radv/amdgpu: add assertions to check the IB size +- radv: use base mip level from vk_image_view +- radv: stop passing redundant parameters to radv_image_view_make_descriptor() +- radv: remove redundant assertions about image views +- radv: stop passing image create flags to radv_image_view_init() +- zink/ci: update expected list of failures on NAVI31 +- radv: do not expose NV DGC extensions on GFX6-7 +- radv: remove GFX6-7 DGC support completely +- radv: update image view extent for non-compressed views earlier on GFX10+ +- radv: fix image view descriptors for samplers on GFX6-8 +- radv: remove RADV_MAX_DRM_DEVICES +- radv: squash radv_get_memory_fd() with radv_GetMemoryFdKHR() +- radv: remove RADV_THREAD_TRACE_TRIGGER completely +- zink/ci: update list of expected failures for NAVI10 +- vulkan: fix merging bind sparse submits together +- radv: fix conditional rendering with DGC preprocessing on compute +- radv: fix generating the global key for pipeline binaries +- radv: move radv_compact_spi_shader_col_format() to radv_cmd_buffer.c +- radv: add a helper to bind the color output state +- radv: add PKT3_INDIRECT_BUFFER_BYTES in the DGC path +- radv: implement IB chaining for DGC when it's executed on compute +- radv/amdgpu: simplify cs_execute_ib() +- radv/amdgpu: remove unused code about external IBs in the submit path +- radv: rename 'gfx' to 'main' in the DGC path +- radv: fix returning non-zero captured address without binding +- radv: use app names instead of exec name for shader based drirc workarounds +- radv: stop recomputing the viewport xform for guarband/viewport +- radv: optimize breaking batch when CB_TARGET_MASK change +- radv: use radv_normalize_blend_factor() more +- radv do not force-disable hierarchical stencil testing +- radv: fix emitting DB_RENDER_OVERRIDE on GFX12 +- radv: track more redundant DB related registers +- radv: regroup and emit all DS related states in the same function +- ci: uprev vkd3d-proton to 65b81403435576d882d9141ae3eb4a29373fba0e +- radv: fix enabling/disabling user sample locations +- ci: uprev vkd3d-proton to 59d6d4b5ed23766e69fe252408a3401d2fd52ce8 +- radv: fix initializing the HTILE buffer on transfer queue +- radv: do not use MRT counters for images created for db capture&replay +- radv: capture shader statistics when RGP is enabled +- radv: fix emitting NGG culling state for ESO +- radv: simplify determining if a graphics pipeline uses NGG culling +- radv: do no emit PA_SC_CONSERVATIVE_RASTERIZATION_CNTL in the preamble on GFX12 +- amd: do not emit PA_SU_PRIM_FILTER_CNTL in the common GFX preamble +- ci: uprev VKCTS to 1.3.10.0 +- amd/descriptors: set fmask_tile_swizzle for TC-compat CMASK images on GFX8 +- radv: set missing FMASK surface counters for MSAA MRTs +- radv: fix considering NGG culling for depth-only rendering +- radv: remove useless check about gl_Position as PS inputs for NGGC +- radv: simplify determining if dual-source blending is enabled +- Revert "radv: advertise sampler2DViewOf3D" +- Revert "radv: implement 2D views of 3D images using 2D_ARRAY descriptors on GFX9+" +- Revert "ac/surface: add RADEON_SURF_VIEW_3D_AS_2D_ARRAY for GFX9+" +- radv: implement VK_EXT_device_generated_commands +- radv: advertise VK_EXT_device_generated_commands on GFX8+ +- radv: track more redundant raster related registers +- radv: regroup and emit all raster related states in the same function +- ac/spm: do not abort when the SPM BO is too small +- radv: resize the SPM bo when it's too small +- radv: fix wrong index in radv_skip_graphics_pipeline_compile() +- ac/nir: cull triangles/lines when all W positions are zero/NaN +- radv: fix configuring the memory violation exception for the compute stage +- docs: add missing RADV_TRAP_HANDLER +- radv: add an option to configure the trap handler exceptions +- radv/ci: skip dEQP-VK.api.command_buffers.many_indirect_disps_on_secondary +- radv: add missing L2 non-coherent image case for mipmaps with DCC/HTILE on GFX11 +- ac: add ac_gpu_info::has_trap_handler_support +- radv: check for has_trap_handler_support instead of asserting +- radv: only emit the TBA/TMA registers on GFX8 +- radv: fix dumping the faulty shader detected by the trap handler on GFX9+ +- radv,aco: save SQ_WAVE_GPR_ALLOC from the trap handler +- radv: add a struct that describes the trap handler layout +- aco: fix reading registers from the trap handler shader +- aco: add support for the trap handler shader on GFX9-GFX10.3 +- radv: set missing shader info values for the trap handler +- radv,aco: dump all SGPRS from the trap handler +- aco: fix validation for VOP1 instructions without any dest/src +- aco: clear the current wave exception in the trap handler +- aco: add support for the trap handler shader on GFX11 +- aco: use inlined constant offsets for storing SGPRs in the trap handler +- radv: cleanup tools related resources when destroying logical device +- radv: save the trap handler report in the HOME directory +- radv: mark some GFX6-7 GPUs as Vulkan 1.3 conformant +- radv: fix ignoring src stage mask when dst stage mask is BOTTOM_OF_PIPE +- radv: add a new drirc option to disable DCC for mips and enable it for RDR2 + +Satadru Pramanik (1): + +- Update lp_bld_misc.cpp to support llvm-19+. + +Scott Moreau (2): + +- egl: small refactor to avoid code duplication +- egl: hook up swap interval bits when using zink on wayland + +Semenov Herman (Семенов Герман) (2): + +- radv: fix memleaks in radv_sqtt_reloc_graphics_shaders() +- radv: fix memleaks in radv_init_shader_upload_queue() + +Serdar Kocdemir (22): + +- Reduce heap pressure on getPacketContents +- Use append to chain memory pointer info +- Add VK_EXT_debug_utils to host modules for codegen +- Add VK_EXT_validation_features to host modules +- Add support for VK_EXT_robustness2 +- Add VK_EXT_metal_objects support +- Mark VK_MVK_moltenvk extension as supported on vulkan +- Add VK_EXT_debug_report extension support +- Remove direct usages of MoltenVK library functions +- Remove empty ifdef blocks after the codegen +- Show missing extension's name in the abort message +- Handle all extensions for structure size calculation +- Add extensions used in test applications +- Add error messages for unhandled extensions +- Add VK_EXT_external_memory_metal definitions +- gfxstream: Add VkPrivateDataSlot handle type +- gfxstream: Handle tmp folder explicitly on codegen +- gfxstream: Use KHR version of the line_rasterization extension +- Update decoder.py to use try_unbox on destroy calls +- gfxstream: Check metal extension for external memory +- gfxstream: Keep VK_EXT_line_rasterization for codegen +- gfxstream: Allow VK_KHR_line_rasterization + +Sergi Blanch Torne (20): + +- ci: disable Collabora's farm due to maintenance +- Revert "ci: disable Collabora's farm due to maintenance" +- ci: disable Collabora's farm due to maintenance +- Revert "ci: disable Collabora's farm due to maintenance" +- New DUT for Alder Lake +- New testing jobs anv-adl{,-full} +- New testing jobs anv-adl-angle{,-full} +- New testing jobs intel-adl-skqp +- New testing jobs zink-anv-adl{,-full} +- Uprev Piglit to e9ab30aeaed97b69868cf4d6d6a3f70f3b53c362 +- ci: Disable gk20a-gles +- ci: disable Collabora's farm due to maintenance +- Revert "ci: disable Collabora's farm due to maintenance" +- ci: disable Collabora's farm due to maintenance +- ci: shellcheck requires better 'boolean or' syntax +- ci: run shellcheck when changes on scripts to check +- Revert "ci: disable Collabora's farm due to maintenance" +- WIP: Re-enable Comet Lake +- ci: disable Collabora's farm due to maintenance +- Nightly full job for a630-gles-asan + +Sergio Lopez (2): + +- hk: limit the number of free BOs in each cmd pool +- hk: allow overriding sysmem with an env var + +Sergiu (1): + +- Tweaking YCBCR conversion to return correct VK Format + +Shih, Jude (1): + +- amd/vpelib: Update Plane Descriptor Writer + +Sid Pranjale (3): + +- glx: add a missing nullptr check +- vulkan/util: add vk_format_has_float_depth() +- nvk: implement VK_EXT_depth_clamp_zero_one + +Sil Vilerino (49): + +- Revert "d3d12: Video Encode - Remove PIPE_VIDEO_PROFILE_MPEG4_AVC_BASELINE as not supported" This reverts commit d6bb4ddc638f3ee37fbbe066c631dad80aaeb2d3. Fixes: d6bb4ddc638 ("d3d12: Video Encode - Remove PIPE_VIDEO_PROFILE_MPEG4_AVC_BASELINE as not supported") +- pipe: Add PIPE_BIND_VIDEO_DECODE_DPB/PIPE_BIND_VIDEO_ENCODE_DPB +- d3d12: Implement pipe_video_codec.create_dpb_buffer for AOT resources +- d3d12: Allow passing custom pipe_resource creation template/placed resource to d3d12_video_buffer_create_impl +- d3d12: Implement pipe_video_codec.create_dpb_buffer for texture array resources +- frontend/va: VaSyncSurface encoder check for surface feedback +- d3d12: Implement get_feedback_fence +- d3d12: Video Encode H264 to use direct DPB from frontend +- d3d12: Video Encode H264 - Support direct mmco operations +- d3d12: Video Encode HEVC to use direct DPB from frontend +- d3d12: Video Encode HEVC - Use VPS information from frontend, specifically for vps_max_dec_pic_buffering_minus1 +- d3d12: Rename d3d12_video_encoder_config_dirty_flag_sequence_info to d3d12_video_encoder_config_dirty_flag_sequence_header +- pipe: Add pipe_h264_enc_pic_control.temporal_id +- d3d12: H264, HEVC, AV1 Store multiple rate control states for multiple temporal layers +- vl/vl_win32_screen_create_from_d3d12_device: Allow winsys to be injected +- vl/null_sw_winsys: Add extern declaration for C++ usage +- d3d12: Remove usage of WindowFromDC when building without d3d12 graphics +- d3d12: Video Encode HEVC - Store L0/L1 active nums at time to encode them in the headers +- d3d12: Video Encode H264 - Store L0/L1 active nums at time to encode them in the headers +- d3d12: Video Encode H264 - Workaround for D3D12 validation bug requiring pRefPicMarkingOperationsCommands for IDR frames +- pipe: Add PIPE_H26X_MAX_REFERENCES +- d3d12: Remove max_references usage from frontend to reserve barriers array +- d3d12: Fix d3d12_video_encoder_get_current_max_dpb_capacity to report actual max slots number in DPB +- d3d12: Use d3d12_video_encoder_get_current_max_dpb_capacity in tex array pool instead of separate constant +- d3d12: Use pipe_h264_enc_seq_param.max_num_ref_frames instead of pipe_video_codec.max_references in SPS +- Fix definitions of PIPE_BIND_VIDEO_DECODE/ENCODE_DPB as they conflict with PIPE_BIND_SCANOUT and PIPE_BIND_SHARED +- d3d12: Fix setting of direct_8x8_inference_flag in the SPS +- d3d12: Plumb constrained_intra_pred_flag and transform_8x8_mode_flag from pipe_h264_enc_pic_control +- d3d12: Set HEVC GOP state before calling d3d12_video_encoder_negotiate_requested_features_and_d3d12_driver_caps +- d3d12: Support for on demand vps, sps, pps, aud headers requests from frontend +- pipe: Remove renew_headers_on_idr, insert_aud_nalu unused params +- d3d12: Add HEVC support for on demand AUD NALUS +- pipe: Add HEVC encode range extension caps, SPS and PPS params +- d3d12: Add support for HEVC 4:4:4 video encode using AYUV format +- d3d12: Add support for HEVC range extension header parameters +- ci: Bump DirectX-Headers and Agility SDK dependencies to 1.614.1 +- d3d12: Support create_dpb_buffer for non-opaque allocations when decoder/encoder supports it +- pipe: Add temporal_id to pipe_h26x_enc_dpb_entry +- d3d12: Support writing H264_SEI_SCALABILITY_INFO header on demand +- d3d12: Support writing H264 temporal scalability prefix slice NAL on demand +- d3d12: Report H264/HEVC support for 4 max temporal layers +- d3d12: H264 Encode - Set SPS.gaps_in_frame_num_value_allowed_flag=1 when num_temporal_layers > 1 +- d3d12: d3d12_video_encoder_references_manager_h264 to use FrameDecodingOrderNumber as h264Pic->slice.frame_num +- d3d12: Plumb H264/HEVC temporal_id from pipe params +- d3d12: Fix HEVC chroma_format_idc, separate_colour_plane_flag and conf_win_*_offset for HEVC 444 +- d3d12: Wrap d3d12_video_encode_get_hevc_codec_support around #if VIDEO_CODEC_H265ENC +- d3d12: fix incorrect memset in d3d12_video_encoder_references_manager_hevc::get_current_frame_picture_control_data +- d3d12: Fix video encode metadata snapshot at frame submission, must include changes done by DPB manager begin_frame method +- d3d12: Improve video encode intra-refresh logging + +Simon Perretta (6): + +- pvr: Add missing feature check in compute cmd stream +- pvr: Rescope msaa_mode +- pvr: Add missing include for FALLTHROUGH +- pvr: Inline xxhash header functions +- pvr: rename pds unused reg constant +- pvr: Make Vulkan driver depend only on nir headers, not library + +SoroushIMG (4): + +- pvr: fix intermediate size when configuring mrt setup +- pvr: cleanup 2101010 int format handling +- pvr: fix msaa setup for resolve TQs +- pvr: fix texture address offset when base level >0 + +Stéphane Cerveau (2): + +- anv: inline query for vulkan video encode +- anv: check that inline query pool is VK_NULL_HANDLE + +Surafel Assefa (1): + +- vulkan: MESA_VK_ENABLE_SUBMIT_THREAD=0 disables threaded submit + +Sushma Venkatesh Reddy (4): + +- intel/clflush: Utilize clflushopt in intel_invalidate_range +- intel/brw: Fix DEBUG_OPTIMIZER +- intel/brw: Move get_nir_def() higher to avoid UNDEF +- intel/brw: Handle 16-bit sampler return payloads + +Sviatoslav Peleshko (14): + +- anv: Assert ray query BO actually exists +- brw,elk: Fix opening flags on dumping shader binaries +- anv: Release correct BO in anv_cmd_buffer_set_ray_query_buffer +- anv: Add full subgroups WA for the shaders with barriers in Breaking Limit +- brw: Fix mov cmod propagation when there's int signedness mismatch +- mesa: Reset vbo attributes after flushing them to Current in glPopAttrib +- anv: Update XeSS workaround executable names for Satisfactory 1.0 +- intel/brw: Don't apply discard_if condition opt if it can change results +- intel/brw/gfx9: Implement WaClearArfDependenciesBeforeEot +- intel/elk/chv: Implement WaClearArfDependenciesBeforeEot +- intel/brw: List all instructions that have BranchCtrl bit +- intel/brw_asm: Add BranchCtrl support +- intel/elk: List all instructions that have BranchCtrl bit +- intel/elk_asm: Add BranchCtrl support + +Tapani Pälli (34): + +- intel/dev: update mesa_defs.json from workaround database +- blorp: support new flag for setting cps dependency +- anv: fix a cmd_buffer reference in simple shader +- anv: refactor cmd_buffer_flush_gfx_runtime_state for dirty state +- anv: implement workaround for Wa_18038825448 +- intel/dev: update mesa_defs.json from workaround database +- gbm: depend on libdrm indepedent of dri2 setting +- anv: move setting 3DSTATE_CLIP::MaximumVPIndex from loop +- anv: check if RT writes are happening for HasWriteableRT +- anv: set correct miplevel for anv_image_hiz_op +- anv: change existing ICL workaround to depend on BLEND_STATE +- iris: use correct enum for aux state on depth fast clear +- mesa: check for ARB_ES3_compatibility in format checks +- anv: initialize pixel struct to zero when setting clear color +- iris: initialize pixel struct to zero when setting clear color +- glsl: set a version to standalone compiler context +- iris: fix issues with memory object updates via glBufferSubData +- blorp: assert that color depth is not 96 for Wa_16021021469 +- anv: force StackIDControl value for Wa_14021821874 +- drirc/anv: force_vk_vendor=-1 for Faaast Penguin +- anv: use mi_builder in CmdBeginTransformFeedbackEXT +- intel/genxml: introduce L3 Fabric Flush for gfx12 +- intel/ds: add L3 fabric flush support +- anv: add plumbing/support for L3 fabric flush +- iris: add plumbing/support for L3 fabric flush +- iris: add depth, DC and L3 fabric flush for aux map invalidation +- anv: add depth, DC and L3 fabric flush for aux map invalidation +- drirc/anv: force_vk_vendor=-1 for Silent Hill 2 +- intel/dev: update mesa_defs.json from workaround database +- anv: set StackIDControlOverride_RTGlobals for 2 workarounds +- mesa: fix DXT1 support with EXT_texture_compression_dxt1 +- iris: implement VF_STATISTICS emit for Wa_16012775297 +- anv: implement VF_STATISTICS emit for Wa_16012775297 +- anv/iris: add note about Wa_18039438632 for RT flush after SBA + +Tatsuyuki Ishi (2): + +- util/u_queue: Replace relative time wait hack with u_cnd_monotonic +- radv: Enable descriptorBufferCaptureReplay. + +Thomas Debesse (1): + +- meson: fix linking gallium with xcb-xfixes when using x11 + +Thomas H.P. Andersen (8): + +- nvk: EXT_post_depth_coverage +- nak: special case PhiDsts as not uniform +- docs: update for new extensions in nvk +- nvk: do not overwrite test cases +- dirconf: add a common vk_zero_vram +- nvk: handle driconf for zeroing vram +- driconf: set vk_zero_vram driconf for Path of Exile +- driconf: set vk_zero_vram driconf for X4 Foundations + +Thomas Wagner (4): + +- llvmpipe: Fix external memory object reference +- llvmpipe: Unmap DMA buffer to release all resources +- llvmpipe: properly save llvmpipe_memory_allocation in memory_object +- llvmpipe: Remove unused struct member + +Thong Thai (1): + +- ci: partially emulate cdna devices using lower image opcodes + +Timothy Arceri (48): + +- nir: create validate_tex_src_texture_deref() helper +- nir: add nir_tex_src_{sampler,texture}_deref_intrinsic +- glsl: make use of new tex src deref intrinsic +- nir: set disallow_undef_to_nan for legacy ARB asm programs +- glsl: fix glsl to nir support for lower precision builtins +- glsl: always copy bindless sampler packing constructors to a temp +- nir: make static assert more flexible +- nir/glsl: set deref cast mode during function inlining +- glsl: drop double support checks in helper +- glsl: store has implicit conversions bools in gl_shader +- glsl: apply implicit matching rules when linking +- glsl: fix function overload resolution at link time +- ci: bump piglit version +- nir/tests: make add_loop_terminators more flexible +- nir/tests: add basic terminator merge test +- ci: disable gimark trace +- mesa/glsl: add nir fields to the gl shader structs +- glsl: add gl_nir_validate_intrastage_interface_blocks() +- glsl: make gl_nir_validate_intrastage_arrays() more flexible +- glsl: add support for converting global instructions to NIR +- util/hash_table: add _mesa_string_hash_table_create() helper +- nir: add max_ifc_array_access field to vars +- nir: add nir_function_impl_clone_remap_globals() +- glsl: move _mesa_glsl_can_implicitly_convert() to linker_util.cpp +- nir: add function in/outs to variable modes +- nir: serialize nir_parameter type +- nir: store variable mode in nir_parameter +- nir: add implicit_conversion_prohibited field to nir_parameter +- glsl: add gl_nir_link_function_calls() +- glsl: add gl_nir_linker_size_arrays() +- nir: add nir_intrinsic_deref_implicit_array_length intrinsic +- nir: handle wildcard array deref +- glsl: add intrastage shader linking helpers for nir linker +- glsl: add lower_derivatives_without_layout() helper +- glsl: set blake3 hash in standalone scaffolding +- glsl: switch to a full nir based linker +- glsl: remove now unused linker code +- glsl: fix gl_{Clip,Cull}Distance error messages +- glsl/mesa: move uniform related shader structs to shader_types.h +- glsl: move resource_name_updated() to linker_util.cpp +- glsl/mesa: remove ir_uniform.h +- glsl: move error and warning helpers to util file +- glsl: move shader cache lookup call to st +- glsl: rename link_shaders() -> link_shaders_init() +- glsl: remove linker.cpp +- nir/glsl: set cast mode for image during function inlining +- nir/glsl: set deref cast mode for blocks during function inlining +- glsl/nir: fix function cloning at link time + +Timur Kristóf (22): + +- radv: Add gs/hs_inputs_read field for linked LS and ES. +- ac/nir/tess: Add tcs_inputs_read to LS output lowering. +- ac/nir/tess: Don't emit VS outputs that aren't read by TCS. +- ac/nir/esgs: Add gs_inputs_read to ES output lowering. +- ac/nir/esgs: Don't emit ES outputs that aren't read by GS. +- ac/nir: Add helper for I/O location mapping. +- ac/nir/tess: Map linked LS/HS I/O based on TCS input mask. +- ac/nir/esgs: Map linked ES/GS I/O based on GS input mask. +- ac/nir: Remove ac_nir_calc_io_offset function. +- radv: Stop assigning linked driver locations. +- nir: Remove unused nir_assign_linked_io_var_locations. +- ac/nir/tess: Simplify calculation of HS output LDS offset. +- ac/nir: Shorten the name of ac_nir_calc_io_offset_mapped. +- ac/nir/tess: Remove no_inputs_in_lds. +- ac/nir: Move varying cost functions from radeonsi to common code. +- radv: Add NIR XFB IO intrinsic info to legacy stages not just NGG. +- radv: Add new linking step and use nir_opt_varyings. +- radv: Determine linked I/O info after nir_opt_varyings. +- nir/opt_algebraic: Add various bitfield extract patterns. +- radv: Flush L2 cache for non-L2-coherent images in EndCommandBuffer. +- radv: Increase NGG culling PS param limit to 12 on GFX10. +- radv: Enable NGG culling by default on GFX10. + +Tom Cherry (1): + +- Do not pass std::shared_ptr when not transfering ownership + +Tomeu Vizoso (2): + +- etnaviv/ml: Fix includes +- etnaviv/nn: Fix use of etna_core_info + +Tomson Chang (2): + +- amd/vpelib: Fix color adjustment change detect hole +- amd/vpelib: Only update cached adjustment when it is valid + +Trigger Huang (1): + +- mesa: Fix AMD performance monitor implementation + +Utku Iseri (1): + +- st/mesa: clamp reported max lod bias + +Valentine Burley (72): + +- tu/rmv: Fix missing image bind logging for WSI images +- tu: Implement VK_EXT_device_address_binding_report +- tu/kgsl: Remove unused variable +- docs/features: Update features.txt for Turnip +- lavapipe: Block more formats from getting blit feature flags +- vulkan/format: Translate two 420_UNORM formats properly +- tu, fdl: Drop custom mapping for 1-plane YCbCr formats +- tu: Drop tu_vk_format_to_pipe_format helper function +- vulkan/wsi: Refactor can_present_on_device +- tu: Always report that we can present on kgsl +- freedreno/ci: Update expectations for Adreno 660 +- freedreno/ci: Re-enable Adreno 660 for Vulkan pre-merge jobs +- android: Extract version from llvm-project instead of hardcoding it +- tu: Remove redundant bits override from lower_tex_ycbcr +- ci: Backport two fixes tor the Vulkan CTS +- freedreno/ci: Update expectations +- freedreno/ci: Skip tests that might time out during pre-merge jobs +- freedreno/ci: Move and document last remaining failures +- freedreno/ci: Re-enable Adreno 630 for Vulkan nightly jobs +- tu: Expose VK_KHR_workgroup_memory_explicit_layout +- freedreno/ci: Wire up skips for a630_vk_full +- nvk: Advertise VK_EXT_legacy_vertex_attributes +- llvmpipe: Only use udmabuf with libdrm +- vulkan, radv: Add new common vk_format_get_plane_width/height helpers +- tu: Use vk_format_get_plane_width/height to get the plane dimensions +- tu: Use vk_format_get_plane_count for tu6_plane_count +- util: Remove Vulkan-only formats from get_plane_width/height +- tu: Fix VK_EXT_extended_dynamic_state3 feature +- tu: Simplify VK_EXT_sample_locations SampleCounts assignment +- freedreno/ci: Uprev kernel for a630 runner +- freedreno/ci: Skip timing out test on a630 +- freedreno/ci: Uprev kernel to 6.11 +- freedreno/ci: Update expectations after Piglit uprev +- turnip/ci: Remove fixed test from a660 xfails +- freedreno/ci: Document some a630 EGL flakes +- ci: Add a new build-only job without libdrm +- freedreno/devices: Fix A740v3 from Quest 3 +- freedreno/devices: Unify magic_regs for A740 and A32 +- freedreno/devices: Document an alias for FD644 +- egl: Fix missing libdrm check +- ci: Build zink and llvmpipe in debian-no-libdrm +- freedreno/ci: Document new flake +- tu: Streamline setting YCbCr feature bits +- tu: Be more consistent with using vk_format helpers +- tu: Use vk_format_get_plane_count instead of special casing +- tu: Fix conditional check in tu_image_view_init +- tu: Use vk_format_get_plane_format +- tu: Use existing helpers in tu_clear_blit +- tu: Introduce tu_aspects_to_plane helper function +- tu: Unify zeroing bufferFeatures +- tu: Set some feature bits earlier +- tu: Use v2 format feature flags +- tu: Update comments about the blob driver +- tu: Add an assert to tu6_plane_index +- util/format: Add new 10-bit P010 RGB/planar formats +- util/format: Add new 12-bit P012 RGB/planar formats +- lavapipe: Remap 10 and 12 bit formats to 16 bit formats +- nvk: Remap 10 and 12 bit formats to 16 bit formats +- hk: Remap 10 and 12 bit formats to 16 bit formats +- vulkan: Fix incorrect bpcs value for padded formats +- tu: Fix incorrect bpcs value for padded formats +- vulkan/format: Properly translate 10-bit and 12-bit formats +- vulkan/format: Update vk_format_from_pipe_format with additional formats +- ci: Add global ANGLE skips for its waiver +- freedreno/devices: Inline a690 quirk +- freedreno/devices: Document common name for a635 speedbins +- freedreno/devices: Enable 64-bit atomics on a735 and a740v3 +- freedreno/devices: Establish a7xx sub-generations +- ir3: Don't lower to LCSSA before calling nir_divergence_analysis() +- intel/ci: Fix Alder Lake's configuration +- anv/ci: Remove additive blending fails on ADL +- lavapipe: Block YCbCr formats from getting blit feature flags + +Vignesh Raman (19): + +- ci/deqp-runner: unset CC for arm32 cross-compilation +- ci: uprev deqp-runner from 0.18.0 to 0.20.0 +- ci: enable Mali V10 testing +- ci: use v6.11-rc5 kernel for Mali V10 testing +- ci: move mtl-fw.json to .gitlab-ci directory +- ci/deqp-runner: pass EXTRA_CARGO_ARGS +- ci: include exit code in test result output +- ci/lava: handle non-zero exit codes +- ci/lava: update unit tests +- ci/bare-metal: handle non-zero exit codes +- ci: add Gen10 Arm Mali firmware to rootfs +- ci/image-tags: add DEBIAN_PYUTILS_TAG +- ci/lava: set exit code in exception case +- ci: prepare-artifacts-python: copy if src dir exist +- ci: build KERNEL_IMAGE_BASE in before_script +- ci: move rules from rules-anchors to workflow +- ci/crosvm: Use default value for CROSVM_GALLIUM_DRIVER +- ci: rename FORCE_KERNEL_TAG to EXTERNAL_KERNEL_TAG +- Revert "ci: disable Collabora's farm due to maintenance" + +Visan, Tiberiu (2): + +- amd/vpelib: set the same range for clr adj +- amd/vpelib: Remove TODO comments and legacy check(#421) + +Vitaliy Triang3l Kuzmin (2): + +- r600: Add FMT_1_REVERSED to r600_formats.h +- r600: Add missing formats to r600_colorformat_endian_swap + +Vlad Schiller (3): + +- pvr: Handle VK_STRUCTURE_TYPE_EXPORT_MEMORY_ALLOCATE_INFO +- pvr: Handle VK_STRUCTURE_TYPE_IMAGE_FORMAT_LIST_CREATE_INFO +- pvr: Minor code refactor + +Vldly (1): + +- freedreno: Fix resource tracking on repeated map with discard + +Víctor Manuel Jáquez Leal (1): + +- frontends/va: Don't return P010/P016 as surface formats when encoding + +WANG Xuerui (2): + +- meson: Force use of LLVM ORCJIT for hosts without MCJIT support +- meson: Additionally probe -mtls-dialect=desc for TLSDESC support + +Wang, Kyle (1): + +- amd/vpelib: Add missing copyrights + +X512 (2): + +- egl/haiku: fix double free of BBitmap +- egl/haiku: fix synchronization problems, add missing header + +Yahan Zhou (30): + +- Update vk.xml and code generator +- Fix testAndroidBaselineProfile2021Support for host swiftshader +- Add support for descriptor with inline uniform block +- Implement vkQueueSubmit2 in the same way as vkQueueSubmit +- Add some vk 1.3 extension strings +- Create sync device before calling goldfish_sync_queue_work +- Parse more ext structs in vkAllocateMemory and vkCreateBuffer +- Disable compatible handle types for semaphore timeline +- Fix dEQP-VK.wsi.android.maintenance1 +- VkImageCreateInfo should ignore queeu family indices in some situation +- Use old en/decoding for vkQueueSignalReleaseImage +- Add VK_EXT_vertex_attribute_divisor support for guest ANGLE +- Declare AHB blob unsupported in Vulkan +- Add VK_EXT_fragment_density_map for Chrome +- Add snasphot support for vk coherent memory +- Snapshot vk image content in common situation +- Better track color buffer usage in Vulkan +- Make it not crash during vk snapshot load +- Fix corner cases where vk dependency graph breaks +- Handle dependency by VkMemoryDedicatedAllocateInfo +- Snapshot VkBuffer +- Add snapshot for command buffer +- Fix handle create mismatch for Vulkan snapshot +- Add snapshot support for vkBindImageMemory2 +- Fix Vulkan snapshot with -guest-angle +- Revert^2 "Add snapshot support for vkBindImageMemory2" +- [Vulkan Snapshot] tryLock when an API might create extra handles +- Signal the fence after color buffer copying +- Revert "Signal the fence after color buffer copying" +- [Vulkan Snapshot] Add dependency graphics pipeline -> render pass + +Yao Zi (1): + +- panvk: Link with --build-id explicitly + +YaoBing Xiao (1): + +- vulkan/x11: use xcb_connection_has_error to check for failue + +Yinjie Yao (2): + +- radeonsi/vcn: Rename transform_skip_disabled and remove hardcoded value for VCN5 +- radeonsi/vcn: Add support for QVBR rate control mode for VCN5 + +Yiwei Zhang (8): + +- venus: only request ring thread prio for TLS ring +- venus: clarify wsi image ownership +- anv: improve vma usage for descriptor buffer +- ci: re-enable venus ci +- venus: fix a race condition between gem close and gem handle tracking +- Revert "meson: disallow Venus debug + LTO build via GCC" +- venus: workaround cacheline overflush issue on Intel JSL +- venus: avoid over-caching sfb cmds + +Yunhyeok "Yune" Choi (1): + +- glx: Getting rid of the double assignment in \`__glXWireToEvent`. + +Zack Rusin (1): + +- svga: Redo the way generated files are handled + +Zan Dobersek (11): + +- freedreno: fix a7xx perfcntr countables +- ir3: some 8-bit subgroup intrinsics must execute as 16-bit instructions +- nir/lower_vars_to_scratch: calculate threshold-limited variable size separately +- ir3: lower relevant 8-bit ALU ops in nir_lower_bit_size +- ir3: use fully-functional dp4acc when available +- ir3: indicate possible dword straddle for any multi-component pvtmem access +- ir3: improve validation, display for ldp instructions +- tu: enable shaderInt8 support +- tu: use instance indices in RD dump filenames +- freedreno: sanitize device names for config name usage +- zink: fix bo_export caching + +Zhang He (1): + +- iris, crocus: fix a typo and break comment line correctly + +antonino (10): + +- panfrost: rename \`PAN_GPU_INDIRECTS` +- panfrost: refactor draw logic to support native draw indirect +- panfrost: implement indirect draw for CSF-based GPUs +- pan/cs: add helpers to emit contiguous csf code blocks +- pan/cs: add helpers to generate csf loop +- panfrost: implement multi draw indirect for CSF-based GPUs +- panfrost: advertise \`PIPE_CAP_MULTI_DRAW_INDIRECT` +- mesa/main: expose \`EXT_multi_draw_indirect` +- pan/cs: add support for nop instruction +- pan/genxml: fix SET_EXCEPTION_HANDLER instruction definition + +bbhtt (1): + +- pipe_loader_drm: Fix virtgpu_drm header path + +cheyang (1): + +- Android15 support gralloc IMapper5 + +chyyran (1): + +- spirv_to_dxil: add missing SPIR-V capabilities + +dawnhan (1): + +- vulkan/android: Add missing AHB formats + +itycodes (1): + +- intel: Fix a typo in intel_device_info.c:has_get_tiling + +llyyr (1): + +- vulkan/wsi/wayland: fix suboptimal flag being ignored with explicit sync + +nyanmisaka (4): + +- intel/pci_ids: Update ADL device names +- intel/pci_ids: Update RPL device names +- intel/pci_ids: Update TGL device names +- intel/pci_ids: Update DG1 device names + +quic_lkondred (1): + +- freedreno: Add support for Adreno 663 GPU + +sergiuferentz (2): + +- Adding support for VK_EXT_external_memory_host extension +- gfxstream: VulkanBatchedDescriptorSetUpdate toggled on caps on Guest + +yinjiyao (2): + +- frontends/va: check hevc enc hdr sei +- radeonsi/vcn: add HDR sei in hevc enc