AlexIndustrial/mesa

Author	SHA1	Message	Date
Timur Kristóf	1d8f46e00c	ac/nir/ngg: Carve out ac_nir_create_output_phis. We're going to want to call it from a different file too. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:45 +00:00
Timur Kristóf	4bec453595	ac/nir/ngg: Use ac_nir_ngg_alloc_vertices_and_primitives in mesh shader lowering. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:45 +00:00
Timur Kristóf	02dacac869	ac/nir/ngg: Carve out ac_nir_ngg_alloc_vertices_and_primitives. We're going to want to call it from a different file too. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:45 +00:00
Timur Kristóf	a3d8e6a60c	ac/nir/ngg: Remove unused vs_output struct. Forgot to remove this when I refactored the code to use ac_nir_prerast_out instead. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:45 +00:00
Timur Kristóf	23641d4032	ac/nir/ngg: Add a few comments explaining some variables. These were somewhat confusing, so let's add a few words to explain what they are exactly. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:45 +00:00
Timur Kristóf	28f49bf99c	ac/nir/ngg: Remove some superfluous variables from culling code. No functional changes, just code cleanup. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:45 +00:00
Timur Kristóf	7a3d95bbe2	ac/nir/ngg: Mitigate attribute ring wait bug when primitive ID is per-primitive. There is a possibility that some waves in an NGG workgroup don't have any input vertices, only primitives. When these waves store the primitive ID as a per-primitive attribute, they will need to wait for those stores before the primitive export, because the other waves can't wait for them. Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:45 +00:00
Timur Kristóf	397d45d055	ac/nir/ngg: Mitigate NGG fully culled bug when GS output is compile-time zero. This case is unlikely but possible. We forgot to handle it here, because it was originally handled by the backend compiler. On GFX10 chips that have issues with 0 vertices and primitives exported, this will always export at least 1 vertex and primitive. This could theoretically fix some hangs on Navi 10, although we are not aware of a specific issue caused by this problem. Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:45 +00:00
Samuel Pitoiset	5ac72c5f56	ac/descriptors: allow to configure DCC for buffer descriptors This is not used yet. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33284>	2025-01-30 08:18:22 +00:00
Samuel Pitoiset	fe6494559d	ac,radeonsi: add SDMA DCC tiling for GFX12+ Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33284>	2025-01-30 08:18:22 +00:00
Marek Olšák	d0e1c508c6	ac/fake_hw_db: deobfuscate GPU name strings Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32910>	2025-01-29 07:20:02 +00:00
Marek Olšák	43588be435	radeonsi: remove an incorrectly defined modifier It's missing the PACKERS field to distinguish between different layouts and it's a useless swizzle mode anyway. Fixes: `0833dd7d12` - amd/common: Add support for modifiers. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9344 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32910>	2025-01-29 07:20:01 +00:00
Marek Olšák	71e95b373b	radeonsi: remove si_shader_info code that is no longer needed A lot of this info is now derived from shader variant NIR. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32910>	2025-01-29 07:19:56 +00:00
Marek Olšák	8e8eda4089	radeonsi: fix PS prolog not counting used fragcoord VGPRs correctly Using the used component count is not enough. We need to consider the component mask because any component can be disabled. This might fix tests. This removes the component counting from ac_get_fs_input_vgpr_cnt and determines the component mask where it's needed. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32910>	2025-01-29 07:19:40 +00:00
Marek Olšák	b3fc49686e	ac/nir: clamp vertex color outputs in the right place Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32910>	2025-01-29 07:19:38 +00:00
Samuel Pitoiset	48d199f3dc	ac/gpu_info: add gfx12_supports_display_dcc Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33212>	2025-01-27 08:08:15 +00:00
Marek Olšák	f7e3689fe1	ac/nir: lower sample_pos to load_sample_positions_amd when frag_coord is center Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:26 -05:00
Marek Olšák	0a228dfef2	ac/nir: compute ddx/ddy for barycentric_at_offset at the beginning of shaders to make it work after terminate (discard). Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:26 -05:00
Marek Olšák	eddd063420	ac/nir: cosmetic stuff for ac_nir_lower_ps Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:26 -05:00
Marek Olšák	247c0593eb	ac/nir: eliminate sample_mask_in without MSAA in ac_nir_lower_ps_early Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:26 -05:00
Marek Olšák	e57b52ff6c	ac/nir: optimize frag_coord <-> pixel_coord in ac_nir_lower_ps_early Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:26 -05:00
Marek Olšák	33134f9503	ac/nir: optimize barycentric_at_sample(sample_id) in ac_lower_ps_early Replace it with barycentric_sample. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:26 -05:00
Marek Olšák	43f6b2655e	ac/nir: simplify force_*_center_interp options in ac_nir_lower_ps_early This only indicates whether MSAA is disabled. Having a separate option for each sysval is better for the PS prolog, but not for monolithic compilation. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Marek Olšák	3bccfa72cc	ac/nir: simplify force_*_sample_interp options in ac_nir_lower_ps_early The only thing we need here is whether sample shading is enabled and how many samples. Having a separate option for each sysval is better for the PS prolog, but not for monolithic compilation. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Marek Olšák	d7d4d56f5b	ac,aco,radeonsi: replace SampleMaskIn with 1 << SampleID if full sample shading Since the sample mask is always 1 << sample_id with full sample shading, just use that instead of loading sample_mask_in. Set it to 0 if it's a helper invocation. This removes the sample mask input VGPR. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Marek Olšák	1026402b7c	ac/nir: clamp vertex color outputs in the right place Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Marek Olšák	4f63b21df0	ac/nir: drop 16x EQAA support from ac_get_ps_iter_mask We don't support 16x EQAA anymore. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Marek Olšák	51bbe2606d	ac/nir: switch passes to use nir_shader_intrinsics_pass Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Marek Olšák	d31b3513da	ac/nir: handle FRAG_RESULT_COLOR with dual src blending in ac_nir_lower_ps_early If FRAG_RESULT_COLOR with dual_src_blend_kndex=1 is present, FRAG_RESULT_COLOR really means FRAG_RESULT_DATA0. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Marek Olšák	38d3fc7a6a	ac/nir: return progress from ac_nir_lower_ps_late This changes the creation of barycentric coordinate variables to on-demand. There is also some reordering of export code to return progress. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Marek Olšák	231009be65	ac/nir: return progress from ac_nir_lower_ps_early This changes the creation of barycentric coordinate variables to on-demand. Everything else is ready to return progress. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Marek Olšák	234b416ffb	ac/nir: lower fbfetch_output in ac_nir_lower_ps_early so that we can gather shader_info after this and all system values that this adds will be gathered. shader_info won't be gathered after si_nir_lower_abi, which is why we have to lower fbfetch_output here. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Marek Olšák	1570387aaa	ac/nir: lower barycentric_at_offset/sample in ac_nir_lower_ps_early i.e. before future linking optimizations and shader_info gathering They are lowered together because one depends on the other. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Marek Olšák	b9b1f3a047	ac/nir: lower sample_pos in ac_nir_lower_ps_early i.e. before future linking optimizations and shader_info gathering The ac_nir_lower_ps_early call is also added for non-monolithic shaders because it's required now. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Marek Olšák	580304350b	ac/nir: optimize front_face in ac_nir_lower_ps_early i.e. before future linking optimizations and shader_info gathering Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Saroj Kumar	433004dcff	ac/surface: fix missing NULL check in gfx12_select_swizle_mode() Add null check for surf pointer. Fixes segfault issue during start of gdm on gfx12. Signed-off-by: Saroj Kumar <saroj.kumar@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33199>	2025-01-24 21:51:50 +00:00
David Rosca	604fb68876	radeonsi: Report surface alignment for AV1 encode This is also useful for AV1 to allow applications to use container crop. Reviewed-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33083>	2025-01-24 13:04:47 +00:00
Samuel Pitoiset	c039caeb53	ac/cmdbuf: program SPI_SHADER_GS_MESHLET_CTRL to 0 in the GFX12 preamble Otherwise, it's derivative group quads by default. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33185>	2025-01-24 11:08:59 +00:00
Pierre-Eric Pelloux-Prayer	6cee989915	amd: add ac_drm_device_get_cookie This returns the underlying device pointer but as an opaque uintptr_t. This will be required because libdrm_amdgpu will return the same device when called multiple times from the same process. radeonsi relies on the pointer value to identify if the device are the same and adjust the synchronisation logic based on that. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33081>	2025-01-22 14:55:56 +00:00
Samuel Pitoiset	efab1885b7	ac/sqtt: update programming SQTT on GFX12 This is pure guess but I think GFX12 now uses 48-bits VAs for configuring the SQTT buffer. This isn't yet enough to generate a capture because it's missing some info I don't know, but it's a start. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33068>	2025-01-20 23:50:10 +00:00
Ivan Avdeev	14e3231b56	radv: add a flag to indicate ray tracing support Determine whether the device has hardware raytracing support early, and then use this result where needed, instead of checking for `gfx_level` every time. This is a prerequisite for CYAN_SKILLFISH chip enablement. This chip is still GFX10, not GFX10_3, but has hardware support for accelerated `image_bvh{,64}_intersect_ray` instructions. Just checking for `gfx_level` is insufficient for it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33109>	2025-01-20 08:27:11 +00:00
Samuel Pitoiset	c84b1dda0b	ac/nir: fix skipping streamout when no buffers are bound on GFX12 RadeonSI compiles shader variants with streamout disabled but RADV doesn't do that. The alternative solution is to set the streamout buffer size to 0 to indicate that streamout isn't bound. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33058>	2025-01-17 11:24:55 +00:00
David Rosca	ccb450b91c	radeonsi/uvd: Set decode target swizzle mode on GFX9 Acked-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32829>	2025-01-17 08:53:05 +00:00
Sonny Jiang	5b2de9e593	radeonsi/vcn: Add vcn_5_0_1 support Add support for AMD vcn_5_0_1 Signed-off-by: Sonny Jiang <sonny.jiang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33004>	2025-01-16 23:41:28 +00:00
Marek Olšák	ff6e3e9f76	nir: add next_stage param to nir_slot_is_varying & nir_remove_sysval_output The result of nir_slot_is_varying depends on what the next shader stage is, and nir_remove_sysval_output uses it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32855>	2025-01-16 16:28:15 +00:00
Pierre-Eric Pelloux-Prayer	6c0e4a0ece	ac/virtio: add virtio-only AMDGPU_GEM_CREATE flag On the host side virglrenderer creates dmabuf on demand when: * cpu mapping is requested * setting up scan out * sharing buffers between guest processes On-demand dmabuf creation only works if the ctx that created the BO still exists and knows about this BO. This assumption works ok for the first 2 cases, but can break with the last one (and it does cause issues on Android). eg: * process A allocates BO and exports it as a guest dmabuf * process A closes its handle to the BO (-> detach_resource) * process B imports the guest dmabuf -> this triggers the attach_resource function in virglrenderer. If the given resource isn't a VIRGL_RESOURCE_FD_DMABUF it'll try to get one... But for this to work, process A needs to be used -> this fails because this resource was detached from it. The reason we create dmabuf on demand is to avoid hitting the number of open file descriptor limit. So to cover the 3rd case, we'll use the VIRTGPU_BLOB_FLAG_USE_SHAREABLE flag, but try to limit to as few possible buffers as possible. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Reviewed-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21658>	2025-01-16 12:24:33 +00:00
Pierre-Eric Pelloux-Prayer	2269ea7e2f	ac/virtio: disable timeline syncobj support Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21658>	2025-01-16 12:24:32 +00:00
Pierre-Eric Pelloux-Prayer	dc83195175	ac/virtio: disable userptr and local buffers They're not supported yet so let's not pretend they are. In particular use_local_buffers can cause VM_ALWAYS_VALID to be used, which then prevents the creation of a dmabuf. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Reviewed-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21658>	2025-01-16 12:24:32 +00:00
Pierre-Eric Pelloux-Prayer	22263616ed	amd: amdgpu-virtio implementation Native context support is implemented by diverting the libdrm_amdgpu functions into new functions that use virtio-gpu. VA allocations are done directly in the guest, using newly exposed libdrm_amdgpu helpers (retrieved through dlopen/dlsym). Guest <-> Host roundtrips can be expensive so we try to avoid them as much as possible. When possible we also don't wait for the host reply in case where it's not needed to get correct result. Implicit sync works because virtio-gpu commands are submitted in order to the host (there a single queue per device, shared by all the guest processes). virtio-gpu also only supports one context per file description (but multiple file descriptions per process) while amdgpu only allows one fd per process, but multiple contexts per fd. This causes synchronization problems, because virtio-gpu drops all sync primitive if they belong to the same fd/context/ring: ie the amdgpu_ctx can't be expressed in virtio-gpu terms. For now the solution is to only allocate a single amdgpu_ctx per application. Contrary to radeonsi/radv, amdgpu_virtio can use libdrm_amdgpu directly: the ones that don't rely on ioctl() are safe to use here. Tested-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Reviewed-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21658>	2025-01-16 12:24:32 +00:00
Pierre-Eric Pelloux-Prayer	a565f2994f	amd: move all uses of libdrm_amdgpu to ac_linux_drm This is required to implement virtio native-context. In a virtualized environment, most of the functions provided by libdrm_amdgpu will be implemented using virtio. This allows to implement efficient virtualization, by forwarding the kernel API to the host, instead of the GL/VK calls. Similarly, the raw 'fd' or 'gem_handle' arguments are replaced by opaque types. This allows to encapsulate all the needed state in the handle, and use unmodified API between baremetal and virtualized contexts. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21658>	2025-01-16 12:24:32 +00:00

1 2 3 4 5 ...

3032 Commits