AlexIndustrial/mesa

Author	SHA1	Message	Date
Bas Nieuwenhuizen	76daa30e4a	radv: Use correct flush bits for flushing L2 during CB/DB flushes. Copied from radeonsi. Putting in the correct metadata flush commands for eventually not flushing L2 on CB/DB switch. Does not remove the need for V_028A90_CACHE_FLUSH_AND_INV_TS_EVENT at the moment. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-01-04 19:35:36 +01:00
Bas Nieuwenhuizen	1c78e4f053	radv: Allow writing 0 scissors. When rasterization is disabled we can have that few. Fixes: `76603aa90b` "radv: Drop the default viewport when 0 viewports are given." Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-01-04 00:14:19 +01:00
Bas Nieuwenhuizen	6a36bfc64d	radv: Implement binning on GFX9. Overall it does not really help or hurt. The deferred demo gets 1% improvement and some games a 3% decrease, so I don't think this should be enabled by default. But with the code upstream it is easier to experiment with it. v2: Remove initializing the registers from si_emit_config. Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-12-31 15:07:07 +01:00
Dave Airlie	868377ab33	radv/gfx9: use a bigger hammer to flush cb/db caches. amdvlk is probably more subtle than this but it never uses the inv cb/db variants, we fail some CTS tests without this. Fixes: dEQP-VK.renderpass.dedicated_allocation.formats.d32_sfloat_s8_uint.input*. Fixes: `c2fbeb7ca0` (radv: add GFX9 cache flushing support.) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (for now :-) Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-12-29 11:43:30 +10:00
Nicolai Hähnle	97f42d11df	amd/common: sid.h cleanups Fix a bunch of labels indicating when registers were added/removed and normalize the SI-class GRBM_GFX_INDEX. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-28 09:34:43 +01:00
Samuel Pitoiset	305745457c	radv: optimize calling radv_cmd_buffer_trace_emit() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-13 11:05:38 +01:00
Dave Airlie	a639d40f13	radv: add support for local bos. (v3) This uses the new kernel interfaces for reduced cs overhead, We only set the local flag for memory allocations that don't have a dedicated allocation and ones that aren't imports. v2: add to all the internal buffer creation paths. v3: missed some command submission paths, handle 0/empty bo lists. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-10-26 23:59:28 +01:00
Andres Rodriguez	9f7edf4d1f	radv: don't skip PS/VS partial flush This patch helps lower high priority compute latency. Found by bisecting a perf regression on computeparticles with high priority compute queues enabled. Reverting this micro-optimization doesn't seem to have any negative effect on performance on Dota2 or ssao. Signed-off-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-21 01:01:44 +02:00
Andres Rodriguez	986c4b0bd4	radv: hardcode shader WAVE_LIMIT to the maximum value When WAVE_LIMIT is set, a submission will opt-in for SPI based resource scheduling. Because this mechanism is cooperative, we must ensure that all submissions have this field set, otherwise they will bypass resource arbitration. We always hardcode the field to its maximum value, instead of attempting to calculate an approximate usage. In testing, there were no benefits to using anything other than the maximum. Signed-off-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-21 01:01:44 +02:00
Samuel Pitoiset	94e69f4141	radv: move DB_COUNT_CONTROL initialization to si_emit_config() CLEAR_STATE will initialize DB_COUNT_CONTROL to 0 for CIK+. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-20 10:38:11 +02:00
Dave Airlie	c8eb3558cc	radv: fix CLEAR_STATE packet length. Looking at shader traces I noticed some registers were missing, one of them was being eaten by the wrong clear state length. Fixes: `4f42ea4dc` (radv: use CLEAR_STATE for initializing some registers) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-10-19 23:56:48 +01:00
Samuel Pitoiset	4f42ea4dcf	radv: use CLEAR_STATE for initializing some registers Based on RadeonSI. This improves some Vulkan demos by +1% to +3%. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-12 09:17:43 +02:00
Samuel Pitoiset	c74ed3966e	radv: do not set registers for merged ES-GS on GFX9 Based on RadeonSI. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-12 09:17:38 +02:00
Samuel Pitoiset	1789cac6dd	radv: move the raster config emission in si_set_raster_config() Similar to RadeonSI, also only call this function for <= VI. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-12 09:17:35 +02:00
Marek Olšák	76997e9133	radeonsi: shrink r600d_common.h and stop using it Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-09 16:27:05 +02:00
Samuel Pitoiset	5848565ee3	radv: emit PA_SU_POINT_{SIZE,MINMAX} in si_emit_config() These registers don't change during the lifetime of the command buffer, there is no need to re-emit them when binding a new pipeline. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-09 10:05:04 +02:00
Bas Nieuwenhuizen	d235ff6e8f	radv: Don't use a virtual function for getting the buffer virtual address. We are really not going to use a winsys which does not need to store the va, so might as well store it in a standard field. Not sure this helps perf much though, as most of the cost is in the cache miss accessing the bo anyway, which we stil need to do. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-09-20 22:04:25 +02:00
Dave Airlie	f2d0f587ca	radv: work out a base ia_multi_vgt_param. This just reduces the calculations a bit further. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-11 23:55:15 +01:00
Dave Airlie	ded1dbfd96	radv: calculate non-draw related ia_multi_vgt_param bits in pipeline This moves a bunch of non-draw dependent calcs into the pipeline code, to reduce CPU overheads in the draw path. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-11 23:55:15 +01:00
Dave Airlie	d2490eb2d1	radv: move calculating primgroup_size to pipeline. This moves this out of the draw paths. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-11 23:55:15 +01:00
Dave Airlie	16eac0a756	radv: only calculate num_prims when required. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-11 23:55:15 +01:00
Dave Airlie	1dbcfd2941	radv: realign vgt flush on hawaii workaround with radeonsi. This realigns this code with the radeonsi version and fixes the indirect case to work properly. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-11 23:55:14 +01:00
Samuel Pitoiset	d4d777317b	radv: move shaders related code to radv_shader.c Reduce size of radv_pipeline.c and improve code isolation. More code can probably moved but it's a start. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-09-08 17:17:40 +02:00
Dave Airlie	12fd0f8dc1	radv: fix predication on gfx9 When I added gfx9 I did it wrong, this fixes it. Fixes: `5247b311e9` "radv/gfx9: fix set predication packet." Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-08-25 00:52:32 +01:00
Dave Airlie	5247b311e9	radv/gfx9: fix set predication packet. The predication packet changed format on GFX9, update the driver. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: "17.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-08-16 05:52:50 +10:00
Dave Airlie	9ee67467c9	radv: predicate cmask eliminate when using DCC. When using DCC some clear values don't require a cmask eliminate step. This patch adds support for black and black with alpha 1, there are other values, but I don't have access to a comprehensive list. This works by setting the cmask eliminate predicate when doing the fast clear, and later when doing the cmask elimination making sure the draws are predicated. This increases the fps on Sascha Willems deferred. Tonga: 580fps->670fps on a Tonga PRO card. Polaris 730->850fps Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-17 01:44:43 +01:00
Dave Airlie	a6c2001ace	radv: add support for cmd predication. This doesn't get used yet, it just adds support to various PKT3 emissions to enable it later. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-06 02:06:49 +01:00
Dave Airlie	6a68170c83	radv: handle primitive id input into fragment shader with no geom shader Fixes: dEQP-VK.pipeline.framebuffer_attachment.no_attachments dEQP-VK.pipeline.framebuffer_attachment.no_attachments_ms Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 08:45:30 +10:00
Dave Airlie	a563f611c3	radv: set prim_id for geometry shaders Noticed in passing. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 08:45:22 +10:00
Grazvydas Ignotas	f490200973	radv: assert on CP_DMA_USE_L2 for SI The register header (and radeonsi comment) states V_411_SRC_ADDR_TC_L2 is for CIK+ only, so let's assert on earlier ASICs. Signed-off-by: Grazvydas Ignotas <notasas@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-06-11 14:28:08 +03:00
Dave Airlie	86eff151b1	radv: move chip_class extraction down further. This seems to matter here in a profile, without this we spend a lot more time exiting this function with no flush bits. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-07 10:25:20 +10:00
Dave Airlie	f0b82bc545	radv/gfx9: use correct register setting for uconfig regs Thanks to Marek for pointing this out. Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-07 08:09:03 +10:00
Bas Nieuwenhuizen	e08f741678	radv: Add early exit for cache flushes. No sense checking each bit separately in the common case of none being set. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-06-06 23:23:43 +02:00
Dave Airlie	5c8f8cae3e	radv: add IA_MULTI_VGT_PARAM support for GFX9. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-06 09:43:55 +10:00
Dave Airlie	67655cb24f	radv: add rb+ support for GFX9 This adds some rb+ support, as on GFX9 we have to disable it as per radeonsi. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-06 09:43:45 +10:00
Dave Airlie	c2fbeb7ca0	radv: add GFX9 cache flushing support. GFX9 needs to write event EOP to a fence buffer, allocate some space for this, and just write an ever increasing number to it, this isn't exactly what radeonsi does, but it seems to work. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-06 09:43:40 +10:00
Dave Airlie	87b3799493	radv: add GFX9 to initialisation cmd buffer. This just adds support for initialising some GFX9 registers, and handles the different init for the VGT reuse reg. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-06 09:43:35 +10:00
Dave Airlie	98f27b9cce	radv: don't setup raster_config on gfx9. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-06 09:43:32 +10:00
Dave Airlie	77b8aa4d95	radv: add gfx9 cp dma support. This adds support to the CP dma code for GFX9, ported from radeonsi. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-06 09:43:29 +10:00
Dave Airlie	0063da8393	radv: add some misc gfx9 pieces. This just adds the strings and includes the gfx9 register defs in some files that we need them in. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-06 09:43:21 +10:00
Dave Airlie	04924c09be	radv: fix typo in comment.	2017-06-06 08:59:30 +10:00
Dave Airlie	114d29e7fe	radv: add a comment from radeonsi before cp dma function. This is just copied over. Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-06 08:44:01 +10:00
Dave Airlie	bcae327469	radv: realign cp dma code with radeonsi This reworks this code to be like radeonsi, which will make it easier to add GFX9 support to it in the future. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-02 12:49:11 +10:00
Dave Airlie	ad61eac250	radv: factor out eop event writing code. (v2) In prep for GFX9 refactor some of the eop event writing code out. This changes behaviour, but aligns with what radeonsi does, it does double emits on CIK/VI, whereas previously it only did this on CIK. v2: bump the size checks. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-02 12:48:56 +10:00
Dave Airlie	7205431e73	radv: factor out si_emit_wait_fence code. This code was in a few places, consolidate into one. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-02 12:48:20 +10:00
Dave Airlie	2add79a732	radv: apply the tess+GS hang workaround to Polaris12 as well As I pointed out for radeonsi, and AMD confirmed, so fix this in radv as well. Cc: "17.1" <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-05-07 11:17:48 +01:00
Dave Airlie	a096d8d3f7	radv: enable POLARIS12 support. This just adds the chip in the right places. We don't set the partial_vs_wave workaround, as radeonsi doesn't, but have to confirm it's not required. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: "17.1" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-05-05 11:07:40 +10:00
Bas Nieuwenhuizen	1e1165389c	radv: Add shader prefetch. Gives me approximately a 2% perf increase in bot dota2 & talos. Having descriptors (both sets and vertex buffers) prefetched didn't help so I didn't include that. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-04-19 23:47:27 +02:00
Bas Nieuwenhuizen	a4c4efad89	radv: Rework guard band calculation. We want the guardband_x/y to be the largerst scalars such that each viewport scaled by that amount is still a subrange of [-32767, 32767]. The old code has a couple of issues: 1) It used scissor instead of viewport_scissor, potentially taking into account a viewport that is too small and therefore selecting a scale that is too large. 2) Merging the viewports isn't ideal, as for example viewports with boundaries [0,1] and [1000, 1001] would allow a guardband scale of ~30k, while their union [0, 1001] only allows a scale of ~32. The new code just determines the guardband per viewport and takes the minimum. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Acked-by: Dave Airlie <airlied@redhat.com>	2017-04-03 23:03:46 +02:00
Dave Airlie	03a67fbbf7	radv: fix order of the guardband register emission. y is vert, x is horiz. Noticed in visual inspection compared to radeonsi. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-04-02 20:17:30 +10:00

1 2

78 Commits