ff3162b074
This operation could be implemented in the TP cores, but this operation tends to be added by convertors that export to TFLite from frameworks with different channel order, and end up being no-ops. Once we move to NIR for tensor operations, we can support this operation and then remove it when we have an explicit transpose operation that is negated by a consequent transpose operation. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34629>