f67986b011
Starting with MTL, there will be two GT-tiles, a render and media tile. PXP as a service for supporting workloads with protected contexts and protected buffers can be subscribed by process workloads on any tile. However, depending on the platform, only one of the tiles is used for control events pertaining to PXP operation (such as creating the arbitration session and session tear-down). PXP as a global feature is accessible via batch buffer instructions on any engine/tile and the coherency across tiles is handled implicitly by the HW. In fact, for the foreseeable future, we are expecting this single-control-tile for the PXP subsystem. In MTL, it's the standalone media tile (not the root tile) because it contains the VDBOX and KCR engine (among the assets PXP relies on for those events). Looking at the current code design, each tile is represented by the intel_gt structure while the intel_pxp structure currently hangs off the intel_gt structure. Keeping the intel_pxp structure within the intel_gt structure makes some internal functionalities more straight forward but adds code complexity to code readability and maintainibility to many external-to-pxp subsystems which may need to pick the correct intel_gt structure. An example of this would be the intel_pxp_is_active or intel_pxp_is_enabled functionality which should be viewed as a global level inquiry, not a per-gt inquiry. That said, this series promotes the intel_pxp structure into the drm_i915_private structure making it a top-level subsystem and the PXP subsystem will select the control gt internally and keep a pointer to it for internal reference. This promotion comes with two noteworthy changes: 1. Exported pxp functions that are called by external subsystems (such as intel_pxp_enabled/active) will have to check implicitly if i915->pxp is valid as that structure will not be allocated for HW that doesn't support PXP. 2. Since GT is now considered a soft-dependency of PXP we are ensuring that GT init happens before PXP init and vice versa for fini. This causes a minor ordering change whereby we previously called intel_pxp_suspend after intel_uc_suspend but now is before i915_gem_suspend_late but the change is required for correct dependency flows. Additionally, this re-order change doesn't have any impact because at that point in either case, the top level entry to i915 won't observe any PXP events (since the GPU was quiesced during suspend_prepare). Also, any PXP event doesn't really matter when we disable the PXP HW (global GT irqs are already off anyway, so even if there was a bug that generated spurious events we wouldn't see it and we would just clean it up on resume which is okay since the default fallback action for PXP would be to keep the sessions off at this suspend stage). Changes from prior revs: v11: - Reformat a comment (Tvrtko). v10: - Change the code flow for intel_pxp_init to make it more cleaner and readible with better comments explaining the difference between full-PXP-feature vs the partial-teelink inits depending on the platform. Additionally, only do the pxp allocation when we are certain the subsystem is needed. (Tvrtko). v9: - Cosmetic cleanups in supported/enabled/active. (Daniele). - Add comments for intel_pxp_init and pxp_get_ctrl_gt that explain the functional flow for when PXP is not supported but the backend-assets are needed for HuC authentication (Daniele and Tvrtko). - Fix two remaining functions that are accessible outside PXP that need to be checking pxp ptrs before using them: intel_pxp_irq_handler and intel_pxp_huc_load_and_auth (Tvrtko and Daniele). - User helper macro in pxp-debugfs (Tvrtko). v8: - Remove pxp_to_gt macro (Daniele). - Fix a bug in pxp_get_ctrl_gt for the case of MTL and we don't support GSC-FW on it. (Daniele). - Leave i915->pxp as NULL if we dont support PXP and in line with that, do additional validity check on i915->pxp for intel_pxp_is_supported/enabled/active (Daniele). - Remove unncessary include header from intel_gt_debugfs.c and check drm_minor i915->drm.primary (Daniele). - Other cosmetics / minor issues / more comments on suspend flow order change (Daniele). v7: - Drop i915_dev_to_pxp and in intel_pxp_init use 'i915->pxp' through out instead of local variable newpxp. (Rodrigo) - In the case intel_pxp_fini is called during driver unload but after i915 loading failed without pxp being allocated, check i915->pxp before referencing it. (Alan) v6: - Remove HAS_PXP macro and replace it with intel_pxp_is_supported because : [1] introduction of 'ctrl_gt' means we correct this for MTL's upcoming series now. [2] Also, this has little impact globally as its only used by PXP-internal callers at the moment. - Change intel_pxp_init/fini to take in i915 as its input to avoid ptr-to-ptr in init/fini calls.(Jani). - Remove the backpointer from pxp->i915 since we can use pxp->ctrl_gt->i915 if we need it. (Rodrigo). v5: - Switch from series to single patch (Rodrigo). - change function name from pxp_get_kcr_owner_gt to pxp_get_ctrl_gt. - Fix CI BAT failure by removing redundant call to intel_pxp_fini from driver-remove. - NOTE: remaining open still persists on using ptr-to-ptr and back-ptr. v4: - Instead of maintaining intel_pxp as an intel_gt structure member and creating a number of convoluted helpers that takes in i915 as input and redirects to the correct intel_gt or takes any intel_gt and internally replaces with the correct intel_gt, promote it to be a top-level i915 structure. v3: - Rename gt level helper functions to "intel_pxp_is_enabled/ supported/ active_on_gt" (Daniele) - Upgrade _gt_supports_pxp to replace what was intel_gtpxp_is supported as the new intel_pxp_is_supported_on_gt to check for PXP feature support vs the tee support for huc authentication. Fix pxp-debugfs-registration to use only the former to decide support. (Daniele) - Couple minor optimizations. v2: - Avoid introduction of new device info or gt variables and use existing checks / macros to differentiate the correct GT->PXP control ownership (Daniele Ceraolo Spurio) - Don't reuse the updated global-checkers for per-GT callers (such as other files within PXP) to avoid unnecessary GT-reparsing, expose a replacement helper like the prior ones. (Daniele). v1: - Add one more patch to the series for the intel_pxp suspend/resume for similar refactoring References: https://patchwork.freedesktop.org/patch/msgid/20221202011407.4068371-1-alan.previn.teres.alexis@intel.com Signed-off-by: Alan Previn <alan.previn.teres.alexis@intel.com> Reviewed-by: Daniele Ceraolo Spurio <daniele.ceraolospurio@intel.com> Acked-by: Tvrtko Ursulin <tvrtko.ursulin@intel.com> Signed-off-by: Daniele Ceraolo Spurio <daniele.ceraolospurio@intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20221208180542.998148-1-alan.previn.teres.alexis@intel.com
144 lines
2.9 KiB
C
144 lines
2.9 KiB
C
// SPDX-License-Identifier: MIT
|
|
/*
|
|
* Copyright(c) 2020, Intel Corporation. All rights reserved.
|
|
*/
|
|
|
|
#include "gt/intel_context.h"
|
|
#include "gt/intel_engine_pm.h"
|
|
#include "gt/intel_gpu_commands.h"
|
|
#include "gt/intel_ring.h"
|
|
|
|
#include "i915_trace.h"
|
|
|
|
#include "intel_pxp.h"
|
|
#include "intel_pxp_cmd.h"
|
|
#include "intel_pxp_session.h"
|
|
#include "intel_pxp_types.h"
|
|
|
|
/* stall until prior PXP and MFX/HCP/HUC objects are cmopleted */
|
|
#define MFX_WAIT_PXP (MFX_WAIT | \
|
|
MFX_WAIT_DW0_PXP_SYNC_CONTROL_FLAG | \
|
|
MFX_WAIT_DW0_MFX_SYNC_CONTROL_FLAG)
|
|
|
|
static u32 *pxp_emit_session_selection(u32 *cs, u32 idx)
|
|
{
|
|
*cs++ = MFX_WAIT_PXP;
|
|
|
|
/* pxp off */
|
|
*cs++ = MI_FLUSH_DW;
|
|
*cs++ = 0;
|
|
*cs++ = 0;
|
|
|
|
/* select session */
|
|
*cs++ = MI_SET_APPID | MI_SET_APPID_SESSION_ID(idx);
|
|
|
|
*cs++ = MFX_WAIT_PXP;
|
|
|
|
/* pxp on */
|
|
*cs++ = MI_FLUSH_DW | MI_FLUSH_DW_PROTECTED_MEM_EN |
|
|
MI_FLUSH_DW_OP_STOREDW | MI_FLUSH_DW_STORE_INDEX;
|
|
*cs++ = I915_GEM_HWS_PXP_ADDR | MI_FLUSH_DW_USE_GTT;
|
|
*cs++ = 0;
|
|
|
|
*cs++ = MFX_WAIT_PXP;
|
|
|
|
return cs;
|
|
}
|
|
|
|
static u32 *pxp_emit_inline_termination(u32 *cs)
|
|
{
|
|
/* session inline termination */
|
|
*cs++ = CRYPTO_KEY_EXCHANGE;
|
|
*cs++ = 0;
|
|
|
|
return cs;
|
|
}
|
|
|
|
static u32 *pxp_emit_session_termination(u32 *cs, u32 idx)
|
|
{
|
|
cs = pxp_emit_session_selection(cs, idx);
|
|
cs = pxp_emit_inline_termination(cs);
|
|
|
|
return cs;
|
|
}
|
|
|
|
static u32 *pxp_emit_wait(u32 *cs)
|
|
{
|
|
/* wait for cmds to go through */
|
|
*cs++ = MFX_WAIT_PXP;
|
|
*cs++ = 0;
|
|
|
|
return cs;
|
|
}
|
|
|
|
/*
|
|
* if we ever need to terminate more than one session, we can submit multiple
|
|
* selections and terminations back-to-back with a single wait at the end
|
|
*/
|
|
#define SELECTION_LEN 10
|
|
#define TERMINATION_LEN 2
|
|
#define SESSION_TERMINATION_LEN(x) ((SELECTION_LEN + TERMINATION_LEN) * (x))
|
|
#define WAIT_LEN 2
|
|
|
|
static void pxp_request_commit(struct i915_request *rq)
|
|
{
|
|
struct i915_sched_attr attr = { .priority = I915_PRIORITY_MAX };
|
|
struct intel_timeline * const tl = i915_request_timeline(rq);
|
|
|
|
lockdep_unpin_lock(&tl->mutex, rq->cookie);
|
|
|
|
trace_i915_request_add(rq);
|
|
__i915_request_commit(rq);
|
|
__i915_request_queue(rq, &attr);
|
|
|
|
mutex_unlock(&tl->mutex);
|
|
}
|
|
|
|
int intel_pxp_terminate_session(struct intel_pxp *pxp, u32 id)
|
|
{
|
|
struct i915_request *rq;
|
|
struct intel_context *ce = pxp->ce;
|
|
u32 *cs;
|
|
int err = 0;
|
|
|
|
if (!intel_pxp_is_enabled(pxp))
|
|
return 0;
|
|
|
|
rq = i915_request_create(ce);
|
|
if (IS_ERR(rq))
|
|
return PTR_ERR(rq);
|
|
|
|
if (ce->engine->emit_init_breadcrumb) {
|
|
err = ce->engine->emit_init_breadcrumb(rq);
|
|
if (err)
|
|
goto out_rq;
|
|
}
|
|
|
|
cs = intel_ring_begin(rq, SESSION_TERMINATION_LEN(1) + WAIT_LEN);
|
|
if (IS_ERR(cs)) {
|
|
err = PTR_ERR(cs);
|
|
goto out_rq;
|
|
}
|
|
|
|
cs = pxp_emit_session_termination(cs, id);
|
|
cs = pxp_emit_wait(cs);
|
|
|
|
intel_ring_advance(rq, cs);
|
|
|
|
out_rq:
|
|
i915_request_get(rq);
|
|
|
|
if (unlikely(err))
|
|
i915_request_set_error_once(rq, err);
|
|
|
|
pxp_request_commit(rq);
|
|
|
|
if (!err && i915_request_wait(rq, 0, HZ / 5) < 0)
|
|
err = -ETIME;
|
|
|
|
i915_request_put(rq);
|
|
|
|
return err;
|
|
}
|
|
|