2022-04-01 17:22:03 +03:00
.. _drm-client-usage-stats:
======================
DRM client usage stats
======================
DRM drivers can choose to export partly standardised text output via the
`fops->show_fdinfo()` as part of the driver specific file operations registered
in the `struct drm_driver` object registered with the DRM core.
2023-08-15 00:28:22 +03:00
One purpose of this output is to enable writing as generic as practically
2022-04-01 17:22:03 +03:00
feasible `top(1)` like userspace monitoring tools.
Given the differences between various DRM drivers the specification of the
output is split between common and driver specific parts. Having said that,
wherever possible effort should still be made to standardise as much as
possible.
File format specification
=========================
- File shall contain one key value pair per one line of text.
- Colon character (`:` ) must be used to delimit keys and values.
- All keys shall be prefixed with `drm-` .
- Whitespace between the delimiter and first non-whitespace character shall be
ignored when parsing.
2023-05-24 18:59:37 +03:00
- Keys are not allowed to contain whitespace characters.
2022-04-01 17:22:03 +03:00
- Numerical key value pairs can end with optional unit string.
- Data type of the value is fixed as defined in the specification.
Key types
---------
1. Mandatory, fully standardised.
2. Optional, fully standardised.
3. Driver specific.
Data types
----------
- <uint> - Unsigned integer without defining the maximum value.
2023-05-24 18:59:37 +03:00
- <keystr> - String excluding any above defined reserved characters or whitespace.
- <valstr> - String.
2022-04-01 17:22:03 +03:00
Mandatory fully standardised keys
---------------------------------
2023-05-24 18:59:37 +03:00
- drm-driver: <valstr>
2022-04-01 17:22:03 +03:00
String shall contain the name this driver registered as via the respective
`struct drm_driver` data structure.
Optional fully standardised keys
--------------------------------
2023-05-24 18:59:35 +03:00
Identification
^^^^^^^^^^^^^^
2022-04-01 17:22:03 +03:00
- drm-pdev: <aaaa:bb.cc.d>
For PCI devices this should contain the PCI slot address of the device in
question.
- drm-client-id: <uint>
Unique value relating to the open DRM file descriptor used to distinguish
duplicated and shared file descriptors. Conceptually the value should map 1:1
to the in kernel representation of `struct drm_file` instances.
Uniqueness of the value shall be either globally unique, or unique within the
scope of each device, in which case `drm-pdev` shall be present as well.
Userspace should make sure to not double account any usage statistics by using
the above described criteria in order to associate data to individual clients.
2023-05-24 18:59:35 +03:00
Utilization
^^^^^^^^^^^
2023-05-24 18:59:37 +03:00
- drm-engine-<keystr>: <uint> ns
2022-04-01 17:22:03 +03:00
GPUs usually contain multiple execution engines. Each shall be given a stable
2023-05-24 18:59:37 +03:00
and unique name (keystr), with possible values documented in the driver specific
2022-04-01 17:22:03 +03:00
documentation.
Value shall be in specified time units which the respective GPU engine spent
busy executing workloads belonging to this client.
Values are not required to be constantly monotonic if it makes the driver
implementation easier, but are required to catch up with the previously reported
larger value within a reasonable period. Upon observing a value lower than what
was previously read, userspace is expected to stay with that larger previous
value until a monotonic update is seen.
2023-05-24 18:59:37 +03:00
- drm-engine-capacity-<keystr>: <uint>
2022-04-01 17:22:03 +03:00
Engine identifier string must be the same as the one specified in the
2023-05-24 18:59:37 +03:00
drm-engine-<keystr> tag and shall contain a greater than zero number in case the
2022-04-01 17:22:03 +03:00
exported engine corresponds to a group of identical hardware engines.
In the absence of this tag parser shall assume capacity of one. Zero capacity
is not allowed.
2023-05-24 18:59:37 +03:00
- drm-cycles-<keystr>: <uint>
2022-06-09 20:42:12 +03:00
Engine identifier string must be the same as the one specified in the
2023-05-24 18:59:37 +03:00
drm-engine-<keystr> tag and shall contain the number of busy cycles for the given
2022-06-09 20:42:12 +03:00
engine.
Values are not required to be constantly monotonic if it makes the driver
implementation easier, but are required to catch up with the previously reported
larger value within a reasonable period. Upon observing a value lower than what
was previously read, userspace is expected to stay with that larger previous
value until a monotonic update is seen.
2023-05-24 18:59:37 +03:00
- drm-maxfreq-<keystr>: <uint> [Hz|MHz|KHz]
2022-06-09 20:42:12 +03:00
Engine identifier string must be the same as the one specified in the
2023-05-24 18:59:37 +03:00
drm-engine-<keystr> tag and shall contain the maximum frequency for the given
engine. Taken together with drm-cycles-<keystr>, this can be used to calculate
percentage utilization of the engine, whereas drm-engine-<keystr> only reflects
2022-06-09 20:42:12 +03:00
time active without considering what frequency the engine is operating as a
2023-08-15 00:28:22 +03:00
percentage of its maximum frequency.
2022-06-09 20:42:12 +03:00
2023-05-24 18:59:35 +03:00
Memory
^^^^^^
- drm-memory-<region>: <uint> [KiB|MiB]
Each possible memory type which can be used to store buffer objects by the
GPU in question shall be given a stable and unique name to be returned as the
string here. The name "memory" is reserved to refer to normal system memory.
Value shall reflect the amount of storage currently consumed by the buffer
objects belong to this client, in the respective memory region.
Default unit shall be bytes with optional unit specifiers of 'KiB' or 'MiB'
indicating kibi- or mebi-bytes.
- drm-shared-<region>: <uint> [KiB|MiB]
The total size of buffers that are shared with another file (ie. have more
than a single handle).
- drm-total-<region>: <uint> [KiB|MiB]
The total size of buffers that including shared and private memory.
- drm-resident-<region>: <uint> [KiB|MiB]
The total size of buffers that are resident in the specified region.
- drm-purgeable-<region>: <uint> [KiB|MiB]
The total size of buffers that are purgeable.
- drm-active-<region>: <uint> [KiB|MiB]
The total size of buffers that are active on one or more engines.
2023-05-24 18:59:32 +03:00
Implementation Details
======================
Drivers should use drm_show_fdinfo() in their `struct file_operations` , and
implement &drm_driver.show_fdinfo if they wish to provide any stats which
are not provided by drm_show_fdinfo(). But even driver specific stats should
be documented above and where possible, aligned with other drivers.
2022-04-01 17:22:05 +03:00
Driver specific implementations
2023-05-24 18:59:32 +03:00
-------------------------------
2022-04-01 17:22:05 +03:00
:ref: `i915-usage-stats`
drm/panfrost: Add fdinfo support GPU load metrics
The drm-stats fdinfo tags made available to user space are drm-engine,
drm-cycles, drm-max-freq and drm-curfreq, one per job slot.
This deviates from standard practice in other DRM drivers, where a single
set of key:value pairs is provided for the whole render engine. However,
Panfrost has separate queues for fragment and vertex/tiler jobs, so a
decision was made to calculate bus cycles and workload times separately.
Maximum operating frequency is calculated at devfreq initialisation time.
Current frequency is made available to user space because nvtop uses it
when performing engine usage calculations.
It is important to bear in mind that both GPU cycle and kernel time numbers
provided are at best rough estimations, and always reported in excess from
the actual figure because of two reasons:
- Excess time because of the delay between the end of a job processing,
the subsequent job IRQ and the actual time of the sample.
- Time spent in the engine queue waiting for the GPU to pick up the next
job.
To avoid race conditions during enablement/disabling, a reference counting
mechanism was introduced, and a job flag that tells us whether a given job
increased the refcount. This is necessary, because user space can toggle
cycle counting through a debugfs file, and a given job might have been in
flight by the time cycle counting was disabled.
The main goal of the debugfs cycle counter knob is letting tools like nvtop
or IGT's gputop switch it at any time, to avoid power waste in case no
engine usage measuring is necessary.
Also add a documentation file explaining the possible values for fdinfo's
engine keystrings and Panfrost-specific drm-curfreq-<keystr> pairs.
Signed-off-by: Adrián Larumbe <adrian.larumbe@collabora.com>
Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com>
Reviewed-by: Steven Price <steven.price@arm.com>
Reviewed-by: AngeloGioacchino Del Regno <angelogioacchino.delregno@collabora.com>
Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com>
Link: https://patchwork.freedesktop.org/patch/msgid/20230929181616.2769345-3-adrian.larumbe@collabora.com
2023-09-29 21:14:28 +03:00
:ref: `panfrost-usage-stats`