23a68d14de
This Kselftest update for Linux 6.2-rc1 consists of several fixes and enhancements to existing tests and a few new tests: - adds new amd-pstate and fixes and enhances existing ones - adds new watchdog tests and enhances existing ones to improve coverage - fixes to ftrace, splice_read, rtc, and efivars tests - fixes to handle egrep obsolescence in the latest grep release - miscellaneous spelling and SPDX fixes -----BEGIN PGP SIGNATURE----- iQIzBAABCgAdFiEEPZKym/RZuOCGeA/kCwJExA0NQxwFAmOXdT4ACgkQCwJExA0N Qxy06RAAvQN6kpzCGJjLz5R9Lx3QpG4BiCO7vdZs1QTwzo8kTGQDq3JD6m+ychDx CgLnH7RrPlIYz4oExnV7JPE2tFEarV/zFh2V8LjKfGePZVtNeDASlC7F3lWYUnM/ n3+6H/JbZ1BgGE9DE5/DAOOAsN0CY2QPJWRDN1wYH7/gLXulPlSt+BV/ZFj3LG/0 Qne2SR7kc+hKPOFNl+BWKOU2a4mNOmoxaROgQraKdeQMQoTAwz/7lfylYZD9nU0r nyVxHTr0n+/XX3Q93arAS/chOyFBJrAESciUPY4E2oF97uiE0TqHdKA/qfPNRr7N wSOdWxYSuNaz0tkzO01EzeGGr+mw0WlCNoo6NzsUvqzRXDf0F0cWe32tmIZHJAzS CqxpKd6I8XPkEeyy5kL12q+akxe30zDGaKdaYGkZ7SjbwG6ygzSSW5MYfojvbtr9 Nfb6OnkPC1aZzC9jtiJO1EHd9f+PdeUVKNQsvzseT4b9xhmpxBqlrzgB5GakDoE6 uo3cXyz5gOzqJD6FT+CqKa/16NaHATw/U7/Y0gXj5ELKEmuYBmnl1T9svDnSIVfF hgS/3UkFYiw3R2oW35wv988w2JsXrkItOyNdAm47ihvAHF/uCumcSeea5k3+QYrH 7bM4PzJsMMcOhWWQ/04Q+LQCWWem/Vhk22BlIr6IiuGd6L03pc8= =4cDX -----END PGP SIGNATURE----- Merge tag 'linux-kselftest-next-6.2-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/shuah/linux-kselftest Pull Kselftest updates from Shuah Khan: "Several fixes and enhancements to existing tests and a few new tests: - add new amd-pstate tests and fix and enhance existing ones - add new watchdog tests and enhance existing ones to improve coverage - fixes to ftrace, splice_read, rtc, and efivars tests - fixes to handle egrep obsolescence in the latest grep release - miscellaneous spelling and SPDX fixes" * tag 'linux-kselftest-next-6.2-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/shuah/linux-kselftest: (24 commits) selftests/ftrace: Use long for synthetic event probe test selftests/tpm2: Split async tests call to separate shell script runner selftests: splice_read: Fix sysfs read cases selftests: ftrace: Use "grep -E" instead of "egrep" selftests: gpio: Use "grep -E" instead of "egrep" selftests: kselftest_deps: Use "grep -E" instead of "egrep" selftests/efivarfs: Add checking of the test return value cpufreq: amd-pstate: fix spdxcheck warnings for amd-pstate-ut.c selftests: rtc: skip when RTC is not present selftests/ftrace: event_triggers: wait longer for test_event_enable selftests/vDSO: Add riscv getcpu & gettimeofday test Documentation: amd-pstate: Add tbench and gitsource test introduction selftests: amd-pstate: Trigger gitsource benchmark and test cpus selftests: amd-pstate: Trigger tbench benchmark and test cpus selftests: amd-pstate: Split basic.sh into run.sh and basic.sh. selftests: amd-pstate: Rename amd-pstate-ut.sh to basic.sh. selftests/ftrace: Convert tracer tests to use 'requires' to specify program dependency selftests/ftrace: Add check for ping command for trigger tests selftests/watchdog: Fix spelling mistake "Temeprature" -> "Temperature" selftests/watchdog: add test for WDIOC_GETTEMP ...
634 lines
41 KiB
ReStructuredText
634 lines
41 KiB
ReStructuredText
.. SPDX-License-Identifier: GPL-2.0
|
|
.. include:: <isonum.txt>
|
|
|
|
===============================================
|
|
``amd-pstate`` CPU Performance Scaling Driver
|
|
===============================================
|
|
|
|
:Copyright: |copy| 2021 Advanced Micro Devices, Inc.
|
|
|
|
:Author: Huang Rui <ray.huang@amd.com>
|
|
|
|
|
|
Introduction
|
|
===================
|
|
|
|
``amd-pstate`` is the AMD CPU performance scaling driver that introduces a
|
|
new CPU frequency control mechanism on modern AMD APU and CPU series in
|
|
Linux kernel. The new mechanism is based on Collaborative Processor
|
|
Performance Control (CPPC) which provides finer grain frequency management
|
|
than legacy ACPI hardware P-States. Current AMD CPU/APU platforms are using
|
|
the ACPI P-states driver to manage CPU frequency and clocks with switching
|
|
only in 3 P-states. CPPC replaces the ACPI P-states controls and allows a
|
|
flexible, low-latency interface for the Linux kernel to directly
|
|
communicate the performance hints to hardware.
|
|
|
|
``amd-pstate`` leverages the Linux kernel governors such as ``schedutil``,
|
|
``ondemand``, etc. to manage the performance hints which are provided by
|
|
CPPC hardware functionality that internally follows the hardware
|
|
specification (for details refer to AMD64 Architecture Programmer's Manual
|
|
Volume 2: System Programming [1]_). Currently, ``amd-pstate`` supports basic
|
|
frequency control function according to kernel governors on some of the
|
|
Zen2 and Zen3 processors, and we will implement more AMD specific functions
|
|
in future after we verify them on the hardware and SBIOS.
|
|
|
|
|
|
AMD CPPC Overview
|
|
=======================
|
|
|
|
Collaborative Processor Performance Control (CPPC) interface enumerates a
|
|
continuous, abstract, and unit-less performance value in a scale that is
|
|
not tied to a specific performance state / frequency. This is an ACPI
|
|
standard [2]_ which software can specify application performance goals and
|
|
hints as a relative target to the infrastructure limits. AMD processors
|
|
provide the low latency register model (MSR) instead of an AML code
|
|
interpreter for performance adjustments. ``amd-pstate`` will initialize a
|
|
``struct cpufreq_driver`` instance, ``amd_pstate_driver``, with the callbacks
|
|
to manage each performance update behavior. ::
|
|
|
|
Highest Perf ------>+-----------------------+ +-----------------------+
|
|
| | | |
|
|
| | | |
|
|
| | Max Perf ---->| |
|
|
| | | |
|
|
| | | |
|
|
Nominal Perf ------>+-----------------------+ +-----------------------+
|
|
| | | |
|
|
| | | |
|
|
| | | |
|
|
| | | |
|
|
| | | |
|
|
| | | |
|
|
| | Desired Perf ---->| |
|
|
| | | |
|
|
| | | |
|
|
| | | |
|
|
| | | |
|
|
| | | |
|
|
| | | |
|
|
| | | |
|
|
| | | |
|
|
| | | |
|
|
Lowest non- | | | |
|
|
linear perf ------>+-----------------------+ +-----------------------+
|
|
| | | |
|
|
| | Lowest perf ---->| |
|
|
| | | |
|
|
Lowest perf ------>+-----------------------+ +-----------------------+
|
|
| | | |
|
|
| | | |
|
|
| | | |
|
|
0 ------>+-----------------------+ +-----------------------+
|
|
|
|
AMD P-States Performance Scale
|
|
|
|
|
|
.. _perf_cap:
|
|
|
|
AMD CPPC Performance Capability
|
|
--------------------------------
|
|
|
|
Highest Performance (RO)
|
|
.........................
|
|
|
|
This is the absolute maximum performance an individual processor may reach,
|
|
assuming ideal conditions. This performance level may not be sustainable
|
|
for long durations and may only be achievable if other platform components
|
|
are in a specific state; for example, it may require other processors to be in
|
|
an idle state. This would be equivalent to the highest frequencies
|
|
supported by the processor.
|
|
|
|
Nominal (Guaranteed) Performance (RO)
|
|
......................................
|
|
|
|
This is the maximum sustained performance level of the processor, assuming
|
|
ideal operating conditions. In the absence of an external constraint (power,
|
|
thermal, etc.), this is the performance level the processor is expected to
|
|
be able to maintain continuously. All cores/processors are expected to be
|
|
able to sustain their nominal performance state simultaneously.
|
|
|
|
Lowest non-linear Performance (RO)
|
|
...................................
|
|
|
|
This is the lowest performance level at which nonlinear power savings are
|
|
achieved, for example, due to the combined effects of voltage and frequency
|
|
scaling. Above this threshold, lower performance levels should be generally
|
|
more energy efficient than higher performance levels. This register
|
|
effectively conveys the most efficient performance level to ``amd-pstate``.
|
|
|
|
Lowest Performance (RO)
|
|
........................
|
|
|
|
This is the absolute lowest performance level of the processor. Selecting a
|
|
performance level lower than the lowest nonlinear performance level may
|
|
cause an efficiency penalty but should reduce the instantaneous power
|
|
consumption of the processor.
|
|
|
|
AMD CPPC Performance Control
|
|
------------------------------
|
|
|
|
``amd-pstate`` passes performance goals through these registers. The
|
|
register drives the behavior of the desired performance target.
|
|
|
|
Minimum requested performance (RW)
|
|
...................................
|
|
|
|
``amd-pstate`` specifies the minimum allowed performance level.
|
|
|
|
Maximum requested performance (RW)
|
|
...................................
|
|
|
|
``amd-pstate`` specifies a limit the maximum performance that is expected
|
|
to be supplied by the hardware.
|
|
|
|
Desired performance target (RW)
|
|
...................................
|
|
|
|
``amd-pstate`` specifies a desired target in the CPPC performance scale as
|
|
a relative number. This can be expressed as percentage of nominal
|
|
performance (infrastructure max). Below the nominal sustained performance
|
|
level, desired performance expresses the average performance level of the
|
|
processor subject to hardware. Above the nominal performance level,
|
|
the processor must provide at least nominal performance requested and go higher
|
|
if current operating conditions allow.
|
|
|
|
Energy Performance Preference (EPP) (RW)
|
|
.........................................
|
|
|
|
This attribute provides a hint to the hardware if software wants to bias
|
|
toward performance (0x0) or energy efficiency (0xff).
|
|
|
|
|
|
Key Governors Support
|
|
=======================
|
|
|
|
``amd-pstate`` can be used with all the (generic) scaling governors listed
|
|
by the ``scaling_available_governors`` policy attribute in ``sysfs``. Then,
|
|
it is responsible for the configuration of policy objects corresponding to
|
|
CPUs and provides the ``CPUFreq`` core (and the scaling governors attached
|
|
to the policy objects) with accurate information on the maximum and minimum
|
|
operating frequencies supported by the hardware. Users can check the
|
|
``scaling_cur_freq`` information comes from the ``CPUFreq`` core.
|
|
|
|
``amd-pstate`` mainly supports ``schedutil`` and ``ondemand`` for dynamic
|
|
frequency control. It is to fine tune the processor configuration on
|
|
``amd-pstate`` to the ``schedutil`` with CPU CFS scheduler. ``amd-pstate``
|
|
registers the adjust_perf callback to implement performance update behavior
|
|
similar to CPPC. It is initialized by ``sugov_start`` and then populates the
|
|
CPU's update_util_data pointer to assign ``sugov_update_single_perf`` as the
|
|
utilization update callback function in the CPU scheduler. The CPU scheduler
|
|
will call ``cpufreq_update_util`` and assigns the target performance according
|
|
to the ``struct sugov_cpu`` that the utilization update belongs to.
|
|
Then, ``amd-pstate`` updates the desired performance according to the CPU
|
|
scheduler assigned.
|
|
|
|
.. _processor_support:
|
|
|
|
Processor Support
|
|
=======================
|
|
|
|
The ``amd-pstate`` initialization will fail if the ``_CPC`` entry in the ACPI
|
|
SBIOS does not exist in the detected processor. It uses ``acpi_cpc_valid``
|
|
to check the existence of ``_CPC``. All Zen based processors support the legacy
|
|
ACPI hardware P-States function, so when ``amd-pstate`` fails initialization,
|
|
the kernel will fall back to initialize the ``acpi-cpufreq`` driver.
|
|
|
|
There are two types of hardware implementations for ``amd-pstate``: one is
|
|
`Full MSR Support <perf_cap_>`_ and another is `Shared Memory Support
|
|
<perf_cap_>`_. It can use the :c:macro:`X86_FEATURE_CPPC` feature flag to
|
|
indicate the different types. (For details, refer to the Processor Programming
|
|
Reference (PPR) for AMD Family 19h Model 51h, Revision A1 Processors [3]_.)
|
|
``amd-pstate`` is to register different ``static_call`` instances for different
|
|
hardware implementations.
|
|
|
|
Currently, some of the Zen2 and Zen3 processors support ``amd-pstate``. In the
|
|
future, it will be supported on more and more AMD processors.
|
|
|
|
Full MSR Support
|
|
-----------------
|
|
|
|
Some new Zen3 processors such as Cezanne provide the MSR registers directly
|
|
while the :c:macro:`X86_FEATURE_CPPC` CPU feature flag is set.
|
|
``amd-pstate`` can handle the MSR register to implement the fast switch
|
|
function in ``CPUFreq`` that can reduce the latency of frequency control in
|
|
interrupt context. The functions with a ``pstate_xxx`` prefix represent the
|
|
operations on MSR registers.
|
|
|
|
Shared Memory Support
|
|
----------------------
|
|
|
|
If the :c:macro:`X86_FEATURE_CPPC` CPU feature flag is not set, the
|
|
processor supports the shared memory solution. In this case, ``amd-pstate``
|
|
uses the ``cppc_acpi`` helper methods to implement the callback functions
|
|
that are defined on ``static_call``. The functions with the ``cppc_xxx`` prefix
|
|
represent the operations of ACPI CPPC helpers for the shared memory solution.
|
|
|
|
|
|
AMD P-States and ACPI hardware P-States always can be supported in one
|
|
processor. But AMD P-States has the higher priority and if it is enabled
|
|
with :c:macro:`MSR_AMD_CPPC_ENABLE` or ``cppc_set_enable``, it will respond
|
|
to the request from AMD P-States.
|
|
|
|
|
|
User Space Interface in ``sysfs``
|
|
==================================
|
|
|
|
``amd-pstate`` exposes several global attributes (files) in ``sysfs`` to
|
|
control its functionality at the system level. They are located in the
|
|
``/sys/devices/system/cpu/cpufreq/policyX/`` directory and affect all CPUs. ::
|
|
|
|
root@hr-test1:/home/ray# ls /sys/devices/system/cpu/cpufreq/policy0/*amd*
|
|
/sys/devices/system/cpu/cpufreq/policy0/amd_pstate_highest_perf
|
|
/sys/devices/system/cpu/cpufreq/policy0/amd_pstate_lowest_nonlinear_freq
|
|
/sys/devices/system/cpu/cpufreq/policy0/amd_pstate_max_freq
|
|
|
|
|
|
``amd_pstate_highest_perf / amd_pstate_max_freq``
|
|
|
|
Maximum CPPC performance and CPU frequency that the driver is allowed to
|
|
set, in percent of the maximum supported CPPC performance level (the highest
|
|
performance supported in `AMD CPPC Performance Capability <perf_cap_>`_).
|
|
In some ASICs, the highest CPPC performance is not the one in the ``_CPC``
|
|
table, so we need to expose it to sysfs. If boost is not active, but
|
|
still supported, this maximum frequency will be larger than the one in
|
|
``cpuinfo``.
|
|
This attribute is read-only.
|
|
|
|
``amd_pstate_lowest_nonlinear_freq``
|
|
|
|
The lowest non-linear CPPC CPU frequency that the driver is allowed to set,
|
|
in percent of the maximum supported CPPC performance level. (Please see the
|
|
lowest non-linear performance in `AMD CPPC Performance Capability
|
|
<perf_cap_>`_.)
|
|
This attribute is read-only.
|
|
|
|
Other performance and frequency values can be read back from
|
|
``/sys/devices/system/cpu/cpuX/acpi_cppc/``, see :ref:`cppc_sysfs`.
|
|
|
|
|
|
``amd-pstate`` vs ``acpi-cpufreq``
|
|
======================================
|
|
|
|
On the majority of AMD platforms supported by ``acpi-cpufreq``, the ACPI tables
|
|
provided by the platform firmware are used for CPU performance scaling, but
|
|
only provide 3 P-states on AMD processors.
|
|
However, on modern AMD APU and CPU series, hardware provides the Collaborative
|
|
Processor Performance Control according to the ACPI protocol and customizes this
|
|
for AMD platforms. That is, fine-grained and continuous frequency ranges
|
|
instead of the legacy hardware P-states. ``amd-pstate`` is the kernel
|
|
module which supports the new AMD P-States mechanism on most of the future AMD
|
|
platforms. The AMD P-States mechanism is the more performance and energy
|
|
efficiency frequency management method on AMD processors.
|
|
|
|
Kernel Module Options for ``amd-pstate``
|
|
=========================================
|
|
|
|
Passive Mode
|
|
------------
|
|
|
|
``amd_pstate=passive``
|
|
|
|
It will be enabled if the ``amd_pstate=passive`` is passed to the kernel in the command line.
|
|
In this mode, ``amd_pstate`` driver software specifies a desired QoS target in the CPPC
|
|
performance scale as a relative number. This can be expressed as percentage of nominal
|
|
performance (infrastructure max). Below the nominal sustained performance level,
|
|
desired performance expresses the average performance level of the processor subject
|
|
to the Performance Reduction Tolerance register. Above the nominal performance level,
|
|
processor must provide at least nominal performance requested and go higher if current
|
|
operating conditions allow.
|
|
|
|
|
|
``cpupower`` tool support for ``amd-pstate``
|
|
===============================================
|
|
|
|
``amd-pstate`` is supported by the ``cpupower`` tool, which can be used to dump
|
|
frequency information. Development is in progress to support more and more
|
|
operations for the new ``amd-pstate`` module with this tool. ::
|
|
|
|
root@hr-test1:/home/ray# cpupower frequency-info
|
|
analyzing CPU 0:
|
|
driver: amd-pstate
|
|
CPUs which run at the same hardware frequency: 0
|
|
CPUs which need to have their frequency coordinated by software: 0
|
|
maximum transition latency: 131 us
|
|
hardware limits: 400 MHz - 4.68 GHz
|
|
available cpufreq governors: ondemand conservative powersave userspace performance schedutil
|
|
current policy: frequency should be within 400 MHz and 4.68 GHz.
|
|
The governor "schedutil" may decide which speed to use
|
|
within this range.
|
|
current CPU frequency: Unable to call hardware
|
|
current CPU frequency: 4.02 GHz (asserted by call to kernel)
|
|
boost state support:
|
|
Supported: yes
|
|
Active: yes
|
|
AMD PSTATE Highest Performance: 166. Maximum Frequency: 4.68 GHz.
|
|
AMD PSTATE Nominal Performance: 117. Nominal Frequency: 3.30 GHz.
|
|
AMD PSTATE Lowest Non-linear Performance: 39. Lowest Non-linear Frequency: 1.10 GHz.
|
|
AMD PSTATE Lowest Performance: 15. Lowest Frequency: 400 MHz.
|
|
|
|
|
|
Diagnostics and Tuning
|
|
=======================
|
|
|
|
Trace Events
|
|
--------------
|
|
|
|
There are two static trace events that can be used for ``amd-pstate``
|
|
diagnostics. One of them is the ``cpu_frequency`` trace event generally used
|
|
by ``CPUFreq``, and the other one is the ``amd_pstate_perf`` trace event
|
|
specific to ``amd-pstate``. The following sequence of shell commands can
|
|
be used to enable them and see their output (if the kernel is
|
|
configured to support event tracing). ::
|
|
|
|
root@hr-test1:/home/ray# cd /sys/kernel/tracing/
|
|
root@hr-test1:/sys/kernel/tracing# echo 1 > events/amd_cpu/enable
|
|
root@hr-test1:/sys/kernel/tracing# cat trace
|
|
# tracer: nop
|
|
#
|
|
# entries-in-buffer/entries-written: 47827/42233061 #P:2
|
|
#
|
|
# _-----=> irqs-off
|
|
# / _----=> need-resched
|
|
# | / _---=> hardirq/softirq
|
|
# || / _--=> preempt-depth
|
|
# ||| / delay
|
|
# TASK-PID CPU# |||| TIMESTAMP FUNCTION
|
|
# | | | |||| | |
|
|
<idle>-0 [015] dN... 4995.979886: amd_pstate_perf: amd_min_perf=85 amd_des_perf=85 amd_max_perf=166 cpu_id=15 changed=false fast_switch=true
|
|
<idle>-0 [007] d.h.. 4995.979893: amd_pstate_perf: amd_min_perf=85 amd_des_perf=85 amd_max_perf=166 cpu_id=7 changed=false fast_switch=true
|
|
cat-2161 [000] d.... 4995.980841: amd_pstate_perf: amd_min_perf=85 amd_des_perf=85 amd_max_perf=166 cpu_id=0 changed=false fast_switch=true
|
|
sshd-2125 [004] d.s.. 4995.980968: amd_pstate_perf: amd_min_perf=85 amd_des_perf=85 amd_max_perf=166 cpu_id=4 changed=false fast_switch=true
|
|
<idle>-0 [007] d.s.. 4995.980968: amd_pstate_perf: amd_min_perf=85 amd_des_perf=85 amd_max_perf=166 cpu_id=7 changed=false fast_switch=true
|
|
<idle>-0 [003] d.s.. 4995.980971: amd_pstate_perf: amd_min_perf=85 amd_des_perf=85 amd_max_perf=166 cpu_id=3 changed=false fast_switch=true
|
|
<idle>-0 [011] d.s.. 4995.980996: amd_pstate_perf: amd_min_perf=85 amd_des_perf=85 amd_max_perf=166 cpu_id=11 changed=false fast_switch=true
|
|
|
|
The ``cpu_frequency`` trace event will be triggered either by the ``schedutil`` scaling
|
|
governor (for the policies it is attached to), or by the ``CPUFreq`` core (for the
|
|
policies with other scaling governors).
|
|
|
|
|
|
Tracer Tool
|
|
-------------
|
|
|
|
``amd_pstate_tracer.py`` can record and parse ``amd-pstate`` trace log, then
|
|
generate performance plots. This utility can be used to debug and tune the
|
|
performance of ``amd-pstate`` driver. The tracer tool needs to import intel
|
|
pstate tracer.
|
|
|
|
Tracer tool located in ``linux/tools/power/x86/amd_pstate_tracer``. It can be
|
|
used in two ways. If trace file is available, then directly parse the file
|
|
with command ::
|
|
|
|
./amd_pstate_trace.py [-c cpus] -t <trace_file> -n <test_name>
|
|
|
|
Or generate trace file with root privilege, then parse and plot with command ::
|
|
|
|
sudo ./amd_pstate_trace.py [-c cpus] -n <test_name> -i <interval> [-m kbytes]
|
|
|
|
The test result can be found in ``results/test_name``. Following is the example
|
|
about part of the output. ::
|
|
|
|
common_cpu common_secs common_usecs min_perf des_perf max_perf freq mperf apef tsc load duration_ms sample_num elapsed_time common_comm
|
|
CPU_005 712 116384 39 49 166 0.7565 9645075 2214891 38431470 25.1 11.646 469 2.496 kworker/5:0-40
|
|
CPU_006 712 116408 39 49 166 0.6769 8950227 1839034 37192089 24.06 11.272 470 2.496 kworker/6:0-1264
|
|
|
|
Unit Tests for amd-pstate
|
|
-------------------------
|
|
|
|
``amd-pstate-ut`` is a test module for testing the ``amd-pstate`` driver.
|
|
|
|
* It can help all users to verify their processor support (SBIOS/Firmware or Hardware).
|
|
|
|
* Kernel can have a basic function test to avoid the kernel regression during the update.
|
|
|
|
* We can introduce more functional or performance tests to align the result together, it will benefit power and performance scale optimization.
|
|
|
|
1. Test case decriptions
|
|
|
|
1). Basic tests
|
|
|
|
Test prerequisite and basic functions for the ``amd-pstate`` driver.
|
|
|
|
+---------+--------------------------------+------------------------------------------------------------------------------------+
|
|
| Index | Functions | Description |
|
|
+=========+================================+====================================================================================+
|
|
| 1 | amd_pstate_ut_acpi_cpc_valid || Check whether the _CPC object is present in SBIOS. |
|
|
| | || |
|
|
| | || The detail refer to `Processor Support <processor_support_>`_. |
|
|
+---------+--------------------------------+------------------------------------------------------------------------------------+
|
|
| 2 | amd_pstate_ut_check_enabled || Check whether AMD P-State is enabled. |
|
|
| | || |
|
|
| | || AMD P-States and ACPI hardware P-States always can be supported in one processor. |
|
|
| | | But AMD P-States has the higher priority and if it is enabled with |
|
|
| | | :c:macro:`MSR_AMD_CPPC_ENABLE` or ``cppc_set_enable``, it will respond to the |
|
|
| | | request from AMD P-States. |
|
|
+---------+--------------------------------+------------------------------------------------------------------------------------+
|
|
| 3 | amd_pstate_ut_check_perf || Check if the each performance values are reasonable. |
|
|
| | || highest_perf >= nominal_perf > lowest_nonlinear_perf > lowest_perf > 0. |
|
|
+---------+--------------------------------+------------------------------------------------------------------------------------+
|
|
| 4 | amd_pstate_ut_check_freq || Check if the each frequency values and max freq when set support boost mode |
|
|
| | | are reasonable. |
|
|
| | || max_freq >= nominal_freq > lowest_nonlinear_freq > min_freq > 0 |
|
|
| | || If boost is not active but supported, this maximum frequency will be larger than |
|
|
| | | the one in ``cpuinfo``. |
|
|
+---------+--------------------------------+------------------------------------------------------------------------------------+
|
|
|
|
2). Tbench test
|
|
|
|
Test and monitor the cpu changes when running tbench benchmark under the specified governor.
|
|
These changes include desire performance, frequency, load, performance, energy etc.
|
|
The specified governor is ondemand or schedutil.
|
|
Tbench can also be tested on the ``acpi-cpufreq`` kernel driver for comparison.
|
|
|
|
3). Gitsource test
|
|
|
|
Test and monitor the cpu changes when running gitsource benchmark under the specified governor.
|
|
These changes include desire performance, frequency, load, time, energy etc.
|
|
The specified governor is ondemand or schedutil.
|
|
Gitsource can also be tested on the ``acpi-cpufreq`` kernel driver for comparison.
|
|
|
|
#. How to execute the tests
|
|
|
|
We use test module in the kselftest frameworks to implement it.
|
|
We create ``amd-pstate-ut`` module and tie it into kselftest.(for
|
|
details refer to Linux Kernel Selftests [4]_).
|
|
|
|
1). Build
|
|
|
|
+ open the :c:macro:`CONFIG_X86_AMD_PSTATE` configuration option.
|
|
+ set the :c:macro:`CONFIG_X86_AMD_PSTATE_UT` configuration option to M.
|
|
+ make project
|
|
+ make selftest ::
|
|
|
|
$ cd linux
|
|
$ make -C tools/testing/selftests
|
|
|
|
+ make perf ::
|
|
|
|
$ cd tools/perf/
|
|
$ make
|
|
|
|
|
|
2). Installation & Steps ::
|
|
|
|
$ make -C tools/testing/selftests install INSTALL_PATH=~/kselftest
|
|
$ cp tools/perf/perf /usr/bin/perf
|
|
$ sudo ./kselftest/run_kselftest.sh -c amd-pstate
|
|
|
|
3). Specified test case ::
|
|
|
|
$ cd ~/kselftest/amd-pstate
|
|
$ sudo ./run.sh -t basic
|
|
$ sudo ./run.sh -t tbench
|
|
$ sudo ./run.sh -t tbench -m acpi-cpufreq
|
|
$ sudo ./run.sh -t gitsource
|
|
$ sudo ./run.sh -t gitsource -m acpi-cpufreq
|
|
$ ./run.sh --help
|
|
./run.sh: illegal option -- -
|
|
Usage: ./run.sh [OPTION...]
|
|
[-h <help>]
|
|
[-o <output-file-for-dump>]
|
|
[-c <all: All testing,
|
|
basic: Basic testing,
|
|
tbench: Tbench testing,
|
|
gitsource: Gitsource testing.>]
|
|
[-t <tbench time limit>]
|
|
[-p <tbench process number>]
|
|
[-l <loop times for tbench>]
|
|
[-i <amd tracer interval>]
|
|
[-m <comparative test: acpi-cpufreq>]
|
|
|
|
|
|
4). Results
|
|
|
|
+ basic
|
|
|
|
When you finish test, you will get the following log info ::
|
|
|
|
$ dmesg | grep "amd_pstate_ut" | tee log.txt
|
|
[12977.570663] amd_pstate_ut: 1 amd_pstate_ut_acpi_cpc_valid success!
|
|
[12977.570673] amd_pstate_ut: 2 amd_pstate_ut_check_enabled success!
|
|
[12977.571207] amd_pstate_ut: 3 amd_pstate_ut_check_perf success!
|
|
[12977.571212] amd_pstate_ut: 4 amd_pstate_ut_check_freq success!
|
|
|
|
+ tbench
|
|
|
|
When you finish test, you will get selftest.tbench.csv and png images.
|
|
The selftest.tbench.csv file contains the raw data and the drop of the comparative test.
|
|
The png images shows the performance, energy and performan per watt of each test.
|
|
Open selftest.tbench.csv :
|
|
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ Governor | Round | Des-perf | Freq | Load | Performance | Energy | Performance Per Watt |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ Unit | | | GHz | | MB/s | J | MB/J |
|
|
+=================================================+==============+==========+=========+==========+=============+=========+======================+
|
|
+ amd-pstate-ondemand | 1 | | | | 2504.05 | 1563.67 | 158.5378 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ amd-pstate-ondemand | 2 | | | | 2243.64 | 1430.32 | 155.2941 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ amd-pstate-ondemand | 3 | | | | 2183.88 | 1401.32 | 154.2860 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ amd-pstate-ondemand | Average | | | | 2310.52 | 1465.1 | 156.1268 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ amd-pstate-schedutil | 1 | 165.329 | 1.62257 | 99.798 | 2136.54 | 1395.26 | 151.5971 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ amd-pstate-schedutil | 2 | 166 | 1.49761 | 99.9993 | 2100.56 | 1380.5 | 150.6377 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ amd-pstate-schedutil | 3 | 166 | 1.47806 | 99.9993 | 2084.12 | 1375.76 | 149.9737 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ amd-pstate-schedutil | Average | 165.776 | 1.53275 | 99.9322 | 2107.07 | 1383.84 | 150.7399 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-ondemand | 1 | | | | 2529.9 | 1564.4 | 160.0997 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-ondemand | 2 | | | | 2249.76 | 1432.97 | 155.4297 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-ondemand | 3 | | | | 2181.46 | 1406.88 | 153.5060 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-ondemand | Average | | | | 2320.37 | 1468.08 | 156.4741 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-schedutil | 1 | | | | 2137.64 | 1385.24 | 152.7723 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-schedutil | 2 | | | | 2107.05 | 1372.23 | 152.0138 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-schedutil | 3 | | | | 2085.86 | 1365.35 | 151.2433 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-schedutil | Average | | | | 2110.18 | 1374.27 | 152.0136 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-ondemand VS acpi-cpufreq-schedutil | Comprison(%) | | | | -9.0584 | -6.3899 | -2.8506 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ amd-pstate-ondemand VS amd-pstate-schedutil | Comprison(%) | | | | 8.8053 | -5.5463 | -3.4503 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-ondemand VS amd-pstate-ondemand | Comprison(%) | | | | -0.4245 | -0.2029 | -0.2219 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-schedutil VS amd-pstate-schedutil | Comprison(%) | | | | -0.1473 | 0.6963 | -0.8378 |
|
|
+-------------------------------------------------+--------------+----------+---------+----------+-------------+---------+----------------------+
|
|
|
|
+ gitsource
|
|
|
|
When you finish test, you will get selftest.gitsource.csv and png images.
|
|
The selftest.gitsource.csv file contains the raw data and the drop of the comparative test.
|
|
The png images shows the performance, energy and performan per watt of each test.
|
|
Open selftest.gitsource.csv :
|
|
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ Governor | Round | Des-perf | Freq | Load | Time | Energy | Performance Per Watt |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ Unit | | | GHz | | s | J | 1/J |
|
|
+=================================================+==============+==========+==========+==========+=============+=========+======================+
|
|
+ amd-pstate-ondemand | 1 | 50.119 | 2.10509 | 23.3076 | 475.69 | 865.78 | 0.001155027 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ amd-pstate-ondemand | 2 | 94.8006 | 1.98771 | 56.6533 | 467.1 | 839.67 | 0.001190944 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ amd-pstate-ondemand | 3 | 76.6091 | 2.53251 | 43.7791 | 467.69 | 855.85 | 0.001168429 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ amd-pstate-ondemand | Average | 73.8429 | 2.20844 | 41.2467 | 470.16 | 853.767 | 0.001171279 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ amd-pstate-schedutil | 1 | 165.919 | 1.62319 | 98.3868 | 464.17 | 866.8 | 0.001153668 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ amd-pstate-schedutil | 2 | 165.97 | 1.31309 | 99.5712 | 480.15 | 880.4 | 0.001135847 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ amd-pstate-schedutil | 3 | 165.973 | 1.28448 | 99.9252 | 481.79 | 867.02 | 0.001153375 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ amd-pstate-schedutil | Average | 165.954 | 1.40692 | 99.2944 | 475.37 | 871.407 | 0.001147569 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-ondemand | 1 | | | | 2379.62 | 742.96 | 0.001345967 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-ondemand | 2 | | | | 441.74 | 817.49 | 0.001223256 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-ondemand | 3 | | | | 455.48 | 820.01 | 0.001219497 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-ondemand | Average | | | | 425.613 | 793.487 | 0.001260260 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-schedutil | 1 | | | | 459.69 | 838.54 | 0.001192548 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-schedutil | 2 | | | | 466.55 | 830.89 | 0.001203528 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-schedutil | 3 | | | | 470.38 | 837.32 | 0.001194286 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-schedutil | Average | | | | 465.54 | 835.583 | 0.001196769 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-ondemand VS acpi-cpufreq-schedutil | Comprison(%) | | | | 9.3810 | 5.3051 | -5.0379 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ amd-pstate-ondemand VS amd-pstate-schedutil | Comprison(%) | 124.7392 | -36.2934 | 140.7329 | 1.1081 | 2.0661 | -2.0242 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-ondemand VS amd-pstate-ondemand | Comprison(%) | | | | 10.4665 | 7.5968 | -7.0605 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
+ acpi-cpufreq-schedutil VS amd-pstate-schedutil | Comprison(%) | | | | 2.1115 | 4.2873 | -4.1110 |
|
|
+-------------------------------------------------+--------------+----------+----------+----------+-------------+---------+----------------------+
|
|
|
|
Reference
|
|
===========
|
|
|
|
.. [1] AMD64 Architecture Programmer's Manual Volume 2: System Programming,
|
|
https://www.amd.com/system/files/TechDocs/24593.pdf
|
|
|
|
.. [2] Advanced Configuration and Power Interface Specification,
|
|
https://uefi.org/sites/default/files/resources/ACPI_Spec_6_4_Jan22.pdf
|
|
|
|
.. [3] Processor Programming Reference (PPR) for AMD Family 19h Model 51h, Revision A1 Processors
|
|
https://www.amd.com/system/files/TechDocs/56569-A1-PUB.zip
|
|
|
|
.. [4] Linux Kernel Selftests,
|
|
https://www.kernel.org/doc/html/latest/dev-tools/kselftest.html
|