License cleanup: add SPDX GPL-2.0 license identifier to files with no license
Many source files in the tree are missing licensing information, which
makes it harder for compliance tools to determine the correct license.
By default all files without license information are under the default
license of the kernel, which is GPL version 2.
Update the files which contain no license information with the 'GPL-2.0'
SPDX license identifier. The SPDX identifier is a legally binding
shorthand, which can be used instead of the full boiler plate text.
This patch is based on work done by Thomas Gleixner and Kate Stewart and
Philippe Ombredanne.
How this work was done:
Patches were generated and checked against linux-4.14-rc6 for a subset of
the use cases:
- file had no licensing information it it.
- file was a */uapi/* one with no licensing information in it,
- file was a */uapi/* one with existing licensing information,
Further patches will be generated in subsequent months to fix up cases
where non-standard license headers were used, and references to license
had to be inferred by heuristics based on keywords.
The analysis to determine which SPDX License Identifier to be applied to
a file was done in a spreadsheet of side by side results from of the
output of two independent scanners (ScanCode & Windriver) producing SPDX
tag:value files created by Philippe Ombredanne. Philippe prepared the
base worksheet, and did an initial spot review of a few 1000 files.
The 4.13 kernel was the starting point of the analysis with 60,537 files
assessed. Kate Stewart did a file by file comparison of the scanner
results in the spreadsheet to determine which SPDX license identifier(s)
to be applied to the file. She confirmed any determination that was not
immediately clear with lawyers working with the Linux Foundation.
Criteria used to select files for SPDX license identifier tagging was:
- Files considered eligible had to be source code files.
- Make and config files were included as candidates if they contained >5
lines of source
- File already had some variant of a license header in it (even if <5
lines).
All documentation files were explicitly excluded.
The following heuristics were used to determine which SPDX license
identifiers to apply.
- when both scanners couldn't find any license traces, file was
considered to have no license information in it, and the top level
COPYING file license applied.
For non */uapi/* files that summary was:
SPDX license identifier # files
---------------------------------------------------|-------
GPL-2.0 11139
and resulted in the first patch in this series.
If that file was a */uapi/* path one, it was "GPL-2.0 WITH
Linux-syscall-note" otherwise it was "GPL-2.0". Results of that was:
SPDX license identifier # files
---------------------------------------------------|-------
GPL-2.0 WITH Linux-syscall-note 930
and resulted in the second patch in this series.
- if a file had some form of licensing information in it, and was one
of the */uapi/* ones, it was denoted with the Linux-syscall-note if
any GPL family license was found in the file or had no licensing in
it (per prior point). Results summary:
SPDX license identifier # files
---------------------------------------------------|------
GPL-2.0 WITH Linux-syscall-note 270
GPL-2.0+ WITH Linux-syscall-note 169
((GPL-2.0 WITH Linux-syscall-note) OR BSD-2-Clause) 21
((GPL-2.0 WITH Linux-syscall-note) OR BSD-3-Clause) 17
LGPL-2.1+ WITH Linux-syscall-note 15
GPL-1.0+ WITH Linux-syscall-note 14
((GPL-2.0+ WITH Linux-syscall-note) OR BSD-3-Clause) 5
LGPL-2.0+ WITH Linux-syscall-note 4
LGPL-2.1 WITH Linux-syscall-note 3
((GPL-2.0 WITH Linux-syscall-note) OR MIT) 3
((GPL-2.0 WITH Linux-syscall-note) AND MIT) 1
and that resulted in the third patch in this series.
- when the two scanners agreed on the detected license(s), that became
the concluded license(s).
- when there was disagreement between the two scanners (one detected a
license but the other didn't, or they both detected different
licenses) a manual inspection of the file occurred.
- In most cases a manual inspection of the information in the file
resulted in a clear resolution of the license that should apply (and
which scanner probably needed to revisit its heuristics).
- When it was not immediately clear, the license identifier was
confirmed with lawyers working with the Linux Foundation.
- If there was any question as to the appropriate license identifier,
the file was flagged for further research and to be revisited later
in time.
In total, over 70 hours of logged manual review was done on the
spreadsheet to determine the SPDX license identifiers to apply to the
source files by Kate, Philippe, Thomas and, in some cases, confirmation
by lawyers working with the Linux Foundation.
Kate also obtained a third independent scan of the 4.13 code base from
FOSSology, and compared selected files where the other two scanners
disagreed against that SPDX file, to see if there was new insights. The
Windriver scanner is based on an older version of FOSSology in part, so
they are related.
Thomas did random spot checks in about 500 files from the spreadsheets
for the uapi headers and agreed with SPDX license identifier in the
files he inspected. For the non-uapi files Thomas did random spot checks
in about 15000 files.
In initial set of patches against 4.14-rc6, 3 files were found to have
copy/paste license identifier errors, and have been fixed to reflect the
correct identifier.
Additionally Philippe spent 10 hours this week doing a detailed manual
inspection and review of the 12,461 patched files from the initial patch
version early this week with:
- a full scancode scan run, collecting the matched texts, detected
license ids and scores
- reviewing anything where there was a license detected (about 500+
files) to ensure that the applied SPDX license was correct
- reviewing anything where there was no detection but the patch license
was not GPL-2.0 WITH Linux-syscall-note to ensure that the applied
SPDX license was correct
This produced a worksheet with 20 files needing minor correction. This
worksheet was then exported into 3 different .csv files for the
different types of files to be modified.
These .csv files were then reviewed by Greg. Thomas wrote a script to
parse the csv files and add the proper SPDX tag to the file, in the
format that the file expected. This script was further refined by Greg
based on the output to detect more types of files automatically and to
distinguish between header and source .c files (which need different
comment types.) Finally Greg ran the script using the .csv files to
generate the patches.
Reviewed-by: Kate Stewart <kstewart@linuxfoundation.org>
Reviewed-by: Philippe Ombredanne <pombredanne@nexb.com>
Reviewed-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
2017-11-01 15:07:57 +01:00
/* SPDX-License-Identifier: GPL-2.0 */
2014-05-08 15:21:52 -04:00
/ *
* Copyright ( C ) 2 0 1 4 S t e v e n R o s t e d t , R e d H a t I n c
* /
2023-08-06 23:59:56 +09:00
# include < l i n u x / e x p o r t . h >
2022-10-18 13:49:21 +02:00
# include < l i n u x / c f i _ t y p e s . h >
2014-05-08 15:21:52 -04:00
# include < l i n u x / l i n k a g e . h >
2022-09-15 13:11:37 +02:00
# include < a s m / a s m - o f f s e t s . h >
2014-05-08 15:21:52 -04:00
# include < a s m / p t r a c e . h >
# include < a s m / f t r a c e . h >
2018-01-11 21:46:29 +00:00
# include < a s m / n o s p e c - b r a n c h . h >
2018-01-22 22:07:46 -06:00
# include < a s m / u n w i n d _ h i n t s . h >
2019-05-07 23:25:50 +02:00
# include < a s m / f r a m e . h >
2014-05-08 15:21:52 -04:00
.code64
2020-03-25 19:45:26 +01:00
.section .text , " ax"
2014-05-08 15:21:52 -04:00
2014-11-24 18:08:48 -05:00
# ifdef C O N F I G _ F R A M E _ P O I N T E R
/* Save parent and function stack frames (rip and rbp) */
# define M C O U N T _ F R A M E _ S I Z E ( 8 + 1 6 * 2 )
# else
/* No need to save a stack frame */
2018-01-22 22:07:46 -06:00
# define M C O U N T _ F R A M E _ S I Z E 0
2014-11-24 18:08:48 -05:00
# endif / * C O N F I G _ F R A M E _ P O I N T E R * /
2014-11-24 14:26:38 -05:00
/* Size of stack used to save mcount regs in save_mcount_regs */
2020-04-01 16:50:40 +02:00
# define M C O U N T _ R E G _ S I Z E ( F R A M E _ S I Z E + M C O U N T _ F R A M E _ S I Z E )
2014-11-24 14:26:38 -05:00
2014-11-24 11:43:39 -05:00
/ *
* gcc - p g o p t i o n a d d s a c a l l t o ' m c o u n t ' i n m o s t f u n c t i o n s .
* When - m f e n t r y i s u s e d , t h e c a l l i s t o ' f e n t r y ' a n d n o t ' m c o u n t '
* and i s d o n e b e f o r e t h e f u n c t i o n ' s s t a c k f r a m e i s s e t u p .
* They b o t h r e q u i r e a s e t o f r e g s t o b e s a v e d b e f o r e c a l l i n g
* any C c o d e a n d r e s t o r e d b e f o r e r e t u r n i n g b a c k t o t h e f u n c t i o n .
*
* On b o o t u p , a l l t h e s e c a l l s a r e c o n v e r t e d i n t o n o p s . W h e n t r a c i n g
* is e n a b l e d , t h e c a l l c a n j u m p t o e i t h e r f t r a c e _ c a l l e r o r
* ftrace_ r e g s _ c a l l e r . C a l l b a c k s ( t r a c i n g f u n c t i o n s ) t h a t r e q u i r e
* ftrace_ r e g s _ c a l l e r ( l i k e k p r o b e s ) n e e d t o h a v e p t _ r e g s p a s s e d t o
* it. F o r t h i s r e a s o n , t h e s i z e o f t h e p t _ r e g s s t r u c t u r e w i l l b e
* allocated o n t h e s t a c k a n d t h e r e q u i r e d m c o u n t r e g i s t e r s w i l l
* be s a v e d i n t h e l o c a t i o n s t h a t p t _ r e g s h a s t h e m i n .
* /
2014-11-24 21:38:40 -05:00
/ *
* @added: the amount of stack added before calling this
*
* After t h i s i s c a l l e d , t h e f o l l o w i n g r e g i s t e r s c o n t a i n :
*
* % rdi - h o l d s t h e a d d r e s s t h a t c a l l e d t h e t r a m p o l i n e
* % rsi - h o l d s t h e p a r e n t f u n c t i o n ( t r a c e d f u n c t i o n ' s r e t u r n a d d r e s s )
* % rdx - h o l d s t h e o r i g i n a l % r b p
* /
2014-11-24 13:06:05 -05:00
.macro save_mcount_regs added=0
2014-11-24 18:08:48 -05:00
2018-01-22 22:07:46 -06:00
# ifdef C O N F I G _ F R A M E _ P O I N T E R
/* Save the original rbp */
2014-11-24 18:08:48 -05:00
pushq % r b p
/ *
* Stack t r a c e s w i l l s t o p a t t h e f t r a c e t r a m p o l i n e i f t h e f r a m e p o i n t e r
* is n o t s e t u p p r o p e r l y . I f f e n t r y i s u s e d , w e n e e d t o s a v e a f r a m e
* pointer f o r t h e p a r e n t a s w e l l a s t h e f u n c t i o n t r a c e d , b e c a u s e t h e
* fentry i s c a l l e d b e f o r e t h e s t a c k f r a m e i s s e t u p , w h e r e a s m c o u n t
* is c a l l e d a f t e r w a r d .
* /
ftrace/x86: Remove mcount support
There's two methods of enabling function tracing in Linux on x86. One is
with just "gcc -pg" and the other is "gcc -pg -mfentry". The former will use
calls to a special function "mcount" after the frame is set up in all C
functions. The latter will add calls to a special function called "fentry"
as the very first instruction of all C functions.
At compile time, there is a check to see if gcc supports, -mfentry, and if
it does, it will use that, because it is more versatile and less error prone
for function tracing.
Starting with v4.19, the minimum gcc supported to build the Linux kernel,
was raised to version 4.6. That also happens to be the first gcc version to
support -mfentry. Since on x86, using gcc versions from 4.6 and beyond will
unconditionally enable the -mfentry, it will no longer use mcount as the
method for inserting calls into the C functions of the kernel. This means
that there is no point in continuing to maintain mcount in x86.
Remove support for using mcount. This makes the code less complex, and will
also allow it to be simplified in the future.
Acked-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Acked-by: Jiri Kosina <jkosina@suse.cz>
Acked-by: Josh Poimboeuf <jpoimboe@redhat.com>
Signed-off-by: Steven Rostedt (VMware) <rostedt@goodmis.org>
2019-05-09 15:32:05 -04:00
2014-11-24 18:08:48 -05:00
/* Save the parent pointer (skip orig rbp and our return address) */
pushq \ a d d e d + 8 * 2 ( % r s p )
pushq % r b p
movq % r s p , % r b p
/* Save the return address (now skip orig rbp, rbp and parent) */
pushq \ a d d e d + 8 * 3 ( % r s p )
pushq % r b p
movq % r s p , % r b p
# endif / * C O N F I G _ F R A M E _ P O I N T E R * /
/ *
* We a d d e n o u g h s t a c k t o s a v e a l l r e g s .
* /
2020-04-01 16:50:40 +02:00
subq $ ( F R A M E _ S I Z E ) , % r s p
2014-11-24 11:30:58 -05:00
movq % r a x , R A X ( % r s p )
movq % r c x , R C X ( % r s p )
movq % r d x , R D X ( % r s p )
movq % r s i , R S I ( % r s p )
movq % r d i , R D I ( % r s p )
movq % r8 , R 8 ( % r s p )
movq % r9 , R 9 ( % r s p )
2019-11-08 13:11:39 -05:00
movq $ 0 , O R I G _ R A X ( % r s p )
2014-11-24 18:08:48 -05:00
/ *
* Save t h e o r i g i n a l R B P . E v e n t h o u g h t h e m c o u n t A B I d o e s n o t
* require t h i s , i t h e l p s o u t c a l l e r s .
* /
2018-01-22 22:07:46 -06:00
# ifdef C O N F I G _ F R A M E _ P O I N T E R
2014-11-24 18:08:48 -05:00
movq M C O U N T _ R E G _ S I Z E - 8 ( % r s p ) , % r d x
2018-01-22 22:07:46 -06:00
# else
movq % r b p , % r d x
# endif
2014-11-24 18:08:48 -05:00
movq % r d x , R B P ( % r s p )
2014-11-24 21:38:40 -05:00
/* Copy the parent address into %rsi (second parameter) */
movq M C O U N T _ R E G _ S I Z E + 8 + \ a d d e d ( % r s p ) , % r s i
2014-11-24 11:30:58 -05:00
/* Move RIP to its proper location */
2014-11-24 14:26:38 -05:00
movq M C O U N T _ R E G _ S I Z E + \ a d d e d ( % r s p ) , % r d i
2014-11-24 13:21:09 -05:00
movq % r d i , R I P ( % r s p )
2014-11-24 21:38:40 -05:00
/ *
* Now % r d i ( t h e f i r s t p a r a m e t e r ) h a s t h e r e t u r n a d d r e s s o f
* where f t r a c e _ c a l l r e t u r n s . B u t t h e c a l l b a c k s e x p e c t t h e
2014-11-24 21:00:34 -05:00
* address o f t h e c a l l i t s e l f .
2014-11-24 21:38:40 -05:00
* /
subq $ M C O U N T _ I N S N _ S I Z E , % r d i
2014-11-24 11:30:58 -05:00
.endm
2019-11-08 13:11:39 -05:00
.macro restore_mcount_regs save=0
/* ftrace_regs_caller or frame pointers require this */
movq R B P ( % r s p ) , % r b p
2014-11-24 11:30:58 -05:00
movq R 9 ( % r s p ) , % r9
movq R 8 ( % r s p ) , % r8
movq R D I ( % r s p ) , % r d i
movq R S I ( % r s p ) , % r s i
movq R D X ( % r s p ) , % r d x
movq R C X ( % r s p ) , % r c x
movq R A X ( % r s p ) , % r a x
2014-11-24 18:08:48 -05:00
2019-11-08 13:11:39 -05:00
addq $ M C O U N T _ R E G _ S I Z E - \ s a v e , % r s p
2014-11-24 18:08:48 -05:00
2014-11-24 11:30:58 -05:00
.endm
2022-10-18 13:49:21 +02:00
SYM_ T Y P E D _ F U N C _ S T A R T ( f t r a c e _ s t u b )
2022-10-22 09:55:06 +02:00
CALL_ D E P T H _ A C C O U N T
2022-10-18 13:49:21 +02:00
RET
SYM_ F U N C _ E N D ( f t r a c e _ s t u b )
2023-01-31 10:36:30 +01:00
# ifdef C O N F I G _ F U N C T I O N _ G R A P H _ T R A C E R
2022-10-18 13:49:21 +02:00
SYM_ T Y P E D _ F U N C _ S T A R T ( f t r a c e _ s t u b _ g r a p h )
2022-10-22 09:55:06 +02:00
CALL_ D E P T H _ A C C O U N T
2022-10-18 13:49:21 +02:00
RET
SYM_ F U N C _ E N D ( f t r a c e _ s t u b _ g r a p h )
2023-01-31 10:36:30 +01:00
# endif
2022-10-18 13:49:21 +02:00
2014-11-24 14:54:27 -05:00
# ifdef C O N F I G _ D Y N A M I C _ F T R A C E
2019-10-21 17:18:23 +02:00
SYM_ F U N C _ S T A R T ( _ _ f e n t r y _ _ )
2022-09-15 13:11:37 +02:00
CALL_ D E P T H _ A C C O U N T
2021-12-04 14:43:40 +01:00
RET
2019-10-21 17:18:23 +02:00
SYM_ F U N C _ E N D ( _ _ f e n t r y _ _ )
EXPORT_ S Y M B O L ( _ _ f e n t r y _ _ )
2014-11-24 14:54:27 -05:00
2019-10-11 13:51:04 +02:00
SYM_ F U N C _ S T A R T ( f t r a c e _ c a l l e r )
2014-11-24 21:38:40 -05:00
/* save_mcount_regs fills in first two parameters */
save_ m c o u n t _ r e g s
2022-09-15 13:11:37 +02:00
CALL_ D E P T H _ A C C O U N T
2020-10-27 10:55:55 -04:00
/* Stack - skipping return address of ftrace_caller */
leaq M C O U N T _ R E G _ S I Z E + 8 ( % r s p ) , % r c x
movq % r c x , R S P ( % r s p )
2019-10-11 13:50:57 +02:00
SYM_ I N N E R _ L A B E L ( f t r a c e _ c a l l e r _ o p _ p t r , S Y M _ L _ G L O B A L )
2022-03-08 16:30:41 +01:00
ANNOTATE_ N O E N D B R
2014-11-24 21:38:40 -05:00
/* Load the ftrace_ops into the 3rd parameter */
movq f u n c t i o n _ t r a c e _ o p ( % r i p ) , % r d x
2020-10-27 10:55:55 -04:00
/* regs go into 4th parameter */
leaq ( % r s p ) , % r c x
/* Only ops with REGS flag set should have CS register set */
movq $ 0 , C S ( % r s p )
2014-05-08 15:21:52 -04:00
2022-09-15 13:11:37 +02:00
/* Account for the function call below */
CALL_ D E P T H _ A C C O U N T
2019-10-11 13:50:57 +02:00
SYM_ I N N E R _ L A B E L ( f t r a c e _ c a l l , S Y M _ L _ G L O B A L )
2022-03-08 16:30:41 +01:00
ANNOTATE_ N O E N D B R
2014-05-08 15:21:52 -04:00
call f t r a c e _ s t u b
2020-10-28 17:15:27 -04:00
/* Handlers can change the RIP */
movq R I P ( % r s p ) , % r a x
movq % r a x , M C O U N T _ R E G _ S I Z E ( % r s p )
2014-11-24 11:43:39 -05:00
restore_ m c o u n t _ r e g s
ftrace/x86: Add dynamic allocated trampoline for ftrace_ops
The current method of handling multiple function callbacks is to register
a list function callback that calls all the other callbacks based on
their hash tables and compare it to the function that the callback was
called on. But this is very inefficient.
For example, if you are tracing all functions in the kernel and then
add a kprobe to a function such that the kprobe uses ftrace, the
mcount trampoline will switch from calling the function trace callback
to calling the list callback that will iterate over all registered
ftrace_ops (in this case, the function tracer and the kprobes callback).
That means for every function being traced it checks the hash of the
ftrace_ops for function tracing and kprobes, even though the kprobes
is only set at a single function. The kprobes ftrace_ops is checked
for every function being traced!
Instead of calling the list function for functions that are only being
traced by a single callback, we can call a dynamically allocated
trampoline that calls the callback directly. The function graph tracer
already uses a direct call trampoline when it is being traced by itself
but it is not dynamically allocated. It's trampoline is static in the
kernel core. The infrastructure that called the function graph trampoline
can also be used to call a dynamically allocated one.
For now, only ftrace_ops that are not dynamically allocated can have
a trampoline. That is, users such as function tracer or stack tracer.
kprobes and perf allocate their ftrace_ops, and until there's a safe
way to free the trampoline, it can not be used. The dynamically allocated
ftrace_ops may, although, use the trampoline if the kernel is not
compiled with CONFIG_PREEMPT. But that will come later.
Tested-by: Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>
Tested-by: Jiri Kosina <jkosina@suse.cz>
Signed-off-by: Steven Rostedt <rostedt@goodmis.org>
2014-07-02 23:23:31 -04:00
/ *
2016-02-16 09:43:21 +01:00
* The c o d e u p t o t h i s l a b e l i s c o p i e d i n t o t r a m p o l i n e s s o
* think t w i c e b e f o r e a d d i n g a n y n e w c o d e o r c h a n g i n g t h e
* layout h e r e .
ftrace/x86: Add dynamic allocated trampoline for ftrace_ops
The current method of handling multiple function callbacks is to register
a list function callback that calls all the other callbacks based on
their hash tables and compare it to the function that the callback was
called on. But this is very inefficient.
For example, if you are tracing all functions in the kernel and then
add a kprobe to a function such that the kprobe uses ftrace, the
mcount trampoline will switch from calling the function trace callback
to calling the list callback that will iterate over all registered
ftrace_ops (in this case, the function tracer and the kprobes callback).
That means for every function being traced it checks the hash of the
ftrace_ops for function tracing and kprobes, even though the kprobes
is only set at a single function. The kprobes ftrace_ops is checked
for every function being traced!
Instead of calling the list function for functions that are only being
traced by a single callback, we can call a dynamically allocated
trampoline that calls the callback directly. The function graph tracer
already uses a direct call trampoline when it is being traced by itself
but it is not dynamically allocated. It's trampoline is static in the
kernel core. The infrastructure that called the function graph trampoline
can also be used to call a dynamically allocated one.
For now, only ftrace_ops that are not dynamically allocated can have
a trampoline. That is, users such as function tracer or stack tracer.
kprobes and perf allocate their ftrace_ops, and until there's a safe
way to free the trampoline, it can not be used. The dynamically allocated
ftrace_ops may, although, use the trampoline if the kernel is not
compiled with CONFIG_PREEMPT. But that will come later.
Tested-by: Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>
Tested-by: Jiri Kosina <jkosina@suse.cz>
Signed-off-by: Steven Rostedt <rostedt@goodmis.org>
2014-07-02 23:23:31 -04:00
* /
x86,ftrace: Fix ftrace_regs_caller() unwind
The ftrace_regs_caller() trampoline does something 'funny' when there
is a direct-caller present. In that case it stuffs the 'direct-caller'
address on the return stack and then exits the function. This then
results in 'returning' to the direct-caller with the exact registers
we came in with -- an indirect tail-call without using a register.
This however (rightfully) confuses objtool because the function shares
a few instruction in order to have a single exit path, but the stack
layout is different for them, depending through which path we came
there.
This is currently cludged by forcing the stack state to the non-direct
case, but this generates actively wrong (ORC) unwind information for
the direct case, leading to potential broken unwinds.
Fix this issue by fully separating the exit paths. This results in
having to poke a second RET into the trampoline copy, see
ftrace_regs_caller_ret.
This brings us to a second objtool problem, in order for it to
perceive the 'jmp ftrace_epilogue' as a function exit, it needs to be
recognised as a tail call. In order to make that happen,
ftrace_epilogue needs to be the start of an STT_FUNC, so re-arrange
code to make this so.
Finally, a third issue is that objtool requires functions to exit with
the same stack layout they started with, which is obviously violated
in the direct case, employ the new HINT_RET_OFFSET to tell objtool
this is an expected exception.
Together, this results in generating correct ORC unwind information
for the ftrace_regs_caller() function and it's trampoline copies.
Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Reviewed-by: Miroslav Benes <mbenes@suse.cz>
Reviewed-by: Alexandre Chartre <alexandre.chartre@oracle.com>
Acked-by: Josh Poimboeuf <jpoimboe@redhat.com>
Link: https://lkml.kernel.org/r/20200416115118.749606694@infradead.org
Signed-off-by: Ingo Molnar <mingo@kernel.org>
2020-04-01 16:53:19 +02:00
SYM_ I N N E R _ L A B E L ( f t r a c e _ c a l l e r _ e n d , S Y M _ L _ G L O B A L )
2022-03-08 16:30:41 +01:00
ANNOTATE_ N O E N D B R
2022-09-15 13:11:35 +02:00
RET
x86,ftrace: Fix ftrace_regs_caller() unwind
The ftrace_regs_caller() trampoline does something 'funny' when there
is a direct-caller present. In that case it stuffs the 'direct-caller'
address on the return stack and then exits the function. This then
results in 'returning' to the direct-caller with the exact registers
we came in with -- an indirect tail-call without using a register.
This however (rightfully) confuses objtool because the function shares
a few instruction in order to have a single exit path, but the stack
layout is different for them, depending through which path we came
there.
This is currently cludged by forcing the stack state to the non-direct
case, but this generates actively wrong (ORC) unwind information for
the direct case, leading to potential broken unwinds.
Fix this issue by fully separating the exit paths. This results in
having to poke a second RET into the trampoline copy, see
ftrace_regs_caller_ret.
This brings us to a second objtool problem, in order for it to
perceive the 'jmp ftrace_epilogue' as a function exit, it needs to be
recognised as a tail call. In order to make that happen,
ftrace_epilogue needs to be the start of an STT_FUNC, so re-arrange
code to make this so.
Finally, a third issue is that objtool requires functions to exit with
the same stack layout they started with, which is obviously violated
in the direct case, employ the new HINT_RET_OFFSET to tell objtool
this is an expected exception.
Together, this results in generating correct ORC unwind information
for the ftrace_regs_caller() function and it's trampoline copies.
Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Reviewed-by: Miroslav Benes <mbenes@suse.cz>
Reviewed-by: Alexandre Chartre <alexandre.chartre@oracle.com>
Acked-by: Josh Poimboeuf <jpoimboe@redhat.com>
Link: https://lkml.kernel.org/r/20200416115118.749606694@infradead.org
Signed-off-by: Ingo Molnar <mingo@kernel.org>
2020-04-01 16:53:19 +02:00
SYM_ F U N C _ E N D ( f t r a c e _ c a l l e r ) ;
2022-06-03 08:04:44 -07:00
STACK_ F R A M E _ N O N _ S T A N D A R D _ F P ( f t r a c e _ c a l l e r )
x86,ftrace: Fix ftrace_regs_caller() unwind
The ftrace_regs_caller() trampoline does something 'funny' when there
is a direct-caller present. In that case it stuffs the 'direct-caller'
address on the return stack and then exits the function. This then
results in 'returning' to the direct-caller with the exact registers
we came in with -- an indirect tail-call without using a register.
This however (rightfully) confuses objtool because the function shares
a few instruction in order to have a single exit path, but the stack
layout is different for them, depending through which path we came
there.
This is currently cludged by forcing the stack state to the non-direct
case, but this generates actively wrong (ORC) unwind information for
the direct case, leading to potential broken unwinds.
Fix this issue by fully separating the exit paths. This results in
having to poke a second RET into the trampoline copy, see
ftrace_regs_caller_ret.
This brings us to a second objtool problem, in order for it to
perceive the 'jmp ftrace_epilogue' as a function exit, it needs to be
recognised as a tail call. In order to make that happen,
ftrace_epilogue needs to be the start of an STT_FUNC, so re-arrange
code to make this so.
Finally, a third issue is that objtool requires functions to exit with
the same stack layout they started with, which is obviously violated
in the direct case, employ the new HINT_RET_OFFSET to tell objtool
this is an expected exception.
Together, this results in generating correct ORC unwind information
for the ftrace_regs_caller() function and it's trampoline copies.
Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Reviewed-by: Miroslav Benes <mbenes@suse.cz>
Reviewed-by: Alexandre Chartre <alexandre.chartre@oracle.com>
Acked-by: Josh Poimboeuf <jpoimboe@redhat.com>
Link: https://lkml.kernel.org/r/20200416115118.749606694@infradead.org
Signed-off-by: Ingo Molnar <mingo@kernel.org>
2020-04-01 16:53:19 +02:00
2019-10-11 13:51:04 +02:00
SYM_ F U N C _ S T A R T ( f t r a c e _ r e g s _ c a l l e r )
2014-11-24 13:06:05 -05:00
/* Save the current flags before any operations that can change them */
2014-05-08 15:21:52 -04:00
pushfq
2014-11-24 13:06:05 -05:00
/* added 8 bytes to save flags */
2014-11-24 21:38:40 -05:00
save_ m c o u n t _ r e g s 8
/* save_mcount_regs fills in first two parameters */
2022-09-15 13:11:37 +02:00
CALL_ D E P T H _ A C C O U N T
2019-10-11 13:50:57 +02:00
SYM_ I N N E R _ L A B E L ( f t r a c e _ r e g s _ c a l l e r _ o p _ p t r , S Y M _ L _ G L O B A L )
2022-03-08 16:30:41 +01:00
ANNOTATE_ N O E N D B R
2014-11-24 21:38:40 -05:00
/* Load the ftrace_ops into the 3rd parameter */
movq f u n c t i o n _ t r a c e _ o p ( % r i p ) , % r d x
2014-05-08 15:21:52 -04:00
/* Save the rest of pt_regs */
movq % r15 , R 1 5 ( % r s p )
movq % r14 , R 1 4 ( % r s p )
movq % r13 , R 1 3 ( % r s p )
movq % r12 , R 1 2 ( % r s p )
movq % r11 , R 1 1 ( % r s p )
movq % r10 , R 1 0 ( % r s p )
movq % r b x , R B X ( % r s p )
/* Copy saved flags */
2014-11-24 14:26:38 -05:00
movq M C O U N T _ R E G _ S I Z E ( % r s p ) , % r c x
2014-05-08 15:21:52 -04:00
movq % r c x , E F L A G S ( % r s p )
/* Kernel segments */
movq $ _ _ K E R N E L _ D S , % r c x
movq % r c x , S S ( % r s p )
movq $ _ _ K E R N E L _ C S , % r c x
movq % r c x , C S ( % r s p )
2014-11-24 13:06:05 -05:00
/* Stack - skipping return address and flags */
2014-11-24 14:26:38 -05:00
leaq M C O U N T _ R E G _ S I Z E + 8 * 2 ( % r s p ) , % r c x
2014-05-08 15:21:52 -04:00
movq % r c x , R S P ( % r s p )
2019-05-07 23:25:50 +02:00
ENCODE_ F R A M E _ P O I N T E R
2014-05-08 15:21:52 -04:00
/* regs go into 4th parameter */
leaq ( % r s p ) , % r c x
2022-09-15 13:11:37 +02:00
/* Account for the function call below */
CALL_ D E P T H _ A C C O U N T
2019-10-11 13:50:57 +02:00
SYM_ I N N E R _ L A B E L ( f t r a c e _ r e g s _ c a l l , S Y M _ L _ G L O B A L )
2022-03-08 16:30:41 +01:00
ANNOTATE_ N O E N D B R
2014-05-08 15:21:52 -04:00
call f t r a c e _ s t u b
/* Copy flags back to SS, to restore them */
movq E F L A G S ( % r s p ) , % r a x
2014-11-24 14:26:38 -05:00
movq % r a x , M C O U N T _ R E G _ S I Z E ( % r s p )
2014-05-08 15:21:52 -04:00
/* Handlers can change the RIP */
movq R I P ( % r s p ) , % r a x
2014-11-24 14:26:38 -05:00
movq % r a x , M C O U N T _ R E G _ S I Z E + 8 ( % r s p )
2014-05-08 15:21:52 -04:00
/* restore the rest of pt_regs */
movq R 1 5 ( % r s p ) , % r15
movq R 1 4 ( % r s p ) , % r14
movq R 1 3 ( % r s p ) , % r13
movq R 1 2 ( % r s p ) , % r12
movq R 1 0 ( % r s p ) , % r10
movq R B X ( % r s p ) , % r b x
2019-11-08 13:11:39 -05:00
movq O R I G _ R A X ( % r s p ) , % r a x
movq % r a x , M C O U N T _ R E G _ S I Z E - 8 ( % r s p )
x86,ftrace: Fix ftrace_regs_caller() unwind
The ftrace_regs_caller() trampoline does something 'funny' when there
is a direct-caller present. In that case it stuffs the 'direct-caller'
address on the return stack and then exits the function. This then
results in 'returning' to the direct-caller with the exact registers
we came in with -- an indirect tail-call without using a register.
This however (rightfully) confuses objtool because the function shares
a few instruction in order to have a single exit path, but the stack
layout is different for them, depending through which path we came
there.
This is currently cludged by forcing the stack state to the non-direct
case, but this generates actively wrong (ORC) unwind information for
the direct case, leading to potential broken unwinds.
Fix this issue by fully separating the exit paths. This results in
having to poke a second RET into the trampoline copy, see
ftrace_regs_caller_ret.
This brings us to a second objtool problem, in order for it to
perceive the 'jmp ftrace_epilogue' as a function exit, it needs to be
recognised as a tail call. In order to make that happen,
ftrace_epilogue needs to be the start of an STT_FUNC, so re-arrange
code to make this so.
Finally, a third issue is that objtool requires functions to exit with
the same stack layout they started with, which is obviously violated
in the direct case, employ the new HINT_RET_OFFSET to tell objtool
this is an expected exception.
Together, this results in generating correct ORC unwind information
for the ftrace_regs_caller() function and it's trampoline copies.
Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Reviewed-by: Miroslav Benes <mbenes@suse.cz>
Reviewed-by: Alexandre Chartre <alexandre.chartre@oracle.com>
Acked-by: Josh Poimboeuf <jpoimboe@redhat.com>
Link: https://lkml.kernel.org/r/20200416115118.749606694@infradead.org
Signed-off-by: Ingo Molnar <mingo@kernel.org>
2020-04-01 16:53:19 +02:00
/ *
* If O R I G _ R A X i s a n y t h i n g b u t z e r o , m a k e t h i s a c a l l t o t h a t .
* See a r c h _ f t r a c e _ s e t _ d i r e c t _ c a l l e r ( ) .
* /
2020-04-01 16:51:11 +02:00
testq % r a x , % r a x
2020-04-22 12:25:42 -04:00
SYM_ I N N E R _ L A B E L ( f t r a c e _ r e g s _ c a l l e r _ j m p , S Y M _ L _ G L O B A L )
2022-03-08 16:30:41 +01:00
ANNOTATE_ N O E N D B R
2020-04-22 12:25:40 -04:00
jnz 1 f
2019-11-08 13:11:39 -05:00
2020-04-22 12:25:40 -04:00
restore_ m c o u n t _ r e g s
2014-05-08 15:21:52 -04:00
/* Restore flags */
popfq
ftrace/x86: Add dynamic allocated trampoline for ftrace_ops
The current method of handling multiple function callbacks is to register
a list function callback that calls all the other callbacks based on
their hash tables and compare it to the function that the callback was
called on. But this is very inefficient.
For example, if you are tracing all functions in the kernel and then
add a kprobe to a function such that the kprobe uses ftrace, the
mcount trampoline will switch from calling the function trace callback
to calling the list callback that will iterate over all registered
ftrace_ops (in this case, the function tracer and the kprobes callback).
That means for every function being traced it checks the hash of the
ftrace_ops for function tracing and kprobes, even though the kprobes
is only set at a single function. The kprobes ftrace_ops is checked
for every function being traced!
Instead of calling the list function for functions that are only being
traced by a single callback, we can call a dynamically allocated
trampoline that calls the callback directly. The function graph tracer
already uses a direct call trampoline when it is being traced by itself
but it is not dynamically allocated. It's trampoline is static in the
kernel core. The infrastructure that called the function graph trampoline
can also be used to call a dynamically allocated one.
For now, only ftrace_ops that are not dynamically allocated can have
a trampoline. That is, users such as function tracer or stack tracer.
kprobes and perf allocate their ftrace_ops, and until there's a safe
way to free the trampoline, it can not be used. The dynamically allocated
ftrace_ops may, although, use the trampoline if the kernel is not
compiled with CONFIG_PREEMPT. But that will come later.
Tested-by: Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>
Tested-by: Jiri Kosina <jkosina@suse.cz>
Signed-off-by: Steven Rostedt <rostedt@goodmis.org>
2014-07-02 23:23:31 -04:00
/ *
2022-09-15 13:11:35 +02:00
* The t r a m p o l i n e w i l l a d d t h e r e t u r n .
ftrace/x86: Add dynamic allocated trampoline for ftrace_ops
The current method of handling multiple function callbacks is to register
a list function callback that calls all the other callbacks based on
their hash tables and compare it to the function that the callback was
called on. But this is very inefficient.
For example, if you are tracing all functions in the kernel and then
add a kprobe to a function such that the kprobe uses ftrace, the
mcount trampoline will switch from calling the function trace callback
to calling the list callback that will iterate over all registered
ftrace_ops (in this case, the function tracer and the kprobes callback).
That means for every function being traced it checks the hash of the
ftrace_ops for function tracing and kprobes, even though the kprobes
is only set at a single function. The kprobes ftrace_ops is checked
for every function being traced!
Instead of calling the list function for functions that are only being
traced by a single callback, we can call a dynamically allocated
trampoline that calls the callback directly. The function graph tracer
already uses a direct call trampoline when it is being traced by itself
but it is not dynamically allocated. It's trampoline is static in the
kernel core. The infrastructure that called the function graph trampoline
can also be used to call a dynamically allocated one.
For now, only ftrace_ops that are not dynamically allocated can have
a trampoline. That is, users such as function tracer or stack tracer.
kprobes and perf allocate their ftrace_ops, and until there's a safe
way to free the trampoline, it can not be used. The dynamically allocated
ftrace_ops may, although, use the trampoline if the kernel is not
compiled with CONFIG_PREEMPT. But that will come later.
Tested-by: Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>
Tested-by: Jiri Kosina <jkosina@suse.cz>
Signed-off-by: Steven Rostedt <rostedt@goodmis.org>
2014-07-02 23:23:31 -04:00
* /
2020-04-22 12:25:41 -04:00
SYM_ I N N E R _ L A B E L ( f t r a c e _ r e g s _ c a l l e r _ e n d , S Y M _ L _ G L O B A L )
2022-03-08 16:30:41 +01:00
ANNOTATE_ N O E N D B R
2022-09-15 13:11:35 +02:00
RET
2014-06-25 11:59:45 -04:00
2020-04-22 12:25:40 -04:00
/* Swap the flags with orig_rax */
1 : movq M C O U N T _ R E G _ S I Z E ( % r s p ) , % r d i
movq % r d i , M C O U N T _ R E G _ S I Z E - 8 ( % r s p )
movq % r a x , M C O U N T _ R E G _ S I Z E ( % r s p )
restore_ m c o u n t _ r e g s 8
/* Restore flags */
popfq
2021-01-21 15:29:24 -06:00
UNWIND_ H I N T _ F U N C
2022-09-15 13:11:36 +02:00
/ *
* The a b o v e l e f t a n e x t r a r e t u r n v a l u e o n t h e s t a c k ; effectively
* doing a t a i l - c a l l w i t h o u t u s i n g a r e g i s t e r . T h i s P U S H ;RET
* pattern u n b a l a n c e s t h e R S B , i n j e c t a p o i n t l e s s C A L L t o r e b a l a n c e .
* /
ANNOTATE_ I N T R A _ F U N C T I O N _ C A L L
CALL . L d o _ r e b a l a n c e
int3
.Ldo_rebalance :
add $ 8 , % r s p
2022-09-15 13:11:37 +02:00
ALTERNATIVE _ _ s t r i n g i f y ( R E T ) , \
_ _ stringify( A N N O T A T E _ U N R E T _ S A F E ; ret; int3), \
X8 6 _ F E A T U R E _ C A L L _ D E P T H
2020-04-22 12:25:40 -04:00
2019-10-11 13:51:04 +02:00
SYM_ F U N C _ E N D ( f t r a c e _ r e g s _ c a l l e r )
2022-06-03 08:04:44 -07:00
STACK_ F R A M E _ N O N _ S T A N D A R D _ F P ( f t r a c e _ r e g s _ c a l l e r )
2014-05-08 15:21:52 -04:00
ftrace: selftest: remove broken trace_direct_tramp
The ftrace selftest code has a trace_direct_tramp() function which it
uses as a direct call trampoline. This happens to work on x86, since the
direct call's return address is in the usual place, and can be returned
to via a RET, but in general the calling convention for direct calls is
different from regular function calls, and requires a trampoline written
in assembly.
On s390, regular function calls place the return address in %r14, and an
ftrace patch-site in an instrumented function places the trampoline's
return address (which is within the instrumented function) in %r0,
preserving the original %r14 value in-place. As a regular C function
will return to the address in %r14, using a C function as the trampoline
results in the trampoline returning to the caller of the instrumented
function, skipping the body of the instrumented function.
Note that the s390 issue is not detcted by the ftrace selftest code, as
the instrumented function is trivial, and returning back into the caller
happens to be equivalent.
On arm64, regular function calls place the return address in x30, and
an ftrace patch-site in an instrumented function saves this into r9
and places the trampoline's return address (within the instrumented
function) in x30. A regular C function will return to the address in
x30, but will not restore x9 into x30. Consequently, using a C function
as the trampoline results in returning to the trampoline's return
address having corrupted x30, such that when the instrumented function
returns, it will return back into itself.
To avoid future issues in this area, remove the trace_direct_tramp()
function, and require that each architecture with direct calls provides
a stub trampoline, named ftrace_stub_direct_tramp. This can be written
to handle the architecture's trampoline calling convention, and in
future could be used elsewhere (e.g. in the ftrace ops sample, to
measure the overhead of direct calls), so we may as well always build it
in.
Link: https://lkml.kernel.org/r/20230321140424.345218-8-revest@chromium.org
Signed-off-by: Mark Rutland <mark.rutland@arm.com>
Cc: Li Huafei <lihuafei1@huawei.com>
Cc: Xu Kuohai <xukuohai@huawei.com>
Signed-off-by: Florent Revest <revest@chromium.org>
Acked-by: Jiri Olsa <jolsa@kernel.org>
Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>
2023-03-21 15:04:24 +01:00
SYM_ F U N C _ S T A R T ( f t r a c e _ s t u b _ d i r e c t _ t r a m p )
CALL_ D E P T H _ A C C O U N T
RET
SYM_ F U N C _ E N D ( f t r a c e _ s t u b _ d i r e c t _ t r a m p )
2014-05-08 15:21:52 -04:00
# else / * ! C O N F I G _ D Y N A M I C _ F T R A C E * /
2019-10-21 17:18:23 +02:00
SYM_ F U N C _ S T A R T ( _ _ f e n t r y _ _ )
2022-09-15 13:11:37 +02:00
CALL_ D E P T H _ A C C O U N T
2014-05-08 15:21:52 -04:00
cmpq $ f t r a c e _ s t u b , f t r a c e _ t r a c e _ f u n c t i o n
jnz t r a c e
2021-12-04 14:43:40 +01:00
RET
2014-05-08 15:21:52 -04:00
trace :
2014-11-24 21:38:40 -05:00
/* save_mcount_regs fills in first two parameters */
save_ m c o u n t _ r e g s
2014-05-08 15:21:52 -04:00
2015-11-17 09:43:24 +09:00
/ *
* When D Y N A M I C _ F T R A C E i s n o t d e f i n e d , A R C H _ S U P P O R T S _ F T R A C E _ O P S i s n o t
* set ( s e e i n c l u d e / a s m / f t r a c e . h a n d i n c l u d e / l i n u x / f t r a c e . h ) . O n l y t h e
* ip a n d p a r e n t i p a r e u s e d a n d t h e l i s t f u n c t i o n i s c a l l e d w h e n
* function t r a c i n g i s e n a b l e d .
* /
2018-01-11 21:46:29 +00:00
movq f t r a c e _ t r a c e _ f u n c t i o n , % r8
2020-04-22 17:16:40 +02:00
CALL_ N O S P E C r8
2014-11-24 11:43:39 -05:00
restore_ m c o u n t _ r e g s
2014-05-08 15:21:52 -04:00
2021-10-08 11:13:31 +02:00
jmp f t r a c e _ s t u b
2019-10-21 17:18:23 +02:00
SYM_ F U N C _ E N D ( _ _ f e n t r y _ _ )
EXPORT_ S Y M B O L ( _ _ f e n t r y _ _ )
2022-06-03 08:04:44 -07:00
STACK_ F R A M E _ N O N _ S T A N D A R D _ F P ( _ _ f e n t r y _ _ )
2014-05-08 15:21:52 -04:00
# endif / * C O N F I G _ D Y N A M I C _ F T R A C E * /
# ifdef C O N F I G _ F U N C T I O N _ G R A P H _ T R A C E R
2022-06-03 08:04:44 -07:00
SYM_ C O D E _ S T A R T ( r e t u r n _ t o _ h a n d l e r )
2023-03-01 07:13:12 -08:00
UNWIND_ H I N T _ U N D E F I N E D
2022-06-03 08:04:44 -07:00
ANNOTATE_ N O E N D B R
2023-04-08 05:42:20 -07:00
subq $ 2 4 , % r s p
2014-05-08 15:21:52 -04:00
/* Save the return values */
movq % r a x , ( % r s p )
movq % r d x , 8 ( % r s p )
2023-04-08 05:42:20 -07:00
movq % r b p , 1 6 ( % r s p )
movq % r s p , % r d i
2014-05-08 15:21:52 -04:00
call f t r a c e _ r e t u r n _ t o _ h a n d l e r
movq % r a x , % r d i
movq 8 ( % r s p ) , % r d x
movq ( % r s p ) , % r a x
2022-03-08 16:30:31 +01:00
2023-04-08 05:42:20 -07:00
addq $ 2 4 , % r s p
2022-03-08 16:30:31 +01:00
/ *
* Jump b a c k t o t h e o l d r e t u r n a d d r e s s . T h i s c a n n o t b e J M P _ N O S P E C r d i
* since I B T w o u l d d e m a n d t h a t c o n t a i n E N D B R , w h i c h s i m p l y i s n ' t s o f o r
* return a d d r e s s e s . U s e a r e t p o l i n e h e r e t o k e e p t h e R S B b a l a n c e d .
* /
ANNOTATE_ I N T R A _ F U N C T I O N _ C A L L
call . L d o _ r o p
int3
.Ldo_rop :
mov % r d i , ( % r s p )
2022-09-15 13:11:37 +02:00
ALTERNATIVE _ _ s t r i n g i f y ( R E T ) , \
_ _ stringify( A N N O T A T E _ U N R E T _ S A F E ; ret; int3), \
X8 6 _ F E A T U R E _ C A L L _ D E P T H
2022-06-03 08:04:44 -07:00
SYM_ C O D E _ E N D ( r e t u r n _ t o _ h a n d l e r )
2014-05-08 15:21:52 -04:00
# endif