License cleanup: add SPDX GPL-2.0 license identifier to files with no license
Many source files in the tree are missing licensing information, which
makes it harder for compliance tools to determine the correct license.
By default all files without license information are under the default
license of the kernel, which is GPL version 2.
Update the files which contain no license information with the 'GPL-2.0'
SPDX license identifier. The SPDX identifier is a legally binding
shorthand, which can be used instead of the full boiler plate text.
This patch is based on work done by Thomas Gleixner and Kate Stewart and
Philippe Ombredanne.
How this work was done:
Patches were generated and checked against linux-4.14-rc6 for a subset of
the use cases:
- file had no licensing information it it.
- file was a */uapi/* one with no licensing information in it,
- file was a */uapi/* one with existing licensing information,
Further patches will be generated in subsequent months to fix up cases
where non-standard license headers were used, and references to license
had to be inferred by heuristics based on keywords.
The analysis to determine which SPDX License Identifier to be applied to
a file was done in a spreadsheet of side by side results from of the
output of two independent scanners (ScanCode & Windriver) producing SPDX
tag:value files created by Philippe Ombredanne. Philippe prepared the
base worksheet, and did an initial spot review of a few 1000 files.
The 4.13 kernel was the starting point of the analysis with 60,537 files
assessed. Kate Stewart did a file by file comparison of the scanner
results in the spreadsheet to determine which SPDX license identifier(s)
to be applied to the file. She confirmed any determination that was not
immediately clear with lawyers working with the Linux Foundation.
Criteria used to select files for SPDX license identifier tagging was:
- Files considered eligible had to be source code files.
- Make and config files were included as candidates if they contained >5
lines of source
- File already had some variant of a license header in it (even if <5
lines).
All documentation files were explicitly excluded.
The following heuristics were used to determine which SPDX license
identifiers to apply.
- when both scanners couldn't find any license traces, file was
considered to have no license information in it, and the top level
COPYING file license applied.
For non */uapi/* files that summary was:
SPDX license identifier # files
---------------------------------------------------|-------
GPL-2.0 11139
and resulted in the first patch in this series.
If that file was a */uapi/* path one, it was "GPL-2.0 WITH
Linux-syscall-note" otherwise it was "GPL-2.0". Results of that was:
SPDX license identifier # files
---------------------------------------------------|-------
GPL-2.0 WITH Linux-syscall-note 930
and resulted in the second patch in this series.
- if a file had some form of licensing information in it, and was one
of the */uapi/* ones, it was denoted with the Linux-syscall-note if
any GPL family license was found in the file or had no licensing in
it (per prior point). Results summary:
SPDX license identifier # files
---------------------------------------------------|------
GPL-2.0 WITH Linux-syscall-note 270
GPL-2.0+ WITH Linux-syscall-note 169
((GPL-2.0 WITH Linux-syscall-note) OR BSD-2-Clause) 21
((GPL-2.0 WITH Linux-syscall-note) OR BSD-3-Clause) 17
LGPL-2.1+ WITH Linux-syscall-note 15
GPL-1.0+ WITH Linux-syscall-note 14
((GPL-2.0+ WITH Linux-syscall-note) OR BSD-3-Clause) 5
LGPL-2.0+ WITH Linux-syscall-note 4
LGPL-2.1 WITH Linux-syscall-note 3
((GPL-2.0 WITH Linux-syscall-note) OR MIT) 3
((GPL-2.0 WITH Linux-syscall-note) AND MIT) 1
and that resulted in the third patch in this series.
- when the two scanners agreed on the detected license(s), that became
the concluded license(s).
- when there was disagreement between the two scanners (one detected a
license but the other didn't, or they both detected different
licenses) a manual inspection of the file occurred.
- In most cases a manual inspection of the information in the file
resulted in a clear resolution of the license that should apply (and
which scanner probably needed to revisit its heuristics).
- When it was not immediately clear, the license identifier was
confirmed with lawyers working with the Linux Foundation.
- If there was any question as to the appropriate license identifier,
the file was flagged for further research and to be revisited later
in time.
In total, over 70 hours of logged manual review was done on the
spreadsheet to determine the SPDX license identifiers to apply to the
source files by Kate, Philippe, Thomas and, in some cases, confirmation
by lawyers working with the Linux Foundation.
Kate also obtained a third independent scan of the 4.13 code base from
FOSSology, and compared selected files where the other two scanners
disagreed against that SPDX file, to see if there was new insights. The
Windriver scanner is based on an older version of FOSSology in part, so
they are related.
Thomas did random spot checks in about 500 files from the spreadsheets
for the uapi headers and agreed with SPDX license identifier in the
files he inspected. For the non-uapi files Thomas did random spot checks
in about 15000 files.
In initial set of patches against 4.14-rc6, 3 files were found to have
copy/paste license identifier errors, and have been fixed to reflect the
correct identifier.
Additionally Philippe spent 10 hours this week doing a detailed manual
inspection and review of the 12,461 patched files from the initial patch
version early this week with:
- a full scancode scan run, collecting the matched texts, detected
license ids and scores
- reviewing anything where there was a license detected (about 500+
files) to ensure that the applied SPDX license was correct
- reviewing anything where there was no detection but the patch license
was not GPL-2.0 WITH Linux-syscall-note to ensure that the applied
SPDX license was correct
This produced a worksheet with 20 files needing minor correction. This
worksheet was then exported into 3 different .csv files for the
different types of files to be modified.
These .csv files were then reviewed by Greg. Thomas wrote a script to
parse the csv files and add the proper SPDX tag to the file, in the
format that the file expected. This script was further refined by Greg
based on the output to detect more types of files automatically and to
distinguish between header and source .c files (which need different
comment types.) Finally Greg ran the script using the .csv files to
generate the patches.
Reviewed-by: Kate Stewart <kstewart@linuxfoundation.org>
Reviewed-by: Philippe Ombredanne <pombredanne@nexb.com>
Reviewed-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
2017-11-01 17:07:57 +03:00
// SPDX-License-Identifier: GPL-2.0
2012-08-06 08:41:20 +04:00
# include <fcntl.h>
# include <stdio.h>
# include <errno.h>
2019-07-04 17:21:24 +03:00
# include <stdlib.h>
2012-08-06 08:41:20 +04:00
# include <string.h>
# include <unistd.h>
# include <inttypes.h>
2019-09-10 18:17:19 +03:00
# include "dso.h"
2019-01-27 15:42:37 +03:00
# include "map.h"
2019-11-26 04:24:10 +03:00
# include "maps.h"
2012-08-06 08:41:20 +04:00
# include "symbol.h"
2019-08-30 16:26:37 +03:00
# include "symsrc.h"
2021-02-04 00:15:37 +03:00
# include "demangle-ocaml.h"
2015-11-30 12:02:20 +03:00
# include "demangle-java.h"
2016-07-09 10:20:00 +03:00
# include "demangle-rust.h"
2014-09-30 00:07:28 +04:00
# include "machine.h"
2014-04-17 19:27:01 +04:00
# include "vdso.h"
2012-08-06 08:41:20 +04:00
# include "debug.h"
2019-09-24 21:14:12 +03:00
# include "util/copyfile.h"
tools perf: Move from sane_ctype.h obtained from git to the Linux's original
We got the sane_ctype.h headers from git and kept using it so far, but
since that code originally came from the kernel sources to the git
sources, perhaps its better to just use the one in the kernel, so that
we can leverage tools/perf/check_headers.sh to be notified when our copy
gets out of sync, i.e. when fixes or goodies are added to the code we've
copied.
This will help with things like tools/lib/string.c where we want to have
more things in common with the kernel, such as strim(), skip_spaces(),
etc so as to go on removing the things that we have in tools/perf/util/
and instead using the code in the kernel, indirectly and removing things
like EXPORT_SYMBOL(), etc, getting notified when fixes and improvements
are made to the original code.
Hopefully this also should help with reducing the difference of code
hosted in tools/ to the one in the kernel proper.
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Namhyung Kim <namhyung@kernel.org>
Link: https://lkml.kernel.org/n/tip-7k9868l713wqtgo01xxygn12@git.kernel.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2019-06-25 23:27:31 +03:00
# include <linux/ctype.h>
2019-09-03 16:56:06 +03:00
# include <linux/kernel.h>
2019-07-04 17:32:27 +03:00
# include <linux/zalloc.h>
2017-04-17 22:10:49 +03:00
# include <symbol/kallsyms.h>
2019-09-03 16:56:06 +03:00
# include <internal/lib.h>
2012-08-06 08:41:20 +04:00
2015-02-19 03:33:37 +03:00
# ifndef EM_AARCH64
# define EM_AARCH64 183 /* ARM 64 bit */
# endif
2019-02-04 21:48:03 +03:00
# ifndef ELF32_ST_VISIBILITY
# define ELF32_ST_VISIBILITY(o) ((o) & 0x03)
# endif
/* For ELF64 the definitions are the same. */
# ifndef ELF64_ST_VISIBILITY
# define ELF64_ST_VISIBILITY(o) ELF32_ST_VISIBILITY (o)
# endif
/* How to extract information held in the st_other field. */
# ifndef GELF_ST_VISIBILITY
# define GELF_ST_VISIBILITY(val) ELF64_ST_VISIBILITY (val)
# endif
2016-07-12 17:04:13 +03:00
typedef Elf64_Nhdr GElf_Nhdr ;
2015-02-19 03:33:37 +03:00
2019-08-30 16:01:50 +03:00
# ifndef DMGL_PARAMS
# define DMGL_NO_OPTS 0 /* For readability... */
# define DMGL_PARAMS (1 << 0) /* Include function args */
# define DMGL_ANSI (1 << 1) /* Include const, volatile, etc */
# endif
2020-08-21 19:52:36 +03:00
# ifdef HAVE_LIBBFD_SUPPORT
# define PACKAGE 'perf'
# include <bfd.h>
# else
2014-11-24 23:10:52 +03:00
# ifdef HAVE_CPLUS_DEMANGLE_SUPPORT
extern char * cplus_demangle ( const char * , int ) ;
static inline char * bfd_demangle ( void __maybe_unused * v , const char * c , int i )
{
return cplus_demangle ( c , i ) ;
}
# else
# ifdef NO_DEMANGLE
static inline char * bfd_demangle ( void __maybe_unused * v ,
const char __maybe_unused * c ,
int __maybe_unused i )
{
return NULL ;
}
2020-08-21 19:52:36 +03:00
# endif
2014-11-24 23:10:52 +03:00
# endif
# endif
2013-09-30 14:07:11 +04:00
# ifndef HAVE_ELF_GETPHDRNUM_SUPPORT
2015-09-17 17:30:20 +03:00
static int elf_getphdrnum ( Elf * elf , size_t * dst )
2013-09-13 17:49:30 +04:00
{
GElf_Ehdr gehdr ;
GElf_Ehdr * ehdr ;
ehdr = gelf_getehdr ( elf , & gehdr ) ;
if ( ! ehdr )
return - 1 ;
* dst = ehdr - > e_phnum ;
return 0 ;
}
# endif
2016-07-05 01:35:47 +03:00
# ifndef HAVE_ELF_GETSHDRSTRNDX_SUPPORT
static int elf_getshdrstrndx ( Elf * elf __maybe_unused , size_t * dst __maybe_unused )
{
pr_err ( " %s: update your libelf to > 0.140, this one lacks elf_getshdrstrndx(). \n " , __func__ ) ;
return - 1 ;
}
# endif
2012-08-06 08:41:20 +04:00
# ifndef NT_GNU_BUILD_ID
# define NT_GNU_BUILD_ID 3
# endif
/**
* elf_symtab__for_each_symbol - iterate thru all the symbols
*
* @ syms : struct elf_symtab instance to iterate
* @ idx : uint32_t idx
* @ sym : GElf_Sym iterator
*/
# define elf_symtab__for_each_symbol(syms, nr_syms, idx, sym) \
for ( idx = 0 , gelf_getsym ( syms , idx , & sym ) ; \
idx < nr_syms ; \
idx + + , gelf_getsym ( syms , idx , & sym ) )
static inline uint8_t elf_sym__type ( const GElf_Sym * sym )
{
return GELF_ST_TYPE ( sym - > st_info ) ;
}
2019-01-28 16:35:26 +03:00
static inline uint8_t elf_sym__visibility ( const GElf_Sym * sym )
{
return GELF_ST_VISIBILITY ( sym - > st_other ) ;
}
2015-02-10 03:29:37 +03:00
# ifndef STT_GNU_IFUNC
# define STT_GNU_IFUNC 10
# endif
2012-08-06 08:41:20 +04:00
static inline int elf_sym__is_function ( const GElf_Sym * sym )
{
2014-07-14 14:02:33 +04:00
return ( elf_sym__type ( sym ) = = STT_FUNC | |
elf_sym__type ( sym ) = = STT_GNU_IFUNC ) & &
2012-08-06 08:41:20 +04:00
sym - > st_name ! = 0 & &
sym - > st_shndx ! = SHN_UNDEF ;
}
static inline bool elf_sym__is_object ( const GElf_Sym * sym )
{
return elf_sym__type ( sym ) = = STT_OBJECT & &
sym - > st_name ! = 0 & &
sym - > st_shndx ! = SHN_UNDEF ;
}
static inline int elf_sym__is_label ( const GElf_Sym * sym )
{
return elf_sym__type ( sym ) = = STT_NOTYPE & &
sym - > st_name ! = 0 & &
sym - > st_shndx ! = SHN_UNDEF & &
2019-01-28 16:35:26 +03:00
sym - > st_shndx ! = SHN_ABS & &
elf_sym__visibility ( sym ) ! = STV_HIDDEN & &
elf_sym__visibility ( sym ) ! = STV_INTERNAL ;
2012-08-06 08:41:20 +04:00
}
2018-04-26 22:52:34 +03:00
static bool elf_sym__filter ( GElf_Sym * sym )
2012-08-06 08:41:20 +04:00
{
2018-04-26 22:52:34 +03:00
return elf_sym__is_function ( sym ) | | elf_sym__is_object ( sym ) ;
2012-08-06 08:41:20 +04:00
}
static inline const char * elf_sym__name ( const GElf_Sym * sym ,
const Elf_Data * symstrs )
{
return symstrs - > d_buf + sym - > st_name ;
}
static inline const char * elf_sec__name ( const GElf_Shdr * shdr ,
const Elf_Data * secstrs )
{
return secstrs - > d_buf + shdr - > sh_name ;
}
static inline int elf_sec__is_text ( const GElf_Shdr * shdr ,
const Elf_Data * secstrs )
{
return strstr ( elf_sec__name ( shdr , secstrs ) , " text " ) ! = NULL ;
}
static inline bool elf_sec__is_data ( const GElf_Shdr * shdr ,
const Elf_Data * secstrs )
{
return strstr ( elf_sec__name ( shdr , secstrs ) , " data " ) ! = NULL ;
}
2018-04-26 22:52:34 +03:00
static bool elf_sec__filter ( GElf_Shdr * shdr , Elf_Data * secstrs )
2012-08-06 08:41:20 +04:00
{
2018-04-26 22:52:34 +03:00
return elf_sec__is_text ( shdr , secstrs ) | |
elf_sec__is_data ( shdr , secstrs ) ;
2012-08-06 08:41:20 +04:00
}
static size_t elf_addr_to_index ( Elf * elf , GElf_Addr addr )
{
Elf_Scn * sec = NULL ;
GElf_Shdr shdr ;
size_t cnt = 1 ;
while ( ( sec = elf_nextscn ( elf , sec ) ) ! = NULL ) {
gelf_getshdr ( sec , & shdr ) ;
if ( ( addr > = shdr . sh_addr ) & &
( addr < ( shdr . sh_addr + shdr . sh_size ) ) )
return cnt ;
+ + cnt ;
}
return - 1 ;
}
2014-01-16 13:39:49 +04:00
Elf_Scn * elf_section_by_name ( Elf * elf , GElf_Ehdr * ep ,
GElf_Shdr * shp , const char * name , size_t * idx )
2012-08-06 08:41:20 +04:00
{
Elf_Scn * sec = NULL ;
size_t cnt = 1 ;
2012-08-11 02:22:55 +04:00
/* Elf is corrupted/truncated, avoid calling elf_strptr. */
if ( ! elf_rawdata ( elf_getscn ( elf , ep - > e_shstrndx ) , NULL ) )
return NULL ;
2012-08-06 08:41:20 +04:00
while ( ( sec = elf_nextscn ( elf , sec ) ) ! = NULL ) {
char * str ;
gelf_getshdr ( sec , shp ) ;
str = elf_strptr ( elf , ep - > e_shstrndx , shp - > sh_name ) ;
2014-03-02 17:32:07 +04:00
if ( str & & ! strcmp ( name , str ) ) {
2012-08-06 08:41:20 +04:00
if ( idx )
* idx = cnt ;
2014-03-02 17:32:07 +04:00
return sec ;
2012-08-06 08:41:20 +04:00
}
+ + cnt ;
}
2014-03-02 17:32:07 +04:00
return NULL ;
2012-08-06 08:41:20 +04:00
}
2022-12-15 22:28:11 +03:00
bool filename__has_section ( const char * filename , const char * sec )
{
int fd ;
Elf * elf ;
GElf_Ehdr ehdr ;
GElf_Shdr shdr ;
bool found = false ;
fd = open ( filename , O_RDONLY ) ;
if ( fd < 0 )
return false ;
elf = elf_begin ( fd , PERF_ELF_C_READ_MMAP , NULL ) ;
if ( elf = = NULL )
goto out ;
if ( gelf_getehdr ( elf , & ehdr ) = = NULL )
goto elf_out ;
found = ! ! elf_section_by_name ( elf , & ehdr , & shdr , sec , NULL ) ;
elf_out :
elf_end ( elf ) ;
out :
close ( fd ) ;
return found ;
}
perf symbol: Correct address for bss symbols
When using 'perf mem' and 'perf c2c', an issue is observed that tool
reports the wrong offset for global data symbols. This is a common
issue on both x86 and Arm64 platforms.
Let's see an example, for a test program, below is the disassembly for
its .bss section which is dumped with objdump:
...
Disassembly of section .bss:
0000000000004040 <completed.0>:
...
0000000000004080 <buf1>:
...
00000000000040c0 <buf2>:
...
0000000000004100 <thread>:
...
First we used 'perf mem record' to run the test program and then used
'perf --debug verbose=4 mem report' to observe what's the symbol info
for 'buf1' and 'buf2' structures.
# ./perf mem record -e ldlat-loads,ldlat-stores -- false_sharing.exe 8
# ./perf --debug verbose=4 mem report
...
dso__load_sym_internal: adjusting symbol: st_value: 0x40c0 sh_addr: 0x4040 sh_offset: 0x3028
symbol__new: buf2 0x30a8-0x30e8
...
dso__load_sym_internal: adjusting symbol: st_value: 0x4080 sh_addr: 0x4040 sh_offset: 0x3028
symbol__new: buf1 0x3068-0x30a8
...
The perf tool relies on libelf to parse symbols, in executable and
shared object files, 'st_value' holds a virtual address; 'sh_addr' is
the address at which section's first byte should reside in memory, and
'sh_offset' is the byte offset from the beginning of the file to the
first byte in the section. The perf tool uses below formula to convert
a symbol's memory address to a file address:
file_address = st_value - sh_addr + sh_offset
^
` Memory address
We can see the final adjusted address ranges for buf1 and buf2 are
[0x30a8-0x30e8) and [0x3068-0x30a8) respectively, apparently this is
incorrect, in the code, the structure for 'buf1' and 'buf2' specifies
compiler attribute with 64-byte alignment.
The problem happens for 'sh_offset', libelf returns it as 0x3028 which
is not 64-byte aligned, combining with disassembly, it's likely libelf
doesn't respect the alignment for .bss section, therefore, it doesn't
return the aligned value for 'sh_offset'.
Suggested by Fangrui Song, ELF file contains program header which
contains PT_LOAD segments, the fields p_vaddr and p_offset in PT_LOAD
segments contain the execution info. A better choice for converting
memory address to file address is using the formula:
file_address = st_value - p_vaddr + p_offset
This patch introduces elf_read_program_header() which returns the
program header based on the passed 'st_value', then it uses the formula
above to calculate the symbol file address; and the debugging log is
updated respectively.
After applying the change:
# ./perf --debug verbose=4 mem report
...
dso__load_sym_internal: adjusting symbol: st_value: 0x40c0 p_vaddr: 0x3d28 p_offset: 0x2d28
symbol__new: buf2 0x30c0-0x3100
...
dso__load_sym_internal: adjusting symbol: st_value: 0x4080 p_vaddr: 0x3d28 p_offset: 0x2d28
symbol__new: buf1 0x3080-0x30c0
...
Fixes: f17e04afaff84b5c ("perf report: Fix ELF symbol parsing")
Reported-by: Chang Rui <changruinj@gmail.com>
Suggested-by: Fangrui Song <maskray@google.com>
Signed-off-by: Leo Yan <leo.yan@linaro.org>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: https://lore.kernel.org/r/20220724060013.171050-2-leo.yan@linaro.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2022-07-24 09:00:12 +03:00
static int elf_read_program_header ( Elf * elf , u64 vaddr , GElf_Phdr * phdr )
{
size_t i , phdrnum ;
u64 sz ;
if ( elf_getphdrnum ( elf , & phdrnum ) )
return - 1 ;
for ( i = 0 ; i < phdrnum ; i + + ) {
if ( gelf_getphdr ( elf , i , phdr ) = = NULL )
return - 1 ;
if ( phdr - > p_type ! = PT_LOAD )
continue ;
sz = max ( phdr - > p_memsz , phdr - > p_filesz ) ;
if ( ! sz )
continue ;
if ( vaddr > = phdr - > p_vaddr & & ( vaddr < phdr - > p_vaddr + sz ) )
return 0 ;
}
/* Not found any valid program header */
return - 1 ;
}
2016-08-30 14:41:02 +03:00
static bool want_demangle ( bool is_kernel_sym )
{
return is_kernel_sym ? symbol_conf . demangle_kernel : symbol_conf . demangle ;
}
static char * demangle_sym ( struct dso * dso , int kmodule , const char * elf_name )
{
2017-02-17 11:17:38 +03:00
int demangle_flags = verbose > 0 ? ( DMGL_PARAMS | DMGL_ANSI ) : DMGL_NO_OPTS ;
2016-08-30 14:41:02 +03:00
char * demangled = NULL ;
/*
* We need to figure out if the object was created from C + + sources
* DWARF DW_compile_unit has this , but we don ' t always have access
* to it . . .
*/
if ( ! want_demangle ( dso - > kernel | | kmodule ) )
return demangled ;
demangled = bfd_demangle ( NULL , elf_name , demangle_flags ) ;
2021-02-04 00:15:37 +03:00
if ( demangled = = NULL ) {
demangled = ocaml_demangle_sym ( elf_name ) ;
if ( demangled = = NULL ) {
demangled = java_demangle_sym ( elf_name , JAVA_DEMANGLE_NORET ) ;
}
}
2016-08-30 14:41:02 +03:00
else if ( rust_is_mangled ( demangled ) )
/*
* Input to Rust demangling is the BFD - demangled
* name which it Rust - demangles in place .
*/
rust_demangle_sym ( demangled ) ;
return demangled ;
}
2023-01-20 15:34:55 +03:00
struct rel_info {
perf symbols: Sort plt relocations for x86
For x86, with the addition of IFUNCs, relocation information becomes
disordered with respect to plt. Correct that by sorting the relocations by
offset.
Example:
Before:
$ cat tstpltlib.c
void fn1(void) {}
void fn2(void) {}
void fn3(void) {}
void fn4(void) {}
$ cat tstpltifunc.c
#include <stdio.h>
void thing1(void)
{
printf("thing1\n");
}
void thing2(void)
{
printf("thing2\n");
}
typedef void (*thing_fn_t)(void);
thing_fn_t thing_ifunc(void)
{
int x;
if (x & 1)
return thing2;
return thing1;
}
void thing(void) __attribute__ ((ifunc ("thing_ifunc")));
void fn1(void);
void fn2(void);
void fn3(void);
void fn4(void);
int main()
{
fn4();
fn1();
thing();
fn2();
fn3();
return 0;
}
$ gcc --version
gcc (Ubuntu 11.3.0-1ubuntu1~22.04) 11.3.0
Copyright (C) 2021 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ gcc -Wall -Wextra -shared -o libtstpltlib.so tstpltlib.c
$ gcc -Wall -Wextra -Wno-uninitialized -o tstpltifunc tstpltifunc.c -L . -ltstpltlib -Wl,-rpath="$(pwd)"
$ readelf -rW tstpltifunc | grep -A99 plt
Relocation section '.rela.plt' at offset 0x738 contains 8 entries:
Offset Info Type Symbol's Value Symbol's Name + Addend
0000000000003f98 0000000300000007 R_X86_64_JUMP_SLOT 0000000000000000 puts@GLIBC_2.2.5 + 0
0000000000003fa8 0000000400000007 R_X86_64_JUMP_SLOT 0000000000000000 __stack_chk_fail@GLIBC_2.4 + 0
0000000000003fb0 0000000500000007 R_X86_64_JUMP_SLOT 0000000000000000 fn1 + 0
0000000000003fb8 0000000600000007 R_X86_64_JUMP_SLOT 0000000000000000 fn3 + 0
0000000000003fc0 0000000800000007 R_X86_64_JUMP_SLOT 0000000000000000 fn4 + 0
0000000000003fc8 0000000900000007 R_X86_64_JUMP_SLOT 0000000000000000 fn2 + 0
0000000000003fd0 0000000b00000007 R_X86_64_JUMP_SLOT 0000000000000000 getrandom@GLIBC_2.25 + 0
0000000000003fa0 0000000000000025 R_X86_64_IRELATIVE 125d
$ perf record -e intel_pt//u --filter 'filter main @ ./tstpltifunc' ./tstpltifunc
thing2
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.029 MB perf.data ]
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
20417.302513948: tr strt 0 [unknown] => 5629a74892be main+0x0
20417.302513948: tr end call 5629a74892c6 main+0x8 => 5629a7489110 fn2@plt+0x0
20417.302513949: tr strt 0 [unknown] => 5629a74892cb main+0xd
20417.302513949: tr end call 5629a74892cb main+0xd => 5629a74890f0 fn3@plt+0x0
20417.302513950: tr strt 0 [unknown] => 5629a74892d0 main+0x12
20417.302513950: tr end call 5629a74892d0 main+0x12 => 5629a74890d0 __stack_chk_fail@plt+0x0
20417.302528114: tr strt 0 [unknown] => 5629a74892d5 main+0x17
20417.302528114: tr end call 5629a74892d5 main+0x17 => 5629a7489120 getrandom@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892da main+0x1c
20417.302528115: tr end call 5629a74892da main+0x1c => 5629a7489100 fn4@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892df main+0x21
20417.302528115: tr end return 5629a74892e5 main+0x27 => 7ff14da29d90 __libc_start_call_main+0x80
After:
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
20417.302513948: tr strt 0 [unknown] => 5629a74892be main+0x0
20417.302513948: tr end call 5629a74892c6 main+0x8 => 5629a7489110 fn4@plt+0x0
20417.302513949: tr strt 0 [unknown] => 5629a74892cb main+0xd
20417.302513949: tr end call 5629a74892cb main+0xd => 5629a74890f0 fn1@plt+0x0
20417.302513950: tr strt 0 [unknown] => 5629a74892d0 main+0x12
20417.302513950: tr end call 5629a74892d0 main+0x12 => 5629a74890d0 offset_0x10d0@plt+0x0
20417.302528114: tr strt 0 [unknown] => 5629a74892d5 main+0x17
20417.302528114: tr end call 5629a74892d5 main+0x17 => 5629a7489120 fn2@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892da main+0x1c
20417.302528115: tr end call 5629a74892da main+0x1c => 5629a7489100 fn3@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892df main+0x21
20417.302528115: tr end return 5629a74892e5 main+0x27 => 7ff14da29d90 __libc_start_call_main+0x80
Reviewed-by: Namhyung Kim <namhyung@kernel.org>
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Link: https://lore.kernel.org/r/20230131131625.6964-4-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-31 16:16:19 +03:00
u32 nr_entries ;
u32 * sorted ;
2023-01-20 15:34:55 +03:00
bool is_rela ;
Elf_Data * reldata ;
GElf_Rela rela ;
GElf_Rel rel ;
} ;
static u32 get_rel_symidx ( struct rel_info * ri , u32 idx )
{
perf symbols: Sort plt relocations for x86
For x86, with the addition of IFUNCs, relocation information becomes
disordered with respect to plt. Correct that by sorting the relocations by
offset.
Example:
Before:
$ cat tstpltlib.c
void fn1(void) {}
void fn2(void) {}
void fn3(void) {}
void fn4(void) {}
$ cat tstpltifunc.c
#include <stdio.h>
void thing1(void)
{
printf("thing1\n");
}
void thing2(void)
{
printf("thing2\n");
}
typedef void (*thing_fn_t)(void);
thing_fn_t thing_ifunc(void)
{
int x;
if (x & 1)
return thing2;
return thing1;
}
void thing(void) __attribute__ ((ifunc ("thing_ifunc")));
void fn1(void);
void fn2(void);
void fn3(void);
void fn4(void);
int main()
{
fn4();
fn1();
thing();
fn2();
fn3();
return 0;
}
$ gcc --version
gcc (Ubuntu 11.3.0-1ubuntu1~22.04) 11.3.0
Copyright (C) 2021 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ gcc -Wall -Wextra -shared -o libtstpltlib.so tstpltlib.c
$ gcc -Wall -Wextra -Wno-uninitialized -o tstpltifunc tstpltifunc.c -L . -ltstpltlib -Wl,-rpath="$(pwd)"
$ readelf -rW tstpltifunc | grep -A99 plt
Relocation section '.rela.plt' at offset 0x738 contains 8 entries:
Offset Info Type Symbol's Value Symbol's Name + Addend
0000000000003f98 0000000300000007 R_X86_64_JUMP_SLOT 0000000000000000 puts@GLIBC_2.2.5 + 0
0000000000003fa8 0000000400000007 R_X86_64_JUMP_SLOT 0000000000000000 __stack_chk_fail@GLIBC_2.4 + 0
0000000000003fb0 0000000500000007 R_X86_64_JUMP_SLOT 0000000000000000 fn1 + 0
0000000000003fb8 0000000600000007 R_X86_64_JUMP_SLOT 0000000000000000 fn3 + 0
0000000000003fc0 0000000800000007 R_X86_64_JUMP_SLOT 0000000000000000 fn4 + 0
0000000000003fc8 0000000900000007 R_X86_64_JUMP_SLOT 0000000000000000 fn2 + 0
0000000000003fd0 0000000b00000007 R_X86_64_JUMP_SLOT 0000000000000000 getrandom@GLIBC_2.25 + 0
0000000000003fa0 0000000000000025 R_X86_64_IRELATIVE 125d
$ perf record -e intel_pt//u --filter 'filter main @ ./tstpltifunc' ./tstpltifunc
thing2
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.029 MB perf.data ]
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
20417.302513948: tr strt 0 [unknown] => 5629a74892be main+0x0
20417.302513948: tr end call 5629a74892c6 main+0x8 => 5629a7489110 fn2@plt+0x0
20417.302513949: tr strt 0 [unknown] => 5629a74892cb main+0xd
20417.302513949: tr end call 5629a74892cb main+0xd => 5629a74890f0 fn3@plt+0x0
20417.302513950: tr strt 0 [unknown] => 5629a74892d0 main+0x12
20417.302513950: tr end call 5629a74892d0 main+0x12 => 5629a74890d0 __stack_chk_fail@plt+0x0
20417.302528114: tr strt 0 [unknown] => 5629a74892d5 main+0x17
20417.302528114: tr end call 5629a74892d5 main+0x17 => 5629a7489120 getrandom@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892da main+0x1c
20417.302528115: tr end call 5629a74892da main+0x1c => 5629a7489100 fn4@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892df main+0x21
20417.302528115: tr end return 5629a74892e5 main+0x27 => 7ff14da29d90 __libc_start_call_main+0x80
After:
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
20417.302513948: tr strt 0 [unknown] => 5629a74892be main+0x0
20417.302513948: tr end call 5629a74892c6 main+0x8 => 5629a7489110 fn4@plt+0x0
20417.302513949: tr strt 0 [unknown] => 5629a74892cb main+0xd
20417.302513949: tr end call 5629a74892cb main+0xd => 5629a74890f0 fn1@plt+0x0
20417.302513950: tr strt 0 [unknown] => 5629a74892d0 main+0x12
20417.302513950: tr end call 5629a74892d0 main+0x12 => 5629a74890d0 offset_0x10d0@plt+0x0
20417.302528114: tr strt 0 [unknown] => 5629a74892d5 main+0x17
20417.302528114: tr end call 5629a74892d5 main+0x17 => 5629a7489120 fn2@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892da main+0x1c
20417.302528115: tr end call 5629a74892da main+0x1c => 5629a7489100 fn3@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892df main+0x21
20417.302528115: tr end return 5629a74892e5 main+0x27 => 7ff14da29d90 __libc_start_call_main+0x80
Reviewed-by: Namhyung Kim <namhyung@kernel.org>
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Link: https://lore.kernel.org/r/20230131131625.6964-4-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-31 16:16:19 +03:00
idx = ri - > sorted ? ri - > sorted [ idx ] : idx ;
2023-01-20 15:34:55 +03:00
if ( ri - > is_rela ) {
gelf_getrela ( ri - > reldata , idx , & ri - > rela ) ;
return GELF_R_SYM ( ri - > rela . r_info ) ;
}
gelf_getrel ( ri - > reldata , idx , & ri - > rel ) ;
return GELF_R_SYM ( ri - > rel . r_info ) ;
}
perf symbols: Sort plt relocations for x86
For x86, with the addition of IFUNCs, relocation information becomes
disordered with respect to plt. Correct that by sorting the relocations by
offset.
Example:
Before:
$ cat tstpltlib.c
void fn1(void) {}
void fn2(void) {}
void fn3(void) {}
void fn4(void) {}
$ cat tstpltifunc.c
#include <stdio.h>
void thing1(void)
{
printf("thing1\n");
}
void thing2(void)
{
printf("thing2\n");
}
typedef void (*thing_fn_t)(void);
thing_fn_t thing_ifunc(void)
{
int x;
if (x & 1)
return thing2;
return thing1;
}
void thing(void) __attribute__ ((ifunc ("thing_ifunc")));
void fn1(void);
void fn2(void);
void fn3(void);
void fn4(void);
int main()
{
fn4();
fn1();
thing();
fn2();
fn3();
return 0;
}
$ gcc --version
gcc (Ubuntu 11.3.0-1ubuntu1~22.04) 11.3.0
Copyright (C) 2021 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ gcc -Wall -Wextra -shared -o libtstpltlib.so tstpltlib.c
$ gcc -Wall -Wextra -Wno-uninitialized -o tstpltifunc tstpltifunc.c -L . -ltstpltlib -Wl,-rpath="$(pwd)"
$ readelf -rW tstpltifunc | grep -A99 plt
Relocation section '.rela.plt' at offset 0x738 contains 8 entries:
Offset Info Type Symbol's Value Symbol's Name + Addend
0000000000003f98 0000000300000007 R_X86_64_JUMP_SLOT 0000000000000000 puts@GLIBC_2.2.5 + 0
0000000000003fa8 0000000400000007 R_X86_64_JUMP_SLOT 0000000000000000 __stack_chk_fail@GLIBC_2.4 + 0
0000000000003fb0 0000000500000007 R_X86_64_JUMP_SLOT 0000000000000000 fn1 + 0
0000000000003fb8 0000000600000007 R_X86_64_JUMP_SLOT 0000000000000000 fn3 + 0
0000000000003fc0 0000000800000007 R_X86_64_JUMP_SLOT 0000000000000000 fn4 + 0
0000000000003fc8 0000000900000007 R_X86_64_JUMP_SLOT 0000000000000000 fn2 + 0
0000000000003fd0 0000000b00000007 R_X86_64_JUMP_SLOT 0000000000000000 getrandom@GLIBC_2.25 + 0
0000000000003fa0 0000000000000025 R_X86_64_IRELATIVE 125d
$ perf record -e intel_pt//u --filter 'filter main @ ./tstpltifunc' ./tstpltifunc
thing2
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.029 MB perf.data ]
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
20417.302513948: tr strt 0 [unknown] => 5629a74892be main+0x0
20417.302513948: tr end call 5629a74892c6 main+0x8 => 5629a7489110 fn2@plt+0x0
20417.302513949: tr strt 0 [unknown] => 5629a74892cb main+0xd
20417.302513949: tr end call 5629a74892cb main+0xd => 5629a74890f0 fn3@plt+0x0
20417.302513950: tr strt 0 [unknown] => 5629a74892d0 main+0x12
20417.302513950: tr end call 5629a74892d0 main+0x12 => 5629a74890d0 __stack_chk_fail@plt+0x0
20417.302528114: tr strt 0 [unknown] => 5629a74892d5 main+0x17
20417.302528114: tr end call 5629a74892d5 main+0x17 => 5629a7489120 getrandom@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892da main+0x1c
20417.302528115: tr end call 5629a74892da main+0x1c => 5629a7489100 fn4@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892df main+0x21
20417.302528115: tr end return 5629a74892e5 main+0x27 => 7ff14da29d90 __libc_start_call_main+0x80
After:
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
20417.302513948: tr strt 0 [unknown] => 5629a74892be main+0x0
20417.302513948: tr end call 5629a74892c6 main+0x8 => 5629a7489110 fn4@plt+0x0
20417.302513949: tr strt 0 [unknown] => 5629a74892cb main+0xd
20417.302513949: tr end call 5629a74892cb main+0xd => 5629a74890f0 fn1@plt+0x0
20417.302513950: tr strt 0 [unknown] => 5629a74892d0 main+0x12
20417.302513950: tr end call 5629a74892d0 main+0x12 => 5629a74890d0 offset_0x10d0@plt+0x0
20417.302528114: tr strt 0 [unknown] => 5629a74892d5 main+0x17
20417.302528114: tr end call 5629a74892d5 main+0x17 => 5629a7489120 fn2@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892da main+0x1c
20417.302528115: tr end call 5629a74892da main+0x1c => 5629a7489100 fn3@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892df main+0x21
20417.302528115: tr end return 5629a74892e5 main+0x27 => 7ff14da29d90 __libc_start_call_main+0x80
Reviewed-by: Namhyung Kim <namhyung@kernel.org>
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Link: https://lore.kernel.org/r/20230131131625.6964-4-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-31 16:16:19 +03:00
static u64 get_rel_offset ( struct rel_info * ri , u32 x )
{
if ( ri - > is_rela ) {
GElf_Rela rela ;
gelf_getrela ( ri - > reldata , x , & rela ) ;
return rela . r_offset ;
} else {
GElf_Rel rel ;
gelf_getrel ( ri - > reldata , x , & rel ) ;
return rel . r_offset ;
}
}
static int rel_cmp ( const void * a , const void * b , void * r )
{
struct rel_info * ri = r ;
u64 a_offset = get_rel_offset ( ri , * ( const u32 * ) a ) ;
u64 b_offset = get_rel_offset ( ri , * ( const u32 * ) b ) ;
return a_offset < b_offset ? - 1 : ( a_offset > b_offset ? 1 : 0 ) ;
}
static int sort_rel ( struct rel_info * ri )
{
size_t sz = sizeof ( ri - > sorted [ 0 ] ) ;
u32 i ;
ri - > sorted = calloc ( ri - > nr_entries , sz ) ;
if ( ! ri - > sorted )
return - 1 ;
for ( i = 0 ; i < ri - > nr_entries ; i + + )
ri - > sorted [ i ] = i ;
qsort_r ( ri - > sorted , ri - > nr_entries , sz , rel_cmp , ri ) ;
return 0 ;
}
perf symbols: Add support for IFUNC symbols for x86_64
For x86_64, the GNU linker is putting IFUNC information in the relocation
addend, so use it to try to find a symbol for plt entries that refer to
IFUNCs.
Example:
Before:
$ cat tstpltlib.c
void fn1(void) {}
void fn2(void) {}
void fn3(void) {}
void fn4(void) {}
$ cat tstpltifunc.c
#include <stdio.h>
void thing1(void)
{
printf("thing1\n");
}
void thing2(void)
{
printf("thing2\n");
}
typedef void (*thing_fn_t)(void);
thing_fn_t thing_ifunc(void)
{
int x;
if (x & 1)
return thing2;
return thing1;
}
void thing(void) __attribute__ ((ifunc ("thing_ifunc")));
void fn1(void);
void fn2(void);
void fn3(void);
void fn4(void);
int main()
{
fn4();
fn1();
thing();
fn2();
fn3();
return 0;
}
$ gcc --version
gcc (Ubuntu 11.3.0-1ubuntu1~22.04) 11.3.0
Copyright (C) 2021 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ gcc -Wall -Wextra -shared -o libtstpltlib.so tstpltlib.c
$ gcc -Wall -Wextra -Wno-uninitialized -o tstpltifunc tstpltifunc.c -L . -ltstpltlib -Wl,-rpath="$(pwd)"
$ readelf -rW tstpltifunc | grep -A99 plt
Relocation section '.rela.plt' at offset 0x738 contains 8 entries:
Offset Info Type Symbol's Value Symbol's Name + Addend
0000000000003f98 0000000300000007 R_X86_64_JUMP_SLOT 0000000000000000 puts@GLIBC_2.2.5 + 0
0000000000003fa8 0000000400000007 R_X86_64_JUMP_SLOT 0000000000000000 __stack_chk_fail@GLIBC_2.4 + 0
0000000000003fb0 0000000500000007 R_X86_64_JUMP_SLOT 0000000000000000 fn1 + 0
0000000000003fb8 0000000600000007 R_X86_64_JUMP_SLOT 0000000000000000 fn3 + 0
0000000000003fc0 0000000800000007 R_X86_64_JUMP_SLOT 0000000000000000 fn4 + 0
0000000000003fc8 0000000900000007 R_X86_64_JUMP_SLOT 0000000000000000 fn2 + 0
0000000000003fd0 0000000b00000007 R_X86_64_JUMP_SLOT 0000000000000000 getrandom@GLIBC_2.25 + 0
0000000000003fa0 0000000000000025 R_X86_64_IRELATIVE 125d
$ perf record -e intel_pt//u --filter 'filter main @ ./tstpltifunc' ./tstpltifunc
thing2
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.016 MB perf.data ]
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
21860.073683659: tr strt 0 [unknown] => 561e212c42be main+0x0
21860.073683659: tr end call 561e212c42c6 main+0x8 => 561e212c4110 fn4@plt+0x0
21860.073683661: tr strt 0 [unknown] => 561e212c42cb main+0xd
21860.073683661: tr end call 561e212c42cb main+0xd => 561e212c40f0 fn1@plt+0x0
21860.073683661: tr strt 0 [unknown] => 561e212c42d0 main+0x12
21860.073683661: tr end call 561e212c42d0 main+0x12 => 561e212c40d0 offset_0x10d0@plt+0x0
21860.073698451: tr strt 0 [unknown] => 561e212c42d5 main+0x17
21860.073698451: tr end call 561e212c42d5 main+0x17 => 561e212c4120 fn2@plt+0x0
21860.073698451: tr strt 0 [unknown] => 561e212c42da main+0x1c
21860.073698451: tr end call 561e212c42da main+0x1c => 561e212c4100 fn3@plt+0x0
21860.073698452: tr strt 0 [unknown] => 561e212c42df main+0x21
21860.073698452: tr end return 561e212c42e5 main+0x27 => 7fb51cc29d90 __libc_start_call_main+0x80
After:
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
21860.073683659: tr strt 0 [unknown] => 561e212c42be main+0x0
21860.073683659: tr end call 561e212c42c6 main+0x8 => 561e212c4110 fn4@plt+0x0
21860.073683661: tr strt 0 [unknown] => 561e212c42cb main+0xd
21860.073683661: tr end call 561e212c42cb main+0xd => 561e212c40f0 fn1@plt+0x0
21860.073683661: tr strt 0 [unknown] => 561e212c42d0 main+0x12
21860.073683661: tr end call 561e212c42d0 main+0x12 => 561e212c40d0 thing_ifunc@plt+0x0
21860.073698451: tr strt 0 [unknown] => 561e212c42d5 main+0x17
21860.073698451: tr end call 561e212c42d5 main+0x17 => 561e212c4120 fn2@plt+0x0
21860.073698451: tr strt 0 [unknown] => 561e212c42da main+0x1c
21860.073698451: tr end call 561e212c42da main+0x1c => 561e212c4100 fn3@plt+0x0
21860.073698452: tr strt 0 [unknown] => 561e212c42df main+0x21
21860.073698452: tr end return 561e212c42e5 main+0x27 => 7fb51cc29d90 __libc_start_call_main+0x80
Reviewed-by: Namhyung Kim <namhyung@kernel.org>
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Link: https://lore.kernel.org/r/20230131131625.6964-6-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-31 16:16:21 +03:00
/*
* For x86_64 , the GNU linker is putting IFUNC information in the relocation
* addend .
*/
static bool addend_may_be_ifunc ( GElf_Ehdr * ehdr , struct rel_info * ri )
{
return ehdr - > e_machine = = EM_X86_64 & & ri - > is_rela & &
GELF_R_TYPE ( ri - > rela . r_info ) = = R_X86_64_IRELATIVE ;
}
static bool get_ifunc_name ( Elf * elf , struct dso * dso , GElf_Ehdr * ehdr ,
struct rel_info * ri , char * buf , size_t buf_sz )
{
u64 addr = ri - > rela . r_addend ;
struct symbol * sym ;
GElf_Phdr phdr ;
if ( ! addend_may_be_ifunc ( ehdr , ri ) )
return false ;
if ( elf_read_program_header ( elf , addr , & phdr ) )
return false ;
addr - = phdr . p_vaddr - phdr . p_offset ;
sym = dso__find_symbol_nocache ( dso , addr ) ;
/* Expecting the address to be an IFUNC or IFUNC alias */
if ( ! sym | | sym - > start ! = addr | | ( sym - > type ! = STT_GNU_IFUNC & & ! sym - > ifunc_alias ) )
return false ;
snprintf ( buf , buf_sz , " %s@plt " , sym - > name ) ;
return true ;
}
perf symbols: Sort plt relocations for x86
For x86, with the addition of IFUNCs, relocation information becomes
disordered with respect to plt. Correct that by sorting the relocations by
offset.
Example:
Before:
$ cat tstpltlib.c
void fn1(void) {}
void fn2(void) {}
void fn3(void) {}
void fn4(void) {}
$ cat tstpltifunc.c
#include <stdio.h>
void thing1(void)
{
printf("thing1\n");
}
void thing2(void)
{
printf("thing2\n");
}
typedef void (*thing_fn_t)(void);
thing_fn_t thing_ifunc(void)
{
int x;
if (x & 1)
return thing2;
return thing1;
}
void thing(void) __attribute__ ((ifunc ("thing_ifunc")));
void fn1(void);
void fn2(void);
void fn3(void);
void fn4(void);
int main()
{
fn4();
fn1();
thing();
fn2();
fn3();
return 0;
}
$ gcc --version
gcc (Ubuntu 11.3.0-1ubuntu1~22.04) 11.3.0
Copyright (C) 2021 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ gcc -Wall -Wextra -shared -o libtstpltlib.so tstpltlib.c
$ gcc -Wall -Wextra -Wno-uninitialized -o tstpltifunc tstpltifunc.c -L . -ltstpltlib -Wl,-rpath="$(pwd)"
$ readelf -rW tstpltifunc | grep -A99 plt
Relocation section '.rela.plt' at offset 0x738 contains 8 entries:
Offset Info Type Symbol's Value Symbol's Name + Addend
0000000000003f98 0000000300000007 R_X86_64_JUMP_SLOT 0000000000000000 puts@GLIBC_2.2.5 + 0
0000000000003fa8 0000000400000007 R_X86_64_JUMP_SLOT 0000000000000000 __stack_chk_fail@GLIBC_2.4 + 0
0000000000003fb0 0000000500000007 R_X86_64_JUMP_SLOT 0000000000000000 fn1 + 0
0000000000003fb8 0000000600000007 R_X86_64_JUMP_SLOT 0000000000000000 fn3 + 0
0000000000003fc0 0000000800000007 R_X86_64_JUMP_SLOT 0000000000000000 fn4 + 0
0000000000003fc8 0000000900000007 R_X86_64_JUMP_SLOT 0000000000000000 fn2 + 0
0000000000003fd0 0000000b00000007 R_X86_64_JUMP_SLOT 0000000000000000 getrandom@GLIBC_2.25 + 0
0000000000003fa0 0000000000000025 R_X86_64_IRELATIVE 125d
$ perf record -e intel_pt//u --filter 'filter main @ ./tstpltifunc' ./tstpltifunc
thing2
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.029 MB perf.data ]
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
20417.302513948: tr strt 0 [unknown] => 5629a74892be main+0x0
20417.302513948: tr end call 5629a74892c6 main+0x8 => 5629a7489110 fn2@plt+0x0
20417.302513949: tr strt 0 [unknown] => 5629a74892cb main+0xd
20417.302513949: tr end call 5629a74892cb main+0xd => 5629a74890f0 fn3@plt+0x0
20417.302513950: tr strt 0 [unknown] => 5629a74892d0 main+0x12
20417.302513950: tr end call 5629a74892d0 main+0x12 => 5629a74890d0 __stack_chk_fail@plt+0x0
20417.302528114: tr strt 0 [unknown] => 5629a74892d5 main+0x17
20417.302528114: tr end call 5629a74892d5 main+0x17 => 5629a7489120 getrandom@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892da main+0x1c
20417.302528115: tr end call 5629a74892da main+0x1c => 5629a7489100 fn4@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892df main+0x21
20417.302528115: tr end return 5629a74892e5 main+0x27 => 7ff14da29d90 __libc_start_call_main+0x80
After:
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
20417.302513948: tr strt 0 [unknown] => 5629a74892be main+0x0
20417.302513948: tr end call 5629a74892c6 main+0x8 => 5629a7489110 fn4@plt+0x0
20417.302513949: tr strt 0 [unknown] => 5629a74892cb main+0xd
20417.302513949: tr end call 5629a74892cb main+0xd => 5629a74890f0 fn1@plt+0x0
20417.302513950: tr strt 0 [unknown] => 5629a74892d0 main+0x12
20417.302513950: tr end call 5629a74892d0 main+0x12 => 5629a74890d0 offset_0x10d0@plt+0x0
20417.302528114: tr strt 0 [unknown] => 5629a74892d5 main+0x17
20417.302528114: tr end call 5629a74892d5 main+0x17 => 5629a7489120 fn2@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892da main+0x1c
20417.302528115: tr end call 5629a74892da main+0x1c => 5629a7489100 fn3@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892df main+0x21
20417.302528115: tr end return 5629a74892e5 main+0x27 => 7ff14da29d90 __libc_start_call_main+0x80
Reviewed-by: Namhyung Kim <namhyung@kernel.org>
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Link: https://lore.kernel.org/r/20230131131625.6964-4-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-31 16:16:19 +03:00
static void exit_rel ( struct rel_info * ri )
{
free ( ri - > sorted ) ;
}
2023-01-20 15:34:49 +03:00
static bool get_plt_sizes ( struct dso * dso , GElf_Ehdr * ehdr , GElf_Shdr * shdr_plt ,
2023-01-20 15:34:48 +03:00
u64 * plt_header_size , u64 * plt_entry_size )
{
switch ( ehdr - > e_machine ) {
case EM_ARM :
* plt_header_size = 20 ;
* plt_entry_size = 12 ;
2023-01-20 15:34:49 +03:00
return true ;
2023-01-20 15:34:48 +03:00
case EM_AARCH64 :
* plt_header_size = 32 ;
* plt_entry_size = 16 ;
2023-01-20 15:34:49 +03:00
return true ;
2023-01-20 15:34:48 +03:00
case EM_SPARC :
* plt_header_size = 48 ;
* plt_entry_size = 12 ;
2023-01-20 15:34:49 +03:00
return true ;
2023-01-20 15:34:48 +03:00
case EM_SPARCV9 :
* plt_header_size = 128 ;
* plt_entry_size = 32 ;
2023-01-20 15:34:49 +03:00
return true ;
perf symbols: Correct plt entry sizes for x86
In 32-bit executables the .plt entry size can be set to 4 when it is really
16. In fact the only sizes used for x86 (32 or 64 bit) are 8 or 16, so
check for those and, if not, use the alignment to choose which it is.
Example on Ubuntu 22.04 gcc 11.3:
Before:
$ cat tstpltlib.c
void fn1(void) {}
void fn2(void) {}
void fn3(void) {}
void fn4(void) {}
$ cat tstplt.c
void fn1(void);
void fn2(void);
void fn3(void);
void fn4(void);
int main()
{
fn4();
fn1();
fn2();
fn3();
return 0;
}
$ gcc --version
gcc (Ubuntu 11.3.0-1ubuntu1~22.04) 11.3.0
Copyright (C) 2021 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ gcc -m32 -Wall -Wextra -shared -o libtstpltlib32.so tstpltlib.c
$ gcc -m32 -Wall -Wextra -o tstplt32 tstplt.c -L . -ltstpltlib32 -Wl,-rpath=$(pwd)
$ perf record -e intel_pt//u --filter 'filter main @ ./tstplt32' ./tstplt32
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.011 MB perf.data ]
$ readelf -SW tstplt32 | grep 'plt\|Name'
[Nr] Name Type Addr Off Size ES Flg Lk Inf Al
[10] .rel.plt REL 0000041c 00041c 000028 08 AI 5 22 4
[12] .plt PROGBITS 00001030 001030 000060 04 AX 0 0 16 <- ES is 0x04, should be 0x10
[13] .plt.got PROGBITS 00001090 001090 000008 08 AX 0 0 8
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
17894.383903029: tr strt 0 [unknown] => 565b81cd main+0x0
17894.383903029: tr end call 565b81d4 main+0x7 => 565b80d0 __x86.get_pc_thunk.bx+0x0
17894.383903031: tr strt 0 [unknown] => 565b81d9 main+0xc
17894.383903031: tr end call 565b81df main+0x12 => 565b8070 [unknown]
17894.383903032: tr strt 0 [unknown] => 565b81e4 main+0x17
17894.383903032: tr end call 565b81e4 main+0x17 => 565b8050 [unknown]
17894.383903033: tr strt 0 [unknown] => 565b81e9 main+0x1c
17894.383903033: tr end call 565b81e9 main+0x1c => 565b8080 [unknown]
17894.383903033: tr strt 0 [unknown] => 565b81ee main+0x21
17894.383903033: tr end call 565b81ee main+0x21 => 565b8060 [unknown]
17894.383903237: tr strt 0 [unknown] => 565b81f3 main+0x26
17894.383903237: tr end return 565b81fc main+0x2f => f7c21519 [unknown]
After:
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
17894.383903029: tr strt 0 [unknown] => 565b81cd main+0x0
17894.383903029: tr end call 565b81d4 main+0x7 => 565b80d0 __x86.get_pc_thunk.bx+0x0
17894.383903031: tr strt 0 [unknown] => 565b81d9 main+0xc
17894.383903031: tr end call 565b81df main+0x12 => 565b8070 fn4@plt+0x0
17894.383903032: tr strt 0 [unknown] => 565b81e4 main+0x17
17894.383903032: tr end call 565b81e4 main+0x17 => 565b8050 fn1@plt+0x0
17894.383903033: tr strt 0 [unknown] => 565b81e9 main+0x1c
17894.383903033: tr end call 565b81e9 main+0x1c => 565b8080 fn2@plt+0x0
17894.383903033: tr strt 0 [unknown] => 565b81ee main+0x21
17894.383903033: tr end call 565b81ee main+0x21 => 565b8060 fn3@plt+0x0
17894.383903237: tr strt 0 [unknown] => 565b81f3 main+0x26
17894.383903237: tr end return 565b81fc main+0x2f => f7c21519 [unknown]
Reviewed-by: Namhyung Kim <namhyung@kernel.org>
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Link: https://lore.kernel.org/r/20230131131625.6964-2-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-31 16:16:17 +03:00
case EM_386 :
case EM_X86_64 :
* plt_entry_size = shdr_plt - > sh_entsize ;
/* Size is 8 or 16, if not, assume alignment indicates size */
if ( * plt_entry_size ! = 8 & & * plt_entry_size ! = 16 )
* plt_entry_size = shdr_plt - > sh_addralign = = 8 ? 8 : 16 ;
* plt_header_size = * plt_entry_size ;
break ;
2023-01-20 15:34:48 +03:00
default : /* FIXME: s390/alpha/mips/parisc/poperpc/sh/xtensa need to be checked */
* plt_header_size = shdr_plt - > sh_entsize ;
* plt_entry_size = shdr_plt - > sh_entsize ;
perf symbols: Correct plt entry sizes for x86
In 32-bit executables the .plt entry size can be set to 4 when it is really
16. In fact the only sizes used for x86 (32 or 64 bit) are 8 or 16, so
check for those and, if not, use the alignment to choose which it is.
Example on Ubuntu 22.04 gcc 11.3:
Before:
$ cat tstpltlib.c
void fn1(void) {}
void fn2(void) {}
void fn3(void) {}
void fn4(void) {}
$ cat tstplt.c
void fn1(void);
void fn2(void);
void fn3(void);
void fn4(void);
int main()
{
fn4();
fn1();
fn2();
fn3();
return 0;
}
$ gcc --version
gcc (Ubuntu 11.3.0-1ubuntu1~22.04) 11.3.0
Copyright (C) 2021 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ gcc -m32 -Wall -Wextra -shared -o libtstpltlib32.so tstpltlib.c
$ gcc -m32 -Wall -Wextra -o tstplt32 tstplt.c -L . -ltstpltlib32 -Wl,-rpath=$(pwd)
$ perf record -e intel_pt//u --filter 'filter main @ ./tstplt32' ./tstplt32
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.011 MB perf.data ]
$ readelf -SW tstplt32 | grep 'plt\|Name'
[Nr] Name Type Addr Off Size ES Flg Lk Inf Al
[10] .rel.plt REL 0000041c 00041c 000028 08 AI 5 22 4
[12] .plt PROGBITS 00001030 001030 000060 04 AX 0 0 16 <- ES is 0x04, should be 0x10
[13] .plt.got PROGBITS 00001090 001090 000008 08 AX 0 0 8
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
17894.383903029: tr strt 0 [unknown] => 565b81cd main+0x0
17894.383903029: tr end call 565b81d4 main+0x7 => 565b80d0 __x86.get_pc_thunk.bx+0x0
17894.383903031: tr strt 0 [unknown] => 565b81d9 main+0xc
17894.383903031: tr end call 565b81df main+0x12 => 565b8070 [unknown]
17894.383903032: tr strt 0 [unknown] => 565b81e4 main+0x17
17894.383903032: tr end call 565b81e4 main+0x17 => 565b8050 [unknown]
17894.383903033: tr strt 0 [unknown] => 565b81e9 main+0x1c
17894.383903033: tr end call 565b81e9 main+0x1c => 565b8080 [unknown]
17894.383903033: tr strt 0 [unknown] => 565b81ee main+0x21
17894.383903033: tr end call 565b81ee main+0x21 => 565b8060 [unknown]
17894.383903237: tr strt 0 [unknown] => 565b81f3 main+0x26
17894.383903237: tr end return 565b81fc main+0x2f => f7c21519 [unknown]
After:
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
17894.383903029: tr strt 0 [unknown] => 565b81cd main+0x0
17894.383903029: tr end call 565b81d4 main+0x7 => 565b80d0 __x86.get_pc_thunk.bx+0x0
17894.383903031: tr strt 0 [unknown] => 565b81d9 main+0xc
17894.383903031: tr end call 565b81df main+0x12 => 565b8070 fn4@plt+0x0
17894.383903032: tr strt 0 [unknown] => 565b81e4 main+0x17
17894.383903032: tr end call 565b81e4 main+0x17 => 565b8050 fn1@plt+0x0
17894.383903033: tr strt 0 [unknown] => 565b81e9 main+0x1c
17894.383903033: tr end call 565b81e9 main+0x1c => 565b8080 fn2@plt+0x0
17894.383903033: tr strt 0 [unknown] => 565b81ee main+0x21
17894.383903033: tr end call 565b81ee main+0x21 => 565b8060 fn3@plt+0x0
17894.383903237: tr strt 0 [unknown] => 565b81f3 main+0x26
17894.383903237: tr end return 565b81fc main+0x2f => f7c21519 [unknown]
Reviewed-by: Namhyung Kim <namhyung@kernel.org>
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Link: https://lore.kernel.org/r/20230131131625.6964-2-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-31 16:16:17 +03:00
break ;
2023-01-20 15:34:48 +03:00
}
perf symbols: Correct plt entry sizes for x86
In 32-bit executables the .plt entry size can be set to 4 when it is really
16. In fact the only sizes used for x86 (32 or 64 bit) are 8 or 16, so
check for those and, if not, use the alignment to choose which it is.
Example on Ubuntu 22.04 gcc 11.3:
Before:
$ cat tstpltlib.c
void fn1(void) {}
void fn2(void) {}
void fn3(void) {}
void fn4(void) {}
$ cat tstplt.c
void fn1(void);
void fn2(void);
void fn3(void);
void fn4(void);
int main()
{
fn4();
fn1();
fn2();
fn3();
return 0;
}
$ gcc --version
gcc (Ubuntu 11.3.0-1ubuntu1~22.04) 11.3.0
Copyright (C) 2021 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ gcc -m32 -Wall -Wextra -shared -o libtstpltlib32.so tstpltlib.c
$ gcc -m32 -Wall -Wextra -o tstplt32 tstplt.c -L . -ltstpltlib32 -Wl,-rpath=$(pwd)
$ perf record -e intel_pt//u --filter 'filter main @ ./tstplt32' ./tstplt32
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.011 MB perf.data ]
$ readelf -SW tstplt32 | grep 'plt\|Name'
[Nr] Name Type Addr Off Size ES Flg Lk Inf Al
[10] .rel.plt REL 0000041c 00041c 000028 08 AI 5 22 4
[12] .plt PROGBITS 00001030 001030 000060 04 AX 0 0 16 <- ES is 0x04, should be 0x10
[13] .plt.got PROGBITS 00001090 001090 000008 08 AX 0 0 8
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
17894.383903029: tr strt 0 [unknown] => 565b81cd main+0x0
17894.383903029: tr end call 565b81d4 main+0x7 => 565b80d0 __x86.get_pc_thunk.bx+0x0
17894.383903031: tr strt 0 [unknown] => 565b81d9 main+0xc
17894.383903031: tr end call 565b81df main+0x12 => 565b8070 [unknown]
17894.383903032: tr strt 0 [unknown] => 565b81e4 main+0x17
17894.383903032: tr end call 565b81e4 main+0x17 => 565b8050 [unknown]
17894.383903033: tr strt 0 [unknown] => 565b81e9 main+0x1c
17894.383903033: tr end call 565b81e9 main+0x1c => 565b8080 [unknown]
17894.383903033: tr strt 0 [unknown] => 565b81ee main+0x21
17894.383903033: tr end call 565b81ee main+0x21 => 565b8060 [unknown]
17894.383903237: tr strt 0 [unknown] => 565b81f3 main+0x26
17894.383903237: tr end return 565b81fc main+0x2f => f7c21519 [unknown]
After:
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
17894.383903029: tr strt 0 [unknown] => 565b81cd main+0x0
17894.383903029: tr end call 565b81d4 main+0x7 => 565b80d0 __x86.get_pc_thunk.bx+0x0
17894.383903031: tr strt 0 [unknown] => 565b81d9 main+0xc
17894.383903031: tr end call 565b81df main+0x12 => 565b8070 fn4@plt+0x0
17894.383903032: tr strt 0 [unknown] => 565b81e4 main+0x17
17894.383903032: tr end call 565b81e4 main+0x17 => 565b8050 fn1@plt+0x0
17894.383903033: tr strt 0 [unknown] => 565b81e9 main+0x1c
17894.383903033: tr end call 565b81e9 main+0x1c => 565b8080 fn2@plt+0x0
17894.383903033: tr strt 0 [unknown] => 565b81ee main+0x21
17894.383903033: tr end call 565b81ee main+0x21 => 565b8060 fn3@plt+0x0
17894.383903237: tr strt 0 [unknown] => 565b81f3 main+0x26
17894.383903237: tr end return 565b81fc main+0x2f => f7c21519 [unknown]
Reviewed-by: Namhyung Kim <namhyung@kernel.org>
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Link: https://lore.kernel.org/r/20230131131625.6964-2-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-31 16:16:17 +03:00
if ( * plt_entry_size )
return true ;
pr_debug ( " Missing PLT entry size for %s \n " , dso - > long_name ) ;
return false ;
2023-01-20 15:34:48 +03:00
}
perf symbols: Add support for x86 .plt.sec
The section .plt.sec was originally added for MPX and was first called
.plt.bnd. While MPX has been deprecated, .plt.sec is now also used for
IBT. On x86_64, IBT may be enabled by default, but can be switched off
using gcc option -fcf-protection=none, or switched on by -z ibt or -z
ibtplt. On 32-bit, option -z ibt or -z ibtplt will enable IBT.
With .plt.sec, calls are made into .plt.sec instead of .plt, so it makes
more sense to put the symbols there instead of .plt. A notable
difference is that .plt.sec does not have a header entry.
For x86, when synthesizing symbols for plt, use offset and entry size of
.plt.sec instead of .plt when there is a .plt.sec section.
Example on Ubuntu 22.04 gcc 11.3:
Before:
$ cat tstpltlib.c
void fn1(void) {}
void fn2(void) {}
void fn3(void) {}
void fn4(void) {}
$ cat tstplt.c
void fn1(void);
void fn2(void);
void fn3(void);
void fn4(void);
int main()
{
fn4();
fn1();
fn2();
fn3();
return 0;
}
$ gcc --version
gcc (Ubuntu 11.3.0-1ubuntu1~22.04) 11.3.0
Copyright (C) 2021 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ gcc -Wall -Wextra -shared -o libtstpltlib.so tstpltlib.c
$ gcc -Wall -Wextra -z ibt -o tstplt tstplt.c -L . -ltstpltlib -Wl,-rpath=$(pwd)
$ readelf -SW tstplt | grep 'plt\|Name'
[Nr] Name Type Address Off Size ES Flg Lk Inf Al
[11] .rela.plt RELA 0000000000000698 000698 000060 18 AI 6 24 8
[13] .plt PROGBITS 0000000000001020 001020 000050 10 AX 0 0 16
[14] .plt.got PROGBITS 0000000000001070 001070 000010 10 AX 0 0 16
[15] .plt.sec PROGBITS 0000000000001080 001080 000040 10 AX 0 0 16
$ perf record -e intel_pt//u --filter 'filter main @ ./tstplt' ./tstplt
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.015 MB perf.data ]
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
38970.522546686: tr strt 0 [unknown] => 55fc222a81a9 main+0x0
38970.522546686: tr end call 55fc222a81b1 main+0x8 => 55fc222a80a0 [unknown]
38970.522546687: tr strt 0 [unknown] => 55fc222a81b6 main+0xd
38970.522546687: tr end call 55fc222a81b6 main+0xd => 55fc222a8080 [unknown]
38970.522546688: tr strt 0 [unknown] => 55fc222a81bb main+0x12
38970.522546688: tr end call 55fc222a81bb main+0x12 => 55fc222a80b0 [unknown]
38970.522546688: tr strt 0 [unknown] => 55fc222a81c0 main+0x17
38970.522546688: tr end call 55fc222a81c0 main+0x17 => 55fc222a8090 [unknown]
38970.522546689: tr strt 0 [unknown] => 55fc222a81c5 main+0x1c
38970.522546894: tr end return 55fc222a81cb main+0x22 => 7f3a4dc29d90 __libc_start_call_main+0x80
After:
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
38970.522546686: tr strt 0 [unknown] => 55fc222a81a9 main+0x0
38970.522546686: tr end call 55fc222a81b1 main+0x8 => 55fc222a80a0 fn4@plt+0x0
38970.522546687: tr strt 0 [unknown] => 55fc222a81b6 main+0xd
38970.522546687: tr end call 55fc222a81b6 main+0xd => 55fc222a8080 fn1@plt+0x0
38970.522546688: tr strt 0 [unknown] => 55fc222a81bb main+0x12
38970.522546688: tr end call 55fc222a81bb main+0x12 => 55fc222a80b0 fn2@plt+0x0
38970.522546688: tr strt 0 [unknown] => 55fc222a81c0 main+0x17
38970.522546688: tr end call 55fc222a81c0 main+0x17 => 55fc222a8090 fn3@plt+0x0
38970.522546689: tr strt 0 [unknown] => 55fc222a81c5 main+0x1c
38970.522546894: tr end return 55fc222a81cb main+0x22 => 7f3a4dc29d90 __libc_start_call_main+0x80
Reviewed-by: Namhyung Kim <namhyung@kernel.org>
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Link: https://lore.kernel.org/r/20230131131625.6964-3-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-31 16:16:18 +03:00
static bool machine_is_x86 ( GElf_Half e_machine )
{
return e_machine = = EM_386 | | e_machine = = EM_X86_64 ;
}
2012-08-06 08:41:20 +04:00
/*
* We need to check if we have a . dynsym , so that we can handle the
* . plt , synthesizing its symbols , that aren ' t on the symtabs ( be it
* . dynsym or . symtab ) .
* And always look at the original dso , not at debuginfo packages , that
* have the PLT data stripped out ( shdr_rel_plt . sh_type = = SHT_NOBITS ) .
*/
2018-04-26 22:52:34 +03:00
int dso__synthesize_plt_symbols ( struct dso * dso , struct symsrc * ss )
2012-08-06 08:41:20 +04:00
{
perf symbols: Sort plt relocations for x86
For x86, with the addition of IFUNCs, relocation information becomes
disordered with respect to plt. Correct that by sorting the relocations by
offset.
Example:
Before:
$ cat tstpltlib.c
void fn1(void) {}
void fn2(void) {}
void fn3(void) {}
void fn4(void) {}
$ cat tstpltifunc.c
#include <stdio.h>
void thing1(void)
{
printf("thing1\n");
}
void thing2(void)
{
printf("thing2\n");
}
typedef void (*thing_fn_t)(void);
thing_fn_t thing_ifunc(void)
{
int x;
if (x & 1)
return thing2;
return thing1;
}
void thing(void) __attribute__ ((ifunc ("thing_ifunc")));
void fn1(void);
void fn2(void);
void fn3(void);
void fn4(void);
int main()
{
fn4();
fn1();
thing();
fn2();
fn3();
return 0;
}
$ gcc --version
gcc (Ubuntu 11.3.0-1ubuntu1~22.04) 11.3.0
Copyright (C) 2021 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ gcc -Wall -Wextra -shared -o libtstpltlib.so tstpltlib.c
$ gcc -Wall -Wextra -Wno-uninitialized -o tstpltifunc tstpltifunc.c -L . -ltstpltlib -Wl,-rpath="$(pwd)"
$ readelf -rW tstpltifunc | grep -A99 plt
Relocation section '.rela.plt' at offset 0x738 contains 8 entries:
Offset Info Type Symbol's Value Symbol's Name + Addend
0000000000003f98 0000000300000007 R_X86_64_JUMP_SLOT 0000000000000000 puts@GLIBC_2.2.5 + 0
0000000000003fa8 0000000400000007 R_X86_64_JUMP_SLOT 0000000000000000 __stack_chk_fail@GLIBC_2.4 + 0
0000000000003fb0 0000000500000007 R_X86_64_JUMP_SLOT 0000000000000000 fn1 + 0
0000000000003fb8 0000000600000007 R_X86_64_JUMP_SLOT 0000000000000000 fn3 + 0
0000000000003fc0 0000000800000007 R_X86_64_JUMP_SLOT 0000000000000000 fn4 + 0
0000000000003fc8 0000000900000007 R_X86_64_JUMP_SLOT 0000000000000000 fn2 + 0
0000000000003fd0 0000000b00000007 R_X86_64_JUMP_SLOT 0000000000000000 getrandom@GLIBC_2.25 + 0
0000000000003fa0 0000000000000025 R_X86_64_IRELATIVE 125d
$ perf record -e intel_pt//u --filter 'filter main @ ./tstpltifunc' ./tstpltifunc
thing2
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.029 MB perf.data ]
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
20417.302513948: tr strt 0 [unknown] => 5629a74892be main+0x0
20417.302513948: tr end call 5629a74892c6 main+0x8 => 5629a7489110 fn2@plt+0x0
20417.302513949: tr strt 0 [unknown] => 5629a74892cb main+0xd
20417.302513949: tr end call 5629a74892cb main+0xd => 5629a74890f0 fn3@plt+0x0
20417.302513950: tr strt 0 [unknown] => 5629a74892d0 main+0x12
20417.302513950: tr end call 5629a74892d0 main+0x12 => 5629a74890d0 __stack_chk_fail@plt+0x0
20417.302528114: tr strt 0 [unknown] => 5629a74892d5 main+0x17
20417.302528114: tr end call 5629a74892d5 main+0x17 => 5629a7489120 getrandom@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892da main+0x1c
20417.302528115: tr end call 5629a74892da main+0x1c => 5629a7489100 fn4@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892df main+0x21
20417.302528115: tr end return 5629a74892e5 main+0x27 => 7ff14da29d90 __libc_start_call_main+0x80
After:
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
20417.302513948: tr strt 0 [unknown] => 5629a74892be main+0x0
20417.302513948: tr end call 5629a74892c6 main+0x8 => 5629a7489110 fn4@plt+0x0
20417.302513949: tr strt 0 [unknown] => 5629a74892cb main+0xd
20417.302513949: tr end call 5629a74892cb main+0xd => 5629a74890f0 fn1@plt+0x0
20417.302513950: tr strt 0 [unknown] => 5629a74892d0 main+0x12
20417.302513950: tr end call 5629a74892d0 main+0x12 => 5629a74890d0 offset_0x10d0@plt+0x0
20417.302528114: tr strt 0 [unknown] => 5629a74892d5 main+0x17
20417.302528114: tr end call 5629a74892d5 main+0x17 => 5629a7489120 fn2@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892da main+0x1c
20417.302528115: tr end call 5629a74892da main+0x1c => 5629a7489100 fn3@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892df main+0x21
20417.302528115: tr end return 5629a74892e5 main+0x27 => 7ff14da29d90 __libc_start_call_main+0x80
Reviewed-by: Namhyung Kim <namhyung@kernel.org>
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Link: https://lore.kernel.org/r/20230131131625.6964-4-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-31 16:16:19 +03:00
uint32_t idx ;
2012-08-06 08:41:20 +04:00
GElf_Sym sym ;
perf symbols: Fix plt entry calculation for ARM and AARCH64
On x86, the plt header size is as same as the plt entry size, and can be
identified from shdr's sh_entsize of the plt.
But we can't assume that the sh_entsize of the plt shdr is always the
plt entry size in all architecture, and the plt header size may be not
as same as the plt entry size in some architecure.
On ARM, the plt header size is 20 bytes and the plt entry size is 12
bytes (don't consider the FOUR_WORD_PLT case) that refer to the binutils
implementation. The plt section is as follows:
Disassembly of section .plt:
000004a0 <__cxa_finalize@plt-0x14>:
4a0: e52de004 push {lr} ; (str lr, [sp, #-4]!)
4a4: e59fe004 ldr lr, [pc, #4] ; 4b0 <_init+0x1c>
4a8: e08fe00e add lr, pc, lr
4ac: e5bef008 ldr pc, [lr, #8]!
4b0: 00008424 .word 0x00008424
000004b4 <__cxa_finalize@plt>:
4b4: e28fc600 add ip, pc, #0, 12
4b8: e28cca08 add ip, ip, #8, 20 ; 0x8000
4bc: e5bcf424 ldr pc, [ip, #1060]! ; 0x424
000004c0 <printf@plt>:
4c0: e28fc600 add ip, pc, #0, 12
4c4: e28cca08 add ip, ip, #8, 20 ; 0x8000
4c8: e5bcf41c ldr pc, [ip, #1052]! ; 0x41c
On AARCH64, the plt header size is 32 bytes and the plt entry size is 16
bytes. The plt section is as follows:
Disassembly of section .plt:
0000000000000560 <__cxa_finalize@plt-0x20>:
560: a9bf7bf0 stp x16, x30, [sp,#-16]!
564: 90000090 adrp x16, 10000 <__FRAME_END__+0xf8a8>
568: f944be11 ldr x17, [x16,#2424]
56c: 9125e210 add x16, x16, #0x978
570: d61f0220 br x17
574: d503201f nop
578: d503201f nop
57c: d503201f nop
0000000000000580 <__cxa_finalize@plt>:
580: 90000090 adrp x16, 10000 <__FRAME_END__+0xf8a8>
584: f944c211 ldr x17, [x16,#2432]
588: 91260210 add x16, x16, #0x980
58c: d61f0220 br x17
0000000000000590 <__gmon_start__@plt>:
590: 90000090 adrp x16, 10000 <__FRAME_END__+0xf8a8>
594: f944c611 ldr x17, [x16,#2440]
598: 91262210 add x16, x16, #0x988
59c: d61f0220 br x17
NOTES:
In addition to ARM and AARCH64, other architectures, such as
s390/alpha/mips/parisc/poperpc/sh/sparc/xtensa also need to consider
this issue.
Signed-off-by: Li Bin <huawei.libin@huawei.com>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Alexis Berlemont <alexis.berlemont@gmail.com>
Cc: David Tolnay <dtolnay@gmail.com>
Cc: Hanjun Guo <guohanjun@huawei.com>
Cc: Hemant Kumar <hemant@linux.vnet.ibm.com>
Cc: Masami Hiramatsu <mhiramat@kernel.org>
Cc: Milian Wolff <milian.wolff@kdab.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Wang Nan <wangnan0@huawei.com>
Cc: zhangmengting@huawei.com
Link: http://lkml.kernel.org/r/1496622849-21877-1-git-send-email-huawei.libin@huawei.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2017-06-05 03:34:09 +03:00
u64 plt_offset , plt_header_size , plt_entry_size ;
perf symbols: Add support for x86 .plt.sec
The section .plt.sec was originally added for MPX and was first called
.plt.bnd. While MPX has been deprecated, .plt.sec is now also used for
IBT. On x86_64, IBT may be enabled by default, but can be switched off
using gcc option -fcf-protection=none, or switched on by -z ibt or -z
ibtplt. On 32-bit, option -z ibt or -z ibtplt will enable IBT.
With .plt.sec, calls are made into .plt.sec instead of .plt, so it makes
more sense to put the symbols there instead of .plt. A notable
difference is that .plt.sec does not have a header entry.
For x86, when synthesizing symbols for plt, use offset and entry size of
.plt.sec instead of .plt when there is a .plt.sec section.
Example on Ubuntu 22.04 gcc 11.3:
Before:
$ cat tstpltlib.c
void fn1(void) {}
void fn2(void) {}
void fn3(void) {}
void fn4(void) {}
$ cat tstplt.c
void fn1(void);
void fn2(void);
void fn3(void);
void fn4(void);
int main()
{
fn4();
fn1();
fn2();
fn3();
return 0;
}
$ gcc --version
gcc (Ubuntu 11.3.0-1ubuntu1~22.04) 11.3.0
Copyright (C) 2021 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ gcc -Wall -Wextra -shared -o libtstpltlib.so tstpltlib.c
$ gcc -Wall -Wextra -z ibt -o tstplt tstplt.c -L . -ltstpltlib -Wl,-rpath=$(pwd)
$ readelf -SW tstplt | grep 'plt\|Name'
[Nr] Name Type Address Off Size ES Flg Lk Inf Al
[11] .rela.plt RELA 0000000000000698 000698 000060 18 AI 6 24 8
[13] .plt PROGBITS 0000000000001020 001020 000050 10 AX 0 0 16
[14] .plt.got PROGBITS 0000000000001070 001070 000010 10 AX 0 0 16
[15] .plt.sec PROGBITS 0000000000001080 001080 000040 10 AX 0 0 16
$ perf record -e intel_pt//u --filter 'filter main @ ./tstplt' ./tstplt
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.015 MB perf.data ]
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
38970.522546686: tr strt 0 [unknown] => 55fc222a81a9 main+0x0
38970.522546686: tr end call 55fc222a81b1 main+0x8 => 55fc222a80a0 [unknown]
38970.522546687: tr strt 0 [unknown] => 55fc222a81b6 main+0xd
38970.522546687: tr end call 55fc222a81b6 main+0xd => 55fc222a8080 [unknown]
38970.522546688: tr strt 0 [unknown] => 55fc222a81bb main+0x12
38970.522546688: tr end call 55fc222a81bb main+0x12 => 55fc222a80b0 [unknown]
38970.522546688: tr strt 0 [unknown] => 55fc222a81c0 main+0x17
38970.522546688: tr end call 55fc222a81c0 main+0x17 => 55fc222a8090 [unknown]
38970.522546689: tr strt 0 [unknown] => 55fc222a81c5 main+0x1c
38970.522546894: tr end return 55fc222a81cb main+0x22 => 7f3a4dc29d90 __libc_start_call_main+0x80
After:
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
38970.522546686: tr strt 0 [unknown] => 55fc222a81a9 main+0x0
38970.522546686: tr end call 55fc222a81b1 main+0x8 => 55fc222a80a0 fn4@plt+0x0
38970.522546687: tr strt 0 [unknown] => 55fc222a81b6 main+0xd
38970.522546687: tr end call 55fc222a81b6 main+0xd => 55fc222a8080 fn1@plt+0x0
38970.522546688: tr strt 0 [unknown] => 55fc222a81bb main+0x12
38970.522546688: tr end call 55fc222a81bb main+0x12 => 55fc222a80b0 fn2@plt+0x0
38970.522546688: tr strt 0 [unknown] => 55fc222a81c0 main+0x17
38970.522546688: tr end call 55fc222a81c0 main+0x17 => 55fc222a8090 fn3@plt+0x0
38970.522546689: tr strt 0 [unknown] => 55fc222a81c5 main+0x1c
38970.522546894: tr end return 55fc222a81cb main+0x22 => 7f3a4dc29d90 __libc_start_call_main+0x80
Reviewed-by: Namhyung Kim <namhyung@kernel.org>
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Link: https://lore.kernel.org/r/20230131131625.6964-3-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-31 16:16:18 +03:00
GElf_Shdr shdr_plt , plt_sec_shdr ;
struct symbol * f , * plt_sym ;
2012-08-06 08:41:20 +04:00
GElf_Shdr shdr_rel_plt , shdr_dynsym ;
2023-01-20 15:34:55 +03:00
Elf_Data * syms , * symstrs ;
2012-08-06 08:41:20 +04:00
Elf_Scn * scn_plt_rel , * scn_symstrs , * scn_dynsym ;
size_t dynsym_idx ;
GElf_Ehdr ehdr ;
char sympltname [ 1024 ] ;
Elf * elf ;
2023-01-20 15:34:55 +03:00
int nr = 0 , err = - 1 ;
struct rel_info ri = { . is_rela = false } ;
2012-08-06 08:41:20 +04:00
2012-08-11 02:22:59 +04:00
elf = ss - > elf ;
ehdr = ss - > ehdr ;
2012-08-06 08:41:20 +04:00
perf symbols: Add symbol for .plt header
perf expands the _init symbol over .plt because there are no PLT symbols
at that point, but then dso__synthesize_plt_symbols() creates them.
Fix by truncating the previous symbol and inserting a symbol for .plt
header.
Example:
Before:
$ perf test --dso `which uname` -v Symbols
74: Symbols :
--- start ---
test child forked, pid 191028
Problems creating module maps, continuing anyway...
Testing /usr/bin/uname
Overlapping symbols:
2000-25f0 g _init
2040-2050 g free@plt
test child finished with -1
---- end ----
Symbols: FAILED!
$ perf test --dso `which uname` -vv Symbols 2>/tmp/cmp1.txt
After:
$ perf test --dso `which uname` -v Symbols
74: Symbols :
--- start ---
test child forked, pid 194291
Testing /usr/bin/uname
test child finished with 0
---- end ----
Symbols: Ok
$ perf test --dso `which uname` -vv Symbols 2>/tmp/cmp2.txt
$ diff /tmp/cmp1.txt /tmp/cmp2.txt
4,5c4
< test child forked, pid 191031
< Problems creating module maps, continuing anyway...
---
> test child forked, pid 194296
9c8,9
< 2000-25f0 g _init
---
> 2000-2030 g _init
> 2030-2040 g .plt
100,103c100
< Overlapping symbols:
< 2000-25f0 g _init
< 2040-2050 g free@plt
< test child finished with -1
---
> test child finished with 0
105c102
< Symbols: FAILED!
---
> Symbols: Ok
$
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Acked-by: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Namhyung Kim <namhyung@kernel.org>
Link: https://lore.kernel.org/r/20230120123456.12449-8-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-20 15:34:53 +03:00
if ( ! elf_section_by_name ( elf , & ehdr , & shdr_plt , " .plt " , NULL ) )
return 0 ;
/*
* A symbol from a previous section ( e . g . . init ) can have been expanded
* by symbols__fixup_end ( ) to overlap . plt . Truncate it before adding
* a symbol for . plt header .
*/
f = dso__find_symbol_nocache ( dso , shdr_plt . sh_offset ) ;
if ( f & & f - > start < shdr_plt . sh_offset & & f - > end > shdr_plt . sh_offset )
f - > end = shdr_plt . sh_offset ;
if ( ! get_plt_sizes ( dso , & ehdr , & shdr_plt , & plt_header_size , & plt_entry_size ) )
return 0 ;
/* Add a symbol for .plt header */
perf symbols: Add support for x86 .plt.sec
The section .plt.sec was originally added for MPX and was first called
.plt.bnd. While MPX has been deprecated, .plt.sec is now also used for
IBT. On x86_64, IBT may be enabled by default, but can be switched off
using gcc option -fcf-protection=none, or switched on by -z ibt or -z
ibtplt. On 32-bit, option -z ibt or -z ibtplt will enable IBT.
With .plt.sec, calls are made into .plt.sec instead of .plt, so it makes
more sense to put the symbols there instead of .plt. A notable
difference is that .plt.sec does not have a header entry.
For x86, when synthesizing symbols for plt, use offset and entry size of
.plt.sec instead of .plt when there is a .plt.sec section.
Example on Ubuntu 22.04 gcc 11.3:
Before:
$ cat tstpltlib.c
void fn1(void) {}
void fn2(void) {}
void fn3(void) {}
void fn4(void) {}
$ cat tstplt.c
void fn1(void);
void fn2(void);
void fn3(void);
void fn4(void);
int main()
{
fn4();
fn1();
fn2();
fn3();
return 0;
}
$ gcc --version
gcc (Ubuntu 11.3.0-1ubuntu1~22.04) 11.3.0
Copyright (C) 2021 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ gcc -Wall -Wextra -shared -o libtstpltlib.so tstpltlib.c
$ gcc -Wall -Wextra -z ibt -o tstplt tstplt.c -L . -ltstpltlib -Wl,-rpath=$(pwd)
$ readelf -SW tstplt | grep 'plt\|Name'
[Nr] Name Type Address Off Size ES Flg Lk Inf Al
[11] .rela.plt RELA 0000000000000698 000698 000060 18 AI 6 24 8
[13] .plt PROGBITS 0000000000001020 001020 000050 10 AX 0 0 16
[14] .plt.got PROGBITS 0000000000001070 001070 000010 10 AX 0 0 16
[15] .plt.sec PROGBITS 0000000000001080 001080 000040 10 AX 0 0 16
$ perf record -e intel_pt//u --filter 'filter main @ ./tstplt' ./tstplt
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.015 MB perf.data ]
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
38970.522546686: tr strt 0 [unknown] => 55fc222a81a9 main+0x0
38970.522546686: tr end call 55fc222a81b1 main+0x8 => 55fc222a80a0 [unknown]
38970.522546687: tr strt 0 [unknown] => 55fc222a81b6 main+0xd
38970.522546687: tr end call 55fc222a81b6 main+0xd => 55fc222a8080 [unknown]
38970.522546688: tr strt 0 [unknown] => 55fc222a81bb main+0x12
38970.522546688: tr end call 55fc222a81bb main+0x12 => 55fc222a80b0 [unknown]
38970.522546688: tr strt 0 [unknown] => 55fc222a81c0 main+0x17
38970.522546688: tr end call 55fc222a81c0 main+0x17 => 55fc222a8090 [unknown]
38970.522546689: tr strt 0 [unknown] => 55fc222a81c5 main+0x1c
38970.522546894: tr end return 55fc222a81cb main+0x22 => 7f3a4dc29d90 __libc_start_call_main+0x80
After:
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
38970.522546686: tr strt 0 [unknown] => 55fc222a81a9 main+0x0
38970.522546686: tr end call 55fc222a81b1 main+0x8 => 55fc222a80a0 fn4@plt+0x0
38970.522546687: tr strt 0 [unknown] => 55fc222a81b6 main+0xd
38970.522546687: tr end call 55fc222a81b6 main+0xd => 55fc222a8080 fn1@plt+0x0
38970.522546688: tr strt 0 [unknown] => 55fc222a81bb main+0x12
38970.522546688: tr end call 55fc222a81bb main+0x12 => 55fc222a80b0 fn2@plt+0x0
38970.522546688: tr strt 0 [unknown] => 55fc222a81c0 main+0x17
38970.522546688: tr end call 55fc222a81c0 main+0x17 => 55fc222a8090 fn3@plt+0x0
38970.522546689: tr strt 0 [unknown] => 55fc222a81c5 main+0x1c
38970.522546894: tr end return 55fc222a81cb main+0x22 => 7f3a4dc29d90 __libc_start_call_main+0x80
Reviewed-by: Namhyung Kim <namhyung@kernel.org>
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Link: https://lore.kernel.org/r/20230131131625.6964-3-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-31 16:16:18 +03:00
plt_sym = symbol__new ( shdr_plt . sh_offset , plt_header_size , STB_GLOBAL , STT_FUNC , " .plt " ) ;
if ( ! plt_sym )
perf symbols: Add symbol for .plt header
perf expands the _init symbol over .plt because there are no PLT symbols
at that point, but then dso__synthesize_plt_symbols() creates them.
Fix by truncating the previous symbol and inserting a symbol for .plt
header.
Example:
Before:
$ perf test --dso `which uname` -v Symbols
74: Symbols :
--- start ---
test child forked, pid 191028
Problems creating module maps, continuing anyway...
Testing /usr/bin/uname
Overlapping symbols:
2000-25f0 g _init
2040-2050 g free@plt
test child finished with -1
---- end ----
Symbols: FAILED!
$ perf test --dso `which uname` -vv Symbols 2>/tmp/cmp1.txt
After:
$ perf test --dso `which uname` -v Symbols
74: Symbols :
--- start ---
test child forked, pid 194291
Testing /usr/bin/uname
test child finished with 0
---- end ----
Symbols: Ok
$ perf test --dso `which uname` -vv Symbols 2>/tmp/cmp2.txt
$ diff /tmp/cmp1.txt /tmp/cmp2.txt
4,5c4
< test child forked, pid 191031
< Problems creating module maps, continuing anyway...
---
> test child forked, pid 194296
9c8,9
< 2000-25f0 g _init
---
> 2000-2030 g _init
> 2030-2040 g .plt
100,103c100
< Overlapping symbols:
< 2000-25f0 g _init
< 2040-2050 g free@plt
< test child finished with -1
---
> test child finished with 0
105c102
< Symbols: FAILED!
---
> Symbols: Ok
$
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Acked-by: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Namhyung Kim <namhyung@kernel.org>
Link: https://lore.kernel.org/r/20230120123456.12449-8-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-20 15:34:53 +03:00
goto out_elf_end ;
perf symbols: Add support for x86 .plt.sec
The section .plt.sec was originally added for MPX and was first called
.plt.bnd. While MPX has been deprecated, .plt.sec is now also used for
IBT. On x86_64, IBT may be enabled by default, but can be switched off
using gcc option -fcf-protection=none, or switched on by -z ibt or -z
ibtplt. On 32-bit, option -z ibt or -z ibtplt will enable IBT.
With .plt.sec, calls are made into .plt.sec instead of .plt, so it makes
more sense to put the symbols there instead of .plt. A notable
difference is that .plt.sec does not have a header entry.
For x86, when synthesizing symbols for plt, use offset and entry size of
.plt.sec instead of .plt when there is a .plt.sec section.
Example on Ubuntu 22.04 gcc 11.3:
Before:
$ cat tstpltlib.c
void fn1(void) {}
void fn2(void) {}
void fn3(void) {}
void fn4(void) {}
$ cat tstplt.c
void fn1(void);
void fn2(void);
void fn3(void);
void fn4(void);
int main()
{
fn4();
fn1();
fn2();
fn3();
return 0;
}
$ gcc --version
gcc (Ubuntu 11.3.0-1ubuntu1~22.04) 11.3.0
Copyright (C) 2021 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ gcc -Wall -Wextra -shared -o libtstpltlib.so tstpltlib.c
$ gcc -Wall -Wextra -z ibt -o tstplt tstplt.c -L . -ltstpltlib -Wl,-rpath=$(pwd)
$ readelf -SW tstplt | grep 'plt\|Name'
[Nr] Name Type Address Off Size ES Flg Lk Inf Al
[11] .rela.plt RELA 0000000000000698 000698 000060 18 AI 6 24 8
[13] .plt PROGBITS 0000000000001020 001020 000050 10 AX 0 0 16
[14] .plt.got PROGBITS 0000000000001070 001070 000010 10 AX 0 0 16
[15] .plt.sec PROGBITS 0000000000001080 001080 000040 10 AX 0 0 16
$ perf record -e intel_pt//u --filter 'filter main @ ./tstplt' ./tstplt
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.015 MB perf.data ]
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
38970.522546686: tr strt 0 [unknown] => 55fc222a81a9 main+0x0
38970.522546686: tr end call 55fc222a81b1 main+0x8 => 55fc222a80a0 [unknown]
38970.522546687: tr strt 0 [unknown] => 55fc222a81b6 main+0xd
38970.522546687: tr end call 55fc222a81b6 main+0xd => 55fc222a8080 [unknown]
38970.522546688: tr strt 0 [unknown] => 55fc222a81bb main+0x12
38970.522546688: tr end call 55fc222a81bb main+0x12 => 55fc222a80b0 [unknown]
38970.522546688: tr strt 0 [unknown] => 55fc222a81c0 main+0x17
38970.522546688: tr end call 55fc222a81c0 main+0x17 => 55fc222a8090 [unknown]
38970.522546689: tr strt 0 [unknown] => 55fc222a81c5 main+0x1c
38970.522546894: tr end return 55fc222a81cb main+0x22 => 7f3a4dc29d90 __libc_start_call_main+0x80
After:
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
38970.522546686: tr strt 0 [unknown] => 55fc222a81a9 main+0x0
38970.522546686: tr end call 55fc222a81b1 main+0x8 => 55fc222a80a0 fn4@plt+0x0
38970.522546687: tr strt 0 [unknown] => 55fc222a81b6 main+0xd
38970.522546687: tr end call 55fc222a81b6 main+0xd => 55fc222a8080 fn1@plt+0x0
38970.522546688: tr strt 0 [unknown] => 55fc222a81bb main+0x12
38970.522546688: tr end call 55fc222a81bb main+0x12 => 55fc222a80b0 fn2@plt+0x0
38970.522546688: tr strt 0 [unknown] => 55fc222a81c0 main+0x17
38970.522546688: tr end call 55fc222a81c0 main+0x17 => 55fc222a8090 fn3@plt+0x0
38970.522546689: tr strt 0 [unknown] => 55fc222a81c5 main+0x1c
38970.522546894: tr end return 55fc222a81cb main+0x22 => 7f3a4dc29d90 __libc_start_call_main+0x80
Reviewed-by: Namhyung Kim <namhyung@kernel.org>
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Link: https://lore.kernel.org/r/20230131131625.6964-3-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-31 16:16:18 +03:00
symbols__insert ( & dso - > symbols , plt_sym ) ;
/* Only x86 has .plt.sec */
if ( machine_is_x86 ( ehdr . e_machine ) & &
elf_section_by_name ( elf , & ehdr , & plt_sec_shdr , " .plt.sec " , NULL ) ) {
if ( ! get_plt_sizes ( dso , & ehdr , & plt_sec_shdr , & plt_header_size , & plt_entry_size ) )
return 0 ;
/* Extend .plt symbol to entire .plt */
plt_sym - > end = plt_sym - > start + shdr_plt . sh_size ;
/* Use .plt.sec offset */
plt_offset = plt_sec_shdr . sh_offset ;
} else {
plt_offset = shdr_plt . sh_offset + plt_header_size ;
}
perf symbols: Add symbol for .plt header
perf expands the _init symbol over .plt because there are no PLT symbols
at that point, but then dso__synthesize_plt_symbols() creates them.
Fix by truncating the previous symbol and inserting a symbol for .plt
header.
Example:
Before:
$ perf test --dso `which uname` -v Symbols
74: Symbols :
--- start ---
test child forked, pid 191028
Problems creating module maps, continuing anyway...
Testing /usr/bin/uname
Overlapping symbols:
2000-25f0 g _init
2040-2050 g free@plt
test child finished with -1
---- end ----
Symbols: FAILED!
$ perf test --dso `which uname` -vv Symbols 2>/tmp/cmp1.txt
After:
$ perf test --dso `which uname` -v Symbols
74: Symbols :
--- start ---
test child forked, pid 194291
Testing /usr/bin/uname
test child finished with 0
---- end ----
Symbols: Ok
$ perf test --dso `which uname` -vv Symbols 2>/tmp/cmp2.txt
$ diff /tmp/cmp1.txt /tmp/cmp2.txt
4,5c4
< test child forked, pid 191031
< Problems creating module maps, continuing anyway...
---
> test child forked, pid 194296
9c8,9
< 2000-25f0 g _init
---
> 2000-2030 g _init
> 2030-2040 g .plt
100,103c100
< Overlapping symbols:
< 2000-25f0 g _init
< 2040-2050 g free@plt
< test child finished with -1
---
> test child finished with 0
105c102
< Symbols: FAILED!
---
> Symbols: Ok
$
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Acked-by: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Namhyung Kim <namhyung@kernel.org>
Link: https://lore.kernel.org/r/20230120123456.12449-8-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-20 15:34:53 +03:00
2012-08-11 02:22:59 +04:00
scn_dynsym = ss - > dynsym ;
shdr_dynsym = ss - > dynshdr ;
dynsym_idx = ss - > dynsym_idx ;
2012-08-06 08:41:20 +04:00
if ( scn_dynsym = = NULL )
2023-01-20 15:34:51 +03:00
return 0 ;
2012-08-06 08:41:20 +04:00
scn_plt_rel = elf_section_by_name ( elf , & ehdr , & shdr_rel_plt ,
" .rela.plt " , NULL ) ;
if ( scn_plt_rel = = NULL ) {
scn_plt_rel = elf_section_by_name ( elf , & ehdr , & shdr_rel_plt ,
" .rel.plt " , NULL ) ;
if ( scn_plt_rel = = NULL )
2023-01-20 15:34:51 +03:00
return 0 ;
2012-08-06 08:41:20 +04:00
}
2023-01-20 15:34:56 +03:00
if ( shdr_rel_plt . sh_type ! = SHT_RELA & &
shdr_rel_plt . sh_type ! = SHT_REL )
return 0 ;
2012-08-06 08:41:20 +04:00
if ( shdr_rel_plt . sh_link ! = dynsym_idx )
goto out_elf_end ;
/*
* Fetch the relocation section to find the idxes to the GOT
* and the symbols in the . dynsym they refer to .
*/
2023-01-20 15:34:55 +03:00
ri . reldata = elf_getdata ( scn_plt_rel , NULL ) ;
if ( ! ri . reldata )
2012-08-06 08:41:20 +04:00
goto out_elf_end ;
syms = elf_getdata ( scn_dynsym , NULL ) ;
if ( syms = = NULL )
goto out_elf_end ;
scn_symstrs = elf_getscn ( elf , shdr_dynsym . sh_link ) ;
if ( scn_symstrs = = NULL )
goto out_elf_end ;
symstrs = elf_getdata ( scn_symstrs , NULL ) ;
if ( symstrs = = NULL )
goto out_elf_end ;
2012-08-11 02:22:51 +04:00
if ( symstrs - > d_size = = 0 )
goto out_elf_end ;
perf symbols: Sort plt relocations for x86
For x86, with the addition of IFUNCs, relocation information becomes
disordered with respect to plt. Correct that by sorting the relocations by
offset.
Example:
Before:
$ cat tstpltlib.c
void fn1(void) {}
void fn2(void) {}
void fn3(void) {}
void fn4(void) {}
$ cat tstpltifunc.c
#include <stdio.h>
void thing1(void)
{
printf("thing1\n");
}
void thing2(void)
{
printf("thing2\n");
}
typedef void (*thing_fn_t)(void);
thing_fn_t thing_ifunc(void)
{
int x;
if (x & 1)
return thing2;
return thing1;
}
void thing(void) __attribute__ ((ifunc ("thing_ifunc")));
void fn1(void);
void fn2(void);
void fn3(void);
void fn4(void);
int main()
{
fn4();
fn1();
thing();
fn2();
fn3();
return 0;
}
$ gcc --version
gcc (Ubuntu 11.3.0-1ubuntu1~22.04) 11.3.0
Copyright (C) 2021 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ gcc -Wall -Wextra -shared -o libtstpltlib.so tstpltlib.c
$ gcc -Wall -Wextra -Wno-uninitialized -o tstpltifunc tstpltifunc.c -L . -ltstpltlib -Wl,-rpath="$(pwd)"
$ readelf -rW tstpltifunc | grep -A99 plt
Relocation section '.rela.plt' at offset 0x738 contains 8 entries:
Offset Info Type Symbol's Value Symbol's Name + Addend
0000000000003f98 0000000300000007 R_X86_64_JUMP_SLOT 0000000000000000 puts@GLIBC_2.2.5 + 0
0000000000003fa8 0000000400000007 R_X86_64_JUMP_SLOT 0000000000000000 __stack_chk_fail@GLIBC_2.4 + 0
0000000000003fb0 0000000500000007 R_X86_64_JUMP_SLOT 0000000000000000 fn1 + 0
0000000000003fb8 0000000600000007 R_X86_64_JUMP_SLOT 0000000000000000 fn3 + 0
0000000000003fc0 0000000800000007 R_X86_64_JUMP_SLOT 0000000000000000 fn4 + 0
0000000000003fc8 0000000900000007 R_X86_64_JUMP_SLOT 0000000000000000 fn2 + 0
0000000000003fd0 0000000b00000007 R_X86_64_JUMP_SLOT 0000000000000000 getrandom@GLIBC_2.25 + 0
0000000000003fa0 0000000000000025 R_X86_64_IRELATIVE 125d
$ perf record -e intel_pt//u --filter 'filter main @ ./tstpltifunc' ./tstpltifunc
thing2
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.029 MB perf.data ]
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
20417.302513948: tr strt 0 [unknown] => 5629a74892be main+0x0
20417.302513948: tr end call 5629a74892c6 main+0x8 => 5629a7489110 fn2@plt+0x0
20417.302513949: tr strt 0 [unknown] => 5629a74892cb main+0xd
20417.302513949: tr end call 5629a74892cb main+0xd => 5629a74890f0 fn3@plt+0x0
20417.302513950: tr strt 0 [unknown] => 5629a74892d0 main+0x12
20417.302513950: tr end call 5629a74892d0 main+0x12 => 5629a74890d0 __stack_chk_fail@plt+0x0
20417.302528114: tr strt 0 [unknown] => 5629a74892d5 main+0x17
20417.302528114: tr end call 5629a74892d5 main+0x17 => 5629a7489120 getrandom@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892da main+0x1c
20417.302528115: tr end call 5629a74892da main+0x1c => 5629a7489100 fn4@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892df main+0x21
20417.302528115: tr end return 5629a74892e5 main+0x27 => 7ff14da29d90 __libc_start_call_main+0x80
After:
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
20417.302513948: tr strt 0 [unknown] => 5629a74892be main+0x0
20417.302513948: tr end call 5629a74892c6 main+0x8 => 5629a7489110 fn4@plt+0x0
20417.302513949: tr strt 0 [unknown] => 5629a74892cb main+0xd
20417.302513949: tr end call 5629a74892cb main+0xd => 5629a74890f0 fn1@plt+0x0
20417.302513950: tr strt 0 [unknown] => 5629a74892d0 main+0x12
20417.302513950: tr end call 5629a74892d0 main+0x12 => 5629a74890d0 offset_0x10d0@plt+0x0
20417.302528114: tr strt 0 [unknown] => 5629a74892d5 main+0x17
20417.302528114: tr end call 5629a74892d5 main+0x17 => 5629a7489120 fn2@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892da main+0x1c
20417.302528115: tr end call 5629a74892da main+0x1c => 5629a7489100 fn3@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892df main+0x21
20417.302528115: tr end return 5629a74892e5 main+0x27 => 7ff14da29d90 __libc_start_call_main+0x80
Reviewed-by: Namhyung Kim <namhyung@kernel.org>
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Link: https://lore.kernel.org/r/20230131131625.6964-4-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-31 16:16:19 +03:00
ri . nr_entries = shdr_rel_plt . sh_size / shdr_rel_plt . sh_entsize ;
2012-08-06 08:41:20 +04:00
2023-01-20 15:34:55 +03:00
ri . is_rela = shdr_rel_plt . sh_type = = SHT_RELA ;
2012-08-06 08:41:20 +04:00
perf symbols: Sort plt relocations for x86
For x86, with the addition of IFUNCs, relocation information becomes
disordered with respect to plt. Correct that by sorting the relocations by
offset.
Example:
Before:
$ cat tstpltlib.c
void fn1(void) {}
void fn2(void) {}
void fn3(void) {}
void fn4(void) {}
$ cat tstpltifunc.c
#include <stdio.h>
void thing1(void)
{
printf("thing1\n");
}
void thing2(void)
{
printf("thing2\n");
}
typedef void (*thing_fn_t)(void);
thing_fn_t thing_ifunc(void)
{
int x;
if (x & 1)
return thing2;
return thing1;
}
void thing(void) __attribute__ ((ifunc ("thing_ifunc")));
void fn1(void);
void fn2(void);
void fn3(void);
void fn4(void);
int main()
{
fn4();
fn1();
thing();
fn2();
fn3();
return 0;
}
$ gcc --version
gcc (Ubuntu 11.3.0-1ubuntu1~22.04) 11.3.0
Copyright (C) 2021 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ gcc -Wall -Wextra -shared -o libtstpltlib.so tstpltlib.c
$ gcc -Wall -Wextra -Wno-uninitialized -o tstpltifunc tstpltifunc.c -L . -ltstpltlib -Wl,-rpath="$(pwd)"
$ readelf -rW tstpltifunc | grep -A99 plt
Relocation section '.rela.plt' at offset 0x738 contains 8 entries:
Offset Info Type Symbol's Value Symbol's Name + Addend
0000000000003f98 0000000300000007 R_X86_64_JUMP_SLOT 0000000000000000 puts@GLIBC_2.2.5 + 0
0000000000003fa8 0000000400000007 R_X86_64_JUMP_SLOT 0000000000000000 __stack_chk_fail@GLIBC_2.4 + 0
0000000000003fb0 0000000500000007 R_X86_64_JUMP_SLOT 0000000000000000 fn1 + 0
0000000000003fb8 0000000600000007 R_X86_64_JUMP_SLOT 0000000000000000 fn3 + 0
0000000000003fc0 0000000800000007 R_X86_64_JUMP_SLOT 0000000000000000 fn4 + 0
0000000000003fc8 0000000900000007 R_X86_64_JUMP_SLOT 0000000000000000 fn2 + 0
0000000000003fd0 0000000b00000007 R_X86_64_JUMP_SLOT 0000000000000000 getrandom@GLIBC_2.25 + 0
0000000000003fa0 0000000000000025 R_X86_64_IRELATIVE 125d
$ perf record -e intel_pt//u --filter 'filter main @ ./tstpltifunc' ./tstpltifunc
thing2
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.029 MB perf.data ]
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
20417.302513948: tr strt 0 [unknown] => 5629a74892be main+0x0
20417.302513948: tr end call 5629a74892c6 main+0x8 => 5629a7489110 fn2@plt+0x0
20417.302513949: tr strt 0 [unknown] => 5629a74892cb main+0xd
20417.302513949: tr end call 5629a74892cb main+0xd => 5629a74890f0 fn3@plt+0x0
20417.302513950: tr strt 0 [unknown] => 5629a74892d0 main+0x12
20417.302513950: tr end call 5629a74892d0 main+0x12 => 5629a74890d0 __stack_chk_fail@plt+0x0
20417.302528114: tr strt 0 [unknown] => 5629a74892d5 main+0x17
20417.302528114: tr end call 5629a74892d5 main+0x17 => 5629a7489120 getrandom@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892da main+0x1c
20417.302528115: tr end call 5629a74892da main+0x1c => 5629a7489100 fn4@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892df main+0x21
20417.302528115: tr end return 5629a74892e5 main+0x27 => 7ff14da29d90 __libc_start_call_main+0x80
After:
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
20417.302513948: tr strt 0 [unknown] => 5629a74892be main+0x0
20417.302513948: tr end call 5629a74892c6 main+0x8 => 5629a7489110 fn4@plt+0x0
20417.302513949: tr strt 0 [unknown] => 5629a74892cb main+0xd
20417.302513949: tr end call 5629a74892cb main+0xd => 5629a74890f0 fn1@plt+0x0
20417.302513950: tr strt 0 [unknown] => 5629a74892d0 main+0x12
20417.302513950: tr end call 5629a74892d0 main+0x12 => 5629a74890d0 offset_0x10d0@plt+0x0
20417.302528114: tr strt 0 [unknown] => 5629a74892d5 main+0x17
20417.302528114: tr end call 5629a74892d5 main+0x17 => 5629a7489120 fn2@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892da main+0x1c
20417.302528115: tr end call 5629a74892da main+0x1c => 5629a7489100 fn3@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892df main+0x21
20417.302528115: tr end return 5629a74892e5 main+0x27 => 7ff14da29d90 __libc_start_call_main+0x80
Reviewed-by: Namhyung Kim <namhyung@kernel.org>
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Link: https://lore.kernel.org/r/20230131131625.6964-4-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-31 16:16:19 +03:00
/*
* x86 doesn ' t insert IFUNC relocations in . plt order , so sort to get
* back in order .
*/
if ( machine_is_x86 ( ehdr . e_machine ) & & sort_rel ( & ri ) )
goto out_elf_end ;
for ( idx = 0 ; idx < ri . nr_entries ; idx + + ) {
2023-01-20 15:34:56 +03:00
const char * elf_name = NULL ;
char * demangled = NULL ;
gelf_getsym ( syms , get_rel_symidx ( & ri , idx ) , & sym ) ;
elf_name = elf_sym__name ( & sym , symstrs ) ;
demangled = demangle_sym ( dso , 0 , elf_name ) ;
if ( demangled )
elf_name = demangled ;
if ( * elf_name )
snprintf ( sympltname , sizeof ( sympltname ) , " %s@plt " , elf_name ) ;
perf symbols: Add support for IFUNC symbols for x86_64
For x86_64, the GNU linker is putting IFUNC information in the relocation
addend, so use it to try to find a symbol for plt entries that refer to
IFUNCs.
Example:
Before:
$ cat tstpltlib.c
void fn1(void) {}
void fn2(void) {}
void fn3(void) {}
void fn4(void) {}
$ cat tstpltifunc.c
#include <stdio.h>
void thing1(void)
{
printf("thing1\n");
}
void thing2(void)
{
printf("thing2\n");
}
typedef void (*thing_fn_t)(void);
thing_fn_t thing_ifunc(void)
{
int x;
if (x & 1)
return thing2;
return thing1;
}
void thing(void) __attribute__ ((ifunc ("thing_ifunc")));
void fn1(void);
void fn2(void);
void fn3(void);
void fn4(void);
int main()
{
fn4();
fn1();
thing();
fn2();
fn3();
return 0;
}
$ gcc --version
gcc (Ubuntu 11.3.0-1ubuntu1~22.04) 11.3.0
Copyright (C) 2021 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ gcc -Wall -Wextra -shared -o libtstpltlib.so tstpltlib.c
$ gcc -Wall -Wextra -Wno-uninitialized -o tstpltifunc tstpltifunc.c -L . -ltstpltlib -Wl,-rpath="$(pwd)"
$ readelf -rW tstpltifunc | grep -A99 plt
Relocation section '.rela.plt' at offset 0x738 contains 8 entries:
Offset Info Type Symbol's Value Symbol's Name + Addend
0000000000003f98 0000000300000007 R_X86_64_JUMP_SLOT 0000000000000000 puts@GLIBC_2.2.5 + 0
0000000000003fa8 0000000400000007 R_X86_64_JUMP_SLOT 0000000000000000 __stack_chk_fail@GLIBC_2.4 + 0
0000000000003fb0 0000000500000007 R_X86_64_JUMP_SLOT 0000000000000000 fn1 + 0
0000000000003fb8 0000000600000007 R_X86_64_JUMP_SLOT 0000000000000000 fn3 + 0
0000000000003fc0 0000000800000007 R_X86_64_JUMP_SLOT 0000000000000000 fn4 + 0
0000000000003fc8 0000000900000007 R_X86_64_JUMP_SLOT 0000000000000000 fn2 + 0
0000000000003fd0 0000000b00000007 R_X86_64_JUMP_SLOT 0000000000000000 getrandom@GLIBC_2.25 + 0
0000000000003fa0 0000000000000025 R_X86_64_IRELATIVE 125d
$ perf record -e intel_pt//u --filter 'filter main @ ./tstpltifunc' ./tstpltifunc
thing2
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.016 MB perf.data ]
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
21860.073683659: tr strt 0 [unknown] => 561e212c42be main+0x0
21860.073683659: tr end call 561e212c42c6 main+0x8 => 561e212c4110 fn4@plt+0x0
21860.073683661: tr strt 0 [unknown] => 561e212c42cb main+0xd
21860.073683661: tr end call 561e212c42cb main+0xd => 561e212c40f0 fn1@plt+0x0
21860.073683661: tr strt 0 [unknown] => 561e212c42d0 main+0x12
21860.073683661: tr end call 561e212c42d0 main+0x12 => 561e212c40d0 offset_0x10d0@plt+0x0
21860.073698451: tr strt 0 [unknown] => 561e212c42d5 main+0x17
21860.073698451: tr end call 561e212c42d5 main+0x17 => 561e212c4120 fn2@plt+0x0
21860.073698451: tr strt 0 [unknown] => 561e212c42da main+0x1c
21860.073698451: tr end call 561e212c42da main+0x1c => 561e212c4100 fn3@plt+0x0
21860.073698452: tr strt 0 [unknown] => 561e212c42df main+0x21
21860.073698452: tr end return 561e212c42e5 main+0x27 => 7fb51cc29d90 __libc_start_call_main+0x80
After:
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
21860.073683659: tr strt 0 [unknown] => 561e212c42be main+0x0
21860.073683659: tr end call 561e212c42c6 main+0x8 => 561e212c4110 fn4@plt+0x0
21860.073683661: tr strt 0 [unknown] => 561e212c42cb main+0xd
21860.073683661: tr end call 561e212c42cb main+0xd => 561e212c40f0 fn1@plt+0x0
21860.073683661: tr strt 0 [unknown] => 561e212c42d0 main+0x12
21860.073683661: tr end call 561e212c42d0 main+0x12 => 561e212c40d0 thing_ifunc@plt+0x0
21860.073698451: tr strt 0 [unknown] => 561e212c42d5 main+0x17
21860.073698451: tr end call 561e212c42d5 main+0x17 => 561e212c4120 fn2@plt+0x0
21860.073698451: tr strt 0 [unknown] => 561e212c42da main+0x1c
21860.073698451: tr end call 561e212c42da main+0x1c => 561e212c4100 fn3@plt+0x0
21860.073698452: tr strt 0 [unknown] => 561e212c42df main+0x21
21860.073698452: tr end return 561e212c42e5 main+0x27 => 7fb51cc29d90 __libc_start_call_main+0x80
Reviewed-by: Namhyung Kim <namhyung@kernel.org>
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Link: https://lore.kernel.org/r/20230131131625.6964-6-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-31 16:16:21 +03:00
else if ( ! get_ifunc_name ( elf , dso , & ehdr , & ri , sympltname , sizeof ( sympltname ) ) )
2023-01-20 15:34:56 +03:00
snprintf ( sympltname , sizeof ( sympltname ) ,
" offset_%# " PRIx64 " @plt " , plt_offset ) ;
free ( demangled ) ;
f = symbol__new ( plt_offset , plt_entry_size , STB_GLOBAL , STT_FUNC , sympltname ) ;
if ( ! f )
goto out_elf_end ;
plt_offset + = plt_entry_size ;
symbols__insert ( & dso - > symbols , f ) ;
+ + nr ;
2012-08-06 08:41:20 +04:00
}
err = 0 ;
out_elf_end :
perf symbols: Sort plt relocations for x86
For x86, with the addition of IFUNCs, relocation information becomes
disordered with respect to plt. Correct that by sorting the relocations by
offset.
Example:
Before:
$ cat tstpltlib.c
void fn1(void) {}
void fn2(void) {}
void fn3(void) {}
void fn4(void) {}
$ cat tstpltifunc.c
#include <stdio.h>
void thing1(void)
{
printf("thing1\n");
}
void thing2(void)
{
printf("thing2\n");
}
typedef void (*thing_fn_t)(void);
thing_fn_t thing_ifunc(void)
{
int x;
if (x & 1)
return thing2;
return thing1;
}
void thing(void) __attribute__ ((ifunc ("thing_ifunc")));
void fn1(void);
void fn2(void);
void fn3(void);
void fn4(void);
int main()
{
fn4();
fn1();
thing();
fn2();
fn3();
return 0;
}
$ gcc --version
gcc (Ubuntu 11.3.0-1ubuntu1~22.04) 11.3.0
Copyright (C) 2021 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ gcc -Wall -Wextra -shared -o libtstpltlib.so tstpltlib.c
$ gcc -Wall -Wextra -Wno-uninitialized -o tstpltifunc tstpltifunc.c -L . -ltstpltlib -Wl,-rpath="$(pwd)"
$ readelf -rW tstpltifunc | grep -A99 plt
Relocation section '.rela.plt' at offset 0x738 contains 8 entries:
Offset Info Type Symbol's Value Symbol's Name + Addend
0000000000003f98 0000000300000007 R_X86_64_JUMP_SLOT 0000000000000000 puts@GLIBC_2.2.5 + 0
0000000000003fa8 0000000400000007 R_X86_64_JUMP_SLOT 0000000000000000 __stack_chk_fail@GLIBC_2.4 + 0
0000000000003fb0 0000000500000007 R_X86_64_JUMP_SLOT 0000000000000000 fn1 + 0
0000000000003fb8 0000000600000007 R_X86_64_JUMP_SLOT 0000000000000000 fn3 + 0
0000000000003fc0 0000000800000007 R_X86_64_JUMP_SLOT 0000000000000000 fn4 + 0
0000000000003fc8 0000000900000007 R_X86_64_JUMP_SLOT 0000000000000000 fn2 + 0
0000000000003fd0 0000000b00000007 R_X86_64_JUMP_SLOT 0000000000000000 getrandom@GLIBC_2.25 + 0
0000000000003fa0 0000000000000025 R_X86_64_IRELATIVE 125d
$ perf record -e intel_pt//u --filter 'filter main @ ./tstpltifunc' ./tstpltifunc
thing2
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.029 MB perf.data ]
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
20417.302513948: tr strt 0 [unknown] => 5629a74892be main+0x0
20417.302513948: tr end call 5629a74892c6 main+0x8 => 5629a7489110 fn2@plt+0x0
20417.302513949: tr strt 0 [unknown] => 5629a74892cb main+0xd
20417.302513949: tr end call 5629a74892cb main+0xd => 5629a74890f0 fn3@plt+0x0
20417.302513950: tr strt 0 [unknown] => 5629a74892d0 main+0x12
20417.302513950: tr end call 5629a74892d0 main+0x12 => 5629a74890d0 __stack_chk_fail@plt+0x0
20417.302528114: tr strt 0 [unknown] => 5629a74892d5 main+0x17
20417.302528114: tr end call 5629a74892d5 main+0x17 => 5629a7489120 getrandom@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892da main+0x1c
20417.302528115: tr end call 5629a74892da main+0x1c => 5629a7489100 fn4@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892df main+0x21
20417.302528115: tr end return 5629a74892e5 main+0x27 => 7ff14da29d90 __libc_start_call_main+0x80
After:
$ perf script --itrace=be --ns -F+flags,-event,+addr,-period,-comm,-tid,-cpu,-dso
20417.302513948: tr strt 0 [unknown] => 5629a74892be main+0x0
20417.302513948: tr end call 5629a74892c6 main+0x8 => 5629a7489110 fn4@plt+0x0
20417.302513949: tr strt 0 [unknown] => 5629a74892cb main+0xd
20417.302513949: tr end call 5629a74892cb main+0xd => 5629a74890f0 fn1@plt+0x0
20417.302513950: tr strt 0 [unknown] => 5629a74892d0 main+0x12
20417.302513950: tr end call 5629a74892d0 main+0x12 => 5629a74890d0 offset_0x10d0@plt+0x0
20417.302528114: tr strt 0 [unknown] => 5629a74892d5 main+0x17
20417.302528114: tr end call 5629a74892d5 main+0x17 => 5629a7489120 fn2@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892da main+0x1c
20417.302528115: tr end call 5629a74892da main+0x1c => 5629a7489100 fn3@plt+0x0
20417.302528115: tr strt 0 [unknown] => 5629a74892df main+0x21
20417.302528115: tr end return 5629a74892e5 main+0x27 => 7ff14da29d90 __libc_start_call_main+0x80
Reviewed-by: Namhyung Kim <namhyung@kernel.org>
Signed-off-by: Adrian Hunter <adrian.hunter@intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Link: https://lore.kernel.org/r/20230131131625.6964-4-adrian.hunter@intel.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-01-31 16:16:19 +03:00
exit_rel ( & ri ) ;
2012-08-06 08:41:20 +04:00
if ( err = = 0 )
return nr ;
pr_debug ( " %s: problems reading %s PLT info. \n " ,
__func__ , dso - > long_name ) ;
return 0 ;
}
2017-08-07 00:24:34 +03:00
char * dso__demangle_sym ( struct dso * dso , int kmodule , const char * elf_name )
2017-03-25 23:34:26 +03:00
{
return demangle_sym ( dso , kmodule , elf_name ) ;
}
2012-08-06 08:41:20 +04:00
/*
* Align offset to 4 bytes as needed for note name and descriptor data .
*/
# define NOTE_ALIGN(n) (((n) + 3) & -4U)
static int elf_read_build_id ( Elf * elf , void * bf , size_t size )
{
int err = - 1 ;
GElf_Ehdr ehdr ;
GElf_Shdr shdr ;
Elf_Data * data ;
Elf_Scn * sec ;
Elf_Kind ek ;
void * ptr ;
if ( size < BUILD_ID_SIZE )
goto out ;
ek = elf_kind ( elf ) ;
if ( ek ! = ELF_K_ELF )
goto out ;
if ( gelf_getehdr ( elf , & ehdr ) = = NULL ) {
pr_err ( " %s: cannot get elf header. \n " , __func__ ) ;
goto out ;
}
/*
* Check following sections for notes :
* ' . note . gnu . build - id '
* ' . notes '
* ' . note ' ( VDSO specific )
*/
do {
sec = elf_section_by_name ( elf , & ehdr , & shdr ,
" .note.gnu.build-id " , NULL ) ;
if ( sec )
break ;
sec = elf_section_by_name ( elf , & ehdr , & shdr ,
" .notes " , NULL ) ;
if ( sec )
break ;
sec = elf_section_by_name ( elf , & ehdr , & shdr ,
" .note " , NULL ) ;
if ( sec )
break ;
return err ;
} while ( 0 ) ;
data = elf_getdata ( sec , NULL ) ;
if ( data = = NULL )
goto out ;
ptr = data - > d_buf ;
while ( ptr < ( data - > d_buf + data - > d_size ) ) {
GElf_Nhdr * nhdr = ptr ;
size_t namesz = NOTE_ALIGN ( nhdr - > n_namesz ) ,
descsz = NOTE_ALIGN ( nhdr - > n_descsz ) ;
const char * name ;
ptr + = sizeof ( * nhdr ) ;
name = ptr ;
ptr + = namesz ;
if ( nhdr - > n_type = = NT_GNU_BUILD_ID & &
nhdr - > n_namesz = = sizeof ( " GNU " ) ) {
if ( memcmp ( name , " GNU " , sizeof ( " GNU " ) ) = = 0 ) {
size_t sz = min ( size , descsz ) ;
memcpy ( bf , ptr , sz ) ;
memset ( bf + sz , 0 , size - sz ) ;
err = descsz ;
break ;
}
}
ptr + = descsz ;
}
out :
return err ;
}
2020-08-21 19:52:36 +03:00
# ifdef HAVE_LIBBFD_BUILDID_SUPPORT
2020-12-14 13:54:48 +03:00
static int read_build_id ( const char * filename , struct build_id * bid )
2020-08-21 19:52:36 +03:00
{
2020-10-13 22:24:34 +03:00
size_t size = sizeof ( bid - > data ) ;
2020-08-21 19:52:36 +03:00
int err = - 1 ;
bfd * abfd ;
abfd = bfd_openr ( filename , NULL ) ;
if ( ! abfd )
return - 1 ;
if ( ! bfd_check_format ( abfd , bfd_object ) ) {
pr_debug2 ( " %s: cannot read %s bfd file. \n " , __func__ , filename ) ;
goto out_close ;
}
if ( ! abfd - > build_id | | abfd - > build_id - > size > size )
goto out_close ;
2020-10-13 22:24:34 +03:00
memcpy ( bid - > data , abfd - > build_id - > data , abfd - > build_id - > size ) ;
memset ( bid - > data + abfd - > build_id - > size , 0 , size - abfd - > build_id - > size ) ;
err = bid - > size = abfd - > build_id - > size ;
2020-08-21 19:52:36 +03:00
out_close :
bfd_close ( abfd ) ;
return err ;
}
# else // HAVE_LIBBFD_BUILDID_SUPPORT
2020-12-14 13:54:48 +03:00
static int read_build_id ( const char * filename , struct build_id * bid )
2012-08-06 08:41:20 +04:00
{
2020-10-13 22:24:34 +03:00
size_t size = sizeof ( bid - > data ) ;
2012-08-06 08:41:20 +04:00
int fd , err = - 1 ;
Elf * elf ;
if ( size < BUILD_ID_SIZE )
goto out ;
fd = open ( filename , O_RDONLY ) ;
if ( fd < 0 )
goto out ;
elf = elf_begin ( fd , PERF_ELF_C_READ_MMAP , NULL ) ;
if ( elf = = NULL ) {
pr_debug2 ( " %s: cannot read %s ELF file. \n " , __func__ , filename ) ;
goto out_close ;
}
2020-10-13 22:24:34 +03:00
err = elf_read_build_id ( elf , bid - > data , size ) ;
if ( err > 0 )
bid - > size = err ;
2012-08-06 08:41:20 +04:00
elf_end ( elf ) ;
out_close :
close ( fd ) ;
out :
return err ;
}
2020-08-21 19:52:36 +03:00
# endif // HAVE_LIBBFD_BUILDID_SUPPORT
2020-12-14 13:54:48 +03:00
int filename__read_build_id ( const char * filename , struct build_id * bid )
{
struct kmod_path m = { . name = NULL , } ;
char path [ PATH_MAX ] ;
int err ;
if ( ! filename )
return - EFAULT ;
err = kmod_path__parse ( & m , filename ) ;
if ( err )
return - 1 ;
if ( m . comp ) {
int error = 0 , fd ;
fd = filename__decompress ( filename , path , sizeof ( path ) , m . comp , & error ) ;
if ( fd < 0 ) {
pr_debug ( " Failed to decompress (error %d) %s \n " ,
error , filename ) ;
return - 1 ;
}
close ( fd ) ;
filename = path ;
}
err = read_build_id ( filename , bid ) ;
if ( m . comp )
unlink ( filename ) ;
return err ;
}
2020-10-13 22:24:35 +03:00
int sysfs__read_build_id ( const char * filename , struct build_id * bid )
2012-08-06 08:41:20 +04:00
{
2020-10-13 22:24:35 +03:00
size_t size = sizeof ( bid - > data ) ;
2012-08-06 08:41:20 +04:00
int fd , err = - 1 ;
fd = open ( filename , O_RDONLY ) ;
if ( fd < 0 )
goto out ;
while ( 1 ) {
char bf [ BUFSIZ ] ;
GElf_Nhdr nhdr ;
size_t namesz , descsz ;
if ( read ( fd , & nhdr , sizeof ( nhdr ) ) ! = sizeof ( nhdr ) )
break ;
namesz = NOTE_ALIGN ( nhdr . n_namesz ) ;
descsz = NOTE_ALIGN ( nhdr . n_descsz ) ;
if ( nhdr . n_type = = NT_GNU_BUILD_ID & &
nhdr . n_namesz = = sizeof ( " GNU " ) ) {
if ( read ( fd , bf , namesz ) ! = ( ssize_t ) namesz )
break ;
if ( memcmp ( bf , " GNU " , sizeof ( " GNU " ) ) = = 0 ) {
size_t sz = min ( descsz , size ) ;
2020-10-13 22:24:35 +03:00
if ( read ( fd , bid - > data , sz ) = = ( ssize_t ) sz ) {
memset ( bid - > data + sz , 0 , size - sz ) ;
bid - > size = sz ;
2012-08-06 08:41:20 +04:00
err = 0 ;
break ;
}
} else if ( read ( fd , bf , descsz ) ! = ( ssize_t ) descsz )
break ;
} else {
int n = namesz + descsz ;
2017-01-03 21:19:21 +03:00
if ( n > ( int ) sizeof ( bf ) ) {
n = sizeof ( bf ) ;
pr_debug ( " %s: truncating reading of build id in sysfs file %s: n_namesz=%u, n_descsz=%u. \n " ,
__func__ , filename , nhdr . n_namesz , nhdr . n_descsz ) ;
}
2012-08-06 08:41:20 +04:00
if ( read ( fd , bf , n ) ! = n )
break ;
}
}
close ( fd ) ;
out :
return err ;
}
2020-08-21 19:52:36 +03:00
# ifdef HAVE_LIBBFD_SUPPORT
int filename__read_debuglink ( const char * filename , char * debuglink ,
size_t size )
{
int err = - 1 ;
asection * section ;
bfd * abfd ;
abfd = bfd_openr ( filename , NULL ) ;
if ( ! abfd )
return - 1 ;
if ( ! bfd_check_format ( abfd , bfd_object ) ) {
pr_debug2 ( " %s: cannot read %s bfd file. \n " , __func__ , filename ) ;
goto out_close ;
}
section = bfd_get_section_by_name ( abfd , " .gnu_debuglink " ) ;
if ( ! section )
goto out_close ;
if ( section - > size > size )
goto out_close ;
if ( ! bfd_get_section_contents ( abfd , section , debuglink , 0 ,
section - > size ) )
goto out_close ;
err = 0 ;
out_close :
bfd_close ( abfd ) ;
return err ;
}
# else
2012-08-06 08:41:20 +04:00
int filename__read_debuglink ( const char * filename , char * debuglink ,
size_t size )
{
int fd , err = - 1 ;
Elf * elf ;
GElf_Ehdr ehdr ;
GElf_Shdr shdr ;
Elf_Data * data ;
Elf_Scn * sec ;
Elf_Kind ek ;
fd = open ( filename , O_RDONLY ) ;
if ( fd < 0 )
goto out ;
elf = elf_begin ( fd , PERF_ELF_C_READ_MMAP , NULL ) ;
if ( elf = = NULL ) {
pr_debug2 ( " %s: cannot read %s ELF file. \n " , __func__ , filename ) ;
goto out_close ;
}
ek = elf_kind ( elf ) ;
if ( ek ! = ELF_K_ELF )
2013-10-11 04:27:57 +04:00
goto out_elf_end ;
2012-08-06 08:41:20 +04:00
if ( gelf_getehdr ( elf , & ehdr ) = = NULL ) {
pr_err ( " %s: cannot get elf header. \n " , __func__ ) ;
2013-10-11 04:27:57 +04:00
goto out_elf_end ;
2012-08-06 08:41:20 +04:00
}
sec = elf_section_by_name ( elf , & ehdr , & shdr ,
" .gnu_debuglink " , NULL ) ;
if ( sec = = NULL )
2013-10-11 04:27:57 +04:00
goto out_elf_end ;
2012-08-06 08:41:20 +04:00
data = elf_getdata ( sec , NULL ) ;
if ( data = = NULL )
2013-10-11 04:27:57 +04:00
goto out_elf_end ;
2012-08-06 08:41:20 +04:00
/* the start of this section is a zero-terminated string */
strncpy ( debuglink , data - > d_buf , size ) ;
2014-02-20 05:32:55 +04:00
err = 0 ;
2013-10-11 04:27:57 +04:00
out_elf_end :
2012-08-06 08:41:20 +04:00
elf_end ( elf ) ;
out_close :
close ( fd ) ;
out :
return err ;
}
2020-08-21 19:52:36 +03:00
# endif
2012-08-06 08:41:20 +04:00
static int dso__swap_init ( struct dso * dso , unsigned char eidata )
{
static unsigned int const endian = 1 ;
dso - > needs_swap = DSO_SWAP__NO ;
switch ( eidata ) {
case ELFDATA2LSB :
/* We are big endian, DSO is little endian. */
if ( * ( unsigned char const * ) & endian ! = 1 )
dso - > needs_swap = DSO_SWAP__YES ;
break ;
case ELFDATA2MSB :
/* We are little endian, DSO is big endian. */
if ( * ( unsigned char const * ) & endian ! = 0 )
dso - > needs_swap = DSO_SWAP__YES ;
break ;
default :
pr_err ( " unrecognized DSO data encoding %d \n " , eidata ) ;
return - EINVAL ;
}
return 0 ;
}
2012-08-11 02:23:02 +04:00
bool symsrc__possibly_runtime ( struct symsrc * ss )
{
return ss - > dynsym | | ss - > opdsec ;
}
2012-08-11 02:23:00 +04:00
bool symsrc__has_symtab ( struct symsrc * ss )
{
return ss - > symtab ! = NULL ;
}
2012-08-11 02:22:57 +04:00
void symsrc__destroy ( struct symsrc * ss )
{
2013-12-27 23:55:14 +04:00
zfree ( & ss - > name ) ;
2012-08-11 02:22:57 +04:00
elf_end ( ss - > elf ) ;
close ( ss - > fd ) ;
}
2020-03-06 04:57:58 +03:00
bool elf__needs_adjust_symbols ( GElf_Ehdr ehdr )
2015-04-28 15:05:35 +03:00
{
2020-03-06 04:57:58 +03:00
/*
* Usually vmlinux is an ELF file with type ET_EXEC for most
* architectures ; except Arm64 kernel is linked with option
* ' - share ' , so need to check type ET_DYN .
*/
return ehdr . e_type = = ET_EXEC | | ehdr . e_type = = ET_REL | |
ehdr . e_type = = ET_DYN ;
2015-04-28 15:05:35 +03:00
}
2012-08-11 02:22:57 +04:00
int symsrc__init ( struct symsrc * ss , struct dso * dso , const char * name ,
enum dso_binary_type type )
2012-08-06 08:41:20 +04:00
{
GElf_Ehdr ehdr ;
Elf * elf ;
2012-08-11 02:22:57 +04:00
int fd ;
2015-03-24 17:49:02 +03:00
if ( dso__needs_decompress ( dso ) ) {
2017-06-08 10:31:03 +03:00
fd = dso__decompress_kmodule_fd ( dso , name ) ;
2015-03-24 17:49:02 +03:00
if ( fd < 0 )
return - 1 ;
2017-06-08 10:31:08 +03:00
type = dso - > symtab_type ;
2015-03-24 17:49:02 +03:00
} else {
2014-11-04 04:14:27 +03:00
fd = open ( name , O_RDONLY ) ;
2015-03-24 17:49:02 +03:00
if ( fd < 0 ) {
dso - > load_errno = errno ;
return - 1 ;
}
}
2012-08-06 08:41:20 +04:00
elf = elf_begin ( fd , PERF_ELF_C_READ_MMAP , NULL ) ;
if ( elf = = NULL ) {
pr_debug ( " %s: cannot read %s ELF file. \n " , __func__ , name ) ;
2015-03-24 17:49:02 +03:00
dso - > load_errno = DSO_LOAD_ERRNO__INVALID_ELF ;
2012-08-06 08:41:20 +04:00
goto out_close ;
}
if ( gelf_getehdr ( elf , & ehdr ) = = NULL ) {
2015-03-24 17:49:02 +03:00
dso - > load_errno = DSO_LOAD_ERRNO__INVALID_ELF ;
2012-08-06 08:41:20 +04:00
pr_debug ( " %s: cannot get elf header. \n " , __func__ ) ;
goto out_elf_end ;
}
2015-03-24 17:49:02 +03:00
if ( dso__swap_init ( dso , ehdr . e_ident [ EI_DATA ] ) ) {
dso - > load_errno = DSO_LOAD_ERRNO__INTERNAL_ERROR ;
2012-08-06 08:41:20 +04:00
goto out_elf_end ;
2015-03-24 17:49:02 +03:00
}
2012-08-06 08:41:20 +04:00
/* Always reject images with a mismatched build-id: */
2016-08-25 19:24:42 +03:00
if ( dso - > has_build_id & & ! symbol_conf . ignore_vmlinux_buildid ) {
2012-08-06 08:41:20 +04:00
u8 build_id [ BUILD_ID_SIZE ] ;
2020-10-13 22:24:38 +03:00
struct build_id bid ;
int size ;
2012-08-06 08:41:20 +04:00
2020-10-13 22:24:38 +03:00
size = elf_read_build_id ( elf , build_id , BUILD_ID_SIZE ) ;
if ( size < = 0 ) {
2015-03-24 17:49:02 +03:00
dso - > load_errno = DSO_LOAD_ERRNO__CANNOT_READ_BUILDID ;
2012-08-06 08:41:20 +04:00
goto out_elf_end ;
2015-03-24 17:49:02 +03:00
}
2012-08-06 08:41:20 +04:00
2020-10-13 22:24:38 +03:00
build_id__init ( & bid , build_id , size ) ;
if ( ! dso__build_id_equal ( dso , & bid ) ) {
2015-04-24 22:44:46 +03:00
pr_debug ( " %s: build id mismatch for %s. \n " , __func__ , name ) ;
2015-03-24 17:49:02 +03:00
dso - > load_errno = DSO_LOAD_ERRNO__MISMATCHING_BUILDID ;
2012-08-06 08:41:20 +04:00
goto out_elf_end ;
2015-03-24 17:49:02 +03:00
}
2012-08-06 08:41:20 +04:00
}
2014-07-14 14:02:41 +04:00
ss - > is_64_bit = ( gelf_getclass ( elf ) = = ELFCLASS64 ) ;
2012-08-11 02:22:57 +04:00
ss - > symtab = elf_section_by_name ( elf , & ehdr , & ss - > symshdr , " .symtab " ,
NULL ) ;
if ( ss - > symshdr . sh_type ! = SHT_SYMTAB )
ss - > symtab = NULL ;
ss - > dynsym_idx = 0 ;
ss - > dynsym = elf_section_by_name ( elf , & ehdr , & ss - > dynshdr , " .dynsym " ,
& ss - > dynsym_idx ) ;
if ( ss - > dynshdr . sh_type ! = SHT_DYNSYM )
ss - > dynsym = NULL ;
ss - > opdidx = 0 ;
ss - > opdsec = elf_section_by_name ( elf , & ehdr , & ss - > opdshdr , " .opd " ,
& ss - > opdidx ) ;
if ( ss - > opdshdr . sh_type ! = SHT_PROGBITS )
ss - > opdsec = NULL ;
2020-08-08 15:21:54 +03:00
if ( dso - > kernel = = DSO_SPACE__USER )
2016-04-07 13:24:31 +03:00
ss - > adjust_symbols = true ;
else
2015-04-28 15:05:35 +03:00
ss - > adjust_symbols = elf__needs_adjust_symbols ( ehdr ) ;
2012-08-11 02:22:57 +04:00
ss - > name = strdup ( name ) ;
2015-03-24 17:49:02 +03:00
if ( ! ss - > name ) {
dso - > load_errno = errno ;
2012-08-11 02:22:57 +04:00
goto out_elf_end ;
2015-03-24 17:49:02 +03:00
}
2012-08-11 02:22:57 +04:00
ss - > elf = elf ;
ss - > fd = fd ;
ss - > ehdr = ehdr ;
ss - > type = type ;
return 0 ;
out_elf_end :
elf_end ( elf ) ;
out_close :
close ( fd ) ;
2019-05-30 12:38:01 +03:00
return - 1 ;
2012-08-11 02:22:57 +04:00
}
2013-08-07 15:38:47 +04:00
/**
* ref_reloc_sym_not_found - has kernel relocation symbol been found .
* @ kmap : kernel maps and relocation reference symbol
*
* This function returns % true if we are dealing with the kernel maps and the
* relocation reference symbol has not yet been found . Otherwise % false is
* returned .
*/
static bool ref_reloc_sym_not_found ( struct kmap * kmap )
{
return kmap & & kmap - > ref_reloc_sym & & kmap - > ref_reloc_sym - > name & &
! kmap - > ref_reloc_sym - > unrelocated_addr ;
}
/**
* ref_reloc - kernel relocation offset .
* @ kmap : kernel maps and relocation reference symbol
*
* This function returns the offset of kernel addresses as determined by using
* the relocation reference symbol i . e . if the kernel has not been relocated
* then the return value is zero .
*/
static u64 ref_reloc ( struct kmap * kmap )
{
if ( kmap & & kmap - > ref_reloc_sym & &
kmap - > ref_reloc_sym - > unrelocated_addr )
return kmap - > ref_reloc_sym - > addr -
kmap - > ref_reloc_sym - > unrelocated_addr ;
return 0 ;
}
2016-04-12 12:10:50 +03:00
void __weak arch__sym_update ( struct symbol * s __maybe_unused ,
GElf_Sym * sym __maybe_unused ) { }
2015-04-28 15:05:38 +03:00
2018-04-27 21:15:24 +03:00
static int dso__process_kernel_symbol ( struct dso * dso , struct map * map ,
GElf_Sym * sym , GElf_Shdr * shdr ,
2019-11-26 03:58:33 +03:00
struct maps * kmaps , struct kmap * kmap ,
2018-04-27 21:15:24 +03:00
struct dso * * curr_dsop , struct map * * curr_mapp ,
const char * section_name ,
bool adjust_kernel_syms , bool kmodule , bool * remap_kernel )
{
struct dso * curr_dso = * curr_dsop ;
struct map * curr_map ;
char dso_name [ PATH_MAX ] ;
/* Adjust symbol to map to file offset */
if ( adjust_kernel_syms )
sym - > st_value - = shdr - > sh_addr - shdr - > sh_offset ;
if ( strcmp ( section_name , ( curr_dso - > short_name + dso - > short_name_len ) ) = = 0 )
return 0 ;
if ( strcmp ( section_name , " .text " ) = = 0 ) {
/*
* The initial kernel mapping is based on
* kallsyms and identity maps . Overwrite it to
* map to the kernel dso .
*/
2020-08-08 15:26:56 +03:00
if ( * remap_kernel & & dso - > kernel & & ! kmodule ) {
2018-04-27 21:15:24 +03:00
* remap_kernel = false ;
map - > start = shdr - > sh_addr + ref_reloc ( kmap ) ;
map - > end = map - > start + shdr - > sh_size ;
map - > pgoff = shdr - > sh_offset ;
map - > map_ip = map__map_ip ;
map - > unmap_ip = map__unmap_ip ;
/* Ensure maps are correctly ordered */
if ( kmaps ) {
map__get ( map ) ;
2019-11-26 03:58:33 +03:00
maps__remove ( kmaps , map ) ;
maps__insert ( kmaps , map ) ;
2018-04-27 21:15:24 +03:00
map__put ( map ) ;
}
}
/*
* The initial module mapping is based on
* / proc / modules mapped to offset zero .
* Overwrite it to map to the module dso .
*/
if ( * remap_kernel & & kmodule ) {
* remap_kernel = false ;
map - > pgoff = shdr - > sh_offset ;
}
* curr_mapp = map ;
* curr_dsop = dso ;
return 0 ;
}
if ( ! kmap )
return 0 ;
snprintf ( dso_name , sizeof ( dso_name ) , " %s%s " , dso - > short_name , section_name ) ;
2019-11-26 03:58:33 +03:00
curr_map = maps__find_by_name ( kmaps , dso_name ) ;
2018-04-27 21:15:24 +03:00
if ( curr_map = = NULL ) {
u64 start = sym - > st_value ;
if ( kmodule )
start + = map - > start + shdr - > sh_offset ;
curr_dso = dso__new ( dso_name ) ;
if ( curr_dso = = NULL )
return - 1 ;
curr_dso - > kernel = dso - > kernel ;
curr_dso - > long_name = dso - > long_name ;
curr_dso - > long_name_len = dso - > long_name_len ;
curr_map = map__new2 ( start , curr_dso ) ;
dso__put ( curr_dso ) ;
if ( curr_map = = NULL )
return - 1 ;
perf map: Set kmap->kmaps backpointer for main kernel map chunks
When a map is create to represent the main kernel area (vmlinux) with
map__new2() we allocate an extra area to store a pointer to the 'struct
maps' for the kernel maps, so that we can access that struct when
loading ELF files or kallsyms, as we will need to split it in multiple
maps, one per kernel module or ELF section (such as ".init.text").
So when map->dso->kernel is non-zero, it is expected that
map__kmap(map)->kmaps to be set to the tree of kernel maps (modules,
chunks of the main kernel, bpf progs put in place via
PERF_RECORD_KSYMBOL, the main kernel).
This was not the case when we were splitting the main kernel into chunks
for its ELF sections, which ended up making 'perf report --children'
processing a perf.data file with callchains to trip on
__map__is_kernel(), when we press ENTER to see the popup menu for main
histogram entries that starts at a symbol in the ".init.text" ELF
section, e.g.:
- 8.83% 0.00% swapper [kernel.vmlinux].init.text [k] start_kernel
start_kernel
cpu_startup_entry
do_idle
cpuidle_enter
cpuidle_enter_state
intel_idle
Fix it.
Reviewed-by: Jiri Olsa <jolsa@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Link: https://lore.kernel.org/lkml/20191218190120.GB13282@kernel.org/
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2019-12-18 21:23:14 +03:00
if ( curr_dso - > kernel )
map__kmap ( curr_map ) - > kmaps = kmaps ;
2018-04-27 21:15:24 +03:00
if ( adjust_kernel_syms ) {
curr_map - > start = shdr - > sh_addr + ref_reloc ( kmap ) ;
curr_map - > end = curr_map - > start + shdr - > sh_size ;
curr_map - > pgoff = shdr - > sh_offset ;
} else {
curr_map - > map_ip = curr_map - > unmap_ip = identity__map_ip ;
}
curr_dso - > symtab_type = dso - > symtab_type ;
2019-11-26 03:58:33 +03:00
maps__insert ( kmaps , curr_map ) ;
2018-04-27 21:15:24 +03:00
/*
2021-03-23 19:09:15 +03:00
* Add it before we drop the reference to curr_map , i . e . while
2018-04-27 21:15:24 +03:00
* we still are sure to have a reference to this DSO via
* * curr_map - > dso .
*/
2019-11-04 22:09:48 +03:00
dsos__add ( & kmaps - > machine - > dsos , curr_dso ) ;
2018-04-27 21:15:24 +03:00
/* kmaps already got it */
map__put ( curr_map ) ;
dso__set_loaded ( curr_dso ) ;
* curr_mapp = curr_map ;
* curr_dsop = curr_dso ;
} else
* curr_dsop = curr_map - > dso ;
return 0 ;
}
2021-07-03 18:35:27 +03:00
static int
dso__load_sym_internal ( struct dso * dso , struct map * map , struct symsrc * syms_ss ,
struct symsrc * runtime_ss , int kmodule , int dynsym )
2012-08-11 02:22:57 +04:00
{
struct kmap * kmap = dso - > kernel ? map__kmap ( map ) : NULL ;
2019-11-26 03:58:33 +03:00
struct maps * kmaps = kmap ? map__kmaps ( map ) : NULL ;
2012-08-11 02:22:57 +04:00
struct map * curr_map = map ;
struct dso * curr_dso = dso ;
perf top: Fix overflow in elf_sec__is_text()
ASan reports a heap-buffer-overflow in elf_sec__is_text when using perf-top.
The bug is caused by the fact that secstrs is built from runtime_ss, while
shdr is built from syms_ss if shdr.sh_type != SHT_NOBITS. Therefore, they
point to two different ELF files.
This patch renames secstrs to secstrs_run and adds secstrs_sym, so that
the correct secstrs is chosen depending on shdr.sh_type.
$ ASAN_OPTIONS=abort_on_error=1:disable_coredump=0:unmap_shadow_on_exit=1 ./perf top
=================================================================
==363148==ERROR: AddressSanitizer: heap-buffer-overflow on address 0x61300009add6 at pc 0x00000049875c bp 0x7f4f56446440 sp 0x7f4f56445bf0
READ of size 1 at 0x61300009add6 thread T6
#0 0x49875b in StrstrCheck(void*, char*, char const*, char const*) (/home/user/linux/tools/perf/perf+0x49875b)
#1 0x4d13a2 in strstr (/home/user/linux/tools/perf/perf+0x4d13a2)
#2 0xacae36 in elf_sec__is_text /home/user/linux/tools/perf/util/symbol-elf.c:176:9
#3 0xac3ec9 in elf_sec__filter /home/user/linux/tools/perf/util/symbol-elf.c:187:9
#4 0xac2c3d in dso__load_sym /home/user/linux/tools/perf/util/symbol-elf.c:1254:20
#5 0x883981 in dso__load /home/user/linux/tools/perf/util/symbol.c:1897:9
#6 0x8e6248 in map__load /home/user/linux/tools/perf/util/map.c:332:7
#7 0x8e66e5 in map__find_symbol /home/user/linux/tools/perf/util/map.c:366:6
#8 0x7f8278 in machine__resolve /home/user/linux/tools/perf/util/event.c:707:13
#9 0x5f3d1a in perf_event__process_sample /home/user/linux/tools/perf/builtin-top.c:773:6
#10 0x5f30e4 in deliver_event /home/user/linux/tools/perf/builtin-top.c:1197:3
#11 0x908a72 in do_flush /home/user/linux/tools/perf/util/ordered-events.c:244:9
#12 0x905fae in __ordered_events__flush /home/user/linux/tools/perf/util/ordered-events.c:323:8
#13 0x9058db in ordered_events__flush /home/user/linux/tools/perf/util/ordered-events.c:341:9
#14 0x5f19b1 in process_thread /home/user/linux/tools/perf/builtin-top.c:1109:7
#15 0x7f4f6a21a298 in start_thread /usr/src/debug/glibc-2.33-16.fc34.x86_64/nptl/pthread_create.c:481:8
#16 0x7f4f697d0352 in clone ../sysdeps/unix/sysv/linux/x86_64/clone.S:95
0x61300009add6 is located 10 bytes to the right of 332-byte region [0x61300009ac80,0x61300009adcc)
allocated by thread T6 here:
#0 0x4f3f7f in malloc (/home/user/linux/tools/perf/perf+0x4f3f7f)
#1 0x7f4f6a0a88d9 (/lib64/libelf.so.1+0xa8d9)
Thread T6 created by T0 here:
#0 0x464856 in pthread_create (/home/user/linux/tools/perf/perf+0x464856)
#1 0x5f06e0 in __cmd_top /home/user/linux/tools/perf/builtin-top.c:1309:6
#2 0x5ef19f in cmd_top /home/user/linux/tools/perf/builtin-top.c:1762:11
#3 0x7b28c0 in run_builtin /home/user/linux/tools/perf/perf.c:313:11
#4 0x7b119f in handle_internal_command /home/user/linux/tools/perf/perf.c:365:8
#5 0x7b2423 in run_argv /home/user/linux/tools/perf/perf.c:409:2
#6 0x7b0c19 in main /home/user/linux/tools/perf/perf.c:539:3
#7 0x7f4f696f7b74 in __libc_start_main /usr/src/debug/glibc-2.33-16.fc34.x86_64/csu/../csu/libc-start.c:332:16
SUMMARY: AddressSanitizer: heap-buffer-overflow (/home/user/linux/tools/perf/perf+0x49875b) in StrstrCheck(void*, char*, char const*, char const*)
Shadow bytes around the buggy address:
0x0c268000b560: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c268000b570: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c268000b580: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c268000b590: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
0x0c268000b5a0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
=>0x0c268000b5b0: 00 00 00 00 00 00 00 00 00 04[fa]fa fa fa fa fa
0x0c268000b5c0: fa fa fa fa fa fa fa fa 00 00 00 00 00 00 00 00
0x0c268000b5d0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
0x0c268000b5e0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
0x0c268000b5f0: 07 fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c268000b600: fd fd fd fd fd fd fd fd fd fd fd fd fd fd fd fd
Shadow byte legend (one shadow byte represents 8 application bytes):
Addressable: 00
Partially addressable: 01 02 03 04 05 06 07
Heap left redzone: fa
Freed heap region: fd
Stack left redzone: f1
Stack mid redzone: f2
Stack right redzone: f3
Stack after return: f5
Stack use after scope: f8
Global redzone: f9
Global init order: f6
Poisoned by user: f7
Container overflow: fc
Array cookie: ac
Intra object redzone: bb
ASan internal: fe
Left alloca redzone: ca
Right alloca redzone: cb
Shadow gap: cc
==363148==ABORTING
Suggested-by: Jiri Slaby <jirislaby@kernel.org>
Signed-off-by: Riccardo Mancini <rickyman7@gmail.com>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Fabian Hemmer <copy@copy.sh>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@redhat.com>
Cc: Jiri Slaby <jirislaby@kernel.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Remi Bernon <rbernon@codeweavers.com>
Link: http://lore.kernel.org/lkml/20210621222108.196219-1-rickyman7@gmail.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2021-06-22 01:21:08 +03:00
Elf_Data * symstrs , * secstrs , * secstrs_run , * secstrs_sym ;
2012-08-11 02:22:57 +04:00
uint32_t nr_syms ;
int err = - 1 ;
uint32_t idx ;
GElf_Ehdr ehdr ;
2012-08-11 02:23:01 +04:00
GElf_Shdr shdr ;
2016-02-26 12:31:49 +03:00
GElf_Shdr tshdr ;
2012-08-11 02:22:57 +04:00
Elf_Data * syms , * opddata = NULL ;
GElf_Sym sym ;
2012-08-11 02:23:01 +04:00
Elf_Scn * sec , * sec_strndx ;
2012-08-11 02:22:57 +04:00
Elf * elf ;
int nr = 0 ;
2013-08-07 15:38:47 +04:00
bool remap_kernel = false , adjust_kernel_syms = false ;
2012-08-11 02:22:57 +04:00
2015-04-07 11:22:45 +03:00
if ( kmap & & ! kmaps )
return - 1 ;
2012-08-11 02:23:01 +04:00
elf = syms_ss - > elf ;
ehdr = syms_ss - > ehdr ;
2021-07-03 18:35:27 +03:00
if ( dynsym ) {
sec = syms_ss - > dynsym ;
shdr = syms_ss - > dynshdr ;
} else {
sec = syms_ss - > symtab ;
shdr = syms_ss - > symshdr ;
}
2012-08-11 02:22:57 +04:00
2016-08-13 04:55:33 +03:00
if ( elf_section_by_name ( runtime_ss - > elf , & runtime_ss - > ehdr , & tshdr ,
" .text " , NULL ) )
2016-02-26 12:31:49 +03:00
dso - > text_offset = tshdr . sh_addr - tshdr . sh_offset ;
2012-08-11 02:23:01 +04:00
if ( runtime_ss - > opdsec )
opddata = elf_rawdata ( runtime_ss - > opdsec , NULL ) ;
2012-08-06 08:41:20 +04:00
syms = elf_getdata ( sec , NULL ) ;
if ( syms = = NULL )
goto out_elf_end ;
sec = elf_getscn ( elf , shdr . sh_link ) ;
if ( sec = = NULL )
goto out_elf_end ;
symstrs = elf_getdata ( sec , NULL ) ;
if ( symstrs = = NULL )
goto out_elf_end ;
2014-07-31 10:00:46 +04:00
sec_strndx = elf_getscn ( runtime_ss - > elf , runtime_ss - > ehdr . e_shstrndx ) ;
2012-08-06 08:41:20 +04:00
if ( sec_strndx = = NULL )
goto out_elf_end ;
perf top: Fix overflow in elf_sec__is_text()
ASan reports a heap-buffer-overflow in elf_sec__is_text when using perf-top.
The bug is caused by the fact that secstrs is built from runtime_ss, while
shdr is built from syms_ss if shdr.sh_type != SHT_NOBITS. Therefore, they
point to two different ELF files.
This patch renames secstrs to secstrs_run and adds secstrs_sym, so that
the correct secstrs is chosen depending on shdr.sh_type.
$ ASAN_OPTIONS=abort_on_error=1:disable_coredump=0:unmap_shadow_on_exit=1 ./perf top
=================================================================
==363148==ERROR: AddressSanitizer: heap-buffer-overflow on address 0x61300009add6 at pc 0x00000049875c bp 0x7f4f56446440 sp 0x7f4f56445bf0
READ of size 1 at 0x61300009add6 thread T6
#0 0x49875b in StrstrCheck(void*, char*, char const*, char const*) (/home/user/linux/tools/perf/perf+0x49875b)
#1 0x4d13a2 in strstr (/home/user/linux/tools/perf/perf+0x4d13a2)
#2 0xacae36 in elf_sec__is_text /home/user/linux/tools/perf/util/symbol-elf.c:176:9
#3 0xac3ec9 in elf_sec__filter /home/user/linux/tools/perf/util/symbol-elf.c:187:9
#4 0xac2c3d in dso__load_sym /home/user/linux/tools/perf/util/symbol-elf.c:1254:20
#5 0x883981 in dso__load /home/user/linux/tools/perf/util/symbol.c:1897:9
#6 0x8e6248 in map__load /home/user/linux/tools/perf/util/map.c:332:7
#7 0x8e66e5 in map__find_symbol /home/user/linux/tools/perf/util/map.c:366:6
#8 0x7f8278 in machine__resolve /home/user/linux/tools/perf/util/event.c:707:13
#9 0x5f3d1a in perf_event__process_sample /home/user/linux/tools/perf/builtin-top.c:773:6
#10 0x5f30e4 in deliver_event /home/user/linux/tools/perf/builtin-top.c:1197:3
#11 0x908a72 in do_flush /home/user/linux/tools/perf/util/ordered-events.c:244:9
#12 0x905fae in __ordered_events__flush /home/user/linux/tools/perf/util/ordered-events.c:323:8
#13 0x9058db in ordered_events__flush /home/user/linux/tools/perf/util/ordered-events.c:341:9
#14 0x5f19b1 in process_thread /home/user/linux/tools/perf/builtin-top.c:1109:7
#15 0x7f4f6a21a298 in start_thread /usr/src/debug/glibc-2.33-16.fc34.x86_64/nptl/pthread_create.c:481:8
#16 0x7f4f697d0352 in clone ../sysdeps/unix/sysv/linux/x86_64/clone.S:95
0x61300009add6 is located 10 bytes to the right of 332-byte region [0x61300009ac80,0x61300009adcc)
allocated by thread T6 here:
#0 0x4f3f7f in malloc (/home/user/linux/tools/perf/perf+0x4f3f7f)
#1 0x7f4f6a0a88d9 (/lib64/libelf.so.1+0xa8d9)
Thread T6 created by T0 here:
#0 0x464856 in pthread_create (/home/user/linux/tools/perf/perf+0x464856)
#1 0x5f06e0 in __cmd_top /home/user/linux/tools/perf/builtin-top.c:1309:6
#2 0x5ef19f in cmd_top /home/user/linux/tools/perf/builtin-top.c:1762:11
#3 0x7b28c0 in run_builtin /home/user/linux/tools/perf/perf.c:313:11
#4 0x7b119f in handle_internal_command /home/user/linux/tools/perf/perf.c:365:8
#5 0x7b2423 in run_argv /home/user/linux/tools/perf/perf.c:409:2
#6 0x7b0c19 in main /home/user/linux/tools/perf/perf.c:539:3
#7 0x7f4f696f7b74 in __libc_start_main /usr/src/debug/glibc-2.33-16.fc34.x86_64/csu/../csu/libc-start.c:332:16
SUMMARY: AddressSanitizer: heap-buffer-overflow (/home/user/linux/tools/perf/perf+0x49875b) in StrstrCheck(void*, char*, char const*, char const*)
Shadow bytes around the buggy address:
0x0c268000b560: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c268000b570: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c268000b580: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c268000b590: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
0x0c268000b5a0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
=>0x0c268000b5b0: 00 00 00 00 00 00 00 00 00 04[fa]fa fa fa fa fa
0x0c268000b5c0: fa fa fa fa fa fa fa fa 00 00 00 00 00 00 00 00
0x0c268000b5d0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
0x0c268000b5e0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
0x0c268000b5f0: 07 fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c268000b600: fd fd fd fd fd fd fd fd fd fd fd fd fd fd fd fd
Shadow byte legend (one shadow byte represents 8 application bytes):
Addressable: 00
Partially addressable: 01 02 03 04 05 06 07
Heap left redzone: fa
Freed heap region: fd
Stack left redzone: f1
Stack mid redzone: f2
Stack right redzone: f3
Stack after return: f5
Stack use after scope: f8
Global redzone: f9
Global init order: f6
Poisoned by user: f7
Container overflow: fc
Array cookie: ac
Intra object redzone: bb
ASan internal: fe
Left alloca redzone: ca
Right alloca redzone: cb
Shadow gap: cc
==363148==ABORTING
Suggested-by: Jiri Slaby <jirislaby@kernel.org>
Signed-off-by: Riccardo Mancini <rickyman7@gmail.com>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Fabian Hemmer <copy@copy.sh>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@redhat.com>
Cc: Jiri Slaby <jirislaby@kernel.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Remi Bernon <rbernon@codeweavers.com>
Link: http://lore.kernel.org/lkml/20210621222108.196219-1-rickyman7@gmail.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2021-06-22 01:21:08 +03:00
secstrs_run = elf_getdata ( sec_strndx , NULL ) ;
if ( secstrs_run = = NULL )
goto out_elf_end ;
sec_strndx = elf_getscn ( elf , ehdr . e_shstrndx ) ;
if ( sec_strndx = = NULL )
goto out_elf_end ;
secstrs_sym = elf_getdata ( sec_strndx , NULL ) ;
if ( secstrs_sym = = NULL )
2012-08-06 08:41:20 +04:00
goto out_elf_end ;
nr_syms = shdr . sh_size / shdr . sh_entsize ;
memset ( & sym , 0 , sizeof ( sym ) ) ;
2013-08-07 15:38:47 +04:00
/*
* The kernel relocation symbol is needed in advance in order to adjust
* kernel maps correctly .
*/
if ( ref_reloc_sym_not_found ( kmap ) ) {
elf_symtab__for_each_symbol ( syms , nr_syms , idx , sym ) {
const char * elf_name = elf_sym__name ( & sym , symstrs ) ;
if ( strcmp ( elf_name , kmap - > ref_reloc_sym - > name ) )
continue ;
kmap - > ref_reloc_sym - > unrelocated_addr = sym . st_value ;
2014-01-29 18:14:36 +04:00
map - > reloc = kmap - > ref_reloc_sym - > addr -
kmap - > ref_reloc_sym - > unrelocated_addr ;
2013-08-07 15:38:47 +04:00
break ;
}
}
2015-08-14 15:50:06 +03:00
/*
* Handle any relocation of vdso necessary because older kernels
* attempted to prelink vdso to its virtual address .
*/
2016-02-26 12:31:49 +03:00
if ( dso__is_vdso ( dso ) )
map - > reloc = map - > start - dso - > text_offset ;
2015-08-14 15:50:06 +03:00
2013-08-07 15:38:47 +04:00
dso - > adjust_symbols = runtime_ss - > adjust_symbols | | ref_reloc ( kmap ) ;
/*
2018-04-26 18:36:37 +03:00
* Initial kernel and module mappings do not map to the dso .
* Flag the fixups .
2013-08-07 15:38:47 +04:00
*/
2020-08-08 15:26:56 +03:00
if ( dso - > kernel ) {
2013-08-07 15:38:47 +04:00
remap_kernel = true ;
adjust_kernel_syms = dso - > adjust_symbols ;
}
2012-08-06 08:41:20 +04:00
elf_symtab__for_each_symbol ( syms , nr_syms , idx , sym ) {
struct symbol * f ;
const char * elf_name = elf_sym__name ( & sym , symstrs ) ;
char * demangled = NULL ;
int is_label = elf_sym__is_label ( & sym ) ;
const char * section_name ;
2012-08-11 02:23:01 +04:00
bool used_opd = false ;
2012-08-06 08:41:20 +04:00
2018-04-26 22:52:34 +03:00
if ( ! is_label & & ! elf_sym__filter ( & sym ) )
2012-08-06 08:41:20 +04:00
continue ;
/* Reject ARM ELF "mapping symbols": these aren't unique and
* don ' t identify functions , so will confuse the profile
* output : */
2015-01-27 09:34:01 +03:00
if ( ehdr . e_machine = = EM_ARM | | ehdr . e_machine = = EM_AARCH64 ) {
if ( elf_name [ 0 ] = = ' $ ' & & strchr ( " adtx " , elf_name [ 1 ] )
& & ( elf_name [ 2 ] = = ' \0 ' | | elf_name [ 2 ] = = ' . ' ) )
2012-08-06 08:41:20 +04:00
continue ;
}
2012-08-11 02:23:01 +04:00
if ( runtime_ss - > opdsec & & sym . st_shndx = = runtime_ss - > opdidx ) {
u32 offset = sym . st_value - syms_ss - > opdshdr . sh_addr ;
2012-08-06 08:41:20 +04:00
u64 * opd = opddata - > d_buf + offset ;
sym . st_value = DSO__SWAP ( dso , u64 , * opd ) ;
2012-08-11 02:23:01 +04:00
sym . st_shndx = elf_addr_to_index ( runtime_ss - > elf ,
sym . st_value ) ;
used_opd = true ;
2012-08-06 08:41:20 +04:00
}
perf symbol: Correct address for bss symbols
When using 'perf mem' and 'perf c2c', an issue is observed that tool
reports the wrong offset for global data symbols. This is a common
issue on both x86 and Arm64 platforms.
Let's see an example, for a test program, below is the disassembly for
its .bss section which is dumped with objdump:
...
Disassembly of section .bss:
0000000000004040 <completed.0>:
...
0000000000004080 <buf1>:
...
00000000000040c0 <buf2>:
...
0000000000004100 <thread>:
...
First we used 'perf mem record' to run the test program and then used
'perf --debug verbose=4 mem report' to observe what's the symbol info
for 'buf1' and 'buf2' structures.
# ./perf mem record -e ldlat-loads,ldlat-stores -- false_sharing.exe 8
# ./perf --debug verbose=4 mem report
...
dso__load_sym_internal: adjusting symbol: st_value: 0x40c0 sh_addr: 0x4040 sh_offset: 0x3028
symbol__new: buf2 0x30a8-0x30e8
...
dso__load_sym_internal: adjusting symbol: st_value: 0x4080 sh_addr: 0x4040 sh_offset: 0x3028
symbol__new: buf1 0x3068-0x30a8
...
The perf tool relies on libelf to parse symbols, in executable and
shared object files, 'st_value' holds a virtual address; 'sh_addr' is
the address at which section's first byte should reside in memory, and
'sh_offset' is the byte offset from the beginning of the file to the
first byte in the section. The perf tool uses below formula to convert
a symbol's memory address to a file address:
file_address = st_value - sh_addr + sh_offset
^
` Memory address
We can see the final adjusted address ranges for buf1 and buf2 are
[0x30a8-0x30e8) and [0x3068-0x30a8) respectively, apparently this is
incorrect, in the code, the structure for 'buf1' and 'buf2' specifies
compiler attribute with 64-byte alignment.
The problem happens for 'sh_offset', libelf returns it as 0x3028 which
is not 64-byte aligned, combining with disassembly, it's likely libelf
doesn't respect the alignment for .bss section, therefore, it doesn't
return the aligned value for 'sh_offset'.
Suggested by Fangrui Song, ELF file contains program header which
contains PT_LOAD segments, the fields p_vaddr and p_offset in PT_LOAD
segments contain the execution info. A better choice for converting
memory address to file address is using the formula:
file_address = st_value - p_vaddr + p_offset
This patch introduces elf_read_program_header() which returns the
program header based on the passed 'st_value', then it uses the formula
above to calculate the symbol file address; and the debugging log is
updated respectively.
After applying the change:
# ./perf --debug verbose=4 mem report
...
dso__load_sym_internal: adjusting symbol: st_value: 0x40c0 p_vaddr: 0x3d28 p_offset: 0x2d28
symbol__new: buf2 0x30c0-0x3100
...
dso__load_sym_internal: adjusting symbol: st_value: 0x4080 p_vaddr: 0x3d28 p_offset: 0x2d28
symbol__new: buf1 0x3080-0x30c0
...
Fixes: f17e04afaff84b5c ("perf report: Fix ELF symbol parsing")
Reported-by: Chang Rui <changruinj@gmail.com>
Suggested-by: Fangrui Song <maskray@google.com>
Signed-off-by: Leo Yan <leo.yan@linaro.org>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: https://lore.kernel.org/r/20220724060013.171050-2-leo.yan@linaro.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2022-07-24 09:00:12 +03:00
2012-11-21 16:49:44 +04:00
/*
* When loading symbols in a data mapping , ABS symbols ( which
* has a value of SHN_ABS in its st_shndx ) failed at
* elf_getscn ( ) . And it marks the loading as a failure so
* already loaded symbols cannot be fixed up .
*
* I ' m not sure what should be done . Just ignore them for now .
* - Namhyung Kim
*/
if ( sym . st_shndx = = SHN_ABS )
continue ;
2012-08-06 08:41:20 +04:00
2021-02-17 15:21:25 +03:00
sec = elf_getscn ( syms_ss - > elf , sym . st_shndx ) ;
2012-08-06 08:41:20 +04:00
if ( ! sec )
goto out_elf_end ;
gelf_getshdr ( sec , & shdr ) ;
2022-07-24 09:00:13 +03:00
/*
* If the attribute bit SHF_ALLOC is not set , the section
* doesn ' t occupy memory during process execution .
* E . g . " .gnu.warning.* " section is used by linker to generate
* warnings when calling deprecated functions , the symbols in
* the section aren ' t loaded to memory during process execution ,
* so skip them .
*/
if ( ! ( shdr . sh_flags & SHF_ALLOC ) )
continue ;
perf top: Fix overflow in elf_sec__is_text()
ASan reports a heap-buffer-overflow in elf_sec__is_text when using perf-top.
The bug is caused by the fact that secstrs is built from runtime_ss, while
shdr is built from syms_ss if shdr.sh_type != SHT_NOBITS. Therefore, they
point to two different ELF files.
This patch renames secstrs to secstrs_run and adds secstrs_sym, so that
the correct secstrs is chosen depending on shdr.sh_type.
$ ASAN_OPTIONS=abort_on_error=1:disable_coredump=0:unmap_shadow_on_exit=1 ./perf top
=================================================================
==363148==ERROR: AddressSanitizer: heap-buffer-overflow on address 0x61300009add6 at pc 0x00000049875c bp 0x7f4f56446440 sp 0x7f4f56445bf0
READ of size 1 at 0x61300009add6 thread T6
#0 0x49875b in StrstrCheck(void*, char*, char const*, char const*) (/home/user/linux/tools/perf/perf+0x49875b)
#1 0x4d13a2 in strstr (/home/user/linux/tools/perf/perf+0x4d13a2)
#2 0xacae36 in elf_sec__is_text /home/user/linux/tools/perf/util/symbol-elf.c:176:9
#3 0xac3ec9 in elf_sec__filter /home/user/linux/tools/perf/util/symbol-elf.c:187:9
#4 0xac2c3d in dso__load_sym /home/user/linux/tools/perf/util/symbol-elf.c:1254:20
#5 0x883981 in dso__load /home/user/linux/tools/perf/util/symbol.c:1897:9
#6 0x8e6248 in map__load /home/user/linux/tools/perf/util/map.c:332:7
#7 0x8e66e5 in map__find_symbol /home/user/linux/tools/perf/util/map.c:366:6
#8 0x7f8278 in machine__resolve /home/user/linux/tools/perf/util/event.c:707:13
#9 0x5f3d1a in perf_event__process_sample /home/user/linux/tools/perf/builtin-top.c:773:6
#10 0x5f30e4 in deliver_event /home/user/linux/tools/perf/builtin-top.c:1197:3
#11 0x908a72 in do_flush /home/user/linux/tools/perf/util/ordered-events.c:244:9
#12 0x905fae in __ordered_events__flush /home/user/linux/tools/perf/util/ordered-events.c:323:8
#13 0x9058db in ordered_events__flush /home/user/linux/tools/perf/util/ordered-events.c:341:9
#14 0x5f19b1 in process_thread /home/user/linux/tools/perf/builtin-top.c:1109:7
#15 0x7f4f6a21a298 in start_thread /usr/src/debug/glibc-2.33-16.fc34.x86_64/nptl/pthread_create.c:481:8
#16 0x7f4f697d0352 in clone ../sysdeps/unix/sysv/linux/x86_64/clone.S:95
0x61300009add6 is located 10 bytes to the right of 332-byte region [0x61300009ac80,0x61300009adcc)
allocated by thread T6 here:
#0 0x4f3f7f in malloc (/home/user/linux/tools/perf/perf+0x4f3f7f)
#1 0x7f4f6a0a88d9 (/lib64/libelf.so.1+0xa8d9)
Thread T6 created by T0 here:
#0 0x464856 in pthread_create (/home/user/linux/tools/perf/perf+0x464856)
#1 0x5f06e0 in __cmd_top /home/user/linux/tools/perf/builtin-top.c:1309:6
#2 0x5ef19f in cmd_top /home/user/linux/tools/perf/builtin-top.c:1762:11
#3 0x7b28c0 in run_builtin /home/user/linux/tools/perf/perf.c:313:11
#4 0x7b119f in handle_internal_command /home/user/linux/tools/perf/perf.c:365:8
#5 0x7b2423 in run_argv /home/user/linux/tools/perf/perf.c:409:2
#6 0x7b0c19 in main /home/user/linux/tools/perf/perf.c:539:3
#7 0x7f4f696f7b74 in __libc_start_main /usr/src/debug/glibc-2.33-16.fc34.x86_64/csu/../csu/libc-start.c:332:16
SUMMARY: AddressSanitizer: heap-buffer-overflow (/home/user/linux/tools/perf/perf+0x49875b) in StrstrCheck(void*, char*, char const*, char const*)
Shadow bytes around the buggy address:
0x0c268000b560: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c268000b570: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c268000b580: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c268000b590: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
0x0c268000b5a0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
=>0x0c268000b5b0: 00 00 00 00 00 00 00 00 00 04[fa]fa fa fa fa fa
0x0c268000b5c0: fa fa fa fa fa fa fa fa 00 00 00 00 00 00 00 00
0x0c268000b5d0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
0x0c268000b5e0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
0x0c268000b5f0: 07 fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c268000b600: fd fd fd fd fd fd fd fd fd fd fd fd fd fd fd fd
Shadow byte legend (one shadow byte represents 8 application bytes):
Addressable: 00
Partially addressable: 01 02 03 04 05 06 07
Heap left redzone: fa
Freed heap region: fd
Stack left redzone: f1
Stack mid redzone: f2
Stack right redzone: f3
Stack after return: f5
Stack use after scope: f8
Global redzone: f9
Global init order: f6
Poisoned by user: f7
Container overflow: fc
Array cookie: ac
Intra object redzone: bb
ASan internal: fe
Left alloca redzone: ca
Right alloca redzone: cb
Shadow gap: cc
==363148==ABORTING
Suggested-by: Jiri Slaby <jirislaby@kernel.org>
Signed-off-by: Riccardo Mancini <rickyman7@gmail.com>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Fabian Hemmer <copy@copy.sh>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@redhat.com>
Cc: Jiri Slaby <jirislaby@kernel.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Remi Bernon <rbernon@codeweavers.com>
Link: http://lore.kernel.org/lkml/20210621222108.196219-1-rickyman7@gmail.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2021-06-22 01:21:08 +03:00
secstrs = secstrs_sym ;
2021-02-17 15:21:25 +03:00
/*
* We have to fallback to runtime when syms ' section header has
* NOBITS set . NOBITS results in file offset ( sh_offset ) not
* being incremented . So sh_offset used below has different
* values for syms ( invalid ) and runtime ( valid ) .
*/
if ( shdr . sh_type = = SHT_NOBITS ) {
sec = elf_getscn ( runtime_ss - > elf , sym . st_shndx ) ;
if ( ! sec )
goto out_elf_end ;
gelf_getshdr ( sec , & shdr ) ;
perf top: Fix overflow in elf_sec__is_text()
ASan reports a heap-buffer-overflow in elf_sec__is_text when using perf-top.
The bug is caused by the fact that secstrs is built from runtime_ss, while
shdr is built from syms_ss if shdr.sh_type != SHT_NOBITS. Therefore, they
point to two different ELF files.
This patch renames secstrs to secstrs_run and adds secstrs_sym, so that
the correct secstrs is chosen depending on shdr.sh_type.
$ ASAN_OPTIONS=abort_on_error=1:disable_coredump=0:unmap_shadow_on_exit=1 ./perf top
=================================================================
==363148==ERROR: AddressSanitizer: heap-buffer-overflow on address 0x61300009add6 at pc 0x00000049875c bp 0x7f4f56446440 sp 0x7f4f56445bf0
READ of size 1 at 0x61300009add6 thread T6
#0 0x49875b in StrstrCheck(void*, char*, char const*, char const*) (/home/user/linux/tools/perf/perf+0x49875b)
#1 0x4d13a2 in strstr (/home/user/linux/tools/perf/perf+0x4d13a2)
#2 0xacae36 in elf_sec__is_text /home/user/linux/tools/perf/util/symbol-elf.c:176:9
#3 0xac3ec9 in elf_sec__filter /home/user/linux/tools/perf/util/symbol-elf.c:187:9
#4 0xac2c3d in dso__load_sym /home/user/linux/tools/perf/util/symbol-elf.c:1254:20
#5 0x883981 in dso__load /home/user/linux/tools/perf/util/symbol.c:1897:9
#6 0x8e6248 in map__load /home/user/linux/tools/perf/util/map.c:332:7
#7 0x8e66e5 in map__find_symbol /home/user/linux/tools/perf/util/map.c:366:6
#8 0x7f8278 in machine__resolve /home/user/linux/tools/perf/util/event.c:707:13
#9 0x5f3d1a in perf_event__process_sample /home/user/linux/tools/perf/builtin-top.c:773:6
#10 0x5f30e4 in deliver_event /home/user/linux/tools/perf/builtin-top.c:1197:3
#11 0x908a72 in do_flush /home/user/linux/tools/perf/util/ordered-events.c:244:9
#12 0x905fae in __ordered_events__flush /home/user/linux/tools/perf/util/ordered-events.c:323:8
#13 0x9058db in ordered_events__flush /home/user/linux/tools/perf/util/ordered-events.c:341:9
#14 0x5f19b1 in process_thread /home/user/linux/tools/perf/builtin-top.c:1109:7
#15 0x7f4f6a21a298 in start_thread /usr/src/debug/glibc-2.33-16.fc34.x86_64/nptl/pthread_create.c:481:8
#16 0x7f4f697d0352 in clone ../sysdeps/unix/sysv/linux/x86_64/clone.S:95
0x61300009add6 is located 10 bytes to the right of 332-byte region [0x61300009ac80,0x61300009adcc)
allocated by thread T6 here:
#0 0x4f3f7f in malloc (/home/user/linux/tools/perf/perf+0x4f3f7f)
#1 0x7f4f6a0a88d9 (/lib64/libelf.so.1+0xa8d9)
Thread T6 created by T0 here:
#0 0x464856 in pthread_create (/home/user/linux/tools/perf/perf+0x464856)
#1 0x5f06e0 in __cmd_top /home/user/linux/tools/perf/builtin-top.c:1309:6
#2 0x5ef19f in cmd_top /home/user/linux/tools/perf/builtin-top.c:1762:11
#3 0x7b28c0 in run_builtin /home/user/linux/tools/perf/perf.c:313:11
#4 0x7b119f in handle_internal_command /home/user/linux/tools/perf/perf.c:365:8
#5 0x7b2423 in run_argv /home/user/linux/tools/perf/perf.c:409:2
#6 0x7b0c19 in main /home/user/linux/tools/perf/perf.c:539:3
#7 0x7f4f696f7b74 in __libc_start_main /usr/src/debug/glibc-2.33-16.fc34.x86_64/csu/../csu/libc-start.c:332:16
SUMMARY: AddressSanitizer: heap-buffer-overflow (/home/user/linux/tools/perf/perf+0x49875b) in StrstrCheck(void*, char*, char const*, char const*)
Shadow bytes around the buggy address:
0x0c268000b560: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c268000b570: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c268000b580: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c268000b590: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
0x0c268000b5a0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
=>0x0c268000b5b0: 00 00 00 00 00 00 00 00 00 04[fa]fa fa fa fa fa
0x0c268000b5c0: fa fa fa fa fa fa fa fa 00 00 00 00 00 00 00 00
0x0c268000b5d0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
0x0c268000b5e0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
0x0c268000b5f0: 07 fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
0x0c268000b600: fd fd fd fd fd fd fd fd fd fd fd fd fd fd fd fd
Shadow byte legend (one shadow byte represents 8 application bytes):
Addressable: 00
Partially addressable: 01 02 03 04 05 06 07
Heap left redzone: fa
Freed heap region: fd
Stack left redzone: f1
Stack mid redzone: f2
Stack right redzone: f3
Stack after return: f5
Stack use after scope: f8
Global redzone: f9
Global init order: f6
Poisoned by user: f7
Container overflow: fc
Array cookie: ac
Intra object redzone: bb
ASan internal: fe
Left alloca redzone: ca
Right alloca redzone: cb
Shadow gap: cc
==363148==ABORTING
Suggested-by: Jiri Slaby <jirislaby@kernel.org>
Signed-off-by: Riccardo Mancini <rickyman7@gmail.com>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Fabian Hemmer <copy@copy.sh>
Cc: Ian Rogers <irogers@google.com>
Cc: Jiri Olsa <jolsa@redhat.com>
Cc: Jiri Slaby <jirislaby@kernel.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Remi Bernon <rbernon@codeweavers.com>
Link: http://lore.kernel.org/lkml/20210621222108.196219-1-rickyman7@gmail.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2021-06-22 01:21:08 +03:00
secstrs = secstrs_run ;
2021-02-17 15:21:25 +03:00
}
2018-04-26 22:52:34 +03:00
if ( is_label & & ! elf_sec__filter ( & shdr , secstrs ) )
2012-08-06 08:41:20 +04:00
continue ;
section_name = elf_sec__name ( & shdr , secstrs ) ;
/* On ARM, symbols for thumb functions have 1 added to
* the symbol address as a flag - remove it */
if ( ( ehdr . e_machine = = EM_ARM ) & &
2018-04-26 18:45:17 +03:00
( GELF_ST_TYPE ( sym . st_info ) = = STT_FUNC ) & &
2012-08-06 08:41:20 +04:00
( sym . st_value & 1 ) )
- - sym . st_value ;
2020-08-08 15:26:56 +03:00
if ( dso - > kernel ) {
2018-04-27 21:15:24 +03:00
if ( dso__process_kernel_symbol ( dso , map , & sym , & shdr , kmaps , kmap , & curr_dso , & curr_map ,
section_name , adjust_kernel_syms , kmodule , & remap_kernel ) )
goto out_elf_end ;
2018-04-27 16:53:14 +03:00
} else if ( ( used_opd & & runtime_ss - > adjust_symbols ) | |
( ! used_opd & & syms_ss - > adjust_symbols ) ) {
perf symbol: Correct address for bss symbols
When using 'perf mem' and 'perf c2c', an issue is observed that tool
reports the wrong offset for global data symbols. This is a common
issue on both x86 and Arm64 platforms.
Let's see an example, for a test program, below is the disassembly for
its .bss section which is dumped with objdump:
...
Disassembly of section .bss:
0000000000004040 <completed.0>:
...
0000000000004080 <buf1>:
...
00000000000040c0 <buf2>:
...
0000000000004100 <thread>:
...
First we used 'perf mem record' to run the test program and then used
'perf --debug verbose=4 mem report' to observe what's the symbol info
for 'buf1' and 'buf2' structures.
# ./perf mem record -e ldlat-loads,ldlat-stores -- false_sharing.exe 8
# ./perf --debug verbose=4 mem report
...
dso__load_sym_internal: adjusting symbol: st_value: 0x40c0 sh_addr: 0x4040 sh_offset: 0x3028
symbol__new: buf2 0x30a8-0x30e8
...
dso__load_sym_internal: adjusting symbol: st_value: 0x4080 sh_addr: 0x4040 sh_offset: 0x3028
symbol__new: buf1 0x3068-0x30a8
...
The perf tool relies on libelf to parse symbols, in executable and
shared object files, 'st_value' holds a virtual address; 'sh_addr' is
the address at which section's first byte should reside in memory, and
'sh_offset' is the byte offset from the beginning of the file to the
first byte in the section. The perf tool uses below formula to convert
a symbol's memory address to a file address:
file_address = st_value - sh_addr + sh_offset
^
` Memory address
We can see the final adjusted address ranges for buf1 and buf2 are
[0x30a8-0x30e8) and [0x3068-0x30a8) respectively, apparently this is
incorrect, in the code, the structure for 'buf1' and 'buf2' specifies
compiler attribute with 64-byte alignment.
The problem happens for 'sh_offset', libelf returns it as 0x3028 which
is not 64-byte aligned, combining with disassembly, it's likely libelf
doesn't respect the alignment for .bss section, therefore, it doesn't
return the aligned value for 'sh_offset'.
Suggested by Fangrui Song, ELF file contains program header which
contains PT_LOAD segments, the fields p_vaddr and p_offset in PT_LOAD
segments contain the execution info. A better choice for converting
memory address to file address is using the formula:
file_address = st_value - p_vaddr + p_offset
This patch introduces elf_read_program_header() which returns the
program header based on the passed 'st_value', then it uses the formula
above to calculate the symbol file address; and the debugging log is
updated respectively.
After applying the change:
# ./perf --debug verbose=4 mem report
...
dso__load_sym_internal: adjusting symbol: st_value: 0x40c0 p_vaddr: 0x3d28 p_offset: 0x2d28
symbol__new: buf2 0x30c0-0x3100
...
dso__load_sym_internal: adjusting symbol: st_value: 0x4080 p_vaddr: 0x3d28 p_offset: 0x2d28
symbol__new: buf1 0x3080-0x30c0
...
Fixes: f17e04afaff84b5c ("perf report: Fix ELF symbol parsing")
Reported-by: Chang Rui <changruinj@gmail.com>
Suggested-by: Fangrui Song <maskray@google.com>
Signed-off-by: Leo Yan <leo.yan@linaro.org>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: https://lore.kernel.org/r/20220724060013.171050-2-leo.yan@linaro.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2022-07-24 09:00:12 +03:00
GElf_Phdr phdr ;
2022-11-23 13:18:16 +03:00
if ( elf_read_program_header ( runtime_ss - > elf ,
perf symbol: Correct address for bss symbols
When using 'perf mem' and 'perf c2c', an issue is observed that tool
reports the wrong offset for global data symbols. This is a common
issue on both x86 and Arm64 platforms.
Let's see an example, for a test program, below is the disassembly for
its .bss section which is dumped with objdump:
...
Disassembly of section .bss:
0000000000004040 <completed.0>:
...
0000000000004080 <buf1>:
...
00000000000040c0 <buf2>:
...
0000000000004100 <thread>:
...
First we used 'perf mem record' to run the test program and then used
'perf --debug verbose=4 mem report' to observe what's the symbol info
for 'buf1' and 'buf2' structures.
# ./perf mem record -e ldlat-loads,ldlat-stores -- false_sharing.exe 8
# ./perf --debug verbose=4 mem report
...
dso__load_sym_internal: adjusting symbol: st_value: 0x40c0 sh_addr: 0x4040 sh_offset: 0x3028
symbol__new: buf2 0x30a8-0x30e8
...
dso__load_sym_internal: adjusting symbol: st_value: 0x4080 sh_addr: 0x4040 sh_offset: 0x3028
symbol__new: buf1 0x3068-0x30a8
...
The perf tool relies on libelf to parse symbols, in executable and
shared object files, 'st_value' holds a virtual address; 'sh_addr' is
the address at which section's first byte should reside in memory, and
'sh_offset' is the byte offset from the beginning of the file to the
first byte in the section. The perf tool uses below formula to convert
a symbol's memory address to a file address:
file_address = st_value - sh_addr + sh_offset
^
` Memory address
We can see the final adjusted address ranges for buf1 and buf2 are
[0x30a8-0x30e8) and [0x3068-0x30a8) respectively, apparently this is
incorrect, in the code, the structure for 'buf1' and 'buf2' specifies
compiler attribute with 64-byte alignment.
The problem happens for 'sh_offset', libelf returns it as 0x3028 which
is not 64-byte aligned, combining with disassembly, it's likely libelf
doesn't respect the alignment for .bss section, therefore, it doesn't
return the aligned value for 'sh_offset'.
Suggested by Fangrui Song, ELF file contains program header which
contains PT_LOAD segments, the fields p_vaddr and p_offset in PT_LOAD
segments contain the execution info. A better choice for converting
memory address to file address is using the formula:
file_address = st_value - p_vaddr + p_offset
This patch introduces elf_read_program_header() which returns the
program header based on the passed 'st_value', then it uses the formula
above to calculate the symbol file address; and the debugging log is
updated respectively.
After applying the change:
# ./perf --debug verbose=4 mem report
...
dso__load_sym_internal: adjusting symbol: st_value: 0x40c0 p_vaddr: 0x3d28 p_offset: 0x2d28
symbol__new: buf2 0x30c0-0x3100
...
dso__load_sym_internal: adjusting symbol: st_value: 0x4080 p_vaddr: 0x3d28 p_offset: 0x2d28
symbol__new: buf1 0x3080-0x30c0
...
Fixes: f17e04afaff84b5c ("perf report: Fix ELF symbol parsing")
Reported-by: Chang Rui <changruinj@gmail.com>
Suggested-by: Fangrui Song <maskray@google.com>
Signed-off-by: Leo Yan <leo.yan@linaro.org>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: https://lore.kernel.org/r/20220724060013.171050-2-leo.yan@linaro.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2022-07-24 09:00:12 +03:00
( u64 ) sym . st_value , & phdr ) ) {
2022-07-31 19:49:23 +03:00
pr_debug4 ( " %s: failed to find program header for "
perf symbol: Correct address for bss symbols
When using 'perf mem' and 'perf c2c', an issue is observed that tool
reports the wrong offset for global data symbols. This is a common
issue on both x86 and Arm64 platforms.
Let's see an example, for a test program, below is the disassembly for
its .bss section which is dumped with objdump:
...
Disassembly of section .bss:
0000000000004040 <completed.0>:
...
0000000000004080 <buf1>:
...
00000000000040c0 <buf2>:
...
0000000000004100 <thread>:
...
First we used 'perf mem record' to run the test program and then used
'perf --debug verbose=4 mem report' to observe what's the symbol info
for 'buf1' and 'buf2' structures.
# ./perf mem record -e ldlat-loads,ldlat-stores -- false_sharing.exe 8
# ./perf --debug verbose=4 mem report
...
dso__load_sym_internal: adjusting symbol: st_value: 0x40c0 sh_addr: 0x4040 sh_offset: 0x3028
symbol__new: buf2 0x30a8-0x30e8
...
dso__load_sym_internal: adjusting symbol: st_value: 0x4080 sh_addr: 0x4040 sh_offset: 0x3028
symbol__new: buf1 0x3068-0x30a8
...
The perf tool relies on libelf to parse symbols, in executable and
shared object files, 'st_value' holds a virtual address; 'sh_addr' is
the address at which section's first byte should reside in memory, and
'sh_offset' is the byte offset from the beginning of the file to the
first byte in the section. The perf tool uses below formula to convert
a symbol's memory address to a file address:
file_address = st_value - sh_addr + sh_offset
^
` Memory address
We can see the final adjusted address ranges for buf1 and buf2 are
[0x30a8-0x30e8) and [0x3068-0x30a8) respectively, apparently this is
incorrect, in the code, the structure for 'buf1' and 'buf2' specifies
compiler attribute with 64-byte alignment.
The problem happens for 'sh_offset', libelf returns it as 0x3028 which
is not 64-byte aligned, combining with disassembly, it's likely libelf
doesn't respect the alignment for .bss section, therefore, it doesn't
return the aligned value for 'sh_offset'.
Suggested by Fangrui Song, ELF file contains program header which
contains PT_LOAD segments, the fields p_vaddr and p_offset in PT_LOAD
segments contain the execution info. A better choice for converting
memory address to file address is using the formula:
file_address = st_value - p_vaddr + p_offset
This patch introduces elf_read_program_header() which returns the
program header based on the passed 'st_value', then it uses the formula
above to calculate the symbol file address; and the debugging log is
updated respectively.
After applying the change:
# ./perf --debug verbose=4 mem report
...
dso__load_sym_internal: adjusting symbol: st_value: 0x40c0 p_vaddr: 0x3d28 p_offset: 0x2d28
symbol__new: buf2 0x30c0-0x3100
...
dso__load_sym_internal: adjusting symbol: st_value: 0x4080 p_vaddr: 0x3d28 p_offset: 0x2d28
symbol__new: buf1 0x3080-0x30c0
...
Fixes: f17e04afaff84b5c ("perf report: Fix ELF symbol parsing")
Reported-by: Chang Rui <changruinj@gmail.com>
Suggested-by: Fangrui Song <maskray@google.com>
Signed-off-by: Leo Yan <leo.yan@linaro.org>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: https://lore.kernel.org/r/20220724060013.171050-2-leo.yan@linaro.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2022-07-24 09:00:12 +03:00
" symbol: %s st_value: %# " PRIx64 " \n " ,
__func__ , elf_name , ( u64 ) sym . st_value ) ;
2022-07-31 19:49:23 +03:00
pr_debug4 ( " %s: adjusting symbol: st_value: %# " PRIx64 " "
" sh_addr: %# " PRIx64 " sh_offset: %# " PRIx64 " \n " ,
__func__ , ( u64 ) sym . st_value , ( u64 ) shdr . sh_addr ,
( u64 ) shdr . sh_offset ) ;
/*
* Fail to find program header , let ' s rollback
* to use shdr . sh_addr and shdr . sh_offset to
* calibrate symbol ' s file address , though this
* is not necessary for normal C ELF file , we
* still need to handle java JIT symbols in this
* case .
*/
sym . st_value - = shdr . sh_addr - shdr . sh_offset ;
} else {
pr_debug4 ( " %s: adjusting symbol: st_value: %# " PRIx64 " "
" p_vaddr: %# " PRIx64 " p_offset: %# " PRIx64 " \n " ,
__func__ , ( u64 ) sym . st_value , ( u64 ) phdr . p_vaddr ,
( u64 ) phdr . p_offset ) ;
sym . st_value - = phdr . p_vaddr - phdr . p_offset ;
perf symbol: Correct address for bss symbols
When using 'perf mem' and 'perf c2c', an issue is observed that tool
reports the wrong offset for global data symbols. This is a common
issue on both x86 and Arm64 platforms.
Let's see an example, for a test program, below is the disassembly for
its .bss section which is dumped with objdump:
...
Disassembly of section .bss:
0000000000004040 <completed.0>:
...
0000000000004080 <buf1>:
...
00000000000040c0 <buf2>:
...
0000000000004100 <thread>:
...
First we used 'perf mem record' to run the test program and then used
'perf --debug verbose=4 mem report' to observe what's the symbol info
for 'buf1' and 'buf2' structures.
# ./perf mem record -e ldlat-loads,ldlat-stores -- false_sharing.exe 8
# ./perf --debug verbose=4 mem report
...
dso__load_sym_internal: adjusting symbol: st_value: 0x40c0 sh_addr: 0x4040 sh_offset: 0x3028
symbol__new: buf2 0x30a8-0x30e8
...
dso__load_sym_internal: adjusting symbol: st_value: 0x4080 sh_addr: 0x4040 sh_offset: 0x3028
symbol__new: buf1 0x3068-0x30a8
...
The perf tool relies on libelf to parse symbols, in executable and
shared object files, 'st_value' holds a virtual address; 'sh_addr' is
the address at which section's first byte should reside in memory, and
'sh_offset' is the byte offset from the beginning of the file to the
first byte in the section. The perf tool uses below formula to convert
a symbol's memory address to a file address:
file_address = st_value - sh_addr + sh_offset
^
` Memory address
We can see the final adjusted address ranges for buf1 and buf2 are
[0x30a8-0x30e8) and [0x3068-0x30a8) respectively, apparently this is
incorrect, in the code, the structure for 'buf1' and 'buf2' specifies
compiler attribute with 64-byte alignment.
The problem happens for 'sh_offset', libelf returns it as 0x3028 which
is not 64-byte aligned, combining with disassembly, it's likely libelf
doesn't respect the alignment for .bss section, therefore, it doesn't
return the aligned value for 'sh_offset'.
Suggested by Fangrui Song, ELF file contains program header which
contains PT_LOAD segments, the fields p_vaddr and p_offset in PT_LOAD
segments contain the execution info. A better choice for converting
memory address to file address is using the formula:
file_address = st_value - p_vaddr + p_offset
This patch introduces elf_read_program_header() which returns the
program header based on the passed 'st_value', then it uses the formula
above to calculate the symbol file address; and the debugging log is
updated respectively.
After applying the change:
# ./perf --debug verbose=4 mem report
...
dso__load_sym_internal: adjusting symbol: st_value: 0x40c0 p_vaddr: 0x3d28 p_offset: 0x2d28
symbol__new: buf2 0x30c0-0x3100
...
dso__load_sym_internal: adjusting symbol: st_value: 0x4080 p_vaddr: 0x3d28 p_offset: 0x2d28
symbol__new: buf1 0x3080-0x30c0
...
Fixes: f17e04afaff84b5c ("perf report: Fix ELF symbol parsing")
Reported-by: Chang Rui <changruinj@gmail.com>
Suggested-by: Fangrui Song <maskray@google.com>
Signed-off-by: Leo Yan <leo.yan@linaro.org>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: https://lore.kernel.org/r/20220724060013.171050-2-leo.yan@linaro.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2022-07-24 09:00:12 +03:00
}
2012-08-06 08:41:20 +04:00
}
2018-04-27 21:15:24 +03:00
2016-08-30 14:41:02 +03:00
demangled = demangle_sym ( dso , kmodule , elf_name ) ;
if ( demangled ! = NULL )
elf_name = demangled ;
2016-07-09 10:20:00 +03:00
2012-08-06 08:41:20 +04:00
f = symbol__new ( sym . st_value , sym . st_size ,
2018-04-26 17:09:10 +03:00
GELF_ST_BIND ( sym . st_info ) ,
GELF_ST_TYPE ( sym . st_info ) , elf_name ) ;
2012-08-06 08:41:20 +04:00
free ( demangled ) ;
if ( ! f )
goto out_elf_end ;
2016-04-12 12:10:50 +03:00
arch__sym_update ( f , & sym ) ;
2018-04-26 22:52:34 +03:00
__symbols__insert ( & curr_dso - > symbols , f , dso - > kernel ) ;
2016-09-02 01:25:52 +03:00
nr + + ;
2012-08-06 08:41:20 +04:00
}
/*
* For misannotated , zeroed , ASM function sizes .
*/
if ( nr > 0 ) {
2022-04-16 03:40:46 +03:00
symbols__fixup_end ( & dso - > symbols , false ) ;
2018-04-26 22:52:34 +03:00
symbols__fixup_duplicate ( & dso - > symbols ) ;
2012-08-06 08:41:20 +04:00
if ( kmap ) {
/*
* We need to fixup this here too because we create new
* maps here , for things like vsyscall sections .
*/
2019-11-26 03:58:33 +03:00
maps__fixup_end ( kmaps ) ;
2012-08-06 08:41:20 +04:00
}
}
err = nr ;
out_elf_end :
return err ;
}
2021-07-03 18:35:27 +03:00
int dso__load_sym ( struct dso * dso , struct map * map , struct symsrc * syms_ss ,
struct symsrc * runtime_ss , int kmodule )
{
int nr = 0 ;
int err = - 1 ;
dso - > symtab_type = syms_ss - > type ;
dso - > is_64_bit = syms_ss - > is_64_bit ;
dso - > rel = syms_ss - > ehdr . e_type = = ET_REL ;
/*
* Modules may already have symbols from kallsyms , but those symbols
* have the wrong values for the dso maps , so remove them .
*/
if ( kmodule & & syms_ss - > symtab )
symbols__delete ( & dso - > symbols ) ;
if ( ! syms_ss - > symtab ) {
/*
* If the vmlinux is stripped , fail so we will fall back
* to using kallsyms . The vmlinux runtime symbols aren ' t
* of much use .
*/
if ( dso - > kernel )
return err ;
} else {
err = dso__load_sym_internal ( dso , map , syms_ss , runtime_ss ,
kmodule , 0 ) ;
if ( err < 0 )
return err ;
nr = err ;
}
if ( syms_ss - > dynsym ) {
err = dso__load_sym_internal ( dso , map , syms_ss , runtime_ss ,
kmodule , 1 ) ;
if ( err < 0 )
return err ;
err + = nr ;
}
return err ;
}
2013-08-07 15:38:51 +04:00
static int elf_read_maps ( Elf * elf , bool exe , mapfn_t mapfn , void * data )
{
GElf_Phdr phdr ;
size_t i , phdrnum ;
int err ;
u64 sz ;
if ( elf_getphdrnum ( elf , & phdrnum ) )
return - 1 ;
for ( i = 0 ; i < phdrnum ; i + + ) {
if ( gelf_getphdr ( elf , i , & phdr ) = = NULL )
return - 1 ;
if ( phdr . p_type ! = PT_LOAD )
continue ;
if ( exe ) {
if ( ! ( phdr . p_flags & PF_X ) )
continue ;
} else {
if ( ! ( phdr . p_flags & PF_R ) )
continue ;
}
sz = min ( phdr . p_memsz , phdr . p_filesz ) ;
if ( ! sz )
continue ;
err = mapfn ( phdr . p_vaddr , sz , phdr . p_offset , data ) ;
if ( err )
return err ;
}
return 0 ;
}
int file__read_maps ( int fd , bool exe , mapfn_t mapfn , void * data ,
bool * is_64_bit )
{
int err ;
Elf * elf ;
elf = elf_begin ( fd , PERF_ELF_C_READ_MMAP , NULL ) ;
if ( elf = = NULL )
return - 1 ;
if ( is_64_bit )
* is_64_bit = ( gelf_getclass ( elf ) = = ELFCLASS64 ) ;
err = elf_read_maps ( elf , exe , mapfn , data ) ;
elf_end ( elf ) ;
return err ;
}
2014-07-22 17:17:59 +04:00
enum dso_type dso__type_fd ( int fd )
{
enum dso_type dso_type = DSO__TYPE_UNKNOWN ;
GElf_Ehdr ehdr ;
Elf_Kind ek ;
Elf * elf ;
elf = elf_begin ( fd , PERF_ELF_C_READ_MMAP , NULL ) ;
if ( elf = = NULL )
goto out ;
ek = elf_kind ( elf ) ;
if ( ek ! = ELF_K_ELF )
goto out_end ;
if ( gelf_getclass ( elf ) = = ELFCLASS64 ) {
dso_type = DSO__TYPE_64BIT ;
goto out_end ;
}
if ( gelf_getehdr ( elf , & ehdr ) = = NULL )
goto out_end ;
if ( ehdr . e_machine = = EM_X86_64 )
dso_type = DSO__TYPE_X32BIT ;
else
dso_type = DSO__TYPE_32BIT ;
out_end :
elf_end ( elf ) ;
out :
return dso_type ;
}
2013-10-09 16:01:12 +04:00
static int copy_bytes ( int from , off_t from_offs , int to , off_t to_offs , u64 len )
{
ssize_t r ;
size_t n ;
int err = - 1 ;
char * buf = malloc ( page_size ) ;
if ( buf = = NULL )
return - 1 ;
if ( lseek ( to , to_offs , SEEK_SET ) ! = to_offs )
goto out ;
if ( lseek ( from , from_offs , SEEK_SET ) ! = from_offs )
goto out ;
while ( len ) {
n = page_size ;
if ( len < n )
n = len ;
/* Use read because mmap won't work on proc files */
r = read ( from , buf , n ) ;
if ( r < 0 )
goto out ;
if ( ! r )
break ;
n = r ;
r = write ( to , buf , n ) ;
if ( r < 0 )
goto out ;
if ( ( size_t ) r ! = n )
goto out ;
len - = n ;
}
err = 0 ;
out :
free ( buf ) ;
return err ;
}
struct kcore {
int fd ;
int elfclass ;
Elf * elf ;
GElf_Ehdr ehdr ;
} ;
static int kcore__open ( struct kcore * kcore , const char * filename )
{
GElf_Ehdr * ehdr ;
kcore - > fd = open ( filename , O_RDONLY ) ;
if ( kcore - > fd = = - 1 )
return - 1 ;
kcore - > elf = elf_begin ( kcore - > fd , ELF_C_READ , NULL ) ;
if ( ! kcore - > elf )
goto out_close ;
kcore - > elfclass = gelf_getclass ( kcore - > elf ) ;
if ( kcore - > elfclass = = ELFCLASSNONE )
goto out_end ;
ehdr = gelf_getehdr ( kcore - > elf , & kcore - > ehdr ) ;
if ( ! ehdr )
goto out_end ;
return 0 ;
out_end :
elf_end ( kcore - > elf ) ;
out_close :
close ( kcore - > fd ) ;
return - 1 ;
}
static int kcore__init ( struct kcore * kcore , char * filename , int elfclass ,
bool temp )
{
kcore - > elfclass = elfclass ;
if ( temp )
kcore - > fd = mkstemp ( filename ) ;
else
kcore - > fd = open ( filename , O_WRONLY | O_CREAT | O_EXCL , 0400 ) ;
if ( kcore - > fd = = - 1 )
return - 1 ;
kcore - > elf = elf_begin ( kcore - > fd , ELF_C_WRITE , NULL ) ;
if ( ! kcore - > elf )
goto out_close ;
if ( ! gelf_newehdr ( kcore - > elf , elfclass ) )
goto out_end ;
2015-09-24 13:05:22 +03:00
memset ( & kcore - > ehdr , 0 , sizeof ( GElf_Ehdr ) ) ;
2013-10-09 16:01:12 +04:00
return 0 ;
out_end :
elf_end ( kcore - > elf ) ;
out_close :
close ( kcore - > fd ) ;
unlink ( filename ) ;
return - 1 ;
}
static void kcore__close ( struct kcore * kcore )
{
elf_end ( kcore - > elf ) ;
close ( kcore - > fd ) ;
}
static int kcore__copy_hdr ( struct kcore * from , struct kcore * to , size_t count )
{
GElf_Ehdr * ehdr = & to - > ehdr ;
GElf_Ehdr * kehdr = & from - > ehdr ;
memcpy ( ehdr - > e_ident , kehdr - > e_ident , EI_NIDENT ) ;
ehdr - > e_type = kehdr - > e_type ;
ehdr - > e_machine = kehdr - > e_machine ;
ehdr - > e_version = kehdr - > e_version ;
ehdr - > e_entry = 0 ;
ehdr - > e_shoff = 0 ;
ehdr - > e_flags = kehdr - > e_flags ;
ehdr - > e_phnum = count ;
ehdr - > e_shentsize = 0 ;
ehdr - > e_shnum = 0 ;
ehdr - > e_shstrndx = 0 ;
if ( from - > elfclass = = ELFCLASS32 ) {
ehdr - > e_phoff = sizeof ( Elf32_Ehdr ) ;
ehdr - > e_ehsize = sizeof ( Elf32_Ehdr ) ;
ehdr - > e_phentsize = sizeof ( Elf32_Phdr ) ;
} else {
ehdr - > e_phoff = sizeof ( Elf64_Ehdr ) ;
ehdr - > e_ehsize = sizeof ( Elf64_Ehdr ) ;
ehdr - > e_phentsize = sizeof ( Elf64_Phdr ) ;
}
if ( ! gelf_update_ehdr ( to - > elf , ehdr ) )
return - 1 ;
if ( ! gelf_newphdr ( to - > elf , count ) )
return - 1 ;
return 0 ;
}
static int kcore__add_phdr ( struct kcore * kcore , int idx , off_t offset ,
u64 addr , u64 len )
{
2015-09-24 13:05:22 +03:00
GElf_Phdr phdr = {
. p_type = PT_LOAD ,
. p_flags = PF_R | PF_W | PF_X ,
. p_offset = offset ,
. p_vaddr = addr ,
. p_paddr = 0 ,
. p_filesz = len ,
. p_memsz = len ,
. p_align = page_size ,
} ;
if ( ! gelf_update_phdr ( kcore - > elf , idx , & phdr ) )
2013-10-09 16:01:12 +04:00
return - 1 ;
return 0 ;
}
static off_t kcore__write ( struct kcore * kcore )
{
return elf_update ( kcore - > elf , ELF_C_WRITE ) ;
}
2013-10-14 17:57:29 +04:00
struct phdr_data {
off_t offset ;
2018-05-22 13:54:41 +03:00
off_t rel ;
2013-10-14 17:57:29 +04:00
u64 addr ;
u64 len ;
2018-05-22 13:54:38 +03:00
struct list_head node ;
2018-05-22 13:54:45 +03:00
struct phdr_data * remaps ;
2013-10-14 17:57:29 +04:00
} ;
2018-05-22 13:54:44 +03:00
struct sym_data {
u64 addr ;
struct list_head node ;
} ;
2013-10-14 17:57:29 +04:00
struct kcore_copy_info {
u64 stext ;
u64 etext ;
u64 first_symbol ;
u64 last_symbol ;
u64 first_module ;
2020-05-12 15:19:16 +03:00
u64 first_module_symbol ;
2013-10-14 17:57:29 +04:00
u64 last_module_symbol ;
2018-05-22 13:54:39 +03:00
size_t phnum ;
2018-05-22 13:54:38 +03:00
struct list_head phdrs ;
2018-05-22 13:54:44 +03:00
struct list_head syms ;
2013-10-14 17:57:29 +04:00
} ;
2018-05-22 13:54:41 +03:00
# define kcore_copy__for_each_phdr(k, p) \
list_for_each_entry ( ( p ) , & ( k ) - > phdrs , node )
2018-05-22 13:54:43 +03:00
static struct phdr_data * phdr_data__new ( u64 addr , u64 len , off_t offset )
{
struct phdr_data * p = zalloc ( sizeof ( * p ) ) ;
if ( p ) {
p - > addr = addr ;
p - > len = len ;
p - > offset = offset ;
}
return p ;
}
static struct phdr_data * kcore_copy_info__addnew ( struct kcore_copy_info * kci ,
u64 addr , u64 len ,
off_t offset )
{
struct phdr_data * p = phdr_data__new ( addr , len , offset ) ;
if ( p )
list_add_tail ( & p - > node , & kci - > phdrs ) ;
return p ;
}
static void kcore_copy__free_phdrs ( struct kcore_copy_info * kci )
{
struct phdr_data * p , * tmp ;
list_for_each_entry_safe ( p , tmp , & kci - > phdrs , node ) {
2019-07-04 18:13:46 +03:00
list_del_init ( & p - > node ) ;
2018-05-22 13:54:43 +03:00
free ( p ) ;
}
}
2018-05-22 13:54:44 +03:00
static struct sym_data * kcore_copy__new_sym ( struct kcore_copy_info * kci ,
u64 addr )
{
struct sym_data * s = zalloc ( sizeof ( * s ) ) ;
if ( s ) {
s - > addr = addr ;
list_add_tail ( & s - > node , & kci - > syms ) ;
}
return s ;
}
static void kcore_copy__free_syms ( struct kcore_copy_info * kci )
{
struct sym_data * s , * tmp ;
list_for_each_entry_safe ( s , tmp , & kci - > syms , node ) {
2019-07-04 18:13:46 +03:00
list_del_init ( & s - > node ) ;
2018-05-22 13:54:44 +03:00
free ( s ) ;
}
}
2013-10-14 17:57:29 +04:00
static int kcore_copy__process_kallsyms ( void * arg , const char * name , char type ,
u64 start )
{
struct kcore_copy_info * kci = arg ;
2018-04-25 23:16:31 +03:00
if ( ! kallsyms__is_function ( type ) )
2013-10-14 17:57:29 +04:00
return 0 ;
if ( strchr ( name , ' [ ' ) ) {
2020-05-12 15:19:16 +03:00
if ( ! kci - > first_module_symbol | | start < kci - > first_module_symbol )
kci - > first_module_symbol = start ;
2013-10-14 17:57:29 +04:00
if ( start > kci - > last_module_symbol )
kci - > last_module_symbol = start ;
return 0 ;
}
if ( ! kci - > first_symbol | | start < kci - > first_symbol )
kci - > first_symbol = start ;
if ( ! kci - > last_symbol | | start > kci - > last_symbol )
kci - > last_symbol = start ;
if ( ! strcmp ( name , " _stext " ) ) {
kci - > stext = start ;
return 0 ;
}
if ( ! strcmp ( name , " _etext " ) ) {
kci - > etext = start ;
return 0 ;
}
2018-05-22 13:54:44 +03:00
if ( is_entry_trampoline ( name ) & & ! kcore_copy__new_sym ( kci , start ) )
return - 1 ;
2013-10-14 17:57:29 +04:00
return 0 ;
}
static int kcore_copy__parse_kallsyms ( struct kcore_copy_info * kci ,
const char * dir )
{
char kallsyms_filename [ PATH_MAX ] ;
scnprintf ( kallsyms_filename , PATH_MAX , " %s/kallsyms " , dir ) ;
if ( symbol__restricted_filename ( kallsyms_filename , " /proc/kallsyms " ) )
return - 1 ;
if ( kallsyms__parse ( kallsyms_filename , kci ,
kcore_copy__process_kallsyms ) < 0 )
return - 1 ;
return 0 ;
}
static int kcore_copy__process_modules ( void * arg ,
const char * name __maybe_unused ,
2017-08-03 16:49:02 +03:00
u64 start , u64 size __maybe_unused )
2013-10-14 17:57:29 +04:00
{
struct kcore_copy_info * kci = arg ;
if ( ! kci - > first_module | | start < kci - > first_module )
kci - > first_module = start ;
return 0 ;
}
static int kcore_copy__parse_modules ( struct kcore_copy_info * kci ,
const char * dir )
{
char modules_filename [ PATH_MAX ] ;
scnprintf ( modules_filename , PATH_MAX , " %s/modules " , dir ) ;
if ( symbol__restricted_filename ( modules_filename , " /proc/modules " ) )
return - 1 ;
if ( modules__parse ( modules_filename , kci ,
kcore_copy__process_modules ) < 0 )
return - 1 ;
return 0 ;
}
2018-05-22 13:54:43 +03:00
static int kcore_copy__map ( struct kcore_copy_info * kci , u64 start , u64 end ,
u64 pgoff , u64 s , u64 e )
2013-10-14 17:57:29 +04:00
{
2018-05-22 13:54:43 +03:00
u64 len , offset ;
if ( s < start | | s > = end )
return 0 ;
2013-10-14 17:57:29 +04:00
2018-05-22 13:54:43 +03:00
offset = ( s - start ) + pgoff ;
len = e < end ? e - s : end - s ;
return kcore_copy_info__addnew ( kci , s , len , offset ) ? 0 : - 1 ;
2013-10-14 17:57:29 +04:00
}
static int kcore_copy__read_map ( u64 start , u64 len , u64 pgoff , void * data )
{
struct kcore_copy_info * kci = data ;
u64 end = start + len ;
2018-05-22 13:54:44 +03:00
struct sym_data * sdat ;
2013-10-14 17:57:29 +04:00
2018-05-22 13:54:43 +03:00
if ( kcore_copy__map ( kci , start , end , pgoff , kci - > stext , kci - > etext ) )
return - 1 ;
2013-10-14 17:57:29 +04:00
2018-05-22 13:54:43 +03:00
if ( kcore_copy__map ( kci , start , end , pgoff , kci - > first_module ,
kci - > last_module_symbol ) )
return - 1 ;
2013-10-14 17:57:29 +04:00
2018-05-22 13:54:44 +03:00
list_for_each_entry ( sdat , & kci - > syms , node ) {
u64 s = round_down ( sdat - > addr , page_size ) ;
if ( kcore_copy__map ( kci , start , end , pgoff , s , s + len ) )
return - 1 ;
}
2013-10-14 17:57:29 +04:00
return 0 ;
}
static int kcore_copy__read_maps ( struct kcore_copy_info * kci , Elf * elf )
{
if ( elf_read_maps ( elf , true , kcore_copy__read_map , kci ) < 0 )
return - 1 ;
return 0 ;
}
2018-05-22 13:54:45 +03:00
static void kcore_copy__find_remaps ( struct kcore_copy_info * kci )
{
struct phdr_data * p , * k = NULL ;
u64 kend ;
if ( ! kci - > stext )
return ;
/* Find phdr that corresponds to the kernel map (contains stext) */
kcore_copy__for_each_phdr ( kci , p ) {
u64 pend = p - > addr + p - > len - 1 ;
if ( p - > addr < = kci - > stext & & pend > = kci - > stext ) {
k = p ;
break ;
}
}
if ( ! k )
return ;
kend = k - > offset + k - > len ;
/* Find phdrs that remap the kernel */
kcore_copy__for_each_phdr ( kci , p ) {
u64 pend = p - > offset + p - > len ;
if ( p = = k )
continue ;
if ( p - > offset > = k - > offset & & pend < = kend )
p - > remaps = k ;
}
}
2018-05-22 13:54:41 +03:00
static void kcore_copy__layout ( struct kcore_copy_info * kci )
{
struct phdr_data * p ;
off_t rel = 0 ;
2018-05-22 13:54:45 +03:00
kcore_copy__find_remaps ( kci ) ;
2018-05-22 13:54:41 +03:00
kcore_copy__for_each_phdr ( kci , p ) {
2018-05-22 13:54:45 +03:00
if ( ! p - > remaps ) {
p - > rel = rel ;
rel + = p - > len ;
}
2018-05-22 13:54:41 +03:00
kci - > phnum + = 1 ;
}
2018-05-22 13:54:45 +03:00
kcore_copy__for_each_phdr ( kci , p ) {
struct phdr_data * k = p - > remaps ;
if ( k )
p - > rel = p - > offset - k - > offset + k - > rel ;
}
2018-05-22 13:54:41 +03:00
}
2013-10-14 17:57:29 +04:00
static int kcore_copy__calc_maps ( struct kcore_copy_info * kci , const char * dir ,
Elf * elf )
{
if ( kcore_copy__parse_kallsyms ( kci , dir ) )
return - 1 ;
if ( kcore_copy__parse_modules ( kci , dir ) )
return - 1 ;
if ( kci - > stext )
kci - > stext = round_down ( kci - > stext , page_size ) ;
else
kci - > stext = round_down ( kci - > first_symbol , page_size ) ;
if ( kci - > etext ) {
kci - > etext = round_up ( kci - > etext , page_size ) ;
} else if ( kci - > last_symbol ) {
kci - > etext = round_up ( kci - > last_symbol , page_size ) ;
kci - > etext + = page_size ;
}
2020-05-12 15:19:16 +03:00
if ( kci - > first_module_symbol & &
( ! kci - > first_module | | kci - > first_module_symbol < kci - > first_module ) )
kci - > first_module = kci - > first_module_symbol ;
2013-10-14 17:57:29 +04:00
kci - > first_module = round_down ( kci - > first_module , page_size ) ;
if ( kci - > last_module_symbol ) {
kci - > last_module_symbol = round_up ( kci - > last_module_symbol ,
page_size ) ;
kci - > last_module_symbol + = page_size ;
}
if ( ! kci - > stext | | ! kci - > etext )
return - 1 ;
if ( kci - > first_module & & ! kci - > last_module_symbol )
return - 1 ;
2018-05-22 13:54:41 +03:00
if ( kcore_copy__read_maps ( kci , elf ) )
return - 1 ;
kcore_copy__layout ( kci ) ;
return 0 ;
2013-10-14 17:57:29 +04:00
}
static int kcore_copy__copy_file ( const char * from_dir , const char * to_dir ,
const char * name )
{
char from_filename [ PATH_MAX ] ;
char to_filename [ PATH_MAX ] ;
scnprintf ( from_filename , PATH_MAX , " %s/%s " , from_dir , name ) ;
scnprintf ( to_filename , PATH_MAX , " %s/%s " , to_dir , name ) ;
return copyfile_mode ( from_filename , to_filename , 0400 ) ;
}
static int kcore_copy__unlink ( const char * dir , const char * name )
{
char filename [ PATH_MAX ] ;
scnprintf ( filename , PATH_MAX , " %s/%s " , dir , name ) ;
return unlink ( filename ) ;
}
static int kcore_copy__compare_fds ( int from , int to )
{
char * buf_from ;
char * buf_to ;
ssize_t ret ;
size_t len ;
int err = - 1 ;
buf_from = malloc ( page_size ) ;
buf_to = malloc ( page_size ) ;
if ( ! buf_from | | ! buf_to )
goto out ;
while ( 1 ) {
/* Use read because mmap won't work on proc files */
ret = read ( from , buf_from , page_size ) ;
if ( ret < 0 )
goto out ;
if ( ! ret )
break ;
len = ret ;
if ( readn ( to , buf_to , len ) ! = ( int ) len )
goto out ;
if ( memcmp ( buf_from , buf_to , len ) )
goto out ;
}
err = 0 ;
out :
free ( buf_to ) ;
free ( buf_from ) ;
return err ;
}
static int kcore_copy__compare_files ( const char * from_filename ,
const char * to_filename )
{
int from , to , err = - 1 ;
from = open ( from_filename , O_RDONLY ) ;
if ( from < 0 )
return - 1 ;
to = open ( to_filename , O_RDONLY ) ;
if ( to < 0 )
goto out_close_from ;
err = kcore_copy__compare_fds ( from , to ) ;
close ( to ) ;
out_close_from :
close ( from ) ;
return err ;
}
static int kcore_copy__compare_file ( const char * from_dir , const char * to_dir ,
const char * name )
{
char from_filename [ PATH_MAX ] ;
char to_filename [ PATH_MAX ] ;
scnprintf ( from_filename , PATH_MAX , " %s/%s " , from_dir , name ) ;
scnprintf ( to_filename , PATH_MAX , " %s/%s " , to_dir , name ) ;
return kcore_copy__compare_files ( from_filename , to_filename ) ;
}
/**
* kcore_copy - copy kallsyms , modules and kcore from one directory to another .
* @ from_dir : from directory
* @ to_dir : to directory
*
* This function copies kallsyms , modules and kcore files from one directory to
* another . kallsyms and modules are copied entirely . Only code segments are
* copied from kcore . It is assumed that two segments suffice : one for the
* kernel proper and one for all the modules . The code segments are determined
* from kallsyms and modules files . The kernel map starts at _stext or the
* lowest function symbol , and ends at _etext or the highest function symbol .
* The module map starts at the lowest module address and ends at the highest
* module symbol . Start addresses are rounded down to the nearest page . End
* addresses are rounded up to the nearest page . An extra page is added to the
* highest kernel symbol and highest module symbol to , hopefully , encompass that
* symbol too . Because it contains only code sections , the resulting kcore is
* unusual . One significant peculiarity is that the mapping ( start - > pgoff )
* is not the same for the kernel map and the modules map . That happens because
* the data is copied adjacently whereas the original kcore has gaps . Finally ,
2022-09-14 15:24:29 +03:00
* kallsyms file is compared with its copy to check that modules have not been
* loaded or unloaded while the copies were taking place .
2013-10-14 17:57:29 +04:00
*
* Return : % 0 on success , % - 1 on failure .
*/
int kcore_copy ( const char * from_dir , const char * to_dir )
{
struct kcore kcore ;
struct kcore extract ;
int idx = 0 , err = - 1 ;
2018-05-22 13:54:42 +03:00
off_t offset , sz ;
2013-10-14 17:57:29 +04:00
struct kcore_copy_info kci = { . stext = 0 , } ;
char kcore_filename [ PATH_MAX ] ;
char extract_filename [ PATH_MAX ] ;
2018-05-22 13:54:42 +03:00
struct phdr_data * p ;
2013-10-14 17:57:29 +04:00
2018-05-22 13:54:38 +03:00
INIT_LIST_HEAD ( & kci . phdrs ) ;
2018-05-22 13:54:44 +03:00
INIT_LIST_HEAD ( & kci . syms ) ;
2018-05-22 13:54:38 +03:00
2013-10-14 17:57:29 +04:00
if ( kcore_copy__copy_file ( from_dir , to_dir , " kallsyms " ) )
return - 1 ;
if ( kcore_copy__copy_file ( from_dir , to_dir , " modules " ) )
goto out_unlink_kallsyms ;
scnprintf ( kcore_filename , PATH_MAX , " %s/kcore " , from_dir ) ;
scnprintf ( extract_filename , PATH_MAX , " %s/kcore " , to_dir ) ;
if ( kcore__open ( & kcore , kcore_filename ) )
goto out_unlink_modules ;
if ( kcore_copy__calc_maps ( & kci , from_dir , kcore . elf ) )
goto out_kcore_close ;
if ( kcore__init ( & extract , extract_filename , kcore . elfclass , false ) )
goto out_kcore_close ;
2018-05-22 13:54:39 +03:00
if ( kcore__copy_hdr ( & kcore , & extract , kci . phnum ) )
2013-10-14 17:57:29 +04:00
goto out_extract_close ;
2018-05-22 13:54:40 +03:00
offset = gelf_fsize ( extract . elf , ELF_T_EHDR , 1 , EV_CURRENT ) +
gelf_fsize ( extract . elf , ELF_T_PHDR , kci . phnum , EV_CURRENT ) ;
offset = round_up ( offset , page_size ) ;
2018-05-22 13:54:42 +03:00
kcore_copy__for_each_phdr ( & kci , p ) {
off_t offs = p - > rel + offset ;
2013-10-14 17:57:29 +04:00
2018-05-22 13:54:42 +03:00
if ( kcore__add_phdr ( & extract , idx + + , offs , p - > addr , p - > len ) )
2013-10-14 17:57:29 +04:00
goto out_extract_close ;
}
sz = kcore__write ( & extract ) ;
if ( sz < 0 | | sz > offset )
goto out_extract_close ;
2018-05-22 13:54:42 +03:00
kcore_copy__for_each_phdr ( & kci , p ) {
off_t offs = p - > rel + offset ;
2013-10-14 17:57:29 +04:00
2018-05-22 13:54:45 +03:00
if ( p - > remaps )
continue ;
2018-05-22 13:54:42 +03:00
if ( copy_bytes ( kcore . fd , p - > offset , extract . fd , offs , p - > len ) )
goto out_extract_close ;
}
2013-10-14 17:57:29 +04:00
if ( kcore_copy__compare_file ( from_dir , to_dir , " kallsyms " ) )
goto out_extract_close ;
err = 0 ;
out_extract_close :
kcore__close ( & extract ) ;
if ( err )
unlink ( extract_filename ) ;
out_kcore_close :
kcore__close ( & kcore ) ;
out_unlink_modules :
if ( err )
kcore_copy__unlink ( to_dir , " modules " ) ;
out_unlink_kallsyms :
if ( err )
kcore_copy__unlink ( to_dir , " kallsyms " ) ;
2018-05-22 13:54:43 +03:00
kcore_copy__free_phdrs ( & kci ) ;
2018-05-22 13:54:44 +03:00
kcore_copy__free_syms ( & kci ) ;
2018-05-22 13:54:43 +03:00
2013-10-14 17:57:29 +04:00
return err ;
}
2013-10-09 16:01:12 +04:00
int kcore_extract__create ( struct kcore_extract * kce )
{
struct kcore kcore ;
struct kcore extract ;
size_t count = 1 ;
int idx = 0 , err = - 1 ;
off_t offset = page_size , sz ;
if ( kcore__open ( & kcore , kce - > kcore_filename ) )
return - 1 ;
strcpy ( kce - > extract_filename , PERF_KCORE_EXTRACT ) ;
if ( kcore__init ( & extract , kce - > extract_filename , kcore . elfclass , true ) )
goto out_kcore_close ;
if ( kcore__copy_hdr ( & kcore , & extract , count ) )
goto out_extract_close ;
if ( kcore__add_phdr ( & extract , idx , offset , kce - > addr , kce - > len ) )
goto out_extract_close ;
sz = kcore__write ( & extract ) ;
if ( sz < 0 | | sz > offset )
goto out_extract_close ;
if ( copy_bytes ( kcore . fd , kce - > offs , extract . fd , offset , kce - > len ) )
goto out_extract_close ;
err = 0 ;
out_extract_close :
kcore__close ( & extract ) ;
if ( err )
unlink ( kce - > extract_filename ) ;
out_kcore_close :
kcore__close ( & kcore ) ;
return err ;
}
void kcore_extract__delete ( struct kcore_extract * kce )
{
unlink ( kce - > extract_filename ) ;
}
2016-07-12 18:19:09 +03:00
# ifdef HAVE_GELF_GETNOTE_SUPPORT
2018-08-20 07:42:50 +03:00
static void sdt_adjust_loc ( struct sdt_note * tmp , GElf_Addr base_off )
{
if ( ! base_off )
return ;
if ( tmp - > bit32 )
tmp - > addr . a32 [ SDT_NOTE_IDX_LOC ] =
tmp - > addr . a32 [ SDT_NOTE_IDX_LOC ] + base_off -
tmp - > addr . a32 [ SDT_NOTE_IDX_BASE ] ;
else
tmp - > addr . a64 [ SDT_NOTE_IDX_LOC ] =
tmp - > addr . a64 [ SDT_NOTE_IDX_LOC ] + base_off -
tmp - > addr . a64 [ SDT_NOTE_IDX_BASE ] ;
}
static void sdt_adjust_refctr ( struct sdt_note * tmp , GElf_Addr base_addr ,
GElf_Addr base_off )
{
if ( ! base_off )
return ;
if ( tmp - > bit32 & & tmp - > addr . a32 [ SDT_NOTE_IDX_REFCTR ] )
tmp - > addr . a32 [ SDT_NOTE_IDX_REFCTR ] - = ( base_addr - base_off ) ;
else if ( tmp - > addr . a64 [ SDT_NOTE_IDX_REFCTR ] )
tmp - > addr . a64 [ SDT_NOTE_IDX_REFCTR ] - = ( base_addr - base_off ) ;
}
perf sdt: ELF support for SDT
This patch serves the initial support to identify and list SDT events in
binaries. When programs containing SDT markers are compiled, gcc with
the help of assembler directives identifies them and places them in the
section ".note.stapsdt".
To find these markers from the binaries, one needs to traverse through
this section and parse the relevant details like the name, type and
location of the marker. Also, the original location could be skewed due
to the effect of prelinking. If that is the case, the locations need to
be adjusted.
The functions in this patch open a given ELF, find out the SDT section,
parse the relevant details, adjust the location (if necessary) and
populate them in a list.
A typical note entry in ".note.stapsdt" section is as follows :
|--nhdr.n_namesz--|
------------------------------------
| nhdr | "stapsdt" |
----- |----------------------------------|
| | <location> <base_address> |
| | <semaphore> |
nhdr.n_descsize | "provider_name" "note_name" |
| | <args> |
----- |----------------------------------|
| nhdr | "stapsdt" |
|...
The above shows an excerpt from the section ".note.stapsdt". 'nhdr' is
a structure which has the note name size (n_namesz), note description
size (n_desc_sz) and note type (n_type).
So, in order to parse the note note info, we need nhdr to tell us where
to start from. As can be seen from <sys/sdt.h>, the name of the SDT
notes given is "stapsdt". But this is not the identifier of the note.
After that, we go to description of the note to find out its location, the
address of the ".stapsdt.base" section and the semaphore address.
Then, we find the provider name and the SDT marker name and then follow the
arguments.
Signed-off-by: Hemant Kumar <hemant@linux.vnet.ibm.com>
Reviewed-by: Masami Hiramatsu <mhiramat@kernel.org>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Ananth N Mavinakayanahalli <ananth@linux.vnet.ibm.com>
Cc: Brendan Gregg <brendan.d.gregg@gmail.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: http://lkml.kernel.org/r/146736022628.27797.1201368329092908163.stgit@devbox
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-07-01 11:03:46 +03:00
/**
* populate_sdt_note : Parse raw data and identify SDT note
* @ elf : elf of the opened file
* @ data : raw data of a section with description offset applied
* @ len : note description size
* @ type : type of the note
* @ sdt_notes : List to add the SDT note
*
* Responsible for parsing the @ data in section . note . stapsdt in @ elf and
* if its an SDT note , it appends to @ sdt_notes list .
*/
static int populate_sdt_note ( Elf * * elf , const char * data , size_t len ,
struct list_head * sdt_notes )
{
2016-12-14 03:07:31 +03:00
const char * provider , * name , * args ;
perf sdt: ELF support for SDT
This patch serves the initial support to identify and list SDT events in
binaries. When programs containing SDT markers are compiled, gcc with
the help of assembler directives identifies them and places them in the
section ".note.stapsdt".
To find these markers from the binaries, one needs to traverse through
this section and parse the relevant details like the name, type and
location of the marker. Also, the original location could be skewed due
to the effect of prelinking. If that is the case, the locations need to
be adjusted.
The functions in this patch open a given ELF, find out the SDT section,
parse the relevant details, adjust the location (if necessary) and
populate them in a list.
A typical note entry in ".note.stapsdt" section is as follows :
|--nhdr.n_namesz--|
------------------------------------
| nhdr | "stapsdt" |
----- |----------------------------------|
| | <location> <base_address> |
| | <semaphore> |
nhdr.n_descsize | "provider_name" "note_name" |
| | <args> |
----- |----------------------------------|
| nhdr | "stapsdt" |
|...
The above shows an excerpt from the section ".note.stapsdt". 'nhdr' is
a structure which has the note name size (n_namesz), note description
size (n_desc_sz) and note type (n_type).
So, in order to parse the note note info, we need nhdr to tell us where
to start from. As can be seen from <sys/sdt.h>, the name of the SDT
notes given is "stapsdt". But this is not the identifier of the note.
After that, we go to description of the note to find out its location, the
address of the ".stapsdt.base" section and the semaphore address.
Then, we find the provider name and the SDT marker name and then follow the
arguments.
Signed-off-by: Hemant Kumar <hemant@linux.vnet.ibm.com>
Reviewed-by: Masami Hiramatsu <mhiramat@kernel.org>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Ananth N Mavinakayanahalli <ananth@linux.vnet.ibm.com>
Cc: Brendan Gregg <brendan.d.gregg@gmail.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: http://lkml.kernel.org/r/146736022628.27797.1201368329092908163.stgit@devbox
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-07-01 11:03:46 +03:00
struct sdt_note * tmp = NULL ;
GElf_Ehdr ehdr ;
GElf_Shdr shdr ;
int ret = - EINVAL ;
union {
Elf64_Addr a64 [ NR_ADDR ] ;
Elf32_Addr a32 [ NR_ADDR ] ;
} buf ;
Elf_Data dst = {
. d_buf = & buf , . d_type = ELF_T_ADDR , . d_version = EV_CURRENT ,
. d_size = gelf_fsize ( ( * elf ) , ELF_T_ADDR , NR_ADDR , EV_CURRENT ) ,
. d_off = 0 , . d_align = 0
} ;
Elf_Data src = {
. d_buf = ( void * ) data , . d_type = ELF_T_ADDR ,
. d_version = EV_CURRENT , . d_size = dst . d_size , . d_off = 0 ,
. d_align = 0
} ;
tmp = ( struct sdt_note * ) calloc ( 1 , sizeof ( struct sdt_note ) ) ;
if ( ! tmp ) {
ret = - ENOMEM ;
goto out_err ;
}
INIT_LIST_HEAD ( & tmp - > note_list ) ;
if ( len < dst . d_size + 3 )
goto out_free_note ;
/* Translation from file representation to memory representation */
if ( gelf_xlatetom ( * elf , & dst , & src ,
elf_getident ( * elf , NULL ) [ EI_DATA ] ) = = NULL ) {
pr_err ( " gelf_xlatetom : %s \n " , elf_errmsg ( - 1 ) ) ;
goto out_free_note ;
}
/* Populate the fields of sdt_note */
provider = data + dst . d_size ;
name = ( const char * ) memchr ( provider , ' \0 ' , data + len - provider ) ;
if ( name + + = = NULL )
goto out_free_note ;
tmp - > provider = strdup ( provider ) ;
if ( ! tmp - > provider ) {
ret = - ENOMEM ;
goto out_free_note ;
}
tmp - > name = strdup ( name ) ;
if ( ! tmp - > name ) {
ret = - ENOMEM ;
goto out_free_prov ;
}
2016-12-14 03:07:31 +03:00
args = memchr ( name , ' \0 ' , data + len - name ) ;
/*
* There is no argument if :
* - We reached the end of the note ;
* - There is not enough room to hold a potential string ;
* - The argument string is empty or just contains ' : ' .
*/
if ( args = = NULL | | data + len - args < 2 | |
args [ 1 ] = = ' : ' | | args [ 1 ] = = ' \0 ' )
tmp - > args = NULL ;
else {
tmp - > args = strdup ( + + args ) ;
if ( ! tmp - > args ) {
ret = - ENOMEM ;
goto out_free_name ;
}
}
perf sdt: ELF support for SDT
This patch serves the initial support to identify and list SDT events in
binaries. When programs containing SDT markers are compiled, gcc with
the help of assembler directives identifies them and places them in the
section ".note.stapsdt".
To find these markers from the binaries, one needs to traverse through
this section and parse the relevant details like the name, type and
location of the marker. Also, the original location could be skewed due
to the effect of prelinking. If that is the case, the locations need to
be adjusted.
The functions in this patch open a given ELF, find out the SDT section,
parse the relevant details, adjust the location (if necessary) and
populate them in a list.
A typical note entry in ".note.stapsdt" section is as follows :
|--nhdr.n_namesz--|
------------------------------------
| nhdr | "stapsdt" |
----- |----------------------------------|
| | <location> <base_address> |
| | <semaphore> |
nhdr.n_descsize | "provider_name" "note_name" |
| | <args> |
----- |----------------------------------|
| nhdr | "stapsdt" |
|...
The above shows an excerpt from the section ".note.stapsdt". 'nhdr' is
a structure which has the note name size (n_namesz), note description
size (n_desc_sz) and note type (n_type).
So, in order to parse the note note info, we need nhdr to tell us where
to start from. As can be seen from <sys/sdt.h>, the name of the SDT
notes given is "stapsdt". But this is not the identifier of the note.
After that, we go to description of the note to find out its location, the
address of the ".stapsdt.base" section and the semaphore address.
Then, we find the provider name and the SDT marker name and then follow the
arguments.
Signed-off-by: Hemant Kumar <hemant@linux.vnet.ibm.com>
Reviewed-by: Masami Hiramatsu <mhiramat@kernel.org>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Ananth N Mavinakayanahalli <ananth@linux.vnet.ibm.com>
Cc: Brendan Gregg <brendan.d.gregg@gmail.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: http://lkml.kernel.org/r/146736022628.27797.1201368329092908163.stgit@devbox
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-07-01 11:03:46 +03:00
if ( gelf_getclass ( * elf ) = = ELFCLASS32 ) {
memcpy ( & tmp - > addr , & buf , 3 * sizeof ( Elf32_Addr ) ) ;
tmp - > bit32 = true ;
} else {
memcpy ( & tmp - > addr , & buf , 3 * sizeof ( Elf64_Addr ) ) ;
tmp - > bit32 = false ;
}
if ( ! gelf_getehdr ( * elf , & ehdr ) ) {
pr_debug ( " %s : cannot get elf header. \n " , __func__ ) ;
ret = - EBADF ;
2016-12-14 03:07:31 +03:00
goto out_free_args ;
perf sdt: ELF support for SDT
This patch serves the initial support to identify and list SDT events in
binaries. When programs containing SDT markers are compiled, gcc with
the help of assembler directives identifies them and places them in the
section ".note.stapsdt".
To find these markers from the binaries, one needs to traverse through
this section and parse the relevant details like the name, type and
location of the marker. Also, the original location could be skewed due
to the effect of prelinking. If that is the case, the locations need to
be adjusted.
The functions in this patch open a given ELF, find out the SDT section,
parse the relevant details, adjust the location (if necessary) and
populate them in a list.
A typical note entry in ".note.stapsdt" section is as follows :
|--nhdr.n_namesz--|
------------------------------------
| nhdr | "stapsdt" |
----- |----------------------------------|
| | <location> <base_address> |
| | <semaphore> |
nhdr.n_descsize | "provider_name" "note_name" |
| | <args> |
----- |----------------------------------|
| nhdr | "stapsdt" |
|...
The above shows an excerpt from the section ".note.stapsdt". 'nhdr' is
a structure which has the note name size (n_namesz), note description
size (n_desc_sz) and note type (n_type).
So, in order to parse the note note info, we need nhdr to tell us where
to start from. As can be seen from <sys/sdt.h>, the name of the SDT
notes given is "stapsdt". But this is not the identifier of the note.
After that, we go to description of the note to find out its location, the
address of the ".stapsdt.base" section and the semaphore address.
Then, we find the provider name and the SDT marker name and then follow the
arguments.
Signed-off-by: Hemant Kumar <hemant@linux.vnet.ibm.com>
Reviewed-by: Masami Hiramatsu <mhiramat@kernel.org>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Ananth N Mavinakayanahalli <ananth@linux.vnet.ibm.com>
Cc: Brendan Gregg <brendan.d.gregg@gmail.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: http://lkml.kernel.org/r/146736022628.27797.1201368329092908163.stgit@devbox
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-07-01 11:03:46 +03:00
}
/* Adjust the prelink effect :
* Find out the . stapsdt . base section .
* This scn will help us to handle prelinking ( if present ) .
* Compare the retrieved file offset of the base section with the
* base address in the description of the SDT note . If its different ,
* then accordingly , adjust the note location .
*/
2018-08-20 07:42:50 +03:00
if ( elf_section_by_name ( * elf , & ehdr , & shdr , SDT_BASE_SCN , NULL ) )
sdt_adjust_loc ( tmp , shdr . sh_offset ) ;
/* Adjust reference counter offset */
if ( elf_section_by_name ( * elf , & ehdr , & shdr , SDT_PROBES_SCN , NULL ) )
sdt_adjust_refctr ( tmp , shdr . sh_addr , shdr . sh_offset ) ;
perf sdt: ELF support for SDT
This patch serves the initial support to identify and list SDT events in
binaries. When programs containing SDT markers are compiled, gcc with
the help of assembler directives identifies them and places them in the
section ".note.stapsdt".
To find these markers from the binaries, one needs to traverse through
this section and parse the relevant details like the name, type and
location of the marker. Also, the original location could be skewed due
to the effect of prelinking. If that is the case, the locations need to
be adjusted.
The functions in this patch open a given ELF, find out the SDT section,
parse the relevant details, adjust the location (if necessary) and
populate them in a list.
A typical note entry in ".note.stapsdt" section is as follows :
|--nhdr.n_namesz--|
------------------------------------
| nhdr | "stapsdt" |
----- |----------------------------------|
| | <location> <base_address> |
| | <semaphore> |
nhdr.n_descsize | "provider_name" "note_name" |
| | <args> |
----- |----------------------------------|
| nhdr | "stapsdt" |
|...
The above shows an excerpt from the section ".note.stapsdt". 'nhdr' is
a structure which has the note name size (n_namesz), note description
size (n_desc_sz) and note type (n_type).
So, in order to parse the note note info, we need nhdr to tell us where
to start from. As can be seen from <sys/sdt.h>, the name of the SDT
notes given is "stapsdt". But this is not the identifier of the note.
After that, we go to description of the note to find out its location, the
address of the ".stapsdt.base" section and the semaphore address.
Then, we find the provider name and the SDT marker name and then follow the
arguments.
Signed-off-by: Hemant Kumar <hemant@linux.vnet.ibm.com>
Reviewed-by: Masami Hiramatsu <mhiramat@kernel.org>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Ananth N Mavinakayanahalli <ananth@linux.vnet.ibm.com>
Cc: Brendan Gregg <brendan.d.gregg@gmail.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: http://lkml.kernel.org/r/146736022628.27797.1201368329092908163.stgit@devbox
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-07-01 11:03:46 +03:00
list_add_tail ( & tmp - > note_list , sdt_notes ) ;
return 0 ;
2016-12-14 03:07:31 +03:00
out_free_args :
2019-07-04 18:06:20 +03:00
zfree ( & tmp - > args ) ;
perf sdt: ELF support for SDT
This patch serves the initial support to identify and list SDT events in
binaries. When programs containing SDT markers are compiled, gcc with
the help of assembler directives identifies them and places them in the
section ".note.stapsdt".
To find these markers from the binaries, one needs to traverse through
this section and parse the relevant details like the name, type and
location of the marker. Also, the original location could be skewed due
to the effect of prelinking. If that is the case, the locations need to
be adjusted.
The functions in this patch open a given ELF, find out the SDT section,
parse the relevant details, adjust the location (if necessary) and
populate them in a list.
A typical note entry in ".note.stapsdt" section is as follows :
|--nhdr.n_namesz--|
------------------------------------
| nhdr | "stapsdt" |
----- |----------------------------------|
| | <location> <base_address> |
| | <semaphore> |
nhdr.n_descsize | "provider_name" "note_name" |
| | <args> |
----- |----------------------------------|
| nhdr | "stapsdt" |
|...
The above shows an excerpt from the section ".note.stapsdt". 'nhdr' is
a structure which has the note name size (n_namesz), note description
size (n_desc_sz) and note type (n_type).
So, in order to parse the note note info, we need nhdr to tell us where
to start from. As can be seen from <sys/sdt.h>, the name of the SDT
notes given is "stapsdt". But this is not the identifier of the note.
After that, we go to description of the note to find out its location, the
address of the ".stapsdt.base" section and the semaphore address.
Then, we find the provider name and the SDT marker name and then follow the
arguments.
Signed-off-by: Hemant Kumar <hemant@linux.vnet.ibm.com>
Reviewed-by: Masami Hiramatsu <mhiramat@kernel.org>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Ananth N Mavinakayanahalli <ananth@linux.vnet.ibm.com>
Cc: Brendan Gregg <brendan.d.gregg@gmail.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: http://lkml.kernel.org/r/146736022628.27797.1201368329092908163.stgit@devbox
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-07-01 11:03:46 +03:00
out_free_name :
2019-07-04 18:06:20 +03:00
zfree ( & tmp - > name ) ;
perf sdt: ELF support for SDT
This patch serves the initial support to identify and list SDT events in
binaries. When programs containing SDT markers are compiled, gcc with
the help of assembler directives identifies them and places them in the
section ".note.stapsdt".
To find these markers from the binaries, one needs to traverse through
this section and parse the relevant details like the name, type and
location of the marker. Also, the original location could be skewed due
to the effect of prelinking. If that is the case, the locations need to
be adjusted.
The functions in this patch open a given ELF, find out the SDT section,
parse the relevant details, adjust the location (if necessary) and
populate them in a list.
A typical note entry in ".note.stapsdt" section is as follows :
|--nhdr.n_namesz--|
------------------------------------
| nhdr | "stapsdt" |
----- |----------------------------------|
| | <location> <base_address> |
| | <semaphore> |
nhdr.n_descsize | "provider_name" "note_name" |
| | <args> |
----- |----------------------------------|
| nhdr | "stapsdt" |
|...
The above shows an excerpt from the section ".note.stapsdt". 'nhdr' is
a structure which has the note name size (n_namesz), note description
size (n_desc_sz) and note type (n_type).
So, in order to parse the note note info, we need nhdr to tell us where
to start from. As can be seen from <sys/sdt.h>, the name of the SDT
notes given is "stapsdt". But this is not the identifier of the note.
After that, we go to description of the note to find out its location, the
address of the ".stapsdt.base" section and the semaphore address.
Then, we find the provider name and the SDT marker name and then follow the
arguments.
Signed-off-by: Hemant Kumar <hemant@linux.vnet.ibm.com>
Reviewed-by: Masami Hiramatsu <mhiramat@kernel.org>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Ananth N Mavinakayanahalli <ananth@linux.vnet.ibm.com>
Cc: Brendan Gregg <brendan.d.gregg@gmail.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: http://lkml.kernel.org/r/146736022628.27797.1201368329092908163.stgit@devbox
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-07-01 11:03:46 +03:00
out_free_prov :
2019-07-04 18:06:20 +03:00
zfree ( & tmp - > provider ) ;
perf sdt: ELF support for SDT
This patch serves the initial support to identify and list SDT events in
binaries. When programs containing SDT markers are compiled, gcc with
the help of assembler directives identifies them and places them in the
section ".note.stapsdt".
To find these markers from the binaries, one needs to traverse through
this section and parse the relevant details like the name, type and
location of the marker. Also, the original location could be skewed due
to the effect of prelinking. If that is the case, the locations need to
be adjusted.
The functions in this patch open a given ELF, find out the SDT section,
parse the relevant details, adjust the location (if necessary) and
populate them in a list.
A typical note entry in ".note.stapsdt" section is as follows :
|--nhdr.n_namesz--|
------------------------------------
| nhdr | "stapsdt" |
----- |----------------------------------|
| | <location> <base_address> |
| | <semaphore> |
nhdr.n_descsize | "provider_name" "note_name" |
| | <args> |
----- |----------------------------------|
| nhdr | "stapsdt" |
|...
The above shows an excerpt from the section ".note.stapsdt". 'nhdr' is
a structure which has the note name size (n_namesz), note description
size (n_desc_sz) and note type (n_type).
So, in order to parse the note note info, we need nhdr to tell us where
to start from. As can be seen from <sys/sdt.h>, the name of the SDT
notes given is "stapsdt". But this is not the identifier of the note.
After that, we go to description of the note to find out its location, the
address of the ".stapsdt.base" section and the semaphore address.
Then, we find the provider name and the SDT marker name and then follow the
arguments.
Signed-off-by: Hemant Kumar <hemant@linux.vnet.ibm.com>
Reviewed-by: Masami Hiramatsu <mhiramat@kernel.org>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Ananth N Mavinakayanahalli <ananth@linux.vnet.ibm.com>
Cc: Brendan Gregg <brendan.d.gregg@gmail.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: http://lkml.kernel.org/r/146736022628.27797.1201368329092908163.stgit@devbox
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-07-01 11:03:46 +03:00
out_free_note :
free ( tmp ) ;
out_err :
return ret ;
}
/**
* construct_sdt_notes_list : constructs a list of SDT notes
* @ elf : elf to look into
* @ sdt_notes : empty list_head
*
* Scans the sections in ' elf ' for the section
* . note . stapsdt . It , then calls populate_sdt_note to find
* out the SDT events and populates the ' sdt_notes ' .
*/
static int construct_sdt_notes_list ( Elf * elf , struct list_head * sdt_notes )
{
GElf_Ehdr ehdr ;
Elf_Scn * scn = NULL ;
Elf_Data * data ;
GElf_Shdr shdr ;
size_t shstrndx , next ;
GElf_Nhdr nhdr ;
size_t name_off , desc_off , offset ;
int ret = 0 ;
if ( gelf_getehdr ( elf , & ehdr ) = = NULL ) {
ret = - EBADF ;
goto out_ret ;
}
if ( elf_getshdrstrndx ( elf , & shstrndx ) ! = 0 ) {
ret = - EBADF ;
goto out_ret ;
}
/* Look for the required section */
scn = elf_section_by_name ( elf , & ehdr , & shdr , SDT_NOTE_SCN , NULL ) ;
if ( ! scn ) {
ret = - ENOENT ;
goto out_ret ;
}
if ( ( shdr . sh_type ! = SHT_NOTE ) | | ( shdr . sh_flags & SHF_ALLOC ) ) {
ret = - ENOENT ;
goto out_ret ;
}
data = elf_getdata ( scn , NULL ) ;
/* Get the SDT notes */
for ( offset = 0 ; ( next = gelf_getnote ( data , offset , & nhdr , & name_off ,
& desc_off ) ) > 0 ; offset = next ) {
if ( nhdr . n_namesz = = sizeof ( SDT_NOTE_NAME ) & &
! memcmp ( data - > d_buf + name_off , SDT_NOTE_NAME ,
sizeof ( SDT_NOTE_NAME ) ) ) {
/* Check the type of the note */
if ( nhdr . n_type ! = SDT_NOTE_TYPE )
goto out_ret ;
ret = populate_sdt_note ( & elf , ( ( data - > d_buf ) + desc_off ) ,
nhdr . n_descsz , sdt_notes ) ;
if ( ret < 0 )
goto out_ret ;
}
}
if ( list_empty ( sdt_notes ) )
ret = - ENOENT ;
out_ret :
return ret ;
}
/**
* get_sdt_note_list : Wrapper to construct a list of sdt notes
* @ head : empty list_head
* @ target : file to find SDT notes from
*
* This opens the file , initializes
* the ELF and then calls construct_sdt_notes_list .
*/
int get_sdt_note_list ( struct list_head * head , const char * target )
{
Elf * elf ;
int fd , ret ;
fd = open ( target , O_RDONLY ) ;
if ( fd < 0 )
return - EBADF ;
elf = elf_begin ( fd , PERF_ELF_C_READ_MMAP , NULL ) ;
if ( ! elf ) {
ret = - EBADF ;
goto out_close ;
}
ret = construct_sdt_notes_list ( elf , head ) ;
elf_end ( elf ) ;
out_close :
close ( fd ) ;
return ret ;
}
/**
* cleanup_sdt_note_list : free the sdt notes ' list
* @ sdt_notes : sdt notes ' list
*
* Free up the SDT notes in @ sdt_notes .
* Returns the number of SDT notes free ' d .
*/
int cleanup_sdt_note_list ( struct list_head * sdt_notes )
{
struct sdt_note * tmp , * pos ;
int nr_free = 0 ;
list_for_each_entry_safe ( pos , tmp , sdt_notes , note_list ) {
2019-07-04 18:13:46 +03:00
list_del_init ( & pos - > note_list ) ;
2021-06-03 01:08:33 +03:00
zfree ( & pos - > args ) ;
2019-07-04 18:06:20 +03:00
zfree ( & pos - > name ) ;
zfree ( & pos - > provider ) ;
perf sdt: ELF support for SDT
This patch serves the initial support to identify and list SDT events in
binaries. When programs containing SDT markers are compiled, gcc with
the help of assembler directives identifies them and places them in the
section ".note.stapsdt".
To find these markers from the binaries, one needs to traverse through
this section and parse the relevant details like the name, type and
location of the marker. Also, the original location could be skewed due
to the effect of prelinking. If that is the case, the locations need to
be adjusted.
The functions in this patch open a given ELF, find out the SDT section,
parse the relevant details, adjust the location (if necessary) and
populate them in a list.
A typical note entry in ".note.stapsdt" section is as follows :
|--nhdr.n_namesz--|
------------------------------------
| nhdr | "stapsdt" |
----- |----------------------------------|
| | <location> <base_address> |
| | <semaphore> |
nhdr.n_descsize | "provider_name" "note_name" |
| | <args> |
----- |----------------------------------|
| nhdr | "stapsdt" |
|...
The above shows an excerpt from the section ".note.stapsdt". 'nhdr' is
a structure which has the note name size (n_namesz), note description
size (n_desc_sz) and note type (n_type).
So, in order to parse the note note info, we need nhdr to tell us where
to start from. As can be seen from <sys/sdt.h>, the name of the SDT
notes given is "stapsdt". But this is not the identifier of the note.
After that, we go to description of the note to find out its location, the
address of the ".stapsdt.base" section and the semaphore address.
Then, we find the provider name and the SDT marker name and then follow the
arguments.
Signed-off-by: Hemant Kumar <hemant@linux.vnet.ibm.com>
Reviewed-by: Masami Hiramatsu <mhiramat@kernel.org>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Ananth N Mavinakayanahalli <ananth@linux.vnet.ibm.com>
Cc: Brendan Gregg <brendan.d.gregg@gmail.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: http://lkml.kernel.org/r/146736022628.27797.1201368329092908163.stgit@devbox
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-07-01 11:03:46 +03:00
free ( pos ) ;
nr_free + + ;
}
return nr_free ;
}
/**
* sdt_notes__get_count : Counts the number of sdt events
* @ start : list_head to sdt_notes list
*
* Returns the number of SDT notes in a list
*/
int sdt_notes__get_count ( struct list_head * start )
{
struct sdt_note * sdt_ptr ;
int count = 0 ;
list_for_each_entry ( sdt_ptr , start , note_list )
count + + ;
return count ;
}
2016-07-12 18:19:09 +03:00
# endif
perf sdt: ELF support for SDT
This patch serves the initial support to identify and list SDT events in
binaries. When programs containing SDT markers are compiled, gcc with
the help of assembler directives identifies them and places them in the
section ".note.stapsdt".
To find these markers from the binaries, one needs to traverse through
this section and parse the relevant details like the name, type and
location of the marker. Also, the original location could be skewed due
to the effect of prelinking. If that is the case, the locations need to
be adjusted.
The functions in this patch open a given ELF, find out the SDT section,
parse the relevant details, adjust the location (if necessary) and
populate them in a list.
A typical note entry in ".note.stapsdt" section is as follows :
|--nhdr.n_namesz--|
------------------------------------
| nhdr | "stapsdt" |
----- |----------------------------------|
| | <location> <base_address> |
| | <semaphore> |
nhdr.n_descsize | "provider_name" "note_name" |
| | <args> |
----- |----------------------------------|
| nhdr | "stapsdt" |
|...
The above shows an excerpt from the section ".note.stapsdt". 'nhdr' is
a structure which has the note name size (n_namesz), note description
size (n_desc_sz) and note type (n_type).
So, in order to parse the note note info, we need nhdr to tell us where
to start from. As can be seen from <sys/sdt.h>, the name of the SDT
notes given is "stapsdt". But this is not the identifier of the note.
After that, we go to description of the note to find out its location, the
address of the ".stapsdt.base" section and the semaphore address.
Then, we find the provider name and the SDT marker name and then follow the
arguments.
Signed-off-by: Hemant Kumar <hemant@linux.vnet.ibm.com>
Reviewed-by: Masami Hiramatsu <mhiramat@kernel.org>
Acked-by: Namhyung Kim <namhyung@kernel.org>
Cc: Ananth N Mavinakayanahalli <ananth@linux.vnet.ibm.com>
Cc: Brendan Gregg <brendan.d.gregg@gmail.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: http://lkml.kernel.org/r/146736022628.27797.1201368329092908163.stgit@devbox
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-07-01 11:03:46 +03:00
2012-08-06 08:41:20 +04:00
void symbol__elf_init ( void )
{
elf_version ( EV_CURRENT ) ;
}