License cleanup: add SPDX GPL-2.0 license identifier to files with no license
Many source files in the tree are missing licensing information, which
makes it harder for compliance tools to determine the correct license.
By default all files without license information are under the default
license of the kernel, which is GPL version 2.
Update the files which contain no license information with the 'GPL-2.0'
SPDX license identifier. The SPDX identifier is a legally binding
shorthand, which can be used instead of the full boiler plate text.
This patch is based on work done by Thomas Gleixner and Kate Stewart and
Philippe Ombredanne.
How this work was done:
Patches were generated and checked against linux-4.14-rc6 for a subset of
the use cases:
- file had no licensing information it it.
- file was a */uapi/* one with no licensing information in it,
- file was a */uapi/* one with existing licensing information,
Further patches will be generated in subsequent months to fix up cases
where non-standard license headers were used, and references to license
had to be inferred by heuristics based on keywords.
The analysis to determine which SPDX License Identifier to be applied to
a file was done in a spreadsheet of side by side results from of the
output of two independent scanners (ScanCode & Windriver) producing SPDX
tag:value files created by Philippe Ombredanne. Philippe prepared the
base worksheet, and did an initial spot review of a few 1000 files.
The 4.13 kernel was the starting point of the analysis with 60,537 files
assessed. Kate Stewart did a file by file comparison of the scanner
results in the spreadsheet to determine which SPDX license identifier(s)
to be applied to the file. She confirmed any determination that was not
immediately clear with lawyers working with the Linux Foundation.
Criteria used to select files for SPDX license identifier tagging was:
- Files considered eligible had to be source code files.
- Make and config files were included as candidates if they contained >5
lines of source
- File already had some variant of a license header in it (even if <5
lines).
All documentation files were explicitly excluded.
The following heuristics were used to determine which SPDX license
identifiers to apply.
- when both scanners couldn't find any license traces, file was
considered to have no license information in it, and the top level
COPYING file license applied.
For non */uapi/* files that summary was:
SPDX license identifier # files
---------------------------------------------------|-------
GPL-2.0 11139
and resulted in the first patch in this series.
If that file was a */uapi/* path one, it was "GPL-2.0 WITH
Linux-syscall-note" otherwise it was "GPL-2.0". Results of that was:
SPDX license identifier # files
---------------------------------------------------|-------
GPL-2.0 WITH Linux-syscall-note 930
and resulted in the second patch in this series.
- if a file had some form of licensing information in it, and was one
of the */uapi/* ones, it was denoted with the Linux-syscall-note if
any GPL family license was found in the file or had no licensing in
it (per prior point). Results summary:
SPDX license identifier # files
---------------------------------------------------|------
GPL-2.0 WITH Linux-syscall-note 270
GPL-2.0+ WITH Linux-syscall-note 169
((GPL-2.0 WITH Linux-syscall-note) OR BSD-2-Clause) 21
((GPL-2.0 WITH Linux-syscall-note) OR BSD-3-Clause) 17
LGPL-2.1+ WITH Linux-syscall-note 15
GPL-1.0+ WITH Linux-syscall-note 14
((GPL-2.0+ WITH Linux-syscall-note) OR BSD-3-Clause) 5
LGPL-2.0+ WITH Linux-syscall-note 4
LGPL-2.1 WITH Linux-syscall-note 3
((GPL-2.0 WITH Linux-syscall-note) OR MIT) 3
((GPL-2.0 WITH Linux-syscall-note) AND MIT) 1
and that resulted in the third patch in this series.
- when the two scanners agreed on the detected license(s), that became
the concluded license(s).
- when there was disagreement between the two scanners (one detected a
license but the other didn't, or they both detected different
licenses) a manual inspection of the file occurred.
- In most cases a manual inspection of the information in the file
resulted in a clear resolution of the license that should apply (and
which scanner probably needed to revisit its heuristics).
- When it was not immediately clear, the license identifier was
confirmed with lawyers working with the Linux Foundation.
- If there was any question as to the appropriate license identifier,
the file was flagged for further research and to be revisited later
in time.
In total, over 70 hours of logged manual review was done on the
spreadsheet to determine the SPDX license identifiers to apply to the
source files by Kate, Philippe, Thomas and, in some cases, confirmation
by lawyers working with the Linux Foundation.
Kate also obtained a third independent scan of the 4.13 code base from
FOSSology, and compared selected files where the other two scanners
disagreed against that SPDX file, to see if there was new insights. The
Windriver scanner is based on an older version of FOSSology in part, so
they are related.
Thomas did random spot checks in about 500 files from the spreadsheets
for the uapi headers and agreed with SPDX license identifier in the
files he inspected. For the non-uapi files Thomas did random spot checks
in about 15000 files.
In initial set of patches against 4.14-rc6, 3 files were found to have
copy/paste license identifier errors, and have been fixed to reflect the
correct identifier.
Additionally Philippe spent 10 hours this week doing a detailed manual
inspection and review of the 12,461 patched files from the initial patch
version early this week with:
- a full scancode scan run, collecting the matched texts, detected
license ids and scores
- reviewing anything where there was a license detected (about 500+
files) to ensure that the applied SPDX license was correct
- reviewing anything where there was no detection but the patch license
was not GPL-2.0 WITH Linux-syscall-note to ensure that the applied
SPDX license was correct
This produced a worksheet with 20 files needing minor correction. This
worksheet was then exported into 3 different .csv files for the
different types of files to be modified.
These .csv files were then reviewed by Greg. Thomas wrote a script to
parse the csv files and add the proper SPDX tag to the file, in the
format that the file expected. This script was further refined by Greg
based on the output to detect more types of files automatically and to
distinguish between header and source .c files (which need different
comment types.) Finally Greg ran the script using the .csv files to
generate the patches.
Reviewed-by: Kate Stewart <kstewart@linuxfoundation.org>
Reviewed-by: Philippe Ombredanne <pombredanne@nexb.com>
Reviewed-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
2017-11-01 17:07:57 +03:00
// SPDX-License-Identifier: GPL-2.0
2011-01-23 01:37:02 +03:00
# include <inttypes.h>
2010-03-26 01:58:58 +03:00
# include <limits.h>
2022-02-11 13:34:01 +03:00
# include <stdio.h>
2009-08-12 13:07:25 +04:00
# include <stdlib.h>
# include <string.h>
2022-02-11 13:34:01 +03:00
# include <linux/string.h>
# include <linux/zalloc.h>
2016-09-12 22:47:57 +03:00
# include <uapi/linux/mman.h> /* To get things like MAP_HUGETLB even on older libc headers */
2022-02-11 13:34:01 +03:00
# include "debug.h"
2019-08-30 17:11:01 +03:00
# include "dso.h"
2010-03-26 01:58:58 +03:00
# include "map.h"
2022-02-11 13:34:01 +03:00
# include "namespaces.h"
# include "srcline.h"
# include "symbol.h"
2012-07-21 03:25:47 +04:00
# include "thread.h"
2012-09-10 20:50:19 +04:00
# include "vdso.h"
2015-05-22 19:45:24 +03:00
2014-05-20 13:48:50 +04:00
static inline int is_android_lib ( const char * filename )
{
2020-03-09 22:53:41 +03:00
return strstarts ( filename , " /data/app-lib/ " ) | |
strstarts ( filename , " /system/lib/ " ) ;
2014-05-20 13:48:50 +04:00
}
static inline bool replace_android_lib ( const char * filename , char * newfilename )
{
const char * libname ;
char * app_abi ;
size_t app_abi_length , new_length ;
size_t lib_length = 0 ;
libname = strrchr ( filename , ' / ' ) ;
if ( libname )
lib_length = strlen ( libname ) ;
app_abi = getenv ( " APP_ABI " ) ;
if ( ! app_abi )
return false ;
app_abi_length = strlen ( app_abi ) ;
2020-03-09 22:53:41 +03:00
if ( strstarts ( filename , " /data/app-lib/ " ) ) {
2014-05-20 13:48:50 +04:00
char * apk_path ;
if ( ! app_abi_length )
return false ;
new_length = 7 + app_abi_length + lib_length ;
apk_path = getenv ( " APK_PATH " ) ;
if ( apk_path ) {
new_length + = strlen ( apk_path ) + 1 ;
if ( new_length > PATH_MAX )
return false ;
snprintf ( newfilename , new_length ,
" %s/libs/%s/%s " , apk_path , app_abi , libname ) ;
} else {
if ( new_length > PATH_MAX )
return false ;
snprintf ( newfilename , new_length ,
" libs/%s/%s " , app_abi , libname ) ;
}
return true ;
}
2020-03-09 22:53:41 +03:00
if ( strstarts ( filename , " /system/lib/ " ) ) {
2014-05-20 13:48:50 +04:00
char * ndk , * app ;
const char * arch ;
2021-03-05 16:02:09 +03:00
int ndk_length , app_length ;
2014-05-20 13:48:50 +04:00
ndk = getenv ( " NDK_ROOT " ) ;
app = getenv ( " APP_PLATFORM " ) ;
if ( ! ( ndk & & app ) )
return false ;
ndk_length = strlen ( ndk ) ;
app_length = strlen ( app ) ;
if ( ! ( ndk_length & & app_length & & app_abi_length ) )
return false ;
arch = ! strncmp ( app_abi , " arm " , 3 ) ? " arm " :
! strncmp ( app_abi , " mips " , 4 ) ? " mips " :
! strncmp ( app_abi , " x86 " , 3 ) ? " x86 " : NULL ;
if ( ! arch )
return false ;
new_length = 27 + ndk_length +
app_length + lib_length
+ strlen ( arch ) ;
if ( new_length > PATH_MAX )
return false ;
snprintf ( newfilename , new_length ,
2021-03-05 16:02:09 +03:00
" %.*s/platforms/%.*s/arch-%s/usr/lib/%s " ,
ndk_length , ndk , app_length , app , arch , libname ) ;
2014-05-20 13:48:50 +04:00
return true ;
}
return false ;
}
2018-04-26 22:52:34 +03:00
void map__init ( struct map * map , u64 start , u64 end , u64 pgoff , struct dso * dso )
2009-10-30 21:28:23 +03:00
{
2023-04-19 18:24:12 +03:00
map__set_start ( map , start ) ;
map__set_end ( map , end ) ;
map__set_pgoff ( map , pgoff ) ;
map__set_reloc ( map , 0 ) ;
map__set_dso ( map , dso__get ( dso ) ) ;
map__set_map_ip ( map , map__dso_map_ip ) ;
map__set_unmap_ip ( map , map__dso_unmap_ip ) ;
map__set_erange_warned ( map , false ) ;
2023-04-19 16:57:57 +03:00
refcount_set ( map__refcnt ( map ) , 1 ) ;
2009-10-30 21:28:23 +03:00
}
2014-07-22 17:17:53 +04:00
struct map * map__new ( struct machine * machine , u64 start , u64 len ,
2019-11-19 18:40:29 +03:00
u64 pgoff , struct dso_id * id ,
2020-12-14 13:54:49 +03:00
u32 prot , u32 flags , struct build_id * bid ,
char * filename , struct thread * thread )
2009-08-12 13:07:25 +04:00
{
2023-04-19 18:57:53 +03:00
struct map * result ;
RC_STRUCT ( map ) * map ;
2017-07-06 04:48:09 +03:00
struct nsinfo * nsi = NULL ;
struct nsinfo * nnsi ;
2009-08-12 13:07:25 +04:00
2023-04-19 18:57:53 +03:00
map = malloc ( sizeof ( * map ) ) ;
if ( ADD_RC_CHK ( result , map ) ) {
2009-08-12 13:07:25 +04:00
char newfilename [ PATH_MAX ] ;
2022-03-04 12:09:56 +03:00
struct dso * dso , * header_bid_dso ;
2014-05-20 13:48:50 +04:00
int anon , no_dso , vdso , android ;
2009-08-12 13:07:25 +04:00
2014-05-20 13:48:50 +04:00
android = is_android_lib ( filename ) ;
perf inject: Do not load map/dso when injecting build-id
No need to load symbols in a DSO when injecting build-id. I guess the
reason was to check the DSO is a special file like anon files. Use some
helper functions in map.c to check them before reading build-id. Also
pass sample event's cpumode to a new build-id event.
It brought a speedup in the benchmark of 25 -> 21 msec on my laptop.
Also the memory usage (Max RSS) went down by ~200 KB.
# Running 'internals/inject-build-id' benchmark:
Average build-id injection took: 21.389 msec (+- 0.138 msec)
Average time per event: 2.097 usec (+- 0.014 usec)
Average memory usage: 8225 KB (+- 0 KB)
Committer notes:
Before:
$ perf stat -r5 perf bench internals inject-build-id > /dev/null
Performance counter stats for 'perf bench internals inject-build-id' (5 runs):
4,020.56 msec task-clock:u # 1.271 CPUs utilized ( +- 0.74% )
0 context-switches:u # 0.000 K/sec
0 cpu-migrations:u # 0.000 K/sec
123,354 page-faults:u # 0.031 M/sec ( +- 0.81% )
7,119,951,568 cycles:u # 1.771 GHz ( +- 1.74% ) (83.27%)
230,086,969 stalled-cycles-frontend:u # 3.23% frontend cycles idle ( +- 1.97% ) (83.41%)
1,168,298,765 stalled-cycles-backend:u # 16.41% backend cycles idle ( +- 1.13% ) (83.44%)
11,173,083,669 instructions:u # 1.57 insn per cycle
# 0.10 stalled cycles per insn ( +- 1.58% ) (83.31%)
2,413,908,936 branches:u # 600.392 M/sec ( +- 1.69% ) (83.26%)
46,576,289 branch-misses:u # 1.93% of all branches ( +- 2.20% ) (83.31%)
3.1638 +- 0.0309 seconds time elapsed ( +- 0.98% )
$
After:
$ perf stat -r5 perf bench internals inject-build-id > /dev/null
Performance counter stats for 'perf bench internals inject-build-id' (5 runs):
2,379.94 msec task-clock:u # 1.473 CPUs utilized ( +- 0.18% )
0 context-switches:u # 0.000 K/sec
0 cpu-migrations:u # 0.000 K/sec
62,584 page-faults:u # 0.026 M/sec ( +- 0.07% )
2,372,389,668 cycles:u # 0.997 GHz ( +- 0.29% ) (83.14%)
106,937,862 stalled-cycles-frontend:u # 4.51% frontend cycles idle ( +- 4.89% ) (83.20%)
581,697,915 stalled-cycles-backend:u # 24.52% backend cycles idle ( +- 0.71% ) (83.47%)
3,659,692,199 instructions:u # 1.54 insn per cycle
# 0.16 stalled cycles per insn ( +- 0.10% ) (83.63%)
791,372,961 branches:u # 332.518 M/sec ( +- 0.27% ) (83.39%)
10,648,083 branch-misses:u # 1.35% of all branches ( +- 0.22% ) (83.16%)
1.61570 +- 0.00172 seconds time elapsed ( +- 0.11% )
$
Signed-off-by: Namhyung Kim <namhyung@kernel.org>
Original-patch-by: Stephane Eranian <eranian@google.com>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Acked-by: Jiri Olsa <jolsa@redhat.com>
Link: https://lore.kernel.org/r/20201012070214.2074921-5-namhyung@kernel.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2020-10-12 10:02:12 +03:00
anon = is_anon_memory ( filename ) | | flags & MAP_HUGETLB ;
2012-09-10 20:50:19 +04:00
vdso = is_vdso_map ( filename ) ;
2011-08-24 17:18:34 +04:00
no_dso = is_no_dso_memory ( filename ) ;
2014-05-19 23:13:49 +04:00
map - > prot = prot ;
map - > flags = flags ;
2023-06-09 02:28:00 +03:00
nsi = nsinfo__get ( thread__nsinfo ( thread ) ) ;
2013-08-21 14:10:25 +04:00
2018-04-26 22:08:38 +03:00
if ( ( anon | | no_dso ) & & nsi & & ( prot & PROT_EXEC ) ) {
2017-07-06 04:48:09 +03:00
snprintf ( newfilename , sizeof ( newfilename ) ,
2022-02-11 13:34:06 +03:00
" /tmp/perf-%d.map " , nsinfo__pid ( nsi ) ) ;
2009-08-12 13:07:25 +04:00
filename = newfilename ;
}
2014-05-20 13:48:50 +04:00
if ( android ) {
if ( replace_android_lib ( filename , newfilename ) )
filename = newfilename ;
}
2012-09-10 20:50:19 +04:00
if ( vdso ) {
2017-07-06 04:48:09 +03:00
/* The vdso maps are always on the host and not the
* container . Ensure that we don ' t use setns to look
* them up .
*/
nnsi = nsinfo__copy ( nsi ) ;
if ( nnsi ) {
nsinfo__put ( nsi ) ;
2022-02-11 13:34:06 +03:00
nsinfo__clear_need_setns ( nnsi ) ;
2017-07-06 04:48:09 +03:00
nsi = nnsi ;
}
2012-09-10 20:50:19 +04:00
pgoff = 0 ;
perf machine: Fix up vdso methods names
To make it consistent with the other dso lifetime routines.
For instance:
struct dso *vdso__new(struct machine *machine, const char *short_name,
const char *long_name)
Becomes:
struct dso *machine__addnew_vdso(struct machine *machine, const
char *short_name, const char *long_name)
Because:
1) There is no 'struct vdso' for us to have vdso__ prefixed routines.
2) Because it will not really just create a new instance of 'struct
dso', it'll call dso__new() but it will also insert it into the
DSO's list/rbtree, and we have a method name for that: 'addnew',
just like we have dsos__addnew().
3) So it is really a 'struct machine' operation, it is the first
argument, etc.
This way the place where this is used gets consistent:
if (vdso) {
pgoff = 0;
- dso = vdso__dso_findnew(machine, thread);
+ dso = machine__findnew_vdso(machine, thread);
} else
dso = machine__findnew_dso(machine, filename);
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Jiri Olsa <jolsa@redhat.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Link: http://lkml.kernel.org/n/tip-r3w3tvh8exm9xfz3p4tz9qbz@git.kernel.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-05-29 17:54:08 +03:00
dso = machine__findnew_vdso ( machine , thread ) ;
2012-09-10 20:50:19 +04:00
} else
perf dso: Move dso_id from 'struct map' to 'struct dso'
And take it into account when looking up DSOs when we have the dso_id
fields obtained from somewhere, like from PERF_RECORD_MMAP2 records.
Instances of struct map pointing to the same DSO pathname but with
anything in dso_id different are in fact different DSOs, so better have
different 'struct dso' instances to reflect that. At some point we may
want to get copies of the contents of the different objects if we want
to do correct annotation or other analysis.
With this we get 'struct map' 24 bytes leaner:
$ pahole -C map ~/bin/perf
struct map {
union {
struct rb_node rb_node __attribute__((__aligned__(8))); /* 0 24 */
struct list_head node; /* 0 16 */
} __attribute__((__aligned__(8))); /* 0 24 */
u64 start; /* 24 8 */
u64 end; /* 32 8 */
_Bool erange_warned:1; /* 40: 0 1 */
_Bool priv:1; /* 40: 1 1 */
/* XXX 6 bits hole, try to pack */
/* XXX 3 bytes hole, try to pack */
u32 prot; /* 44 4 */
u64 pgoff; /* 48 8 */
u64 reloc; /* 56 8 */
/* --- cacheline 1 boundary (64 bytes) --- */
u64 (*map_ip)(struct map *, u64); /* 64 8 */
u64 (*unmap_ip)(struct map *, u64); /* 72 8 */
struct dso * dso; /* 80 8 */
refcount_t refcnt; /* 88 4 */
u32 flags; /* 92 4 */
/* size: 96, cachelines: 2, members: 13 */
/* sum members: 92, holes: 1, sum holes: 3 */
/* sum bitfield members: 2 bits, bit holes: 1, sum bit holes: 6 bits */
/* forced alignments: 1 */
/* last cacheline: 32 bytes */
} __attribute__((__aligned__(8)));
$
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Namhyung Kim <namhyung@kernel.org>
Link: https://lkml.kernel.org/n/tip-g4hxxmraplo7wfjmk384mfsb@git.kernel.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2019-11-20 00:44:22 +03:00
dso = machine__findnew_dso_id ( machine , filename , id ) ;
2012-09-10 20:50:19 +04:00
2009-10-30 21:28:23 +03:00
if ( dso = = NULL )
2009-08-12 13:07:25 +04:00
goto out_delete ;
2023-04-19 18:57:53 +03:00
map__init ( result , start , start + len , pgoff , dso ) ;
2009-10-30 21:28:23 +03:00
2011-08-24 17:18:34 +04:00
if ( anon | | no_dso ) {
2013-01-25 04:59:59 +04:00
map - > map_ip = map - > unmap_ip = identity__map_ip ;
2011-08-24 17:18:34 +04:00
/*
* Set memory without DSO as loaded . All map__find_ *
* functions still return NULL , and we avoid the
* unnecessary map__load warning .
*/
2018-04-26 22:08:38 +03:00
if ( ! ( prot & PROT_EXEC ) )
2018-04-26 22:52:34 +03:00
dso__set_loaded ( dso ) ;
2010-02-03 21:52:02 +03:00
}
2022-08-26 19:42:38 +03:00
mutex_lock ( & dso - > lock ) ;
nsinfo__put ( dso - > nsinfo ) ;
2017-07-06 04:48:09 +03:00
dso - > nsinfo = nsi ;
2022-08-26 19:42:38 +03:00
mutex_unlock ( & dso - > lock ) ;
2020-12-14 13:54:49 +03:00
2022-03-04 12:09:56 +03:00
if ( build_id__is_defined ( bid ) ) {
2020-12-14 13:54:49 +03:00
dso__set_build_id ( dso , bid ) ;
2022-03-04 12:09:56 +03:00
} else {
/*
* If the mmap event had no build ID , search for an existing dso from the
* build ID header by name . Otherwise only the dso loaded at the time of
* reading the header will have the build ID set and all future mmaps will
* have it missing .
*/
down_read ( & machine - > dsos . lock ) ;
header_bid_dso = __dsos__find ( & machine - > dsos , filename , false ) ;
up_read ( & machine - > dsos . lock ) ;
if ( header_bid_dso & & header_bid_dso - > header_build_id ) {
dso__set_build_id ( dso , & header_bid_dso - > bid ) ;
dso - > header_build_id = 1 ;
}
}
2015-06-02 17:53:26 +03:00
dso__put ( dso ) ;
2009-08-12 13:07:25 +04:00
}
2023-04-19 18:57:53 +03:00
return result ;
2009-08-12 13:07:25 +04:00
out_delete :
2017-07-06 04:48:09 +03:00
nsinfo__put ( nsi ) ;
2023-04-19 18:57:53 +03:00
RC_CHK_FREE ( result ) ;
2009-08-12 13:07:25 +04:00
return NULL ;
}
2012-08-06 08:41:20 +04:00
/*
* Constructor variant for modules ( where we know from / proc / modules where
* they are loaded ) and for vmlinux , where only after we load all the
* symbols we ' ll know where it starts and ends .
*/
2018-04-26 22:52:34 +03:00
struct map * map__new2 ( u64 start , struct dso * dso )
2012-08-06 08:41:20 +04:00
{
2023-04-19 18:57:53 +03:00
struct map * result ;
RC_STRUCT ( map ) * map ;
map = calloc ( 1 , sizeof ( * map ) + ( dso - > kernel ? sizeof ( struct kmap ) : 0 ) ) ;
if ( ADD_RC_CHK ( result , map ) ) {
2012-08-06 08:41:20 +04:00
/*
* - > end will be filled after we load all the symbols
*/
2023-04-19 18:57:53 +03:00
map__init ( result , start , 0 , 0 , dso ) ;
2012-08-06 08:41:20 +04:00
}
2023-04-19 18:57:53 +03:00
return result ;
2012-08-06 08:41:20 +04:00
}
2015-07-23 17:06:16 +03:00
bool __map__is_kernel ( const struct map * map )
{
2023-03-21 00:22:35 +03:00
if ( ! map__dso ( map ) - > kernel )
2019-11-04 22:31:33 +03:00
return false ;
perf maps: Add functions to access maps
Introduce functions to access struct maps. These functions reduce the
number of places reference counting is necessary. While tidying APIs do
some small const-ification, in particlar to unwind_libunwind_ops.
Committer notes:
Fixed up tools/perf/util/unwind-libunwind.c:
- return ops->get_entries(cb, arg, thread, data, max_stack);
+ return ops->get_entries(cb, arg, thread, data, max_stack, best_effort);
Signed-off-by: Ian Rogers <irogers@google.com>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Alexey Bayduraev <alexey.v.bayduraev@linux.intel.com>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Andrew Morton <akpm@linux-foundation.org>
Cc: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
Cc: Darren Hart <dvhart@infradead.org>
Cc: Davidlohr Bueso <dave@stgolabs.net>
Cc: Dmitriy Vyukov <dvyukov@google.com>
Cc: Eric Dumazet <edumazet@google.com>
Cc: German Gomez <german.gomez@arm.com>
Cc: Hao Luo <haoluo@google.com>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: James Clark <james.clark@arm.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: John Garry <john.g.garry@oracle.com>
Cc: Kajol Jain <kjain@linux.ibm.com>
Cc: Kan Liang <kan.liang@linux.intel.com>
Cc: Leo Yan <leo.yan@linaro.org>
Cc: Madhavan Srinivasan <maddy@linux.ibm.com>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Masami Hiramatsu <mhiramat@kernel.org>
Cc: Miaoqian Lin <linmq006@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Riccardo Mancini <rickyman7@gmail.com>
Cc: Shunsuke Nakamura <nakamura.shun@fujitsu.com>
Cc: Song Liu <song@kernel.org>
Cc: Stephane Eranian <eranian@google.com>
Cc: Stephen Brennan <stephen.s.brennan@oracle.com>
Cc: Steven Rostedt (VMware) <rostedt@goodmis.org>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: Thomas Richter <tmricht@linux.ibm.com>
Cc: Yury Norov <yury.norov@gmail.com>
Link: https://lore.kernel.org/r/20230320212248.1175731-2-irogers@google.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-03-21 00:22:34 +03:00
return machine__kernel_map ( maps__machine ( map__kmaps ( ( struct map * ) map ) ) ) = = map ;
2015-07-23 17:06:16 +03:00
}
2018-05-22 13:54:35 +03:00
bool __map__is_extra_kernel_map ( const struct map * map )
{
struct kmap * kmap = __map__kmap ( ( struct map * ) map ) ;
return kmap & & kmap - > name [ 0 ] ;
}
2019-04-16 19:01:22 +03:00
bool __map__is_bpf_prog ( const struct map * map )
{
const char * name ;
2023-03-21 00:22:35 +03:00
struct dso * dso = map__dso ( map ) ;
2019-04-16 19:01:22 +03:00
2023-03-21 00:22:35 +03:00
if ( dso - > binary_type = = DSO_BINARY_TYPE__BPF_PROG_INFO )
2019-04-16 19:01:22 +03:00
return true ;
/*
* If PERF_RECORD_BPF_EVENT is not included , the dso will not have
* type of DSO_BINARY_TYPE__BPF_PROG_INFO . In such cases , we can
* guess the type based on name .
*/
2023-03-21 00:22:35 +03:00
name = dso - > short_name ;
2019-04-16 19:01:22 +03:00
return name & & ( strstr ( name , " bpf_prog_ " ) = = name ) ;
}
2020-08-27 00:30:17 +03:00
bool __map__is_bpf_image ( const struct map * map )
{
const char * name ;
2023-03-21 00:22:35 +03:00
struct dso * dso = map__dso ( map ) ;
2020-08-27 00:30:17 +03:00
2023-03-21 00:22:35 +03:00
if ( dso - > binary_type = = DSO_BINARY_TYPE__BPF_IMAGE )
2020-08-27 00:30:17 +03:00
return true ;
/*
* If PERF_RECORD_KSYMBOL is not included , the dso will not have
* type of DSO_BINARY_TYPE__BPF_IMAGE . In such cases , we can
* guess the type based on name .
*/
2023-03-21 00:22:35 +03:00
name = dso - > short_name ;
2020-08-27 00:30:17 +03:00
return name & & is_bpf_image ( name ) ;
}
2020-05-12 15:19:19 +03:00
bool __map__is_ool ( const struct map * map )
{
2023-03-21 00:22:35 +03:00
const struct dso * dso = map__dso ( map ) ;
return dso & & dso - > binary_type = = DSO_BINARY_TYPE__OOL ;
2020-05-12 15:19:19 +03:00
}
2018-04-23 23:13:49 +03:00
bool map__has_symbols ( const struct map * map )
{
2023-03-21 00:22:35 +03:00
return dso__has_symbols ( map__dso ( map ) ) ;
2018-04-23 23:13:49 +03:00
}
2015-06-02 17:53:26 +03:00
static void map__exit ( struct map * map )
2009-11-21 01:51:27 +03:00
{
2023-04-19 16:57:57 +03:00
BUG_ON ( refcount_read ( map__refcnt ( map ) ) ! = 0 ) ;
2023-04-19 18:57:53 +03:00
dso__zput ( RC_CHK_ACCESS ( map ) - > dso ) ;
2015-06-02 17:53:26 +03:00
}
void map__delete ( struct map * map )
{
map__exit ( map ) ;
2023-04-19 18:57:53 +03:00
RC_CHK_FREE ( map ) ;
2009-11-21 01:51:27 +03:00
}
2015-05-25 22:59:56 +03:00
void map__put ( struct map * map )
{
2023-04-19 16:57:57 +03:00
if ( map & & refcount_dec_and_test ( map__refcnt ( map ) ) )
2015-05-25 22:59:56 +03:00
map__delete ( map ) ;
2023-04-19 18:57:53 +03:00
else
RC_CHK_PUT ( map ) ;
2015-05-25 22:59:56 +03:00
}
2013-01-25 04:59:59 +04:00
void map__fixup_start ( struct map * map )
2009-11-21 01:51:27 +03:00
{
2023-03-21 00:22:35 +03:00
struct dso * dso = map__dso ( map ) ;
struct rb_root_cached * symbols = & dso - > symbols ;
2018-12-06 22:18:17 +03:00
struct rb_node * nd = rb_first_cached ( symbols ) ;
2023-03-21 00:22:35 +03:00
2009-11-21 01:51:27 +03:00
if ( nd ! = NULL ) {
struct symbol * sym = rb_entry ( nd , struct symbol , rb_node ) ;
2023-03-21 00:22:35 +03:00
2023-04-19 18:24:12 +03:00
map__set_start ( map , sym - > start ) ;
2009-11-21 01:51:27 +03:00
}
}
2013-01-25 04:59:59 +04:00
void map__fixup_end ( struct map * map )
2009-11-21 01:51:27 +03:00
{
2023-03-21 00:22:35 +03:00
struct dso * dso = map__dso ( map ) ;
struct rb_root_cached * symbols = & dso - > symbols ;
2018-12-06 22:18:17 +03:00
struct rb_node * nd = rb_last ( & symbols - > rb_root ) ;
2023-03-21 00:22:35 +03:00
2009-11-21 01:51:27 +03:00
if ( nd ! = NULL ) {
struct symbol * sym = rb_entry ( nd , struct symbol , rb_node ) ;
2023-04-19 18:24:12 +03:00
map__set_end ( map , sym - > end ) ;
2009-11-21 01:51:27 +03:00
}
}
2009-10-30 21:28:25 +03:00
# define DSO__DELETED "(deleted)"
2016-09-02 01:25:52 +03:00
int map__load ( struct map * map )
2009-10-29 02:51:21 +03:00
{
2023-03-21 00:22:35 +03:00
struct dso * dso = map__dso ( map ) ;
const char * name = dso - > long_name ;
2009-12-15 18:32:33 +03:00
int nr ;
perf symbols: Allow lookups by symbol name too
Configurable via symbol_conf.sort_by_name, so that the cost of an
extra rb_node on all 'struct symbol' instances is not paid by tools
that only want to decode addresses.
How to use it:
symbol_conf.sort_by_name = true;
symbol_init(&symbol_conf);
struct map *map = map_groups__find_by_name(kmaps, MAP__VARIABLE, "[kernel.kallsyms]");
if (map == NULL) {
pr_err("couldn't find map!\n");
kernel_maps__fprintf(stdout);
} else {
struct symbol *sym = map__find_symbol_by_name(map, sym_filter, NULL);
if (sym == NULL)
pr_err("couldn't find symbol %s!\n", sym_filter);
else
pr_info("symbol %s: %#Lx-%#Lx \n", sym_filter, sym->start, sym->end);
}
Looking over the vmlinux/kallsyms is common enough that I'll add a
variable to the upcoming struct perf_session to avoid the need to
use map_groups__find_by_name to get the main vmlinux/kallsyms map.
The above example looks on the 'variable' symtab, but it is just
like that for the functions one.
Also the sort operation is done when we first use
map__find_symbol_by_name, in a lazy way.
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Frédéric Weisbecker <fweisbec@gmail.com>
Cc: Masami Hiramatsu <mhiramat@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Paul Mackerras <paulus@samba.org>
LKML-Reference: <1260564622-12392-1-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-12-11 23:50:22 +03:00
2023-03-21 00:22:35 +03:00
if ( dso__loaded ( dso ) )
2009-12-15 18:32:33 +03:00
return 0 ;
2023-03-21 00:22:35 +03:00
nr = dso__load ( dso , map ) ;
perf symbols: Allow lookups by symbol name too
Configurable via symbol_conf.sort_by_name, so that the cost of an
extra rb_node on all 'struct symbol' instances is not paid by tools
that only want to decode addresses.
How to use it:
symbol_conf.sort_by_name = true;
symbol_init(&symbol_conf);
struct map *map = map_groups__find_by_name(kmaps, MAP__VARIABLE, "[kernel.kallsyms]");
if (map == NULL) {
pr_err("couldn't find map!\n");
kernel_maps__fprintf(stdout);
} else {
struct symbol *sym = map__find_symbol_by_name(map, sym_filter, NULL);
if (sym == NULL)
pr_err("couldn't find symbol %s!\n", sym_filter);
else
pr_info("symbol %s: %#Lx-%#Lx \n", sym_filter, sym->start, sym->end);
}
Looking over the vmlinux/kallsyms is common enough that I'll add a
variable to the upcoming struct perf_session to avoid the need to
use map_groups__find_by_name to get the main vmlinux/kallsyms map.
The above example looks on the 'variable' symtab, but it is just
like that for the functions one.
Also the sort operation is done when we first use
map__find_symbol_by_name, in a lazy way.
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Frédéric Weisbecker <fweisbec@gmail.com>
Cc: Masami Hiramatsu <mhiramat@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Paul Mackerras <paulus@samba.org>
LKML-Reference: <1260564622-12392-1-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-12-11 23:50:22 +03:00
if ( nr < 0 ) {
2023-03-21 00:22:35 +03:00
if ( dso - > has_build_id ) {
2016-05-11 16:51:59 +03:00
char sbuild_id [ SBUILD_ID_SIZE ] ;
perf symbols: Allow lookups by symbol name too
Configurable via symbol_conf.sort_by_name, so that the cost of an
extra rb_node on all 'struct symbol' instances is not paid by tools
that only want to decode addresses.
How to use it:
symbol_conf.sort_by_name = true;
symbol_init(&symbol_conf);
struct map *map = map_groups__find_by_name(kmaps, MAP__VARIABLE, "[kernel.kallsyms]");
if (map == NULL) {
pr_err("couldn't find map!\n");
kernel_maps__fprintf(stdout);
} else {
struct symbol *sym = map__find_symbol_by_name(map, sym_filter, NULL);
if (sym == NULL)
pr_err("couldn't find symbol %s!\n", sym_filter);
else
pr_info("symbol %s: %#Lx-%#Lx \n", sym_filter, sym->start, sym->end);
}
Looking over the vmlinux/kallsyms is common enough that I'll add a
variable to the upcoming struct perf_session to avoid the need to
use map_groups__find_by_name to get the main vmlinux/kallsyms map.
The above example looks on the 'variable' symtab, but it is just
like that for the functions one.
Also the sort operation is done when we first use
map__find_symbol_by_name, in a lazy way.
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Frédéric Weisbecker <fweisbec@gmail.com>
Cc: Masami Hiramatsu <mhiramat@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Paul Mackerras <paulus@samba.org>
LKML-Reference: <1260564622-12392-1-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-12-11 23:50:22 +03:00
2023-03-21 00:22:35 +03:00
build_id__sprintf ( & dso - > bid , sbuild_id ) ;
2018-09-04 16:43:07 +03:00
pr_debug ( " %s with build id %s not found " , name , sbuild_id ) ;
perf symbols: Allow lookups by symbol name too
Configurable via symbol_conf.sort_by_name, so that the cost of an
extra rb_node on all 'struct symbol' instances is not paid by tools
that only want to decode addresses.
How to use it:
symbol_conf.sort_by_name = true;
symbol_init(&symbol_conf);
struct map *map = map_groups__find_by_name(kmaps, MAP__VARIABLE, "[kernel.kallsyms]");
if (map == NULL) {
pr_err("couldn't find map!\n");
kernel_maps__fprintf(stdout);
} else {
struct symbol *sym = map__find_symbol_by_name(map, sym_filter, NULL);
if (sym == NULL)
pr_err("couldn't find symbol %s!\n", sym_filter);
else
pr_info("symbol %s: %#Lx-%#Lx \n", sym_filter, sym->start, sym->end);
}
Looking over the vmlinux/kallsyms is common enough that I'll add a
variable to the upcoming struct perf_session to avoid the need to
use map_groups__find_by_name to get the main vmlinux/kallsyms map.
The above example looks on the 'variable' symtab, but it is just
like that for the functions one.
Also the sort operation is done when we first use
map__find_symbol_by_name, in a lazy way.
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Frédéric Weisbecker <fweisbec@gmail.com>
Cc: Masami Hiramatsu <mhiramat@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Paul Mackerras <paulus@samba.org>
LKML-Reference: <1260564622-12392-1-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-12-11 23:50:22 +03:00
} else
2018-09-04 16:43:07 +03:00
pr_debug ( " Failed to open %s " , name ) ;
perf symbols: Allow lookups by symbol name too
Configurable via symbol_conf.sort_by_name, so that the cost of an
extra rb_node on all 'struct symbol' instances is not paid by tools
that only want to decode addresses.
How to use it:
symbol_conf.sort_by_name = true;
symbol_init(&symbol_conf);
struct map *map = map_groups__find_by_name(kmaps, MAP__VARIABLE, "[kernel.kallsyms]");
if (map == NULL) {
pr_err("couldn't find map!\n");
kernel_maps__fprintf(stdout);
} else {
struct symbol *sym = map__find_symbol_by_name(map, sym_filter, NULL);
if (sym == NULL)
pr_err("couldn't find symbol %s!\n", sym_filter);
else
pr_info("symbol %s: %#Lx-%#Lx \n", sym_filter, sym->start, sym->end);
}
Looking over the vmlinux/kallsyms is common enough that I'll add a
variable to the upcoming struct perf_session to avoid the need to
use map_groups__find_by_name to get the main vmlinux/kallsyms map.
The above example looks on the 'variable' symtab, but it is just
like that for the functions one.
Also the sort operation is done when we first use
map__find_symbol_by_name, in a lazy way.
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Frédéric Weisbecker <fweisbec@gmail.com>
Cc: Masami Hiramatsu <mhiramat@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Paul Mackerras <paulus@samba.org>
LKML-Reference: <1260564622-12392-1-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-12-11 23:50:22 +03:00
2018-09-04 16:43:07 +03:00
pr_debug ( " , continuing without symbols \n " ) ;
perf symbols: Allow lookups by symbol name too
Configurable via symbol_conf.sort_by_name, so that the cost of an
extra rb_node on all 'struct symbol' instances is not paid by tools
that only want to decode addresses.
How to use it:
symbol_conf.sort_by_name = true;
symbol_init(&symbol_conf);
struct map *map = map_groups__find_by_name(kmaps, MAP__VARIABLE, "[kernel.kallsyms]");
if (map == NULL) {
pr_err("couldn't find map!\n");
kernel_maps__fprintf(stdout);
} else {
struct symbol *sym = map__find_symbol_by_name(map, sym_filter, NULL);
if (sym == NULL)
pr_err("couldn't find symbol %s!\n", sym_filter);
else
pr_info("symbol %s: %#Lx-%#Lx \n", sym_filter, sym->start, sym->end);
}
Looking over the vmlinux/kallsyms is common enough that I'll add a
variable to the upcoming struct perf_session to avoid the need to
use map_groups__find_by_name to get the main vmlinux/kallsyms map.
The above example looks on the 'variable' symtab, but it is just
like that for the functions one.
Also the sort operation is done when we first use
map__find_symbol_by_name, in a lazy way.
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Frédéric Weisbecker <fweisbec@gmail.com>
Cc: Masami Hiramatsu <mhiramat@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Paul Mackerras <paulus@samba.org>
LKML-Reference: <1260564622-12392-1-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-12-11 23:50:22 +03:00
return - 1 ;
} else if ( nr = = 0 ) {
2013-09-30 14:07:11 +04:00
# ifdef HAVE_LIBELF_SUPPORT
perf symbols: Allow lookups by symbol name too
Configurable via symbol_conf.sort_by_name, so that the cost of an
extra rb_node on all 'struct symbol' instances is not paid by tools
that only want to decode addresses.
How to use it:
symbol_conf.sort_by_name = true;
symbol_init(&symbol_conf);
struct map *map = map_groups__find_by_name(kmaps, MAP__VARIABLE, "[kernel.kallsyms]");
if (map == NULL) {
pr_err("couldn't find map!\n");
kernel_maps__fprintf(stdout);
} else {
struct symbol *sym = map__find_symbol_by_name(map, sym_filter, NULL);
if (sym == NULL)
pr_err("couldn't find symbol %s!\n", sym_filter);
else
pr_info("symbol %s: %#Lx-%#Lx \n", sym_filter, sym->start, sym->end);
}
Looking over the vmlinux/kallsyms is common enough that I'll add a
variable to the upcoming struct perf_session to avoid the need to
use map_groups__find_by_name to get the main vmlinux/kallsyms map.
The above example looks on the 'variable' symtab, but it is just
like that for the functions one.
Also the sort operation is done when we first use
map__find_symbol_by_name, in a lazy way.
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Frédéric Weisbecker <fweisbec@gmail.com>
Cc: Masami Hiramatsu <mhiramat@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Paul Mackerras <paulus@samba.org>
LKML-Reference: <1260564622-12392-1-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-12-11 23:50:22 +03:00
const size_t len = strlen ( name ) ;
const size_t real_len = len - sizeof ( DSO__DELETED ) ;
if ( len > sizeof ( DSO__DELETED ) & &
strcmp ( name + real_len + 1 , DSO__DELETED ) = = 0 ) {
2018-09-04 16:43:07 +03:00
pr_debug ( " %.*s was updated (is prelink enabled?). "
2011-10-19 04:44:45 +04:00
" Restart the long running apps that use it! \n " ,
perf symbols: Allow lookups by symbol name too
Configurable via symbol_conf.sort_by_name, so that the cost of an
extra rb_node on all 'struct symbol' instances is not paid by tools
that only want to decode addresses.
How to use it:
symbol_conf.sort_by_name = true;
symbol_init(&symbol_conf);
struct map *map = map_groups__find_by_name(kmaps, MAP__VARIABLE, "[kernel.kallsyms]");
if (map == NULL) {
pr_err("couldn't find map!\n");
kernel_maps__fprintf(stdout);
} else {
struct symbol *sym = map__find_symbol_by_name(map, sym_filter, NULL);
if (sym == NULL)
pr_err("couldn't find symbol %s!\n", sym_filter);
else
pr_info("symbol %s: %#Lx-%#Lx \n", sym_filter, sym->start, sym->end);
}
Looking over the vmlinux/kallsyms is common enough that I'll add a
variable to the upcoming struct perf_session to avoid the need to
use map_groups__find_by_name to get the main vmlinux/kallsyms map.
The above example looks on the 'variable' symtab, but it is just
like that for the functions one.
Also the sort operation is done when we first use
map__find_symbol_by_name, in a lazy way.
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Frédéric Weisbecker <fweisbec@gmail.com>
Cc: Masami Hiramatsu <mhiramat@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Paul Mackerras <paulus@samba.org>
LKML-Reference: <1260564622-12392-1-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-12-11 23:50:22 +03:00
( int ) real_len , name ) ;
} else {
2018-09-04 16:43:07 +03:00
pr_debug ( " no symbols found in %s, maybe install a debug package? \n " , name ) ;
2009-10-29 02:51:21 +03:00
}
2012-08-06 08:41:21 +04:00
# endif
perf symbols: Allow lookups by symbol name too
Configurable via symbol_conf.sort_by_name, so that the cost of an
extra rb_node on all 'struct symbol' instances is not paid by tools
that only want to decode addresses.
How to use it:
symbol_conf.sort_by_name = true;
symbol_init(&symbol_conf);
struct map *map = map_groups__find_by_name(kmaps, MAP__VARIABLE, "[kernel.kallsyms]");
if (map == NULL) {
pr_err("couldn't find map!\n");
kernel_maps__fprintf(stdout);
} else {
struct symbol *sym = map__find_symbol_by_name(map, sym_filter, NULL);
if (sym == NULL)
pr_err("couldn't find symbol %s!\n", sym_filter);
else
pr_info("symbol %s: %#Lx-%#Lx \n", sym_filter, sym->start, sym->end);
}
Looking over the vmlinux/kallsyms is common enough that I'll add a
variable to the upcoming struct perf_session to avoid the need to
use map_groups__find_by_name to get the main vmlinux/kallsyms map.
The above example looks on the 'variable' symtab, but it is just
like that for the functions one.
Also the sort operation is done when we first use
map__find_symbol_by_name, in a lazy way.
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Frédéric Weisbecker <fweisbec@gmail.com>
Cc: Masami Hiramatsu <mhiramat@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Paul Mackerras <paulus@samba.org>
LKML-Reference: <1260564622-12392-1-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-12-11 23:50:22 +03:00
return - 1 ;
2009-10-29 02:51:21 +03:00
}
perf symbols: Allow lookups by symbol name too
Configurable via symbol_conf.sort_by_name, so that the cost of an
extra rb_node on all 'struct symbol' instances is not paid by tools
that only want to decode addresses.
How to use it:
symbol_conf.sort_by_name = true;
symbol_init(&symbol_conf);
struct map *map = map_groups__find_by_name(kmaps, MAP__VARIABLE, "[kernel.kallsyms]");
if (map == NULL) {
pr_err("couldn't find map!\n");
kernel_maps__fprintf(stdout);
} else {
struct symbol *sym = map__find_symbol_by_name(map, sym_filter, NULL);
if (sym == NULL)
pr_err("couldn't find symbol %s!\n", sym_filter);
else
pr_info("symbol %s: %#Lx-%#Lx \n", sym_filter, sym->start, sym->end);
}
Looking over the vmlinux/kallsyms is common enough that I'll add a
variable to the upcoming struct perf_session to avoid the need to
use map_groups__find_by_name to get the main vmlinux/kallsyms map.
The above example looks on the 'variable' symtab, but it is just
like that for the functions one.
Also the sort operation is done when we first use
map__find_symbol_by_name, in a lazy way.
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Frédéric Weisbecker <fweisbec@gmail.com>
Cc: Masami Hiramatsu <mhiramat@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Paul Mackerras <paulus@samba.org>
LKML-Reference: <1260564622-12392-1-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-12-11 23:50:22 +03:00
return 0 ;
}
2016-09-02 01:25:52 +03:00
struct symbol * map__find_symbol ( struct map * map , u64 addr )
perf symbols: Allow lookups by symbol name too
Configurable via symbol_conf.sort_by_name, so that the cost of an
extra rb_node on all 'struct symbol' instances is not paid by tools
that only want to decode addresses.
How to use it:
symbol_conf.sort_by_name = true;
symbol_init(&symbol_conf);
struct map *map = map_groups__find_by_name(kmaps, MAP__VARIABLE, "[kernel.kallsyms]");
if (map == NULL) {
pr_err("couldn't find map!\n");
kernel_maps__fprintf(stdout);
} else {
struct symbol *sym = map__find_symbol_by_name(map, sym_filter, NULL);
if (sym == NULL)
pr_err("couldn't find symbol %s!\n", sym_filter);
else
pr_info("symbol %s: %#Lx-%#Lx \n", sym_filter, sym->start, sym->end);
}
Looking over the vmlinux/kallsyms is common enough that I'll add a
variable to the upcoming struct perf_session to avoid the need to
use map_groups__find_by_name to get the main vmlinux/kallsyms map.
The above example looks on the 'variable' symtab, but it is just
like that for the functions one.
Also the sort operation is done when we first use
map__find_symbol_by_name, in a lazy way.
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Frédéric Weisbecker <fweisbec@gmail.com>
Cc: Masami Hiramatsu <mhiramat@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Paul Mackerras <paulus@samba.org>
LKML-Reference: <1260564622-12392-1-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-12-11 23:50:22 +03:00
{
2016-09-02 01:25:52 +03:00
if ( map__load ( map ) < 0 )
perf symbols: Allow lookups by symbol name too
Configurable via symbol_conf.sort_by_name, so that the cost of an
extra rb_node on all 'struct symbol' instances is not paid by tools
that only want to decode addresses.
How to use it:
symbol_conf.sort_by_name = true;
symbol_init(&symbol_conf);
struct map *map = map_groups__find_by_name(kmaps, MAP__VARIABLE, "[kernel.kallsyms]");
if (map == NULL) {
pr_err("couldn't find map!\n");
kernel_maps__fprintf(stdout);
} else {
struct symbol *sym = map__find_symbol_by_name(map, sym_filter, NULL);
if (sym == NULL)
pr_err("couldn't find symbol %s!\n", sym_filter);
else
pr_info("symbol %s: %#Lx-%#Lx \n", sym_filter, sym->start, sym->end);
}
Looking over the vmlinux/kallsyms is common enough that I'll add a
variable to the upcoming struct perf_session to avoid the need to
use map_groups__find_by_name to get the main vmlinux/kallsyms map.
The above example looks on the 'variable' symtab, but it is just
like that for the functions one.
Also the sort operation is done when we first use
map__find_symbol_by_name, in a lazy way.
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Frédéric Weisbecker <fweisbec@gmail.com>
Cc: Masami Hiramatsu <mhiramat@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Paul Mackerras <paulus@samba.org>
LKML-Reference: <1260564622-12392-1-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-12-11 23:50:22 +03:00
return NULL ;
2023-03-21 00:22:35 +03:00
return dso__find_symbol ( map__dso ( map ) , addr ) ;
2009-10-29 02:51:21 +03:00
}
2016-09-02 01:25:52 +03:00
struct symbol * map__find_symbol_by_name ( struct map * map , const char * name )
perf symbols: Allow lookups by symbol name too
Configurable via symbol_conf.sort_by_name, so that the cost of an
extra rb_node on all 'struct symbol' instances is not paid by tools
that only want to decode addresses.
How to use it:
symbol_conf.sort_by_name = true;
symbol_init(&symbol_conf);
struct map *map = map_groups__find_by_name(kmaps, MAP__VARIABLE, "[kernel.kallsyms]");
if (map == NULL) {
pr_err("couldn't find map!\n");
kernel_maps__fprintf(stdout);
} else {
struct symbol *sym = map__find_symbol_by_name(map, sym_filter, NULL);
if (sym == NULL)
pr_err("couldn't find symbol %s!\n", sym_filter);
else
pr_info("symbol %s: %#Lx-%#Lx \n", sym_filter, sym->start, sym->end);
}
Looking over the vmlinux/kallsyms is common enough that I'll add a
variable to the upcoming struct perf_session to avoid the need to
use map_groups__find_by_name to get the main vmlinux/kallsyms map.
The above example looks on the 'variable' symtab, but it is just
like that for the functions one.
Also the sort operation is done when we first use
map__find_symbol_by_name, in a lazy way.
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Frédéric Weisbecker <fweisbec@gmail.com>
Cc: Masami Hiramatsu <mhiramat@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Paul Mackerras <paulus@samba.org>
LKML-Reference: <1260564622-12392-1-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-12-11 23:50:22 +03:00
{
2023-03-21 00:22:35 +03:00
struct dso * dso ;
2016-09-02 01:25:52 +03:00
if ( map__load ( map ) < 0 )
perf symbols: Allow lookups by symbol name too
Configurable via symbol_conf.sort_by_name, so that the cost of an
extra rb_node on all 'struct symbol' instances is not paid by tools
that only want to decode addresses.
How to use it:
symbol_conf.sort_by_name = true;
symbol_init(&symbol_conf);
struct map *map = map_groups__find_by_name(kmaps, MAP__VARIABLE, "[kernel.kallsyms]");
if (map == NULL) {
pr_err("couldn't find map!\n");
kernel_maps__fprintf(stdout);
} else {
struct symbol *sym = map__find_symbol_by_name(map, sym_filter, NULL);
if (sym == NULL)
pr_err("couldn't find symbol %s!\n", sym_filter);
else
pr_info("symbol %s: %#Lx-%#Lx \n", sym_filter, sym->start, sym->end);
}
Looking over the vmlinux/kallsyms is common enough that I'll add a
variable to the upcoming struct perf_session to avoid the need to
use map_groups__find_by_name to get the main vmlinux/kallsyms map.
The above example looks on the 'variable' symtab, but it is just
like that for the functions one.
Also the sort operation is done when we first use
map__find_symbol_by_name, in a lazy way.
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Frédéric Weisbecker <fweisbec@gmail.com>
Cc: Masami Hiramatsu <mhiramat@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Paul Mackerras <paulus@samba.org>
LKML-Reference: <1260564622-12392-1-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-12-11 23:50:22 +03:00
return NULL ;
2023-03-21 00:22:35 +03:00
dso = map__dso ( map ) ;
if ( ! dso__sorted_by_name ( dso ) )
dso__sort_by_name ( dso ) ;
perf symbols: Allow lookups by symbol name too
Configurable via symbol_conf.sort_by_name, so that the cost of an
extra rb_node on all 'struct symbol' instances is not paid by tools
that only want to decode addresses.
How to use it:
symbol_conf.sort_by_name = true;
symbol_init(&symbol_conf);
struct map *map = map_groups__find_by_name(kmaps, MAP__VARIABLE, "[kernel.kallsyms]");
if (map == NULL) {
pr_err("couldn't find map!\n");
kernel_maps__fprintf(stdout);
} else {
struct symbol *sym = map__find_symbol_by_name(map, sym_filter, NULL);
if (sym == NULL)
pr_err("couldn't find symbol %s!\n", sym_filter);
else
pr_info("symbol %s: %#Lx-%#Lx \n", sym_filter, sym->start, sym->end);
}
Looking over the vmlinux/kallsyms is common enough that I'll add a
variable to the upcoming struct perf_session to avoid the need to
use map_groups__find_by_name to get the main vmlinux/kallsyms map.
The above example looks on the 'variable' symtab, but it is just
like that for the functions one.
Also the sort operation is done when we first use
map__find_symbol_by_name, in a lazy way.
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Frédéric Weisbecker <fweisbec@gmail.com>
Cc: Masami Hiramatsu <mhiramat@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Paul Mackerras <paulus@samba.org>
LKML-Reference: <1260564622-12392-1-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-12-11 23:50:22 +03:00
2023-03-21 00:22:35 +03:00
return dso__find_symbol_by_name ( dso , name ) ;
perf symbols: Allow lookups by symbol name too
Configurable via symbol_conf.sort_by_name, so that the cost of an
extra rb_node on all 'struct symbol' instances is not paid by tools
that only want to decode addresses.
How to use it:
symbol_conf.sort_by_name = true;
symbol_init(&symbol_conf);
struct map *map = map_groups__find_by_name(kmaps, MAP__VARIABLE, "[kernel.kallsyms]");
if (map == NULL) {
pr_err("couldn't find map!\n");
kernel_maps__fprintf(stdout);
} else {
struct symbol *sym = map__find_symbol_by_name(map, sym_filter, NULL);
if (sym == NULL)
pr_err("couldn't find symbol %s!\n", sym_filter);
else
pr_info("symbol %s: %#Lx-%#Lx \n", sym_filter, sym->start, sym->end);
}
Looking over the vmlinux/kallsyms is common enough that I'll add a
variable to the upcoming struct perf_session to avoid the need to
use map_groups__find_by_name to get the main vmlinux/kallsyms map.
The above example looks on the 'variable' symtab, but it is just
like that for the functions one.
Also the sort operation is done when we first use
map__find_symbol_by_name, in a lazy way.
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Frédéric Weisbecker <fweisbec@gmail.com>
Cc: Masami Hiramatsu <mhiramat@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Paul Mackerras <paulus@samba.org>
LKML-Reference: <1260564622-12392-1-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2009-12-11 23:50:22 +03:00
}
2015-08-18 21:19:50 +03:00
struct map * map__clone ( struct map * from )
2009-08-12 13:07:25 +04:00
{
2023-04-19 18:57:53 +03:00
struct map * result ;
RC_STRUCT ( map ) * map ;
size_t size = sizeof ( RC_STRUCT ( map ) ) ;
2023-03-21 00:22:35 +03:00
struct dso * dso = map__dso ( from ) ;
2015-08-18 21:19:50 +03:00
2023-03-21 00:22:35 +03:00
if ( dso & & dso - > kernel )
2020-02-10 17:32:17 +03:00
size + = sizeof ( struct kmap ) ;
2023-04-19 18:57:53 +03:00
map = memdup ( RC_CHK_ACCESS ( from ) , size ) ;
if ( ADD_RC_CHK ( result , map ) ) {
2017-02-21 18:34:59 +03:00
refcount_set ( & map - > refcnt , 1 ) ;
2023-04-04 23:59:49 +03:00
map - > dso = dso__get ( dso ) ;
2015-08-18 21:19:50 +03:00
}
2023-04-19 18:57:53 +03:00
return result ;
2009-08-12 13:07:25 +04:00
}
2013-01-25 04:59:59 +04:00
size_t map__fprintf ( struct map * map , FILE * fp )
2009-08-12 13:07:25 +04:00
{
2023-03-21 00:22:35 +03:00
const struct dso * dso = map__dso ( map ) ;
2011-01-23 01:37:02 +03:00
return fprintf ( fp , " % " PRIx64 " -% " PRIx64 " % " PRIx64 " %s \n " ,
2023-04-04 23:59:46 +03:00
map__start ( map ) , map__end ( map ) , map__pgoff ( map ) , dso - > name ) ;
2009-08-12 13:07:25 +04:00
}
perf annotate: Fix it for non-prelinked *.so
The problem was we were incorrectly calculating objdump
addresses for sym->start and sym->end, look:
For simple ET_DYN type DSO (*.so) with one function, objdump -dS
output is something like this:
000004ac <my_strlen>:
int my_strlen(const char *s)
4ac: 55 push %ebp
4ad: 89 e5 mov %esp,%ebp
4af: 83 ec 10 sub $0x10,%esp
{
i.e. we have relative-to-dso-mapping IPs (=RIP) there.
For ET_EXEC type and probably for prelinked libs as well (sorry
can't test - I don't use prelink) objdump outputs absolute IPs,
e.g.
08048604 <zz_strlen>:
extern "C"
int zz_strlen(const char *s)
8048604: 55 push %ebp
8048605: 89 e5 mov %esp,%ebp
8048607: 83 ec 10 sub $0x10,%esp
{
So, if sym->start is always relative to dso mapping(*), we'll
have to unmap it for ET_EXEC like cases, and leave as is for
ET_DYN cases.
(*) and it is - we've explicitely made it relative. Look for
adjust_symbols handling in dso__load_sym()
Previously we were always unmapping sym->start and for ET_DYN
dsos resulting addresses were wrong, and so objdump output was
empty.
The end result was that perf annotate output for symbols from
non-prelinked *.so had always 0.00% percents only, which is
wrong.
To fix it, let's introduce a helper for converting rip to
objdump address, and also let's document what map_ip() and
unmap_ip() do -- I had to study sources for several hours to
understand it.
Signed-off-by: Kirill Smelkov <kirr@landau.phys.spbu.ru>
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
LKML-Reference: <1265223128-11786-8-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2010-02-03 21:52:07 +03:00
2023-04-24 08:51:07 +03:00
static bool prefer_dso_long_name ( const struct dso * dso , bool print_off )
{
return dso - > long_name & &
( symbol_conf . show_kernel_path | |
( print_off & & ( dso - > name [ 0 ] = = ' [ ' | | dso__is_kcore ( dso ) ) ) ) ;
}
static size_t __map__fprintf_dsoname ( struct map * map , bool print_off , FILE * fp )
2012-01-30 08:42:57 +04:00
{
2019-05-08 16:20:05 +03:00
char buf [ symbol_conf . pad_output_len_dso + 1 ] ;
2012-08-27 11:38:26 +04:00
const char * dsoname = " [unknown] " ;
2023-03-21 00:22:35 +03:00
const struct dso * dso = map ? map__dso ( map ) : NULL ;
2012-01-30 08:42:57 +04:00
2023-03-21 00:22:35 +03:00
if ( dso ) {
2023-04-24 08:51:07 +03:00
if ( prefer_dso_long_name ( dso , print_off ) )
2023-03-21 00:22:35 +03:00
dsoname = dso - > long_name ;
2017-02-13 23:11:03 +03:00
else
2023-03-21 00:22:35 +03:00
dsoname = dso - > name ;
2012-08-27 11:38:26 +04:00
}
2012-01-30 08:42:57 +04:00
2019-05-08 16:20:05 +03:00
if ( symbol_conf . pad_output_len_dso ) {
scnprintf_pad ( buf , symbol_conf . pad_output_len_dso , " %s " , dsoname ) ;
dsoname = buf ;
}
2012-01-30 08:42:57 +04:00
return fprintf ( fp , " %s " , dsoname ) ;
}
2023-04-24 08:51:07 +03:00
size_t map__fprintf_dsoname ( struct map * map , FILE * fp )
{
return __map__fprintf_dsoname ( map , false , fp ) ;
}
2023-04-18 06:18:24 +03:00
size_t map__fprintf_dsoname_dsoff ( struct map * map , bool print_off , u64 addr , FILE * fp )
{
2023-04-24 08:51:07 +03:00
const struct dso * dso = map ? map__dso ( map ) : NULL ;
2023-04-18 06:18:24 +03:00
int printed = 0 ;
2023-04-24 08:51:07 +03:00
if ( print_off & & ( ! dso | | ! dso__is_object_file ( dso ) ) )
print_off = false ;
2023-04-18 06:18:24 +03:00
printed + = fprintf ( fp , " ( " ) ;
2023-04-24 08:51:07 +03:00
printed + = __map__fprintf_dsoname ( map , print_off , fp ) ;
if ( print_off )
2023-04-18 06:18:24 +03:00
printed + = fprintf ( fp , " +0x% " PRIx64 , addr ) ;
printed + = fprintf ( fp , " ) " ) ;
return printed ;
}
2018-05-28 17:05:20 +03:00
char * map__srcline ( struct map * map , u64 addr , struct symbol * sym )
{
if ( map = = NULL )
return SRCLINE_UNKNOWN ;
2023-03-21 00:22:35 +03:00
return get_srcline ( map__dso ( map ) , map__rip_2objdump ( map , addr ) , sym , true , true , addr ) ;
2018-05-28 17:05:20 +03:00
}
2013-12-06 11:42:57 +04:00
int map__fprintf_srcline ( struct map * map , u64 addr , const char * prefix ,
FILE * fp )
{
2023-03-21 00:22:35 +03:00
const struct dso * dso = map ? map__dso ( map ) : NULL ;
2013-12-06 11:42:57 +04:00
int ret = 0 ;
2023-03-21 00:22:35 +03:00
if ( dso ) {
2018-05-28 17:05:20 +03:00
char * srcline = map__srcline ( map , addr , NULL ) ;
2020-02-23 22:34:49 +03:00
if ( strncmp ( srcline , SRCLINE_UNKNOWN , strlen ( SRCLINE_UNKNOWN ) ) ! = 0 )
2013-12-06 11:42:57 +04:00
ret = fprintf ( fp , " %s%s " , prefix , srcline ) ;
2023-06-09 02:28:22 +03:00
zfree_srcline ( & srcline ) ;
2013-12-06 11:42:57 +04:00
}
return ret ;
}
2018-12-04 03:18:48 +03:00
void srccode_state_free ( struct srccode_state * state )
{
zfree ( & state - > srcfile ) ;
state - > line = 0 ;
}
2013-10-14 14:43:44 +04:00
/**
* map__rip_2objdump - convert symbol start address to objdump address .
* @ map : memory map
* @ rip : symbol start address
*
perf annotate: Fix it for non-prelinked *.so
The problem was we were incorrectly calculating objdump
addresses for sym->start and sym->end, look:
For simple ET_DYN type DSO (*.so) with one function, objdump -dS
output is something like this:
000004ac <my_strlen>:
int my_strlen(const char *s)
4ac: 55 push %ebp
4ad: 89 e5 mov %esp,%ebp
4af: 83 ec 10 sub $0x10,%esp
{
i.e. we have relative-to-dso-mapping IPs (=RIP) there.
For ET_EXEC type and probably for prelinked libs as well (sorry
can't test - I don't use prelink) objdump outputs absolute IPs,
e.g.
08048604 <zz_strlen>:
extern "C"
int zz_strlen(const char *s)
8048604: 55 push %ebp
8048605: 89 e5 mov %esp,%ebp
8048607: 83 ec 10 sub $0x10,%esp
{
So, if sym->start is always relative to dso mapping(*), we'll
have to unmap it for ET_EXEC like cases, and leave as is for
ET_DYN cases.
(*) and it is - we've explicitely made it relative. Look for
adjust_symbols handling in dso__load_sym()
Previously we were always unmapping sym->start and for ET_DYN
dsos resulting addresses were wrong, and so objdump output was
empty.
The end result was that perf annotate output for symbols from
non-prelinked *.so had always 0.00% percents only, which is
wrong.
To fix it, let's introduce a helper for converting rip to
objdump address, and also let's document what map_ip() and
unmap_ip() do -- I had to study sources for several hours to
understand it.
Signed-off-by: Kirill Smelkov <kirr@landau.phys.spbu.ru>
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
LKML-Reference: <1265223128-11786-8-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2010-02-03 21:52:07 +03:00
* objdump wants / reports absolute IPs for ET_EXEC , and RIPs for ET_DYN .
2013-08-07 15:38:50 +04:00
* map - > dso - > adjust_symbols = = 1 for ET_EXEC - like cases except ET_REL which is
* relative to section start .
2013-10-14 14:43:44 +04:00
*
* Return : Address suitable for passing to " objdump --start-address= "
perf annotate: Fix it for non-prelinked *.so
The problem was we were incorrectly calculating objdump
addresses for sym->start and sym->end, look:
For simple ET_DYN type DSO (*.so) with one function, objdump -dS
output is something like this:
000004ac <my_strlen>:
int my_strlen(const char *s)
4ac: 55 push %ebp
4ad: 89 e5 mov %esp,%ebp
4af: 83 ec 10 sub $0x10,%esp
{
i.e. we have relative-to-dso-mapping IPs (=RIP) there.
For ET_EXEC type and probably for prelinked libs as well (sorry
can't test - I don't use prelink) objdump outputs absolute IPs,
e.g.
08048604 <zz_strlen>:
extern "C"
int zz_strlen(const char *s)
8048604: 55 push %ebp
8048605: 89 e5 mov %esp,%ebp
8048607: 83 ec 10 sub $0x10,%esp
{
So, if sym->start is always relative to dso mapping(*), we'll
have to unmap it for ET_EXEC like cases, and leave as is for
ET_DYN cases.
(*) and it is - we've explicitely made it relative. Look for
adjust_symbols handling in dso__load_sym()
Previously we were always unmapping sym->start and for ET_DYN
dsos resulting addresses were wrong, and so objdump output was
empty.
The end result was that perf annotate output for symbols from
non-prelinked *.so had always 0.00% percents only, which is
wrong.
To fix it, let's introduce a helper for converting rip to
objdump address, and also let's document what map_ip() and
unmap_ip() do -- I had to study sources for several hours to
understand it.
Signed-off-by: Kirill Smelkov <kirr@landau.phys.spbu.ru>
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
LKML-Reference: <1265223128-11786-8-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2010-02-03 21:52:07 +03:00
*/
u64 map__rip_2objdump ( struct map * map , u64 rip )
{
2018-06-05 10:30:00 +03:00
struct kmap * kmap = __map__kmap ( map ) ;
2023-03-21 00:22:35 +03:00
const struct dso * dso = map__dso ( map ) ;
2018-06-05 10:30:00 +03:00
/*
* vmlinux does not have program headers for PTI entry trampolines and
* kcore may not either . However the trampoline object code is on the
* main kernel map , so just use that instead .
*/
perf maps: Add functions to access maps
Introduce functions to access struct maps. These functions reduce the
number of places reference counting is necessary. While tidying APIs do
some small const-ification, in particlar to unwind_libunwind_ops.
Committer notes:
Fixed up tools/perf/util/unwind-libunwind.c:
- return ops->get_entries(cb, arg, thread, data, max_stack);
+ return ops->get_entries(cb, arg, thread, data, max_stack, best_effort);
Signed-off-by: Ian Rogers <irogers@google.com>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Alexey Bayduraev <alexey.v.bayduraev@linux.intel.com>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Andrew Morton <akpm@linux-foundation.org>
Cc: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
Cc: Darren Hart <dvhart@infradead.org>
Cc: Davidlohr Bueso <dave@stgolabs.net>
Cc: Dmitriy Vyukov <dvyukov@google.com>
Cc: Eric Dumazet <edumazet@google.com>
Cc: German Gomez <german.gomez@arm.com>
Cc: Hao Luo <haoluo@google.com>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: James Clark <james.clark@arm.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: John Garry <john.g.garry@oracle.com>
Cc: Kajol Jain <kjain@linux.ibm.com>
Cc: Kan Liang <kan.liang@linux.intel.com>
Cc: Leo Yan <leo.yan@linaro.org>
Cc: Madhavan Srinivasan <maddy@linux.ibm.com>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Masami Hiramatsu <mhiramat@kernel.org>
Cc: Miaoqian Lin <linmq006@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Riccardo Mancini <rickyman7@gmail.com>
Cc: Shunsuke Nakamura <nakamura.shun@fujitsu.com>
Cc: Song Liu <song@kernel.org>
Cc: Stephane Eranian <eranian@google.com>
Cc: Stephen Brennan <stephen.s.brennan@oracle.com>
Cc: Steven Rostedt (VMware) <rostedt@goodmis.org>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: Thomas Richter <tmricht@linux.ibm.com>
Cc: Yury Norov <yury.norov@gmail.com>
Link: https://lore.kernel.org/r/20230320212248.1175731-2-irogers@google.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-03-21 00:22:34 +03:00
if ( kmap & & is_entry_trampoline ( kmap - > name ) & & kmap - > kmaps ) {
struct machine * machine = maps__machine ( kmap - > kmaps ) ;
2018-06-05 10:30:00 +03:00
perf maps: Add functions to access maps
Introduce functions to access struct maps. These functions reduce the
number of places reference counting is necessary. While tidying APIs do
some small const-ification, in particlar to unwind_libunwind_ops.
Committer notes:
Fixed up tools/perf/util/unwind-libunwind.c:
- return ops->get_entries(cb, arg, thread, data, max_stack);
+ return ops->get_entries(cb, arg, thread, data, max_stack, best_effort);
Signed-off-by: Ian Rogers <irogers@google.com>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com>
Cc: Alexey Bayduraev <alexey.v.bayduraev@linux.intel.com>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Andrew Morton <akpm@linux-foundation.org>
Cc: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
Cc: Darren Hart <dvhart@infradead.org>
Cc: Davidlohr Bueso <dave@stgolabs.net>
Cc: Dmitriy Vyukov <dvyukov@google.com>
Cc: Eric Dumazet <edumazet@google.com>
Cc: German Gomez <german.gomez@arm.com>
Cc: Hao Luo <haoluo@google.com>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: James Clark <james.clark@arm.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: John Garry <john.g.garry@oracle.com>
Cc: Kajol Jain <kjain@linux.ibm.com>
Cc: Kan Liang <kan.liang@linux.intel.com>
Cc: Leo Yan <leo.yan@linaro.org>
Cc: Madhavan Srinivasan <maddy@linux.ibm.com>
Cc: Mark Rutland <mark.rutland@arm.com>
Cc: Masami Hiramatsu <mhiramat@kernel.org>
Cc: Miaoqian Lin <linmq006@gmail.com>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Riccardo Mancini <rickyman7@gmail.com>
Cc: Shunsuke Nakamura <nakamura.shun@fujitsu.com>
Cc: Song Liu <song@kernel.org>
Cc: Stephane Eranian <eranian@google.com>
Cc: Stephen Brennan <stephen.s.brennan@oracle.com>
Cc: Steven Rostedt (VMware) <rostedt@goodmis.org>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: Thomas Richter <tmricht@linux.ibm.com>
Cc: Yury Norov <yury.norov@gmail.com>
Link: https://lore.kernel.org/r/20230320212248.1175731-2-irogers@google.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2023-03-21 00:22:34 +03:00
if ( machine ) {
struct map * kernel_map = machine__kernel_map ( machine ) ;
if ( kernel_map )
map = kernel_map ;
}
2018-06-05 10:30:00 +03:00
}
2023-03-21 00:22:35 +03:00
if ( ! dso - > adjust_symbols )
2013-08-07 15:38:50 +04:00
return rip ;
2023-03-21 00:22:35 +03:00
if ( dso - > rel )
2023-04-04 23:59:46 +03:00
return rip - map__pgoff ( map ) ;
2013-08-07 15:38:50 +04:00
2016-04-07 13:24:30 +03:00
/*
* kernel modules also have DSO_TYPE_USER in dso - > kernel ,
* but all kernel modules are ET_REL , so won ' t get here .
*/
2023-03-21 00:22:35 +03:00
if ( dso - > kernel = = DSO_SPACE__USER )
return rip + dso - > text_offset ;
2016-04-07 13:24:30 +03:00
2023-04-04 23:59:46 +03:00
return map__unmap_ip ( map , rip ) - map__reloc ( map ) ;
perf annotate: Fix it for non-prelinked *.so
The problem was we were incorrectly calculating objdump
addresses for sym->start and sym->end, look:
For simple ET_DYN type DSO (*.so) with one function, objdump -dS
output is something like this:
000004ac <my_strlen>:
int my_strlen(const char *s)
4ac: 55 push %ebp
4ad: 89 e5 mov %esp,%ebp
4af: 83 ec 10 sub $0x10,%esp
{
i.e. we have relative-to-dso-mapping IPs (=RIP) there.
For ET_EXEC type and probably for prelinked libs as well (sorry
can't test - I don't use prelink) objdump outputs absolute IPs,
e.g.
08048604 <zz_strlen>:
extern "C"
int zz_strlen(const char *s)
8048604: 55 push %ebp
8048605: 89 e5 mov %esp,%ebp
8048607: 83 ec 10 sub $0x10,%esp
{
So, if sym->start is always relative to dso mapping(*), we'll
have to unmap it for ET_EXEC like cases, and leave as is for
ET_DYN cases.
(*) and it is - we've explicitely made it relative. Look for
adjust_symbols handling in dso__load_sym()
Previously we were always unmapping sym->start and for ET_DYN
dsos resulting addresses were wrong, and so objdump output was
empty.
The end result was that perf annotate output for symbols from
non-prelinked *.so had always 0.00% percents only, which is
wrong.
To fix it, let's introduce a helper for converting rip to
objdump address, and also let's document what map_ip() and
unmap_ip() do -- I had to study sources for several hours to
understand it.
Signed-off-by: Kirill Smelkov <kirr@landau.phys.spbu.ru>
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
LKML-Reference: <1265223128-11786-8-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2010-02-03 21:52:07 +03:00
}
perf top: Fix annotate for userspace
First, for programs and prelinked libraries, annotate code was
fooled by objdump output IPs (src->eip in the code) being
wrongly converted to absolute IPs. In such case there were no
conversion needed, but in
src->eip = strtoull(src->line, NULL, 16);
src->eip = map->unmap_ip(map, src->eip); // = eip + map->start - map->pgoff
we were reading absolute address from objdump (e.g. 8048604) and
then almost doubling it, because eip & map->start are
approximately close for small programs.
Needless to say, that later, in record_precise_ip() there was no
matching with real runtime IPs.
And second, like with `perf annotate` the problem with
non-prelinked *.so was that we were doing rip -> objdump address
conversion wrong.
Also, because unlike `perf annotate`, `perf top` code does
annotation based on absolute IPs for performance reasons(*), new
helper for mapping objdump addresse to IP is introduced.
(*) we get samples info in absolute IPs, and since we do lots of
hit-testing on absolute IPs at runtime in record_precise_ip(), it's
better to convert objdump addresses to IPs once and do no conversion
at runtime.
I also had to fix how objdump output is parsed (with hardcoded
8/16 characters format, which was inappropriate for ET_DYN dsos
with small addresses like '4ac')
Also note, that not all objdump output lines has associtated
IPs, e.g. look at source lines here:
000004ac <my_strlen>:
extern "C"
int my_strlen(const char *s)
4ac: 55 push %ebp
4ad: 89 e5 mov %esp,%ebp
4af: 83 ec 10 sub $0x10,%esp
{
int len = 0;
4b2: c7 45 fc 00 00 00 00 movl $0x0,-0x4(%ebp)
4b9: eb 08 jmp 4c3 <my_strlen+0x17>
while (*s) {
++len;
4bb: 83 45 fc 01 addl $0x1,-0x4(%ebp)
++s;
4bf: 83 45 08 01 addl $0x1,0x8(%ebp)
So we mark them with eip=0, and ignore such lines in annotate
lookup code.
Signed-off-by: Kirill Smelkov <kirr@landau.phys.spbu.ru>
[ Note: one hunk of this patch was applied by Mike in 57d8188 ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
LKML-Reference: <1265550376-12665-1-git-send-email-acme@infradead.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
2010-02-07 16:46:15 +03:00
2013-10-14 14:43:44 +04:00
/**
* map__objdump_2mem - convert objdump address to a memory address .
* @ map : memory map
* @ ip : objdump address
*
* Closely related to map__rip_2objdump ( ) , this function takes an address from
* objdump and converts it to a memory address . Note this assumes that @ map
* contains the address . To be sure the result is valid , check it forwards
2023-04-04 23:59:44 +03:00
* e . g . map__rip_2objdump ( map__map_ip ( map , map__objdump_2mem ( map , ip ) ) ) = = ip
2013-10-14 14:43:44 +04:00
*
* Return : Memory address .
*/
u64 map__objdump_2mem ( struct map * map , u64 ip )
{
2023-03-21 00:22:35 +03:00
const struct dso * dso = map__dso ( map ) ;
if ( ! dso - > adjust_symbols )
2023-04-04 23:59:44 +03:00
return map__unmap_ip ( map , ip ) ;
2013-10-14 14:43:44 +04:00
2023-03-21 00:22:35 +03:00
if ( dso - > rel )
2023-04-04 23:59:46 +03:00
return map__unmap_ip ( map , ip + map__pgoff ( map ) ) ;
2013-10-14 14:43:44 +04:00
2016-04-07 13:24:30 +03:00
/*
* kernel modules also have DSO_TYPE_USER in dso - > kernel ,
* but all kernel modules are ET_REL , so won ' t get here .
*/
2023-03-21 00:22:35 +03:00
if ( dso - > kernel = = DSO_SPACE__USER )
2023-04-04 23:59:44 +03:00
return map__unmap_ip ( map , ip - dso - > text_offset ) ;
2016-04-07 13:24:30 +03:00
2023-04-04 23:59:46 +03:00
return ip + map__reloc ( map ) ;
2013-10-14 14:43:44 +04:00
}
2022-02-11 13:34:03 +03:00
bool map__contains_symbol ( const struct map * map , const struct symbol * sym )
2018-09-07 11:51:16 +03:00
{
2023-04-04 23:59:44 +03:00
u64 ip = map__unmap_ip ( map , sym - > start ) ;
2018-09-07 11:51:16 +03:00
2023-03-21 00:22:36 +03:00
return ip > = map__start ( map ) & & ip < map__end ( map ) ;
2018-09-07 11:51:16 +03:00
}
2018-05-22 13:54:35 +03:00
struct kmap * __map__kmap ( struct map * map )
2015-04-07 11:22:45 +03:00
{
2023-03-21 00:22:35 +03:00
const struct dso * dso = map__dso ( map ) ;
if ( ! dso | | ! dso - > kernel )
2015-04-07 11:22:45 +03:00
return NULL ;
2023-04-19 18:57:53 +03:00
return ( struct kmap * ) ( & RC_CHK_ACCESS ( map ) [ 1 ] ) ;
2015-04-07 11:22:45 +03:00
}
2018-05-22 13:54:35 +03:00
struct kmap * map__kmap ( struct map * map )
{
struct kmap * kmap = __map__kmap ( map ) ;
if ( ! kmap )
pr_err ( " Internal error: map__kmap with a non-kernel map \n " ) ;
return kmap ;
}
2019-11-26 03:58:33 +03:00
struct maps * map__kmaps ( struct map * map )
2015-04-07 11:22:45 +03:00
{
struct kmap * kmap = map__kmap ( map ) ;
if ( ! kmap | | ! kmap - > kmaps ) {
pr_err ( " Internal error: map__kmaps with a non-kernel map \n " ) ;
return NULL ;
}
return kmap - > kmaps ;
}
2022-02-11 13:34:02 +03:00
2023-04-04 23:59:43 +03:00
u64 map__dso_map_ip ( const struct map * map , u64 ip )
2022-02-11 13:34:02 +03:00
{
2023-04-04 23:59:46 +03:00
return ip - map__start ( map ) + map__pgoff ( map ) ;
2022-02-11 13:34:02 +03:00
}
2023-04-04 23:59:43 +03:00
u64 map__dso_unmap_ip ( const struct map * map , u64 ip )
2022-02-11 13:34:02 +03:00
{
2023-04-04 23:59:46 +03:00
return ip + map__start ( map ) - map__pgoff ( map ) ;
2022-02-11 13:34:02 +03:00
}
u64 identity__map_ip ( const struct map * map __maybe_unused , u64 ip )
{
return ip ;
}