2013-10-02 13:49:08 +04:00
i n c l u d e . . / s c r i p t s / M a k e f i l e . i n c l u d e
# The default target of this Makefile is...
all :
2016-03-18 19:57:20 +03:00
i n c l u d e . . / s c r i p t s / u t i l i t i e s . m a k
2013-10-02 13:49:08 +04:00
# Define V to have a more verbose compile.
#
2014-02-19 19:52:54 +04:00
# Define VF to have a more verbose feature check output.
#
2013-10-02 13:49:08 +04:00
# Define O to save output files in a separate directory.
#
# Define ARCH as name of target architecture if you want cross-builds.
#
# Define CROSS_COMPILE as prefix name of compiler if you want cross-builds.
#
# Define NO_LIBPERL to disable perl script extension.
#
# Define NO_LIBPYTHON to disable python script extension.
#
# Define PYTHON to point to the python binary if the default
# `python' is not correct; for example: PYTHON=python2
#
# Define PYTHON_CONFIG to point to the python-config binary if
# the default `$(PYTHON)-config' is not correct.
#
# Define ASCIIDOC8 if you want to format documentation with AsciiDoc 8
#
# Define DOCBOOK_XSL_172 if you want to format man pages with DocBook XSL v1.72.
#
# Define LDFLAGS=-static to build a static binary.
#
# Define EXTRA_CFLAGS=-m64 or EXTRA_CFLAGS=-m32 as appropriate for cross-builds.
#
# Define NO_DWARF if you do not want debug-info analysis feature at all.
#
# Define WERROR=0 to disable treating any warnings as errors.
#
# Define NO_NEWT if you do not want TUI support. (deprecated)
#
# Define NO_SLANG if you do not want TUI support.
#
# Define NO_GTK2 if you do not want GTK+ GUI support.
#
# Define NO_DEMANGLE if you do not want C++ symbol demangling.
#
# Define NO_LIBELF if you do not want libelf dependency (e.g. cross-builds)
#
# Define NO_LIBUNWIND if you do not want libunwind dependency for dwarf
# backtrace post unwind.
#
# Define NO_BACKTRACE if you do not want stack backtrace debug feature
#
# Define NO_LIBNUMA if you do not want numa perf benchmark
#
# Define NO_LIBAUDIT if you do not want libaudit support
#
# Define NO_LIBBIONIC if you do not want bionic support
2014-02-19 19:52:57 +04:00
#
2015-11-30 12:02:21 +03:00
# Define NO_LIBCRYPTO if you do not want libcrypto (openssl) support
# used for generating build-ids for ELFs generated by jitdump.
#
2014-02-19 19:52:57 +04:00
# Define NO_LIBDW_DWARF_UNWIND if you do not want libdw support
# for dwarf backtrace post unwind.
2014-10-24 01:16:03 +04:00
#
# Define NO_PERF_READ_VDSO32 if you do not want to build perf-read-vdso32
# for reading the 32-bit compatibility VDSO in 64-bit mode
#
# Define NO_PERF_READ_VDSOX32 if you do not want to build perf-read-vdsox32
# for reading the x32 mode 32-bit compatibility VDSO in 64-bit mode
perf tools: Add gzip decompression support for kernel module
Now my Archlinux box shows module symbols correctly.
Before:
$ perf report --stdio
Failed to open /tmp/perf-3477.map, continuing without symbols
no symbols found in /usr/bin/date, maybe install a debug package?
No kallsyms or vmlinux with build-id 7b4ea0a49ae2111925857099aaf05c3246ff33e0 was found
[drm] with build id 7b4ea0a49ae2111925857099aaf05c3246ff33e0 not found, continuing without symbols
No kallsyms or vmlinux with build-id edd931629094b660ca9dec09a1b635c8d87aa2ee was found
[jbd2] with build id edd931629094b660ca9dec09a1b635c8d87aa2ee not found, continuing without symbols
No kallsyms or vmlinux with build-id a7b1eada671c34933e5610bb920b2ca4945a82c3 was found
[ext4] with build id a7b1eada671c34933e5610bb920b2ca4945a82c3 not found, continuing without symbols
No kallsyms or vmlinux with build-id d69511fa3e5840e770336ef45b06c83fef8d74e3 was found
[scsi_mod] with build id d69511fa3e5840e770336ef45b06c83fef8d74e3 not found, continuing without symbols
No kallsyms or vmlinux with build-id af0430af13461af058770ee9b87afc07922c2e77 was found
[libata] with build id af0430af13461af058770ee9b87afc07922c2e77 not found, continuing without symbols
No kallsyms or vmlinux with build-id aaeedff8160ce631a5f0333591c6ff291201d29f was found
[libahci] with build id aaeedff8160ce631a5f0333591c6ff291201d29f not found, continuing without symbols
No kallsyms or vmlinux with build-id c57907712becaf662dc4981824bb372c0441d605 was found
[mac80211] with build id c57907712becaf662dc4981824bb372c0441d605 not found, continuing without symbols
No kallsyms or vmlinux with build-id e0589077cc0ec8c3e4c40eb9f2d9e69d236bee8f was found
[iwldvm] with build id e0589077cc0ec8c3e4c40eb9f2d9e69d236bee8f not found, continuing without symbols
No kallsyms or vmlinux with build-id 2d86086bf136bf374a2f029cf85a48194f9b950b was found
[cfg80211] with build id 2d86086bf136bf374a2f029cf85a48194f9b950b not found, continuing without symbols
No kallsyms or vmlinux with build-id 4493c48599bdb3d91d0f8db5150e0be33fdd9221 was found
[iwlwifi] with build id 4493c48599bdb3d91d0f8db5150e0be33fdd9221 not found, continuing without symbols
...
#
# Overhead Command Shared Object Symbol
# ........ ............... ....................... ........................................................
#
0.03% swapper [ext4] [k] 0x000000000000fe2e
0.03% swapper [kernel.kallsyms] [k] account_entity_enqueue
0.03% swapper [ext4] [k] 0x000000000000fc2b
0.03% irq/50-iwlwifi [iwlwifi] [k] 0x000000000000200b
0.03% swapper [kernel.kallsyms] [k] ktime_add_safe
0.03% swapper [kernel.kallsyms] [k] elv_completed_request
0.03% swapper [libata] [k] 0x0000000000003997
0.03% swapper [libahci] [k] 0x0000000000001f25
0.03% swapper [kernel.kallsyms] [k] rb_next
0.03% swapper [kernel.kallsyms] [k] blk_finish_request
0.03% swapper [ext4] [k] 0x0000000000010248
0.00% perf [kernel.kallsyms] [k] native_write_msr_safe
After:
$ perf report --stdio
Failed to open /tmp/perf-3477.map, continuing without symbols
no symbols found in /usr/bin/tr, maybe install a debug package?
...
#
# Overhead Command Shared Object Symbol
# ........ ............... ........................... ......................................................
#
0.04% kworker/u16:3 [ext4] [k] ext4_read_block_bitmap
0.03% kworker/u16:0 [mac80211] [k] ieee80211_sta_reset_beacon_monitor
0.02% irq/50-iwlwifi [mac80211] [k] ieee80211_get_bssid
0.02% firefox [e1000e] [k] __ew32_prepare
0.02% swapper [libahci] [k] ahci_handle_port_interrupt
0.02% emacs libglib-2.0.so.0.4000.0 [.] g_mutex_unlock
0.02% swapper [e1000e] [k] e1000_clean_tx_irq
0.02% dwm [kernel.kallsyms] [k] __schedule
0.02% gnome-terminal- [vdso] [.] __vdso_clock_gettime
0.02% swapper [e1000e] [k] e1000_alloc_rx_buffers
0.02% irq/50-iwlwifi [mac80211] [k] ieee80211_rx
0.01% firefox [vdso] [.] __vdso_gettimeofday
0.01% irq/50-iwlwifi [iwlwifi] [k] iwl_pcie_rxq_restock.part.13
Signed-off-by: Namhyung Kim <namhyung@kernel.org>
Acked-by: Jiri Olsa <jolsa@redhat.com>
Cc: Jiri Olsa <jolsa@redhat.com>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Ingo Molnar <mingo@kernel.org>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Namhyung Kim <namhyung.kim@lge.com>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Stephane Eranian <eranian@google.com>
Link: http://lkml.kernel.org/r/87h9yexshi.fsf@sejong.aot.lge.com
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2014-10-31 10:51:38 +03:00
#
# Define NO_ZLIB if you do not want to support compressed kernel modules
2015-02-21 01:16:58 +03:00
#
2015-03-28 13:30:30 +03:00
# Define LIBBABELTRACE if you DO want libbabeltrace support
2015-02-21 01:16:58 +03:00
# for CTF data format.
2015-01-29 15:29:39 +03:00
#
# Define NO_LZMA if you do not want to support compressed (xz) kernel modules
2015-04-30 17:37:27 +03:00
#
# Define NO_AUXTRACE if you do not want AUX area tracing support
2015-10-14 15:41:12 +03:00
#
# Define NO_LIBBPF if you do not want BPF support
2016-01-15 07:00:18 +03:00
#
2016-07-12 13:05:56 +03:00
# Define NO_SDT if you do not want to define SDT event in perf tools,
# note that it doesn't disable SDT scanning support.
#
2016-01-15 07:00:18 +03:00
# Define FEATURES_DUMP to provide features detection dump file
# and bypass the feature detection
2016-11-02 16:35:49 +03:00
#
# Define NO_JVMTI if you do not want jvmti agent built
2016-11-26 10:03:33 +03:00
#
# Define LIBCLANGLLVM if you DO want builtin clang and llvm support.
# When selected, pass LLVM_CONFIG=/path/to/llvm-config to `make' if
# llvm-config is not in $PATH.
2014-02-19 19:52:57 +04:00
2015-08-13 10:14:55 +03:00
# As per kernel Makefile, avoid funny character set dependencies
u n export LC_ALL
LC_COLLATE = C
LC_NUMERIC = C
export LC_COLLATE LC_NUMERIC
2013-10-02 13:49:08 +04:00
i f e q ( $( srctree ) , )
2016-11-22 11:30:26 +03:00
srctree := $( patsubst %/,%,$( dir $( CURDIR) ) )
2013-10-02 13:49:08 +04:00
srctree := $( patsubst %/,%,$( dir $( srctree) ) )
#$(info Determined 'srctree' to be $(srctree))
e n d i f
i f n e q ( $( objtree ) , )
#$(info Determined 'objtree' to be $(objtree))
e n d i f
i f n e q ( $( OUTPUT ) , )
#$(info Determined 'OUTPUT' to be $(OUTPUT))
2014-12-29 19:42:46 +03:00
# Adding $(OUTPUT) as a directory to look for source files,
# because use generated output files as sources dependency
# for flex/bison parsers.
VPATH += $( OUTPUT)
export VPATH
2013-10-02 13:49:08 +04:00
e n d i f
2015-01-12 01:59:55 +03:00
i f e q ( $( V ) , 1 )
Q =
e l s e
Q = @
e n d i f
2015-01-10 23:43:32 +03:00
# Do not use make's built-in rules
# (this improves performance and avoids hard-to-debug behaviour);
MAKEFLAGS += -r
2015-07-14 12:05:20 +03:00
# Makefiles suck: This macro sets a default value of $(2) for the
# variable named by $(1), unless the variable has been set by
# environment or command line. This is necessary for CC and AR
# because make sets default values, so the simpler ?= approach
# won't work as expected.
d e f i n e a l l o w - o v e r r i d e
$( if $( or $( findstring environment,$( origin $( 1) ) ) ,\
$( findstring command line,$( origin $( 1) ) ) ) ,,\
$( eval $( 1) = $( 2) ) )
e n d e f
# Allow setting CC and AR and LD, or setting CROSS_COMPILE as a prefix.
$( call allow -override ,CC ,$ ( CROSS_COMPILE ) gcc )
$( call allow -override ,AR ,$ ( CROSS_COMPILE ) ar )
$( call allow -override ,LD ,$ ( CROSS_COMPILE ) ld )
2016-11-26 10:03:33 +03:00
$( call allow -override ,CXX ,$ ( CROSS_COMPILE ) g ++)
2015-07-14 12:05:20 +03:00
2016-02-09 16:33:38 +03:00
LD += $( EXTRA_LDFLAGS)
perf jevents: Program to convert JSON file
This is a modified version of an earlier patch by Andi Kleen.
We expect architectures to create JSON files describing the performance
monitoring (PMU) events that each CPU model/family of the architecture
supports.
Following is an example of the JSON file entry for an x86 event:
[
...
{
"EventCode": "0x00",
"UMask": "0x01",
"EventName": "INST_RETIRED.ANY",
"BriefDescription": "Instructions retired from execution.",
"PublicDescription": "Instructions retired from execution.",
"Counter": "Fixed counter 1",
"CounterHTOff": "Fixed counter 1",
"SampleAfterValue": "2000003",
"SampleAfterValue": "2000003",
"MSRIndex": "0",
"MSRValue": "0",
"TakenAlone": "0",
"CounterMask": "0",
"Invert": "0",
"AnyThread": "0",
"EdgeDetect": "0",
"PEBS": "0",
"PRECISE_STORE": "0",
"Errata": "null",
"Offcore": "0"
},
...
]
All the PMU events supported by a CPU model/family must be grouped into
"topics" such as "Pipelining", "Floating-point", "Virtual-memory" etc.
All events belonging to a topic must be placed in a separate JSON file
(eg: "Pipelining.json") and all the topic JSON files for a CPU model must
be in a separate directory.
Eg: for the CPU model "Silvermont_core":
$ ls tools/perf/pmu-events/arch/x86/Silvermont_core
Floating-point.json
Memory.json
Other.json
Pipelining.json
Virtualmemory.json
Finally, to allow multiple CPU models to share a single set of JSON files,
architectures must provide a mapping between a model and its set of events:
$ grep Silvermont tools/perf/pmu-events/arch/x86/mapfile.csv
GenuineIntel-6-4D,V13,Silvermont_core,core
GenuineIntel-6-4C,V13,Silvermont_core,core
which maps each CPU, identified by [vendor, family, model, version, type]
to a directory of JSON files. Thus two (or more) CPU models support the
set of PMU events listed in the directory.
tools/perf/pmu-events/arch/x86/Silvermont_core/
Given this organization of files, the program, jevents:
- locates all JSON files for each CPU-model of the architecture,
- parses all JSON files for the CPU-model and generates a C-style
"PMU-events table" (pmu-events.c) for the model
- locates a mapfile for the architecture
- builds a global table, mapping each model of CPU to the corresponding
PMU-events table.
The 'pmu-events.c' is generated when building perf and added to libperf.a.
The global table pmu_events_map[] table in this pmu-events.c will be used
in perf in a follow-on patch.
If the architecture does not have any JSON files or there is an error in
processing them, an empty mapping file is created. This would allow the
build of perf to proceed even if we are not able to provide aliases for
events.
The parser for JSON files allows parsing Intel style JSON event files. This
allows to use an Intel event list directly with perf. The Intel event lists
can be quite large and are too big to store in unswappable kernel memory.
The conversion from JSON to C-style is straight forward. The parser knows
(very little) Intel specific information, and can be easily extended to
handle fields for other CPUs.
The parser code is partially shared with an independent parsing library,
which is 2-clause BSD licensed. To avoid any conflicts I marked those
files as BSD licensed too. As part of perf they become GPLv2.
Committer notes:
Fixes:
1) Limit maxfds to 512 to avoid nftd() segfaulting on alloca() with a
big rlim_max, as in docker containers - acme
2) Make jevents a hostprog, supporting cross compilation - jolsa
3) Use HOSTCC for jevents final step - acme
4) Define _GNU_SOURCE for asprintf, as we can't use CC's EXTRA_CFLAGS,
that has to have --sysroot on the Android NDK 24 - acme
5) Removed $(srctree)/tools/perf/pmu-events/pmu-events.c from the
'clean' target, it is generated on $(OUTPUT)pmu-events/pmu-events.c,
which is already taken care of in the original patch - acme
Signed-off-by: Andi Kleen <ak@linux.intel.com>
Signed-off-by: Jiri Olsa <jolsa@redhat.com>
Signed-off-by: Sukadev Bhattiprolu <sukadev@linux.vnet.ibm.com>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Acked-by: Ingo Molnar <mingo@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Madhavan Srinivasan <maddy@linux.vnet.ibm.com>
Cc: linuxppc-dev@lists.ozlabs.org
Link: http://lkml.kernel.org/r/1473978296-20712-3-git-send-email-sukadev@linux.vnet.ibm.com
Link: http://lkml.kernel.org/r/20160927141846.GA6589@krava
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-09-19 23:39:33 +03:00
HOSTCC ?= gcc
HOSTLD ?= ld
HOSTAR ?= ar
2014-01-15 14:44:08 +04:00
PKG_CONFIG = $( CROSS_COMPILE) pkg-config
2016-11-26 10:03:33 +03:00
LLVM_CONFIG ?= llvm-config
2013-10-02 13:49:08 +04:00
RM = rm -f
perf trace: Add 'trace' alias to 'perf trace'
Make 'perf trace' more accessible by aliasing it to just 'trace':
[root@zoo linux]# trace --duration 15 -a -e futex sleep 1
110.092 (16.188 ms): libvirtd/1166 futex(uaddr: 0x185b344, op: WAIT|PRIV, val: 174293 ) = 0
110.101 (15.903 ms): libvirtd/1171 futex(uaddr: 0x185b3dc, op: WAIT|PRIV, val: 139265 ) = 0
111.594 (15.776 ms): libvirtd/1165 futex(uaddr: 0x185b344, op: WAIT|PRIV, val: 174295 ) = 0
111.610 (15.969 ms): libvirtd/1169 futex(uaddr: 0x185b3dc, op: WAIT|PRIV, val: 139267 ) = 0
113.556 (16.216 ms): libvirtd/1168 futex(uaddr: 0x185b3dc, op: WAIT|PRIV, val: 139269 ) = 0
291.265 (199.508 ms): chromium-brows/15830 futex(uaddr: 0x7fff2986bcb4, op: WAIT_BITSET|PRIV|CLKRT, val: 1, utime: 0x7fff2986bab0, val3: 4294967295) = -1 ETIMEDOUT Connection timed out
360.354 (69.053 ms): chromium-brows/15830 futex(uaddr: 0x7fff2986bcb4, op: WAIT_BITSET|PRIV|CLKRT, val: 1, utime: 0x7fff2986bab0, val3: 4294967295) = -1 ETIMEDOUT Connection timed out
[root@zoo linux]#
I.e. looking for futex calls that take at least 15ms, system wide, during a one
second window. Now to get callchains into 'trace' to figure out what are those
locks :-)
Requested-by: Ingo Molnar <mingo@kernel.org>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Frederic Weisbecker <fweisbec@gmail.com>
Cc: Jiri Olsa <jolsa@redhat.com>
Cc: Mike Galbraith <efault@gmx.de>
Cc: Paul Mackerras <paulus@samba.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Stephane Eranian <eranian@google.com>
Link: http://lkml.kernel.org/n/tip-ch4smqz8b5fmgrte7c5e4fuw@git.kernel.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2013-09-24 18:56:36 +04:00
LN = ln -f
2013-10-02 13:49:08 +04:00
MKDIR = mkdir
FIND = find
INSTALL = install
FLEX = flex
BISON = bison
STRIP = strip
2015-08-13 10:14:55 +03:00
AWK = awk
2013-10-02 13:49:08 +04:00
2016-08-01 21:02:33 +03:00
# include Makefile.config by default and rule out
2013-10-02 13:49:08 +04:00
# non-config cases
config := 1
2016-12-04 23:42:57 +03:00
NON_CONFIG_TARGETS := clean TAGS tags cscope help install-doc install-man install-html install-info install-pdf doc man html info pdf
2013-10-02 13:49:08 +04:00
i f d e f M A K E C M D G O A L S
i f e q ( $( filter -out $ ( NON_CONFIG_TARGETS ) ,$ ( MAKECMDGOALS ) ) , )
config := 0
e n d i f
e n d i f
2016-12-06 16:18:49 +03:00
# The fixdep build - we force fixdep tool to be built as
# the first target in the separate make session not to be
# disturbed by any parallel make jobs. Once fixdep is done
# we issue the requested build with FIXDEP=1 variable.
#
# The fixdep build is disabled for $(NON_CONFIG_TARGETS)
# targets, because it's not necessary.
i f d e f F I X D E P
force_fixdep := 0
e l s e
force_fixdep := $( config)
e n d i f
export srctree OUTPUT RM CC CXX LD AR CFLAGS CXXFLAGS V BISON FLEX AWK
export HOSTCC HOSTLD HOSTAR
i n c l u d e $( srctree ) / t o o l s / b u i l d / M a k e f i l e . i n c l u d e
i f e q ( $( force_fixdep ) , 1 )
goals := $( filter-out all sub-make, $( MAKECMDGOALS) )
$(goals) all : sub -make
sub-make : fixdep
2016-12-06 16:18:51 +03:00
@./check-headers.sh
2016-12-06 16:18:49 +03:00
$( Q) $( MAKE) FIXDEP = 1 -f Makefile.perf $( goals)
e l s e # force_fixdep
2016-12-06 16:18:50 +03:00
LIB_DIR = $( srctree) /tools/lib/api/
TRACE_EVENT_DIR = $( srctree) /tools/lib/traceevent/
BPF_DIR = $( srctree) /tools/lib/bpf/
SUBCMD_DIR = $( srctree) /tools/lib/subcmd/
2016-01-25 12:55:51 +03:00
# Set FEATURE_TESTS to 'all' so all possible feature checkers are executed.
# Without this setting the output feature dump file misses some features, for
# example, liberty. Select all checkers so we won't get an incomplete feature
# dump file.
2013-10-02 13:49:08 +04:00
i f e q ( $( config ) , 1 )
2016-01-25 12:55:51 +03:00
i f d e f M A K E C M D G O A L S
i f e q ( $( filter feature -dump ,$ ( MAKECMDGOALS ) ) , f e a t u r e - d u m p )
FEATURE_TESTS := all
e n d i f
e n d i f
2016-08-01 21:02:33 +03:00
i n c l u d e M a k e f i l e . c o n f i g
2013-10-02 13:49:08 +04:00
e n d i f
perf tools: Build syscall table .c header from kernel's syscall_64.tbl
We used libaudit to map ids to syscall names and vice-versa, but that
imposes a delay in supporting new syscalls, having to wait for libaudit
to get those new syscalls on its tables.
To remove that delay, for x86_64 initially, grab a copy of
arch/x86/entry/syscalls/syscall_64.tbl and use it to generate those
tables.
Syscalls currently not available in audit-libs:
# trace -e copy_file_range,membarrier,mlock2,pread64,pwrite64,timerfd_create,userfaultfd
Error: Invalid syscall copy_file_range, membarrier, mlock2, pread64, pwrite64, timerfd_create, userfaultfd
Hint: try 'perf list syscalls:sys_enter_*'
Hint: and: 'man syscalls'
#
With this patch:
# trace -e copy_file_range,membarrier,mlock2,pread64,pwrite64,timerfd_create,userfaultfd
8505.733 ( 0.010 ms): gnome-shell/2519 timerfd_create(flags: 524288) = 36
8506.688 ( 0.005 ms): gnome-shell/2519 timerfd_create(flags: 524288) = 40
30023.097 ( 0.025 ms): qemu-system-x8/24629 pwrite64(fd: 18, buf: 0x7f63ae382000, count: 4096, pos: 529592320) = 4096
31268.712 ( 0.028 ms): qemu-system-x8/24629 pwrite64(fd: 18, buf: 0x7f63afd8b000, count: 4096, pos: 2314133504) = 4096
31268.854 ( 0.016 ms): qemu-system-x8/24629 pwrite64(fd: 18, buf: 0x7f63afda2000, count: 4096, pos: 2314137600) = 4096
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Wang Nan <wangnan0@huawei.com>
Link: http://lkml.kernel.org/n/tip-51xfjbxevdsucmnbc4ka5r88@git.kernel.org
[ Added make dep for 'prepare' in 'LIBPERF_IN', fix by Wang Nan to fix parallell build ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-04-05 01:05:36 +03:00
i f e q ( $( config ) , 0 )
i n c l u d e $( srctree ) / t o o l s / s c r i p t s / M a k e f i l e . a r c h
- i n c l u d e a r c h / $( ARCH ) / M a k e f i l e
e n d i f
2016-01-15 07:00:18 +03:00
# The FEATURE_DUMP_EXPORT holds location of the actual
# FEATURE_DUMP file to be used to bypass feature detection
# (for bpf or any other subproject)
i f e q ( $( FEATURES_DUMP ) , )
FEATURE_DUMP_EXPORT := $( realpath $( OUTPUT) FEATURE-DUMP)
e l s e
FEATURE_DUMP_EXPORT := $( FEATURES_DUMP)
e n d i f
2013-12-03 17:09:21 +04:00
export prefix bindir sharedir sysconfdir DESTDIR
2013-10-02 13:49:08 +04:00
# sparse is architecture-neutral, which means that we need to tell it
# explicitly what architecture to check for. Fix this up for yours..
SPARSE_FLAGS = -D__BIG_ENDIAN__ -D__powerpc__
# Guard against environment variables
PYRF_OBJS =
SCRIPT_SH =
SCRIPT_SH += perf-archive.sh
2014-07-31 10:01:12 +04:00
SCRIPT_SH += perf-with-kcore.sh
2013-10-02 13:49:08 +04:00
grep-libs = $( filter -l%,$( 1) )
strip-libs = $( filter-out -l%,$( 1) )
i f n e q ( $( OUTPUT ) , )
TE_PATH = $( OUTPUT)
2015-10-14 15:41:12 +03:00
BPF_PATH = $( OUTPUT)
2015-12-15 18:39:39 +03:00
SUBCMD_PATH = $( OUTPUT)
2013-10-02 13:49:08 +04:00
i f n e q ( $( subdir ) , )
2015-12-14 07:18:06 +03:00
API_PATH = $( OUTPUT) /../lib/api/
2013-10-02 13:49:08 +04:00
e l s e
2015-12-14 07:18:06 +03:00
API_PATH = $( OUTPUT)
2013-10-02 13:49:08 +04:00
e n d i f
e l s e
TE_PATH = $( TRACE_EVENT_DIR)
2015-12-14 07:18:06 +03:00
API_PATH = $( LIB_DIR)
2015-10-14 15:41:12 +03:00
BPF_PATH = $( BPF_DIR)
2015-12-15 18:39:39 +03:00
SUBCMD_PATH = $( SUBCMD_DIR)
2013-10-02 13:49:08 +04:00
e n d i f
LIBTRACEEVENT = $( TE_PATH) libtraceevent.a
export LIBTRACEEVENT
2015-05-28 16:28:54 +03:00
LIBTRACEEVENT_DYNAMIC_LIST = $( TE_PATH) libtraceevent-dynamic-list
2015-06-17 17:46:29 +03:00
LIBTRACEEVENT_DYNAMIC_LIST_LDFLAGS = -Xlinker --dynamic-list= $( LIBTRACEEVENT_DYNAMIC_LIST)
2015-05-28 16:28:54 +03:00
2015-12-14 07:18:06 +03:00
LIBAPI = $( API_PATH) libapi.a
2015-01-10 22:53:13 +03:00
export LIBAPI
2013-10-02 13:49:08 +04:00
2015-10-14 15:41:12 +03:00
LIBBPF = $( BPF_PATH) libbpf.a
2015-12-15 18:39:39 +03:00
LIBSUBCMD = $( SUBCMD_PATH) libsubcmd.a
2013-10-02 13:49:08 +04:00
# python extension build directories
PYTHON_EXTBUILD := $( OUTPUT) python_ext_build/
PYTHON_EXTBUILD_LIB := $( PYTHON_EXTBUILD) lib/
PYTHON_EXTBUILD_TMP := $( PYTHON_EXTBUILD) tmp/
export PYTHON_EXTBUILD_LIB PYTHON_EXTBUILD_TMP
2013-10-09 13:49:26 +04:00
python-clean := $( call QUIET_CLEAN, python) $( RM) -r $( PYTHON_EXTBUILD) $( OUTPUT) python/perf.so
2013-10-02 13:49:08 +04:00
PYTHON_EXT_SRCS := $( shell grep -v ^\# util/python-ext-sources)
2015-01-10 22:53:13 +03:00
PYTHON_EXT_DEPS := util/python-ext-sources util/setup.py $( LIBTRACEEVENT) $( LIBAPI)
2013-10-02 13:49:08 +04:00
SCRIPTS = $( patsubst %.sh,%,$( SCRIPT_SH) )
PROGRAMS += $( OUTPUT) perf
2014-10-24 01:16:03 +04:00
i f n d e f N O _ P E R F _ R E A D _ V D S O 3 2
PROGRAMS += $( OUTPUT) perf-read-vdso32
e n d i f
i f n d e f N O _ P E R F _ R E A D _ V D S O X 3 2
PROGRAMS += $( OUTPUT) perf-read-vdsox32
e n d i f
2016-11-02 16:35:49 +03:00
LIBJVMTI = libperf-jvmti.so
i f n d e f N O _ J V M T I
PROGRAMS += $( OUTPUT) $( LIBJVMTI)
e n d i f
2013-10-02 13:49:08 +04:00
# what 'all' will build and 'install' will install, in perfexecdir
ALL_PROGRAMS = $( PROGRAMS) $( SCRIPTS)
# what 'all' will build but not install in perfexecdir
OTHER_PROGRAMS = $( OUTPUT) perf
# Set paths to tools early so that they can be used for version tests.
i f n d e f S H E L L _ P A T H
SHELL_PATH = /bin/sh
e n d i f
i f n d e f P E R L _ P A T H
PERL_PATH = /usr/bin/perl
e n d i f
export PERL_PATH
LIB_FILE = $( OUTPUT) libperf.a
2015-12-15 18:39:39 +03:00
PERFLIBS = $( LIB_FILE) $( LIBAPI) $( LIBTRACEEVENT) $( LIBSUBCMD)
2015-10-14 15:41:12 +03:00
i f n d e f N O _ L I B B P F
PERFLIBS += $( LIBBPF)
e n d i f
2013-10-02 13:49:08 +04:00
# We choose to avoid "if .. else if .. else .. endif endif"
# because maintaining the nesting to match is a pain. If
# we had "elif" things would have been much nicer...
i f n e q ( $( OUTPUT ) , )
CFLAGS += -I$( OUTPUT)
e n d i f
i f n d e f N O _ G T K 2
2013-09-13 10:27:43 +04:00
ALL_PROGRAMS += $( OUTPUT) libperf-gtk.so
2014-12-30 02:34:23 +03:00
GTK_IN := $( OUTPUT) gtk-in.o
2013-10-02 13:49:08 +04:00
e n d i f
i f d e f A S C I I D O C 8
export ASCIIDOC8
e n d i f
LIBS = -Wl,--whole-archive $( PERFLIBS) -Wl,--no-whole-archive -Wl,--start-group $( EXTLIBS) -Wl,--end-group
2016-11-26 10:03:33 +03:00
i f e q ( $( USE_CLANG ) , 1 )
CLANGLIBS_LIST = AST Basic CodeGen Driver Frontend Lex Tooling Edit Sema Analysis Parse Serialization
LIBCLANG = $( foreach l,$( CLANGLIBS_LIST) ,$( wildcard $( shell $( LLVM_CONFIG) --libdir) /libclang$( l) .a) )
LIBS += -Wl,--start-group $( LIBCLANG) -Wl,--end-group
e n d i f
i f e q ( $( USE_LLVM ) , 1 )
LIBLLVM = $( shell $( LLVM_CONFIG) --libs all) $( shell $( LLVM_CONFIG) --system-libs)
LIBS += -L$( shell $( LLVM_CONFIG) --libdir) $( LIBLLVM)
e n d i f
i f e q ( $( USE_CXX ) , 1 )
LIBS += -lstdc++
e n d i f
2013-10-02 13:49:08 +04:00
export INSTALL SHELL_PATH
### Build rules
SHELL = $( SHELL_PATH)
all : shell_compatibility_test $( ALL_PROGRAMS ) $( LANG_BINDINGS ) $( OTHER_PROGRAMS )
2016-12-04 23:42:55 +03:00
$(OUTPUT)python/perf.so : $( PYTHON_EXT_SRCS ) $( PYTHON_EXT_DEPS ) $( LIBTRACEEVENT_DYNAMIC_LIST )
$( QUIET_GEN) LDSHARED = " $( CC) -pthread -shared " \
CFLAGS = '$(CFLAGS)' LDFLAGS = '$(LDFLAGS) $(LIBTRACEEVENT_DYNAMIC_LIST_LDFLAGS)' \
$( PYTHON_WORD) util/setup.py \
--quiet build_ext; \
mkdir -p $( OUTPUT) python && \
cp $( PYTHON_EXTBUILD_LIB) perf.so $( OUTPUT) python/
2013-10-02 13:49:08 +04:00
please_set_SHELL_PATH_to_a_more_modern_shell :
2015-01-12 01:59:55 +03:00
$( Q) $$ ( :)
2013-10-02 13:49:08 +04:00
shell_compatibility_test : please_set_SHELL_PATH_to_a_more_modern_shell
strip : $( PROGRAMS ) $( OUTPUT ) perf
$( STRIP) $( STRIP_OPTS) $( PROGRAMS) $( OUTPUT) perf
2014-12-29 15:52:36 +03:00
PERF_IN := $( OUTPUT) perf-in.o
perf jevents: Program to convert JSON file
This is a modified version of an earlier patch by Andi Kleen.
We expect architectures to create JSON files describing the performance
monitoring (PMU) events that each CPU model/family of the architecture
supports.
Following is an example of the JSON file entry for an x86 event:
[
...
{
"EventCode": "0x00",
"UMask": "0x01",
"EventName": "INST_RETIRED.ANY",
"BriefDescription": "Instructions retired from execution.",
"PublicDescription": "Instructions retired from execution.",
"Counter": "Fixed counter 1",
"CounterHTOff": "Fixed counter 1",
"SampleAfterValue": "2000003",
"SampleAfterValue": "2000003",
"MSRIndex": "0",
"MSRValue": "0",
"TakenAlone": "0",
"CounterMask": "0",
"Invert": "0",
"AnyThread": "0",
"EdgeDetect": "0",
"PEBS": "0",
"PRECISE_STORE": "0",
"Errata": "null",
"Offcore": "0"
},
...
]
All the PMU events supported by a CPU model/family must be grouped into
"topics" such as "Pipelining", "Floating-point", "Virtual-memory" etc.
All events belonging to a topic must be placed in a separate JSON file
(eg: "Pipelining.json") and all the topic JSON files for a CPU model must
be in a separate directory.
Eg: for the CPU model "Silvermont_core":
$ ls tools/perf/pmu-events/arch/x86/Silvermont_core
Floating-point.json
Memory.json
Other.json
Pipelining.json
Virtualmemory.json
Finally, to allow multiple CPU models to share a single set of JSON files,
architectures must provide a mapping between a model and its set of events:
$ grep Silvermont tools/perf/pmu-events/arch/x86/mapfile.csv
GenuineIntel-6-4D,V13,Silvermont_core,core
GenuineIntel-6-4C,V13,Silvermont_core,core
which maps each CPU, identified by [vendor, family, model, version, type]
to a directory of JSON files. Thus two (or more) CPU models support the
set of PMU events listed in the directory.
tools/perf/pmu-events/arch/x86/Silvermont_core/
Given this organization of files, the program, jevents:
- locates all JSON files for each CPU-model of the architecture,
- parses all JSON files for the CPU-model and generates a C-style
"PMU-events table" (pmu-events.c) for the model
- locates a mapfile for the architecture
- builds a global table, mapping each model of CPU to the corresponding
PMU-events table.
The 'pmu-events.c' is generated when building perf and added to libperf.a.
The global table pmu_events_map[] table in this pmu-events.c will be used
in perf in a follow-on patch.
If the architecture does not have any JSON files or there is an error in
processing them, an empty mapping file is created. This would allow the
build of perf to proceed even if we are not able to provide aliases for
events.
The parser for JSON files allows parsing Intel style JSON event files. This
allows to use an Intel event list directly with perf. The Intel event lists
can be quite large and are too big to store in unswappable kernel memory.
The conversion from JSON to C-style is straight forward. The parser knows
(very little) Intel specific information, and can be easily extended to
handle fields for other CPUs.
The parser code is partially shared with an independent parsing library,
which is 2-clause BSD licensed. To avoid any conflicts I marked those
files as BSD licensed too. As part of perf they become GPLv2.
Committer notes:
Fixes:
1) Limit maxfds to 512 to avoid nftd() segfaulting on alloca() with a
big rlim_max, as in docker containers - acme
2) Make jevents a hostprog, supporting cross compilation - jolsa
3) Use HOSTCC for jevents final step - acme
4) Define _GNU_SOURCE for asprintf, as we can't use CC's EXTRA_CFLAGS,
that has to have --sysroot on the Android NDK 24 - acme
5) Removed $(srctree)/tools/perf/pmu-events/pmu-events.c from the
'clean' target, it is generated on $(OUTPUT)pmu-events/pmu-events.c,
which is already taken care of in the original patch - acme
Signed-off-by: Andi Kleen <ak@linux.intel.com>
Signed-off-by: Jiri Olsa <jolsa@redhat.com>
Signed-off-by: Sukadev Bhattiprolu <sukadev@linux.vnet.ibm.com>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Acked-by: Ingo Molnar <mingo@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Madhavan Srinivasan <maddy@linux.vnet.ibm.com>
Cc: linuxppc-dev@lists.ozlabs.org
Link: http://lkml.kernel.org/r/1473978296-20712-3-git-send-email-sukadev@linux.vnet.ibm.com
Link: http://lkml.kernel.org/r/20160927141846.GA6589@krava
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-09-19 23:39:33 +03:00
JEVENTS := $( OUTPUT) pmu-events/jevents
JEVENTS_IN := $( OUTPUT) pmu-events/jevents-in.o
PMU_EVENTS_IN := $( OUTPUT) pmu-events/pmu-events-in.o
export JEVENTS
build := -f $( srctree) /tools/build/Makefile.build dir = . obj
2015-09-23 13:34:01 +03:00
$(PERF_IN) : prepare FORCE
2015-01-12 01:59:55 +03:00
$( Q) $( MAKE) $( build) = perf
2014-12-29 15:52:36 +03:00
perf jevents: Program to convert JSON file
This is a modified version of an earlier patch by Andi Kleen.
We expect architectures to create JSON files describing the performance
monitoring (PMU) events that each CPU model/family of the architecture
supports.
Following is an example of the JSON file entry for an x86 event:
[
...
{
"EventCode": "0x00",
"UMask": "0x01",
"EventName": "INST_RETIRED.ANY",
"BriefDescription": "Instructions retired from execution.",
"PublicDescription": "Instructions retired from execution.",
"Counter": "Fixed counter 1",
"CounterHTOff": "Fixed counter 1",
"SampleAfterValue": "2000003",
"SampleAfterValue": "2000003",
"MSRIndex": "0",
"MSRValue": "0",
"TakenAlone": "0",
"CounterMask": "0",
"Invert": "0",
"AnyThread": "0",
"EdgeDetect": "0",
"PEBS": "0",
"PRECISE_STORE": "0",
"Errata": "null",
"Offcore": "0"
},
...
]
All the PMU events supported by a CPU model/family must be grouped into
"topics" such as "Pipelining", "Floating-point", "Virtual-memory" etc.
All events belonging to a topic must be placed in a separate JSON file
(eg: "Pipelining.json") and all the topic JSON files for a CPU model must
be in a separate directory.
Eg: for the CPU model "Silvermont_core":
$ ls tools/perf/pmu-events/arch/x86/Silvermont_core
Floating-point.json
Memory.json
Other.json
Pipelining.json
Virtualmemory.json
Finally, to allow multiple CPU models to share a single set of JSON files,
architectures must provide a mapping between a model and its set of events:
$ grep Silvermont tools/perf/pmu-events/arch/x86/mapfile.csv
GenuineIntel-6-4D,V13,Silvermont_core,core
GenuineIntel-6-4C,V13,Silvermont_core,core
which maps each CPU, identified by [vendor, family, model, version, type]
to a directory of JSON files. Thus two (or more) CPU models support the
set of PMU events listed in the directory.
tools/perf/pmu-events/arch/x86/Silvermont_core/
Given this organization of files, the program, jevents:
- locates all JSON files for each CPU-model of the architecture,
- parses all JSON files for the CPU-model and generates a C-style
"PMU-events table" (pmu-events.c) for the model
- locates a mapfile for the architecture
- builds a global table, mapping each model of CPU to the corresponding
PMU-events table.
The 'pmu-events.c' is generated when building perf and added to libperf.a.
The global table pmu_events_map[] table in this pmu-events.c will be used
in perf in a follow-on patch.
If the architecture does not have any JSON files or there is an error in
processing them, an empty mapping file is created. This would allow the
build of perf to proceed even if we are not able to provide aliases for
events.
The parser for JSON files allows parsing Intel style JSON event files. This
allows to use an Intel event list directly with perf. The Intel event lists
can be quite large and are too big to store in unswappable kernel memory.
The conversion from JSON to C-style is straight forward. The parser knows
(very little) Intel specific information, and can be easily extended to
handle fields for other CPUs.
The parser code is partially shared with an independent parsing library,
which is 2-clause BSD licensed. To avoid any conflicts I marked those
files as BSD licensed too. As part of perf they become GPLv2.
Committer notes:
Fixes:
1) Limit maxfds to 512 to avoid nftd() segfaulting on alloca() with a
big rlim_max, as in docker containers - acme
2) Make jevents a hostprog, supporting cross compilation - jolsa
3) Use HOSTCC for jevents final step - acme
4) Define _GNU_SOURCE for asprintf, as we can't use CC's EXTRA_CFLAGS,
that has to have --sysroot on the Android NDK 24 - acme
5) Removed $(srctree)/tools/perf/pmu-events/pmu-events.c from the
'clean' target, it is generated on $(OUTPUT)pmu-events/pmu-events.c,
which is already taken care of in the original patch - acme
Signed-off-by: Andi Kleen <ak@linux.intel.com>
Signed-off-by: Jiri Olsa <jolsa@redhat.com>
Signed-off-by: Sukadev Bhattiprolu <sukadev@linux.vnet.ibm.com>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Acked-by: Ingo Molnar <mingo@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Madhavan Srinivasan <maddy@linux.vnet.ibm.com>
Cc: linuxppc-dev@lists.ozlabs.org
Link: http://lkml.kernel.org/r/1473978296-20712-3-git-send-email-sukadev@linux.vnet.ibm.com
Link: http://lkml.kernel.org/r/20160927141846.GA6589@krava
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-09-19 23:39:33 +03:00
$(JEVENTS_IN) : FORCE
$( Q) $( MAKE) -f $( srctree) /tools/build/Makefile.build dir = pmu-events obj = jevents
$(JEVENTS) : $( JEVENTS_IN )
$( QUIET_LINK) $( HOSTCC) $( JEVENTS_IN) -o $@
$(PMU_EVENTS_IN) : $( JEVENTS ) FORCE
$( Q) $( MAKE) -f $( srctree) /tools/build/Makefile.build dir = pmu-events obj = pmu-events
$(OUTPUT)perf : $( PERFLIBS ) $( PERF_IN ) $( PMU_EVENTS_IN ) $( LIBTRACEEVENT_DYNAMIC_LIST )
2015-06-17 17:46:29 +03:00
$( QUIET_LINK) $( CC) $( CFLAGS) $( LDFLAGS) $( LIBTRACEEVENT_DYNAMIC_LIST_LDFLAGS) \
perf jevents: Program to convert JSON file
This is a modified version of an earlier patch by Andi Kleen.
We expect architectures to create JSON files describing the performance
monitoring (PMU) events that each CPU model/family of the architecture
supports.
Following is an example of the JSON file entry for an x86 event:
[
...
{
"EventCode": "0x00",
"UMask": "0x01",
"EventName": "INST_RETIRED.ANY",
"BriefDescription": "Instructions retired from execution.",
"PublicDescription": "Instructions retired from execution.",
"Counter": "Fixed counter 1",
"CounterHTOff": "Fixed counter 1",
"SampleAfterValue": "2000003",
"SampleAfterValue": "2000003",
"MSRIndex": "0",
"MSRValue": "0",
"TakenAlone": "0",
"CounterMask": "0",
"Invert": "0",
"AnyThread": "0",
"EdgeDetect": "0",
"PEBS": "0",
"PRECISE_STORE": "0",
"Errata": "null",
"Offcore": "0"
},
...
]
All the PMU events supported by a CPU model/family must be grouped into
"topics" such as "Pipelining", "Floating-point", "Virtual-memory" etc.
All events belonging to a topic must be placed in a separate JSON file
(eg: "Pipelining.json") and all the topic JSON files for a CPU model must
be in a separate directory.
Eg: for the CPU model "Silvermont_core":
$ ls tools/perf/pmu-events/arch/x86/Silvermont_core
Floating-point.json
Memory.json
Other.json
Pipelining.json
Virtualmemory.json
Finally, to allow multiple CPU models to share a single set of JSON files,
architectures must provide a mapping between a model and its set of events:
$ grep Silvermont tools/perf/pmu-events/arch/x86/mapfile.csv
GenuineIntel-6-4D,V13,Silvermont_core,core
GenuineIntel-6-4C,V13,Silvermont_core,core
which maps each CPU, identified by [vendor, family, model, version, type]
to a directory of JSON files. Thus two (or more) CPU models support the
set of PMU events listed in the directory.
tools/perf/pmu-events/arch/x86/Silvermont_core/
Given this organization of files, the program, jevents:
- locates all JSON files for each CPU-model of the architecture,
- parses all JSON files for the CPU-model and generates a C-style
"PMU-events table" (pmu-events.c) for the model
- locates a mapfile for the architecture
- builds a global table, mapping each model of CPU to the corresponding
PMU-events table.
The 'pmu-events.c' is generated when building perf and added to libperf.a.
The global table pmu_events_map[] table in this pmu-events.c will be used
in perf in a follow-on patch.
If the architecture does not have any JSON files or there is an error in
processing them, an empty mapping file is created. This would allow the
build of perf to proceed even if we are not able to provide aliases for
events.
The parser for JSON files allows parsing Intel style JSON event files. This
allows to use an Intel event list directly with perf. The Intel event lists
can be quite large and are too big to store in unswappable kernel memory.
The conversion from JSON to C-style is straight forward. The parser knows
(very little) Intel specific information, and can be easily extended to
handle fields for other CPUs.
The parser code is partially shared with an independent parsing library,
which is 2-clause BSD licensed. To avoid any conflicts I marked those
files as BSD licensed too. As part of perf they become GPLv2.
Committer notes:
Fixes:
1) Limit maxfds to 512 to avoid nftd() segfaulting on alloca() with a
big rlim_max, as in docker containers - acme
2) Make jevents a hostprog, supporting cross compilation - jolsa
3) Use HOSTCC for jevents final step - acme
4) Define _GNU_SOURCE for asprintf, as we can't use CC's EXTRA_CFLAGS,
that has to have --sysroot on the Android NDK 24 - acme
5) Removed $(srctree)/tools/perf/pmu-events/pmu-events.c from the
'clean' target, it is generated on $(OUTPUT)pmu-events/pmu-events.c,
which is already taken care of in the original patch - acme
Signed-off-by: Andi Kleen <ak@linux.intel.com>
Signed-off-by: Jiri Olsa <jolsa@redhat.com>
Signed-off-by: Sukadev Bhattiprolu <sukadev@linux.vnet.ibm.com>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Acked-by: Ingo Molnar <mingo@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Madhavan Srinivasan <maddy@linux.vnet.ibm.com>
Cc: linuxppc-dev@lists.ozlabs.org
Link: http://lkml.kernel.org/r/1473978296-20712-3-git-send-email-sukadev@linux.vnet.ibm.com
Link: http://lkml.kernel.org/r/20160927141846.GA6589@krava
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-09-19 23:39:33 +03:00
$( PERF_IN) $( PMU_EVENTS_IN) $( LIBS) -o $@
2013-10-02 13:49:08 +04:00
2016-12-06 16:18:49 +03:00
$(GTK_IN) : FORCE
2015-01-12 01:59:55 +03:00
$( Q) $( MAKE) $( build) = gtk
2013-09-13 10:27:43 +04:00
2014-12-30 02:34:23 +03:00
$(OUTPUT)libperf-gtk.so : $( GTK_IN ) $( PERFLIBS )
2014-04-27 13:12:21 +04:00
$( QUIET_LINK) $( CC) -o $@ -shared $( LDFLAGS) $( filter %.o,$^) $( GTK_LIBS)
2013-09-13 10:27:43 +04:00
2013-10-02 13:49:08 +04:00
$(OUTPUT)common-cmds.h : util /generate -cmdlist .sh command -list .txt
$(OUTPUT)common-cmds.h : $( wildcard Documentation /perf -*.txt )
$( QUIET_GEN) . util/generate-cmdlist.sh > $@ + && mv $@ + $@
$(SCRIPTS) : % : %.sh
$( QUIET_GEN) $( INSTALL) '$@.sh' '$(OUTPUT)$@'
2016-12-06 16:18:48 +03:00
$(OUTPUT)PERF-VERSION-FILE : ../../.git /HEAD
$( Q) $( SHELL_PATH) util/PERF-VERSION-GEN $( OUTPUT)
$( Q) touch $( OUTPUT) PERF-VERSION-FILE
2013-10-02 13:49:08 +04:00
# These can record PERF_VERSION
2014-12-30 19:09:15 +03:00
p e r f . s p e c $( SCRIPTS ) \
2013-10-02 13:49:08 +04:00
: $( OUTPUT) PERF-VERSION-FILE
.SUFFIXES :
2013-10-04 14:14:59 +04:00
#
# If a target does not match any of the later rules then prefix it by $(OUTPUT)
# This makes targets like 'make O=/tmp/perf perf.o' work in a natural way.
#
i f n e q ( $( OUTPUT ) , )
%.o : $( OUTPUT ) %.o
@echo " # Redirected target $@ => $( OUTPUT) $@ "
perf jevents: Program to convert JSON file
This is a modified version of an earlier patch by Andi Kleen.
We expect architectures to create JSON files describing the performance
monitoring (PMU) events that each CPU model/family of the architecture
supports.
Following is an example of the JSON file entry for an x86 event:
[
...
{
"EventCode": "0x00",
"UMask": "0x01",
"EventName": "INST_RETIRED.ANY",
"BriefDescription": "Instructions retired from execution.",
"PublicDescription": "Instructions retired from execution.",
"Counter": "Fixed counter 1",
"CounterHTOff": "Fixed counter 1",
"SampleAfterValue": "2000003",
"SampleAfterValue": "2000003",
"MSRIndex": "0",
"MSRValue": "0",
"TakenAlone": "0",
"CounterMask": "0",
"Invert": "0",
"AnyThread": "0",
"EdgeDetect": "0",
"PEBS": "0",
"PRECISE_STORE": "0",
"Errata": "null",
"Offcore": "0"
},
...
]
All the PMU events supported by a CPU model/family must be grouped into
"topics" such as "Pipelining", "Floating-point", "Virtual-memory" etc.
All events belonging to a topic must be placed in a separate JSON file
(eg: "Pipelining.json") and all the topic JSON files for a CPU model must
be in a separate directory.
Eg: for the CPU model "Silvermont_core":
$ ls tools/perf/pmu-events/arch/x86/Silvermont_core
Floating-point.json
Memory.json
Other.json
Pipelining.json
Virtualmemory.json
Finally, to allow multiple CPU models to share a single set of JSON files,
architectures must provide a mapping between a model and its set of events:
$ grep Silvermont tools/perf/pmu-events/arch/x86/mapfile.csv
GenuineIntel-6-4D,V13,Silvermont_core,core
GenuineIntel-6-4C,V13,Silvermont_core,core
which maps each CPU, identified by [vendor, family, model, version, type]
to a directory of JSON files. Thus two (or more) CPU models support the
set of PMU events listed in the directory.
tools/perf/pmu-events/arch/x86/Silvermont_core/
Given this organization of files, the program, jevents:
- locates all JSON files for each CPU-model of the architecture,
- parses all JSON files for the CPU-model and generates a C-style
"PMU-events table" (pmu-events.c) for the model
- locates a mapfile for the architecture
- builds a global table, mapping each model of CPU to the corresponding
PMU-events table.
The 'pmu-events.c' is generated when building perf and added to libperf.a.
The global table pmu_events_map[] table in this pmu-events.c will be used
in perf in a follow-on patch.
If the architecture does not have any JSON files or there is an error in
processing them, an empty mapping file is created. This would allow the
build of perf to proceed even if we are not able to provide aliases for
events.
The parser for JSON files allows parsing Intel style JSON event files. This
allows to use an Intel event list directly with perf. The Intel event lists
can be quite large and are too big to store in unswappable kernel memory.
The conversion from JSON to C-style is straight forward. The parser knows
(very little) Intel specific information, and can be easily extended to
handle fields for other CPUs.
The parser code is partially shared with an independent parsing library,
which is 2-clause BSD licensed. To avoid any conflicts I marked those
files as BSD licensed too. As part of perf they become GPLv2.
Committer notes:
Fixes:
1) Limit maxfds to 512 to avoid nftd() segfaulting on alloca() with a
big rlim_max, as in docker containers - acme
2) Make jevents a hostprog, supporting cross compilation - jolsa
3) Use HOSTCC for jevents final step - acme
4) Define _GNU_SOURCE for asprintf, as we can't use CC's EXTRA_CFLAGS,
that has to have --sysroot on the Android NDK 24 - acme
5) Removed $(srctree)/tools/perf/pmu-events/pmu-events.c from the
'clean' target, it is generated on $(OUTPUT)pmu-events/pmu-events.c,
which is already taken care of in the original patch - acme
Signed-off-by: Andi Kleen <ak@linux.intel.com>
Signed-off-by: Jiri Olsa <jolsa@redhat.com>
Signed-off-by: Sukadev Bhattiprolu <sukadev@linux.vnet.ibm.com>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Acked-by: Ingo Molnar <mingo@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Madhavan Srinivasan <maddy@linux.vnet.ibm.com>
Cc: linuxppc-dev@lists.ozlabs.org
Link: http://lkml.kernel.org/r/1473978296-20712-3-git-send-email-sukadev@linux.vnet.ibm.com
Link: http://lkml.kernel.org/r/20160927141846.GA6589@krava
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-09-19 23:39:33 +03:00
pmu-events/%.o : $( OUTPUT ) pmu -events /%.o
@echo " # Redirected target $@ => $( OUTPUT) $@ "
2013-10-04 14:14:59 +04:00
util/%.o : $( OUTPUT ) util /%.o
2013-10-10 09:42:56 +04:00
@echo " # Redirected target $@ => $( OUTPUT) $@ "
2013-10-04 14:14:59 +04:00
bench/%.o : $( OUTPUT ) bench /%.o
2013-10-10 09:42:56 +04:00
@echo " # Redirected target $@ => $( OUTPUT) $@ "
2013-10-04 14:14:59 +04:00
tests/%.o : $( OUTPUT ) tests /%.o
2013-10-10 09:42:56 +04:00
@echo " # Redirected target $@ => $( OUTPUT) $@ "
2013-10-04 14:14:59 +04:00
e n d i f
2013-10-02 13:49:08 +04:00
# These two need to be here so that when O= is not used they take precedence
# over the general rule for .o
2014-12-30 20:44:38 +03:00
# get relative building directory (to $(OUTPUT))
# and '.' if it's $(OUTPUT) itself
__build-dir = $( subst $( OUTPUT) ,,$( dir $@ ) )
build-dir = $( if $( __build-dir) ,$( __build-dir) ,.)
2016-12-06 16:18:49 +03:00
prepare : $( OUTPUT ) PERF -VERSION -FILE $( OUTPUT ) common -cmds .h archheaders
2014-12-30 20:44:38 +03:00
2015-09-23 13:34:01 +03:00
$(OUTPUT)%.o : %.c prepare FORCE
2015-01-12 01:59:55 +03:00
$( Q) $( MAKE) -f $( srctree) /tools/build/Makefile.build dir = $( build-dir) $@
2014-12-30 20:44:38 +03:00
2015-09-23 13:34:01 +03:00
$(OUTPUT)%.i : %.c prepare FORCE
2015-01-12 01:59:55 +03:00
$( Q) $( MAKE) -f $( srctree) /tools/build/Makefile.build dir = $( build-dir) $@
2014-12-30 20:44:38 +03:00
2015-09-23 13:34:01 +03:00
$(OUTPUT)%.s : %.c prepare FORCE
2015-01-12 01:59:55 +03:00
$( Q) $( MAKE) -f $( srctree) /tools/build/Makefile.build dir = $( build-dir) $@
2014-12-30 20:44:38 +03:00
2015-09-23 13:34:01 +03:00
$(OUTPUT)%-bison.o : %.c prepare FORCE
2015-01-12 01:59:55 +03:00
$( Q) $( MAKE) -f $( srctree) /tools/build/Makefile.build dir = $( build-dir) $@
2014-12-30 20:44:38 +03:00
2015-09-23 13:34:01 +03:00
$(OUTPUT)%-flex.o : %.c prepare FORCE
2015-01-12 01:59:55 +03:00
$( Q) $( MAKE) -f $( srctree) /tools/build/Makefile.build dir = $( build-dir) $@
2014-12-30 20:44:38 +03:00
2015-09-23 13:34:01 +03:00
$(OUTPUT)%.o : %.S prepare FORCE
2015-01-12 01:59:55 +03:00
$( Q) $( MAKE) -f $( srctree) /tools/build/Makefile.build dir = $( build-dir) $@
2014-12-30 20:44:38 +03:00
2015-09-23 13:34:01 +03:00
$(OUTPUT)%.i : %.S prepare FORCE
2015-01-12 01:59:55 +03:00
$( Q) $( MAKE) -f $( srctree) /tools/build/Makefile.build dir = $( build-dir) $@
2013-10-02 13:49:08 +04:00
$(OUTPUT)perf-% : %.o $( PERFLIBS )
$( QUIET_LINK) $( CC) $( CFLAGS) -o $@ $( LDFLAGS) $( filter %.o,$^) $( LIBS)
2014-10-24 01:16:03 +04:00
i f n d e f N O _ P E R F _ R E A D _ V D S O 3 2
$(OUTPUT)perf-read-vdso32 : perf -read -vdso .c util /find -vdso -map .c
$( QUIET_CC) $( CC) -m32 $( filter -static,$( LDFLAGS) ) -Wall -Werror -o $@ perf-read-vdso.c
e n d i f
i f n d e f N O _ P E R F _ R E A D _ V D S O X 3 2
$(OUTPUT)perf-read-vdsox32 : perf -read -vdso .c util /find -vdso -map .c
$( QUIET_CC) $( CC) -mx32 $( filter -static,$( LDFLAGS) ) -Wall -Werror -o $@ perf-read-vdso.c
e n d i f
2016-11-02 16:35:49 +03:00
i f n d e f N O _ J V M T I
LIBJVMTI_IN := $( OUTPUT) jvmti/jvmti-in.o
$(LIBJVMTI_IN) : FORCE
$( Q) $( MAKE) -f $( srctree) /tools/build/Makefile.build dir = jvmti obj = jvmti
$(OUTPUT)$(LIBJVMTI) : $( LIBJVMTI_IN )
$( QUIET_LINK) $( CC) -shared -Wl,-soname -Wl,$( LIBJVMTI) -o $@ $< -lelf -lrt
e n d i f
2014-12-30 18:51:35 +03:00
$(patsubst perf-%,%.o,$(PROGRAMS)) : $( wildcard */*.h )
2013-10-02 13:49:08 +04:00
2014-12-29 19:42:46 +03:00
LIBPERF_IN := $( OUTPUT) libperf-in.o
2016-12-06 16:18:49 +03:00
$(LIBPERF_IN) : prepare FORCE
2015-01-12 01:59:55 +03:00
$( Q) $( MAKE) $( build) = libperf
2014-12-29 19:42:46 +03:00
2014-12-30 18:51:35 +03:00
$(LIB_FILE) : $( LIBPERF_IN )
2014-12-29 19:42:46 +03:00
$( QUIET_AR) $( RM) $@ && $( AR) rcs $@ $( LIBPERF_IN) $( LIB_OBJS)
2013-10-02 13:49:08 +04:00
2013-12-03 17:09:21 +04:00
LIBTRACEEVENT_FLAGS += plugin_dir = $( plugindir_SQ)
2016-12-06 16:18:49 +03:00
$(LIBTRACEEVENT) : FORCE
2015-05-28 16:28:54 +03:00
$( Q) $( MAKE) -C $( TRACE_EVENT_DIR) $( LIBTRACEEVENT_FLAGS) O = $( OUTPUT) $( OUTPUT) libtraceevent.a
2016-12-06 16:18:49 +03:00
libtraceevent_plugins : FORCE
2015-05-28 16:28:54 +03:00
$( Q) $( MAKE) -C $( TRACE_EVENT_DIR) $( LIBTRACEEVENT_FLAGS) O = $( OUTPUT) plugins
$(LIBTRACEEVENT_DYNAMIC_LIST) : libtraceevent_plugins
$( Q) $( MAKE) -C $( TRACE_EVENT_DIR) $( LIBTRACEEVENT_FLAGS) O = $( OUTPUT) $( OUTPUT) libtraceevent-dynamic-list
2013-10-02 13:49:08 +04:00
$(LIBTRACEEVENT)-clean :
2013-10-09 13:49:26 +04:00
$( call QUIET_CLEAN, libtraceevent)
2015-01-12 01:59:55 +03:00
$( Q) $( MAKE) -C $( TRACE_EVENT_DIR) O = $( OUTPUT) clean >/dev/null
2013-10-02 13:49:08 +04:00
2015-11-26 16:54:04 +03:00
install-traceevent-plugins : libtraceevent_plugins
2015-01-12 01:59:55 +03:00
$( Q) $( MAKE) -C $( TRACE_EVENT_DIR) $( LIBTRACEEVENT_FLAGS) O = $( OUTPUT) install_plugins
2013-12-03 17:09:21 +04:00
2016-12-06 16:18:49 +03:00
$(LIBAPI) : FORCE
2015-01-12 01:59:55 +03:00
$( Q) $( MAKE) -C $( LIB_DIR) O = $( OUTPUT) $( OUTPUT) libapi.a
2013-10-02 13:49:08 +04:00
2015-01-10 22:53:13 +03:00
$(LIBAPI)-clean :
$( call QUIET_CLEAN, libapi)
2015-01-12 01:59:55 +03:00
$( Q) $( MAKE) -C $( LIB_DIR) O = $( OUTPUT) clean >/dev/null
2013-10-02 13:49:08 +04:00
2016-12-06 16:18:49 +03:00
$(LIBBPF) : FORCE
2016-01-15 07:00:18 +03:00
$( Q) $( MAKE) -C $( BPF_DIR) O = $( OUTPUT) $( OUTPUT) libbpf.a FEATURES_DUMP = $( FEATURE_DUMP_EXPORT)
2015-10-14 15:41:12 +03:00
$(LIBBPF)-clean :
$( call QUIET_CLEAN, libbpf)
$( Q) $( MAKE) -C $( BPF_DIR) O = $( OUTPUT) clean >/dev/null
2016-12-06 16:18:49 +03:00
$(LIBSUBCMD) : FORCE
2015-12-15 18:39:39 +03:00
$( Q) $( MAKE) -C $( SUBCMD_DIR) O = $( OUTPUT) $( OUTPUT) libsubcmd.a
$(LIBSUBCMD)-clean :
$( call QUIET_CLEAN, libsubcmd)
$( Q) $( MAKE) -C $( SUBCMD_DIR) O = $( OUTPUT) clean
2013-10-02 13:49:08 +04:00
help :
@echo 'Perf make targets:'
@echo ' doc - make *all* documentation (see below)'
@echo ' man - make manpage documentation (access with man <foo>)'
@echo ' html - make html documentation'
@echo ' info - make GNU info documentation (access with info <foo>)'
@echo ' pdf - make pdf documentation'
@echo ' TAGS - use etags to make tag information for source browsing'
@echo ' tags - use ctags to make tag information for source browsing'
@echo ' cscope - use cscope to make interactive browsing database'
@echo ''
@echo 'Perf install targets:'
@echo ' NOTE: documentation build requires asciidoc, xmlto packages to be installed'
2014-06-02 20:44:34 +04:00
@echo ' HINT: use "prefix" or "DESTDIR" to install to a particular'
@echo ' path like "make prefix=/usr/local install install-doc"'
2013-10-02 13:49:08 +04:00
@echo ' install - install compiled binaries'
@echo ' install-doc - install *all* documentation'
@echo ' install-man - install manpage documentation'
@echo ' install-html - install html documentation'
@echo ' install-info - install GNU info documentation'
@echo ' install-pdf - install pdf documentation'
@echo ''
@echo ' quick-install-doc - alias for quick-install-man'
@echo ' quick-install-man - install the documentation quickly'
@echo ' quick-install-html - install the html documentation quickly'
@echo ''
@echo 'Perf maintainer targets:'
@echo ' clean - clean all binary objects and build output'
DOC_TARGETS := doc man html info pdf
INSTALL_DOC_TARGETS := $( patsubst %,install-%,$( DOC_TARGETS) ) try-install-man
INSTALL_DOC_TARGETS += quick-install-doc quick-install-man quick-install-html
# 'make doc' should call 'make -C Documentation all'
$(DOC_TARGETS) :
$( QUIET_SUBDIR0) Documentation $( QUIET_SUBDIR1) $( @:doc= all)
2016-01-07 12:14:06 +03:00
TAG_FOLDERS = . ../lib ../include
2014-05-28 12:19:18 +04:00
TAG_FILES = ../../include/uapi/linux/perf_event.h
2013-10-02 13:49:08 +04:00
TAGS :
2014-06-04 16:43:58 +04:00
$( QUIET_GEN) $( RM) TAGS; \
2014-05-28 12:19:18 +04:00
$( FIND) $( TAG_FOLDERS) -name '*.[hcS]' -print | xargs etags -a $( TAG_FILES)
2013-10-02 13:49:08 +04:00
tags :
2014-06-04 16:43:58 +04:00
$( QUIET_GEN) $( RM) tags; \
2014-05-28 12:19:18 +04:00
$( FIND) $( TAG_FOLDERS) -name '*.[hcS]' -print | xargs ctags -a $( TAG_FILES)
2013-10-02 13:49:08 +04:00
cscope :
2014-06-04 16:43:58 +04:00
$( QUIET_GEN) $( RM) cscope*; \
2014-05-28 12:19:18 +04:00
$( FIND) $( TAG_FOLDERS) -name '*.[hcS]' -print | xargs cscope -b $( TAG_FILES)
2013-10-02 13:49:08 +04:00
### Testing rules
# GNU make supports exporting all variables by "export" without parameters.
# However, the environment gets quite big, and some programs have problems
# with that.
check : $( OUTPUT ) common -cmds .h
if sparse; \
then \
for i in *.c */*.c; \
do \
sparse $( CFLAGS) $( SPARSE_FLAGS) $$ i || exit; \
done ; \
else \
exit 1; \
fi
### Installation rules
2016-12-04 23:42:54 +03:00
i f n d e f N O _ G T K 2
install-gtk : $( OUTPUT ) libperf -gtk .so
$( call QUIET_INSTALL, 'GTK UI' ) \
$( INSTALL) -d -m 755 '$(DESTDIR_SQ)$(libdir_SQ)' ; \
$( INSTALL) $( OUTPUT) libperf-gtk.so '$(DESTDIR_SQ)$(libdir_SQ)'
e l s e
2013-09-13 10:27:43 +04:00
install-gtk :
2016-12-04 23:42:54 +03:00
e n d i f
2013-09-13 10:27:43 +04:00
2015-05-18 21:37:27 +03:00
install-tools : all install -gtk
2013-10-09 13:49:29 +04:00
$( call QUIET_INSTALL, binaries) \
$( INSTALL) -d -m 755 '$(DESTDIR_SQ)$(bindir_SQ)' ; \
$( INSTALL) $( OUTPUT) perf '$(DESTDIR_SQ)$(bindir_SQ)' ; \
$( LN) '$(DESTDIR_SQ)$(bindir_SQ)/perf' '$(DESTDIR_SQ)$(bindir_SQ)/trace'
2014-10-24 01:16:03 +04:00
i f n d e f N O _ P E R F _ R E A D _ V D S O 3 2
$( call QUIET_INSTALL, perf-read-vdso32) \
$( INSTALL) $( OUTPUT) perf-read-vdso32 '$(DESTDIR_SQ)$(bindir_SQ)' ;
e n d i f
i f n d e f N O _ P E R F _ R E A D _ V D S O X 3 2
$( call QUIET_INSTALL, perf-read-vdsox32) \
$( INSTALL) $( OUTPUT) perf-read-vdsox32 '$(DESTDIR_SQ)$(bindir_SQ)' ;
2016-11-02 16:35:49 +03:00
e n d i f
i f n d e f N O _ J V M T I
$( call QUIET_INSTALL, $( LIBJVMTI) ) \
$( INSTALL) $( OUTPUT) $( LIBJVMTI) '$(DESTDIR_SQ)$(libdir_SQ)' ;
2014-10-24 01:16:03 +04:00
e n d i f
2013-10-09 13:49:29 +04:00
$( call QUIET_INSTALL, libexec) \
$( INSTALL) -d -m 755 '$(DESTDIR_SQ)$(perfexec_instdir_SQ)'
$( call QUIET_INSTALL, perf-archive) \
$( INSTALL) $( OUTPUT) perf-archive -t '$(DESTDIR_SQ)$(perfexec_instdir_SQ)'
2014-07-31 10:01:12 +04:00
$( call QUIET_INSTALL, perf-with-kcore) \
$( INSTALL) $( OUTPUT) perf-with-kcore -t '$(DESTDIR_SQ)$(perfexec_instdir_SQ)'
perf trace: Support 'strace' syscall event groups
I.e.:
$ cat ~/share/perf-core/strace/groups/file
access
chmod
creat
execve
faccessat
getcwd
lstat
mkdir
open
openat
quotactl
readlink
rename
rmdir
stat
statfs
symlink
unlink
$
Then, on a quiet desktop, try running this and then moving your mouse to
see the deluge of mouse related activity:
# perf probe 'vfs_getname=getname_flags:72 pathname=filename:string'
Added new event:
probe:vfs_getname (on getname_flags:72 with pathname=filename:string)
You can now use it in all perf tools, such as:
perf record -e probe:vfs_getname -aR sleep 1
#
# trace --ev probe:vfs_getname --filter-pids 2232 -e file
0.042 (0.042 ms): mousetweaks/2235 open(filename: 0x14e3910, mode: 438 ) ...
0.042 ( ): probe:vfs_getname:(ffffffff812230bc) pathname="/home/acme/.icons/Adwaita/cursors/xterm")
0.100 (0.100 ms): mousetweaks/2235 ... [continued]: open()) = -1 ENOENT No such file or directory
0.142 (0.018 ms): mousetweaks/2235 open(filename: 0x14c3c10, mode: 438 ) ...
0.142 ( ): probe:vfs_getname:(ffffffff812230bc) pathname="/home/acme/.icons/Adwaita/index.theme")
0.192 (0.069 ms): mousetweaks/2235 ... [continued]: open()) = -1 ENOENT No such file or directory
0.230 (0.017 ms): mousetweaks/2235 open(filename: 0x14c3c10, mode: 438 ) ...
0.230 ( ): probe:vfs_getname:(ffffffff812230bc) pathname="/usr/share/icons/Adwaita/cursors/xterm")
0.253 (0.041 ms): mousetweaks/2235 ... [continued]: open()) = 14
0.459 (0.008 ms): mousetweaks/2235 open(filename: 0x14e3910, mode: 438 ) ...
0.459 ( ): probe:vfs_getname:(ffffffff812230bc) pathname="/home/acme/.icons/Adwaita/cursors/left_side")
0.468 (0.017 ms): mousetweaks/2235 ... [continued]: open()) = -1 ENOENT No such file or directory
Need to combine that raw_syscalls:sys_enter(open) + probe:vfs_getname +
raw_syscalls:sys_exit(open) sequence...
Now, if you're bored, please write some more syscall groups, like the ones
in 'strace' and send it our way :-)
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Borislav Petkov <bp@suse.de>
Cc: David Ahern <dsahern@gmail.com>
Cc: Frederic Weisbecker <fweisbec@gmail.com>
Cc: Jiri Olsa <jolsa@redhat.com>
Cc: Milian Wolff <mail@milianw.de>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Stephane Eranian <eranian@google.com>
Link: http://lkml.kernel.org/n/tip-a42xklu59lcbxp7bbnic74a8@git.kernel.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2015-07-20 18:02:09 +03:00
i f n d e f N O _ L I B A U D I T
$( call QUIET_INSTALL, strace/groups) \
$( INSTALL) -d -m 755 '$(DESTDIR_SQ)$(STRACE_GROUPS_INSTDIR_SQ)' ; \
$( INSTALL) trace/strace/groups/* -t '$(DESTDIR_SQ)$(STRACE_GROUPS_INSTDIR_SQ)'
e n d i f
2013-10-02 13:49:08 +04:00
i f n d e f N O _ L I B P E R L
2013-10-09 13:49:29 +04:00
$( call QUIET_INSTALL, perl-scripts) \
$( INSTALL) -d -m 755 '$(DESTDIR_SQ)$(perfexec_instdir_SQ)/scripts/perl/Perf-Trace-Util/lib/Perf/Trace' ; \
$( INSTALL) scripts/perl/Perf-Trace-Util/lib/Perf/Trace/* -t '$(DESTDIR_SQ)$(perfexec_instdir_SQ)/scripts/perl/Perf-Trace-Util/lib/Perf/Trace' ; \
$( INSTALL) scripts/perl/*.pl -t '$(DESTDIR_SQ)$(perfexec_instdir_SQ)/scripts/perl' ; \
$( INSTALL) -d -m 755 '$(DESTDIR_SQ)$(perfexec_instdir_SQ)/scripts/perl/bin' ; \
$( INSTALL) scripts/perl/bin/* -t '$(DESTDIR_SQ)$(perfexec_instdir_SQ)/scripts/perl/bin'
2013-10-02 13:49:08 +04:00
e n d i f
i f n d e f N O _ L I B P Y T H O N
2013-10-09 13:49:29 +04:00
$( call QUIET_INSTALL, python-scripts) \
$( INSTALL) -d -m 755 '$(DESTDIR_SQ)$(perfexec_instdir_SQ)/scripts/python/Perf-Trace-Util/lib/Perf/Trace' ; \
$( INSTALL) -d -m 755 '$(DESTDIR_SQ)$(perfexec_instdir_SQ)/scripts/python/bin' ; \
$( INSTALL) scripts/python/Perf-Trace-Util/lib/Perf/Trace/* -t '$(DESTDIR_SQ)$(perfexec_instdir_SQ)/scripts/python/Perf-Trace-Util/lib/Perf/Trace' ; \
$( INSTALL) scripts/python/*.py -t '$(DESTDIR_SQ)$(perfexec_instdir_SQ)/scripts/python' ; \
$( INSTALL) scripts/python/bin/* -t '$(DESTDIR_SQ)$(perfexec_instdir_SQ)/scripts/python/bin'
2013-10-02 13:49:08 +04:00
e n d i f
2013-11-17 20:13:27 +04:00
$( call QUIET_INSTALL, perf_completion-script) \
2013-10-09 13:49:29 +04:00
$( INSTALL) -d -m 755 '$(DESTDIR_SQ)$(sysconfdir_SQ)/bash_completion.d' ; \
2013-11-17 20:13:27 +04:00
$( INSTALL) perf-completion.sh '$(DESTDIR_SQ)$(sysconfdir_SQ)/bash_completion.d/perf'
2016-01-07 14:41:53 +03:00
$( call QUIET_INSTALL, perf-tip) \
$( INSTALL) -d -m 755 '$(DESTDIR_SQ)$(tip_instdir_SQ)' ; \
$( INSTALL) Documentation/tips.txt -t '$(DESTDIR_SQ)$(tip_instdir_SQ)'
2015-05-18 21:37:27 +03:00
install-tests : all install -gtk
2013-10-09 13:49:29 +04:00
$( call QUIET_INSTALL, tests) \
$( INSTALL) -d -m 755 '$(DESTDIR_SQ)$(perfexec_instdir_SQ)/tests' ; \
$( INSTALL) tests/attr.py '$(DESTDIR_SQ)$(perfexec_instdir_SQ)/tests' ; \
$( INSTALL) -d -m 755 '$(DESTDIR_SQ)$(perfexec_instdir_SQ)/tests/attr' ; \
$( INSTALL) tests/attr/* '$(DESTDIR_SQ)$(perfexec_instdir_SQ)/tests/attr'
2013-10-02 13:49:08 +04:00
2015-05-18 21:37:27 +03:00
install-bin : install -tools install -tests
2013-12-03 17:09:21 +04:00
install : install -bin try -install -man install -traceevent -plugins
2013-10-02 13:49:08 +04:00
install-python_ext :
$( PYTHON_WORD) util/setup.py --quiet install --root= '/$(DESTDIR_SQ)'
# 'make install-doc' should call 'make -C Documentation install'
$(INSTALL_DOC_TARGETS) :
$( QUIET_SUBDIR0) Documentation $( QUIET_SUBDIR1) $( @:-doc= )
### Cleaning rules
#
2016-08-01 21:02:33 +03:00
# This is here, not in Makefile.config, because Makefile.config does
2013-10-02 13:49:08 +04:00
# not get included for the clean target:
#
config-clean :
2013-10-09 13:49:26 +04:00
$( call QUIET_CLEAN, config)
2015-12-14 07:18:05 +03:00
$( Q) $( MAKE) -C $( srctree) /tools/build/feature/ $( if $( OUTPUT) ,OUTPUT= $( OUTPUT) feature/,) clean >/dev/null
2013-10-02 13:49:08 +04:00
perf tools: Build syscall table .c header from kernel's syscall_64.tbl
We used libaudit to map ids to syscall names and vice-versa, but that
imposes a delay in supporting new syscalls, having to wait for libaudit
to get those new syscalls on its tables.
To remove that delay, for x86_64 initially, grab a copy of
arch/x86/entry/syscalls/syscall_64.tbl and use it to generate those
tables.
Syscalls currently not available in audit-libs:
# trace -e copy_file_range,membarrier,mlock2,pread64,pwrite64,timerfd_create,userfaultfd
Error: Invalid syscall copy_file_range, membarrier, mlock2, pread64, pwrite64, timerfd_create, userfaultfd
Hint: try 'perf list syscalls:sys_enter_*'
Hint: and: 'man syscalls'
#
With this patch:
# trace -e copy_file_range,membarrier,mlock2,pread64,pwrite64,timerfd_create,userfaultfd
8505.733 ( 0.010 ms): gnome-shell/2519 timerfd_create(flags: 524288) = 36
8506.688 ( 0.005 ms): gnome-shell/2519 timerfd_create(flags: 524288) = 40
30023.097 ( 0.025 ms): qemu-system-x8/24629 pwrite64(fd: 18, buf: 0x7f63ae382000, count: 4096, pos: 529592320) = 4096
31268.712 ( 0.028 ms): qemu-system-x8/24629 pwrite64(fd: 18, buf: 0x7f63afd8b000, count: 4096, pos: 2314133504) = 4096
31268.854 ( 0.016 ms): qemu-system-x8/24629 pwrite64(fd: 18, buf: 0x7f63afda2000, count: 4096, pos: 2314137600) = 4096
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Wang Nan <wangnan0@huawei.com>
Link: http://lkml.kernel.org/n/tip-51xfjbxevdsucmnbc4ka5r88@git.kernel.org
[ Added make dep for 'prepare' in 'LIBPERF_IN', fix by Wang Nan to fix parallell build ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-04-05 01:05:36 +03:00
clean :: $( LIBTRACEEVENT ) -clean $( LIBAPI ) -clean $( LIBBPF ) -clean $( LIBSUBCMD ) -clean config -clean
2014-12-30 18:51:35 +03:00
$( call QUIET_CLEAN, core-objs) $( RM) $( LIB_FILE) $( OUTPUT) perf-archive $( OUTPUT) perf-with-kcore $( LANG_BINDINGS)
2015-12-14 07:18:05 +03:00
$( Q) find $( if $( OUTPUT) ,$( OUTPUT) ,.) -name '*.o' -delete -o -name '\.*.cmd' -delete -o -name '\.*.d' -delete
2015-07-01 14:54:42 +03:00
$( Q) $( RM) $( OUTPUT) .config-detected
2016-11-02 16:35:49 +03:00
$( call QUIET_CLEAN, core-progs) $( RM) $( ALL_PROGRAMS) perf perf-read-vdso32 perf-read-vdsox32 $( OUTPUT) pmu-events/jevents $( OUTPUT) $( LIBJVMTI) .so
2015-08-13 10:14:55 +03:00
$( call QUIET_CLEAN, core-gen) $( RM) *.spec *.pyc *.pyo */*.pyc */*.pyo $( OUTPUT) common-cmds.h TAGS tags cscope* $( OUTPUT) PERF-VERSION-FILE $( OUTPUT) FEATURE-DUMP $( OUTPUT) util/*-bison* $( OUTPUT) util/*-flex* \
2015-12-14 07:18:05 +03:00
$( OUTPUT) util/intel-pt-decoder/inat-tables.c $( OUTPUT) fixdep \
perf jevents: Program to convert JSON file
This is a modified version of an earlier patch by Andi Kleen.
We expect architectures to create JSON files describing the performance
monitoring (PMU) events that each CPU model/family of the architecture
supports.
Following is an example of the JSON file entry for an x86 event:
[
...
{
"EventCode": "0x00",
"UMask": "0x01",
"EventName": "INST_RETIRED.ANY",
"BriefDescription": "Instructions retired from execution.",
"PublicDescription": "Instructions retired from execution.",
"Counter": "Fixed counter 1",
"CounterHTOff": "Fixed counter 1",
"SampleAfterValue": "2000003",
"SampleAfterValue": "2000003",
"MSRIndex": "0",
"MSRValue": "0",
"TakenAlone": "0",
"CounterMask": "0",
"Invert": "0",
"AnyThread": "0",
"EdgeDetect": "0",
"PEBS": "0",
"PRECISE_STORE": "0",
"Errata": "null",
"Offcore": "0"
},
...
]
All the PMU events supported by a CPU model/family must be grouped into
"topics" such as "Pipelining", "Floating-point", "Virtual-memory" etc.
All events belonging to a topic must be placed in a separate JSON file
(eg: "Pipelining.json") and all the topic JSON files for a CPU model must
be in a separate directory.
Eg: for the CPU model "Silvermont_core":
$ ls tools/perf/pmu-events/arch/x86/Silvermont_core
Floating-point.json
Memory.json
Other.json
Pipelining.json
Virtualmemory.json
Finally, to allow multiple CPU models to share a single set of JSON files,
architectures must provide a mapping between a model and its set of events:
$ grep Silvermont tools/perf/pmu-events/arch/x86/mapfile.csv
GenuineIntel-6-4D,V13,Silvermont_core,core
GenuineIntel-6-4C,V13,Silvermont_core,core
which maps each CPU, identified by [vendor, family, model, version, type]
to a directory of JSON files. Thus two (or more) CPU models support the
set of PMU events listed in the directory.
tools/perf/pmu-events/arch/x86/Silvermont_core/
Given this organization of files, the program, jevents:
- locates all JSON files for each CPU-model of the architecture,
- parses all JSON files for the CPU-model and generates a C-style
"PMU-events table" (pmu-events.c) for the model
- locates a mapfile for the architecture
- builds a global table, mapping each model of CPU to the corresponding
PMU-events table.
The 'pmu-events.c' is generated when building perf and added to libperf.a.
The global table pmu_events_map[] table in this pmu-events.c will be used
in perf in a follow-on patch.
If the architecture does not have any JSON files or there is an error in
processing them, an empty mapping file is created. This would allow the
build of perf to proceed even if we are not able to provide aliases for
events.
The parser for JSON files allows parsing Intel style JSON event files. This
allows to use an Intel event list directly with perf. The Intel event lists
can be quite large and are too big to store in unswappable kernel memory.
The conversion from JSON to C-style is straight forward. The parser knows
(very little) Intel specific information, and can be easily extended to
handle fields for other CPUs.
The parser code is partially shared with an independent parsing library,
which is 2-clause BSD licensed. To avoid any conflicts I marked those
files as BSD licensed too. As part of perf they become GPLv2.
Committer notes:
Fixes:
1) Limit maxfds to 512 to avoid nftd() segfaulting on alloca() with a
big rlim_max, as in docker containers - acme
2) Make jevents a hostprog, supporting cross compilation - jolsa
3) Use HOSTCC for jevents final step - acme
4) Define _GNU_SOURCE for asprintf, as we can't use CC's EXTRA_CFLAGS,
that has to have --sysroot on the Android NDK 24 - acme
5) Removed $(srctree)/tools/perf/pmu-events/pmu-events.c from the
'clean' target, it is generated on $(OUTPUT)pmu-events/pmu-events.c,
which is already taken care of in the original patch - acme
Signed-off-by: Andi Kleen <ak@linux.intel.com>
Signed-off-by: Jiri Olsa <jolsa@redhat.com>
Signed-off-by: Sukadev Bhattiprolu <sukadev@linux.vnet.ibm.com>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Acked-by: Ingo Molnar <mingo@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Madhavan Srinivasan <maddy@linux.vnet.ibm.com>
Cc: linuxppc-dev@lists.ozlabs.org
Link: http://lkml.kernel.org/r/1473978296-20712-3-git-send-email-sukadev@linux.vnet.ibm.com
Link: http://lkml.kernel.org/r/20160927141846.GA6589@krava
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-09-19 23:39:33 +03:00
$( OUTPUT) tests/llvm-src-{ base,kbuild,prologue,relocation} .c \
$( OUTPUT) pmu-events/pmu-events.c
2013-12-09 20:14:23 +04:00
$( QUIET_SUBDIR0) Documentation $( QUIET_SUBDIR1) clean
2013-10-02 13:49:08 +04:00
$( python-clean)
2016-01-15 07:00:17 +03:00
#
# To provide FEATURE-DUMP into $(FEATURE_DUMP_COPY)
# file if defined, with no further action.
feature-dump :
i f d e f F E A T U R E _ D U M P _ C O P Y
@cp $( OUTPUT) FEATURE-DUMP $( FEATURE_DUMP_COPY)
@echo " FEATURE-DUMP file copied into $( FEATURE_DUMP_COPY) "
e l s e
@echo " FEATURE-DUMP file available in $( OUTPUT) FEATURE-DUMP "
e n d i f
2013-10-02 13:49:08 +04:00
#
# Trick: if ../../.git does not exist - we are building out of tree for example,
# then force version regeneration:
#
i f e q ( $( wildcard ../../.git /HEAD ) , )
GIT-HEAD-PHONY = ../../.git/HEAD
e l s e
GIT-HEAD-PHONY =
e n d i f
2014-12-29 15:52:36 +03:00
FORCE :
2013-09-13 10:27:43 +04:00
.PHONY : all install clean config -clean strip install -gtk
2013-10-02 13:49:08 +04:00
.PHONY : shell_compatibility_test please_set_SHELL_PATH_to_a_more_modern_shell
2015-09-23 13:34:01 +03:00
.PHONY : $( GIT -HEAD -PHONY ) TAGS tags cscope FORCE prepare
perf tools: Build syscall table .c header from kernel's syscall_64.tbl
We used libaudit to map ids to syscall names and vice-versa, but that
imposes a delay in supporting new syscalls, having to wait for libaudit
to get those new syscalls on its tables.
To remove that delay, for x86_64 initially, grab a copy of
arch/x86/entry/syscalls/syscall_64.tbl and use it to generate those
tables.
Syscalls currently not available in audit-libs:
# trace -e copy_file_range,membarrier,mlock2,pread64,pwrite64,timerfd_create,userfaultfd
Error: Invalid syscall copy_file_range, membarrier, mlock2, pread64, pwrite64, timerfd_create, userfaultfd
Hint: try 'perf list syscalls:sys_enter_*'
Hint: and: 'man syscalls'
#
With this patch:
# trace -e copy_file_range,membarrier,mlock2,pread64,pwrite64,timerfd_create,userfaultfd
8505.733 ( 0.010 ms): gnome-shell/2519 timerfd_create(flags: 524288) = 36
8506.688 ( 0.005 ms): gnome-shell/2519 timerfd_create(flags: 524288) = 40
30023.097 ( 0.025 ms): qemu-system-x8/24629 pwrite64(fd: 18, buf: 0x7f63ae382000, count: 4096, pos: 529592320) = 4096
31268.712 ( 0.028 ms): qemu-system-x8/24629 pwrite64(fd: 18, buf: 0x7f63afd8b000, count: 4096, pos: 2314133504) = 4096
31268.854 ( 0.016 ms): qemu-system-x8/24629 pwrite64(fd: 18, buf: 0x7f63afda2000, count: 4096, pos: 2314137600) = 4096
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: David Ahern <dsahern@gmail.com>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Namhyung Kim <namhyung@kernel.org>
Cc: Wang Nan <wangnan0@huawei.com>
Link: http://lkml.kernel.org/n/tip-51xfjbxevdsucmnbc4ka5r88@git.kernel.org
[ Added make dep for 'prepare' in 'LIBPERF_IN', fix by Wang Nan to fix parallell build ]
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
2016-04-05 01:05:36 +03:00
.PHONY : libtraceevent_plugins archheaders
2013-10-02 13:49:08 +04:00
2016-12-06 16:18:49 +03:00
e n d i f # force_fixdep