1Compile-time stack metadata validation
2======================================
3
4
5Overview
6--------
7
8The kernel CONFIG_STACK_VALIDATION option enables a host tool named
9objtool which runs at compile time.  It has a "check" subcommand which
10analyzes every .o file and ensures the validity of its stack metadata.
11It enforces a set of rules on asm code and C inline assembly code so
12that stack traces can be reliable.
13
14For each function, it recursively follows all possible code paths and
15validates the correct frame pointer state at each instruction.
16
17It also follows code paths involving special sections, like
18.altinstructions, __jump_table, and __ex_table, which can add
19alternative execution paths to a given instruction (or set of
20instructions).  Similarly, it knows how to follow switch statements, for
21which gcc sometimes uses jump tables.
22
23(Objtool also has an 'orc generate' subcommand which generates debuginfo
24for the ORC unwinder.  See Documentation/x86/orc-unwinder.txt in the
25kernel tree for more details.)
26
27
28Why do we need stack metadata validation?
29-----------------------------------------
30
31Here are some of the benefits of validating stack metadata:
32
33a) More reliable stack traces for frame pointer enabled kernels
34
35   Frame pointers are used for debugging purposes.  They allow runtime
36   code and debug tools to be able to walk the stack to determine the
37   chain of function call sites that led to the currently executing
38   code.
39
40   For some architectures, frame pointers are enabled by
41   CONFIG_FRAME_POINTER.  For some other architectures they may be
42   required by the ABI (sometimes referred to as "backchain pointers").
43
44   For C code, gcc automatically generates instructions for setting up
45   frame pointers when the -fno-omit-frame-pointer option is used.
46
47   But for asm code, the frame setup instructions have to be written by
48   hand, which most people don't do.  So the end result is that
49   CONFIG_FRAME_POINTER is honored for C code but not for most asm code.
50
51   For stack traces based on frame pointers to be reliable, all
52   functions which call other functions must first create a stack frame
53   and update the frame pointer.  If a first function doesn't properly
54   create a stack frame before calling a second function, the *caller*
55   of the first function will be skipped on the stack trace.
56
57   For example, consider the following example backtrace with frame
58   pointers enabled:
59
60     [<ffffffff81812584>] dump_stack+0x4b/0x63
61     [<ffffffff812d6dc2>] cmdline_proc_show+0x12/0x30
62     [<ffffffff8127f568>] seq_read+0x108/0x3e0
63     [<ffffffff812cce62>] proc_reg_read+0x42/0x70
64     [<ffffffff81256197>] __vfs_read+0x37/0x100
65     [<ffffffff81256b16>] vfs_read+0x86/0x130
66     [<ffffffff81257898>] SyS_read+0x58/0xd0
67     [<ffffffff8181c1f2>] entry_SYSCALL_64_fastpath+0x12/0x76
68
69   It correctly shows that the caller of cmdline_proc_show() is
70   seq_read().
71
72   If we remove the frame pointer logic from cmdline_proc_show() by
73   replacing the frame pointer related instructions with nops, here's
74   what it looks like instead:
75
76     [<ffffffff81812584>] dump_stack+0x4b/0x63
77     [<ffffffff812d6dc2>] cmdline_proc_show+0x12/0x30
78     [<ffffffff812cce62>] proc_reg_read+0x42/0x70
79     [<ffffffff81256197>] __vfs_read+0x37/0x100
80     [<ffffffff81256b16>] vfs_read+0x86/0x130
81     [<ffffffff81257898>] SyS_read+0x58/0xd0
82     [<ffffffff8181c1f2>] entry_SYSCALL_64_fastpath+0x12/0x76
83
84   Notice that cmdline_proc_show()'s caller, seq_read(), has been
85   skipped.  Instead the stack trace seems to show that
86   cmdline_proc_show() was called by proc_reg_read().
87
88   The benefit of objtool here is that because it ensures that *all*
89   functions honor CONFIG_FRAME_POINTER, no functions will ever[*] be
90   skipped on a stack trace.
91
92   [*] unless an interrupt or exception has occurred at the very
93       beginning of a function before the stack frame has been created,
94       or at the very end of the function after the stack frame has been
95       destroyed.  This is an inherent limitation of frame pointers.
96
97b) ORC (Oops Rewind Capability) unwind table generation
98
99   An alternative to frame pointers and DWARF, ORC unwind data can be
100   used to walk the stack.  Unlike frame pointers, ORC data is out of
101   band.  So it doesn't affect runtime performance and it can be
102   reliable even when interrupts or exceptions are involved.
103
104   For more details, see Documentation/x86/orc-unwinder.txt.
105
106c) Higher live patching compatibility rate
107
108   Livepatch has an optional "consistency model", which is needed for
109   more complex patches.  In order for the consistency model to work,
110   stack traces need to be reliable (or an unreliable condition needs to
111   be detectable).  Objtool makes that possible.
112
113   For more details, see the livepatch documentation in the Linux kernel
114   source tree at Documentation/livepatch/livepatch.txt.
115
116Rules
117-----
118
119To achieve the validation, objtool enforces the following rules:
120
1211. Each callable function must be annotated as such with the ELF
122   function type.  In asm code, this is typically done using the
123   ENTRY/ENDPROC macros.  If objtool finds a return instruction
124   outside of a function, it flags an error since that usually indicates
125   callable code which should be annotated accordingly.
126
127   This rule is needed so that objtool can properly identify each
128   callable function in order to analyze its stack metadata.
129
1302. Conversely, each section of code which is *not* callable should *not*
131   be annotated as an ELF function.  The ENDPROC macro shouldn't be used
132   in this case.
133
134   This rule is needed so that objtool can ignore non-callable code.
135   Such code doesn't have to follow any of the other rules.
136
1373. Each callable function which calls another function must have the
138   correct frame pointer logic, if required by CONFIG_FRAME_POINTER or
139   the architecture's back chain rules.  This can by done in asm code
140   with the FRAME_BEGIN/FRAME_END macros.
141
142   This rule ensures that frame pointer based stack traces will work as
143   designed.  If function A doesn't create a stack frame before calling
144   function B, the _caller_ of function A will be skipped on the stack
145   trace.
146
1474. Dynamic jumps and jumps to undefined symbols are only allowed if:
148
149   a) the jump is part of a switch statement; or
150
151   b) the jump matches sibling call semantics and the frame pointer has
152      the same value it had on function entry.
153
154   This rule is needed so that objtool can reliably analyze all of a
155   function's code paths.  If a function jumps to code in another file,
156   and it's not a sibling call, objtool has no way to follow the jump
157   because it only analyzes a single file at a time.
158
1595. A callable function may not execute kernel entry/exit instructions.
160   The only code which needs such instructions is kernel entry code,
161   which shouldn't be be in callable functions anyway.
162
163   This rule is just a sanity check to ensure that callable functions
164   return normally.
165
166
167Objtool warnings
168----------------
169
170For asm files, if you're getting an error which doesn't make sense,
171first make sure that the affected code follows the above rules.
172
173For C files, the common culprits are inline asm statements and calls to
174"noreturn" functions.  See below for more details.
175
176Another possible cause for errors in C code is if the Makefile removes
177-fno-omit-frame-pointer or adds -fomit-frame-pointer to the gcc options.
178
179Here are some examples of common warnings reported by objtool, what
180they mean, and suggestions for how to fix them.
181
182
1831. file.o: warning: objtool: func()+0x128: call without frame pointer save/setup
184
185   The func() function made a function call without first saving and/or
186   updating the frame pointer, and CONFIG_FRAME_POINTER is enabled.
187
188   If the error is for an asm file, and func() is indeed a callable
189   function, add proper frame pointer logic using the FRAME_BEGIN and
190   FRAME_END macros.  Otherwise, if it's not a callable function, remove
191   its ELF function annotation by changing ENDPROC to END, and instead
192   use the manual unwind hint macros in asm/unwind_hints.h.
193
194   If it's a GCC-compiled .c file, the error may be because the function
195   uses an inline asm() statement which has a "call" instruction.  An
196   asm() statement with a call instruction must declare the use of the
197   stack pointer in its output operand.  On x86_64, this means adding
198   the ASM_CALL_CONSTRAINT as an output constraint:
199
200     asm volatile("call func" : ASM_CALL_CONSTRAINT);
201
202   Otherwise the stack frame may not get created before the call.
203
204
2052. file.o: warning: objtool: .text+0x53: unreachable instruction
206
207   Objtool couldn't find a code path to reach the instruction.
208
209   If the error is for an asm file, and the instruction is inside (or
210   reachable from) a callable function, the function should be annotated
211   with the ENTRY/ENDPROC macros (ENDPROC is the important one).
212   Otherwise, the code should probably be annotated with the unwind hint
213   macros in asm/unwind_hints.h so objtool and the unwinder can know the
214   stack state associated with the code.
215
216   If you're 100% sure the code won't affect stack traces, or if you're
217   a just a bad person, you can tell objtool to ignore it.  See the
218   "Adding exceptions" section below.
219
220   If it's not actually in a callable function (e.g. kernel entry code),
221   change ENDPROC to END.
222
223
2244. file.o: warning: objtool: func(): can't find starting instruction
225   or
226   file.o: warning: objtool: func()+0x11dd: can't decode instruction
227
228   Does the file have data in a text section?  If so, that can confuse
229   objtool's instruction decoder.  Move the data to a more appropriate
230   section like .data or .rodata.
231
232
2335. file.o: warning: objtool: func()+0x6: unsupported instruction in callable function
234
235   This is a kernel entry/exit instruction like sysenter or iret.  Such
236   instructions aren't allowed in a callable function, and are most
237   likely part of the kernel entry code.  They should usually not have
238   the callable function annotation (ENDPROC) and should always be
239   annotated with the unwind hint macros in asm/unwind_hints.h.
240
241
2426. file.o: warning: objtool: func()+0x26: sibling call from callable instruction with modified stack frame
243
244   This is a dynamic jump or a jump to an undefined symbol.  Objtool
245   assumed it's a sibling call and detected that the frame pointer
246   wasn't first restored to its original state.
247
248   If it's not really a sibling call, you may need to move the
249   destination code to the local file.
250
251   If the instruction is not actually in a callable function (e.g.
252   kernel entry code), change ENDPROC to END and annotate manually with
253   the unwind hint macros in asm/unwind_hints.h.
254
255
2567. file: warning: objtool: func()+0x5c: stack state mismatch
257
258   The instruction's frame pointer state is inconsistent, depending on
259   which execution path was taken to reach the instruction.
260
261   Make sure that, when CONFIG_FRAME_POINTER is enabled, the function
262   pushes and sets up the frame pointer (for x86_64, this means rbp) at
263   the beginning of the function and pops it at the end of the function.
264   Also make sure that no other code in the function touches the frame
265   pointer.
266
267   Another possibility is that the code has some asm or inline asm which
268   does some unusual things to the stack or the frame pointer.  In such
269   cases it's probably appropriate to use the unwind hint macros in
270   asm/unwind_hints.h.
271
272
2738. file.o: warning: objtool: funcA() falls through to next function funcB()
274
275   This means that funcA() doesn't end with a return instruction or an
276   unconditional jump, and that objtool has determined that the function
277   can fall through into the next function.  There could be different
278   reasons for this:
279
280   1) funcA()'s last instruction is a call to a "noreturn" function like
281      panic().  In this case the noreturn function needs to be added to
282      objtool's hard-coded global_noreturns array.  Feel free to bug the
283      objtool maintainer, or you can submit a patch.
284
285   2) funcA() uses the unreachable() annotation in a section of code
286      that is actually reachable.
287
288   3) If funcA() calls an inline function, the object code for funcA()
289      might be corrupt due to a gcc bug.  For more details, see:
290      https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70646
291
292
293If the error doesn't seem to make sense, it could be a bug in objtool.
294Feel free to ask the objtool maintainer for help.
295
296
297Adding exceptions
298-----------------
299
300If you _really_ need objtool to ignore something, and are 100% sure
301that it won't affect kernel stack traces, you can tell objtool to
302ignore it:
303
304- To skip validation of a function, use the STACK_FRAME_NON_STANDARD
305  macro.
306
307- To skip validation of a file, add
308
309    OBJECT_FILES_NON_STANDARD_filename.o := n
310
311  to the Makefile.
312
313- To skip validation of a directory, add
314
315    OBJECT_FILES_NON_STANDARD := y
316
317  to the Makefile.
318