Linux Audio

Check our new training course

Loading...
Note: File does not exist in v3.1.
  1Compile-time stack metadata validation
  2======================================
  3
  4
  5Overview
  6--------
  7
  8The kernel CONFIG_STACK_VALIDATION option enables a host tool named
  9objtool which runs at compile time.  It has a "check" subcommand which
 10analyzes every .o file and ensures the validity of its stack metadata.
 11It enforces a set of rules on asm code and C inline assembly code so
 12that stack traces can be reliable.
 13
 14Currently it only checks frame pointer usage, but there are plans to add
 15CFI validation for C files and CFI generation for asm files.
 16
 17For each function, it recursively follows all possible code paths and
 18validates the correct frame pointer state at each instruction.
 19
 20It also follows code paths involving special sections, like
 21.altinstructions, __jump_table, and __ex_table, which can add
 22alternative execution paths to a given instruction (or set of
 23instructions).  Similarly, it knows how to follow switch statements, for
 24which gcc sometimes uses jump tables.
 25
 26
 27Why do we need stack metadata validation?
 28-----------------------------------------
 29
 30Here are some of the benefits of validating stack metadata:
 31
 32a) More reliable stack traces for frame pointer enabled kernels
 33
 34   Frame pointers are used for debugging purposes.  They allow runtime
 35   code and debug tools to be able to walk the stack to determine the
 36   chain of function call sites that led to the currently executing
 37   code.
 38
 39   For some architectures, frame pointers are enabled by
 40   CONFIG_FRAME_POINTER.  For some other architectures they may be
 41   required by the ABI (sometimes referred to as "backchain pointers").
 42
 43   For C code, gcc automatically generates instructions for setting up
 44   frame pointers when the -fno-omit-frame-pointer option is used.
 45
 46   But for asm code, the frame setup instructions have to be written by
 47   hand, which most people don't do.  So the end result is that
 48   CONFIG_FRAME_POINTER is honored for C code but not for most asm code.
 49
 50   For stack traces based on frame pointers to be reliable, all
 51   functions which call other functions must first create a stack frame
 52   and update the frame pointer.  If a first function doesn't properly
 53   create a stack frame before calling a second function, the *caller*
 54   of the first function will be skipped on the stack trace.
 55
 56   For example, consider the following example backtrace with frame
 57   pointers enabled:
 58
 59     [<ffffffff81812584>] dump_stack+0x4b/0x63
 60     [<ffffffff812d6dc2>] cmdline_proc_show+0x12/0x30
 61     [<ffffffff8127f568>] seq_read+0x108/0x3e0
 62     [<ffffffff812cce62>] proc_reg_read+0x42/0x70
 63     [<ffffffff81256197>] __vfs_read+0x37/0x100
 64     [<ffffffff81256b16>] vfs_read+0x86/0x130
 65     [<ffffffff81257898>] SyS_read+0x58/0xd0
 66     [<ffffffff8181c1f2>] entry_SYSCALL_64_fastpath+0x12/0x76
 67
 68   It correctly shows that the caller of cmdline_proc_show() is
 69   seq_read().
 70
 71   If we remove the frame pointer logic from cmdline_proc_show() by
 72   replacing the frame pointer related instructions with nops, here's
 73   what it looks like instead:
 74
 75     [<ffffffff81812584>] dump_stack+0x4b/0x63
 76     [<ffffffff812d6dc2>] cmdline_proc_show+0x12/0x30
 77     [<ffffffff812cce62>] proc_reg_read+0x42/0x70
 78     [<ffffffff81256197>] __vfs_read+0x37/0x100
 79     [<ffffffff81256b16>] vfs_read+0x86/0x130
 80     [<ffffffff81257898>] SyS_read+0x58/0xd0
 81     [<ffffffff8181c1f2>] entry_SYSCALL_64_fastpath+0x12/0x76
 82
 83   Notice that cmdline_proc_show()'s caller, seq_read(), has been
 84   skipped.  Instead the stack trace seems to show that
 85   cmdline_proc_show() was called by proc_reg_read().
 86
 87   The benefit of objtool here is that because it ensures that *all*
 88   functions honor CONFIG_FRAME_POINTER, no functions will ever[*] be
 89   skipped on a stack trace.
 90
 91   [*] unless an interrupt or exception has occurred at the very
 92       beginning of a function before the stack frame has been created,
 93       or at the very end of the function after the stack frame has been
 94       destroyed.  This is an inherent limitation of frame pointers.
 95
 96b) 100% reliable stack traces for DWARF enabled kernels
 97
 98   (NOTE: This is not yet implemented)
 99
100   As an alternative to frame pointers, DWARF Call Frame Information
101   (CFI) metadata can be used to walk the stack.  Unlike frame pointers,
102   CFI metadata is out of band.  So it doesn't affect runtime
103   performance and it can be reliable even when interrupts or exceptions
104   are involved.
105
106   For C code, gcc automatically generates DWARF CFI metadata.  But for
107   asm code, generating CFI is a tedious manual approach which requires
108   manually placed .cfi assembler macros to be scattered throughout the
109   code.  It's clumsy and very easy to get wrong, and it makes the real
110   code harder to read.
111
112   Stacktool will improve this situation in several ways.  For code
113   which already has CFI annotations, it will validate them.  For code
114   which doesn't have CFI annotations, it will generate them.  So an
115   architecture can opt to strip out all the manual .cfi annotations
116   from their asm code and have objtool generate them instead.
117
118   We might also add a runtime stack validation debug option where we
119   periodically walk the stack from schedule() and/or an NMI to ensure
120   that the stack metadata is sane and that we reach the bottom of the
121   stack.
122
123   So the benefit of objtool here will be that external tooling should
124   always show perfect stack traces.  And the same will be true for
125   kernel warning/oops traces if the architecture has a runtime DWARF
126   unwinder.
127
128c) Higher live patching compatibility rate
129
130   (NOTE: This is not yet implemented)
131
132   Currently with CONFIG_LIVEPATCH there's a basic live patching
133   framework which is safe for roughly 85-90% of "security" fixes.  But
134   patches can't have complex features like function dependency or
135   prototype changes, or data structure changes.
136
137   There's a strong need to support patches which have the more complex
138   features so that the patch compatibility rate for security fixes can
139   eventually approach something resembling 100%.  To achieve that, a
140   "consistency model" is needed, which allows tasks to be safely
141   transitioned from an unpatched state to a patched state.
142
143   One of the key requirements of the currently proposed livepatch
144   consistency model [*] is that it needs to walk the stack of each
145   sleeping task to determine if it can be transitioned to the patched
146   state.  If objtool can ensure that stack traces are reliable, this
147   consistency model can be used and the live patching compatibility
148   rate can be improved significantly.
149
150   [*] https://lkml.kernel.org/r/cover.1423499826.git.jpoimboe@redhat.com
151
152
153Rules
154-----
155
156To achieve the validation, objtool enforces the following rules:
157
1581. Each callable function must be annotated as such with the ELF
159   function type.  In asm code, this is typically done using the
160   ENTRY/ENDPROC macros.  If objtool finds a return instruction
161   outside of a function, it flags an error since that usually indicates
162   callable code which should be annotated accordingly.
163
164   This rule is needed so that objtool can properly identify each
165   callable function in order to analyze its stack metadata.
166
1672. Conversely, each section of code which is *not* callable should *not*
168   be annotated as an ELF function.  The ENDPROC macro shouldn't be used
169   in this case.
170
171   This rule is needed so that objtool can ignore non-callable code.
172   Such code doesn't have to follow any of the other rules.
173
1743. Each callable function which calls another function must have the
175   correct frame pointer logic, if required by CONFIG_FRAME_POINTER or
176   the architecture's back chain rules.  This can by done in asm code
177   with the FRAME_BEGIN/FRAME_END macros.
178
179   This rule ensures that frame pointer based stack traces will work as
180   designed.  If function A doesn't create a stack frame before calling
181   function B, the _caller_ of function A will be skipped on the stack
182   trace.
183
1844. Dynamic jumps and jumps to undefined symbols are only allowed if:
185
186   a) the jump is part of a switch statement; or
187
188   b) the jump matches sibling call semantics and the frame pointer has
189      the same value it had on function entry.
190
191   This rule is needed so that objtool can reliably analyze all of a
192   function's code paths.  If a function jumps to code in another file,
193   and it's not a sibling call, objtool has no way to follow the jump
194   because it only analyzes a single file at a time.
195
1965. A callable function may not execute kernel entry/exit instructions.
197   The only code which needs such instructions is kernel entry code,
198   which shouldn't be be in callable functions anyway.
199
200   This rule is just a sanity check to ensure that callable functions
201   return normally.
202
203
204Errors in .S files
205------------------
206
207If you're getting an error in a compiled .S file which you don't
208understand, first make sure that the affected code follows the above
209rules.
210
211Here are some examples of common warnings reported by objtool, what
212they mean, and suggestions for how to fix them.
213
214
2151. asm_file.o: warning: objtool: func()+0x128: call without frame pointer save/setup
216
217   The func() function made a function call without first saving and/or
218   updating the frame pointer.
219
220   If func() is indeed a callable function, add proper frame pointer
221   logic using the FRAME_BEGIN and FRAME_END macros.  Otherwise, remove
222   its ELF function annotation by changing ENDPROC to END.
223
224   If you're getting this error in a .c file, see the "Errors in .c
225   files" section.
226
227
2282. asm_file.o: warning: objtool: .text+0x53: return instruction outside of a callable function
229
230   A return instruction was detected, but objtool couldn't find a way
231   for a callable function to reach the instruction.
232
233   If the return instruction is inside (or reachable from) a callable
234   function, the function needs to be annotated with the ENTRY/ENDPROC
235   macros.
236
237   If you _really_ need a return instruction outside of a function, and
238   are 100% sure that it won't affect stack traces, you can tell
239   objtool to ignore it.  See the "Adding exceptions" section below.
240
241
2423. asm_file.o: warning: objtool: func()+0x9: function has unreachable instruction
243
244   The instruction lives inside of a callable function, but there's no
245   possible control flow path from the beginning of the function to the
246   instruction.
247
248   If the instruction is actually needed, and it's actually in a
249   callable function, ensure that its function is properly annotated
250   with ENTRY/ENDPROC.
251
252   If it's not actually in a callable function (e.g. kernel entry code),
253   change ENDPROC to END.
254
255
2564. asm_file.o: warning: objtool: func(): can't find starting instruction
257   or
258   asm_file.o: warning: objtool: func()+0x11dd: can't decode instruction
259
260   Did you put data in a text section?  If so, that can confuse
261   objtool's instruction decoder.  Move the data to a more appropriate
262   section like .data or .rodata.
263
264
2655. asm_file.o: warning: objtool: func()+0x6: kernel entry/exit from callable instruction
266
267   This is a kernel entry/exit instruction like sysenter or sysret.
268   Such instructions aren't allowed in a callable function, and are most
269   likely part of the kernel entry code.
270
271   If the instruction isn't actually in a callable function, change
272   ENDPROC to END.
273
274
2756. asm_file.o: warning: objtool: func()+0x26: sibling call from callable instruction with changed frame pointer
276
277   This is a dynamic jump or a jump to an undefined symbol.  Stacktool
278   assumed it's a sibling call and detected that the frame pointer
279   wasn't first restored to its original state.
280
281   If it's not really a sibling call, you may need to move the
282   destination code to the local file.
283
284   If the instruction is not actually in a callable function (e.g.
285   kernel entry code), change ENDPROC to END.
286
287
2887. asm_file: warning: objtool: func()+0x5c: frame pointer state mismatch
289
290   The instruction's frame pointer state is inconsistent, depending on
291   which execution path was taken to reach the instruction.
292
293   Make sure the function pushes and sets up the frame pointer (for
294   x86_64, this means rbp) at the beginning of the function and pops it
295   at the end of the function.  Also make sure that no other code in the
296   function touches the frame pointer.
297
298
299Errors in .c files
300------------------
301
3021. c_file.o: warning: objtool: funcA() falls through to next function funcB()
303
304   This means that funcA() doesn't end with a return instruction or an
305   unconditional jump, and that objtool has determined that the function
306   can fall through into the next function.  There could be different
307   reasons for this:
308
309   1) funcA()'s last instruction is a call to a "noreturn" function like
310      panic().  In this case the noreturn function needs to be added to
311      objtool's hard-coded global_noreturns array.  Feel free to bug the
312      objtool maintainer, or you can submit a patch.
313
314   2) funcA() uses the unreachable() annotation in a section of code
315      that is actually reachable.
316
317   3) If funcA() calls an inline function, the object code for funcA()
318      might be corrupt due to a gcc bug.  For more details, see:
319      https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70646
320
3212. If you're getting any other objtool error in a compiled .c file, it
322   may be because the file uses an asm() statement which has a "call"
323   instruction.  An asm() statement with a call instruction must declare
324   the use of the stack pointer in its output operand.  For example, on
325   x86_64:
326
327     register void *__sp asm("rsp");
328     asm volatile("call func" : "+r" (__sp));
329
330   Otherwise the stack frame may not get created before the call.
331
3323. Another possible cause for errors in C code is if the Makefile removes
333   -fno-omit-frame-pointer or adds -fomit-frame-pointer to the gcc options.
334
335Also see the above section for .S file errors for more information what
336the individual error messages mean.
337
338If the error doesn't seem to make sense, it could be a bug in objtool.
339Feel free to ask the objtool maintainer for help.
340
341
342Adding exceptions
343-----------------
344
345If you _really_ need objtool to ignore something, and are 100% sure
346that it won't affect kernel stack traces, you can tell objtool to
347ignore it:
348
349- To skip validation of a function, use the STACK_FRAME_NON_STANDARD
350  macro.
351
352- To skip validation of a file, add
353
354    OBJECT_FILES_NON_STANDARD_filename.o := n
355
356  to the Makefile.
357
358- To skip validation of a directory, add
359
360    OBJECT_FILES_NON_STANDARD := y
361
362  to the Makefile.