chromium/docs/linux/debugging_minidump.md

# Linux Minidump Code Lab

[TOC]

## About Minidumps

Minidump is a file format for storing parts of a program's state for later
inspection. [Microsoft's
Documentation](https://docs.microsoft.com/en-us/windows/desktop/api/minidumpapiset/)
defines the format though the [Rust
Documentation](https://docs.rs/minidump/latest/minidump/format/index.html)
is sometimes easier to navigate. The minidump implementation and tools used by
Chrome are
[Breakpad](https://chromium.googlesource.com/breakpad/breakpad/+/refs/heads/main/README.md)
and
[Crashpad](https://chromium.googlesource.com/crashpad/crashpad/+/main/README.md).
However, the tools of interest here are from the Breakpad project.

## Create a Minidump

When Chrome crashes it writes out a minidump file. The minidump file is written
under the application product directory. On Linux this is
`<XDG_CONFIG_HOME>/<app-name>/Crash Reports`. The default for `XDG_CONFIG_HOME`
is `~.config`. Common `<app-name>`s are `chromium`, `google-chrome`,
`google-chrome-beta`, and `google-chrome-unstable`. A typical example is
`~.config/google-chrome/Crash Reports`. When a minidump is uploaded it will be
moved between the `new`, `pending`, and `completed` subdirectories. The minidump
file is named something like `<uuid>.dmp`. If the minidump is uploaded to the
crash reporting system, the `<uuid>.meta` file will contain the crash report id.
Those with access can find the uploaded report at `go/crash/<report-id>`, where
the minidump file will be available with a name like
`upload_file_minidump-<report-id>.dmp`.

To create a minidump, you can use a local build of Chromium or a release
version of Chrome. Run the browser with the environment variable
`CHROME_HEADLESS=1`, which enables crash dumping but prevents crash dumps from
being uploaded and deleted. Something like `$ env CHROME_HEADLESS=1
./out/debug/chrome-wrapper` or `$ env CHROME_HEADLESS=1
/opt/google/chrome/google-chrome`. Navigate to `chrome://crash` to trigger a
crash in the renderer process or reproduce your current crash bug. A crash dump
file should appear in the `Crash Reports` directory.

## Inspect the Minidump

To get an idea about what is in a minidump file, install the Okteta hex editor
and add the [Minidump Structure
Definition](https://github.com/bungeman/structures/tree/main/okteta-minidump).
Open the minidump previously created and explore the information it contains.

One quirk to notice is that there is a `ThreadListStream` which contains
`MINIDUMP_THREAD`s which contain a `MINIDUMP_THREAD_CONTEXT` and an
`ExceptionStream` which also contains a `MINIDUMP_THREAD_CONTEXT`. The thread
list contains the thread contexts as they existed when the crash reporter was
running. The exception's thread context is the state of the crashing thread at
the time that it crashed, which is generally the most interesting thread
context. When using the Breakpad tools for Linux (like `minidump_stackwalk` and
`minidump-2-core`) the thread context from the exception record is used in place
of the thread context associated with the corresponding thread.

Each `MINIDUMP_THREAD` contains a `StackMemoryRva` which is a reference to to a
copy of the stack on that thread at the time the crash handler was running.
Parsing a stack usefully requires additional debug information.
`minidump_stackwalk` or a debugger may be used to parse the stack memory to
create a usable trace.

## Get the Tools

From a Chromium checkout `ninja -C out/release minidump-2-core
minidump_stackwalk dump_syms`. From a [Breakpad
checkout](https://chromium.googlesource.com/breakpad/breakpad/) `make`. It can
be useful to use Breakpad directly on machines where one does not already have
a Chromium checkout.

When working at this level, one will also want to have `readelf` and
`objdump` available, which are available from most distributions.

## Get Executables and Symbols

In addition to the minidump, you will need the exact executables of Chromium or
Chrome which produced the minidump and those executable's symbols. If the
minidump was created locally, you already have the executables. Symbols for
Google Chrome's official builds are available from
`https://edgedl.me.gvt1.com/chrome/linux/symbols/google-chrome-debug-info-linux64-${VERSION}.zip`
where `${VERSION}` is any version of Google Chrome that has recently been served
to Stable, Beta, or Unstable (Dev) channels on Linux, like `114.0.5696.0`. Those
with access can find both executables and symbols for unreleased builds at
`go/chrome-symbols`.

For symbols outside of Chrome (like when the crash is happening in a shared
library) then symbols for the files of interest must be found. If the minidump
was created locally then install the symbol packages from your distribution.  If
not, you will need to track down the exact symbol files, which can be an
interesting exercise. For some distributions using the
[debuginfod](https://sourceware.org/elfutils/Debuginfod.html) system can be
quite helpful.

To ensure the correct binaries and debug symbols are used, the minidump contains
the build-id for each loaded module in the `ModuleListStream` in the
`CvRecordRva`'s `Signature`. This build-id is matched against a note
section of type `NT_GNU_BUILD_ID`, usually named `.note.gnu.build-id` in the
executable and symbol files. This note can be inspected with `readelf -n
<file>` like `readelf -n chrome` or `readelf -n chrome.debug` and looking for
the `.note.gnu.build-id` section. `readelf` reports the `Build ID` as the flat
bytes in the note, but Breakpad binaries like `stackwalk_minidimp` and
`dump_syms` will report and expect this truncated to a formatted Type 2 GUID
(without dashes). This means `readelf` will output a `<build-id>` like
33221100554477668899AABBCCDDEEFFXXXXXXXX but crashpad binaries will expect and
report this as a `<build-uuid>` of 00112233445566778899AABBCCDDEEFF.

The `.gnu_debuglink` section states which debug symbol file to use with a
striped binary. For example `readelf --string-dump=.gnu_debuglink chrome`
produces `chrome.debug`. This can be helpful to know for libraries with
interesting debug symbol setup, like libc.so.6.

## Create Symbolized Stack

Given a minidump with the name `mini.dmp`

`minidump_stackwalk mini.dmp > mini.stackwalk.nosym`

This will produce a mostly unsymbolized summary of the crash. To symbolize, look
toward the bottom of the output for `WARNING: No symbols, <file>, <build-uuid>`.
For each `<file>` which is of interest, `mkdir -p symbols/<file>/<build-uuid>` then
`dump_syms <file> <directory-with-file.debug> >
symbols/<file>/<build-uuid>/<file>.sym`. Ensure this output `<file>.sym`
contains the expected `<build-uuid>`. Then re-run `minidump_stackwalk` but
with the symbols directory, like `minidump_stackwalk mini.dmp symbols/ >
mini.stackwalk`.

The output of `minidump_stackwalk` is often quite useful and enough to track
down many issues. However, it does not fully use all of the information from
DWARF, so it is possible sometimes to get much better stack traces from a full
debugger like gdb. This is particularly true when functions have been
aggressively inlined.

## Create Core File

`minidump-2-core mini.dmp > mini.core`

## Loading into GDB

This works best if the binaries, symbols, and core files are all in different
directories to prevent gdb from automatically loading them into the wrong
locations. This is also generally necessary when using a system installed
version of Chrome. For full details see [the gdb
manual](https://sourceware.org/gdb/onlinedocs/gdb/Separate-Debug-Files.html).
The easiest way is to rename and move the .debug files to a directory structure
like `<debugdir>/debug/.build-id/nn/nnnnnnnn.debug` where `nn` are the first
two hex characters of the `build-id`, and `nnnnnnnn` are the rest of the hex
characters of the `build-id`. Note that this `build-id` is exactly what is
reported by `readelf -n <binary> | grep "Build ID"` and not the `build-uuid`
used by Breakpad. Then in gdb use `show debug-file-directory` to get the
`<previous-directories>` and `set debug-file-directory
<previous-directories>:<debugdir>/debug`.

The `offset`s used here are the offsets of the corresponding module from the
output of `minidump_stackwalk` or (equivalently) the value of
`ModuleListStream::Modules[]::BaseOfImage` from the minidump file (which can be
read with the structure definition).

```
$ gdb
(gdb) file <executable>
(gdb) show debug-file-directory
<previous directories>
(gdb) set debug-file-directory <previous directories>:<debugdir>/debug
(gdb) symbol-file <executable> -o <executable-offset>
(gdb) core-file <mini.core>
```

Running the commands in this order avoids needing to load the symbols twice and
maps the `<executable>` to the expected location.

To add an additional shared library it is possible to
`(gdb) add-symbol-file <shared-library> -o <shared-library-offset>`

Source paths in Chrome builds are relative to the `out/<build>` directory. If you
have a Chromium checkout at or around when the Chrome build was created, it can
be added to the debugger search path, like

```
(gdb) directory <path-to-chromium>/chromium/src/out/<build>/
```