intel_error_decode
Decode Intel machine check errors
SYNOPSIS
intel_error_decode [-c configfile] [-f filename] [-V]
PARAMETERS
-c configfile
Specifies an alternate configuration file instead of the default /etc/mcelog/mcelog.conf. This allows for customized error decoding based on different hardware configurations or error reporting requirements.
-f filename
Reads machine check events from the specified file instead of the default /dev/mcelog. Useful for analyzing logs or historical error data.
-V
Displays the version number of the intel_error_decode tool and exits.
DESCRIPTION
The `intel_error_decode` command is a Linux utility used to decode Machine Check Architecture (MCA) error records from Intel processors. These error records, often logged when a system encounters hardware issues like memory errors, cache failures, or bus problems, are typically represented in a hexadecimal or binary format that's difficult for humans to interpret. This tool translates these raw error codes into a more understandable form, providing details about the type of error, the component involved, and potentially even the location of the fault. This helps system administrators and developers diagnose and troubleshoot hardware problems, leading to faster problem resolution and improved system stability. It's an invaluable tool for anyone dealing with server systems or high-performance computing environments where hardware reliability is critical. The command typically analyzes the /dev/mcelog device, which receives machine check events from the kernel. Proper decoding requires that the system have the `mcelog` package installed and configured.
CAVEATS
The effectiveness of `intel_error_decode` relies on an up-to-date `mcelog` configuration file. Incorrect or outdated configuration files can lead to inaccurate or incomplete error decoding. It is processor architecture dependent and supports intel processors.
REQUIREMENTS
The command requires the mcelog package to be installed to properly collect and decode hardware errors. The mcelog service must be running to properly log these errors to /dev/mcelog.
USAGE EXAMPLE
To decode errors directly from the /dev/mcelog device: intel_error_decode
To decode errors from a log file: intel_error_decode -f /var/log/mcelog
SEE ALSO
mcelog(8)