enca
TLDR
Detect encoding of file
SYNOPSIS
enca [options] [files...]
DESCRIPTION
enca (Extremely Naive Charset Analyser) detects character encodings of text files using language-based heuristics. It can identify various encodings including legacy charsets for Central/Eastern European languages.
The tool works best with language hints, as many encodings are ambiguous without context. It can also convert files between encodings.
enca is useful for handling files with unknown or legacy encodings, particularly for Slavic and other non-Western European languages.
PARAMETERS
FILES
Files to analyze.-L LANGUAGE
Hint language for detection.-x ENCODING
Convert to specified encoding.-d
Show detailed detection info.-g, --guess
Output best guess only.-i, --info
Show available encodings.--help
Display help information.
CAVEATS
Detection is heuristic, not deterministic. Short files may be ambiguous. Works best with specific language hints. Some encodings indistinguishable.
HISTORY
enca was developed for handling the encoding diversity in Central/Eastern European computing, where many incompatible character sets were historically used for the same languages.


