codespell
Spellchecker for source code.
TLDR
Check for typos in all text files in the current directory, recursively
Correct all typos found in-place
Skip files with names that match the specified pattern (accepts a comma-separated list of patterns using wildcards)
Use a custom dictionary file when checking (--dictionary can be used multiple times)
Do not check words that are listed in the specified file
Do not check the specified words
Print 3 lines of context around, before or after each match
Check file names for typos, in addition to file contents
SYNOPSIS
codespell [OPTIONS] [file1 file2 ... fileN]
DESCRIPTION
codespell is designed to find and fix common misspellings in text files. It is designed primarily for checking misspelled words in source code, but it can be used with other files as well.
usage: codespell [-h] [--version] [-d] [-c] [-w] [-D DICTIONARY]
[--builtin BUILTIN-LIST] [--ignore-regex IGNORE_REGEX] [-I FILE] [-L WORDS] [-r REGEX] [-s] [--count] [-S SKIP] [-x FILE] [-i INTERACTIVE] [-q QUIET_LEVEL] [-e] [-f] [-H] [-A LINES] [-B LINES] [-C LINES] [--config CONFIG] [files ...]
positional arguments:
- files
-
files or directories to check
optional arguments:
- -h, --help
-
show this help message and exit
- --version
-
show program's version number and exit
- -d, --disable-colors
-
disable colors, even when printing to terminal (always set for Windows)
- -c, --enable-colors
-
enable colors, even when not printing to terminal
- -w, --write-changes
-
write changes in place if possible
- -D DICTIONARY, --dictionary DICTIONARY
-
custom dictionary file that contains spelling corrections. If this flag is not specified or equals "-" then the default dictionary is used. This option can be specified multiple times.
- --builtin BUILTIN-LIST
-
comma-separated list of builtin dictionaries to include (when "-D -" or no "-D" is passed). Current options are: - 'clear' for unambiguous errors - 'rare' for rare but valid words - 'informal' for informal words - 'usage' for recommended terms - 'code' for words common to code and/or mathematics - 'names' for valid proper names that might be typos - 'en-GB_to_en-US' for corrections from en-GB to en-US The default is 'clear,rare'.
- --ignore-regex IGNORE_REGEX
-
regular expression which is used to find patterns to ignore by treating as whitespace. When writing regexes, consider ensuring there are boundary non-word chars, e.g., "\Wmatch\W". Defaults to empty/disabled.
- -I FILE, --ignore-words FILE
-
file that contains words which will be ignored by codespell. File must contain 1 word per line. Words are case sensitive based on how they are written in the dictionary file
- -L WORDS, --ignore-words-list WORDS
-
comma separated list of words to be ignored by codespell. Words are case sensitive based on how they are written in the dictionary file
- -r REGEX, --regex REGEX
-
regular expression which is used to find words. By default any alphanumeric character, the underscore, the hyphen, and the apostrophe is used to build words. This option cannot be specified together with --writechanges.
- -s, --summary
-
print summary of fixes
- --count
-
print the number of errors as the last line of stderr
- -S SKIP, --skip SKIP
-
comma-separated list of files to skip. It accepts globs as well. E.g.: if you want codespell to skip .eps and .txt files, you'd give "*.eps,*.txt" to this option.
- -x FILE, --exclude-file FILE
-
FILE with lines that should not be checked for errors or changed
- -i INTERACTIVE, --interactive INTERACTIVE
-
set interactive mode when writing changes: - 0: no interactivity. - 1: ask for confirmation. - 2: ask user to choose one fix when more than one is
- available.
-
- 3: both 1 and 2
- -q QUIET_LEVEL, --quiet-level QUIET_LEVEL
-
bitmask that allows suppressing messages: - 0: print all messages. - 1: disable warnings about wrong encoding. - 2: disable warnings about binary files. - 4: omit warnings about automatic fixes that were
- disabled in the dictionary.
-
- 8: don't print anything for non-automatic fixes. - 16: don't print the list of fixed files. As usual with bitmasks, these levels can be combined; e.g. use 3 for levels 1+2, 7 for 1+2+4, 23 for 1+2+4+16, etc. The default mask is 2.
- -e, --hard-encoding-detection
-
use chardet to detect the encoding of each file. This can slow down codespell, but is more reliable in detecting encodings other than utf-8, iso8859-1, and ascii.
- -f, --check-filenames
-
check file names as well
- -H, --check-hidden
-
check hidden files and directories (those starting with ".") as well.
- -A LINES, --after-context LINES
-
print LINES of trailing context
- -B LINES, --before-context LINES
-
print LINES of leading context
- -C LINES, --context LINES
-
print LINES of surrounding context
- --config CONFIG
-
path to config file.
SEE ALSO
https://github.com/codespell-project/codespell
AUTHOR
Lucas De Marchi <lucas.de.marchi@gmail.com>