LinuxCommandLibrary

dvc-fetch

download tracked files to local cache

TLDR

Fetch all tracked data
$ dvc fetch
copy
Fetch specific file
$ dvc fetch [data.csv.dvc]
copy
Fetch from specific remote
$ dvc fetch -r [remote_name]
copy
Fetch all revisions
$ dvc fetch --all-commits
copy
Fetch with jobs
$ dvc fetch -j [4]
copy

SYNOPSIS

dvc fetch [options] [targets...]

DESCRIPTION

dvc fetch downloads DVC-tracked data from remote storage into the local DVC cache without affecting the workspace. This is useful for preparing data for later use without immediately populating the working directory.
The two-stage process (fetch then checkout) provides control over disk space and timing. Fetching brings data into cache (usually .dvc/cache) where it's stored efficiently with deduplication. A subsequent dvc checkout creates links or copies from cache to workspace when you actually need the files.
This separation is particularly valuable in CI/CD pipelines or when switching between branches: you can fetch all necessary data upfront, then quickly checkout different versions as needed without repeated downloads. The --all-commits, --all-branches, and --all-tags options help prefetch data for multiple revisions in bulk.

PARAMETERS

-r, --remote name

Remote storage name.
-j, --jobs number
Parallel download jobs.
--all-commits
Fetch for all Git commits.
--all-branches
Fetch for all branches.
--all-tags
Fetch for all tags.
-R, --recursive
Fetch recursively.

SEE ALSO

> TERMINAL_GEAR

Curated for the Linux community

Copied to clipboard