Bug Report
Description
the --pull option is not working anymore since Friday. We observed on several project, when data are not present and must be downloaded (fresh remote instances). We got this message: ERROR: failed to reproduce '[...].parquet.dvc': missing data 'source': [...]
Solution: replacing the option by a previous dvc pull command resolves the issue (but download all the data).
Note: data is on AWS S3
Reproduce
I wish i could do that easily. Do you have a model for reproducible example with S3 based cache files?
Expected
--pull should download necessary files. it's not.
Environment information
Output of dvc doctor:
DVC version: 3.66.1 (pip)
-------------------------
Platform: Python 3.12.9 on Linux-6.17.0-14-generic-x86_64-with-glibc2.39
Subprojects:
dvc_data = 3.18.2
dvc_objects = 5.2.0
dvc_render = 1.0.2
dvc_task = 0.40.2
scmrepo = 3.6.1
Supports:
http (aiohttp = 3.13.3, aiohttp-retry = 2.9.1),
https (aiohttp = 3.13.3, aiohttp-retry = 2.9.1),
s3 (s3fs = 2026.1.0)
Config:
Global: /home/pakite/.config/dvc
System: /etc/xdg/dvc
Cache types: hardlink, symlink
Cache directory: ext4 on /dev/mapper/ubuntu--vg-ubuntu--lv
Caches: local
Remotes: s3
Workspace directory: ext4 on /dev/mapper/ubuntu--vg-ubuntu--lv
Repo: dvc, git
Repo.site_cache_dir: /var/tmp/dvc/repo/2b81ae817a1361d5858ada7fb57a7577
Additional Information (if any):
Bug Report
Description
the
--pulloption is not working anymore since Friday. We observed on several project, when data are not present and must be downloaded (fresh remote instances). We got this message:ERROR: failed to reproduce '[...].parquet.dvc': missing data 'source': [...]Solution: replacing the option by a previous
dvc pullcommand resolves the issue (but download all the data).Note: data is on AWS S3
Reproduce
I wish i could do that easily. Do you have a model for reproducible example with S3 based cache files?
Expected
--pullshould download necessary files. it's not.Environment information
Output of
dvc doctor:Additional Information (if any):