Skip to content

repro: --pull option stopped working since last Friday #10991

@eng-ts

Description

@eng-ts

Bug Report

Description

the --pull option is not working anymore since Friday. We observed on several project, when data are not present and must be downloaded (fresh remote instances). We got this message: ERROR: failed to reproduce '[...].parquet.dvc': missing data 'source': [...]

Solution: replacing the option by a previous dvc pull command resolves the issue (but download all the data).
Note: data is on AWS S3

Reproduce

I wish i could do that easily. Do you have a model for reproducible example with S3 based cache files?

Expected

--pull should download necessary files. it's not.

Environment information

Output of dvc doctor:

DVC version: 3.66.1 (pip)
-------------------------
Platform: Python 3.12.9 on Linux-6.17.0-14-generic-x86_64-with-glibc2.39
Subprojects:
        dvc_data = 3.18.2
        dvc_objects = 5.2.0
        dvc_render = 1.0.2
        dvc_task = 0.40.2
        scmrepo = 3.6.1
Supports:
        http (aiohttp = 3.13.3, aiohttp-retry = 2.9.1),
        https (aiohttp = 3.13.3, aiohttp-retry = 2.9.1),
        s3 (s3fs = 2026.1.0)
Config:
        Global: /home/pakite/.config/dvc
        System: /etc/xdg/dvc
Cache types: hardlink, symlink
Cache directory: ext4 on /dev/mapper/ubuntu--vg-ubuntu--lv
Caches: local
Remotes: s3
Workspace directory: ext4 on /dev/mapper/ubuntu--vg-ubuntu--lv
Repo: dvc, git
Repo.site_cache_dir: /var/tmp/dvc/repo/2b81ae817a1361d5858ada7fb57a7577

Additional Information (if any):

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions