🦉 Data Versioning and ML Experiments

ai data-science data-version-control developer-tools machine-learning reproducibility unstructured-data
5 Open Issues Need Help Last updated: Sep 14, 2025

Open Issues Need Help

View All on GitHub
help wanted p1-important performance research

🦉 Data Versioning and ML Experiments

Python
#ai#data-science#data-version-control#developer-tools#machine-learning#reproducibility#unstructured-data

AI Summary: The `dvc exp show` command currently only processes the last `--drop` argument provided, preventing users from simultaneously removing multiple specified columns or patterns. The request is to enhance this functionality, allowing users to provide multiple `--drop` arguments on the command line, which would then all be applied to filter the experiment table output.

Complexity: 2/5
enhancement help wanted good first issue p3-nice-to-have A: experiments

🦉 Data Versioning and ML Experiments

Python
#ai#data-science#data-version-control#developer-tools#machine-learning#reproducibility#unstructured-data
help wanted p2-medium A: data-sync blocked

🦉 Data Versioning and ML Experiments

Python
#ai#data-science#data-version-control#developer-tools#machine-learning#reproducibility#unstructured-data
bug help wanted triage A: experiments

🦉 Data Versioning and ML Experiments

Python
#ai#data-science#data-version-control#developer-tools#machine-learning#reproducibility#unstructured-data

AI Summary: Enhance the `dvc data status` command by adding a `--remote` flag to specify the remote for the `--not-in-remote` check and an `-e` flag to return a non-zero exit code if data is missing from the specified remote. This improves the workflow for verifying data consistency in remote storage within Github Actions.

Complexity: 3/5
good first issue p3-nice-to-have A: cli A: status A: data-management

🦉 Data Versioning and ML Experiments

Python
#ai#data-science#data-version-control#developer-tools#machine-learning#reproducibility#unstructured-data