MLOps-first diagnostic tool for automatic root cause analysis of ML model failures.

15 stars 25 forks 15 watchers Python BSD 2-Clause "Simplified" License
python
10 Open Issues Need Help Last updated: Aug 28, 2025

Open Issues Need Help

View All on GitHub
enhancement good first issue medium Gssoc'25

MLOps-first diagnostic tool for automatic root cause analysis of ML model failures.

Python
#python

MLOps-first diagnostic tool for automatic root cause analysis of ML model failures.

Python
#python
good first issue medium Gssoc'25

MLOps-first diagnostic tool for automatic root cause analysis of ML model failures.

Python
#python

AI Summary: Implement NLP model failure analysis in the `failprint` MLOps tool. This involves creating a text preprocessing pipeline for embedding generation, implementing embedding-based clustering for similar failure patterns, adding support for transformer model outputs, integrating with popular NLP libraries (transformers, spaCy, sentence-transformers), adding text-specific drift detection, and creating NLP-focused failure segmentation (by text length, complexity, domain, etc.). The goal is to extend the tool's capabilities to handle unstructured text data and provide root cause analysis for NLP model failures.

Complexity: 4/5
enhancement good first issue medium Gssoc'25

MLOps-first diagnostic tool for automatic root cause analysis of ML model failures.

Python
#python

AI Summary: Implement a command-line interface (CLI) for the `failprint` Python package using the `argparse` module. The CLI should allow users to specify input data files (features, true labels, predicted labels), output format, and optionally clustering, via command-line arguments. The CLI should then call the existing `analyze()` function and either print the report to the terminal or write it to a file.

Complexity: 3/5
enhancement good first issue medium Gssoc'25

MLOps-first diagnostic tool for automatic root cause analysis of ML model failures.

Python
#python

AI Summary: Resolve a license conflict in the failprint project by choosing a single license (either BSD-2-Clause or MIT) and updating the `pyproject.toml`, `README.md`, and `LICENSE` files to reflect the chosen license consistently.

Complexity: 2/5
good first issue Gssoc'25

MLOps-first diagnostic tool for automatic root cause analysis of ML model failures.

Python
#python

AI Summary: Improve the `failprint` tool's segmentation logic to handle continuous numerical features more effectively. The current implementation creates a segment for each unique value, leading to excessive noise. The task involves implementing automatic binning (using `pd.cut` or `pd.qcut`) for numerical columns with high cardinality, allowing the analysis to identify problematic ranges instead of individual values.

Complexity: 3/5
enhancement good first issue core-logic

MLOps-first diagnostic tool for automatic root cause analysis of ML model failures.

Python
#python

AI Summary: Add a License section to the failprint project's README file, specifying the license type (which is already present in a separate license file).

Complexity: 1/5
documentation good first issue easy Gssoc'25

MLOps-first diagnostic tool for automatic root cause analysis of ML model failures.

Python
#python

AI Summary: Enhance the documentation for the `etsi-failprint` project by creating a comprehensive getting started guide (docs/getting_started.md), updating the README with links to examples, and adding code examples and explanations. The goal is to make the project more accessible to beginners, including clear instructions for setup, running tests, interpreting reports, and troubleshooting common errors.

Complexity: 3/5
documentation good first issue easy Gssoc'25

MLOps-first diagnostic tool for automatic root cause analysis of ML model failures.

Python
#python

AI Summary: Integrate a Streamlit dashboard into the failprint MLOps tool to visualize failure patterns from ML model predictions. This involves creating an interactive UI allowing users to upload data, explore misclassified clusters, view feature distributions, and analyze class imbalance and drift using charts and plots (e.g., Plotly or Matplotlib). The existing `analyze()` function's logic should be reused.

Complexity: 4/5
enhancement good first issue medium Gssoc'25

MLOps-first diagnostic tool for automatic root cause analysis of ML model failures.

Python
#python