etsi-ai/etsi-failprint

AI Summary: Implement NLP model failure analysis in the `failprint` MLOps tool. This involves creating a text preprocessing pipeline for embedding generation, implementing embedding-based clustering for similar failure patterns, adding support for transformer model outputs, integrating with popular NLP libraries (transformers, spaCy, sentence-transformers), adding text-specific drift detection, and creating NLP-focused failure segmentation (by text length, complexity, domain, etc.). The goal is to extend the tool's capabilities to handle unstructured text data and provide root cause analysis for NLP model failures.

Complexity: 4/5

enhancement good first issue medium Gssoc'25

15

MLOps-first diagnostic tool for automatic root cause analysis of ML model failures.

Python

#python

Add a CLI (Command Line Interface) for failprint to analyze failures via terminal commands 6 months ago

AI Summary: Implement a command-line interface (CLI) for the `failprint` Python package using the `argparse` module. The CLI should allow users to specify input data files (features, true labels, predicted labels), output format, and optionally clustering, via command-line arguments. The CLI should then call the existing `analyze()` function and either print the report to the terminal or write it to a file.

Complexity: 3/5

enhancement good first issue medium Gssoc'25

etsi-ai/etsi-failprint

15

MLOps-first diagnostic tool for automatic root cause analysis of ML model failures.

Python

#python

[BUG] Resolve critical license conflict between pyproject.toml and documentation 6 months ago

AI Summary: Resolve a license conflict in the failprint project by choosing a single license (either BSD-2-Clause or MIT) and updating the `pyproject.toml`, `README.md`, and `LICENSE` files to reflect the chosen license consistently.

Complexity: 2/5

good first issue Gssoc'25

etsi-ai/etsi-failprint

15

MLOps-first diagnostic tool for automatic root cause analysis of ML model failures.

Python

#python

[ENHANCEMENT] Improve core segmentation logic with automatic binning for numerical features 6 months ago

AI Summary: Improve the `failprint` tool's segmentation logic to handle continuous numerical features more effectively. The current implementation creates a segment for each unique value, leading to excessive noise. The task involves implementing automatic binning (using `pd.cut` or `pd.qcut`) for numerical columns with high cardinality, allowing the analysis to identify problematic ranges instead of individual values.

Complexity: 3/5

enhancement good first issue core-logic

etsi-ai/etsi-failprint

15

MLOps-first diagnostic tool for automatic root cause analysis of ML model failures.

Python

#python

Add License Section in README 7 months ago

AI Summary: Add a License section to the failprint project's README file, specifying the license type (which is already present in a separate license file).

Complexity: 1/5

documentation good first issue easy Gssoc'25

etsi-ai/etsi-failprint

15

MLOps-first diagnostic tool for automatic root cause analysis of ML model failures.

Python

#python

Expanded documentation and tutorials 7 months ago

AI Summary: Enhance the documentation for the `etsi-failprint` project by creating a comprehensive getting started guide (docs/getting_started.md), updating the README with links to examples, and adding code examples and explanations. The goal is to make the project more accessible to beginners, including clear instructions for setup, running tests, interpreting reports, and troubleshooting common errors.

Complexity: 3/5

documentation good first issue easy Gssoc'25

etsi-ai/etsi-failprint

15

MLOps-first diagnostic tool for automatic root cause analysis of ML model failures.

Python

#python

Add Interactive Dashboard for Failure Pattern Visualization (Streamlit Integration) 7 months ago

AI Summary: Integrate a Streamlit dashboard into the failprint MLOps tool to visualize failure patterns from ML model predictions. This involves creating an interactive UI allowing users to upload data, explore misclassified clusters, view feature distributions, and analyze class imbalance and drift using charts and plots (e.g., Plotly or Matplotlib). The existing `analyze()` function's logic should be reused.

Complexity: 4/5

enhancement good first issue medium Gssoc'25

etsi-ai/etsi-failprint

15

MLOps-first diagnostic tool for automatic root cause analysis of ML model failures.

Python

#python

Open Issues Need Help