5 Open Issues Need Help Last updated: Jul 11, 2025

Open Issues Need Help

Nettoyer les valeurs manquantes 3 months ago

AI Summary: Create a Python function `clean_data(df)` that cleans a Pandas DataFrame. For numerical columns, replace NaN values with the column's mean. For text columns ('name', 'education', 'city', 'notes'), replace NaN and empty strings with 'Unknown'.

Complexity: 2/5

good first issue

promodevAI/data-pipeline

Lire les données depuis raw_data.csv 3 months ago

AI Summary: Create a Python function `load_data(path)` in `src/load.py` that uses pandas to load a CSV file (provided as DataSet.csv) and returns the data as a pandas DataFrame.

Complexity: 1/5

good first issue

promodevAI/data-pipeline

Créer un script principal main.py 3 months ago

AI Summary: Create a main.py script that orchestrates the execution of functions from other tickets within a data pipeline. This script should load, clean, transform data, and display the results, including a preview of the DataFrame using .head().

Complexity: 3/5

good first issue

promodevAI/data-pipeline

Compter les genres 3 months ago

AI Summary: Create a Python function `count_gender(df)` within the `src/summary.py` file that counts the number of rows in a Pandas DataFrame (df) where the 'gender' column is either 'Male' or 'Female'. The function should then print this count.

Complexity: 2/5

good first issue

promodevAI/data-pipeline

Afficher des statistiques 3 months ago

AI Summary: Create a Python function `show_stats(df)` within the `src/summary.py` file of the data-pipeline project. This function should take a Pandas DataFrame (df) as input and display descriptive statistics using the `.describe()` and `.info()` methods.

Complexity: 2/5

good first issue

promodevAI/data-pipeline