Open Issues Need Help
View All on GitHubAI Summary: Document the hyperparameters used in the `ritme` project for both model training and feature engineering. This involves describing each hyperparameter's meaning and purpose, drawing information from the provided `run_config_whparams.json` file and expanding to include undocumented feature engineering options like `data_selection` and `data_aggregation`.
Target-driven optimization of feature representation and model selection for next-generation sequencing data
AI Summary: Expand the hyperparameter search space for the RandomForest model in the `ritme` project to be more comparable to the existing XGBoost search space, specifically increasing the range of `n_estimators`. This involves modifying the `model_hyperparameters` section within the JSON configuration file used by `ritme`'s model selection process.
Target-driven optimization of feature representation and model selection for next-generation sequencing data
AI Summary: Expand the hyperparameter search space for the RandomForest model within the `ritme` framework to better handle low-variance target variables (e.g., log-transformed data), improving the model's ability to predict beyond the mean in such scenarios. This involves modifying the existing configuration file or code to include a wider range of hyperparameter values for RandomForest.
Target-driven optimization of feature representation and model selection for next-generation sequencing data
AI Summary: Modify the `ritme` software to use a more unique prefix (other than "F") to identify microbial features in the input data, preventing conflicts with metadata columns that might start with "F". This involves changing the code that parses column names to identify features.
Target-driven optimization of feature representation and model selection for next-generation sequencing data