Data cleaning and validation functions for names, languages, identifiers, etc.

1 Open Issue Need Help Last updated: Jul 20, 2025

Open Issues Need Help

View All on GitHub

AI Summary: The task involves analyzing a list of frequently occurring organization name tokens from the OpenSanctions dataset to identify potential symbol groups or group expansions for improved data cleaning and validation within the `rigour` Python library. This requires examining the tokens, categorizing them (descriptive, geographical, symbolic), and proposing additions to the library's existing symbol mappings.

Complexity: 4/5
help wanted

Data cleaning and validation functions for names, languages, identifiers, etc.

Python