Contextual understanding and compliance requirements need to be observed when applying Sensitive-lexicon. Direct string matching may lead to misclassification, e.g. 'computer' is filtered because it contains 'computing'. Developers should adjust the thesaurus with business requirements and introduce NLP techniques to understand the context if necessary.
Legal compliance is critical and definitions of sensitive terms vary from region to region. It is important to confirm compliance with local regulations before use, especially for multinational business. The project explicitly advises users to assess the suitability of the thesaurus on their own, and does not assume any legal responsibility for its use. In practice, it is important to balance the strictness of the review with the user experience.
This answer comes from the articleSensitive-lexicon: a continuously updated thesaurus of Chinese sensitive wordsThe