The following proactive strategies can be adopted to address the problem of delayed thesaurus updates relying on community contributions:
- Establishment of an automated local update mechanism: Automatically pull the latest commits of a project via a Git Hook or a timed task (e.g. daily early morning) and trigger the service to reload the thesaurus.
- Constructing a Business Supplement Thesaurus: in
sensitive-lexicon.txt
Based on this, a separate business sensitive words file is maintained and the two types of lexicons are combined. - Participation in community contributions: Encourage teams to submit Pull Requests directly when they find new sensitive words to accelerate the official thesaurus iterations.
It is also recommended to regularly compare the local and master thesaurus difference records to avoid missing critical updates.
This answer comes from the articleSensitive-lexicon: a continuously updated thesaurus of Chinese sensitive wordsThe