Data Security Protection Program
WeClone has built-in multi-layer protection mechanisms and users can take the following measures to enhance protection:
- Sensitive word filtering: Add custom keywords (e.g. bank card number, address, etc.) to blocked_words.json with regular expression matching pattern support. Example:
"d{4}-d{4}-d{4}"
Matching Credit Card Numbers - localization: Always run data preprocessing scripts in a disconnected environment, it is recommended to use the
airgap
tool to create an isolated environment. Installation commands:uv pip install airgap
- encrypted storage: Use sqlcipher to encrypt the CSV file exported by WeChat (password strength ≥ 12 digits) and delete the original file immediately after decryption
Enterprise users can also 1) Enable Docker container isolation 2) Add differential privacy noise 3) Implement model privilege access control. It is recommended to periodically review access logs in data/audit_log.txt
This answer comes from the articleWeClone: training digital doppelgangers with WeChat chats and voicesThe