Data Caching and Lifecycle Management
SiteMCP has designed a complete cache management solution to balance data freshness with access efficiency:
- storage strategy: The default is to use
~/.cache/sitemcpdirectory, support for the use of the--cache-dirCustomized Paths - Cache Control: Provided
--no-cacheOption to disable caching for ad hoc data needs - Updating mechanism: Repeated crawling of the same URL triggers an automatic cache update to ensure data currency.
Cached files are stored using gzip compression, which takes up an average of only 15-30KB of space per page. For large document sites (e.g., Vite official documents), it is recommended to periodically perform arm -rf ~/.cache/sitemcpCleaning up outdated data. This system allows the AI assistant to quickly access historical data while maintaining access to new content.
This answer comes from the articleSiteMCP: Crawling website content and turning it into MCP servicesThe































