Data cleansing is a feature of Data Formulator, and the specific operation process is as follows:
Step 1: Data import
- Support uploading structured data such as CSV/Excel etc.
- Unique ability to take or upload images/PDFs containing forms directly
Step 2: AI Preprocessing
- Automatic system recognition of data patterns (e.g., date/currency format)
- Flag suspected outliers (highlighted in red)
- Generate data quality reports (with metrics such as percentage of missing values)
Step 3: Interactive amendments
- Fix problems with natural language commands (e.g. "standardize all dates to YYYY-MM-DD format")
- Or drag and drop directly in the GUI to adjust the data distribution
Typical case: a retailer used the tool to process scanned paper reports, which originally required 2 days of manual entry cleaning work was shortened to 15 minutes, and the accuracy rate increased from 78% to 95%.
This answer comes from the articleData Formulator: an AI-driven data visualization toolThe































