No. The best feature of WhiteLightning is that there is no need to prepare real training data. The tool automatically generates synthetic training data, including regular samples and edge cases, using a large language model such as Grok-3-beta or GPT-4o-mini. By default, 50 pieces of data are generated for each class, and the user can also adjust the amount of data via parameters. This zero-data modeling approach greatly reduces the threshold of use and is particularly suitable for rapid prototyping or scenarios where data acquisition is difficult.
This answer comes from the articleWhiteLightning: an open source tool for generating lightweight offline text classification models in one clickThe