Data validation
From data to informed decision information. The power of data validation.

Data validation
‘Data validation: mapping data quality with the aim of getting a grip on the data, its application and associated insights’
Witteveen+Bos is a reliable partner in the water sector. We provide engineering and consultancy services, including data validation. This enables our clients to get a grip on their data, its uses and associated insights.
Data validation is essential for managers to keep a grip on the quality of incoming measurement data. The Witteveen+Bos data validation toolbox Dataprofeet is widely used in the water sector, among others, for monitoring time series in water purification, surface water or groundwater. Thanks to the Dataprofeet toolbox, water managers can anticipate the quality and reliability of data. The toolbox validates data based on various parameters, providing informed decision information.
Validation service
The toolbox can run on-premise, is offered as a cloud service or can be run as a consultancy project, with data validation reports prepared by us at fixed intervals with recommendations for optimising the measurement system. Using various statistical tests, a model has been developed to enrich data with quality labels based on historical data and streaming data. The validations are flexible and can be configured based on the specific needs of each measurement site or measurement network.
Application
By enriching time series data from the source with validation labels, administrators are able to keep a grip on the entire data processing process. With data validation, they can:
- Select different versions of the dataset based on enriched data with quality labels for training and testing Machine Learning or AI models. This provides flexibility.
- Gain insight into data quality, based on quality labels. This allows administrators to always have a uniform insight into deviations from expected values. Every validation test has a description with an expected cause, which helps in resolving deviations. Additionally, quality labels provide insight into data availability and trends over time.
- Manage, for example in terms of sensor checks. Based on the quality labels, it is possible to assess whether a sensor is still working properly or requires maintenance.
- Making choices in process control: automatic control based on high-quality data enriched with quality labels.
Insight
If data validation is structurally applied to source data, data validation services (such as the Dataprofeet, ‘Data prohet’) offer added value for water managers. Routinely automated tests provide clarity through quality labels. This ensures transparency and overview of data quality. This method of implementation offers reproducibility of results and guarantees knowledge assurance and consistency in data management. Based on the result of data validation (a dataset enriched with quality labels), it is possible to be validation-driven instead of source-driven.
The Dataprofeet ensures that water managers can always rely on high-quality and reliable information they need for their decisions.
More information
