Test 03: Cleaners
Purpose
Verify that all input signals have appropriate data cleaning rules applied before model training and testing. This ensures data quality and consistency throughout the pipeline.
Required Inputs
From configs/env.sh:
WORK_DIR: Output directory for resultsMODEL_PATH: Path to the modelREPOSITORY_PATH: Path to the data repository
How to Run
From your TestKit folder, execute:
What This Test Does
- Checks that every input signal has a defined cleaner rule
- Flags any signals missing cleaner definitions
- Fails the test if any required cleaner is missing
Output Location
- Main log:
WORK_DIR/03.test_cleaners.log(lists signals missing cleaners)
How to Interpret Results
- If the log lists missing cleaners, update your configuration to define them
- The test passes only if all signals have cleaner rules