π CODE_DIR: Your ETL Workspace
The CODE_DIR is the primary location for all code specific to your ETL process. It typically contains two main Python files: a helper library for parsing and load.py, which is the entry point for the entire loading process.

Key Directories and Files
configs: This folder stores configurations specific to your ETL. While it might be empty, it commonly contains settings for unit and signal mapping.rep.signals: This file is created for you as a template. You can override or add new signal definitions that are specific to your repository. New signals should use an ID of3000or greater. The file format is consistent with the global signal format.map_units_stats.cfg: Used if you are performing signal mapping and unit conversions. This file contains the configuration for those conversions.- Additional configuration files can be added as needed. For more details: unit_conversion
-
signal_processings: This folder contains the custom logic for processing different signals and data types.XXXX.py: Each file here is a Python script with specific processing logic. For example,labs.pywould process all signals tagged as "labs" (unless a more specific file exists). The system uses a hierarchical search: for a signal tagged "labs,cbc,Hemoglobin," it first looks forHemoglobin.py, thencbc.py, thenlabs.py. This structure promotes code reuse.- If no suitable logic was found, a template python file with signal name and instructions will be created automatically. The template shows you the expected input dataframe (
df) and the required output format, guiding you to write the appropriate processing code. For example:
-
tests: An optional and rarely used folder for adding extra tests specific to your ETL process. It follows the same format as thetestsfolder inETL_INFRA_DIR, and its contents are merged with the global tests during execution.