dfxpy is an open-source Python package developed to reduce repetitive data preprocessing tasks in machine learning projects. It automates common workflows such as missing value imputation, duplicate removal, encoding, feature/target splitting, scaling, leakage detection, and statistical profiling, aiming to improve workflow automation, preprocessing acceleration, diagnostics, and reproducibility.
Use Case
Opening the operator briefing
Pulling the full operator breakdown, tooling context, and verification notes.
