Sebastian Jäger
Sebastian Jäger
Publications
Projects
Talks
Contact
CV
GitHub Resume
Data Quality
From Data Imputation to Data Cleaning - Automated Cleaning of Tabular Data Improves Downstream Predictive Performance
We develop and evaluate an application-agnostic ML-based data cleaning approach using well-established imputation techniques for automated detection and cleaning of erroneous values. To improve the degree of automation, we combine imputation techniques with conformal prediction (CP), a model-agnostic and distribution-free method to quantify and calibrate the uncertainty of ML models.
Sebastian Jäger
,
Felix Bießmann
PDF
Code
Poster
A Benchmark for Data Imputation Methods
Comparison of data imputation methods on a wide range of datasets, missingness patterns, and missingness fractions.
Sebastian Jäger
,
Arndt Allhorn
,
Felix Bießmann
PDF
Code
DOI
Cite
×