Paper Number

ECIS2026-1299

Paper Type

CRP

Abstract

Information Systems research increasingly relies on machine learning (ML) to predict outcomes in complex sociotechnical systems, yet predictive models are not designed to identify causal effects. This limitation is particularly critical in building retrofits, where unbiased estimates of energy savings are essential for climate policy and investment decisions. Because retrofit adoption is shaped by household and building characteristics that also affect energy consumption, predictive ML can yield biased effect estimates. This paper systematically benchmarks leading causal ML estimators, including metalearners (S-, T- and X-Learners) and DoubleML across multiple retrofit interventions. To enable this comparison, we construct a physically grounded simulation in which true treatment effects and realistic adoption biases are known. Results show that DoubleML achieves the lowest estimation errors, particularly for complex envelope retrofits. These findings demonstrate that orthogonalising the treatment assignment improves causal effect estimation and provides a methodological foundation for large-scale energy retrofit and policy evaluation.

Share

COinS
 
Jun 14th, 12:00 AM

Predicting Heterogeneous Treatment Effects Of Building Energy Saving Retrofits Using Causal Machine Learning

Information Systems research increasingly relies on machine learning (ML) to predict outcomes in complex sociotechnical systems, yet predictive models are not designed to identify causal effects. This limitation is particularly critical in building retrofits, where unbiased estimates of energy savings are essential for climate policy and investment decisions. Because retrofit adoption is shaped by household and building characteristics that also affect energy consumption, predictive ML can yield biased effect estimates. This paper systematically benchmarks leading causal ML estimators, including metalearners (S-, T- and X-Learners) and DoubleML across multiple retrofit interventions. To enable this comparison, we construct a physically grounded simulation in which true treatment effects and realistic adoption biases are known. Results show that DoubleML achieves the lowest estimation errors, particularly for complex envelope retrofits. These findings demonstrate that orthogonalising the treatment assignment improves causal effect estimation and provides a methodological foundation for large-scale energy retrofit and policy evaluation.

When commenting on articles, please be friendly, welcoming, respectful and abide by the AIS eLibrary Discussion Thread Code of Conduct posted here.