Home Price Index: A Machine Learning Methodology

Joseph Barr, Eden A. Ellis, Antonio Kassab, Christian L. Redfearn, Narayanan Nani Srinivasan, Kurtis B. Voris

    Research output: Contribution to journalArticlepeer-review


    Estimating house prices is essential for homeowners and investors alike with both needing to understand the value of their asset, and to understand real estate assets as part of an overall portfolios. Commonly-used indices like the National Association of Realtors (NAR) median home price index, or the celebrated Case-Shiller Home Price Index are reported exclusively over a large geographic areas, i.e., a metropolitan, whereby home price dynamics are lost. In this paper, we propose a improved method to capture price dynamics over time at the most granular level possible a single home. Using over 16 years of home sale data, from the year 2000 to 2016, we estimate home price index for each house. Once home price dynamics is captured, its possible to aggregate price dynamics to construct a price index over geographies of any kind, e.g., ZIP code. This particular index relies on a so-called `gradient boosted' model, a methodology framework relying on multiple calibration parameters and heavily dependent on sampling techniques. We demonstrate that this approach offers several strengths compared to the commonly reported indices, the `median sale' and `repeat sales' indices.
    Original languageEnglish
    Pages (from-to)111-133
    Number of pages23
    JournalInternational Journal of Semantic Computing
    Issue number1
    Publication statusPublished - 2017


    Dive into the research topics of 'Home Price Index: A Machine Learning Methodology'. Together they form a unique fingerprint.

    Cite this