Glossary term

Hedonic Regression

Updated 2026-05-01 Editorially reviewed

Hedonic regression is a statistical method that prices a home by treating each attribute — sqft, beds, baths, lot, age, location — as an independent variable with its own dollar coefficient. The predicted value is the sum of those coefficients plus a market intercept. It is the engine behind most modern AVMs, including Zestimate. It differs from sales-comparison: appraisers hand-pick 3–6 comps and adjust them; hedonic models fit one equation across hundreds of sales.

The model in one equation

A simple hedonic regression looks like this:

Price = β0 + β1·sqft + β2·beds + β3·baths + β4·lot
      + β5·age + β6·location_score + ε

Each β (beta) is the marginal price of one unit of that attribute, holding everything else constant. Fit on a few hundred recent sales in a market, the model produces estimates like:

Plug in the target's attributes, sum, and you have a value estimate. Real production AVMs use 50–200 features and non-linear extensions (splines, interactions, Gradient-Boosted Trees), but the spine is hedonic.

Why it works for real estate

Real estate is the canonical hedonic-pricing problem. Sherwin Rosen's 1974 paper "Hedonic Prices and Implicit Markets" formalised the idea: a home is a bundle of characteristics, not a single good, and the market reveals the implicit price of each characteristic through sales data. Three reasons it dominates modern AVMs:

  1. Scales to large datasets. Sales-comparison logic needs you to hand-pick comps; hedonic models fit on thousands of sales at once and extract the average attribute price.
  2. Handles atypical homes. When the target is a 5-bed home in a neighbourhood of 3-bed homes, sales-comparison runs out of comparables. Hedonic regression extrapolates using the bedroom coefficient.
  3. Produces confidence intervals. The regression's residual variance gives you a defensible error band. AVMs that don't show a confidence band are usually using simpler methods that can't.

Hedonic regression vs sales-comparison approach

The two approaches answer the same question different ways:

Dimension Hedonic regression Sales-comparison
Data scale 100–10,000 sales 3–8 hand-picked comps
Adjustment logic Coefficient per attribute Line-item per comp
Best for Typical homes in active markets Atypical homes, expert review
Transparency Black box without methodology page Every adjustment visible
USPAP-compliant No (statistical) Yes (when done by appraiser)
Used by Zillow, Redfin, HouseCanary, Twellie Licensed appraisers, USPAP

A modern AVM typically blends both: a hedonic / GBT model for the central tendency, plus an explicit comparable-sales view in the report. The line-by-line adjustments are what makes the math auditable — see comp adjustment factors explained for the full breakdown.

Where hedonic models break

Three failure modes worth knowing:

  1. Multicollinearity. Sqft and bed count are correlated; the model can't cleanly separate the two coefficients. Solution: regularisation (Ridge / Lasso) or feature selection.
  2. Non-linearity. Adding a 4th bedroom to a 1,200 ft² house doesn't add the same value as adding a 4th bedroom to a 3,000 ft² house. Linear hedonic regression misses this; GBT and spline models capture it.
  3. Sparse markets. With under ~50 sales in a 12-month window, the regression coefficients are noisy. This is why rural and thin-market valuations have wider confidence bands.

The third one is structural — no model fixes a data problem. In sparse markets the appraiser's hand-picked comparables (with explicit adjustments) often beat the AVM. See our AVM vs appraisal vs Zestimate guide for when to use each.

How to spot hedonic-driven AVMs in the wild

When you read an AVM's methodology page, the giveaway is language about "feature coefficients," "hedonic price index," or "attribute-level pricing." All three of these mean hedonic regression at the core. Zillow's Zestimate methodology, the S&P CoreLogic Case-Shiller index, and most academic real-estate price indices are hedonic-based.

The Federal Housing Finance Agency (FHFA) House Price Index uses a repeat-sales method instead, which avoids hedonic estimation by tracking the same homes over time. Different tradeoffs: more robust to attribute estimation error, but blind to home improvements between sales.

What this means for your offer

You don't need to do the regression yourself. What you need to know:

Frequently asked questions

Is hedonic regression the same thing as an AVM?
Not exactly — hedonic regression is a method, and an AVM is a productized system that uses one or more methods. Most modern AVMs (Zillow Zestimate, Redfin Estimate, HouseCanary) use hedonic regression as one input, often blended with Gradient-Boosted Trees, repeat-sales indices, and machine-learning ensembles. Pure linear hedonic regression has been mostly replaced by non-linear ML in production AVMs, but the conceptual frame — pricing each attribute independently — still drives the architecture.
Why don't licensed appraisers just use hedonic regression?
USPAP requires the appraiser to use the sales-comparison approach for residential, not statistical inference. Beyond compliance, hedonic regression is a bad fit for individual transactions: a single property's value depends on idiosyncratic factors a regression averages out. Appraisers can defend each comp choice and each line-item adjustment in writing; a hedonic coefficient is harder to defend in a tax appeal or divorce proceeding. The methods complement each other — appraisers handle individual files, AVMs handle scale.
How many sales does a hedonic model need to be accurate?
There's no clean threshold, but rough guidance from real-estate econometrics: at least 50 recent sales in a tight geography for the coefficients to be meaningfully estimated, and ideally 200+ for the attribute-level coefficients to be stable. Below 50 sales the standard errors on each coefficient explode. This is why AVM accuracy degrades sharply in rural and unique-stock markets — there's just not enough data to fit a clean model, and the AVM falls back to broader regional comparables that don't reflect the micro-market.

Related reading

Apply this to a real property

Pull a Twellie report on the next address.

Analyze a property — $50