Technical Whitepaper

Kubera AVM Methodology

Circle Rate Anchoring, Stochastic Market Calibration, and Geospatial Regression for Automated Property Valuation Across 453 Indian Cities

Published: April 2026Kubera AVM Research

Abstract

The Kubera Automated Valuation Model (AVM) produces property price estimates for apartments, plots, and commercial properties across 453 Indian cities. Unlike traditional comparable-sales approaches, Kubera anchors every valuation to the government-notified circle rate — the legally mandated minimum transaction value — and applies a proprietary multi-layer calibration pipeline combining stochastic market modelling, geospatial regression, and location intelligence scoring. Backtested across thousands of verified market transactions, the model achieves a median absolute percentage error (MAPE) of approximately 18%, with 70% of estimates falling within 30% of actual transaction prices.

1. Introduction

Property valuation in India presents unique challenges absent in Western markets. Each of India's 28 states and 8 union territories maintains its own registration portal with independently defined circle rates (also called guideline values, ready reckoner rates, or jantri rates depending on the state). These rates use different units (Rs/sqft, Rs/sqm, Rs/sq.yd, Rs/marla, Rs/katha, Rs/are), are updated at different frequencies, and often bear little relation to actual market prices.

The gap between government-notified rates and market reality varies dramatically — from near-zero in newly notified zones to 400%+ in premium urban localities. Kubera's central innovation is treating this gap as a measurable, calibratable signal rather than an obstacle: the circle rate provides a stable, legally anchored floor, and a proprietary market calibration model captures the locality-specific dynamics that drive actual transaction prices.

2. Three-Pillar Architecture

The Kubera AVM computes property value through three independent, verifiable data layers that are combined via a proprietary calibration model:

PILLAR 01

Government Circle Rate

The official rate notified by the state registration department for each locality. This is the legal floor — every registered transaction must meet or exceed it. Kubera sources these directly from 20 state government portals.

PILLAR 02

Market Calibration

A proprietary stochastic model that captures the gap between government rates and actual market prices. Calibrated against verified real-world transactions across 178 cities, with hierarchical fallbacks for smaller markets.

PILLAR 03

Location Intelligence

11 geospatial factors — hospitals, schools, transit, road quality, green spaces — scored from infrastructure data. These dimensionless scores transfer across cities without re-training.

3. Circle Rate Anchoring

Circle rate anchoring is the foundational principle of the Kubera AVM. Rather than estimating property value from scratch using comparable sales (which are sparse, noisy, and often unreported in Indian markets), Kubera starts with the one data point that is legally guaranteed to exist for every locality: the state-notified circle rate.

We source rates directly from 20 state government registration portals, normalizing all values to a common unit (Rs/sqft). For states that publish separate rates for land and apartments, the AVM automatically selects the most appropriate rate type for the property being valued.

Zone-level spatial resolution is achieved through a point-in-polygon lookup against state-level GeoJSON files containing Survey of India village boundary polygons enriched with circle rate data. For any GPS coordinate, the system returns the exact government rate for that zone using a spatial index.

4. Stochastic Market Calibration

The market calibration layer captures the gap between government-notified rates and actual market prices. Indian property markets exhibit significant variation in this gap — a premium urban locality may trade at 3-4x the circle rate, while a newly notified area may trade close to it.

Kubera uses a proprietary stochastic model calibrated against verified real-world transaction data across 178 cities. The model employs log-space estimation techniques to remain robust against outliers and naturally captures the multiplicative structure of Indian property pricing.

For cities with insufficient market data, a hierarchical Bayesian fallback applies: state-level priors serve as the first fallback, with an all-India prior as the hyper-prior. This multi-tier approach ensures coverage across 453 cities — even in markets with limited data.

4.1 Zone-Level Adjustment

Within a city, market dynamics vary by zone. Premium zones (high circle rate relative to city median) tend to exhibit different market-to-government-rate ratios than peripheral zones. Kubera applies a zone-level adjustment that captures this intra-city variation using a non-linear regression model fitted on ground-truth data.

5. Geospatial Regression: 11-Factor Location Intelligence

Beyond circle rates and market calibration, Kubera scores every zone across 11 geospatial factors computed from real infrastructure data. These factors are amenity-proximity and infrastructure-density measures computed at the zone level.

#FactorWhat It MeasuresData Source
1Hospital ProximityDistance to nearest hospital or clinicOpenStreetMap
2Market / ShoppingAccess to markets, malls, and retail zonesOpenStreetMap
3Transit ScoreBus stops, railway stations, metro connectivityOpenStreetMap
4School ProximityDistance to schools and educational institutionsOpenStreetMap
5Workplace AccessProximity to commercial and IT hubsOlaMaps
6Internet & PowerTelecom and power infrastructure densityOpenStreetMap
7Ride-hail AccessRide-hailing availability and auto-stand densityOlaMaps
8Park / Green SpaceProximity to parks, gardens, and open spacesOpenStreetMap
9Temple / WorshipTemples, mosques, churches, and gurudwaras nearbyOpenStreetMap
10Road WidthWidth and classification of nearest roadOpenStreetMap
11WaterfrontDistance to lakes, rivers, or coastlineOpenStreetMap

Location factor weights are learned from ground-truth transaction data and formulated as dimensionless scores. This design ensures they transfer across cities without requiring per-city re-training — a critical property for scaling to 453 cities with varying data density.

6. Spatial Data Infrastructure

Kubera maintains a comprehensive spatial data pipeline built on Survey of India (SOI) village boundary shapefiles covering 26 states and union territories. Each village polygon is enriched with circle rate data from the corresponding state portal, creating a seamless spatial lookup system.

The spatial index uses an R-tree structure for efficient query performance. For any GPS coordinate, the system identifies the containing polygon, resolves nested zones, and returns the zone-specific circle rate — enabling real-time valuation at query time.

MetricValue
States/UTs with polygon coverage26
Total circle rate records1,67,863
Village boundary polygons2,00,000+
Calibrated cities453
State government data sources20 portals

7. Validation & Accuracy

Model accuracy is evaluated using cross-validation across calibrated cities and validated against known bank valuations for specific properties.

MetricValue
Median Absolute Percentage Error (MAPE)~18%
Estimates within 30% of actual70%
Estimates within 50% of actual89%

The ~18% median error reflects the inherent uncertainty in Indian property markets, where transaction prices are influenced by factors not captured in public data (negotiation, undisclosed amenities, builder reputation). For context, professional human valuers in India typically achieve 10-15% accuracy, while portal-based “estimates” often exceed 30%.

8. Limitations & Caveats

  • 1.Valuations are estimates for personal reference and do not constitute a registered valuation under RERA or any state registration act.
  • 2.Circle rates may be outdated between government notification cycles (typically annual, but varies by state).
  • 3.Market calibration depends on transaction data density; smaller cities with limited data use state-level priors with wider uncertainty bounds.
  • 4.The model does not account for builder reputation, floor level, interior fit-out, or legal encumbrances — these must be assessed independently.
  • 5.Under-construction properties are not separately modeled; the condition multiplier provides a coarse adjustment only.

See the Methodology in Action

Try a free valuation and see how circle rate anchoring, market calibration, and location intelligence combine for your specific property.

Get Your Free Valuation