What is Real Estate Data & How to Use It

What is Real Estate Data?

Real Estate Data refers to information that is provided about properties, including residential real estate, commercial real estate, industrial real estate and land, which helps investors, owners, renters, companies, and people make informed real estate decisions.

An overview of the Real Estate market

The real estate market is one of the largest markets in the world, with a current market size of $8.9 billion in 2018. As such, it comes with no surprise that in the last couple of years, global metropoles have experienced a hefty increase in real estate prices, especially in residential real estate. Housing space is becoming an intensively scarce resource in many cities as construction work of new buildings cannot keep up with the demand. All these factors make it all the more valuable for the different stakeholders to place their investment, selling or renting decisions on sound data about the real estate market.

The real estate market can be divided into four categories:

Residential Real estate

This category ranges from mini-apartments to high-value homes. In between, it includes single-family homes, condominiums, co-ops, townhouses, and lofts. New construction homes, as well as resale homes belong to the residential property market.

Commercial real estate

As the name suggests, commercial real estate refers to properties whose main purpose is to produce income for the owner and/or the renter. Examples are shopping centers, malls, offices, hotels, and medical institutions. Apartments can be included in either the category of residential or commercial real estate because their owners and investors often pursue a commercial interest by holding those apartments.

Industrial real estate

Industrial company’s properties, such as buildings, warehouses, production facilities or logistics centers belong to the category of industrial real estate. This includes basically every building used by companies for research, design, production, and distribution of physical goods.


Ranches, farms and vacant land is the last category of real estate. Vacant land that is under-developed can be used for infrastructure development projects and become residential real estate at a later time. Some real estate investors are buying vacant land under the assumption that it will become residential real estate soon, which multiplies the value of the estate significantly.

Who uses Real Estate data?

Real estate data is probably one of the widely used data in the world. Anyone that is looking to buy, sell or rent a property is reliant on specific data about the property. This starts with any person that is looking for a place to live. These private households use marketplaces and brokers to compare the available properties in their preferred living area and even place their offer for the property using the marketplaces. Real estate investors and developers are collaborating with real estate data providers that offer property data solutions to gain quick actionable insights on where and when to acquire the next promising properties. Portfolio holders need to constantly monitor and predict future market developments about real estate to profit from future trends in the real estate market or by divesting less valuable properties.

What are common Real Estate data attributes?

There are dozens of relevant data attributes for real estate available and it differs on the use case what data attributes are most relevant. Traditionally, data attributes belong to either one of the two categories:

Land & Ownership attributes

  • Geolocation (address)
  • Ownership (name, last ownership change)
  • Site coverage - plot density
  • Site area
  • Local authority
  • Tenure
  • Flood risk
  • Flood defense
  • Property type (house, apartment, industrial)
  • Financing (outstanding mortgages)
  • Taxes

Real Estate market data attributes

  • Price /sqft –Rent
  • Price /sqft – Sales
  • Price development
  • Median Discount
  • Gross Yield
  • Price Asked
  • Price Paid
  • Rent Asking
  • Condition of Property
  • Days on Market – Rent
  • Days on Market – Sales
  • Local Housing Allowance
  • Property Size Square footage
  • Number and Type of Rooms
  • Rent Listings
  • Sales Listings
  • Sales History & Transactions
  • Comparables – Rent
  • Comparables – Sales

How to use real estate data?

There is an upcoming trend the last couple of years of a data driven real estate market. In addition to using traditional data about the real estate market, real estate investment firms increasingly rely on alternative data or non-traditional data for real estate to improve their investment performance.. This also includes demographic data specific to the surrounding area of the property, that is used in the context of real estate to indicate future price developments.

Demographic data attributes relevant for Real Estate

  • Median income within the area (postal code, street)
  • Unemployment rate in the area
  • Level of homelessness
  • Level of education (% of inhabitants with a graduate college degree, high school degree)
  • Median age of the inhabitants
  • Population size and density in the surrounding area of the property

Non-traditional data / Alternative data relevant for Real Estate

  • Wi-fi consumption/usage
  • Building energy consumption
  • In-building mobility, calculated by frequency of elevator movement
  • Number of coffee shops, restaurants, supermarkets within a small radius of the property
  • Reviews and price levels of local restaurants and stores
  • Semantic data of reviews for nearby shops
  • Crime rates in the area
  • Number of swimming pools in the area

Artificial intelligence and machine learning have tremendous potential in real estate, given the sheer amount of available data. Data scientists and data analysts are using all these different data sets to make sense of the real estate market. The data scientists leverage traditional data in combination with non-traditional real estate data to build highly complex models about the real estate market. These models allow real estate investment firms to make smarter investment decisions. Two properties that look very similar using traditional real estate metrics can look substantially different using non-traditional metrics. Non-traditional metrics account for up to 60% of the predictive power of the real estate value.

How is Real Estate data usually collected?

Data about real estate is typically collected from multiple public and private sources. Still to this data, large amounts of data sets reside in non-digital paper files which reduces the amount of publicly available data sets for transaction purposes. Real estate brokers are usually the ones that are directly sitting at the source of the data. Internet listings service companies, rental sites, vacation rental platforms or transactional marketplaces all provide transactional data for consumers and investors likewise.

Professional property data providers search the Internet daily and compile their database from these public advertisement sources. Other data sources are government statistics providers, company filings from real estate companies or land registries that provide information about land & ownership data.

How to assess the quality of Real Estate data?

Only accurate data is valuable data. Quality of data is an indisputable issue in almost all data categories, including real estate data. In our B2B contact data guide, we already discussed the main error types (invalid, duplicate, and missing data) that diminish the data quality.

Real estate data, additionally, poses the challenge that changes in the data occur on a regular basis due to the often limited time between a property is available on the market and when it is taken off the market. Data providers for real estate should always have multiple sources for the same data to assure high quality and accuracy of the data. Furthermore, data providers need to cleanse outliers that are obviously not accurate as well as typos from real estate agents (which happens more often than you would think). Another specific challenge for real estate data is the standardization of the property address format. Having the same address format in place (country, city, zip code, street, number) allows users of real estate data, to put the data to use more efficiently.

What to look out for when buying from a Real Estate database?

In the past, data was sold in bulk and then had to be integrated with the other applications of a company. This, however, created a lot of struggle for the industry because, as mentioned already, property data changes quickly and needs to be updated regularly. Investment firms and real estate developers don’t have the time to manually update the data sets. They just want to put the data to work and make smart decisions. This prompted a shift in the real estate data industry. Nowadays, most real estate data vendors offer cloud solutions that integrate immediately upon request via a real estate data API into the existing systems. Many data providers also offer ready-to-use SaaS solutions that are used by professionals to manipulate and dissect the data in all sorts of ways.

How is Real Estate data typically priced?

There are two prevailing pricing models in the real estate data industry. Many large data vendors offer advanced SaaS solutions, that includes software for analytics, insights and even predictions. This software is typically priced as a subscription on a monthly basis per user. Other data providers offer data buyers their data in bulk, and are priced accordingly per data point. These bulks are especially helpful for very detailed data requests, for example: “All addresses in Los Angeles that have a swimming pool.”

What to ask Real Estate data providers?

  • How do you source your data?
  • How do you assure the quality of the data?
  • How regularly is the data updated?
  • How is your data priced?
  • How does your prediction model work for future prices? (for Real Estate investors)

Explore vendors in this category

Logo of Reonomy data provider.


Reonomy leverages big data, partnerships and machine learning to connect the fragmented, disparate world of commercial real estate. By pr...

Logo of QueXopa data provider.


QueXopa® is an Alternative Data provider that finds, sources and aggregates the data that fuels decision makers in all areas of government, ...

Logo of Vendigi data provider.


#Home by Vendigi provides audience data for all things home buyers, home sellers and home ownership. Discover a spectrum of audiences spanni...