DoRA: Domain-Based Self-Supervised Learning Framework for Low-Resource Real Estate Appraisal (CIKM 2023)
Official code of the paper DoRA: Domain-Based Self-Supervised Learning Framework for Low-Resource Real Estate Appraisal. Paper link: http://arxiv.org/abs/2309.00855.
DoRA is pre-trained with an intra-sample geographic prediction as the pretext task based on the metadata of the real estate for equipping the real estate representations with prior domain knowledge. And, inter-sample contrastive learning is employed to generalize the representations to be robust for limited transactions of downstream tasks.
python DoRA.py {building_type} {num_shot}
{building_type} = 'building', 'apartment', 'house'
{num_shot} = 1, 5, ...
| Feature | Feature Type (#class) | Example |
|---|---|---|
| City Name | Category (21) | Taipei city |
| Town Name | Category (350) | Ren’ai township |
| Parking spot | Category (2) | True/False (If the house includes a parking spot.) |
| Studio | Binary | True/False (If the area of the estate is smaller than 8 square meters.) |
| Details building type | Category (5) | Residential building (11 floors and above), Mansion (10 floors and below) |
| Main Purpose | Category (1622) | Electromechanical equipment space |
| Building materials | Category (220) | Rebar, Wood |
| Management organization | Binary | True/False |
| Type of parking space | Category (3) | Flat parking spot, Automated parking spot |
| Elevator | Binary | True/False |
| First-floor index | Binary | True/False (If the house is located on the first floor.) |
| Shop index | Binary | True/False (If the house is for shop use.) |
| Housing type | Category (3) | Building, Apartment, House |
| Village name | Category (4650) | Zhongshan village |
| Land use | Category (19) | Residential zone, Forestry land, Mining land |
| Land Use Designation | Category (16) | Type A building land, Class B building site |
| Land transfer area | Numerical | 30 |
| Building transfer area | Numerical | 50 |
| Number of bedrooms | Numerical | 2 |
| Number of living rooms | Numerical | 1 |
| Number of bathroom | Numerical | 3 |
| Number of total rooms | Numerical | 5 |
| Parking area | Numerical | 5 |
| Main building area | Numerical | 100 |
| Ancillary building area | Numerical | 10 |
| Balcony area | Numerical | 5 |
| House age | Numerical | 10 years |
| Number of land transaction | Numerical | 1 |
| Number of building transaction | Numerical | 1 |
| Number of parking space transactions | Numerical | 2 |
| Building area without parking area | Numerical | 45 |
| Single floor area | Numerical | 20 |
| Floor area ratio (FAR) | Numerical | 10 (Derived by dividing the total area of the building by the total area of the parcel.) |
| Estate floor | Numerical | 5 |
| Total floor | Numerical | 10 |
| Latitude | Numerical | Horizontal lines that measure distance north or south of the equator |
| Longitude | Numerical | Vertical lines that measure east or west of the meridian in Greenwich, England. |
| Building coverage ratio | Numerical | 9 |
| Park count flat | Numerical | 0 |
Generally, real estate with many YIMBY facilities often has a higher price since it implies the quality of living and the degree of transportation convenience. On the contrary, real estate with many NIMBY facilities may be likely to have a lower price since it indicates there may be some pollutant issues that cause a negative impact on living
| Feature | Feature Type (#class) | Example |
|---|---|---|
| YIMBY_10 | Numerical | 2 |
| YIMBY_50 | Numerical | 3 |
| YIMBY_100 | Numerical | 3 |
| YIMBY_250 | Numerical | 6 |
| YIMBY_500 | Numerical | 7 |
| YIMBY_1000 | Numerical | 13 |
| YIMBY_5000 | Numerical | 28 |
| YIMBY_10000 | Numerical | 52 |
| NIMBY_10 | Numerical | 0 |
| NIMBY_50 | Numerical | 0 |
| NIMBY_100 | Numerical | 0 |
| NIMBY_250 | Numerical | 1 |
| NIMBY_500 | Numerical | 1 |
| NIMBY_1000 | Numerical | 2 |
| NIMBY_5000 | Numerical | 3 |
| NIMBY_10000 | Numerical | 6 |
| Feature | Feature Type (#class) | Example |
|---|---|---|
| Land area per town | Numerical | 23.13 (km2) |
| Population density per town | Numerical | 23835 (#people/km2) |
| House price index per quarter | Numerical | 110 |
| Unemployment rate per quarter | Numerical | 5% |
| Economic growth rate per quarter | Numerical | 3% |
| Lending rate per quarter | Numerical | 1.9% |
| Land transaction count per quarter | Numerical | 163796 |
| Average land price index per quarter | Numerical | 101 |
| Steel price index per quarter | Numerical | 1071 |
