The inspiration for this project was a curiosity-driven deep dive into AirBnB listing data. The goal was to identify market opportunities by finding hosts who might benefit from a cleaning and maintenance service.
The CSV files used in this project can be found at the following link: Google Drive.
This project focuses on analyzing AirBnB listings in London, UK. After performing exploratory data analysis on the listings, the goal was to identify potential leads for a hypothetical new cleaning and maintenance service.
This project explores key questions to extract meaningful insights from the dataset:
1️⃣ What are the 10 most reviewed listings?
2️⃣ What are the top 5 most expensive neighbourhoods?
3️⃣ Are any hosts fully booked for March 2025, and which hosts have the least number of available dates in March 2025?
4️⃣ Who are the hosts with the least listings?
5️⃣ Identify some hosts which may benefit from a new cleaning service provider.
- Database: SQLite3
- Language: SQL
✅ Relational database creation
✅ Table creation and data import
✅ Exploratory data analysis using SQL
✅ Basic report generation for insights
-
Download Data
- Get the relevant CSV files from this Google Drive link.
-
Set Up SQLite3
- If SQLite3 is not installed, download it from SQLite Official Website.
- Verify installation:
sqlite3 --version
-
Create the Database
- Open SQLite3 and create a new database:
sqlite3 airbnb_london.db
- Open SQLite3 and create a new database:
-
Import the CSV Data into SQLite
- Open SQLite shell and run:
.mode csv .import 'calendar.csv' calendar .import 'listings.csv' listings .import 'listings-2.csv' listings_2 .import 'neighbourhoods.csv' neighbourhoods .import 'reviews.csv' reviews .import 'reviews-2.csv' reviews_2
- Open SQLite shell and run:
-
Verify Data
- Open SQLite and explore the data:
SELECT * FROM listings LIMIT 5;
- Open SQLite and explore the data:
-
Run SQL Queries
- Open SQLite and explore the data:
sqlite3 airbnb_london.db
- Example query:
SELECT host_id, host_name, AVG(review_scores_cleanliness) AS avg_cleanliness FROM listings GROUP BY host_id ORDER BY avg_cleanliness DESC;
- Open SQLite and explore the data:
This project serves as a template for:
- Importing web-scraped data into a relational database
- Performing exploratory analysis on structured data
- Extracting actionable insights from raw datasets
- No contribution guidelines at the moment.
- No specific license.