professional
VivienTech Scrappers
Thu 05 Jan, 2023
-Sat 10 Feb, 2024
( 13 months)
Basic scrapping for real estate agencies data collection.
Technologies
Python
Scrapy
SQLite
Streamlit
Docker
Docker Compose
Roles
- WebBackendAPIDBDevOps
VivienTech Scrappers is a set of custom scrappers to extract information from various online sources in order to compile an index of real estate agencies present in different areas of Spain.
Vivientech Scrappers is a collection of projects focused on extracting and managing real-estate agency data from multiple web sources. The project includes two main parts:
- Agencies Indexing: a pipeline that goes through configurations (a Google Sheets sheet) and runs the corresponding scrapers to populate a SQLite database with agency information. At the end, it publishes a copy of the database to a Google Sheets sheet for easy access and review.
- Admin Panel: a web application (admin interface) to visualize KPIs, explore the database data, and view the agencies on a map. (built with Streamlit)
Key features:
- Modular architecture: each scraper is encapsulated and can be enabled/disabled via the
configsheet in Google Sheets. - Lightweight persistence: uses SQLite for local storage and periodic copy to Google Sheets.
- Flexible execution: supports single run, looped execution, and scheduled runs.
- Admin panel for data querying and monitoring (in
admin-panelfolder).


