VivienTech Scrappers
professional

VivienTech Scrappers

Thu 05 Jan, 2023

-

Sat 10 Feb, 2024

( 13 months)

Basic scrapping for real estate agencies data collection.

Technologies

    Python

    Scrapy

    SQLite

    Streamlit

    Docker

    Docker Compose

Roles

    WebBackendAPIDBDevOps

VivienTech Scrappers is a set of custom scrappers to extract information from various online sources in order to compile an index of real estate agencies present in different areas of Spain.

python
scrapy
sqlite
streamlit
docker

Vivientech Scrappers is a collection of projects focused on extracting and managing real-estate agency data from multiple web sources. The project includes two main parts:

  • Agencies Indexing: a pipeline that goes through configurations (a Google Sheets sheet) and runs the corresponding scrapers to populate a SQLite database with agency information. At the end, it publishes a copy of the database to a Google Sheets sheet for easy access and review.
  • Admin Panel: a web application (admin interface) to visualize KPIs, explore the database data, and view the agencies on a map. (built with Streamlit)

Key features:

  • Modular architecture: each scraper is encapsulated and can be enabled/disabled via the config sheet in Google Sheets.
  • Lightweight persistence: uses SQLite for local storage and periodic copy to Google Sheets.
  • Flexible execution: supports single run, looped execution, and scheduled runs.
  • Admin panel for data querying and monitoring (in admin-panel folder).