Skip to content

Academic Inquiry: Spatial Analysis with DuckDB for Engineering Thesis #564

@2xqqjnq

Description

@2xqqjnq

Dear DuckDB Team,

We are two engineering students, BAIDA Brahim and BOUCHANTIYA Mohamed,
currently completing our final year project (PFE) for the Engineering
Degree in Geomatics and Topographic Sciences at the Institut
Agronomique et Vétérinaire Hassan II in Rabat, Morocco.

Our PFE is entitled "Exploration of DuckDB Spatial for Analytics on
Massive Geospatial Data".
As part of this work, we are focusing on a benchmarking study between
PostGIS and DuckDB Spatial.

Specifically, we are comparing the performance of both engines on the
spatial analysis of a large building dataset across Morocco (~3GB,
approximately 24 million geometries).

During our experiments, we noticed an important difference:

In PostGIS, after validating the geometries, we obtain approximately
24 million valid geometries.

In DuckDB, after validation, we obtain approximately 12 million valid
geometries.

Our investigations suggest that PostGIS may be slightly tolerant
towards minor polygon closure errors, while DuckDB seems to enforce
stricter validation rules.

➔ We would greatly appreciate if you could clarify:

  • Is DuckDB Spatial deliberately stricter in its validation of geometries?
  • Which validation standard or model is DuckDB following internally
    for spatial data?

Moreover, we are very interested in better understanding the internal
architecture of DuckDB regarding the processing of massive spatial
datasets, particularly:

How DuckDB manages spatial indexing and optimization.

How DuckDB achieves efficient handling of very large spatial files.

Any official documentation, technical notes, or research articles that
could enrich our thesis would be highly appreciated.

Thank you very much for your time and support.
We remain available for any further information or collaboration if needed.

Sincerely,
BAIDA Brahim & BOUCHANTIYA Mohamed
Institut Agronomique et Vétérinaire Hassan II
Rabat, Morocco

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions