-
Notifications
You must be signed in to change notification settings - Fork 0
Data
A thorough description of all the fields in the ATOM can be found in Documento de especificación sobre el formato de sindicación y reutilización de datos sobre licitacionesAbre nueva ventana .
- The same contract (identified through its id, e.g., https://contrataciondelestado.es/sindicacion/PlataformasAgregadasSinMenores/1974509) can show up multiple times across different files (ATOM and zip)
-
Sometimes the same field shows up twice (error?). While this might be fine in some situations since it will be handled as a multivalue field, it is not if the offending column was already meant to be multivalued. The reason is that we would be getting, for a particular row (contract) and column (field), a list with nested lists, which is something that cannot be handled by the parquet format. At the time of writing this, the workaround is to convert every element in the nested list (scalar elements and
lists) to strings. -
Some (ATOM) files flag as
deletedentries (contracts) that are nowhere to be found (see e.g., https://contrataciondelestado.es/sindicacion/PlataformasAgregadasSinMenores/1955775 in PlataformasAgregadasSinMenores_20220104_030016_1.atom). These events are silently ignored by the implementation.