Skip to content
Manuel A. Vázquez edited this page Dec 9, 2022 · 1 revision

Description

A thorough description of all the fields in the ATOM can be found in Documento de especificación sobre el formato de sindicación y reutilización de datos sobre licitacionesAbre nueva ventana .

Things to take into account

Issues

  • Sometimes the same field shows up twice (error?). While this might be fine in some situations since it will be handled as a multivalue field, it is not if the offending column was already meant to be multivalued. The reason is that we would be getting, for a particular row (contract) and column (field), a list with nested lists, which is something that cannot be handled by the parquet format. At the time of writing this, the workaround is to convert every element in the nested list (scalar elements and lists) to strings.

  • Some (ATOM) files flag as deleted entries (contracts) that are nowhere to be found (see e.g., https://contrataciondelestado.es/sindicacion/PlataformasAgregadasSinMenores/1955775 in PlataformasAgregadasSinMenores_20220104_030016_1.atom). These events are silently ignored by the implementation.

Clone this wiki locally