Skip to content

Comapring JP2 subdirectory names: DB (public.manuscripts_itemimage.image) vs Archetype3 cloud folders #65

@zixben

Description

@zixben

Summary

Comparing the distinct subfolder names after jp2/ stored in the DB paths (public.manuscripts_itemimage.image) with the folders that currently exist in the Archetype3 cloud folder (where transferred images live). Identifying missing and extra.

DB extraction query

SELECT DISTINCT
       substring(image FROM 'jp2/([^/]+)')
         AS after_jp2
FROM public.manuscripts_itemimage
WHERE image LIKE '%jp2/%'
  AND image ~ 'jp2/[^/]+';

Data (current snapshot)

DB subfolders (distinct)

  • 06W03156P
  • 15_1_18
  • 4w108287_89_90_91_92p
  • BLno1
  • C418080
  • C418254
  • C7443_C7444_C7445
  • Durham01
  • Durham02
  • Durham03
  • Durham04
  • Durham_Scottish_Charters
  • Holyrood_1
  • Holyrood_2
  • Holyrood_3
  • Holyrood_4
  • Jul_23_2010_2
  • MS_187_06
  • Melrose_charters_GD_55
  • NRSGD45
  • NRS_GD_90
  • Reading
  • Rievaulx_Abbey_C
  • SC_499_14
  • admin-upload

Archetype3 cloud folders provided

  • 06W03156P/
  • 080508/
  • 100709_01/
  • 15.1.18/
  • 4w108287-89-90-91-92p/
  • BLno1/
  • C418080/
  • C418254/
  • C7443-C7444-C7445/
  • Donnelly Durham/
  • Durham Scottish Charters/
  • Durham01/
  • Durham03/
  • Durham04/
  • Holyrood 1/
  • Holyrood 2/
  • Holyrood 3/
  • Holyrood 4/
  • Jul 22 2010/
  • Jul 23 2010/
  • Jul 23 2010 - 2/
  • Melrose charters GD 55/
  • MS-187-06/
  • New BL charters/
  • NRS GD 90/
  • NRSGD45/
  • Reading/
  • Rievaulx Abbey C/
  • SC-499-14/
  • test/

Comparison (based on current lists)

Present (DB ↔ Cloud) but naming differs

These names differ only by separators (underscore/dot/hyphen/space):

  • 15_1_1815.1.18/
  • 4w108287_89_90_91_92p4w108287-89-90-91-92p/
  • C7443_C7444_C7445C7443-C7444-C7445/
  • Durham_Scottish_ChartersDurham Scottish Charters/
  • Holyrood_1Holyrood 1/
  • Holyrood_2Holyrood 2/
  • Holyrood_3Holyrood 3/
  • Holyrood_4Holyrood 4/
  • Jul_23_2010_2Jul 23 2010 - 2/
  • MS_187_06MS-187-06/
  • Melrose_charters_GD_55Melrose charters GD 55/
  • NRS_GD_90NRS GD 90/
  • Rievaulx_Abbey_CRievaulx Abbey C/
  • SC_499_14SC-499-14/

Present with same naming

  • 06W03156P06W03156P/
  • BLno1BLno1/
  • C418080C418080/
  • C418254C418254/
  • Durham01Durham01/
  • Durham03Durham03/
  • Durham04Durham04/
  • NRSGD45NRSGD45/
  • ReadingReading/

In DB but not found in cloud list (potentially missing transfers)

  • Durham02
  • admin-upload

In cloud but not found in DB list (potentially extra / new / unrelated)

  • 080508/
  • 100709_01/
  • Donnelly Durham/
  • Jul 22 2010/
  • Jul 23 2010/
  • New BL charters/
  • test/

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

Status

In progress

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions