Summary
Comparing the distinct subfolder names after jp2/ stored in the DB paths (public.manuscripts_itemimage.image) with the folders that currently exist in the Archetype3 cloud folder (where transferred images live). Identifying missing and extra.
DB extraction query
SELECT DISTINCT
substring(image FROM 'jp2/([^/]+)')
AS after_jp2
FROM public.manuscripts_itemimage
WHERE image LIKE '%jp2/%'
AND image ~ 'jp2/[^/]+';
Data (current snapshot)
DB subfolders (distinct)
- 06W03156P
- 15_1_18
- 4w108287_89_90_91_92p
- BLno1
- C418080
- C418254
- C7443_C7444_C7445
- Durham01
- Durham02
- Durham03
- Durham04
- Durham_Scottish_Charters
- Holyrood_1
- Holyrood_2
- Holyrood_3
- Holyrood_4
- Jul_23_2010_2
- MS_187_06
- Melrose_charters_GD_55
- NRSGD45
- NRS_GD_90
- Reading
- Rievaulx_Abbey_C
- SC_499_14
- admin-upload
Archetype3 cloud folders provided
- 06W03156P/
- 080508/
- 100709_01/
- 15.1.18/
- 4w108287-89-90-91-92p/
- BLno1/
- C418080/
- C418254/
- C7443-C7444-C7445/
- Donnelly Durham/
- Durham Scottish Charters/
- Durham01/
- Durham03/
- Durham04/
- Holyrood 1/
- Holyrood 2/
- Holyrood 3/
- Holyrood 4/
- Jul 22 2010/
- Jul 23 2010/
- Jul 23 2010 - 2/
- Melrose charters GD 55/
- MS-187-06/
- New BL charters/
- NRS GD 90/
- NRSGD45/
- Reading/
- Rievaulx Abbey C/
- SC-499-14/
- test/
Comparison (based on current lists)
Present (DB ↔ Cloud) but naming differs
These names differ only by separators (underscore/dot/hyphen/space):
15_1_18 ↔ 15.1.18/
4w108287_89_90_91_92p ↔ 4w108287-89-90-91-92p/
C7443_C7444_C7445 ↔ C7443-C7444-C7445/
Durham_Scottish_Charters ↔ Durham Scottish Charters/
Holyrood_1 ↔ Holyrood 1/
Holyrood_2 ↔ Holyrood 2/
Holyrood_3 ↔ Holyrood 3/
Holyrood_4 ↔ Holyrood 4/
Jul_23_2010_2 ↔ Jul 23 2010 - 2/
MS_187_06 ↔ MS-187-06/
Melrose_charters_GD_55 ↔ Melrose charters GD 55/
NRS_GD_90 ↔ NRS GD 90/
Rievaulx_Abbey_C ↔ Rievaulx Abbey C/
SC_499_14 ↔ SC-499-14/
Present with same naming
06W03156P ↔ 06W03156P/
BLno1 ↔ BLno1/
C418080 ↔ C418080/
C418254 ↔ C418254/
Durham01 ↔ Durham01/
Durham03 ↔ Durham03/
Durham04 ↔ Durham04/
NRSGD45 ↔ NRSGD45/
Reading ↔ Reading/
In DB but not found in cloud list (potentially missing transfers)
In cloud but not found in DB list (potentially extra / new / unrelated)
080508/
100709_01/
Donnelly Durham/
Jul 22 2010/
Jul 23 2010/
New BL charters/
test/
Summary
Comparing the distinct subfolder names after
jp2/stored in the DB paths (public.manuscripts_itemimage.image) with the folders that currently exist in the Archetype3 cloud folder (where transferred images live). Identifying missing and extra.DB extraction query
Data (current snapshot)
DB subfolders (distinct)
Archetype3 cloud folders provided
Comparison (based on current lists)
Present (DB ↔ Cloud) but naming differs
These names differ only by separators (underscore/dot/hyphen/space):
15_1_18↔15.1.18/4w108287_89_90_91_92p↔4w108287-89-90-91-92p/C7443_C7444_C7445↔C7443-C7444-C7445/Durham_Scottish_Charters↔Durham Scottish Charters/Holyrood_1↔Holyrood 1/Holyrood_2↔Holyrood 2/Holyrood_3↔Holyrood 3/Holyrood_4↔Holyrood 4/Jul_23_2010_2↔Jul 23 2010 - 2/MS_187_06↔MS-187-06/Melrose_charters_GD_55↔Melrose charters GD 55/NRS_GD_90↔NRS GD 90/Rievaulx_Abbey_C↔Rievaulx Abbey C/SC_499_14↔SC-499-14/Present with same naming
06W03156P↔06W03156P/BLno1↔BLno1/C418080↔C418080/C418254↔C418254/Durham01↔Durham01/Durham03↔Durham03/Durham04↔Durham04/NRSGD45↔NRSGD45/Reading↔Reading/In DB but not found in cloud list (potentially missing transfers)
Durham02admin-uploadIn cloud but not found in DB list (potentially extra / new / unrelated)
080508/100709_01/Donnelly Durham/Jul 22 2010/Jul 23 2010/New BL charters/test/