Description
I am opening a space to review the lessons which use the Old Bailey Online's website, in light of the recent changes to the Old Bailey's API.
Although the old version of the website will still be accessible until August 2024 at https://www.dhi.ac.uk/oldbaileyonline, we want to update lessons which are affected by this change, so they remain usable in the future.
I'll then open a single issue for each of the lessons which do need to be updated, and link to them below.
I have counted 10 lessons which refer to the Old Bailey more or less extensively:
- EN, ES, FR, PT
Changes needed: MINOR
In the ‘Python List’ section, we need to change the URL inside a code block from http://www.oldbaileyonline.org/print.jsp?div=t17800628-33
to https://www.oldbaileyonline.org/record/t17800628-33
.
(I’m not sure what the print
component adds to the first URL and whether it is needed in the update too.)
Then, perhaps we need to update the list of words received in the output below, but only if they've changed with the new URL.
- EN, ES, FR, PT
Changes needed: MINOR
Where the lesson says: ‘and the Old Bailey uses this format’, the format needs to be updated to reflect the current one. The example URL should be changed as well.
Actually, the current example URL is http://www.oldbaileyonline.org/browse.jsp?ref=OA16780417
, which doesn’t show any results on the obsolete Old Bailey site. Using name
instead of ref
does work, though (https://www.dhi.ac.uk/oldbaileyonline/browse.jsp?name=OA16780417
).
The corresponding URL on the new website is https://www.oldbaileyonline.org/record/OA16780417
.
- EN, ES, PT
Changes needed: NONE
Although this lesson refers to the Old Bailey, it uses a file which is already available in the lesson's assets directory, so I think it can remain as is
- EN, ES, PT
Changes needed: MINOR
The URL http://www.oldbaileyonline.org/browse.jsp?id=t17800628-33&div=t17800628-33
appears twice and needs to be changed to https://www.oldbaileyonline.org/record/t17800628-33
. (Perhaps this URL would also need the &div=
component? I don’t know how to recreate this in the new format.)
- EN, ES, FR, PT
Changes needed: MINOR
The URL http://www.oldbaileyonline.org/browse.jsp?id=t17800628-33&div=t17800628-33
needs to be changed to https://www.oldbaileyonline.org/record/t17800628-33
. (Again, perhaps it needs the &div=
component?)
- EN, ES, FR, PT
Changes needed: NONE
This lesson teaches how to create matrices with data from the Old Bailey, but never refers directly to the site
- EN, ES, FR, PT
Changes needed: NONE
This lesson only shows a screenshot of the Old Bailey website and its html code. Although we could update the images to show its modern look and html code, it’s not really necessary for the lesson.
- EN, ES, FR, PT
Changes needed: MAJOR
Many URLs need to be updated:
http://oldbaileyonline.org/static/Project.jsp
-> unsure
.
https://www.oldbaileyonline.org/search.jsp? form=searchHomePage&_divs_fulltext=arsenic&kwparse=and&_persNames_surname=&_persNames_given=&_persNames_alias=&_offences_offenceCategory_offenceSubcategory=&_verdicts_verdictCategory_verdictSubcategory=&_punishments_punishmentCategory_punishmentSubcategory=&_divs_div0Type_div1Type=&fromMonth=&fromYear=&toMonth=&toYear=&ref=&submit.x=0&submit.y=0
-> unsure
. We can probably recreate it by using the Advanced Search functionality in the new website with the same parameters, though.
http://www.oldbaileyonline.org/browse.jsp?id=t17800628-33&div=t17800628-33
-> https://www.oldbaileyonline.org/record/t17800628-33
(Bowsey trial).
We must check that the 'little bit of HTML markup' is still correct.
Also, after ‘By studying the URL we can learn a few things’, these ‘few things’ have to be reviewed to ensure they are still correct.
[On a different note, this lesson uses Komodo Edit, which we've encountered issues with in other lessons.]
- EN, ES, PT
Changes needed: MAJOR
See Issue #3134
- EN
Changes needed: MAJOR
Changes are needed from 'Downloading trials' onwards:
http://www.oldbaileyonline.org/obapi/ob?term0=fromdate_18300114&term1=todate_18391216&count=10&start=211&return=zip
-> unsure
.
Careful changes will be needed to the script which allows you to download more than 10 entries at once, and to the accompanying description.
Where it says ‘a file that looks like this:’ (wget1830s.txt
), I expect it will look different now due to the changed URLs.
After ‘Here’s a snippet from one trial:’, we might need to update it slightly. The XML markup found on the current website for https://www.oldbaileyonline.org/record/t18300114-2
is ever so slightly different. However, I think it will perhaps still work as intended? This will be discovered if the command:
cd ../../baileycode/
python save-trialtxts-by-category.py
still runs the script as desired. If so, then no further changes are needed after this step.