MyAV·¶

XClose

MyAV·¶ Global

Home
Menu

MyAV·¶ and U of T develop software to machine-read medieval Latin texts

28 April 2021

MyAV·¶ and University of Toronto researchers launch free-to-use AI model for the automated transcription of handwritten Latin manuscripts

Screenshot of Transkribus in use

Researchers behind the  at MyAV·¶ and the  (DEEDS) Project at the University of Toronto (U of T) have collaborated to develop a machine-reading model that uses artificial intelligence to transcribe medieval Latin texts.

The model, which uses the , was officially launched and made available for free public use at a last month. The joint event was chaired by the project lead, Professor Philip Schofield (MyAV·¶ Laws), and attended by 180 people.

Following the successful launch, the MyAV·¶ and U of T teams held a joint virtual training session to demonstrate how to use the new model. The event, training session and collaboration were funded by the MyAV·¶-U of T Strategic Partner Funds, which supports joint collaborative projects between the two universities.

Teaming up with researchers at U of T

Before joining forces with the DEEDS team at U of T led by Professor Michael Gervers, Professor Schofield and the Bentham Project were involved in the initial development of the Transkribus software for six years between 2013 and 2019.

The more recent collaboration with U of T focuses specifically on teaching the software to read medieval Latin manuscripts. Over four to five months, the two teams fed the machine, line by line, around 300 pages of handwritten Latin text and the model is now capable of reading over 90 percent of the characters correctly.

Professor Schofield said: “One of the most important breakthroughs was teaching the computer to recognise and expand abbreviations. It was truly a joint effort, with Ariella Elema on the Toronto side providing the accurate transcripts of the Latin material and Chris Riley our side doing layout analysis, which ensures the transcript matches each line of text in an image.”

While the MyAV·¶-U of T model has primarily been trained for the automated transcription of 13th-century English charter documents, Professor Schofield’s vision is to eventually combine the machine-reading of manuscripts with crowdsourcing to accelerate the Transcribe Bentham initiative.

Transcribe Bentham was launched in 2010 and engages volunteers to transcribe original correspondence written by Jeremy Bentham over his lifetime (1748-1832). Over 26,000 pages have been transcribed to date.

More information


For the latest news about MyAV·¶â€™s international activity, partnerships and opportunities, subscribe to the bimonthly Global Update newsletter.