OpenITI release 2021.2.5

1 minute read

The KITAB team has released a new version (2021.2.5) of the OpenITI corpus at Zenodo. The release is open access. It is our fifth release (second release in ...

Using the Many to Spot the Few

3 minute read

At present, the OpenITI/KITAB corpus comprises 10,243 text files, 6,268 of which are unique titles.

OpenITI release 2021.1.4

1 minute read

The KITAB team has released a new version (2021.1.4) of the OpenITI corpus at Zenodo. The release is open access and freely available. It is our fourth relea...

OpenITI release 2020.2.3

1 minute read

A new version (version 2020.2.3) of the OpenITI corpus is available at Zenodo, an Open Science platform that supports open access. This is the third release ...

Preserving Pre-Modern Terminologies

11 minute read

To categorise things is a fundamental human and scholarly instinct and activity. And yet it is one not without obstacles, for we soon learn that the world is...

OpenITI, OCR, and Textual Criticism

5 minute read

In previous posts, other members of the KITAB team have talked about building the OpenITI corpus of Arabic and Persian sources. Many members of the team are ...