Using the Many to Spot the Few

3 minute read

At present, the OpenITI/KITAB corpus comprises 10,243 text files, 6,268 of which are unique titles.

The New OpenITI Metadata Search

2 minute read

The OpenITI corpus was designed in a way that makes it easy for scripts to access, identify and analyse the texts in the corpus. As a human reader, it was un...