Using the Many to Spot the Few

At present, the OpenITI/KITAB corpus comprises 10,243 text files, 6,268 of which are unique titles.

The New OpenITI Metadata Search

The OpenITI corpus was designed in a way that makes it easy for scripts to access, identify and analyse the texts in the corpus. As a human reader, it was un...