Sitemap
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Pages
Posts
OpenITI release 2023.1.8
The 8th version (2023.1.8) of the OpenITI corpus is now available at Zenodo. The release is open access and is also accessible through our GitHub repository....
Post 8: Bibliography
Antrim, Zayde, ‘Nostalgia for the Future: A Comparison between the Introductions to Ibn ʿAsākir’s Taʾrīkh Madīnat Dimashq and al-Khaṭīb al-Baghdādī’s Taʾrīkh...
Post 7: People, Connections and Memory
Image yourself as a learned bookseller of the twelfth century. You have just been called in to assess the estate of a wealthy, prominent scholar who has died...
Post 6: Searches for References to written materials outside of Isnads
As noted in the last post, we struggled to verify book citations in the TMD, both within and outside of isnāds. We believe that our struggles reflect the cha...
Post 5: Ibn ‘Asakir’s Citation of Author Names in Isnads
We continue our investigation of Ibn ʿAsākir’s citations to address our third question about his working methods. When author names appear within his isnāds,...
Post 4: Ibn ‘Asakir’s Transmission Terms in Isnads
Our previous blog post featured a deep dive into the pool of informants whom Ibn ʿAsākir cites frequently. Now we turn to the big picture of how he says he a...
Post 3: Ibn ‘Asakir’s Direct Informants
Ibn ʿAsākir names many persons from whom he acquired information for the TMD. What can our data tell us about them?
Post 2: Ibn ‘Asakir and His History of Damascus, the Data Set
Digital humanists often say they would like to read more work in progress. Our blog posts represent such work. We worked intensively over months to create an...
Post 1: Introducing Ibn ‘Asakir and His History of Damascus
The OpenITI corpus contains more than 11,000 works and now exceeds 2 billion words in size. Many of the corpus’s works are extraordinarily large, surpassing ...
Arabic Pasts 2023: Programme
We are pleased to announce the programme for this year’s Arabic Pasts, running from Thursday 5th until Friday 6th of October 2023. We have yet another exciti...
Call For Papers : Arabic Pasts 2023
Arabic Pasts: Histories and Historiographies
OpenITI release 2022.2.7
The KITAB team has released a new version (2022.2.7) of the OpenITI corpus at Zenodo. The release is open access. It is our seventh release (second release i...
OpenITI release 2022.1.6
The KITAB team has released a new version (2022.1.6) of the OpenITI corpus at Zenodo. The release is open access. It is our fifth release (second release in ...
OpenITI and the Fihrist: Analysis
This is the third blog in a short series of blogs on the overlap between the OpenITI corpus and Ibn al-Nadim’s Fihrist. Please refer to the first part for a ...
OpenITI and the Fihrist: Methodology
This is the second blog in a short series of blogs on the overlap between the OpenITI corpus and Ibn al-Nadim’s Fihrist. Please refer to the first part for a...
OpenITI and the Fihrist
The corpus of texts the KITAB project uses as the basis for its research is a subsection of the OpenITI corpus. It contains Arabic-language texts of the firs...
Lecture Announcement: SHARIAsource Lab Workshop : Ibn ʿAsākir and His History of Damascus: Named Entity Recognition and Text Reuse, Sarah Bowen Savant (Harvard Law School)
On Tuesday September 27, 2022 at 12:00-1:00PM US EST at Lewis 214, KITAB’s Sarah Bowen Savant, will lead a seminar on research in progress that uses the Open...
Arabic Pasts 2022: Programme
We are pleased to announce the programme for this year’s Arabic Pasts. We have yet another exciting series of papers covering a range of topics and periods. ...
A Ramble Through the Cluster Data, Part 2: Quantifying and Visualising Clusters.
In part 1, I introduced you to the cluster data set, a second passim data set that is slightly different from the pairwise data set that the KITAB team use i...
A Close and Distant Reading of Writerly Practices: Sarah Bowen Savant’s Inaugural Lecture
On Thursday 5th of May 2022 Sarah Bowen Savant gave her inaugural lecture as full professor at the AKU-ISMC.
A Ramble Through the Cluster Data, Part 1: From Pairs to Clusters.
It should be no surprise to any reader of this blog that the KITAB project is primarily interested in studying Arabic text reuse. A large number of posts her...
Call for Papers: A Workshop on Citation (25th-26th July 2022)
Modeling Attribution and Acknowledgement in the Digital Humanities: Citation Practices and the Pre-Modern Arabic Book.
Call for papers: Arabic Pasts 2022
Arabic Pasts: Histories and Historiographies
Oh Brethren, Where Are Ye? How to search for words and phrases in the OpenITI corpus, demonstrated with the phrase ‘Ikhwan al-Safa’
The OpenITI corpus is designed to facilitate many different forms of computational analysis. Within the KITAB project we spend the bulk of our time fine-tuni...
New KITAB visualizations
Much of our work at KITAB involves comparing books in order to understand their relationships. Our main tool for this is the passim software, which detects p...
Some Suggestions on Using OpenITI Corpus to Present Enhanced Digital Versions of Large Collections: The Case of al-Dhari‘a Ila Tasanif al-Shi‘a
Tagging the structure of the texts in OpenITI corpus is an important step towards the ultimate goal of the KITAB projectStudying the Arabic textual tradition...
Dispatches from al-Tabari 8: The Afterlife of al-Tabari in Quotations
In this series of blog posts, we have argued for the imperative to rethink writerly culture in ways that allow for a more meaningful exploration of al-Tabari...
Dispatches from al-Tabari 7: Text Reuse Alignments
Post 7: Text Reuse Alignments
OpenITI release 2021.2.5
The KITAB team has released a new version (2021.2.5) of the OpenITI corpus at Zenodo. The release is open access. It is our fifth release (second release in ...
Dispatches from al-Tabari 6: Sources Common to All of al-Tabari’s Works
The argument we are advancing in these blog posts is that when al-Tabari (d. 310/923) created his Taʾrikh al-rusul wa-l-muluk, Jamiʿ al-bayan ʿan taʾwil ay a...
Dispatches from al-Tabari 5: Reconstructing al-Tabari’s Notebooks
In our previous blog post, we argued that al-Tabari (d. 310/923) had to hand an extensive written collection consisting of sets of well-written notes.
Dispatches from al-Tabari 4: The Form of al-Tabari’s Sources: His Probable Notebooks
In the preceding posts, we showed that al-Tabari (d. 310/923) used the phrases ‘he told me’ and ‘he told us’ (haddathani/haddathana) in the Taʾrikh al-rusul ...
Dispatches from al-Tabari 3: How Many People Did al-Tabari Talk To?
This is the third in a series of blog posts examining al-Tabari’s (d. 310/923) citations in his Taʾrikh al-rusul wa-l-muluk, Jāmiʿ al-bayān ʿan taʾwīl āy al-...
Dispatches from al-Tabari 2: Show Me the Data!
You have now entered the weeds.
Dispatches from al-Tabari 1: Al-Tabari’s Direct Informants: Work on a New Data Set
In a series of eight blog posts, we share some results of experimental work on citations in three works by Muhammad b. Jarir al-Tabari (d. 310/923). These ar...
Arabic Pasts 2021 Programme
Arabic Pasts 2021 is happening from 7th - 9th of October.
Using the Many to Spot the Few
At present, the OpenITI/KITAB corpus comprises 10,243 text files, 6,268 of which are unique titles.
A Token Frequency Counter For OpenITI Texts
One of the participants in our KITAB user group asked for an easy way to find out which are the most frequently used words in a text.
From Networks to Named Entities and Back Again: Exploring Isnad Networks
From Networks to Named Entities and Back Again: Exploring Isnad Networks
First Five Hundred Years of the Arabic Book: The Native Origin of the Authors
Quantitative and macroanalytic approaches
Can Digital Humanities Be Informed by Bioinformatics? Visualising Passim Data for Multiple-Book Relationships
As KITAB’s research has shown, passim is an incredibly powerful tool for answering a variety of questions about book history and history in general. The algo...
Call for Papers – Arabic Pasts: Histories and Historiographies (Annual Workshop)
This annual exploratory and informal workshop offers the opportunity to reflect on history writing in Arabic. We encourage contributions focused on methodolo...
OpenITI release 2021.1.4
The KITAB team has released a new version (2021.1.4) of the OpenITI corpus at Zenodo. The release is open access and freely available. It is our fourth relea...
Diversifying the OpenITI corpus, One Text at a Time
The vast majority of texts in the OpenITI corpus were sourced from three major collections of digital texts originally prepared by organisations based in the...
Tracing the origins of a historical fragment focused on the Samanids
At the Arabic Pasts conference this year, Hugh Kennedy and I presented a paper in the panel dedicated to the Invisible East programme, chaired by the program...
Al-Maktaba al-Shamila: a short history
(This is the first blog post in a longer series of posts about the sources of OpenITI.)
KITAB postdoc Gowaart Van Den Bossche wins BRAIS-De Gruyter dissertation prize – 2020
The British Association for Islamic Studies (BRAIS) and De Gruyter have announced the outcome of the fifth (2020) round of the BRAIS–De Gruyter Prize in the ...
OpenITI release 2020.2.3
A new version (version 2020.2.3) of the OpenITI corpus is available at Zenodo, an Open Science platform that supports open access. This is the third release ...
Mapping Who’s Who in Isnads – First Steps
One of the major challenges for those working with historical Arabic texts lies in names, and in the variety of ways that authors might refer to the same per...
Between Manuscripts and Digital Texts: Commentaries on Hadith Raʾs al-Jalut
For us as digital historians and corpus curators, faced with the complex history of reception and transmission as well as the distinct approach to learning a...
Arabic Pasts: Histories and Historiographies Research workshop (October 22-24, 2020 London)
This annual exploratory and informal workshop offers the opportunity to reflect on history writing in Arabic. This year the event will be held online to allo...
Adventures in Alignments: Training an Algorithm to Recognise Text Reuse
Text reuse is the term that we use to describe cases where one book shares verbatim material with another. Text reuse can be studied manually through the rea...
Preserving Pre-Modern Terminologies
To categorise things is a fundamental human and scholarly instinct and activity. And yet it is one not without obstacles, for we soon learn that the world is...
Call for Participation in KITAB (Knowledge, Information Technology, and the Arabic Book)
The KITAB project is seeking researchers who are interested in collaborating to advance their own, distinct research projects. The aim is to build a small gr...
OpenITI, OCR, and Textual Criticism
In previous posts, other members of the KITAB team have talked about building the OpenITI corpus of Arabic and Persian sources. Many members of the team are ...
Algorithmic Reading of Shiʿi Hadith Collections: Direct Borrowing and Common Sources
It is not accidental that a large number of books in the OpenITI corpus belong to one important genre, prophetic Hadith – the sayings of the Prophet Muhammad...
New Release of Our Open Access Arabic Corpus, OpenITI, version 2020.1.2
A new version of the corpus used by the KITAB team is now available to download at Zenodo, an Open Science platform that supports open access. This is the se...
Tagging the Structure of Texts in the OPENITI Corpus
With currently more than 7,000 titles, collected from a number of huge digital Arabic libraries (al-Jamiʿ al-Kabir, al-Maktaba al-Shamila, Shia Online, etc.)...
Contagion in the Corpus: The Black Death and Where to Find It
“How can I bear to pair fair words in rhyme
The New OpenITI Metadata Search
The OpenITI corpus was designed in a way that makes it easy for scripts to access, identify and analyse the texts in the corpus. As a human reader, it was un...
Tracking Traditions: Identifying Isnads in the OpenITI Corpus
Due to its size and coverage, the OpenITI corpus is useful for a wide variety of research purposes. In particular, it represents an excellent opportunity to ...
When al-Tabari is Not (Just) al-Tabari: The Challenges Posed by Composite Editions in the OpenITI Corpus
In the past few months the KITAB team members have been closely studying the issue of versioning and composite editions in the OpenITI corpus. The problem of...
On Commentaries, Digressions, Transtextualities, and Rabbit Holes
Running the passim algorithm on the OpenITI corpus allows us to identify a vast number of instances of text reuse, but the quality of these results from a hi...
Judging the Differences between Arabic Text Versions Mathematically
Researchers working on historical Arabic texts have long known about transmission practices that resulted in considerable differences between what were osten...
A New Application that Helps You Find Texts in the OpenITI Corpus
The Open Islamicate Texts Initiative (OpenITI) is a multi-institutional effort to construct the first open-access machine-actionable scholarly corpus of prem...
First Open Access Release of Our Arabic Corpus
Scholars working in Arabic can now download the entire corpus used by the KITAB team through Zenodo, an Open Science platform that supports open access.
Arabic Pasts – 2018
The ‘Arabic Pasts: Histories and Historiography’ workshop was held in the new Aga Khan Centre in London on the 12th and 13th of October and featured papers t...
A First Look at KITAB’s Data
The digital revolution is arriving rather late to Middle Eastern studies, but it is coming fast.
Detecting What Authors Took from Earlier Works
With text reuse detection, we rely on the power, speed and memory of a computer to find common passages between texts.
Happy News from the ERC…..and Some Details
The European Research Council has awarded KITAB a five-year, €2 million grant that will enable us to make major progress on our research agenda.
A Tale of 3 “Versions”
Measuring variation in the early tradition