| Literature DB >> 35264762 |
Yannis Assael1, Thea Sommerschield2,3, Brendan Shillingford4, Mahyar Bordbar4, John Pavlopoulos5, Marita Chatzipanagiotou5, Ion Androutsopoulos5, Jonathan Prag6, Nando de Freitas4.
Abstract
Ancient history relies on disciplines such as epigraphy-the study of inscribed texts known as inscriptions-for evidence of the thought, language, society and history of past civilizations1. However, over the centuries, many inscriptions have been damaged to the point of illegibility, transported far from their original location and their date of writing is steeped in uncertainty. Here we present Ithaca, a deep neural network for the textual restoration, geographical attribution and chronological attribution of ancient Greek inscriptions. Ithaca is designed to assist and expand the historian's workflow. The architecture of Ithaca focuses on collaboration, decision support and interpretability. While Ithaca alone achieves 62% accuracy when restoring damaged texts, the use of Ithaca by historians improved their accuracy from 25% to 72%, confirming the synergistic effect of this research tool. Ithaca can attribute inscriptions to their original location with an accuracy of 71% and can date them to less than 30 years of their ground-truth ranges, redating key texts of Classical Athens and contributing to topical debates in ancient history. This research shows how models such as Ithaca can unlock the cooperative potential between artificial intelligence and historians, transformationally impacting the way that we study and write about one of the most important periods in human history.Entities:
Mesh:
Year: 2022 PMID: 35264762 PMCID: PMC8907065 DOI: 10.1038/s41586-022-04448-z
Source DB: PubMed Journal: Nature ISSN: 0028-0836 Impact factor: 69.504
Fig. 1Restoration of a damaged inscription.
This inscription (Inscriptiones Graecae, volume 1, edition 3, document 4, face B (IG I3 4B)) records a decree concerning the Acropolis of Athens and dates to 485/4 bc. Marsyas, Epigraphic Museum, WikiMedia CC BY 2.5.
Fig. 2Ithaca’s architecture processing the phrase ‘δήμο το αθηναίων’ (‘the people of Athens’).
The first three characters of the phrase were hidden and their restoration is proposed. In tandem, Ithaca also predicts the inscription’s region and date.
Fig. 3Ithaca’s outputs.
a, Restoration predictions for six missing characters (dashes) in an Athenian inscription (IG II² 116). The top restoration, in green, is correct (συμμαχία, ‘alliance’). Note how the following hypotheses (ἐκκλησία, ‘assembly’; and προξενία, ‘treaty between state and foreigner’), highlighted in red, typically occur in Athenian political decrees[23], revealing Ithaca’s receptivity to context. b, Geographical attribution of an inscription from Amorgos (IG XII 7, 2). Ithaca’s top prediction is correct, and the closest predictions are neighbouring regions. c, Date distribution for an inscription from Delos (IG XI 4, 579). The ground-truth date interval 300–250 bc is shown in grey; Ithaca’s predicted distribution is shown in yellow and has a mean at 273 bc (green). Ithaca’s predictions show a higher confidence for the interval’s higher date margin, therefore potentially narrowing the broad ground-truth dating bracket. d, Chronological attribution saliency map for an Athenian inscription (IG I³ 371). The colour intensity illustrates the importance of each input. Ithaca focuses on the personal name (Νικίας, ‘Nikias’) and the Greek commanders’ rank (στρατεγοίς, ‘generals’). Nikias had a key role in the great Athenian expedition to Sicily[24–26], the historical event to which this very inscription pertains. Ithaca dates the inscription to 413 bc, matching the exact range proposed by historians (414–413 bc).
Extended Data Fig. 5Restoration and geographical attribution saliency maps.
(a) The decree (IG II² 116) from the Acropolis of Athens recording an alliance between the Athenians and the Thessalian federation (360/1 bc). At each step of the restoration of the missing word “alliance” (συμμαχία), Ithaca is clearly attending to the contextually important words “Athenians” (ʻAθηναίων) and “Thessalians” (Θετταλων). (b) The manumission inscription (BCH 66/67 (1942/3) 82,9) is correctly attributed to the Delphi region (left), and the generated saliency map (right) highlights words correlated to high accuracy predictions from the word statistics table.
Experimental results
| Restoration | Region | Date | ||||
|---|---|---|---|---|---|---|
| Method | CER (%) | Top 1 (%) | Top 20 (%) | Top 1 (%) | Top 3 (%) | Years |
| Ancient historian and Ithaca | ||||||
| Ithaca | 26.3 | 61.8 | 78.3 | |||
| Pythia | 47.0 | 32.6 | 53.9 | |||
| Ancient historian | 59.6 | 25.3 | ||||
| Onomastics | 21.2 | 26.5 | 144.4 | |||
Evaluating methods for text restoration, geographical attribution (region) and chronological attribution (date) on I.PHI’s test set of n = 7,811 inscriptions. For ‘CER’ and ‘years’, lower scores are better. For ‘top 1’, ‘top 3’ and ‘top 20’, higher scores are better. For each metric, the best performing method is in bold.
Extended Data Fig. 1Raw and processed PHI inscription, text and metadata.
A fragmentary early-fifth century sacrificial calendar from the Acropolis of Athens (IG I3 234), face A, lines 10-23. In (a) transcription of the inscription text as it currently appears in PHI; (b) the same text’s processed rendition in I.PHI; (c) the unprocessed metadata of this inscription as it currently appears in the PHI dataset; (d) the processed metadata rendition in I.PHI.
Extended Data Fig. 2Geographical distribution of Greek inscriptions in I.PHI.
Each red circle represents a region across the ancient Mediterranean world (84 in total), the circle size is directly proportional to the number of inscriptions found in that region (total inscriptions in I.PHI n = 78,608).
Dataset statistics for the size of the I.PHI corpus
Dataset statistics for the size of the I.PHI corpus
Extended Data Fig. 3Comparison between Ithaca and the onomastics baseline’s chronological predictions.
The box plot shows the median and the mean distance between the predicted date and the ground-truth time interval, measured in years using the chronological distance metric (see Methods). In this plot, the bounds of the boxes are defined by the first and the third quartiles, and the whiskers by the minimum and maximum values. Ithaca’s mean distance is 2.2x lower than that of the onomastics baseline. Ithaca’s average prediction loss was 29.3 years from the ground-truth interval, while the median prediction loss was only 3 years. The onomastics baseline consists of n = 142 attributions provided by the human annotators.
Extended Data Fig. 4Restoration performance comparison.
(a) The original inscription (IG II² 116) has 378 missing characters. (b) The restorations of the missing characters proposed in the authoritative edition by Rhodes - Osborne 2003 for this text[88], and which we use as ground truths in our evaluation. (c) Pythia’s restoration shows 74 mismatches with the Rhodes-Osborne edition, while (d) Ithaca’s shows only 45. Correct restorations are highlighted in green, incorrect ones in red.
Word statistics for geographical attribution
Word statistics for geographical attribution
To discover underlying patterns in Ithaca’s predictions, we compute statistics to track the words that appear most frequently (“frequency”) in texts whose region Ithaca predicts correctly (“accuracy”). For each word of the test set, we compute an average accuracy, and a frequency of appearance. This visualization is intended to evaluate whether the occurrence of particular words could be correlated to the model’s geographical attributions.
Downdating Athenian decrees with Ithaca
Downdating Athenian decrees with Ithaca
List of disputed Classical Athenian decrees (including their IG[3] edition number), their dates as listed in PHI (which follow the conventional dates proposed by Meiggs - Lewis 1969103 and correspond to the dates in the IG[3] editions of the decrees) based on the conventional ‘three-bar-sigma’ dating criterion, and their recent dating re-evaluations[28]. Ithaca’s prediction mean is listed in column 5. The last two columns represent the distance (in years) of the PHI dates and Ithaca’s predictions from the recent dating re-evaluations. The colour intensity reflects the distance in years, with stronger intensity reflecting a farther distance. As can be seen, Ithaca’s predictions result in an average distance of 5 years, which is 22 years closer to the re-evaluated dates, compared to PHI’s conventional dates.
PHI IDs of the inscriptions excluded from training: 10, 11, 14, 17, 18, 19, 27, 28, 32, 34, 37, 39, 40, 41, 42, 46, 1682; additional PHI IDs for new editions, newly discovered or published sections and doubles of the decrees: 293752, 294468, 229647, 291317, 232697, 293754, 1675, 1676, 1677, 1678, 1679, 1680, 1681, 291118, 292366, 291960, 346490, 292187, 291318, 291321, 292189, 293756, 232710, 291322, 293327, 292194.
Extended Data Fig. 6PHI vs. Ithaca’s dating distance in years for disputed Athenian decrees.
The box plot shows the median and the mean of the distribution, the bounds of the boxes are defined by the first and the third quartiles and the whiskers by the minimum and maximum values of n = 21 inscriptions. Ithaca’s chronological predictions (average distance of 5 years from the modern “lower” ground truth) compared to PHI meta-data for time intervals (older estimates, average distance of 27 years from the modern ground truth). Lower distance in years is better. Exploiting the features of our full dataset, Ithaca’s predictions are better and closer to modern re-evaluations compared to the original PHI ground-truth dates. The latter reflect the dates assigned by the published editions which PHI is reporting, and which almost all reflect the old three-bar sigma dating. We refer the reader to Extended Data Table 3 for detailed results.
Extended Data Fig. 7Chalcis decree (IG I3 40).
The inscription records an oath of allegiance sworn by the city of Chalcis to Athens. It has been traditionally dated to 446/5 bc based on the 3-bar sigma criterion[28], but was more recently redated to 424/3 bc[99]. Photograph by kind concession of the Acropolis Museum. Acrop. 6509 © Acropolis Museum (photo: Socratis Mavrommatis).