AUTHOR=Palmero Aprosio Alessio , Tonelli Sara , Menini Stefano , Moretti Giovanni TITLE=Using Semantic Linking to Understand Persons’ Networks Extracted from Text JOURNAL=Frontiers in Digital Humanities VOLUME=4 YEAR=2017 URL=https://www.frontiersin.org/journals/digital-humanities/articles/10.3389/fdigh.2017.00022 DOI=10.3389/fdigh.2017.00022 ISSN=2297-2668 ABSTRACT=
In this work, we describe a methodology to interpret large persons’ networks extracted from text by classifying cliques using the DBpedia ontology. The approach relies on a combination of NLP, Semantic web technologies, and network analysis. The classification methodology that first starts from single nodes and then generalizes to cliques is effective in terms of performance and is able to deal also with nodes that are not linked to Wikipedia. The gold standard manually developed for evaluation shows that groups of co-occurring entities share in most of the cases a category that can be automatically assigned. This holds for both languages considered in this study. The outcome of this work may be of interest to enhance the readability of large networks and to provide an additional semantic layer on top of cliques. This would greatly help humanities scholars when dealing with large amounts of textual data that need to be interpreted or categorized. Furthermore, it represents an unsupervised approach to automatically extend DBpedia starting from a corpus.