Different solutions are offered today for modelling multilingual terminological data. In this art... more Different solutions are offered today for modelling multilingual terminological data. In this article, we focus on the description of two approaches: on the one hand, the model proposed in the context of ISO TC 37/SC 3, based on the adoption of the Terminological Markup Framework/TermBase eXchange standards; on the other hand, the ‘Lemon’ model and, more generally, the Ontology Web Language adopted in the framework of the Semantic Web. The aim of this study is to propose a contrastive multilevel analysis between these two paradigms, with the ultimate goal of highlighting their divergences and convergences. The terminological case study chosen to test the two approaches is represented by the pathology known as ‘body dysmorphic disorder’, which poses challenges in terms of conceptual and linguistic representation. Starting from the phenomena of reconceptualization and denominative variation of this disorder, we will show that the two models are based on diverging but complementary the...
La presente ricerca si propone di indagare la preverbazione in lituano, in particolare la sua fun... more La presente ricerca si propone di indagare la preverbazione in lituano, in particolare la sua funzione ed i suoi influssi all’interno del sistema verbale. Un fenomeno ben noto ed ampiamente discusso nella letteratura è la presenza, nel sistema verbale di molte lingue, di coppie lessematiche in cui un verbo di tipo durativo non telico si oppone ad un verbo telico indicante incoatività o più spesso totale esaurimento del processo. In particolare uno dei due lessemi, solitamente quello telico, deriva dall’altro, mediante l’aggiunta di un affisso o di una preposizione. Si pensi ad esempio all’opposizione inglese tra eat/eat up “mangiare/finire di mangiare” oppure wear/wear up “portare un vestito/logorare” o ancora alle coppie tedesche enden/beenden “finire/porre fine” o schlafen/einschlafen “dormire/addormentarsi”. Questo fenomeno appare in tutta la sua evidenza nelle lingue slave: in esse i procedimenti di formazione di coppie lessicali che si opponevano per valori azionali si sono con...
The article analyses person free structures, especially with experiencer predicates, that mark ph... more The article analyses person free structures, especially with experiencer predicates, that mark physical process, e. g. “gelti”, “skaudėti” or “sopėti”, “mausti”, “peršėti”, “troškinti”, “dusinti”, “pykinti”, “niežėti”. This small group of verbs features non-canon marking: personal body parts that experience pain are expressed by an accusative, not nominative, as it might be expected. Such principle is clear: an accusative, i.e. prototypic case of an object, is naturally inactive used for marking non-agent, inactive object, which does not regulate verb process. In such structures as “kaip man skauda galvą” an accusative indicated the body part, directly involved in a verb process, while dative marks a subject which has peripheral connection with the process and has possessive meaning (dativus sympatheticus). Data highlighted in the analysis does not contradict to the results of various studies of Indo-European languages and supports those scientists who think that syntactic compositi...
In this paper we present a first version of LexO, a collaborative editor of multilingual lexica a... more In this paper we present a first version of LexO, a collaborative editor of multilingual lexica and termino-ontological resources. It is based on the lemon model, and aims at supporting lexicographers and terminologists in their work. Although the development of LexO is still ongoing, the editor is already being used within two research projects in the field of Computational Linguistics applied to Humanities: DiTMAO and Totus Mundus. This allowed to test the functionalities of LexO and to prove its high degree of flexibility according to the different extensions of the lemon model needed to fulfill the needs of the involved scholars.
This article introduces the model DIATERM, devoted to representing the diachronic evolution of co... more This article introduces the model DIATERM, devoted to representing the diachronic evolution of concepts and terms in a given domain, according to Semantic Web standards and Linked Data technologies. The approach adopted for the representation of temporal information is based on the reification of N-ary relationships. DIATERM is articulated on three levels, textual, terminological and conceptual. Each level can be affected, more or less simultaneously, by change. The use of SWRL rules allows to automatically assign temporal information, thus facilitating the construction of the terminological resource and highlighting any inconsistencies. Two examples of interrogation and visualization of diachronic terminological resources will be illustrated. The first example is taken from the resource dedicated to the astronomical terminology introduced by Christopher Clavius in his Commentary on the Sacrobosco’s Tractatus de Sphaera. The second example is taken from the electronic lexicon of Fer...
Proceedings of the Third AIUCD Annual Conference on Humanities and Their Methods in the Digital Ecosystem - AIUCD '14, 2015
This paper describes the full procedure adopted in the context of the Clavius on the Web project,... more This paper describes the full procedure adopted in the context of the Clavius on the Web project, which aims to help Web users to appraise the importance of specific manuscripts by going beyond their digital reproduction. The proposed approach is based on the multilayered explication of linguistic, lexical and semantic data representing the innermost nature of the analyzed manuscripts. The final purpose of the project is to gather and display the results of the three layers of analysis through interactive visualization techniques and export them as Linked Data. All the analyses rely on the XML/TEI encoding of the text, followed by a CTS-based tokenization. As a working example for this paper, the analysis of a portion of a manuscript provided by Historical Archives of the Pontifical Gregorian University will be illustrated. The text is a letter written in Latin and sent by Botvitus Nericius to Christophorus Clavius in 1598 from Madrid.
English. In this work, we present an experiment in the modeling of a diachronic termino-ontologic... more English. In this work, we present an experiment in the modeling of a diachronic termino-ontological resource named CLAVIUS through both the N-ary relations model and the 4D-fluents approach. Some of the salient differences of these two models are discussed. The overall objective of this research is to illustrate the main advantages and disadvantages in the adoption of a given model to build diachronic resources. Italiano. In questo lavoro, si illustra un esperimento di modellazione di una risorsa termino-ontologica diacronica (CLAVIUS) secondo due approcci, quello N-ario e quello dei 4D-fluents. Le differenze salienti dei due approcci verranno presentate e discusse. L'obiettivo generale della ricerca qui introdotta è quello di mostrare i principali vantaggi e svantaggi che l'adozione di un determinato modello può comportare nella modellazione di risorse diacroniche. Introduzione Pànta rei è la celebre espressione attribuita da Platone ad Eraclito. Tutto è sottoposto alla i...
Abstract. In the context of the digitization of manuscripts, transcription and annotation are oft... more Abstract. In the context of the digitization of manuscripts, transcription and annotation are often distinct, sequential steps. This could lead to diculties in improving the transcribed text when annotations have already been dened. In order to avoid this, we devised an approach which merges the two steps into the same process. Text Encoder and Annotator (TEA) is a prototype application embracing this concept. TEA is based on a lightweight language syntax which annotates text using Semantic Web technologies. Our approach is currently being developed within the Clavius on the Web project, devoted to studying the manuscripts of Christophorus Clavius, an inuential 16th century mathematician and astronomer.
The aim of this article is to present a model for representing in an explicit and formal way the ... more The aim of this article is to present a model for representing in an explicit and formal way the diachronic evolution of concepts and terms in a given domain, so that this formalization can be machine-actionable. The approach we here propose is based on Semantic Web technologies in order to guarantee interoperability and reuse across scientific communities of diachronic terminological resources that can be thus easily accessed, interconnected and mutually enriched. More specifically, the representation of dynamic evolution of terms and concepts was performed in OWL using the N-ary relations mechanisms. In addition, a set of SWRL rules was set up, in order to automatically identify the evolution of the concepts evoked within a text, as well as the terms representing these concepts. Our model was adopted to formally represent diachronic aspects of Saussure’s terminology as they emerge from his works. An example will be provided to highlight the potential of such a knowledge structurat...
In this work, we describe the modelling of a diachronic termino-ontological resource, named CLAVI... more In this work, we describe the modelling of a diachronic termino-ontological resource, named CLAVIUS, representing the evolution of astronomical concepts and theories from antiquity until the dawn of the modern age. The resource was built by means of existing tools allowing the scholars to formalize knowledge even though they are not familiar with the models and the languages underlying the representation. More specifically, Protégé, a free open-source ontology editor, which supports OWL (and OWL 2) and Chronos, a plug-in for Protégé to manage temporal aspect, were used. A raw evaluation of the resource is provided by means of a controlled natural language interface, which enables scholars to answer a set of salient queries defined by our domain expert.
In the last few years the amount of manuscripts digitized and made available on the Web has been ... more In the last few years the amount of manuscripts digitized and made available on the Web has been constantly increasing. However, there is still a considarable lack of results concerning both the explicitation of their content and the tools developed to make it available. The objective of the Clavius on the Web project is to develop a Web platform exposing a selection of Christophorus Clavius letters along with three different levels of analysis: linguistic, lexical and semantic. The multilayered annotation of the corpus involves a XML-TEI encoding followed by a tokenization step where each token is univocally identified through a CTS urn notation and then associated to a part-of-speech and a lemma. The text is lexically and semantically annotated on the basis of a lexicon and a domain ontology, the former structuring the most relevant terms occurring in the text and the latter representing the domain entities of interest (e.g. people, places, etc.). Moreover, each entity is connecte...
In the framework of the Italian project ‘For a digital edition of Ferdinand de Saussure's man... more In the framework of the Italian project ‘For a digital edition of Ferdinand de Saussure's manuscripts’, an electronic thesaurus of Saussure’s terminology is being built, which includes new terms extracted from recently found manuscripts. The lexical model on which it is grounded is a customized version of the SIMPLE model. In this paper, an overview of the customization process is provided, with a special focus on the steps taken for designing a domain-specific ontology as well as on the creation of additional semantic relations and features. Lexical entries are illustrated and the potential of a structured organization of semantic knowledge for gaining a wider understanding of the overall domain terminology is highlighted.
Different solutions are offered today for modelling multilingual terminological data. In this art... more Different solutions are offered today for modelling multilingual terminological data. In this article, we focus on the description of two approaches: on the one hand, the model proposed in the context of ISO TC 37/SC 3, based on the adoption of the Terminological Markup Framework/TermBase eXchange standards; on the other hand, the ‘Lemon’ model and, more generally, the Ontology Web Language adopted in the framework of the Semantic Web. The aim of this study is to propose a contrastive multilevel analysis between these two paradigms, with the ultimate goal of highlighting their divergences and convergences. The terminological case study chosen to test the two approaches is represented by the pathology known as ‘body dysmorphic disorder’, which poses challenges in terms of conceptual and linguistic representation. Starting from the phenomena of reconceptualization and denominative variation of this disorder, we will show that the two models are based on diverging but complementary the...
La presente ricerca si propone di indagare la preverbazione in lituano, in particolare la sua fun... more La presente ricerca si propone di indagare la preverbazione in lituano, in particolare la sua funzione ed i suoi influssi all’interno del sistema verbale. Un fenomeno ben noto ed ampiamente discusso nella letteratura è la presenza, nel sistema verbale di molte lingue, di coppie lessematiche in cui un verbo di tipo durativo non telico si oppone ad un verbo telico indicante incoatività o più spesso totale esaurimento del processo. In particolare uno dei due lessemi, solitamente quello telico, deriva dall’altro, mediante l’aggiunta di un affisso o di una preposizione. Si pensi ad esempio all’opposizione inglese tra eat/eat up “mangiare/finire di mangiare” oppure wear/wear up “portare un vestito/logorare” o ancora alle coppie tedesche enden/beenden “finire/porre fine” o schlafen/einschlafen “dormire/addormentarsi”. Questo fenomeno appare in tutta la sua evidenza nelle lingue slave: in esse i procedimenti di formazione di coppie lessicali che si opponevano per valori azionali si sono con...
The article analyses person free structures, especially with experiencer predicates, that mark ph... more The article analyses person free structures, especially with experiencer predicates, that mark physical process, e. g. “gelti”, “skaudėti” or “sopėti”, “mausti”, “peršėti”, “troškinti”, “dusinti”, “pykinti”, “niežėti”. This small group of verbs features non-canon marking: personal body parts that experience pain are expressed by an accusative, not nominative, as it might be expected. Such principle is clear: an accusative, i.e. prototypic case of an object, is naturally inactive used for marking non-agent, inactive object, which does not regulate verb process. In such structures as “kaip man skauda galvą” an accusative indicated the body part, directly involved in a verb process, while dative marks a subject which has peripheral connection with the process and has possessive meaning (dativus sympatheticus). Data highlighted in the analysis does not contradict to the results of various studies of Indo-European languages and supports those scientists who think that syntactic compositi...
In this paper we present a first version of LexO, a collaborative editor of multilingual lexica a... more In this paper we present a first version of LexO, a collaborative editor of multilingual lexica and termino-ontological resources. It is based on the lemon model, and aims at supporting lexicographers and terminologists in their work. Although the development of LexO is still ongoing, the editor is already being used within two research projects in the field of Computational Linguistics applied to Humanities: DiTMAO and Totus Mundus. This allowed to test the functionalities of LexO and to prove its high degree of flexibility according to the different extensions of the lemon model needed to fulfill the needs of the involved scholars.
This article introduces the model DIATERM, devoted to representing the diachronic evolution of co... more This article introduces the model DIATERM, devoted to representing the diachronic evolution of concepts and terms in a given domain, according to Semantic Web standards and Linked Data technologies. The approach adopted for the representation of temporal information is based on the reification of N-ary relationships. DIATERM is articulated on three levels, textual, terminological and conceptual. Each level can be affected, more or less simultaneously, by change. The use of SWRL rules allows to automatically assign temporal information, thus facilitating the construction of the terminological resource and highlighting any inconsistencies. Two examples of interrogation and visualization of diachronic terminological resources will be illustrated. The first example is taken from the resource dedicated to the astronomical terminology introduced by Christopher Clavius in his Commentary on the Sacrobosco’s Tractatus de Sphaera. The second example is taken from the electronic lexicon of Fer...
Proceedings of the Third AIUCD Annual Conference on Humanities and Their Methods in the Digital Ecosystem - AIUCD '14, 2015
This paper describes the full procedure adopted in the context of the Clavius on the Web project,... more This paper describes the full procedure adopted in the context of the Clavius on the Web project, which aims to help Web users to appraise the importance of specific manuscripts by going beyond their digital reproduction. The proposed approach is based on the multilayered explication of linguistic, lexical and semantic data representing the innermost nature of the analyzed manuscripts. The final purpose of the project is to gather and display the results of the three layers of analysis through interactive visualization techniques and export them as Linked Data. All the analyses rely on the XML/TEI encoding of the text, followed by a CTS-based tokenization. As a working example for this paper, the analysis of a portion of a manuscript provided by Historical Archives of the Pontifical Gregorian University will be illustrated. The text is a letter written in Latin and sent by Botvitus Nericius to Christophorus Clavius in 1598 from Madrid.
English. In this work, we present an experiment in the modeling of a diachronic termino-ontologic... more English. In this work, we present an experiment in the modeling of a diachronic termino-ontological resource named CLAVIUS through both the N-ary relations model and the 4D-fluents approach. Some of the salient differences of these two models are discussed. The overall objective of this research is to illustrate the main advantages and disadvantages in the adoption of a given model to build diachronic resources. Italiano. In questo lavoro, si illustra un esperimento di modellazione di una risorsa termino-ontologica diacronica (CLAVIUS) secondo due approcci, quello N-ario e quello dei 4D-fluents. Le differenze salienti dei due approcci verranno presentate e discusse. L'obiettivo generale della ricerca qui introdotta è quello di mostrare i principali vantaggi e svantaggi che l'adozione di un determinato modello può comportare nella modellazione di risorse diacroniche. Introduzione Pànta rei è la celebre espressione attribuita da Platone ad Eraclito. Tutto è sottoposto alla i...
Abstract. In the context of the digitization of manuscripts, transcription and annotation are oft... more Abstract. In the context of the digitization of manuscripts, transcription and annotation are often distinct, sequential steps. This could lead to diculties in improving the transcribed text when annotations have already been dened. In order to avoid this, we devised an approach which merges the two steps into the same process. Text Encoder and Annotator (TEA) is a prototype application embracing this concept. TEA is based on a lightweight language syntax which annotates text using Semantic Web technologies. Our approach is currently being developed within the Clavius on the Web project, devoted to studying the manuscripts of Christophorus Clavius, an inuential 16th century mathematician and astronomer.
The aim of this article is to present a model for representing in an explicit and formal way the ... more The aim of this article is to present a model for representing in an explicit and formal way the diachronic evolution of concepts and terms in a given domain, so that this formalization can be machine-actionable. The approach we here propose is based on Semantic Web technologies in order to guarantee interoperability and reuse across scientific communities of diachronic terminological resources that can be thus easily accessed, interconnected and mutually enriched. More specifically, the representation of dynamic evolution of terms and concepts was performed in OWL using the N-ary relations mechanisms. In addition, a set of SWRL rules was set up, in order to automatically identify the evolution of the concepts evoked within a text, as well as the terms representing these concepts. Our model was adopted to formally represent diachronic aspects of Saussure’s terminology as they emerge from his works. An example will be provided to highlight the potential of such a knowledge structurat...
In this work, we describe the modelling of a diachronic termino-ontological resource, named CLAVI... more In this work, we describe the modelling of a diachronic termino-ontological resource, named CLAVIUS, representing the evolution of astronomical concepts and theories from antiquity until the dawn of the modern age. The resource was built by means of existing tools allowing the scholars to formalize knowledge even though they are not familiar with the models and the languages underlying the representation. More specifically, Protégé, a free open-source ontology editor, which supports OWL (and OWL 2) and Chronos, a plug-in for Protégé to manage temporal aspect, were used. A raw evaluation of the resource is provided by means of a controlled natural language interface, which enables scholars to answer a set of salient queries defined by our domain expert.
In the last few years the amount of manuscripts digitized and made available on the Web has been ... more In the last few years the amount of manuscripts digitized and made available on the Web has been constantly increasing. However, there is still a considarable lack of results concerning both the explicitation of their content and the tools developed to make it available. The objective of the Clavius on the Web project is to develop a Web platform exposing a selection of Christophorus Clavius letters along with three different levels of analysis: linguistic, lexical and semantic. The multilayered annotation of the corpus involves a XML-TEI encoding followed by a tokenization step where each token is univocally identified through a CTS urn notation and then associated to a part-of-speech and a lemma. The text is lexically and semantically annotated on the basis of a lexicon and a domain ontology, the former structuring the most relevant terms occurring in the text and the latter representing the domain entities of interest (e.g. people, places, etc.). Moreover, each entity is connecte...
In the framework of the Italian project ‘For a digital edition of Ferdinand de Saussure's man... more In the framework of the Italian project ‘For a digital edition of Ferdinand de Saussure's manuscripts’, an electronic thesaurus of Saussure’s terminology is being built, which includes new terms extracted from recently found manuscripts. The lexical model on which it is grounded is a customized version of the SIMPLE model. In this paper, an overview of the customization process is provided, with a special focus on the steps taken for designing a domain-specific ontology as well as on the creation of additional semantic relations and features. Lexical entries are illustrated and the potential of a structured organization of semantic knowledge for gaining a wider understanding of the overall domain terminology is highlighted.
The poster is related to “Languages and Cultures of Ancient Italy. Historical Linguistics and Dig... more The poster is related to “Languages and Cultures of Ancient Italy. Historical Linguistics and Digital Models” (https://www.prin-italia-antica.unifi.it), a project funded by the Italian Ministry of University and Research in 2020 and involving the Universities of Venice and Florence, and the Institute for Computational Linguistics - Italian National Council of Research. The project will investigate the cultures of ancient Italy on the basis of their linguistic documentation, only consisting in epigraphic evidence, by means of computational tools specifically developed for this purpose.
Poster presented at Epigraphy.info Workshop V (Leuven, November 3rd-6th, 2020)
Uploads
Papers by Silvia Piccini
The project will investigate the cultures of ancient Italy on the basis of their linguistic documentation, only consisting in epigraphic evidence, by means of computational tools specifically developed for this purpose.
Poster presented at Epigraphy.info Workshop V (Leuven, November 3rd-6th, 2020)