Wordnet Research Papers

Most research in text classification to date has used a bag of words representation in which each feature corresponds to a single word. This paper examines some alternative ways to represent text based on syntactic and semantic... more

Bookmark
Download
- by Sam Scott
- •
- 17
  Natural Language Processing, Representations, Learning, Text Classification

Identifying semantic expressions (so-called concept strings (CSs)) in multilingual corpora is an important NLP task, as it allows web search engines to deﬁne and perform semantic queries over large collection of documents. Existing web... more

A number of Learning Management Systems (LMSs) exist on the market today. A subset of a LMS is the component in which student assessment is managed. In some forms of assessment, such as open questions, the LMS is incapable of evaluating... more

Bookmark
Download
- by Maiga Chang
- •
- 9
  Information Retrieval, Natural Language Processing, Data Mining, Semantics

This study adopts a lexicon-based approach to address violence on social media. It uses FrameNet 1.7 (fn) and WordNet 3.1 (wn) to build a hierarchical domain-specific language resource of violence. The proposed lexicon tethers fn’s... more

This paper is a contribution to the discussion on compiling computational lexical resources from conventional dictionaries. It describes the theoretical as well as practical problems that are encountered when reusing a conventional... more

Bookmark
Download
- by Henrik Lorentzen and +1
  Lars Trap-Jensen
- •
- 9
  Cognitive Science, Computational Linguistics, Lexical Semantics, Language Resources

Bookmark
Download
- by Lynne Cameron
- •
- 63
  Psychology, Cognitive Psychology, Cognitive Science, Social Psychology

In education, the use of electronic (E) examination systems is not a novel idea, as E-examination systems have been used to conduct objective assessments for the last few years. This research deals with randomly designed E-examinations... more

This paper presents a new model of WordNet that is used to disambiguate the correct sense of polysemy word based on the clue words. The related words for each sense of a polysemy word as well as single sense word are referred to as the... more

WordNet is an online lexical resource which expresses unique concepts in a language. English WordNet is the first WordNet which was developed at Princeton University. Over a period of time, many language WordNets were developed by various... more

Bookmark
Download
- by Hanumant Redkar and +1
  Diptesh Kanojia
- •
- 5
  Artificial Intelligence, Knowledge Representation, Database Management Systems, Wordnet

This paper describes the process of creation and review of a new lexico-semantic resource for the classical studies: AncientGreekWordNet. The candidate sets of synonyms (synsets) are extracted from Greek-English dictionaries, on the... more

The paper presents the latest release of the Polish WordNet, namely plWord-Net 4.1. The most significant developments since 3.0 version include new relations for nouns and verbs, mapping semantic role-relations from the va-lency lexicon... more

Sea la oración Tengo dos sobres de sobra. Para comprenderla, un hablante de español deberá llevar a cabo un buen número de procesos lingüísticos, por fortuna sin ser muy consciente de ello. Sólo para dar con el signiÞ cado correcto de... more

In this paper an attempt is made to study and analyze the prototype model for developing a WordNet for Dogri language, and describes its specific characteristics and properties to develop WordNet. In terms of morphological and syntactic... more

Bookmark
Download
- by Rakesh Goswami
- •
- 5
  Information Retrieval, NLP, Wordnet, Stemmer

Genel anlamda sözlüklerin içeriği üç temel bileşenden oluşur: tanımlanan sözlükbirimler, tanımlar ve sözlükbirimler arasındaki sözcüksel-anlamsal ilişkiler. Bunlardan üçüncüsü, sözlüklerin kullanıma yönelik boyutunun ötesinde, söz konusu... more

Genel anlamda sözlüklerin içeriği üç temel bileşenden oluşur: tanımlanan sözlükbirimler, tanımlar ve sözlükbirimler arasındaki sözcüksel-anlamsal ilişkiler. Bunlardan üçüncüsü, sözlüklerin kullanıma yönelik boyutunun ötesinde, söz konusu dilin sözlükçesinin önemli bir bölümüdür. Geleneksel çerçevede yapısal anlam ilişkileri ulamında yer alan sözcüksel-anlamsal ilişkiler Princeton Üniversitesi'nde C.Felbaum tarafından geliştirilen WordNet projesi ile yeni bir görünüm kazanmıştır. WordNet, sözcüklerin başta eş-anlam kümeleri (synset) olmak üzere karşıt anlamlılık, sözcük ailesi, parça-bütün ilişkisi ve alt/üst anlamlılık ilişkileri içinde sunulduğu bir elektronik sözlük projesidir. Dünyada tek dilli ve çok dilli olmak üzere 75 civarında sözcük ağı projesi bulunmasına karşın, bazı yarıda kalmış girişimler ve Türkçe sözcük ağının yazılımsal araçlarının geliştirilmesine yönelik çalışmalar dışında Türkçe için böyle bir çalışma yapılmamıştır. Günümüzde bilgisayar teknolojisinin doğal dil çalışmalarındaki en etkili kullanım alanlarından biri olan sözcük ağları, bilgiişlem aşamasından önce ilgili dilin sözcüklerine ilişkin yapılandırılmış bir sözlüksel veri tabanını gerektirmektedir. Eş anlamlılık, karşıt anlamlılık ve alt/üst anlamlılığı içeren dizisel ilişkilerin yanı sıra, eş-dizimlilik ve anlam ezgisi gibi dizimsel ilişkiler de sözlüksel veri tabanlarını oluşturur. Bu çalışmada Türkçenin sözcük ağının oluşturabilmesi için gerekli olan sözlüksel veri tabanının alt/üst anlamlılık bileşeni üzerinde durulmakta ve Güncel Türkçe Sözlük'te (GTS) tanım içeriği olarak sunulan sıralıdüzen ilişkilerinin genel görünümünün belirlenmesi amaçlanmaktadır. Bu çerçevede ad, eylem ve sıfat türü sözcükler için sunulan sıralıdüzen ilişkileri örneklerle ve WordNet ile karşılaştırılarak incelenmekte ve sözlüğün bu açıdan yapısının betimlemesi yapılmaktadır. Ön bulgu olarak, genel kullanıma yönelik bir sözlük olduğu için GTS'de sözcüksel-anlamsal ilişkilerin belirli bir düzen içinde yer almadığı söylenebilir. Sözcük türlerinin her biri için bu açıdan farklı sorunlar söz konusudur. Bir bölümü sözlük yapısının yeniden düzenlenmesi ile aşılabilecek bu sorunların bir bölümü ise daha geniş çerçeveli çalışmaların yapılmasını gerektirmektedir. Örneğin pek çok ad türü sözcüğün uzak üst anlamlı ile tanımlanmış olması sözlük yapısını ilgilendirirken, sıfat ve eylem türü sözcüklerin üst anlamlıları konusunda hem dilbilimsel hem de varlıkbilimsel belirlemelerin yapılması gerekmektedir. Bunun yanı sıra, aynı üst anlamlı sözcük altında toplanan sözcüklerin bir eş-alt anlamlı kümesi oluşturduğu düşünüldüğünde söz konusu ilişkinin sözcük ağının oluşturulmasındaki önemi daha da belirginleşecektir.

Bookmark
Download
- by Soner Akşehirli
- •
- 4
  Lexicology, Lexicography, Semantic relations, Wordnet

Bookmark
Download
- by Shirley N . Dita
- •
- 2
  Wordnet, Tagalog

Despite being a popular language in the world, the Bengali language lacks in having a good wordnet. This restricts us to do NLP related research work in Bengali. Most of the today’s wordnets are developed by following expand... more

Bookmark
Download
- by Tahsin Hassan Rahit
- •
- 6
  Bioinformatics, Genetics, Medical Informatics, Machine Learning

A new approach to numerically measure the semantic distances between lexical units (words and collocations) based on the geometric analogies and analytical calculations, is put forward. Having considered the cases of equal and different... more

This thesis is written as part of the preliminary research for a proposed project at the Centre for Text Technology at the North-West University in Potchefstroom, North-West Province, South Africa. In this work a methodology for... more

This paper will focus on recent and near-term future developments at FrameNet (FN) and the interoperability issues they raise. We begin by discussing the current state of the Berkeley FN database including major changes in the data format... more

Bookmark
Download
- by Collin Baker
- •
- 7
  Cognitive Science, Crowdsourcing, Wordnet, Corpus

Various websites are available as source of microblogs. This is due to nature of microblogs on which people post real time messages about their attitudes on a various topics, talk about present issues, criticize, and articulate positive... more

Iconicity is a pervasive phenomenon in language that defies the Saussurean dictum of the arbitrariness of the linguistic sign, not only occurring in phenomena like sound symbolism and ideophones (e.g. Dingemanse 2012), gesture and sign... more

Iconicity is a pervasive phenomenon in language that defies the Saussurean dictum of the arbitrariness of the linguistic sign, not only occurring in phenomena like sound symbolism and ideophones (e.g. Dingemanse 2012), gesture and sign language (e.g. Herlofsky 2005), but also syntax (e.g. Haiman 1985; Van Langendonck 2007). Iconicity is best viewed as a unified notion that manifests itself very differently in different circumstances, some being highly schematical or semiotically general, others related to lower-level cultural customs. Chinese data is particularly revealing in case of the latter because of its logographic nature, but also displays cross-linguistic characteristics in case of the former. In this paper we will use the semantic domain of meteorological expressions in Chinese, with data based on dictionaries like Handian (Handian 漢典 2004) and WordNet (Hsieh & Huang 2009), to illustrate the interplay of iconic patterns on the two different levels: general conception and culture/language specific. We chose ‘weather’ as a domain because it constitutes a highly salient phenomenon that occurs across different languages and cultures (Eriksen, Kittilä & Kolehmainen 2010) and because it provided both ‘normal vocabulary’ as well as ideophones. Thus we can address two main questions in the call for papers.

As for the first question, “whether iconicity is culture-specific or semiotically general”, Chinese displays general types of iconicity found in cross-linguistic typological research, e.g. serial verbs as an iconic mapping of logical-temporal order, or a quite large inventory of ideophones, with many high-iconic imagic (in Peirce’s terminology) mappings between form and meaning. However, their usage of characters as a writing system displays many iconic properties that are absent in e.g. Latin based scripts, as has been long acknowledged in the traditional character classification (liu shu 六書), which includes a category for iconic characters that combined with other characters become indexes. A lexical field analysis of weather expressions shows that the basic level items essentially stem from five different iconic semantic radicals: imagic ones like rain (雨), sun (日), thunder (畾), cloud (云); and an indexical one wind (風). It is curious that the phonological form of most weather expressions is symbolic, displaying almost no iconicity, while the writing system does, e.g. xue 雪 ‘snow’ has a ‘rain’ radical which indicates a form of precipitation.

The second question, “whether iconicity can be combined within or across modalities” can be discussed from the perspective of weather-related Chinese ideophones. On the one hand, with ‘modalities’ referring to the senses, they seem to display a high flexibility concerning cross-modal synaesthesia, e.g. xilihuala 唏哩嘩啦 ‘to rain abundantly’, which depicts both hearing and movement (cf. Van Hoey's (2016) spinning top model). This lexical item has an imagic motivation in its phonological form. On the other hand, when ‘modalities’ refers to spoken vs. written language, ideophones like linlin 淋淋 ‘soaked’ show a different kind of iconicity – indexal, as seen in the semantic radical water (氵). It is mainly this interplay between (virtual) referent, phonological form and written image that is of interest to the topic of iconicity, since sometimes everything is linked through the phenomenon, but in other cases only one of the spoken or written form.
.

References:
Dingemanse, Mark. 2012. Advances in the cross-linguistic study of ideophones. Language and Linguistics Compass 6(10). 654–672.
Eriksen, Pål, Seppo Kittilä & Leena Kolehmainen. 2010. Linguistics of weather: cross-linguistic patterns of meteorological expressions. Studies in Language 34(3). 565–601.
Haiman, John (ed.). 1985. Iconicity in syntax: proceedings of a Symposium on Iconicity in Syntax, Stanford, June 24 - [2]6, 1983. (Typological Studies in Language 6). Amsterdam: Benjamins.
Handian 漢典. 2004. Handian 漢典 [Chinese dictionary]. http://www.zdic.net/.
Herlofsky, William J. 2005. Now you see it, now you don’t: Imagic diagrams in the spatial mapping of signed (JSL) discourse. In Costantino Maeder, Olga Fischer & William J. Herlofsky (eds.), Outside-in, inside-out, 323–348. (Iconicity in Language and Literature 4). Amsterdam ; Philadelphia: J. Benjamins Pub.
Hsieh Shukai 謝樹凱 & Huang Churen 黃居仁. 2009. Chinese WordNet (Zhongwen cihui wanglu 中文詞義網路). http://lope.linguistics.ntu.edu.tw/cwn/ .
Van Hoey, Thomas. 2016. Ideophones in Premodern Chinese: Revisiting Dingemanse’s implicational hierarchy (poster). Mimetics in Japanese and other languages in the world (日本語と世界諸言語のオノマトペ). Tachikawa: NINJAL.
Van Langendonck, Willy. 2007. Iconicity. In Dirk Geeraerts & Hubert Cuyckens (eds.), The Oxford handbook of cognitive linguistics, 394–418. (Oxford Handbooks). Oxford ; New York: Oxford University Press.

Bookmark
Download
- by Thomas Van Hoey
- •
- 11
  Cognitive Linguistics, Lexical Semantics, Ideophones, Onomatopoeia

This paper describes the process of creation and review of a new lexico-semantic resource for the classical studies: AncientGreekWord-Net. The candidate sets of synonyms (synsets) are extracted from Greek-English dictionaries, on the... more

Bookmark
Download
- by Yuri Bizzoni and +1
  Federico Boschetti
- •
- 6
  Languages and Linguistics, Multilingualism, Linguistics, Classical philology

In this paper, we introduce the Filipino wordnet project (FilWordNet). Filipino is the national language of the Philippines spoken by some 90 million people as their first or second language. However, it has historically had a limited... more

Bookmark
Download
- by Allan Borra
- •
- 2
  Wordnet, Tagalog

We report on our ongoing effort towards developing VietWordNet, a WordNet for the Vietnamese language. We present the methodology we used, the lexical resources we employed, and the computing tools we designed to help acquiring and... more

Bookmark
Download
- by Son Dao
- •
- 6
  Ontology, Languages and Linguistics, Computational Linguistics, Linguistics

WordNet is a hierarchical information base in any language. A WordNet is implemented using indexed file system.Even though there are many languages in which we have good wordnets, Malayalam is not having an effi cient wordnet. This... more

Question Answering Systems, unlike search engines, are providing answers to the users' questions in succinct form which requires the prior knowledge of the expectation of the user. Question classification module of a Question Answering... more

Bookmark
Download
- by Santosh Ray
- •
- 12
  Cognitive Science, Wordnet, Question Answering System, Wikipedia

This paper presents Hydra for Web – a web interface for wordnets (and lexical-semantic databases with similar relational structure). Hydra for web is built on top of Hydra – an open source tool for wordnet development – and is a single... more

Sentiment classification is an ongoing field and interesting area of research because of its application in various fields collecting review from people about products and social and political events through the web. Currently, Sentiment... more

Lexical databases are invaluable sources of knowledge about words and their meanings, with numerous applications in areas like NLP, IR, and AI. We propose a methodology for the automatic construction of a large-scale multilingual lexical... more

Bookmark
Download
- by Gerard de Melo
- •
- 4
  Multilingualism, Taxonomy, Lexical Semantics, Wordnet

This paper presents the work in progress toward the creation of a family of WordNets for Sanskrit, Ancient Greek, and Latin. Building on previous attempts in the field, we elaborate these efforts bridging together WordNet relational... more

Bookmark
Download
- by Chiara Zanchi
- •
- 11
  Languages and Linguistics, Semantics, Metaphor, Cognitive Linguistics

Abstract In this paper we present a set of tools that will help developers of wordnets not only to increase the number of synsets but also to ensure their quality, thus preventing it to become obsolete too soon. We discuss where the... more

Bookmark
Download
- by Miljana Mladenovic and +1
  Jelena Mitrović
- •
- 5
  Computational Linguistics, Lexical Semantics, Wordnet, Research tools

WordNet is a crucial resource that aids in several Natural Language Processing (NLP) tasks. The WordNet development activity for 18 Indian languages has been initiated in INDIA by the IndoWordNet 1 consortium using the expansion approach... more

Capturing the sentiments and the emotional states enclosed in textual information is a critical task which embraces a wide range of web-oriented activities such as detecting the sentiments associated to the product reviews, developing... more

Bookmark
Download
- by Negin Ilkhanipour
- •
- 6
  Semantics, Persian Language, Modality, Translatology

This paper describes WordNet design and development, discussing its origins, the objectives it initially intended to reach and the subsequent use to which it has been put, the factor that has determined its structure and success. The... more

Bookmark
Download
- by Miguel Marzal and +2
  Jorge Morato
  Juan Llorens
- •
- 2
  Natural Language Processing, Wordnet

In this paper, we introduce the Filipino wordnet project (FilWordNet). Filipino is the national language of the Philippines spoken by some 90 million people as their first or second language. However, it has historically had a limited... more

Bookmark
Download
- by Shirley N . Dita
- •
- 2
  Wordnet, Tagalog

This study seeks to provide a methodology for building and developing an Arabic WordNet for lexicography purposes. The study aims to set a clear vision of the mechanisms for building an Arabic WordNet, taking into account the nature of... more

Abstract The goal of this paper is to point out the importance of crowdsourcing and to present some of the most successful projects that are functioning on the basis of this management model that originated in the business world, but it... more

Bookmark
Download
- by Jelena Mitrović
- •
- 5
  Natural Language Processing, Crowdsourcing, Wordnet, Mechanical Turk

The paper motivates a strategy for identification and annotation of derivational relations in the Bul- garian wordnet that aims at coping with the com- plex morphology of the language in an elegant way. Our method involves transfer of the... more

Bookmark
Download
- by Ekaterina Tarpomanova and +1
  Tsvetana Dimitrova
- •
- 2
  Wordnet, Derivational Morphology

In this paper, we introduce the Filipino wordnet project (FilWordNet). Filipino is the national language of the Philippines spoken by some 90 million people as their first or second language. However, it has historically had a limited... more

Bookmark
Download
- by Shirley N . Dita
- •
- 2
  Wordnet, Tagalog

This paper describes our work in integrating three different lexical resources: FrameNet, VerbNet, and WordNet, into a unified, richer knowledge-base, to the end of enabling more robust semantic parsing. The construction of each of these... more

Bookmark
Download
- by Rada Mihalcea
- •
- 3
  Wordnet, Knowledge base, FrameNet

In this paper we present a set of tools that will help developers of wordnets not only to increase the number of synsets but also to ensure their quality, thus preventing it to become obsolete too soon. We discuss where the dangers lay... more

Bookmark
Download
- by Jelena Mitrović and +1
  Miljana Mladenovic
- •
- 6
  Natural Language Processing, Computational Linguistics, Lexical Semantics, Wordnet

Benyújtva: 2008. március 3.; elfogadva: 2008. március 3.

Bookmark
Download
- by Gábor Prószéky
- •
- 3
  Ontology, Hungarian, Wordnet

In this paper we highlight the main challenges in building a lexical database for Kurdish, a resource-scarce and diverse language. We also report on our effort in building the first prototype of KurdNet – the Kurdish WordNet– along with a... more

Bookmark
Download
- by Purya Aliabadi
- •
- 6
  Computer Science, Information Retrieval, Kurdish Language, Wordnet

One has to mention the "eXtended WordNet" project, developed in 2003 at the University of Dallas (http://xwn.hlt.utdallas.edu/index.html). This project enhances WordNet 2.0 with a logical representation and a syntactic analysis of the... more

Bookmark
Download
- by Lucie Barque
- •
- 2
  Metonymy, Wordnet

—Old fashioned way of correcting and grading exam papers is the most stressful job of a teacher, which is why they hate it so much [13]. The monotonous task of checking several papers causes the faculties to lose interest and also make... more

Bookmark
Download
- by Mayeesha Mariam and +1
  Ali Ahmed
- •
- 6
  Computer Science, Computational Linguistics, Lexical Semantics, Automatic Text Summarization

Topic Based Vector Space Model (TVSM) proposed a new vector space that its dimensions is composed of topics. Every term and document is represented by vectors inside this vector space. By using topics as dimensions TVSM tries to overcome... more

Bookmark
Download
- by Adi Wibowo
- •
- 3
  Information Retrieval, Wordnet, Vector Space Model

We discuss a method to enhance the accuracy of a subset of the Ancient Greek WordNet based on the Homeric lexicon and the related conceptual network, by using multilingual semantic spaces built from aligned corpora.

Bookmark
Download
- by Yuri Bizzoni and +2
  Federico Boschetti
  Marianne Reboul
- •
- 3
  Computational Linguistics, Ancient Greek Language, Wordnet

In a conventional CAT (Computer Assisted Translation) system a human translator post-edits an automatically generated target language text using the keyboard. In this paper we extend a CAT system with speech input by which the translator... more

Bookmark
Download
- by aniruddha tammewar
- •
- 9
  Computational Linguistics, Machine Translation, NLP, Hindi

We present path2vec, a new approach for learning graph embeddings that relies on structural measures of pairwise node similarities. The model learns representations for nodes in a dense space that approximate a given user-defined graph... more

Wordnet

Log In