Wordnet
5,445 Followers
Recent papers in Wordnet
Most research in text classification to date has used a bag of words representation in which each feature corresponds to a single word. This paper examines some alternative ways to represent text based on syntactic and semantic... more
Identifying semantic expressions (so-called concept strings (CSs)) in multilingual corpora is an important NLP task, as it allows web search engines to define and perform semantic queries over large collection of documents. Existing web... more
A number of Learning Management Systems (LMSs) exist on the market today. A subset of a LMS is the component in which student assessment is managed. In some forms of assessment, such as open questions, the LMS is incapable of evaluating... more
This study adopts a lexicon-based approach to address violence on social media. It uses FrameNet 1.7 (fn) and WordNet 3.1 (wn) to build a hierarchical domain-specific language resource of violence. The proposed lexicon tethers fn’s... more
In education, the use of electronic (E) examination systems is not a novel idea, as E-examination systems have been used to conduct objective assessments for the last few years. This research deals with randomly designed E-examinations... more
This paper presents a new model of WordNet that is used to disambiguate the correct sense of polysemy word based on the clue words. The related words for each sense of a polysemy word as well as single sense word are referred to as the... more
This paper describes the process of creation and review of a new lexico-semantic resource for the classical studies: AncientGreekWordNet. The candidate sets of synonyms (synsets) are extracted from Greek-English dictionaries, on the... more
The paper presents the latest release of the Polish WordNet, namely plWord-Net 4.1. The most significant developments since 3.0 version include new relations for nouns and verbs, mapping semantic role-relations from the va-lency lexicon... more
Sea la oración Tengo dos sobres de sobra. Para comprenderla, un hablante de español deberá llevar a cabo un buen número de procesos lingüísticos, por fortuna sin ser muy consciente de ello. Sólo para dar con el signiÞ cado correcto de... more
In this paper an attempt is made to study and analyze the prototype model for developing a WordNet for Dogri language, and describes its specific characteristics and properties to develop WordNet. In terms of morphological and syntactic... more
Genel anlamda sözlüklerin içeriği üç temel bileşenden oluşur: tanımlanan sözlükbirimler, tanımlar ve sözlükbirimler arasındaki sözcüksel-anlamsal ilişkiler. Bunlardan üçüncüsü, sözlüklerin kullanıma yönelik boyutunun ötesinde, söz konusu... more
Despite being a popular language in the world, the Bengali language lacks in having a good wordnet. This restricts us to do NLP related research work in Bengali. Most of the today’s wordnets are developed by following expand... more
A new approach to numerically measure the semantic distances between lexical units (words and collocations) based on the geometric analogies and analytical calculations, is put forward. Having considered the cases of equal and different... more
This thesis is written as part of the preliminary research for a proposed project at the Centre for Text Technology at the North-West University in Potchefstroom, North-West Province, South Africa. In this work a methodology for... more
This paper will focus on recent and near-term future developments at FrameNet (FN) and the interoperability issues they raise. We begin by discussing the current state of the Berkeley FN database including major changes in the data format... more
Iconicity is a pervasive phenomenon in language that defies the Saussurean dictum of the arbitrariness of the linguistic sign, not only occurring in phenomena like sound symbolism and ideophones (e.g. Dingemanse 2012), gesture and sign... more
In this paper, we introduce the Filipino wordnet project (FilWordNet). Filipino is the national language of the Philippines spoken by some 90 million people as their first or second language. However, it has historically had a limited... more
We report on our ongoing effort towards developing VietWordNet, a WordNet for the Vietnamese language. We present the methodology we used, the lexical resources we employed, and the computing tools we designed to help acquiring and... more
WordNet is a hierarchical information base in any language. A WordNet is implemented using indexed file system.Even though there are many languages in which we have good wordnets, Malayalam is not having an effi cient wordnet. This... more
Question Answering Systems, unlike search engines, are providing answers to the users' questions in succinct form which requires the prior knowledge of the expectation of the user. Question classification module of a Question Answering... more
This paper presents Hydra for Web – a web interface for wordnets (and lexical-semantic databases with similar relational structure). Hydra for web is built on top of Hydra – an open source tool for wordnet development – and is a single... more
Sentiment classification is an ongoing field and interesting area of research because of its application in various fields collecting review from people about products and social and political events through the web. Currently, Sentiment... more
Lexical databases are invaluable sources of knowledge about words and their meanings, with numerous applications in areas like NLP, IR, and AI. We propose a methodology for the automatic construction of a large-scale multilingual lexical... more
This paper presents the work in progress toward the creation of a family of WordNets for Sanskrit, Ancient Greek, and Latin. Building on previous attempts in the field, we elaborate these efforts bridging together WordNet relational... more
WordNet is a crucial resource that aids in several Natural Language Processing (NLP) tasks. The WordNet development activity for 18 Indian languages has been initiated in INDIA by the IndoWordNet 1 consortium using the expansion approach... more
Capturing the sentiments and the emotional states enclosed in textual information is a critical task which embraces a wide range of web-oriented activities such as detecting the sentiments associated to the product reviews, developing... more
In this paper, we introduce the Filipino wordnet project (FilWordNet). Filipino is the national language of the Philippines spoken by some 90 million people as their first or second language. However, it has historically had a limited... more
This study seeks to provide a methodology for building and developing an Arabic WordNet for lexicography purposes. The study aims to set a clear vision of the mechanisms for building an Arabic WordNet, taking into account the nature of... more
Abstract The goal of this paper is to point out the importance of crowdsourcing and to present some of the most successful projects that are functioning on the basis of this management model that originated in the business world, but it... more
In this paper, we introduce the Filipino wordnet project (FilWordNet). Filipino is the national language of the Philippines spoken by some 90 million people as their first or second language. However, it has historically had a limited... more
This paper describes our work in integrating three different lexical resources: FrameNet, VerbNet, and WordNet, into a unified, richer knowledge-base, to the end of enabling more robust semantic parsing. The construction of each of these... more
Benyújtva: 2008. március 3.; elfogadva: 2008. március 3.
In this paper we highlight the main challenges in building a lexical database for Kurdish, a resource-scarce and diverse language. We also report on our effort in building the first prototype of KurdNet – the Kurdish WordNet– along with a... more
One has to mention the "eXtended WordNet" project, developed in 2003 at the University of Dallas (http://xwn.hlt.utdallas.edu/index.html). This project enhances WordNet 2.0 with a logical representation and a syntactic analysis of the... more
Topic Based Vector Space Model (TVSM) proposed a new vector space that its dimensions is composed of topics. Every term and document is represented by vectors inside this vector space. By using topics as dimensions TVSM tries to overcome... more
In a conventional CAT (Computer Assisted Translation) system a human translator post-edits an automatically generated target language text using the keyboard. In this paper we extend a CAT system with speech input by which the translator... more