Text Mining
89,382 Followers
Recent papers in Text Mining
Natural Language Processing is a programmed approach to analyze text that is based on both a set of theories and a set of technologies. This forum aims to bring together researchers who have designed and build software that will analyze,... more
In recent years, organizations have come across complex databases due to the development of technology, the growth of databases and information technologies, and the use of widespread information technologies. If the data found in line... more
Automatic understanding of domain specific texts in order to extract useful relationships for later use is a nontrivial task. One such relationship would be between railroad accidents' causes and their correspondent descriptions in... more
The increasing number of digitized texts presently available notably on the Web has developed an acute need in text mining techniques. Clustering systems are used more and more often in text mining, especially to analyze texts and to... more
In this paper, we analyze about the relation between stock price returns and Headline News. Headline News is very important sources of information in asset management, and is sent in large quantities every day. We study the effect of more... more
Co-clustering has been defined as a way to organize simultaneously subsets of instances and subsets of features in order to improve the clustering of both of them. In previous work [1], we proposed an efficient co-similarity measure... more
Blogosphere is the name associated to universe of all the blog sites. A blog is a website that allows people to write about topics they want to share with others. The ease & simplicity of creating blog posts and their free form and... more
Analysis and modeling of crime text report data has important applications, including refinement of crime classifications, clustering of documents, and feature extraction for spatiotemporal forecasts. Having better neural network... more
The objective of this research is to compare movement of stock return literature on companies in the tourism sector and other businesses. This study also analyzed comprehensively the relationship between contexts built on the literature.... more
The textual analysis has become most important task due to the rapid increase of the number of texts that have been continuously generated in several forms such as posts and chats in social media, emails, articles, and news. The... more
Analysis, Collecting and analyzing online tourist reviews on destinations is important for sustainable tourism. These analyses can give insight into the extent to which natural and cultural assets in the destination are protected. These... more
DC Universe is a fictional universe in which a collection of superheroes and super villains based on characters that appear in comic books by DC Comics is in it. DC Comics itself is the largest and oldest comic book publisher that... more
In this paper, we propose a new approach for evaluating research projects and programs. According our approach, the improvement might be achieved by adopting a results-based and a project portfolio approach, and assuring a research and... more
Named Entities Recognition (NER) has become one of the major issues in Information Retrieval (IR), knowledge extraction, and document classification. This paper addresses a particular case of NER, acronym expansion (or definition) when... more
This report provides a description of the Open Boek intelligent retrieval system version 3, and of its care and feeding. It combines the user manual and the administration guide. Finally, it provides detailed descriptions of the scripts... more
Enhancing the educational corporations is truly challenging mission due to the highly competitive nature of the business. Currently, there is emerging development within organizations to capitalize on their internal resources. This paper... more
Bibliometric studies have the particularity of analyzing the performance of a research field/area. The present study, following this premise, will map the works of a Postgraduate program at a Brazilian Federal University. For this... more
This paper introduces the concept of a classification tool for Web pages called WebClassify, which uses modified naive Bayesian algorithm with multinomial model to classify pages into various categories. The tool starts the classification... more
1 (English) The spread of English as the Lingua Franca of international communication has given rise to meaningful language contact phenomena in the world's languages like loanwords and pseudo-loanwords, namely, words from one language... more
In this paper, we establish Fog Index (FI) as a text filter to locate the sentences in texts that contain connected biomedical concepts of interest. To do so, we have used 24 random papers each containing four pairs of connected concepts.... more
Social media data is increasingly used to gain insights into trends in mental health, but prior studies aimed at confirming a link between online expression of suicidal ideation on social media and actual suicide deaths have been... more
This study used the data of questionnaires conducted by a local TV program on child rearing to analyze the worries and needs of parents raising children in Yamanashi prefecture. Data from questionnaires conducted by the program for every... more
This paper describes a text mining tool that performs two tasks, namely document clustering and text summarization. These tasks have, of course, their corresponding counterpart in "conventional" data mining. However, the... more
Most of company product or service depends on user experience. So finding feedback of each user is tough but social platform reduce this dependency as people provide this information in form of comments. This paper has work in this domain... more
Com o crescimento da Educação a distância (EAD), os fóruns de discussão online se apresentam como um importante instrumento no processo de ensino-aprendizagem. No entanto, à medida que as discussões em fóruns crescem torna-se difícil para... more
An important task in text mining is finding the dominant topics (and their associated documents) in a collection of documents. While traditional clustering techniques, e.g., hierarchical clustering and K-means, are often used for this... more
Beta endorphin is a neurotransmitter and is involved in functions like enhancement of immune system, deceleration of cancer cell growth and induction of euphoria and relaxation. It is an opioid like neuropeptide synthesized in neurons of... more
Se presenta una propuesta de modelo para el diseño de estrategias de aprendizaje móvil, que fue realizado en 2012 por investigadores del grupo Proventus de la universidad de la Sabana, en el marco de un proyecto de investigación que tiene... more
In finance task domain, it is indispensable to get and analyze information as quickly as possible. Analyst's reports are one of the important information in asset management, and these include a large amount of text information. However,... more
五代女性的資料或見於後人撰述的歷史著作,或見於當代人書寫的墓誌銘文,本文嘗試將可知見的資料放回作者所處的社會文化結構中,解析資料中所挾帶的訊息特質,並探索個別資料差異背後所具有的時代意義之結構關係,以求釐清當時人以至後代史家對女性的印象與期待,試著穿透文字,以掌握五代時期女性行止的實況。
BioCaster is an ontology-based text mining system for detecting and tracking the distribution of infectious disease outbreaks from linguistic signals on the Web. The system continuously analyzes documents reported from over 1700 RSS... more
The experience of the Golf Practice Class with Lodging and the Change of Life Skills through it The purpose of this study is to examine the experience of the golf practice class with lodging and the longitudinal change of life skills... more
This study empirically evaluates the effectiveness of different feature types for the classification of the first language of an author. In particular, it examines the utility of psycholinguistic features, extracted by the Linguistic... more
Analyzing stock market trends and sentiment is an interdisciplinary area of research being undertaken by many disciplines such as Finance, Computer Science, Statistics, and Economics. It has been well established that real time news plays... more
This paper describes a text mining tool that performs two tasks, namely document clustering and text summarization. These tasks have, of course, their corresponding counterpart in "conventional" data mining. However, the textual,... more
A fully automated, self-driving car can perceive its environment, determine the optimal route, and drive unaided by human intervention for the entire journey. Connected autonomous vehicles (CAVs) have the potential to drastically reduce... more
Metric and kernel learning are important in several machine learning applications. However, most existing metric learning algorithms are limited to learning metrics over low-dimensional data, while existing kernel learning algorithms are... more
User satisfaction from education institution, health service or other bussiness are often measured using questionnaire with scoring grade system. Generally many comments, advices and suggestions from questionnaire complements is not... more
The Indonesian tourism industry continues to develop and has become the core of the nation economy. Indonesia is known for its wealth of natural beauty, which can be used as a potential for tourism business. Garut is a city in Indonesia... more
In the present state of digital world, computer machine do not understand the human's ordinary language. This is the great barrier between humans and digital systems. Hence, researchers found an advanced technology that provides... more
Text mining is the process of discovering information in large text collections, and automatically identifying interesting patterns and relationships in textual data. It is a relatively new research area, which has recently raised much... more