Information Extraction
17,346 Followers
Recent papers in Information Extraction
The increasing popularity of the social networking service, Twitter, has made it more involved in day-to-day communications, strengthening social relationships and information dissemination. Conversations on Twitter are now being explored... more
Natural Language Processing is a programmed approach to analyze text that is based on both a set of theories and a set of technologies. This forum aims to bring together researchers who have designed and build software that will analyze,... more
Use of InSAR techniques in the study of unstable slopes has been suggested in recent works. However, in the ease of mass movements, which typically occur in high-relief terrain and are of limited areal extent, the detection of ground... more
In this article we describe the joint effort of experts in linguistics, information extraction and risk assessment to integrate EventSpotter, an automatic event extraction engine, into ADAC, an automated early warning system. By detecting... more
It is a fascinating subject to explore how well we can understand the processes of life on the basis of fundamental laws of physics. It is emphasised that viewing biological processes as manipulation of information extracts their... more
In this paper, a novel approach for building synopses is proposed by using a service and message-oriented architecture. The Sain-tEtiQ summarization system initially designed for very large stored databases, by its intrinsic features, is... more
Investors, before making an investment decision, rely on many sources of information, including the Web, which in current times has become an important mean of mass production and dissemination of information for the financial/stock... more
Official travel warnings published regularly in the internet by the ministries for foreign affairs of France, Germany, and the UK provide a useful resource for assessing the risks associated with travelling to some countries. The shallow... more
The evaluation of the quality of ontological classification is an important part of semantic web technology. Because this area is under constant development, it requires improvement and standardisation. This paper discusses existing... more
Periodicity mining is used for predicting trends in time series data. Discovering the rate at which the time series is periodic has always been an obstacle for fully automated periodicity mining. Existing periodicity mining algorithms... more
This paper presents the insights gained from the use of data mining and multivariate statistical techniques to identify important factors associated with a country's competitiveness and the development of knowledge discovery in databases... more
Malicious executables are programs designed to infiltrate or damage a computer system without the owner's consent, which have become a serious threat to the security of computer systems. There is an urgent need for effective techniques to... more
The first object of this study is to consider ground surface displacements that occurred due to Philippine Sea Plate movements in the adjacent area of Huatung Valley in Taiwan. In order to clarify the activity of Crustal Deformation in... more
Most information extraction systems focus on the textual content of the documents. They treat documents as sequences of words, disregarding the physical and typographical layout of the information. While this strategy helps in focusing... more
The rapidly increasing use of large-scale data on the Web makes named entity disambiguation become one of the main challenges to research in Information Extraction and development of Semantic Web. This paper presents a novel method for... more
Interest in information extraction from the biomedical literature is motivated by the need to speed up the creation of structured databases representing the latest scientific knowledge about specific objects, such as proteins and genes.... more
An automatic method for generating assembly instructions using CAD files is presented in this paper. Algorithms for extracting geometrical information of objects stored in a non-proprietary format, ISO-10303, STEP-CAD data file are... more
A ground-level semantic map is obtained by a mobile robot equipped with an omnidirectional camera, differential GPS and a laser range finder. The mobile robot uses a virtual sensor for building detection (based on omnidirectional images)... more
The paper provides an OWL ontology for legal cases with an instantiation of the legal case Popov v. Hayashi. The ontology makes explicit the conceptual knowledge of the legal case domain, supports reasoning about the domain, and can be... more
This paper illustrates a system designed to automatically extract semantic annotations of the normative modifications present in legal texts. The work relies on a deep parsing approach. The problem of semantically annotating legal texts... more
This paper presents a set of tools that were developed in order to facilitate and speed up the process of building information extraction and retrieval systems for documents that exhibit a set of predefined characteristics. Specifically,... more
Currently health care industry is undergoing a huge expansion in different aspects. Advances in Clinical Informatics (CI) are an important part of this expansion process. One of the goals of CI is to apply Information Technology for... more
This paper presents a system for automatically extracting linguistic data from digitized linguistic documents using a combination of existing software packages and custom scripts. The system is designed to leverage existing resources in... more
Due to the continuous and rampant increase in the size of domain specific data sources, there is a real and sustained need for fast processing in time-sensitive applications, such as medical record information extraction at the point of... more
A huge number of informal messages are posted every day in social network sites, blogs and discussion forums. Emotions seem to be frequently important in these texts for expressing friendship, showing social support or as part of online... more
An overview of channel coding techniques for data hiding in still images is presented. Use of codes is helpful in reducing the bit error probability of the decoded hidden information, thus increasing the reliability of the system. First,... more
A distance-based outlier detection method that finds the top outliers in an unlabeled data set and provides a subset of it, called outlier detection solving set, that can be used to predict the outlierness of new unseen objects, is... more
Our KNOWITALL system aims to automate the tedious process of extracting large collections of facts (e.g., names of scientists or politicians) from the Web in an autonomous, domain-independent, and scalable manner. In its first major run,... more
Because of the increasing complexity of products and the design process, as well as the popularity of computer-aided documentation tools, the number of electronic and textual design documents being generated has exploded. The availability... more
A number of Learning Management Systems (LMSs) exist on the market today. A subset of a LMS is the component in which student assessment is managed. In some forms of assessment, such as open questions, the LMS is incapable of evaluating... more
The paper presents some contemporary approaches to spatial environmental data analysis. The main topics are concentrated on the decision-oriented problems of environmental spatial data mining and modeling: valorization and... more
Information extraction from high spatial resolution imagery is sometimes hampered by the limited number of spectral channels available from these systems. Standard supervised classification algorithms found in commercial software packages... more
To map water levels related to large floods, we propose using geographical information systems to manage the vast amount of information extracted from aerial photographs. Our approach is divided into three parts: (1) segmentation of the... more
Digital social platform increase the importance of image in daily life. Hence privacy of content plays an important role when content get live, so data owner put signature inform of watermark for claiming its proprietorship. This work has... more
Most of the existing data mining approaches to time series prediction use as training data an embed of the most recent values of the time series, following the traditional linear auto-regressive methodologies. However, in many time series... more
One of the most accurate methods in Question Answering (QA) uses off-line information extraction to find answers for frequently asked questions. It requires automatic extraction from text of all relation instances for relations that users... more
Satire is an attractive subject in deception detection research: it is a type of deception that intentionally incorporates cues revealing its own deceptiveness. Whereas other types of fabrications aim to instill a false sense of truth in... more
Information Extraction (IE) is an important research field within the Artificial Intelligence community, for it tries to extract relevant information out of vast amounts of data. In this paper, we propose an extraction method that... more
A huge number of informal messages are posted every day in social network sites, blogs and discussion forums. Emotions seem to be frequently important in these texts for expressing friendship, showing social support or as part of online... more
The problem of constructing a tight isothetic outer (or inner) polygon covering an arbitrarily shaped 2D object on a background grid, is addressed in this paper, and a novel algorithm is proposed. Such covers have many applications to... more
Background Persistent high levels of aggressive, oppositional and impulsive behaviours, in the early lives of children, are significant risk factors for adolescent and adult antisocial behaviour and criminal activity. If the disruptive... more