Database Integration
48 Followers
Recent papers in Database Integration
The inaugural version of the InGaP database (Integrative Gene and Protein expression database; http://www.kazusa.or.jp/ingap/index.html) is a comprehensive database of gene/protein expression profiles of 127 mKIAA genes/proteins related... more
Extraction-Transformation-Loading (ETL) tools are pieces of software responsible for the extraction of data from several sources, their cleansing, customization, and insertion into a data warehouse. In this paper, we delve into the... more
constraint information about the real-world entities, are used to derive the missing eztended key attribute values of a tuple.
Resolving domain incompatibility among independently developed databases often involves uncertain information. DeMichiel (1989) showed that uncertain information can be generated by the mapping of conflicting attributes to a common... more
A federated database system (FDBS) is a collection of cooperating database systems that are autonomous and possibly heterogeneous. In this paper, we define a reference architecture for distributed database management systems from system... more
This study developed and implemented an improved mediator wrapper approach to addressing the challenges of integration of semantic heterogeneous databases. It employed Local as View (LaV) paradigm of database integration so as to reduce... more
Database security concerns the use of a broad range of information security controls to protect databases against compromises of their confidentiality, integrity and availability. Data integrity, in turn, is a fundamental component of... more
Claims and disputes on time extension for delays are common in construction projects. Various methods and divergent approaches are used by different parties in assessing delays. Disputes then often arise on (a) eligibility of a delay... more
In the last decades, the use of ontologies in information systems has become more and more popular in various fields, such as web technologies, database integration, multi agent systems, natural language processing, etc. Artificial... more
A database integrating 90 years of empirical studies reporting intercorrelations among rated job performance dimensions was used to test the hypothesis of a general factor in job performance. After controlling for halo error and 3 other... more
Wireless local area network (WLAN) has been widely used in many sectors. The popularity gained is due to many reasons, such as ease of installation, installation flexibility, mobility, reduced cost-of-ownership, and scalability. However,... more
Every scientist knows that research results are only as good as the data upon which the conclusions were formed. However, most scientists receive no training in methods for achieving, assessing, or controlling the quality of research... more
One of the fundamental principles of the database approach is that a database allows a nonredundant, unified representation of all data managed in an organization. This is achieved only when methodologies are available to support... more
State-of-the-art OLTP systems execute distributed transactions using XA-2PC protocol, a presumed-abort variant of the Two-Phase Commit (2PC) protocol. While the XA specification provides for the Read-Only and 1PC optimizations of 2PC, it... more
Inductive databases integrate database querying with database mining. In this article, we present an inductive database system that does not rely on a new data mining query language, but on plain SQL. We propose an intuitive and elegant... more
A federated database system (FDBS) is a collection of cooperating database systems that are autonomous and possibly heterogeneous. In this paper, we define a reference architecture for distributed database management systems from system... more
Corporate databases are potentially rich sources of new and valuable knowledge. Various approaches to "discovering" or "mining" such knowledge have been proposed. We identify an important and previously ignored discovery task, data... more
We present a data model for the initial implementation of MetPetDB, a geochemical database specific to metamorphic rock samples. The database is designed around the concept of preservation of spatial relationships, at all scales, of... more
We address the problem of providing integrated access to diverse and dynamic information sources. We explain how this problem di ers from the traditional database integration problem and we focus on one aspect of the information... more
Database integration provides integrated access to multiple data sources. Database integration has two main activities: schema integration (forming a global view of the data contents available in the sources) and data integration... more
Abstract. Russian social science studying the public opinion has accumulated considerable datasets of research data obtained over the past 25 years. In theory, the quantitative data growth is to be transferred into the emergence of... more
A key problem in providing 'enterprise-wide' information is the integration of databases that have been independently developed. An important requirement is to accommodate heterogeneity and maintain the autonomy of component databases.... more
x Our comments about the paper by Leeflang and Wittink Internat. J. Res. Marketing, 17 2000 105 comprise of two components: first, we address two issues on which we disagree with Leeflang and Wittink: soft versus hard data, and... more
The Bovine Genome Database (BGD; http://BovineGenome.org) strives to improve annotation of the bovine genome and to integrate the genome sequence with other genomics data. BGD includes GBrowse genome browsers, the Apollo Annotation... more
S. Janssen). a v a i l a b l e a t w w w . s c i e n c e d i r e c t . c o m journal homepage: www.elsevier.com/locate/envsci 1462-9011/$ -see front matter #
Schema integration is important in two contexts, logical database design (in centralized DBMS) and global schema design (in distributed DBMS). Performing an integration on real-life schemas without a tool can be very difficult, tedious... more
Mycobacterium tuberculosis spoligotypes that may derive from mixed strain infections are revealed by a novel computational approach a b s t r a c t Global control of tuberculosis is increasingly dependent on rapid and accurate genetic... more
The Gene Ontology (GO) project (http://www. geneontology.org/) provides structured, controlled vocabularies and classi®cations that cover several domains of molecular and cellular biology and are freely available for community use in the... more
Ontology is presently an emerging research topic in the field of artificial intelligence, semantic web, and natural language processing, software engineering, and information architecture etc. Manual Ontology building is essentially a... more
Biological data sources are known for its heterogeneous in many aspects. These aspects include data formats, physical location as well as its query capabilities. These data sources need to be integrated so that researchers can easily... more
The actual value of the Deep Web comes from integrating the data its applications provide. Such applications offer human-oriented search forms as their entry points, and there exists a number of tools that are used to fill them in and... more
We provide a methodology for the creation of ontological partitions in biomedicine and we test the methodology via an application to the phenomenon of blood pressure. An ontology of blood pressure must do justice to the complex networks... more
I1 High-throughput next generation sequencing technologies foster new computing techniques in bioinformatics Mary Qu Yang et al. S1 Prediction of DNA-binding residues from protein sequence information using random forests Liangjiang Wang... more
Background: Fungi secrete various proteins that have diverse functions. Prediction of secretory proteins using only one program is unsatisfactory. To enhance prediction accuracy, we constructed Fungal Secretome Database (FSD).
This paper describes the objectives and goals of the FEMUS project, a joint research project of the database research groups at EPF Lausanne and ETH Zürich. It presents an overview on FEMUS and the first results on comparison between ERC+... more
The paper presents the results of a trilateral research project carried out jointly by German, Israeli, and Palestinian institutions. The overall objective of the project was to develop and adapt models and tools for resource-preserving... more
Database integration is currently solved only for the case of simple structures. Semantics is mainly neglected. It is known but often neglected that database integration cannot be automated. System integration is far more difficult. Both... more
As mobile ad hoc networks (MANETs) are becoming popular for a variety of applications, so are the issues surrounding corresponding implementations. In this paper, a healthcare application is developed for an environment where normal... more
Today, databases provide the best technique for storing and retrieving data, but they suffer from the absence of a semantic perspective, which is needed to reach global goals such as the semantic web and data integration. Using ontologies... more
The cell cycle is one of the biological processes most frequently investigated in systems biology studies and it involves the knowledge of a large number of genes and networks of protein interactions. A deep knowledge of the molecular... more