Versioning Research Papers

Cloud computing offers a powerful abstraction that provides a scalable, virtualized infrastructure as a service where the complexity of fine-grained resource management is hidden from the end-user. Running data analytics applications in... more

Bancos de dados e documentos sao comumente mantidos em separado nas organizacoes, controlados por Sistemas Gerenciadores de Bancos de Dados (SGBDs) e Sistemas de Recuperacao de Informacao (SRIs), respectivamente. Essa separacao tem... more

Bookmark
Download
- by Vladimir Soares Catao
- •
- 7
  Philosophy, Humanities, Banco de Dados, Ciência da Computação

The present paper aims to shed light on the experiences of racism faced by second-generation black immigrants in Canada and explore their quest for identity in the context of a multicultural Canada. David Chariandy's novel Brother,... more

In unstructured distributed P2P systems there is no logical structure to control the peers coming and leaving the network, which can occur anytime due to mobility. Thus, data exchange with consistence, and data availability are very... more

Bookmark
Download
- by Carlo Emmanoel Tolla de Oliveira
- •
- 3
  Physics, Humanities, Brasil

This paper considers a two-stage development problem for information goods with costless quality degradation. In our model, a seller of information goods faces customers that are heterogeneous with regard to both the marginal willingness... more

Bookmark
Download
- by Sridhar Seshadri
- •
- 18
  Business, Information Systems, Marketing, Computer Science

It is commonly believed that piracy of information goods leads to lower profits, which translate to lower incentives to invest in innovation, and eventually to lower quality products. Manufacturers, policy-makers, and researchers, all... more

Bookmark
Download
- by Debabrata Dey
- •
- 10
  Business, Computer Science, Economics, Industrial Organization

With sinking storage costs, it becomes more and more feasible, and popular, to retain past versions of documents and data. While undoing changes is worthy, this becomes even more valuable if the data is queryable. Nowadays, there are two... more

Bookmark
Download
- by Ghislain Fourny
- •
- 12
  Languages, Computer Science, Design, XML

The maintenance of materialized views in large-scale environments composed of numerous information sources (ISs), such as in the WWW, is complicated by ISs not only continuously modifying their contents but also their capabilities... more

The trajectory data warehouse (TDW) view definitions are constructed from heterogeneous mobile information sources schema that are more and more independent. In fact, they frequently change their content due to perpetual transactions... more

Bookmark
Download
- by wided oueslati
- •
- 5
  Computer Science, Technology, Data Warehouse, Trajectory

File-sharing semantics is used by the file systems for sharing data among concurrent client processes in a consistent manner. Session semantics is a widely used file-sharing semantics in Distributed File Systems (DFSs). The main... more

Hadoop Distributed File System (HDFS) is the core component of Apache Hadoop project. In HDFS, the computation is carried out in the nodes where relevant data is stored. Hadoop also implemented a parallel computational paradigm named as... more

Background: Provenance is a critical ingredient for establishing trust of published scientific content. This is true whether we are considering a data set, a computational workflow, a peer-reviewed publication or a simple scientific claim... more

Background: Provenance is a critical ingredient for establishing trust of published scientific content. This is true whether we are considering a data set, a computational workflow, a peer-reviewed publication or a simple scientific claim with supportive evidence. Existing vocabularies such as Dublin Core Terms (DC Terms) and the W3C Provenance Ontology (PROV-O) are domain-independent and general-purpose and they allow and encourage for extensions to cover more specific needs. In particular, to track authoring and versioning information of web resources, PROV-O provides a basic methodology but not any specific classes and properties for identifying or distinguishing between the various roles assumed by agents manipulating digital artifacts, such as author, contributor and curator. Results: We present the Provenance, Authoring and Versioning ontology (PAV, namespace http://purl.org/pav/): a lightweight ontology for capturing "just enough" descriptions essential for tracking the provenance, authoring and versioning of web resources. We argue that such descriptions are essential for digital scientific content. PAV distinguishes between contributors, authors and curators of content and creators of representations in addition to the provenance of originating resources that have been accessed, transformed and consumed. We explore five projects (and communities) that have adopted PAV illustrating their usage through concrete examples. Moreover, we present mappings that show how PAV extends the W3C PROV-O ontology to support broader interoperability. Method: The initial design of the PAV ontology was driven by requirements from the AlzSWAN project with further requirements incorporated later from other projects detailed in this paper. The authors strived to keep PAV lightweight and compact by including only those terms that have demonstrated to be pragmatically useful in existing applications, and by recommending terms from existing ontologies when plausible. Discussion: We analyze and compare PAV with related approaches, namely Provenance Vocabulary (PRV), DC Terms and BIBFRAME. We identify similarities and analyze differences between those vocabularies and PAV, outlining strengths and weaknesses of our proposed model. We specify SKOS mappings that align PAV with DC Terms. We conclude the paper with general remarks on the applicability of PAV.

Bookmark
Download
- by Carole Goble
- •
- 14
  Computer Science, Ontology, Provenance, Workflow

Tomando como referência o modelo editorial que está a ser desenvolvido para a poesia de Pedro Homem de Mello (1904- 1984), este trabalho procura refletir sobre o problema da atribuição crítica de autoridade, nos casos em que se documentam... more

Bookmark
Download
- by elsa pereira
- •
- 13
  Philosophy, Humanities, Textual Scholarship, Digital Edition

Immeasurable thanks goes to the Almighty God for granting me the grace and direction to complete this thesis. My gratitude goes to my parents Mr. and Mrs. Daniel Ani and my siblings, my family members, friends and my dear Sarah Garba... more

Bookmark
Download
- by Uche Daniel
- •
- 7
  Engineering, Computer Science, Evidence, Digital Forensics

User-Defined Functions (UDF) allow application programmers to specify analysis operations on data, while leaving the data management tasks to the system. This general approach enables numerous custom analysis functions and is at the heart... more

Bookmark
Download
- by Kesheng Wu
- •
- 3
  Computer Science, Locality, Exploit

The ubiquity of Big Data has greatly influenced the direction and the development of storage technologies. To meet the needs of storing and analyzing Big Data, researchers and administrators have turned to parallel and distributed storage... more

Bookmark
Download
- by Itua Ijagbone
- •
- 6
  Computer Science, Distributed Database, Metadata, Big Data

These last years, the amount of data generated by information systems has exploded. It is not only the quantities of information that are now estimated in Exabyte, but also the variety of these data which is more and more structurally... more

Bookmark
Download
- by larbi Hassouni
- •
- 2
  Computer Science, Big Data

Modern day systems are facing an avalanche of data, and they are being forced to handle more and more data intensive use cases. These data comes in many forms and shapes: Sensors (RFID, Near Field Communication, Weather Sensors),... more

Bookmark
Download
- by Shelan Perera
- •
- 8
  Computer Science, Distributed Computing, Metadata, Hash Table

Web repositories are large scale warehouses of data downloaded from the Web, needed by applications that summarize that data to produce results that help people use information. Time is a central dimension in Web data, because the Web is... more

Bookmark
Download
- by gaspar silva
- •
- 4
  Computer Science, Distributed Computing, World Wide Web, Large Scale

Atualmente, um massivo volume de dados tem sido produzido pelos mais variados tipos de fontes de dados. A facilidade de acesso a esses dados apresenta novas oportunidades, no entanto, escolher quais fontes de dados são mais adequadas para... more

Bookmark
Download
- by Ana Salgado
- •
- 4
  Engineering, Computer Science, Physics, Humanities

Gerenciamento de emergência é uma tarefa complexa pois envolve comunicação e colaboração entre variadas organizações e seus sistemas. Integração de dados e interoperabilidade de sistemas estão entre os maiores desafios nesta área. Como... more

Bookmark
Download
- by Reinaldo Almeida
- •
- Computer Science

As data volumes increase at a high speed in more and more application fields of science, engineering, information services, etc., the challenges posed by data-intensive computing gain an increasing importance. The emergence of highly... more

Bookmark
Download
- by A. Gabriel
- •
- 20
  Computer Science, Distributed Computing, Metadata, Data Management

The use of ontology is widely spread among software engineering groups as a way to represent, structure, share and reuse knowledge. As projects progress, the ontological understanding of the domain may change, evolve. New domain concepts... more

Bookmark
Download
- by Jessica Soares Dos Santos
- •
- 2
  Computer Science, Ontology

With the emergence of Cloud Computing, the amount of data generated in different fields such as physics, medical, social networks, etc. is growing exponentially. This increase in the volume of data and their large scale make the problem... more

Bookmark
Download
- by Mag CHALABI Baya
- •
- 3
  Computer Science, Scalability, Networking Technology

In this era of developing technologies, one of the most promising is cloud computing that has been functioning since years and used by individuals and large enterprises to provide different kind of services to the world. Cloud computing... more

Bookmark
Download
- by Chandu Vaidya
- •
- 3
  Computer Science, Cloud Computing, Helix

Deduplicação de registros (DR) tem como objetivo identificar instâncias que representam a mesma entidade do mundo real em repositórios de dados. No ambiente governamental, o processo de DR facilita a identificação de irregularidades e... more

Bookmark
Download
- by Alberto Laender
- •
- 2
  Physics, Humanities

Hardware transactional memory (HTM) systems have been studied extensively along the dimensions of speculative versioning and contention management policies. The relative performance of several designs policies has been discussed at length... more

O crescimento na produção e disponibilização de informacões não estruturadas na Web aumenta diariamente. Essa abundância de informações desestruturadas representa um grande desafio para a aquisição de conhecimento que seja processado por... more

Bookmark
Download
- by Fábio Lima
- •
- 4
  Computer Science, Ontology, Portuguese, Semantic Web

Ontologies evolve continuously throughout their lifecycle to respond to different change requirements. Several problems emanate from ontology evolution: capturing change requirements, change representation, change impact analysis and... more

Bookmark
Download
- by Rim Djedidi
- •
- 7
  Computer Science, Ontology, Ontology Engineering, Process Ontology

To distribute software, commercial vendors of proprietary software have the opportunity to use some dual licensing (DL) strategy i.e. to provide their software under two different licensing terms (proprietary and open source). We... more

Bookmark
Download
- by dominique torre
- •
- 8
  Business, Industrial Organization, Open Source Software, Open Source

Resource management is a key factor in the performance and efficient utilization of cloud systems, and many research works have proposed efficient policies to optimize such systems. However, these policies have traditionally managed the... more

The NSDL Metadata Registry is designed to provide humans and machines with the means to discover, create, access and manage metadata schemes, schemas, application profiles, crosswalks and concept mappings. This paper describes the general... more

Bookmark
Download
- by Stuart Sutton
- •
- 11
  Computer Science, Metadata, Concept Map, Semantic Web

ALMOST HOME IS AN ENTHRALLING EXAMINATION of the pursuit for belonging, the contradictions of black kinship and the contestations of colonial institutions. Ruma Chopra exposes the fraught nature of black mobility and freedom as it sits in... more

ALMOST HOME IS AN ENTHRALLING EXAMINATION of the pursuit for belonging, the contradictions of black kinship and the contestations of colonial institutions. Ruma Chopra exposes the fraught nature of black mobility and freedom as it sits in uneasy tension with the aspirations of the British colonialists in Jamaica, Canada and Sierra Leone. The picture that emerges is one of the Trelawny Town Maroons as highly strategic actors who negotiated the colonial apparatus of government through conflict, accommodation, and connivance. This is an important contribution to the historiography of the British empire, African-American history, Maroon studies and the emergent work in Black Geographies. Chopra draws on an extensive archive of the Maroons' story both to ensure richness and to demonstrate one of the central themes of the book: Maroons commanded the attention and reflection of the British empire. Maroon existence proved the empire was pregnable rather than a fait accompli. In the introduction we learn that the Maroons signed treaties with the British in 1738 and 1739 that "established the Maroons as useful neighbours" (15). As Chopra shows, though, while the Maroons would at first seem compliant with their role of propping up the plantation economy, they were less ready to accept an inferior racial positioning. War broke out following what Maroons perceived to be an especially demeaning event: the beating of two Trelawny Town Maroons "by a slave overseer the Maroons had previously captured as a runaway; adding insult to injury, the Maroons overheard nearby slaves making jeering remarks" (22). By insisting on these details Chopra highlights the fraught position of the Maroons within a racist colonial system that made their position volatile and their loyalties questionable. The chapter "Bloodhounds" shows the inconsistencies of the perceived social and moral position of the Maroons. As a last resort to ferret out the Maroons from their haunts and retreats, the colonial government brought in Cuban bloodhounds. Yet, as Chopra shows, the coincidence of these devel

Bookmark
Download
- by Alex A Moulton
- •
- 9
  History, Geography, Race and Ethnicity, Ethnology

Quality of data plays a very important role in any scientific research. In this paper we present some of the challenges that we face in managing and maintaining data quality for a terabyte scale biometrics repository. We have developed a... more

Bookmark
Download
- by Hoàng Kỳ Bùi
- •
- 11
  Computer Science, Grid Computing, Data Mining, Metadata

Hadoop Distributed File System (HDFS) is the core component of Apache Hadoop project. In HDFS, the computation is carried out in the nodes where relevant data is stored. Hadoop also implemented a parallel computational paradigm named as... more

Product Data Management (PDM) and Software Configuration Management (SCM) are the disciplines of building and controlling the evolution of a complex artifacts; either physical or software. Surprisingly, these two fields have evolved... more

Bookmark
Download
- by Jake Watts
- •
- 14
  Art, Contemporary Art, Divination, Futurism

Data warehouse systems integrate data from heterogeneous sources. These sources are autonomous in nature and change independently of a data warehouse. Owing to changes in data sources, the content and the schema of a data warehouse may... more

Bookmark
Download
- by Waqas Ahmed
- •
- 4
  Computer Science, Data Warehouse, Database, Springer Ebooks

Hardware transactional memory (HTM) systems have been studied extensively along the dimensions of speculative versioning and contention management policies. The relative performance of several designs policies has been discussed at length... more

In WebDAV: Next-Generation Collaborative Web Authoring, Lisa Dusseault thoroughly describes the WebDAV protocol and the rationale behind the current version (see Y. Goland et al., HTTP Extensions for Distributed Authoring WebDAV... more

Bookmark
Download
- by Fernando Bolívar-Galiano
- •
- 3
  Computer Science, Distributed Computing, IEEE

Deduplicação de registros (DR) tem como objetivo identificar instâncias que representam a mesma entidade do mundo real em repositórios de dados. No ambiente governamental, o processo de DR facilita a identificação de irregularidades e... more

Bookmark
Download
- by Gabriel Nunes
- •
- 2
  Physics, Humanities

This paper addresses the design and implementation of an adaptive document version management scheme. Existing schemes typically assume: (i) a priori expectations for how versions will be manipulated and (ii) fixed priorities between... more

Bookmark
Download
- by Mehregan Mahdavi
- •
- 8
  Computer Science, Heuristics, XML, Space Time

In the rapidly evolving Cloud market, the amount of data being generated is growing continuously and as a consequence storage as a service plays an increasingly important role. In this paper, we describe and compare two new approaches,... more

Bookmark
Download
- by Hillel Kolodner
- •
- 5
  Computer Science, Data Management, Cloud Computing, Storage

Active Storage provides an opportunity for reducing the bandwidth requirements between the storage and compute elements of current supercomputing systems, and leveraging the processing power of the storage nodes used by some modern file... more

The capability of taking snapshots is approaching ubiquity as a feature of file systems and data storage arrays. Here, we present an approach to structuring and managing snapshots in a storage space that provides for rapid creation and... more

O sistema judiciário é composto por inúmeros documentos relacionados a processos jurídicos. Esses documentos podem conter informações relevantes que suportem a tomada de decisão em processos futuros. No entanto, a coleta dessas... more

Bookmark
Download
- by Rafael Mello
- •
- Computer Science

In this paper we discuss several features of XP we have used in developing curricula and courses at Duke University and the University of Northern Iowa. We also discuss those practices of XP that we teach as part of the design and... more

We present NeST, a flexible software-only storage appliance designed to meet the storage needs of the Grid. NeST has three key features that make it well-suited for deployment in a Grid environment. First, NeST provides a generic data... more

Versioning

Log In