Actualmente existe una gran cantidad de empresas ofreciendo servicios para el análisis de conteni... more Actualmente existe una gran cantidad de empresas ofreciendo servicios para el análisis de contenido y minería de datos de las redes sociales con el objetivo de realizar análisis de opiniones y gestión de la reputación. Un alto porcentaje de pequeñas y medianas empresas (pymes) ofrecen soluciones específicas a un sector o dominio industrial. Sin embargo, la adquisición de la necesaria tecnología básica para ofrecer tales servicios es demasiado compleja y constituye un sobrecoste demasiado alto para sus limitados recursos. El objetivo del proyecto europeo OpeNER es la reutilización y desarrollo de componentes y recursos para el procesamiento lingüístico que proporcione la tecnología necesaria para su uso industrial y/o académico.Currently there are a many companies offering Content Analytics and Social Internet Mining services for the purposes of Opinion Mining and Reputation Management. A high percentage of Small and Medium Enterprises (SMEs) are active offering niche solutions to sp...
Abstract. Currently there are only few available language resources for French. Additionally ther... more Abstract. Currently there are only few available language resources for French. Additionally there is a lack of available language models for for tasks such as Named Entity Recognition and Classification (NERC) which makes difficult building natural language processing systems for this language. This paper presents a new publicly available supervised Apache OpenNLP NERC model that has been trained and tested under a maximum entropy approach. This new model achieves state of the art results for French when compared with another systems. Finally we have also extended Apache OpenNLP libraries to support part-of-speech feature extraction component which has been used for our experiments. 1
Market uptake of the results from security research projects has been an area of concern since lo... more Market uptake of the results from security research projects has been an area of concern since long time ago. When the ASGARD project was conceived back in 2014, the main goal of the project was “to support LEA Technological Autonomy, by building a sustainable, long-lasting community formed by LEAs, Researchers, and Industry that will create (at little or no cost to LEAs), maintain and evolve a best of class tool set for the extraction, fusion, exchange and analysis of Big Data including cyber-offenses data for forensic investigation”. In this chapter we describe how the project was designed with the aim of improving the efficiency of multidisciplinary collaboration in security research projects. Open-source model concepts and principles were adapted to the needs of security research projects. Fluid, frequent and fruitful collaboration during short full-development cycles and face-to-face “Hackathon”-like events are the backbone of the new approach implemented in ASGARD.
The Internet presents a problem for the protection of intellectual property. Those who create con... more The Internet presents a problem for the protection of intellectual property. Those who create content must be adequately compensated for the use of their works. Rights agencies who monitor the use of these works exist in many juristictions. In the traditional broadcast environment this monitoring is a difficult task. With Internet Protocol Television (IPTV) and Next Generation Networks (NGN) this situation is further complicated. In this work we focus on Digitally Watermarking next generation media broadcasts. We present a framework which provides the ability to monitor media broadcasts that also utilises a Public Key Infrastructure (PKI) and Digital Certificates. Furthermore, the concept of an independent monitoring agency, that would operate the framework and act as an arbiter, is introduced. We evaluate appropriate short signature schemes, suitable Watermarking algorithms and Watermark robustness. Finally, the application of the proposed framework in other related scenarios is dis-
The automatic analysis of opinions, which usually receives the name of opinion mining or sentimen... more The automatic analysis of opinions, which usually receives the name of opinion mining or sentiment analysis, has gained a great importance during the last decade. This is mainly due to the overgrown of online content in the Internet. The so-called aspect based opinion mining systems aim to detect the sentiment at “aspect” level (i.e. the precise feature being opinionated in a clause or sentence). In order to detect such aspects it is required some knowledge about the domain under analysis. The vocabulary in different domains may vary, and different words are interesting features in different domains. We aim to generate a list of domain related words and expressions from unlabeled domain texts, in a completely unsupervised way, as a first step to a more complex opinion mining system.
Currently there are many companies offering Content Analytics and Social Internet Mining services... more Currently there are many companies offering Content Analytics and Social Internet Mining services for the purposes of Opinion Mining and Reputation Management. A high percentage of Small and Medium Enterprises (SMEs) are active offering niche solutions to specific segments of the market and/or domains. However, acquiring or developing the base qualifying technologies required to enter the market is an expensive undertaking that redirects the already limited resources of SMEs away from offering products and services that the market demands.The main goal of the OpeNER european project is the reuse and repurposing of existing language resources and data sets to provide a set of underlying technologies to the broader industrial and academic community.
ABSTRACT The question of the capacity of artificial intelligence to make moral decisions has been... more ABSTRACT The question of the capacity of artificial intelligence to make moral decisions has been a key focus of investigation in robotics for decades. This question has now become pertinent to automated vehicle technologies, as a question of understanding the capacity of artificial driving intelligence to respond to unavoidable road traffic accidents. Artificial driving intelligence will make a calculated decision that could equate to deciding who lives and who dies. In calculating such important decisions, does the driving intelligence require moral intelligence and a capacity to make informed moral decisions? Artificial driving intelligence will be determined by at very least, state laws, driving codes, and codes of conduct relating to driving behaviour and safety. Does it also need to be informed by ethical theories, human values, and human rights frameworks? If so, how can this be achieved and how can we ensure there are no moral biases in the moral decision-making algorithms? The question of moral capacity is complex and has become the ethical focal point of this technology. Research has centred on applying Philippa Foot’s famous trolley dilemma. We claim that before applications attempt to focus on moral theories, there is a necessary precedent to utilise the trolley dilemma as an ontological experiment. The trolley dilemma is succinct in identifying important ontological differences between human driving intelligence and artificial driving intelligence. In this paper, we argue that when the trolley dilemma is focused upon ontology, it has the potential to become an important elucidatory tool. It can act as a prism through which one can perceive different ontological aspects of driving intelligence and assess response decisions to unavoidable road traffic accidents. The identification of the ontological differences is integral to understanding the underlying variances that support human and artificial driving decisions. Ontologically differentiating between these two contexts allows for a more complete interrogation of the moral decision-making capacity of the artificial driving intelligence.
Computer vision methods for advanced driver assistance systems (ADAS) must be developed consideri... more Computer vision methods for advanced driver assistance systems (ADAS) must be developed considering the strong requirements imposed by the industry, including real-time performance in low cost and low consumption hardware (HW), and rapid time to market. These two apparently contradictory requirements create the necessity of adopting careful development methodologies. In this study the authors review existing approaches and describe the methodology to optimise computer vision applications without incurring in costly code optimisation or migration into special HW. This approach is exemplified on the improvements achieved on the successive re-designs of vehicle detection algorithms for monocular systems. In the experiments the authors observed a ×15 speed up between the first and fourth prototypes, progressively optimised using the proposed methodology from the very first naive approach to a fine-tuned algorithm.
reputación. Un alto por-centaje de pequeñas y medianas empresas (pymes) ofrecen soluciontes espec... more reputación. Un alto por-centaje de pequeñas y medianas empresas (pymes) ofrecen soluciontes específicas a un sector o dominio industrial. Sin embargo, la adquisición de la necesaria tec-nología básica para ofrecer tales servicios es demasiado compleja y constituye un sobrecoste demasiado alto para sus limitados recursos. El objetivo del proyecto eu-ropeo OpeNER es la reutilización y desarrollo de componentes y recursos para el procesamiento lingüístico que proporcione la tecnología necesaria para su uso indus-trial y/o académico. Palabras clave: Abstract: Currently there are a many companies offering Content Analytics and Social Internet Mining services for the purposes of Opinion Mining and Reputation Management. A high percentage of Small and Medium Enterprises (SMEs) are active offering niche solutions to specific segments of the market and/or domains. However, acquiring or developing the base qualifying technologies required to enter the market is an expensive undertaking that r...
There is currently a lack of available language resources for French, especially for basic tasks ... more There is currently a lack of available language resources for French, especially for basic tasks such as Named Entity Recognition and Classification (NERC), which makes it difficult to build natural language processing systems for this language. This paper presents a supervised NERC model for French that has been trained and tested under a maximum entropy approach. The Apache OpenNLP libraries have also been extended, to support the required part-of-speech feature extraction component. The model achieves state of the art results for French, when compared to similar systems developed for other languages, and will be made publicly available.
The Internet presents a problem for the protection of intellectual property. Those who create con... more The Internet presents a problem for the protection of intellectual property. Those who create content must be adequately compensated for the use of their works. Rights agencies who monitor the use of these works exist in many jurisdictions. In the traditional broadcast environment this monitoring is a difficult task. With Internet Protocol Television (IPTV) and Next Generation Networks (NGN) this
ACM ICMR is the premier scientific conference for multimedia retrieval held worldwide, with the s... more ACM ICMR is the premier scientific conference for multimedia retrieval held worldwide, with the stated mission "to illuminate the state of the art in multimedia retrieval by bringing together researchers and practitioners in the field of multimedia retrieval". The conference aims to promote intellectual exchanges and interactions among scientists, engineers, students, multimedia researchers in academia as well as industry through various events, including keynote talk, oral, special, and poster sessions focused on research challenges and solutions, technical and industrial demonstrations of prototypes, tutorials, research and industrial panel.
Actualmente existe una gran cantidad de empresas ofreciendo servicios para el análisis de conteni... more Actualmente existe una gran cantidad de empresas ofreciendo servicios para el análisis de contenido y minería de datos de las redes sociales con el objetivo de realizar análisis de opiniones y gestión de la reputación. Un alto porcentaje de pequeñas y medianas empresas (pymes) ofrecen soluciones específicas a un sector o dominio industrial. Sin embargo, la adquisición de la necesaria tecnología básica para ofrecer tales servicios es demasiado compleja y constituye un sobrecoste demasiado alto para sus limitados recursos. El objetivo del proyecto europeo OpeNER es la reutilización y desarrollo de componentes y recursos para el procesamiento lingüístico que proporcione la tecnología necesaria para su uso industrial y/o académico.Currently there are a many companies offering Content Analytics and Social Internet Mining services for the purposes of Opinion Mining and Reputation Management. A high percentage of Small and Medium Enterprises (SMEs) are active offering niche solutions to sp...
Abstract. Currently there are only few available language resources for French. Additionally ther... more Abstract. Currently there are only few available language resources for French. Additionally there is a lack of available language models for for tasks such as Named Entity Recognition and Classification (NERC) which makes difficult building natural language processing systems for this language. This paper presents a new publicly available supervised Apache OpenNLP NERC model that has been trained and tested under a maximum entropy approach. This new model achieves state of the art results for French when compared with another systems. Finally we have also extended Apache OpenNLP libraries to support part-of-speech feature extraction component which has been used for our experiments. 1
Market uptake of the results from security research projects has been an area of concern since lo... more Market uptake of the results from security research projects has been an area of concern since long time ago. When the ASGARD project was conceived back in 2014, the main goal of the project was “to support LEA Technological Autonomy, by building a sustainable, long-lasting community formed by LEAs, Researchers, and Industry that will create (at little or no cost to LEAs), maintain and evolve a best of class tool set for the extraction, fusion, exchange and analysis of Big Data including cyber-offenses data for forensic investigation”. In this chapter we describe how the project was designed with the aim of improving the efficiency of multidisciplinary collaboration in security research projects. Open-source model concepts and principles were adapted to the needs of security research projects. Fluid, frequent and fruitful collaboration during short full-development cycles and face-to-face “Hackathon”-like events are the backbone of the new approach implemented in ASGARD.
The Internet presents a problem for the protection of intellectual property. Those who create con... more The Internet presents a problem for the protection of intellectual property. Those who create content must be adequately compensated for the use of their works. Rights agencies who monitor the use of these works exist in many juristictions. In the traditional broadcast environment this monitoring is a difficult task. With Internet Protocol Television (IPTV) and Next Generation Networks (NGN) this situation is further complicated. In this work we focus on Digitally Watermarking next generation media broadcasts. We present a framework which provides the ability to monitor media broadcasts that also utilises a Public Key Infrastructure (PKI) and Digital Certificates. Furthermore, the concept of an independent monitoring agency, that would operate the framework and act as an arbiter, is introduced. We evaluate appropriate short signature schemes, suitable Watermarking algorithms and Watermark robustness. Finally, the application of the proposed framework in other related scenarios is dis-
The automatic analysis of opinions, which usually receives the name of opinion mining or sentimen... more The automatic analysis of opinions, which usually receives the name of opinion mining or sentiment analysis, has gained a great importance during the last decade. This is mainly due to the overgrown of online content in the Internet. The so-called aspect based opinion mining systems aim to detect the sentiment at “aspect” level (i.e. the precise feature being opinionated in a clause or sentence). In order to detect such aspects it is required some knowledge about the domain under analysis. The vocabulary in different domains may vary, and different words are interesting features in different domains. We aim to generate a list of domain related words and expressions from unlabeled domain texts, in a completely unsupervised way, as a first step to a more complex opinion mining system.
Currently there are many companies offering Content Analytics and Social Internet Mining services... more Currently there are many companies offering Content Analytics and Social Internet Mining services for the purposes of Opinion Mining and Reputation Management. A high percentage of Small and Medium Enterprises (SMEs) are active offering niche solutions to specific segments of the market and/or domains. However, acquiring or developing the base qualifying technologies required to enter the market is an expensive undertaking that redirects the already limited resources of SMEs away from offering products and services that the market demands.The main goal of the OpeNER european project is the reuse and repurposing of existing language resources and data sets to provide a set of underlying technologies to the broader industrial and academic community.
ABSTRACT The question of the capacity of artificial intelligence to make moral decisions has been... more ABSTRACT The question of the capacity of artificial intelligence to make moral decisions has been a key focus of investigation in robotics for decades. This question has now become pertinent to automated vehicle technologies, as a question of understanding the capacity of artificial driving intelligence to respond to unavoidable road traffic accidents. Artificial driving intelligence will make a calculated decision that could equate to deciding who lives and who dies. In calculating such important decisions, does the driving intelligence require moral intelligence and a capacity to make informed moral decisions? Artificial driving intelligence will be determined by at very least, state laws, driving codes, and codes of conduct relating to driving behaviour and safety. Does it also need to be informed by ethical theories, human values, and human rights frameworks? If so, how can this be achieved and how can we ensure there are no moral biases in the moral decision-making algorithms? The question of moral capacity is complex and has become the ethical focal point of this technology. Research has centred on applying Philippa Foot’s famous trolley dilemma. We claim that before applications attempt to focus on moral theories, there is a necessary precedent to utilise the trolley dilemma as an ontological experiment. The trolley dilemma is succinct in identifying important ontological differences between human driving intelligence and artificial driving intelligence. In this paper, we argue that when the trolley dilemma is focused upon ontology, it has the potential to become an important elucidatory tool. It can act as a prism through which one can perceive different ontological aspects of driving intelligence and assess response decisions to unavoidable road traffic accidents. The identification of the ontological differences is integral to understanding the underlying variances that support human and artificial driving decisions. Ontologically differentiating between these two contexts allows for a more complete interrogation of the moral decision-making capacity of the artificial driving intelligence.
Computer vision methods for advanced driver assistance systems (ADAS) must be developed consideri... more Computer vision methods for advanced driver assistance systems (ADAS) must be developed considering the strong requirements imposed by the industry, including real-time performance in low cost and low consumption hardware (HW), and rapid time to market. These two apparently contradictory requirements create the necessity of adopting careful development methodologies. In this study the authors review existing approaches and describe the methodology to optimise computer vision applications without incurring in costly code optimisation or migration into special HW. This approach is exemplified on the improvements achieved on the successive re-designs of vehicle detection algorithms for monocular systems. In the experiments the authors observed a ×15 speed up between the first and fourth prototypes, progressively optimised using the proposed methodology from the very first naive approach to a fine-tuned algorithm.
reputación. Un alto por-centaje de pequeñas y medianas empresas (pymes) ofrecen soluciontes espec... more reputación. Un alto por-centaje de pequeñas y medianas empresas (pymes) ofrecen soluciontes específicas a un sector o dominio industrial. Sin embargo, la adquisición de la necesaria tec-nología básica para ofrecer tales servicios es demasiado compleja y constituye un sobrecoste demasiado alto para sus limitados recursos. El objetivo del proyecto eu-ropeo OpeNER es la reutilización y desarrollo de componentes y recursos para el procesamiento lingüístico que proporcione la tecnología necesaria para su uso indus-trial y/o académico. Palabras clave: Abstract: Currently there are a many companies offering Content Analytics and Social Internet Mining services for the purposes of Opinion Mining and Reputation Management. A high percentage of Small and Medium Enterprises (SMEs) are active offering niche solutions to specific segments of the market and/or domains. However, acquiring or developing the base qualifying technologies required to enter the market is an expensive undertaking that r...
There is currently a lack of available language resources for French, especially for basic tasks ... more There is currently a lack of available language resources for French, especially for basic tasks such as Named Entity Recognition and Classification (NERC), which makes it difficult to build natural language processing systems for this language. This paper presents a supervised NERC model for French that has been trained and tested under a maximum entropy approach. The Apache OpenNLP libraries have also been extended, to support the required part-of-speech feature extraction component. The model achieves state of the art results for French, when compared to similar systems developed for other languages, and will be made publicly available.
The Internet presents a problem for the protection of intellectual property. Those who create con... more The Internet presents a problem for the protection of intellectual property. Those who create content must be adequately compensated for the use of their works. Rights agencies who monitor the use of these works exist in many jurisdictions. In the traditional broadcast environment this monitoring is a difficult task. With Internet Protocol Television (IPTV) and Next Generation Networks (NGN) this
ACM ICMR is the premier scientific conference for multimedia retrieval held worldwide, with the s... more ACM ICMR is the premier scientific conference for multimedia retrieval held worldwide, with the stated mission "to illuminate the state of the art in multimedia retrieval by bringing together researchers and practitioners in the field of multimedia retrieval". The conference aims to promote intellectual exchanges and interactions among scientists, engineers, students, multimedia researchers in academia as well as industry through various events, including keynote talk, oral, special, and poster sessions focused on research challenges and solutions, technical and industrial demonstrations of prototypes, tutorials, research and industrial panel.
Uploads
Papers by Seán Gaines