General Actions:
Log-in
Wiki:
QUADRIS
▼
:
Document Index
»
Space:
QUADRIS
▼
:
Document Index
»
Page:
WebHome
Page Actions:
Export
▼
:
Export as PDF
Export as RTF
Export as HTML
More actions
▼
:
Print preview
View Source
Wiki source code of
Scope
Content
·
Comments
(0)
·
Annotations
(0)
·
Attachments
(0)
·
History
·
Information
Hide line numbers
1: {style:type=div|align=center}<h1>Quality of Data and Multi-Source Information Systems</h1>\\<h2>ARA Masses de Données 2006-2009 (ANR)</h2>{style} 2: Data quality problems in databases, data warehouses or more generally in multi-source information systems are widely spread in an endemic way on all types of data and in all application domains: commercial data, biomedical data, industrial data, scientific or geographical data. As examples, among the numerous problems encountered in the massive data sets now available (structured or semi-structured data), let's cite data errors, outliers, duplicates, data inconsistencies, missing values, incomplete, uncertain, obsolete, or unreliable data. These problems harm seriously the result of information searching process (even effective) and also the result of data analysis preliminary to any decision-making.\\\\QUADRIS is a project funded by the [ARA «Masses de Données»>http://acimd.labri.fr/] research program from the French [Agence Nationale de la Recherche>http://www.agence-nationale-recherche.fr/].\\\\The objective of the QUADRIS project (36 months duration) is to solve the various data quality problems that appear when modelling, designing information systems, integrating and querying multi-source information and finally, evaluating multi-source information systems. QUADRIS will provide theoretical solutions validated in real situations on very large data volumes for three representative disciplinary fields in order to solve the multiple problems of data and information system quality. 3: 4: The multi-disciplinary research project QUADRIS will tackle these problems by organizing research work according to four directive axes: \\ 5: <blockquote>1. The *~~methodological axis~~* (mainly carried out by the *[CEDRIC>http://cedric.cnam.fr]* Lab of CNAM) aims at adapting the current methods of conceptual analysis-design, engineering, reverse engineering and migration of multi-source information systems in order to include the evaluation and the control of the various facets of data quality jointly with the evaluation of system quality,</blockquote><blockquote> 6: 2. The *~~theoretical and technical axis~~* (carried out jointly by *[IRISA>http://www.irisa.fr]* and *[PRISM>http://www.prism.uvsq.fr]* Labs) is organized in two objectives: i) proposing metrics, methods and algorithmic approaches to analyze, detect, control and "clean" continuously various data quality problems in multi-source information systems ; ii) reconsidering the multi-source information mediation, integration and optimization of multi-source queries in order to take into account data quality control methods with the adaptive query processing based on the negotiation between the query cost and the quality of the multi-source retrieved data, \\ 7: </blockquote><blockquote> 8: 3. The *~~technological axis~~* that aims at developing an experimental and original prototype of middleware that is configurable and allows: i) to detect, measure, control and correct the quality of large data volumes for any type of multi-source information systems; ii) to evaluate the quality of a multi-source information system; iii) to study the mediation and integration driven by data quality and the optimization of multi-source queries based on data quality control and system quality control. \\ 9: </blockquote><blockquote> 10: 4. The ~~*applicative axis*~~ for which the project QUADRIS will validate the aforementioned research proposals in three application areas that are representative for their huge volumes of data, their complex underlying models and for their numerous and specific data quality problems. These application domains are: the biomedical domain (medical records collected by health professionals (*[Curie Institute>http://www.curie.fr]*), the commercial domain (data of *[EDF>http://www.edf.fr]*'s Customer Relationship Management - CRM) and the geographical domain (*[LSIS>http://www.lsis.org]*). \\ 11: </blockquote>
Search
Search query
QUADRIS
Common Publications
Contact us
Forthcoming Events
Meetings
Objectives
Partners
Past Events
Private
Publications after 2006
Publications before 2006
Themes
Web Home
Web Preferences