A Semantic Enrichment of Data Tables Applied to Food Risk Assessment
- 548 Downloads
Our work deals with the automatic construction of domain specific data warehouses. Our application domain concerns microbiological risks in food products. The MIEL++ system , implemented during the Sym’Previus project, is a tool based on a database containing experimental and industrial results about the behavior of pathogenic germs in food products. This database is incomplete by nature since the number of possible experiments is potentially infinite. Our work, developed within the e.dot project, presents a way of palliating that incompleteness by complementing the database with data automatically extracted from the Web. We propose to query these data through a mediated architecture based on a domain ontology. So, we need to make them compatible with the ontology. In the e.dot project , we exclusively focus on documents in Html or Pdf format which contain data tables. Data tables are very common presentation scheme to describe synthetic data in scientific articles. These tables are semantically enriched and we want this enrichment to be as automatic and flexible as possible. Thus, we have defined a Document Type Definition named SML (Semantic Markup Language) which can deal with additional or incomplete information in a semantic relation, ambiguities or possible interpretation errors. In this paper, we present this semantic enrichment step.
Unable to display preview. Download preview PDF.
- 2.Buche, P., Dibie-Barthélemy, J., Haemmerlé, O., Houhou, M.: Towards flexible querying of xml imprecise data in a dataware house opened on the web. In: Christiansen, H., Hacid, M.-S., Andreasen, T., Larsen, H.L. (eds.) FQAS 2004. LNCS (LNAI), vol. 3055, pp. 28–40. Springer, Heidelberg (2004)CrossRefGoogle Scholar
- 5.e.dot, Progress report of the e.dot project (2004), http://www-rocq.inria.fr/gemo/edot