At line 0 added 95 lines. |
+ Semantic Data Integration |
+ |
+ The area of scientific data integration provides a number of |
+ challenges in addition to the traditional ones in data integration and |
+ database mediation. While data volume can and often is a problem, in a |
+ number of disciplines (e.g., ecology, life sciences in general, |
+ geosciences, etc.) the **semantic heterogeneity and complexity** of |
+ the data can be a significant impediment in itself. For example, an |
+ ecologist may want to combine a number of different data sources as |
+ part of an analytical pipeline or scientific workflow. In order to |
+ facilitate data integration, additional semantic information is often |
+ necessary, for example, on the unit type of a measurement, the |
+ protocol by which data was created or derived, or simply to provide |
+ additional information at the conceputal/ontological level about the |
+ data. |
+ |
+ The purpose of a semantic mediation system is to utilize semantic |
+ annotations, e.g., for smarter (ontology-enabled) data discovery, |
+ semantic type checking and conversion when linking analytical steps to |
+ one another or when binding data sets to analysis steps. |
+ |
+ The goal of this meeting is to identify common techniques and |
+ procedures for semantic mediation and to explore opportunities for |
+ collaborations between UK and US research groups. |
+ |
+ In particular, we plan to concentrate on Semantic registration of data sets, provenance sets, parameter sets, workflow sets, services sets. We will attempt to address questions like the following: |
+ |
+ * What kind of resources should be semantically typed (datasets, databases, services), and what is the semantic typing language for those? |
+ * How is semantic typing employed for data discovery, query rewriting, and scientific workflow planning? |
+ * What does a registry of semantic types look like, and how are data and services registered to it? What is the semantic registration procedure? |
+ * What tools exists to support semantic registration, querying, and reasoning with ontologies, schemas, etc. ? |
+ * What standards can be employed and extended? In particular, what support does EML already provide, and what else is needed? |
+ |
+ Prior to the meeting attendees will be given some **homework**, i.e., to illustrate their approach of semantic registration of data sets, web services, and workflows using two examples. One of them (tentative) can be found here: http://www.sdsc.edu/~ludaesch/Paper/dils04.html. |
+ |
+ |
+ |
+ {{{ |
+ 09:00-10:30 PRESENTATIONS: SEMANTIC EXTENSIONS IN SEEK |
+ |
+ The presentations will specificially address requirements and current |
+ architecture in the context of the SEEK Semantic Mediation System |
+ (SMS): |
+ |
+ 1. How to register data sets, ontologies, workflows, and associations |
+ between them (semantic registration). |
+ |
+ 2. How to put the above to good use, e.g., for "smart data discovery", |
+ semantics-enhanced data integration and mediation, semantics-enhanced |
+ workflow design and execution (this also includes, e.g., required |
+ reasoning services). |
+ |
+ Hopefully we will also be able to report on the linkage between the |
+ SEEK EcoGrid and the SEEK SMS: what are the "structural and semantic |
+ commitments" of the EcoGrid that can be used by SMS. |
+ |
+ In a sense, SMS uses the EcoGrid to do (1). Conversely, applications |
+ such as the SEEK workflow system (AMS/Kepler) use both the EcoGrid and |
+ SMS. This overall picture should also be fleshed out to some extent as |
+ part of this session. |
+ |
+ 10:30-11:00 TEA AND COFFEE |
+ |
+ 11:00-12:30 PRESENTATIONS: SEMANTIC EXTENSIONS IN MYGRID |
+ |
+ Same as above for SEEK, but now for MyGrid! As part of the "homework |
+ assignment" both SEEK and MyGrid folks use the same running example(s) |
+ to illustrate their approaches. |
+ |
+ 12:30-14:00 LUNCH |
+ |
+ 14:00-16:30 INTEROPERABLE SEMANTIC REGISTRATION, MEDIATION, WORKFLOWS I |
+ |
+ This 2.5 hour session might be parallelized into break-out |
+ sessions. The goal is to flesh out an interoperable semantic |
+ registration approach that will work across SEEK, MyGrid, and related |
+ "semantics-aware" systems. A MyGrid semantically registered service |
+ should be usable from a semantics-aware SEEK workflow. Conversely, a |
+ semantics-aware MyGrid workflow should be able to invoke SEEK services |
+ and take advantage of semantic types. |
+ |
+ Part of this discussion should also deal with non-procedural data |
+ integration, i.e., based on declarative views as opposed to procedural |
+ workflows. |
+ |
+ 15:30-16:00 TEA AND COFFEE |
+ |
+ 16:00-17:00 INTEROPERABLE SEMANTIC REGISTRATION, MEDIATION, WORKFLOWS II |
+ |
+ 17:00-18:00 PLENARY SESSION: REPORTING, NEXT STEPS |
+ |
+ 18:00 CLOSE |
+ }}} |
+ ----------------------------------------------- |
+ [Back to the meeting agenda | EdinburghMeeting] |