Science Environment for Ecological Knowledge
Ecoinformatics site parent site of Partnership for Biodiversity Informatics site parent site of SEEK - Home
Science Environment for Ecological Knowledge









 

 

 



Beam Knowledge Rep Sept 04

Difference between version 54 and version 53:

At line 271 added 30 lines.
+ ** General steps outlined:
+ *** Data Request
+ *** Quality Control and Assurance (if from different sites)
+ *** Data Integration
+ *** Quality Control and Assurance (of the integration)
+ *** Analysis
+ *** Capture result of analysis …
+ ** Workflow we developed:
+ *** (workflow here ...)
+ ** Useful Actors
+ *** List Summarizer
+ **** A set of values in a data column
+ *** List Comparator
+ **** Given two sets (lists), do they match?
+ **** Which ones in the first list aren’t in the second
+ **** Assign first list values to new values
+ *** Nested Transpose
+ **** (site, taxon, count)
+ **** [(A, x, 3), (A, y, 1), (B, y, 4), (C, z, 2)]
+ **** Transpose to:
+ ***** (site, x, y, z)
+ ***** [(A, 3, 1, 0), (B, 0, 4, 0), (C, 0, 0, 2)]
+ **** Notes about this from Bertram and Shawn after meeting:
+ ***** Given an annotated schema S, denoted S*. And a white-box actor q s.t. q(S*) -> S’. We want to “push through” the annotations to obtain S’*.
+ ***** The “nested” transpose is basically a combination of various lower-level algebraic operators, such as (theoretical) group-by, matrix transpose, projection, etc. So, give q as such a plan of operators, can we reason over the operators (white box-actors) q to obtain S*’? Using symbolic manipulation? Using the chase, e.g., for similar problems in integrity constraints?
+ ** Often-found pattern of computation
+ *** Can Kepler/Ptolemy efficiently and conveniently support the following pattern?
+ *** Given a data set, construct a scatter plot for pairs of variables, allow user to select a subset of the plots -or- pairs of variables of interest, return data subsets based on chosen pairs (with no extraneous variables)
+ *** Similarly, given data sets, an actor computes a set of regressions, the user is shown the results, the user selects the regressions of interest, and the workflow then proceeds using only those selected regressions
+ *** These "patterns" can be supported now (with lots of plumbing) using the browser actor. Can we also add functionality to better support/model these patterns?
Removed lines 273-274
-
-

Back to Beam Knowledge Rep Sept 04, or to the Page History.