ENM Pipeline Conference Call 20 Oct 2004

Your trail: SemanticsInKepler | KRSMSOntoCreationGuide | SMSServiceInterfaces | KRSMSSemanticAnnotationLanguage | SMSHotTopics | AllHandsMeetingSMSNotesNov04 | EScienceLinkUpOct04 | SDSCWeeklyMeetings | KBISMeetingNotes | TaxonSMSApril04PrototypeNotes

This is version 13. It is not the current version, and thus it cannot be edited.
[Back to current version] [Restore this version]

Participants

Jones, Zhang, Pereira, Higgins, Tao, Schildhauer, Spears, Berkley

Discussion

Dan gave overview of pipeline refactoring

Discussion of how to handle distributed execution

Dan has "bulletin-board" model in mind right now
Matt would like a more 'cluster' oriented approach with a controller

Ricardo suggests that we should reduce granularity of inputs to make it computationally tractable and then address the parallelization more comprehensively in a second iteration
Ricardo: cluster at Kansas should be examined to discover issues relevant to the parallelization effort for Kepler

Rod gave overview or DiGIR/DarwinCore data sources in Kepler

Exposes data as fields, rows, tables

Jianting's progress on the GIS actors

Convex hull, rasterization, buffering

Decisions

Delay parallelization effort until we can do it right
Implement a simpler GARP workflow that can run in one day on one machine

Preprocessing to a reasonable grid density
Choose fewer species (e.g., 10-50)
Possibly eliminate the best subsets approach to reduce comnputational demand
Implement whole end-to-end iteration in the workflow for demonstration purposes

Action items

Rod will create another option in the DarwinCore data source to allow aggregations by species (across providers)
Dan and Chad will handle format conversion from the .raw files to ascii
Dan will enumerate tasks to complete ENM pipelinein bugzilla, with an overall tracker bug

Go to top More info... Attach file...

This particular version was published on 20-Oct-2004 12:28:05 PDT by NCEAS.jones.

This material is based upon work supported by the National Science Foundation under award 0225676. Any opinions, findings and conclusions or recomendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation (NSF).


Long Term Ecological Research Network, UNM	National Center for Ecological Analysis and Synthesis, UCSB	Biodiversity Research Center, KU	San Diego Supercomputer Center, UCSD


Arizona State University	Napier University	University of North Carolina	University of Vermont


UC Davis Genome Center

Copyright 2004 Partnership for Biodiversity Informatics, University of New Mexico, The Regents of the University of California, and University of Kansas