ENM Pipeline Conference Call 20 Oct 2004

Your trail: SemanticsInKepler | KRSMSOntoCreationGuide | SMSServiceInterfaces | KRSMSSemanticAnnotationLanguage | SMSHotTopics | AllHandsMeetingSMSNotesNov04 | EScienceLinkUpOct04 | SDSCWeeklyMeetings | KBISMeetingNotes | TaxonSMSApril04PrototypeNotes

Participants

Jones, Zhang, Pereira, Higgins, Tao, Schildhauer, Spears, Berkley

Discussion

Dan gave overview of pipeline refactoring

Discussion of how to handle distributed execution

Dan has "bulletin-board" model in mind right now
Matt would like a more 'cluster' oriented approach with a controller

Ricardo suggests that we should reduce granularity of inputs to make it computationally tractable and then address the parallelization more comprehensively in a second iteration
Ricardo: cluster at Kansas should be examined to discover issues relevant to the parallelization effort for Kepler

Rod gave overview or DiGIR/DarwinCore data sources in Kepler

Exposes data as fields, rows, tables

Jianting's progress on the GIS actors

Convex hull, rasterization, buffering

Decisions

Delay parallelization effort until we can do it right
Implement a simpler GARP workflow that can run in one day on one machine

Preprocessing to a reasonable grid density
Choose fewer species (e.g., 10-50)
Possibly eliminate the best subsets approach to reduce computational demand
Implement whole end-to-end iteration in the workflow for demonstration purposes

Action items

Rod will create another option in the DarwinCore data source to allow aggregations by species (across providers)
Dan and Chad will handle format conversion from the .raw files to ascii
Dan will enumerate tasks to complete ENM pipeline in bugzilla, with an overall tracker bug

Will assign developers as needed to get these steps done

Go to top Edit this page More info... Attach file...

This page last changed on 20-Oct-2004 12:29:05 PDT by NCEAS.jones.

This material is based upon work supported by the National Science Foundation under award 0225676. Any opinions, findings and conclusions or recomendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation (NSF).


Long Term Ecological Research Network, UNM	National Center for Ecological Analysis and Synthesis, UCSB	Biodiversity Research Center, KU	San Diego Supercomputer Center, UCSD


Arizona State University	Napier University	University of North Carolina	University of Vermont


UC Davis Genome Center

Copyright 2004 Partnership for Biodiversity Informatics, University of New Mexico, The Regents of the University of California, and University of Kansas