From - Mon Jan 27 09:07:34 2003
X-Mozilla-Status: 0001
X-Mozilla-Status2: 00800000
Message-ID: <3E352105.2090105@cria.org.br>
Date: Mon, 27 Jan 2003 09:07:33 -0300
From: Ricardo Scachetti Pereira <ricardo@cria.org.br>
Organization: Centro de =?ISO-8859-1?Q?Refer=EAncia_em_Informa=E7=E3o?=
 =?ISO-8859-1?Q?_Ambiental_=28CRIA=29?=
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.0.1) Gecko/20020823 Netscape/7.0
X-Accept-Language: en-us, en
MIME-Version: 1.0
To: "Stewart, Aimee Marian" <astewart@ku.edu>
CC: "Beach, James H" <beach@ku.edu>, "Vorontsov, Gregory Yakovlevich"
 <voron999@ku.edu>, "Downie, Scott Russell" <sdownie@ku.edu>, 
 reed.beaman@yale.edu
Subject: Re: Value added products to collection as webservices
References: <F73B91BED37FD5119F970008C7CF50FC0EE2D511@skylark.mail.ukans.edu>
Content-Type: multipart/alternative;
 boundary="------------010809040406030205070706"


--------------010809040406030205070706
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 8bit

    It is good that you liked, Aimee.
    See answers below:

Stewart, Aimee Marian wrote:

> Hi Ricardo,
>  
> Great suggestions!  I think that this is an excellent way to increase 
> our value to data providers.   I will think about some more services 
> that can be implemented.  
>  
> By the way, 2 services are available 
> at http:\\beta.lifemapper.org\Services\QueryByLocation 
> and  http:\\beta.lifemapper.org\Services\QueryTaxa .  Number 6 on your 
> spreadsheet could be easily be put together from methods in these 
> services. 

You are right!
There is also another implementation of this service that creates many 
models, jackknifing the input datapoints (submitted paper by Town and 
I), that might be interesting, too. Reed is involved on this one, too.

>  
> Number 7 would be easy once we gather layers for areas outside the 
> US.  I will get to that once we have successfully moved to production.

Yeah! The data repositories that those kind of services rely on can be 
build in phases, as more data is added on. For the parts of the world 
that we don't have data, we output a diagnostic message: no data 
available. Those datasets can also be used to normalize those same data 
fields.

>  
> We can create the initial services, then create some applications 
> using the services to implement commiting the new data back to 
> provider databases (interactively, one record at a time or groups at a 
> time for normalization, in Specify perhaps).  To get some interest 
> quickly, I think we need to allow providers a way to improve their own 
> data without on-site programmers. 

Right.

>  
> Report cards could include geostatistical analysis of entire 
> collections, genus, species, subspecies.  I've been wanting to explore 
> that ESRI extension!

Yeah! I've been thinking about measuring correlation between a given set 
of points (per species, genus, museum, collector, etc) and features 
related to infrastructure (roads, rivers), and also with county 
centroids. That would give a basis for comparing sampling between 
datasets. Also, we could measure correlation of datapoints and 
ecological dimensions, but that is for other purposes.
In this line, I was also thinking about some kind of density measures 
(spatial and temporal, such as, points per square mile, or per year), 
and also measures of clustering.

>  
> Perhaps a short questionaire for existing contributors could determine 
> whether there is any interest in these services, or if they have some 
> ideas we haven't yet come up with.  The initial Fishnet (Ed) reaction 
> was very positive!

Did you show that to Ed? Cool!!!

>  
> Aimee
>  
>  
>
>  -----Original Message-----
> From: Ricardo Scachetti Pereira [mailto:ricardo@cria.org.br]
> Sent: Friday, January 24, 2003 11:47 AM
> To: Beach, James H; voron999@UKANS.EDU; Stewart, Aimee Marian; Downie, 
> Scott Russell; reed.beaman@yale.edu
> Subject: Value added products to collection as webservices
>
>         Hey, guys.
>         How have you been?
>
>         Let me bring something up, so we can discuss it and think
>     about it a little, if you want.
>         I've been thinking about value added products to collections
>     and came up with some ideas.
>         Among the categories of value added products, I could list those:
>         1) Error checking and correction tools, such as: normalization
>     of field values, checking for invalid characters, checking
>     compliance of date or coordinate formats, checking mispellings and
>     typos, cross-reference of textual location descriptions (country,
>     state, county, etc) with georeferences, etc; Reed's georeferencing
>     tool could be put in this category, if you consider that putting a
>     georeference in a record that does not contain one means
>     correcting it, i.e., filling in the blanks on georeference field.
>         2) Visualization tools: Showing data on maps (like GIS tools
>     and WMS services);
>         3) Statistics and other analysis tools: Measuring clustering
>     of points, correlation between data points and infrastructure,
>     sampling densities, etc.
>         As far as I know, the tools in category #3 don't lend
>     themselves well to a webservice implementation. They could be
>     implemented as services, but they are not very important to stand
>     up for themselves.
>         Tools in #2 are currently being well covered by Aimee's WMS.
>         So I tried to think about possible #1 tools that would become
>     good webservices. The result of that is summarized in the Excel
>     spreadsheet attached.
>         I'm still not pretty sure what would be the best architecture
>     to implement them.
>         A possibility would be implement each of them as one object,
>     or methods within an object.
>         Each object would have a SOAP and/or XML/HTTP wrapper so they
>     become webservices.
>         This way, local server components can use the objects
>     directly. Remote components and other users can call the services
>     programmatically using the webservices. Some can be implemented in
>     Kansas, other at CRIA, or we could have copies in both places.
>     Pretty flexible.
>         Last but not least, there are the report cards to the
>     collections, that could use some of those services and some other
>     statistics from Lifemapper.
>         Questions:
>         1) What do you think of the general idea and the proposed
>     architecture?
>         2) What do you think of the services listed? Can you add some
>     more to the list?
>         3) Do you guys have more ideas for the report cards? What
>     should be in there?
>         4) Market analysis: would any one out there (collections
>     community) in need to use such services?
>
>         That is about it.
>         Cheers,
>
>     Ricardo
>
>
>
>     Attached is a set of 10 possible value added products that
>
>-- 
>Ricardo Scachetti Pereira
>Gerente de Pesquisa
>Centro de Referência em Informação Ambiental - CRIA
>http://www.cria.org.br/
>    
>
>
>     -- 
>     This message has been scanned for viruses and
>     dangerous content and is believed to be clean. 
>
>
> -- 
> This message has been scanned for viruses and
> dangerous content and is believed to be clean. 


-- 
Ricardo Scachetti Pereira
Gerente de Pesquisa
Centro de Referência em Informação Ambiental - CRIA
http://www.cria.org.br/



--------------010809040406030205070706
Content-Type: text/html; charset=us-ascii
Content-Transfer-Encoding: 7bit

<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
  <meta http-equiv="Content-Type" content="text/html;charset=ISO-8859-1">
  <title></title>
</head>
<body>
&nbsp;&nbsp;&nbsp; It is good that you liked, Aimee.<br>
&nbsp;&nbsp;&nbsp; See answers below:<br>
<br>
Stewart, Aimee Marian wrote:<br>
<blockquote type="cite"
 cite="midF73B91BED37FD5119F970008C7CF50FC0EE2D511@skylark.mail.ukans.edu"> 
 
  <meta http-equiv="Content-Type" content="text/html; ">
  <title></title>
   
  <meta content="MSHTML 6.00.2800.1126" name="GENERATOR">
 
  <div><span class="137075719-24012003"><font face="Palatino Linotype"
 color="#0000ff" size="2">Hi Ricardo,</font></span></div>
 
  <div><span class="137075719-24012003"></span>&nbsp;</div>
 
  <div><span class="137075719-24012003"><font face="Palatino Linotype"
 color="#0000ff" size="2">Great suggestions!&nbsp; I think that this is an excellent
way to  increase our value to data providers.&nbsp;&nbsp; I will think about some more
 services that can be implemented.&nbsp;&nbsp;</font></span></div>
 
  <div><span class="137075719-24012003"></span>&nbsp;</div>
 
  <div><span class="137075719-24012003"><font face="Palatino Linotype"
 color="#0000ff" size="2">By the way, 2 services are available  at&nbsp;<a class="moz-txt-link-freetext" href="http:\\beta.lifemapper.org\Services\QueryByLocation">http:\\beta.lifemapper.org\Services\QueryByLocation</a>
 and&nbsp;&nbsp;<a class="moz-txt-link-freetext" href="http:\\beta.lifemapper.org\Services\QueryTaxa">http:\\beta.lifemapper.org\Services\QueryTaxa</a> .&nbsp; Number 6 on  your
spreadsheet could be easily be put together from methods in these  services.&nbsp;</font></span></div>
</blockquote>
You are right! <br>
There is also another implementation of this service that creates many models,
jackknifing the input datapoints (submitted paper by Town and I), that might
be interesting, too. Reed is involved on this one, too.<br>
<blockquote type="cite"
 cite="midF73B91BED37FD5119F970008C7CF50FC0EE2D511@skylark.mail.ukans.edu">
  <div><span class="137075719-24012003"><font face="Palatino Linotype"
 color="#0000ff" size="2"> </font></span></div>
 
  <div><span class="137075719-24012003"></span>&nbsp;</div>
 
  <div><span class="137075719-24012003"><font face="Palatino Linotype"
 color="#0000ff" size="2">Number 7 would be easy once we gather layers for
areas outside the  US.&nbsp; I will get to that once we have successfully moved
to production.</font></span></div>
</blockquote>
Yeah! The data repositories that those kind of services rely on can be build
in phases, as more data is added on. For the parts of the world that we don't
have data, we output a diagnostic message: no data available. Those datasets
can also be used to normalize those same data fields.<br>
<blockquote type="cite"
 cite="midF73B91BED37FD5119F970008C7CF50FC0EE2D511@skylark.mail.ukans.edu">
  <div><span class="137075719-24012003"><font face="Palatino Linotype"
 color="#0000ff" size="2">  </font></span></div>
 
  <div><span class="137075719-24012003"></span>&nbsp;</div>
 
  <div><span class="137075719-24012003"><font face="Palatino Linotype"
 color="#0000ff" size="2">We can create the initial services, then create
some applications using  the services to implement commiting the new data
back to provider databases  (interactively, one record at a time or groups
at a time for normalization, in  Specify perhaps).&nbsp; To get some interest
quickly,&nbsp;I think we need to  allow providers a way to improve their own data
without on-site  programmers.&nbsp;</font></span></div>
</blockquote>
Right.<br>
<blockquote type="cite"
 cite="midF73B91BED37FD5119F970008C7CF50FC0EE2D511@skylark.mail.ukans.edu">
  <div><span class="137075719-24012003"><font face="Palatino Linotype"
 color="#0000ff" size="2"> </font></span></div>
 
  <div><span class="137075719-24012003"></span><span
 class="137075719-24012003"></span>&nbsp;</div>
 
  <div><span class="137075719-24012003"><font face="Palatino Linotype"
 color="#0000ff" size="2">Report cards could include geostatistical analysis
of entire collections,  genus, species, subspecies.&nbsp; I've been wanting to
explore that ESRI  extension!</font></span></div>
</blockquote>
Yeah! I've been thinking about measuring correlation between a given set
of points (per species, genus, museum, collector, etc) and features related
to infrastructure (roads, rivers), and also with county centroids. That would
give a basis for comparing sampling between datasets. Also, we could measure
correlation of datapoints and ecological dimensions, but that is for other
purposes.<br>
In this line, I was also thinking about some kind of density measures (spatial
and temporal, such as, points per square mile, or per year), and also measures
of clustering.<br>
<blockquote type="cite"
 cite="midF73B91BED37FD5119F970008C7CF50FC0EE2D511@skylark.mail.ukans.edu"> 
  <div><span class="137075719-24012003"></span>&nbsp;</div>
 
  <div><span class="137075719-24012003"><font face="Palatino Linotype"
 color="#0000ff" size="2">Perhaps a short questionaire for existing contributors
could determine  whether there is any interest in these services, or if they
have some ideas we  haven't yet come up with.&nbsp; The initial Fishnet (Ed) reaction
was very  positive!</font></span></div>
</blockquote>
Did you show that to Ed? Cool!!!<br>
<blockquote type="cite"
 cite="midF73B91BED37FD5119F970008C7CF50FC0EE2D511@skylark.mail.ukans.edu"> 
  <div><span class="137075719-24012003"></span>&nbsp;</div>
 
  <div><span class="137075719-24012003"><font face="Palatino Linotype"
 color="#0000ff" size="2">Aimee</font></span></div>
 
  <div><span class="137075719-24012003"></span>&nbsp;</div>
 
  <div><span class="137075719-24012003"></span><font face="Tahoma">&nbsp;</font></div>
 
  <div><font face="Tahoma"><br>
  <font size="2"><span class="137075719-24012003">&nbsp;</span>-----Original  Message-----<br>
  <b>From:</b> Ricardo Scachetti Pereira  [<a class="moz-txt-link-freetext" href="mailto:ricardo@cria.org.br">mailto:ricardo@cria.org.br</a>]<br>
  <b>Sent:</b> Friday, January 24, 2003 11:47  AM<br>
  <b>To:</b> Beach, James H; <a class="moz-txt-link-abbreviated" href="mailto:voron999@UKANS.EDU">voron999@UKANS.EDU</a>; Stewart, Aimee Marian;  Downie,
Scott Russell; <a class="moz-txt-link-abbreviated" href="mailto:reed.beaman@yale.edu">reed.beaman@yale.edu</a><br>
  <b>Subject:</b> Value added  products to collection as webservices<br>
  <br>
  </font></font></div>
 
  <blockquote>&nbsp;&nbsp;&nbsp; Hey, guys.<br>
&nbsp;&nbsp;&nbsp; How have you    been?<br>
    <br>
&nbsp;&nbsp;&nbsp; Let me bring something up, so we can discuss    it and think about it
a little, if you want.<br>
&nbsp;&nbsp;&nbsp; I've been    thinking about value added products to collections and came
up with some    ideas.<br>
&nbsp;&nbsp;&nbsp; Among the categories of value added products, I    could list those:
    <br>
&nbsp;&nbsp;&nbsp; 1) <b>Error checking and correction    tools</b>, such as: normalization
of field values, checking for invalid    characters, checking compliance
of date or coordinate formats, checking    mispellings and typos, cross-reference
of textual location descriptions    (country, state, county, etc) with georeferences,
etc; Reed's georeferencing    tool could be put in this category, if you
consider that putting a    georeference in a record that does not contain
one means correcting it, i.e.,    filling in the blanks on georeference field.<br>
&nbsp;&nbsp;&nbsp; 2)    <b>Visualization tools</b>: Showing data on maps (like GIS tools
and WMS    services);<br>
&nbsp;&nbsp;&nbsp; 3) <b>Statistics and other analysis    tools</b>: Measuring clustering
of points, correlation between data points and    infrastructure, sampling
densities, etc.<br>
&nbsp;&nbsp;&nbsp; As far as I    know, the tools in category #3 don't lend themselves well
to a webservice    implementation. They could be implemented as services,
but they are not very    important to stand up for themselves.<br>
&nbsp;&nbsp;&nbsp; Tools in #2 are    currently being well covered by Aimee's WMS.<br>
&nbsp;&nbsp;&nbsp; So I tried    to think about possible #1 tools that would become good
webservices. The    result of that is summarized in the Excel spreadsheet
   attached.<br>
&nbsp;&nbsp;&nbsp; I'm still not pretty sure what would be the    best architecture to implement
them.<br>
&nbsp;&nbsp;&nbsp; A possibility would    be implement each of them as one object, or methods
within an object.    <br>
&nbsp;&nbsp;&nbsp; Each object would have a SOAP and/or XML/HTTP wrapper    so they become
webservices.<br>
&nbsp;&nbsp;&nbsp; This way, local server    components can use the objects directly. Remote
components and other users can    call the services programmatically using
the webservices. Some can be    implemented in Kansas, other at CRIA, or
we could have copies in both places.    Pretty flexible.<br>
&nbsp;&nbsp;&nbsp; Last but not least, there are the    report cards to the collections,
that could use some of those services and    some other statistics from Lifemapper.<br>
&nbsp;&nbsp;&nbsp;    Questions:<br>
&nbsp;&nbsp;&nbsp; 1) What do you think of the general idea and    the proposed architecture?<br>
&nbsp;&nbsp;&nbsp; 2) What do you think of the    services listed? Can you add some more
to the list?<br>
&nbsp;&nbsp;&nbsp; 3)    Do you guys have more ideas for the report cards? What should be
in    there?<br>
&nbsp;&nbsp;&nbsp; 4) Market analysis: would any one out there    (collections community)
in need to use such    services?<br>
    <br>
&nbsp;&nbsp;&nbsp; That is about it.<br>
&nbsp;&nbsp;&nbsp;    Cheers,<br>
    <br>
Ricardo<br>
    <br>
    <br>
    <br>
Attached is a set of 10 possible value    added products that<br>
    <pre class="moz-signature" cols="$mailwrapcol">-- 
Ricardo Scachetti Pereira
Gerente de Pesquisa
Centro de Refer&ecirc;ncia em Informa&ccedil;&atilde;o Ambiental - CRIA
<a class="moz-txt-link-freetext" href="http://www.cria.org.br/">http://www.cria.org.br/</a>
    </pre>
    <br>
-- <br>
This message has been scanned for viruses and <br>
dangerous    content and is believed to be clean. </blockquote>
  <br>
-- <br>
This message has been scanned for viruses and <br>
dangerous content and is believed to be clean. </blockquote>
<br>
<pre class="moz-signature" cols="$mailwrapcol">-- 
Ricardo Scachetti Pereira
Gerente de Pesquisa
Centro de Refer&ecirc;ncia em Informa&ccedil;&atilde;o Ambiental - CRIA
<a class="moz-txt-link-freetext" href="http://www.cria.org.br/">http://www.cria.org.br/</a>
</pre>
<br>
</body>
</html>

--------------010809040406030205070706--

