Nomina V Recap

Another session of Nomina has been concluded.  EOL was nice enough to again host this workshop at the Marine Biological Laboratory in Woods Hole.  This time the focus was on methods for reconciling heterogeneous taxonomic data.  The group this time was made up of myself from SilverBiology, GBif, EOL, and Atlas of Living Australia (ALA).

The initial part of the workshop was an update on identifying the current status of existing projects, data, and services currently available or in development.  Once we figured out where everyone stood and what needed to be done we spent the remainder of the week improving and developing tools to get us towards some of the goals defined for the workshop.  EOL focused on integrating TaxaMatch in the the Global Names Index (globalnames.org) and way to generate their lexical groups using distributed computers accessing a jobs queue.  EOL also looked into offering socket style methods for faster processing throughput.  Bridging the different languages between all the projects was also a hot topic throughout the week and all of us looked into different ways to use resources from Ruby, Java, and PHP.

To view the project in detail follow this Nomina V Workshop link.

SilverBiology’s role during the workshop was to accomplish two things.

The first was to provide Darwin Core Archive (DwC-A) files for what is being called the NUB. These and other files can be found at: http://code.google.com/p/gbif-ecat/wiki/Nom5DataSource

Note: Most of these files still need to be revised to match the required meta.xml format.

The second project while I was there was to help develop a tool the easily adjust the lexical groups, blacklist and ratings of what the algorithems create.  We called this the Lexical Editor and a prototype can be found here.  Below is a sample screen shot of the tool in its development state. More on the lexical grouping can be found here.

Additional Useful Links:

Tags: , , , , , ,

Comments are closed.