New summer course on anatomy ontologies

February 27, 2012

A summer course on anatomy ontologies is being offered for the first time through the NESCent Academy and the Phenotype Ontology Research Coordination Network.  The intended audience is postgraduate researchers in evolutionary biology and informatics – including students, postdocs and faculty – who are relative newcomers to ontologies.  It will be held from 30-Jul to 3-Aug  2012 in Durham, NC.  Spread the word, and if you are interested be sure to apply before the deadline of 6-Apr.  [Full disclosure: a number of long-time friends of Phenoscape are among the instructors].

From the course website:

Evolutionary research has been revolutionized by the explosion of genetic information available, and anatomy ontologies must play a central crucial in relating this knowledge to observable diversity. Anatomy ontologies and vocabularies are widely used to index data and are critical for relating gene expression and phenotype data across taxa. Within a single species, anatomy ontologies provide scaffolding that interconnects many kinds of observations; across species, they provide evolutionary, developmental, and mechanistic insights. In order for anatomy ontologies to successfully serve all of these purposes, they must be constructed consistently so that they can be utilized and understood by both researcher and software alike. This course aims to teach proper ontology design principles and practices such that anatomical interoperability across evolutionarily disparate taxa is achieved. It further seeks to promote community growth and adoption of ontology-based methods and tools. The subsequent benefit is in the form of shared access to the unique data store of each community (e.g. genetic, genomic, developmental, and evolutionary data).

The course covers a basic introduction to ontology design principles and usage, specific ontology considerations for anatomy, application of anatomy ontologies in the context of evolutionary phenotype comparison, and use of anatomy ontologies for image annotation in different taxa. There will be strong emphasis on hands-on exercises that will develop ontology skills and provide exposure to different software applications that are useful in variety of areas of evolutionary biology.


Integrating the Paleobiology Database (PaleoDB) into our taxonomy workflow

February 14, 2012

In the original Phenoscape project, our focus was on asking comparative questions regarding living taxa. Although we added fossil taxa to the Teleost Taxonomy Ontology (TTO) when our publications included them, we had no general need to add fossil taxa to the contemporary groups provided by the Catalog of Fishes.   However, in our renewal, the focus has both expanded taxonomically (to all vertebrates) and narrowed to the evolution of fins and limbs.   The evolution of limbs from fins occurred over 300 million years ago, meaning the morphological data for this transition exists only in the fossil record.  Therefore, including fossil data and taxonomy has become essential.

These fossil taxa are not available in the major online sources of names, whether taxon-specific, such as Catalog of Fishes, or general such as Catalog of Life or the NCBI taxonomy. Although NCBI includes some fossil taxa, taxa are only included when a related molecular sequence is submitted, which will never be the case for the vast majority of fossil taxa. These latter taxa will only ever be represented as morphological remains.

This need for fossil data, along with the absence of names from recognized sources, requires us to either add names (and hopefully plausible taxonomy) as curators encounter them in papers, or find an alternative source for names of fossil taxa. Although we have and will continue to add fossil taxa to our taxonomy, we do not, and did not intend to become a name or taxonomy authority in our own right.  In light of the strengths and weaknesses of the Phenoscape team allying with a recognized source of fossil taxonomy seems the best option.

The Paleobiology database also called PaleoDB or simply PBDB is an online repository covering a wide range of paleontological data across all taxa represented in the fossil record. These data include names as well as taxonomic opinions appearing in paleontology publications. These data are available and queryable on the PBDB website and are also available for bulk download. As part of developing the Vertebrate Taxonomy Ontology (VTO), an expansion of the TTO to cover all vertebrates and several chordate groups of interest, I have implemented a tool that adds the content of these bulk downloads to a taxonomy ontology. The process of updating from PBDB was designed to minimize disruption to the existing taxonomy by only adding new taxa from PBDB along with whatever taxonomic lineage is required to link each new taxon to a taxon already known to the existing taxonomy. This way, updating from PBDB does not disrupt any existing taxonomic hierarchy we have either incorporated from other resources or were the result of prior curators’ efforts.

However, no taxonomic resource is ever complete. As our term of curators annotate publications, they are encountering fossil taxa unknown to PBDB, and have begun contributing the publication and taxonomy information back to the PBDB. John Alroy and the PBDB board have accepted several project members as authorizers and enterers of data into the PBDB. This allows us to give back to the PBDB as well as simplify the process of adding fossil taxa to our vertebrate taxonomy. We have developed a workflow where a curator can enter publications, names, and taxonomic opinions directly into the PBDB. This immediately makes our additions visible to a wider community and the opportunity to engage expertise we may not have known existed. Subsequent PBDB bulk downloads will include these new names and reflect any changes to the taxonomic opinions entered during curation. These will then be added to the next update of the VTO.


Collaborative editing in Phenex 1.2

February 13, 2012

We have recently released version 1.2.1 of our Phenex annotation software. This release adds some functionality for easier collaborative editing of data files. While our curators have used Subversion revision control software in the past, the new features make it more reliable to share Phenex data files with user-friendly file synchronization software such as Dropbox. While a NeXML document is open in Phenex, the application monitors for changes to the document file in the background. If the file is being shared via Dropbox and is simultaneously edited by someone else, Phenex will alert the user that the file has changed and offer to load the new version. If there are no unsaved edits then Phenex will reload the file automatically. Phenex 1.2 also provides an autosave feature which saves the document after every edit—this reduces the chance that the file might be edited elsewhere while one has unsaved changes, avoiding complicated file merges.


Teleost Anatomy Ontology adds French terms and synonyms

February 3, 2012

With the help of Phenoscape and DeepFin intern Ben Frable, I recently finished adding 117 French anatomical terms and synonyms from Chanet & Desoutter’s glossary publication [1] to the Teleost Anatomy Ontology (TAO). These authors spent many years defining and translating Paul Chabanaud’s anatomical analyses of flatfishes into modern French and English to help researchers understand his important publications. Adding these terms to the TAO takes their translation one step further, enabling computers to link Chabanaud’s unusual terms to an ontology ID for each anatomical ‘concept’, which in turn enables connections among all phenotypic and related data that reference this ID.

These synonyms can now be used in searches of the Phenoscape Knowledgebase. For example, you can see the French synonyms for ‘paired fin’. One can imagine ultimately being able to select a preferred language or term label when browsing the ontology in the Knowledgebase.

These were the first set of foreign terms to be added to the teleost ontology, and we had to tweak the Phenoscape Knowledgebase interface to display the diacritical marks correctly. We are ready to accept more! Please send me anything you’d like added or changed to the TAO term tracker.

[1] Chanet, B., & Desoutter-Meniger, M. (2008). French-English glossary of terms found in Chabanaud’s published works on Pleuronectiformes. Cybium, Electronic Publication no 1:1-23. PDF download