phyloXML is an XML language for the analysis, exchange, and storage of phylogenetic trees (or networks) and associated data. The structure of phyloXML is described by XML Schema Definition (XSD) language.
2009.07.03: Beta version for phyloXML 1.10 released: http://www.phyloxml.org/1.10/phyloxml.xsd
2009.06.26: Proposed changes for version 1.10: phyloxml.blogspot.com
2009.04.20: BioPython (»project page, »wiki, »source code) and BioRuby (»project page, »wiki, »source code) implementations of the phyloXML format are being supported by Google Inc. as part of the Google Summer of Code™ 2009 program and sponsored by the National Evolutionary Synthesis Center (NESCent) as part of the Phyloinformatics Summer of Code 2009.
http://www.phyloxml.org/1.00/phyloxml.xsd
1. Download the newest version of the forester libraries:
» forester.jar
2. Download an example phyloXML file:
» bcl_2.xml (Bcl-2 gene family tree with gene duplications, support values, and taxonomy data) or
» apaf.xml (Apaf-1 gene family tree with domain architectures) or
» ncbi_taxonomy_mollusca.xml.zip (Mollusca from the NCBI taxonomy with 5632 external nodes)
3. Click on file "forester.jar" (ATV should start; or use "java -cp path\to\forester.jar org.forester.atv.ATV") and use "File"|"Read tree from file..." to load file "bcl_2.xml", "apaf.xml", or "ncbi_taxonomy_mollusca.xml.zip".
A shortcoming of current formats for describing
phylogenetic trees (such as
Nexus and
Newick/New Hampshire) is a lack of a standardized means
to annotate tree nodes and branches with distinct data
fields (which in the case of a basic species tree might be:
taxon names, branch lengths, and, possibly multiple,
support values).
Data storage and exchange is even more cumbersome in
studies in which trees are the result of a
reconciliation of some kind:
To alleviate this, a variety of ad-hoc, special purpose formats have come into use (such as the NHX format, which focuses on the needs of gene-function and phylogenomic studies).
A well defined XML format addresses these problems in a general and extensible manner and allows for interoperability, both between and in between specialized and general purpose software.
BioPerl (project demo) | TreeViewJ | TreeGradients | Archaeopteryx | phyloxml_converter | decorator
The purposes of this group are (i) to inform about new developments concerning phyloXML and (ii) to provide a platform for discussing ideas regarding phyloXML.
groups.yahoo.com/group/phyloxml
Retree | TreeML | NeXML | GraphML | Representation and Retrieval Method for Molecular Phylogenetic Tree Data Based on XML [Joho Shori Gakkai Kenkyu Hokoku article] | Taxonomic markup language [Bioinformatics article] | XGMML (eXtensible Graph Markup and Modeling Language) | BioPax | GraphXML [pdf] | XML for Molecular Biology as compiled by Paul Gordon | XML-biology links
Many of the ideas for the phyloXML format are the results of discussions by the members of the phyloXML discussion group. Ethalinda Cannon contributed to the development of ATV/Archaeopteryx and phyloXML. Additional progress on phyloXML and its implementations are due to the BioHackathon 2008 (towards integrated web service in life science with Open Bio* libraries). The BioPerl implementation of the phyloXML format was supported by Google Inc. as part of the Google Summer of Code™ 2008 program and was sponsored by the National Evolutionary Synthesis Center (NESCent).
Christian M Zmasek | Burnham Institute for Medical Research | cmzmasek yahoo com
phyloXML blog | Archaeopteryx | forester | www.phylosoft.org