[ensembl-dev] Bio::Species doesn't like scientific names containing brackets

Nick Langridge nickl at ebi.ac.uk
Wed Jun 29 12:02:26 BST 2011


Hi,

I'm having problems with Bio::Species and species that have brackets in 
thier scientific names, e.g. "Buchnera aphidicola (subsp. Acyrthosiphon 
pisum, strain 5A)".

Bio::Species tries to extract the genus, species, and subspecies from 
the scientific name, but it ends up with mismatched brackets, e.g.
genus:  "Buchnera"
species: "aphidicola (subsp."
subspecies: "Acyrthosiphon pisum, strain 5A)"

This causes an 'Unmatched ( in regex' runtime error when the module 
later tries to use the species value directly in a regex.

Does anyone know what should be happening here? Are brackets allowed, 
and if so, how should Bio::Species be dealing with them?

The runtime error would be easy to trap by escaping the text in the 
regex, but I suspect that really the problem is that species/subspecies 
shouldn't contain brackets in the first place (?)

Cheers,
Nick












More information about the Dev mailing list