[ensembl-dev] Patch for "XrefParser::RefSeqGPFFParser"
Youens-Clark, Ken
kclark at cshl.edu
Wed Aug 29 20:22:43 BST 2012
While building xrefs for A. thaliana, I found a GenBank record was not parsed correctly because the word "ORIGIN" was used in a context other than starting the sequence. This patch fixes that problem.
$ cvs diff RefSeqGPFFParser.pm
Index: RefSeqGPFFParser.pm
===================================================================
RCS file: /cvsroot/ensembl/ensembl/misc-scripts/xref_mapping/XrefParser/RefSeqGPFFParser.pm,v
retrieving revision 1.55
diff -r1.55 RefSeqGPFFParser.pm
226c226
< my ($seq) = $_ =~ /ORIGIN\s+(.+)/s; # /s allows . to match newline
---
> my ($seq) = $_ =~ /^\s*ORIGIN\s+(.+)/ms; # /s allows . to match newline
--
Ken Youens-Clark
kclark at cshl.edu
Ware Lab/CSHL/USDA-ARS
More information about the Dev
mailing list