[ensembl-dev] Patch for "XrefParser::RefSeqGPFFParser"

Youens-Clark, Ken kclark at cshl.edu
Wed Aug 29 20:22:43 BST 2012


While building xrefs for A. thaliana, I found a GenBank record was not parsed correctly because the word "ORIGIN" was used in a context other than starting the sequence.  This patch fixes that problem.

$ cvs diff RefSeqGPFFParser.pm
Index: RefSeqGPFFParser.pm
===================================================================
RCS file: /cvsroot/ensembl/ensembl/misc-scripts/xref_mapping/XrefParser/RefSeqGPFFParser.pm,v
retrieving revision 1.55
diff -r1.55 RefSeqGPFFParser.pm
226c226
<       my ($seq) = $_ =~ /ORIGIN\s+(.+)/s; # /s allows . to match newline
---
>       my ($seq) = $_ =~ /^\s*ORIGIN\s+(.+)/ms; # /s allows . to match newline

--
Ken Youens-Clark
kclark at cshl.edu
Ware Lab/CSHL/USDA-ARS





More information about the Dev mailing list