[ensembl-dev] HG375_PATCH features out of bounds

Kamil Slowikowski kslowikowski at gmail.com
Wed Aug 14 17:26:02 BST 2013


There exist features outside the coordinates listed for HG375_PATCH. I'm
wondering if this is expected or if this is an error.


ftp://ftp.ensembl
.org/pub/release-72/fasta/homo_sapiens/dna/Homo_sapiens.GRCh37.72.dna.chromosome.HG375_PATCH.fa.gz

zcat Homo_sapiens.GRCh37.72.dna.chromosome.HG375_PATCH.fa.gz | head -n1
>HG375_PATCH dna:chromosome
chromosome:GRCh37:HG375_PATCH:104423968:104489001:1 PATCH_FIX

Notice that the last position is 104489001.


ftp://ftp.ensembl
.org/pub/release-72/gtf/homo_sapiens/Homo_sapiens.GRCh37.72.gtf.gz

zcat Homo_sapiens.GRCh37.72.gtf.gz | grep HG375_PATCH | cut -f1-5 | head
HG375_PATCH protein_coding exon 103810996 103811732
HG375_PATCH protein_coding exon 103903576 103903676
HG375_PATCH protein_coding CDS 103903595 103903676
HG375_PATCH protein_coding start_codon 103903595 103903597
HG375_PATCH protein_coding exon 104440157 104440430
HG375_PATCH protein_coding CDS 104440157 104440430
HG375_PATCH protein_coding exon 104478500 104478686
HG375_PATCH protein_coding CDS 104478500 104478686
HG375_PATCH protein_coding exon 104512069 104512222
HG375_PATCH protein_coding CDS 104512069 104512222

Notice the positions such as 104512069 are greater than 104489001.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20130814/861cc42f/attachment.html>


More information about the Dev mailing list