[ensembl-dev] VEP 86 GRCh38 - alt contig names not recognized

MEYNERT Alison alison.meynert at igmm.ed.ac.uk
Mon Dec 5 15:38:33 GMT 2016


Hi Will,


Thanks, have just tested the update. Some of the contig names now work:


chr11_KI270721v1

chr14_GL000009v2

chr14_GL000194v1

chr14_GL000225v1

chr14_KI270726v1

chr15_KI270727v1

chr16_KI270728v1

chr17_GL000205v2

chr1_KI270711v1

chr1_KI270713v1

chr22_KI270731v1

chr22_KI270733v1

chr22_KI270734v1

chrUn_GL000195v1

chrUn_GL000213v1

chrUn_GL000216v2

chrUn_GL000218v1

chrUn_GL000219v1

chrUn_GL000220v1

chrUn_KI270442v1

chrUn_KI270744v1

chrUn_KI270750v1


but a large number (131) still give warnings (attached).


Cheers,

Alison


--

Alison Meynert
alison.meynert at igmm.ed.ac.uk

MRC Human Genetics Unit
MRC IGMM
University of Edinburgh
Western General Hospital
Crewe Road, Edinburgh EH4 2XU
United Kingdom
________________________________
From: dev-bounces at ensembl.org <dev-bounces at ensembl.org> on behalf of Will McLaren <wm2 at ebi.ac.uk>
Sent: 22 November 2016 10:01:25
To: Ensembl developers list
Subject: Re: [ensembl-dev] VEP 86 GRCh38 - alt contig names not recognized

Hi Alison,

Thanks for bringing this to our attention, there was a bug in the way VEP was handling chromosome names beginning with "chr".

This should be fixed now in the release/86 branch of the ensembl-variation module; if you used INSTALL.pl to set up your command-line VEP then you can re-run this to pick up the changes.

Regards

Will McLaren
Ensembl Variation

On 21 November 2016 at 15:53, MEYNERT Alison <alison.meynert at igmm.ed.ac.uk<mailto:alison.meynert at igmm.ed.ac.uk>> wrote:

I've been annotating some GRCh38 (hs38DH) VCFs with VEP 86, and it's working fine for the primary assembly, but I'm getting warnings on the alt contig names, e.g.


WARNING: Chromosome Un_GL000218v1 not found in cache on line 3416412

It looks like VEP is just chopping the 'chr' prefixes and not fully mapping the alt contig names to the Ensembl-style versions. Both the web interface and the standalone Perl script give the same results; however, interestingly, when I enter a variant like this into the web interface, e.g.


chrUn_GL000218v1 160424 . C CTTT,CTTTT

And run the "instant VEP", I get a result (see attached screenshot). If I fix the contig name manually to GL000218.1, I get full results.

Cheers,
Alison

--
Alison Meynert
alison.meynert at igmm.ed.ac.uk<mailto:alison.meynert at igmm.ed.ac.uk>

MRC Human Genetics Unit
MRC IGMM
University of Edinburgh
Western General Hospital
Crewe Road, Edinburgh EH4 2XU
United Kingdom

The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.

_______________________________________________
Dev mailing list    Dev at ensembl.org<mailto:Dev at ensembl.org>
Posting guidelines and subscribe/unsubscribe info: http://lists.ensembl.org/mailman/listinfo/dev
Ensembl Blog: http://www.ensembl.info/


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20161205/62f1d66a/attachment.html>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: contigs_not_found.txt
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20161205/62f1d66a/attachment.txt>


More information about the Dev mailing list