[ensembl-dev] VEP result does not match start-end coordinate and allele for hgvs syntax

Daniel Suveges ds26 at sanger.ac.uk
Fri Sep 16 16:15:20 BST 2016


HI Bhavana,

I think the transcript is on the -1 strand so the start/end coordinates have to be swapped.
http://grch37.rest.ensembl.org/vep/human/hgvs/ENST00000460680.1:c.783_661+1del124?content-type=application/json;canonical=1&ccds=1&hgvs=1

But that's just a guess.
Best,
Daniel

From: <dev-bounces at ensembl.org<mailto:dev-bounces at ensembl.org>> on behalf of Bhavana Harsha <bh4 at sanger.ac.uk<mailto:bh4 at sanger.ac.uk>>
Reply-To: Ensembl developers list <dev at ensembl.org<mailto:dev at ensembl.org>>
Date: Friday, September 16, 2016 at 3:40 PM
To: "dev at ensembl.org<mailto:dev at ensembl.org>" <dev at ensembl.org<mailto:dev at ensembl.org>>
Subject: [ensembl-dev] VEP result does not match start-end coordinate and allele for hgvs syntax

Hello,

We’re trying to validate some mutations and see other co-located variants by using the VEP REST service for GRCh37, like this:

http://grch37.rest.ensembl.org/vep/human/hgvs/ENST00000460680.1:c.661_783+1del124?content-type=application/json;canonical=1&ccds=1&hgvs=1

And I get the following error:
{“error":"Start (52440390) must be less than or equal to end+1 (52440270)"}


If I include the deleted sequence in the hgvs syntax, I get a different error:

Input:
http://grch37.rest.ensembl.org/vep/human/hgvs/ENST00000460680.1:c.661_783+1delGAGCCCTACCACGACATCCGCTTCAACCTGATGGCAGTGGTGCCCGACCGCAGGATCAAGTATGAGGCCAGGCTGCATGTGCTGAAGGTGAACCGTCAGACAGTACTAGAGGCTCTGCAGCAGG?content-type=application/json;canonical=1&ccds=1&hgvs=1

Output:
{“error":"Reference allele extracted from ENST00000460680:52440390-52440269 (AGCCCTACCACGACATCCGCTTCAACCTGATGGCAGTGGTGCCCGACCGCAGGATCAAGTATGAGGCCAGGCTGCATGTGCTGAAGGTGAACCGTCAGACAGTACTAGAGGCTCTGCAGCAG) does not match reference allele given by HGVS notation ENST00000460680.1:c.661_783+1delGAGCCCTACCACGACATCCGCTTCAACCTGATGGCAGTGGTGCCCGACCGCAGGATCAAGTATGAGGCCAGGCTGCATGTGCTGAAGGTGAACCGTCAGACAGTACTAGAGGCTCTGCAGCAGG (GAGCCCTACCACGACATCCGCTTCAACCTGATGGCAGTGGTGCCCGACCGCAGGATCAAGTATGAGGCCAGGCTGCATGTGCTGAAGGTGAACCGTCAGACAGTACTAGAGGCTCTGCAGCAGG)"}


This is strange because shouldn’t the transcript coordinates ‘ENST00000460680.1:c.661-783+1’ correspond to ‘3:g.52440268-52440391’? But these aren’t the coordinates returned and the allele sequence is inconsistent with the input.

Is there something I’m missing here?

Cheers,
Bhavana


---------------------------------------------
Bhavana Harsha
Bioinformatician
COSMIC
Wellcome Trust Sanger Institute
Hinxton, UK
CB10 1SA



-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20160916/717f7a75/attachment.html>


More information about the Dev mailing list