[ensembl-dev] Allele specific custom annotation

Levine, Adam a.levine at ucl.ac.uk
Mon Mar 10 15:30:11 GMT 2014


I have a query regarding performing custom annotation using the VEP. I would like to annotate specific allele changes with a score, i.e. a G to T with score X but G to A at the same position with score Y. It seems, however, that the VEP only annotates on the basis of position and does not consider the allele change. Am I correct? If so, is there a way to set it to use custom annotation tracks in an allele specific manner?

The custom annotations are in VCF format, e.g.:

##fileformat=VCFv4.0
#CHROM        POS     ID        REF     ALT     QUAL  FILTER            INFO
21        26960070        GT_scoreX      G         T          .           .           .
21        26960070        GA_scoreY      G         A         .           .           .

The input file looks like this:

##fileformat=VCFv4.0
#CHROM        POS     ID        REF     ALT     QUAL  FILTER            INFO
21        26960070        rs116645811   G         A         .           .           .

My command is:

perl variant_effect_predictor.pl \
--input_file example_single_variant.vcf \
--format vcf \
--custom test_custom.vcf.gz,test_custom,vcf,exact \
--cache

The output looks like this:

## ENSEMBL VARIANT EFFECT PREDICTOR v75
## Output produced at 2014-03-10 14:34:53
## Connected to homo_sapiens_core_75_37 on ensembldb.ensembl.org
## Using cache in /home/Levine/.vep/homo_sapiens/75
## Using API version 75, DB version 75
## Extra column keys:
## DISTANCE : Shortest distance from variant to transcript
## STRAND : Strand of the feature (1/-1)
## test_custom : test_custom.vcf.gz (exact)
#Uploaded_variation   Location          Allele   Gene    Feature Feature_type    Consequence   cDNA_position            CDS_position  Protein_position           Amino_ac
ids        Codons            Existing_variation        Extra
rs116645811   21:26960070   A         ENSG00000260583    ENST00000567517     Transcript            upstream_gene_variant            -           -           -           -           -           -
STRAND=-1;test_custom=GT_scoreX,GA_scoreY;DISTANCE=4432
rs116645811   21:26960070   A         ENSG00000154719    ENST00000352957     Transcript        intron_variant  -            -           -           -           -           -           STRAND=-
1;test_custom=G_A,G_T
rs116645811   21:26960070   A         ENSG00000154719    ENST00000307301     Transcript        missense_variant            1043    1001    334      T/M      aCg/aTg           -
STRAND=-1;test_custom=GT_scoreX,GA_scoreY

You can see the variant in the input (G>A) is annotated with both G_A and G_T. I can of course, pull out the relevant annotation (score X for G>T, score Y for G>A) myself manually after the fact but it would be great if the VEP could do it directly.

Thank you,

Adam

Adam P. Levine

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20140310/53063433/attachment.html>


More information about the Dev mailing list