[ensembl-dev] Variant effect predictor: regulatory information and MATRIX/HIGH_INF_POS

A. P. Levine a.levine at ucl.ac.uk
Wed Oct 5 21:49:16 BST 2011


I have two questions regarding the Variant Effect Predictor (VEP).

1. Regulatory information

When I use the VEP online it gives me the regulatory feature information without a problem. However, when I use the Perl script it does not report this information, e.g.:

Input (VCF format):
18   10304   .       TACCC   TAACCC
18   10333   .       T       C
18   10334   .       T       A
18   10405   .       C       T
18   10411   .       TA      TAA

Output from web version:
Uploaded Variation    Location    Allele    Gene    Feature    Feature type    Consequence    Position in cDNA    Position in CDS    Position in protein    Amino acid change    Codon change    Co-located Variation    Extra
18_10305_ACCC/AACCC    18:10305-10308    -    -    -    -    INTERGENIC    -    -    -    -    -    -    -
18_10305_ACCC/AACCC    18:10305-10308    AACCC    -    ENSR00000667451    RegulatoryFeature    REGULATORY_REGION    -    -    -    -    -    -    -
18_10333_T/C    18:10333    C    -    ENSR00000667451    RegulatoryFeature    REGULATORY_REGION    -    -    -    -    -    -    -
18_10333_T/C    18:10333    -    -    -    -    INTERGENIC    -    -    -    -    -    -    -
18_10334_T/A    18:10334    A    -    ENSR00000667451    RegulatoryFeature    REGULATORY_REGION    -    -    -    -    -    -    -
18_10334_T/A    18:10334    -    -    -    -    INTERGENIC    -    -    -    -    -    -    -
18_10405_C/T    18:10405    -    -    -    -    INTERGENIC    -    -    -    -    -    -    -
18_10405_C/T    18:10405    T    -    ENSR00000667451    RegulatoryFeature    REGULATORY_REGION    -    -    -    -    -    -    -
18_10441_C/T    18:10441    T    -    ENSR00000667451    RegulatoryFeature    REGULATORY_REGION    -    -    -    -    -    rs56928311    -

Output from Perl script (perl variant_effect_predictor.pl -i test -o test.out --sift b --polyphen b --condel b --regulatory --hgvs --gene --hgnc --check_existing):
#Uploaded_variation     Location        Allele  Gene    Feature Feature_type    Consequence     cDNA_position   CDS_position    Protein_position        Amino_acids     Codons  Existing_variation    Extra
18_10305_ACCC/AACCC     18:10305-10308  -       -       -       -       INTERGENIC      -       -       -       -       -       -       -
18_10333_T/C    18:10333        -       -       -       -       INTERGENIC      -       -       -       -       -       -       -
18_10334_T/A    18:10334        -       -       -       -       INTERGENIC      -       -       -       -       -       -       -
18_10405_C/T    18:10405        -       -       -       -       INTERGENIC      -       -       -       -       -       -       -
18_10441_C/T    18:10441        -       -       -       -       INTERGENIC      -       -       -       -       -       rs56928311      -

Any thoughts on why it might not be working would be appreciated.

2. MATRIX/HIGH_INF_POS

The header from running the perl script is as follows:
    ## ENSEMBL VARIANT EFFECT PREDICTOR v2.1
    ## Output produced at 2011-10-05 21:44:03
    ## Connected to homo_sapiens_core_63_37 on ensembldb.ensembl.org
    ## Using API version 63, DB version 63
    ## Extra column keys:
    ## HGNC         : HGNC gene identifier
    ## ENSP         : Ensembl protein identifer
    ## HGVSc        : HGVS coding sequence name
    ## HGVSp        : HGVS protein sequence name
    ## SIFT         : SIFT prediction
    ## PolyPhen     : PolyPhen prediction
    ## Condel       : Condel SIFT/PolyPhen consensus prediction
    ## MATRIX       : The source and identifier of a transcription factor binding profile aligned at this position
    ## HIGH_INF_POS : A flag indicating if the variant falls in a high information position of a transcription factor binding profile

Are MATRIX and HIGH_INF_POS operational?

Thank you,

Adam

Adam P. Levine
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20111005/fb9c776f/attachment.html>


More information about the Dev mailing list