[ensembl-dev] Question Regarding VEP script warnings and --html option

Mathias Lesche mathias.lesche at biotec.tu-dresden.de
Wed Dec 11 12:27:13 GMT 2013


Hey Will,

thanks for the answers.

On 12/11/2013 01:02 PM, Will McLaren wrote:
> The first warning, ($data[4] is uninitialized), happens because the 5th
> column (the alternate allele, ALT) of your VCF line is not present -
> this usually means that there are fewer than 5 columns of data on that
> particular line, or the delimiter between the columns (space or tab) is
> not present or corrupted somehow.
>
> VEP expects at least the first 5 columns of the VCF (CHR, POS, ID, REF,
> ALT) to be populated or to have a null "." value - have a check of your
> VCF file to make sure that it looks OK. The vcftools package has a tool
> called vcf-validator which you can use to validate the format of VCF
> files, see http://vcftools.sourceforge.net/docs.html

You were absolutely right. The script complains about the last line 
which was an empty line. I removed it and it ran smoothly.

>
> Currently it's not possible to produce HTML (--html) and VCF (--vcf)
> output - to see the HTML output, remove --vcf from your command line. If
> you want the VCF output as well, you'd have to do that in a separate VEP
> run. If you intended to indicate that the input format was VCF, you
> should use "--format vcf".

Ah okay. I always ran both options and did not know that it is not 
possible. I ran them separately now and


Cheers
Mathias



>
> Regards
>
> Will McLaren
> Ensembl Variation
>
>
>
>
> On 11 December 2013 11:14, Mathias Lesche
> <mathias.lesche at biotec.tu-dresden.de
> <mailto:mathias.lesche at biotec.tu-dresden.de>> wrote:
>
>     Dear VEP developers,
>
>     I have two questions regarding the current VEP script (Version 74).
>
>     I call the script in the following way:
>     /opt/rocks/bin/perl
>     /projects/seq-work/user/all/__vep/vep-74/variant_effect___predictor.pl
>     <http://variant_effect_predictor.pl> -i input.vcf -o ouput.vcf
>     --species homo_sapiens --fork 6 --format vcf --cache --dir_cache
>     /projects/seq-work/user/all/__vep/vep-cache/ --sift p --polyphen p
>     --humdiv --total_length --numbers --protein --canonical --pubmed
>     --gmaf --maf_1kg --maf_esp --vcf --html --offline
>
>     When running the script I get the following warning:
>     Use of uninitialized value $data[4] in string eq at
>     /projects/seq-work/user/all/__vep/vep-74/Bio/EnsEMBL/__Variation/Utils/VEP.pm
>     line 415, <GEN0> line 837.
>     Use of uninitialized value $ref in length at
>     /projects/seq-work/user/all/__vep/vep-74/Bio/EnsEMBL/__Variation/Utils/VEP.pm
>     line 448, <GEN0> line 837.
>     Use of uninitialized value $alt in pattern match (m//) at
>     /projects/seq-work/user/all/__vep/vep-74/Bio/EnsEMBL/__Variation/Utils/VEP.pm
>     line 451, <GEN0> line 837.
>     Use of uninitialized value $alt in split at
>     /projects/seq-work/user/all/__vep/vep-74/Bio/EnsEMBL/__Variation/Utils/VEP.pm
>     line 540, <GEN0> line 837.
>     Use of uninitialized value $alt in pattern match (m//) at
>     /projects/seq-work/user/all/__vep/vep-74/Bio/EnsEMBL/__Variation/Utils/VEP.pm
>     line 549, <GEN0> line 837.
>     Use of uninitialized value $ref in concatenation (.) or string at
>     /projects/seq-work/user/all/__vep/vep-74/Bio/EnsEMBL/__Variation/Utils/VEP.pm
>     line 621, <GEN0> line 837.
>     Use of uninitialized value $alt in concatenation (.) or string at
>     /projects/seq-work/user/all/__vep/vep-74/Bio/EnsEMBL/__Variation/Utils/VEP.pm
>     line 621, <GEN0> line 837.
>     Use of uninitialized value $data[2] in string eq at
>     /projects/seq-work/user/all/__vep/vep-74/Bio/EnsEMBL/__Variation/Utils/VEP.pm
>     line 621, <GEN0> line 837.
>     Use of uninitialized value in pattern match (m//) at
>     /projects/seq-work/user/all/__vep/vep-74/Bio/EnsEMBL/__Variation/Utils/VEP.pm
>     line 2395, <GEN0> line 837.
>     Use of uninitialized value in substitution (s///) at
>     /projects/seq-work/user/all/__vep/vep-74/Bio/EnsEMBL/__Variation/Utils/VEP.pm
>     line 2395, <GEN0> line 837.
>     Use of uninitialized value in string eq at
>     /projects/seq-work/user/all/__vep/vep-74/Bio/EnsEMBL/__Variation/Utils/VEP.pm
>     line 2396, <GEN0> line 837.
>     Use of uninitialized value in pattern match (m//) at
>     /projects/seq-work/user/all/__vep/vep-74/Bio/EnsEMBL/__Variation/Utils/VEP.pm
>     line 2401, <GEN0> line 837.
>     Use of uninitialized value in concatenation (.) or string at
>     /projects/seq-work/user/all/__vep/vep-74/Bio/EnsEMBL/__Variation/Utils/VEP.pm
>     line 2402, <GEN0> line 837.
>
>     But the calculation of the variations finishes. I'm just wondering
>     if it comes from the vcf file or is from the script itself.
>
>     Secondly the output.vcf.html is empty. It doesn't show the variants
>     or any links to Ensembl. Is there a way to fix it? But I use the
>     --html option.
>
>     I have to say I don't run vep on the whole vcf file. It is only a
>     portion of it including the header.
>
>     Regards,
>     Mathias Lesche
>     --
>     Mathias Lesche, Dipl. Bioinformatician
>     Deep Sequencing Group - SFB655
>     c/o DFG-Center for Regenerative Therapies Dresden
>     Cluster of Excellence / TU Dresden
>     Fetscherstraße 105
>     01307 Dresden
>
>     Phone: +49 (351) 458 82362 <tel:%2B49%20%28351%29%20458%2082362>
>     Email: mathias.lesche(at)biotec.tu-__dresden.de
>     <http://biotec.tu-dresden.de>
>
>     _________________________________________________
>     Dev mailing list Dev at ensembl.org <mailto:Dev at ensembl.org>
>     Posting guidelines and subscribe/unsubscribe info:
>     http://lists.ensembl.org/__mailman/listinfo/dev
>     <http://lists.ensembl.org/mailman/listinfo/dev>
>     Ensembl Blog: http://www.ensembl.info/
>
>
>
>
> _______________________________________________
> Dev mailing list    Dev at ensembl.org
> Posting guidelines and subscribe/unsubscribe info: http://lists.ensembl.org/mailman/listinfo/dev
> Ensembl Blog: http://www.ensembl.info/
>




More information about the Dev mailing list