[ensembl-dev] VEP does not like whitespace in VCF input

Andrew Parton aparton at ebi.ac.uk
Mon May 14 17:41:20 BST 2018


Hi João,

Currently, VEP does not support blank lines within VCF files, however this is something that we intend to change in a future release. In the meantime however, you will need to remove blank lines from your input files for VEP to process them correctly.

Kind Regards,
Andrew

> On 12 May 2018, at 12:58, João Eiras <joao.eiras at gmail.com> wrote:
> 
> Hi.
> 
> I've upgraded VEP in my pipeline to version 92, from version 87.
> I have a big test suite (275 vcf files) with many VCF files which test
> my pipeline, and those are artificial files, created manually which
> naturally contain comments and whitespace.
> 
> VEP 92 stop reading the input when it finds a blank line immediately
> after the header :( Meaning, if my VCF headers and variant lines are
> condensed without any whitespace in the middle, VEP will be able to
> annotate my variants, else it stops at the first blank line.
> 
> Example:
> $ perl $vep_dist/vep \
>  -i a.vcf --format vcf \
>  --cache --offline --merged \
>  --species mus_musculus --assembly GRCm38 \
>  --force_overwrite --verbose \
>  --dir $vep_cache \
>  -o a.vcf.txt \
> 
> $ cat a.vcf
> ##fileformat=VCFv4.1
> ##reference=GRCm38
> #CHROM POS ID REF ALT QUAL FILTER INFO FORMAT
> 
> # Substitutions, non synonymous
> # AGA -> AGT (R/s)
> chr1 99772782 . A T 5000 . . .
> 
> $ cat a.vcf.txt
> ## ENSEMBL VARIANT EFFECT PREDICTOR v92.1
> # ...
> #Uploaded_variantion
> 
> ^^^ End of file
> 
> If my input file however looks like the following it does produce output:
> 
> 
> $ cat a.vcf
> ##fileformat=VCFv4.1
> ##reference=GRCm38
> #CHROM POS ID REF ALT QUAL FILTER INFO FORMAT
> # Substitutions, non synonymous
> # AGA -> AGT (R/s)
> chr1 99772782 . A T 5000 . . .
> 
> # CTG -> ATG (L/m)
> chr1 99772783 . C A 5000 . . .
> 
> $ cat a.vcf.txt
> ## ENSEMBL VARIANT EFFECT PREDICTOR v92.1
> # ...
> #Uploaded_variantion
> chr1_99772782_A/T       ...
> chr1_99772783_C/A       ...
> 
> Is this behavior expected ?
> 
> Thank you.
> _______________________________________________
> Dev mailing list    Dev at ensembl.org
> Posting guidelines and subscribe/unsubscribe info: http://lists.ensembl.org/mailman/listinfo/dev
> Ensembl Blog: http://www.ensembl.info/




More information about the Dev mailing list