[ensembl-dev] Variant Effect Predictor not using HTS to read gzipped fata

Hollizeck, Sebastian Sebastian.Hollizeck at med.uni-muenchen.de
Wed Dec 7 13:12:12 GMT 2016


Hi,

I installed the standalone script in my module system for usage on our server, but i cant seem to get it working with the cache.
The script works with just using the --database option, but when i use the --cache i get the following error 

MSG: ERROR: Cannot index bgzipped FASTA file with Bio::DB::Fasta

this is the full output with verbose

[hollizeck at kkf0f6ee kleinPipeline]$ variant_effect_predictor.pl  --check_existing --gmaf --maf_1kg --maf_esp --maf_exac  --pubmed --regulatory --species homo_sapiens --port 3337 --buffer_size 40000 --cache --minimal  --sift b --polyphen b --ccds --symbol --biotype --gene_phenotype --variant_class  --fork 6 --force_overwrite --filter_common --dir_cache /opt/modules/i12g/ensembl-tools/85/cachedir -i /archive/sample_data/scn281/scn281pa_fa.recalibrated_variants.vcf.gz --vcf -o /archive/sample_data/scn281/scn281pa_fa.recalibrated_variants_vep.vcf.gz -v#----------------------------------#
# ENSEMBL VARIANT EFFECT PREDICTOR #
#----------------------------------#

version 85

By Will McLaren (wm2 at ebi.ac.uk)

Configuration options:

biotype            1
buffer_size        40000
cache              1
ccds               1
check_existing     1
core_type          core
dir                /opt/modules/i12g/ensembl-tools/85/cachedir
dir_cache          /opt/modules/i12g/ensembl-tools/85/cachedir
dir_plugins        /home/hollizeck/.vep/Plugins
filter_common      1
force_overwrite    1
fork               6
gene_phenotype     1
gmaf               1
host               ensembldb.ensembl.org
input_file         /archive/sample_data/scn281/scn281pa_fa.recalibrated_variants.vcf.gz
maf_1kg            1
maf_esp            1
maf_exac           1
minimal            1
numbers            1
output_file        /archive/sample_data/scn281/scn281pa_fa.recalibrated_variants_vep.vcf.gz
polyphen           b
port               3337
pubmed             1
regulatory         1
sift               b
species            homo_sapiens
stats              HASH(0x58ccbc0)
symbol             1
variant_class      1
vcf                1
verbose            1

--------------------

Will only load v85 databases
Species 'homo_sapiens' loaded from database 'homo_sapiens_core_85_37'
Species 'homo_sapiens' loaded from database 'homo_sapiens_cdna_85_37'
Species 'homo_sapiens' loaded from database 'homo_sapiens_vega_85_37'
Species 'homo_sapiens' loaded from database 'homo_sapiens_otherfeatures_85_37'
Species 'homo_sapiens' loaded from database 'homo_sapiens_rnaseq_85_37'
homo_sapiens_variation_85_37 loaded
homo_sapiens_funcgen_85_37 loaded
No ancestral database found
No ontology database found
No taxonomy database found
No ensembl_metadata database found
No production database or adaptor found
2016-12-07 14:04:31 - Connected to core version 85 database and variation version 85 database
2016-12-07 14:04:31 - Read existing cache info
2016-12-07 14:04:32 - Auto-detected FASTA file in cache directory

-------------------- EXCEPTION --------------------
MSG: ERROR: Cannot index bgzipped FASTA file with Bio::DB::Fasta

STACK Bio::EnsEMBL::Variation::Utils::FastaSequence::setup_fasta /opt/modules/i12g/ensembl-tools/85/modules/Bio/EnsEMBL/Variation/Utils/FastaSequence.pm:194
STACK main::configure /opt/modules/i12g/ensembl-tools/85/bin/variant_effect_predictor.pl:835
STACK toplevel /opt/modules/i12g/ensembl-tools/85/bin/variant_effect_predictor.pl:146
Date (localtime)    = Wed Dec  7 14:04:32 2016
Ensembl API version = 85
---------------------------------------------------

This looks like the program does not select the Bio:DB:HTS module but it is present and also in the perl path

[hollizeck at kkf0f6ee kleinPipeline]$ ll /opt/modules/i12g/ensembl-tools/85/modules/Bio/DB/HTS.pm
-rw-rw-rw- 1 root root 77700 Sep 21 18:44 /opt/modules/i12g/ensembl-tools/85/modules/Bio/DB/HTS.pm
[hollizeck at kkf0f6ee kleinPipeline]$ echo $PERL5LIB
/opt/modules/i12g/perl/5.24.0/perl5lib/lib/perl5:/opt/modules/i12g/ensembl-tools/85/modules

I do not know, if I even look at the right spot.
Can you please help me to debug this further?

Is there any way to specify which plugin to use?


Sebastian Hollizeck
Bioinformatics

Dr. von Hauner Children's Hospital
Kubus Research Center
Lindwurmstraße 2a
D-80337 München
phone +49 89-4400-57488
fax +49 89-4400-57979



More information about the Dev mailing list