[ensembl-dev] VEP dbNSFP error

Abhishek Nag abhishek at well.ox.ac.uk
Tue Jun 4 11:00:19 BST 2019


Dear Anja,

Please see below my responses to your queries in bold. Hope that helps.

Thanks
Abhishek
________________________________
From: Dev [dev-bounces at ensembl.org] on behalf of Anja Thormann [anja at ebi.ac.uk]
Sent: Monday, June 03, 2019 3:53 PM
To: Ensembl developers list
Subject: Re: [ensembl-dev] VEP dbNSFP error

Thank you for the extra details.

Could you please confirm one more time the return value for running:

tabix -fh /home/VEP/Plugins/dbNSFP4.0b2a.txt.gz 1:1-1 2>&1

It should be #chr    pos(1-based)    ref     alt     aaref   aaalt   rs_dbSNP151     hg19_chr ….

I get exactly the same output as you mentioned, on running the tabix -fh command

The error (ERROR: Could not read headers from dbNSFP4.0b2a.txt.gz) most likely points to a missing header line in the dbNSFP file.

If the plugin couldn't read from the dbNSFP file the error would be ERROR: Data file dbNSFP4.0b2a.txt.gz not found

There are 2 things which slightly confuse me:

1) If you run vep with /apps/well/ensembl-tools/93/ensembl-vep/vep -i variants.txt -o output.txt --cache --dir /home/VEP/ --plugin dbNSFP, /home/VEP/Plugins/dbNSFP4.0b2a.txt.gz
ALL --tab --force_overwrite --port 3337
I would expect the full path name in the error message: ERROR: Could not read headers from /home/VEP/Plugins/dbNSFP4.0b2a.txt.gz

Yes, true, I do get the full path name in the error message

2) tabix -H is not supported by tabix version: 0.2.5. But you are able to run tabix -H?

When I just run tabix (version 0.2.5) from my command line, the list of available options shows that the -H option for printing just the header line is available.

Once we figured out what is going on you need to switch to a more recent release version for the plugin<https://github.com/Ensembl/VEP_plugins/blob/release/97/dbNSFP.pm>. In the more recent versions of the dbNSFP plugin we are dealing with the different header names for grch37 and grch38 coordinates that have been introduced by dbNSFP for their last file version 2.*, 3.* and 4.*. But this is only relevant after the plugin has retrieved the header line.

Thank you,
Anja


On 3 Jun 2019, at 14:43, Abhishek Nag <abhishek at well.ox.ac.uk<mailto:abhishek at well.ox.ac.uk>> wrote:

Dear Anja,

Thanks for your response.

I'm using the VEP version 93 and perl version v5.16.3

Thanks
Abhishek
________________________________
From: Dev [dev-bounces at ensembl.org<mailto:dev-bounces at ensembl.org>] on behalf of Anja Thormann [anja at ebi.ac.uk<mailto:anja at ebi.ac.uk>]
Sent: Monday, June 03, 2019 2:25 PM
To: Ensembl developers list
Subject: Re: [ensembl-dev] VEP dbNSFP error

Dear Abhishek,

so far I wasn’t able to reproduce the error. I have been running tests with the same dbNSFP file and tabix version. Could you please also let me know your perl version and your vep version?

Best regards,
Anja

On 26 May 2019, at 15:47, Abhishek Nag <abhishek at well.ox.ac.uk<mailto:abhishek at well.ox.ac.uk>> wrote:

Dear Anja,

Thanks for responding to my email.

The header line that is returned on running tabix -H seems fine.

Here is the VEP command that I'm running:


/apps/well/ensembl-tools/93/ensembl-vep/vep -i variants.txt -o output.txt --cache --dir /home/VEP/ --plugin dbNSFP,/home/VEP/Plugins/dbNSFP4.0b2a.txt.gz,ALL --tab --force_overwrite --port 3337


The /home/VEP/ path contains the Plugins subdirectory (which contains the dbNSFP4.0b2a.txt.gz file) as well as the cache folder homo_sapiens.


Thanks for your help!


Abhishek

________________________________
From: Dev [dev-bounces at ensembl.org<mailto:dev-bounces at ensembl.org>] on behalf of Anja Thormann [anja at ebi.ac.uk<mailto:anja at ebi.ac.uk>]
Sent: Thursday, May 23, 2019 3:52 PM
To: Ensembl developers list
Subject: Re: [ensembl-dev] VEP dbNSFP error

Dear Abhishek,

could you please run tabix -H dbNSFP4.0b2a.txt.gz and let me know if it returns the header line starting with #chr  pos(1-based)..? If that is the case could you please let me know who exactly your are running the VEP command together with the dbNSFP plugin command? If the header is returned correctly I’m wondering if the dbNSFP file path is maybe not found correctly by the plugin.

Regards,
Anja

On 23 May 2019, at 13:53, Abhishek Nag <abhishek at well.ox.ac.uk<mailto:abhishek at well.ox.ac.uk>> wrote:

Hi,

I have been trying to use the dbNSFP database to annotate my variants. Currently, I'm trying to use the latest version of the database (v4.0), but I have also tried using the version 3.5. With both versions, I'm getting the following error message whenever I'm trying to run it -


Failed to instantiate plugin dbNSFP: ERROR: Could not read headers from dbNSFP4.0b2a.txt.gz


When I zcat the database file, I am able to read the header line, which was obtained from one of the chromosome files, as suggested in the online manual.


The formatting steps for the database (including tab indexing) seem to have gone fine, so I'm not sure where the problem is. Also, I am being able to use the other VEP Plugins just fine; for now, I'm getting this error messge with only the dbNSFP plugin.


It would be great if I could have some pointers regarding where the issue might be.

Many Thanks
Abhishek
_______________________________________________
Dev mailing list    Dev at ensembl.org<mailto:Dev at ensembl.org>
Posting guidelines and subscribe/unsubscribe info: https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org
Ensembl Blog: http://www.ensembl.info/

_______________________________________________
Dev mailing list    Dev at ensembl.org<mailto:Dev at ensembl.org>
Posting guidelines and subscribe/unsubscribe info: https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org
Ensembl Blog: http://www.ensembl.info/

_______________________________________________
Dev mailing list    Dev at ensembl.org<mailto:Dev at ensembl.org>
Posting guidelines and subscribe/unsubscribe info: https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org
Ensembl Blog: http://www.ensembl.info/

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20190604/0ef8fa11/attachment.html>


More information about the Dev mailing list