[ensembl-dev] VEP using CADD plugin for indels
Kirsley CHENNEN
kchennen at unistra.fr
Fri Apr 17 09:41:59 BST 2015
Hi again Will,
I have a l'll question concerning dbNSFP data. With the
latest release (dbNSFP3.0b2a, dbscSNV), by default the tabix indexing is
on the hg38 coordinates, but if I want to use hg19 coordinates, will the
following tabix indexing work?
tabix -s 1 -b 9 -e 9 dbNSFP.gz
Columns of dbNSFP_variant:
chr: chromosome number
pos(1-based): physical position on the chromosome as to hg38
(1-based coordinate).
For mitochondrial SNV, this position refers to the rCRS
(GenBank: NC_012920).
ref: reference nucleotide allele (as on the + strand)
alt: alternative nucleotide allele (as on the + strand)
aaref: reference amino acid
"." if the variant is a splicing site SNP (2bp on each end of
an intron)
aaalt: alternative amino acid
"." if the variant is a splicing site SNP (2bp on each end of
an intron)
rs_dbSNP142: rs number from dbSNP 142
hg19_chr: chromosome as to hg19, "." means missing
hg19_pos(1-based): physical position on the chromosome as to hg19
(1-based coordinate).
For mitochondrial SNV, this position refers to a YRI sequence
(GenBank: AF347015)
.
.
.
Regards,
Kirsley
On 14/04/2015 17:18, Will McLaren wrote:
> Hi Kirsley,
>
> The dbscSNV data works slightly differently than the dbNSFP data; I've
> added a separate plugin for this.
>
> https://github.com/ensembl-variation/VEP_plugins/blob/master/dbscSNV.pm
>
> Note the different process for tabix-indexing the data file.
>
> Regards
>
> Will
>
>
>
> On 8 April 2015 at 23:22, Kirsley Chennen <kchennen at unistra.fr
> <mailto:kchennen at unistra.fr>> wrote:
>
> Hi Will,
> Marvellous! Is it also possible to have an update of the
> dbNSFP.pm plugin, as
> dbNSFP[https://sites.google.com/site/jpopgen/dbNSFP] now comes in
> two files; dbNSFP.XXX.zip for non-synonymous SNPs and dbscSNV.zip
> for splicing consensus regions?
> Thanks again.
>
> Best regards,
> Kirsley
>
>
>
> On 08/04/2015 14:51, Will McLaren wrote:
>> Hello,
>>
>> Looking at the CADD website, it seems the data files for indels
>> are separate from those for SNVs.
>>
>> I've updated the CADD plugin
>> (https://github.com/ensembl-variation/VEP_plugins/blob/master/CADD.pm)
>> to be able to run with more than one CADD data file specified, e.g.:
>>
>> perl variant_effect_predictor.pl
>> <http://variant_effect_predictor.pl> [your_options] -plugin
>> CADD,whole_genome_SNVs.tsv.gz,InDels.tsv.gz
>>
>> Hopefully this should work for you
>>
>> Will McLaren
>> Ensembl Variation
>>
>> On 3 April 2015 at 10:32, Kirsley CHENNEN <kchennen at unistra.fr
>> <mailto:kchennen at unistra.fr>> wrote:
>>
>> Hi,
>> I would like to know how to use the CADD plugin with VEP
>> to annotate both SNP and indels? In the plugin head
>> description, it seems to refer only to SNP. Is there a way to
>> annotate the indels too?
>>
>>
>> Regards,
>> Kirsley Chennen
>>
>>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20150417/83222932/attachment.html>
More information about the Dev
mailing list