[ensembl-dev] VEP: Is it possible to add LRG annotations?

Andrew Carson acarson at invivoscribe.com
Mon Apr 7 22:28:57 BST 2014


Hi Dr. McLaren,

Sorry for the additional query, but I would like to add to my last posted question. As I investigate further, it seems that I don't understand how the --lrg flag works. Is it adding a second separate annotation if there is an LRG overlap?
I notice that when I run the commands:



perl variant_effect_predictor.pl --fork 4 --no_stats --everything --lrg --cache --format vcf --force_overwrite --check_existing --check_alleles --vcf --no_progress --pubmed --gmaf --maf_1kg --pick -i input.vcf -o output.VEP.vcf



Whenever I get a LRG annotation, there is a separate regular annotation. For example:



1       92941604        .     C       T       .       .       CSQ=T|ENSG00000162676|ENST00000370332|Transcript|synonymous_variant|1570|1251|417|T|acG/acA|COSM1344916|||7/7||||||

|-1||YES|GFI1|HGNC||||protein_coding|ENSP00000359357|PROSITE_profiles:PS50157&SMART_domains:SM00355&Superfamily_domains:SSF57667|CCDS30773.1|ENST00000370332.1:c.1251G>A|ENST00000370332.1:c.1251G>A(p.%3D)|||||



Followed 105 lanes later by:



1       92941604        .     C       T       .       .       CSQ=A|LRG_63|LRG_63t1|Transcript|synonymous_variant|1501|1251|417|T|acG/acA||||7/7|||||||1||YES|LRG_63|LRG||||LRG_g

ene|LRG_63p1|||LRG_63t1.1:c.1251G>A|LRG_63t1.1:c.1251G>A(p.%3D)|||||



This causes the output .vcf to have out of order variants (no longer properly sorted). Not that this is the same consequence but one shown as the + strand (CSQ=T) and one is shown on the - strand (CSQ=A).



Am I doing something wrong here?

Any help would be appreciated.

Thanks!
Andrew





>Thank you very much Dr. McLaren.

>Just one clarification to the LRG choice. Is the LRG always presented as the first consequence (if it exists)? If this is true, then if the --pick chooses the worst consequence, and there are multiple transcripts with the same >"worst consequence", does VEP --pick the first transcript with that consequence? If that is true, if the LRG contains the "worst consequence" along with other similar transcripts, will --pick successfully choose this consequence >over other equal transcripts?



>Any help on this would be appreciated.

>Thank you again!

>Andrew

>

>I should also say that there's currently no way to prioritise LRG

>consequences other than filtering using filter_vep.pl, though this wouldn't

>be a complete solution.

>

>Will

>

>

>On 1 April 2014 09:51, Will McLaren <wm2 at ebi.ac.uk<http://lists.ensembl.org/mailman/listinfo/dev>> wrote:

>

> Hi Andrew,

>

> In fact this is already possible; just add the flag --lrg at runtime. Note

> however that using LRGs depends on connecting to our database, so this will

> not work using --offline and will connect to ensembldb.ensembl.org when

> using --cache. Because of this database connection you may find that the

> script runs more slowly as it attempts to remap your input variants to LRG

> coordinates.

>

> I'm afraid this is missing from the documentation currently, I will get

> that updated.

>

> Regards

>

> Will McLaren

> Ensembl Variation

>

>

> On 31 March 2014 22:24, Andrew Carson <acarson at invivoscribe.com<http://lists.ensembl.org/mailman/listinfo/dev>> wrote:

>

>> Hi ensembl-dev team,

>>

>> I was just wondering if there are plans to incorporate the LRG (locus

>> reference genomic) records into the VEP annotation pipeline (from here:

>> http://www.lrg-sequence.org/home). I only ask because in the HGVS new

>> clinical reporting guidelines they recommend using the LRG sequence (if one

>> is present) to standardize variant reporting.

>>

>>

>>

>> It would also be very useful to have an option where, if a variant

>> overlaps an LRG, you can choose to "pick" that consequence over other

>> consequences.

>>

>>

>>

>> Any thoughts on if this could be added to the development for the next

>> release cycle?

>>

>> Thanks for all of your help!

>>

>>

>>

>> Andrew R. Carson, Ph.D.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20140407/a0c67830/attachment.html>


More information about the Dev mailing list