[ensembl-dev] VEP command line
Linan, Margaret
margaret.linan at mssm.edu
Mon Sep 16 18:25:10 BST 2019
Thanks Irina,
Regarding the counting of the overlapped transcripts and regulatory features (using stats_html), should I just count how many times the string "transcript" or "regulatory features" appears in the 'Feature' column?
Also, what string would I be searching for in the 'Feature_type' column? In an example VEP annotated VCF, the only relevant string was: 'sense_overlapping'
Best regards,
Margaret Linan, MPH MS
Independent Consultant
Serving the CBIPM @ Icahn School of Medicine at Mount Sinai
Margaret.Linan at mssm.edu
________________________________
From: Irina Armean <iarmean at ebi.ac.uk>
Sent: Monday, September 16, 2019 8:22:53 AM
To: Ensembl developers list; Linan, Margaret
Subject: Re: [ensembl-dev] VEP command line
USE CAUTION: External Message.
Hi Margaret,
Sorry for the delay.
The stats written out in stats_html are collected internally simultaneously with the VEP annotation and therefore are not generated based on the VCF columns of the output file.
Depending on what VEP run options were selected, the counts could be reproduced based on the output file. For example the number of overlapped genes corresponds to the unique count of ENSG identifiers in the 'Gene' output column. The number of overlapped transcripts and regulatory features could be computed based on the 'Feature' and 'Feature_type' columns.
Kind regards,
Irina
On 12/09/2019 19:27, Linan, Margaret wrote:
Hi -
Does anyone know how the VEP command line program's stats_html utility calculates the following (i.e., what VCF columns and operations it uses)?
- VCF file pre-processing
- Number of overlapped genes
- Number of overlapped transcripts
- Number of overlapped regulatory features
Thank you,
Margaret
_______________________________________________
Dev mailing list Dev at ensembl.org<mailto:Dev at ensembl.org>
Posting guidelines and subscribe/unsubscribe info: https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org<https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.ensembl.org_mailman_listinfo_dev-5Fensembl.org&d=DwMC-g&c=shNJtf5dKgNcPZ6Yh64b-A&r=kRxZpbitOhDkEC3BuUN1vDtzo3iicYrRn6woDJL_jnA&m=w9gjaZF2-WgEeSoFXEwsblFfwJmVFz1CEmhpSp9zXtY&s=SpZOBETLvgtXkDPVAYD1y-NoSVS2-Gm6y5Og0WsbrqU&e=>
Ensembl Blog: http://www.ensembl.info/<https://urldefense.proofpoint.com/v2/url?u=http-3A__www.ensembl.info_&d=DwMC-g&c=shNJtf5dKgNcPZ6Yh64b-A&r=kRxZpbitOhDkEC3BuUN1vDtzo3iicYrRn6woDJL_jnA&m=w9gjaZF2-WgEeSoFXEwsblFfwJmVFz1CEmhpSp9zXtY&s=5upY6Tga0npIqKtFlwp1cmQIuwbtshPzDJQJPRAHMYg&e=>
--
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20190916/1f6562ca/attachment.html>
More information about the Dev
mailing list