[ensembl-dev] Question regarding the fetching if gene info using external names

Duarte Molha Duarte.Molha at ogt.co.uk
Wed Nov 21 16:28:11 GMT 2012


Thank you... but this is not the only one like this... almost all the mitochondrial genes fail in the same way:

Failed > Query:MT-TG
Failed > Query:MT-TH
Failed > Query:MT-TI
Failed > Query:MT-TK
Failed > Query:MT-TL1
Failed > Query:MT-TP
Failed > Query:MT-TQ
Failed > Query:MT-TS1
Failed > Query:MT-TS2

So this leads be to believe it is not a isolated case. In the list above most of these genes are present in the link you gave me ... so why were they not imported?

Cheers

Duarte

-----Original Message-----
From: dev-bounces at ensembl.org [mailto:dev-bounces at ensembl.org] On Behalf Of Andy Yates
Sent: 21 November 2012 16:16
To: Ensembl developers list
Subject: Re: [ensembl-dev] Question regarding the fetching if gene info using external names

Hi Duarte,

This is not possible to do since we have not imported this symbol into Ensembl which looks to be a bug in our xref pipeline. Our major source of HGNC mappings comes from the following URL:

http://www.genenames.org/cgi-bin/hgnc_downloads.cgi?title=Genew+output+data&col=gd_hgnc_id&col=gd_app_sym&col=gd_app_name&col=gd_prev_sym&col=gd_aliases&col=gd_pub_eg_id&col=gd_pub_refseq_ids&col=md_eg_id&col=md_refseq_id&col=gd_pub_ensembl_id&col=md_prot_id&col=gd_lsdb_links&status=Approved&status=Approved+Non-Human&status_opt=3&=on&where=&order_by=gd_hgnc_id&limit=&format=text&submit=submit&.cgifields=&.cgifields=status&.cgifields=chr

The association you have highlighted is also in this file. It seems that for some reason we have not recorded this link. We will register a bug about this and will be able to feed back once it is fixed.

For the moment you can use HGNC's biomart to provide the symbol -> ENSG lookup as we look into what has gone wrong

Andy

On 21 Nov 2012, at 15:49, Duarte Molha wrote:

> Dear developers
>  
> I have a question regarding the use of the methods "fetch_all_by_display_label" and "fetch_all_by_external_name"
>  
> Consider this snippet of code:
>  
> my $query_gene = "MT-TG";
> my @fetched_genes = 
> @{$gene_adaptor->fetch_all_by_display_label($query_gene)};
> unless (@fetched_genes){
> @fetched_genes =  @{ 
> $gene_adaptor->fetch_all_by_external_name($query_gene) }; } foreach my 
> $a (@fetched_genes) {
>                                 [do something]...
> }
>  
> Although, MT-TG is the approved HGNC symbol for this gene ( http://www.genenames.org/data/hgnc_data.php?hgnc_id=7486 ), I am unable to retrieve data for it from the ensemble database.
> In ensembl this gene is called J01415.16 with the ENSGID: 
> ENSG00000210164 (http://www.ensembl.org/Homo_sapiens/Gene/Summary?g=ENSG00000210164;r=MT:9991-10058;t=ENST00000387429 ) Can you tell me what I am doing wrong and how to change my code so I can retrieve is particular gene of interest?
> Best regards
>  
> Duarte Molha
>  
> _______________________________________________
> Dev mailing list    Dev at ensembl.org
> Posting guidelines and subscribe/unsubscribe info: 
> http://lists.ensembl.org/mailman/listinfo/dev
> Ensembl Blog: http://www.ensembl.info/


_______________________________________________
Dev mailing list    Dev at ensembl.org
Posting guidelines and subscribe/unsubscribe info: http://lists.ensembl.org/mailman/listinfo/dev
Ensembl Blog: http://www.ensembl.info/




More information about the Dev mailing list