[ensembl-dev] Bug?? Biomart mapping of Mouse ID to MGI, ids

Ragavendran, Ashok ARAGAVENDRAN at mgh.harvard.edu
Fri Sep 4 15:09:41 BST 2015


hi Magali,

   Thanks again for the prompt response. This again relates back to my previous post on the Entrez IDs and the webpage issue

    http://useast.ensembl.org/Mus_musculus/Gene/Summary?db=core;g=ENSMUSG00000106677;r=5:87459490-87490871;t=ENSMUST00000147854

    http://useast.ensembl.org/Mus_musculus/Gene/Summary?db=core;g=ENSMUSG00000029268;r=5:87459490-87482260

As you can see each of the genes above are assigned to only one MGI id which was misleading whereby I was unsure what the correct  set of results were.

Does this mean we should not rely on the search results for a gene and for anything definite use the biomart tool instead??

Please don't get me wrong, I think that the Ensembl database is a really wonderful resource and I just wanted to bring these aspects to your attention in the hope that it helps in making it better.

 Furthermore, if I search using ensembl gene id for Ugt2a1 i get the following results
        Ensembl Gene ID    Associated Gene Name    MGI ID    MGI symbol    Ensembl Transcript ID
        ENSMUSG00000106677    Ugt2a1    MGI:2149905    Ugt2a1    ENSMUST00000147854

Whereas results from MGI data show that ENSMUSG00000106677  is  associated with both MG1 ids.
>From MGI database:

MGI:2149905    Ugt2a1    O    Gene    UDP glucuronosyltransferase 2 family, polypeptide A1    43.56    5    87459490    87490871    -    BB375653|AK140757|AF184901|BC048926    NM_053184    OTTMUST00000066277|OTTMUST00000138762    ENSMUST00000079811|ENSMUST00000144144|ENSMUST00000147854            OTTMUSP00000033195|OTTMUSP00000072911    ENSMUSP00000078740|ENSMUSP00000114842|ENSMUSP00000114583    NP_444414

MGI:3576095    Ugt2a2    O    Gene    UDP glucuronosyltransferase 2 family, polypeptide A2    43.56    5    87459493    87482258    -    BC058786|BC048920    NM_001024148    OTTMUST00000066277|OTTMUST00000138762    ENSMUST00000144144|ENSMUST00000147854|ENSMUST00000079811            OTTMUSP00000072911|OTTMUSP00000033195    ENSMUSP00000078740|ENSMUSP00000114842|ENSMUSP00000114583    NP_001019319

I see their web page is inconsistent and only lists one of the Ensembl gene IDs, I will write them and see if they can correct the inconsitency.


Thanks again for all your help.

Cheers
    Ashok


On 9/4/15 7:00 AM, dev-request at ensembl.org<mailto:dev-request at ensembl.org> wrote:

Message: 1
Date: Fri, 04 Sep 2015 10:11:35 +0100
From: mag <mr6 at ebi.ac.uk><mailto:mr6 at ebi.ac.uk>
Subject: Re: [ensembl-dev] Bug?? Biomart mapping of Mouse ID to MGI
        ids
To: dev at ensembl.org<mailto:dev at ensembl.org>
Message-ID: <55E96047.5010702 at ebi.ac.uk><mailto:55E96047.5010702 at ebi.ac.uk>
Content-Type: text/plain; charset=windows-1252; format=flowed

Hi Ashok,

MGI mappings, like HGNC mappings, are direct mappings we import directly
from the source.
According to this database, both Ugt2a1 and Ugt2a2 map to the Ensembl gene.
http://www.informatics.jax.org/marker/MGI:2149905
http://www.informatics.jax.org/marker/MGI:3576095

Looking at the gene models, there is an overlap with ENSMUSG00000106677,
which corresponds to Ugt2a1.
I suspect MGI:2149905 needs to be updated to link to ENSMUSG00000106677
instead of ENSMUSG00000029268, but we cannot change it at our end.
If you have any experimental evidence to correct those mappings, please
contact MGI (mgi-help at jax.org<mailto:mgi-help at jax.org>) to let them know.


Regards,
Magali

On 03/09/2015 21:34, Ragavendran, Ashok wrote:


> Hello,
>      I have come upon a subsequent bug where there is an erroneous
> mapping of Ensembl Mouse ID to the corresponding MGI:ID using BioMart (
> both in bioconductor and the Ensembl web interface). Please let me know
> if I am missing something in this regard. Correct mapping should be only
> to 3576095 and not to 2149905.
>
>      Cheers
>      Ashok
>
> ===== Text based Results from querying the gene id  ENSMUSG00000029268
> =======
>
> Ensembl Gene ID    Status (gene)    MGI ID    MGI symbol
> ENSMUSG00000029268    KNOWN    MGI:2149905    Ugt2a1
> ENSMUSG00000029268    KNOWN    MGI:3576095    Ugt2a2
>







--
Ashok Ragavendran
Bioinformatics Specialist
Center for Human Genetic Research
Massachusetts General Hospital
Richard B. Simches Research Center
185 Cambridge St, Boston MA 02114
aragavendran at mgh.harvard.edu<mailto:aragavendran at mgh.harvard.edu>
ph: +1-617-726-1329


The information in this e-mail is intended only for the person to whom it is
addressed. If you believe this e-mail was sent to you in error and the e-mail
contains patient information, please contact the Partners Compliance HelpLine at
http://www.partners.org/complianceline . If the e-mail was sent to you in error
but does not contain patient information, please contact the sender and properly
dispose of the e-mail.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20150904/ac51b6bf/attachment.html>


More information about the Dev mailing list