[ensembl-dev] Mouse Genomes Project gene symbols missing in Ensembl release 114

Leanne Haggerty leanne at ebi.ac.uk
Mon Jul 7 08:43:17 BST 2025


Hi Eric,

Just to add to Jorge’s response, in case it’s useful, you can find the genomes and Ensembl annotations for the latest Mouse Genome Project on the Ensembl project page:
https://projects.ensembl.org/mouse_genomes/

From this page, you can download the GFF files. Gene symbols have been added using a machine learning approach that closely replicates traditional Ensembl assignment methods with high accuracy. You can read more about this approach here:
https://www.embl.org/news/science/a-machine-learning-approach-for-allocating-gene-function/, and more on Ensembl’s conventional gene naming methods here: https://www.ensembl.org/info/genome/genebuild/gene_names.html

I hope this is helpful while we await the release of these genomes in the new website.

All the best
Leanne


> On 4 Jul 2025, at 16:47, Jorge Batista da Rocha <jrocha at ebi.ac.uk> wrote:
> 
> Hi Eric, 
> 
> Hope you are well, and thank you for reaching out. 
> 
> I’m sorry to say that the gene symbols are missing due to an error in our external references pipeline. Our data teams are looking into it. 
> 
> It will not be fixed for the upcoming 115 release in the autumn. Any fixes would be incorporated into the subsequent 116 release.
> 
> We expect the updated builds to become available in our new site, beta.ensembl.org <http://beta.ensembl.org/> within the next few months. 
> 
> Thanks and best wishes
> Jorge 
> 
> 
> 
> 
> 
> Dr Jorge Batista da Rocha
> Ensembl Outreach Project Leader
> European Bioinformatics Institute (EMBL-EBI)
> 
>> On 1 Jul 2025, at 14:58, Eric Engelhard <eric.engelhard at regeneron.com> wrote:
>> 
>> Ensembl release 114 includes new genome builds and annotations for the non-canonical Mouse Genome Project strains (https://www.ensembl.info/2025/05/07/ensembl-114-has-been-released/). I recently noticed, however, that new gene annotations now lack gene symbols for both the web service and within the MySQL sources. Additionally, BioMart sources are no longer synchronized to the new gene identifiers, all of which have been replaced.
>>  
>> Are gene symbol annotations and a BioMart update planned for the next release?
>>  
>> Thank you,
>> Eric
>> 
>> Regeneron - Internal
>> ******************************************************************** 
>> This e-mail and any attachment hereto, is intended only for use by the addressee(s) named above and may contain legally privileged and/or confidential information. If you are not the intended recipient of this e-mail, any dissemination, distribution or copying of this email, or any attachment hereto, is strictly prohibited. If you receive this email in error please immediately notify me by return electronic mail and permanently delete this email and any attachment hereto, any copy of this e-mail and of any such attachment, and any printout thereof. Finally, please note that only authorized representatives of Regeneron Pharmaceuticals, Inc. have the power and authority to enter into business dealings with any third party. 
>> ********************************************************************
>> _______________________________________________
>> Dev mailing list    Dev at ensembl.org <mailto:Dev at ensembl.org>
>> Posting guidelines and subscribe/unsubscribe info: https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org
>> Ensembl Blog: http://www.ensembl.info/
> 
> _______________________________________________
> Dev mailing list    Dev at ensembl.org
> Posting guidelines and subscribe/unsubscribe info: https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org
> Ensembl Blog: http://www.ensembl.info/

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20250707/4330c9c9/attachment.html>


More information about the Dev mailing list