[ensembl-dev] Genes with identical symbols but different ENSG

Mark McDowall mcdowall at ebi.ac.uk
Fri Mar 22 08:59:30 GMT 2019


Hi Duarte,

We are looking into this at the moment.

Cheers,

Mark

On 19/03/2019 14:13, Duarte Molha wrote:
> Dear Devs
> 
> Could you help me understand why the gene
> 
> ATXN7
> 
> Has 2 ENSG ids. but they both map to the same external Reference:
> ATAXIN 7; ATXN7 [*607640] (MIM gene record; description: ATAXIN 7; ATXN7,)
> 
> The Gene with the most transcripts associated with it is:
> ATXN7 (Human Gene)
> ENSG00000163635 3:63898399-64003453:1
> 
> But Overlapping with it you have
> ATXN7 (Human Gene)
> ENSG00000285258 3:63864557-64003462:1
> 
> Would it not make sense to add all transcript to the 1st ID and drop the second?
> This unfortunately is not the only gene where this occurs
> 
> For example the HGNC symbol DIABLO is associated with 2 ENSG IDs (also overlaping
> The same with CCDC39, IGF2, MATR3, PDE11A, RMRP, SCO2, SPATA13 and TBCE
> 
> I am sure this is not the only ones where this is true and it creates a bit of a problem because now I need to be merging 
> distinct entities or choose between one of the 2 entries as the main entry for that gene symbol.
> 
> Your help on why this occurs and any possible solutions as to how to only select the main one would me much appreciated
> 
> Best regards
> 
> Duarte Molha
> 
> _______________________________________________
> Dev mailing list    Dev at ensembl.org
> Posting guidelines and subscribe/unsubscribe info: http://lists.ensembl.org/mailman/listinfo/dev
> Ensembl Blog: http://www.ensembl.info/
> 

-- 
Mark McDowall, PhD | Bioinformatician, Ensembl - Multiscale Genomics
European Bioinformatics Institute (EMBL-EBI)
European Molecular Biology Laboratory
Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
Tel: +44-(0)1223-494589
WWW: http://www.ensembl.org
WWW: http://www.multiscalegenomics.eu



More information about the Dev mailing list