[ensembl-dev] Genes with identical symbols but different ENSG
Mark McDowall
mcdowall at ebi.ac.uk
Fri Mar 22 08:59:30 GMT 2019
Hi Duarte,
We are looking into this at the moment.
Cheers,
Mark
On 19/03/2019 14:13, Duarte Molha wrote:
> Dear Devs
>
> Could you help me understand why the gene
>
> ATXN7
>
> Has 2 ENSG ids. but they both map to the same external Reference:
> ATAXIN 7; ATXN7 [*607640] (MIM gene record; description: ATAXIN 7; ATXN7,)
>
> The Gene with the most transcripts associated with it is:
> ATXN7 (Human Gene)
> ENSG00000163635 3:63898399-64003453:1
>
> But Overlapping with it you have
> ATXN7 (Human Gene)
> ENSG00000285258 3:63864557-64003462:1
>
> Would it not make sense to add all transcript to the 1st ID and drop the second?
> This unfortunately is not the only gene where this occurs
>
> For example the HGNC symbol DIABLO is associated with 2 ENSG IDs (also overlaping
> The same with CCDC39, IGF2, MATR3, PDE11A, RMRP, SCO2, SPATA13 and TBCE
>
> I am sure this is not the only ones where this is true and it creates a bit of a problem because now I need to be merging
> distinct entities or choose between one of the 2 entries as the main entry for that gene symbol.
>
> Your help on why this occurs and any possible solutions as to how to only select the main one would me much appreciated
>
> Best regards
>
> Duarte Molha
>
> _______________________________________________
> Dev mailing list Dev at ensembl.org
> Posting guidelines and subscribe/unsubscribe info: http://lists.ensembl.org/mailman/listinfo/dev
> Ensembl Blog: http://www.ensembl.info/
>
--
Mark McDowall, PhD | Bioinformatician, Ensembl - Multiscale Genomics
European Bioinformatics Institute (EMBL-EBI)
European Molecular Biology Laboratory
Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
Tel: +44-(0)1223-494589
WWW: http://www.ensembl.org
WWW: http://www.multiscalegenomics.eu
More information about the Dev
mailing list