[ensembl-dev] Possibly incorrect entrezgene ids for CCL3L1, CCL3L1, HLA-DQA2
Ameya Chaubal
ameya at ebi.ac.uk
Mon Nov 9 14:13:16 GMT 2020
Hello Gerhard,
Within the system, genes identified as alleles get associated with the external references of each other.
From the biomart, this gets picked up as another attribute of a given Ensembl id and hence the mapping between the attributes.
Thanks,
Ameya
> On 3 Nov 2020, at 14:07, Gerhard Burger <g.a.burger at lacdr.leidenuniv.nl> wrote:
>
> Hi ensemble-dev,
>
> I'm trying to get entrizid for hgnc gene symbols, but I got some confusing results for CCL3L1, CCL3L1, HLA-DQA1, HLA-DQA2. I am using the following R code:
>
> library(biomaRt)
> ensembl <- useEnsembl(biomart = "genes", dataset = "hsapiens_gene_ensembl")
> getBM(values = c("CCL3L1", "CCL3L3", "HLA-DQA1", "HLA-DQA2"),
> attributes = c("entrezgene_id", "hgnc_symbol"),
> filters = "hgnc_symbol", mart = ensembl)
>
> which gives:
>
> entrezgene_id hgnc_symbol
> 1 6349 CCL3L1
> 2 414062 CCL3L1
> 3 6349 CCL3L3
> 4 414062 CCL3L3
> 5 3117 HLA-DQA1
> 6 3117 HLA-DQA2
> 7 3118 HLA-DQA2
>
> However, when I look up these hgnc symbols online I believe the result should be:
>
> entrezgene_id hgnc_symbol
> 1 6349 CCL3L1
> 2 414062 CCL3L3
> 3 3117 HLA-DQA1
> 4 3118 HLA-DQA2
>
> The online sources:
>
> CCL3L1: https://www.genenames.org/data/gene-symbol-report/#!/hgnc_id/HGNC:10628 <https://www.genenames.org/data/gene-symbol-report/#!/hgnc_id/HGNC:10628>
> CCL3L3: https://www.genenames.org/data/gene-symbol-report/#!/hgnc_id/HGNC:30554 <https://www.genenames.org/data/gene-symbol-report/#!/hgnc_id/HGNC:30554>
> HLA-DQA1: https://www.genenames.org/data/gene-symbol-report/#!/hgnc_id/HGNC:4942 <https://www.genenames.org/data/gene-symbol-report/#!/hgnc_id/HGNC:4942>
> HLA-DQA2: https://www.genenames.org/data/gene-symbol-report/#!/hgnc_id/HGNC:4943 <https://www.genenames.org/data/gene-symbol-report/#!/hgnc_id/HGNC:4943>
>
> Also AnnotationDbi does give me the expected results:
>
> require(AnnotationDbi)
> require(org.Hs.eg.db)
> AnnotationDbi::mapIds(org.Hs.eg.db,
> keys=c("CCL3L1", "CCL3L3", "HLA-DQA1", "HLA- DQA2"),
> column = "ENTREZID", keytype = "SYMBOL")
>
> Am I doing something wrong or is this is bug?
>
> (biomaRt version 2.44.4, ensemble version 101)
>
> Kind regards,
> Gerhard
> _______________________________________________
> Dev mailing list Dev at ensembl.org
> Posting guidelines and subscribe/unsubscribe info: https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org
> Ensembl Blog: http://www.ensembl.info/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20201109/d6bfbb25/attachment.html>
More information about the Dev
mailing list