[ensembl-dev] Annotation discrepancy
ian Longden
ianl at ebi.ac.uk
Fri Nov 19 14:59:34 GMT 2010
150000, 51275, 55449, 57126, 503646, 8693 are all unmapped in human.
These will have entrys in the xref table but are not linked to any genes.
I am not sure how the search is done but this may affect it.
984:-
Not sure what the problem is here we find the genes of interest but
this gene also has other EntrezGene ids.
8857:-
as above
9026:-
as above.
Were you expecting only 1 EntrezGene per gene? In time i hope this
becomes true but as these are very similar the software cannot choose
between them and uses both.
I think the data is correct but maybe the search is not giving you
exactly what you want.
We need to look at having the unmapped cases searchable.
0Ian.
On Fri, Nov 19, 2010 at 11:55 AM, Oliver, Gavin
<gavin.oliver at almacgroup.com> wrote:
> I have a few more examples of discrepancies which will hopefully help.
>
>
>
> For all examples, the search was performed on Entrez ID but returned
> nothing. I have looked a bit deeper into a handful of examples. Details
> below:
>
>
>
> Entrez ID 150000 Associated gene Symbol ABCC13 in database but
> with no associated entrez ID
>
> Entrez ID 51275 Associated gene symbol C12orf47 in database but
> no associated entrez ID
>
> Entrez ID 55449 Associated gene symbol C14orf167 in database but
> no associated entrez ID
>
> Entrez ID 57126 Associated gene symbol CD177 in database with no
> associated entrez id
>
> Entrez ID 984 Associated gene symbol CDK11B is not in
> database. CDK11A is in database but is annotated as cyclin-dependent kinase
> 11B with entrez id 100294398 which entrez describes as LOC100294398 (cell
> division protein kinase 11B-like).
>
> Entrez ID 503646 Neither this ID nor associated gene symbol DPRXP5
> are in the database.
>
> Entrez ID 8857 Associated gene symbol FCGBP (Fc fragment of
> IgG binding protein) is there but with Entrez gene ID 100133944 which
> corresponds to LOC100133944 IgGFc-binding protein-like.
>
> Entrez ID 8693 Neither this ID nor associated gene symbol
> GALNT4 are in the database.
>
> Entrez ID 9026 Gene symbol HIP1R (huntingtin interacting
> protein 1 related) is in the database but with entrez ID 100294412 which
> corresponds to huntingtin-interacting protein 1-related protein-like
>
>
>
> Best,
>
>
>
> Gavin
>
>
>
>
>
>
>
> ________________________________
>
> From: dev-bounces at ensembl.org [mailto:dev-bounces at ensembl.org] On Behalf Of
> Oliver, Gavin
> Sent: 19 November 2010 10:29
> To: dev at ensembl.org
> Subject: [ensembl-dev] Annotation discrepancy
>
>
>
> Hi all,
>
>
>
> I have been using Ensembl human for internal annotation of microarrays.
>
>
>
> Yesterday someone did a search for Entrez Gene ID 3336 in our database. It
> returned no hits.
>
>
>
> When they searched with the Gene symbol for this ID (HSPE1), they got 5 hits
> but the Entrez ID associated with the gene was 100132346 (and not 3336 as
> would be expected).
>
>
>
> I ran a search for 100132346 against the Ensembl genome browser and it
> brings back 2 genes on 2 different chromosomes.
>
>
>
> Can someone explain what might be happening here?
>
>
>
> Best,
>
>
>
> Gavin
>
>
>
>
>
> _______________________________________________
> Dev mailing list
> Dev at ensembl.org
> http://lists.ensembl.org/mailman/listinfo/dev
>
>
More information about the Dev
mailing list