[ensembl-dev] NCBI vs Ensemble
Koul, Sunita
sunita at pathology.wustl.edu
Wed Mar 6 17:00:31 GMT 2013
Thanks that was helpful
From: dev-bounces at ensembl.org [mailto:dev-bounces at ensembl.org] On Behalf Of Bert Overduin
Sent: Wednesday, March 06, 2013 10:21 AM
To: Ensembl developers list
Subject: Re: [ensembl-dev] NCBI vs Ensemble
Hello Sunita,
NCBI and Ensembl generate their gene models in different ways, so they can differ in their coordinates, as is the case for the human DNMT3A gene.
The patches have nothing to do with this as this gene is not located in a patch region.
Just for your information, we indeed annotate at the moment the GRCh37.p10 assembly. The patches themselves however are existing as separate entities from the reference genome assembly. The reference genome itself has not changed between GRCh37 and GRCh37.p10. There are basically two kind of patches, novel patches, which represent alternate loci and fix patches, which represent corrected regions. The fix patches will be incorporated in the reference genome assembly in the next major release (i.e. GRCh38), thus replacing the faulty regions. Until that time genes located in fix patch regions will be annotated both on the reference genome assembly and the fix patch, as you can see for e.g. the human ABO gene:
http://www.ensembl.org/Homo_sapiens/Search/Details?db=core;end=2;idx=Gene;q=ABO;species=Homo_sapiens
But again, this does not pertain to the DNMT3A gene, as it is not located in a fix patch region.
Hope this helps.
Cheers,
Bert
On Wed, Mar 6, 2013 at 4:03 PM, Koul, Sunita <sunita at pathology.wustl.edu<mailto:sunita at pathology.wustl.edu>> wrote:
I was looking at the Gene DNMT3A (ENSG00000119772) in the ensemble genome browser
As per the release the ensemble 70 has updated patches for the human assembly (GRCh37.p10)
In NCBI the genomic location of this gene is (25455830-25565459) http://www.ncbi.nlm.nih.gov/gene?db=gene&cmd=retrieve&dopt=full_report&list_uids=1788
How is that the ensemble still shows the (25455845-25565459) in their browser http://useast.ensembl.org/Homo_sapiens/Location/View?db=core;g=ENSG00000119772;r=2:25455845-25565459
Am I missing something here?
Basically we are trying to annotate our genes with annotations from Ensemble. We are currently using NCBI build 37.p10 and want to pull in more information from Ensemble
Unless we find the compatibility between these 2 builds we can't proceed
Can someone guide me how we can do this
Thanks
Sunita
The materials in this email are private and may contain Protected Health Information. If you are not the intended recipient, be advised that any unauthorized use, disclosure, copying, distribution or the taking of any action in reliance on the contents of this information is strictly prohibited. If you have received this email in error, please immediately notify the sender via telephone or return email.
_______________________________________________
Dev mailing list Dev at ensembl.org<mailto:Dev at ensembl.org>
Posting guidelines and subscribe/unsubscribe info: http://lists.ensembl.org/mailman/listinfo/dev
Ensembl Blog: http://www.ensembl.info/
--
Bert Overduin, Ph.D.
Vertebrate Genomics Team
EMBL - European Bioinformatics Institute
Wellcome Trust Genome Campus
Hinxton, Cambridge CB10 1SD
United Kingdom
http://www.ebi.ac.uk/~bert
Ensembl browser: http://www.ensembl.org<http://www.ensembl.org/>
Mailing lists: http://www.ensembl.org/info/about/contact/mailing.html
Blog: http://www.ensembl.info<http://www.ensembl.info/>
YouTube: http://www.youtube.com/user/EnsemblHelpdesk
Facebook: http://www.facebook.com/Ensembl.org
Twitter: http://twitter.com/Ensembl
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20130306/26f34260/attachment.html>
More information about the Dev
mailing list