[ensembl-dev] translation question GTG --> Methionine?
Hervé Pagès
hpages.on.github at gmail.com
Wed Mar 10 23:06:59 GMT 2021
Hi Julie, Emily,
It's worth noting that the TTG and CTG start codons are totally expected
and officially part of the Standard Code:
https://www.ncbi.nlm.nih.gov/Taxonomy/Utils/wprintgc.cgi#SG1
Also the GTG start codon is expected for transcripts found on the
Mitochondrial chromosome and is officially part of the Vertebrate
Mitochondrial Code:
https://www.ncbi.nlm.nih.gov/Taxonomy/Utils/wprintgc.cgi#SG2
So what breaks the rule in the case of transcript ENST00000678634.1 is
that the GTG start codon is found on chromosome 20.
H.
On 3/10/21 7:58 AM, Emily Perry wrote:
> Hi Julie
>
> The information is stored in the transcript_attrib table under
> attribution_id 380:
> http://www.ensembl.org/info/docs/api/core/core_schema.html#transcript_attrib
> <http://www.ensembl.org/info/docs/api/core/core_schema.html#transcript_attrib>
>
> You can fetch it from the Perl API using $transcript->get_all_Attributes
> http://www.ensembl.org/info/docs/Doxygen/core-api/classBio_1_1EnsEMBL_1_1Transcript.html#a59f9ff2079a28ba80bc8b62a5e636327
> <http://www.ensembl.org/info/docs/Doxygen/core-api/classBio_1_1EnsEMBL_1_1Transcript.html#a59f9ff2079a28ba80bc8b62a5e636327>
>
> All the best
>
> Emily
>
>> On 10 Mar 2021, at 14:37, Julie Sullivan <julie.sullivan at gmail.com
>> <mailto:julie.sullivan at gmail.com>> wrote:
>>
>> Thank you! That answers my question!
>>
>> I would really like to be able to access that tag (non-ATG start)
>> programmatically. Are there plans for putting it with the other
>> transcript flags in the GTF file?
>>
>> On Wed, 10 Mar 2021 at 10:09, Emily Perry <emily at ebi.ac.uk
>> <mailto:emily at ebi.ac.uk>> wrote:
>>
>> Hi Julie
>>
>> We have some information about non-ATG start codons in our blog
>> post from release 102:
>> https://www.ensembl.info/2020/11/30/ensembl-102-has-been-released/
>> <https://www.ensembl.info/2020/11/30/ensembl-102-has-been-released/>
>>
>> Quite simply, there is not a rule. This is a situation of
>> exceptional biology which we are only able to annotate correctly
>> because of our expert manual gene annotators analysing the data in
>> detail.
>>
>> All the best
>>
>> Emily
>>
>>> On 10 Mar 2021, at 09:08, Julie Sullivan
>>> <julie.sullivan at gmail.com <mailto:julie.sullivan at gmail.com>> wrote:
>>>
>>> https://www.ensembl.org/Homo_sapiens/Transcript/Sequence_cDNA?db=core;g=ENSG00000288649;r=20:33667144-33668235;t=ENST00000678634
>>> <https://www.ensembl.org/Homo_sapiens/Transcript/Sequence_cDNA?db=core;g=ENSG00000288649;r=20:33667144-33668235;t=ENST00000678634>
>>> The first codon is GTG. I would not have expected that to be
>>> Methionine.
>>>
>>> I looked in the text files, and there are 123 of these
>>> transcripts where the start codon is NOT ATG but the aa is M, in
>>> Homo sapiens.
>>> {'error': 0,
>>> 'methionine': 91434,
>>> 'GTG': 22,
>>> 'ATA': 10,
>>> 'CTG': 67,
>>> 'ACG': 8,
>>> 'TTG': 9,
>>> 'ATT': 5,
>>> 'AAC': 1,
>>> 'AAG': 1}
>>>
>>> Why is that?
>>>
>>> Specifically I would like a rule I can use, as my HGVSp strings
>>> are different from VEP for this reason.
>>>
>>> Thanks!
>>> Julie
>>> _______________________________________________
>>> Dev mailing list Dev at ensembl.org <mailto:Dev at ensembl.org>
>>> Posting guidelines and subscribe/unsubscribe info:
>>> https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org
>>> <https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org>
>>> Ensembl Blog: http://www.ensembl.info/ <http://www.ensembl.info/>
>>
>> —
>>
>> Dr Emily Perry (Pritchard)
>> Ensembl Outreach Project Leader
>> (she/her)
>>
>> European Bioinformatics Institute (EMBL-EBI)
>> European Molecular Biology Laboratory
>> Wellcome Genome Campus
>> Hinxton
>> Cambridge
>> CB10 1SD
>> UK
>>
>>
>>
>>
>> _______________________________________________
>> Dev mailing list Dev at ensembl.org <mailto:Dev at ensembl.org>
>> Posting guidelines and subscribe/unsubscribe info:
>> https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org
>> <https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org>
>> Ensembl Blog: http://www.ensembl.info/ <http://www.ensembl.info/>
>>
>> _______________________________________________
>> Dev mailing list Dev at ensembl.org <mailto:Dev at ensembl.org>
>> Posting guidelines and subscribe/unsubscribe info:
>> https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org
>> <https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org>
>> Ensembl Blog: http://www.ensembl.info/ <http://www.ensembl.info/>
>
> —
>
> Dr Emily Perry (Pritchard)
> Ensembl Outreach Project Leader
> (she/her)
>
> European Bioinformatics Institute (EMBL-EBI)
> European Molecular Biology Laboratory
> Wellcome Genome Campus
> Hinxton
> Cambridge
> CB10 1SD
> UK
>
>
>
>
>
> _______________________________________________
> Dev mailing list Dev at ensembl.org
> Posting guidelines and subscribe/unsubscribe info: https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org
> Ensembl Blog: http://www.ensembl.info/
>
--
Hervé Pagès
Bioconductor Core Team
hpages.on.github at gmail.com
More information about the Dev
mailing list