[ensembl-dev] MySQL query to retrieve protein annotation

Fri Feb 24 13:47:14 GMT 2017

Hi Charles,

Could you please include a link to the display you are looking at?

The data you are retrieving from the MySQL database is displayed on the 
'Domains and Features' tab, for example
http://www.ensembl.org/Homo_sapiens/Transcript/Domains?db=core;g=ENSG00000167657;r=19:3959639-3961159;t=ENST00000594894
This displays a description like 'Protein kinase domain', which is the 
protein_feature.hit_description field from the SQL query.

Hope that helps,
mag

On 24/02/2017 13:39, Charles Joseph Murphy wrote:
> Thanks for your suggestion.
>
> Sorry to bother with another question, but I’ve spent a couple hours 
> trying to figure this out and wonder if either of you could help me 
> with something else. How might I go about getting the descriptions of 
> the protein annotations. For example, for PFAM domains I want what is 
> displayed on the Ensembl website. E.g. “Pkinase” instead of PF00069.
>
> Thanks!
>
>> On Feb 23, 2017, at 04:30, Michal Dabrowski 
>> <m.dabrowski at nencki.gov.pl <mailto:m.dabrowski at nencki.gov.pl>> wrote:
>>
>> How about removing the table "translation" in the FROM part of your 
>> query?
>> Or adding sth like " AND translation.translation_id='291089’ " to the 
>> WHERE part.
>> Now it seems you will get all the entries from the "translation" table.
>> m.
>>
>> Michal Dabrowski
>> Laboratory of Bioinformatics
>> Nencki Institute
>> Pasteur 3 Str
>> 02-093 Warsaw, Poland
>> phone: 48 22 58 92 575
>>
>>
>> On Feb 22, 2017, at 5:43 PM, Charles Joseph Murphy wrote:
>>
>>> Hi,
>>>
>>> I’ am trying to use the Ensembl public MySQL query to retrieve 
>>> protein annotation for all of a genome’s genes. While trying to 
>>> figure out the right MySQL queries to make, I’ am testing it out 
>>> with just one transcript’s translation (ENSMUST00000120187):
>>>
>>> mysql> use mus_musculus_core_75_38;
>>> mysql> SELECT protein_feature.hit_name, protein_feature.hit_start, 
>>> protein_feature.hit_end, protein_feature.seq_start, 
>>> protein_feature.seq_end, protein_feature.evalue, 
>>> protein_feature.perc_ident, protein_feature.hit_description, 
>>> protein_feature.translation_id FROM protein_feature, translation 
>>> WHERE protein_feature.translation_id = '291089’;
>>>
>>> But this returns around 1.4 million entries. What might be the best 
>>> way to get all the protein annotation for ENSMUST00000120187? I’ am 
>>> aware of BioMart and the Perl API, but would rather query via MySQL. 
>>> I’ am continuing to read into the InterProScan pipeline and Ensembl 
>>> documentation, but am not quite sure yet how to go about this.
>>>
>>> Thanks for any help!
>>> Charlie
>>> _______________________________________________
>>> Dev mailing list Dev at ensembl.org <mailto:Dev at ensembl.org>
>>> Posting guidelines and subscribe/unsubscribe info: 
>>> http://lists.ensembl.org/mailman/listinfo/dev 
>>> <https://urldefense.proofpoint.com/v2/url?u=http-3A__lists.ensembl.org_mailman_listinfo_dev&d=DwMF-g&c=lb62iw4YL4RFalcE2hQUQealT9-RXrryqt9KZX2qu2s&r=O3yXKBF_L8Fov58BXORGXKqPP85pYddrOwCg4PV2BCY&m=r6AnPQHwIIxjPPMd57hredgdCfvDhAjn5pgueJbAAnw&s=H9KJ4D9X5HDFPMb_Zi-YBdhwedbaUpfuIqz1qSDpqh0&e=>
>>> Ensembl Blog: http://www.ensembl.info/ 
>>> <https://urldefense.proofpoint.com/v2/url?u=http-3A__www.ensembl.info_&d=DwMF-g&c=lb62iw4YL4RFalcE2hQUQealT9-RXrryqt9KZX2qu2s&r=O3yXKBF_L8Fov58BXORGXKqPP85pYddrOwCg4PV2BCY&m=r6AnPQHwIIxjPPMd57hredgdCfvDhAjn5pgueJbAAnw&s=81t5xwFAfijclq_J5KpIknzBPxqqbJvgKSHMxehclZs&e=>
>>
>
>
>
> _______________________________________________
> Dev mailing list    Dev at ensembl.org
> Posting guidelines and subscribe/unsubscribe info: http://lists.ensembl.org/mailman/listinfo/dev
> Ensembl Blog: http://www.ensembl.info/

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20170224/e96a6a78/attachment.html>