[ensembl-dev] release 105 MySQL protein domain data severely truncated

Eric Engelhard eric.engelhard at regeneron.com
Thu Jan 20 16:24:13 GMT 2022

Hello Ensembl team,

I am working from Ensembl MySQL downloads and have discovered that protein domain data from ftp://ftp.ensembl.org/pub/current_mysql/homo_sapiens_core_105_38/ 
and ftp://ftp.ensembl.org/pub/current_mysql/mus_musculus_core_105_39/ is severely truncated. This is directly impacting the Core Perl API, which is only able to extract 
domain information for about 20 proteins. Specifically, the interpro.txt.gz files for each data set are empty files. I am still comparing other files against release 104 for size

Is this a know issue?


This e-mail and any attachment hereto, is intended only for use by the addressee(s) named above and may contain legally privileged and/or confidential information. If you are not the intended recipient of this e-mail, any dissemination, distribution or copying of this email, or any attachment hereto, is strictly prohibited. If you receive this email in error please immediately notify me by return electronic mail and permanently delete this email and any attachment hereto, any copy of this e-mail and of any such attachment, and any printout thereof. Finally, please note that only authorized representatives of Regeneron Pharmaceuticals, Inc. have the power and authority to enter into business dealings with any third party. 

More information about the Dev mailing list