[ensembl-dev] Ensembl release 108 is out

Adzhubey, Ivan,PHD iadzhubey at rics.bwh.harvard.edu
Thu Nov 10 02:25:41 GMT 2022


Hi,

It looks like installation for 108_GRCh37 is broken:

$ perl INSTALL.pl -a c -s homo_sapiens -y GRCh37 -c /data/vep --NO_HTSLIB --NO_TEST
 - getting list of available cache files
 - downloading https://ftp.ensembl.org/pub/release-108/variation/indexed_vep_cache/homo_sapiens_vep_108_GRCh37.tar.gz<https://urldefense.proofpoint.com/v2/url?u=https-3A__ftp.ensembl.org_pub_release-2D108_variation_indexed-5Fvep-5Fcache_homo-5Fsapiens-5Fvep-5F108-5FGRCh37.tar.gz&d=DwMGaQ&c=WO-RGvefibhHBZq3fL85hQ&r=fpFYsQD9efuj6qk0TjK1MbUTPI07tTdDii-uM88QGpo&m=uEeAtmcBVmdX_pYTuAZvb2qERdaRosoAWop6hummx2YX5fU9PHe2Eg2QxKw-9-aA&s=aJ4A-kNCHvqTdpd3zD-IdHQPMtNEscAdIhnVin8ADK4&e=>
 - unpacking homo_sapiens_vep_108_GRCh37.tar.gz
 - converting cache, this may take some time but will allow VEP to look up variants and frequency data much faster
 - use CTRL-C to cancel if you do not wish to convert this cache now (you may run convert_cache.pl later)
ERROR: Version 108_GRCh37 not found
WARNING: Failed to run convert script

All done

For some reason, it attempts to run indexing script and fails, even though the caches downloaded were already indexed. Caches are in fact downloaded & installed successfully, or at least it looks like they did. I have not run actual tests yet.

Update: In fact, caches are successfully downloaded, checked and uncompressed into the tmp directory but then deleted for no apparent reason, hence installation failed completely. Probably triggered by  the "ERROR: Version 108_GRCh37 not found", see above but what it means is unclear.

Also, there is no way to install FASTA sequence since it will first remove all cache files installed previously. And if I try the other way round it refuses to download caches *after* FASTA sequence has been downloaded because cache directory already exists.

$ perl INSTALL.pl -a f -s homo_sapiens -y GRCh37 -c /data/vep --NO_HTSLIB --NO_TEST

WARNING: Changing URL for GRCh37
 - downloading Homo_sapiens.GRCh37.75.dna.primary_assembly.fa.gz
 - extracting data
 - attempting to index
Indexing failed - VEP will attempt to index the file the first time you use it

The FASTA file should be automatically detected by the VEP when using --cache or --offline.
If it is not, use "--fasta /data/vep/homo_sapiens/108_GRCh37/Homo_sapiens.GRCh37.75.dna.primary_assembly.fa"


All done

$ perl INSTALL.pl -a c -s homo_sapiens -y GRCh37 -c /data/vep --NO_HTSLIB --NO_TEST
 - getting list of available cache files

WARNING: It looks like you already have the cache for homo_sapiens GRCh37 (v108) installed.

Delete the folder /data/vep/homo_sapiens/108_GRCh37 and re-run INSTALL.pl if you want to re-install
 - skipping homo_sapiens

All done

I had to first move .fa file out of the 108_GRCh37 folder, delete empty folder, re-install caches and then move .fa file back and index it manually. Not a big deal but it took me some time to figure out what's going on. Does not seems to be an intended behavior, so I assume this is due to some bug(s) in INSTALL.pl script.

Thanks,

Ivan

On 10/20/22 06:08, Louisse Paola Mirabueno wrote:
Hi all,

We are pleased to announce that the latest Ensembl and Ensembl Genomes updates have been released!

Highlights of this release include:

  *   Changes in the human default tracks
  *   New genomes added in Ensembl Plants and Ensembl Metazoa
  *   Addition of mitochondrial annotation for Tasmanian devil
  *   New and additional data reported by VEP

The release blog post with all news can be viewed here.


For the latest news from the Ensembl project visit our blog. Thank you all!


All the best,

Louisse (on behalf of the Ensembl team)


_______________________________________________
Dev mailing list    Dev at ensembl.org<mailto:Dev at ensembl.org>
Posting guidelines and subscribe/unsubscribe info: https://secure-web.cisco.com/1MfCaT5zfp2XxSKssKChihDiw2CGBTnJm4D7hk5KnoTo93SBrKU1rfL2aIFst_Ettkrjt550lltc8X6J89jL2RCVoYJoXCPamtBL2d2D6bPnjVSUbmyPEpdbMQSkwaUc2ZD0rqg0sgq3sYoXurlctqXgp61WfBNMlf-VbQUX2GpY1x8Ag48kbh8fwuSlVK56oDSYgkKVFD9OENVGtQds00VBNxt43rx3FLwO3_FR2vydi1z2s2kcwea9iw-ZD1RbMHSZCKBPZ_IxGH7wEh4UDI9AdAot01ONaqo0r6auO-2kFSQSb5ZdaMMGvr70fqaab_jnVdY5NtPEZ184KgGK5ww/https%3A%2F%2Flists.ensembl.org%2Fmailman%2Flistinfo%2Fdev_ensembl.org<https://urldefense.proofpoint.com/v2/url?u=https-3A__secure-2Dweb.cisco.com_1MfCaT5zfp2XxSKssKChihDiw2CGBTnJm4D7hk5KnoTo93SBrKU1rfL2aIFst-5FEttkrjt550lltc8X6J89jL2RCVoYJoXCPamtBL2d2D6bPnjVSUbmyPEpdbMQSkwaUc2ZD0rqg0sgq3sYoXurlctqXgp61WfBNMlf-2DVbQUX2GpY1x8Ag48kbh8fwuSlVK56oDSYgkKVFD9OENVGtQds00VBNxt43rx3FLwO3-5FFR2vydi1z2s2kcwea9iw-2DZD1RbMHSZCKBPZ-5FIxGH7wEh4UDI9AdAot01ONaqo0r6auO-2D2kFSQSb5ZdaMMGvr70fqaab-5FjnVdY5NtPEZ184KgGK5ww_https-253A-252F-252Flists.ensembl.org-252Fmailman-252Flistinfo-252Fdev-5Fensembl.org&d=DwMGaQ&c=WO-RGvefibhHBZq3fL85hQ&r=fpFYsQD9efuj6qk0TjK1MbUTPI07tTdDii-uM88QGpo&m=uEeAtmcBVmdX_pYTuAZvb2qERdaRosoAWop6hummx2YX5fU9PHe2Eg2QxKw-9-aA&s=1mreefmWGvwdHdB1P8S3rK5vhe6LYBUsVdYanbEdCs4&e=>
Ensembl Blog: http://secure-web.cisco.com/1VbKctd9cZfiXxK3C-nGWNflK4RzmFkZ7pj9CCe0hgVCMkR2c_vVIL89SiWqnHgyAVW4LNnNPnjgwc8OSVqcj-k1L2-SAeY1QBws931eP6dPUoJxf3_gaaFGOCZZ4LSwp4mmNvVvLDGlPF6UHA5Bn9wtfzK0k1msvRUt864sNIQ88lMDxJUvPRp5B66MQn3roy5yWXO_U4XqoRko8bY6TB11Rv4VaODGGQXC9nFI9iR-ptOo2bQanwY57C1w7LDqB2RCt-pYH2OfxT615QUhsS4eYen5bCMWTprX-LaYsJnLNFdRML8fKAwW3JaGDlykFiQ1noWYpNjWAf48d0PJp2A/http%3A%2F%2Fwww.ensembl.info%2F<https://urldefense.proofpoint.com/v2/url?u=http-3A__secure-2Dweb.cisco.com_1VbKctd9cZfiXxK3C-2DnGWNflK4RzmFkZ7pj9CCe0hgVCMkR2c-5FvVIL89SiWqnHgyAVW4LNnNPnjgwc8OSVqcj-2Dk1L2-2DSAeY1QBws931eP6dPUoJxf3-5FgaaFGOCZZ4LSwp4mmNvVvLDGlPF6UHA5Bn9wtfzK0k1msvRUt864sNIQ88lMDxJUvPRp5B66MQn3roy5yWXO-5FU4XqoRko8bY6TB11Rv4VaODGGQXC9nFI9iR-2DptOo2bQanwY57C1w7LDqB2RCt-2DpYH2OfxT615QUhsS4eYen5bCMWTprX-2DLaYsJnLNFdRML8fKAwW3JaGDlykFiQ1noWYpNjWAf48d0PJp2A_http-253A-252F-252Fwww.ensembl.info-252F&d=DwMGaQ&c=WO-RGvefibhHBZq3fL85hQ&r=fpFYsQD9efuj6qk0TjK1MbUTPI07tTdDii-uM88QGpo&m=uEeAtmcBVmdX_pYTuAZvb2qERdaRosoAWop6hummx2YX5fU9PHe2Eg2QxKw-9-aA&s=rJjUQRqyLU_7Yj7UMCRduq1RkJ6QvM3U5DQd4pTM_NA&e=>


On 10/20/22 06:08, Louisse Paola Mirabueno wrote:
Hi all,

We are pleased to announce that the latest Ensembl and Ensembl Genomes updates have been released!

Highlights of this release include:

  *   Changes in the human default tracks
  *   New genomes added in Ensembl Plants and Ensembl Metazoa
  *   Addition of mitochondrial annotation for Tasmanian devil
  *   New and additional data reported by VEP

The release blog post with all news can be viewed here.


For the latest news from the Ensembl project visit our blog. Thank you all!


All the best,

Louisse (on behalf of the Ensembl team)

--
Ivan Adzhubey, Ph.D.
Research Associate
Dept of Biomedical Informatics
Harvard Medical School
10 Shattuck Street, Suite 514
Boston, MA 02115
tel: (617) 432-2144
fax: (617) 432-0693
web: http://genetics.bwh.harvard.edu/wiki/sunyaevlab/
The information in this e-mail is intended only for the person to whom it is addressed.  If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Mass General Brigham Compliance HelpLine at https://www.massgeneralbrigham.org/complianceline <https://www.massgeneralbrigham.org/complianceline> .
Please note that this e-mail is not secure (encrypted).  If you do not wish to continue communication over unencrypted e-mail, please notify the sender of this message immediately.  Continuing to send or respond to e-mail after receiving this message means you understand and accept this risk and wish to continue to communicate over unencrypted e-mail. 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20221110/e4d45f75/attachment-0001.html>


More information about the Dev mailing list