[ensembl-dev] Species metadata file ("assembly" or "assembly_accession" labels) to species ftp paths mapping
Stefano Giorgetti
sgiorgetti at ebi.ac.uk
Thu Jul 10 11:58:12 BST 2025
//
Dear Allan,
Thanks for your email and for using Ensembl services.
We have 2 main cases: stand-alone species (for instance all the plants')
and species sharing a "collection DB" - like bacteria.
For the stand-alone species - say plants - the path to the (release 60)
GTF would be
http://ftp.ensemblgenomes.org/pub/plants/release-60/gtf/<species>/
where <species> can be found from one of the species metadata files.
For species belonging to a collection - say bacteria - the path to the
(release 60) GTF would be
http://ftp.ensemblgenomes.org/pub/release-60/bacteria/gtf/<collection>/<species>/
Regrettably, there is no trivial way to get the collection the species
belongs to.
One hopefully not-too-cumbersome would be to extract it from the
"core_db" field from the species metadata file.
For instance for "acetobacter_syzygii_gca_002276805", we have core db
"bacteria_60_collection_core_60_113_1", the collection name would be
"bacteria_60_collection"; thus giving
http://ftp.ensemblgenomes.org/pub/release-60/bacteria/gtf/bacteria_60_collection/acetobacter_syzygii_gca_002276805//
/
/
/
Hope it helps.
Any questions, please do not hesitate to ask.
Kind regards,
Stefano on behalf of the Ensembl team/
/
/
/
On 10/07/2025 6:49 am, Allan Kamau wrote:
> Greetings,
>
> Given an entry from one of the species metadata files such as
> "ftp.ensemblgenomes.org/pub/plants/release-60/species_EnsemblPlants.txt
> <http://ftp.ensemblgenomes.org/pub/plants/release-60/species_EnsemblPlants.txt>"
> I would like to determine the ftp path to the "gtf" data of the given
> species.
>
> Is there such a mapping file or mechanism that I can use?
>
> Or in short if I have an "assembly" value such as "ASM16007v2" or and
> an "assembly_accession" label for example "GCA_000160075.2" is there a
> way to determine the ftp path to the gtf data which is
> "tp.ensemblgenomes.org/pub/release-60/bacteria/gtf/bacteria_118_collection/abiotrophia_defectiva_atcc_49176_gca_000160075
> <http://tp.ensemblgenomes.org/pub/release-60/bacteria/gtf/bacteria_118_collection/abiotrophia_defectiva_atcc_49176_gca_000160075>"
> in this case?
>
> Regards,
> - Allan.
>
> _______________________________________________
> Dev mailing listDev at ensembl.org
> Posting guidelines and subscribe/unsubscribe info:https://lists.ensembl.org/mailman/listinfo/dev_ensembl.org
> Ensembl Blog:http://www.ensembl.info/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20250710/33af2640/attachment.html>
More information about the Dev
mailing list