[ensembl-dev] Why are ncRNA separate from cDNA?

Charles Joseph Murphy chm2059 at med.cornell.edu
Fri May 5 18:11:15 BST 2017


To be more specific, why are these two files separate? Why not just have one FASTA file? I’ am asking this question because I’ am working with another individual on a python package for downloading/managing Ensembl data (https://github.com/hammerlab/pyensembl)

ftp://ftp.ensembl.org/pub/release-88/fasta/homo_sapiens/cdna//Homo_sapiens.GRCh38.cdna.all.fa.gz

ftp://ftp.ensembl.org/pub/release-88/fasta/homo_sapiens/ncrna//Homo_sapiens.GRCh38.ncrna.fa.gz




On May 5, 2017, at 13:06, Charles Joseph Murphy <chm2059 at med.cornell.edu<mailto:chm2059 at med.cornell.edu>> wrote:

Hi,

Just out of curiosity, why are the cDNA and ncRNA sequences in separate FASTA files? Is this due to each set of transcripts being produced via different computational pipelines?

Charlie
_______________________________________________
Dev mailing list    Dev at ensembl.org<mailto:Dev at ensembl.org>
Posting guidelines and subscribe/unsubscribe info: https://urldefense.proofpoint.com/v2/url?u=http-3A__lists.ensembl.org_mailman_listinfo_dev&d=DwICAg&c=lb62iw4YL4RFalcE2hQUQealT9-RXrryqt9KZX2qu2s&r=O3yXKBF_L8Fov58BXORGXKqPP85pYddrOwCg4PV2BCY&m=B40uzJI94tMQJCoiCjo2YDtFRCEF4iO0NmS-d5N4NTs&s=Cf-GoJ6S_a79LVttuidb-CoHo7kO3MFlLOmVA7MIDBg&e=
Ensembl Blog: https://urldefense.proofpoint.com/v2/url?u=http-3A__www.ensembl.info_&d=DwICAg&c=lb62iw4YL4RFalcE2hQUQealT9-RXrryqt9KZX2qu2s&r=O3yXKBF_L8Fov58BXORGXKqPP85pYddrOwCg4PV2BCY&m=B40uzJI94tMQJCoiCjo2YDtFRCEF4iO0NmS-d5N4NTs&s=5oDdpJxBYhEXclMvPVyBi8CUBgHL8PzI3kS0OvFKsWA&e=

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20170505/66a15e60/attachment.html>


More information about the Dev mailing list