[ensembl-dev] Bad data in MySQL dumps

Nathan Johnson njohnson at ebi.ac.uk
Mon Apr 11 15:17:33 BST 2011


Hi again Fedor

It seems gzip has problems looking at this file, but it does unpack  
successfully.  The file itself unpacks to ~ 330GB, hence why you were  
having problems.

As I said previously, from v62, this data will be hosted outside of  
the DB for performance and practical reasons.

Nath

On 9 Apr 2011, at 09:39, Fedor Gusev wrote:

> Basically, it says:
>  gzip: result_feature.txt: No space left on device
> And i have >100Gb left on this partition.
>
> gunzip -l reports this:
>        compressed        uncompressed  ratio uncompressed_name
>        11304575360          1712567965 -560.1% result_feature.txt
> Seems kinda weird.
>
>
> On Fri, Apr 8, 2011 at 4:42 PM, Nathan Johnson <njohnson at ebi.ac.uk>  
> wrote:
>> Hi Fedor
>>
>> Can you be more specific when you say 'fails to unpack'? What  
>> happens exactly?
>>
>> I suspect what you are seeing is that it is taking a long time to  
>> unpack as these files are particularly large.  Also, please see the  
>> README file for more information on how to handle the  
>> result_feature imports, as they require some non-standard  
>> parameters to handle the BLOB data.
>>
>> Thanks
>>
>>
>> Nath
>>
>> On 3 Apr 2011, at 12:14, Fedor Gusev wrote:
>>
>>> Hello everyone.
>>>
>>> I have some problems with file
>>>  ftp://ftp.ensembl.org/pub/release-61/mysql/homo_sapiens_funcgen_61_37f/result_feature.txt.gz
>>>
>>> It is not present in CHECKSUMS file and fails to unpack. Can you
>>> please check on this?
>>>
>>> --
>>> Kind regards,
>>> Fedor Gusev.
>>>
>>> _______________________________________________
>>> Dev mailing list
>>> Dev at ensembl.org
>>> http://lists.ensembl.org/mailman/listinfo/dev
>>
>> Nathan Johnson
>> Senior Scientific Programmer
>> Ensembl Regulation
>> European Bioinformatics Institute
>> Wellcome Trust Genome Campus
>> Hinxton
>> Cambridge CB10 1SD
>>
>> http://www.ensembl.info/
>> http://twitter.com/#!/ensembl
>>
>>
>>
>>
>>
>>
>>
>
>
>
> -- 
> Kind regards,
> Fedor Gusev.





More information about the Dev mailing list