[ensembl-dev] Fwd: Re: 1000 Genomes SNPS

Andrea Edwards edwardsa at cs.man.ac.uk
Wed Mar 2 16:21:54 GMT 2011



I used v61

C:\Documents and Settings\Administrator>mysql -h ensembldb.en
ous -P 5306
Welcome to the MySQL monitor.  Commands end with ; or \g.
Your MySQL connection id is 41648834
Server version: 5.1.34-log Source distribution

Type 'help;' or '\h' for help. Type '\c' to clear the buffer.

mysql>  use homo_sapiens_variation_61_37f;
Database changed
mysql>  select distinct name from variation_set;
+--------------------------------------------+

On 02/03/2011 16:17, cj5 at sanger.ac.uk wrote:
>  Hi
>
>  Many thanks Andrea and Pontus for your help with this. May I ask which
>  version of the API you are both using to get your variation sets?
>
>  I am using version 60, and I got a list of available sets by iterating
>  get_all_sub_VariationSets within fetch_all_top_VariationSets, I found the
>  following 1000 genomes sets only:
>  1000 genomes
>           1000 genomes - Low coverage
>           1000 genomes - Trios - YRI
>           1000 genomes - Trios - CEU
>
>  Thanks
>  Chris
>
>>  Hi Chris,
>>
>>  Andrea is right, we have grouped these variations together into various
>>  variation sets. For example, you can get all variations belonging to the
>>  different pilots from the sets '1000 genomes - Low coverage', '1000
>>  genomes
>>  - High coverage - Trios' and '1000 genomes - High coverage exons' for
>>  pilot
>>  1,2 and 3, respectively. You'll need to use the VariationSet and
>>  VariationSetAdaptor modules for this. It is not possible to retrieve the
>>  variations conditional on submission date.
>>
>>  As Andrea points out, if you call the 'get_all_Variations' method on a
>>  VariationSet object, the API will create all variation objects and return
>>  them. For large sets like these, this can easily cause you to run out of
>>  memory but you can use the 'get_Variation_Iterator' method to get an
>>  Iterator object and iterate over the variations instead.
>>
>>  /Pontus
>>
>>
>>
>>  2011/3/2<cj5 at sanger.ac.uk>
>>
>>>  Hi,
>>>  Is it possible using the variations API to get a list of SNPS which have
>>>  been submitted from the 1000 Genomes project?
>>>
>>>  I have a vague idea that it should be possible to retrieve such a list
>>>  using the SS (submission) ID and/or the validation status, however I am
>>>  unsure of the details and what version of the API should be used.
>>>
>>>  The latest 100 genomes pilot release (2010_07) would be great, but any
>>>  earlier release would also be useful.
>>>
>>>  Thanks
>>>  Chris
>>>
>>>
>>>  _______________________________________________
>>>  Dev mailing list
>>>  Dev at ensembl.org
>>>  http://lists.ensembl.org/mailman/listinfo/dev
>>>
>
>
>  _______________________________________________
>  Dev mailing list
>  Dev at ensembl.org
>  http://lists.ensembl.org/mailman/listinfo/dev






More information about the Dev mailing list