[ensembl-dev] Regulatory Features by CellType
Fiona Nielsen
fnielsen at cmbi.ru.nl
Tue Apr 12 13:45:41 BST 2011
Still digging into the Ensembl Regulatory Features...
I am trying to retrieve the regulatory features defined on/by multiple
cell lines, e.g. I am trying to retrieve a dataset similar to the
BioMart query of:
-
Homo sapiens features (GRCh37.p2)
Filters
Feature Type : Gene Associated,Non-Gene Associated,Promoter
Associated,RegulatoryFeature,Unclassified
Cell Type : MultiCell
Attributes
Feature Set
Feature Type
Chromosome Name
Start (bp)
End (bp)
Cell Type
-
However, the Feature Set Adaptor requires a CellType object to specify
the cell type, and the CellTypeAdaptor does not work with the name
'MultiCell':
my $ct_adaptor = $efg_db->get_CellTypeAdaptor();
my $ct = $ct_adaptor->fetch_by_name('HeLa'); # does not work with
'MultiCell'
my @rf_fsets = @{$fset_adaptor->fetch_all_by_CellType($ct)};
foreach my $rf_fset(@rf_fsets){
$returnstring .= $rf_fset->name.",";
}
How then is the best way to retrieve the regulatory features from Cell
Type = 'MultiCell'?
Is there a function that returns all possible Cell Type names? (by the
hypothesis that these features might be named differently in the
database)
Next, if I want only a subset of the results above, e.g. only the
Feature Type = 'Promoter Associated', do I then have to sort through
the result myself, or is there another way to specify both of these
filters through the API?
All suggestions are appreciated,
Thanks,
-Fiona-
More information about the Dev
mailing list