[ensembl-dev] Regulatory Features by CellType

Fiona Nielsen fnielsen at cmbi.ru.nl
Tue Apr 12 13:45:41 BST 2011


Still digging into the Ensembl Regulatory Features...

I am trying to retrieve the regulatory features defined on/by multiple
cell lines, e.g. I am trying to retrieve a dataset similar to the
BioMart query of:
-
Homo sapiens features (GRCh37.p2)
Filters
 Feature Type : Gene Associated,Non-Gene Associated,Promoter
Associated,RegulatoryFeature,Unclassified
 Cell Type : MultiCell
Attributes
 Feature Set
 Feature Type
 Chromosome Name
 Start (bp)
 End (bp)
 Cell Type
-

However, the Feature Set Adaptor requires a CellType object to specify
the cell type, and the CellTypeAdaptor does not work with the name
'MultiCell':

    my $ct_adaptor = $efg_db->get_CellTypeAdaptor();
    my $ct = $ct_adaptor->fetch_by_name('HeLa'); # does not work with
'MultiCell'
    my @rf_fsets = @{$fset_adaptor->fetch_all_by_CellType($ct)};

    foreach my $rf_fset(@rf_fsets){
 	$returnstring .= $rf_fset->name.",";
    }

How then is the best way to retrieve the regulatory features from Cell
Type = 'MultiCell'?
Is there a function that returns all possible Cell Type names? (by the
hypothesis that these features might be named differently in the
database)

Next, if I want only a subset of the results above, e.g. only the
Feature Type = 'Promoter Associated', do I then have to sort through
the result myself, or is there another way to specify both of these
filters through the API?


All suggestions are appreciated,

Thanks,
-Fiona-




More information about the Dev mailing list