[ensembl-dev] question about chain-net results in table 'genomic_align'
Zhang Di
aureliano.jz at gmail.com
Sun Dec 4 13:11:19 GMT 2011
Hi,
Finally I got the compara pipeline for whole genome alignment to work.
The results of RAW, CHAIN, and NET are all stored in table
'genomic_align' distinguished by different method_link_species_id.
I found that the some records of CHAIN and NET, contain a few base
pairs belong to gap region of its scaffold.
e.g.
mysql> select method_link_species_set_id, dnafrag_id, dnafrag_start,
dnafrag_end from genomic_align where dnafrag_id = 4465 and
dnafrag_start=486;
+----------------------------+------------+---------------+-------------+
| method_link_species_set_id | dnafrag_id | dnafrag_start | dnafrag_end |
+----------------------------+------------+---------------+-------------+
| 2 | 4465 | 486 | 567 |
| 3 | 4465 | 486 | 567 |
+----------------------------+------------+---------------+-------------+
while for the dnafrag_id = 4465 , in my core database it is scaffold_2621 ,
seq_region_id = 429785:
mysql> select * from assembly where asm_seq_region_id = 429785;
+-------------------+-------------------+-----------+---------+-----------+---------+-----+
| asm_seq_region_id | cmp_seq_region_id | asm_start | asm_end | cmp_start |
cmp_end | ori |
+-------------------+-------------------+-----------+---------+-----------+---------+-----+
| 429785 | 181573 | 488 | 717 | 1 |
230 | -1 |
| 429785 | 191688 | 1 | 419 | 1 |
419 | 1 |
| 429785 | 220761 | 718 | 1086 | 1 |
369 | 1 |
+-------------------+-------------------+-----------+---------+-----------+---------+-----+
the 420 - 487 interval is a gap.
Is this normal result of CHAIN-NET ?
It is quite annoying because I want to use the compara_db for low coverage
gene build, and It will complain:
EXCEPTION:
Could not find sequence-level pieces for scaffold_2621/486-744
Best reguards
--
Zhang Di
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ensembl.org/pipermail/dev_ensembl.org/attachments/20111204/c4273e42/attachment.html>
More information about the Dev
mailing list