[ensembl-dev] Possible error in grch37 database

Sarah Hunt seh at ebi.ac.uk
Thu Jan 5 17:16:11 GMT 2017


Hi Beat,

Thanks for reporting this one. As you point out, the minor allele should 
be the second most common A at 0.327875 rather than T at 0.00339457. We 
will fix this when we do our next data update.

Best wishes,

Sarah

On 05/01/2017 14:15, Wolf Beat wrote:
> Hi,
>
>
> i'm using ensembl through both the REST API for grch37 and biomart on the old feb2014 archive (because i need a stable archive).
>
>
> I do have a couple of unit tests to verify that my application works correctly, which is why i noticed one change i dont understand.
>
>
> Until now, rs1054487 (G>A) had a MAF of 0.34, in ensembl as well as exac.
>
> Now suddendly on grch37, the MAF changed to 0.0034.
>
>
> This looks suspicously like a parsing error somewhere.
>
>
> This must have been introduced quite recently. You can compare the grch37 and feb2014 versions here:
>
> http://feb2014.archive.ensembl.org/Homo_sapiens/Variation/Explore?r=19:12771665-12772665;v=rs1054487;vdb=variation;vf=839082
>
> http://grch37.ensembl.org/Homo_sapiens/Variation/Explore?r=19:12771665-12772665;v=rs1054487;vdb=variation;vf=757537
>
>
> I know archives of grch37 are not planned. But i would like to repeat my desire for archives of feb2014. Not only to avoid issues like this to randomly appear, but also for a more stable API. There have been for example, to my knowledge, undocumented changes in the REST api for the VEP entry point. Also subtile changes in the biomart API have been made. Having stable versions would help to reduce those types of problems.
>
>
> Kind regards
>
>
> Beat Wolf
>
> _______________________________________________
> Dev mailing list    Dev at ensembl.org
> Posting guidelines and subscribe/unsubscribe info: http://lists.ensembl.org/mailman/listinfo/dev
> Ensembl Blog: http://www.ensembl.info/





More information about the Dev mailing list