GSoC/GCI Archive
Google Summer of Code 2013 The Centre for Computational Medicine

Parallelization of Queries in MedSavant

by Miroslav Cupak for The Centre for Computational Medicine

MedSavant is a search engine for genomic variants. It is a server-client application with the server mainly dedicated to fetching results via Infobright Community Edition, a specialized, SQL-based, single-threaded database. A query on a huge dataset can be optimized by dividing the dataset into pieces called shards and assembling the results. This project aims to allow such parallelization of queries in MedSavant.