MiStrVar
MiStrVar (Micro Structural Variant Caller) is a computational tool for detecting micro structural variations based on one-end anchors (OEA) from paired-end Whole Genome Sequencing (WGS) reads.
How do I get MiStrVar?
Just clone our repository and issue the make
command:
git clone --recursive https://git@bitbucket.org/compbio/mistrvar.git
cd mistrvar/pipeline && make
You can also download the latest version of MiStrVar directly from
the Downloads link.
Note: You will need at least g++ 4.7 and Python 2.7 to compile the source code.
How do I run MiStrVar?
You can use
mistrvar/pipeline/mistrvar.py -h
to get a description of each parameter. For more details, please check doc/MiStrVar_manual.pdf.
Simulation Datasets Used for Evaluation
Please check this link to download the simulation datasets that we used for evaluating MiStrVar. The folder contains 2 data files:
- SV.fastq.gz: a 3.3GB gzipped fastq file
- chr1.fa: reference genome for chromosome 1 from GRCh37.
We also provide a checksum file md5.sum for checking file integrity.
To run MiStrVar on this dataset, type
./mistrvar.py -p my_project -r chr1.fa --files fastq=SV.fastq.gz
Feel free to change my_project to any name you want. Remember to specify the paths correctly for chr1.fa and SV.fastq.gz if you download these 2 files to a folder other than the one you run MiStrVar. The VCF file with the prediction results will be generated in my_project.
Feel free to drop any inquiry to agawrons at sfu dot ca.