Minimac4
|
Minimac4 is a lower memory and more computationally efficient implementation of the genotype imputation algorithms in minimac/mininac2/minimac3.
A prebuilt Linux x86 executable is available at https://github.com/statgen/Minimac4/releases/latest by downloading and running minimac4-{version}-Linux-x86_64.sh. Alternatively, you can build from source.
Minimac4 requires GCC >= v5 or a compiler with full C++11 support. Automatic installation of Minimac4 requires cget and CMake >= v3.2.
The simplest way to build the latest Minimac4 and its dependencies is to use cget:
Alternatively, you can build manually:
To build and run tests from build directory:
Since some users have reported issues with installing cget with pip, a cmake-only alternative for installing dependencies is available:
See minimac4 --help
for detailed usage.
A typical Minimac4 command line for imputation is as follows
Here reference.msav is a reference panel (e.g. 1000 Genomes download) compressed with MVCF encoding, target.vcf.gz is an indexed VCF containing phased genotype array data, and imputed.sav is the imputed output.
The file formats VCF, BCF, and SAV are supported for both input and output:
A sites-only file can be generated with:
Meta-imputation with MetaMinimac2 requires --empirical-output
(or -e
) to be specified:
If an M3VCF file is already available, it can be converted to the new MVCF format with:
Otherwise, phased VCFs containing the reference haplotypes can be compressed into an MVCF with: