Movi indexes

Movi (GitHub repo) is an efficient and scalable approach for indexing and querying pangenomes. It uses the move structure of Nishimoto and Tabei as its core, cache-efficient index structure.

Indexes below are compatible with Movi 2. They include threshold information and are intended for use with Movi’s default regular-thresholds mode.

The following table lists each complete .tar.gz archive, then the individual component files contained in that archive.

Index / file Size HTTPS
HPRC Release 2, 466 human haplotypes (complete .tar.gz) 25.8 GB .tar.gz
Link above is recommended; links below are its individual components    
build.log 3.4 KB .log
index.movi 48.4 GB .movi
movi.pml.nulldb 1.1 MB .pml.nulldb
movi.zml.nulldb 1.1 MB .zml.nulldb
null_reads.fasta 157 KB .fasta
ref.list 17 KB .list
HPRC Release 1, 96 human haplotypes (complete .tar.gz) 18.8 GB .tar.gz
Link above is recommended; links below are its individual components    
CMakeLists.txt 3.2 KB .txt
build.log 10.1 KB .log
index.html 141 B .html
index.movi 37.7 GB .movi
movi.pml.nulldb 1.1 MB .pml.nulldb
movi.zml.nulldb 1.1 MB .zml.nulldb
null_reads.fasta 157 KB .fasta
reference_names 1.2 KB reference_names

Corresponding S3 URLs can be obtained by replacing the https://genome-idx.s3.amazonaws.com/ prefix with s3://genome-idx/.

Movi is the work of Mohsen Zakeri, Nathaniel Brown, Travis Gagie and Ben Langmead.