Catwalk: Identifying closely related sequences n lrg microbial sequence database
microbiologyresearch.org"compiled solution, coded in Nim to increase performance" ...
"Catwalk operates about 1700 times faster than, and uses about 8 % of the RAM of, a Python reference-based compression and comparison tool in current use for outbreak detection."
You/the abstract round down - deeper table in the paper says 1750X faster. :-) But really it's a big 1650X..2020X range for the 25th to 75th percentiles.