Advancements in sequencing technology have made extensive collections of mutations and genomic information available 1,2,3,4. These datasets include millions of novel mutations that cannot all be ...