A k-mer is a contiguous sequence of k nucleotides (the building blocks of DNA) in a genome. Biologists often use k-mers to identify patterns or motifs in genomic sequences, such as repeated sequences or conserved regions. Let’s build an algorithm to do this.
Category: Genome Toolkit Series
![](https://rebelscience.club/wp-content/uploads/2022/11/image-3.jpg)
In the previous article (Part 2 here), we wrote our first Genome Toolkit algorithm. Even though, it was a very simple algorithm to help us search for repeating patterns (k-mers) in a DNA/Genome sequences, and it seemed to worked correctly, we actually had a bug in it. Let’s take a look at what it is, and how we can fix it.
![](https://rebelscience.club/wp-content/uploads/2022/09/image.jpg)
First function – counting patterns in a sequence.
![](https://rebelscience.club/wp-content/uploads/2021/02/2.png)
Welcome to the new series, called “Genome Toolkit”. In this series, we will write a set of tools, that will help us find and build statistical data around any DNA, RNA and Protein sequences.