A k-mer is a contiguous sequence of k nucleotides (the building blocks of DNA) in a genome. Biologists often use k-mers to identify patterns or motifs in genomic sequences, such as repeated sequences or conserved regions. Let’s build an algorithm to do this.
data:image/s3,"s3://crabby-images/1dd13/1dd137ca0299121f5cb409ec21e2de4dfdfbdbec" alt=""
In this article, we are going to take a look at one of the algorithms we wrote in Genome Toolkit series, Part 2, and attempt to optimize it.
data:image/s3,"s3://crabby-images/e2365/e2365d0371cb482e66c189a3fc3fac36022b3cad" alt=""
In the previous article (Part 2 here), we wrote our first Genome Toolkit algorithm. Even though, it was a very simple algorithm to help us search for repeating patterns (k-mers) in a DNA/Genome sequences, and it seemed to worked correctly, we actually had a bug in it. Let’s take a look at what it is, and how we can fix it.
data:image/s3,"s3://crabby-images/c02bf/c02bf5f32477e0e3229c85c66d91c7f3285026a8" alt=""
Bioinformatics with Python Cookbook: Use modern Python libraries and applications to solve real-world computational biology problems, 3rd Edition.
data:image/s3,"s3://crabby-images/19b12/19b129ae1e5bf3b76f3a40d59877d62807c8e035" alt=""
First function – counting patterns in a sequence.
data:image/s3,"s3://crabby-images/078a0/078a078c5bfeb5ba740f389957a7caa571941362" alt=""
Welcome to the new series, called “Genome Toolkit”. In this series, we will write a set of tools, that will help us find and build statistical data around any DNA, RNA and Protein sequences.
data:image/s3,"s3://crabby-images/b125d/b125dd4d8b49b279cee0a177bac935c7ba73993d" alt=""
A guide and advice on how to get started, or how to transition into Bioinformatics for people with biology or programming backgrounds.
data:image/s3,"s3://crabby-images/4da8b/4da8b0499ee720136793be250e4c16070c0b1608" alt=""
Let’s look at how we can program Hamming Distance algorithm in three different ways.
data:image/s3,"s3://crabby-images/2a9f0/2a9f0822a55e6e0b2a26f5c37259d2c3ca4cffb7" alt=""
DNA Engine project structure and class setup.
data:image/s3,"s3://crabby-images/95297/952971067c5abc7b51dd632196a42147b3c06cc9" alt=""
Python dictionary, Rust HashMap and a DNA Reverse Complement function.