DNA Replication Process Simulator

This project is a simulation of the DNA replication process prior to mitosis or meiosis, implemented in Python. The program simulates the complete process of DNA replication, including the key stages of the work of enzymes such as helicase and DNA polymerase, the creation of the leading and lagging strand (using Okazaki fragments).

1. Description of the Process

The process of DNA replication involves duplicating a DNA molecule so that each daughter cell obtains a complete copy of the genetic material. In this simulation, DNA is modelled as a sequence of nucleotides, and the replication process is carried out, which includes the following steps:

Separation of the DNA strands by the enzyme helicase, which unwinds the DNA double helix.
Creation of the leading and lagging strands by the enzyme DNA polymerase:
- The leading strand is synthesised continuously.
- The lagging strand is synthesised into fragments, called Okazaki fragments, which are then joined together.
Replication check to ensure that both resulting strands match the original strands.

The programme provides a complete example of the replication process with specific details on how each step occurs.

2. Project Structure

The project is divided into several modules to organise the code in a way that complies with some of the SOLID principles such as Single Responsibility or the Open-Closed principle:

dna.py: Contains the DNA class, which models the structure of a DNA molecule, stores the nucleotide sequence and generates its complementary strand.
dna_utils.py: Provides DNA-related utilities, such as complementary strand generation and verification of replication accuracy.
helicase.py: Defines the Helicase class, which represents the behaviour of the helicase enzyme during DNA replication, and the HelicaseOrchestrator class, which organises the binding process and helicase activity.
polymerase.py: Defines the Polymerase class, which contains the methods for replicating the leading and lagging strands, and the PolymeraseOrchestrator class, which coordinates the entire replication process for both strands.
replication.py: Controls the execution of the complete replication process using the classes and functions defined in the previous modules.
replication.ipynb: A Jupyter Notebook that runs the complete simulation process, allowing file input and output, and displaying the replication flow.

The following simplified UML diagram explains the functioning of each of the classes described above.

3. Order of Code Execution

The DNA class takes a DNA sequence as input and automatically generates its complementary strand. It also contains functions to display the length of the sequence.
The Helicase class simulates the separation of DNA strands by helicases, which are assigned to different positions in the sequence. The HelicaseOrchestrator class organises these helicases and displays their positions on the DNA strand.
The Polymerase class contains the functions to replicate both the leading strand (continuously) and the lagging strand (using Okazaki fragments). The PolymeraseOrchestrator class coordinates the complete replication.
The Replication class controls the overall flow of the replication process. It uses helicases to separate the strands and then polymerases to replicate the leading and lagging strands.

4. Check Results

At the end of the process, the generated daughter strands are checked:

The first daughter strand is compared to the complementary strand of the original DNA.
The second daughter strand is compared to the original strand.

If both comparisons are correct, the replication process is successful.

5. How to use

To use this simulation, we have released an API that can either return previously replicated data (using chromosome IDs) or replicate new sequences via a POST method. To obtain a replicated sample:

curl -X GET http://localhost:5000/replica -d '{"id": "[chr. id]"}'

To replicate a new sequence, you can use the following command:

curl -X POST http://localhost:5000/dna -d '{"sequence": "[sequence]"}'

6. Credits

This study was carried out by Ricardo Cárdenes Pérez and Susana Suárez Mendoza as part of an internship for the Bioinformatics course taught in the Data Science and Engineering degree at the University of Las Palmas de Gran Canaria.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
__pycache__		__pycache__
data		data
images		images
replication		replication
README.md		README.md
animation.py		animation.py
data_loader.py		data_loader.py
replication.ipynb		replication.ipynb
repo.md		repo.md
web_service.py		web_service.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DNA Replication Process Simulator

1. Description of the Process

2. Project Structure

3. Order of Code Execution

4. Check Results

5. How to use

6. Credits

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

deepamino/dnaReplication

Folders and files

Latest commit

History

Repository files navigation

DNA Replication Process Simulator

1. Description of the Process

2. Project Structure

3. Order of Code Execution

4. Check Results

5. How to use

6. Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages