Difference between revisions of "Designer Genes"

From Wiki - Scioly.org
Jump to navigation Jump to search
m (Added link to protein modeling CRISPR page in the CRISPR section)
(48 intermediate revisions by 12 users not shown)
Line 1: Line 1:
{{Incomplete}}
 
 
{{EventLinksBox
 
{{EventLinksBox
|active=
+
    | active       = yes
|type=Life Science
+
    | type         = Life Science
|cat=Study
+
    | cat           = Study
|2013thread=[http://www.scioly.org/phpBB3/viewtopic.php?f=144&t=3703 2013]
+
    | 2013thread   = [http://www.scioly.org/phpBB3/viewtopic.php?f=144&t=3703 2013]
|2013tests=2013
+
    | testsArchive  = true
|2014thread=[http://www.scioly.org/phpBB3/viewtopic.php?f=167&t=4967 2014]
+
    | 2014thread   = [http://www.scioly.org/phpBB3/viewtopic.php?f=167&t=4967 2014]
|2014tests=2014
+
    | 2014questions = [http://www.scioly.org/phpBB3/viewtopic.php?f=173&t=5024 2014]
|2014questions=[http://www.scioly.org/phpBB3/viewtopic.php?f=173&t=5024 2014]
+
    | 2019thread    = [https://scioly.org/forums/viewtopic.php?f=285&t=12165 2019]
|B Champion=[[Daniel Wright Junior High School]]
+
    | 2019tests    = 2019
|C Champion=[[Munster High School]]
+
    | 2019questions = [https://scioly.org/forums/viewtopic.php?f=297&t=12397 2019]
 +
    | 2020thread    = [https://scioly.org/forums/viewtopic.php?f=285&t=15386 2020]
 +
    | 2020questions = [https://scioly.org/forums/viewtopic.php?f=297&t=15735 2020]
 +
    | 2021thread    = [https://scioly.org/forums/viewtopic.php?f=348&t=18281 2021]
 +
    | 1stCName      = Acton-Boxborough Regional High School
 +
    | 2ndCName      = Troy High School
 +
    | 3rdCName      = New Trier High School
 +
    | Website      = https://www.soinc.org/designer-genes-c
 
}}
 
}}
 +
'''Designer Genes''' is a [[Division C]] biology event for the [[2021]] season. It was previously an event for the [[2013]], [[2014]]. [[2019]], and [[2020]] seasons. The event covers topics relating to genetics, biotechnology, and the molecular biology of inheritance.
  
'''Designer Genes''' ([[Division C]]) and '''Heredity''' ([[Division B]]) are based on genetics and molecular biology (introns/exons, mitosis/meiosis, leading/lagging strand, etc).
+
Many topics listed in the table in section 3 of the rules consist of the material in the [[Division B]] event [[Heredity]] (although generally in less detail since it is only one part of the event). Several sections of this page link to the appropriate sections of the Heredity page.
  
You are allowed to bring a [[Notes|note]] sheet and 2 non-graphing calculators.
+
==Inheritance==
 +
''Refer to [[Heredity#Inheritance]]''
  
 
==DNA==  
 
==DNA==  
DNA is made up of three things: a phosphate group, a deoxyribose sugar, and heterocyclic rings of carbon and nitrogen (purines have two such rings; pyrimidines, one). The bases found in DNA are adenine and guanine (purines), thymine and cytosine (pyrimidines). The sugar and phosphate groups form the backbone or the sides of the double helix "ladder" and the nitrogenous bases stick out from the chain like "rungs" of the ladder.
+
''Refer to [[Heredity#DNA]]''
  
Watson, Crick, and Maurice Wilkins are credited with finding the structure of DNA. Rosalind Franklin, however, was the first to use X-ray diffraction to see the DNA helix. Watson and Crick used her work to discover the helical structure in 1953. See ''The Double Helix'' by James D. Watson for more information.
+
==RNA==
 +
''Refer to [[Heredity#RNA]]''
  
===Base Pairing===  
+
==Mitosis==
Adenine only bonds with thymine and guanine bonds only with cytosine. This is called base pairing. According to Chargaff's rules, an organism should have equal percentages of adenine and thymine and cytosine and guanine. One way to remember which base pairs with which is to remember the "curvy" letters go together. If Chargaff's rules do not hold, the organism's DNA may be single-stranded rather than double-stranded.
+
''Refer to [[Heredity#Mitosis]]''
  
In RNA, uracil replaces thymine and thus, uracil binds with adenine.
+
==Meiosis==
 +
''Refer to [[Heredity#Meiosis]]''
  
===DNA Replication===
+
==DNA Repair==
When a cell divides, it makes a duplicate of its DNA in the S-phase of the cell cycle so the daughter cells will have a complete set of chromosomes. This process is DNA replication, also called DNA synthesis. First, topisomerase unwinds the DNA strands, after which the enzyme helicase separates the strands by breaking the hydrogen bonds between nitrogenous bases. This area of separation is called a replication fork.  
+
Cells have three built-in mechanisms for repairing their DNA. DNA damage can be caused by normal internal factors, as well as environmental factors like radiation. DNA damage causes structural damage to the molecule and can affect a cell's ability to transcribe affected genes. Harmful mutations can also occur, and they can affect the survival of a cell and its daughter cells. DNA repair happens constantly throughout the body, responding to any damage to the DNA structure. When DNA repair is not performed and a cell does not undergo apoptosis (programmed cell death/cell suicide), then permanent damage can occur leading to malignant tumors or cancer.  
  
Before DNA Polymerase can enter the replication fork to make copies of the DNA strands, RNA Primase puts down a "primer" to attract RNA nucleotides, which form hydrogen bonds with DNA bases. The next step is elongation, which creates some difficulties because of how enzymes "read." DNA Polymerase can only copy from 5' to 3'; however, one DNA strand is 3' to 5'. This strand is called the lagging strand, as opposed to the 5'-3' leading strand. While DNA Polymerase can copy the leading strand without a problem, it can only replicate the lagging strand in spurts. The spaces between replicated portions of the lagging strand are called Okazaki fragments.  
+
===Direct reversal===
 +
Direct reversal repair occurs when consecutive pyrimidine bases become fused together when they are exposed to UV light, forming pyrimidine dimers. Photoreactivation directly reverses this process by using an enzyme called photolyase that reacts directly to exposure to blue/UV light. Photolyase no longer functions in humans, but can be found in bacteria, fungi and some animals. Humans use a process known as nucleotide excision repair to repair damage done by UV light.
  
Once replication is complete, an exonuclease (an enzyme that cleaves nitrogenous bonds) removes the RNA primer. Finally, ligase connects the strands with their complements by catalyzing the phosphodiester bonds with the 3' hydroxyl group and the 5' phosphate.
+
===Excision repair===
 +
When only one strand of a double helix is damaged, the other strand can be used to identify the missing or incorrect bases. Excision repair mechanisms remove the damaged nucleotide and replace it with the correct undamaged nucleotide.  
  
==RNA==
+
Base excision repair (BER) repairs damage to a single base using glycosylases. These enzymes remove the specific affected base, and DNA polymerase correctly synthesizes the new strand.
RNA (ribonucleic acid) is a single stranded nucleotide chain, not a double helix. While it does not share the same structure as DNA, it has many similar properties. RNA consists of a '''ribose''' sugar and hydroxyl group, as opposed to deoxyribose. RNA also consists of Adenine, Guanine and Cytosine, but Thymine is replaced with Uracil.
 
===Types===
 
There are three major types of RNA. While there are many other minor types, these three are heavily involved in translation.
 
* Messenger RNA (mRNA): Encodes the sequence of amino acids that becomes a protein.
 
* Transfer RNA (tRNA): Transports amino acids to ribosomes during translation. It contains about 80 RNA nucleotides, with an amino acid attached to the 3' end and an a complimentary anticodon attached to the 5' end.
 
* Ribosomal RNA (rRNA): Along with ribosomal proteins, rRNA makes up the ribosome which is the organelle that translates mRNA into proteins.
 
  
===Transcription===
+
Nucleotide excision repair (NER) is less specific, and is typically used in cases where a large portion of the helix is distorted. Damaged regions are removed in a three step process with the recognition of the damage, excision of the damaged area, and resynthesis of the removed region. NER occurs in almost every organism.  
Transcription is the process of transcribing DNA into mRNA so that it can be translated into proteins. It is also the first major step of gene expression. Transcription produces a complimentary sequence to the DNA; A bonds with T in DNA, U bonds with A, G bonds with C and C bonds with G. For example:
 
  
DNA: GCACGTGTAGCATAGTACTAG<br>
+
===Postreplication repair===
mRNA: CGUGCACAUCGUAUCAUGAUC
 
  
Transcription occurs in the nucleus during the G1 and G2 phases of the cell cycle. In eukaryotes, it occurs in three distinct stages.
+
Postreplication repair (also known as translesion synthesis) occurs when the replication process is allowed to replicate past DNA lesions. A gap is left at the damaged site when the Okazaki fragments are synthesized, filled in later by either recombination repair or error-prone repair. Recombination repair uses the sequence from a sister chromosome to repair the damaged DNA, and error-prone repair uses the damaged strand as a sequence template. Error-prone repair is typically inaccurate, and commonly results in mutations.
====Initiation====
 
#Activator proteins bind to distal control elements that are located before the DNA sequence known as a promoter. Promoters are located near the start sites of genes, and allow various proteins and enzymes (such as RNA polymerase II) to form an initiation complex that begins transcription.
 
#Proteins called transcription factors bind to a specific DNA sequence known as a promoter. At this point in the process, the DNA is still double stranded. RNA polymerase binds to the promoter region shortly after the transcription factors.  
 
#RNA polymerase unwinds approximately 14 base pairs to form an "open complex" that becomes the transcription bubble. As the RNA polymerase begins creating RNA, it enters the RNA exit channel and leaves behind the initial transcription factors.
 
====Elongation====
 
#RNA polymerase begins unwinding the double helix and exposes 10-20 nucleotides for transcription at a time. To do this, RNA polymerase uses free-floating RNA nucleotides in the nucleoplasm.
 
#RNA polymerase travels from the 3' → 5' direction on the template strand of DNA, producing a mRNA strand in the 5' → 3' direction. This process produces an RNA copy of the 5' → 3' strand of DNA.
 
#RNA transcription occurs very quickly, and can involve multiple RNA polymerase working on a single gene. The typical rate of elongation is 10-100 nucleotides/sec.
 
#Elongation also involves a proofreading mechanism that can replace incorrect nucleotides. Transcription pauses, allowing RNA editing factors to bind to the new strand of mRNA and edit base order.
 
 
 
====Termination====
 
#The RNA codes for the polyadenylantion (AAUAAA), and the proteins that have been associated with the RNA polymerase stop moving.
 
#RNA polymerase continues moving, adding hundreds of adenine nucleotides to the end of the mRNA strand. Spare RNA created like this may be used by enzymes.
 
#This termination factor releases the newly created mRNA, which leaves the nucleus and travels to the ribosome where it is translated into a protein.
 
 
 
===Interpreting Genetic Code===
 
A sequence of three mRNA nucleotides is called a codon. Each of these codons corresponds with a complimentary anticodon attached to a strand of tRNA. Different tRNA molecules are attached to different amino acids, meaning that each codon corresponds with one amino acid. Since there are only four different RNA nucleotides, there are 64 possible codons. However, there are only 20 standard amino acids. This means that multiple codons can code for the same amino acid.
 
 
 
A chain of amino acids is called a protein. They are responsible for most biological functions in the body such as DNA replication, transcription, transporting molecules, and regulation of gene expression. Proteins are very complex macromolecules and have four different levels of structure. The amino acid sequence created in translation is known as the primary structure.
 
 
 
It is possible to interpret a DNA sequence into an amino acid sequence by using a chart like the one shown below.
 
*Find the RNA nucleotides that would pair with the DNA nucleotides.
 
**DNA: TAC AGG TAG CTA GTT ATT
 
**RNA: AUG UCC AUC GAU CAA UAA
 
*Follow the sequence of nucleotides on the chart from the inside out. For example, the RNA sequence AUG is found at the beginning of every protein and codes for Methionine. In the center of the circle, start with the A quadrant, then follow the U quadrant and then the G quadrant.
 
**RNA: AUG UCC AUC GAU CAA UAA
 
**Amino Acids: Methionine Serine Isoleucine Aspartic Acid Glutamine Stop
 
 
 
====List of Amino Acids====
 
[[File:AminoWheel.jpg|right]]
 
{|class="wikitable" style="text-align: center; font-size: 100%; border: 1px solid #888;"
 
|-
 
! Name !! Abbreviation !! One Letter Code
 
|-
 
| Alanine
 
| Ala
 
| A
 
|-
 
| Arginine
 
| Arg
 
| R
 
|-
 
| Asparagine
 
| Asn
 
| N
 
|-
 
| Aspartic Acid
 
| Asp
 
| D
 
|-
 
| Cysteine
 
| Cys
 
| C
 
|-
 
| Glutamine
 
| Gln
 
| Q
 
|-
 
| Glutamic Acid
 
| Glu
 
| E
 
|-
 
| Glycine
 
| Gly
 
| G
 
|-
 
| Histidine
 
| His
 
| H
 
|-
 
| Isoleucine
 
| Ile
 
| I
 
|-
 
| Leucine
 
| Leu
 
| L
 
|-
 
| Lysine
 
| Lys
 
| K
 
|-
 
| Methionine (Start)
 
| Met
 
| M
 
|-
 
| Phenylalanine
 
| Phe
 
| F
 
|-
 
| Proline
 
| Pro
 
| P
 
|-
 
| Serine
 
| Ser
 
| S
 
|-
 
| Threonine
 
| Thr
 
| T
 
|-
 
| Tryptophan
 
| Trp
 
| W
 
|-
 
| Tyrosine
 
| Tyr
 
| Y
 
|-
 
| Valine
 
| Val
 
| V
 
|}
 
===Translation===
 
Translation is the process of translating the mRNA created during transcription into a protein. These proteins are responsible for different genetic traits such as hair/eye color, blood type, or hereditary conditions such as color blindness. It takes place in the ribosome, an organelle with three chambers and two subunits that consists of rRNA and other proteins. The three chambers are the A site (Aminoacyl-tRNA binding site), the P site (Peptidyl-tRNA binding site) and the E site (Exit site). All of these chambers are located in the large subunit. Like transcription, it occurs in three steps.
 
====Initiation====
 
#The small subunit attaches to the mRNA, holding it in place throughout translation.
 
#The Methionine tRNA bonds to the start codon AUG.
 
#The large subunit arrives and completes the translation initiation complex.
 
====Elongation====
 
#Amino acids are brought to the ribosome by tRNA molecules and are added to the polypeptide chain one by one.
 
#The anticodon on a tRNA molecule binds to the mRNA codon at the A site.
 
#An rRNA molecule in the large subunit catalyzes the formation of a peptide bond between the amino acid on the tRNA and the polypeptide chain.
 
#The ribosome moves the mRNA to from the P site to the E site, where the tRNA is released.
 
====Termination====
 
#The stop codon on the mRNA reaches the A site.
 
#Release factors bind to the stop codon at the A site.
 
#A water molecule is added to the end of the polypeptide instead of an amino acid, and hydrolysis releases the chain so it can be folded into its final structure.
 
  
 
==Gene Expression==
 
==Gene Expression==
===Epigenetics===
+
Several different factors interact with RNA transcription and translation to control gene expression.
Epigenetics is the study of changes in organisms not caused by the alteration of genetic code. Epigenetics revolves around gene expression, not the DNA itself. It affects how genes are read by cells, and how they produce proteins. Think of the human genome as a filing cabinet, and the genes as folders that contain the instructions to make a protein. Certain folders might be marked as important, or others could be marked as less important. These epigenetic marks control the expression of genes. It's the reason that even though every cell in your body has the same DNA at its core, different cells have different functions. A liver cell would open different folders in the filing cabinet than a brain cell would, because it would need to make different proteins.
 
 
 
Epigenetic marks take the form of molecular tags that are placed in different places on the histone, and each one has a different effect. They can make DNA more accessible to proteins, or purposefully make it less accessible so that a specific gene isn't transcribed or translated. Some epigenetic marks are very long and cover large stretches of DNA, or others are gathered at the start of genes. Epigenetic marks can also change over time. These changes can be caused by anything from chemical additives in plastics to DNA errors during replication.
 
 
 
Some epigenetic marks can also be inherited through generations. This is how environmental factors are passed down through generations. Addictive behavior is inherited in this way, and the effects nutrient deprivation can be passed down in this way too. However, passing down epigenetic tags is different than passing down genes. Reproductive cells undergo a process called reprogramming, and this process is supposed to erase all epigenetic tags. However, on some genes it fails and leaves these tags in place to be passed down to another generation. In mammals, about 1% of genes escape epigenetic reprogramming.
 
  
 
===Transcriptional===
 
===Transcriptional===
Line 197: Line 67:
 
The process for alternative splicing is similar to the process for regular RNA splicing. However, the key difference is that alternative splicing produces '''different RNA from the same primary transcript'''. Exons are mixed and matched to create different proteins from the same length of mRNA. This process is also called exon shuffling, and is the reason why humans produce so many proteins despite having a limited number of genes.  
 
The process for alternative splicing is similar to the process for regular RNA splicing. However, the key difference is that alternative splicing produces '''different RNA from the same primary transcript'''. Exons are mixed and matched to create different proteins from the same length of mRNA. This process is also called exon shuffling, and is the reason why humans produce so many proteins despite having a limited number of genes.  
 
====microRNA====
 
====microRNA====
 +
MicroRNA (abbreviated to miRNA) is a small sequence of RNA that regulates gene expression. It is typically about 22 nucleotides in length. miRNA works by bonding with complementary sequences in mRNA, which destabilizes the mRNA strand by separating it into two pieces or slowing down translation into proteins. miRNA is involved in a variety of biological functions, including cell cycle control, apoptosis and developmental processes like aging and immune responses. miRNA has also been implicated in various diseases including cancer and certain types of heart and neurological diseases. One miRNA can target multiple genes, regulating the expression of multiple proteins.
  
 
===Translational===
 
===Translational===
===Post-translational===
+
Gene expression can also be regulated or modified during or after RNA translation.
The way a protein functions hinges on the way it is folded. Hydrogen bonds form between the nucleotides, which produce the tertiary structure of the protein. Chaperonins assist the folding of proteins, and ensure that it does not fold improperly. A protein that is folded improperly and not destroyed can cause numerous diseases such as Alzheimer's disease, cystic fibrosis, and cancer. Enzymes can also process the polypeptide once it is folded by removing residues or amino acids.
 
 
 
Carbohydrates, lipids and phosphate groups can also be attached to the polypeptides. The attachment of carbohydrates is known as glycosylation, and often promotes protein folding and stability in proteins. Lipidation often occurs in proteins that are going to be attached to the cell membrane. The most common type of post-translational modification is known as phosphorylation and typically regulates the activity of enzymes.
 
  
 
===''Lac'' and ''Trp'' Operons===
 
===''Lac'' and ''Trp'' Operons===
Lac and Trp Operons are examples in prokaryotic gene regulation. Most prokaryotic genes such as in E.coli are always turned "on", but others are active only when products are needed by the cell, so their expression must be regulated.   
+
Lac and Trp Operons are examples in prokaryotic gene regulation. Most prokaryotic genes such as in ''E. coli'' are always turned "on", but others are active only when products are needed by the cell. As such, their expression must be regulated.   
  
 
An operon is a group of genes transcribed together by a single promoter. The ''lac'' operon was the first to be discovered. In the model bacterium ''E. coli'', this operon is transcribed in the presence of lactose to give the bacterium the ability to digest this source of energy. It has three parts: lacA, lacY, and lacZ, as well as a promoter, a regulator, a terminator, and an operator. To activate lactose digestion abilities, an isomer of lactose (allolactose) binds to the gene's repressor, allowing the operon to be transcribed
 
An operon is a group of genes transcribed together by a single promoter. The ''lac'' operon was the first to be discovered. In the model bacterium ''E. coli'', this operon is transcribed in the presence of lactose to give the bacterium the ability to digest this source of energy. It has three parts: lacA, lacY, and lacZ, as well as a promoter, a regulator, a terminator, and an operator. To activate lactose digestion abilities, an isomer of lactose (allolactose) binds to the gene's repressor, allowing the operon to be transcribed
Line 211: Line 79:
 
Whereas the ''lac'' operon gives ''E. coli'' the ability to digest lactose, the ''trp'' operon shuts off the bacterium's capability to metabolize tryptophan. As such, it is an example of a repressible operon. In the presence of lactose, its five structural genes (trpA, trpB, trpC, trpD, and trpE), which code for tryptophan synthase, will be repressed so ''E. coli'' can metabolize lactose instead. Lac operons are inductible operons due to the fact that genes are expressed in the presence of a substance (lactose).
 
Whereas the ''lac'' operon gives ''E. coli'' the ability to digest lactose, the ''trp'' operon shuts off the bacterium's capability to metabolize tryptophan. As such, it is an example of a repressible operon. In the presence of lactose, its five structural genes (trpA, trpB, trpC, trpD, and trpE), which code for tryptophan synthase, will be repressed so ''E. coli'' can metabolize lactose instead. Lac operons are inductible operons due to the fact that genes are expressed in the presence of a substance (lactose).
  
==Punnett squares==
+
===Post-translational===
===Single-Factor Crosses (Monohybrid)===
+
The way a protein functions hinges on the way it is folded. Hydrogen bonds form between the nucleotides, which produce the tertiary structure of the protein. Chaperonins assist the folding of proteins, and ensure that it does not fold improperly. A protein that is folded improperly and not destroyed can cause numerous diseases such as Alzheimer's disease, cystic fibrosis, and cancer. Enzymes can also process the polypeptide once it is folded by removing residues or amino acids.  
[[Image:punnett.jpg|frame|right|2 Punnet squares.]]
 
  
The images to the right are examples of Punnett squares, named after the geneticist Reginald C. Punnett. Punnett squares show the cross between alleles and the genotype of the resulting offspring. Since both of the Punnet squares in the diagram only cross one trait (one pair of alleles), it is called a '''monohybrid''' or '''single-factor cross.''' Likewise, when two traits (two pairs of alleles) are crossed, it is called a '''dihybrid''' or '''two-factor cross.'''
+
Carbohydrates, lipids and phosphate groups can also be attached to the polypeptides. The attachment of carbohydrates is known as glycosylation, and often promotes protein folding and stability in proteins. Lipidation often occurs in proteins that are going to be attached to the cell membrane. The most common type of post-translational modification is known as phosphorylation and typically regulates the activity of enzymes.
  
The first Punnett square shows a cross between two heterozygous plants. The second Punnett square shows a cross between a homozygous tall plant and a homozygous short plant. The letters inside the boxes represent the genotype of each offspring. For example, in the first square, the genotypes of the offspring will be TT, Tt, and tt (2 of the 4 offspring will have the same genotype-''Tt'').
+
===Epigenetics===
 +
Epigenetics is the study of changes in organisms not caused by the alteration of genetic code. Epigenetics revolves around gene expression, not the DNA itself. It affects how genes are read by cells, and how they produce proteins. Think of the human genome as a filing cabinet, and the genes as folders that contain the instructions to make a protein. Certain folders might be marked as important, or others could be marked as less important. These epigenetic marks control the expression of genes. It is the reason that even though every cell in the body has the same DNA at its core, different cells have different functions. A liver cell would open different folders in the filing cabinet than a brain cell would, because it would need to make different proteins.
  
It is helpful to memorize the genotypic and phenotypic ratios of a heterozygous monohybrid cross. If two heterozygotes are crossed (like the first Punnet Square in the image to the right) then the genotypic ratio will always be:
+
Epigenetic marks take the form of molecular tags that are placed in different places on the histone, and each one has a different effect. They can make DNA more accessible to proteins, or purposefully make it less accessible so that a specific gene is not transcribed or translated. Some epigenetic marks are very long and cover large stretches of DNA, or others are gathered at the start of genes. Epigenetic marks can also change over time. These changes can be caused by anything from chemical additives in plastics to DNA errors during replication.
  
''1 D/D: 2 H: 1 R/R''
+
Some epigenetic marks can also be inherited through generations. This is how environmental factors are passed down through generations. Addictive behavior is inherited in this way, and the effects nutrient deprivation can be passed down in this way too. However, passing down epigenetic tags is different than passing down genes. Reproductive cells undergo a process called reprogramming, and this process is supposed to erase all epigenetic tags. However, on some genes it fails and leaves these tags in place to be passed down to another generation. In mammals, about 1% of genes escape epigenetic reprogramming.
  
and the phenotypic ratio will be:
+
===Epistasis===
 +
Epistasis is yet another form of gene expression that describes the relationship between multiple genes. Epistasis typically features one allele masking the phenotype of another separate gene. This is different from a dominant/recessive relationship because those alleles are different types of the same gene. For example, in humans the gene that codes for albinism is separate from the gene that codes for skin tone. If a human has the gene responsible for albinism then their skin tone is "masked" and not displayed.
  
''3 D: 1 R''
+
==Phylogenetics==
 +
Phylogenetics is the study of evolutionary relationships. A phylogenetic tree displays these relationships based upon their similarities and differences. Rooted trees have a common ancestor, and in some cases the length of a line can indicate time estimates. Unrooted trees only show the relationship between a couple of organisms and do not require an ancestral root. Phylogenetic trees are based on speculation and do not show exact evolutionary history, but they can still display how animals could have possibly evolved.
  
where ''D=homozygous dominant, R=homozygous recessive, and H=heterozygous.''
+
==Hardy-Weinberg Equilibrium==
 
+
The Hardy-Weinberg equilibrium is a common population model used in genetics.
Memorizing other simple crosses (such as a single-factor ''homozygous dominant x homozygous recessive'' cross) is useful and saves time on tests. Here are some simple monohybrid crosses with their respective genotypic and phenotypic ratios.
 
 
 
<li>'''AA x aa''' ''(Homozygous dominant x Homozygous recessive)''<br />
 
Genotypic ratio: 0 D/D: 4 H: 0 R/R<br />
 
Phenotypic ratio: 4 D: 0 R<br />
 
 
 
<li>'''AA x Aa''' ''(Homozygous dominant x Heterozygous)''<br />
 
Genotypic ratio: 2 D/D: 2 H: 0 R/R<br />
 
Phenotypic ratio: 4 D: 0 R<br />
 
 
 
<li>'''Aa x aa''' ''(Heterozygous x homozygous recessive)''<br />
 
Genotypic ratio: 0 D/D: 2 H: 2 R/R<br />
 
Phenotypic ratio: 2 D: 2 R<br />
 
 
 
Some important Punnett Square terms are defined below. On tests, be extra careful when you spot these terms as they are easily confused with each other.
 
 
 
:;Genotype: The different combinations of the alleles.
 
:;Phenotype: The physical appearance of the offspring.
 
;;Genotypic ratio: The ratio of the combination of alleles.
 
:;Phenotypic ratio: The ratio of the physical appearance.
 
<br clear="all"/>
 
 
 
===Two-Factor Crosses (Dihybrid)===  
 
 
 
''Two factor crosses,'' or ''dihybrid crosses,'' are similar to single-factor crosses except that in a two-factor cross, two traits are crossed rather than one trait in a single-factor cross. An example of a two-factor cross is pictured to the left.
 
 
 
[[Image:2xNEW.jpg|left|border]]
 
 
 
Here, two heterozygotes are crossed (RrYy x RrYy). The "R" allele represents the shape of the seed and the "Y" allele represents the color. It is important to note the genotypic and phenotypic ratios for a heterozygous dihybrid cross. Regardless of the alleles, if two dihybrid heterzygotes are crossed, then the resulting phenotypic ratio will be:
 
 
 
''9 D/D: 3 D/R: 3 R/D: 1 R/R (D = dominant, R = recessive).''
 
 
 
and the genotypic ratio will be:
 
 
 
''1 D/D: 2 D/H: 1 D/R: 4 H/H: 4 H/D: 1 R/D: 2 R/H: 1 R/R (H = heterozygous).''
 
 
 
So, the phenotypic ratio for the pictured dihybrid cross is:
 
 
 
9 round/yellow:3 round/green: 3 wrinkled/yellow: 1 wrinkled/green.
 
 
 
===Three-Factor Crosses (Trihybrid)===
 
[[Image:trihybrid.jpg|right|border]]<br />
 
 
 
Like single- and double-factor crosses, ''three-factor crosses'' (trihybrid) show three different traits that are crossed (see the image to the right for an example). Trihybrid crosses are rarely seen on tests, so don't spend too much time practicing them until the later stages of competition.
 
<br clear="all"/>
 
 
 
===Special Punnet Squares===
 
====Incomplete dominance====
 
 
 
In some unusual cases such as ''4 o'clock flowers'', gene pairs for a given trait fail to establish dominance and the heterozygous condition is expressed as an intermediate between the two alleles. Often, to draw attention to this situation, the letter 'I' is used to designate the gene allele.<br clear="all"/>
 
 
 
'''Example:''' In 4 o'clock flowers, the genotype RR (homozygous dominant) appears red, rr (homozygous recessive)appears white, and Rr (heterozygous)appears pink. In all cases of incomplete dominance, the number of genotypes equals the number of phenotypes.<br clear="all"/>
 
[[File:IncDom.png|thumb|center|700px|Incomplete dominance in the F1 generation. When the F1's are selfed, all 3 phenotypes will be expressed.]]
 
 
 
====Epistasis====
 
 
 
Epistatis is where one set of genes stops or inhibits the action of another genes. Epistasis genes can either be recessive or dominant. The gene for no pigment (p) in the skin(albinism) is recessive to normal pigmentation(P). For any pigment to appear at all, at least one gene for enzyme S must also be present. That's like even if there is a pigment, but enzyme S is not present, the person is albino. PpSs? is normal, PPss? is albino, ppSS is albino, and so on. To not be albino, there needs to be at least one P and one S.
 
 
 
===Sex-linked traits===
 
 
 
Sex-linked traits are features that are associated with the genes on the sex chromosomes, usually X. Examples of those are recessive genes for color-blindness and hemophilia.
 
 
 
===Sex-influenced traits===
 
 
 
Sex influenced traits are traits that show up more in one sex than they do in the other as a definite phenotype. Usually influenced more by hormones in the male or female.
 
 
 
===Multiple genes===
 
 
 
Most phenotypic features are controlled by more than one set of non-allelic genes acting on them, such as height, skin color, intelligence, and hair and eye color. Usually this type of problem is seen as a typical two or three, etc factor cross with the more dominants, the more expression of the trait in question.
 
 
 
===Multiple alleles===
 
 
 
There may be more than the usual two alleles for any given gene. Especially, this appears in fur or pelt conditions of domestic animals. The problem usually uses 'I' (for incomplete dominance) and some prearranged superscript. The most common example found on tests is the ABO blood type system found in humans.
 
 
 
===Lethal alleles===
 
While lethal alleles do not affect the way you set up your Punnett square, they can appear to alter Mendelian ratios. A lethal genotype is one that causes death before the individual can reproduce and pass their genes on to the next generation. As such, they remove an expected progeny class after a specific cross. For example, in Mexican hairless dogs, the genotype hh means that te dog is hairy, Hh means that the dog is hairless, but HH means that they die as embryos--thus the term "lethal".  
 
  
==Hardy-Weinberg Equilibrium==
 
 
===Conditions===
 
===Conditions===
The ''Hardy-Weinberg Law'' states that a population will maintain the exact allele and genotype frequencies over each generation unless five specific influences are introduced into the population. These are:
+
The Hardy-Weinberg Law states that a population will maintain the exact allele and genotype frequencies over each generation unless five specific influences are introduced into the population. For a population to be in Hardy-Weinberg equilibrium, it must meet all of the 5 conditions listed below:
  
#Mutations
+
#'''No mutations:''' Mutations introduce new alleles into the population.
#Gene flow (migration in/out of the population)
+
#'''No gene flow:''' Like mutations, immigration or emigration can introduce new alleles (or bolster/diminish existing alleles)
#Small population
+
#'''Very large population:''' Genetic drift is likely to occur in a smaller population. Hardy-Weinberg equilibrium can only occur in a population approaching infinity.
#Natural selection
+
#'''No natural selection:''' If some traits are discriminated for/against by environmental conditions, the genotype frequencies will not be in equilibrium over the generations.
#Non-random mating
+
#'''Random mating:''' Like natural selection, sexual selection involved in non-random mating could discriminate for/against traits.
  
For a population to be in Hardy-Weinberg equilibrium, it must '''not''' have any of the 5 conditions listed above. Here are the explanations for each condition:
+
An example of Hardy Weinberg: Consider a world where everyone has either purple or blue skin. "S" is purple skin, and "s' is blue skin. The probability of either one of these traits occurring is constant, and both that and the allele freqeuncies have to add to 1. The probabilities of the alleles are represented as [math]p[/math] (for S) and [math]q[/math] (for s). Therefore, the probability of being homozygous purple (SS) would be [math]p*p[/math] or [math]p^2[/math], the probability of being heterozygous (Ss) would be [math]p*q + q*p[/math] or [math]2pq[/math], and the probability of being homozygous blue (ss) would be [math]q*q[/math] or [math]q^2[/math]. These probabilities form the two equations used in the Hardy-Weinberg equilibrium.
  
#'''Mutations:''' Mutations introduce new alleles into the population.
+
===Equations===
#'''Gene flow:''' Like mutations, migration can introduce new alleles (or diminish another allele)
 
#'''Small population:''' Genetic drift is likely to occur in a smaller population.
 
#'''Natural selection:''' If some traits are discriminated for/against, the genotype frequencies will not be in equilibrium over the generations.
 
#'''Non-random mating:''' Like natural selection, non-random mating could discriminate for/against traits.
 
 
 
An example of Hardy Weinberg: Let's say we were in a world where everyone had either purple or blue skin. S is purple skin, and s is blue skin. The probability of either one of these genes occurring is constant, and both probabilities have to add to 1. Given the probabilities of both blue and purple skin, lets say p for purple and b for blue, the
 
probability of having two purple skin alleles (SS) would be pp, and having a blue and a purple (Ss) would be pb, and so on.
 
 
 
===Equation===
 
  
 
There are two equations used in the Hardy-Weinberg Law:
 
There are two equations used in the Hardy-Weinberg Law:
Line 336: Line 120:
 
where
 
where
  
[math]p[/math] is the frequency of the (homozygous) dominant '''allele''' in the population <br clear="all"/>
+
[math]p[/math] is the frequency of the (homozygous) dominant ''allele'' in the population, as a percentage {{clear}}
[math]q[/math] is the frequency of the (homozygous) recessive '''allele''' in the population<br clear="all"/>
+
[math]q[/math] is the frequency of the (homozygous) recessive ''allele'' in the population, as a percentage {{clear}}
[math]p^2[/math] is the '''percentage''' of the homozygous dominant individuals<br clear="all"/>
+
[math]p^2[/math] is the ''percentage'' of the homozygous dominant individuals{{clear}}
[math]2pq[/math] is the '''percentage''' of the heterozygous individuals<br clear="all"/>
+
[math]2pq[/math] is the ''percentage'' of the heterozygous individuals{{clear}}
and<br clear="all"/>
+
[math]q^2[/math] is the ''percentage'' of the homozygous recessive individuals.{{clear}}
[math]q^2[/math] is the '''percentage''' of the homozygous recessive individuals.<br clear="all"/>
 
  
Remember, the equations only apply if the population is in Hardy-Weinberg equilibrium.
+
These equations ''only'' apply if the population is in Hardy-Weinberg equilibrium.
  
 
===Solving a Hardy-Weinberg Problem===
 
===Solving a Hardy-Weinberg Problem===
 
A typical Hardy-Weinberg problem will resemble the sample problem below:
 
A typical Hardy-Weinberg problem will resemble the sample problem below:
  
In a certain population, the percentage of the homozygous recessive genotype (aa) is 36%. Using only that information, find:
+
{|class="wikitable"
 +
|-
 +
| In a certain population, the percentage of the homozygous recessive genotype (aa) is 36%. Using only that information, find:
  
#The frequency of the recessive genotype.
+
# The frequency of the recessive genotype.
#The frequency of the recessive allele.
+
# The frequency of the recessive allele.
#The frequency of the dominant allele.
+
# The frequency of the dominant allele.
#The percent of the heterozygous individuals.
+
# The percent of the heterozygous individuals.
 +
|}
  
'''IMPORTANT:''' Before attempting to solve the problem, it is critical to analyze all of the given information and approach it in the correct manner. ''Make sure to check your work after finishing!'' One mistake will throw off the entire problem. When solving a problem, make sure to work in the order as follows:
+
'''IMPORTANT:''' Before attempting to solve the problem, it is critical to analyze all of the given information and approach it in the correct manner. ''Make sure to check the math after finishing!'' One mistake will throw off the entire problem. When solving a problem, make sure to work in the order as follows:
  
''Step 1: Determine [math]q[/math].'' Since a dominant phenotype can have either a homozygous or heterozygous genotype, it is easier to find the recessive allele first (unless an exact homozygous/heterozygous dominant value is given).<br/>
+
:''Step 1: Determine [math]q[/math].'' Since a dominant phenotype can have either a homozygous or heterozygous genotype, it is easier to find the recessive allele first (unless an exact homozygous/heterozygous dominant value is given).<br/>
''Step 2: Determine [math]p[/math].'' Using the second equation, [math]p[/math] can be found once [math]q[/math] has been determined.<br/>
+
:''Step 2: Determine [math]p[/math].'' Using the second equation, [math]p[/math] can be found once [math]q[/math] has been determined.<br/>
''Step 3: Determine [math]p^2[/math] and [math]q^2[/math].'' Steps 3 and 4 are interchangeable, but finding [math]p^2[/math] and [math]q^2[/math] first is generally the common practice.<br/>
+
:''Step 3: Determine [math]p^2[/math] and [math]q^2[/math].'' Steps 3 and 4 are interchangeable, but finding [math]p^2[/math] and [math]q^2[/math] first is generally the common practice.<br/>
''Step 4: Determine [math]2pq[/math].''<br/>
+
:''Step 4: Determine [math]2pq[/math].''<br/>
  
 
The answers and work (using the four steps) for the sample problem are shown below:
 
The answers and work (using the four steps) for the sample problem are shown below:
  
'''Step 1: Determine [math]q[/math].''' Since '''aa''', or [math]q^2[/math] is 36%, then '''a''' (the frequency of the recessive allele-this is '''q''' in Hardy-Weinberg terms) must be 60%, or 0.6. <br/>
+
:''Step 1: Determine [math]q[/math].'' Since '''aa''', or [math]q^2[/math] is 36%, then '''a''' (the frequency of the recessive allele-this is '''q''' in Hardy-Weinberg terms) must be 60%, or 0.6. <br/>
'''Step 2: Determine [math]p[/math].''' Using the second equation, [math] p + q = 1[/math]. Therefore, [math]p[/math], or '''A''' must be 0.4. (40%)<br/>
+
:''Step 2: Determine [math]p[/math].'' Using the second equation, [math] p + q = 1[/math]. Therefore, [math]p[/math], or '''A''' must be 0.4. (40%)<br/>
'''Step 3: Determine [math]p^2[/math] and [math]q^2[/math].''' Now that [math]p[/math] and [math]q[/math] ('''A''' and '''a''' respectively) are both known, [math]p^2[/math] and [math]q^2[/math] can be found by squaring each term. In this case, [math]p^2 = .16[/math] and [math]q^2 = .36[/math] (16% and 36% respectively).<br/>
+
:''Step 3: Determine [math]p^2[/math] and [math]q^2[/math].'' Now that [math]p[/math] and [math]q[/math] ('''A''' and '''a''' respectively) are both known, [math]p^2[/math] and [math]q^2[/math] can be found by squaring each term. In this case, [math]p^2 = .16[/math] and [math]q^2 = .36[/math] (16% and 36% respectively).<br/>
'''Step 4: Determine [math]2pq[/math].''' This can be done two ways. Rearranging the first equation, [math]2pq = 1 - p^2 - q^2[/math], so [math]2pq = .48[/math] (48%). Additionally, [math]2pq[/math] can be found by multiplying [math]p[/math] and [math]q[/math] together, then multiplying that by [math]2[/math].<br/>
+
:''Step 4: Determine [math]2pq[/math].'' This can be done two ways. Rearranging the first equation, [math]2pq = 1 - p^2 - q^2[/math], so [math]2pq = .48[/math] (48%). Additionally, [math]2pq[/math] can be found by multiplying [math]p[/math] and [math]q[/math] together, then multiplying that by [math]2[/math].<br/>
  
 
So, the answers to the sample questions are:
 
So, the answers to the sample questions are:
Line 388: Line 174:
 
Next Generation Sequencing (NGS), or high-throughput sequencing, is a name that describes several different ways to sequence DNA. It is faster and cheaper than Sanger sequencing since many sequencing reactions can take place at once, it is very low-cost, and the reactions are much smaller.
 
Next Generation Sequencing (NGS), or high-throughput sequencing, is a name that describes several different ways to sequence DNA. It is faster and cheaper than Sanger sequencing since many sequencing reactions can take place at once, it is very low-cost, and the reactions are much smaller.
  
===Microarray===
+
====RNA-Seq and Tn-Seq====
A microarray consists of a small solid surface with various known single-stranded segments of DNA attached. It is primarily used for testing unknown DNA sequences - the level of binding of an unknown sequence to one of the microarray segments (known as probes) indicates whether the unknown strand is complementary to a particular known strand.
+
RNA sequencing (also known as WTSS) is the use of Next Generation Sequencing to reveal how much RNA is in a sample at a given moment and is replacing microarrays in many labs. RNA-Seq sequences the mRNA and can be used to analyze gene expression, typically in different conditions (such as with drugs and without drugs). It can also find variations in RNA and detect post-transcriptional alterations, whereas microarrays can only determine gene expression.
 +
 
 +
Tn-Seq (transposon sequencing) determines genetic interactions and can determine the frequency of mutations. However, it is limited to bacterial studies.
 +
 
 +
===Microarrays===
 +
A microarray consists of a small solid surface with various known single-stranded segments of DNA attached. It is primarily used for testing unknown DNA sequences - the level of binding of an unknown sequence to one of the microarray segments (known as probes) indicates whether the unknown strand is complementary to a particular known strand. DNA microarrays are also used to measure the expression levels of a large amount of genes simultaneously.
  
 
===RFLP Analysis===
 
===RFLP Analysis===
DNA sample is broken into pieces (and digested) by restriction enzymes and the resulting restriction fragments are separated according to their lengths by gel electrophoresis. Though now largely obsolete due to the rise of inexpensive DNA sequencing technologies, RFLP analysis was previously used for DNA profiling.
+
In RFLP analysis (also known as restriction enzyme analysis), a DNA sample is broken into pieces and digested by restriction enzymes and the resulting restriction fragments are separated according to their lengths by gel electrophoresis. Though now largely obsolete due to the rise of inexpensive DNA sequencing technologies, RFLP analysis was previously used for DNA profiling (also known as DNA fingerprinting).
 +
 
 
===Molecular Cloning===
 
===Molecular Cloning===
 
Molecular cloning is the process of inserting recombinant DNA into various host organisms - for example, certain types of bacteria - and replicating them. It is most often used to manufacture large quantities of desirable proteins. For example, synthetic insulin is primarily produced using recombinant DNA inside bacteria such as ''E. coli''.
 
Molecular cloning is the process of inserting recombinant DNA into various host organisms - for example, certain types of bacteria - and replicating them. It is most often used to manufacture large quantities of desirable proteins. For example, synthetic insulin is primarily produced using recombinant DNA inside bacteria such as ''E. coli''.
  
 
===Polymerase Chain Reaction===  
 
===Polymerase Chain Reaction===  
Polymerase Chain Reaction, abbreviated as PCR, is a method of quickly making billions of copies of a desired section of DNA. For a virtual lab that clearly explains the process, visit http://learn.genetics.utah.edu/content/labs/pcr/.
+
Polymerase Chain Reaction, abbreviated as PCR, is a method of quickly making billions of copies of a desired section of DNA. For a virtual lab that clearly explains the process, visit [http://learn.genetics.utah.edu/content/labs/pcr/ this website].
  
 
The Polymerase Chain Reaction is another way of creating large numbers of a specific piece of DNA, other than cloning DNA.  
 
The Polymerase Chain Reaction is another way of creating large numbers of a specific piece of DNA, other than cloning DNA.  
Line 410: Line 202:
  
 
===Gel Electrophoresis===
 
===Gel Electrophoresis===
Gel Electrophoresis is one of the most useful techniques to study macromolecules, especially proteins or nucleic acids. In gel electrophoresis, charged molecules are pulled through a gel (usually purified agar known as agarose) and this separates the molecules. Larger molecules move more slowly through the gel since they get caught in the gel matrix, and molecules with greater charges move faster since the electric field is what's pulling the molecules.  
+
Gel Electrophoresis is one of the most useful techniques to study macromolecules, especially proteins or nucleic acids. In gel electrophoresis, charged molecules are pulled through a gel (usually purified agar known as agarose) and this separates the molecules. Larger molecules move more slowly through the gel since they get caught in the gel matrix, and molecules with greater charges move faster since the electric field is what is pulling the molecules.  
  
 
Molecules will have a negative charge, and thus will move towards the positive poles.
 
Molecules will have a negative charge, and thus will move towards the positive poles.
  
 
===Blotting===
 
===Blotting===
Blotting is a method used for isolating some certain molecule from a sample. In the case of DNA, it is first cut by restriction enzymes and sorted by size with gel electrophoresis. A blotting membrane is placed over the gel, and a paper towel is used to absorb buffer through the membrane. The buffer moves through the membrane and flows upward, leaving the DNA behind on the other side of the membrane.
+
 
 +
Blotting is a method for identifying specific biomolecules (DNA, RNA, proteins) in a sample mixture. If you wanted to determine if a particular protein is present in a tissue sample, you could determine it by blotting.
 +
 
 +
First, a sample is processed. This step usually involves '''extracting''' all of a particular type of molecule (e.g. protein extraction, DNA extraction), producing an extract. Additional steps may be performed such as reducing steps (to break apart disulfide bonds in proteins), denaturing steps (e.g. adding detergent to denature proteins as in SDS-PAGE), or other digestion steps (such as adding restriction enzymes to a nucleic acid sample).
 +
 
 +
Then, the extract is run by '''gel electrophoresis''' to separate the biomolecules in a gel by size and/or charge. The contents of the gel are then transferred to a membrane sheet (usually nitrocellulose or PVDF) by applying a current. The negatively-charged biomolecules will move out of the gel and be trapped in the membrane.
 +
 
 +
Finally, the biomolecule of interest is '''detected''' using a probe that can be visualized. Probes are biomolecules with specificity for the target protein that are conjugated (bonded) to a detectable marker or enzyme. Usually, if you want to detect a protein, your probe will also be a protein (an antibody); if you want to detect a specific DNA sequence, your probe will be a complementary DNA sequence. After excess probes are washed away, only probes bound to your target biomolecule remain. If your detectable marker is fluorescent, it can be viewed with an imager; the bound marker will fluoresce when imaged, producing a visible band that is distinct from your membrane, allowing you to see the presence of your biomolecule of interest.
 +
 
 +
Sometimes, as mentioned before, the probe can involve a conjugated enzyme. This is common for Western blots, where horseradish peroxidase (HRP) is a popular choice of enzyme. These probes require addition of a chemiluminescent substrate, which when processed by the enzyme will luminesce, and can be visualized by imaging.
 +
 
 +
The different variants of blotting are listed in the table below. A mnemonic for remembering these is '''SNoW DRoP'''. Each letter in the first word is the type of blot that corresponds to a letter in the second word, indicating the type of biomolecule involved.
 +
 
 
{|class="wikitable"
 
{|class="wikitable"
 
! colspan="2"| Types of Blotting
 
! colspan="2"| Types of Blotting
Line 433: Line 237:
 
|}
 
|}
  
==Genetic Disorders==
+
When analyzing the images from a blot, the separation step by gel electrophoresis is helpful. The position of the detected band on the membrane can be used to verify the size of the detected biomolecule. In actual experiments, non-specific binding (the probe binding to something that is not the intended target) may occur; these bands can sometimes be ruled out if they are especially faint. Finally, protein degradation may occur as a result of improper experimental procedure; in this case, you might see a long smear of bands from degraded protein sequences that are different sizes from the actual protein but still bind to the probe.
Genetic disorders are inherited medical conditions caused by abnormalities in the DNA. There are a variety of types of genetic disorders, and some are rarer than others. They are typically caused by mutations in specific genes, deletion of genes, or a person having an additional chromosome. While these genes can be known as disease-causing genes, the abnormality of a gene is the cause of the disorder.
 
  
One of the most common genetic disorders is known as trisomy 21, or Down Syndrome. An individual with this disorder has a third copy of chromosome 21. Cystic fibrosis is also a genetic disorder, caused by mutation in a protein known as CFTR. Even color blindness is a genetic disorder, caused by a mutation on the X chromosome.  
+
===Gene Therapy===
 +
Gene therapy is the process of introducing genes into a patient in order to cure a disease. It has the potential to eliminate hereditary diseases like cystic fibrosis and could cure other diseases like cancer or AIDS. Many different approaches to gene therapy are being tested such as deactivating problematic genes, replacing mutated genes and introducing new genes into the body but most of these are experimental and can be dangerous. Gene therapy is commonly only tested on diseases that have no other cures.  
 +
====CRISPR-Cas technology====
 +
CRISPR stands for Clustered Regularly Interspaced Short Palindromic Repeats, which is a bacterial defense mechanism that can be used to target and edit DNA in specific locations. CRISPR technology is typically used for gene therapy, and is currently being used to correct mutations that cause diseases. Other systems also exist that target RNA and diagnose illnesses.
  
===Karyotypes===
+
CRISPR "spacer" sequences are first translated into RNA sequences called crRNAs that can guide the system to the matching portions of DNA. The crRNAs bind to a protein called Cas9. With this RNA, Cas9 is able to detect complementary sequences of DNA in the genome (this process is similar to the one involving [[Designer Genes#microRNA|miRNA]]). When the DNA is found, Cas9 binds to the DNA and cuts it, disabling the gene. Scientists can introduce a Cas9-gRNA complex (guide RNA is made of crRNA and other RNA sequences which allow it to bind to Cas9) into a cell so that Cas9 targets a specific gene that they may want to study the function of. <br>
[[Image:karyotpye.jpg|thumb|right|Karyotype of a male with no chromosomal polysomy or monosomy.]]
+
CRISPR-Cas9 can be used to repair a gene which has a mutation. If a functional gene is introduced to the cell along with the Cas9-RNA complex, repair enzymes can use this functional gene as a template when they repair the DNA which has been cut by Cas9.
A karyotype is a chart that shows each chromosome. Each karyotype displays 23 pairs of chromosomes, including the X/Y chromosomes. Every pair is assigned a number (except for the sex chromosomes; they are always referred to as the X and Y chromosomes). Some genetic disorders can be detected by analyzing the number of chromosomes and/or the sex chromosomes. The gender of the individual can also be deduced from looking at the sex chromosomes. If there is an X and a Y, the individual is a male. A female has two X chromosomes and no Y chromosome.
 
  
A karyotype is created by stopping cells in cell division and staining the chromosomes, then observing them under a light microscope.
+
Check out the [[Protein Modeling/CRISPR-Cas9|CRISPR/Cas9 Protein Modeling page]] for some more information!
  
Karyotypes can be used to diagnose genetic diseases. For example, a karyotype can reveal a third chromosome 21, resulting in Down syndrome. It can also reveal Turner syndrome (45, X), a disorder that results in females with one X chromosome, and Klinefelter's syndrome (47, XXY), in which a man has two X chromosomes and one Y chromosome.
+
===Plasmid Cloning===
 +
Plasmids are small DNA molecules located outside of the chromosome that can replicate independently. They are typically found in bacteria, but can also be found in both single-celled and eukaryotic organisms. Plasmids can be used as vectors in genetic engineering, meaning that they can be used to transfer foreign genetic material into another cell. Plasmids are typically used to clone and amplify the expression of certain genes, and they can also be used in gene therapy. Plasmid cloning vectors are typically used to clone shorter DNA fragments, and other means are used for cloning larger fragments.  
  
====Sex determination====  
+
===Bioethics===
 +
Bioethics is an intersection of ethics and technology, and studies the way that advancements in science and medicine interact with society and the environment. It is concerned with basic human rights and ensuring that humans are responsible with the advances that they make. It also raises the question of whether or not biotechnology like cloning, life extension, and gene therapy are ethical and should be performed. Human experimentation is another common issue faced by bioethics, and basic ethical principles have been established by a wide variety of organizations to ensure that patients have the freedom to choose their own treatment. Bioethicists come from a variety of different backgrounds, and it typically influences their viewpoints on science. However, they typically follow four different principles that were established by the Belmont Report in 1978. These principles for using humans in research focus around maintaining respect for the people involved, keeping their safety a priority, and ensuring that procedures are not exploitative and are administered fairly. 15 federal agencies adopted a set of rules protecting human subjects following the Belmont Report, and this document enables people to understand regulations on human experimentation.
  
In humans, the male and female share 22 of the 23 pairs of chromosomes in each body cell. The 23rd pair is known as the sex chromosomes because it determines the sex of the individual. In the male, the sex chromosome consists of an X and a Y chromosome(XY) while the pair in females consists of two X chromosomes(XX). The male is the one who determines the sex of the child and the female gives an X to all eggs while the male randomly produces about 50% X sperm and 50% Y sperm.
+
==Genetic Disorders==
 
+
''Refer to [[Heredity#Genetic Disorders]]''
In rare cases, through nondisjunction, a person will have three sex chromosomes. If they have three X (XXX) chromosomes, they are female. If they have even one Y chromosome (XXY), they are male. Although they will show more feminine qualities, any person who has a Y chromosome is considered a male. Other types of sex chromosome polysomy, as well as one monosomy (X), have been known to occur, though much more rarely.
 
 
 
==Bioethics==
 
{{Incomplete|section}}
 
 
 
==Practice Test==
 
This practice test focuses on DNA. For full-length tests see the [http://scioly.org/wiki/Test_Exchange_Archive#Designer_Genes Designer Genes Test Exchange.]
 
 
 
'''1. Which of the following nucleotide pair bonds would be found in a DNA molecule?'''<br clear="all"/>
 
a. adenine-guanine<br clear="all"/>
 
b. guanine-cytosine <br clear="all"/>
 
c. adenine-cytosine<br clear="all"/>
 
d. cytosine-uracil<br clear="all"/>
 
 
 
'''2. The backbone of a DNA molecule is made of which two components?'''<br clear="all"/>
 
a. phosphate molecules and ribose sugars<br clear="all"/>
 
b. deoxyphosphate molecules and ribose sugars<br clear="all"/>
 
c. phosphate molecules and deoxyribose sugars<br clear="all"/>
 
d. deoxyphosphate molecules and deoxyribose sugars <br clear="all"/>
 
 
 
'''3. Ribosomes are made of:''' <br clear="all"/>
 
a. rRNA and protein<br clear="all"/>
 
b. tRNA and mRNA<br clear="all"/>
 
c. rRNA and mRNA<br clear="all"/>
 
d. protein and mRNA<br clear="all"/>
 
 
 
'''4. Watson and Crick were the first to suggest that DNA is:'''<br clear="all"/>
 
a. a short molecule<br clear="all"/>
 
b. the shape of a double helix<br clear="all"/>
 
c. a protein molecule<br clear="all"/>
 
d. protein and tRNA <br clear="all"/>
 
 
 
'''5. The chromosome abnormality that occurs when part of one chromosome breaks off and is added to a different chromosome is:'''<br clear="all"/>
 
a. deletion<br clear="all"/>
 
b. nondisjunction<br clear="all"/>
 
c. translocation<br clear="all"/>
 
d. inversion <br clear="all"/>
 
 
 
'''6. Which of the following would be least likely to happen as a result of a mutation in a person's skin cells?'''<br clear="all"/>
 
a. skin cancer<br clear="all"/>
 
b. reduced functioning of the skin cell<br clear="all"/>
 
c. no change in the functioning of the skin cell<br clear="all"/>
 
d. the person's offspring have mutated skin <br clear="all"/>
 
 
 
'''7. The process by which a DNA molecule is copied is called:'''<br clear="all"/>
 
a. binary fission<br clear="all"/>
 
b. mitosis<br clear="all"/>
 
c. replication<br clear="all"/>
 
d. translation <br clear="all"/>
 
 
 
'''8. A DNA nucleotide may be made up of a phosphate group along with:''' <br clear="all"/>
 
a. a deoxyribose sugar and uracil <br clear="all"/>
 
b. ribose sugar and adenine<br clear="all"/>
 
c. deoxyribose sugar and thymine<br clear="all"/>
 
d. ribose sugar and cytosine<br clear="all"/>
 
 
 
'''9. Which series is arranged in order from largest to smallest in size?'''<br clear="all"/>
 
a. chromosome, nucleus, cell, DNA, nucleotide<br clear="all"/>
 
b. cell, nucleus, chromosome, DNA, nucleotide<br clear="all"/>
 
c. nucleotide, chromosome, cell, DNA, nucleus<br clear="all"/>
 
d. cell, nucleotide, nucleus, DNA, chromosome <br clear="all"/>
 
 
 
'''10. Messenger RNA is formed in the process of:'''<br clear="all"/>
 
a. transcription<br clear="all"/>
 
b. translation<br clear="all"/>
 
c. replication<br clear="all"/>
 
d. mutation <br clear="all"/>
 
 
 
'''11. X rays, ultraviolet light, and radioactive substances that can change the chemical nature of DNA are classified as:'''<br clear="all"/>
 
a. growth regulators<br clear="all"/>
 
b. metamorphic molecules<br clear="all"/>
 
c. hydrolytic enzymes<br clear="all"/>
 
d. mutagens <br clear="all"/>
 
 
 
'''12. After DNA replication, the two DNA strands that are produced are:'''<br clear="all"/>
 
a. are complimentary<br clear="all"/>
 
b. are identical<br clear="all"/>
 
c. must replicate again<br clear="all"/>
 
d. cannot replicate again <br clear="all"/>
 
 
 
'''13. Bacteriophages are:'''<br clear="all"/>
 
a. tiny bacteria<br clear="all"/>
 
b. bacteria of the same type<br clear="all"/>
 
c. lipids and ribonucleic acids <br clear="all"/>
 
d. viruses <br clear="all"/>
 
 
 
'''14. In translation, the order in which codons bond to mRNA is determined by:'''<br clear="all"/>
 
a. rRNA<br clear="all"/>
 
b. tRNA<br clear="all"/>
 
c. Base pairing rules<br clear="all"/>
 
d. Lagging strand <br clear="all"/>
 
 
 
'''15. In RNA, the code word AUG that specifies methionine can also serve as a(n):'''<br clear="all"/>
 
a. anticodon<br clear="all"/>
 
b. stop codon<br clear="all"/>
 
c. initiator codon<br clear="all"/>
 
d. all are correct <br clear="all"/>
 
 
 
'''16. The two strands of a DNA double helix are:'''<br clear="all"/>
 
a. identical<br clear="all"/>
 
b. purines<br clear="all"/>
 
c. pyrimidines<br clear="all"/>
 
d. complementary <br clear="all"/>
 
 
 
'''17. Both DNA and RNA:'''<br clear="all"/>
 
a. contain ribose<br clear="all"/>
 
b. are single stranded<br clear="all"/>
 
c. contain nucleotides<br clear="all"/>
 
d. contain uracil<br clear="all"/>
 
 
 
{{SpoilerBoxBegin}}'''Answers to the Review Questions'''
 
{{SpoilerBoxContent}}
 
1. B<br clear="all"/>
 
2. C<br clear="all"/>
 
3. A<br clear="all"/>
 
4. B<br clear="all"/>
 
5. C<br clear="all"/>
 
6. D<br clear="all"/>
 
7. C<br clear="all"/>
 
8. C<br clear="all"/>
 
9. B<br clear="all"/>
 
10. A<br clear="all"/>
 
11. D<br clear="all"/>
 
12. A<br clear="all"/>
 
13. D<br clear="all"/>
 
14. C<br clear="all"/>
 
15. D<br clear="all"/>
 
16. D<br clear="all"/>
 
17. C<br clear="all"/>
 
{{SpoilerBoxEnd}}
 
  
 
==Resources==
 
==Resources==
[[Media:Gangsta DG Notes.pdf|gangsta_duck's Designer Genes Notes]]
+
:[[Media:Gangsta DG Notes.pdf|gangsta_duck's Designer Genes Notes]]
 +
:[[Media:GFNowhere_designer_notes.pdf|GuyFromNowhere's Designer Genes Notes]]
 +
:[[Media:Designer Genes Short Practice Test and Key.pdf|Short Practice Test (and key)]]
  
[[Media:GFNowhere_designer_notes.pdf|GuyFromNowhere's Designer Genes Notes]]
+
{{Life Science Event}}
[https://docs.google.com/document/d/1h4tr65StoK_H09z4u3fmmTI0PBWizeBRPz4tOPMBEw8/pub Molecular Biology of the Cell notes]
+
{{2021Events}}
 +
[[Category:Life and Social Science Events]]
 
[[Category:Event Pages]]
 
[[Category:Event Pages]]
 
[[Category:Study Event Pages]]
 
[[Category:Study Event Pages]]

Revision as of 02:44, 27 November 2020

Template:EventLinksBox Designer Genes is a Division C biology event for the 2021 season. It was previously an event for the 2013, 2014. 2019, and 2020 seasons. The event covers topics relating to genetics, biotechnology, and the molecular biology of inheritance.

Many topics listed in the table in section 3 of the rules consist of the material in the Division B event Heredity (although generally in less detail since it is only one part of the event). Several sections of this page link to the appropriate sections of the Heredity page.

Inheritance

Refer to Heredity#Inheritance

DNA

Refer to Heredity#DNA

RNA

Refer to Heredity#RNA

Mitosis

Refer to Heredity#Mitosis

Meiosis

Refer to Heredity#Meiosis

DNA Repair

Cells have three built-in mechanisms for repairing their DNA. DNA damage can be caused by normal internal factors, as well as environmental factors like radiation. DNA damage causes structural damage to the molecule and can affect a cell's ability to transcribe affected genes. Harmful mutations can also occur, and they can affect the survival of a cell and its daughter cells. DNA repair happens constantly throughout the body, responding to any damage to the DNA structure. When DNA repair is not performed and a cell does not undergo apoptosis (programmed cell death/cell suicide), then permanent damage can occur leading to malignant tumors or cancer.

Direct reversal

Direct reversal repair occurs when consecutive pyrimidine bases become fused together when they are exposed to UV light, forming pyrimidine dimers. Photoreactivation directly reverses this process by using an enzyme called photolyase that reacts directly to exposure to blue/UV light. Photolyase no longer functions in humans, but can be found in bacteria, fungi and some animals. Humans use a process known as nucleotide excision repair to repair damage done by UV light.

Excision repair

When only one strand of a double helix is damaged, the other strand can be used to identify the missing or incorrect bases. Excision repair mechanisms remove the damaged nucleotide and replace it with the correct undamaged nucleotide.

Base excision repair (BER) repairs damage to a single base using glycosylases. These enzymes remove the specific affected base, and DNA polymerase correctly synthesizes the new strand.

Nucleotide excision repair (NER) is less specific, and is typically used in cases where a large portion of the helix is distorted. Damaged regions are removed in a three step process with the recognition of the damage, excision of the damaged area, and resynthesis of the removed region. NER occurs in almost every organism.

Postreplication repair

Postreplication repair (also known as translesion synthesis) occurs when the replication process is allowed to replicate past DNA lesions. A gap is left at the damaged site when the Okazaki fragments are synthesized, filled in later by either recombination repair or error-prone repair. Recombination repair uses the sequence from a sister chromosome to repair the damaged DNA, and error-prone repair uses the damaged strand as a sequence template. Error-prone repair is typically inaccurate, and commonly results in mutations.

Gene Expression

Several different factors interact with RNA transcription and translation to control gene expression.

Transcriptional

Gene expression can be controlled during or after transcription. The rate of gene transcription is often controlled by allowing or denying RNA polymerase access to the gene. Termination can also occur early, preventing the gene from being transcribed properly. Transcriptional regulation can also occur when RNA polymerase is attempting to escape the promoter complex to start transcribing DNA. Protein factors can also alter the rate of transcription.

Post-transcriptional

There are three main post-transcriptional processes: the processing of 3' and 5' ends, RNA splicing, and alternative splicing. At the end of the transcription process, the 3' end gains 50-250 adenine nucleotides known as the Poly-A tail. The 5' end is capped with a 7-methylguanosine residue which is an altered version of guanine. This cap prevents the RNA from degrading and stabilizes the mRNA, enabling it to undergo translation into proteins. Certain enzymes are able to break down the Poly-A tail and cap, allowing nuclease enzymes to break down the RNA.

Alternative splicing

mRNA can be divided into two parts known as introns and exons. Introns are parts of the mRNA not used for translation and the production of proteins. Exons are all of the other parts of the mRNA that are used for the production of proteins. During RNA splicing, the unnecessary introns are removed and the expressed exons are brought together. Small nuclear ribonuclearproteins (snRNPS) recognize the splice sites, and join together additional proteins to form an assembly known as the spliceosome. This assembly removes the introns and facilitates RNA splicing.

The process for alternative splicing is similar to the process for regular RNA splicing. However, the key difference is that alternative splicing produces different RNA from the same primary transcript. Exons are mixed and matched to create different proteins from the same length of mRNA. This process is also called exon shuffling, and is the reason why humans produce so many proteins despite having a limited number of genes.

microRNA

MicroRNA (abbreviated to miRNA) is a small sequence of RNA that regulates gene expression. It is typically about 22 nucleotides in length. miRNA works by bonding with complementary sequences in mRNA, which destabilizes the mRNA strand by separating it into two pieces or slowing down translation into proteins. miRNA is involved in a variety of biological functions, including cell cycle control, apoptosis and developmental processes like aging and immune responses. miRNA has also been implicated in various diseases including cancer and certain types of heart and neurological diseases. One miRNA can target multiple genes, regulating the expression of multiple proteins.

Translational

Gene expression can also be regulated or modified during or after RNA translation.

Lac and Trp Operons

Lac and Trp Operons are examples in prokaryotic gene regulation. Most prokaryotic genes such as in E. coli are always turned "on", but others are active only when products are needed by the cell. As such, their expression must be regulated.

An operon is a group of genes transcribed together by a single promoter. The lac operon was the first to be discovered. In the model bacterium E. coli, this operon is transcribed in the presence of lactose to give the bacterium the ability to digest this source of energy. It has three parts: lacA, lacY, and lacZ, as well as a promoter, a regulator, a terminator, and an operator. To activate lactose digestion abilities, an isomer of lactose (allolactose) binds to the gene's repressor, allowing the operon to be transcribed

Whereas the lac operon gives E. coli the ability to digest lactose, the trp operon shuts off the bacterium's capability to metabolize tryptophan. As such, it is an example of a repressible operon. In the presence of lactose, its five structural genes (trpA, trpB, trpC, trpD, and trpE), which code for tryptophan synthase, will be repressed so E. coli can metabolize lactose instead. Lac operons are inductible operons due to the fact that genes are expressed in the presence of a substance (lactose).

Post-translational

The way a protein functions hinges on the way it is folded. Hydrogen bonds form between the nucleotides, which produce the tertiary structure of the protein. Chaperonins assist the folding of proteins, and ensure that it does not fold improperly. A protein that is folded improperly and not destroyed can cause numerous diseases such as Alzheimer's disease, cystic fibrosis, and cancer. Enzymes can also process the polypeptide once it is folded by removing residues or amino acids.

Carbohydrates, lipids and phosphate groups can also be attached to the polypeptides. The attachment of carbohydrates is known as glycosylation, and often promotes protein folding and stability in proteins. Lipidation often occurs in proteins that are going to be attached to the cell membrane. The most common type of post-translational modification is known as phosphorylation and typically regulates the activity of enzymes.

Epigenetics

Epigenetics is the study of changes in organisms not caused by the alteration of genetic code. Epigenetics revolves around gene expression, not the DNA itself. It affects how genes are read by cells, and how they produce proteins. Think of the human genome as a filing cabinet, and the genes as folders that contain the instructions to make a protein. Certain folders might be marked as important, or others could be marked as less important. These epigenetic marks control the expression of genes. It is the reason that even though every cell in the body has the same DNA at its core, different cells have different functions. A liver cell would open different folders in the filing cabinet than a brain cell would, because it would need to make different proteins.

Epigenetic marks take the form of molecular tags that are placed in different places on the histone, and each one has a different effect. They can make DNA more accessible to proteins, or purposefully make it less accessible so that a specific gene is not transcribed or translated. Some epigenetic marks are very long and cover large stretches of DNA, or others are gathered at the start of genes. Epigenetic marks can also change over time. These changes can be caused by anything from chemical additives in plastics to DNA errors during replication.

Some epigenetic marks can also be inherited through generations. This is how environmental factors are passed down through generations. Addictive behavior is inherited in this way, and the effects nutrient deprivation can be passed down in this way too. However, passing down epigenetic tags is different than passing down genes. Reproductive cells undergo a process called reprogramming, and this process is supposed to erase all epigenetic tags. However, on some genes it fails and leaves these tags in place to be passed down to another generation. In mammals, about 1% of genes escape epigenetic reprogramming.

Epistasis

Epistasis is yet another form of gene expression that describes the relationship between multiple genes. Epistasis typically features one allele masking the phenotype of another separate gene. This is different from a dominant/recessive relationship because those alleles are different types of the same gene. For example, in humans the gene that codes for albinism is separate from the gene that codes for skin tone. If a human has the gene responsible for albinism then their skin tone is "masked" and not displayed.

Phylogenetics

Phylogenetics is the study of evolutionary relationships. A phylogenetic tree displays these relationships based upon their similarities and differences. Rooted trees have a common ancestor, and in some cases the length of a line can indicate time estimates. Unrooted trees only show the relationship between a couple of organisms and do not require an ancestral root. Phylogenetic trees are based on speculation and do not show exact evolutionary history, but they can still display how animals could have possibly evolved.

Hardy-Weinberg Equilibrium

The Hardy-Weinberg equilibrium is a common population model used in genetics.

Conditions

The Hardy-Weinberg Law states that a population will maintain the exact allele and genotype frequencies over each generation unless five specific influences are introduced into the population. For a population to be in Hardy-Weinberg equilibrium, it must meet all of the 5 conditions listed below:

  1. No mutations: Mutations introduce new alleles into the population.
  2. No gene flow: Like mutations, immigration or emigration can introduce new alleles (or bolster/diminish existing alleles)
  3. Very large population: Genetic drift is likely to occur in a smaller population. Hardy-Weinberg equilibrium can only occur in a population approaching infinity.
  4. No natural selection: If some traits are discriminated for/against by environmental conditions, the genotype frequencies will not be in equilibrium over the generations.
  5. Random mating: Like natural selection, sexual selection involved in non-random mating could discriminate for/against traits.

An example of Hardy Weinberg: Consider a world where everyone has either purple or blue skin. "S" is purple skin, and "s' is blue skin. The probability of either one of these traits occurring is constant, and both that and the allele freqeuncies have to add to 1. The probabilities of the alleles are represented as [math]p[/math] (for S) and [math]q[/math] (for s). Therefore, the probability of being homozygous purple (SS) would be [math]p*p[/math] or [math]p^2[/math], the probability of being heterozygous (Ss) would be [math]p*q + q*p[/math] or [math]2pq[/math], and the probability of being homozygous blue (ss) would be [math]q*q[/math] or [math]q^2[/math]. These probabilities form the two equations used in the Hardy-Weinberg equilibrium.

Equations

There are two equations used in the Hardy-Weinberg Law:

  1. [math]p^2 + 2pq + q^2 = 1[/math]
  2. [math]p + q = 1[/math]

where

[math]p[/math] is the frequency of the (homozygous) dominant allele in the population, as a percentage
[math]q[/math] is the frequency of the (homozygous) recessive allele in the population, as a percentage
[math]p^2[/math] is the percentage of the homozygous dominant individuals
[math]2pq[/math] is the percentage of the heterozygous individuals
[math]q^2[/math] is the percentage of the homozygous recessive individuals.

These equations only apply if the population is in Hardy-Weinberg equilibrium.

Solving a Hardy-Weinberg Problem

A typical Hardy-Weinberg problem will resemble the sample problem below:

In a certain population, the percentage of the homozygous recessive genotype (aa) is 36%. Using only that information, find:
  1. The frequency of the recessive genotype.
  2. The frequency of the recessive allele.
  3. The frequency of the dominant allele.
  4. The percent of the heterozygous individuals.

IMPORTANT: Before attempting to solve the problem, it is critical to analyze all of the given information and approach it in the correct manner. Make sure to check the math after finishing! One mistake will throw off the entire problem. When solving a problem, make sure to work in the order as follows:

Step 1: Determine [math]q[/math]. Since a dominant phenotype can have either a homozygous or heterozygous genotype, it is easier to find the recessive allele first (unless an exact homozygous/heterozygous dominant value is given).
Step 2: Determine [math]p[/math]. Using the second equation, [math]p[/math] can be found once [math]q[/math] has been determined.
Step 3: Determine [math]p^2[/math] and [math]q^2[/math]. Steps 3 and 4 are interchangeable, but finding [math]p^2[/math] and [math]q^2[/math] first is generally the common practice.
Step 4: Determine [math]2pq[/math].

The answers and work (using the four steps) for the sample problem are shown below:

Step 1: Determine [math]q[/math]. Since aa, or [math]q^2[/math] is 36%, then a (the frequency of the recessive allele-this is q in Hardy-Weinberg terms) must be 60%, or 0.6.
Step 2: Determine [math]p[/math]. Using the second equation, [math] p + q = 1[/math]. Therefore, [math]p[/math], or A must be 0.4. (40%)
Step 3: Determine [math]p^2[/math] and [math]q^2[/math]. Now that [math]p[/math] and [math]q[/math] (A and a respectively) are both known, [math]p^2[/math] and [math]q^2[/math] can be found by squaring each term. In this case, [math]p^2 = .16[/math] and [math]q^2 = .36[/math] (16% and 36% respectively).
Step 4: Determine [math]2pq[/math]. This can be done two ways. Rearranging the first equation, [math]2pq = 1 - p^2 - q^2[/math], so [math]2pq = .48[/math] (48%). Additionally, [math]2pq[/math] can be found by multiplying [math]p[/math] and [math]q[/math] together, then multiplying that by [math]2[/math].

So, the answers to the sample questions are:

  1. .36 (this was given to us in the problem)
  2. .6
  3. .4
  4. 48%

Note: Frequency is always expressed as a decimal (and percentages are expressed as percents).

Biotechnology

Sequencing

There are a variety of ways to sequence DNA, or determine the specific order that nucleotides are in. One of the most reliable methods of sequencing is the chain-termination method, or Sanger sequencing. This method was one of the earliest and is typically used on strands of DNA that have 900 base pairs or less. It is expensive and inefficient for larger scale projects, but useful for individual pieces of DNA. The Sanger method is a three step process, and was used in the Human Genome Project to sequence all 22 autosomes and the X and Y chromosomes.

  • A DNA fragment is denatured into a single strand and cooled so that the primer can bind to it. A primer and DNA polymerase are added, along with regular deoxynucleotides and fluorescent chain-terminating dideoxynucleotides. These special nucleotides lack a hydroxyl group on the 3' carbon, preventing the addition of further nucleotides.
  • Once the primer binds to the DNA fragment, the temperature is raised again and the DNA polymerase begins to add DNA nucleotides to appropriate sites on the template DNA. This continues until the polymerase adds a tagged nucleotide instead of a regular one.
  • This process creates numerous strands of varying lengths. These strands can be separated by length using gel electrophoresis which can be used to show which dideoxynucleotide is at the end. The strand created with the Sanger method is the complementary strand of DNA.

Next Generation Sequencing

Next Generation Sequencing (NGS), or high-throughput sequencing, is a name that describes several different ways to sequence DNA. It is faster and cheaper than Sanger sequencing since many sequencing reactions can take place at once, it is very low-cost, and the reactions are much smaller.

RNA-Seq and Tn-Seq

RNA sequencing (also known as WTSS) is the use of Next Generation Sequencing to reveal how much RNA is in a sample at a given moment and is replacing microarrays in many labs. RNA-Seq sequences the mRNA and can be used to analyze gene expression, typically in different conditions (such as with drugs and without drugs). It can also find variations in RNA and detect post-transcriptional alterations, whereas microarrays can only determine gene expression.

Tn-Seq (transposon sequencing) determines genetic interactions and can determine the frequency of mutations. However, it is limited to bacterial studies.

Microarrays

A microarray consists of a small solid surface with various known single-stranded segments of DNA attached. It is primarily used for testing unknown DNA sequences - the level of binding of an unknown sequence to one of the microarray segments (known as probes) indicates whether the unknown strand is complementary to a particular known strand. DNA microarrays are also used to measure the expression levels of a large amount of genes simultaneously.

RFLP Analysis

In RFLP analysis (also known as restriction enzyme analysis), a DNA sample is broken into pieces and digested by restriction enzymes and the resulting restriction fragments are separated according to their lengths by gel electrophoresis. Though now largely obsolete due to the rise of inexpensive DNA sequencing technologies, RFLP analysis was previously used for DNA profiling (also known as DNA fingerprinting).

Molecular Cloning

Molecular cloning is the process of inserting recombinant DNA into various host organisms - for example, certain types of bacteria - and replicating them. It is most often used to manufacture large quantities of desirable proteins. For example, synthetic insulin is primarily produced using recombinant DNA inside bacteria such as E. coli.

Polymerase Chain Reaction

Polymerase Chain Reaction, abbreviated as PCR, is a method of quickly making billions of copies of a desired section of DNA. For a virtual lab that clearly explains the process, visit this website.

The Polymerase Chain Reaction is another way of creating large numbers of a specific piece of DNA, other than cloning DNA.

In PCR, DNA primers are employed on opposite ends of the DNA sequence. They are necessary of the initiation of DNA replication. Then, a single strand of DNA is used as the template to produce double stranded DNA through polymerization.

Individual strands of DNA are unwinded from double stranded DNA using heat. Thus, PCR consists of heat treatment to unwind the DNA, then the binding of primers to the DNA, then polymerization to form another strand. This repeats, and will quickly and exponentially multiply the amount of DNA available.

The key step in the development of PCR was the isolation and use of a heat resistant DNA polymerase (TacDNA Polymerase).

PCR is better than the conventional cloning of DNA due to the fact that PCR can be used with only very small and impure samples of DNA.

Gel Electrophoresis

Gel Electrophoresis is one of the most useful techniques to study macromolecules, especially proteins or nucleic acids. In gel electrophoresis, charged molecules are pulled through a gel (usually purified agar known as agarose) and this separates the molecules. Larger molecules move more slowly through the gel since they get caught in the gel matrix, and molecules with greater charges move faster since the electric field is what is pulling the molecules.

Molecules will have a negative charge, and thus will move towards the positive poles.

Blotting

Blotting is a method for identifying specific biomolecules (DNA, RNA, proteins) in a sample mixture. If you wanted to determine if a particular protein is present in a tissue sample, you could determine it by blotting.

First, a sample is processed. This step usually involves extracting all of a particular type of molecule (e.g. protein extraction, DNA extraction), producing an extract. Additional steps may be performed such as reducing steps (to break apart disulfide bonds in proteins), denaturing steps (e.g. adding detergent to denature proteins as in SDS-PAGE), or other digestion steps (such as adding restriction enzymes to a nucleic acid sample).

Then, the extract is run by gel electrophoresis to separate the biomolecules in a gel by size and/or charge. The contents of the gel are then transferred to a membrane sheet (usually nitrocellulose or PVDF) by applying a current. The negatively-charged biomolecules will move out of the gel and be trapped in the membrane.

Finally, the biomolecule of interest is detected using a probe that can be visualized. Probes are biomolecules with specificity for the target protein that are conjugated (bonded) to a detectable marker or enzyme. Usually, if you want to detect a protein, your probe will also be a protein (an antibody); if you want to detect a specific DNA sequence, your probe will be a complementary DNA sequence. After excess probes are washed away, only probes bound to your target biomolecule remain. If your detectable marker is fluorescent, it can be viewed with an imager; the bound marker will fluoresce when imaged, producing a visible band that is distinct from your membrane, allowing you to see the presence of your biomolecule of interest.

Sometimes, as mentioned before, the probe can involve a conjugated enzyme. This is common for Western blots, where horseradish peroxidase (HRP) is a popular choice of enzyme. These probes require addition of a chemiluminescent substrate, which when processed by the enzyme will luminesce, and can be visualized by imaging.

The different variants of blotting are listed in the table below. A mnemonic for remembering these is SNoW DRoP. Each letter in the first word is the type of blot that corresponds to a letter in the second word, indicating the type of biomolecule involved.

Types of Blotting
Southern Getting DNA from a film
Northern Getting RNA from a film
Western Getting proteins from a film
Eastern Detecting post-translational modifications in proteins

When analyzing the images from a blot, the separation step by gel electrophoresis is helpful. The position of the detected band on the membrane can be used to verify the size of the detected biomolecule. In actual experiments, non-specific binding (the probe binding to something that is not the intended target) may occur; these bands can sometimes be ruled out if they are especially faint. Finally, protein degradation may occur as a result of improper experimental procedure; in this case, you might see a long smear of bands from degraded protein sequences that are different sizes from the actual protein but still bind to the probe.

Gene Therapy

Gene therapy is the process of introducing genes into a patient in order to cure a disease. It has the potential to eliminate hereditary diseases like cystic fibrosis and could cure other diseases like cancer or AIDS. Many different approaches to gene therapy are being tested such as deactivating problematic genes, replacing mutated genes and introducing new genes into the body but most of these are experimental and can be dangerous. Gene therapy is commonly only tested on diseases that have no other cures.

CRISPR-Cas technology

CRISPR stands for Clustered Regularly Interspaced Short Palindromic Repeats, which is a bacterial defense mechanism that can be used to target and edit DNA in specific locations. CRISPR technology is typically used for gene therapy, and is currently being used to correct mutations that cause diseases. Other systems also exist that target RNA and diagnose illnesses.

CRISPR "spacer" sequences are first translated into RNA sequences called crRNAs that can guide the system to the matching portions of DNA. The crRNAs bind to a protein called Cas9. With this RNA, Cas9 is able to detect complementary sequences of DNA in the genome (this process is similar to the one involving miRNA). When the DNA is found, Cas9 binds to the DNA and cuts it, disabling the gene. Scientists can introduce a Cas9-gRNA complex (guide RNA is made of crRNA and other RNA sequences which allow it to bind to Cas9) into a cell so that Cas9 targets a specific gene that they may want to study the function of.
CRISPR-Cas9 can be used to repair a gene which has a mutation. If a functional gene is introduced to the cell along with the Cas9-RNA complex, repair enzymes can use this functional gene as a template when they repair the DNA which has been cut by Cas9.

Check out the CRISPR/Cas9 Protein Modeling page for some more information!

Plasmid Cloning

Plasmids are small DNA molecules located outside of the chromosome that can replicate independently. They are typically found in bacteria, but can also be found in both single-celled and eukaryotic organisms. Plasmids can be used as vectors in genetic engineering, meaning that they can be used to transfer foreign genetic material into another cell. Plasmids are typically used to clone and amplify the expression of certain genes, and they can also be used in gene therapy. Plasmid cloning vectors are typically used to clone shorter DNA fragments, and other means are used for cloning larger fragments.

Bioethics

Bioethics is an intersection of ethics and technology, and studies the way that advancements in science and medicine interact with society and the environment. It is concerned with basic human rights and ensuring that humans are responsible with the advances that they make. It also raises the question of whether or not biotechnology like cloning, life extension, and gene therapy are ethical and should be performed. Human experimentation is another common issue faced by bioethics, and basic ethical principles have been established by a wide variety of organizations to ensure that patients have the freedom to choose their own treatment. Bioethicists come from a variety of different backgrounds, and it typically influences their viewpoints on science. However, they typically follow four different principles that were established by the Belmont Report in 1978. These principles for using humans in research focus around maintaining respect for the people involved, keeping their safety a priority, and ensuring that procedures are not exploitative and are administered fairly. 15 federal agencies adopted a set of rules protecting human subjects following the Belmont Report, and this document enables people to understand regulations on human experimentation.

Genetic Disorders

Refer to Heredity#Genetic Disorders

Resources

gangsta_duck's Designer Genes Notes
GuyFromNowhere's Designer Genes Notes
Short Practice Test (and key)