Reflections. Tales from the Genome. Exam 2

In [1]:
import webbrowser
from IPython.core.display import HTML
from IPython.display import Image

Glossary

Activator: transcription factors that bind to an enhancer and increase transcriptional activity

Adenine (A): a type of nitrogenous base that is typically used in both DNA and RNA (A basepairs with U or T)

Allele: Specific variant of a genetic sequence for which more than one variation exists, sometimes associated with a unique phenotype

Allele Frequency: the frequency of an allele is equal to the number of that allele divided by the total number of alleles in a given population

Amino Acid: the basic building blocks of proteins, combined by ribosomes during the process of translation; there are 20 different amino acids

Aneuploidy: a chromosomal aberration in which certain chromosomes are present in extra copies or are deficient in number

Autosome: a nuclear chromosome that is not a sex chromosome (X or Y)

Basepair: the phenomenon of nitrogenous bases in nucleic acid pairing with one another in double-stranded DNA or RNA, following the rules A:T, G:C in DNA and A:U, G:C in RNA

Behavioral Trait: any trait that concerns an organism's action or interaction with or within an environment, for example: aggressiveness

Bioethicist: someone who focuses on ethical issues relating to biological topics

Blending Inheritance: the idea that a particular trait in an offspring is a mix of the parents’ traits

Bottleneck: genetic drift resulting from the reduction of a population, typically by a natural disaster, such that the surviving population is no longer genetically representative of the original population

Chromosome: a super-coiled structure of organized DNA wrapped around histones; contains a single molecule of DNA

Coding DNA: sometimes referred to as "protein-coding DNA"; refers to any sequence in the genome that specifies amino acids and translation signals (initiation and termination codons)

Codon: a three-nucleotide sequence of mRNA that specifies a particular amino acid or termination signal

Combinatorial Regulation: the idea that transcription of most genes is controlled by more than one activator or repressor to achieve a particular level of activity

Computational Biologist: someone who applies their knowledge of computer science or computer coding to biological problems

Concordance: the presence of the same trait in both members of a pair of twins or set of individuals

Consensus Sequence: a single sequence that represents the most prevalent individual unit at each position, derived by comparing variants of the sequence from different sources

Correlation: refers to an observable relationship between any paired values

Cytochrome P450: a large and diverse group of enzymes that catalyze the oxidation (metabolism) of organic substances (drugs)

Cytoplasm: the interior of a cell, excluding the nucleus

Cytosine (C): a type of nitrogenous base that is typically used in both DNA and RNA (C basepairs with G)

Deoxynucleotides: the building blocks of DNA; there are four different bases used in deoxynucleotides: Adenine (A), Thymine (T), Guanine (G), and Cytosine (C)

Direct Selection: (see natural selection)

Director Of Clinical Operations: someone who designs and manages clinical trials

Dizygotic Twins: twins that came from two different fertilized eggs or zygotes; fraternal twins

DNA: deoxyribonucleic acid; the hereditary material of almost all cells that makes up their genomes

DNA Amplification: the in vitro replication of a DNA sequence to make many more copies

DNA Extraction: the isolation of DNA from a biological sample

DNA Sequence: a string of DNA letters (bases) in consecutive order

Dominant Trait: can mask the presence of a recessive allele or trait

Double Helix: the structure of DNA, referring to its two adjacent strands wound into a spiral shape and held together through basepairing

Duplicated Chromosome: the stereotypical structure in the shape of an "X" for a nuclear chromosome that appears only before cell division

Duplication: a replication error that doubles a large segment of DNA

Efficacy: a drug's ability to produce a therapeutic effect

Egg: female reproductive cell

Enhancer: a DNA sequence that binds certain transcription factors, activators, that can stimulate transcription of nearby genes

Enzyme: a class of proteins that enable chemical reactions without being consumed by the reaction

Exon: a sequence from a gene that is transcribed and remains in the mRNA after splicing and includes codes for amino acids

Founder Effect: a cause of genetic drift attributable to colonization by a limited number of individuals from a parent population

Frameshift: any mutation that results in changing the reading frame of translation

Gain-Of-Function Mutation: changes the gene product such that it gains a new and/or abnormal function

Gamete: sperm or egg cells; produce as a result of meiosis from germ cells

Gene: a discrete unit of hereditary information consisting of a specific deoxynucleotide sequence in DNA

Gene Expression: the process by which information from a gene is used in the synthesis of a functional gene product

Genetic Counselor: someone who explains and discusses personal genetic information with individuals and families

Genetic Genealogist: someone who uses genetic information to determine and catalog family relationships and uncover ancestry

Genome: the complete complement of an organism's genetic material

Genome Wide Association Study (GWAS): GWAS seek to correlate, in populations, the association of specific alleles with the trait or disorder being studied

Genotype: the genetic makeup of an organism, specifically the composition of alleles

Germ Cell: the type of the cell in the body that makes gametes; this is the only cell type where mutations affect the next generation of an organism

Guanine (G): a type of nitrogenous base that is typically used in both DNA and RNA (G basepairs with C)

Hidden Trait: any trait that is not apparent through outward observation

Histone: a protein that is used to organize and fold DNA like a string on a spool

Heterozygous: having two different alleles for a given gene

Heterozygote Advantage: describes the case in which the heterozygous genotype has a higher relative fitness than either the homozygous dominant or homozygous recessive genotype

Heritability: the proportion, between 0 and 1, of observable differences in variation of a trait between individuals within a population that is due to genetic differences

Homozygous: having two identical alleles for a given gene

Human Geneticist: someone who studies human genetics and inheritance

Hybridization: the process of basepairing that can occur between any types of nucleic acids (DNA:DNA, DNA:RNA, or RNA:RNA)

Identity By Descent (IBD): genetic sequence shared through ancestry

Identity By Descent (IBD): genetic sequence shared through ancestry

Identity By State (IBS): genetic sequence that is identical between two individuals

Inheritance: the passing down of traits from one generation to the next, at the level of the cell or the organism

Innate Trait: any trait that is in-born, for example: your pancreas secreting enzymes that break down the food in your gut

Intron: a sequence from a gene that is transcribed but cut out of the mRNA by splicing and typically does not code for any amino acids

Learned Trait: any trait that is not in-born and instead acquired through environmental (typically cognitive) influence, for example: belief in a particular religion

Loss-Of-Function Mutation: results in the gene product having less or no function

Meiosis: a two-stage type of cell division in germ cells that results in gametes with half the chromosome number of the original cell

Mitochondrial DNA: a circular DNA molecule that can only be found in the mitochondria of all cells in the body and is inherited only from the mother

Mitosis: the normal chromosome doubling and division that all somatic (body) cells do to maintain the same number of chromosomes at the end of each division

Missense/Non-Synonymous Mutation: a mutation that changes an amino acid

Monogenic Trait: traits that are significantly influenced by a single gene

Monozygotic Twins: twins that came from the same fertilized egg or zygote; identical twins

Multifactorial Trait: a trait that is controlled by many genes and is also influenced by the environment

Mutation: a change in the genetic sequence

Natural Selection: the process by which traits become either more or less common in a population because of pressures directly affecting the reproductive fitness of individuals carrying particular alleles; also considered "direct selection" due to pressures that directly affect the fitness of particular alleles

Non-coding DNA: any sequence that does not specify amino acids and translation signals (initiation and termination codons)

Non-Duplicated Chromosome: a non-"X"-shaped nuclear chromosome;

Nonsense Mutation: a mutation that changes an amino acid codon to a STOP codon

Nuclear Genome: the complete set of 23 pairs of chromosomes that reside within the nucleus of the cell

Nucleotide: building block of DNA and/or RNA consisting of a base, a ribose or deoxyribose sugar and a phosphate group; there are five different bases used in nucleotides: Adenine (A), Thymine (T) in DNA or Uracil (U) in RNA, Guanine (G), and Cytosine (C)

Nucleus: a separate, membrane-bound compartment of eukaryotic cells that houses the DNA and separates it from the rest of the cell; this is where transcription occurs

Particulate Inheritance: the idea that characteristics can be passed down from generation to generation through discrete particles, i.e. genes

Pedigree: organized way of illustrating (drawing) family relations and traits

Penetrance: the degree to which a particular allele causes a trait

Personal Genome: the entirety of your own individual DNA

Pharmacodynamics: the target effects of a drug; what a drug does to the body

Pharmacogenetics/Pharmacogenomics: the study of how different drugs interact with the body in different ways based on genetic variation

Pharmacokinetics: how a drug is metabolized; what the body does to a drug

Pharmacology: the study of drugs and their origins, as well as how they interact with the body of a living organism

PharmGKB: Pharmacogenomics Knowledge Base is an interactive tool for researchers to investigate how genetic variation affects drug response, both with regards to pharmacodynamics and pharmacokinetics

Phenotype: the physical makeup, or appearance, of an organism or individual trait

Physical Trait: any trait that concerns our material makeup

Polygenic Trait: a trait that is the result of multiple gene interactions with very little environmental impact

Population: a group of individuals of one species that live in a particularly defined area

Promoter: a region of DNA at which transcription of a particular gene is initiated

Protein: a large chain or combination of multiple chains of amino acids

Promoter: a region of DNA at which transcription of a particular gene is initiated

Qualitative Trait: a trait that is described by either its presence or absence

Quantitative Trait: a trait that varies continuously over a range of measurements and displays a normal distribution (bell curve)

Random Selection: the process by which traits become either more or less common in a population due to random chance, not because of pressures directly affecting the reproductive fitness of particular alleles; it is random because the pressures do not directly affect the fitness of particular alleles

Recessive Trait: masked by the presence of a dominant allele or trait

Recombination: a special process during meiosis that can swap pieces of your maternal and paternal chromosome copies

Relative Risk: an individual's risk based on family or genetic background compared to the general population

Repressor: transcription factors that bind to a silencer and inhibit transcriptional activity

Ribosome: a molecular machine that translates, or reads, the genetic code within the mRNA sequence and synthesizes a corresponding chain of amino acids

RNA and mRNA: ribonucleic acid; A type of nucleic acid, usually single-stranded, consisting of nucleotides with the nitrogenous bases of A, C, G, and U

RSID: reference SNV identification

Sex Chromosome: a nuclear chromosome that distinguishes the sexes: XY - male, XX - female, and can affect sex-specific traits

Silencer: a DNA sequence capable of binding transcription regulation factors, called repressors, and inhibit transcriptional activity

Silent/Synonymous Mutation: a mutation that does not change the amino acid sequence

Single Nucleotide Variation (SNV): single base change in DNA; SNVs (also known as single nucleotide polymorphisms, or SNPs) are one of the smallest kinds of mutations and are responsible for a large number of differences among humans

SNV Genotyping: determining the base at any given position in a genome, not through total genome sequencing

Somatic Cell: any cell of the body that is not a germ cell (not directly responsible for carrying the information passed on to the next generation)

Splicing: the process of removing introns and combining exons in a mRNA sequence after transcription

TATA-Box: the TATAAA sequence that can be found in the promoter of many genes and is essential to initiate transcription

Thymine (T): a type of nitrogenous base that is typically used in DNA (T basepairs with A)

Toxicity: the degree to which a drug causes negative effects

Trait: any distinguishing feature of an individual

Transcription: the step of gene expression in which a particular segment of DNA is copied into RNA

Transcription Factor: any protein that joins the transcription process by binding to DNA or to other proteins that bind DNA to regulate transcription

Translation: the process by which ribosomes read the mRNA sequence and connect the amino acids in the order specified by the sequence

Translocation: rearrangement of a large sequence of genetic information, typically transferring from one chromosome to another through DNA breakage and resealing

Uracil (U): a type of nitrogenous base that is typically used in RNA (U basepairs with A)

Variant: version of a genetic sequence or gene for which more than one version exist

Variation: diversity among members of a population

Visible Trait: any trait that is apparent through outward observation

X Chromosome: one of two mammalian sex chromosomes that can be found in both males (XY) and females (XX)

Y Chromosome: the mammalian sex-determining chromosome that can only be found in males (XY) and is passed from father to son

Problem solving tips:

♦ p and q represent allele frequencies for a gene with two alleles; the sum of p + q must equal 100% of the alleles (or gametes) in the population. In other words, p + q = 1.

♦ p2, 2pq, q2 represent genotype frequencies; p2 corresponds to one homozygous genotype, 2pq represents the heterozygous genotype and q2 is the other homozygous genotype.

♦ Determining allele and genotype frequencies can be done two slightly different ways. One method involves converting the initial numbers of each genotype to frequencies and then doing all calculations as frequencies. In this case the frequency of the p allele = the frequency of the p/p homozygotes + 1/2 the frequency of the heterozygotes. The frequency of the q allele = the frequency of the q/q homozygotes + 1/2 the frequency of the heterozygotes. If the problem asks for numbers of individuals then the genotype frequencies will be multiplied by the number of individuals in the population, as in problem 19-2 solution # 1. The problems in this chapter will be solved using this method. Alternately the problems can be solved by calculating the numbers of each allele present in the population. When doing this the total must be divided by the total number of alleles, as in problem 19-2 solution # 2.

♦ When any values of p and q that sum to 1 are binomially expanded (p2 + 2pq + q2) the resulting genotype frequencies are those of a population in Hardy-Weinberg equilibrium.

♦ The binomial expansion is the equivalent of a Punnett square. If p (or the A allele) = 0.7 and q (or the a allele) = 0.3 then the binomial expansion gives the genotype frequencies in the next generation as 0.49 AA + 0.42 Aa + 0.09 aa. You can use a Punnett square to generate the same genotype frequencies in the F1 generation:

# # 0.7 A ### 0.3 a

0.7 A ### 0.49 AA # 0.21 Aa

0.3 a ### 0.21 aA # 0.09 aa

♦ If the allele frequencies are calculated correctly then p + q MUST sum to 1. If the genotype frequencies are calculated correctly than p2 + 2pq + q2 MUST sum to 1.

♦ Once a population is at equilibrium allele and genotype frequencies do not change.

♦ If a genotype is selected against or if populations are combined (by migration) or if there is significant mutation (usually together with selection) allele frequencies will change.

♦ Selection for one genotype and selection against another genotype are balanced at a particular allele frequency (equilibrium frequency).

♦ Genetic drift is most often seen in small populations.

♦ Polygenic traits are controlled solely by alleles of two or more gene; multifactorial traits include polygenic traits and traits that are influenced by both genes and environment.

♦ Genetic and environmental contributions to a phenotype are sorted out by setting up conditions in which the genetic background is consistent (to analyze environmental contributions) or conditions in which the environment is constant (to analyze genetic contributions).

Exam 2

#1

For an autosomal gene with four alleles, there are 10 possible genotypes (4 homozygous and 6 heterozygous). How many unique genotypes are possible with four alleles of an X-linked gene (no correlate on the Y).Remember to think about the possibilities that include both males and females.

14 (4 homozygous female, 6 heterozygous female, 4 hemizygous male)

#2

Normally the thyroid growth gormone thyroxine is made in the body by an enzyme as follows:

Tirosine -> enzyme -> Thyroxine

If the enzyme is deficient, the symptoms are called genetic goiterous cretinism (GGC), a rare syndrome consisting of slow growth, enlarged thyroid (called a goiter), and mental retardation. If one copy of the normal allele for this enzyme is enough to keep normal thyroid function, would you expect GGC to be inherited as:

A. Dominant

B. Recessive

B, dominant is a normal allele (one copy is enough)

#3

A base deletion early in the coding sequence of a gene may result in which of the following?

A. a nonsense mutation
B. a frameshift mutation
C. multiple missense mutation
D. a nonfunctional protein

A,B,C,D

#4

Assume that genes M and A are on different chromosomes.

How many different gamete genotypes could come from a person of genotype:

A. M+/M-

B. A+/A+

C. M+/M- A+/A-

A. 2 (M+, M-)

B. 1 (A+)

C. 4 (M+A+, M-A+, M+A-, M-A-)

#5

A homozygous recessive results from the mating of a heterozygote and a parent with a dominant phenotype.

What does this tell you about the parent with a dominant phenotype?

A. +/+

B. +/-

C. -/-

B. +/-

#6

Two parent with the dominant phenotype produce nine offspring. Two have recessive phenotype.

What does this tell you about the genotype of the parents?

A. +/+

B. +/-

C. -/-

B. +/-

#7

One parent has a dominant phenotype and the other has a recessive phenotype.

Two children result and both have the dominant phenotype.

What phenotypes are possible for the parent with the dominant phenotype?

A. +/+

B. +/-

C. -/-

A,B

#8

True or False. An individual with majority genetic ancestry from Europe would be expected to share more neanderthal SNVs than an individual with majority genetic ancestry from sub-Saharan Africa. Why?

True, an individual with majority genetic ancestry from Europe has more neanderthal SNVs

#9

In [3]:
Image(filename='/Users/olgabelitskaya/Downloads/bio023.jpg')
Out[3]:

#10

In [2]:
Image(filename='bio024.jpg')
Out[2]:

#11

In [3]:
Image(filename='bio025.jpg')
Out[3]:

#12

In [4]:
Image(filename='bio026.jpg')
Out[4]:

#13

In [5]:
Image(filename='bio027.jpg')
Out[5]:

#14

In [6]:
Image(filename='bio028.jpg')
Out[6]:

#15

In [7]:
Image(filename='bio029.jpg')
Out[7]:

#16

In [8]:
Image(filename='bio030.jpg')
Out[8]:
In [13]:
# p(child -/-) = 1/4 (variants +/+, +/-, +/-, -/-)
# p(mother +/-) = 2/3 (variants from her mother +/+, +/-, +/-) * 1/2 (variants +/+, +/+, +/-, -/-) = 1/3
# p(father +/-) = 1
# 1 * 1/3 * 1/4 = 1/12

#17

In [14]:
Image(filename='/Users/olgabelitskaya/Downloads/bio031.jpg')
Out[14]:
In [ ]:
# p(daughter +/-) = 1/2 (variants +/+, +/-)
# father (X+Y), mother (X+X-)

#18

In [9]:
Image(filename='bio032.jpg')
Out[9]:
In [15]:
# father (X+Y), mother (X+X-)
# p(his daughter +/-) = 1/2 (variants +/+, +/-)
# her next generation with a man (X+Y), p for males (+, -) = 1/2
# 1/2 * 1/2 = 1/4

#19

In [10]:
Image(filename='bio033.jpg')
Out[10]:

#20

In [11]:
Image(filename='bio034.jpg')
Out[11]:

Met-His-Pro-Gly-Ser-Leu-Val

#21

What is the amino acid sequence with mutation X? (Q.20)

Met-Ala-Ser-Gly-Leu-Ile-Ser

#22

What is the amino acid sequence with mutation Y? (Q.20)

Met-His-Pro-Gly-Ser

#23

What is the amino acid sequence with both mutations? (Q.20)

Met-Ala-Ser-Gly-Leu-Leu-Val

#24

In [12]:
Image(filename='bio035.jpg')
Out[12]:

#25

In [13]:
Image(filename='bio036.jpg')
Out[13]:

#26

In [14]:
Image(filename='bio037.jpg')
Out[14]:

#27

In [15]:
Image(filename='bio038.jpg')
Out[15]:

#28

In [16]:
Image(filename='bio039.jpg')
Out[16]:
In [24]:
print 'A', (50+50+300+400)/2000.0

print 'B', (300+75+75+150)/2000.0

print 'C', (150+25+25+400)/2000.0
A 0.4
B 0.3
C 0.3

#29

In [17]:
Image(filename='bio040.jpg')
Out[17]:

#30

In [18]:
Image(filename='bio041.jpg')
Out[18]:

#31

In [19]:
Image(filename='bio042.jpg')
Out[19]:

#32

In [20]:
Image(filename='bio043.jpg')
Out[20]:

#33

In [21]:
Image(filename='bio044.jpg')
Out[21]:

#34

In [22]:
Image(filename='bio045.jpg')
Out[22]:

#35

In [23]:
Image(filename='bio046.jpg')
Out[23]:

#36

In [24]:
Image(filename='bio047.jpg')
Out[24]:

#37

In [25]:
Image(filename='bio048.jpg')
Out[25]:
In [41]:
a1=(2*0.25 + 0.75)/2
b1 = 1-a1
print '1. +',a1, '  -', b1
1. + 0.625   - 0.375
In [42]:
a2=(2*0.1 + 0.3)/2
b2 = 1-a2
print '2. +',a2, '  -', b2
2. + 0.25   - 0.75
In [43]:
a3 = (2*0.1 + 0.8)/2
b3 = 1-a3
print '1. +',a3, '  -', b3
1. + 0.5   - 0.5
In [44]:
a3 = (2*0.25 + 0.5)/2
b3 = 1-a3
print '1. +',a3, '  -', b3
1. + 0.5   - 0.5

#38

In [26]:
Image(filename='bio049.jpg')
Out[26]:

#39

In [27]:
Image(filename='bio050.jpg')
Out[27]:

#40

In [28]:
Image(filename='bio051.jpg')
Out[28]:

#41

In [29]:
Image(filename='bio052.jpg')
Out[29]:

#42

In [30]:
Image(filename='bio053.jpg')
Out[30]:

#43

In [31]:
Image(filename='bio054.jpg')
Out[31]:

#44

In [52]:
4188/(2.0*84000)
Out[52]:
0.02492857142857143
In [32]:
Image(filename='bio055.jpg')
Out[32]: