Immunogenomics: Difference between revisions
Line 29: | Line 29: | ||
[[Image:VDJ recombination.png|thumb|400px|Simplistic overview of V(D)J recombination of immunoglobulin heavy chains]] | [[Image:VDJ recombination.png|thumb|400px|Simplistic overview of V(D)J recombination of immunoglobulin heavy chains]] | ||
* Apart from transposable elements, virus insertions and random mutations, all somatic cells (=not egg or sperm) have the same genome. There are only two types of cells that change their genome somatically: B-cells and T-cells. | * Apart from transposable elements, virus insertions and random mutations, all somatic cells (=not egg or sperm) have the same genome. There are only two types of cells that change their genome somatically: B-cells and T-cells. | ||
* B-cells are one of these two somatic cell types that can glue together two different parts of their genome (recombine), in what is called [http://en.wikipedia.org/wiki/VDJ_recombination somatic recombination]. This happens during embryonic development and only once. The loci that recombine are called IGH@ (heavy chains), IGK@ and IGL@ (light chains) (how the heck do you pronounce the @ here?). They are located on chromosomes 14, 2 and 22. (NB: do not confuse IGH, the locus, with the antibody types IgA, IgE, IgM, IgD and IgG). | * B-cells are one of these two somatic cell types that can glue together two different parts of their genome (recombine), in what is called [http://en.wikipedia.org/wiki/VDJ_recombination somatic recombination]. This happens during embryonic development and only once. The loci that recombine are called [http://en.wikipedia.org/wiki/IGH@ IGH@] (heavy chains), IGK@ and IGL@ (K=kappa, L=lamba, both are light chains) (how the heck do you pronounce the @ here?). They are located on chromosomes 14 (heavy), 2 (kapp) and 22 (lambda). (NB: do not confuse IGH, the locus, with the antibody types IgA, IgE, IgM, IgD and IgG). | ||
* On these loci many (somewhere between 6-60) little genes are located next to each other. From left to right, they are separated into a stretch of V genes, then D genes, then J genes, though light chain loci do not have D genes, so we usually write it as V(D)J. On the genome browser, these loci [http://genome.ucsc.edu/cgi-bin/hgTracks?position=chr14:105994256-107283087&db=hg19 don't look great], because all VDJ genes are all very similar, so the RefSeq sequences match everywhere and are filtered out. | * On these loci many (somewhere between 6-60) little genes are located next to each other. From left to right, they are separated into a stretch of V genes, then D genes, then J genes, though light chain loci do not have D genes, so we usually write it as V(D)J. On the genome browser, these loci [http://genome.ucsc.edu/cgi-bin/hgTracks?position=chr14:105994256-107283087&db=hg19 don't look great], because all VDJ genes are all very similar, so the RefSeq sequences match everywhere and are filtered out. | ||
* All Vs, Ds and Js are separated by [http://en.wikipedia.org/wiki/Recombination_Signal_Sequences recombination signal sequences]. The recombinase is activated for a short time during embryonic B-cell development (the mammals probably got that enzyme that from a transposase) and first recombines a D to J and then a V to this DJ combination. | * All Vs, Ds and Js are separated by [http://en.wikipedia.org/wiki/Recombination_Signal_Sequences recombination signal sequences]. The recombinase is activated for a short time during embryonic B-cell development (the mammals probably got that enzyme that from a transposase) and first recombines a D to J and then a V to this DJ combination. |
Revision as of 22:35, 28 June 2011
Intro
Our blood is so energy rich that any quickly dividing organism could use all of it within hours and would kill us. Therefore, our resources have to be defended against any form of cell that tries to steal it. These cells can be either invaders that got across the skin (bacteria, worms, insects). They can also be our own cells that might suddenly use too much energy (infected by a virus or with a mutated cancer-genome).
Immunology, like any part of biology, is daunting at first because of the many strange technical terms, many of them derived from ancient Greek. Learning about the immune system revolves mostly around many different types of cells that are transported by our blood together with the red blood cells. When you spin blood in a centrifuge, you get three layers: red, white and transparent water (called "plasma"). The immune system cells accumulate in a white layer above the red blood cells. The different cells of the immune system are called white blood cells or leukocytes (leuko=white, cyte=cell).
They travel towards the organs in the blood vessels and return via the lymphatic system (lympha=water), a network of conduits through the whole body that empties into the heart, from where they re-enter the blood stream. There are eight major types of cells and we will not cover all of them here. The most difficult part of immunology is the communication between the different cell types, but we will not cover this here either.
Some of these cell types (like macrophages, neutrophils or natural killer cells) can destroy other cells with various means (not covered here). The tricky part is how do these cells recognize foreign cells or mutated body cells. This task is accomplished by one type of cells, called the lymphocytes (lympho=related to the lymphatic system, cyto=cell). Lymphocytes can be divided into B-cells, produced in the bone marrow, and T-cell, produced in the thymus.
T-Cells check body cells and try to recognize the "unusual" ones by scanning them with special receptors on their cell surface. If they think that the cell is "unusual", they will then recruit other cells to kill them, induce them to commit suicide or kill them directly.
B-Cells recognize non-body stuff, which is called antigen. They do this by either binding to a part of the the antigen called epitope with receptors on their cell membrane. Or they can secrete these receptors in the form of antibodies into the blood (or mucus) where the antibodies will bind themselves to antigen, without any other cell around.
B-Cells
- B-Cells recognize non-body stuff, which is called antigen. They do this by either binding to a part of the the antigen called epitope with receptors on their cell membrane. Or they can secrete these receptors in the form of antibodies into the blood (or mucus) where the antibodies will bind themselves to antigen, without any other cell around.
- An Antibody comes in five different flavors (in placental mammals) IgA, IgD, IgE, IgG, IgM. They all share the same basic structure, but differ in their additions. Somehow we assume that IgG is the most important, because it is the most common type and we don't know that much about the other types anyways.
- IgG is one of those antibody types that looks like a Y. The *ends* of the arms of the Y recognize the antigen, the *ends* not the middle part.
- The two arms of the Y of all antibodies - and so IgG - are composed of two identical Heavy and two identical Light Chains
- The two ends of the Y, the part that recognizes the antigen are together called Fab. One Fab comprises a bit of the heavy chain and a bit of the light chain. Not everything touches the antigen, so the most important region in the Fab fragment is called Fv (variable).
- The Fv can be split into its three main parts, the complementary determining regions, called CDR1, CDR2 and CDR3. Remember that there is a light and a heavy chain in one Fv, so there are six complementary determining regions in total.
- Of the three heavy-chain CDRs, CDR3 has a much larger diversity than CDR1 and CDR2. Structural studies showed that CDR3 is the main antigen-binding site. It was believed that CDR1 and CDR2 interacted with the MHC molecule, while CDR3 interacted with the antigen. However, later CDR1 was also shown to have some contact with the antigen. CDR1 and CDR2 are easier to investigate (no hypermutation, see below)
- For a long time, no one knew how the body produces the right Fa fragments that bind to "non-self" but not to "self". A popular theory was that proteins "fold" themselves into the right shape, around some part of antigen (epitope). This turned out to be wrong, the answer lies in a particular property of the genome of these cells: Each B-cell has a different genome and each cell produces only one antibody.
B-cell genomes
- Apart from transposable elements, virus insertions and random mutations, all somatic cells (=not egg or sperm) have the same genome. There are only two types of cells that change their genome somatically: B-cells and T-cells.
- B-cells are one of these two somatic cell types that can glue together two different parts of their genome (recombine), in what is called somatic recombination. This happens during embryonic development and only once. The loci that recombine are called IGH@ (heavy chains), IGK@ and IGL@ (K=kappa, L=lamba, both are light chains) (how the heck do you pronounce the @ here?). They are located on chromosomes 14 (heavy), 2 (kapp) and 22 (lambda). (NB: do not confuse IGH, the locus, with the antibody types IgA, IgE, IgM, IgD and IgG).
- On these loci many (somewhere between 6-60) little genes are located next to each other. From left to right, they are separated into a stretch of V genes, then D genes, then J genes, though light chain loci do not have D genes, so we usually write it as V(D)J. On the genome browser, these loci don't look great, because all VDJ genes are all very similar, so the RefSeq sequences match everywhere and are filtered out.
- All Vs, Ds and Js are separated by recombination signal sequences. The recombinase is activated for a short time during embryonic B-cell development (the mammals probably got that enzyme that from a transposase) and first recombines a D to J and then a V to this DJ combination.
- While this is going on, a terminaldeoxyneucleotidetransferase adds and deletes some nucleotides between the different segments to add a little bit of noise between them.
B-cell diversity
- As a result, each B-cell has its own unique combination of VDJ genes in all of their loci. That means that each B-cell is different from each other B-cell when we are very young.
- In each cell, this V(D)J-combination is transcribed to produce a heavy or light chain immunoglobulin protein and assembled into receptors
- At around 2 months of embryonic development, all B-cells that bind something in the body kill themselves. At our age, only B-cells circulate in our blood that bind to stuff that is *not* part of the body. All others have been eliminated already.
- In any of these B-cells, when the receptors built from the VDJ combinations bind to some antigen, this particular B-cell will start to divide and many of the daughter cells will produce antibodies from this VDJ combination. In addition, the daughter cells will hypermutate their VDJ segments and those that bind better will turn out to produce even more daughter cells and more antibodies. Macrophages will then kill the cells that are marked with antibodies.
Immunogenomics
- To amplify the VDJ fragment with PCR, primers are designed for each V and each J gene in the genome
- Blood is extracted and PCR is run on the mixed cells
- The resulting DNA is sequenced and the identity of each V, D, J determined by blasting against a database of VDJ genes
- The number of each VDJ combination can be determined as a result
- The big dream is to help identify the origin of allergies (which antibody? what does it bind to?) with this, to detect diseases before they produce symptoms (e.g. by finding antibodies in them) or to find antibodies against tumors. No one has achieved this yet.
- Q: Somehow we think that the heavy chains are more important than the light chains, and CDR3 more important than CDR1 and CDR2
- The heavy chains are encoded by one of the few loci that can recombine somatically, the IGH-locus