Comprehensive assessment of T-cell receptor beta-chain diversity in alphabeta T cells

Harlan S Robins, Paulo V Campregher, Santosh K Srivastava, Abigail Wacher, Cameron J Turtle, Orsalem Kahsai, Stanley R Riddell, Edus H Warren, Christopher S Carlson, Harlan S Robins, Paulo V Campregher, Santosh K Srivastava, Abigail Wacher, Cameron J Turtle, Orsalem Kahsai, Stanley R Riddell, Edus H Warren, Christopher S Carlson

Abstract

The adaptive immune system uses several strategies to generate a repertoire of T- and B-cell antigen receptors with sufficient diversity to recognize the universe of potential pathogens. In alphabeta T cells, which primarily recognize peptide antigens presented by major histocompatibility complex molecules, most of this receptor diversity is contained within the third complementarity-determining region (CDR3) of the T-cell receptor (TCR) alpha and beta chains. Although it has been estimated that the adaptive immune system can generate up to 10(16) distinct alphabeta pairs, direct assessment of TCR CDR3 diversity has not proved amenable to standard capillary electrophoresis-based DNA sequencing. We developed a novel experimental and computational approach to measure TCR CDR3 diversity based on single-molecule DNA sequencing, and used this approach to determine the CDR3 sequence in millions of rearranged TCRbeta genes from T cells of 2 adults. We find that total TCRbeta receptor diversity is at least 4-fold higher than previous estimates, and the diversity in the subset of CD45RO(+) antigen-experienced alphabeta T cells is at least 10-fold higher than previous estimates. These methods should prove valuable for assessment of alphabeta T-cell repertoire diversity after hematopoietic cell transplantation, in states of congenital or acquired immunodeficiency, and during normal aging.

Figures

Figure 1
Figure 1
Strategy for PCR amplification, hybridization, and sequencing of rearranged TCRβ CDR3 regions. A generic rearranged TCRβ CDR3 region PCR product is shown, indicating the constituent Vβ segment, Dβ segment, Jβ segment, and the nontemplated nucleotides inserted at the Vβ-Dβ and Dβ-Jβ junctions. Universal adapter sequences that permit solid-phase PCR on the Illumina Genome Analyzer Cluster Station (GA F and GA R) are incorporated into the 5′ and 3′ ends of the PCR products that capture each rearranged TCRβ CDR3 region. Forty-five forward primers were designed, each specific to a single functional Vβ segment or a small family of Vβ segments. The 3′ end of each Vβ forward primer is anchored at position −43 in the Vβ segment, relative to the recombination signal sequence, thereby providing a unique Vβ tag sequence within the amplified region. Vβ forward primers were designed for all known nonpseudogenes in the TCRβ locus. The 13 reverse primers specific to each Jβ segment are anchored in the 3′ intron, with the 3′ end of each primer crossing the intron/exon junction. The Jβ reverse primers were designed to be anchored at their 3′ ends on a consensus splice site motif to minimize overlap with the sequencing primers. Thirteen sequencing primers were designed that are complementary to the amplified portion of the Jβ segment, such that the first few bases of sequence generated will capture the unique Jβ tag sequence. The sequencing primers were designed so that promiscuous priming of a sequencing reaction for one J segment by a primer specific to another J segment would generate sequence data starting at exactly the same nucleotide as sequence data from the correct sequencing primer.
Figure 2
Figure 2
Observed TCRβ CDR3 sequence copy number per 5 mL whole blood. Frequency histograms of TCRβ CDR3 sequences observed in 4 different T-cell subsets distinguished by expression of CD4, CD8, and CD45RO and present in 5 mL blood of one male donor. For example, the square at 200,10 means that 10 unique sequences were each observed 200 times in the CD4+CD45RO+ (antigen-experienced) T-cell sample. The data were resampled from the sequences generated by the Genome Analyzer to approximate the expected CDR3 sequence distribution in the T cells present in 5 mL blood, as determined by flow cytometry. A small set of sequences found in the CD45RO+ compartments were found with very large copy number (> 10 000 copies) but are not displayed.
Figure 3
Figure 3
Assessment of PCR bias. The rearranged TCRβ CDR3 regions present in approximately 30 000 T-cell genomes were amplified through 25 cycles of PCR, and the PCR products were split into 2 pools. One pool was amplified an additional 15 cycles, and then the PCR products from the 25-cycle and 40-cycle reactions were sequenced in separate lanes of a GA1 flow cell. Of the TCRβ CDR3 sequences observed in the 25-cycle PCR lane, 97% were also observed in the 40-cycle PCR lane. Each point on the graph represents a single unique CDR3 sequence, plotted according to the number of times that sequence was observed in the data from 25-cycle (abscissa) and 40-cycle (ordinate) PCR reactions, respectively. The density of sequences at each point in the plot is indicated by color, with purple the highest density and red the lowest. The solid line represents a linear regression of the data, and the dotted lines 1 SD above and below the mean.
Figure 4
Figure 4
Jβgene segment use in 4 different T-cell compartments. Jβ gene segment use of TCRβ CDR3 sequences observed in the 4 different flow cytometrically defined T-cell compartments from donor 1.
Figure 5
Figure 5
Relative abundance of unique TCRβ CDR3 sequences correlates inversely with divergence from germline. (A) Observed frequency (top panels) and average observed frequency (bottom panels) of TCRβ CDR3 sequences in the CD8+CD45RO+/− and CD4+CD45RO+/− T-cell compartments of 2 male donors plotted, from left to right, according to their Jβ and Vβ gene segment use, CDR3 length, and total number of (inserted + deleted) nucleotides at the Vβ-Dβ and Dβ-Jβ junctions. (B) Heat map representation of relative abundance of TCRβ CDR3 sequences observed in CD4+ naive, CD4+ memory, and CD8+ naive T-cell compartments of the 2 male donors arrayed according to the number of nucleotides deleted or inserted at the Vβ-Dβ and Dβ-Jβ junctions. Color indicates the log10(observed frequency) of the sequences with the indicated number of inserted or deleted junctional nucleotides.
Figure 6
Figure 6
Direct TCRβ CDR3 sequencing captures all of the TCR diversity information present in a conventional spectratype. (A) Comparison of standard TCRβ spectratype data and calculated TCRβ CDR3 length distributions for sequences using representative TCR Vβ gene segments and present in CD4+CD45RO+ cells from male donor 1. CDR3 length is plotted along the x-axis and the number of unique CDR3 sequences with that length (GA sequence data) or the relative intensity of the corresponding peak in the spectratype is plotted along the y-axis. Reducing the information contained in the GA sequence data to a frequency histogram of the unique CDR3 sequences with different lengths within each Vβ family readily reproduces all of the information contained in the spectratype data. The length of the differently colored segments within each bar of the histograms indicates the fraction of unique CDR3 sequences that were observed 1 to 5 times (black), 6 to 10 times (blue), 11 to 100 times (green), or more than 100 times (red). (B) A representative “virtual spectratype” of TCRβ CDR3 sequences extracted from CD4+CD45RO+ T cells from donor 1 that use the Vβ10 gene segment. The CDR3 sequences using Vβ10 were sorted by CDR3 length into a frequency histogram, and the sequences within each length bin were then color-coded on the basis of their Jβ use. The inset shows all of the CDR3 sequences using Vβ10 and Jβ2-6, and having a length of 39 nt, as well as the number of times that each of these sequences was observed in the data. The origin of the nucleotides in each sequence is color-coded as follows: Vβ gene segment, red; template-independent N nucleotide, black; Dβ gene segment, blue; Jβ gene segment, green.

Source: PubMed

3
Subscribe