Stem Cells http://www.stemcellsportal.com/
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


First published online October 25, 2007
Stem Cells Vol. 26 No. 1 January 2008, pp. 119 -126
doi:10.1634/stemcells.2007-0283; www.StemCells.com
© 2008 AlphaMed Press

OPEN ACCESS ARTICLE
This Article
Free via Open Access: OA
Right arrow OA Abstract
Right arrow Full Text (PDF)
Right arrowOA All Versions of this Article:
2007-0283v1
2007-0283v2
26/1/119    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow Reprints/Permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Thyagarajan, B.
Right arrow Articles by Chesnut, J. D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Thyagarajan, B.
Right arrow Articles by Chesnut, J. D.

TECHNOLOGY DEVELOPMENT

Creation of Engineered Human Embryonic Stem Cell Lines Using phiC31 Integrase

Bhaskar Thyagarajana, Ying Liua, Soojung Shina, Uma Lakshmipathya, Kelly Scheyhinga, Haipeng Xuea, Catharina Ellerströmb, Raimund Strehlb, Johan Hyllnerb, Mahendra S. Raoa, Jonathan D. Chesnuta

aInvitrogen Corporation, Carlsbad, California, USA;
bCellartis AB, Göteborg, Sweden

Key Words. Human embryonic stem cells • phiC31 integrase • Site-specific integration

Correspondence: Bhaskar Thyagarajan, B.V.Sc., Ph.D., Invitrogen Corporation, 1600 Faraday Avenue, Carlsbad, California 92008, USA. Telephone: 760-268-7460; Fax: 760-602-6691; e-mail: bhaskar.thyagarajan{at}invitrogen.com

Received April 17, 2007; accepted for publication October 9, 2007.
First published online in STEM CELLS EXPRESS   October 25, 2007.


    ABSTRACT
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Disclosure of Potential...
 References
 
It has previously been shown that the phage-derived phiC31 integrase can efficiently target native pseudo-attachment sites in the genome of various species in cultured cells, as well as in vivo. To demonstrate its utility in human embryonic stem cells (hESC), we have created hESC-derived clones containing expression constructs. Variant human embryonic stem cell lines BG01v and SA002 were used to derive lines expressing a green fluorescent protein (GFP) marker under control of either the human Oct4 promoter or the EF1{alpha} promoter. Stable clones were selected by antibiotic resistance and further characterized. The frequency of integration suggested candidate hot spots in the genome, which were mapped using a plasmid rescue strategy. The pseudo-attP profile in hESC differed from those reported earlier in differentiated cells. Clones derived using this method retained the ability to differentiate into all three germ layers, and fidelity of expression of GFP was verified in differentiation assays. GFP expression driven by the Oct4 promoter recapitulated endogenous Oct4 expression, whereas persistent stable expression of GFP expression driven by the EF1{alpha} promoter was seen. Our results demonstrate the utility of phiC31 integrase to target pseudo-attP sites in hESC and show that integrase-mediated site-specific integration can efficiently create stably expressing engineered human embryonic stem cell clones.

Disclosure of potential conflicts of interest is found at the end of this article.


    INTRODUCTION
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Disclosure of Potential...
 References
 
Human embryonic stem cells isolated from the inner cell mass of 5-day blastocysts have the ability to differentiate into cells of various lineages in culture [1, 2]. The pluripotency of these cells makes them useful as platforms for basic research and drug discovery and could eventually allow them to be used as therapeutic material for diseases that are currently difficult or impossible to treat. The use of ESC-derived cells in these various applications would benefit greatly from the development of systems to facilitate rapid and efficient introduction of exogenous genetic elements into the cell genome.

An example of this is the generation of reporter cell lines, where specific promoter activity can easily be monitored via expression of fluorescent protein. The exact culture conditions and medium additives required to maintain human embryonic stem cells (hESC) in their pluripotent state are currently being identified and optimized. In addition, the pathways involved in directing specific differentiation of hESC are still under investigation. The development of a hESC reporter line expressing a fluorescent marker under control of a developmental stage-specific promoter would aid in the study of stem cell biology by providing a rapid readout that indicates the state of cells in culture. Earlier studies have used randomly integrating vectors, either retroviruses or plasmid DNA, to generate hESC-derived lines expressing green fluorescent protein (GFP) [35]. These studies describe the development of hESC lines expressing GFP driven by either a constitutive (EF1{alpha}), an inducible (EF1{alpha} tied to tetO element), or a lineage-specific (Oct4) promoter. A recent study describes the construction of human embryonic stem cells by transfection of plasmid DNA [6]. Although stable lines were successfully generated, the silencing of randomly integrated transgenes in this study underscores the importance of choosing the appropriate vector.

Use of lentiviral vectors for engineering hESC is popular because of the high efficiency of gene delivery afforded by viral infection. Although this system has proven useful, limited payload capacity and the potential for gene disruption from random integration of DNA could limit its utility in stem cells. Two recent articles describe the generation of hESC lines with integrated transgenes [7, 8]. Vallier et al. describe the use of a recombinase to create GFP-expressing hESC lines [7]. This approach first requires the creation of a recombinase-expressing line, followed by random integration of the expression construct. Although this is an elegant method to induce expression of the gene of interest, the efficiency of producing transgenic lines using this method can be low. Zeng et al. describe a baculovirus vector coupled with elements of adenoassociated virus to obtain transduction, integration, and long-term expression of the transgene [8]. This protocol was efficient at generating transgenic hESC, and the integrated transgene showed long-term expression. The large capacity of baculoviruses also overcomes the disadvantage of retroviruses, which have a limited payload capacity. However, the long-term effects of expression of the adeno-associated virus rep protein in hESC are still unknown, and it is possible that they could lead to undesirable effects [9].

Here, we focus on using a site-specific integration approach to direct expression constructs to transcriptionally active chromosomal regions using phiC31 integrase. This study outlines a method for using this site-specific integration system to generate engineered hESC lines. We describe here the construction of a pluripotency-specific reporter line using the human Oct4 promoter to drive GFP expression, as well as a constitutively expressing GFP line.

The integrase from the Streptomyces phage phiC31 has been shown to target donor plasmids containing a native attB site into pseudo-attP sites in the human genome [10, 11]. phiC31 integrase has been shown to function in vitro, in cell culture systems, as well as in vivo. This integrase has been successfully used in cells derived from a number of species, including human, mouse, rat, rabbit, Chinese hamsters, Drosophila, and plants [1225]. Unlike the better-known recombinases Cre and Flp, the phiC31 integrase catalyzes recombination between two nonidentical sites. This feature, along with the apparent lack of a corresponding excisionase enzyme, makes the recombination reaction unidirectional, ensuring that constructs integrated into the genome do not act as substrates for the reverse reaction. The result is an improvement in integration efficiency compared with random integration. Another extremely useful feature of phiC31 integrase is the ability of the enzyme to target pseudo-attP sites present in the genome. These pseudo-attP sites bear some resemblance to the native attP site and have been shown to be present in transcriptionally active areas of the genome [10]. Many of the sites described have been shown to be in intronic regions of genes. Since these pseudo-attP sites typically tend to be in open chromatin [10], our hypothesis was that there would less interference with expression of the transgene, and any changes in expression of the transgene would be solely due to regulation of the promoter used. These features make this integrase a potentially useful tool for construction of transgenic lines from unmodified cells, since the targeted sites are already present in the genome.

In this study, we have used phiC31 integrase to create variant hESC-derived lines [26, 27] containing the GFP gene driven by either the human Oct4 promoter or the human EF1{alpha} promoter. We also describe a simplified vector construction design using a targeting vector that is a substrate for Multisite Gateway (Invitrogen Corporation, Carlsbad, CA, http://www.invitrogen.com). This greatly reduces the effort involved in cloning, and allows the creation of multiple constructs in the same background and with little effort. The combination of Multisite Gateway technology and site-specific recombinases provides a powerful tool for the construction of transgenic lines in human embryonic stem cells, which in turn can be used as versatile platforms for the study of stem cell biology.


    MATERIALS AND METHODS
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Disclosure of Potential...
 References
 
Plasmid Construction
The plasmids used in this study are shown in Figure 1. The plasmid pCMV-phiC31 Int has been described earlier [18]. The plasmid pB2H1-DEST was cloned as follows. The phiC31 attB site was amplified from the plasmid pBC-PB [18] and cloned into pCR2.1 using the TA Cloning Kit (Invitrogen, Carlsbad, CA, http://www.invitrogen.com) to generate pCR2.1-phiC31attB. This plasmid was restricted with EcoRI to release the attB fragment and treated with Klenow to generate blunt ends. This fragment was ligated with ZraI-restricted pUC19 vector to generate pUC-phiC31attB2. An expression cassette containing the Hygromycin phosphatase gene driven by the HSV-TK promoter was amplified from pTKHyg and T/A-cloned into pCR2.1. The resulting plasmid was restricted with SpeI and EcoRV, treated with Klenow to generate blunt ends, and ligated with pUC-phiC31attB2 that had been restricted with AflIII and treated with Klenow to generate pB2H1. A fragment containing the R1-R2 DEST cassette was amplified from pUC-DEST (Invitrogen) and T/A-cloned into pCR2.1. The resulting plasmid was restricted with SpeI and EcoRV, treated with Klenow, and cloned into pB2H1 treated with SalI and Klenow to generate the plasmid pB2H1-R1R2DEST. This plasmid was used as a recipient for the expression constructs used in this study.


Figure 1
View larger version (20K):
[in this window]
[in a new window]

 
Figure 1. Integration strategy and plasmids used in the study. Multisite Gateway technology was used to assemble the phOG construct from the appropriate Entry vectors and the Destination vector pB2H1-DEST. This plasmid was then used to transfect variant hESC lines, along with a plasmid expressing the phiC31 integrase (pCMV-phiC31 Int). Cotransfection resulted in integration of the expression plasmid into pseudo-attP sites in the genome. Abbreviations: AMP, ampicillin-resistance gene; CM, chloramphenicol resistance gene; gDNA, genomic DNA; GFP, green fluorescent protein; hESC, human embryonic stem cells; Hyg, Hygromycin; pCMV, cytomegalovirus promoter; phOct4, human Oct4 promoter; pTK, thymidine kinase promoter; SV40 pA, Simian Virus 40 polyA; TKpA, thymidine kinase polyA.

 
A 3.2-kilobase fragment containing the human Oct4 promoter [28, 29] was amplified from human genomic DNA using the primers hO-For (5'-GGAGAGGTGGGCCTCACC-3') and hO-Rev (5'-GGGGAAGGAAGGCGCCCC-3'). The resulting fragment was T/A-cloned into pCR2.1 to generate pCR2.1-phOct4. Assembly of the final phOct4-GFP and pEF1a-GFP expression constructs was accomplished by using protocols recommended for Multisite Gateway.

Cell Culture and Transfection
BG01v cells (49, XXY, +12, +17,) were obtained from BresaGen, Inc., (Athens, GA). SA002 cells (47, +13, XY) were obtained from Cellartis AB (Goteborg, Sweden, http://www.cellartis.se). All reagents were obtained from Invitrogen unless indicated otherwise. The cells were maintained either on a mouse embryonic fibroblast (MEF) feeder layer in Dulbecco's modified Eagle's medium (DMEM)/Ham's F-12 medium (F12) medium supplemented with 20% knock-out serum replacement (KSR), 4 ng/ml basic fibroblast growth factor, 1 ml of nonessential amino acids, and 100 µM β-mercaptoethanol or on Matrigel (BD Biosciences, Franklin Lakes, NJ, http://www.bdbiosciences.com) in the same medium conditioned on MEF feeder layer. Fresh medium was provided to the cells every day, and the cells were passaged every 4–5 days.

One day prior to transfection with Lipofectamine 2000 (Invitrogen), cells were treated with Accutase (Sigma-Aldrich, St. Louis, http://www.sigmaaldrich.com) and plated on Matrigel in conditioned medium. Lipofectamine 2000-mediated transfection was carried out according to the manufacturer's protocol. We typically used 4 µg of the expression vector and 4 µg of the phiC31 integrase expression vector to transfect 2 million cells. Control transfections omitted the phiC31 integrase plasmid or the GFP expression vector. After transfection, cells were allowed to recover for 1 day, and selection was started with medium containing Hygromycin at a concentration of 10 µg/ml. After 14–21 days of selection, individual colonies were manually picked and expanded for further analysis.

Electroporation was carried out with the BTX ECM630 electroporator (Harvard Bioscience, Holliston, MA, http://www.btxonline.com). Six to 8 million cells were harvested using Accutase and resuspended in 800 µl of OptiPro SFM (Invitrogen). These cells were placed in an electroporation cuvette with a gap of 0.4 cm. Cells were electroporated with a pulse of 500 V at 250 µF. Electroporated cells were plated on MEF feeders and allowed to recover for 48–72 hours before selection was started with Hygromycin (10 µg/ml; Invitrogen). As with lipid-mediated transfection, individual drug-resistant clones were manually picked and expanded for further analysis.

Plasmid Rescue and Sequence Analysis
Genomic DNA isolated from individual clones was restricted with the restriction enzymes NheI, SpeI, and XbaI. The enzymes were heat-inactivated, and the DNA was self-ligated at low DNA and T4 DNA ligase concentrations. After overnight incubation at 16°C, the DNA was extracted with phenol:chloroform, ethanol-precipitated, and resuspended in water. Electrocompetent DH10B Escherichia coli were then electroporated with the ligated DNA using the Gene Pulser II (Bio-Rad, Hercules, CA, http://www.bio-rad.com) using the recommended conditions. The resulting transformation was plated on Luria-Bertani (LB)-agar plates containing ampicillin. Plasmid DNA isolated from the resulting colonies was sequenced using the primer ChoSeqR (5'-TCCCGTGCTCACCGTGACCAC-3'). Sequence data were analyzed using Sequencher software. The genomic integration site was determined by matching the sequence read to the database at BLAT (http://genome.ucsc.edu) [30].

Analysis of 23 pseudo-site sequences rescued in this study was carried out by the Web-based MEME motif finder (http://meme.sdsc.edu/meme/meme.html) [31]. This program was used to find motifs ranging from 6 to 50 base pairs in 100 base pairs of sequence surrounding the point of crossover. The wild-type phiC31 attP site was also included in the analysis. A common motif was discovered in all the pseudo-sites, and a consensus sequence was generated based on these analyses using WebLogo Version 2.8.2 (http://weblogo.berkeley.edu).

Differentiation and Silencing Assays
Cells were induced to form embryoid bodies in differentiation medium as described, with some modifications [32]. Differentiation medium was composed of DMEM/F12 supplemented with 10% fetal bovine serum, 1% nonessential amino acids, and 100 µM β-mercaptoethanol. Four days after the start of differentiation, embryoid bodies were plated on culture plate to be further differentiated as monolayers. After 21 days, the differentiation potential was measured by immunocytochemistry for markers specific for the three different lineages. Primary antibodies were obtained from various sources and used at the following dilutions: pluripotent marker of Oct4, 1:500 (Abcam, Cambridge, MA, http://www.abcam.com); endoderm marker of {alpha}-Fetoprotein, 1:500 (Santa Cruz Biotechnology Inc., Santa Cruz, CA, http://www.scbt.com); mesoderm marker of Smooth Muscle Actin, 1:200 (Sigma-Aldrich); mesoderm marker of Brachyury, 1:1,000 (R&D Systems Inc., Minneapolis, http://www.rndsystems.com); ectoderm marker of βIII-Tubulin (TUJ1), 1:1,000 (Invitrogen); and ectoderm marker of Nestin, 1:500 (BD Biosciences). Secondary markers were obtained from Molecular Probes (Eugene, OR, http://probes.invitrogen.com) and used at the following dilutions: Alexa 594-conjugated anti-mouse IgG, 1:1,000; and Alexa 594-conjugated anti-rabbit IgG, 1:1,000. Data on GFP expression levels were collected using a FACScan instrument (BD Biosciences), and data were analyzed using FlowJo (Tree Star, Ashland, OR, http://www.treestar.com).


    RESULTS
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Disclosure of Potential...
 References
 
Plasmid Construction and Site-Specific Integration Strategy
Cloning of recombinant DNA molecules involves multiple steps that can be time-consuming and in some cases extremely difficult to achieve. To streamline the process of cloning complex expression constructs, we used Multisite Gateway technology. This involved construction of a Destination vector (pB2H1-DEST in our case; Fig. 1A), which acted as a recipient for the expression elements. Entry vectors containing the promoter and gene to be expressed were constructed via polymerase chain reaction amplification using specific primers flanked by {lambda} phage recombination site sequences. Recombination of the amplified products with the recipient pDONR vectors generated the Entry vectors, which could then be used for multiple constructions. Appropriate Entry vectors were recombined with the Destination vector in one step to generate expression vectors containing the gene of interest driven by promoter of choice. In this study, we used this strategy to generate two vectors that consist of the GFP gene driven by either the constitutive EF1{alpha} promoter or the hESC-specific human Oct4 promoter (Fig. 1B).

We then used phiC31 integrase to insert the plasmids into the hESC genome. This enzyme directs integration of expression vectors into pseudo-attP sites in the human genome in an efficient manner. To this end, we engineered our Destination vector such that it would contain a recombination site for phiC31 integrase. To allow for selection of integration events, we also incorporated the Hygromycin phosphotransferase gene driven by the HSV-TK promoter. To obtain cells with integration events, the cells of interest were transfected with the expression vectors along with a plasmid encoding the expression of phiC31 integrase. The integrase protein catalyzed the integration of the expression vector into genomic pseudo-attP locations. Stable integration events were selected by expression of the drug-resistance marker present on the plasmids.

The expression constructs were transfected in the absence and presence of the phiC31 integrase plasmid into BG01v cells. Typically, the frequency of integration after 2 weeks of drug selection in the presence of integrase was ~2 x 10–5. Data from three controlled experiments show that the average increase in colony number was 1.4-fold over random integration. In the absence of integrase, 80 colonies were obtained from three experiments, and in the presence of integrase, 114 colonies were obtained. These data suggest that phiC31 integrase can mediate integration into pseudosites in hESC.

Pseudosite Profile in hESC
To show that clones obtained were the result of phiC31-mediated site-specific integration, the site of integration was determined by a plasmid rescue strategy. The attB-genome junctions were sequenced, and the data were analyzed by comparison with the BLAT database (http://genome.ucsc.edu/cgi-bin/hgBlat). Table 1 shows the sites of integration of various clones derived from BG01v or SA002. Of 90 clones screened, plasmid rescue data were obtained for 56 clones. Of these, 51 clones were a result of phiC31-mediated integration and 5 were a result of random integration. The chromosomal loci for the random integration events were not determined. The 51 integrase-mediated clones showed integration into 23 different pseudo-attP sites. As has previously been observed, there were small deletions (5–25 bases) observed at the site of integration [11, 15].


View this table:
[in this window]
[in a new window]

 
Table 1. phiC31 pseudo-attP sites in human ESC

 
Our data show that there are numerous hot spots of integration in stem cells, many of which have not been previously reported in other cell types. There are, however, some integration sites that are common to hESC and differentiated cell types such as the 293, HepG2, and D407 lines. The number of integration events at each pseudosite is shown in Table 1. As shown in a previous study, most of these hot spots are present in introns of genes, with a few present in intergenic regions or exons [10]. In this study, we found that the most commonly used integration sites were present on chromosomes 13, 6, 21, 9, 17, and 2. Of these, only the site on chromosome 21 has been observed previously. The other hot spots have not been reported in differentiated cell types and seem to be exclusive to hESC. However, the integration sites on chromosomes 1 and 6 (6p25.2) and the two sites on chromosome 20 have been reported earlier, suggesting that they might also be hot spots for integration. Since the majority of the clones we analyzed were derived from the BG01v line, we could not make a meaningful comparison as to the pseudosite profile in these cells vis a vis SA002 cells. However, two of the hot spots were present in both cell lines, suggesting that there is at least some commonality between the two independently derived lines. A few clones showed integration into multiple sites (data not shown). It was not clear whether the clones that showed integration into multiple sites were a mix of two independent clones or whether those clones truly had multiple integrations.

It has previously been reported that pseudo-attP sites show some similarity to the native phiC31 attP site and that they share a common motif that contains a strong inverted repeat [10]. The pseudosites observed in hESC were subjected to similar analysis, and we found that these sites shared a common motif with the phiC31 attP site (Fig. 2A). This motif is present close to the crossover region in most of the sites, suggesting involvement in the recombination reaction. A consensus sequence for this motif was derived using the MEME motif finder [31]. The consensus sequence of this motif is shown in Figure 2B. The consensus shows a strong inverted repeat centered on the core, providing further evidence for the hypothesis that the integrase binds to each half-site [33].


Figure 2
View larger version (34K):
[in this window]
[in a new window]

 
Figure 2. DNA sequence features of phiC31 hot spots. PhiC31 integrase-mediated pseudosites obtained in human ESC were analyzed along with the native phiC31 attP for the presence of a common motif by using the MEME motif finder to analyze 100 base pairs (bp) of genomic DNA surrounding the observed crossover site. (A): Presence of the principal motif in the pseudosites. The 26-bp attP motif appeared in all 24 of the included sequences close to the area of the observed crossover (indicated by the 50-bp midpoint of the sequence). (B): The multilevel consensus sequence derived from the motif shown in (A). The consensus is symmetrical about the core and contains inverted repeats (arrows) extending over the length of the consensus. Also shown is a sequence logo diagram for the MEME motif. The probability of a given base occurring at a position is represented by the size of the letter.

 
Generation of GFP-Expressing hESC Lines
We evaluated both lipid-mediated transfection and electroporation to introduce DNA into BG01v and SA002 cells. Typically, we obtained transfection efficiencies ranging from 5% to 20% with minimal cell death. After transfection, the cells were allowed to recover and then placed under selection with Hygromycin. Drug-resistant colonies obtained after 2 weeks of selection were picked and expanded for further observation. Multiple GFP-expressing clones were obtained with both cell types, and the colonies that were morphologically closest to the parent lines were selected for further analysis. Figure 3A shows bright-field and fluorescent microscope views of three different BG01v-derived lines and one SA002-derived line. Counterstaining with an antibody specific for human Oct4 demonstrated that Oct4 and GFP expression are colocalized.


Figure 3
View larger version (49K):
[in this window]
[in a new window]

 
Figure 3. Coexpression of Oct4 and GFP in variant human ESC clones. (A): Clones resulting from transfection of GFP expression plasmid and phiC31 integrase were picked and expanded, and their integration sites were mapped. Representative hOct4-GFP clones derived from BG01v and SA002 cells and an EF1{alpha}-GFP clone derived from BG01v were analyzed for expression of Oct4 (immunostaining; red) and GFP (fluorescence; green). The cells were counterstained with 4,6-diamidino-2-phenylindole (blue). (BI): Expression of GFP driven by either the Oct4 promoter or the EF1{alpha} promoter. EF1{alpha}-driven expression is typically an order of magnitude higher than Oct4-driven expression. (BII, BIII): Long-term expression of GFP in transgenic lines. phiC31 integrase-derived cells were cultured in the presence of the selectable marker for an extended period, and GFP expression was analyzed by fluorescence-activated cell sorting at regular intervals. Typically, the cells were cultured for at least 10 P, which is approximately 4–5 weeks. Abbreviations: GFP, green fluorescent protein; P, passage.

 
A similar strategy was used to obtain BG01v-derived lines expressing the GFP gene driven by the constitutive human EF1{alpha} promoter [34]. As shown in Figure 3A, the EF1{alpha} promoter directs strong expression of GFP in these cells. Fluorescence-activated cell sorting analysis of three independent Oct4-GFP clones and one EF1{alpha}-GFP clone revealed that the EF1{alpha} promoter directs higher levels of expression compared with the hOct4 promoter (Fig. 3BI). This expression is maintained upon long-term culture, as shown in Figure 3BII and 3BIII. Irrespective of the promoter, there was no significant reduction in GFP expression even after 10 passages, which is approximately 4–5 weeks in culture.

Characteristics of GFP Lines
Three independent BG01v-phOct4-GFP clones (YA06, YA15, and YA18) and one SA002-phOct4-GFP clone (YB1403) were studied for their ability to differentiate into the three germ layers by inducing formation of embryoid bodies (EBs). Immunostaining of the embryoid bodies is shown in Figure 4. Expression of endodermal ({alpha}-Fetoprotein), ectodermal (βIII-Tubulin and Nestin), and mesodermal (muscle-specific actin and Brachyury) markers was detected in EBs derived from all four lines.


Figure 4
View larger version (110K):
[in this window]
[in a new window]

 
Figure 4. Differentiation of Oct4-GFP clones into embryonal lineages. Three BG01v-derived Oct4-GFP lines (YA06, YA15, and YA18) and one SA002-derived Oct4-GFP line (YB1403) were allowed to form embryoid bodies to characterize the differentiation potential of phiC31 integrase-derived lines. Differentiation into the endodermal (AFP), mesodermal (muscle-specific actin and Brachyury), and ectodermal (βIII-Tubulin and Nestin) lineages was analyzed by immunostaining with specific antibodies (red). The cells were counterstained with 4,6-diamidino-2-phenylindole (blue). Abbreviations: AFP, {alpha}-fetoprotein; SMA, smooth muscle actin.

 
Differentiation of human ESC results in downregulation of Oct4 expression. To demonstrate that the promoter fragment used in this study was subject to the same regulation as the native Oct4 promoter, expression of GFP was monitored in EBs derived from the Oct4-GFP transgenic lines. As expected, expression of GFP driven by the human Oct4 promoter was eliminated following differentiation (Fig. 5), showing that elements required for proper control of gene expression are present in the promoter fragment. Furthermore, upon knockdown of Oct4 protein message with inhibitory RNA, we noticed a significant decrease in GFP fluorescence (data not shown). In contrast, expression of GFP driven by the EF1{alpha} promoter was still present upon differentiation.


Figure 5
View larger version (42K):
[in this window]
[in a new window]

 
Figure 5. Effect of differentiation on GFP expression. The BG01v-derived Oct4-GFP clones YA06, YA15, and YA18 and the BG01v-derived EF1{alpha}-GFP clone EG101 were allowed to form embryoid bodies for 21 days under selection, and GFP expression was analyzed by fluorescence-activated cell sorting. The red curves indicate a control line that did not express GFP, green curves indicate undifferentiated cells, and blue curves indicate EBs derived from those cells. GFP expression was shut down in all three Oct4-GFP clones upon formation of embryoid bodies, as opposed to the EF1{alpha}-GFP clone. Abbreviation: GFP, green fluorescent protein.

 

    DISCUSSION
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Disclosure of Potential...
 References
 
Engineered human embryonic stem cell lines will provide an important tool to study their growth, differentiation, and physiology, as well as a method to create cell line platforms useful for studying human disease and screening therapeutic agents. The work presented here is meant to set a foundation for a broader cell engineering effort that combines rapid assembly of complex expression vectors and their site-specific insertion into the cell. The key to this system is an efficient and reliable method of inserting genetic elements that supports controllable and homogeneous expression of multiple genes in a population of cultured hESC. In this study, we have shown that the phage phiC31 integrase can be used to successfully target sites in the genome of hESC and lead to the generation of transgenic lines. We also show that integration into pseudo-attP sites leads to sustained expression of the introduced transgene without significant silencing and that these lines retain pluripotency.

The utility of the site-specific integration system described here is enhanced by the ability to use recombinase-mediated cloning (Multisite Gateway technology) to generate expression constructs for delivery into hESC. As part of this study, we show assembly of an expression vector containing the GFP gene driven by either the human Oct4 promoter or the EF1{alpha} promoter. Although these were relatively simple two-fragment constructions, our laboratory and others commonly use this system to assemble as many as five fragments in one reaction, and assembly of up to eight fragments in two steps has been reported [35]. The ability to assemble multiple regulatory elements and markers using this technology allows the creation of complex expression constructs that would otherwise be next to impossible to generate.

Although site-specific integration using phage integrases ensures stable and controllable levels of gene expression, a main challenge with using plasmid DNA lies in gaining efficient transfection of hESC in culture. In this study, we have obtained satisfactory results using two nonviral gene delivery methods. First, a lipid-based transfection reagent, Lipofectamine 2000, was used to transiently transfect BG01v cells, with efficiencies averaging 10%–20%. A second variant line SA002, however, was refractory to transfection by Lipofectamine 2000. For this line, we used electroporation (BTX ECM630) and obtained reasonable transfection levels, typically between 20% and 30%. Electroporation was also effective in BG01v cells, indicating that this method could potentially be adapted for efficient transfection of other human ESC lines. These methods do not seem to affect the growth characteristics of these cells, as we have been able to keep them in culture for many passages without differentiation. The frequency of stable transfection was independent of the method of transfection used.

One of the attributes of the phiC31 family of integrases is that they tend to target a relatively small number of loci (pseudo-attP sites) in the human genome. Of particular benefit to cell engineering is that these loci tend to exist in transcriptionally active regions. For ESC engineering, it is important to categorize these loci and determine which, if any, remain active upon differentiation. Once a database of active and nonsilenced pseudo-attP sites has been developed, it could be used to help streamline the selection process for subsequent engineered cell lines. Using BG01v and SA002 as relatively easily managed models for hESC lines [26, 27], we sought to map and categorize the loci that we targeted to determine whether pseudosites used in hESC are the same as those previously identified in terminally differentiated cells. We used a plasmid rescue-based strategy that has been used successfully in previous studies to determine the site of integration of the plasmids [10, 11]. We hypothesized that since significant chromatin remodeling occurs from the time cells differentiate past the ESC stage to a terminal adult tissue, we would identify at least some previously unmapped pseudosites in hESC. We were able to obtain integration site data for only 60% of the clones analyzed. It is possible that the clones for which plasmid rescue data could not be obtained are a result of random integration or that the integration site was not amenable to plasmid rescue. At this stage, it is not possible to distinguish between these two events.

Interestingly, our data indicate that the pseudo-attP site profile in human ESC is clearly distinct from that seen in differentiated cells. Of the 23 pseudosites that we identified in this study, only five have been reported previously. All the others, including the two most prominent hot spots for recombination in hESC (6p11.2 and 13q32.3), have not been identified previously. We are currently determining expression levels at these pseudosites in human ESC and the effect of differentiation on expression level. Only 2 of the 23 pseudo-attP sites were present in exons, and of these, only the site on chromosome 1 is a possible hot spot. The gene disrupted by integration into the chromosome 1 pseudosite, CDCP2 (NM_201546 [GenBank] ), has not been the subject of extensive study and is described as being weakly similar to Procollagen C-Proteinase enhancer protein. All the other hotpots for integration are either in intergenic regions or in introns of genes, suggesting that phiC31-mediated integration into stem cell lines will not result in deleterious consequences. Previous reports have suggested that the number of phiC31 pseudo-attP sites could be as high as 1,000. This suggests that the system may not be as site-specific as one would prefer. Our data, however, suggest that a large fraction of the integration events (30%–60%) occur in fewer than 10 hot spots in hESC. These data are very similar to those obtained in differentiated and transformed cells [11, 15]. Given the observed frequency of targeting at hot spots of recombination, one could screen approximately 20 colonies and have a high probability of obtaining one that is in a desirable locus. This offers a significant advantage over established methods of generating transgenic hESC lines.

Analysis of the pseudo-attP sequences revealed the presence of a common sequence motif at the crossover site (Fig. 3). The consensus sequence derived for this motif showed the presence of inverted repeats around the core, similar to earlier reports [10]. The consensus sequence that we observed shows significant identity to the previously reported consensus [10]. Of the 26-base pair sequence, 17 bases are identical to the previously reported consensus at the first level, and 25 of the 26 bases are identical when the second and third level consensus sequences are included. These data strongly suggest that phiC31 integrase is targeting pseudo-attP sites with a very similar sequence pattern in hESC. The difference in observed pseudosites could be attributed to the differences in open regions of the chromosome between hESC and terminally differentiated cells.

As expected, GFP expression driven by the Oct4 promoter was not as strong as the expression driven by the EF1{alpha} promoter. The Oct4 promoter fragment that was used in this study has been shown to have the elements necessary for expression in ESC [28, 29]. Our studies confirmed that this is indeed the case, as can be seen in multiple lines (Fig. 2). GFP expression coincided with Oct4 expression and was not present in cells that did not express Oct4, indicating that the promoter fragment used was ESC-specific. This was further confirmed by differentiation of the Oct4-GFP cells into embryoid bodies. GFP expression driven by the Oct4 promoter was completely absent after differentiation, and EBs derived from these cells could not be distinguished from a control line. In contrast, despite a slight reduction in expression, we were still able to detect GFP expression driven by the EF1{alpha} promoter after differentiation. Integration into pseudosites mediated by phiC31 integrase did not affect the pluripotency of these cells. Upon differentiation, the cells stopped expressing GFP and began expressing markers of differentiation. As seen in Figure 4, these cells maintained the ability to differentiate into all three lineages, indicating that they had not lost their pluripotency. The EF1{alpha}-GFP clone was also able to differentiate into all three lineages (data not shown).

The ability to target well-expressed loci in unmodified cells allows for easy creation of transgenic ESC lines expressing markers of interest. Multisite Gateway technology facilitates the construction of plasmids with multiple expression constructs. Combining Multisite Gateway technology with site-specific integration provides a powerful tool to study stem cell biology and differentiation and an efficient solution to some of the factors impeding generation of stably transfected hESC clones. The use of phiC31 integrase allows for efficient generation of stable clones with sustained expression of the gene of interest. Therefore, this system satisfies several of the criteria that we set for the ideal cell engineering tool: reliable and stable insertion into the host genome, long-term expression of the introduced genes, and no detectable adverse effects. The method described here targets pseudo-attP sites, some of which are hot spots for integration. These sites can be mapped and characterized as to their position with respect to neighboring genes. As expected from previous studies, most pseudo-attP sites identified in this study appear in regions not known to code for any open reading frames. This does not rule out the possibility that insertion disrupts expression of noncoding RNA or some yet undefined regulatory region, however. The sites identified in this study are mostly distinct from pseudo-attP sites previously described in terminally differentiated and transformed cells. The ability to map the insertion sites and categorize them as hot spots allows them to be analyzed with respect to their transcriptional activity after differentiation to specific lineages. This offers the opportunity to create a database of hot spots and the embryonic lineages they may be active in. New cell clones could then be generated, mapped, and compared with the database for a rapid triage of those usable for a particular experiment.


    DISCLOSURE OF POTENTIAL CONFLICTS OF INTEREST
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Disclosure of Potential...
 References
 
B.T., S.S., Y.L., U.L., H.X., M.S.R., and J.D.C. own stock in Invitrogen Corporation. C.E., R.S., and J.H. own stock in Cellartis AB.


    REFERENCES
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Disclosure of Potential...
 References
 

  1. Hoffman LM, Carpenter MK. Characterization and culture of human embryonic stem cells. Nat Biotechnol 2005;23:699–708.[CrossRef][Medline]

  2. Thomson JA, Itskovitz-Eldor J, Shapiro SS et al. Embryonic stem cell lines derived from human blastocysts. Science 1998;282:1145–1147.[Abstract/Free Full Text]

  3. Gerrard L, Zhao D, Clark AJ et al. Stably transfected human embryonic stem cell clones express OCT4-specific green fluorescent protein and maintain self-renewal and pluripotency. STEM CELLS 2005;23:124–133.[Abstract/Free Full Text]

  4. Liu YP, Dovzhenko OV, Garthwaite MA et al. Maintenance of pluripotency in human embryonic stem cells stably over-expressing enhanced green fluorescent protein. Stem Cells Dev 2004;13:636–645.[CrossRef][Medline]

  5. Zhou BY, Ye Z, Chen G et al. Inducible and reversible transgene expression in human stem cells after efficient and stable gene transfer. STEM CELLS 2007;25:779–789.[Abstract/Free Full Text]

  6. Liew CG, Draper JS, Walsh J et al. Transient and stable transgene expression in human embryonic stem cells. Stem Cells 2007;25:1521–1528.[Abstract/Free Full Text]

  7. Vallier L, Alexander M, Pedersen R. Conditional gene expression in human embryonic stem cells. Stem Cells 2007;25:1490–1497.[Abstract/Free Full Text]

  8. Zeng J, Du J, Zhao Y et al. Baculoviral vector-mediated transient and stable transgene expression in human embryonic stem cells. STEM CELLS 2007;25:1055–1061.[Abstract/Free Full Text]

  9. McCarty DM, Young SM Jr, Samulski RJ. Integration of adeno-associated virus (AAV) and recombinant AAV vectors. Ann Rev Genet 2004;38:819–845.[CrossRef][Medline]

  10. Chalberg TW, Portlock JL, Olivares EC et al. Integration specificity of phage phiC31 integrase in the human genome. J Mol Biol 2006;357:28–48.[CrossRef][Medline]

  11. Thyagarajan B, Olivares EC, Hollis RP et al. Site-specific genomic integration in mammalian cells mediated by phage phiC31 integrase. Mol Cell Biol 2001;21:3926–3934.[Abstract/Free Full Text]

  12. Allen BG, Weeks DL. Transgenic Xenopus laevis embryos can be generated using phiC31 integrase. Nat Methods 2005;2:975–979.[CrossRef][Medline]

  13. Bateman JR, Lee AM, Wu CT. Site-specific transformation of Drosophila via phiC31 integrase-mediated cassette exchange. Genetics 2006;173:769–777.[Abstract/Free Full Text]

  14. Bauer JW, Laimer M. Gene therapy of epidermolysis bullosa. Expert Opin Biol Ther 2004;4:1435–1443.[CrossRef][Medline]

  15. Chalberg TW, Genise HL, Vollrath D et al. phiC31 integrase confers genomic integration and long-term transgene expression in rat retina. Invest Ophthalmol Vis Sci 2005;46:2140–2146.[Abstract/Free Full Text]

  16. Dong J, Stuart GW. Transgene manipulation in zebrafish by using recombinases. Methods Cell Biol 2004;77:363–379.[Medline]

  17. Groth AC, Fish M, Nusse R et al. Construction of transgenic Drosophila by using the site-specific integrase from phage phiC31. Genetics 2004;166:1775–1782.[Abstract/Free Full Text]

  18. Groth AC, Olivares EC, Thyagarajan B et al. A phage integrase directs efficient site-specific integration in human cells. Proc Natl Acad Sci U S A 2000;97:5995–6000.[Abstract/Free Full Text]

  19. Khan MS, Khalid AM, Malik KA. Phage phiC31 integrase: A new tool in plastid genome engineering. Trends Plant Sci 2005;10:1–3.[CrossRef][Medline]

  20. Ma QW, Sheng HQ, Yan JB et al. Identification of pseudo attP sites for phage phiC31 integrase in bovine genome. Biochem Biophys Res Commun 2006;345:984–988.[CrossRef][Medline]

  21. Nakayama G, Kawaguchi Y, Koga K et al. Site-specific gene integration in cultured silkworm cells mediated by phiC31 integrase. Mol Genet Genomics 2005;1–8.

  22. Olivares EC, Hollis RP, Chalberg TW et al. Site-specific genomic integration produces therapeutic Factor IX levels in mice. Nat Biotechnol 2002;20:1124–1128.[CrossRef][Medline]

  23. Ortiz-Urda S, Thyagarajan B, Keene DR et al. Stable nonviral genetic correction of inherited human skin disease. Nat Med 2002;8:1166–1170.[CrossRef][Medline]

  24. Thorpe HM, Smith MC. In vitro site-specific integration of bacteriophage DNA catalyzed by a recombinase of the resolvase/invertase family. Proc Natl Acad Sci U S A 1998;95:5505–5510.[Abstract/Free Full Text]

  25. Thyagarajan B, Calos MP. Site-specific integration for high-level protein production in mammalian cells. Methods Mol Biol 2005;308:99–106.[Medline]

  26. Heins N, Englund MC, Sjoblom C et al. Derivation, characterization, and differentiation of human embryonic stem cells. STEM CELLS 2004;22:367–376.[Abstract/Free Full Text]

  27. Zeng X, Chen J, Liu Y et al. BG01V: A variant human embryonic stem cell line which exhibits rapid growth after passaging and reliable dopaminergic differentiation. Restor Neurol Neurosci 2004;22:421–428.[Medline]

  28. Nordhoff V, Hubner K, Bauer A et al. Comparative analysis of human, bovine, and murine Oct-4 upstream promoter sequences. Mamm Genome 2001;12:309–317.[CrossRef][Medline]

  29. Yeom YI, Fuhrmann G, Ovitt CE et al. Germline regulatory element of Oct-4 specific for the totipotent cycle of embryonal cells. Development 1996;122:881–894.[Abstract]

  30. Kent WJ, Sugnet CW, Furey TS et al. The human genome browser at UCSC. Genome Res 2002;12:996–1006.[Abstract/Free Full Text]

  31. Bailey TL, Elkan C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc Int Conf Intell Syst Mol Biol 1994;2:28–36.[Medline]

  32. Bhattacharya B, Cai J, Luo Y et al. Comparison of the gene expression profile of undifferentiated human embryonic stem cell lines and differentiating embryoid bodies. BMC Dev Biol 2005;5:22.[CrossRef][Medline]

  33. Smith MC, Thorpe HM. Diversity in the serine recombinases. Mol Microbiol 2002;44:299–307.[CrossRef][Medline]

  34. Kim DW, Uetsuki T, Kaziro Y et al. Use of the human elongation factor 1 alpha promoter as a versatile and efficient expression system. Gene 1990;91:217–223.[CrossRef][Medline]

  35. Sone T, Yahata K, Sasaki Y et al. Multi-gene gateway clone design for expression of multiple heterologous genes in living cells: Modular construction of multiple cDNA expression elements using recombinant cloning. J Biotechnol 2005; [Epub ahead of print].




This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
N. Sharma, B. Moldt, T. Dalsgaard, T. G. Jensen, and J. G. Mikkelsen
Regulated gene insertion by steroid-induced {Phi}C31 integrase
Nucleic Acids Res., June 1, 2008; 36(11): e67 - e67.
[Abstract] [Full Text] [PDF]


This Article
Free via Open Access: OA
Right arrow OA Abstract
Right arrow Full Text (PDF)
Right arrowOA All Versions of this Article:
2007-0283v1
2007-0283v2
26/1/119    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow Reprints/Permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Thyagarajan, B.
Right arrow Articles by Chesnut, J. D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Thyagarajan, B.
Right arrow Articles by Chesnut, J. D.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
STEM CELLS THE ONCOLOGIST CME ALPHAMED PRESS JOURNALS
http://www.stemcellsportal.com/