
- 积分
- 0
- 威望
- 0
- 包包
- 5
|
Dynamic transcription programs during ES cell differentiation towards 3 m% u, M4 M, s3 s$ V
mesoderm in serum versus serum-freeBMP4 culture Stephen J Bruce1,2 , Brooke B Gardiner1,3,4 , Les J Burke1,4 , M Milena , _- h7 }. f+ N8 m- v) S6 k
Gongora1,3 , Sean M Grimmond1,3,4 and Andrew C Perkins1,2,3,4
: c X4 _6 n1 V0 p2 R7 |1 z 1Institute for Molecular Bioscience, University of Queensland, Brisbane,
! |* u! @3 K" Z- O Queensland, Australia& S6 k3 E( ^7 H+ k( A, T: s& `( q
2Wesley Research Institute, Queensland, Australia
& G, W+ X6 a# D1 x' [- w' T 3ARC Special Research Centre in Functional and Applied Genomics, Australia& U. t5 U3 K6 [, E' B
4Australian Stem Cell Centre, Melbourne, Australia
! H" @0 x6 ~# t3 X author email corresponding author email( A/ u2 b7 {/ t/ \: F! L
BMC Genomics 2007, 8:365doi:10.1186/1471-2164-8-365
3 }2 A% Y, w) E* P7 E, { The electronic version of this article is the complete one and can be
5 q7 V* o, w+ s found online at: http://www.biomedcentral.com/1471-2164/8/365
2 q% I e" H) `- ? Received:28 May 2007$ _7 G ]9 y( D! r
Accepted:10 October 2007( W* m. Q1 H+ C. u; Z, P: ~' w
Published:10 October 2007
. t( A( T' u2 f' ] P
/ r7 z# M( g) }1 v © 2007 Bruce et al; licensee BioMed Central Ltd.
' v$ B: p" n9 @4 B: [ This is an Open Access article distributed under the terms of the Creative
& q1 o- j6 s; k+ c, d Commons Attribution License (http://creativecommons.org/licenses/by/2.0), # N3 e Y4 x; m1 u# K; N
which permits unrestricted use, distribution, and reproduction in any / a8 \. H9 [0 w1 m: s& O- f
medium, provided the original work is properly cited.
4 r9 N U3 X! |% i+ t; G w Abstract
: K5 ^, ~ n/ h. u8 h6 _ Background
7 K7 a% D0 M" {. [; j2 s Expression profiling of embryonic stem (ES) cell differentiation in the
- V2 N) B' P X# Z/ s presence of serum has been performed previously. It remains unclear if 4 W9 r; W$ k; K1 u; c. r' m) W
transcriptional activation is dependent on complex growth factor mixtures % L( A& c, j3 ^# L6 g* v
in serum or whether this process is intrinsic to ES cells once the stem 5 y# d7 F9 _6 E/ Q
cell program has been inactivated. The aims of this study were to
$ ]8 m" [. s+ j$ F5 p& m+ L determine the transcriptional programs associated with the stem cell state 1 p; x9 E% r: }7 `: A+ t0 e
and to characterize mesoderm differentiation between serum and serum-free 7 S9 M+ M. [% q+ ~
culture.
3 k2 I3 q6 {7 p/ u Results) c! `1 ^$ z. E U
ES cells were differentiated as embryoid bodies in 10% FBS or serum-free % }: \* }6 R1 v$ J
media containing BMP4 (2 ng/ml), and expression profiled using 47 K 9 j6 G/ e: m5 o
Illumina(R) Sentrix arrays. Statistical methods were employed to define
% i+ U3 o8 P; w9 p1 q$ @6 T gene sets characteristic of stem cell, epiblast and primitive streak
) x4 G6 Q" l( Z/ V! d; @' \ programs. Although the initial differentiation profile was similar between
" {2 D2 Y" A" {0 b2 }: ?& y1 \ the two culture conditions, cardiac gene expression was inhibited in serum
! H$ V" w) M4 D3 U$ e8 c! c, G whereas blood gene expression was enhanced. Also, expression of many
' d) x5 k U" \: I/ f members of the Kruppel-like factor (KLF) family of transcription factors 7 r4 U4 v! ~6 m( S2 X: `
changed dramatically during the first few days of differentiation. KLF2
2 m2 }/ o8 j9 h* h% o* e and KLF4 co-localized with OCT4 in a sub-nuclear compartment of ES cells, 9 `% M& X3 F6 P
dynamic changes in KLF-DNA binding activities occurred upon
F/ \# U' @' ]$ T differentiation, and strong bio-informatic evidence for direct regulation 7 Q* |4 S' Q3 D3 {3 E
of many stem cell genes by KLFs was found.# _" G: w$ R9 b' y% d$ g- `
Conclusion6 K- T8 C6 S) `9 w% E% ]
Down regulation of stem cell genes and activation of epiblast/primitive
( I# B1 X- R( B. I0 }1 A streak genes is similar in serum and defined media, but subsequent
3 p. \% O$ y( N mesoderm differentiation is strongly influenced by the composition of the
6 K1 X1 C- |6 L/ }# u media. In addition, KLF family members are likely to be important
; w# ~4 |0 {9 S* O1 A% z! @ regulators of many stem cell genes.
* J6 B! V0 }, r) s$ Z Background
. B6 ^- X0 T$ l: p7 a `+ w$ c Embryonic stem (ES) cells isolated from the inner cell mass of the early $ y# F& I/ h) N1 n6 v9 e" n
mammalian blastocyst-stage embryo retain pluripotency when cultured on 5 @5 b v1 S8 _; K; ^
mouse embryonic fibroblasts (MEFs) in the presence of leukemia inhibitory
$ s/ I' L4 t9 [8 y! N factor (LIF) [1]. In the absence of LIF and MEF attachment, ES cells $ i, [) `8 a$ A) b" Q6 ?
spontaneously differentiate into multi-cellular aggregates termed embryoid " H( L3 Q* Q) b
bodies (EBs). Although the spatial complexity of organogenesis is not : A S/ W: ` R3 h+ _
established during EB maturation, the dynamics of gene expression closely
* ~9 m3 R' U: x8 y! O mimic those which characterize early stages of mouse development [2-5]. 1 Z. }( m) D: o7 A( B$ U
Thus, ES cell differentiation is an excellent model system for the
6 x: D& u) |4 L, U3 y# Q3 H discovery of genes involved in developmental processes.
% Z' L/ Q2 f* q1 S Many studies have attempted to identify genes that define the stem cell 2 B& R. X2 x- M; J* X9 Y
state by mining for genes co-expressed in ES cells and various other stem
& q6 l, G' i9 s; h7 \2 K5 B0 u: { cell populations such as hematopoietic and neural stem cells [6,7], or ES . _6 f- t2 D& l1 g/ |1 H
cells and trophoblast stem cells [8]. Some of these studies have been
/ R6 J. A( F- b* f1 i' m. Y9 I criticized because they were unable to define a similar cohort of stem 6 d2 {) @& |! f% F+ r' c! o
cell genes [9]. Also, this approach is likely to find co-expression of
U- ?9 @& Y9 z% W& B 'housekeeping' genes that are irrelevant to the stem cell state. More
; y/ h4 z6 ~9 H+ Q) j( Y recently, dynamic expression profiling during ES cell differentiation has / x( S! e. E6 g6 Y! G
been undertaken using various EB culture techniques [10,11] and the
+ m/ h" `8 W& x) i: o/ D addition of exogenous growth factors [7]. Most of these studies followed
+ P! l0 h' O5 Y/ e8 t differentiation for a short time period (up to 6–8 days) due to inherent 0 C; Z; m- Y v# `5 f/ E" S
limitations of liquid or adherent cultures cell differentiation systems.
' }0 e7 e3 C- {7 a Various platforms have been used to determine gene expression profiles,
. x+ X8 |5 R2 M7 ] including differential display, SAGE [12] and slide micro-arrays, such as
1 ]3 n+ w7 g! {9 k9 D! D; j: @ those provided by Affymetrix, Agilent and Compugen. In this study we $ z, U' l" f- ^. F
employed Illumina® Sentrix Mouse-6 oligo bead arrays [13], which have ~47
' Q' \3 j2 R3 R& f6 {: m; L( r1 d' q K probes that are largely based on the MEEBO set of transcripts, plus ESTs , Q! j4 p$ M$ N
discovered as part of the FANTOM2 transcriptome project [14], some 3 F& F' g1 y, r5 E6 K! m3 S
alternatively employed exons, and a small subset of non-coding
! [" \9 c; U6 M7 r- R( e1 g- Y transcripts. This is a new, rich and sensitive platform for expression
! ^4 e' U; r( o" y+ b1 Z7 v profiling [15].
& @+ L1 o' `! M& J1 } ES cells were differentiated in methylcellulose cultures, which allowed
) N& A0 o& l; y5 R& Q consistent EB development over 16 days in either serum [2,16] or defined / d& C4 k3 d7 M8 ~$ f1 L: c
serum-free media containing BMP4 (2 ng/ml), termed serum-freeB4L hereafter ( E: g- O4 y8 a h4 B- B
[17]. The dynamics of loss of stem cell gene expression and activation of ; y- n$ t- v" }! n3 _8 h
epiblast and primitive streak gene expression programs were similar in I2 e5 e N6 r
serum and serum-freeB4L but QT clustering revealed a significant ; f, c8 ~9 W' l# l# O9 J1 {0 N( _
difference in mesoderm outcomes from day 6 onwards. In particular, cardiac
; r* h' W% r' J3 u9 j gene expression was greater in serum-freeB4L whereas activation of the
7 q7 ~" @, i2 Y' N5 c$ K$ c* k* j blood program was enhanced in serum.
1 z+ s8 Y0 T' j& J High expression levels of Kruppel-like factor (KLF) family members: Klf2, . I2 c9 W! s; Q; P0 K
Klf4, Klf5 and Klf9 was detected in undifferentiated ES cells, and 7 A2 o9 C/ `- ]: j% x8 l
confirmed by quantitative real time RT-PCR, indirect immuno-fluorescence
. J) ?2 t1 t2 u3 m, u and DNA-binding assays. Expression was down regulated rapidly upon o @4 V+ s( M& o0 C. ^8 M; p
differentiation and a different set of Klfs: Klf3, Klf16 and Klf1/Eklf,
0 r$ E. Q0 M. y" N- I were activated upon ES cell differentiation. Also, KLF-binding sites were / S+ Y3 D* I/ a) }, y5 f
markedly over represented in the proximal promoters of most stem cell
. K/ ?! D* [" {" ?& ~ specific genes, suggesting possible direct regulation by KLFs. We suggest
, A3 u0 ?% E: ?/ ^+ H* u5 ?9 G a model in which the balance between self-renewal and differentiation is
( n" k. r7 ?- V$ F regulated by competitive occupancy of the proximal promoters of key stem 4 L- ^" y1 C1 ^' h C/ g
cell genes such as Zfp42/Rex-1, Nanog, Pou5f1(Oct4), Lefty1 and Lefty2, 4 f* L2 @' e! {9 b& z
and others such as the Klf genes themselves.3 b7 R2 |( |2 q3 M
Results
+ |- W9 U- j3 x" U$ ^( P Differentiation of ES cells in serum and serum-freeB4L culture$ g5 n2 x3 L8 V% b4 T
The directed differentiation of embryonic stem cells into desirable cell
% H4 C) {. F0 [/ b types will depend upon the addition of specific growth factors to defined
2 U2 w3 }- {6 { K+ p, j# ^ media and reproducible physical conditions of culture. To establish a
% v; A. @: z* s3 }9 n8 @1 j baseline for such studies, we undertook a comparison of murine ES cell
" j% `' F b5 @' y$ O: ~2 x3 Y' G+ | differentiation using an embryoid body (EB) methylcellulose culture system % n4 \/ h9 T4 U/ W
containing either 10% FBS [2] or a chemically defined BSA-based media , q& ^( H5 ~! \ Z3 C. F. h/ U
containing 2 ng/ml of BMP4, termed serum-freeB4L [17]. Triplicate
0 ~4 a" K- k/ A: f6 `+ Q8 ~ experiments were performed in which ES cells were harvested after feeder
* E4 A/ E* P( C2 K& { A( n depletion, and set up in parallel EB cultures in serum versus 5 Q2 A. g8 |* v1 ^7 A3 B+ M
serum-freeB4L. RNA was collected at regular intervals up to 16 days, ! \3 P, L8 Q$ {, q q3 K& [
quantitative RT-PCR was performed on a panel of genes representing defined
3 G5 P6 [; J# z5 l0 E stages of development to establish the robustness of the culture system
) e4 s6 R1 O7 I# F6 [6 T( i (Figure 1), and expression profiling was performed using Illumina® Sentrix - b H& `1 |5 z6 `3 T
Mouse-6 bead arrays (see Methods and [13]).1 Q! c$ D5 C. i$ K" i
Figure 1. Dynamic gene expression during EB differentiation in serum and
) N& a! e* z& T4 Z serum-free media containing BMP4 [2 ng/ml]. Expression of stem cell,
% P/ A$ q x/ ~: [) U7 E% Q- Q primitive streak and late mesoderm genes were analysed by quantitative 8 u2 p7 [( Y) C u4 u7 J! F: r& j
RT-PCR in undifferentiated ES cells (0, grey bar) and in EBs collected for
E: G# M& ^6 x( l0 l m) R up to 16 days in serum (white bars) or defined media (black bars) from the * {9 G& j" f/ q5 n% t5 W
same starting ES cell populations. Bars represent the means of three
' U6 [% s, @! f$ U5 z. Q (serum) or two (serum-freeB4L) biological replicates. The Y-axis m# f2 W# r H( ~$ Q. n; R8 `
represents a log scale normalized relative to the housekeeping gene HPRT.
4 e* g! y2 T- _! E* T$ m Error bars indicate standard deviation./ V& D2 I3 h' R, x
Changes in gene expression were extremely reproducible across ES
4 r; N3 d2 M. ]& \$ d% }/ ` differentiation experiments with overall correlation of global gene $ _" m+ Y. F9 I
expression for biological replicates in the order or 0.95–0.96.
' b2 I) N6 P6 Z5 E- o/ G Approximately 30% of the probes were differentially expressed at one or
; O/ c% I9 w7 W- |& _7 a more stages of EB differentiation as determined by one-way Welch ANOVA
. h7 v+ M% ]- E7 s. i9 o# G- h/ J with multiple testing corrections using the Benjamini and Hochberg False
' I# M. i1 _; y* b- `: o' ]' \$ ]" m Discovery Rate algorithm (7,967 probes with a p-value of < 0.01 by Welch " ^) D( ^8 c% ?+ v3 i& n
t-test) (Figure 2A).
" G- W9 L8 H) V. i7 J Figure 2. Gene expression profiles during 16 days of EB differentiation in
1 C! M/ U( _: o serum and serum-free media containing BMP4 [2 ng/ml]. (A) GeneSpring 0 _* O- L2 H/ A1 r" i) V, o. G
representation of the subset of genes from the 48 K Illumina array that
% w( s% {1 v3 ~) j2 O5 s showed significant differences in gene expression at 2 or more time 6 h" z( S: ^ K- k% D+ w% O
points. Each line represents the mean normalised expression of an - i# L8 o; D0 q" S3 e% B
individual Illumina probe. Colours represent relative expression (red 4× 1 i$ i% }. l4 j+ t; O# t
increased; green 0.25× reduced) compared to a gene mean of 1 (black) with # q- t N8 t5 L9 v
expression in undifferentiated ES cells (day 0) as the decision point for
: J5 H, X0 |* V3 C& q4 X, R2 a$ T colour coding. The expression profiles of T (brachyury), Hba (α-globin), * f4 K( Q5 W5 _( I; V) X/ a
and Igf2 are indicated for reference. (B) Clustering (tree) of the genes
2 d5 k/ ?, ^8 Y: t# ] that showed significant (p < 0.01) changes in gene expression upon + S, x S& E- I, o
differentiation by One-way Welch ANOVA. Expression levels are heat map 4 i! I- ?' u1 b
colour coded (inset) with high expression at Day 0 coloured red and low
! c6 k9 b1 Q. h expression coloured green.
7 K4 D& @$ Q0 G$ G. s To mine for genes with similar patterns of expression across the time 6 J' E# L( R5 ] Z) }3 I- D" K
course, hierarchical clustering was performed on the 7,967 probes and 1 b$ E$ L# [7 H3 ~, D
represented graphically. A large percentage of these (~30%) displayed
/ c& S+ w* i/ h: F. K4 o7 t highest expression in undifferentiated ES cells and subsequent reduced
% K* H5 f4 N- U& Q- ~. Q expression over time (labeled stem in Figure 2B). A second major cluster . Y' M4 y) F% V" A- f
showed marked induction after ~day 4–6 of differentiation (labeled
% Y$ z J% {1 @% G mesoderm in Figure 2B).
2 Z X& b: O" C1 d$ z Putative stem cell genes sets
5 y% }+ }6 \. ^5 p- g Previous studies have identified genes important to the stem cell state by A/ x0 ]! @5 F$ h! m6 W& F6 \( e
comparing expression profiling data from different stem cell populations
7 @! y) o( ^- S/ Z' e J- m- C such as ES cells, haematopoietic stem cells (HSCs) and neural stem cells
+ | N2 D/ Z9 n6 @7 j$ l [6,18]. We used two alternative approaches to identify specific stem cell
2 ?7 x8 E; {# v( J) K specific gene expression. Firstly, genes with dynamic expression patterns ! b( t( h7 @# A ]8 ?
which correlated closely with Oct4, Sox2 or Nanog expression profiles
& D+ G1 V1 T5 \! V" _1 p8 H) { (Pearson correlation > 0.9) were determined as previously described [6,7]. 4 N/ Y6 b/ `) A* I3 X7 U
Second, genes rapidly down regulated ≥3 fold by day 3 of differentiation,
- v+ V0 h2 Q2 h- d a time point where ES cell clonogenicity decreases dramatically, were
4 a) |+ z' u; a: T$ N( G. J9 m determined. Not surprisingly there was very little overlap between these
1 L; M9 u$ a& L; h gene lists, primarily because Oct4 and Nanog expression persists during EB + v; V, H r/ K1 n- v9 {1 d
differentiation longer than many other stem cell genes (Figure 1). Full
- D u. A, [( O lists of genes ranked according to similarity of expression (Pearson
% M+ P1 b" E q. B q- J correlation >0.9) with expression patterns for Oct4, Nanog or Sox2 are 7 d, {, g$ C1 p: W* p( |3 r4 p+ R
provided in Additional files 1, 2, 3, respectively.
4 Q, H# ^4 W3 P3 `' A Additional file 1. Oct4 gene list. The data provided lists all genes
" G) u2 K5 B# w& \) ?3 z% S expressed during 16 days of embryoid body differentiation with similarity - {& g1 p7 u% K; @
to Oct4 (Pearson correlation >0.9)." N: B2 n8 _2 M" o3 V
Format: DOC Size: 630KB Download file3 B( D% b( l0 e) R
This file can be viewed with: Microsoft Word ViewerAdditional file 2. * [- k0 |3 a/ k6 c
Nanog gene list. The data provided lists all genes expressed during 16
t8 h& V8 K7 h; P7 Y) n) T% u days of embryoid body differentiation with similarity to Nanog (Pearson + | q, @8 ?# G) l" i
correlation >0.9).6 d2 F4 U+ D+ q8 A' U) A
Format: DOC Size: 111KB Download file, w( w" D$ Q- H( F6 T( i% z3 P
This file can be viewed with: Microsoft Word ViewerAdditional file 3. Sox2 3 F9 L3 z$ v. q7 t! u, ? t) _
gene list. The data provided lists all genes expressed during 16 days of
4 N0 y7 U$ t! ^5 i9 [' m3 C- N embryoid body differentiation with similarity to Sox2 (Pearson correlation 3 M3 s$ m8 H$ }9 p2 H5 R
>0.9).4 j* [$ V0 J8 V f; i5 b2 ]
Format: DOC Size: 237KB Download file
3 w6 {# c# r( [6 _ This file can be viewed with: Microsoft Word Viewer
3 v% D" x* T- G. L' f/ k Only fifty-nine genes were down regulated ≥3 fold by day 3 of EB
# x7 I5 f( c. F, s" ~4 k differentiation (Table 1). Many of these have been identified as stem cell
2 a* s& W3 L. Q5 p8 e% P, H% @ markers by other groups using alternative strategies [6,7,19]. Information ' N/ S. W" X0 M8 k
was compiled on their known expression patterns and/or gene knockout 1 u. @6 X+ E+ j& H
phenotypes via literature searches as referenced in Table 1. Insufficient
1 [" H, F. B, o! f space exists to discuss every gene and most have been previously
# q. i9 p: M4 Q" E% s" K identified as ES cell enriched transcripts. Osteopontin is secreted by ( N% n6 G( V" N& |" d3 F: h7 G
osteoblasts in the bone marrow niche where it is thought to play an
" O# \- E9 N( D3 q important role in the maintenance of HSC quiescence and maintenance of 4 {# X1 [+ A7 S% }- V8 @
'stemness' [20]. It has also been discovered by other groups as a strong
( U+ `0 h" p M' I2 S marker of ES cells and is regulated directly by the key stem cell 8 W5 Y: ?% I4 M0 {* L
transcription factors, Oct4 and Sox2 [21]. Thus, osteopontin is likely to
! \3 O% g+ V7 m be a useful marker of ES cell pluripotency and may play autocrine or
3 G+ l7 O( Q" X paracrine roles in maintenance of stemness. The second most rapidly down
' I0 X7 M. V) |2 A. A# e+ c2 ] regulated gene was estrogen related receptor β, Esrrb, which had
* c5 y; v+ ]) J! O* L previously been identified as a stem cell specific gene and shown to be
4 ?& }( H/ J0 A1 l2 c critical for maintenance of the stem cell state [19]. F-box 15 (Fbxo15), a : ?& ^& G9 Z; [! n z
member of the large F-box family of genes [22] was also rapidly down
5 Z; p# y: b. H% b! L7 t' t regulated. Fbxo15 is known to be highly expressed in ES cells, is + F4 e2 L; k/ P7 D5 G, I
regulated by Oct4 and Sox2, but is dispensable for ES cell self-renewal
0 G3 R! H, L: G" | [23]. Recently, the stem cell specificity of this locus was used to screen " M: Q# y5 F4 x# ]5 T
for cDNAs capable of transforming mouse fibroblasts into ES cell ( s6 R4 _ K( S; {8 p( {
populations [24]. Other notable rapidly down regulated genes included the / D8 k- i0 w% A+ `5 I
nodal inhibitors Lefty-1 and Lefty-2, and Zfp42/Rex-1, which are also well ) @; z8 x, e* O" Q- t Z. L2 o
established as stem cell markers [19].0 h) a3 Z) Y( g+ Q* s
Table 1. Genes rapidly down regulated (>3 fold in 3 days) upon ES cell
+ P- I3 j. O+ g! b4 _1 Q& i differentiation) y: A0 K$ h+ h5 o. c6 L
Oct4, Nanog and Sox2 are not included in Table 1, even though each is 7 Z2 x. \0 m1 \* K
required for the maintenance of ES cell pluripotency [25-27] due to their
* w$ x" n& j9 h8 t& s6 G persistent expression during the first 3 days of EB differentiation
& a# S& H7 H7 ?" K* u' V0 M, C# Y& R8 W9 P (Figure 1 and 5A, and Additional file 1). Indeed, Oct4 expression is known
% ]. u. O- A$ x0 Z" e# @0 D: U; w* S to persist beyond the inner cell mass stage of development in vivo, with " ~- y7 V" D1 B+ l$ _# m0 F5 P
transcripts detected in the epiblast and later in the primitive streak, % i9 k9 w) {5 X) \* s% G
before it becomes further restricted to primordial germ cells [10,28]. 779
6 F3 U7 o- U! U$ x- r1 d1 | genes with a similar pattern of expression to Oct4 (Pearson correlation of
5 ]+ o. j9 K& i, @ > 0.9) were identified (Additional file 1). As expected this list also 9 ?+ _# I1 V7 z* B
includes many well known stem cell genes such as Sal-like 1, nodal, d3 U9 K0 D0 Y5 t
chromobox homolog 7 (Cbx7), Dnmt3b and Sprouty homolog 4 [19]. We also . E: D6 t: L+ P$ r3 T4 x$ ~7 y
mined for stem cell genes by looking for similarity (Pearson correlation - f: ?7 ^7 V1 u& V3 t) @
of > 0.9) with Nanog and Sox2 gene expression profiles. These provided . S! g; @- G1 a) A
overlapping but not identical sets of putative stem cell specific genes
! H/ m: d2 a1 H( Q9 V$ D% q" m (Additional files 2, 3). Full data for the 16 days of differentiation can ) H3 g/ h1 k% U$ C1 a' m% x
be interrogated and mined for specific differentiation outcomes via Signet
" ?- h3 E2 G. j: c" Z (Login_Bruce) (see Methods).# Z& J. R! N) o5 Y6 j. K
Tightly co-ordinated waves of gene expression suggest sequential . V' y4 ]3 u. q' M& Z- ^* W6 T. t
activation of epiblast, primitive streak and mesoderm differentiation
7 E6 ?- D% T5 e! u+ D, |* Q& _, B programs9 a% I! ^$ h9 m; w* `
Many alternative differentiation programs occur simultaneously during ES 4 P3 C" r# x: W! W1 t* N* ]4 G
cell differentiation. Therefore, it is difficult to find gene
/ @' p7 n& b3 i# I syn-expression patterns that identify specific programs using standard
. ?1 Z$ A% g& n0 t( a) f9 v clustering methods. An alternative approach is to search for expression
- o. d) o: `6 q# d3 @+ u7 a patterns that closely resemble genes of known function. For example, the T
) o/ D6 U8 v' Q% E( E box gene, brachyury, is a well characterized specific marker of the 8 \; v5 h$ f1 K3 b: x
primitive streak [29]. Brachyury expression is transient in vivo, a
# L4 K( p7 c8 t, j feature recapitulated during ES cell differentiation [2]. Using qRT-PCR, / c$ M# h, q `- o& k; z' Q
brachyury was induced ~1000 fold from D0 to day 4 and then rapidly ' w% Q: ~- h' R0 O4 ]2 |
silenced to baseline levels by D6 (Figure 1). The dynamics of brachyury
! w) b; P1 j, o# J! ? expression were similar between serum and serum-freeB4L media (Figure 1
- X$ w, i" h9 x7 c+ G0 F# N5 [ and 3A), but serum-free media was unable to support brachyury induction in
- {& r$ h& O4 A/ Z% { the absence of BMP-4 (data not shown). Thus, BMP4 (or related factors in & w9 T% R8 C& H! a; V9 P: X
serum such as activin) is essential for mesoderm generation. Thirty genes ; Z* |$ N% S8 D- C
were identified with a similar expression pattern to brachyury (Pearson
5 ?0 Q5 J3 Q( \: x5 i correlation >0.9) (Table 2) including many genes with established
: f- y! ]( P- m0 L2 r expression or critical functions during primitive streak formation, such + \, t3 `9 R- _6 J
as Mix1 [30], Lim1 [31], Sp5 [32] and eomesodermin [33]. Transient ; R2 _( |! Y& w9 ?2 ~) {
expression of many of these 'streak-specific' genes was validated by 3 G. k8 H: r6 O) d
qRT-PCR (Figure 3B and data not shown). In all cases there was a close
/ c" {& G1 h* {! k3 P; V& D correlation between the qRT-PCR results and the profiling data (Figure . ~/ f8 \0 U2 \; c8 T f* U2 ? y
3A).2 j) V4 ?% m3 U* b6 N
Figure 3. Mining for genes expressed in patterns consistent with primitive & C8 K# e( U# t" `' J& g9 C
streak, and epiblast differentiation programs. (A) GeneSpring
! G/ }& K3 {1 H& T representation of epiblast and streak gene expression profiles during EB
2 n# N7 l7 r% I7 X; o& \3 P) E differentiation in 10% serum or serum-freeB4L culture. The up-regulation
$ u. x( Q4 v ^/ e9 J% d of genes between days 2–4 were similar under both conditions. (B) qRT-PCR
4 n$ i g6 K6 a$ k: @4 v% B0 `! F) K/ W analysis of Mixl, Lim1, Cdx4 and Riken clone 8430415E04Rik. The Y-axis
8 e$ d/ ^3 j" f6 a( _ represents expression relative to the housekeeping gene HPRT. Error bars 5 W& J. Z/ J' k! w" d
indicate ± SD from three biological replicates. (C) 102 genes transiently
* r' l+ f' t' L& S2 Z& S3 g% ` up regulated during the first 4 days of ES cell differentiation were ! P4 `& K- s. K/ [3 w- K1 x7 U- u
clustered using a tree algorithm and Pearson correlation of > 0.9. The
: J& g" [2 u7 ]' \6 w 'Brief' group represents gene with very transient expression at day 3. (D) 2 {; T: `( h- I# p
Coding region for 8430415E04Rik. The shaded areas represent the three Heat
1 v/ H/ U0 Z) R" \- w( ` domains.Table 2. Brachyury-like (Primitive Streak) gene list (Pearson . {; }5 y. s6 B* p* t, t
correlation >0.9 with brachyury)
& {( v9 E+ z7 H Interestingly, four novel transcripts were identified in this list (Table 7 A* }8 E0 s. y' {0 @( {' e/ s
2). One of these, 8430415E04Rik, was detected by two independent probes, 5 }# M' S! v- N8 b5 O$ }, {% L
strongly supporting its 'streak-specific' expression; by qRT-PCR there was + f7 v4 q l1 B
~50 fold induction from day 2 to day 3 of differentiation and a decrease
& k" Z# y9 f, U to near baseline levels by day 6 (Figure 3B). The gene contains 24 exons,
2 R$ z) n2 \- Y/ U7 j) i and the full length cDNA encodes for a protein of 868 amino acids (Figure * y: j, f+ E d% u: c3 I p. G8 m0 E
3D) which is highly conserved in vertebrates (Additional file 4). It has 7 S3 H" V- O0 {4 w
little homology to other proteins apart from three predicted HEAT domains
: P! y& ^* ]2 h: E or armadillo domains (grey shading in Figure 3D). Drosophila armadillo 3 s3 V. ~- m$ i5 j! M- F* J9 a
interacts with β-catenin via its HEAT/armadillo repeat [34], which
9 V: ~8 Q) N3 P7 ?' v% I0 U3 E4 U/ c suggests the protein encoded by 8430415E04Rik may also be involved in
" ]9 |9 o+ O# H" m3 K8 G+ d( Z protein-protein interactions.
# T: c' D; v% ?, j% ]2 v Additional file 4. Sequence alignment of RIKEN clone 8430415E04RIK. The
( b- F0 Z/ j& I; D5 g# [ data provides an alignment of RIKEN clone 8430415E04RIK using human,
. @0 ^, i. Y- ~/ ` mouse, chick and zebrafish sequences./ |* f- `3 L# [" F; o2 E
Format: DOC Size: 31KB Download file
' f1 H5 p9 b7 I' {- V1 {- c This file can be viewed with: Microsoft Word Viewer
/ H3 {) |7 d' R3 v& a# G9 C& d% E The list of primitive streak genes in table 2 is smaller than that
: a8 l, ?4 `0 a7 h9 C1 V previously published [10] due to the very stringent criteria used to - ?: \0 V; \7 m
determine brachyury-like patterns of expression. Thus, to find further ; C! q2 j1 h% V; q! q/ ^9 z
genes characteristic of epiblast, early and late primitive streak 4 x; S6 |3 ^& ]) A
programs, probes were identified which were transiently up and down
$ ?$ m# g. E1 ]: F regulated at least two fold with peak expression at day 2, 3 or 4 of EB + b% R4 t1 v* B) e& i9 u z
differentiation. Gene lists were combined (102 Illumina probes) and
* |; ~3 G! p2 ]- ^4 M. Z$ @+ ? clustered into epiblast, early, mid and late streak groups plus a unique 9 B; \1 I; w: I; Y* D. r/ K
group in which expression was very transient at day 3 of differentiation : X) c& \& D, ~; o9 B7 m. ?& f
(designated 'brief' at the bottom of Figure 3C). Also, lists were
! _) I+ n; b- D3 N% [; h L. Q; U8 k generated by searching for similarity of dynamic expression profiles to " R$ q1 k1 A/ q) h2 [7 p. w; ^0 X
known genes which are accepted specific markers of specific stages of 2 Q* B: x, B/ H
mouse development. For example, Fgf5 is specifically expressed in the
+ x3 C5 T7 L2 U6 V( b. L0 ~4 x epiblast [35], and Wnt5a is expressed in the late primitive streak [36]. ! E3 U, H, S+ U: t7 J4 ]; Y5 U- q
The Fgf5-like (epiblast) list includes the methyltransferase, Dnmt3b, ) C' Y: C* ~ @8 V$ R
which was massively up-regulated in the first 2 days of EB differentiation
/ Y: j; G1 Y- o5 d (Additional file 5) and is known to be essential for mesoderm 6 h% @( B; j0 j" E ]# M
differentiation [37]. This list also includes Pim2, Zic2, Wnt8a, the SP1 a& m9 t- j7 F' Z" A9 I
(and possibly SP5) cofactor Crsp2, and Irf1. The Wnt5a-like (late
" [& W, T( R! Y primitive streak) list contains 76 genes including many homeobox genes : a) ^$ ?1 K6 P {0 z1 g5 u
(Hoxb2, Hoxd1, Hoxb6 and Hox8/Msx2), Cdx2, Hand1, Tbx3, Bmp4 and the VEGF $ }- y/ P' L! K3 p$ ~
receptor Flt1 (Additional file 6). Many are expressed at E10 of mouse
/ q, {& B5 c y* \! P8 j' g development in the remnants of the primitive streak [36], suggesting the 9 y9 L) w1 E& X1 A# T9 f
Wnt5a-like list includes many genes known to pattern the embryo following ) M- X1 H4 P+ R5 B
formation of the germ layers. There is a significant overlap between the 3 N* I+ @. S5 e& B; J1 } K
genes we identified as being epiblast-, early and late streak-specific and
1 m9 t' i P% b% p/ g7 G1 q9 e5 d" ? those recently identified using a similar EB differentiation system and 1 q# ]1 a3 T) Z9 H
Affymetrix arrays [10], and by other groups [2,38]. In addition, there are
# j' {* s" N. V" g3 G) b addition novel RIKEN cDNAs in these lists, which are likely to encode for
9 ^9 q, y, Z! k. a interesting proteins worthy of further study (Additional files 5, 6).4 r* m7 j4 B- b7 e5 G& B: @
Additional file 5. FGF5 gene list. The data provided lists all genes
. O0 }' `, U+ ~8 T3 @ expressed during 16 days of embryoid body differentiation with similarity * e( w4 o& x, {9 L7 j0 J" k
to FGF5 (Pearson correlation >0.9).. H: G( ]& |& Q, t
Format: DOC Size: 69KB Download file
5 v6 f9 a; T! \2 J" ^3 x X7 V This file can be viewed with: Microsoft Word ViewerAdditional file 6.
1 c/ l1 H2 R9 a, @: W Wnt5a gene list. The data provided lists all genes expressed during 16
" `# Q e4 v' v9 A. y& ~7 t days of embryoid body differentiation with similarity to Wnt5 (Pearson - t- A) { l5 ]2 m
correlation >0.9).
7 e! S9 J! I0 ` Format: DOC Size: 76KB Download file
. o0 k$ [# Z; k This file can be viewed with: Microsoft Word Viewer+ {4 K2 d/ b6 p' x7 O% }. L; c; M, q/ j
Following the primitive streak wave of gene expression there was a " f7 f" ^& V4 [( g0 |( F
dramatic activation of many genes (Figure 2A). Many of the most highly
: e# U2 `4 L- d" G& b induced genes, such as globin genes, heme synthesis enzymes and
* q- ^5 B; a% a1 O haematopoietic transcription factors, are known to be expressed in the X9 F" t& c, Q$ y8 J9 b# U! g
blood. qRT-PCR for globin genes confirmed > 10,000 fold induction between + d9 G" N# K% M( y
days 3 and 8 with a steady decline thereafter (Figure 1). Other mesoderm
' j+ W9 @ `/ E: m& Q: `7 J and endoderm programs are also activated at this stage, so it is difficult
8 V' i+ u; U0 F4 [/ u to mine for these different outcomes., k# E! n& q3 G
Enhanced cardiac gene expression and reduced erythropoiesis in & ?, {3 j/ }: B$ y" h
serum-freeB4L EB culture3 Q4 p4 ~. P! @: F" Q% Q
Overall there were very strong similarities between ES cell
T3 h: k+ y, k! V+ T# V differentiation in serum and serum-free cultures supplemented with 2 ng/ml
( N/ i4 a: Z7 t; g- C BMP4. For example, all 29 of the primitive streak gene set, as defined
* @! v, _! S- W- w8 _: H from serum-containing cultures, were also expressed transiently and to a
. r6 a0 i! M! S. S similar level in serum-freeB4L media (Figure 3A). Genes differentially
4 [/ @- B- t! B- r! p% k8 { expressed at each EB time point (Day 1–16) were pooled and subjected to QT
# o6 s# D3 ]: l. k+ ~+ R5 m clustering to find sets differentially expressed between serum and
" `( e8 |8 B" B6 x serum-freeB4L EB culture. Using a minimal cut-off of five genes per group, 5 P; I9 ?# w1 l0 `( b' \/ m
nine gene sets were identified; seven showed higher expression in serum 1 T2 {, J6 w+ i& R8 T9 d7 I8 ~
and two showed higher expression in serum-freeB4L (Table 3 and Figure 4).
5 V- j' u& J, Y1 C/ w6 Y: }. E Table 3. Genes differentially expressed during ES cell differentiation in
8 [/ p. ]3 X0 F( y8 R9 F7 o serum versus serum-free defined media (BMP4 2 ng/ml)Figure 4. QT 1 |5 N4 c/ [+ V5 L( q# z% z
clustering revealed sets of genes differentially expressed between serum 9 T8 J% g' J# d1 V% I
and serum-freeB4L EB culture. QT clustering revealed nine sets of genes
$ M* d. o! {; B% Y6 C with similar dynamic expression profiles (Pearson Correlation coefficient + L* `; F/ q# A8 o h
of > 0.9). In sets 1–7, expression is higher in serum (black sets). In 5 m* b8 @6 W1 \( ?
sets 8–9, expression is greater in serum-freeB4L EB culture (red sets). 0 u7 o& |3 A( ^# c0 R) b
Some sets have been named based on the similarity of overall gene function
) V+ z: P) Z: v' H( Y; g during differentiation pathways (see Table 3).- }' n7 l# f3 Z- @
The genes in set 9 were expressed more highly in serum-freeB4L media from 5 P8 C, M9 J4 u; S* c- M; o g; K& K
day 6, and genes in set 8 showed up-regulation at day 12 and 16 in the - p- v) p9 m9 Z! O
absence of serum (Figure 4). Both sets are comprised almost exclusively of $ X# h" P8 k" p9 L! }
cardiac-specific genes such as myosin heavy chain 7 (Myh7), myosin light
' [: n; g( a3 a5 N chain 7 (Myl7), the cardiac isoform of α-actin (Actc1), troponin (Tnnc2),
' W; D2 b& j3 V tropomysin (Tpm2), the nicotinic cholinergic receptor (Chrng), and cardiac
$ V8 { d7 e7 K/ a. q" k, Q muscle fast twitch 1 specific ATPase (Atp2a1) (Table 3). Further + o( F. Z/ j0 n; D6 T9 J
interrogation of the array data revealed a cohort of cardiac transcription ! c0 @% Y2 h8 J! Q: x9 ~
factors, including ISL1, MEF2C, Bop/Smyd1 and Hand2 [39], which were 4 Q& r* P: X" D1 E9 O5 d( e/ ?
transiently up regulated 1.4 to 3.5 fold in serum-freeB4L media compared
/ S m; r9 H& R& P& o to serum at day 6 (Signet Login_Bruce). Importantly, this elevated cardiac ; P" C! }6 S$ @! d
gene expression program correlated with increased numbers of spontaneously
3 \* v& n F }" h8 e beating EBs in serum-freeB4L media (data not shown). Enhanced expression
; x& X4 K4 @! ]1 T0 V (~100 fold) of Actc1, Myl4 and Myl7 in serum-freeB4L media compared with 2 J4 X2 G- Y/ \# |
serum-containing media was confirmed by qRT-PCR (Figure 1). Taken , ^& `9 Z g# F a( Q
together, this data strongly suggests cardiac muscle cell differentiation
P6 ^+ v5 i4 A is either inhibited in serum or enhanced by the addition of BMP4 to
% I) ]( Q2 S9 k* q' H serum-free culture.
" _' Z4 o4 {# ]! M The genes in set 2 were expressed in both serum and serum-freeB4L
/ m, O# z H0 C conditions immediately following primitive streak formation, but expressed 9 @1 [& {/ h6 |1 Q0 d
to higher levels and slightly earlier in serum (Table 3 and Figure 4).
/ {1 p7 x% |) u+ R) ` Many of these genes are globins (both embryonic and definitive) suggesting
+ ~4 S' P, O9 z$ M' o- | erythropoiesis is activated more robustly in the presence of serum (See
) A) Y' c1 S2 O. r" H* N$ d1 u Discussion). Alas2, a gene encoding the erythroid specific isoform of the : r$ A4 M7 o* u. h$ s. a
first and rate limiting enzyme in the heme biosynthesis pathway, and the
2 d9 ^/ e: D: m+ _7 O# m8 U erythroid specific transcription factor, p45-NF-E2 are also in set 2 [40]. . c6 @( y2 C* ^4 _" M% k
qRT-PCR for α- and βh1-globin confirmed more rapid expression in serum 3 r" Q" `! G ^) P5 B* s' j
versus SFB4L media (Figure 1), suggesting factors in addition to BMP4
+ A- D6 J. Q- E/ K& X6 j) z! N enhance activation of the blood program from a similar primitive streak
3 i. y" ]: {/ }0 T5 E$ x3 f platform.5 `6 G2 D+ b# A8 J# h3 y9 c
The genes in set 3 were expressed at a similar time in serum but expressed
8 e' _! o6 x! @: }8 l: \ at very low levels or not at all in serum-freeB4L media. Again, this set 1 ?. i* b% m2 s
is highly enriched for erythroid specific genes such as alpha hemoglobin 4 |' n! E3 E& n; E- l
stabilising protein (Ahsp) [41], Glycophorin A, erythroid Kruppel-like 6 |5 \, J K- ~
factor (EKLF/Klf1) [42] and mitochondrial solute carrier protein . J! q8 b1 b' N) Q, [
(mitoferrin) [43] (Figure 4 and Table 3). Genes in sets 1 and 5 are
$ z7 B1 \( s( n. n activated more robustly at late time points in serum cultures. Most of
! {5 Y& K: k$ O8 _- O% j0 f these are specifically expressed in macrophages. Together, these results
9 m; h; j4 b0 D5 G) k! F suggest delayed and less robust primitive haematopoiesis occurs in SFB4L
8 I2 n, n9 L4 j* `! H culture compared with serum, an observation confirmed by less robust ' R8 H& Q7 x6 y9 L
erythroid cell generation and 'redness' of the EBs from day 6–10 of ! o1 O+ q( \0 p) S! e( f4 ?
differentiation (data not shown).- W7 l0 A7 I; D9 N+ Y. n
Rapid changes in Kruppel-like factor gene expression upon ES cell
7 A% D" a. k2 k8 c" t4 x6 Q( G differentiation' ?+ U v6 }: T7 K! p
Three Kruppel-like factor genes, Klf2, Klf4 and Klf5, were rapidly down
* Y! W. _. _! J7 O6 Q* x regulated during the first few days of EB development (bold type in Table 6 [6 P L6 Z8 o
1). All are members of the KLF family of transcription factors which are 7 S6 _# \& \0 t& D& E
characterized by a similar C-terminal domain of three C2H2 zinc fingers
, ^% k' D# S% j- w% p& L3 }+ Y which confers binding to CACC box elements in promoters and more distant
6 V: M( n7 d: f0 W: V. c2 m/ V regulatory elements [44,45]. Klf2 and Klf4 have N-terminal transcriptional
% t j. n2 J% w0 j j9 c" N( z7 g action domains and act primarily as transcriptional activators, whereas
& D5 W, o' M: s7 u Klf5 is best known as a transcriptional repressor [46]. Until recently,
% I% ?. l7 d9 z3 ~+ m Klf2 and Klf4 were considered to have restricted expression patterns
7 ~- t& |8 r* n/ y# w$ h [47,48] with gene knockout animals showing late developmental defects in ) r( W" r: H# X& M
vasculogenesis and skin differentiation, respectively [49,50]. Recently ! F8 |; T8 [3 u `
however, Klf4 expression was noted in ES cells [19,51], and forced
! z5 D+ J) x& e- ~& h" n expression was shown to enhance Oct4 expression and promote self-renewal 3 T3 h! r( M i
[51].. E( p$ r/ n1 Z- @
qRT-PCR confirmed the rapid down regulation of Klf2, Klf4 and Klf5 in both
3 ]. _. c) O. l& W serum and serum-freeB4L differentiation conditions, although Klf5
6 k% [* ]# e: T1 V Y$ b2 h" K expression was down regulated more slowly (Figure 5A and 5B, and Table 4).
0 l( ^) T9 z6 q. j* u6 p1 X Klf2, Klf4 and Klf5 were all re-induced from day 3–5 of differentiation as # v9 A: U5 Y$ y3 _% K8 @' g: W; S
mesoderm and endoderm developmental pathways are activated (Figure 5A and
' c# L% ?" a+ m$ A/ p 5B). Sixteen of the 17 known KLFs were examined in detail throughout the ; q1 z5 X" @: f! [# s# P. @8 J
entire differentiation time course (Table 4). Klf9 was also steadily down
$ Y6 b$ o: U* D0 V" N regulated (>10 fold) between ES cells and day 5 EBs in serum and 8 s( Q) v" O' Z6 v3 B) ^7 w/ v6 A
serum-freeB4L media (Figure 5B). Klf3, a well characterized 6 E {& y2 m% h
transcriptional repressor [52] via its ability to recruit the 2 _- O- O; k% P
co-repressor, CtBP [53], displayed an interesting biphasic expression 2 w+ I5 F! J# e; _
pattern with initial rapid down regulation then re-induction from days 5–6
6 Y. ~0 r7 j1 ~ (Figure 5B). In contrast, Klf16 displayed little change in expression 3 k* J. N; @, ~( \. R! i1 \. y
levels over the 16 days of EB maturation (Figure 5B).
. s# d- f- @- h; B" a- y Table 4. Kruppel-like factor gene expression during ES cell ) ]! f+ t# ]3 ~* X1 S( u% J% T4 Q. J0 E4 n
differentiationFigure 5. Kruppel like factors (Klfs) are dynamically ' w! j/ P) @" M
expressed during the first few days of ES cell differentiation. (A)
' {- J9 r# p6 Q7 _/ W( F2 q GeneSpring plot of normalized gene expression for all KLFs detected during 0 [6 E" V0 R% |' R: p1 \* G5 I8 |/ ?
ES cell differentiation in serum or serum-freeB4L culture. Most of the ' e4 {5 W4 e5 r
genes are listed on the y-axis in order from their highest relative 1 }0 Y, n/ n8 i) C& F
expression in ES cells. There is dramatic up regulation of Klf1 (Eklf) * b) F4 b1 U0 x% Q
only in serum following day 6 of differentiation. Plots representing Nanog # i/ @- q! a4 \% i6 b9 d
(red), Sox2 (red) and Pou5f1(Oct4) (green) are shown for comparison. 4 e; G3 i; e% D' K% ~* L
Pou5f1 and Nanog gene expression persists at high levels for 2–3 days
! f( l. F. [. q after Klf2, Klf4 and Klf5 are down regulated. (B) Validation of changes in ! f+ ?' g2 n4 A: ~ r" N
gene expression of six members of the KLF family by quantitative real time 1 L7 j$ F: s: f0 o- e2 R8 f) D
RT-PCR. Scheme as described for Figure 2B.
W, y3 l* u7 B% D/ o7 l6 a Klf2 and Klf4 co-localize with OCT4 in the nuclei of ES cells (Figure 6A). * ]/ S% N: U- a' J% F( T+ x
Interestingly, all three proteins were found to preferentially reside in a
! {, C/ d! e3 k1 X nuclear sub-compartment (possibly nucleoli for transcription factories), : S3 m1 c. K6 |7 f: ?( j
suggesting possible co-involvement in a protein complex or network. Also,
: ^; ]: t2 @+ C1 o+ A, a0 x endogenous KLF2 was detected by specific CACC box DNA-binding activity in 3 m" t! I% j1 y7 O# X1 c7 s
undifferentiated ES cells, which was lost upon EB differentiation (Figure
, S# D( G% \% m! u3 `) ~ 6B). In contrast, endogenous ES cell KLF4 DNA binding activity was not
3 @4 S. b; Q/ d detected using the p18INK4c promoter CACC box sequence, although
0 w9 `, }8 d0 s1 p5 Y# R recombinant KLF4 was shown to bind to this probe (data not shown). SP1 and ' x" `) ]2 \, J7 Y5 v* v0 X; T
SP3 are ubiquitously expressed CACC-box binding proteins, which are
0 \' Y7 S. Q1 D9 R dominant in ES cell nuclear extracts (Figure 6B). We hypothesize KLF2,
: Z n4 S j) v6 V, F5 b8 B KLF3, KLF4, KLF5 and KLF6 compete with SP1 family members for DNA binding
7 V! q `7 C- Y0 p) t to key cis elements in various stem cell genes to regulate expression in
0 Z( M/ N2 }& [ positive and negative fashions (see Discussion).8 n. s: s; B1 c1 y6 ~
Figure 6. Kruppel like factor expression and DNA binding activity during : s2 n9 J: {/ p5 ]% z
ES cell differentiation. (A) Co-expression of Oct4 with KLF2 or KLF4 in ES 6 y$ l; I+ G$ D2 p3 ~
cells. Indirect immuno-fluorescence shows co-localization of KLF2 and KLF4 , @/ B) ~3 ?7 ?; z! I
with Oct4 in sub-nuclear compartments (possibly nucleoli). Individual
6 j2 L1 u" v+ o9 k confocal images for OCT4, KLF2, KLF4, and DAPI are shown with the
( }' h; b) a2 n' J; E corresponding composite image. Scale bar 40 μm. (B) Electro-mobility gel 8 Z3 o4 z1 ?6 g! l/ |5 D) ^" |3 s
shift assay showing changes in DNA binding activities at a conserved CACC : \; v3 p7 E3 j# @' P3 N
box site in the p18-INK4c gene promoter. Nuclear extracts were generated
& B# v9 g2 `' r6 q) c, r6 a6 f from ES cells or EBs differentiated for four days in serum. Super-shifts
) v4 R M- Y" ]6 h+ U: @! o, K were performed with specific antisera for SP1, SP3, KLF2, KLF3, and KLF4
8 |" _4 ]) B/ p7 B6 t# t/ Y (See Methods). There is strong binding of endogenous Sp1 to the CACC
) u c! f6 u6 W5 q' k9 ]8 U element in ES cells and EB cells. KLF2 DNA-binding activity is present in
0 z9 U5 a" N, u) a4 P6 O. D# q0 S! Z ES cells as determined by a specific inhibition of binding of the A( L! C' |* ~; }0 ?8 _# F: K
indicated DNA complex with a KLF2 antibody. This activity is lost upon + Z5 n3 S4 B0 M% F( @
differentiation into EBs. The identity of the CACC box binding activity in - }- m+ C3 X+ W1 a( s r+ ]4 l
EBs denoted CAC-X, and the binding activity in ES cells denoted CAC-Y, was + X# I1 _! a9 M4 N, F1 I( }, |
not definitively identified using this panel of antibodies.% K9 Z3 k7 d; |
Enrichment of KLF-binding cis elements in the proximal promoters of stem
. P3 ?( `8 e( {# B7 g cell specific genes
8 W* {3 K) i" S6 A4 ^& U% g& L To investigate whether Kruppel-like factors might directly regulate stem ) l' e9 G9 [( [( E5 p1 W. d+ k
cell gene expression, and therefore play a role in the maintenance or loss
9 c5 H) I1 c, F9 R of 'stemness', the presence of KLF transcription factor binding sites 4 P) R+ a9 |* m; D& ?' N
(TFBS) within the proximal promoters of all genes in Table 1 was
B4 P; e5 \* t! g4 D! ]/ w determined. Although a position weighted matrix (PWM) for KLF4 binding
3 |8 M2 F6 o' z% B6 y# Z9 s0 s8 A site specificity has been published [54], this site does not resemble the
* _6 d. |# o, |! r" ^ CACC box sequences known to bind other KLFs. Since KLF4 has identical 8 ]! ^) _, m! a* w2 R/ x1 d
DNA-binding residues in each of its three C-terminal C2H2 zinc fingers to
: U% J. r* j/ V' A9 A d$ y other members of the Kruppel-like factor family [45], it should bind to
; f8 j) p3 f8 F& X, u similar sequences. Based on the crystal structure of the zinc fingers of
$ q% S; g- x( H& h, } Zif268 and SP1 bound to DNA [55,56], and testing of specific binding of
/ L; A( J n- N/ r CACC box variants to recombinant KLF1, KLF3 and KLF17 [45,52,57,58], a
/ q. H6 J1 t# y) o- b9 g' x generic KLF TFBS has been proposed. Although a SELEX experiment for KLF1 # o. g @6 q' T: w1 A# S* t1 h
binding has not been published, experiments from our group suggest a C at ?2 c& e$ {/ j. h! |) z% s% t
position 1, an A or C at position 3, an A at position 5, and an A, T or C
: `* {8 |0 r5 ]; [& x at position 9 of the 9 bp consensus provides enhanced affinity for KLF1 6 n9 f) w9 \5 f+ j8 J7 A/ c
binding. Taken together, these studies have enabled the development of a
2 e5 \' N9 f/ l) M9 t) L PWM (called here KLF-A) that should predict KLF4, KLF2 and other KLF TFBSs
J# m, k' E6 W, t (see Methods and Additional file 7).
4 n+ {8 x1 B. R9 Z. W Additional file 7. Position Weighted Matrices. Provides published data on
( w. F7 E# E$ Z1 b- d& }" V8 M5 ] position weighted matrices for Oct, Sox, KLF-A, KLF4, Nanog, E-box, Gata-1
) q, Z) ?0 ]' ~/ S7 n. v and their variants.
; P) D4 Z* s5 k( e7 u/ e3 O6 m Format: DOC Size: 116KB Download file+ w L7 g3 o, A: ?3 u
This file can be viewed with: Microsoft Word Viewer& N- o- N/ {8 I, W
Clover [59] was used to determine if defined TFBSs were statistically 5 }( `. Z+ L# |+ j( A5 s1 B
over-represented in the stem cell lists. We searched for PWMs for KLF-A,
1 r2 V$ n' h7 G# e KLF4 [54], octamer, extended octamer, nanog, and Oct-Sox binary sites as " n' k9 \" X7 B; g& E
well as negative control sites (gata-1 and E-box) in sequences 2 kb
! V' T' w* |8 _8 ^ upstream of the TSSs of stem cell genes (Table 1), versus 2 kb of
2 z+ O6 L" g% U% z4 @ sequences upstream of the entire murine transcriptome (See Additional file 7 I' \# Y5 {6 O* F8 _7 r
7 for details of PWMs). There was significant over-representation of KLF-A - C' @& _% H, J
TFBS within the first 2 kb of promoter sequences from the stem list gene 8 u2 Z& G% z# N! g) @- Z
list (p value of < 0.01) (Figure 7A), as well as over-representation of $ D5 W6 J a. m* B7 o# ^
octamer sites and extended Oct4-binding sites as defined from ChIP-PET
+ G& ?7 s; H/ Q& @% |: c data [60]. No statistically significant over-representation of KLF4 [54], 4 w' ~8 Q4 |$ ]( I
NANOG, GATA1 or E-box TFBSs were identified. All genes in the rapidly down
5 i- C3 f8 u. C7 [4 e" k+ Z regulated gene list have at least one KLF-A type CACC site, or octamer or
( U% u, l2 r( g/ i6 ~. y6 X Oct4 [60] TFBS. A full list of the promoters with highlighted sites is
0 M/ s" c6 J1 B available on request. Together, this data strongly suggests direct 4 w" h1 H/ x. X% h9 h; T- H
transcriptional regulation of many stem cell genes by KLFs.
) J) e+ _% m( w* s! y Figure 7. KLF and octamer binding sites are highly enriched in stem cell
' K( {8 W# `7 x/ }! T) |4 R gene promoters. (A) A Clover analysis was used to identify
) F& Y) E' f Q W" B) ?/ s over-represented transcription factor binding sites within the promoter
{2 W# I# |! h( _1 w sequences of all stem cell genes identified in Table 1. Representative
: w9 D( N9 j4 W% X% H; j gene promoters are shown, indicating KLF-A binding sites (pink), octamer # C, ~% n5 m8 `+ Z
sites (ATGCWAAT) (green) and extended Oct4 binding sites (Oct4-Loh)(cyan)
1 V- F- b A5 s( H7 b [60]. (B) Clover output for 2 kb of promoter sequence of murine
# e5 ?- \( `; k. v1 i* l Zfp42/Rex1. The positions and sequences corresponding to PWMs for KLF-A,
5 ?/ g" f3 g4 k+ D. W octamer and Oct4 occupancy sites are indicated in the table and colour 5 N1 C8 L: n& S0 N
coded in the sequence. The positions are relative to the transcriptional
8 w2 k8 G% _- ] start site. (C) ECR Browser output of conserved sequence identity between
3 _% ^! z% Q& n( |, N. L mouse and rat in the promoter and part of the first intron of the Zfp42
6 J0 G8 K4 F2 T: Q8 i gene. The blue arrows indicate the direction of transcription. Yellow
6 B' f1 t3 c1 @' }- V9 h1 ~$ S indicates the extent of the first (non-coding) exon, pink indicates : G+ M- X6 s& M- M% l, g
regions of sequence conservation in the first intron and red indicates
* T% A& R' |5 ]! I, e( K3 T regions of sequence conservation in the 5' upstream region. rVISTA was , w2 n$ {* s5 }" ]* x
used to find all of the KLF-A and octamer sites in the murine gene (Murine
6 |" m* }4 v; a& N/ m- q only) and conserved sites between mouse and rat (Conserved mouse, rat).0 c, v9 q2 X" X' n
The Zfp42/Rex1 gene promoter is presented as an example of output from the
8 o! R) s: a0 J U+ i- }8 H Clover program (Figure 7B). It contains five KLF-A, one octamer and three ( }3 M& k: o4 b4 l* y
OCT4 TFBS. The two CACC box elements within the first 250 base pairs of 5 Z9 _9 k! |; o/ I0 h
the TSS represent classical extended KLF-binding sites [42,61]. These
/ B( ~/ Z& s6 o% t) O4 I2 E% ~2 l. K regions are the same as previously reported [62] although their functional 1 y: I) P: ]& e/ M0 `6 }9 z
importance has not been determined. Interestingly, two of the three KLF-A
6 j1 ~7 V# }$ w TFBS were evolutionarily conserved between the mouse and rat gene 3 ?. j: Y7 }4 f+ X+ f$ v
promoters (Figure 7C). Also, two further conserved CACC sites at -1.4 and
K# ~7 c. ^+ {4 I& ?% I9 H -1.5 kb were identified within 500 bp of extended evolutionary / S6 U- c6 P- {6 R4 f3 w
conservation (>70% identity) (Figure 7C). This might act as a KLF 5 f" u& f! M, Z( u4 @4 w
dependent enhancer. Most of the stem cell genes in list 1 have conserved 2 Y( V9 Z4 F: [" h9 F6 N0 r2 U/ {
CACC box elements in their promoters [63-66]. In some cases these have 8 W) S/ C4 s% \" C4 M1 f
been reported to bind the ubiquitous SP1, but our Clover analysis suggests . W. |+ O* l0 ?- u* ~4 W" B' n
they are also likely to bind KLFs (see Discussion). ~8 K6 h2 l( ]" R
Discussion
4 z% `' D% z1 j. |8 V K Expression profiling of murine ES cell differentiation was undertaken over & S$ Q: h" W$ `- u' m9 k$ m7 j1 i) G- ^
a 16-day time course. We compared gene expression in methylcellulose
$ |* }9 P- C8 V4 Y cultures containing serum versus chemically defined media containing LIF 3 }7 m* D) f6 X$ Y
(1 U/ml) and low concentrations of BMP4 (2 ng/ml) [67]. The Illumina®
% h4 t/ f( ?% C5 k; t* \/ j Sentrix Mouse 6 bead array provided a sensitive and detailed platform for : w& r6 b; B3 N, U
analysis of dynamic gene expression. Using various data mining approaches,
; o8 E; U5 f) F8 c* D1 K* O lists of stem cell-enriched genes and genes that are induced during the in
" N i5 k+ {: Z" Z vitro equivalent of epiblast and primitive streak stages of $ [& l8 z3 p2 e; S
differentiation were generated. In combination with other ES cell . u0 \' M4 Q f( l0 I$ M
profiling studies [7], our detailed expression data provides a useful 5 O, u+ y! w+ |. e
resource for future reverse genetic approaches (i.e. siRNA knockdown) to
5 y2 p9 B& w9 m" i+ q2 i6 {8 l study the function of these genes during ES cell differentiation and in
3 C6 j8 q& O ~# ~) k0 Y7 h vivo development. We also found a number of previously uncharacterized
. i! b8 f% l$ x. f, Q8 W' p cDNAs (RIKEN clones) which could play important roles during development.
3 i$ I6 _! h: e0 [' `4 t z Importantly, the loss of pluripotency, measured by Oct4 gene expression,
) X9 C" v9 C+ P, d6 G2 t' s was comparable between serum and serum-freeB4L EB culture, following a
: |) t5 l+ B" z8 `8 w8 z predicted decrease over the first 6 days. Surprisingly, Oct4 gene 8 |" f$ K% Z0 X0 i; o. g6 `4 j
expression gradually increased following day 6, an observation independent ' q4 S2 C" x1 q U0 s1 K7 i0 o
of the cell lines used (data not shown). It remains undetermined if . ]# C4 c. P/ x" ]
expansion of Oct4 positive ES cells persists as undifferentiated
0 w. W. b! w B% c4 m: j: c populations within our EB culture system. Previous studies have identified
/ o+ ]$ }! [% v& q& y the development of Oct4 positive primordial germ cells (PGC) following 12
) [9 D2 N; Z9 [ days of ES cell differentiation [68-70], suggesting the Oct4 profile may
& x0 Y2 P0 T: ~/ W! H% \2 o; } ] alternatively represent the expansion of non-ES cell populations. Although
3 t& x/ F, D4 C- e1 F. g the expansion of some undifferentiated ES cells is possible, the array
3 Q V, h+ u4 m8 u8 p! k& @ profile does not suggest global persistence or up-regulation of ES gene
/ ]. q* j! i. Z" c2 n [3 ^6 |. s$ e expression late in the EB program. Also, markers of mesoderm induction
1 V) L/ H) l6 k5 G, G+ g {. Z! P such as brachyury and Mixl, do not show persistent up-regulation following
5 B* K8 R3 g m3 T7 G3 v peak expression at day 4, suggesting EBs are unlikely to harbour cells . r7 q9 {- o2 {7 R
which are delayed or arrested from entering the differentiation program.
q- ]7 |; x& v9 r We found significant differences in cardiac gene expression during EB
$ A" y" y' d# n differentiation in serum and serum-freeB4L culture. Our current 7 D. s' Z6 n/ I
understanding of cardiac development provides possible insights into this
( t: B! {5 q% g! o% T observation. Briefly, in vivo studies have revealed BMPs secreted from the + W* Q! B+ d3 k
anterior lateral plate, and ill-defined signals from anterior primitive
' }" i% r3 ~1 S/ ?0 p% `0 t7 N endoderm, are key inducers of cardiac development [71-73]. The
~+ @+ g. X# {! l administration of recombinant BMP2 or BMP4 to chick explant cultures
2 m, s+ |1 J- |, H induces cardiac differentiation in non-cardiogenic mesoderm [71] and Bmp2
G7 p" o6 g4 N8 b knockout animals develop cardiac abnormalities [74]. Conversely, Wnts 8 }! D: L3 n% F- x5 Y$ t
secreted by the neural tube are strong suppressors of cardiogenesis.
- J: l7 S) H$ @% P" [ Together, these opposing signals act to establish the borders of the heart
! K6 w. n3 D) |% C5 ^0 y4 I' `. H field [71,75]. By mimicking the environment that establishes cardiogenesis 7 G1 B4 j7 |6 I- \- C) i& i
in vivo, assays capable of directing cardiomyocyte production from ES & w( M: Z) }$ h
cells have been established. BMPs can efficiently enhance the cardiac $ h! N5 P3 o6 p4 r3 A4 U
program when added to EB culture [76,77], whereas BMP inhibition . a# D7 A$ u! P6 ]& L* G
drastically suppresses this outcome. In our hands cultures supplemented - \( V2 E6 A# S; b
with BMP4 (2 ng/ml) supported cardiomyocyte maturation (as determined by $ U8 W3 w6 z: u6 \
expression of cardiac specific genes) with greatest expression detected at
9 y ?8 a& F( f6 w Day 8–10. This correlated with increased numbers of spontaneously beating 5 ~$ m' ^+ v& L! d
EBs, and the timing of initial spontaneous contractions during murine
6 Y1 W; B+ R- q2 O3 U; ?* b; |/ s embryogenesis [78]. The cardiac program was significantly reduced in
2 E: k9 G3 Y4 Q6 A6 u culture containing 10% serum, an observation supported by a number of ' y# W1 O8 H+ m4 S. d# H
other studies [79,80]. This suggests the constituents of serum are
! U: s! |# l( I; M% B0 _0 ?8 I2 e inhibitory to cardiomyocyte development. Although a detailed assessment of $ b! a. L5 C4 Y2 | H4 p9 Z+ N
cardiomyocyte differentiation was not the focus of this analysis, the . y- {/ b3 q6 e* l' [* j
array output, qRT-PCR profiles and morphological observations described
' b5 R3 Y0 p# n6 G) j" X* Z! T+ _& ^ supports the usefulness of this assay in future investigations. In
1 D) W" R, ]5 X9 K( ? addition, the defined constituents of serum-freeB4L media provides an : j0 X7 ]! p v$ o, _% U
excellent opportunity to identify additional recombinant factors required
) `6 p4 ~, d7 Q( Y; y to further expand cardiac progenitor cell production from EBs.6 v' [" L) W/ x/ X; q! w1 ], A4 O
In contrast, the hematopoietic program was more pronounced within EBs
- F3 Y6 t2 H' k7 @' a grown in 10% serum than BMP4 alone (Figure 1 and 4). In particular, EKLF * `0 S0 g/ ] c, e C0 B( G7 @
(Klf1) expression was significantly reduced in serum-free media 9 a" e* {% l2 O
supplemented with BMP4 (2 ng/ml) compared with serum (Figure 5A and Table / l) [( U# l* f3 k) d. M# u, i
4). EKLF is essential for regulation of a large cohort of erythroid 9 _& w0 |1 O$ h5 h
specific genes [81,82], many of which were identified in sets 2 and 3
) ?: h4 f0 |$ n. H. ] (Figure 4). Thus, it is likely that reduced EKLF expression in SFB4L media
1 l. s, ]1 v! k1 e) | directly results in inhibition of a cascade of erythroid gene expression.
- ^3 F6 ^7 y7 ] x Previous work showed the addition of BMP (at 5 ng/ml) to serum-free ES + Y9 x6 t, }0 _5 A
cell culture induces EKLF expression and restores hematopoietic cell
5 C# E/ ?" V" L0 p7 ~ differentiation [83]. Although a weak hematopoietic program was observed 9 q) Z. C# _% A. u
in SFB4L culture, the concentration of BMP4 used was less than that used
+ m2 d/ n2 i3 s5 Z, r; P* b: N' W, O by Adelman et al. Thus, we suggest a robust cardiac (anterior-ventral $ x( ]( ~! }3 X3 D* ?
mesoderm) gene expression program is induced by low conentrations of BMP4 . g! `7 ^( O7 g1 n+ Q- B. j8 d( p
(or other BMPs), whereas a robust blood gene expression program 7 e5 C+ `8 U. {( F8 [. P
(posterior-ventral mesoderm) requires >2 ng/ml of BMP4 or additional 8 Z$ b# e! x ]6 G* \
growth factors. Recently, mesoderm derived progenitor cell populations for
/ m- h1 @/ B$ i& l2 U* x' v. h hematopoietic and cardiac lineages were studied during EB development
, z [) K. n1 b7 r$ ~/ u using ES cells in which GFP is targeted to the brachyury locus [80]. 6 I2 f" X' k7 c" i" U
Within day 3.25 EBs, GFP+Flk1+ cells were shown to represent hematopoietic / X4 Y0 U9 Y; i. \
precursors, whereas GFP+Flk1- cells were significantly enriched for
7 a3 R5 @5 b0 E6 U9 p6 i0 q cardiac progenitor populations. It would be interesting to determine if 9 n1 |6 |; \) Z# |5 }
the ratio of Flk1+/Flk1- within the brachyury positive population is 4 x8 l1 f+ W" t/ t. z" t
altered between our two culture conditions.
b& }# j$ ?1 r, J A number of the Kruppel-like factor family of transcription factors were
4 m/ P4 S n+ z# F* m# Q4 G dynamically expressed in the first few days of ES cell differentiation. It $ \3 D2 b- S$ Y& r) M& k
was initially surprising to find highly enriched Klf2 and Klf4 expression
: ~% n/ t4 }, s1 q b in ES cells, since both are considered markers of terminally , l7 E" V7 ?- U; H
differentiated cell types such as skin [50], gut [84], vascular smooth
9 C, {; T8 X& ^' m: `7 r muscle [49,85] and lymphocytes [86]. However, recent evidence suggests
& J0 L% \% ?% J6 _9 n KLFs may regulate stem cell function, since Klf4 is enriched in ES cells
1 ]& G Q9 g" O [19] and forced over-expression within these cells can maintain 5 R' C8 I" j( j
pluripotency in the absence of LIF [51]. Furthermore, Klf4 can bind the 4 S& f/ o) L% ]# s5 m) ~; i5 d
Lefty1 gene core promoter co-operatively with Oct4 and Sox2 [65]. Although
: `9 }: q! s1 }* { Lefty 1 and 2 are best characterized as repressors of nodal, acting to , k8 Y) f O$ D2 X2 w8 H
regulate left-right patterning [87], our results and those of others,
6 o" X7 i6 {/ L4 Q suggest a possible redundant role for lefty proteins during the
3 z& |; f; @9 Q3 h maintenance of ES cell pluripotency [65]. Similarly to the Oct4 expression ; u) [. i. h% \
profile, many of the KLF family members also increased in expression late 0 _5 X* P7 v: ?
in the differentiation program. As mentioned, the KLFs are expressed in ; C+ ?+ X. A; g6 ]' Z) j
diverse tissues during development. It is therefore expected that the * {& q# O4 @& [7 f2 K) Z; A b0 I7 K
profiles obtained reflect the generation and early specification of + ~" t0 z) _& `9 s1 s9 g# Q! W
mesoderm and endoderm cells following primitive streak gene activation at
3 D$ b" y" o' |$ l day 4.
8 N- }% n! W: B; w+ [ Based on the likely identical binding specificity of the KLF family,
- H# f( ]/ j- K0 g; u established transcriptional activation and repression roles of certain
7 Q% n2 j, c, n" u family members [44], and bioinformatic evidence of a high prevalence of ' i9 `) l) _, r# S% s, {
KLF TFBSs in many stem cell genes, we propose KLF competition for * d2 g w3 U W N7 L9 D
occupancy of these CACC box elements might determine self-renewal versus 5 O u& y/ p+ r9 K
differentiation of ES cells. According to this model, high levels of KLF2 1 V$ k3 Z8 G( W$ B. o$ V' c# O9 W6 X
and KLF4 expression in undifferentiated ES cells would lead to occupancy
" l# C6 A. Z$ x( p0 ]7 _ of CACC box elements in promoters of stem cell genes such as Pou5f1/Oct4, 9 R6 N Q6 ^8 r1 f( B% o/ d
Nanog, Esrrb, Zfp42/Rex1, Lefty1 and Lefty2. Furthermore, the Klf2 and " c( {. e8 ?5 U4 e1 [, R. v2 G
Klf4 genes themselves have CACC box elements in their proximal promoters
: @4 J" J( ^( y (Figure 7A), suggesting a positive feedback loop within ES cells is
0 k* [+ ]2 [) }, s likely. Interestingly, gene knockout and lentiviral shRNA gene knockdown
& ~, m$ t- R/ D& S of Klf2 or Klf4, does not lead to an obvious stem cell defect
; @; m& H. m+ |9 a& Z! ~/ c [19,49,50,85], suggesting these two KLFs may have functionally redundant
0 j1 U1 v# W- I; r roles during the maintenance of pluripotency. Future analysis of KLF2/KLF4 ) r7 c: `( f8 V9 Q. c
double knockout ES lines or the knockdown of both proteins using RNAi
2 \! l- d3 u# x technologies will be necessary to validate this hypothesis. Upon ES cell 0 T1 G# n2 T0 y: P+ Q
differentiation, down regulation of Klf2 and Klf4 was very rapid, whereas + v% M" j* G4 L
down regulation of Oct4 and Nanog was delayed for one to two days. We % ]1 ?4 [6 H! ]2 K/ g
suggest loss of KLF2 and KLF4 binding to Pouf1/Oct4, Nanog, Esrrb and 6 O& e4 B2 D; s" x5 R
perhaps other stem cell promoters, could be directly responsible for their
: ]. y5 j: C; B3 c1 q5 Z1 H, k! R* J down regulation. Moreover, Klf5 was down regulated more slowly, and Klf3 2 v1 i2 b, g1 p; W- k
was up-regulated in the first two days of differentiation, suggesting
1 R6 a% t/ N% e7 c" t4 k these KLFs may function primarily as transcriptional repressors at the
0 x: I6 U: I* n, Y2 C$ B: L Pouf1/Oct4, nanog, Zfp42 and Esrrb gene promoters. Once this
1 L& X7 t5 v# G1 w& Y differentiation driving transcription network is activated we suggest 2 K# [6 X; ~7 @5 e( Q# @5 W
KLF3, KLF5 and KLF9 can accelerate loss of the stem cell state by directly ) A9 c( `) _. t9 L* }) Q
repressing expression of the Klf2 and Klf4 genes themselves. Therefore, a ( ^7 r% D! ^8 E6 D$ c8 e' W
combination of cross regulation of transcriptional outputs, and 1 ^8 U: P8 K' \, S
competition for occupancy of key stem cell promoters by KLF proteins with
5 y F2 K* G6 N2 N differing biochemical properties but identical DNA-binding capacities, may 1 j4 \8 }1 X. L
determine self-renewal versus differentiation outcomes. Again validation
$ t) T R P7 y of this model requires functional analysis. It is important to note we ! |% f5 e$ p; j+ |( W
have not proven occupancy of stem cell gene promoters by the KLFs in vivo.
0 ]6 H+ M9 z9 y1 S( I% S Chromatin immuno-precipitation (ChIP) experiments in ES cells and
0 T2 n* u! J0 K# P4 q8 v; s differentiated populations are essential to validate this model, however
k* @. p5 k. O1 ^% s2 F6 x6 |0 u ChiP grade antibodies are not currently available.
9 Z5 c% z$ ]; R8 i* C2 P$ ^ This scenario is not necessarily limited to ES cell differentiation. A
9 O% N3 G7 M' U similar program may take place in many adult stem and progenitor cells 3 o5 h% u5 N) q6 ]# F7 L
populations. Certainly competition between Klf4 and Klf5 has been 7 t3 o. n8 x8 d. V
suggested to regulate cell growth [88], and Klf1 and Klf3 compete for
, [$ n, R' t% |9 w0 Z9 @3 h/ J binding and have opposing effects in blood cells [89]. Furthermore, an ) a1 V" H$ o1 N
imbalance between occupancy of key stem cell or proliferation gene 9 X& v& q; }# U+ x) V
promoters by KLFs, with differing activating and repressing functions, 2 _' c1 d3 @* v$ J0 l0 J# ^' q
could be responsible for the development or progression of many common
8 |3 b" a* L7 I; t forms of cancer. For example, loss of hetero-zygosity for Klf4 and Klf5 5 K4 z C M/ g: Y1 L. g! z: R
has been found in colon, stomach, breast, prostate and liver cancers ! t5 j0 V6 P6 f, B2 M* k
[90,91]." w9 p" z, x! c' L
In addition to the proposed complex transcriptional interplay between KLFs ; O4 X) ~- I6 }. p: X2 U
and stem cell gene promoters, direct protein-protein interactions and ! _ F4 O* B' o
protein networks involving KLFs are possible. We showed sub-nuclear
/ p$ g+ ^8 C f, L6 ` localization of KLF2 and KLF4 with OCT4 in ES cells (Figure 6A), and KLF4 , l" f1 m! @. u- m
can co-operate with OCT4 and SOX2 to drive expression of the Lefty1 core
5 E4 t2 v3 \. }& J promoter [65]. Also, very recent biochemical purification of , O; n9 a; V. k! K
NANOG-interacting proteins discovered KLF4 as part of a NANOG-OCT4 network 7 a7 W7 E: I! |" T `" `
in ES cells (Stuart Orkin, personal communication). This is also
1 f+ A# o8 d$ ]/ h consistent with the ability of KLF4 to co-operate with OCT4, SOX2 and
/ H" V' r( V8 L% j* ~ Y% j- ]' F c-MYC to drive de-differentiation of fibroblasts into ES-like cells [24].
) M. g- |& s0 @7 ~! s- L Conclusion
, l# J; ^- `" z- ?6 E Defining the genetic regulation of ES cell self renewal and
# \# ^/ t% k. G differentiation will be instrumental for the development of future cell
8 p/ a+ `8 M8 V3 V+ s, t0 K. q' [* s based therapies. Using 47 K Illumina® Sentrix bead arrays, the
: c0 {" @9 |5 B* W" U6 v* U differential expression of genes during 16 days of ES cell differentiation + ? T f& s3 p" _
was determined. Hierarchical gene clustering and correlation statistical 8 c# k7 {% J# h3 y$ U( H
analyses lead to the identification of a small cohort of genes which
+ n+ R& S1 Y' A7 E define the stem cell state. Historically, ES cell differentiation is 5 A9 g3 L) ^6 A# R
achieved in culture media containing 10–15% FBS. A direct comparison
8 d$ K0 C9 _6 c+ M$ A. t between ES cell differentiation in serum and a serum-free media was ( @1 s+ e4 Z- n, Y- f. @0 ?
undertaken. Surprisingly, global gene expression profiles were comparable * ~! L) ]/ u' W! n7 e7 m S% e3 l0 x
between culture conditions, with the exception of mesoderm derived cardiac
B1 |$ w3 U- v9 _ and haematopoietic transcripts. These results support the use of defined ; Q/ @' G. @1 v7 i
serum-free approaches for the directed differentiation of ES cell / f6 ^$ _5 Z# k( n% C. H7 g. Y
commitment. Expression of many KLF family members were enriched in ES ) v1 g( ]$ a F9 G* z: y: x5 |% Z
cells and rapidly down regulated upon differentiation. KLF2 and KLF4 were + z1 f4 j8 B4 l: E b" B+ ?3 \
co-localisation with OCT4 in ES cell nuclei and KLF transcription factor a O4 G" Z0 Z0 N
binding sites were over-represented within the promoters of the putative
0 c# W: H: _0 f B5 a: I ES cell gene list (p < 0.01). Taken together, this data strongly suggests 6 o4 g8 I9 ^8 S) t3 ^( g; o
KLF family members regulate the maintenance of ES cell pluripotency.6 ]1 N. } A3 D: q: O
Methods1 e# k% I8 @5 e
ES cell culture, embryoid body formation and immunohistochemistry
6 m& ^; J5 e8 d W {! S W9.5 ES cell differentiation was performed as described (Bruce et al., . d/ N. p4 o. l' Y: T+ p
Differentiation, in press). For immuno-fluorescence, feeder depleted W9.5 & k8 H3 e1 ^. y- O' H2 s8 _$ J, i
ES cells were seeded onto sterile gelatin-coated glass cover slips in ES . [3 y K* W/ u8 D' y4 d
cell media, fixed in 4% PFA for 10 min then washed in PBS. Cover slips
0 e) R9 g: e2 A7 C: w were boiled in 10 mM citric acid (pH 6.4) for 10 min, washed in PBS and 1 u- c$ \, a2 [- _0 U+ `, o
permeabilized in 0.18% triton X-100 in PBS for 10 mins. For OCT4/KLF4 3 W5 J8 M0 E3 k. p
detection, the cover slips were blocked in 1% BSA then incubated with * U+ k4 X5 w6 S/ {6 L6 m
rabbit polyclonal antibody raised against OCT4 (Abcam #ab19857) and goat : w" H, W; |1 p7 S$ [9 \
Anti-mouse KLF4 (R&D Systems #AF3158) overnight at 4°C, then incubated in + m* H0 O" g# r A2 \( @
goat anti-rabbit Alexa-488 (1:400) (Molecular Probes #A11008) and donkey
S3 o- I9 h7 f* a8 J+ v8 I anti-goat Alexa-647 (1:400) for 1 hour. For co-detection of OCT4/KLF2, the
% S/ o4 ^4 ^5 }& l& F) | Zenon Rabbit IgG labelling kit was used (Molecular Probes #Z-25308). 1 μg
2 u' A( u @" T- A) C$ v of KLF2 rabbit polyclonal antibody (Santa Cruz SC-28675) was labeled with % q# s: U1 X' b, h
Alexa647 Fab fragment following the manufacturer's instructions. Cover
1 o4 w2 C8 z7 t& h) E5 ^$ e+ C8 | slips already stained for OCT4 (Alexa488) were incubated with labeled KLF2 - c9 O9 ~5 X- z4 J& q4 m
for 1 hr then washed and post-fixed in 4% PFA to ensure strong signal m4 O: y4 v9 G% m1 h% y# D2 o
detection. DAPI 1:5000 (Molecular Probes #D3571) was used to detect
G* R1 ?* V! t, a) y4 a nuclear localization. Cover slips were mounted on SuperFrost® Plus slides
) ]- ^0 t) R6 L/ _5 L! a: V1 ~ (Menzel-Glaser) with VectaShield mounting media (Vector Laboratories
3 z" z& I% S4 P# g7 { H-1000). Confocal microscopy was performed on LSM 510 META Carl Zeiss
( H! F, r8 ^/ s0 ^4 D microscope system.
9 ]" S0 _& ?" o8 ^ RNA extraction, cDNA synthesis and real-time RT-PCR data analysis" I& O/ N7 ^& B1 n' y+ u2 M
Total RNA was made using TRIzol (Invitrogen), and cDNA was synthesized ) h8 _/ r1 U3 u& U
from 2 μg of DNase1-treated total RNA using Superscript III (Invitrogen)
, F/ |9 w& R* ?1 R, b3 t and oligo-DT12–15 (Promega) according to the manufacturer's instructions.
" `3 v' {8 Z8 d0 m; Z- r" C, z! ?- \ Quantitative RT-PCR (qRT-PCR) was carried out using the Applied Biosystems $ I$ p' ]: ^5 j4 W a
SYBR-green dye system and 7500 Real Time Cycler in 96-well plates. Cycling
# L1 g# A0 h8 h( u variables were as follows: 50°C for 2 minutes, 95°C for 10 minutes, then 0 n1 c k8 {4 E+ t- ]
40 cycles of 15-second denaturation at 95°C and 1-minute at appropriate % B" Q4 L* p6 c, q0 B( h6 L- `
annealing temperatures, optimized for each set of primers based on ( d4 C8 M9 G3 ]0 v: Z0 K
dissociation curves (Additional file 8). Expression levels were normalized
' f1 Y# w: Q! x7 N to HPRT as determined from the ratio of delta CT values. Mean of relative . n7 h8 O2 c: x0 U1 n8 B
expression ± SD was determined from three biological replicates.
& V0 P3 z8 z) e# J7 ?' { Sequencing of gel-purified amplicons was performed to ensure correct * B3 s3 g' `- C$ A% {& K# t
product amplification.) D. Q$ ? y& y' k; v* R+ N
Additional file 8. Oligonucleotide sequences used for qRT-PCR analysis.
4 \$ l- y g) ~( x- i) C( W5 [7 k$ q Lists the oligonucleotide sequences and cycling conditions used for the # Z" G+ s* q1 J+ T |
qRT-PCR analysis.7 @9 B' e& ~3 }2 F5 {
Format: DOC Size: 48KB Download file
u" E Q: W$ r/ P4 M This file can be viewed with: Microsoft Word Viewer2 S3 |" W8 e+ w9 B
Micro-array hybridization and analysis% M4 i# x/ x! I2 J3 o# R: O& F9 `7 p
Expression profiling was performed using Sentrix Mouse-6 Expression , k. Q5 K' T# A1 U7 \
BeadChip arrays from Illumina®. RNA was assessed for integrity using the
Z4 v+ p0 B5 s5 Z Agilent Bioanalyzer 2100 and RNA integrity (RIN) scores above 9.5 were
' }+ W9 b/ a5 N2 n present in all samples. Amplification was performed with 500 ng of total & ^ M4 N$ k7 F% F
RNA using the Illumina TotalPrep RNA Amplification kit (Ambion) with a 12 ) ^. X* M: P. n, q
hour in vitro transcription reaction period. The quantity and quality of
2 P1 _& Y7 ]( s1 C% v) G- E biotin-UTP incorporated cRNA was also assessed on the Agilent Bioanalyzer
" k. b+ }; j# m& ~$ t* m; b 2100. Amplified cRNA (1500 ng per array) was hybridized to Mouse-6.v1 8 m+ K" v' Y) {
BeadChip arrays according to the manufacturer guidelines and detected with
& G9 W1 h3 u; }/ F5 p& ^; X6 y Fluorolink Streptavidin-Cy3 (Amersham Biosciences). Arrays were scanned
9 C& v+ }4 |7 f( G2 e( q. C4 Y9 q using the Illumina BeadStation Scanner. The raw intensity values obtained
}, c, y- E+ u# s- W for the scanned array images were compiled using the proprietary
% F7 F' ?5 L4 t) t( E6 G BeadStudio v1.5.1.3 software and imported into GeneSpring GX v7.3.1
& A- H5 m8 T3 A2 m; V (Agilent). A mouse Illumina probe set was defined in the GeneSpring
, Y4 T/ Z% |6 R6 W Workgroup using the Illumina targetIDs as the unique identifiers and
+ V" R+ f0 j; A3 c7 J& K; T annotated according to array content files supplied by illumina®. Data 1 ~3 T3 `: `- L$ G
normalization was performed by first setting all measurements less than . Q7 O; R' O- ?
0.01 to 0.01, then applying per chip normalization to the 50th percentile,
& |2 c9 P v n# L% b5 R" I6 V' m and per gene normalization to the median.( |% d) b* c6 U6 s- r+ \
From an interpretation which included the three FBS experiments, a
$ i7 |" p5 C$ z- m6 ?) u/ |9 e non-parametric Welch ANOVA (where variances were not assumed equal) was
3 s. z' R2 o" B+ ^# h' ] performed on all 46,120 probes to find a subset of genes whose expression
3 A5 v S5 g: U& `! X+ l9 | varied significantly throughout the differentiation time course. A 4 R; X! q; X8 R, R
Benjamini and Hochberg False Discovery Rate multiple testing correction : b$ W# q N* u0 U5 D5 B
was applied to reduce the number of false positives. This yielded a set of
+ z! S$ y! Y: B3 h" h8 g* @ 7,967 probes that showed statistically significant (p < 0.05) differences
( P, x. u" X1 R" p4 d2 c3 C in expression by Welch t-test. This approach simplified running the data
& e& E: D4 |/ c/ P1 ^- B mining algorithms to find syn-expression patterns by removing genes such
9 a2 P5 k1 M# Q, t( C+ l3 D as housekeeping genes which did not display differential expression, or & W O0 ?: d i! B. y4 R F
genes which did not reach a threshold of expression at any time points.9 ^8 P8 N9 L9 H* N3 Y& O6 G
Genome interpretations were generated in GeneSpring GXv7.3.1 (Agilent) in 0 S3 A1 Q x Z6 M0 C6 k
which the mean relative expression level for each gene was represented at " ~6 }' w9 U% I
each time point relative to the average of all time points. A number of 8 B& q, k' i5 }. s" M" Z
statistical approaches were used to find syn-expression patterns. A - a8 T* i A8 y, L
Pearson correlation (between 0.9 and 1.0) was performed to find genes with & B$ Q& C& l2 G5 f
similar expression profiles to Pouf1/Oct4, nanog, sox2, fgf5, wnt5 and
% G4 n. C7 V. X% d: H brachyury. Hierarchical clustering was performed using the gene by gene Z" S0 A/ X2 {& u+ Y) S
and group by group 'tree' algorithm within the GeneSpring (Agilent)
/ U, ~0 X2 E; i9 n) D7 t7 i program. This compares each probe with every other in the lists to " J& I0 k( B- E5 s/ n4 }
generate similarity statistics and a tree representation of similarity % R; h4 K4 H- a6 c- D; ~, ]
based on a Pearson correlation. Also, quality threshold (QT) clustering
+ | ]7 C0 a8 [# p+ b$ L1 q8 Q [92] was used to define distinct subsets of co-expressed genes. The entire $ o1 Z" C6 G& D" S* F, g& |( X
data set for the experiments is available via GEO as well as via our
# d% A) N/ F5 J, W \! x5 x instance of Genespring (contact corresponding author).
/ s8 Q# | ~ H" {, r- W Electro-mobility gel shift assays (EMSA)
0 d* ?! D/ Q: O T Nuclear extracts were made from undifferentiated ES cells and embryoid
! X' l1 F( R6 N8 N& W/ e. p" I( p bodies grown for four days in 10% serum. Electromobility gel shift assays
" k. \- q6 e7 Q& B* v (EMSA) were performed as previously described [61] using double stranded ]& G r* t6 @/ {
oligonucleotide probes corresponding to an extended CACC site in the
( W- i: S( X, `) h8 x p18INK4c promoter (sense strand 5'-gttgggcggggcgtgggcggggcc-3') (Tallack,
5 L% d- y; B: ]* i6 B% L9 E9 q et al., submitted). Supershifts were performed with specific antibodies 7 d9 W! M9 v* f; w j- t8 m
raised against SP1 (Santa Cruz SC-059x), SP3 (Santa Cruz SC-644x), KLF2 7 P" a) `- Z4 P* h9 q0 ~
(Santa Cruz SC-28675), KLF3 [52] and KLF4 (Santa Cruz SC-12538x).1 \ n P/ t& e' M0 k! |" c
Bioinformatic searches for CACC sites and other over represented elements ; D }2 Y2 ~2 r3 F
in promoters of stem cell genes
: x7 Q" u/ D2 x" O* h6 X The data-mining tool Biomart (accessed via Ensembl release 36 of the mouse
: d( B7 W4 l' a+ ?/ a& u" j genome), was used to collate 2 kb of sequences upstream of the
4 s0 G0 ]+ t) q transcriptional start sites (TSSs) of all genes listed in Table 1. The
5 ]3 r7 O% i0 H Clover program, run via the MotifViz web interface [59], was used to
8 T1 o5 i8 \/ X1 K6 D search for over-representation of position weighted matrices (PWMs) for 2 x$ ~1 [8 b; C4 v, `; Y
octamer [93], Oct4 [60], Oct-Sox [60], KLF4 [54], E-box, GATA-1 [94] and * U7 `8 u* | b
CACC (KLF-A) elements in these promoter sets. The PWMs are listed in
7 Q f [2 t% G4 i7 r. h2 b Additional file 7. The ECR Browser [95] was used to determine phylogenetic 2 V+ e) z0 @0 t" V( `1 {, M
conservation of cis elements between mouse, rat and human genomes." k7 F) M, I4 V, S* \
Abbreviations- I$ z; s$ \ _2 \+ f, O
BSA, bovine serum albumin; EB, embryoid body; EMSA, electromobility gel
4 {: z; f$ n& L3 W0 H8 i. H0 T0 F shift assay; ES, embryonic stem; FBS, fetal bovine serum; KL,
4 d2 z" D0 w+ e Fkruppel-like factor; LIF, leukemia inhibitory factor; MEF, mouse % }# Z( Y6 l& h3 }) c) Y) z% \. z
embryonic fibroblasts; PFA, paraformaldehyde; PWM, position weighted 4 l! u+ Y E. d8 I0 o3 c' U! @1 k
matrix; TFBS, transcription factor binding sites.0 D& ~) \* Q: Y; i' |3 |! d
Competing interests
, `% a6 I: M3 { The author(s) declares that there are no competing interests. L; s* L5 ~" @! F% k
Authors' contributions
' M1 n7 D' c7 S# o |6 D# a SB established the EB differentiation assay, collected and purified RNA
# }; v5 n6 f- E; ^; z. n samples, designed and carried out the quantitative RT-PCR studies and
6 a1 d, w; m/ D7 S drafted the manuscript. BG carried out the micro array protocol. LB $ P; R- p% v/ u
Carried out experimentation of mesoderm specific gene expression. MG + Z5 f1 l0 _1 ^
performed the array statistical analysis and annotation. SG participated
- G/ X5 ?" P3 X) \ in the design of the study and performed the statistical analysis. AP
5 ?6 m9 a( X3 x conceived the study, and participated in its design and coordination and
6 l0 d; `( v, J0 x, V) B4 U7 t O$ v helped to draft the manuscript. All authors read and approved the final
6 K6 ~) T& W8 o0 i- T manuscript. |
|