干细胞之家 - 中国干细胞行业门户第一站

 

 

搜索
朗日生物

免疫细胞治疗专区

欢迎关注干细胞微信公众号

  
查看: 15017|回复: 1
go

Dynamic transcription programs during ES cell differentiation towards [复制链接]

Rank: 1

积分
威望
0  
包包
5  
楼主
发表于 2009-3-6 21:08 |只看该作者 |倒序浏览 |打印

  Dynamic transcription programs during ES cell differentiation towards 3 m% u, M4 M, s3 s$ V
      mesoderm in serum versus serum-freeBMP4 culture

      Stephen J Bruce1,2 , Brooke B Gardiner1,3,4 , Les J Burke1,4 , M Milena , _- h7 }. f+ N8 m- v) S6 k
      Gongora1,3 , Sean M Grimmond1,3,4  and Andrew C Perkins1,2,3,4
: c  X4 _6 n1 V0 p2 R7 |1 z      1Institute for Molecular Bioscience, University of Queensland, Brisbane,
! |* u! @3 K" Z- O      Queensland, Australia& S6 k3 E( ^7 H+ k( A, T: s& `( q
      2Wesley Research Institute, Queensland, Australia
& G, W+ X6 a# D1 x' [- w' T      3ARC Special Research Centre in Functional and Applied Genomics, Australia& U. t5 U3 K6 [, E' B
      4Australian Stem Cell Centre, Melbourne, Australia
! H" @0 x6 ~# t3 X       author email corresponding author email( A/ u2 b7 {/ t/ \: F! L
      BMC Genomics 2007, 8:365doi:10.1186/1471-2164-8-365
3 }2 A% Y, w) E* P7 E, {      The electronic version of this article is the complete one and can be
5 q7 V* o, w+ s      found online at: http://www.biomedcentral.com/1471-2164/8/365
2 q% I  e" H) `- ?            Received:28 May 2007$ _7 G  ]9 y( D! r
            Accepted:10 October 2007( W* m. Q1 H+ C. u; Z, P: ~' w
            Published:10 October 2007
. t( A( T' u2 f' ]  P
/ r7 z# M( g) }1 v      © 2007 Bruce et al; licensee BioMed Central Ltd.
' v$ B: p" n9 @4 B: [      This is an Open Access article distributed under the terms of the Creative
& q1 o- j6 s; k+ c, d      Commons Attribution License (http://creativecommons.org/licenses/by/2.0), # N3 e  Y4 x; m1 u# K; N
      which permits unrestricted use, distribution, and reproduction in any / a8 \. H9 [0 w1 m: s& O- f
      medium, provided the original work is properly cited.
4 r9 N  U3 X! |% i+ t; G  w      Abstract
: K5 ^, ~  n/ h. u8 h6 _      Background
7 K7 a% D0 M" {. [; j2 s      Expression profiling of embryonic stem (ES) cell differentiation in the
- V2 N) B' P  X# Z/ s      presence of serum has been performed previously. It remains unclear if 4 W9 r; W$ k; K1 u; c. r' m) W
      transcriptional activation is dependent on complex growth factor mixtures % L( A& c, j3 ^# L6 g* v
      in serum or whether this process is intrinsic to ES cells once the stem 5 y# d7 F9 _6 E/ Q
      cell program has been inactivated. The aims of this study were to
$ ]8 m" [. s+ j$ F5 p& m+ L      determine the transcriptional programs associated with the stem cell state 1 p; x9 E% r: }7 `: A+ t0 e
      and to characterize mesoderm differentiation between serum and serum-free 7 S9 M+ M. [% q+ ~
      culture.
3 k2 I3 q6 {7 p/ u      Results) c! `1 ^$ z. E  U
      ES cells were differentiated as embryoid bodies in 10% FBS or serum-free % }: \* }6 R1 v$ J
      media containing BMP4 (2 ng/ml), and expression profiled using 47 K 9 j6 G/ e: m5 o
      Illumina(R) Sentrix arrays. Statistical methods were employed to define
% i+ U3 o8 P; w9 p1 q$ @6 T      gene sets characteristic of stem cell, epiblast and primitive streak
) x4 G6 Q" l( Z/ V! d; @' \      programs. Although the initial differentiation profile was similar between
" {2 D2 Y" A" {0 b2 }: ?& y1 \      the two culture conditions, cardiac gene expression was inhibited in serum
! H$ V" w) M4 D3 U$ e8 c! c, G      whereas blood gene expression was enhanced. Also, expression of many
' d) x5 k  U" \: I/ f      members of the Kruppel-like factor (KLF) family of transcription factors 7 r4 U4 v! ~6 m( S2 X: `
      changed dramatically during the first few days of differentiation. KLF2
2 m2 }/ o8 j9 h* h% o* e      and KLF4 co-localized with OCT4 in a sub-nuclear compartment of ES cells, 9 `% M& X3 F6 P
      dynamic changes in KLF-DNA binding activities occurred upon
  F/ \# U' @' ]$ T      differentiation, and strong bio-informatic evidence for direct regulation 7 Q* |4 S' Q3 D3 {3 E
      of many stem cell genes by KLFs was found.# _" G: w$ R9 b' y% d$ g- `
      Conclusion6 K- T8 C6 S) `9 w% E% ]
      Down regulation of stem cell genes and activation of epiblast/primitive
( I# B1 X- R( B. I0 }1 A      streak genes is similar in serum and defined media, but subsequent
3 p. \% O$ y( N      mesoderm differentiation is strongly influenced by the composition of the
6 K1 X1 C- |6 L/ }# u      media. In addition, KLF family members are likely to be important
; w# ~4 |0 {9 S* O1 A% z! @      regulators of many stem cell genes.
* J6 B! V0 }, r) s$ Z      Background
. B6 ^- X0 T$ l: p7 a  `+ w$ c      Embryonic stem (ES) cells isolated from the inner cell mass of the early $ y# F& I/ h) N1 n6 v9 e" n
      mammalian blastocyst-stage embryo retain pluripotency when cultured on 5 @5 b  v1 S8 _; K; ^
      mouse embryonic fibroblasts (MEFs) in the presence of leukemia inhibitory
$ s/ I' L4 t9 [8 y! N      factor (LIF) [1]. In the absence of LIF and MEF attachment, ES cells $ i, [) `8 a$ A) b" Q6 ?
      spontaneously differentiate into multi-cellular aggregates termed embryoid " H( L3 Q* Q) b
      bodies (EBs). Although the spatial complexity of organogenesis is not : A  S/ W: `  R3 h+ _
      established during EB maturation, the dynamics of gene expression closely
* ~9 m3 R' U: x8 y! O      mimic those which characterize early stages of mouse development [2-5]. 1 Z. }( m) D: o7 A( B$ U
      Thus, ES cell differentiation is an excellent model system for the
6 x: D& u) |4 L, U3 y# Q3 H      discovery of genes involved in developmental processes.
% Z' L/ Q2 f* q1 S      Many studies have attempted to identify genes that define the stem cell 2 B& R. X2 x- M; J* X9 Y
      state by mining for genes co-expressed in ES cells and various other stem
& q6 l, G' i9 s; h7 \2 K5 B0 u: {      cell populations such as hematopoietic and neural stem cells [6,7], or ES . _6 f- t2 D& l1 g/ |1 H
      cells and trophoblast stem cells [8]. Some of these studies have been
/ R6 J. A( F- b* f1 i' m. Y9 I      criticized because they were unable to define a similar cohort of stem 6 d2 {) @& |! f% F+ r' c! o
      cell genes [9]. Also, this approach is likely to find co-expression of
  U- ?9 @& Y9 z% W& B      'housekeeping' genes that are irrelevant to the stem cell state. More
; y/ h4 z6 ~9 H+ Q) j( Y      recently, dynamic expression profiling during ES cell differentiation has / x( S! e. E6 g6 Y! G
      been undertaken using various EB culture techniques [10,11] and the
+ m/ h" `8 W& x) i: o/ D      addition of exogenous growth factors [7]. Most of these studies followed
+ P! l0 h' O5 Y/ e8 t      differentiation for a short time period (up to 6–8 days) due to inherent 0 C; Z; m- Y  v# `5 f/ E" S
      limitations of liquid or adherent cultures cell differentiation systems.
' }0 e7 e3 C- {7 a      Various platforms have been used to determine gene expression profiles,
. x+ X8 |5 R2 M7 ]      including differential display, SAGE [12] and slide micro-arrays, such as
1 ]3 n+ w7 g! {9 k9 D! D; j: @      those provided by Affymetrix, Agilent and Compugen. In this study we $ z, U' l" f- ^. F
      employed Illumina® Sentrix Mouse-6 oligo bead arrays [13], which have ~47
' Q' \3 j2 R3 R& f6 {: m; L( r1 d' q      K probes that are largely based on the MEEBO set of transcripts, plus ESTs , Q! j4 p$ M$ N
      discovered as part of the FANTOM2 transcriptome project [14], some 3 F& F' g1 y, r5 E6 K! m3 S
      alternatively employed exons, and a small subset of non-coding
! [" \9 c; U6 M7 r- R( e1 g- Y      transcripts. This is a new, rich and sensitive platform for expression
! ^4 e' U; r( o" y+ b1 Z7 v      profiling [15].
& @+ L1 o' `! M& J1 }      ES cells were differentiated in methylcellulose cultures, which allowed
) N& A0 o& l; y5 R& Q      consistent EB development over 16 days in either serum [2,16] or defined / d& C4 k3 d7 M8 ~$ f1 L: c
      serum-free media containing BMP4 (2 ng/ml), termed serum-freeB4L hereafter ( E: g- O4 y8 a  h4 B- B
      [17]. The dynamics of loss of stem cell gene expression and activation of ; y- n$ t- v" }! n3 _8 h
      epiblast and primitive streak gene expression programs were similar in   I2 e5 e  N6 r
      serum and serum-freeB4L but QT clustering revealed a significant ; f, c8 ~9 W' l# l# O9 J1 {0 N( _
      difference in mesoderm outcomes from day 6 onwards. In particular, cardiac
; r* h' W% r' J3 u9 j      gene expression was greater in serum-freeB4L whereas activation of the
7 q7 ~" @, i2 Y' N5 c$ K$ c* k* j      blood program was enhanced in serum.
1 z+ s8 Y0 T' j& J      High expression levels of Kruppel-like factor (KLF) family members: Klf2, . I2 c9 W! s; Q; P0 K
      Klf4, Klf5 and Klf9 was detected in undifferentiated ES cells, and 7 A2 o9 C/ `- ]: j% x8 l
      confirmed by quantitative real time RT-PCR, indirect immuno-fluorescence
. J) ?2 t1 t2 u3 m, u      and DNA-binding assays. Expression was down regulated rapidly upon   o  @4 V+ s( M& o0 C. ^8 M; p
      differentiation and a different set of Klfs: Klf3, Klf16 and Klf1/Eklf,
0 r$ E. Q0 M. y" N- I      were activated upon ES cell differentiation. Also, KLF-binding sites were / S+ Y3 D* I/ a) }, y5 f
      markedly over represented in the proximal promoters of most stem cell
. K/ ?! D* [" {" ?& ~      specific genes, suggesting possible direct regulation by KLFs. We suggest
, A3 u0 ?% E: ?/ ^+ H* u5 ?9 G      a model in which the balance between self-renewal and differentiation is
( n" k. r7 ?- V$ F      regulated by competitive occupancy of the proximal promoters of key stem 4 L- ^" y1 C1 ^' h  C/ g
      cell genes such as Zfp42/Rex-1, Nanog, Pou5f1(Oct4), Lefty1 and Lefty2, 4 f* L2 @' e! {9 b& z
      and others such as the Klf genes themselves.3 b7 R2 |( |2 q3 M
      Results
+ |- W9 U- j3 x" U$ ^( P      Differentiation of ES cells in serum and serum-freeB4L culture$ g5 n2 x3 L8 V% b4 T
      The directed differentiation of embryonic stem cells into desirable cell
% H4 C) {. F0 [/ b      types will depend upon the addition of specific growth factors to defined
2 U2 w3 }- {6 {  K+ p, j# ^      media and reproducible physical conditions of culture. To establish a
% v; A. @: z* s3 }9 n8 @1 j      baseline for such studies, we undertook a comparison of murine ES cell
" j% `' F  b5 @' y$ O: ~2 x3 Y' G+ |      differentiation using an embryoid body (EB) methylcellulose culture system % n4 \/ h9 T4 U/ W
      containing either 10% FBS [2] or a chemically defined BSA-based media , q& ^( H5 ~! \  Z3 C. F. h/ U
      containing 2 ng/ml of BMP4, termed serum-freeB4L [17]. Triplicate
0 ~4 a" K- k/ A: f6 `+ Q8 ~      experiments were performed in which ES cells were harvested after feeder
* E4 A/ E* P( C2 K& {  A( n      depletion, and set up in parallel EB cultures in serum versus 5 Q2 A. g8 |* v1 ^7 A3 B+ M
      serum-freeB4L. RNA was collected at regular intervals up to 16 days, ! \3 P, L8 Q$ {, q  q3 K& [
      quantitative RT-PCR was performed on a panel of genes representing defined
3 G5 P6 [; J# z5 l0 E      stages of development to establish the robustness of the culture system
) e4 s6 R1 O7 I# F6 [6 T( i      (Figure 1), and expression profiling was performed using Illumina® Sentrix - b  H& `1 |5 z6 `3 T
      Mouse-6 bead arrays (see Methods and [13]).1 Q! c$ D5 C. i$ K" i
      Figure 1. Dynamic gene expression during EB differentiation in serum and
) N& a! e* z& T4 Z      serum-free media containing BMP4 [2 ng/ml]. Expression of stem cell,
% P/ A$ q  x/ ~: [) U7 E% Q- Q      primitive streak and late mesoderm genes were analysed by quantitative 8 u2 p7 [( Y) C  u4 u7 J! F: r& j
      RT-PCR in undifferentiated ES cells (0, grey bar) and in EBs collected for
  E: G# M& ^6 x( l0 l  m) R      up to 16 days in serum (white bars) or defined media (black bars) from the * {9 G& j" f/ q5 n% t5 W
      same starting ES cell populations. Bars represent the means of three
' U6 [% s, @! f$ U5 z. Q      (serum) or two (serum-freeB4L) biological replicates. The Y-axis   m# f2 W# r  H( ~$ Q. n; R8 `
      represents a log scale normalized relative to the housekeeping gene HPRT.
4 e* g! y2 T- _! E* T$ m      Error bars indicate standard deviation./ V& D2 I3 h' R, x
      Changes in gene expression were extremely reproducible across ES
4 r; N3 d2 M. ]& \$ d% }/ `      differentiation experiments with overall correlation of global gene $ _" m+ Y. F9 I
      expression for biological replicates in the order or 0.95–0.96.
' b2 I) N6 P6 Z5 E- o/ G      Approximately 30% of the probes were differentially expressed at one or
; O/ c% I9 w7 W- |& _7 a      more stages of EB differentiation as determined by one-way Welch ANOVA
. h7 v+ M% ]- E7 s. i9 o# G- h/ J      with multiple testing corrections using the Benjamini and Hochberg False
' I# M. i1 _; y* b- `: o' ]' \$ ]" m      Discovery Rate algorithm (7,967 probes with a p-value of < 0.01 by Welch " ^) D( ^8 c% ?+ v3 i& n
      t-test) (Figure 2A).
" G- W9 L8 H) V. i7 J      Figure 2. Gene expression profiles during 16 days of EB differentiation in
1 C! M/ U( _: o      serum and serum-free media containing BMP4 [2 ng/ml]. (A) GeneSpring 0 _* O- L2 H/ A1 r" i) V, o. G
      representation of the subset of genes from the 48 K Illumina array that
% w( s% {1 v3 ~) j2 O5 s      showed significant differences in gene expression at 2 or more time 6 h" z( S: ^  K- k% D+ w% O
      points. Each line represents the mean normalised expression of an - i# L8 o; D0 q" S3 e% B
      individual Illumina probe. Colours represent relative expression (red 4× 1 i$ i% }. l4 j+ t; O# t
      increased; green 0.25× reduced) compared to a gene mean of 1 (black) with # q- t  N8 t5 L9 v
      expression in undifferentiated ES cells (day 0) as the decision point for
: J5 H, X0 |* V3 C& q4 X, R2 a$ T      colour coding. The expression profiles of T (brachyury), Hba (α-globin), * f4 K( Q5 W5 _( I; V) X/ a
      and Igf2 are indicated for reference. (B) Clustering (tree) of the genes
2 d5 k/ ?, ^8 Y: t# ]      that showed significant (p < 0.01) changes in gene expression upon + S, x  S& E- I, o
      differentiation by One-way Welch ANOVA. Expression levels are heat map 4 i! I- ?' u1 b
      colour coded (inset) with high expression at Day 0 coloured red and low
! c6 k9 b1 Q. h      expression coloured green.
7 K4 D& @$ Q0 G$ G. s      To mine for genes with similar patterns of expression across the time 6 J' E# L( R5 ]  Z) }3 I- D" K
      course, hierarchical clustering was performed on the 7,967 probes and 1 b$ E$ L# [7 H3 ~, D
      represented graphically. A large percentage of these (~30%) displayed
/ c& S+ w* i/ h: F. K4 o7 t      highest expression in undifferentiated ES cells and subsequent reduced
% K* H5 f4 N- U& Q- ~. Q      expression over time (labeled stem in Figure 2B). A second major cluster . Y' M4 y) F% V" A- f
      showed marked induction after ~day 4–6 of differentiation (labeled
% Y$ z  J% {1 @% G      mesoderm in Figure 2B).
2 Z  X& b: O" C1 d$ z      Putative stem cell genes sets
5 y% }+ }6 \. ^5 p- g      Previous studies have identified genes important to the stem cell state by   A/ x0 ]! @5 F$ h! m6 W& F6 \( e
      comparing expression profiling data from different stem cell populations
7 @! y) o( ^- S/ Z' e  J- m- C      such as ES cells, haematopoietic stem cells (HSCs) and neural stem cells
+ |  N2 D/ Z9 n6 @7 j$ l      [6,18]. We used two alternative approaches to identify specific stem cell
2 ?7 x8 E; {# v( J) K      specific gene expression. Firstly, genes with dynamic expression patterns ! b( t( h7 @# A  ]8 ?
      which correlated closely with Oct4, Sox2 or Nanog expression profiles
& D+ G1 V1 T5 \! V" _1 p8 H) {      (Pearson correlation > 0.9) were determined as previously described [6,7]. 4 N/ Y6 b/ `) A* I3 X7 U
      Second, genes rapidly down regulated ≥3 fold by day 3 of differentiation,
- v+ V0 h2 Q2 h- d      a time point where ES cell clonogenicity decreases dramatically, were
4 a) |+ z' u; a: T$ N( G. J9 m      determined. Not surprisingly there was very little overlap between these
1 L; M9 u$ a& L; h      gene lists, primarily because Oct4 and Nanog expression persists during EB + v; V, H  r/ K1 n- v9 {1 d
      differentiation longer than many other stem cell genes (Figure 1). Full
- D  u. A, [( O      lists of genes ranked according to similarity of expression (Pearson
% M+ P1 b" E  q. B  q- J      correlation >0.9) with expression patterns for Oct4, Nanog or Sox2 are 7 d, {, g$ C1 p: W* p( |3 r4 p+ R
      provided in Additional files 1, 2, 3, respectively.
4 Q, H# ^4 W3 P3 `' A      Additional file 1. Oct4 gene list. The data provided lists all genes
" G) u2 K5 B# w& \) ?3 z% S      expressed during 16 days of embryoid body differentiation with similarity - {& g1 p7 u% K; @
      to Oct4 (Pearson correlation >0.9)." N: B2 n8 _2 M" o3 V
      Format: DOC Size: 630KB Download file3 B( D% b( l0 e) R
      This file can be viewed with: Microsoft Word ViewerAdditional file 2. * [- k0 |3 a/ k6 c
      Nanog gene list. The data provided lists all genes expressed during 16
  t8 h& V8 K7 h; P7 Y) n) T% u      days of embryoid body differentiation with similarity to Nanog (Pearson + |  q, @8 ?# G) l" i
      correlation >0.9).6 d2 F4 U+ D+ q8 A' U) A
      Format: DOC Size: 111KB Download file, w( w" D$ Q- H( F6 T( i% z3 P
      This file can be viewed with: Microsoft Word ViewerAdditional file 3. Sox2 3 F9 L3 z$ v. q7 t! u, ?  t) _
      gene list. The data provided lists all genes expressed during 16 days of
4 N0 y7 U$ t! ^5 i9 [' m3 C- N      embryoid body differentiation with similarity to Sox2 (Pearson correlation 3 M3 s$ m8 H$ }9 p2 H5 R
      >0.9).4 j* [$ V0 J8 V  f; i5 b2 ]
      Format: DOC Size: 237KB Download file
3 w6 {# c# r( [6 _      This file can be viewed with: Microsoft Word Viewer
3 v% D" x* T- G. L' f/ k      Only fifty-nine genes were down regulated ≥3 fold by day 3 of EB
# x7 I5 f( c. F, s" ~4 k      differentiation (Table 1). Many of these have been identified as stem cell
2 a* s& W3 L. Q5 p8 e% P, H% @      markers by other groups using alternative strategies [6,7,19]. Information ' N/ S. W" X0 M8 k
      was compiled on their known expression patterns and/or gene knockout 1 u. @6 X+ E+ j& H
      phenotypes via literature searches as referenced in Table 1. Insufficient
1 [" H, F. B, o! f      space exists to discuss every gene and most have been previously
# q. i9 p: M4 Q" E% s" K      identified as ES cell enriched transcripts. Osteopontin is secreted by ( N% n6 G( V" N& |" d3 F: h7 G
      osteoblasts in the bone marrow niche where it is thought to play an
" O# \- E9 N( D3 q      important role in the maintenance of HSC quiescence and maintenance of 4 {# X1 [+ A7 S% }- V8 @
      'stemness' [20]. It has also been discovered by other groups as a strong
( U+ `0 h" p  M' I2 S      marker of ES cells and is regulated directly by the key stem cell 8 W5 Y: ?% I4 M0 {* L
      transcription factors, Oct4 and Sox2 [21]. Thus, osteopontin is likely to
! \3 O% g+ V7 m      be a useful marker of ES cell pluripotency and may play autocrine or
3 G+ l7 O( Q" X      paracrine roles in maintenance of stemness. The second most rapidly down
' I0 X7 M. V) |2 A. A# e+ c2 ]      regulated gene was estrogen related receptor β, Esrrb, which had
* c5 y; v+ ]) J! O* L      previously been identified as a stem cell specific gene and shown to be
4 ?& }( H/ J0 A1 l2 c      critical for maintenance of the stem cell state [19]. F-box 15 (Fbxo15), a : ?& ^& G9 Z; [! n  z
      member of the large F-box family of genes [22] was also rapidly down
5 Z; p# y: b. H% b! L7 t' t      regulated. Fbxo15 is known to be highly expressed in ES cells, is + F4 e2 L; k/ P7 D5 G, I
      regulated by Oct4 and Sox2, but is dispensable for ES cell self-renewal
0 G3 R! H, L: G" |      [23]. Recently, the stem cell specificity of this locus was used to screen " M: Q# y5 F4 x# ]5 T
      for cDNAs capable of transforming mouse fibroblasts into ES cell ( s6 R4 _  K( S; {8 p( {
      populations [24]. Other notable rapidly down regulated genes included the / D8 k- i0 w% A+ `5 I
      nodal inhibitors Lefty-1 and Lefty-2, and Zfp42/Rex-1, which are also well ) @; z8 x, e* O" Q- t  Z. L2 o
      established as stem cell markers [19].0 h) a3 Z) Y( g+ Q* s
      Table 1. Genes rapidly down regulated (>3 fold in 3 days) upon ES cell
+ P- I3 j. O+ g! b4 _1 Q& i      differentiation) y: A0 K$ h+ h5 o. c6 L
      Oct4, Nanog and Sox2 are not included in Table 1, even though each is 7 Z2 x. \0 m1 \* K
      required for the maintenance of ES cell pluripotency [25-27] due to their
* w$ x" n& j9 h8 t& s6 G      persistent expression during the first 3 days of EB differentiation
& a# S& H7 H7 ?" K* u' V0 M, C# Y& R8 W9 P      (Figure 1 and 5A, and Additional file 1). Indeed, Oct4 expression is known
% ]. u. O- A$ x0 Z" e# @0 D: U; w* S      to persist beyond the inner cell mass stage of development in vivo, with " ~- y7 V" D1 B+ l$ _# m0 F5 P
      transcripts detected in the epiblast and later in the primitive streak, % i9 k9 w) {5 X) \* s% G
      before it becomes further restricted to primordial germ cells [10,28]. 779
6 F3 U7 o- U! U$ x- r1 d1 |      genes with a similar pattern of expression to Oct4 (Pearson correlation of
5 ]+ o. j9 K& i, @      > 0.9) were identified (Additional file 1). As expected this list also 9 ?+ _# I1 V7 z* B
      includes many well known stem cell genes such as Sal-like 1, nodal,   d3 U9 K0 D0 Y5 t
      chromobox homolog 7 (Cbx7), Dnmt3b and Sprouty homolog 4 [19]. We also . E: D6 t: L+ P$ r3 T4 x$ ~7 y
      mined for stem cell genes by looking for similarity (Pearson correlation - f: ?7 ^7 V1 u& V3 t) @
      of > 0.9) with Nanog and Sox2 gene expression profiles. These provided . S! g; @- G1 a) A
      overlapping but not identical sets of putative stem cell specific genes
! H/ m: d2 a1 H( Q9 V$ D% q" m      (Additional files 2, 3). Full data for the 16 days of differentiation can ) H3 g/ h1 k% U$ C1 a' m% x
      be interrogated and mined for specific differentiation outcomes via Signet
" ?- h3 E2 G. j: c" Z      (Login_Bruce) (see Methods).# Z& J. R! N) o5 Y6 j. K
      Tightly co-ordinated waves of gene expression suggest sequential . V' y4 ]3 u. q' M& Z- ^* W6 T. t
      activation of epiblast, primitive streak and mesoderm differentiation
7 E6 ?- D% T5 e! u+ D, |* Q& _, B      programs9 a% I! ^$ h9 m; w* `
      Many alternative differentiation programs occur simultaneously during ES 4 P3 C" r# x: W! W1 t* N* ]4 G
      cell differentiation. Therefore, it is difficult to find gene
/ @' p7 n& b3 i# I      syn-expression patterns that identify specific programs using standard
. ?1 Z$ A% g& n0 t( a) f9 v      clustering methods. An alternative approach is to search for expression
- o. d) o: `6 q# d3 @+ u7 a      patterns that closely resemble genes of known function. For example, the T
) o/ D6 U8 v' Q% E( E      box gene, brachyury, is a well characterized specific marker of the 8 \; v5 h$ f1 K3 b: x
      primitive streak [29]. Brachyury expression is transient in vivo, a
# L4 K( p7 c8 t, j      feature recapitulated during ES cell differentiation [2]. Using qRT-PCR, / c$ M# h, q  `- o& k; z' Q
      brachyury was induced ~1000 fold from D0 to day 4 and then rapidly ' w% Q: ~- h' R0 O4 ]2 |
      silenced to baseline levels by D6 (Figure 1). The dynamics of brachyury
! w) b; P1 j, o# J! ?      expression were similar between serum and serum-freeB4L media (Figure 1
- X$ w, i" h9 x7 c+ G0 F# N5 [      and 3A), but serum-free media was unable to support brachyury induction in
- {& r$ h& O4 A/ Z% {      the absence of BMP-4 (data not shown). Thus, BMP4 (or related factors in & w9 T% R8 C& H! a; V9 P: X
      serum such as activin) is essential for mesoderm generation. Thirty genes ; Z* |$ N% S8 D- C
      were identified with a similar expression pattern to brachyury (Pearson
5 ?0 Q5 J3 Q( \: x5 i      correlation >0.9) (Table 2) including many genes with established
: f- y! ]( P- m0 L2 r      expression or critical functions during primitive streak formation, such + \, t3 `9 R- _6 J
      as Mix1 [30], Lim1 [31], Sp5 [32] and eomesodermin [33]. Transient ; R2 _( |! Y& w9 ?2 ~) {
      expression of many of these 'streak-specific' genes was validated by 3 G. k8 H: r6 O) d
      qRT-PCR (Figure 3B and data not shown). In all cases there was a close
/ c" {& G1 h* {! k3 P; V& D      correlation between the qRT-PCR results and the profiling data (Figure . ~/ f8 \0 U2 \; c8 T  f* U2 ?  y
3A).2 j) V4 ?% m3 U* b6 N
      Figure 3. Mining for genes expressed in patterns consistent with primitive & C8 K# e( U# t" `' J& g9 C
      streak, and epiblast differentiation programs. (A) GeneSpring
! G/ }& K3 {1 H& T      representation of epiblast and streak gene expression profiles during EB
2 n# N7 l7 r% I7 X; o& \3 P) E      differentiation in 10% serum or serum-freeB4L culture. The up-regulation
$ u. x( Q4 v  ^/ e9 J% d      of genes between days 2–4 were similar under both conditions. (B) qRT-PCR
4 n$ i  g6 K6 a$ k: @4 v% B0 `! F) K/ W      analysis of Mixl, Lim1, Cdx4 and Riken clone 8430415E04Rik. The Y-axis
8 e$ d/ ^3 j" f6 a( _      represents expression relative to the housekeeping gene HPRT. Error bars 5 W& J. Z/ J' k! w" d
      indicate ± SD from three biological replicates. (C) 102 genes transiently
* r' l+ f' t' L& S2 Z& S3 g% `      up regulated during the first 4 days of ES cell differentiation were ! P4 `& K- s. K/ [3 w- K1 x7 U- u
      clustered using a tree algorithm and Pearson correlation of > 0.9. The
: J& g" [2 u7 ]' \6 w      'Brief' group represents gene with very transient expression at day 3. (D) 2 {; T: `( h- I# p
      Coding region for 8430415E04Rik. The shaded areas represent the three Heat
1 v/ H/ U0 Z) R" \- w( `      domains.Table 2. Brachyury-like (Primitive Streak) gene list (Pearson . {; }5 y. s6 B* p* t, t
      correlation >0.9 with brachyury)
& {( v9 E+ z7 H      Interestingly, four novel transcripts were identified in this list (Table 7 A* }8 E0 s. y' {0 @( {' e/ s
      2). One of these, 8430415E04Rik, was detected by two independent probes, 5 }# M' S! v- N8 b5 O$ }, {% L
      strongly supporting its 'streak-specific' expression; by qRT-PCR there was + f7 v4 q  l1 B
      ~50 fold induction from day 2 to day 3 of differentiation and a decrease
& k" Z# y9 f, U      to near baseline levels by day 6 (Figure 3B). The gene contains 24 exons,
2 R$ z) n2 \- Y/ U7 j) i      and the full length cDNA encodes for a protein of 868 amino acids (Figure * y: j, f+ E  d% u: c3 I  p. G8 m0 E
      3D) which is highly conserved in vertebrates (Additional file 4). It has 7 S3 H" V- O0 {4 w
      little homology to other proteins apart from three predicted HEAT domains
: P! y& ^* ]2 h: E      or armadillo domains (grey shading in Figure 3D). Drosophila armadillo 3 s3 V. ~- m$ i5 j! M- F* J9 a
      interacts with β-catenin via its HEAT/armadillo repeat [34], which
9 V: ~8 Q) N3 P7 ?' v% I0 U3 E4 U/ c      suggests the protein encoded by 8430415E04Rik may also be involved in
" ]9 |9 o+ O# H" m3 K8 G+ d( Z      protein-protein interactions.
# T: c' D; v% ?, j% ]2 v      Additional file 4. Sequence alignment of RIKEN clone 8430415E04RIK. The
( b- F0 Z/ j& I; D5 g# [      data provides an alignment of RIKEN clone 8430415E04RIK using human,
. @0 ^, i. Y- ~/ `      mouse, chick and zebrafish sequences./ |* f- `3 L# [" F; o2 E
      Format: DOC Size: 31KB Download file
' f1 H5 p9 b7 I' {- V1 {- c      This file can be viewed with: Microsoft Word Viewer
/ H3 {) |7 d' R3 v& a# G9 C& d% E      The list of primitive streak genes in table 2 is smaller than that
: a8 l, ?4 `0 a7 h9 C1 V      previously published [10] due to the very stringent criteria used to - ?: \0 V; \7 m
      determine brachyury-like patterns of expression. Thus, to find further ; C! q2 j1 h% V; q! q/ ^9 z
      genes characteristic of epiblast, early and late primitive streak 4 x; S6 |3 ^& ]) A
      programs, probes were identified which were transiently up and down
$ ?$ m# g. E1 ]: F      regulated at least two fold with peak expression at day 2, 3 or 4 of EB + b% R4 t1 v* B) e& i9 u  z
      differentiation. Gene lists were combined (102 Illumina probes) and
* |; ~3 G! p2 ]- ^4 M. Z$ @+ ?      clustered into epiblast, early, mid and late streak groups plus a unique 9 B; \1 I; w: I; Y* D. r/ K
      group in which expression was very transient at day 3 of differentiation : X) c& \& D, ~; o9 B7 m. ?& f
      (designated 'brief' at the bottom of Figure 3C). Also, lists were
! _) I+ n; b- D3 N% [; h  L. Q; U8 k      generated by searching for similarity of dynamic expression profiles to " R$ q1 k1 A/ q) h2 [7 p. w; ^0 X
      known genes which are accepted specific markers of specific stages of 2 Q* B: x, B/ H
      mouse development. For example, Fgf5 is specifically expressed in the
+ x3 C5 T7 L2 U6 V( b. L0 ~4 x      epiblast [35], and Wnt5a is expressed in the late primitive streak [36]. ! E3 U, H, S+ U: t7 J4 ]; Y5 U- q
      The Fgf5-like (epiblast) list includes the methyltransferase, Dnmt3b, ) C' Y: C* ~  @8 V$ R
      which was massively up-regulated in the first 2 days of EB differentiation
/ Y: j; G1 Y- o5 d      (Additional file 5) and is known to be essential for mesoderm 6 h% @( B; j0 j" E  ]# M
      differentiation [37]. This list also includes Pim2, Zic2, Wnt8a, the SP1   a& m9 t- j7 F' Z" A9 I
      (and possibly SP5) cofactor Crsp2, and Irf1. The Wnt5a-like (late
" [& W, T( R! Y      primitive streak) list contains 76 genes including many homeobox genes : a) ^$ ?1 K6 P  {0 z1 g5 u
      (Hoxb2, Hoxd1, Hoxb6 and Hox8/Msx2), Cdx2, Hand1, Tbx3, Bmp4 and the VEGF $ }- y/ P' L! K3 p$ ~
      receptor Flt1 (Additional file 6). Many are expressed at E10 of mouse
/ q, {& B5 c  y* \! P8 j' g      development in the remnants of the primitive streak [36], suggesting the 9 y9 L) w1 E& X1 A# T9 f
      Wnt5a-like list includes many genes known to pattern the embryo following ) M- X1 H4 P+ R5 B
      formation of the germ layers. There is a significant overlap between the 3 N* I+ @. S5 e& B; J1 }  K
      genes we identified as being epiblast-, early and late streak-specific and
1 m9 t' i  P% b% p/ g7 G1 q9 e5 d" ?      those recently identified using a similar EB differentiation system and 1 q# ]1 a3 T) Z9 H
      Affymetrix arrays [10], and by other groups [2,38]. In addition, there are
# j' {* s" N. V" g3 G) b      addition novel RIKEN cDNAs in these lists, which are likely to encode for
9 ^9 q, y, Z! k. a      interesting proteins worthy of further study (Additional files 5, 6).4 r* m7 j4 B- b7 e5 G& B: @
      Additional file 5. FGF5 gene list. The data provided lists all genes
. O0 }' `, U+ ~8 T3 @      expressed during 16 days of embryoid body differentiation with similarity * e( w4 o& x, {9 L7 j0 J" k
      to FGF5 (Pearson correlation >0.9).. H: G( ]& |& Q, t
      Format: DOC Size: 69KB Download file
5 v6 f9 a; T! \2 J" ^3 x  X7 V      This file can be viewed with: Microsoft Word ViewerAdditional file 6.
1 c/ l1 H2 R9 a, @: W      Wnt5a gene list. The data provided lists all genes expressed during 16
" `# Q  e4 v' v9 A. y& ~7 t      days of embryoid body differentiation with similarity to Wnt5 (Pearson - t- A) {  l5 ]2 m
      correlation >0.9).
7 e! S9 J! I0 `      Format: DOC Size: 76KB Download file
. o0 k$ [# Z; k      This file can be viewed with: Microsoft Word Viewer+ {4 K2 d/ b6 p' x7 O% }. L; c; M, q/ j
      Following the primitive streak wave of gene expression there was a " f7 f" ^& V4 [( g0 |( F
      dramatic activation of many genes (Figure 2A). Many of the most highly
: e# U2 `4 L- d" G& b      induced genes, such as globin genes, heme synthesis enzymes and
* q- ^5 B; a% a1 O      haematopoietic transcription factors, are known to be expressed in the   X9 F" t& c, Q$ y8 J9 b# U! g
      blood. qRT-PCR for globin genes confirmed > 10,000 fold induction between + d9 G" N# K% M( y
      days 3 and 8 with a steady decline thereafter (Figure 1). Other mesoderm
' j+ W9 @  `/ E: m& Q: `7 J      and endoderm programs are also activated at this stage, so it is difficult
8 V' i+ u; U0 F4 [/ u      to mine for these different outcomes., k# E! n& q3 G
      Enhanced cardiac gene expression and reduced erythropoiesis in & ?, {3 j/ }: B$ y" h
      serum-freeB4L EB culture3 Q4 p4 ~. P! @: F" Q% Q
      Overall there were very strong similarities between ES cell
  T3 h: k+ y, k! V+ T# V      differentiation in serum and serum-free cultures supplemented with 2 ng/ml
( N/ i4 a: Z7 t; g- C      BMP4. For example, all 29 of the primitive streak gene set, as defined
* @! v, _! S- W- w8 _: H      from serum-containing cultures, were also expressed transiently and to a
. r6 a0 i! M! S. S      similar level in serum-freeB4L media (Figure 3A). Genes differentially
4 [/ @- B- t! B- r! p% k8 {      expressed at each EB time point (Day 1–16) were pooled and subjected to QT
# o6 s# D3 ]: l. k+ ~+ R5 m      clustering to find sets differentially expressed between serum and
" `( e8 |8 B" B6 x      serum-freeB4L EB culture. Using a minimal cut-off of five genes per group, 5 P; I9 ?# w1 l0 `( b' \/ m
      nine gene sets were identified; seven showed higher expression in serum 1 T2 {, J6 w+ i& R8 T9 d7 I8 ~
      and two showed higher expression in serum-freeB4L (Table 3 and Figure 4).
5 V- j' u& J, Y1 C/ w6 Y: }. E      Table 3. Genes differentially expressed during ES cell differentiation in
8 [/ p. ]3 X0 F( y8 R9 F7 o      serum versus serum-free defined media (BMP4 2 ng/ml)Figure 4. QT 1 |5 N4 c/ [+ V5 L( q# z% z
      clustering revealed sets of genes differentially expressed between serum 9 T8 J% g' J# d1 V% I
      and serum-freeB4L EB culture. QT clustering revealed nine sets of genes
$ M* d. o! {; B% Y6 C      with similar dynamic expression profiles (Pearson Correlation coefficient + L* `; F/ q# A8 o  h
      of > 0.9). In sets 1–7, expression is higher in serum (black sets). In 5 m* b8 @6 W1 \( ?
      sets 8–9, expression is greater in serum-freeB4L EB culture (red sets). 0 u7 o& |3 A( ^# c0 R) b
      Some sets have been named based on the similarity of overall gene function
) V+ z: P) Z: v' H( Y; g      during differentiation pathways (see Table 3).- }' n7 l# f3 Z- @
      The genes in set 9 were expressed more highly in serum-freeB4L media from 5 P8 C, M9 J4 u; S* c- M; o  g; K& K
      day 6, and genes in set 8 showed up-regulation at day 12 and 16 in the - p- v) p9 m9 Z! O
      absence of serum (Figure 4). Both sets are comprised almost exclusively of $ X# h" P8 k" p9 L! }
      cardiac-specific genes such as myosin heavy chain 7 (Myh7), myosin light
' [: n; g( a3 a5 N      chain 7 (Myl7), the cardiac isoform of α-actin (Actc1), troponin (Tnnc2),
' W; D2 b& j3 V      tropomysin (Tpm2), the nicotinic cholinergic receptor (Chrng), and cardiac
$ V8 {  d7 e7 K/ a. q" k, Q      muscle fast twitch 1 specific ATPase (Atp2a1) (Table 3). Further + o( F. Z/ j0 n; D6 T9 J
      interrogation of the array data revealed a cohort of cardiac transcription ! c0 @% Y2 h8 J! Q: x9 ~
      factors, including ISL1, MEF2C, Bop/Smyd1 and Hand2 [39], which were 4 Q& r* P: X" D1 E9 O5 d( e/ ?
      transiently up regulated 1.4 to 3.5 fold in serum-freeB4L media compared
/ S  m; r9 H& R& P& o      to serum at day 6 (Signet Login_Bruce). Importantly, this elevated cardiac ; P" C! }6 S$ @! d
      gene expression program correlated with increased numbers of spontaneously
3 \* v& n  F  }" h8 e      beating EBs in serum-freeB4L media (data not shown). Enhanced expression
; x& X4 K4 @! ]1 T0 V      (~100 fold) of Actc1, Myl4 and Myl7 in serum-freeB4L media compared with 2 J4 X2 G- Y/ \# |
      serum-containing media was confirmed by qRT-PCR (Figure 1). Taken , ^& `9 Z  g# F  a( Q
      together, this data strongly suggests cardiac muscle cell differentiation
  P6 ^+ v5 i4 A      is either inhibited in serum or enhanced by the addition of BMP4 to
% I) ]( Q2 S9 k* q' H      serum-free culture.
" _' Z4 o4 {# ]! M      The genes in set 2 were expressed in both serum and serum-freeB4L
/ m, O# z  H0 C      conditions immediately following primitive streak formation, but expressed 9 @1 [& {/ h6 |1 Q0 d
      to higher levels and slightly earlier in serum (Table 3 and Figure 4).
/ {1 p7 x% |) u+ R) `      Many of these genes are globins (both embryonic and definitive) suggesting
+ ~4 S' P, O9 z$ M' o- |      erythropoiesis is activated more robustly in the presence of serum (See
) A) Y' c1 S2 O. r" H* N$ d1 u      Discussion). Alas2, a gene encoding the erythroid specific isoform of the : r$ A4 M7 o* u. h$ s. a
      first and rate limiting enzyme in the heme biosynthesis pathway, and the
2 d9 ^/ e: D: m+ _7 O# m8 U      erythroid specific transcription factor, p45-NF-E2 are also in set 2 [40]. . c6 @( y2 C* ^4 _" M% k
      qRT-PCR for α- and βh1-globin confirmed more rapid expression in serum 3 r" Q" `! G  ^) P5 B* s' j
      versus SFB4L media (Figure 1), suggesting factors in addition to BMP4
+ A- D6 J. Q- E/ K& X6 j) z! N      enhance activation of the blood program from a similar primitive streak
3 i. y" ]: {/ }0 T5 E$ x3 f      platform.5 `6 G2 D+ b# A8 J# h3 y9 c
      The genes in set 3 were expressed at a similar time in serum but expressed
8 e' _! o6 x! @: }8 l: \      at very low levels or not at all in serum-freeB4L media. Again, this set 1 ?. i* b% m2 s
      is highly enriched for erythroid specific genes such as alpha hemoglobin 4 |' n! E3 E& n; E- l
      stabilising protein (Ahsp) [41], Glycophorin A, erythroid Kruppel-like 6 |5 \, J  K- ~
      factor (EKLF/Klf1) [42] and mitochondrial solute carrier protein . J! q8 b1 b' N) Q, [
      (mitoferrin) [43] (Figure 4 and Table 3). Genes in sets 1 and 5 are
$ z7 B1 \( s( n. n      activated more robustly at late time points in serum cultures. Most of
! {5 Y& K: k$ O8 _- O% j0 f      these are specifically expressed in macrophages. Together, these results
9 m; h; j4 b0 D5 G) k! F      suggest delayed and less robust primitive haematopoiesis occurs in SFB4L
8 I2 n, n9 L4 j* `! H      culture compared with serum, an observation confirmed by less robust ' R8 H& Q7 x6 y9 L
      erythroid cell generation and 'redness' of the EBs from day 6–10 of ! o1 O+ q( \0 p) S! e( f4 ?
      differentiation (data not shown).- W7 l0 A7 I; D9 N+ Y. n
      Rapid changes in Kruppel-like factor gene expression upon ES cell
7 A% D" a. k2 k8 c" t4 x6 Q( G      differentiation' ?+ U  v6 }: T7 K! p
      Three Kruppel-like factor genes, Klf2, Klf4 and Klf5, were rapidly down
* Y! W. _. _! J7 O6 Q* x      regulated during the first few days of EB development (bold type in Table 6 [6 P  L6 Z8 o
      1). All are members of the KLF family of transcription factors which are 7 S6 _# \& \0 t& D& E
      characterized by a similar C-terminal domain of three C2H2 zinc fingers
, ^% k' D# S% j- w% p& L3 }+ Y      which confers binding to CACC box elements in promoters and more distant
6 V: M( n7 d: f0 W: V. c2 m/ V      regulatory elements [44,45]. Klf2 and Klf4 have N-terminal transcriptional
% t  j. n2 J% w0 j  j9 c" N( z7 g      action domains and act primarily as transcriptional activators, whereas
& D5 W, o' M: s7 u      Klf5 is best known as a transcriptional repressor [46]. Until recently,
% I% ?. l7 d9 z3 ~+ m      Klf2 and Klf4 were considered to have restricted expression patterns
7 ~- t& |8 r* n/ y# w$ h      [47,48] with gene knockout animals showing late developmental defects in ) r( W" r: H# X& M
      vasculogenesis and skin differentiation, respectively [49,50]. Recently ! F8 |; T8 [3 u  `
      however, Klf4 expression was noted in ES cells [19,51], and forced
! z5 D+ J) x& e- ~& h" n      expression was shown to enhance Oct4 expression and promote self-renewal 3 T3 h! r( M  i
      [51].. E( p$ r/ n1 Z- @
      qRT-PCR confirmed the rapid down regulation of Klf2, Klf4 and Klf5 in both
3 ]. _. c) O. l& W      serum and serum-freeB4L differentiation conditions, although Klf5
6 k% [* ]# e: T1 V  Y$ b2 h" K      expression was down regulated more slowly (Figure 5A and 5B, and Table 4).
0 l( ^) T9 z6 q. j* u6 p1 X      Klf2, Klf4 and Klf5 were all re-induced from day 3–5 of differentiation as # v9 A: U5 Y$ y3 _% K8 @' g: W; S
      mesoderm and endoderm developmental pathways are activated (Figure 5A and
' c# L% ?" a+ m$ A/ p      5B). Sixteen of the 17 known KLFs were examined in detail throughout the ; q1 z5 X" @: f! [# s# P. @8 J
      entire differentiation time course (Table 4). Klf9 was also steadily down
$ Y6 b$ o: U* D0 V" N      regulated (>10 fold) between ES cells and day 5 EBs in serum and 8 s( Q) v" O' Z6 v3 B) ^7 w/ v6 A
      serum-freeB4L media (Figure 5B). Klf3, a well characterized 6 E  {& y2 m% h
      transcriptional repressor [52] via its ability to recruit the 2 _- O- O; k% P
      co-repressor, CtBP [53], displayed an interesting biphasic expression 2 w+ I5 F! J# e; _
      pattern with initial rapid down regulation then re-induction from days 5–6
6 Y. ~0 r7 j1 ~      (Figure 5B). In contrast, Klf16 displayed little change in expression 3 k* J. N; @, ~( \. R! i1 \. y
      levels over the 16 days of EB maturation (Figure 5B).
. s# d- f- @- h; B" a- y      Table 4. Kruppel-like factor gene expression during ES cell ) ]! f+ t# ]3 ~* X1 S( u% J% T4 Q. J0 E4 n
      differentiationFigure 5. Kruppel like factors (Klfs) are dynamically ' w! j/ P) @" M
      expressed during the first few days of ES cell differentiation. (A)
' {- J9 r# p6 Q7 _/ W( F2 q      GeneSpring plot of normalized gene expression for all KLFs detected during 0 [6 E" V0 R% |' R: p1 \* G5 I8 |/ ?
      ES cell differentiation in serum or serum-freeB4L culture. Most of the ' e4 {5 W4 e5 r
      genes are listed on the y-axis in order from their highest relative 1 }0 Y, n/ n8 i) C& F
      expression in ES cells. There is dramatic up regulation of Klf1 (Eklf) * b) F4 b1 U0 x% Q
      only in serum following day 6 of differentiation. Plots representing Nanog # i/ @- q! a4 \% i6 b9 d
      (red), Sox2 (red) and Pou5f1(Oct4) (green) are shown for comparison. 4 e; G3 i; e% D' K% ~* L
      Pou5f1 and Nanog gene expression persists at high levels for 2–3 days
! f( l. F. [. q      after Klf2, Klf4 and Klf5 are down regulated. (B) Validation of changes in ! f+ ?' g2 n4 A: ~  r" N
      gene expression of six members of the KLF family by quantitative real time 1 L7 j$ F: s: f0 o- e2 R8 f) D
      RT-PCR. Scheme as described for Figure 2B.
  W, y3 l* u7 B% D/ o7 l6 a      Klf2 and Klf4 co-localize with OCT4 in the nuclei of ES cells (Figure 6A). * ]/ S% N: U- a' J% F( T+ x
      Interestingly, all three proteins were found to preferentially reside in a
! {, C/ d! e3 k1 X      nuclear sub-compartment (possibly nucleoli for transcription factories), : S3 m1 c. K6 |7 f: ?( j
      suggesting possible co-involvement in a protein complex or network. Also,
: ^; ]: t2 @+ C1 o+ A, a0 x      endogenous KLF2 was detected by specific CACC box DNA-binding activity in 3 m" t! I% j1 y7 O# X1 c7 s
      undifferentiated ES cells, which was lost upon EB differentiation (Figure
, S# D( G% \% m! u3 `) ~      6B). In contrast, endogenous ES cell KLF4 DNA binding activity was not
3 @4 S. b; Q/ d      detected using the p18INK4c promoter CACC box sequence, although
0 w9 `, }8 d0 s1 p5 Y# R      recombinant KLF4 was shown to bind to this probe (data not shown). SP1 and ' x" `) ]2 \, J7 Y5 v* v0 X; T
      SP3 are ubiquitously expressed CACC-box binding proteins, which are
0 \' Y7 S. Q1 D9 R      dominant in ES cell nuclear extracts (Figure 6B). We hypothesize KLF2,
: Z  n4 S  j) v6 V, F5 b8 B      KLF3, KLF4, KLF5 and KLF6 compete with SP1 family members for DNA binding
7 V! q  `7 C- Y0 p) t      to key cis elements in various stem cell genes to regulate expression in
0 Z( M/ N2 }& [      positive and negative fashions (see Discussion).8 n. s: s; B1 c1 y6 ~
      Figure 6. Kruppel like factor expression and DNA binding activity during : s2 n9 J: {/ p5 ]% z
      ES cell differentiation. (A) Co-expression of Oct4 with KLF2 or KLF4 in ES 6 y$ l; I+ G$ D2 p3 ~
      cells. Indirect immuno-fluorescence shows co-localization of KLF2 and KLF4 , @/ B) ~3 ?7 ?; z! I
      with Oct4 in sub-nuclear compartments (possibly nucleoli). Individual
6 j2 L1 u" v+ o9 k      confocal images for OCT4, KLF2, KLF4, and DAPI are shown with the
( }' h; b) a2 n' J; E      corresponding composite image. Scale bar 40 μm. (B) Electro-mobility gel 8 Z3 o4 z1 ?6 g! l/ |5 D) ^" |3 s
      shift assay showing changes in DNA binding activities at a conserved CACC : \; v3 p7 E3 j# @' P3 N
      box site in the p18-INK4c gene promoter. Nuclear extracts were generated
& B# v9 g2 `' r6 q) c, r6 a6 f      from ES cells or EBs differentiated for four days in serum. Super-shifts
) v4 R  M- Y" ]6 h+ U: @! o, K      were performed with specific antisera for SP1, SP3, KLF2, KLF3, and KLF4
8 |" _4 ]) B/ p7 B6 t# t/ Y      (See Methods). There is strong binding of endogenous Sp1 to the CACC
) u  c! f6 u6 W5 q' k9 ]8 U      element in ES cells and EB cells. KLF2 DNA-binding activity is present in
0 z9 U5 a" N, u) a4 P6 O. D# q0 S! Z      ES cells as determined by a specific inhibition of binding of the   A( L! C' |* ~; }0 ?8 _# F: K
      indicated DNA complex with a KLF2 antibody. This activity is lost upon + Z5 n3 S4 B0 M% F( @
      differentiation into EBs. The identity of the CACC box binding activity in - }- m+ C3 X+ W1 a( s  r+ ]4 l
      EBs denoted CAC-X, and the binding activity in ES cells denoted CAC-Y, was + X# I1 _! a9 M4 N, F1 I( }, |
      not definitively identified using this panel of antibodies.% K9 Z3 k7 d; |
      Enrichment of KLF-binding cis elements in the proximal promoters of stem
. P3 ?( `8 e( {# B7 g      cell specific genes
8 W* {3 K) i" S6 A4 ^& U% g& L      To investigate whether Kruppel-like factors might directly regulate stem ) l' e9 G9 [( [( E5 p1 W. d+ k
      cell gene expression, and therefore play a role in the maintenance or loss
9 c5 H) I1 c, F9 R      of 'stemness', the presence of KLF transcription factor binding sites 4 P) R+ a9 |* m; D& ?' N
      (TFBS) within the proximal promoters of all genes in Table 1 was
  B4 P; e5 \* t! g4 D! ]/ w      determined. Although a position weighted matrix (PWM) for KLF4 binding
3 |8 M2 F6 o' z% B6 y# Z9 s0 s8 A      site specificity has been published [54], this site does not resemble the
* _6 d. |# o, |! r" ^      CACC box sequences known to bind other KLFs. Since KLF4 has identical 8 ]! ^) _, m! a* w2 R/ x1 d
      DNA-binding residues in each of its three C-terminal C2H2 zinc fingers to
: U% J. r* j/ V' A9 A  d$ y      other members of the Kruppel-like factor family [45], it should bind to
; f8 j) p3 f8 F& X, u      similar sequences. Based on the crystal structure of the zinc fingers of
$ q% S; g- x( H& h, }      Zif268 and SP1 bound to DNA [55,56], and testing of specific binding of
/ L; A( J  n- N/ r      CACC box variants to recombinant KLF1, KLF3 and KLF17 [45,52,57,58], a
/ q. H6 J1 t# y) o- b9 g' x      generic KLF TFBS has been proposed. Although a SELEX experiment for KLF1 # o. g  @6 q' T: w1 A# S* t1 h
      binding has not been published, experiments from our group suggest a C at   ?2 c& e$ {/ j. h! |) z% s% t
      position 1, an A or C at position 3, an A at position 5, and an A, T or C
: `* {8 |0 r5 ]; [& x      at position 9 of the 9 bp consensus provides enhanced affinity for KLF1 6 n9 f) w9 \5 f+ j8 J7 A/ c
      binding. Taken together, these studies have enabled the development of a
2 e5 \' N9 f/ l) M9 t) L      PWM (called here KLF-A) that should predict KLF4, KLF2 and other KLF TFBSs
  J# m, k' E6 W, t      (see Methods and Additional file 7).
4 n+ {8 x1 B. R9 Z. W      Additional file 7. Position Weighted Matrices. Provides published data on
( w. F7 E# E$ Z1 b- d& }" V8 M5 ]      position weighted matrices for Oct, Sox, KLF-A, KLF4, Nanog, E-box, Gata-1
) q, Z) ?0 ]' ~/ S7 n. v      and their variants.
; P) D4 Z* s5 k( e7 u/ e3 O6 m      Format: DOC Size: 116KB Download file+ w  L7 g3 o, A: ?3 u
      This file can be viewed with: Microsoft Word Viewer& N- o- N/ {8 I, W
      Clover [59] was used to determine if defined TFBSs were statistically 5 }( `. Z+ L# |+ j( A5 s1 B
      over-represented in the stem cell lists. We searched for PWMs for KLF-A,
1 r2 V$ n' h7 G# e      KLF4 [54], octamer, extended octamer, nanog, and Oct-Sox binary sites as " n' k9 \" X7 B; g& E
      well as negative control sites (gata-1 and E-box) in sequences 2 kb
! V' T' w* |8 _8 ^      upstream of the TSSs of stem cell genes (Table 1), versus 2 kb of
2 z+ O6 L" g% U% z4 @      sequences upstream of the entire murine transcriptome (See Additional file 7 I' \# Y5 {6 O* F8 _7 r
      7 for details of PWMs). There was significant over-representation of KLF-A - C' @& _% H, J
      TFBS within the first 2 kb of promoter sequences from the stem list gene 8 u2 Z& G% z# N! g) @- Z
      list (p value of < 0.01) (Figure 7A), as well as over-representation of $ D5 W6 J  a. m* B7 o# ^
      octamer sites and extended Oct4-binding sites as defined from ChIP-PET
+ G& ?7 s; H/ Q& @% |: c      data [60]. No statistically significant over-representation of KLF4 [54], 4 w' ~8 Q4 |$ ]( I
      NANOG, GATA1 or E-box TFBSs were identified. All genes in the rapidly down
5 i- C3 f8 u. C7 [4 e" k+ Z      regulated gene list have at least one KLF-A type CACC site, or octamer or
( U% u, l2 r( g/ i6 ~. y6 X      Oct4 [60] TFBS. A full list of the promoters with highlighted sites is
0 M/ s" c6 J1 B      available on request. Together, this data strongly suggests direct 4 w" h1 H/ x. X% h9 h; T- H
      transcriptional regulation of many stem cell genes by KLFs.
) J) e+ _% m( w* s! y      Figure 7. KLF and octamer binding sites are highly enriched in stem cell
' K( {8 W# `7 x/ }! T) |4 R      gene promoters. (A) A Clover analysis was used to identify
) F& Y) E' f  Q  W" B) ?/ s      over-represented transcription factor binding sites within the promoter
  {2 W# I# |! h( _1 w      sequences of all stem cell genes identified in Table 1. Representative
: w9 D( N9 j4 W% X% H; j      gene promoters are shown, indicating KLF-A binding sites (pink), octamer # C, ~% n5 m8 `+ Z
      sites (ATGCWAAT) (green) and extended Oct4 binding sites (Oct4-Loh)(cyan)
1 V- F- b  A5 s( H7 b      [60]. (B) Clover output for 2 kb of promoter sequence of murine
# e5 ?- \( `; k. v1 i* l      Zfp42/Rex1. The positions and sequences corresponding to PWMs for KLF-A,
5 ?/ g" f3 g4 k+ D. W      octamer and Oct4 occupancy sites are indicated in the table and colour 5 N1 C8 L: n& S0 N
      coded in the sequence. The positions are relative to the transcriptional
8 w2 k8 G% _- ]      start site. (C) ECR Browser output of conserved sequence identity between
3 _% ^! z% Q& n( |, N. L      mouse and rat in the promoter and part of the first intron of the Zfp42
6 J0 G8 K4 F2 T: Q8 i      gene. The blue arrows indicate the direction of transcription. Yellow
6 B' f1 t3 c1 @' }- V9 h1 ~$ S      indicates the extent of the first (non-coding) exon, pink indicates : G+ M- X6 s& M- M% l, g
      regions of sequence conservation in the first intron and red indicates
* T% A& R' |5 ]! I, e( K3 T      regions of sequence conservation in the 5' upstream region. rVISTA was , w2 n$ {* s5 }" ]* x
      used to find all of the KLF-A and octamer sites in the murine gene (Murine
6 |" m* }4 v; a& N/ m- q      only) and conserved sites between mouse and rat (Conserved mouse, rat).0 c, v9 q2 X" X' n
      The Zfp42/Rex1 gene promoter is presented as an example of output from the
8 o! R) s: a0 J  U+ i- }8 H      Clover program (Figure 7B). It contains five KLF-A, one octamer and three ( }3 M& k: o4 b4 l* y
      OCT4 TFBS. The two CACC box elements within the first 250 base pairs of 5 Z9 _9 k! |; o/ I0 h
      the TSS represent classical extended KLF-binding sites [42,61]. These
/ B( ~/ Z& s6 o% t) O4 I2 E% ~2 l. K      regions are the same as previously reported [62] although their functional 1 y: I) P: ]& e/ M0 `6 }9 z
      importance has not been determined. Interestingly, two of the three KLF-A
6 j1 ~7 V# }$ w      TFBS were evolutionarily conserved between the mouse and rat gene 3 ?. j: Y7 }4 f+ X+ f$ v
      promoters (Figure 7C). Also, two further conserved CACC sites at -1.4 and
  K# ~7 c. ^+ {4 I& ?% I9 H      -1.5 kb were identified within 500 bp of extended evolutionary / S6 U- c6 P- {6 R4 f3 w
      conservation (>70% identity) (Figure 7C). This might act as a KLF 5 f" u& f! M, Z( u4 @4 w
      dependent enhancer. Most of the stem cell genes in list 1 have conserved 2 Y( V9 Z4 F: [" h9 F6 N0 r2 U/ {
      CACC box elements in their promoters [63-66]. In some cases these have 8 W) S/ C4 s% \" C4 M1 f
      been reported to bind the ubiquitous SP1, but our Clover analysis suggests . W. |+ O* l0 ?- u* ~4 W" B' n
      they are also likely to bind KLFs (see Discussion).  ~8 K6 h2 l( ]" R
      Discussion
4 z% `' D% z1 j. |8 V  K      Expression profiling of murine ES cell differentiation was undertaken over & S$ Q: h" W$ `- u' m9 k$ m7 j1 i) G- ^
      a 16-day time course. We compared gene expression in methylcellulose
$ |* }9 P- C8 V4 Y      cultures containing serum versus chemically defined media containing LIF 3 }7 m* D) f6 X$ Y
      (1 U/ml) and low concentrations of BMP4 (2 ng/ml) [67]. The Illumina&reg;
% h4 t/ f( ?% C5 k; t* \/ j      Sentrix Mouse 6 bead array provided a sensitive and detailed platform for : w& r6 b; B3 N, U
      analysis of dynamic gene expression. Using various data mining approaches,
; o8 E; U5 f) F8 c* D1 K* O      lists of stem cell-enriched genes and genes that are induced during the in
" N  i5 k+ {: Z" Z      vitro equivalent of epiblast and primitive streak stages of $ [& l8 z3 p2 e; S
      differentiation were generated. In combination with other ES cell . u0 \' M4 Q  f( l0 I$ M
      profiling studies [7], our detailed expression data provides a useful 5 O, u+ y! w+ |. e
      resource for future reverse genetic approaches (i.e. siRNA knockdown) to
5 y2 p9 B& w9 m" i+ q2 i6 {8 l      study the function of these genes during ES cell differentiation and in
3 C6 j8 q& O  ~# ~) k0 Y7 h      vivo development. We also found a number of previously uncharacterized
. i! b8 f% l$ x. f, Q8 W' p      cDNAs (RIKEN clones) which could play important roles during development.
3 i$ I6 _! h: e0 [' `4 t  z      Importantly, the loss of pluripotency, measured by Oct4 gene expression,
) X9 C" v9 C+ P, d6 G2 t' s      was comparable between serum and serum-freeB4L EB culture, following a
: |) t5 l+ B" z8 `8 w8 z      predicted decrease over the first 6 days. Surprisingly, Oct4 gene 8 |" f$ K% Z0 X0 i; o. g6 `4 j
      expression gradually increased following day 6, an observation independent ' q4 S2 C" x1 q  U0 s1 K7 i0 o
      of the cell lines used (data not shown). It remains undetermined if . ]# C4 c. P/ x" ]
      expansion of Oct4 positive ES cells persists as undifferentiated
0 w. W. b! w  B% c4 m: j: c      populations within our EB culture system. Previous studies have identified
/ o+ ]$ }! [% v& q& y      the development of Oct4 positive primordial germ cells (PGC) following 12
) [9 D2 N; Z9 [      days of ES cell differentiation [68-70], suggesting the Oct4 profile may
& x0 Y2 P0 T: ~/ W! H% \2 o; }  ]      alternatively represent the expansion of non-ES cell populations. Although
3 t& x/ F, D4 C- e1 F. g      the expansion of some undifferentiated ES cells is possible, the array
3 Q  V, h+ u4 m8 u8 p! k& @      profile does not suggest global persistence or up-regulation of ES gene
/ ]. q* j! i. Z" c2 n  [3 ^6 |. s$ e      expression late in the EB program. Also, markers of mesoderm induction
1 V) L/ H) l6 k5 G, G+ g  {. Z! P      such as brachyury and Mixl, do not show persistent up-regulation following
5 B* K8 R3 g  m3 T7 G3 v      peak expression at day 4, suggesting EBs are unlikely to harbour cells . r7 q9 {- o2 {7 R
      which are delayed or arrested from entering the differentiation program.
  q- ]7 |; x& v9 r      We found significant differences in cardiac gene expression during EB
$ A" y" y' d# n      differentiation in serum and serum-freeB4L culture. Our current 7 D. s' Z6 n/ I
      understanding of cardiac development provides possible insights into this
( t: B! {5 q% g! o% T      observation. Briefly, in vivo studies have revealed BMPs secreted from the + W* Q! B+ d3 k
      anterior lateral plate, and ill-defined signals from anterior primitive
' }" i% r3 ~1 S/ ?0 p% `0 t7 N      endoderm, are key inducers of cardiac development [71-73]. The
  ~+ @+ g. X# {! l      administration of recombinant BMP2 or BMP4 to chick explant cultures
2 m, s+ |1 J- |, H      induces cardiac differentiation in non-cardiogenic mesoderm [71] and Bmp2
  G7 p" o6 g4 N8 b      knockout animals develop cardiac abnormalities [74]. Conversely, Wnts 8 }! D: L3 n% F- x5 Y$ t
      secreted by the neural tube are strong suppressors of cardiogenesis.
- J: l7 S) H$ @% P" [      Together, these opposing signals act to establish the borders of the heart
! K6 w. n3 D) |% C5 ^0 y4 I' `. H      field [71,75]. By mimicking the environment that establishes cardiogenesis 7 G1 B4 j7 |6 I- \- C) i& i
      in vivo, assays capable of directing cardiomyocyte production from ES & w( M: Z) }$ h
      cells have been established. BMPs can efficiently enhance the cardiac $ h! N5 P3 o6 p4 r3 A4 U
      program when added to EB culture [76,77], whereas BMP inhibition . a# D7 A$ u! P6 ]& L* G
      drastically suppresses this outcome. In our hands cultures supplemented - \( V2 E6 A# S; b
      with BMP4 (2 ng/ml) supported cardiomyocyte maturation (as determined by $ U8 W3 w6 z: u6 \
      expression of cardiac specific genes) with greatest expression detected at
9 y  ?8 a& F( f6 w      Day 8–10. This correlated with increased numbers of spontaneously beating 5 ~$ m' ^+ v& L! d
      EBs, and the timing of initial spontaneous contractions during murine
6 Y1 W; B+ R- q2 O3 U; ?* b; |/ s      embryogenesis [78]. The cardiac program was significantly reduced in
2 E: k9 G3 Y4 Q6 A6 u      culture containing 10% serum, an observation supported by a number of ' y# W1 O8 H+ m4 S. d# H
      other studies [79,80]. This suggests the constituents of serum are
! U: s! |# l( I; M% B0 _0 ?8 I2 e      inhibitory to cardiomyocyte development. Although a detailed assessment of $ b! a. L5 C4 Y2 |  H4 p9 Z+ N
      cardiomyocyte differentiation was not the focus of this analysis, the . y- {/ b3 q6 e* l' [* j
      array output, qRT-PCR profiles and morphological observations described
' b5 R3 Y0 p# n6 G) j" X* Z! T+ _& ^      supports the usefulness of this assay in future investigations. In
1 D) W" R, ]5 X9 K( ?      addition, the defined constituents of serum-freeB4L media provides an : j0 X7 ]! p  v$ o, _% U
      excellent opportunity to identify additional recombinant factors required
) `6 p4 ~, d7 Q( Y; y      to further expand cardiac progenitor cell production from EBs.6 v' [" L) W/ x/ X; q! w1 ], A4 O
      In contrast, the hematopoietic program was more pronounced within EBs
- F3 Y6 t2 H' k7 @' a      grown in 10% serum than BMP4 alone (Figure 1 and 4). In particular, EKLF * `0 S0 g/ ]  c, e  C0 B( G7 @
      (Klf1) expression was significantly reduced in serum-free media 9 a" e* {% l2 O
      supplemented with BMP4 (2 ng/ml) compared with serum (Figure 5A and Table / l) [( U# l* f3 k) d. M# u, i
      4). EKLF is essential for regulation of a large cohort of erythroid 9 _& w0 |1 O$ h5 h
      specific genes [81,82], many of which were identified in sets 2 and 3
) ?: h4 f0 |$ n. H. ]      (Figure 4). Thus, it is likely that reduced EKLF expression in SFB4L media
1 l. s, ]1 v! k1 e) |      directly results in inhibition of a cascade of erythroid gene expression.
- ^3 F6 ^7 y7 ]  x      Previous work showed the addition of BMP (at 5 ng/ml) to serum-free ES + Y9 x6 t, }0 _5 A
      cell culture induces EKLF expression and restores hematopoietic cell
5 C# E/ ?" V" L0 p7 ~      differentiation [83]. Although a weak hematopoietic program was observed 9 q) Z. C# _% A. u
      in SFB4L culture, the concentration of BMP4 used was less than that used
+ m2 d/ n2 i3 s5 Z, r; P* b: N' W, O      by Adelman et al. Thus, we suggest a robust cardiac (anterior-ventral $ x( ]( ~! }3 X3 D* ?
      mesoderm) gene expression program is induced by low conentrations of BMP4 . g! `7 ^( O7 g1 n+ Q- B. j8 d( p
      (or other BMPs), whereas a robust blood gene expression program 7 e5 C+ `8 U. {( F8 [. P
      (posterior-ventral mesoderm) requires >2 ng/ml of BMP4 or additional 8 Z$ b# e! x  ]6 G* \
      growth factors. Recently, mesoderm derived progenitor cell populations for
/ m- h1 @/ B$ i& l2 U* x' v. h      hematopoietic and cardiac lineages were studied during EB development
, z  [) K. n1 b7 r$ ~/ u      using ES cells in which GFP is targeted to the brachyury locus [80]. 6 I2 f" X' k7 c" i" U
      Within day 3.25 EBs, GFP+Flk1+ cells were shown to represent hematopoietic / X4 Y0 U9 Y; i. \
      precursors, whereas GFP+Flk1- cells were significantly enriched for
7 a3 R5 @5 b0 E6 U9 p6 i0 q      cardiac progenitor populations. It would be interesting to determine if 9 n1 |6 |; \) Z# |5 }
      the ratio of Flk1+/Flk1- within the brachyury positive population is 4 x8 l1 f+ W" t/ t. z" t
      altered between our two culture conditions.
  b& }# j$ ?1 r, J      A number of the Kruppel-like factor family of transcription factors were
4 m/ P4 S  n+ z# F* m# Q4 G      dynamically expressed in the first few days of ES cell differentiation. It $ \3 D2 b- S$ Y& r) M& k
      was initially surprising to find highly enriched Klf2 and Klf4 expression
: ~% n/ t4 }, s1 q  b      in ES cells, since both are considered markers of terminally , l7 E" V7 ?- U; H
      differentiated cell types such as skin [50], gut [84], vascular smooth
9 C, {; T8 X& ^' m: `7 r      muscle [49,85] and lymphocytes [86]. However, recent evidence suggests
& J0 L% \% ?% J6 _9 n      KLFs may regulate stem cell function, since Klf4 is enriched in ES cells
1 ]& G  Q9 g" O      [19] and forced over-expression within these cells can maintain 5 R' C8 I" j( j
      pluripotency in the absence of LIF [51]. Furthermore, Klf4 can bind the 4 S& f/ o) L% ]# s5 m) ~; i5 d
      Lefty1 gene core promoter co-operatively with Oct4 and Sox2 [65]. Although
: `9 }: q! s1 }* {      Lefty 1 and 2 are best characterized as repressors of nodal, acting to , k8 Y) f  O$ D2 X2 w8 H
      regulate left-right patterning [87], our results and those of others,
6 o" X7 i6 {/ L4 Q      suggest a possible redundant role for lefty proteins during the
3 z& |; f; @9 Q3 h      maintenance of ES cell pluripotency [65]. Similarly to the Oct4 expression ; u) [. i. h% \
      profile, many of the KLF family members also increased in expression late 0 _5 X* P7 v: ?
      in the differentiation program. As mentioned, the KLFs are expressed in ; C+ ?+ X. A; g6 ]' Z) j
      diverse tissues during development. It is therefore expected that the * {& q# O4 @& [7 f2 K) Z; A  b0 I7 K
      profiles obtained reflect the generation and early specification of + ~" t0 z) _& `9 s1 s9 g# Q! W
      mesoderm and endoderm cells following primitive streak gene activation at
3 D$ b" y" o' |$ l      day 4.
8 N- }% n! W: B; w+ [      Based on the likely identical binding specificity of the KLF family,
- H# f( ]/ j- K0 g; u      established transcriptional activation and repression roles of certain
7 Q% n2 j, c, n" u      family members [44], and bioinformatic evidence of a high prevalence of ' i9 `) l) _, r# S% s, {
      KLF TFBSs in many stem cell genes, we propose KLF competition for * d2 g  w3 U  W  N7 L9 D
      occupancy of these CACC box elements might determine self-renewal versus 5 O  u& y/ p+ r9 K
      differentiation of ES cells. According to this model, high levels of KLF2 1 V$ k3 Z8 G( W$ B. o$ V' c# O9 W6 X
      and KLF4 expression in undifferentiated ES cells would lead to occupancy
" l# C6 A. Z$ x( p0 ]7 _      of CACC box elements in promoters of stem cell genes such as Pou5f1/Oct4, 9 R6 N  Q6 ^8 r1 f( B% o/ d
      Nanog, Esrrb, Zfp42/Rex1, Lefty1 and Lefty2. Furthermore, the Klf2 and " c( {. e8 ?5 U4 e1 [, R. v2 G
      Klf4 genes themselves have CACC box elements in their proximal promoters
: @4 J" J( ^( y      (Figure 7A), suggesting a positive feedback loop within ES cells is
0 k* [+ ]2 [) }, s      likely. Interestingly, gene knockout and lentiviral shRNA gene knockdown
& ~, m$ t- R/ D& S      of Klf2 or Klf4, does not lead to an obvious stem cell defect
; @; m& H. m+ |9 a& Z! ~/ c      [19,49,50,85], suggesting these two KLFs may have functionally redundant
0 j1 U1 v# W- I; r      roles during the maintenance of pluripotency. Future analysis of KLF2/KLF4 ) r7 c: `( f8 V9 Q. c
      double knockout ES lines or the knockdown of both proteins using RNAi
2 \! l- d3 u# x      technologies will be necessary to validate this hypothesis. Upon ES cell 0 T1 G# n2 T0 y: P+ Q
      differentiation, down regulation of Klf2 and Klf4 was very rapid, whereas + v% M" j* G4 L
      down regulation of Oct4 and Nanog was delayed for one to two days. We % ]1 ?4 [6 H! ]2 K/ g
      suggest loss of KLF2 and KLF4 binding to Pouf1/Oct4, Nanog, Esrrb and 6 O& e4 B2 D; s" x5 R
      perhaps other stem cell promoters, could be directly responsible for their
: ]. y5 j: C; B3 c1 q5 Z1 H, k! R* J      down regulation. Moreover, Klf5 was down regulated more slowly, and Klf3 2 v1 i2 b, g1 p; W- k
      was up-regulated in the first two days of differentiation, suggesting
1 R6 a% t/ N% e7 c" t4 k      these KLFs may function primarily as transcriptional repressors at the
0 x: I6 U: I* n, Y2 C$ B: L      Pouf1/Oct4, nanog, Zfp42 and Esrrb gene promoters. Once this
1 L& X7 t5 v# G1 w& Y      differentiation driving transcription network is activated we suggest 2 K# [6 X; ~7 @5 e( Q# @5 W
      KLF3, KLF5 and KLF9 can accelerate loss of the stem cell state by directly ) A9 c( `) _. t9 L* }) Q
      repressing expression of the Klf2 and Klf4 genes themselves. Therefore, a ( ^7 r% D! ^8 E6 D$ c8 e' W
      combination of cross regulation of transcriptional outputs, and 1 ^8 U: P8 K' \, S
      competition for occupancy of key stem cell promoters by KLF proteins with
5 y  F2 K* G6 N2 N      differing biochemical properties but identical DNA-binding capacities, may 1 j4 \8 }1 X. L
      determine self-renewal versus differentiation outcomes. Again validation
$ t) T  R  P7 y      of this model requires functional analysis. It is important to note we ! |% f5 e$ p; j+ |( W
      have not proven occupancy of stem cell gene promoters by the KLFs in vivo.
0 ]6 H+ M9 z9 y1 S( I% S      Chromatin immuno-precipitation (ChIP) experiments in ES cells and
0 T2 n* u! J0 K# P4 q8 v; s      differentiated populations are essential to validate this model, however
  k* @. p5 k. O1 ^% s2 F6 x6 |0 u      ChiP grade antibodies are not currently available.
9 Z5 c% z$ ]; R8 i* C2 P$ ^      This scenario is not necessarily limited to ES cell differentiation. A
9 O% N3 G7 M' U      similar program may take place in many adult stem and progenitor cells 3 o5 h% u5 N) q6 ]# F7 L
      populations. Certainly competition between Klf4 and Klf5 has been 7 t3 o. n8 x8 d. V
      suggested to regulate cell growth [88], and Klf1 and Klf3 compete for
, [$ n, R' t% |9 w0 Z9 @3 h/ J      binding and have opposing effects in blood cells [89]. Furthermore, an ) a1 V" H$ o1 N
      imbalance between occupancy of key stem cell or proliferation gene 9 X& v& q; }# U+ x) V
      promoters by KLFs, with differing activating and repressing functions, 2 _' c1 d3 @* v$ J0 l0 J# ^' q
      could be responsible for the development or progression of many common
8 |3 b" a* L7 I; t      forms of cancer. For example, loss of hetero-zygosity for Klf4 and Klf5 5 K4 z  C  M/ g: Y1 L. g! z: R
      has been found in colon, stomach, breast, prostate and liver cancers ! t5 j0 V6 P6 f, B2 M* k
      [90,91]." w9 p" z, x! c' L
      In addition to the proposed complex transcriptional interplay between KLFs ; O4 X) ~- I6 }. p: X2 U
      and stem cell gene promoters, direct protein-protein interactions and ! _  F4 O* B' o
      protein networks involving KLFs are possible. We showed sub-nuclear
/ p$ g+ ^8 C  f, L6 `      localization of KLF2 and KLF4 with OCT4 in ES cells (Figure 6A), and KLF4 , l" f1 m! @. u- m
      can co-operate with OCT4 and SOX2 to drive expression of the Lefty1 core
5 E4 t2 v3 \. }& J      promoter [65]. Also, very recent biochemical purification of , O; n9 a; V. k! K
      NANOG-interacting proteins discovered KLF4 as part of a NANOG-OCT4 network 7 a7 W7 E: I! |" T  `" `
      in ES cells (Stuart Orkin, personal communication). This is also
1 f+ A# o8 d$ ]/ h      consistent with the ability of KLF4 to co-operate with OCT4, SOX2 and
/ H" V' r( V8 L% j* ~  Y% j- ]' F      c-MYC to drive de-differentiation of fibroblasts into ES-like cells [24].
) M. g- |& s0 @7 ~! s- L      Conclusion
, l# J; ^- `" z- ?6 E      Defining the genetic regulation of ES cell self renewal and
# \# ^/ t% k. G      differentiation will be instrumental for the development of future cell
8 p/ a+ `8 M8 V3 V+ s, t0 K. q' [* s      based therapies. Using 47 K Illumina&reg; Sentrix bead arrays, the
: c0 {" @9 |5 B* W" U6 v* U      differential expression of genes during 16 days of ES cell differentiation + ?  T  f& s3 p" _
      was determined. Hierarchical gene clustering and correlation statistical 8 c# k7 {% J# h3 y$ U( H
      analyses lead to the identification of a small cohort of genes which
+ n+ R& S1 Y' A7 E      define the stem cell state. Historically, ES cell differentiation is 5 A9 g3 L) ^6 A# R
      achieved in culture media containing 10–15% FBS. A direct comparison
8 d$ K0 C9 _6 c+ M$ A. t      between ES cell differentiation in serum and a serum-free media was ( @1 s+ e4 Z- n, Y- f. @0 ?
      undertaken. Surprisingly, global gene expression profiles were comparable * ~! L) ]/ u' W! n7 e7 m  S% e3 l0 x
      between culture conditions, with the exception of mesoderm derived cardiac
  B1 |$ w3 U- v9 _      and haematopoietic transcripts. These results support the use of defined ; Q/ @' G. @1 v7 i
      serum-free approaches for the directed differentiation of ES cell / f6 ^$ _5 Z# k( n% C. H7 g. Y
      commitment. Expression of many KLF family members were enriched in ES ) v1 g( ]$ a  F9 G* z: y: x5 |% Z
      cells and rapidly down regulated upon differentiation. KLF2 and KLF4 were + z1 f4 j8 B4 l: E  b" B+ ?3 \
      co-localisation with OCT4 in ES cell nuclei and KLF transcription factor   a  O4 G" Z0 Z0 N
      binding sites were over-represented within the promoters of the putative
0 c# W: H: _0 f  B5 a: I      ES cell gene list (p < 0.01). Taken together, this data strongly suggests 6 o4 g8 I9 ^8 S) t3 ^( g; o
      KLF family members regulate the maintenance of ES cell pluripotency.6 ]1 N. }  A3 D: q: O
      Methods1 e# k% I8 @5 e
      ES cell culture, embryoid body formation and immunohistochemistry
6 m& ^; J5 e8 d  W  {! S      W9.5 ES cell differentiation was performed as described (Bruce et al., . d/ N. p4 o. l' Y: T+ p
      Differentiation, in press). For immuno-fluorescence, feeder depleted W9.5 & k8 H3 e1 ^. y- O' H2 s8 _$ J, i
      ES cells were seeded onto sterile gelatin-coated glass cover slips in ES . [3 y  K* W/ u8 D' y4 d
      cell media, fixed in 4% PFA for 10 min then washed in PBS. Cover slips
0 e) R9 g: e2 A7 C: w      were boiled in 10 mM citric acid (pH 6.4) for 10 min, washed in PBS and 1 u- c$ \, a2 [- _0 U+ `, o
      permeabilized in 0.18% triton X-100 in PBS for 10 mins. For OCT4/KLF4 3 W5 J8 M0 E3 k. p
      detection, the cover slips were blocked in 1% BSA then incubated with * U+ k4 X5 w6 S/ {6 L6 m
      rabbit polyclonal antibody raised against OCT4 (Abcam #ab19857) and goat : w" H, W; |1 p7 S$ [9 \
      Anti-mouse KLF4 (R&D Systems #AF3158) overnight at 4°C, then incubated in + m* H0 O" g# r  A2 \( @
      goat anti-rabbit Alexa-488 (1:400) (Molecular Probes #A11008) and donkey
  S3 o- I9 h7 f* a8 J+ v8 I      anti-goat Alexa-647 (1:400) for 1 hour. For co-detection of OCT4/KLF2, the
% S/ o4 ^4 ^5 }& l& F) |      Zenon Rabbit IgG labelling kit was used (Molecular Probes #Z-25308). 1 μg
2 u' A( u  @" T- A) C$ v      of KLF2 rabbit polyclonal antibody (Santa Cruz SC-28675) was labeled with % q# s: U1 X' b, h
      Alexa647 Fab fragment following the manufacturer's instructions. Cover
1 o4 w2 C8 z7 t& h) E5 ^$ e+ C8 |      slips already stained for OCT4 (Alexa488) were incubated with labeled KLF2 - c9 O9 ~5 X- z4 J& q4 m
      for 1 hr then washed and post-fixed in 4% PFA to ensure strong signal   m4 O: y4 v9 G% m1 h% y# D2 o
      detection. DAPI 1:5000 (Molecular Probes #D3571) was used to detect
  G* R1 ?* V! t, a) y4 a      nuclear localization. Cover slips were mounted on SuperFrost&reg; Plus slides
) ]- ^0 t) R6 L/ _5 L! a: V1 ~      (Menzel-Glaser) with VectaShield mounting media (Vector Laboratories
3 z" z& I% S4 P# g7 {      H-1000). Confocal microscopy was performed on LSM 510 META Carl Zeiss
( H! F, r8 ^/ s0 ^4 D      microscope system.
9 ]" S0 _& ?" o8 ^      RNA extraction, cDNA synthesis and real-time RT-PCR data analysis" I& O/ N7 ^& B1 n' y+ u2 M
      Total RNA was made using TRIzol (Invitrogen), and cDNA was synthesized ) h8 _/ r1 U3 u& U
      from 2 μg of DNase1-treated total RNA using Superscript III (Invitrogen)
, F/ |9 w& R* ?1 R, b3 t      and oligo-DT12–15 (Promega) according to the manufacturer's instructions.
" `3 v' {8 Z8 d0 m; Z- r" C, z! ?- \      Quantitative RT-PCR (qRT-PCR) was carried out using the Applied Biosystems $ I$ p' ]: ^5 j4 W  a
      SYBR-green dye system and 7500 Real Time Cycler in 96-well plates. Cycling
# L1 g# A0 h8 h( u      variables were as follows: 50°C for 2 minutes, 95°C for 10 minutes, then 0 n1 c  k8 {4 E+ t- ]
      40 cycles of 15-second denaturation at 95°C and 1-minute at appropriate % B" Q4 L* p6 c, q0 B( h6 L- `
      annealing temperatures, optimized for each set of primers based on ( d4 C8 M9 G3 ]0 v: Z0 K
      dissociation curves (Additional file 8). Expression levels were normalized
' f1 Y# w: Q! x7 N      to HPRT as determined from the ratio of delta CT values. Mean of relative . n7 h8 O2 c: x0 U1 n8 B
      expression ± SD was determined from three biological replicates.
& V0 P3 z8 z) e# J7 ?' {      Sequencing of gel-purified amplicons was performed to ensure correct * B3 s3 g' `- C$ A% {& K# t
      product amplification.) D. Q$ ?  y& y' k; v* R+ N
      Additional file 8. Oligonucleotide sequences used for qRT-PCR analysis.
4 \$ l- y  g) ~( x- i) C( W5 [7 k$ q      Lists the oligonucleotide sequences and cycling conditions used for the # Z" G+ s* q1 J+ T  |
      qRT-PCR analysis.7 @9 B' e& ~3 }2 F5 {
      Format: DOC Size: 48KB Download file
  u" E  Q: W$ r/ P4 M      This file can be viewed with: Microsoft Word Viewer2 S3 |" W8 e+ w9 B
      Micro-array hybridization and analysis% M4 i# x/ x! I2 J3 o# R: O& F9 `7 p
      Expression profiling was performed using Sentrix Mouse-6 Expression , k. Q5 K' T# A1 U7 \
      BeadChip arrays from Illumina&reg;. RNA was assessed for integrity using the
  Z4 v+ p0 B5 s5 Z      Agilent Bioanalyzer 2100 and RNA integrity (RIN) scores above 9.5 were
' }+ W9 b/ a5 N2 n      present in all samples. Amplification was performed with 500 ng of total & ^  M4 N$ k7 F% F
      RNA using the Illumina TotalPrep RNA Amplification kit (Ambion) with a 12 ) ^. X* M: P. n, q
      hour in vitro transcription reaction period. The quantity and quality of
2 P1 _& Y7 ]( s1 C% v) G- E      biotin-UTP incorporated cRNA was also assessed on the Agilent Bioanalyzer
" k. b+ }; j# m& ~$ t* m; b      2100. Amplified cRNA (1500 ng per array) was hybridized to Mouse-6.v1 8 m+ K" v' Y) {
      BeadChip arrays according to the manufacturer guidelines and detected with
& G9 W1 h3 u; }/ F5 p& ^; X6 y      Fluorolink Streptavidin-Cy3 (Amersham Biosciences). Arrays were scanned
9 C& v+ }4 |7 f( G2 e( q. C4 Y9 q      using the Illumina BeadStation Scanner. The raw intensity values obtained
  }, c, y- E+ u# s- W      for the scanned array images were compiled using the proprietary
% F7 F' ?5 L4 t) t( E6 G      BeadStudio v1.5.1.3 software and imported into GeneSpring GX v7.3.1
& A- H5 m8 T3 A2 m; V      (Agilent). A mouse Illumina probe set was defined in the GeneSpring
, Y4 T/ Z% |6 R6 W      Workgroup using the Illumina targetIDs as the unique identifiers and
+ V" R+ f0 j; A3 c7 J& K; T      annotated according to array content files supplied by illumina&reg;. Data 1 ~3 T3 `: `- L$ G
      normalization was performed by first setting all measurements less than . Q7 O; R' O- ?
      0.01 to 0.01, then applying per chip normalization to the 50th percentile,
& |2 c9 P  v  n# L% b5 R" I6 V' m      and per gene normalization to the median.( |% d) b* c6 U6 s- r+ \
      From an interpretation which included the three FBS experiments, a
$ i7 |" p5 C$ z- m6 ?) u/ |9 e      non-parametric Welch ANOVA (where variances were not assumed equal) was
3 s. z' R2 o" B+ ^# h' ]      performed on all 46,120 probes to find a subset of genes whose expression
3 A5 v  S5 g: U& `! X+ l9 |      varied significantly throughout the differentiation time course. A 4 R; X! q; X8 R, R
      Benjamini and Hochberg False Discovery Rate multiple testing correction : b$ W# q  N* u0 U5 D5 B
      was applied to reduce the number of false positives. This yielded a set of
+ z! S$ y! Y: B3 h" h8 g* @      7,967 probes that showed statistically significant (p < 0.05) differences
( P, x. u" X1 R" p4 d2 c3 C      in expression by Welch t-test. This approach simplified running the data
& e& E: D4 |/ c/ P1 ^- B      mining algorithms to find syn-expression patterns by removing genes such
9 a2 P5 k1 M# Q, t( C+ l3 D      as housekeeping genes which did not display differential expression, or & W  O0 ?: d  i! B. y4 R  F
      genes which did not reach a threshold of expression at any time points.9 ^8 P8 N9 L9 H* N3 Y& O6 G
      Genome interpretations were generated in GeneSpring GXv7.3.1 (Agilent) in 0 S3 A1 Q  x  Z6 M0 C6 k
      which the mean relative expression level for each gene was represented at " ~6 }' w9 U% I
      each time point relative to the average of all time points. A number of 8 B& q, k' i5 }. s" M" Z
      statistical approaches were used to find syn-expression patterns. A - a8 T* i  A8 y, L
      Pearson correlation (between 0.9 and 1.0) was performed to find genes with & B$ Q& C& l2 G5 f
      similar expression profiles to Pouf1/Oct4, nanog, sox2, fgf5, wnt5 and
% G4 n. C7 V. X% d: H      brachyury. Hierarchical clustering was performed using the gene by gene   Z" S0 A/ X2 {& u+ Y) S
      and group by group 'tree' algorithm within the GeneSpring (Agilent)
/ U, ~0 X2 E; i9 n) D7 t7 i      program. This compares each probe with every other in the lists to " J& I0 k( B- E5 s/ n4 }
      generate similarity statistics and a tree representation of similarity % R; h4 K4 H- a6 c- D; ~, ]
      based on a Pearson correlation. Also, quality threshold (QT) clustering
+ |  ]7 C0 a8 [# p+ b$ L1 q8 Q      [92] was used to define distinct subsets of co-expressed genes. The entire $ o1 Z" C6 G& D" S* F, g& |( X
      data set for the experiments is available via GEO as well as via our
# d% A) N/ F5 J, W  \! x5 x      instance of Genespring (contact corresponding author).
/ s8 Q# |  ~  H" {, r- W      Electro-mobility gel shift assays (EMSA)
0 d* ?! D/ Q: O  T      Nuclear extracts were made from undifferentiated ES cells and embryoid
! X' l1 F( R6 N8 N& W/ e. p" I( p      bodies grown for four days in 10% serum. Electromobility gel shift assays
" k. \- q6 e7 Q& B* v      (EMSA) were performed as previously described [61] using double stranded   ]& G  r* t6 @/ {
      oligonucleotide probes corresponding to an extended CACC site in the
( W- i: S( X, `) h8 x      p18INK4c promoter (sense strand 5'-gttgggcggggcgtgggcggggcc-3') (Tallack,
5 L% d- y; B: ]* i6 B% L9 E9 q      et al., submitted). Supershifts were performed with specific antibodies 7 d9 W! M9 v* f; w  j- t8 m
      raised against SP1 (Santa Cruz SC-059x), SP3 (Santa Cruz SC-644x), KLF2 7 P" a) `- Z4 P* h9 q0 ~
      (Santa Cruz SC-28675), KLF3 [52] and KLF4 (Santa Cruz SC-12538x).1 \  n  P/ t& e' M0 k! |" c
      Bioinformatic searches for CACC sites and other over represented elements ; D  }2 Y2 ~2 r3 F
      in promoters of stem cell genes
: x7 Q" u/ D2 x" O* h6 X      The data-mining tool Biomart (accessed via Ensembl release 36 of the mouse
: d( B7 W4 l' a+ ?/ a& u" j      genome), was used to collate 2 kb of sequences upstream of the
4 s0 G0 ]+ t) q      transcriptional start sites (TSSs) of all genes listed in Table 1. The
5 ]3 r7 O% i0 H      Clover program, run via the MotifViz web interface [59], was used to
8 T1 o5 i8 \/ X1 K6 D      search for over-representation of position weighted matrices (PWMs) for 2 x$ ~1 [8 b; C4 v, `; Y
      octamer [93], Oct4 [60], Oct-Sox [60], KLF4 [54], E-box, GATA-1 [94] and * U7 `8 u* |  b
      CACC (KLF-A) elements in these promoter sets. The PWMs are listed in
7 Q  f  [2 t% G4 i7 r. h2 b      Additional file 7. The ECR Browser [95] was used to determine phylogenetic 2 V+ e) z0 @0 t" V( `1 {, M
      conservation of cis elements between mouse, rat and human genomes." k7 F) M, I4 V, S* \
      Abbreviations- I$ z; s$ \  _2 \+ f, O
      BSA, bovine serum albumin; EB, embryoid body; EMSA, electromobility gel
4 {: z; f$ n& L3 W0 H8 i. H0 T0 F      shift assay; ES, embryonic stem; FBS, fetal bovine serum; KL,
4 d2 z" D0 w+ e      Fkruppel-like factor; LIF, leukemia inhibitory factor; MEF, mouse % }# Z( Y6 l& h3 }) c) Y) z% \. z
      embryonic fibroblasts; PFA, paraformaldehyde; PWM, position weighted 4 l! u+ Y  E. d8 I0 o3 c' U! @1 k
      matrix; TFBS, transcription factor binding sites.0 D& ~) \* Q: Y; i' |3 |! d
      Competing interests
, `% a6 I: M3 {      The author(s) declares that there are no competing interests.  L; s* L5 ~" @! F% k
      Authors' contributions
' M1 n7 D' c7 S# o  |6 D# a      SB established the EB differentiation assay, collected and purified RNA
# }; v5 n6 f- E; ^; z. n      samples, designed and carried out the quantitative RT-PCR studies and
6 a1 d, w; m/ D7 S      drafted the manuscript. BG carried out the micro array protocol. LB $ P; R- p% v/ u
      Carried out experimentation of mesoderm specific gene expression. MG + Z5 f1 l0 _1 ^
      performed the array statistical analysis and annotation. SG participated
- G/ X5 ?" P3 X) \      in the design of the study and performed the statistical analysis. AP
5 ?6 m9 a( X3 x      conceived the study, and participated in its design and coordination and
6 l0 d; `( v, J0 x, V) B4 U7 t  O$ v      helped to draft the manuscript. All authors read and approved the final
6 K6 ~) T& W8 o0 i- T      manuscript.

Rank: 1

积分
10 
威望
10  
包包
20  
沙发
发表于 2009-3-6 23:14 |只看该作者
很好的资料,非常感谢
‹ 上一主题|下一主题
你需要登录后才可以回帖 登录 | 注册
验证问答 换一个

Archiver|干细胞之家 ( 吉ICP备2021004615号-3 )

GMT+8, 2025-6-2 14:39

Powered by Discuz! X1.5

© 2001-2010 Comsenz Inc.