|

- 积分
- 133
- 威望
- 133
- 包包
- 316
|
本帖最后由 细胞海洋 于 2010-9-19 20:20 编辑 & G; x* c v- j$ d# ~5 o
o* Q. T+ b3 e3 m
SECTION I INTRODUCTION – DATA DIVERSITY AND INTEGRATION 1
9 F& b3 {! E$ C" g! k% a1 Integrative Data Analysis and Visualization: Introduction1 i* u6 O% }/ P/ R' ^8 `7 u% A
to Critical Problems, Goals and Challenges 3, F6 X9 r3 d0 t% Q1 h5 ^0 G4 ~( p1 k
Francisco Azuaje and Joaquı´n Dopazo2 E6 E% t+ [/ c2 D- g/ u* T- i
1.1 Data Analysis and Visualization: An Integrative Approach 34 G6 Q3 }0 L( K% G2 l$ d
1.2 Critical Design and Implementation Factors 57 M5 V" R) @4 D4 S2 w
1.3 Overview of Contributions 8" i; a6 u5 ]' r# B
References 9
! Z0 [( p) Y0 r9 b0 Q( d2 Biological Databases: Infrastructure, Content
1 l2 N3 E( d2 O. X8 ^$ L( land Integration 11
" O# d( F. B r: L; a* h h6 QAllyson L. Williams, Paul J. Kersey, Manuela Pruess' B, c9 Q1 X6 }
and Rolf Apweiler* c- g" }' w' c7 f
2.1 Introduction 11# T' ~5 q2 c% P# ?
2.2 Data Integration 12
& W' W( S8 T6 Q, Z2.3 Review of Molecular Biology Databases 17- Y$ F" [$ h* j3 c/ E9 b/ l
2.4 Conclusion 23; \) T- D1 }$ c& D: M
References 26
) H, X( y: I E& H* Y* |4 J3 Data and Predictive Model Integration: an Overview, F+ s" J! H. U. Z# i6 `
of Key Concepts, Problems and Solutions 29
4 `7 Y9 i2 |5 _3 c0 E/ [5 aFrancisco Azuaje, Joaquı´n Dopazo and Haiying Wang: r, g5 j& N7 d/ |+ J+ w
3.1 Integrative Data Analysis and Visualization: Motivation and Approaches 29$ j, D z! W8 x# s+ e+ W1 o0 o
3.2 Integrating Informational Views and Complexity for Understanding Function 31' T" F: v# C3 H$ H0 m; P
3.3 Integrating Data Analysis Techniques for Supporting Functional
6 }. {0 [' G' n' [3 d7 M, i: hSECTION II INTEGRATIVE DATA MINING AND VISUALIZATION –
+ _* G$ n2 ?3 Y1 ~7 \1 q. F( c- e6 f. x2 IEMPHASIS ON COMBINATION OF MULTIPLE
$ a* F/ ^: @7 L2 c$ jDATA TYPES 41% F6 \. J) [$ w! f5 ~
4 Applications of Text Mining in Molecular Biology, from Name3 M4 f1 Q0 t' p! L4 x$ W" F: |
Recognition to Protein Interaction Maps 43
8 U% w; h G3 ^- wMartin Krallinger and Alfonso Valencia
+ r9 I1 V7 v1 d0 `* ]4.1 Introduction 44
9 T- J( u p! H4 j5 Y4.2 Introduction to Text Mining and NLP 45
1 X6 d ]" K! s8 v5 }9 ]! N0 P4.3 Databases and Resources for Biomedical Text Mining 47
! X! y. _: L Y4.4 Text Mining and Protein–Protein Interactions 50! u6 a4 u( O1 j7 P5 l
4.5 Other Text-Mining Applications in Genomics 55. \: D* u4 b: R7 [" E
4.6 The Future of NLP in Biomedicine 56
3 k8 |) W: D+ r8 G5 ]" fAcknowledgements 565 x ^& S6 t' F5 E$ N
References 56
, @7 i8 N) {& b1 H0 r7 J+ G9 \, k5 Protein Interaction Prediction by Integrating Genomic
% j: T3 n3 j6 H2 E0 q, K# oFeatures and Protein Interaction Network Analysis 614 g: j3 s4 `( d. K* X. }0 H
Long J. Lu, Yu Xia, Haiyuan Yu, Alexander Rives, Haoxin Lu,
& X- J; t7 K% W/ w0 RFalk Schubert and Mark Gerstein/ p( z3 x; s* D- C g# }
5.1 Introduction 62! G1 R8 c( c0 b. L" m9 z1 o+ N
5.2 Genomic Features in Protein Interaction Predictions 63
0 [( s1 l" H' T5.3 Machine Learning on Protein–Protein Interactions 67( W3 b/ n, x% `+ O. l: n
5.4 The Missing Value Problem 739 |6 x8 t! _+ Z; v
5.5 Network Analysis of Protein Interactions 75+ P4 t/ @ f- u1 B5 ~ Q" ?* v
5.6 Discussion 79/ p! ^. Q4 U; D3 w
References 80* X3 w- c. B( G3 d9 H q6 m) W
6 Integration of Genomic and Phenotypic Data 83 U" J0 e# q: @# E6 S9 }$ z* D& ]/ S
Amanda Clare! e; Y7 M, N- w+ l1 j$ N; i% l
6.1 Phenotype 83
5 s/ z) Q/ e0 M- K9 o6.2 Forward Genetics and QTL Analysis 85* A u5 S8 y: m- N! @' H" ?" M
6.3 Reverse Genetics 87
8 y" `+ x: _# Y d6.4 Prediction of Phenotype from Other Sources of Data 88# F* \ i) b* n
6.5 Integrating Phenotype Data with Systems Biology 90
+ j0 t) D' Y K {% B- m; d6.6 Integration of Phenotype Data in Databases 93
2 S: E$ _ ~4 B% \2 n# m- j. M6.7 Conclusions 954 G0 @1 D1 e' S9 g& }9 v+ \$ x
References 95
- \! [) G- w9 B" Z( b$ t7 Ontologies and Functional Genomics 99) _: S. S3 O0 Y9 J+ q2 G7 m' x
Fa´tima Al-Shahrour and Joaquı´n Dopazo" }8 ]2 ?1 m: _* t. y3 P: Q9 P5 h
7.1 Information Mining in Genome-Wide Functional Analysis 99
2 [" ?4 X5 j3 I" ^9 _7.2 Sources of Information: Free Text Versus Curated Repositories 1006 V& T/ k- d/ F6 |/ d R$ R" ]
7.3 Bio-Ontologies and the Gene Ontology in Functional Genomics 101
+ a4 I3 t7 z( }. V' F' w( C& D# v9 G7.4 Using GO to Translate the Results of Functional Genomic Experiments into, \& u' Y# i! K. Q9 A! f
Biological Knowledge 103
! v' |" D, ], R1 z5 Q7.5 Statistical Approaches to Test Significant Biological Differences 1042 J8 A4 `% r) r' S& u5 X$ v3 S' Z z7 ~
7.6 Using FatiGO to Find Significant Functional Associations6 x3 O7 p* V0 J3 b! ~; q& Z2 P; x
in Clusters of Genes 106
+ k. g8 J! z# I4 H: I# O0 o8 V7.7 Other Tools 107
/ Z. i5 e5 q! V! [7.8 Examples of Functional Analysis of Clusters of Genes 108
+ J& g* H$ g* S; _& T: b7.9 Future Prospects 110
, E* v' c+ M. F4 ]+ rReferences 110
. ~: _0 Q( e6 ^8 L8 The C. elegans Interactome: its Generation and Visualization 113
2 x" Q: m# r6 y XAlban Chesnau and Claude Sardet2 l3 K- B+ Q5 w$ z) t: U% R- `4 i
8.1 Introduction 113
# l8 j: B( y: |4 B% S5 X8.2 The ORFeome: the first step toward the interactome of C. elegans 116
4 J: s! J2 g9 }% k4 _8.3 Large-Scale High-Throughput Yeast Two-Hybrid Screens to Map the C. elegans
4 E, |4 x+ k, e8 P2 u0 F8 g4 {Protein–Protein Interaction (Interactome) Network: Technical Aspects 118
% W" ?& h* ^5 q+ u3 B3 h1 Q8.4 Visualization and Topology of Protein–Protein Interaction Networks 1215 l" Y) v% e Y" D) Z d# ?
8.5 Cross-Talk Between the C. elegans Interactome and other Large-Scale
0 A; g( y$ |& Z: qGenomics and Post-Genomics Data Sets 123
' l: D$ N* p& s! q8.6 Conclusion: From Interactions to Therapies 129
1 c' @ \1 o) B+ J2 M$ z- @References 130
3 O4 c! j+ _( _' @0 {3 D& Y" QSECTION III INTEGRATIVE DATA MINING AND; c# s- p1 e& q7 j8 Q! T
VISUALIZATION – EMPHASIS ON
2 g& Y1 \$ }# @/ U7 r3 ]/ tCOMBINATION OF MULTIPLE
+ v/ ~5 w& x* z- W; e- bPREDICTION MODELS AND METHODS 135' a$ [$ B! W( C( j% b. X
9 Integrated Approaches for Bioinformatic Data Analysis
& D y' f: X8 _* [, A$ Uand Visualization – Challenges, Opportunities5 [( T0 e4 H8 k% s8 N( y& J
and New Solutions 137" r1 |. k( L% j) s( a
Steve R. Pettifer, James R. Sinnott and Teresa K. Attwood: V1 m7 k9 K) [) h! [
9.1 Introduction 1377 n3 C9 G, p! B D/ S
9.2 Sequence Analysis Methods and Databases 139
6 ^4 h& `/ w3 x3 }9.3 A View Through a Portal 141
( |2 Z4 Q- [) S* a8 J% \0 y- f) V9.4 Problems with Monolithic Approaches: One Size Does Not Fit All 142$ U3 U& |- j7 C) }0 v
9.5 A Toolkit View 143
1 z0 J& P- t; e8 A4 e3 w/ u9.6 Challenges and Opportunities 1455 ?! h) ?0 d6 p: r% b' B
9.7 Extending the Desktop Metaphor 147) V' v# k' |& e. ~( f" S
9.8 Conclusions 151
, w$ V" P5 @( j. h( gAcknowledgements 151
; D' {3 x a" z3 J; ^* gReferences 152
2 E1 u: C% e+ I4 u8 `. i10 Advances in Cluster Analysis of Microarray Data 153
" }/ i$ C: K& s5 @! X/ IQizheng Sheng, Yves Moreau, Frank De Smet, Kathleen Marchal
) R- G- A: C: Y* P2 gand Bart De Moor
# e# {1 A# g/ E: |9 H# e3 Y0 O+ n; A1 c10.1 Introduction 153
; q0 Q5 X. n) o: l/ J$ b10.2 Some Preliminaries 155
: a& t4 Z) \- P* W3 f2 s( y10.3 Hierarchical Clustering 157
9 Y: x0 R2 k D9 `5 Y$ |# ~# v10.4 k-Means Clustering 159
- ]$ u7 u2 Q% {# H10.5 Self-Organizing Maps 159* [3 E E3 h- A/ P3 S( O
10.6 A Wish List for Clustering Algorithms 160& I- a- u/ q: c* m6 F
10.7 The Self-Organizing Tree Algorithm 161: Y! S! b7 C$ ~& }
10.8 Quality-Based Clustering Algorithms 162
* V0 ], U9 O7 U& n10.9 Mixture Models 163- j) k! X0 s2 i, }. |2 r; C+ l
10.10 Biclustering Algorithms 166
% @, ?* N+ |) M7 \( ]10.11 Assessing Cluster Quality 168) S1 a, m X$ ?9 q& b
10.12 Open Horizons 170
% z1 Y, _$ f- O0 OReferences 171
& {4 d, m. Q: w8 p- d2 ^! }11 Unsupervised Machine Learning to Support Functional2 n g3 Z, d1 \/ K0 ^7 H. B
Characterization of Genes: Emphasis on Cluster+ E% ~) j. u7 \3 I3 G1 G. x3 N1 R
Description and Class Discovery 1757 |( x" g$ F# q1 I7 e1 Q' g
Olga G. Troyanskaya; r+ Z, w2 \5 I
11.1 Functional Genomics: Goals and Data Sources 175$ B4 C& q0 W7 b" c
11.2 Functional Annotation by Unsupervised Analysis of Gene
( ?* `1 _# z/ W; d6 C: FExpression Microarray Data 177% I0 t: ~, X+ D% C5 v/ v+ |
11.3 Integration of Diverse Functional Data For Accurate Gene Function" f, Y) r) ?$ u! D! x
Prediction 179- T# I1 `& z- ^( B8 b. I
11.4 MAGIC – General Probabilistic Integration of Diverse Genomic Data 180
" y- Q7 Y: ?$ U) h11.5 Conclusion 1882 P0 a3 b- V; z& s2 k$ ^
References 189+ p6 F8 L2 J: c! [: y6 Q
12 Supervised Methods with Genomic Data: a Review4 M( z4 n+ V3 H& k: p
and Cautionary View 193- r5 I1 H$ X9 g0 P, p
Ramo´n Dı´az-Uriarte
+ o: u4 V+ O9 V5 H C7 N12.1 Chapter Objectives 193! @, `. S+ M3 ^' B
12.2 Class Prediction and Class Comparison 1943 q2 i6 Y/ q! L l& b: x2 D
12.3 Class Comparison: Finding/Ranking Differentially Expressed Genes 194( {1 s* ?0 u6 t' ?+ u p! d4 Y
12.4 Class Prediction and Prognostic Prediction 198- C9 w! c- ~- W9 D; i
12.5 ROC Curves for Evaluating Predictors and Differential Expression 2011 t' R8 K7 c5 l2 E6 q: g
12.6 Caveats and Admonitions 203
; v' S1 {, |3 F12.7 Final Note: Source Code Should be Available 209
! m6 s0 v: Y+ |$ D( CAcknowledgements 2108 } h; P) ?# c2 W8 W) \
References 2107 \4 d; R4 Y: a/ D* M. d
13 A Guide to the Literature on Inferring Genetic Networks2 P4 ~2 L( _ p
by Probabilistic Graphical Models 215
' p8 t1 \3 I6 t- b1 k. N5 w, bPedro Larran˜aga, In˜aki Inza and Jose L. Flores
1 e( T% n, [* w" R8 c- V13.1 Introduction 2151 r7 q Q! a A9 Q! S4 x, K
13.2 Genetic Networks 216& i9 S2 l& k4 S5 @. h- w
13.3 Probabilistic Graphical Models 2180 D6 }; y3 x# {7 Q- h
13.4 Inferring Genetic Networks by Means of Probabilistic Graphical Models 229
: h' m ^9 Q, `5 _0 ^$ m2 B5 M6 U" k) \" b13.5 Conclusions 234' _/ _! I `6 K, b& V" D! f
Acknowledgements 2359 a+ V8 i- g; ]" \4 }
References 235
) i) V3 n$ O3 G: i; t14 Integrative Models for the Prediction and Understanding
0 P. c H' K! w+ c; R9 Sof Protein Structure Patterns 239( v1 I, v0 l- z1 v, j% u. p
Inge Jonassen" }# X$ ]/ z0 c4 [$ X* V# ?( Z
14.1 Introduction 239/ F; f7 @5 ]$ H; } |
14.2 Structure Prediction 241
6 C( l V k4 @/ A9 K14.3 Classifications of Structures 244
; m0 @; Y Z# J& Z3 T. z$ ^14.4 Comparing Protein Structures 246/ d" n# r+ C/ d
14.5 Methods for the Discovery of Structure Motifs 249, u. C* @& }4 T9 Z- w
14.6 Discussion and Conclusions 252$ q0 h' |! x* v
References 2540 C3 F4 P9 ?4 A
- k8 o1 ~3 B2 T0 P4 `9 H0 U. L/ P[hide][/hide] |
附件: 你需要登录才可以下载或查看附件。没有帐号?注册
-
总评分: 威望 + 2
包包 + 5
查看全部评分
|