干细胞之家 - 中国干细胞行业门户第一站

 

 

搜索
朗日生物

免疫细胞治疗专区

欢迎关注干细胞微信公众号

  
查看: 734514|回复: 278
go

[生物学相关学科类] PDF电子书:Biological.Data.Mining [复制链接]

Rank: 2

积分
50 
威望
50  
包包
541  
楼主
发表于 2010-7-20 15:17 |只看该作者 |倒序浏览 |打印
本帖最后由 细胞海洋 于 2010-7-20 15:26 编辑 % I2 ^3 ]1 D' [& ^: |

: I- F5 \! [0 r) S) p+ kModern biology has become an information science. Since the invention of a4 F$ h4 D- l. N  i$ q( o+ A) {
DNA sequencing method by Sanger in the late seventies, public repositories7 Q2 N. ?- M5 z( C0 N
of genomic sequences have been growing exponentially, doubling in size every
1 Z5 |) G, S. q! j% U16 months—a rate often compared to the growth of semiconductor transistor) [6 H+ m4 o4 T0 c- P+ ]
densities in CPUs known as Moore’s Law. In the nineties, the public–private1 ~; w: Q4 J7 r0 O1 V
race to sequence the human genome further intensified the fervor to gener-
. N; Q: R8 {& R2 G0 Qate high-throughput biomolecular data from highly parallel and miniaturized
+ P& v7 W7 x; I& ]& sinstruments. Today, sequencing data from thousands of genomes, including
8 `4 t* X) h( Bplants, mammals, and microbial genomes, are accumulating at an unprece-, U: D: ?5 p  D3 t) z6 r
dented rate. The advent of second-generation DNA sequencing instruments,2 I/ d4 y1 }  t( ?& F; D* [, n
high-density cDNA microarrays, tandem mass spectrometers, and high-power
2 o" T! z" Q: m2 n( B9 W" ]NMRs have fueled the growth of molecular biology into a wide spectrum of% g1 H' P5 U* }! F) ?6 t# K
disciplines such as personalized genomics, functional genomics, proteomics,: P# W: I) k9 o' P. X8 B  q
metabolomics, and structural genomics. Few experiments in molecular biol-. a+ H) l3 G4 j# {% y- D
ogy and genetics performed today can afford to ignore the vast amount of$ v* i. h0 K1 a3 f! A- |
biological information publicly accessible. Suddenly, molecular biology and
$ X+ q* S/ |, p7 qgenetics have become data rich.
( e3 ~$ U! Q6 q. `Biological data mining is a data-guzzling turbo engine for postgenomic
7 F; c6 m6 i1 E& X  nbiology, driving the competitive race toward unprecedented biological discov-5 Z* H* F# D  i: b# {3 D
ery opportunities in the twenty-first century. Classical bioinformatics emerged
- ?7 f2 l/ {: _) F+ C( Lfrom the study of macromolecules in molecular biology, biochemistry, and
  m. D: t0 C, W( J4 G  Bbiophysics. Analysis, comparison, and classification of DNA and protein se-
3 b) U' p6 }# I9 X) a/ S6 wquences were the dominant themes of bioinformatics in the early nineties.
' s# I1 ^2 t  y6 q" T. \4 r4 IMachine learning mainly focused on predicting genes and proteins functions( [) K$ ~" F/ j
from their sequences and structures. The understanding of cellular functions7 j) a& h2 _- }) m1 ?/ r/ j
and processes underlying complex diseases were out of reach. Bioinformatics
' P4 t- ^+ ]* P; M. U  ?# p  xscientists were a rare breed, and their contribution to molecular biology and% ?$ D' d; R  |: r; z  P
genetics was considered marginal, because the computational tools available( D* h5 N$ n' V# [; P  ~
then for biomolecular data analysis were far more primitive than the array
$ f3 `5 W5 c. l# pof experimental techniques and assays that were available to life scientists.3 i# y! D! q3 o, G, d3 U( ]5 c& N5 j
Today, we are now witnessing the reversal of these past trends. Diverse sets
" Q; y1 v, \# l! y5 Z9 Iof data types that cover a broad spectrum of genotypes and phenotypes, par-
; A" Z) A+ P. V  Eticularly those related to human health and diseases, have become available.1 o2 X$ B5 |' [
Many interdisciplinary researchers, including applied computer scientists, ap-3 H3 @4 w0 b9 g* x3 I
plied mathematicians, biostatisticians, biomedical researchers, clinical scien-
. Y8 m5 e, @/ M8 @2 ?, rtists, and biopharmaceutical professionals, have discovered in biology a goldmine of knowledge leading to many exciting possibilities: the unraveling of the
0 Q  g" t- `# \  ~7 h: o' {* z' ~tree of life, harnessing the power of microbial organisms for renewable energy,
- v7 d+ h1 P' l; E& Afinding new ways to diagnose disease early, and developing new therapeutic
2 A* \7 O1 H2 R" j( c! Tcompounds that save lives. Much of the experimental high-throughput biology
# K/ X5 _8 D: z- K& F. adata are generated and analyzed “in haste,” therefore leaving plenty of oppor-
+ }6 P! K/ U# @2 K: `tunities for knowledge discovery even after the original data are released. Most
, v1 D/ Q( J; A3 j& W$ Qof the bets on the race to separate the wheat from the chaff have been placed( \9 H. n& B9 w( r5 t
on biological data mining techniques. After all, when easy, straightforward,. }/ E, @2 }" k2 _# Z9 h; R
first-pass data analysis has not yielded novel biological insights, data mining
- h( w; B$ S9 @; [0 t$ p0 r  H2 E5 btechniques must be able to help—or, many presumed so.
( V6 ^( ~6 c6 ~0 |! D0 O) x
8 y# B5 M8 D+ ?/ k6 p3 E! W[hide][/hide]
附件: 你需要登录才可以下载或查看附件。没有帐号?注册
已有 1 人评分威望 包包 收起 理由
细胞海洋 + 5 + 10 极好资料

总评分: 威望 + 5  包包 + 10   查看全部评分

Rank: 1

积分
威望
2  
包包
86  
沙发
发表于 2010-7-20 17:19 |只看该作者
好的,谢谢楼主分享~

Rank: 2

积分
64 
威望
64  
包包
1734  
藤椅
发表于 2015-5-31 19:00 |只看该作者
羊水干细胞

Rank: 2

积分
64 
威望
64  
包包
1769  
板凳
发表于 2015-6-11 08:27 |只看该作者
干细胞之家微信公众号
加油啊!!!!顶哦!!!!!支持楼主,支持你~  

Rank: 2

积分
75 
威望
75  
包包
2193  
报纸
发表于 2015-6-11 18:17 |只看该作者
内皮祖细胞

Rank: 2

积分
129 
威望
129  
包包
1788  
地板
发表于 2015-8-11 08:01 |只看该作者
哈哈,顶你了哦.  

Rank: 2

积分
61 
威望
61  
包包
1757  
7
发表于 2015-8-15 11:41 |只看该作者
想都不想,就支持一下  

Rank: 2

积分
118 
威望
118  
包包
1769  
8
发表于 2015-8-17 04:22 |只看该作者
家财万贯还得回很多贴哦  

Rank: 2

积分
88 
威望
88  
包包
1897  
9
发表于 2015-8-20 15:52 |只看该作者
快毕业了 希望有个好工作 干细胞还是不错的方向

Rank: 2

积分
76 
威望
76  
包包
1772  
10
发表于 2015-8-22 08:27 |只看该作者
端粒酶研究
‹ 上一主题|下一主题
你需要登录后才可以回帖 登录 | 注册
验证问答 换一个

Archiver|干细胞之家 ( 吉ICP备2021004615号-3 )

GMT+8, 2025-5-21 05:32

Powered by Discuz! X1.5

© 2001-2010 Comsenz Inc.