PDF电子书：Biological.Data.Mining - 经典PDF电子书下载区干细胞之家



免疫细胞治疗专区	欢迎关注干细胞微信公众号

12 3 4 5 6 7 8 9 10 ... 28 下一页

返回列表

查看: 732642\|回复: 277	go [生物学相关学科类] PDF电子书：Biological.Data.Mining [复制链接]

viogriffin

注册会员

Rank: 2

积分: 50
威望: 50
包包: 541

楼主

发表于 2010-7-20 15:17 |只看该作者 |倒序浏览 |打印

本帖最后由细胞海洋于 2010-7-20 15:26 编辑 1 @  B1 Z8 Y9 n* n$ L! ?
  Q( o. o$ y0 A: l
Modern biology has become an information science. Since the invention of a
DNA sequencing method by Sanger in the late seventies, public repositories& ]2 i& |. L0 p( |+ F; \7 P( j7 g" ^1 d
of genomic sequences have been growing exponentially, doubling in size every
16 months—a rate often compared to the growth of semiconductor transistor7 G6 C, R5 _$ P0 n9 K* A+ }
densities in CPUs known as Moore’s Law. In the nineties, the public–private
race to sequence the human genome further intensiﬁed the fervor to gener-
ate high-throughput biomolecular data from highly parallel and miniaturized) p/ x8 g( o2 q  G" T- I! l7 X) R
instruments. Today, sequencing data from thousands of genomes, including' h$ r9 ]& d/ P6 Y4 }
plants, mammals, and microbial genomes, are accumulating at an unprece-$ s  y" H% ]# w; O# V) j
dented rate. The advent of second-generation DNA sequencing instruments,
high-density cDNA microarrays, tandem mass spectrometers, and high-power
NMRs have fueled the growth of molecular biology into a wide spectrum of
disciplines such as personalized genomics, functional genomics, proteomics,6 X( d( E6 r% i# |; m9 v
metabolomics, and structural genomics. Few experiments in molecular biol-
ogy and genetics performed today can aﬀord to ignore the vast amount of- ~  m5 {# o4 y; A& x6 n) r
biological information publicly accessible. Suddenly, molecular biology and5 H4 f# j6 u+ T7 H
genetics have become data rich.$ P- R4 A! |0 p. x
Biological data mining is a data-guzzling turbo engine for postgenomic% r& }8 |: k  X6 s" O+ T9 [
biology, driving the competitive race toward unprecedented biological discov-
ery opportunities in the twenty-ﬁrst century. Classical bioinformatics emerged
from the study of macromolecules in molecular biology, biochemistry, and; e0 e' S8 t+ m8 K
biophysics. Analysis, comparison, and classiﬁcation of DNA and protein se-
quences were the dominant themes of bioinformatics in the early nineties.
Machine learning mainly focused on predicting genes and proteins functions# a1 d9 W# p/ \# Y9 }. r1 e
from their sequences and structures. The understanding of cellular functions" M; y- o! r! J5 W- E
and processes underlying complex diseases were out of reach. Bioinformatics/ _0 `4 T. o5 i, w: m8 X
scientists were a rare breed, and their contribution to molecular biology and
genetics was considered marginal, because the computational tools available+ d. W& ^& p% m. p4 j
then for biomolecular data analysis were far more primitive than the array
of experimental techniques and assays that were available to life scientists.
Today, we are now witnessing the reversal of these past trends. Diverse sets) E: ^2 r" U. n5 s. e+ z/ y
of data types that cover a broad spectrum of genotypes and phenotypes, par-
ticularly those related to human health and diseases, have become available.% s; s% b8 {9 \1 d; u
Many interdisciplinary researchers, including applied computer scientists, ap-
plied mathematicians, biostatisticians, biomedical researchers, clinical scien-2 `8 ^7 d1 N6 C
tists, and biopharmaceutical professionals, have discovered in biology a goldmine of knowledge leading to many exciting possibilities: the unraveling of the
tree of life, harnessing the power of microbial organisms for renewable energy,( P3 Q$ \* c2 j& `4 N1 A7 R" }' v
ﬁnding new ways to diagnose disease early, and developing new therapeutic8 e8 S# t; Y$ z
compounds that save lives. Much of the experimental high-throughput biology' ~! P/ X' T( @0 {- u- L4 f
data are generated and analyzed “in haste,” therefore leaving plenty of oppor-
tunities for knowledge discovery even after the original data are released. Most2 u( u5 l- ^: z' \$ z
of the bets on the race to separate the wheat from the chaﬀ have been placed8 q! |3 M5 t+ ]; ]
on biological data mining techniques. After all, when easy, straightforward,
ﬁrst-pass data analysis has not yielded novel biological insights, data mining; L$ K# w8 \' q+ e* V+ {
techniques must be able to help—or, many presumed so.- Q3 |+ s2 Q  O$ c9 O/ {
% b1 O5 t; K# j3 g& m$ q' t
[hide][/hide]

附件: 你需要登录才可以下载或查看附件。没有帐号？注册