課程名稱︰ 生物資訊學演算法 期末考
課程性質︰ 資訊系選修
課程教師︰ 歐陽彥正
開課學院: 電機資訊學院
開課系所︰ 資訊系
考試日期(年月日)︰ 2006/1/13
考試時限(分鐘):
是否需發放獎勵金: yes
(如未明確表示,則不予發放)
試題 :
(20%)At a sub-root during the construction of a decision tree, the software
needs to determine whether the acctivity of a gene could be exploited to
predict the category that a person should belong to based on his/her weight.
The following table gives the distribution of the samples at the sub-root.
Assume that the criterion to prevent overfitting is that the statistical
confidence of claiming the independence between the attribute and the
decision is over 95%. Based on this criterion, can you tell whether overfitting
could occur if the activity of the gene is applied to make the prediction?
(20%)A biochemist wnats to test his hypothesis that the activities of gene1
and gene2 together determine whether a person suffers a disease. Following
is the microarray data that the biochemist has obtained in as experiment. If
the biochemist employs the univariate approach, can the biochemist figure out
the influences of these two genes correctly? Assume that a confidence level of
95% is normally required ofr making a claim. Given that \sigma^2 for gene1 and
gene2 are estimated to be 16.63 and 21.38, respectively.
(20%)given the plot of the dataset in problem 2 as follos. Is this dataset
linearly separable? if yes, can you figure out a linear decision function
in th form of sgn(v) = sgn[ax+by+c], where v is the feature vector of a new
sample and a, b, c are real numbers?
(20%)Please describe the concept of PSI-BLAST by explaining how it incorporates
sequence alignment(BLAST) with multiple sequence alignment (MSA) and position
specific score matrix to fine remote homologues.
(20%)Assume that you are requested to implement a software program that
measures the structural similarity between a target protein and the reference
protein. Furthermore, assume that you have found a software package that can
provide you with the major principal components of a 3-dimensional object.
How will you design the structural similarity analysis software with the
Fast Fourier Ttansform algorithm?_
--
※ 發信站: 批踢踢實業坊(ptt.cc)
◆ From: 61.228.41.12