精華區beta NTU-Exam 關於我們 聯絡資訊
課程名稱︰資訊檢索 課程性質︰圖資系大三必修 課程教師︰唐牧群 開課學院:文學院 開課系所︰圖資系 考試日期(年月日)︰98.12.15 考試時限(分鐘):150分鐘 是否需發放獎勵金:是 (如未明確表示,則不予發放) 試題 : Here is an imaginary database that contains the following 5 documents: D1: "a dog barks at a cat and it fell from a tree" D2: "an ant watches ants on the bark of a tree" D3: "a dog watches another dog barks a cat" D4: "a dog barks at a cat watches another cat" D5: "the bark fell from the tree as an ant watches" (Terms in the stop-word list have been grayed out.) 1a. Calculate DF(Document Frequency) and IDF(Inverse Document Frequency) weight for each index term(simply use N/n without logarithm). 1b. Create an inverted file for the database where each cell contains the TF*IDF weight of each term in the documents(treat singulars and plurals as the same word stem). 1c. What will be the search results look like by a system based on Boolean and vector space model, respectively. To rank the search results using vector space model, first calculate relevance scores(inner product without document length normalization) and rank the documents accordingly. 1d. After the initial iteration, the user examines the results and marks D2, D5 as relevant, and D3 and D4 as non-relevant. Produce the new ranking using Rocchio's method where α=1.0 β=1.0 γ=1.0 2. Unlike data retrieval where perfect precision and recall are guaranteed, information retrieval is more of a trial-and-error process. What might be the possible causes behind the uncertainty of IR? 3. Define 4 of the following 5 concepts: a.SIP(Statistically Improbable Phrases) b.The "explode" function for PubMed search c.The "subheading" function for PubMed search d.Relevance feedback e."Building block" search strategy 4. From information behavior research we know that users' "information needs" or "search tasks" come in different shapes and sizes. List a few ways(two or more) by which users' information needs or search tasks can be categorized and comments on how users' needs or tasks might impact information retrieval. 5. How is interactive view of IR different from traditional view of IR? What doew it consider to be the most crucial component of IR process? Name one or two interactive IR inspired search techniques that we have talked about in the class. -- ※ 發信站: 批踢踢實業坊(ptt.cc) ◆ From: 140.112.180.107
cindy6603 :已收錄 :D 12/15 17:14