Refer to the exhibit.
Click on the calculator icon in the upper left corner. An analyst is searching a corpus of documents for the topic “solid state disk”. In the Exhibit, Table A provides the inverse document frequency for each term across the corpus. Table B provides each term’s frequency in four documents selected from corpus. Which of the four documents is most relevant to the analyst’s search?

A.
Document B
B.
Document A
C.
Document C
D.
Document D
Why B is correct? I am looking for explanation? Thanks
0
0
Because the value of tf/idf should be max. If you calculate it, you can reach the max related to document B.
0
0
That’s correct.
0
0
TF-IDF of the documents as follows;
A: 0.6
B: 0.75
C: 0.6
D: 0.7
B is the most relevant.
4
0
TF-IDF for document A is 0.6 or 0.4 ?
0
0