Introduction - If you have any usage issues, please Google them yourself
TF-IDF is a statistical method to assess the importance of a word for a file set or a corpus of the importance of one of the documents. The importance of a word increases in proportion to the number of times it appears in a file, but it decreases inversely as it appears in the corpus. The various forms of TF-IDF weighting are often used by search engines as a measure or rating of the degree of correlation between a file and a user query