python - Use sklearn to find tfidf features of large text?

Question

The above data is obtained from the 7303 training set from the reuters data set, and sklearn is used to obtain the tfidf feature. The results obtained are all 0. What is going on? When I take a portion of this data, I can get correct tfidf results for these small portions of data.

扔个三星炸死你 · Answer

The above code may be caused by your accuracy being too low or min_count

For example, if the word frequency is 1 and the total number of words is 1e9, the corresponding tf is 1e-9, which is ignored.

python - Use sklearn to find tfidf features of large text?

reply all(1)I'll reply