Home  >  Q&A  >  body text

python - Use sklearn to find tfidf features of large text?


The above data is obtained from 7303 training sets from the reuters data set, and sklearn is used to extract the tfidf features. The results obtained are all 0. What's going on?

When I take part of this data, I can get the correct tfidf results for these small parts of the data.

大家讲道理大家讲道理2669 days ago762

reply all(1)I'll reply

  • 扔个三星炸死你

    扔个三星炸死你2017-06-28 09:25:35

    The above code may be caused by your accuracy being too low or min_count

    For example, if the word frequency is 1 and the total number of words is 1e9, the corresponding tf is 1e-9, which is ignored.

    reply
    0
  • Cancelreply