Home  >  Q&A  >  body text

python - 文本特征词提取算法

PHP中文网PHP中文网2741 days ago654

reply all(2)I'll reply

  • PHPz

    PHPz2017-04-18 09:27:22

    The question asked by the subject is actually handled in Chinese. First of all, let me say that I am not an expert, but I have done some research in this area. Let me share my ideas with the questioner:
    1 Word processing requires a vocabulary. Without a vocabulary, it is impossible to segment and stem words. But thesaurus cannot be created by individuals or small groups. 2 Thesaurus: http://www.afenxi.com/post/9700
    3 With the thesaurus, you may also need to focus on what you want to deal with. The business needs to "draw some boundaries" and "rules", and let the machine know how to deal with multiple choices and contradictory choices. This is a bit like "machine learning"
    4 How to teach machine learning? You must have textbooks and question banks, and let the machine do it, which means the corresponding thesaurus and more than N articles
    5 Balabala said a few words, but did not mention many specific tools~~~
    6 The most direct way: go Zhaopin, search for Chinese language processing positions and look at their skill requirements, it’s basically OK

    reply
    0
  • 怪我咯

    怪我咯2017-04-18 09:27:22

    Effect

    Reference address

    reply
    0
  • Cancelreply