python - 机器学习实战代码中的.split函数与.strip函数疑惑

Question

图1是学习到机器学习实战2.2.1节时，knn.py中需要的所有代码。图二是要处理的数据集合，可以看到有4列，行数很多。我的疑问是图三中33行与34行代码，既然用strip函数处理了每行的数据，那每行的空格和分行符都被...

大家讲道理 · Answer

strip(...) method of builtins.str instance
    S.strip([chars]) -> str
    
    Return a copy of the string S with leading and trailing
    whitespace removed.
    # 首尾去空(包括	
\s, 只要在字串首或者尾部)
    If chars is given and not None, remove characters in chars instead.

split(...) method of builtins.str instance
    S.split(sep=None, maxsplit=-1) -> list of strings
    # 按指定分隔符(定界符)拆分
    Return a list of the words in S, using sep as the
    delimiter string.  If maxsplit is given, at most maxsplit
    splits are done. If sep is not specified or is None, any
    whitespace string is a separator and empty strings are
    removed from the result.

Démonstration d'effet :

In[23]: a_str = " 啦啦	咳咳
少年	我粉你 	"
In[24]: a_str.strip()
Out[24]: '啦啦	咳咳
少年	我粉你'
In[25]: a_str.split("	")
Out[25]: [' 啦啦', '咳咳
少年', '我粉你 ', '']
In[26]: a_str.strip().split("	")
Out[26]: ['啦啦', '咳咳
少年', '我粉你']

迷茫 · Answer

S.strip([chars]) -> str

Return a copy of the string S with leading and trailing
whitespace removed.

ringa_lee · Answer

L'explication de strip est écrite à l'étage
le début et la fin font référence à la tête et à la queue, en laissant le milieu
De plus, je pense que la lecture des données dans tout le livre est trop maladroite, donc Je peux le faire en une seule ligne avec des pandas
pd.read_csv('dataSet.txt', sep='t', header=None)

python - 机器学习实战代码中的.split函数与.strip函数疑惑

répondre à tous(3)je répondrai