红楼梦词频统计

import jieba
jieba.setLogLevel(jieba.logging.INFO)
txt=open('红楼梦.txt','r',encoding='gb18030').read()
words=jieba.lcut(txt)
counts={}
for word in words:
if len(word)==1:
continue
else:
counts[word]=counts.get(word,0)+1
items=list(counts.items())
items.sort(key=lambda x:x[1],reverse=True)
for i in range(20):
word,count=items[i]
print('{0:<10}{1:>5}'.format(word,count))

红楼梦词频统计

 

上一篇:Ajax - xml格式


下一篇:Delphi CheckListBox用法 文章来源于《傻猫网络日志》 https://www.samool.com/41856.html