实现:
import stringdef str_count(str):'''找出字符串中的中英文、空格、数字、标点符号个数'''count_en = 0count_dg = 0count_sp = 0count_zh = 0count_pu = 0for s in str:# 英文if s in string.ascii_letters:count_en += 1# 数字elif s.isdigit():count_dg += 1# 空格elif s.isspace():count_sp += 1# 中文elif s.isalpha():count_zh += 1# 特殊字符else:count_pu += 1return count_zhdef str_count2(str):for s in str:# 中文字符范围if '\u4e00' <= s <= '\u9fff':print(s, end="\t")
示例:去掉文件中单个汉字词
with open('chs.txt', 'r') as open_file:for line in open_file:if str_count(line) > 1: print(line)
参见:
/xiamoyanyulrq/article/details/81504114