python统计文本文件内单词数量的方法
作者:不吃皮蛋
这篇文章主要介绍了python统计文本文件内单词数量的方法,涉及Python针对文本文件及字符串的相关操作技巧,需要的朋友可以参考下
本文实例讲述了python统计文本文件内单词数量的方法。分享给大家供大家参考。具体实现方法如下:
# count lines, sentences, and words of a text file # set all the counters to zero lines, blanklines, sentences, words = 0, 0, 0, 0 print '-' * 50 try: # use a text file you have, or google for this one ... filename = 'GettysburgAddress.txt' textf = open(filename, 'r') except IOError: print 'Cannot open file %s for reading' % filename import sys sys.exit(0) # reads one line at a time for line in textf: print line, # test lines += 1 if line.startswith('\n'): blanklines += 1 else: # assume that each sentence ends with . or ! or ? # so simply count these characters sentences += line.count('.') + line.count('!') + line.count('?') # create a list of words # use None to split at any whitespace regardless of length # so for instance double space counts as one space tempwords = line.split(None) print tempwords # test # word total count words += len(tempwords) textf.close() print '-' * 50 print "Lines : ", lines print "Blank lines: ", blanklines print "Sentences : ", sentences print "Words : ", words # optional console wait for keypress from msvcrt import getch getch()
希望本文所述对大家的python程序设计有所帮助。