求python脚本，从txt检索出特定字符的行（有很多行，行里面有记录的时间），并从行中抓出对应的时间字符

最后需要算出时间差

推荐答案 2014-02-18

逐行匹配。对于每行可以首先使用find来确定该行中有没有特定字符。如果有，则根据正则表达式从中提取时间字符。

以下举一个例子，假设特定字符串为name，时间格式为xxxx-xx-xx。

def main():
    import re
    time_format = "\d+-\d+-\d+" #时间格式
    special_string = "name" #特定字符串
    pattern = re.compile(time_format)
    txt_content = open("test.txt", "r")
    for line in txt_content:
        l = line.strip()
        if l.find(special_string)>=0:   #如果有特定字符串
            print l                     #打印对应的行
            match =pattern.match(l)     #如果有匹配的时间格式
            if match:
                print match.group()     #打印对应的时间
if __name__ == '__main__':
    main()

样例test.txt为

2014-2-11 Your behavior is causing our name to be dragged through the mud.
2014-2-12 I am so happy to get munificent birthday presents from my friends.
2014-2-13 He's the boss in name only, because I issue all the orders.
2014-2-14 We are happy to give the product our full endorsement.
2014-2-15 His name leaped out at me from the book.
2014-2-16 We'll be happy to help if you need us.

输出结果为

2014-2-11 Your behavior is causing our name to be dragged through the mud.
2014-2-11
2014-2-13 He's the boss in name only, because I issue all the orders.
2014-2-13
2014-2-15 His name leaped out at me from the book.
2014-2-15

追问

21:57:41.059-[DEBUG]-[M1]-[D7]-LigeEND Test Block 1/12 - test start time:2013-12-18 21:57:16 ms - test end time:2013-12-18 21:57:41 ms

需要 D7 1/12 21:57:16 21:57:41

追答

修改一下字符串和时间格式即可。

def main():
    import re
    time_format = "\d+:\d+:\d+ " #时间格式
    special_string1 = "D7" #特定字符串
    special_string2 = "1/12" #特定字符串
    pattern = re.compile(time_format)
    txt_content = open("test.txt", "r")
    for line in txt_content:
        l = line.strip()
        if l.find(special_string1)>=0:      #如果有特定字符串1
            if l.find(special_string2)>=0:  #如果有特定字符串2
                match =pattern.findall(l)   #如果有匹配的时间格式
                if match:
                    for i in match:
                        print i             #打印对应的时间

温馨提示：答案为网友推荐，仅供参考

当前网址：http://00.wendadaohang.com/zd/DjZBnDTeDejrDnZrrZ0.html

其他回答

第1个回答 2014-02-18

你至少应该贴几行你文件的内容吧？
时间什么格式啊？追问

21:57:41.059-[DEBUG]-[M1]-[D7]-LigeEND Test Block 1/12 - test start time:2013-12-18 21:57:16 ms - test end time:2013-12-18 21:57:41 ms

需要 D7 1/12 21:57:16 21:57:41

追答>>> s = '21:57:41.059-[DEBUG]-[M1]-[D7]-LigeEND Test Block 1/12 - test start time:2013-12-18 21:57:16 ms - test end time:2013-12-18 21:57:41 ms'
>>> import re
>>> pat = re.compile('[\d:.]+-(?:\[([^\]]+)\]-)+\D+(\d+\/\d+)\D+[\d-]+ ([\d:]+)\D+[\d-]+ ([\d:]+)')
>>> m = pat.findall(s)
>>> m
[('D7', '1/12', '21:57:16', '21:57:41')]
>>> t1 = datetime.datetime.strptime(m[0][2], '%H:%M:%S')
>>> t2 = datetime.datetime.strptime(m[0][3], '%H:%M:%S')
>>> print t2 - t1
0:00:25

上面的可以一次提取出来，并计算出时间

追问

怎么先从TXT中抓出很多类似行的 s = '21:57:41.059-[DEBUG]-[M1]-[D7]-LigeEND Test Block 1/12 - test start time:2013-12-18 21:57:16 ms - test end time:2013-12-18 21:57:41 ms'

追答def processText():
f = open("1.txt", "r") #把1.txt改成你文件的名字
for s in f:
pat = re.compile('[\d:.]+-(?:\[([^\]]+)\]-)+\D+(\d+\/\d+)\D+[\d-]+ ([\d:]+)\D+[\d-]+ ([\d:]+)')
m = pat.findall(s)
print m
t1 = datetime.datetime.strptime(m[0][2], '%H:%M:%S')
t2 = datetime.datetime.strptime(m[0][3], '%H:%M:%S')
print t2 - t1

本回答被提问者采纳

相似回答

Python实用技巧大学生来看答：1、all or any Python语言如此流行的众多原因之一，是因为它具有很好的可读性和表现力。人们经常开玩笑说Python是可执行的伪代码。当你可以像这样写代码时，就很难反驳。2、bash plot lib 你有没有想过在控制台中绘制图形吗?Bash plot lib是一个Python库，他能够帮助我们在命令行(粗旷的环境...

Python中怎么用爬虫爬答：拉勾网、智联：爬取各类职位信息，分析各行业人才需求情况及薪资水平。雪球网：抓取雪球高回报用户的行为，对股票市场进行分析和预测。爬虫是入门Python最好的方式，没有之一。Python有很多应用的方向，比如后台开发、web开发、科学计算等等，但爬虫对于初学者而言更友好，原理简单，几行代码就能实现基本的爬虫...

Python爬虫是什么?答：网络爬虫为一个自动提取网页的程序，它为搜索引擎从万维网上下载网页，是搜索引擎的重要组成。传统爬虫从一个或若干初始网页的URL开始，获得初始网页上的URL，在抓取网页的过程中，不断从当前页面上抽取新的URL放入队列,直到满足系统的一定停止条件。将根据一定的搜索策略从队列中选择下一步要抓取的网页URL...

想用python编写一添加标志位小程序很简单但是我不会,望高手指点编写出...答：●绝对有用的20条电脑使用超级技巧●高手新手都适用的137个技巧●键盘上每个键的作用●教你怎样抓图 ●释放C盘空间的技巧●一键恢复及重装系统步骤●帮你把电脑调到最佳状态●电脑问题解答 ●电脑高手常用的组合键●了解密码破解原理，确保QQ相册安全●电脑小技巧70个●四个你不知道的QQ绝密技巧！●介绍用...

大家正在搜

python截取特定字符前的字符 python找出字符串的重复字符 python取字符串之间的字符 python从字符串中提取字符 python字符串去掉指定字符 python查找字符串中某个字符 python取字符串的第几个字符 python字符串添加字符 python去掉特定字符串