python 正则表达式 提取一段信息中的段?

想要提取 data-srcset="https://www.artic.edu/iiif/2/0c266d91-4d1a-9329-aa1b-6d1a64a35b15/full/200,/0/default.jpg 200w, https://www.artic.edu/iiif/2/0c266d91-4d1a-9329-aa1b-6d1a64a35b15/full/400,/0/default.jpg 400w, https://www.artic.edu/iiif/2/0c266d91-4d1a-9329-aa1b-6d1a64a35b15/full/600,/0/default.jpg 600w, https://www.artic.edu/iiif/2/0c266d91-4d1a-9329-aa1b-6d1a64a35b15/full/800,/0/default.jpg 800w" 中最后一段网址

自己写的正则表达式为:
re.search("https\:\/\/www\.artic\.edu\/iiif\/2\/[\s\S]*\-[\s\S]*\-[\s\S]*\-[\s\S]*\-[\s\S]*\/full\/800\,\/0\/default\.jpg", str(response.css('img::attr(data-srcset)').extract_first()))

提取到的信息为:
https://www.artic.edu/iiif/2/d02e0079-8e82-733e-683c-cb83a387ee5e/full/200,/0/default.jpg 200w,
https://www.artic.edu/iiif/2/d02e0079-8e82-733e-683c-cb83a387ee5e/full/400,/0/default.jpg 400w,
https://www.artic.edu/iiif/2/d02e0079-8e82-733e-683c-cb83a387ee5e/full/600,/0/default.jpg 600w,
https://www.artic.edu/iiif/2/d02e0079-8e82-733e-683c-cb83a387ee5e/full/800,/0/default.jpg

如何只提取最后一段?

第1个回答  2020-05-22

正则如下:

str = 'data-srcset="https://www.artic.edu/iiif/2/0c266d91-4d1a-9329-aa1b-6d1a64a35b15/full/200,/0/default.jpg 200w, https://www.artic.edu/iiif/2/0c266d91-4d1a-9329-aa1b-6d1a64a35b15/full/400,/0/default.jpg 400w, https://www.artic.edu/iiif/2/0c266d91-4d1a-9329-aa1b-6d1a64a35b15/full/600,/0/default.jpg 600w, https://www.artic.edu/iiif/2/0c266d91-4d1a-9329-aa1b-6d1a64a35b15/full/800,/0/default.jpg 800w"'

finall_url = re.findall('600w,(.+)"$',str)

本回答被提问者采纳