正則 re.findall 的簡單用法(返回string中所有與pattern相匹配的全部字串,返回形式為數(shù)組)
語法:
findall(pattern, string, flags=0)
import re
Python 正則表達式 re findall 方法能夠以列表的形式返回能匹配的子串
print (help(re.findall))
print (dir(re.findall))
findall查找全部r標(biāo)識代表后面是正則的語句
regular_v1 = re.findall(r"docs","https://docs./3/whatsnew/3.6.html")
print (regular_v1)
# ['docs']
符號^表示匹配以https開頭的的字符串返回,
regular_v2 = re.findall(r"^https","https://docs./3/whatsnew/3.6.html")
print (regular_v2)
# ['https']
用$符號表示以html結(jié)尾的字符串返回,判斷是否字符串結(jié)束的字符串
regular_v3 = re.findall(r"html$","https://docs./3/whatsnew/3.6.html")
print (regular_v3)
# ['html']
[...]匹配括號中的其中一個字符
regular_v4 = re.findall(r"[t,w]h","https://docs./3/whatsnew/3.6.html")
print (regular_v4)
# ['th', 'wh']
“d”是正則語法規(guī)則用來匹配0到9之間的數(shù)返回列表
regular_v5 = re.findall(r"\d","https://docs./3/whatsnew/3.6.html")
regular_v6 = re.findall(r"\d\d\d","https://docs./3/whatsnew/3.6.html/1234")
print (regular_v5)
# ['3', '3', '6']
print (regular_v6)
# ['123']
小d表示取數(shù)字0-9,大D表示不要數(shù)字,也就是出了數(shù)字以外的內(nèi)容返回
regular_v7 = re.findall(r"\D","https://docs./3/whatsnew/3.6.html")
print (regular_v7)
# ['h', 't', 't', 'p', 's', ':', '/', '/', 'd', 'o', 'c', 's', '.', 'p', 'y', 't', 'h', 'o', 'n', '.', 'o', 'r', 'g', '/', '/', 'w', 'h', 'a', 't', 's', 'n', 'e', 'w', '/', '.', '.', 'h', 't', 'm', 'l']
“w”在正則里面代表匹配從小寫a到z,大寫A到Z,數(shù)字0到9
regular_v8 = re.findall(r"\w","https://docs./3/whatsnew/3.6.html")
print (regular_v8)
#['h', 't', 't', 'p', 's', 'd', 'o', 'c', 's', 'p', 'y', 't', 'h', 'o', 'n', 'o', 'r', 'g', '3', 'w', 'h', 'a', 't', 's', 'n', 'e', 'w', '3', '6', 'h', 't', 'm', 'l']
“W”在正則里面代表匹配除了字母與數(shù)字以外的特殊符號
regular_v9 = re.findall(r"\W","https://docs./3/whatsnew/3.6.html")
print (regular_v9)
# [':', '/', '/', '.', '.', '/', '/', '/', '.', '.']
|