python 中正则表达式用法

时间:2025-02-11 14:47:18
符 号 表 示 以 h t m l 结 尾 的 字 符 串 返 回 , 判 断 是 否 字 符 串 结 束 的 字 符 串 &lt; / p r e &gt; &lt; d i v c l a s s = &quot; c n b l o g s H i g h l i g h t e r s h − g u t t e r &quot; &gt; &lt; d i v &gt; &lt; d i v c l a s s = &quot; s y n t a x h i g h l i g h t e r p y t h o n &quot; &gt; &lt; d i v c l a s s = &quot; t a b l e − b o x &quot; &gt; &lt; t a b l e c e l l s p a c i n g = &quot; 0 &quot; c e l l p a d d i n g = &quot; 0 &quot; b o r d e r = &quot; 0 &quot; &gt; &lt; t b o d y &gt; &lt; t r &gt; &lt; t d c l a s s = &quot; g u t t e r &quot; &gt; &lt; d i v c l a s s = &quot; l i n e n u m b e r 1 i n d e x 0 a l t 2 &quot; &gt; 1 &lt; / d i v &gt; &lt; d i v c l a s s = &quot; l i n e n u m b e r 2 i n d e x 1 a l t 1 &quot; &gt; 2 &lt; / d i v &gt; &lt; d i v c l a s s = &quot; l i n e n u m b e r 3 i n d e x 2 a l t 2 &quot; &gt; 3 &lt; / d i v &gt; &lt; / t d &gt; &lt; t d c l a s s = &quot; c o d e &quot; &gt; &lt; d i v &gt; &lt; d i v c l a s s = &quot; l i n e n u m b e r 1 i n d e x 0 a l t 2 &quot; &gt; &lt; c o d e c l a s s = &quot; p y t h o n p l a i n &quot; &gt; r e g u l a r v 3 &lt; / c o d e &gt; &lt; c o d e c l a s s = &quot; p y t h o n k e y w o r d &quot; &gt; = &lt; / c o d e &gt; &lt; c o d e c l a s s = &quot; p y t h o n p l a i n &quot; &gt; r e . f i n d a l l ( r &lt; / c o d e &gt; &lt; c o d e c l a s s = &quot; p y t h o n s t r i n g &quot; &gt; &quot; h t m l 符号表示以html结尾的字符串返回,判断是否字符串结束的字符串&lt;/pre&gt;&lt;div class=&quot;cnblogs_Highlighter sh-gutter&quot;&gt;&lt;div&gt;&lt;div class=&quot;syntaxhighlighter python&quot;&gt;&lt;div class=&quot;table-box&quot;&gt;&lt;table cellspacing=&quot;0&quot; cellpadding=&quot;0&quot; border=&quot;0&quot;&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td class=&quot;gutter&quot;&gt;&lt;div class=&quot;line number1 index0 alt2&quot;&gt;1&lt;/div&gt;&lt;div class=&quot;line number2 index1 alt1&quot;&gt;2&lt;/div&gt;&lt;div class=&quot;line number3 index2 alt2&quot;&gt;3&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;code&quot;&gt;&lt;div&gt;&lt;div class=&quot;line number1 index0 alt2&quot;&gt;&lt;code class=&quot;python plain&quot;&gt;regular_v3 &lt;/code&gt;&lt;code class=&quot;python keyword&quot;&gt;=&lt;/code&gt; &lt;code class=&quot;python plain&quot;&gt;(r&lt;/code&gt;&lt;code class=&quot;python string&quot;&gt;&quot;html html,</pre><divclass="cnblogsHighlightershgutter"><div><divclass="syntaxhighlighterpython"><divclass="tablebox"><tablecellspacing="0"cellpadding="0"border="0"><tbody><tr><tdclass="gutter"><divclass="linenumber1index0alt2">1</div><divclass="linenumber2index1alt1">2</div><divclass="linenumber3index2alt2">3</div></td><tdclass="code"><div><divclass="linenumber1index0alt2"><codeclass="pythonplain">regularv3</code><codeclass="pythonkeyword">=</code><codeclass="pythonplain">re.findall(r</code><codeclass="pythonstring">"html",/3/whatsnew/3.)
print (regular_v3)
# [‘html’]
# […]匹配括号中的其中一个字符
1
2
3
regular_v4 = (r "[t,w]h" , /3/whatsnew/3. )
print (regular_v4)
# [‘th’, ‘wh’]
“d”是正则语法规则用来匹配0到9之间的数返回列表
1
2
3
4
5
6
regular_v5 = (r "\d" , /3/whatsnew/3. )
regular_v6 = (r "\d\d\d" , /3/whatsnew/3./1234 )
print (regular_v5)
# [‘3’, ‘3’, ‘6’]
print (regular_v6)
# [‘123’]
小d表示取数字0-9,大D表示不要数字,也就是出了数字以外的内容返回
1
2
3
regular_v7 = (r "\D" , /3/whatsnew/3. )
print (regular_v7)
# [‘h’, ‘t’, ‘t’, ‘p’, ‘s’, ‘:’, ‘/’, ‘/’, ‘d’, ‘o’, ‘c’, ‘s’, ‘.’, ‘p’, ‘y’, ‘t’, ‘h’, ‘o’, ‘n’, ‘.’, ‘o’, ‘r’, ‘g’, ‘/’, ‘/’, ‘w’, ‘h’, ‘a’, ‘t’, ‘s’, ‘n’, ‘e’, ‘w’, ‘/’, ‘.’, ‘.’, ‘h’, ‘t’, ‘m’, ‘l’]
“w”在正则里面代表匹配从小写a到z,大写A到Z,数字0到9
1
2
3
regular_v8 = (r "\w" , /3/whatsnew/3. )
print (regular_v8)
#[‘h’, ‘t’, ‘t’, ‘p’, ‘s’, ‘d’, ‘o’, ‘c’, ‘s’, ‘p’, ‘y’, ‘t’, ‘h’, ‘o’, ‘n’, ‘o’, ‘r’, ‘g’, ‘3’, ‘w’, ‘h’, ‘a’, ‘t’, ‘s’, ‘n’, ‘e’, ‘w’, ‘3’, ‘6’, ‘h’, ‘t’, ‘m’, ‘l’]

“W”在正则里面代表匹配除了字母与数字以外的特殊符号
1
2
3
regular_v9 = (r "\W" , /3/whatsnew/3. )
print (regular_v9)
# [’:’, ‘/’, ‘/’, ‘.’, ‘.’, ‘/’, ‘/’, ‘/’, ‘.’, ‘.’]