benzhou 发表于 2017-5-19 12:09:32

perl正则表达式中各种字符集的整理

UTF8

[ - ]
CODE:
||{2}|{3}


UTF16

[ - ]
CODE:
|{2}


JIS

[ - ]
CODE:
||{2}


SJIS

[ - ]
CODE:
||(|)(|)


EUC_JP

[ - ]
CODE:
|/x81||/x8f{2}


EUC_JP标点符号及特殊字符

[ - ]
CODE:



EUC_JP全角数字

[ - ]
CODE:
/xa3


EUC_JP全角大写英文

[ - ]
CODE:
/xa3


EUC_JP全角小写英文

[ - ]
CODE:
/xa3


EUC_JP全角平假名

[ - ]
CODE:
/xa4


EUC_JP全角片假名 2007-03-12 15:00更新

[ - ]
CODE:
/xa3|/xa3|/xa5||


EUC_JP全角汉字    2007-03-12 15:06更新

[ - ]
CODE:
||||||


Big5

[ - ]
CODE:
|(|)


GBK

[ - ]
CODE:
|


GB2312汉字

[ - ]
CODE:



GB2312半角标点符号及特殊符号

[ - ]
CODE:
/xa1


GB2312罗马数组及项目序号

[ - ]
CODE:
/xa2(|||||)


GB2312全角标点及全角字母

[ - ]
CODE:
/xa3


GB2312日文平假名

[ - ]
CODE:
/xa4


GB2312日文片假名

[ - ]
CODE:
/xa5


補充:
GB18030

[ - ]
CODE:
||


2007-03-12 21:35 补充
日文半角空格

[ - ]
CODE:
/x20


SJIS全角空格

[ - ]
CODE:
(?:/x81/x81)


SJIS全角数字

[ - ]
CODE:
(?:/x82)


SJIS全角大写英文

[ - ]
CODE:
(?:/x82)


SJIS全角小写英文

[ - ]
CODE:
(?:/x82)


SJIS全角平假名

[ - ]
CODE:
(?:/x82)


SJIS全角平假名扩展

[ - ]
CODE:
(?:/x82|/x81)


SJIS全角片假名

[ - ]
CODE:
(?:/x83)


SJIS全角片假名扩展

[ - ]
CODE:
(?:/x83|/x81)


EUC_JP全角空格

[ - ]
CODE:
(?:/xa1/xa1)


EUC半角片假名

[ - ]
CODE:
(?:/x8e)
页: [1]
查看完整版本: perl正则表达式中各种字符集的整理