Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
800 views
in Technique[技术] by (71.8m points)

regex - Ignore escaped double quote characters swift

I am trying to validate a phone number using NSPredicate and regex. The only problem is when setting the regex Swift thinks that I am trying to escape part of it due to the backslashes. How can I get around this?

My code is as follows:

let phoneRegEx = "^(((?0d{4})?s?d{3}s?d{3})|((?0d{3})?s?d{3}s?d{4})|((?0d{2})?s?d{4}s?d{4}))(s?#(d{4}|d{3}))?$"
See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)

In Swift regular string literals, you need to double-escape the slashes to define literal backslashes:

let phoneRegEx = "^((\(?0\d{4}\)?\s?\d{3}\s?\d{3})|(\(?0\d{3}\)?\s?\d{3}\s?\d{4})|(\(?0\d{2}\)?\s???\d{4}\s?\d{4}))(\s?#(\d{4}|\d{3}))?$"

Starting from Swift 5, you can use raw string literals and escape regex escapes with a single backslash:

let phoneRegEx = #"^(((?0d{4})?s?d{3}s?d{3})|((?0d{3})?s?d{3}s?d{4})|((?0d{2})?s??d{4}s?d{4}))(s?#(d{4}|d{3}))?$"#

Please refer to the Regular Expression Metacharacters table on the ICU Regular Expressions page to see what regex escapes should be escaped this way.

Please mind the difference between the regex escapes (in the above table) and string literal escape sequences used in the regular string literals that you may check, say, at Special Characters in String Literals:

String literals can include the following special characters:

  • The escaped special characters (null character), \ (backslash), (horizontal tab), (line feed), (carriage return), " (double quotation mark) and ' (single quotation mark)
  • An arbitrary Unicode scalar value, written as u{n}, where n is a 1–8 digit hexadecimal number (Unicode is discussed in Unicode below)

So, in regular string literals, """ is a " string written as a string literal, and you do not have to escape a double quotation mark for the regex engine, so """ string literal regex pattern is enough to match a " char in a string. However, "\"", a string literal repesenting " literal string will also match " char, although you can already see how redundant this regex pattern is. Also, " " (an LF symbol) matches a newline in the same way as "\n" does, as " " is a literal representation of the newline char and "\n" is a regex escape defined in the ICU regex escape table.

In raw string literals, is just a literal backslash.


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...