Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
428 views
in Technique[技术] by (71.8m points)

.net - Regular expression to select repeating groups

I have a series of grouped values that follow a specific format and would like to use a single expression to capture them into groups. For example, I have -group1 -group2 -group3 and am trying to use something similar to (-[sS]{1,}?) This is basically allowing me to capture the entire string into a single group but I'd like to be able to backreference each of the values separately. I figured the ? would force it to be non-greedy and, therefore, split the pattern match into three separate groups (for example). For now I am simply repeating the reference (-[sS]*?) but it seems there should be a more elegant expression.
Thanks!

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)

You are in luck because C# is one of the few languages (if not the only one) that supports subexpression captures

https://msdn.microsoft.com/en-us/library/system.text.regularexpressions.capture(v=vs.110)

The .NET API can be looked at as follows

 Matches
     Groups (most regex engines stop here)
         Captures (unique for .NET)

It's not clear from your question what you want to match exactly but this should get you started. Ask again if you are stuck.

  string input = "-group1 -group2 ";
  string pattern = @"(-S*W){2}";
  foreach (Match match in Regex.Matches(input, pattern))
  {
     Console.WriteLine("Match: {0}", match.Value);
     for (int groupCtr = 0; groupCtr < match.Groups.Count; groupCtr++)
     {
        Group group = match.Groups[groupCtr];
        Console.WriteLine("   Group {0}: {1}", groupCtr, group.Value);
        for (int captureCtr = 0; captureCtr < group.Captures.Count; captureCtr++)
           Console.WriteLine("      Capture {0}: {1}", captureCtr, 
                             group.Captures[captureCtr].Value);
     }                      
  } 

This ouputs

Match: -group1 -group2 
   Group 0: -group1 -group2 
      Capture 0: -group1 -group2 
   Group 1: -group2 
      Capture 0: -group1 
      Capture 1: -group2 

As you can see (Group 1, Capture 0) and (Group 1, Capture 1) offer the individual captures of a group (and not the last as in most languages)

This address I think of what you describe as "to be able to backreference each of the values separately"

(You use the term backreference but I don't think you are aiming for a replacement pattern right?)


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...