PHP - preg_match_all - a little advenced
Asked Answered
A

1

6

I need to find specific part of text in string. That text need to have:

  • 12 characters (letters and numbers only)
  • whole string must contains at least 3 digits
  • 3*4 characters with spaces (ex. K9X6 6GM6 LM11)
  • every block from example above must contains at least 1 number
  • words like this, line, spod shouldn't be recognized

enter image description here

So I ended with this code:

preg_match_all("/(?<!\S)(?i:[a-z\d]{4}|[a-z\d]{12})(?!\S)/", $input_lines, $output_array); But it won't works for all of requirements. Of course I can use preg_repace or str_replace and remove all (!,?,#) and in a loop count numbers if there are 4 or more but I wonder if it is possible to do with preg_match_all...

Here is a string to search in:

?K9X6 6GM6 LM11  // not recognized - but it should be
!K9X6 6GM6 LM11  // not recognized - but it should be
K0X6 0GM7 LM12! // not recognized - but it should be
K1X6 1GM8 LM13@ // not recognized - but it should be
K2X6 2GM9 LM14? // not recognized - but it should be
K3X6 3GM0 LM15# // not recognized - but it should be
K4X6 4GM1 LM16* // not recognized - but it should be
K5X65GM2LM17
bla bla bla
this shouldn't be visible
spod also shouldn't be visible
but line below should be!!
K9X66GM6LM11! (see that "!" at the end? Help me with this)

Correct preg_match_all should returns this:

K9X6
6GM6
LM11
K9X6
6GM6
LM11
K0X6
0GM7
LM12
K1X6
1GM8
LM13
K2X6
2GM9
LM14
K3X6
3GM0
LM15
K4X6
4GM1
LM16
K5X65GM2LM17
K9X66GM6LM11

working example: http://www.phpliveregex.com/p/bHX

Angelia answered 27/6, 2015 at 9:50 Comment(3)
"every block from example above must contains at least 1 number" and "3*4 characters" implies "whole string must contains at least 3 numbers" :)Fairground
OK, maybe I can't exmplain it in words, so I've just updated my question and add picture to explain better what I needAngelia
When there are no spaces, does the block logic still apply? In other words, is ABCDE123FGHI a valid input?Coimbatore
C
2

The following should do the trick:

\b(?:(?=.{0,3}?\d)[A-Za-z\d]{4}\s??){3}\b

Demo

  • [A-Za-z\d]{4} matches 4 letters/digits
  • (?=.{0,3}?\d) checks there's a digit in these 4 characters
  • \s?? matches a whitespace character, but tries not to match it if possible
  • \b makes sure everything isn't contained in a larger word

Note that this will allow strings like K2X6 2GM9LM14, I'm not sure whether you want these to match or not.

Coimbatore answered 27/6, 2015 at 12:8 Comment(4)
Yea, that's it. Strings like K2X6 2GM9LM14 is also valid so thanks a lot!Angelia
one more question. Now it acepts only spaces, if i want to change \s?? to allow whitespace and also other separatos like "_", "-" (dash and minus) - how to modify that regex?Angelia
@Angelia change \s?? to [\s_-]??. You have to put the - dash either at the beginning or the end of the [...] character set.Coimbatore
thank you. //some unimportant text to reach the limitAngelia

© 2022 - 2024 — McMap. All rights reserved.