How should one proceed to prove (or find) if two regular expressions are same or equivalent?

Asked 22/1, 2012 at 14:8 Answered 22/8, 2016 at 4:57

Solved regex finite-automata regular-language equivalence

For example, in an assignment given to me, we were asked to find out if two regular expressions are equal or not.

(a+b+c)*  and ((ab)**c*)*

My question is how is one supposed to do that? If I draw the transition graphs for both and then run a few strings through it and show that both of the TGs are able to accept it, is that a sufficient proof ? If not, how do I do it? Is there a mathematical/axiomatic approach towards this?

Thanks in advance.

EDIT: There is another thing that I'd like to clear which is kind of related to this question. Are the two FAs depicted in the photo below the same?

enter image description here

i.e. Are (1) and (2) in the above picture the same?

Radiate answered 22/1, 2012 at 14:8 Comment(4)

Why there are spaces in the expression? – Gastropod 22/1, 2012 at 14:16

By hand, I assume? It's well-known how to do it programmatically, but I assume (I don't know the relevant algorithms very well) it's rather complicated to do the whole procedure by hand. – Weldon 22/1, 2012 at 14:20

( ( a b )* * c* )* is wrong! – Gastropod 22/1, 2012 at 14:23

@Shiplu Thanks for your response. Yes, the spaces were redundant. I've edited the question. – Radiate 22/1, 2012 at 15:38

There is an algorithm to determine whether they are equal:

Construct NFA-lambdas corresponding to each RE using Kleene's theorem

Construct DFAs for each using the subset/powerset construction

(optional) Minimize the DFAs using a standard DFA minimization algorithm.

Construct DFAs for L(M1) \ L(M2) and L(M2) \ L(M1) using the Cartesian Product Machine construction

(Optional) Minimize these CPMs.

Determine whether each one accepts any strings by testing all strings over alphabet E of size no greater than |Q| (works due to the pumping lemma for regular languages)

No novelty or genius is required; you could write a program to do this (although, in practice, using the powerset construction can be unwieldy, and failing to minimize at both steps can be costly).

EDIT: Yes, those DFAs are the same. The first is just a shorthand notation for the second.

Nonpros answered 22/1, 2012 at 15:51 Comment(1)

If the minimal DFA is unique, why do we need steps 4 - 6? Is it insufficient to construct the minimal DFA's and compare them? – Keenakeenan 23/9, 2020 at 18:58

Two regular expressions R and T are equivalent if the language defined by R (i.e., the set of strings generated by regular expression R) is equal to the language defined by T. To prove equivalences for regular expressions, we use containment proofs from set theory. That is, if S1 is the set of strings generated by regular expression R, and S2 is the set of strings generated by regular expression T, we must prove that S1 ⊆ S2 and S2 ⊆ S1. Both directions are necessary to prove equality of sets.

-- From the lecture notes of CSc 4340 GSU Fall 09 (Dr. Raj Sunderraman)

Shelving answered 22/1, 2012 at 14:39 Comment(1)

Right, but since the size of both of these languages is [countably] infinite, this doesn't help prove anything. – Diannediannne 23/1, 2012 at 18:45

Assuming

Spaces are inserted for illustration
( ( a b )* * c* )* is actually ((ab)*c*)**,
Each pattern is wrapped by ^ and $.

Those regular expressions are NOT same.

abccabcc will not match (a+b+c)* but will match ((ab)*c*)*

How did I find this?

When I closely looked at those patterns, I found 2 things.

First one accepts more than 1 of a and b {1,}. So there will be always a sequence of a and sequence of b side by side. like aaaabb, aabbbbb, etc. But in the second pattern a and be will be side by side with single instance. like ab, ababab, abababab, etc.
c will appear only 1 time following sequence of a and sequence of b. But in second pattern c can appear as many times as it can.

Gastropod answered 22/1, 2012 at 14:22 Comment(0)

They are different, which is easy to tell by the quantifiers. For the first expression to match anything, it must contain a c. The second can obviously do without a c. (There are many more differences, but that should get you started).

Ramos answered 22/1, 2012 at 14:36 Comment(0)

-3

((ab)^^c^)^=( a^b^c^)^ = (a+b+c)^

Polygyny answered 22/8, 2016 at 4:57 Comment(1)

Please try to avoid just dumping code as an answer and try to explain what it does and why. Your code might not be obvious for people who do not have the relevant coding experience. – Hellfire 22/8, 2016 at 5:18

-6

Since this is homework, I won't give you the complete answer, but I will tell you the key fact you need to know: for a given finite state language, the DFA which recognizes it with the minimum number of states is unique.

BTW, I don't believe that your professor would assign this homework without teaching you how to do it. Get off the internet and read your lecture notes and/or textbook.

Rorke answered 22/1, 2012 at 16:1 Comment(0)

Recommended topics

Hot tags