Is there a tool (or an algorithm) to convert a finite state machine into a regular expression?
(not the other way around, that would be easy).
Is there a tool (or an algorithm) to convert a finite state machine into a regular expression?
(not the other way around, that would be easy).
There are several algorithms to perform this task: the "state-elimination method" from Brzozowski and Mc Cluskey, the resolution of a system of linear equation, the method from McNaughton and Yamada, etc. They are very well described in Automata and rational expressions by Jacques Sakarovitch.
The state-elimination method in particular is simple to understand. The key idea is that you are going to build an automaton labeled by rational (aka regular) expressions rather than letters. First, make sure you have a single initial state and a single final state (you may add fresh states and spontaneous transitions if necessary). Then choose a state s to eliminate, say state 1 in the following picture.
Then consider all the couples (p, q) where p is a predecessor (states from which a transition reaches s, 0 in the picture) and q a successor (state 2). For each such couple (p, q) add a transition from p to q which is labeled by E(p, q) + E(p, s)E(s, s)*E(s, q) where E(p, s) means "the expression that labels the transition from p to s. Once you treated all the couple (p, q), remove the state s. In the previous example:
Do that until you eliminated all the inner states (i.e., keep the initial state and the final state), and just read the result on the transition from the initial state to the final state (here d+ab*c).
You may toy with this algorithm using Vcsn, a tool for rational expressions and automata. Here is a complete example you may reproduce at Vcsn Sandbox.
I believe the best tool I have used is greenery
. It is a FSM/regex conversion library for python. You can read more about the library here and the algorithm used is well described here.
The model that can be found on the website can be converted like this:
from greenery import fsm, lego
A, B, C, D, E = range(5)
a, b = 'a', 'b'
# create the FSM
machine = fsm.fsm(
alphabet = {a, b},
states = {A, B, C, D, E},
initial = A,
finals = {C, E},
map = {
A : {a: B, b: D},
B : {a: C, b: E},
C : {a: C, b: E},
D : {a: B, b: D},
E : {a: B, b: D}
},
)
# convert it to regex
rex = lego.from_fsm(machine)
The output is the following:
>>> print(machine)
name final? a b
------------------
* 0 False 1 3
1 False 2 4
2 True 2 4
3 False 1 3
4 True 1 3
>>> print(rex)
b*a((a*(a|b+))?ba)*(a+b?|b)
The version on PYPI has some problem with assertions, and the github version has some problems with the memory, but the combination python 3.x + github version is awesome.
You can use fsm2regex, an online tool that does the job.
I have implemented the state elimination algorithm for the http://www.brics.dk/automaton/ Java package. The implementation is based on the algorithm illustrated in Sipser, Michael. Introduction to the Theory of Computation. Vol. 2. Boston: Thomson Course Technology, 2006.
You can check it out at https://github.com/julianthome/autorex. Would be happy to get some feedback.
dk.brics.Automaton
it works out of the box. You can find some example test-cases here: github.com/julianthome/autorex/blob/master/src/test/java/org/…. In case you have a custom data-structure, you could translate it into an GNFA (github.com/julianthome/autorex/blob/master/src/main/java/org/…) which can be passed on to the StateEliminator (github.com/julianthome/autorex/blob/master/src/main/java/org/…). –
Trantrance You don't specify what you're doing, but you might want to know there is a tool called Ragel that specializes in FSMs. It generates code for a slew of languages, and when I looked a few years ago, it wasn't too hard to port the machines to other languages.
© 2022 - 2024 — McMap. All rights reserved.