I'm trying to find out what the algorithm would be by being given two languages L1 and L2 to determine if they are equivalent (L1 = L2). It's surprisingly difficult to come up with one as I've found, although I am pretty sure it needs to be converted to a DFA first and then reduce each of them to a minimal DFA.. Also, I know that if L1 - L2 and L2 - L1 are empty, then L1 = L2. Anyone good with theory here?

You can find a description of a reasonably efficient algorithm for testing r.e. equality here: http://arxiv.org/PS_cache/arxiv/pdf/0907/0907.5058v1.pdf Dig through references of the article to find other solutions that may be less efficient, but easier to implement.

Here's a conceptually simple answer (assuming L1 and L2 are regular). 1) Find DFAs D1 and D2 for L1 and L2 respectively. 2) Construct D'1 and D'2 from D1 and D2 by swapping accepting/non-accepting states (note that D'1 accepts exactly ~L1 and D'2 accepts ~L2 where ~ means "complement of") 3) Use the standard product construction three times to produce a DFA that accepts exactly (L1 intersect ~L2) union (L2 intersect ~L1) 4) Check to see if the DFA from part 3 accepts any strings by checking each accepting state for reachability from the start state. 5) If the DFA from part 3 accepts any strings, then L1 <> L2. Otherwise, L1=L2 There are a huge number of heuristics you could use to speed this up, but conceptually, this is probably the simplest algorithm. A good reference for the product construction in part 3 is "Automata and Computability" by Dexter Kozen.

Trying to find an algorithm which takes 2 regular expressions and tells whether they are equivalent

2 Answers

You can find a description of a reasonably efficient algorithm for testing r.e. equality here:

http://arxiv.org/PS_cache/arxiv/pdf/0907/0907.5058v1.pdf

Dig through references of the article to find other solutions that may be less efficient, but easier to implement.

answered Oct 05 '22 19:10

Gintautas Miliauskas

Here's a conceptually simple answer (assuming L1 and L2 are regular).

1) Find DFAs D1 and D2 for L1 and L2 respectively.

2) Construct D'1 and D'2 from D1 and D2 by swapping accepting/non-accepting states (note that D'1 accepts exactly ~L1 and D'2 accepts ~L2 where ~ means "complement of")

3) Use the standard product construction three times to produce a DFA that accepts exactly (L1 intersect ~L2) union (L2 intersect ~L1)

4) Check to see if the DFA from part 3 accepts any strings by checking each accepting state for reachability from the start state.

5) If the DFA from part 3 accepts any strings, then L1 <> L2. Otherwise, L1=L2

There are a huge number of heuristics you could use to speed this up, but conceptually, this is probably the simplest algorithm. A good reference for the product construction in part 3 is "Automata and Computability" by Dexter Kozen.

answered Oct 05 '22 20:10

Aubrey da Cunha

Related questions
                            
                                pandas extractall() is not extracting all cases given a regex?
                            
                                Negative regular expression before specific term
                            
                                Regex parsing from delimited string with sequential groups
                            
                                Get small amout of text out of large text present in database using laravel
                            
                                Search and Replace in pandas dataframe for large dataset
                            
                                Remove '\n' in text in pandas python
                            
                                How i can match root of domain name without www. using regex
                            
                                How can I use regex to catch unquoted array indices in PHP code and quote them?
                            
                                String Matching with wildcard in Python
                            
                                git word diff regex strange behaviour
                            
                                inputFormatter should allow just decimal numbers and negative numbers
                            
                                Why is CVE-2021-33623 vulnerable to ReDoS?
                            
                                Why would you ever need (?(R)...|...) if condition in a regex?
                            
                                How to add missing spaces after periods using regex, without changing decimals
                            
                                Count occurrences of a word in a row in MySQL
                            
                                Highlighting long sentences using jQuery
                            
                                Validating XML using XSD with regex pattern
                            
                                Why do these regular expressions execute slowly in Java?
                            
                                How can I make a regular expression which takes accented characters into account?
                            
                                PHP: preg_match regex not finding correct strings

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Trying to find an algorithm which takes 2 regular expressions and tells whether they are equivalent

Tags:

regex

expression

theory

equivalence

John

People also ask

2 Answers

Gintautas Miliauskas

Aubrey da Cunha

Recent Activity

Donate For Us