Testing a too general program

Tags:

Suppose the correct definition of a predicate would be

len([],0).
len([_|T],N)  :-  len(T,X), N is  X+1.

However, we end up with the following erroneous definition in stead.

Click to copy

len2([],0).
len2([_|T],N)  :-  len(T,X),  ( N  is  X+1 ; N is X + 2, N = 10000 ).

All the standard testing doesn’t reveal the mistake because it works just like len/2 except when it stumbles upon a list of length exactly 9999 elements where there are two possible answers.

as user mjano314 observes. How is it possible to detect such an error?

Note that len2/2 above uses len/2. In this manner there is precisely one single case where the definition is too general. Would len2/2 be directly recursive, we would have infinitely many cases that are too general. Obviously, in such a situation it would be easier to locate errors.

262

asked Jul 01 '21 19:07

false

2 Answers

If we already suspect that the predicate len2(X,Y) is not functional while we expect it to be, meaning in this case that there are no two answers with the same value for the first argument and different values for the second argument, then we can verify our suspicion by searching for such two answers with the following snippet:

Click to copy

len2(X,Y1), len2(X,Y2), Y1\=Y2

In this case, the program will give us an answer with Y1=9999, Y2=10000 and X a list of 9999 variables.

However, if the fault is not present or if the code of the predicate was such that the input triggering the fault was not generated in finite time (imagine that it generates all even-length lists before any odd-length list), then the code above will not finish. This means, in my opinion, that this approach is only useful as a debugging tool but not really suitable as part of some automated testing/validation of the predicate.

answered Sep 20 '22 09:09

jnmonette

As has been noted by @jnmonette, there is a functional dependency from the first to the second argument. With a query like

Click to copy

?- len2(L, N), dif(N, M), len2(L, M).
   L = [_A,_B,_C,_D,_E,_F,_G,_H,_I,_J|...], N = 9999, M = 10000
;  L = [_A,_B,_C,_D,_E,_F,_G,_H,_I,_J|...], N = 10000, M = 9999
;  loops.

the error in len2/2 can be detected. After all, L cannot have two different lengths. Additionally,

Click to copy

?- len2([_|L], N), len2(L, N).
   N = 10000, L = [_A,_B,_C,_D,_E,_F,_G,_H,_I,_J|...]
;  loops.

identifies the error going into the other direction. The lengths of L and [_|L] cannot be the same. This can be generalized to cover all such errors:

Click to copy

?- len2(L, N), phrase(([_],...), L,K), len2(K, N).
   L = [_A,_B,_C,_D,_E,_F,_G,_H,_I,_J|...], N = 10000, K = [_B,_C,_D,_E,_F,_G,_H,_I,_J,_K|...]
;  loops.

So far we have used Prolog directly. We were able to state general properties by posing this queries. However, we will only find counterexamples should there be some, but otherwise we are left in the uncertain. And, we were pretty lucky that the actual definition enumerates answers in a fair manner letting each and every answer appear in finite time. In case of a more demanding definition (think of exchanging the order of clauses in len/2) simply add length(L, _) in front of all queries so far.

However, in case of a running query we inevitably ask: Should we continue to wait for an answer, or can we already abort the query? After all, for a correct implementation the query will not produce any answer and thus loop indefinitely.

There is no way (in current implementations) to delegate such a query at least into the background running there with lower priority. And therefore such queries are not used for testing at all.

On the other hand, such queries are a very powerful way to express many testable properties. For example many of the bugs in clpfd-systems of SICStus, SWI, and Scryer have been identified in this manner using condor. The cruder support however does not lead to very elegant solutions.

To start to address this problem, the following annotation may help:

Click to copy

:/-& len2(L, N), dif(N, M), len2(L, M).
:/-& len2(L, N), phrase(([_],...), L,K), len2(K, N).

The :/- means there is no solution - similarly to :- \+ Q_0. and the additional &, an asciified ∞, meaning Prolog's execution will be infinite. This annotation therefore leaves room to try out better strategies that disprove that there is no solution.

In GUPU this annotation is executed as a Prolog goal with a (relatively short) timeout. Also alternate strategies are tried in particular iterative deepening which in this case do time out as well. So effectively, the error remains undetected. But with more resources or a better strategy the error might be discovered.

answered Sep 18 '22 09:09

false

Related questions
                            
                                When only "All Exceptions" breakpoint is set, Xcode 7 always stop on app start
                            
                                How can I change the debug path included in the DWARF info of a binary by the compiler
                            
                                Steps to diagnose translated UWP stack trace
                            
                                Likely and unlikely causes of Heisenbugs in Java?
                            
                                How can I tell a Python script to halt for debugger attach to process?
                            
                                Find category of MATLAB mlint warning ID
                            
                                Rust debug library with GDB
                            
                                Wildfly 10 not showing debug messages
                            
                                Error:Execution failed for task ':app:transformClassesAndResourcesWithProguardForDebug'
                            
                                How to debug `Error while processing function` in `vim` and `nvim`?
                            
                                SEH on Windows, call stack traceback is gone
                            
                                Where does Atom (or it's installed packages) store debug log files?
                            
                                Delphi - Trying to get StackTrace for an exception
                            
                                gdb break when entering child process
                            
                                How can I debug a stdin error that jq is throwing at me?
                            
                                Stall when debugging with gdbserver in VSCode - "The preLaunchTask 'docker gdb' cannot be tracked."
                            
                                iOS Swift: write print and debug prints into a file
                            
                                Showing class attributes in the PyCharm debugger when subclassing str
                            
                                Angular 7: no compile errors, but blank screen
                            
                                Remote debugging with mvnDebug, doesn't stop at breakpoint

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Testing a too general program

Tags:

debugging

declarative

prolog

false

People also ask

2 Answers

jnmonette

false

Recent Activity

Donate For Us