Fast algorithm for checking if binary arrays can be rotated to not have an elementwise sum over 1

Tags:

Let's say I have a set of constant-length arrays containing only zeroes and ones. My goal is to find out whether, after any rotation of any of the arrays, the element-wise sums of the arrays will not exceed 1.

For instance, let's say I have the following three arrays: [1, 0, 0, 0], [1, 0, 1, 0], and [1, 0, 0, 0]. I can rotate the second array by one element and the third array by two elements to obtain the arrays [1, 0, 0, 0], [0, 1, 0, 1], [0, 0, 1, 0], the element-wise sum of which is [1, 1, 1, 1]. However, had I not applied the rotations, I would have gotten a sum of [3, 0, 1, 0], which does not fit my requirements as one of the elements (the 3) is greater than 1.

Now, my question is, what is a fast way to determine if this is possible for an arbitrary number of arrays? For instance, there is no way to rotate [1, 0, 0, 0], [1, 0, 1, 0], [1, 0, 1, 0] so that the elements of the sum do not exceed 1.

Current heuristics

Obviously, if the total sum of the arrays, which are, say, length n, exceeds n, then this is trivially impossible.
The best idea for an approach I can think of so far is taking two arrays, finding a way to merge them together, and inverting the result. Then, we take this result and the next array, and repeat this process. However, this method does not guarantee to find a solution if one exists.

My question is, short of trying every possible rotation, what would be a good algorithm for this problem?

772

asked Feb 24 '18 02:02

VarmirGadkin

2 Answers

You could reduce this problem to an exact cover problem and use one of the known algorithms for exact cover (Knuth's Algorithm X, integer linear programming, SAT solving as sascha reminds me, possibly others). The reduction involves creating all rotations of each input array and extending them with an indicator to ensure that exactly one rotation is chosen. For the instance [1, 0, 0, 0], [1, 0, 1, 0], [1, 0, 0, 0], for example, the exact cover instance is

[1, 0, 0, 0; 1, 0, 0]  # from [1, 0, 0, 0]
[0, 1, 0, 0; 1, 0, 0]
[0, 0, 1, 0; 1, 0, 0]
[0, 0, 0, 1; 1, 0, 0]
[1, 0, 1, 0; 0, 1, 0]  # from [1, 0, 1, 0]
[0, 1, 0, 1; 0, 1, 0]
[1, 0, 0, 0; 0, 0, 1]  # from [1, 0, 0, 0]
[0, 1, 0, 0; 0, 0, 1]
[0, 0, 1, 0; 0, 0, 1]
[0, 0, 0, 1; 0, 0, 1]
[1, 0, 0, 0; 0, 0, 0]  # extra columns to solve the impedance mismatch
[0, 1, 0, 0; 0, 0, 0]  # between zeros being allowed and exact cover
[0, 0, 1, 0; 0, 0, 0]
[0, 0, 0, 1; 0, 0, 0]

I have a feeling that your problem is NP-hard, which would mean that there's a reduction in the reverse direction too and hence no hope for an algorithm whose provable worst-case running time is subexponential.

EDIT: yes, this problem is NP-hard. There's an easy reduction from 3-partition that I'll demonstrate by example.

Suppose that the 3-partition instance is [20, 23, 25, 45, 27, 40]. Then we make a binary array

[1, ..(20 ones in total).., 1, 0, ..., 0]
[1, ..(23 ones in total).., 1, 0, ..., 0]
[1, ..(25 ones in total).., 1, 0, ..., 0]
[1, ..(45 ones in total).., 1, 0, ..., 0]
[1, ..(27 ones in total).., 1, 0, ..., 0]
[1, ..(40 ones in total).., 1, 0, ..., 0].

We're looking for a partition where each of the two parts sums to 90, so the final array is a "stencil"

[1, 0, ..(90 zeros in total).., 0, 1, 0, ..(90 zeros in total).., 0]

that enforces the 3-partition constraints.

162

answered Oct 12 '22 11:10

David Eisenstat

~~I'm still undecided on the question if that problem is in P or NP-hard. There surely is a lot of mathematical-structure to exploit.~~. See David's answer.

But until someone else comes with a nice solution, here something which will work in theory and might also do in practice.

The basic idea is: formulate it as SAT-problem and use very efficient solvers for this kind of combinatorial-problems.

The SAT-solver kind we use here are CDCL-based solvers which are complete and sound (they will find a feasible solution or proof there is none!).

Analysis (naive approach)

While SAT-solving is NP-hard in general, often instances with millions of variables and constraints can be solved. But this does not guarantee that this will work here. It's hard to say without testing and sadly, no test-data was provided.

The formulation is as simple as it gets:

N * M binary-variables:
- N marks the data-row; M the roation/shift value
A: preprocess all possible pairwise conflicts
B: add constraints that at least one configuration of each row is used
C: add constraints forbidding conflicts

For N=100 rows of M=100 cols, there will be 4950 ordered pairs, each multiplied by M*M (pairwise rotation-combinations) each. So there are <= 4950 * 100 * 100 = 49.500.000 conflict-checks (which is even feasible in slow languages). This also is an upper-bound on the number of conflict-constraints.

Of course this might change if you got very sparse-data which allows N to grow while M is fixed (in feasible-instance world).

There are probably a lot of potential reductions possible.

The take-away message here:

Preprocessing is a lot of work (asymptotics!) and the approach is based on the comment the length of the arrays is less than 100
SAT-solving is very very fast in regards to propagation and if P or NP-hard, the kind of constraints we provided are very powerful in terms of propagation-efficiency
Testing this empirically (on your data) is recommended !

Remark:

There is no <= per row constraint and in some cases maybe two configurations are selected. The solution-reconstruction code does not check for this case (but there is no theoretical problem -> just pick one => will be compatible).

Code

from pyCadical import PyCadical  # own wrapper; not much tested; @github
import itertools
import numpy as np

""" DATA """
data = np.array([[1, 0, 0, 0],
                 [1, 0, 1, 0],
                 [1, 0, 0, 0]])

""" Preprocessing """
N = len(data)
M = len(data[0])

conflicts = []
for i, j in itertools.combinations(range(N), 2):
    for rotA in range(M):
        for rotB in range(M):
            if np.amax(np.roll(data[i], rotA) + np.roll(data[j], rotB)) > 1:
                conflicts.append((i, j, rotA, rotB))
conflicts = np.array(conflicts)

""" SAT """
cad = PyCadical()
vars = np.arange(N*M, dtype=int).reshape(N,M) + 1

# at least one rotation chosen per element
for i in range(N):
    cad.add_clause(vars[i, :])  # row0rot0 OR row0rot1 OR ...

# forbid conflicts
for i, j, rotA, rotB in conflicts:
    cad.add_clause([-vars[i, rotA], -vars[j, rotB]])  # (not rowIrotA) or (not rowJrotB)

""" Solve """
cad.solve()

""" Read out solution """
sol = cad.get_sol_np().reshape(N, M)
chosen = np.where(sol > 0)

solution = []  # could be implemented vectorized
for i in range(N):
    solution.append(np.roll(data[i], chosen[1][i]))

print(np.array(solution))

Output

[[0 1 0 0]
 [1 0 1 0]
 [0 0 0 1]]

answered Oct 12 '22 11:10

sascha

Related questions
                            
                                How to make custom navigation view animations like this one
                            
                                DAO / repository: Good practice return value after insert / update
                            
                                How to serialize java.math.BigDecimal in Avro?
                            
                                Spring Boot with MongoTemplate
                            
                                Suffix array nlogn creation
                            
                                java.lang.NoSuchMethodError - Ljava/lang/String;)Ljava/lang/String;
                            
                                How does 'maven jetty:run' work?
                            
                                What does <fork>true</fork> means in pom.xml while using maven?
                            
                                Spring Tool Suite (STS) - Can not import using Gradle (Buildship)
                            
                                Why does the JVM show more latency for the same block of code after a busy spin pause?
                            
                                How to store and reuse keypair in Java?
                            
                                Are cycles allowed between platform modules?
                            
                                Stop accepting new TCP connections without dropping any existing ones
                            
                                How to create OpenID Provider with Spring Boot
                            
                                How to integrate Firebase with Glide ('using' method)
                            
                                How to keep user information after login in JavaFX Desktop Application
                            
                                os.detected.classifier not set - os-maven-plugin and eclipse oxygen
                            
                                Convert Maybe to Single from another source if Maybe completes
                            
                                How can Spring's test annotation @Sql behave like @BeforeClass?
                            
                                AspectJ - pointcut for methods of interface's implementation

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Fast algorithm for checking if binary arrays can be rotated to not have an elementwise sum over 1

Tags:

java

arrays

language-agnostic

algorithm

heuristics