Subset-Sum in Linear Time

Tags:

This was a question on our Algorithms final exam. It's verbatim because the prof let us take a copy of the exam home.

(20 points) Let I = {r1,r2,...,rn} be a set of n arbitrary positive integers and the values in I are distinct. I is not given in any sorted order. Suppose we want to find a subset I' of I such that the total sum of all elements in I' is exactly 100*ceil(n^.5) (each element of I can appear at most once in I'). Present an O(n) time algorithm for solving this problem.

As far as I can tell, it's basically a special case of the knapsack problem, otherwise known as the subset-sum problem ... both of which are in NP and in theory impossible to solve in linear time?

So ... was this a trick question?

This SO post basically explains that a pseudo-polynomial (linear) time approximation can be done if the weights are bounded, but in the exam problem the weights aren't bounded and either way given the overall difficulty of the exam I'd be shocked if the prof expected us to know/come up with an obscure dynamic optimization algorithm.

986

asked Dec 17 '13 05:12

cjhin

1 Answers

There are two things that make this problem possible:

The input can be truncated to size O(sqrt(n)). There are no negative inputs, so you can discard any numbers greater than 100*sqrt(n), and all inputs are distinct so we know there are at most 100*sqrt(n) inputs that matter.
The playing field has size O(sqrt(n)). Although there are O(2^sqrt(n)) ways to combine the O(sqrt(n)) inputs that matter, you don't have to care about combinations that either leave the 100*sqrt(n) range or redundantly hit a target you can already reach.

Basically, this problem screams dynamic programming with each input being checked against each part of the 'reached number' space somehow.

The solution ends up being a matter of ensuring numbers don't reach off of themselves (by scanning in the right direction), of only looking at each number once, and of giving ourselves enough information to reconstruct the solution afterwards.

Here's some C# code that should solve the problem in the given time:

int[] FindSubsetToImpliedTarget(int[] inputs) {
    var target = 100*(int)Math.Ceiling(Math.Sqrt(inputs.Count));

    // build up how-X-was-reached table
    var reached = new int?[target+1];
    reached[0] = 0; // the empty set reaches 0
    foreach (var e in inputs) {
        // we go backwards to avoid reaching off of ourselves
        for (var i = target; i >= e; i--) {
            if (reached[i-e].HasValue) {
                reached[i] = e;
            }
        }
    }

    // was target even reached?
    if (!reached[target].HasValue) return null;

    // build result by back-tracking via the logged reached values
    var result = new List<int>();
    for (var i = target; reached[i] != 0; i -= reached[i].Value) {
        result.Add(reached[i].Value);
    }
    return result.ToArray();
}

I haven't actually tested the above code, so beware typos and off-by-ones.

174

answered Oct 06 '22 00:10

Craig Gidney

Related questions
                            
                                Pairing heap - implementation of decrease key
                            
                                Storing parent child mapping in memory. To list all reachable child for a parent efficiently
                            
                                How to traverse through a adjacency matrix?
                            
                                Under what conditions do these non-comparison sorts run in linear time?
                            
                                How to find out whether a triangle mesh is concave or not?
                            
                                Finding a pattern in a binary string
                            
                                Writing an algorithm to decide whether a target number can be reached with a set of other numbers and specific operators?
                            
                                find a matrix in a big matrix
                            
                                Haskell Linear-Time Online Algorithm
                            
                                Open a lock with the least number of moves
                            
                                How to arrange a graph linearly with no overlapping?
                            
                                Obtaining the minimum number of tails of coin after flipping the entire row or column multiple times
                            
                                Algorithm to make a polynomial fit of a part of a data set
                            
                                OpenCV find coloured in circle and position value Python
                            
                                Reasoning behind shifting over the text whem mismatch occurs in KMP algorithm?
                            
                                Three elements in array whose xor is maximum
                            
                                Finding all subsets of a multiset
                            
                                Push, Pop, Range in constant time
                            
                                Find a single number in a list when other numbers occur more than twice
                            
                                Rotating right an array of int in c#?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Subset-Sum in Linear Time

Tags:

algorithm

time-complexity

subset-sum

cjhin

People also ask

1 Answers

Craig Gidney

Recent Activity

Donate For Us