This problem is similar to blind SQL injections. The goal is to determine the exact value of a string, and the only test you can do is to see if a DOS-style wildcard (? = any character, * = any number of any characters) you specify is matched by the string. (So practically you only have access to a <code>bool DoesWildcardMatch(string wildcard)</code> function). The straight-forward way is to test against <code>a*, b*, c*...</code> until you find the first letter, then repeat. Some optimizations I can think of: <ul> <li>search for <code>*a*, *b*</code> etc. to determine the character set</li> <li>when a match on <code>*x*</code> is found, perform divide-et-impera (<code>*a*x*, *b*x*, ...</code>)</li> </ul>

A first thought. You can determin the length <code>n</code> of the string in <code>O(log2(n))</code>. <ul> <li>Check <code>Z*</code> where <code>Z</code> represents <code>k</code> question marks starting with 0, then 1, and then doubling the number of question marks with every check until no match occurs. <code>n</code> must be between <code>k / 2</code> and <code>k</code> </li> <li>Find the exact length using the same pattern changing <code>k</code> in the same way as binary search does.</li> </ul> Knowing the exact length might help to perform a kind of divide-et-impera in the spatial domain. UPDATE If you know the length, you can use the same pattern to correctly locate a symbol. Example: <pre class="prettyprint"> ..X. ..XX (spaces added for readability) + symbol may be X - symbol is not X X symbol is X *X* => MATCH ++++ ++++ *X* ???? => MATCH ++++ ++++ *X*?? ???? => NO MATCH --++ ++++ ??X? ???? => MATCH --X+ ++++ ??XX ???? => NO MATCH --X- ++++ ??X? *X*?? => NO MATCH --X- --++ ??X? ??X? => MATCH --X- --X+ ??X? ??XX => MATCH --X- --XX </pre> For string length <code>n</code> and alphabet size <code>m</code> this will take about <code>O(log2(n))</code> to find the length of the string, about <code>O(n • log2(n))</code> to correctly place <code>n</code> symbols, and <code>O(m)</code> to find the used symbols - summing all together yields <code>O(n • log2(n) + m)</code>. I could imagine that it is possible to speed this up by merging several steps - maybe test for used symbols while determining the string length or simultaneously locating two (or even more?) symbols in the first and second half of the string. This will require to recheck the merged steps in isolation if the check fails in order to determine which check faild. But as long as the merged check succeeds, you gain information on both. Maybe I will calculate that tomorrow in order to see if it will really speed the thing up.

Fastest way to bruteforce a string using a DOS wildcard

Tags:

string

algorithm

wildcard

brute-force

This problem is similar to blind SQL injections. The goal is to determine the exact value of a string, and the only test you can do is to see if a DOS-style wildcard (? = any character, * = any number of any characters) you specify is matched by the string. (So practically you only have access to a bool DoesWildcardMatch(string wildcard) function).

The straight-forward way is to test against a*, b*, c*... until you find the first letter, then repeat. Some optimizations I can think of:

search for *a*, *b* etc. to determine the character set
when a match on *x* is found, perform divide-et-impera (*a*x*, *b*x*, ...)

649

asked May 14 '09 19:05

Vladimir Panteleev

1 Answers

A first thought. You can determin the length n of the string in O(log2(n)).

Check Z* where Z represents k question marks starting with 0, then 1, and then doubling the number of question marks with every check until no match occurs. n must be between k / 2 and k
Find the exact length using the same pattern changing k in the same way as binary search does.

Knowing the exact length might help to perform a kind of divide-et-impera in the spatial domain.

UPDATE

If you know the length, you can use the same pattern to correctly locate a symbol.

Example:

    ..X. ..XX (spaces added for readability)

                              + symbol may be X
                              - symbol is not X
                              X symbol is X

    *X*         => MATCH      ++++ ++++
    *X*   ????  => MATCH      ++++ ++++
    *X*?? ????  => NO MATCH   --++ ++++
    ??X?  ????  => MATCH      --X+ ++++
    ??XX  ????  => NO MATCH   --X- ++++
    ??X?  *X*?? => NO MATCH   --X- --++
    ??X?  ??X?  => MATCH      --X- --X+
    ??X?  ??XX  => MATCH      --X- --XX

For string length n and alphabet size m this will take about O(log2(n)) to find the length of the string, about O(n • log2(n)) to correctly place n symbols, and O(m) to find the used symbols - summing all together yields O(n • log2(n) + m).

I could imagine that it is possible to speed this up by merging several steps - maybe test for used symbols while determining the string length or simultaneously locating two (or even more?) symbols in the first and second half of the string. This will require to recheck the merged steps in isolation if the check fails in order to determine which check faild. But as long as the merged check succeeds, you gain information on both.

Maybe I will calculate that tomorrow in order to see if it will really speed the thing up.

answered Sep 17 '22 12:09

Daniel Brückner

Related questions
                            
                                finding minimum number of rectangular pieces in a rectangular chocolate bar, with a rule
                            
                                Generating n binary vectors where each vector has a Hamming distance of d from every other vector
                            
                                Group array of items by their distinct id
                            
                                generate a random point within rectangles' areas uniformly (some rectangles could overlap)
                            
                                3Sum leetcode algorithm
                            
                                Peak finding algorithm in 2d-array with complexity O(n)
                            
                                Algorithm to find k smallest numbers in an array in same order using O(1) auxiliary space
                            
                                Any idea to optimise this algorithm?
                            
                                Sorted array except for first K and last K elements
                            
                                Sum of max elements in sub-triangles
                            
                                Building a tree recursively in JavaScript
                            
                                How to compute the min average sub-array better than O(n^2)? [duplicate]
                            
                                Progressively store the path from root node to node of multiway tree during insertion so that the storage operation does not have a complexity of O(n)
                            
                                Check if strings in a list can be formed by concatenation of elements in the same list
                            
                                Number of expressions of a given length
                            
                                Number of ways to change coins in constant time?
                            
                                Why is KNN so much faster with cosine distance than Euclidean distance?
                            
                                Are function parameters not polymorphic in Algorithm W (or Haskell)?
                            
                                Finding subarrays in an array where length equals P * (sum of elements)
                            
                                how to identify the minimal set of parameters describing a data set

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With