I am building an encryption program which produces a massive integer.It looks something like this: <pre class="prettyprint lang-py prettyprint-override"><code>a = plaintextOrd**bigNumber </code></pre> when i do <pre class="prettyprint lang-py prettyprint-override"><code>a = str(a) </code></pre> it takes over 28 minutes. Is there any possible way to convert an integer like this quicker that using the built in str() function? the reason i need it to be a string is because of this function here: <pre class="prettyprint lang-py prettyprint-override"><code>def divideStringIntoParts(parts,string): parts = int(parts) a = len(string)//parts new = [] firstTime = True secondTime = True for i in range(parts): if firstTime: new.append(string[:a]) firstTime = False elif secondTime: new.append(string[a:a+a]) secondTime = False else: new.append(string[a*i:a*(i+1)]) string2 = "" for i in new: for i in i: string2 += i if len(string2) - len(string) != 0: lettersNeeded = len(string) - len(string2) for i in range(lettersNeeded): new[-1] += string[len(string2) + i] return new </code></pre>

You wrote in the comments that you want to get the length of the integer in decimal format. You don't need to convert this integer to a string, you can use "common logarithm" instead: <pre class="prettyprint"><code>import math math.ceil(math.log(a, 10)) </code></pre> Moreover, if you know that: <pre class="prettyprint"><code>a = plaintextOrd**bigNumber </code></pre> then <code>math.log(a, 10)</code> is equal to <code>math.log(plaintextOrd, 10) * bigNumber</code>, which shouldn't take more than a few milliseconds to calculate: <pre class="prettyprint"><code>>>> plaintextOrd = 12345 >>> bigNumber = 67890 >>> a = plaintextOrd**bigNumber >>> len(str(a)) 277772 >>> import math >>> math.ceil(math.log(a, 10)) 277772 >>> math.ceil(math.log(plaintextOrd, 10) * bigNumber) 277772 </code></pre> It should work even if <code>a</code> wouldn't fit on your hard drive: <pre class="prettyprint"><code>>>> math.ceil(math.log(123456789, 10) * 123456789012345678901234567890) 998952457326621672529828249600 </code></pre> As mentioned by @kaya3, Python standard floats aren't precise enough to describe the exact length of such a large number. You could use <code>mpmath</code> (arbitrary-precision floating-point arithmetic) to get results with the desired precision: <pre class="prettyprint"><code>>>> from mpmath import mp >>> mp.dps = 1000 >>> mp.ceil(mp.log(123456789, 10) * mp.mpf('123456789012345678901234567890')) mpf('998952457326621684655868656199.0') </code></pre>

Some quick notes on the "I need it for this function". <ul> <li>You don't need the first/second logic: <ul> <li><code>[:a] == [a*0:a*(0+1)]</code></li> <li><code>[a:a+a] == [a*1:a*(1+1)]</code></li> </ul> </li> </ul> So we have <pre class="prettyprint"><code> new = [] for i in range(parts): new.append(string[a*i:a*(i+1)]) </code></pre> or just <code>new = [string[a*i:a*(i+1)] for i in range(parts)]</code>. Note that you have silently discarded the last <code>len(string) % parts</code> characters. In your second loop, you shadow <code>i</code> with <code>for i in i</code>, which happens to work but is awkward and dangerous. It can also be replaced with <code>string2 = ''.join(new)</code>, which means you can just do <code>string2 = string[:-(len(string) % parts)]</code>. You then see if the strings are the same length, and then add the extra letters to the end of the last list. This is a little surprising, e.g. you would have <pre class="prettyprint"><code>>>> divideStringIntoParts(3, '0123456789a') ['012', '345', '6789a'] </code></pre> When most algorithms would produce something that favors even distributions, and earlier elements, e.g.: <pre class="prettyprint"><code>>>> divideStringIntoParts(3, '0123456789a') ['0124', '4567', '89a'] </code></pre> Regardless of this, we see that you don't really care about the value of the string at all here, just how many digits it has. Thus you could rewrite your function as follows. <pre class="prettyprint"><code>def divide_number_into_parts(number, parts): ''' >>> divide_number_into_parts(12345678901, 3) [123, 456, 78901] ''' total_digits = math.ceil(math.log(number + 1, 10)) part_digits = total_digits // parts extra_digits = total_digits % parts remaining = number results = [] for i in range(parts): to_take = part_digits if i == 0: to_take += extra_digits digits, remaining = take_digits(remaining, to_take) results.append(digits) # Reverse results, since we go from the end to the beginning return results[::-1] def take_digits(number, digits): ''' Removes the last <digits> digits from number. Returns those digits along with the remainder, e.g.: >>> take_digits(12345, 2) (45, 123) ''' mod = 10 ** digits return number % mod, number // mod </code></pre> This should be very fast, since it avoids strings altogether. You can change it to strings at the end if you'd like, which may or may not benefit from the other answers here, depending on your chunk sizes.

Is it possible to convert a really large int to a string quickly in python

Tags:

python

I am building an encryption program which produces a massive integer.It looks something like this:

a = plaintextOrd**bigNumber

when i do

a = str(a)

it takes over 28 minutes.

Is there any possible way to convert an integer like this quicker that using the built in str() function?

the reason i need it to be a string is because of this function here:

def divideStringIntoParts(parts,string):
    parts = int(parts)
    a = len(string)//parts

    new = []
    firstTime = True
    secondTime = True
    for i in range(parts):
        if firstTime:
            new.append(string[:a])
            firstTime = False
        elif secondTime:
            new.append(string[a:a+a])
            secondTime = False
        else:
            new.append(string[a*i:a*(i+1)])

    string2 = ""
    for i in new:
        for i in i:
            string2 += i

    if len(string2) - len(string) != 0:
        lettersNeeded = len(string) - len(string2)
        for i in range(lettersNeeded):
            new[-1] += string[len(string2) + i] 

    return new

686

asked Nov 23 '19 15:11

Matthew Tranmer

2 Answers

You wrote in the comments that you want to get the length of the integer in decimal format. You don't need to convert this integer to a string, you can use "common logarithm" instead:

import math
math.ceil(math.log(a, 10))

Moreover, if you know that:

a = plaintextOrd**bigNumber

then math.log(a, 10) is equal to math.log(plaintextOrd, 10) * bigNumber, which shouldn't take more than a few milliseconds to calculate:

>>> plaintextOrd = 12345
>>> bigNumber = 67890
>>> a = plaintextOrd**bigNumber
>>> len(str(a))
277772
>>> import math
>>> math.ceil(math.log(a, 10))
277772
>>> math.ceil(math.log(plaintextOrd, 10) * bigNumber)
277772

It should work even if a wouldn't fit on your hard drive:

>>> math.ceil(math.log(123456789, 10) * 123456789012345678901234567890)
998952457326621672529828249600

As mentioned by @kaya3, Python standard floats aren't precise enough to describe the exact length of such a large number.

You could use mpmath (arbitrary-precision floating-point arithmetic) to get results with the desired precision:

>>> from mpmath import mp
>>> mp.dps = 1000
>>> mp.ceil(mp.log(123456789, 10) * mp.mpf('123456789012345678901234567890'))
mpf('998952457326621684655868656199.0')

answered Oct 04 '22 01:10

Eric Duminil

Some quick notes on the "I need it for this function".

You don't need the first/second logic:
- [:a] == [a*0:a*(0+1)]
- [a:a+a] == [a*1:a*(1+1)]

So we have

    new = []
    for i in range(parts):
        new.append(string[a*i:a*(i+1)])

or just new = [string[a*i:a*(i+1)] for i in range(parts)].

Note that you have silently discarded the last len(string) % parts characters.

In your second loop, you shadow i with for i in i, which happens to work but is awkward and dangerous. It can also be replaced with string2 = ''.join(new), which means you can just do string2 = string[:-(len(string) % parts)].

You then see if the strings are the same length, and then add the extra letters to the end of the last list. This is a little surprising, e.g. you would have

>>> divideStringIntoParts(3, '0123456789a')
['012', '345', '6789a']

When most algorithms would produce something that favors even distributions, and earlier elements, e.g.:

>>> divideStringIntoParts(3, '0123456789a')
['0124', '4567', '89a']

Regardless of this, we see that you don't really care about the value of the string at all here, just how many digits it has. Thus you could rewrite your function as follows.

def divide_number_into_parts(number, parts):
    '''
    >>> divide_number_into_parts(12345678901, 3)
    [123, 456, 78901]
    '''
    total_digits = math.ceil(math.log(number + 1, 10))
    part_digits = total_digits // parts
    extra_digits = total_digits % parts

    remaining = number
    results = []
    for i in range(parts):
        to_take = part_digits
        if i == 0:
            to_take += extra_digits
        digits, remaining = take_digits(remaining, to_take)
        results.append(digits)
    # Reverse results, since we go from the end to the beginning
    return results[::-1]


def take_digits(number, digits):
    '''
    Removes the last <digits> digits from number.
    Returns those digits along with the remainder, e.g.:
    >>> take_digits(12345, 2)
    (45, 123)
    '''
    mod = 10 ** digits
    return number % mod, number // mod

This should be very fast, since it avoids strings altogether. You can change it to strings at the end if you'd like, which may or may not benefit from the other answers here, depending on your chunk sizes.

answered Oct 04 '22 01:10

Cireo

Related questions
                            
                                Python, list of tuples split into dictionaries
                            
                                get all unicode variations of a latin character
                            
                                How to count consecutive repetitions in a pandas series
                            
                                How to use flask_jwt_extended with blueprints?
                            
                                how to convert perreplica to tensor?
                            
                                How to plot text clusters?
                            
                                Dictionary to Dataframe Error: "If using all scalar values, you must pass an index"
                            
                                Why do these two functions have the same bytecode when disassembled under dis.dis?
                            
                                DataFrame to list of list without change in data type of values
                            
                                cannot import name 'ft2font' from 'matplotlib' on windows10
                            
                                Decreasing the time necessary to enter the coefficients of a matrix
                            
                                How to install the specific version of Python with Anaconda?
                            
                                How can I integrate xgboost in spark? (Python)
                            
                                How to count no of rows in a data frame whose values divisible by 3 or 5?
                            
                                How to animate a line chart in a streamlit page
                            
                                How to popup success message in odoo?
                            
                                SQLAlchemy: Can't reconnect until invalid transaction is rolled back
                            
                                What is causing large jumps in training accuracy and loss between epochs?
                            
                                rllib use custom registered environments
                            
                                Is it possible to extract text from specific portion of image using pytesseract

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With