How can one modify the format for the output from a groupby operation in pandas that produces scientific notation for very large numbers? I know how to do string formatting in python but I'm at a loss when it comes to applying it here. <pre class="prettyprint"><code>df1.groupby('dept')['data1'].sum() dept value1 1.192433e+08 value2 1.293066e+08 value3 1.077142e+08 </code></pre> This suppresses the scientific notation if I convert to string but now I'm just wondering how to string format and add decimals. <pre class="prettyprint"><code>sum_sales_dept.astype(str) </code></pre>

Granted, the answer I linked in the comments is not very helpful. You can specify your own string converter like so. <pre class="prettyprint"><code>In [25]: pd.set_option('display.float_format', lambda x: '%.3f' % x) In [28]: Series(np.random.randn(3))*1000000000 Out[28]: 0 -757322420.605 1 -1436160588.997 2 -1235116117.064 dtype: float64 </code></pre> I'm not sure if that's the preferred way to do this, but it works. Converting numbers to strings purely for aesthetic purposes seems like a bad idea, but if you have a good reason, this is one way: <pre class="prettyprint"><code>In [6]: Series(np.random.randn(3)).apply(lambda x: '%.3f' % x) Out[6]: 0 0.026 1 -0.482 2 -0.694 dtype: object </code></pre>

Here is another way of doing it, similar to Dan Allan's answer but without the lambda function: <pre class="prettyprint"><code>>>> pd.options.display.float_format = '{:.2f}'.format >>> Series(np.random.randn(3)) 0 0.41 1 0.99 2 0.10 </code></pre> or <pre class="prettyprint"><code>>>> pd.set_option('display.float_format', '{:.2f}'.format) </code></pre>

Format / Suppress Scientific Notation from Python Pandas Aggregation Results

Tags:

python

floating-point

pandas

number-formatting

scientific-notation

How can one modify the format for the output from a groupby operation in pandas that produces scientific notation for very large numbers?

I know how to do string formatting in python but I'm at a loss when it comes to applying it here.

df1.groupby('dept')['data1'].sum()  dept value1       1.192433e+08 value2       1.293066e+08 value3       1.077142e+08

This suppresses the scientific notation if I convert to string but now I'm just wondering how to string format and add decimals.

sum_sales_dept.astype(str)

643

asked Jan 15 '14 12:01

horatio1701d

2 Answers

Granted, the answer I linked in the comments is not very helpful. You can specify your own string converter like so.

In [25]: pd.set_option('display.float_format', lambda x: '%.3f' % x)  In [28]: Series(np.random.randn(3))*1000000000 Out[28]:  0    -757322420.605 1   -1436160588.997 2   -1235116117.064 dtype: float64

I'm not sure if that's the preferred way to do this, but it works.

Converting numbers to strings purely for aesthetic purposes seems like a bad idea, but if you have a good reason, this is one way:

In [6]: Series(np.random.randn(3)).apply(lambda x: '%.3f' % x) Out[6]:  0     0.026 1    -0.482 2    -0.694 dtype: object

152

answered Sep 24 '22 15:09

Dan Allan

Here is another way of doing it, similar to Dan Allan's answer but without the lambda function:

>>> pd.options.display.float_format = '{:.2f}'.format >>> Series(np.random.randn(3)) 0    0.41 1    0.99 2    0.10

>>> pd.set_option('display.float_format', '{:.2f}'.format)

answered Sep 25 '22 15:09

tfhans

Related questions
                            
                                Is it possible to decompile a compiled .pyc file into a .py file?
                            
                                How do I get python's pprint to return a string instead of printing?
                            
                                Getting the docstring from a function
                            
                                Is there a zip-like function that pads to longest length?
                            
                                Update value of a nested dictionary of varying depth
                            
                                Python: Why is functools.partial necessary?
                            
                                Python csv string to array
                            
                                matplotlib error - no module named tkinter
                            
                                How to calculate the time interval between two time strings
                            
                                Execute code when Django starts ONCE only?
                            
                                Get the current git hash in a Python script
                            
                                What is the difference between pyenv, virtualenv, anaconda?
                            
                                When and why should I use a namedtuple instead of a dictionary? [duplicate]
                            
                                Pandas column of lists, create a row for each list element
                            
                                Is it possible to use argsort in descending order?
                            
                                Queue.Queue vs. collections.deque
                            
                                Formatting floats without trailing zeros
                            
                                SQLAlchemy: print the actual query
                            
                                django test app error - Got an error creating the test database: permission denied to create database
                            
                                How do I get current URL in Selenium Webdriver 2 Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With