I am trying to format the output in an IPython notebook. I tried using the to_string function, and this neatly lets me eliminate the index column. But the textual data is right justified. In [10]: <pre class="prettyprint"><code>import pandas as pd columns = ['Text', 'Value'] a = pd.DataFrame ({'Text': ['abcdef', 'x'], 'Value': [12.34, 4.2]}) print (a.to_string (index=False)) Text Value abcdef 12.34 x 4.20 </code></pre> The same is true when just printing the dataframe. In [12]: <pre class="prettyprint"><code>print (a) Text Value 0 abcdef 12.34 1 x 4.20 </code></pre> The justify argument in the to_string function, surprisingly, only justifies the column heading. In [13]: <pre class="prettyprint"><code>import pandas as pd columns = ['Text', 'Value'] a = pd.DataFrame ({'Text': ['abcdef', 'x'], 'Value': [12.34, 4.2]}) print (a.to_string (justify='left', index=False)) Text Value abcdef 12.34 x 4.20 </code></pre> How can I control the justification settings for individual columns?

If you're willing to use another library, tabulate will do this - <pre class="prettyprint"><code>$ pip install tabulate </code></pre> and then <pre class="prettyprint"><code>from tabulate import tabulate df = pd.DataFrame ({'Text': ['abcdef', 'x'], 'Value': [12.34, 4.2]}) print(tabulate(df, showindex=False, headers=df.columns)) Text Value ------ ------- abcdef 12.34 x 4.2 </code></pre> It has various other output formats also.

You could use <code>a['Text'].str.len().max()</code> to compute the length of the longest string in <code>a['Text']</code>, and use that number, <code>N</code>, in a left-justified formatter <code>'{:<Ns}'.format</code>: <pre class="prettyprint"><code>In [211]: print(a.to_string(formatters={'Text':'{{:<{}s}}'.format(a['Text'].str.len().max()).format}, index=False)) Text Value abcdef 12.34 x 4.20 </code></pre>

I like @unutbu's answer (not requiring any additional dependencies). @JS.'s additions are a step in the direction (towards something re-usable). Since the construction of the formatter dict is the difficult part, let's create a function which creates the formatter dict from a DataFrame and an optional list of columns to format. <pre class="prettyprint lang-py prettyprint-override"><code>def make_lalign_formatter(df, cols=None): """ Construct formatter dict to left-align columns. Parameters ---------- df : pandas.core.frame.DataFrame The DataFrame to format cols : None or iterable of strings, optional The columns of df to left-align. The default, cols=None, will left-align all the columns of dtype object Returns ------- dict Formatter dictionary """ if cols is None: cols = df.columns[df.dtypes == 'object'] return {col: f'{{:<{df[col].str.len().max()}s}}'.format for col in cols} </code></pre> Let's create some example data to demonstrate using this function: <pre class="prettyprint lang-py prettyprint-override"><code>import pandas as pd # Make some data data = {'First': ['Tom', 'Dick', 'Harry'], 'Last': ['Thumb', 'Whittington', 'Potter'], 'Age': [183, 667, 23]} # Make into a DataFrame df = pd.DataFrame(data) </code></pre> To align all the columns of type object in our DataFrame: <pre class="prettyprint lang-py prettyprint-override"><code># Left align all columns print(df.to_string(formatters=make_lalign_formatter(df), index=False, justify='left')) </code></pre> To align only the <code>'First'</code> column: <pre class="prettyprint lang-py prettyprint-override"><code># Left align 'First' column print(df.to_string(formatters=make_lalign_formatter(df, cols=['First']), index=False, justify='left')) </code></pre>

How can I left justify text in a pandas DataFrame column in an IPython notebook

Tags:

pandas

ipython

ipython-notebook

I am trying to format the output in an IPython notebook. I tried using the to_string function, and this neatly lets me eliminate the index column. But the textual data is right justified.

In [10]:

import pandas as pd
columns = ['Text', 'Value']
a = pd.DataFrame ({'Text': ['abcdef', 'x'], 'Value': [12.34, 4.2]})
print (a.to_string (index=False))

   Text  Value
 abcdef  12.34
      x   4.20

The same is true when just printing the dataframe.

In [12]:

print (a)

     Text  Value
0  abcdef  12.34
1       x   4.20

The justify argument in the to_string function, surprisingly, only justifies the column heading.

In [13]:

import pandas as pd
columns = ['Text', 'Value']
a = pd.DataFrame ({'Text': ['abcdef', 'x'], 'Value': [12.34, 4.2]})
print (a.to_string (justify='left', index=False))
Text     Value
 abcdef  12.34
      x   4.20

How can I control the justification settings for individual columns?

592

asked Sep 11 '14 00:09

Fred Mitchell

3 Answers

If you're willing to use another library, tabulate will do this -

$ pip install tabulate

and then

from tabulate import tabulate
df = pd.DataFrame ({'Text': ['abcdef', 'x'], 'Value': [12.34, 4.2]})
print(tabulate(df, showindex=False, headers=df.columns))

Text      Value
------  -------
abcdef    12.34
x          4.2

It has various other output formats also.

187

answered Oct 12 '22 16:10

Brian Burns

You could use a['Text'].str.len().max() to compute the length of the longest string in a['Text'], and use that number, N, in a left-justified formatter '{:<Ns}'.format:

In [211]: print(a.to_string(formatters={'Text':'{{:<{}s}}'.format(a['Text'].str.len().max()).format}, index=False))
   Text  Value
 abcdef  12.34
 x        4.20

answered Oct 12 '22 17:10

unutbu

I like @unutbu's answer (not requiring any additional dependencies). @JS.'s additions are a step in the direction (towards something re-usable).

Since the construction of the formatter dict is the difficult part, let's create a function which creates the formatter dict from a DataFrame and an optional list of columns to format.

def make_lalign_formatter(df, cols=None):
    """
    Construct formatter dict to left-align columns.

    Parameters
    ----------
    df : pandas.core.frame.DataFrame
        The DataFrame to format
    cols : None or iterable of strings, optional
        The columns of df to left-align. The default, cols=None, will
        left-align all the columns of dtype object

    Returns
    -------
    dict
        Formatter dictionary

    """
    if cols is None:
       cols = df.columns[df.dtypes == 'object'] 

    return {col: f'{{:<{df[col].str.len().max()}s}}'.format for col in cols}

Let's create some example data to demonstrate using this function:

import pandas as pd

# Make some data
data = {'First': ['Tom', 'Dick', 'Harry'],
        'Last': ['Thumb', 'Whittington', 'Potter'],
        'Age': [183, 667, 23]}

# Make into a DataFrame
df = pd.DataFrame(data)

To align all the columns of type object in our DataFrame:

# Left align all columns
print(df.to_string(formatters=make_lalign_formatter(df), 
                   index=False,
                   justify='left'))

To align only the 'First' column:

# Left align 'First' column
print(df.to_string(formatters=make_lalign_formatter(df, cols=['First']), 
                   index=False,
                   justify='left'))

answered Oct 12 '22 15:10

jwalton

Related questions
                            
                                Cosine similarity between each row in a Dataframe in Python
                            
                                Pandas Dataframe: plot colors by column name
                            
                                matplotlib plot window won't appear
                            
                                Remove dtype datetime NaT
                            
                                How to create a Decile and Quintile columns to rank another variable based on size using Python, Pandas?
                            
                                How to create an array of dataframes in Python
                            
                                Python/Pandas - Convert type from pandas period to string
                            
                                pandas, multiply all the numeric values in the data frame by a constant
                            
                                Pandas and unicode
                            
                                pandas dataframe drop columns by number of nan
                            
                                Pandas: convert date in month to the 1st day of next month
                            
                                Pandas histogram df.hist() group by
                            
                                Extrapolate values in Pandas DataFrame
                            
                                Python Pandas to R dataframe
                            
                                Converting timezones from pandas Timestamps
                            
                                python dataframe converting multiple datetime formats
                            
                                Is there a way to generate the dtypes as a dictionary in pandas?
                            
                                plotly inside jupyter notebook python
                            
                                How to check if a particular cell in pandas DataFrame isnull?
                            
                                How to read CSV file from GitHub using pandas

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With