"Repel" annotations in matplotlib?

Tags:

I recently saw this package for R/ggplot2, which lets one to have multiple annotations on a plot and automatically adjust their position to minimize overlap, and this way improve the readability. Is there anything similar available for python/matplotlib?

EDIT: I've found Matplotlib overlapping annotations / text and it looks promising, but seems like the result is inferior to the R package.

Example:

from matplotlib import pyplot as plt
import numpy as np
xs = np.arange(10, step=0.1)+np.random.random(100)*3
ys = np.arange(10, step=0.1)+np.random.random(100)*3
labels = np.arange(100)
plt.scatter(xs, ys)
for x, y, s in zip(xs, ys, labels):
    plt.text(x, y, s)
plt.show()

enter image description here

You can see that even such short labels create a crazy mess when the data density is high.

350

asked Jan 09 '16 13:01

Phlya

2 Answers

[12-11-2016 updated the code and second figure again since the library has been significantly improved since then]

ANSWER COMPLETELY REWRITTEN

I've made a small library for this purpose, which works similarly to above mentioned ggrepel: https://github.com/Phlya/adjustText

With switched off repelling from points it produces something decent even for this difficult example:

from matplotlib import pyplot as plt
from adjustText import adjust_text
import numpy as np

np.random.seed(2016)
xs = np.arange(10, step=0.1) + np.random.random(100) * 3
ys = np.arange(10, step=0.1) + np.random.random(100) * 3
labels = np.arange(100)

f = plt.figure()
scatter = plt.scatter(xs, ys, s=15, c='r', edgecolors='w')
texts = []
for x, y, s in zip(xs, ys, labels):
    texts.append(plt.text(x, y, s))

plt.show()

enter image description here

adjust_text(texts, force_points=0.2, force_text=0.2,
            expand_points=(1, 1), expand_text=(1, 1),
            arrowprops=dict(arrowstyle="-", color='black', lw=0.5))
plt.show()

enter image description here

186

answered Sep 18 '22 08:09

Phlya

Building on tcaswell's answer, you could repel labels using networkx's spring_layout which implements the Fruchterman Reingold force-directed layout algorithm:

import matplotlib.pyplot as plt
import numpy as np
import networkx as nx
np.random.seed(2016)
xs = np.arange(10, step=0.1)+np.random.random(100)*3
ys = np.arange(10, step=0.1)+np.random.random(100)*3
labels = np.arange(100)

def repel_labels(ax, x, y, labels, k=0.01):
    G = nx.DiGraph()
    data_nodes = []
    init_pos = {}
    for xi, yi, label in zip(x, y, labels):
        data_str = 'data_{0}'.format(label)
        G.add_node(data_str)
        G.add_node(label)
        G.add_edge(label, data_str)
        data_nodes.append(data_str)
        init_pos[data_str] = (xi, yi)
        init_pos[label] = (xi, yi)

    pos = nx.spring_layout(G, pos=init_pos, fixed=data_nodes, k=k)

    # undo spring_layout's rescaling
    pos_after = np.vstack([pos[d] for d in data_nodes])
    pos_before = np.vstack([init_pos[d] for d in data_nodes])
    scale, shift_x = np.polyfit(pos_after[:,0], pos_before[:,0], 1)
    scale, shift_y = np.polyfit(pos_after[:,1], pos_before[:,1], 1)
    shift = np.array([shift_x, shift_y])
    for key, val in pos.iteritems():
        pos[key] = (val*scale) + shift

    for label, data_str in G.edges():
        ax.annotate(label,
                    xy=pos[data_str], xycoords='data',
                    xytext=pos[label], textcoords='data',
                    arrowprops=dict(arrowstyle="->",
                                    shrinkA=0, shrinkB=0,
                                    connectionstyle="arc3", 
                                    color='red'), )
    # expand limits
    all_pos = np.vstack(pos.values())
    x_span, y_span = np.ptp(all_pos, axis=0)
    mins = np.min(all_pos-x_span*0.15, 0)
    maxs = np.max(all_pos+y_span*0.15, 0)
    ax.set_xlim([mins[0], maxs[0]])
    ax.set_ylim([mins[1], maxs[1]])


fig, ax = plt.subplots()
ax.plot(xs, ys, 'o')
repel_labels(ax, xs, ys, labels, k=0.0025)
plt.show()

yields

enter image description here

answered Sep 18 '22 08:09

unutbu

Related questions
                            
                                Accessing elements in the shadow DOM
                            
                                Adjust padding inside matplotlib annotation box
                            
                                Creating a log-linear plot in matplotlib using hist2d
                            
                                How do I save results of a "for" loop into a single variable?
                            
                                How do I make this character counter be insensitive to case?
                            
                                django admin error - Unknown column 'django_content_type.name' in 'field list'
                            
                                Get the last modified date of a directory (including subdirectories) using Python?
                            
                                change variable in Pycharm debugger
                            
                                How to get the percentage of memory usage of a process?
                            
                                Copying a file to an existing directory results in IOError [Error 21] is a directory
                            
                                python - subprocess.Popen().pid return the pid of the parent script
                            
                                TypeError constructor returned NULL while importing pyplot in ssh
                            
                                What is the difference between u' ' prefix and unicode() in python?
                            
                                Element-wise XOR in pandas
                            
                                Pool within a Class in Python
                            
                                How to get rid of cursor id error in mongodb?
                            
                                ImportError: No module named 'appdirs'
                            
                                Django / postgres setup for database creation, for running tests
                            
                                How to use a dictionary to translate/replace elements of an array? [duplicate]
                            
                                windows pip installing libraries in wrong directory

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

"Repel" annotations in matplotlib?

Tags:

python

matplotlib

plot

Phlya

People also ask

2 Answers

Phlya

unutbu

Recent Activity

Donate For Us