Sprite Sheet Detect Individual Sprite Bounds Automatically

Tags:

Given a sprite sheet like this:

Sprite Sheet Example

I would like to write an algorithm that can loop through the pixel data and determine the bounding rectangle of each discreet sprite.

If we assume that for each pixel X, Y that I can pull either true (pixel is not totally transparent) or false (pixel is totally transparent), how would I go about automatically generating the bounding rectangles for each sprite?

The resulting data should be an array of rectangle objects with {x, y, width, height}.

Here's the same image but with the bounds of the first four sprites marked in light blue:

Sprite Sheet With Bounds

Can anyone give a step-by-step on how to detect these bounds as described above?

407

asked Nov 27 '12 12:11

Rob Evans

2 Answers

Here's an approach

Convert image to grayscale
Otsu's threshold to obtain binary image
Perform morphological transformations to smooth image
Find contours
Iterate through contours to draw bounding rectangle and extract ROI

After converting to grayscale, we Otsu's threshold to obtain a binary image

enter image description here

Next we perform morphological transformations to merge each sprite into a single contour

enter image description here

From here we find contours, iterate through each contour, draw the bounding rectangle, and extract each ROI. Here's the result

enter image description here

and here's each saved sprite ROI

enter image description here

I've implemented this method using OpenCV and Python but you can adapt the strategy to any language

import cv2

image = cv2.imread('1.jpg')
original = image.copy()
gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
thresh = cv2.threshold(gray, 0, 255, cv2.THRESH_BINARY_INV + cv2.THRESH_OTSU)[1]

kernel = cv2.getStructuringElement(cv2.MORPH_RECT, (3,3))
close = cv2.morphologyEx(thresh, cv2.MORPH_CLOSE, kernel, iterations=2)
dilate = cv2.dilate(close, kernel, iterations=1)

cnts = cv2.findContours(dilate, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE)
cnts = cnts[0] if len(cnts) == 2 else cnts[1]

sprite_number = 0
for c in cnts:
    x,y,w,h = cv2.boundingRect(c)
    ROI = image[y:y+h, x:x+w]
    cv2.imwrite('sprite_{}.png'.format(sprite_number), ROI)
    cv2.rectangle(image, (x, y), (x + w, y + h), (36,255,12), 2)
    sprite_number += 1

cv2.imshow('thresh', thresh)
cv2.imshow('dilate', dilate)
cv2.imshow('image', image)
cv2.waitKey()

151

answered Nov 15 '22 08:11

nathancy

How about this? The only downside is that you'll need a writable version of your image to mark visited pixels in, or the floodfill will never terminate.

Process each* scan line in turn
  For each scanline, walk from left to right, until you find a non-transparent pixel P.
    If the location of P is already inside a known bounded box
      Continue to the right of the bounded box
    Else
      BBox = ExploreBoundedBox(P)
      Add BBox to the collection of known bounded boxes

Function ExploreBoundedBox(pStart)
  Q = new Queue(pStart)
  B = new BoundingBox(pStart)

  While Q is not empty
    Dequeue the front element as P
    Expand B to include P

    For each of the four neighbouring pixels N
      If N is not transparent and N is not marked
        Mark N
        Enqueue N at the back of Q

  return B

You don't need to process every scanline, you could do every 10th, or every 30th scanline. As long as it doesn't exceed the minimum sprite height.

answered Nov 15 '22 07:11

Leon Bouquiet

Related questions
                            
                                Distance between two convex polygons in 3D
                            
                                What's the best way to inherit properties in a tree-based structure?
                            
                                Using the Levenshtein distance in a spell checker
                            
                                Trilateration of a signal using Time Difference of Arrival
                            
                                Binary Search Tree Traversal - PreOrder
                            
                                parsing words in a continuous string
                            
                                How Can I Speed Up This Anagram Algorithm
                            
                                Which algorithm to use to get longest string matching in two big arrays?
                            
                                Closest Point on a Map
                            
                                Sum of 3 variables: strange behavior [duplicate]
                            
                                Improve a word search game worst case
                            
                                String distance, transpositions only [duplicate]
                            
                                Algorithm to find intersections between polylines
                            
                                How do I efficiently determine the longest individual character palindrome in a given string?
                            
                                BST from two unsorted array
                            
                                Solving String reduction Algorithm
                            
                                How we can perform Action on Sequence UIButtons?
                            
                                How does the Twitter timeline algorithm work?
                            
                                How can I figure out who can give gifts to whom on Christmas?
                            
                                Complexity of Prims Algorithm using Priority Queue?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Sprite Sheet Detect Individual Sprite Bounds Automatically

Tags:

algorithm

image

image-processing

computer-vision

sprite

Rob Evans

People also ask

2 Answers

nathancy

Leon Bouquiet

Recent Activity

Donate For Us