Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

What would be the most efficient way to detect all closed paths in a collection of segments and connectors?

We have a data set which is comprised of Connectors and Segments. Each segment has exactly two connectors, but each connector can belong to zero or more segments (i.e. connector 'A' in the left image below has no segments, while connector 'M' has three, M-R, M-L and M-N.)

It is understood that wherever any lines meet or intersect, there will be a connector so we don't have to worry about even/odd rules, overlapping or partially-enclosed polygons, etc. as they don't apply.

In short, we're trying to identify all of the created polygons (the colored shapes in the right image.) I believe this can be completed in two steps.

Polygons

Part 1: Removing superfluous items

Stand-alone connectors (connector 'A' here) can simply be removed since they can't be part of a shape's outline.

Floating end-points referencing a single segment (connectors 'B' and 'E') can also be removed as they too can't be part of a shape's outline. This will also remove their referenced segments (B-C and E-D).

Performing the above recursively will next identify 'C' as an endpoint (since 'B' and B-C were already removed) so it and it's remaining segment C-D can also be removed. On the next recursive pass, connector 'D' and segment D-F will also be removed, etc.

However, I haven't found a good way to identify segment H-I. That said, I think that can be achieved during polygon detection since such segments would only be the result of compound paths and would be traced in both directions during one shape's detection. (More on that below.)

Step 2: Polygon Detection

Each segment can be traced in two directions. For instance, the segment connecting 'O' and 'P' can be either O-P or P-O. Picking a trace-direction of clockwise, O-P would belong to the polygon O-P-Q-N whereas P-O would belong to the polygon P-O-I.

The following logic assumes a trace-direction of clockwise.

Starting from any segment, when tracing around, if you get back to your starting point, you have identified a potential polygon. By keeping a running delta of your heading's angle as you trace around (this is how much your heading turns and is not to be confused with simply adding the angles between segments), when done, if that angle is positive, you've detected a valid polygon. If it's negative, you've detected a 'containing' polygon, meaning one that contains one or more 'valid' polygons. The outer perimeter of the entire shape (or shapes) are all containing polygons.

Consider the case of a square, diagonally divided into two triangles. Tracing each segment twice--once in each direction--you will end up with three potentially-valid polygons: a square and two triangles. The triangles will have a positive angle delta telling you they're valid, but the square's angle delta will be negative telling you that's the containing polygon.

Note: A containing polygon can be equal to a valid polygon too. It will just be 'wound' in the opposite direction.

Consider a simple triangle. The clockwise trace will yield the valid polygon. The second attempt to trace clockwise will actually yield a counter-clockwise trace which will give you a negative angle delta, telling you that's actually the outline of the shape.

Note: You also have to test for other polygons encountered along the way by also testing each point for any previously-encountered point during that shape detection. If you find you've revisited the same point, save off the polygon created since the first encounter of that point, check it's angle. If it's positive, it's a valid polygon (and you're actually currently tracing a containing polygon.) If it's negative, you've detected a containing polygon (in which case you're currently tracing a valid polygon.) Finally, remove all segments on your accumulation stack back to the first instance that point was last encountered and continue on with your detection.

For instance, if you started at 'J' and traced around counter-clockwise, you would go through 'I', 'H', then 'G', then 'F' then you'd be back at 'H'. You just found a polygon H-G-F which has a negative angle so you know it's a containing polygon. Remove those three segments from your stack and continue on. Now you'll again hit 'I'. In this case, you already visited that same segment during this pass, but in the other direction, so simply remove that segment completely from your stack and continue on, next to 'O' then 'N', etc. You'll eventually be back at 'J'.

When a segment has been traced in both directions, it can be considered 'used' and no further processing of that segment is needed. Continue processing all non-used segments. Once all segments have been traced in both directions, you can be sure all polygons--valid and containing--have been found.

Finally, check each containing polygon to see if it falls within any valid polygon. If so, exclude it from that valid polygon creating a compound path. In the example here, containing polygon H-G-F is contained by the valid cyan polygon so it should be excluded. Note there is also a valid H-F-G polygon which is marked in red here.

Anyway, that's what I've come up with, but I'm wondering if there's a better/simpler way. Thoughts?

like image 824
Mark A. Donohoe Avatar asked Oct 25 '15 21:10

Mark A. Donohoe


1 Answers

Hint:

Your problem has a geometrical aspect (not pure connectivity) because the faces may not overlap and are known to be simple. I would recommend a sweepline approach.

First cleanup to discard all floating endpoints.

Then consider an horizontal line that moves from top to bottom, vertex by vertex. On every position of the seewpline, it includes or intersects a number of segments. Sorting all vertices/intersections from left to right, you get non-overlapping line segments.

The trick is to track the endpoints as the sweepline progresses in order to find the left and right boundaries of the regions.

In the given example, you will successively consider the points

R  K        J
RM KL G     JI
 M  L GF GH JI
 MN    F GH JI
 MN       H JI
  N  O       I
  NQ   P
   Q

(pairs denote intersections).

From this, you should be able to reconstruct the left/right outlines from connectivity considerations

R M | K L
K L M N | G F H | G H | J I (and embedded G F H | G H)
N Q | O P Q
O P | I P

enter image description here

Here is the graph that you obtain by linking the endpoints and intersections of existing edges from scanline to scanline.

enter image description here

And after cleanup, removing the intermediate vertices:

enter image description here

like image 145
Yves Daoust Avatar answered Oct 16 '22 23:10

Yves Daoust