How to use Azure custom vision service response boundingBox to plot shape

Tags:

I am using Azure cognitive-service custom vision service to detect shapes from capture images. As per their documentation, I got the response as per their format.

But I am an facing issue to plot the shape above the image.

{
    "id": "0fbda4ee-8956-4979-bf57-a252441af98d",
    "project": "9ca4032b-beeb-40ad-9396-1c3fcfd9ba89",
    "iteration": "27c85265-a158-4fc4-b22a-d535dd758d80",
    "created": "2018-06-11T09:34:29.9496528Z",
    "predictions": [
        {
            "probability": 0.0102891214,
            "tagId": "677afcf8-bc4a-493f-b588-707663286125",
            "tagName": "ball",
            "boundingBox": {
                "left": 0.2889924,
                "top": 0.0169312358,
                "width": 0.7007024,
                "height": 0.8284572
            }
        },
        {
            "probability": 0.012788726,
            "tagId": "ca844f08-b6c0-4d9a-9010-73945d442708",
            "tagName": "cricket ball",
            "boundingBox": {
                "left": 0.304018974,
                "top": 0.413163722,
                "width": 0.299461246,
                "height": 0.436399817
            }
        },
        {
            "probability": 0.0229086485,
            "tagId": "ca844f08-b6c0-4d9a-9010-73945d442708",
            "tagName": "cricket ball",
            "boundingBox": {
                "left": 0.2889924,
                "top": 0.0169312358,
                "width": 0.7007024,
                "height": 0.8284572
            }
        },
        {
            "probability": 0.0100123268,
            "tagId": "4672144d-5593-446f-be63-5144a35d0e6e",
            "tagName": "pipe",
            "boundingBox": {
                "left": 0.711509764,
                "top": 0.377838552,
                "width": 0.07217276,
                "height": 0.113578767
            }
        },
        {
            "probability": 0.0167990718,
            "tagId": "4672144d-5593-446f-be63-5144a35d0e6e",
            "tagName": "pipe",
            "boundingBox": {
                "left": 0.9821227,
                "top": 0.9500536,
                "width": 0.0115685463,
                "height": 0.033854425
            }
        },
        {
            "probability": 0.923659563,
            "tagId": "4672144d-5593-446f-be63-5144a35d0e6e",
            "tagName": "pipe",
            "boundingBox": {
                "left": 0.288039029,
                "top": 0.411838,
                "width": 0.291451037,
                "height": 0.4237842
            }
        }
    ]
}

Above is the response I got in that Custom vision API call. But the issue is with boundingBox. It is having values always in a fraction, starting from 0. Now if I want to use that and want to draw a square, then it is not possible as because I don't know exact logic behind drawing square/rectangles from that values.

How can I use those values and draw a rectangles/square using it?

565

asked Jun 11 '18 09:06

Shreyash Mahajan

1 Answers

Reply / TL;DR

These boundingBox values are in percent of the image original size, so you can draw the rectangle by multiplying the values by the image width (for left and width values) or by the image height (for top and height values).

Keep in mind that the position is expressed from the top left corner, so position 0,0 is this corner.

Details with a sample

I got a small custom vision detecting cola bottles.

Original image is the following one:

I used the Custom Vision portal to make a prediction and got the following result - let's focus on this highlighted result with a 87,5% score:

portal result

Using the API (available here), I also made the Predict operation and got (among other details) this prediction:

{
    "probability": 0.875464261,
    "tagId": "1932c95f-ed4a-4675-bde4-c2457e1389e6",
    "tagName": "CocaLight",
    "boundingBox": {
      "left": 0.453497916,
      "top": 0.0,
      "width": 0.2523211,
      "height": 0.8738168
    }
}

Considering that my image dimension is 800 x 652 (so ImageWidth 800, ImageHeight 652):

Rectangle draw

Top left point position?

x (vertical distance from the left border) = left value from API x ImageWidth => 0.453497916 x 800 = 362
y (horizontal distance from the top border) = top value from the API x ImageHeight => 0.0 x 652 = 0

So my rectangle starting position is (362,0).

Size?

Rectangle width = width from the API x ImageWidth => 201
Rectangle height = height from the API x ImageHeight => 569

Let's draw it!

Draw API Result

Looks right!

179

answered Sep 19 '22 17:09

Nicolas R

Related questions
                            
                                Eye tracking: finding the pupil (x,y)
                            
                                Relation between horizontal, vertical and diagonal Field-of-View
                            
                                Is One-Hot Encoding required for using PyTorch's Cross Entropy Loss Function?
                            
                                Triangulation & Direct linear transform
                            
                                Principal Component Analysis in MATLAB
                            
                                How to apply the camera pose transformation computed using EPnP to the VTK camera?
                            
                                How to detect ellipses in image without using fitEllipse() in opencv?
                            
                                How to obtain the right alpha value to perfectly blend two images?
                            
                                Reconstruct scene like Photosynth
                            
                                The best way to calculate the best threshold with P. Viola, M. Jones Framework
                            
                                Real-time template matching - OpenCV, C++
                            
                                Advanced square detection (with connected region)
                            
                                In the Circle Hough Transform, what is the Inverse Ratio of Accumulator Resolution (dp) and how does it affect circle detection?
                            
                                Calculating aspect ratio of Perspective Transform destination image
                            
                                Opencv getGaborKernel parameters for filter bank
                            
                                Plane fitting in a 3d point cloud
                            
                                Opencv: AttributeError: module 'cv2' has no attribute 'dnn'
                            
                                Copying Region of an Image to another Region in another Image
                            
                                Remove background text and noise from an image using image processing with OpenCV
                            
                                Training Tensorflow Inception-v3 Imagenet on modest hardware setup

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to use Azure custom vision service response boundingBox to plot shape

Tags:

computer-vision

microsoft-cognitive

azure-cognitive-services