Submitted by hundley10 t3_10dqwqb in MachineLearning
What model structure would be recommended for detecting the coordinates of all 4 corners of a rectangle (e.g. index cards)? Most object detection models like YOLO produce rectangular bounding boxes; what tweaks can be made to trace the object regardless of orientation?
For my specific problem, classical edge/corner detectors aren't a good fit - so I'm falling back on ML. Currently have a dataset of about 1500 domain-specific labeled images; hoping to train a model on TF. Thanks for the suggestions!
Edit: here are a few examples from my dataset. The green dots aren't part of the images; they just show how the corners are annotated:
JiraSuxx2 t1_j4muqql wrote
I’m not a 100% sure how yolo works but I think images are cut into grids and then detection is done per grid square. The results are processed, the bounding boxes are computed the old fashioned way from the predictions. That’s also how they get multiple predictions per image I think.
In your case, even if you detect corners how do you know they belong to the same card?