Python

Robust Automatic Detection of a Document in an Image

This project presents an approach for robust detection of documents in images taken with cameras. The proposed detection system combines variable image prepossessing, Harris corner detection, canny edge detection, Hough line transformation and a steepest ascent search algorithm to detect the page with increasing confidence in an iterative process. This approach is moderately successful with example images of documents that have a relatively high contrast to their background and are the main subject of the image. This approach outperforms other contour-based approaches.

A full report on the process and results is contained in the GitHub repository here.

Process Summary

Smoothing
Closing
Corner detection (Harris)
Edge detection (Canny)
Hough transform
Hough line filtering
Combine edge and corner data
Detect possible page bounding rectangles
Compute bounding rectangle confidence
Optimise parameters to improve confidence and repeat from step 1
Select highest confidence bounding rectangle

Experimental results

Under reasonable conditions this proposed process performs very well but can struggle in low contrast situations.

The following image shows the process at each stage:

For more detailed experimental results and analysis view the project report here.

GitHub Repository

https://github.com/ChrisSkorka/Perceptual-Computing-Project

Robust Automatic Detection of a Document in an Image - Report

Other Python Projects

Random Word Generator