Clean room implementation for PDF table detection; inspired by camelot.
Starts with the raw image file; get it from their corresponding go-pardocs + go-dundocs commands.
This is called via simple os.exec for every page; pass back OCR-ed text to sync with calling function.
Assumption: Only target pure strongly separated tables with line; e.g. detect line
Ideas
Use edge detection (e.g. Canny) + line detection (e.g Hough Lines); available via gocv to create the row/column slices