在 OpenCV 中查找带有文本行的轮廓

Finding contours with lines of text in OpenCV

我正在编写一个文本识别程序，我在排序轮廓时遇到了问题。该程序对一行文本运行良好，但是当涉及到整个文本块时，我的程序在 80% 的情况下都无法检测到文本行。提取一行文本然后提取所有其他行（一次一个）的真正有效方法是什么？

我想达到的目标：

实现此目的有一系列步骤：

找到对图像进行二值化的最佳阈值。我用的是大津门槛。
找到合适的形态学操作，沿着水平方向形成一个单一的区域。选择宽度大于高度的内核。
在生成的轮廓上绘制边界框

更新

实现如下：

x = 'C:/Users/Desktop/text.jpg' 

img = cv2.imread(x)
gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)  

#--- performing Otsu threshold ---
ret,thresh1 = cv2.threshold(gray, 0, 255,cv2.THRESH_OTSU|cv2.THRESH_BINARY_INV)
cv2.imshow('thresh1', thresh1)

#--- choosing the right kernel
#--- kernel size of 3 rows (to join dots above letters 'i' and 'j')
#--- and 10 columns to join neighboring letters in words and neighboring words
rect_kernel = cv2.getStructuringElement(cv2.MORPH_RECT, (15, 3))
dilation = cv2.dilate(thresh1, rect_kernel, iterations = 1)
cv2.imshow('dilation', dilation)

#---Finding contours ---
_, contours, hierarchy = cv2.findContours(dilation, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_NONE)

im2 = img.copy()
for cnt in contours:
        x, y, w, h = cv2.boundingRect(cnt)
        cv2.rectangle(im2, (x, y), (x + w, y + h), (0, 255, 0), 2)
cv2.imshow('final', im2)