如何使用 opencv 和 pytesseract python 提取文本？

Question

我正在使用 labelImg 在图像的行上绘制一个矩形。这给了我 xml 文件。借助此 xml 如何从图像 table 中提取文本。为了提取文本，我使用了水平和垂直 ine 检测，但没有得到很好的结果。现在我正在使用 labelImg，它给我想要提取的文本要点，但我不知道如何为此应用该方法。请告诉我该怎么做？

我的 xml 文件：

    <annotation>
      <folder>Test Images</folder>
      <filename>FreKa.jpg</filename>
      <path>/home/sumit/Desktop/office_works/Fusion_Code/BIS_Final/Test Images/FreKa.jpg</path>
      <source>
         <database>Unknown</database>
        </source>
      <size>
         <width>679</width>
         <height>341</height>
         <depth>3</depth>
         </size>
         <segmented>0</segmented>
       <object>
         <name>Contact Type</name>
         <pose>Unspecified</pose>
         <truncated>1</truncated>
         <difficult>0</difficult>
         <bndbox>
           <xmin>1</xmin>
           <ymin>100</ymin>
           <xmax>678</xmax>
           <ymax>157</ymax>
        </bndbox>
       </object>
       </annotation>

我输入的图片：

如何借助 xml 文件从 table 中提取合同类型？谢谢...

Answer 1

要获得 xmin，您可以使用 xpath() 和 '//annotation/object/bndbox/xmin' 或更短的 '//xmin'

它总是给出列表（即使只有一个元素或没有元素）所以它需要 [0] 来获取第一个元素或 for 循环来处理所有元素。

使用 if list_of_elelemts: ... 你可以运行只有当列表有一些元素时才可以编码。

您也可以使用len()来检查您得到了多少个元素。

text = '''
<annotation>
  <folder>Test Images</folder>
  <filename>FreKa.jpg</filename>
  <path>/home/sumit/Desktop/office_works/Fusion_Code/BIS_Final/Test Images/FreKa.jpg</path>
  <source>
     <database>Unknown</database>
  </source>
  <size>
     <width>679</width>
     <height>341</height>
     <depth>3</depth>
  </size>
  <segmented>0</segmented>
  <object>
     <name>Contact Type</name>
     <pose>Unspecified</pose>
     <truncated>1</truncated>
     <difficult>0</difficult>
     <bndbox>
       <xmin>1</xmin>
       <ymin>100</ymin>
       <xmax>678</xmax>
       <ymax>157</ymax>
     </bndbox>
  </object>
</annotation>
'''

import lxml.etree

tree = lxml.etree.fromstring(text)

print('xmin:', tree.xpath("//annotation/object/bndbox/xmin")[0].text)
print('xmin:', tree.xpath("//bndbox/xmin")[0].text)
print('xmin:', tree.xpath("//object//xmin")[0].text)
print('xmin:', tree.xpath("//xmin")[0].text)

print('xmin:', tree.xpath("//xmin/text()")[0])  # with `text()` instead of `.text`

for item in tree.xpath("//xmin/text()"):
    print('xmin:', item)  # with `text()` instead of `.text`

objects = tree.xpath("//object")
print('len(objects):', len(objects))

other = tree.xpath("//bndbox/other")
if other:
    print('found', len(other), 'elements')
else:
    print('there is no "other" elements')

如何使用 opencv 和 pytesseract python 提取文本？

how to extract text using opencv and pytesseract python?

python

opencv

python-tesseract

image-preprocessing

labelimg