博客
关于我
强烈建议你试试无所不能的chatGPT,快点击我
atitit。ocr框架类库大全 attilax总结
阅读量:4329 次
发布时间:2019-06-06

本文共 1594 字,大约阅读时间需要 5 分钟。

 

 

atititocr框架类库大全 attilax总结

 

Tesseract

 

 

闲来无事,发现百度有一个OCR文字识别接口,感觉挺有意思的,拿来研究一下。       

百度服务简介:文字识别是百度自然场景OCR服务,依托百度业界领先的OCR,提供了整图文字检测、识别、整图文字识别、整图文字行定位和单字图像识别等功能。

不多说啦,直接看demo吧!

 

 

java4less

The J4L OCR tools is set of components that can be used to include OCR capabilities in Java applications. That means you can receive faxes, PDF files or scan documents and extract business information from the images. The main 3 components are:

a Java wrapper for the  engine. The OCR engine Tesseract itself is delivered under the Apache 2.0 license and we support a version compiled for windows only.

a PDF to text converter. 

a text document parser.

The document recognition process can therefore be divided in 2 steps:

The component takes an image file (tif, png, jpg....) or a PDF file and returns the text contained in it. The Java wrapper will perform this operation by using Tesseract. Alternatively you can use any other OCR engine. If you are however using a PDF file, you will use our PDF to Text converter.

In the second step, your Java application needs to understand the text returned by the OCR engine or PDF converter. This is done by the document parser. The document parser uses as input as text string (the data) and a xml file that describes the structure of the document and the ouput is a business document either as a Java object or as a XML file

 

 

 

JAVA实现百度OCR文字识别功能 - 张荣珍的专栏 - 博客频道 - CSDN.NET.html

作者:: 绰号:老哇的爪子 ( 全名::Attilax Akbar Al Rapanui 阿提拉克斯 阿克巴 阿尔 拉帕努伊 ) 

汉字名:艾提拉(艾龙)   EMAIL:1466519819@qq.com

转载请注明来源: http://www.cnblogs.com/attilax/

Atiend

 

 

转载于:https://www.cnblogs.com/attilax/p/6021585.html

你可能感兴趣的文章
安装php扩展
查看>>
百度移动搜索主要有如下几类结果构成
查看>>
Python爬虫面试题170道:2019版【1】
查看>>
JavaBean规范
查看>>
第四阶段 15_Linux tomcat安装与配置
查看>>
NAS 创建大文件
查看>>
学习笔记-模块之xml文件处理
查看>>
接口测试用例
查看>>
面试:用 Java 实现一个 Singleton 模式
查看>>
Sybase IQ导出文件的几种方式
查看>>
案例:手动输入一个字符串,打散放进一个列表,小写字母反序 大写字母保持不变...
查看>>
linux 系统下 tar 的压缩与解压缩命令
查看>>
阿里负载均衡,配置中间证书问题(在starcom申请免费DV ssl)
查看>>
转:How to force a wordbreaker to be used in Sharepoint Search
查看>>
MySQL存储过程定时任务
查看>>
Python中and(逻辑与)计算法则
查看>>
POJ 3267 The Cow Lexicon(动态规划)
查看>>
设计原理+设计模式
查看>>
音视频处理
查看>>
tomcat 7服务器跨域问题解决
查看>>