- 5
- 0
- 约1.36万字
- 约 10页
- 2017-08-03 发布于河南
- 举报
文字识别(Character recognition)
The computer accepts a digital image of the manuscript. The Chinese characters on the image may be printed Chinese characters or handwritten Chinese characters, and then the Chinese characters in the images can be identified. For printed characters, first the image document data into original black and white dot matrix by optical method, then converts the text in the image into text format through the recognition software, further processing to word processing software. Among them, character recognition is an important technology of OCR.
Two ways of 1.OCR recognition
As with other information data, graphic information in the computer scanner to capture all are 0, 1 of the two digital recording and recognition, all the information is only 0, 1 holds a string of points or samples. OCR recognition program identifies character information on the page, mainly through the unit pattern matching method and feature extraction method in two ways of character recognition.
Pattern (Matching) is a strict comparison of each character with a file with standard font and font size bitmap. If there is a large database of saved characters in the application, the application selects the appropriate characters for proper matching. Software must use some processing techniques to find the most similar matches, usually by experimenting with different versions of the same character. Some software can scan a page of text and identify each character that defines a new font. Some software uses their own identification technology to do their best to identify characters on the page, and then manually select or directly input the characters that are not recognized.
Extraction (Feature) is the decomposition of each character into many different character features, including diagonals, horizontal lines, and curves. These features are then matched with characters that are understood (recognized). For a simple example, the application recognizes two horizontal lines, and i
您可能关注的文档
- 授权加盟协议书(Authorization agreement).doc
- 数据、信息技术和信息系统(Data, information technology and information systems).doc
- 数据采集初学者(Data acquisition beginners).doc
- 数据车床加工精度(Machining accuracy of data lathe).doc
- 手掌蜕皮症(Palmar molt).doc
- 数据库 汇总(Database summarization).doc
- 数据库挂号(Database registration).doc
- 数据库的考题(Database questions).doc
- 数据库例题(Database example).doc
- 手诊讲座(Department of hand).doc
最近下载
- 个人简历表格填写2021简历模板.docx VIP
- 针灸推拿学习题库(附答案).docx VIP
- 毕业设计(论文)-五边形凸台零件铣削加工.doc VIP
- 2026届山东省淄博市高三上学期期末考试(摸底质量检测)历史试题(含答案).docx VIP
- 常见词组固定搭配.pdf VIP
- 2023年山东泰安中考地理试题及答案.pdf VIP
- 胎动管理专家共识最新2025.pptx
- (小学综合实践课标复习题全.doc VIP
- 0—3岁婴幼儿心理发展与教育 第四章 0-3岁婴幼儿心理发展与教育 课件PPT.pptx VIP
- 0—3岁婴幼儿心理发展与教育 第三章 0-3岁婴幼儿心理发展与教育 课件PPT.pptx VIP
原创力文档

文档评论(0)