Versatile Document Image Content Extraction Lehigh通用的文档图像内容提取里海.pptVIP

  • 3
  • 0
  • 约8.03千字
  • 约 23页
  • 2017-03-09 发布于上海
  • 举报

Versatile Document Image Content Extraction Lehigh通用的文档图像内容提取里海.ppt

Versatile Document Image Content Extraction Lehigh通用的文档图像内容提取里海

Versatile Document Image Content Extraction Henry S. Baird Michael A. Moll Jean Nonnemaker Matthew R. Casey Don L. Delorenzo Document Image Content Extraction Problem Given an image of a document Find regions containing handwriting, machine-print text, graphics, line-art, logos, photographs, noise, etc Difficulties Vast diversity of document types Arduous data collection How big is a representative training set? Expense of preparing correctly labeled “ground-truthed” samples Lack of consensus on how to evaluate performance Our Research Goals Versatility First Beware “brittle” or narrow approaches Develop methods that work across broadest possible spectrum of document and image types Voracious Classifiers Belief that accuracy of a classifier has more to do with training data than other considerations Want to train on extremely large (and representative) data sets (in reasonable amounts of time) Extremely High Speed Classification Ideally, perform nearly at I/O rates (as fast as images can be read) Too ambitious? Related Strategies (for the future) Amplification Real ground-truthed training samples are hard to find, expensive to generate and difficult to ensure coverage Want to use real samples as ‘seeds’ for massive synthetic generation of pseudo randomly perturbed samples for use in supplementary training Confidence Before Accuracy Confidence is maybe more important than accuracy, since even modest accuracy (across all cases) can be useful Near-Infinite Space Design for best performance in near future when main memory will be orders of magnitude larger and faster Data-Driven Design Avoid arbitrary engineering decisions such as choice of features, instead allowing training data to determine this Document Images Range of document and image types Color, grey-level, black and white Any size or resolution Lots of file formats (TIFF, JPEG, PNG, etc) Pre-processing step of converting images into three channel color PNG file in HSL (Hue, Saturation, Luminance) co

文档评论(0)

1亿VIP精品文档

相关文档