This book constitutes the refereed proceedings of the 14th IAPR International Workshop on Document Analysis Systems, DAS 2020, held in Wuhan, China, in July 2020. The 40 full papers presented in this book were carefully reviewed and selected from 57 submissions. The papers are grouped in the following topical sections: character and text recognition; document image processing; segmentation and layout analysis; word embedding and spotting; text detection; and font design and classification. Due to the Corona pandemic the conference was held as a virtual event .
The objective of Document Analysis and Recognition (DAR) is to recognize the text and graphical components of a document and to extract information. This book is a collection of research papers and state-of-the-art reviews by leading researchers all over the world. It includes pointers to challenges and opportunities for future research directions. The main goal of the book is to identify good practices for the use of learning strategies in DAR.
This book constitutes the refereed proceedings of the 7th International Conference on Document Analysis Systems, DAS 2006, held in Nelson, New Zealand, in February 2006. The 33 revised full papers and 22 poster papers presented were carefully reviewed and selected from 78 submissions. The papers are organized in topical sections on digital libraries, image processing, handwriting, document structure and format, tables, language and script identification, systems and performance evaluation, and retrieval and segmentation.
Optical character recognition and document image analysis have become very important areas with a fast growing number of researchers in the field. This comprehensive handbook with contributions by eminent experts, presents both the theoretical and practical aspects at an introductory level wherever possible.
This book constitutes the refereed proceedings of the 15th IAPR International Workshop on Document Analysis Systems, DAS 2022, held in La Rochelle, France, in May 2022. The full papers presented were carefully reviewed and selected from numerous submissions addressing key techniques of document analysis.
This book provides the first comprehensive look at the emerging field of web document analysis. It sets the scene in this new field by combining state-of-the-art reviews of challenges and opportunities with research papers by leading researchers. Readers will find in-depth discussions on the many diverse and interdisciplinary areas within the field, including web image processing, applications of machine learning and graph theories fat content extraction and web mining, adaptive web content delivery, multimedia document modeling and human interactive proofs for web security.
Recently, there has been an increased interest in the research and development of techniques for components of complete document analysis systems. In recognition of this trend, a series of workshops on Document Analysis Systems commenced in 1994, under the leadership of Henry Baird. The first workshop, held in Kaiserslautern, Germany, in October, 1994, was chaired by Andreas Dengel and Larry Spitz. The second workshop on Document Analysis Systems was held in Malvern, PA, USA, in October, 1996, chaired by Jonathan J. Hull and Suzanne Liebowitz Taylor. The DAS workshop has been one of the most prestigious technical meetings, bringing together a large number of scientists and engineers from all over the world to express their innovative ideas and report on their latest achievements in the area of document analysis systems. The papers in this special book edition were rigorously selected from the Third IAPR Workshop on Document Analysis Systems (DAS’98), held in Nagano, Japan, on 4 - 6 November 1998. It is worth mentioning that the papers were chosen for their original and substantial contributions to the workshop theme and this special book edition. From among the 53 papers that were presented by authors from 11 countries at the DAS’98 after critical reviews by at least three experts, we carefully selected 29 papers for this special book edition. Most of the contributions in this edition have been expanded or extensively revised to include helpful discussions, suggestions, or comments made during the workshop.
This four-volume set of LNCS 12821, LNCS 12822, LNCS 12823 and LNCS 12824, constitutes the refereed proceedings of the 16th International Conference on Document Analysis and Recognition, ICDAR 2021, held in Lausanne, Switzerland in September 2021. The 182 full papers were carefully reviewed and selected from 340 submissions, and are presented with 13 competition reports. The papers are organized into the following topical sections: document analysis for literature search, document summarization and translation, multimedia document analysis, mobile text recognition, document analysis for social good, indexing and retrieval of documents, physical and logical layout analysis, recognition of tables and formulas, and natural language processing (NLP) for document understanding.
This book constitutes the refereed proceedings of the 5th International Workshop on Document Analysis Systems, DAS 2002, held in Princeton, NJ, USA in August 2002 with sponsorship from IAPR.The 44 revised full papers presented together with 14 short papers were carefuly reviwed and selected for inclusion in the book. All current issues in document analysis systems are adressed. The papers are organized in topical sections on OCR features and systems, handwriting recognition, layout analysis, classifiers and learning, tables and forms, text extraction, indexing and retrieval, document engineering, and new applications.