Automatically Ordering Events and Times in Text

Automatically Ordering Events and Times in Text

Author: Leon R.A. Derczynski

Publisher: Springer

Published: 2016-10-18

Total Pages: 220

ISBN-13: 3319472410

DOWNLOAD EBOOK

The book offers a detailed guide to temporal ordering, exploring open problems in the field and providing solutions and extensive analysis. It addresses the challenge of automatically ordering events and times in text. Aided by TimeML, it also describes and presents concepts relating to time in easy-to-compute terms. Working out the order that events and times happen has proven difficult for computers, since the language used to discuss time can be vague and complex. Mapping out these concepts for a computational system, which does not have its own inherent idea of time, is, unsurprisingly, tough. Solving this problem enables powerful systems that can plan, reason about events, and construct stories of their own accord, as well as understand the complex narratives that humans express and comprehend so naturally. This book presents a theory and data-driven analysis of temporal ordering, leading to the identification of exactly what is difficult about the task. It then proposes and evaluates machine-learning solutions for the major difficulties. It is a valuable resource for those working in machine learning for natural language processing as well as anyone studying time in language, or involved in annotating the structure of time in documents.


Innovations in Bio-Inspired Computing and Applications

Innovations in Bio-Inspired Computing and Applications

Author: Ajith Abraham

Publisher: Springer

Published: 2019-05-21

Total Pages: 536

ISBN-13: 3030166813

DOWNLOAD EBOOK

This book highlights recent research on bio-inspired computing and its various innovative applications in Information and Communication Technologies. It presents 50 high-quality papers from the 9th International Conference on Innovations in Bio-Inspired Computing and Applications (IBICA 2018) and 7th World Congress on Information and Communication Technologies (WICT 2018), which was held at Toc H Institute of Science and Technology (TIST) on December 17–19, 2018. IBICA-WICT 2018 was a premier conference and brought together researchers, engineers and practitioners whose work involved bio-inspired computing, computational intelligence and their applications in information security, real-world contexts etc. Including contributions by authors from 22 countries, the book offers a valuable reference guide for all researchers, students and practitioners in the fields of Computer Science and Engineering.


Text, Speech and Dialogue

Text, Speech and Dialogue

Author: Vaclav Matousek

Publisher: Springer Science & Business Media

Published: 2003-06-02

Total Pages: 438

ISBN-13: 354020024X

DOWNLOAD EBOOK

This book constitutes the refereed proceedings of the 6th International Conference on Text, Speech and Dialogue, TSD 2003, held in Ceské Budejovice, Czech Republic in September 2003. The 60 revised full papers presented together with 2 invited contributions were carefully reviewed and selected from 121 submissions. The papers present a wealth of state-of-the-art research and development results in the field of natural language processing with an emphasis on text, speech, and spoken language ranging from theoretical and methodological issues to applications in various fields, such as web information retrieval, the semantic web, algorithmic learning, and dialogue systems.


Formal Representation and the Digital Humanities

Formal Representation and the Digital Humanities

Author: Paola Cotticelli-Kurras

Publisher: Cambridge Scholars Publishing

Published: 2018-12-14

Total Pages: 268

ISBN-13: 1527523349

DOWNLOAD EBOOK

What do linguistics, philology and even cultural studies have in common? There can be many answers for this question; certainly, however, they all have to deal with the new technologies and methods that go by the name of “Digital Humanities”. Today, all human sciences are facing new challenges both from the methodological point of view and from their very scientific contents. Accordingly, the number of research fields and approaches represented in this volume is large, reflecting the complexity of the problems of formalization, computation and digitalization of data and resources. The future of human sciences will be marked by the ever-increasing importance of formal models and computational tools, and the effective communication among the specialists of different fields is crucial for the scientific success of every single area of research. This collection of cutting-edge, high-quality papers is a fundamental step towards a better definition of the role the “Digital Humanities” will play in the next years.


Clinical Text Mining

Clinical Text Mining

Author: Hercules Dalianis

Publisher: Springer

Published: 2018-05-14

Total Pages: 192

ISBN-13: 3319785036

DOWNLOAD EBOOK

This open access book describes the results of natural language processing and machine learning methods applied to clinical text from electronic patient records. It is divided into twelve chapters. Chapters 1-4 discuss the history and background of the original paper-based patient records, their purpose, and how they are written and structured. These initial chapters do not require any technical or medical background knowledge. The remaining eight chapters are more technical in nature and describe various medical classifications and terminologies such as ICD diagnosis codes, SNOMED CT, MeSH, UMLS, and ATC. Chapters 5-10 cover basic tools for natural language processing and information retrieval, and how to apply them to clinical text. The difference between rule-based and machine learning-based methods, as well as between supervised and unsupervised machine learning methods, are also explained. Next, ethical concerns regarding the use of sensitive patient records for research purposes are discussed, including methods for de-identifying electronic patient records and safely storing patient records. The book’s closing chapters present a number of applications in clinical text mining and summarise the lessons learned from the previous chapters. The book provides a comprehensive overview of technical issues arising in clinical text mining, and offers a valuable guide for advanced students in health informatics, computational linguistics, and information retrieval, and for researchers entering these fields.


Annotating, Extracting and Reasoning about Time and Events

Annotating, Extracting and Reasoning about Time and Events

Author: Frank Schilder

Publisher: Springer

Published: 2007-10-06

Total Pages: 146

ISBN-13: 3540759891

DOWNLOAD EBOOK

This state-of-the-art survey comprises a selection of the material presented at the International Dagstuhl Seminar on Annotating, Extracting and Reasoning about Time and Events, held in Dagstuhl Castle, Germany, in April 2005. The seminar centered around an emerging de facto standard for time and event annotation: TimeML. It features nine papers that detail current research and discuss open problems concerning annotation, temporal reasoning, and event identification.


Natural Language Processing

Natural Language Processing

Author: Yue Zhang

Publisher: Cambridge University Press

Published: 2021-01-07

Total Pages: 487

ISBN-13: 1108349773

DOWNLOAD EBOOK

With a machine learning approach and less focus on linguistic details, this gentle introduction to natural language processing develops fundamental mathematical and deep learning models for NLP under a unified framework. NLP problems are systematically organised by their machine learning nature, including classification, sequence labelling, and sequence-to-sequence problems. Topics covered include statistical machine learning and deep learning models, text classification and structured prediction models, generative and discriminative models, supervised and unsupervised learning with latent variables, neural networks, and transition-based methods. Rich connections are drawn between concepts throughout the book, equipping students with the tools needed to establish a deep understanding of NLP solutions, adapt existing models, and confidently develop innovative models of their own. Featuring a host of examples, intuition, and end of chapter exercises, plus sample code available as an online resource, this textbook is an invaluable tool for the upper undergraduate and graduate student.


Inducing Event Schemas and Their Participants from Unlabeled Text

Inducing Event Schemas and Their Participants from Unlabeled Text

Author: Nathanael William Chambers

Publisher: Stanford University

Published: 2011

Total Pages: 159

ISBN-13:

DOWNLOAD EBOOK

The majority of information on the Internet is expressed in written text. Understanding and extracting this information is crucial to building intelligent systems that can organize this knowledge, but most algorithms focus on learning atomic facts and relations. For instance, we can reliably extract facts like "Stanford is a University" and "Professors teach Science" by observing redundant word patterns across a corpus. However, these facts do not capture richer knowledge like the way detonating a bomb is related to destroying a building, or that the perpetrator who was convicted must have been arrested. A structured model of these events and entities is needed to understand language across many genres, including news, blogs, and even social media. This dissertation describes a new approach to knowledge acquisition and extraction that learns rich structures of events (e.g., plant, detonate, destroy) and participants (e.g., suspect, target, victim) over a large corpus of news articles, beginning from scratch and without human involvement. As opposed to early event models in Natural Language Processing (NLP) such as scripts and frames, modern statistical approaches and advances in NLP now enable new representations and large-scale learning over many domains. This dissertation begins by describing a new model of events and entities called Narrative Event Schemas. A Narrative Event Schema is a collection of events that occur together in the real world, linked by the typical entities involved. I describe the representation itself, followed by a statistical learning algorithm that observes chains of entities repeatedly connecting the same sets of events within documents. The learning process extracts thousands of verbs within schemas from 14 years of newspaper data. I present novel contributions in the field of temporal ordering to build classifiers that order the events and infer likely schema orderings. I then present several new evaluations for the extracted knowledge. Finally, I apply Narrative Event Schemas to the field of Information Extraction, learning templates of events with sets of semantic roles. Most Information Extraction approaches assume foreknowledge of the domain's templates, but I instead start from scratch and learn schemas as templates, and then extract the entities from text as in a standard extraction task. My algorithm is the first to learn templates without human guidance, and its results approach those of supervised algorithms.


Recent Advances in Natural Language Processing III

Recent Advances in Natural Language Processing III

Author: Nicolas Nicolov

Publisher: John Benjamins Publishing

Published: 2004-11-30

Total Pages: 418

ISBN-13: 9027294682

DOWNLOAD EBOOK

This volume brings together revised versions of a selection of papers presented at the 2003 International Conference on “Recent Advances in Natural Language Processing”. A wide range of topics is covered in the volume: semantics, dialogue, summarization, anaphora resolution, shallow parsing, morphology, part-of-speech tagging, named entity, question answering, word sense disambiguation, information extraction. Various ‘state-of-the-art’ techniques are explored: finite state processing, machine learning (support vector machines, maximum entropy, decision trees, memory-based learning, inductive logic programming, transformation-based learning, perceptions), latent semantic analysis, constraint programming. The papers address different languages (Arabic, English, German, Slavic languages) and use different linguistic frameworks (HPSG, LFG, constraint-based DCG). This book will be of interest to those who work in computational linguistics, corpus linguistics, human language technology, translation studies, cognitive science, psycholinguistics, artificial intelligence, and informatics.