This book constitutes the refereed proceedings of the 13th Information Retrieval Societies Conference, AIRS 2017, held in Jeju, Korea, in November 2017. The 17 full papers presented were carefully reviewed and selected from numerous submissions. The final program of AIRS 2017 is divided in the following tracks: IR Infrastructure and Systems; IR Models and Theories; Personalization and Recommendation; Data Mining for IR; and IR Evaluation.
The first monograph to provide a coherent and organized survey on this topic. It puts together the various research pieces of the puzzle, provides a comprehensive and structured overview of diverse proposed methods, and lists several application scenarios where effective verbose query processing can make a significant difference.
This book constitutes the refereed proceedings of the 14th Information Retrieval Societies Conference, AIRS 2018, held in Taipei, Taiwan, in November 2018. The 8 full papers presented together with 9 short papers and 3 session papers were carefully reviewed and selected from 41 submissions. The scope of the conference covers applications, systems, technologies and theory aspects of information retrieval in text, audio, image, video and multimedia data.
Many information retrieval (IR) systems suffer from a radical variance in performance when responding to users' queries. Even for systems that succeed very well on average, the quality of results returned for some of the queries is poor. Thus, it is desirable that IR systems will be able to identify "difficult" queries so they can be handled properly. Understanding why some queries are inherently more difficult than others is essential for IR, and a good answer to this important question will help search engines to reduce the variance in performance, hence better servicing their customer needs. Estimating the query difficulty is an attempt to quantify the quality of search results retrieved for a query from a given collection of documents. This book discusses the reasons that cause search engines to fail for some of the queries, and then reviews recent approaches for estimating query difficulty in the IR field. It then describes a common methodology for evaluating the prediction quality of those estimators, and experiments with some of the predictors applied by various IR methods over several TREC benchmarks. Finally, it discusses potential applications that can utilize query difficulty estimators by handling each query individually and selectively, based upon its estimated difficulty. Table of Contents: Introduction - The Robustness Problem of Information Retrieval / Basic Concepts / Query Performance Prediction Methods / Pre-Retrieval Prediction Methods / Post-Retrieval Prediction Methods / Combining Predictors / A General Model for Query Difficulty / Applications of Query Difficulty Estimation / Summary and Conclusions
Commercial Web search engines such as Google, Yahoo, and Bing are used every day by millions of people across the globe. With their ever-growing refinement and usage, it has become increasingly difficult for academic researchers to keep up with the collection sizes and other critical research issues related to Web search, which has created a divide between the information retrieval research being done within academia and industry. Such large collections pose a new set of challenges for information retrieval researchers. In this work, Metzler describes highly effective information retrieval models for both smaller, classical data sets, and larger Web collections. In a shift away from heuristic, hand-tuned ranking functions and complex probabilistic models, he presents feature-based retrieval models. The Markov random field model he details goes beyond the traditional yet ill-suited bag of words assumption in two ways. First, the model can easily exploit various types of dependencies that exist between query terms, eliminating the term independence assumption that often accompanies bag of words models. Second, arbitrary textual or non-textual features can be used within the model. As he shows, combining term dependencies and arbitrary features results in a very robust, powerful retrieval model. In addition, he describes several extensions, such as an automatic feature selection algorithm and a query expansion framework. The resulting model and extensions provide a flexible framework for highly effective retrieval across a wide range of tasks and data sets. A Feature-Centric View of Information Retrieval provides graduate students, as well as academic and industrial researchers in the fields of information retrieval and Web search with a modern perspective on information retrieval modeling and Web searches.
This book constitutes the refereed proceedings of the 5th Asia Information Retrieval Symposium, AIRS 2009, held in Sapporo, Japan, in October 2009. The 18 revised full papers and 20 revised poster papers presented were carefully reviewed and selected from 82 submissions. All current aspects of information retrieval - in theory and practice - are addressed; working with text, audio, image, video and multimedia data.
This book constitutes the refereed proceedings of the 30th annual European Conference on Information Retrieval Research, ECIR 2009, held in Toulouse, France in April 2009. The 42 revised full papers and 18 revised short papers presented together with the abstracts of 3 invited lectures and 25 poster papers were carefully reviewed and selected from 188 submissions. The papers are organized in topical sections on retrieval model, collaborative IR / filtering, learning, multimedia - metadata, expert search - advertising, evaluation, opinion detection, web IR, representation, clustering / categorization as well as distributed IR.
This book constitutes the thoroughly refereed post-conference proceedings of the 4th Asia Information Retrieval Symposium, AIRS 2008, held in Harbin, China, in May 2008. The 39 revised full papers and 43 revised poster papers presented were carefully reviewed and selected from 144 submissions. All current issues in information retrieval are addressed: applications, systems, technologies and theoretical aspects of information retrieval in text, audio, image, video and multi-media data. The papers are organized in topical sections on IR models image retrieval, text classification, chinese language processing, text processing, application of IR, machine learning, taxonomy, IR methods, information extraction, summarization, multimedia, Web IR, and text clustering.
This book constitutes the refereed proceedings of the 9th Information Retrieval Societies Conference, AIRS 2013, held in Singapore, in December 2013. The 27 full papers and 18 poster presentations included in this volume were carefully reviewed and selected from 109 submissions. They are organized in the following topical sections: IR theory, modeling and query processing; clustering, classification and detection; natural language processing for IR; social networks, user-centered studies and personalization and applications.
This book constitutes the refereed proceedings of the Second International Conference on the Theory of Information Retrieval, ICTIR 2009, held in Cambridge, UK, in September 2009. The 18 revised full papers, 14 short papers, and 11 posters presented together with one invited talk were carefully reviewed and selected from 82 submissions. The papers are categorized into four main themes: novel IR models, evaluation, efficiency, and new perspectives in IR. Twenty-one papers fall into the general theme of novel IR models, ranging from various retrieval models, query and term selection models, Web IR models, developments in novelty and diversity, to the modeling of user aspects. There are four papers on new evaluation methodologies, e.g., modeling score distributions, evaluation over sessions, and an axiomatic framework for XML retrieval evaluation. Three papers focus on the issue of efficiency and offer solutions to improve the tractability of PageRank, data cleansing practices for training classifiers, and approximate search for distributed IR. Finally, four papers look into new perspectives of IR and shed light on some new emerging areas of interest, such as the application and adoption of quantum theory in IR.