Entity-Oriented Search

Entity-Oriented Search

Author: Krisztian Balog

Publisher: Springer

Published: 2018-10-02

Total Pages: 358

ISBN-13: 3319939351

DOWNLOAD EBOOK

This open access book covers all facets of entity-oriented search—where “search” can be interpreted in the broadest sense of information access—from a unified point of view, and provides a coherent and comprehensive overview of the state of the art. It represents the first synthesis of research in this broad and rapidly developing area. Selected topics are discussed in-depth, the goal being to establish fundamental techniques and methods as a basis for future research and development. Additional topics are treated at a survey level only, containing numerous pointers to the relevant literature. A roadmap for future research, based on open issues and challenges identified along the way, rounds out the book. The book is divided into three main parts, sandwiched between introductory and concluding chapters. The first two chapters introduce readers to the basic concepts, provide an overview of entity-oriented search tasks, and present the various types and sources of data that will be used throughout the book. Part I deals with the core task of entity ranking: given a textual query, possibly enriched with additional elements or structural hints, return a ranked list of entities. This core task is examined in a number of different variants, using both structured and unstructured data collections, and numerous query formulations. In turn, Part II is devoted to the role of entities in bridging unstructured and structured data. Part III explores how entities can enable search engines to understand the concepts, meaning, and intent behind the query that the user enters into the search box, and how they can provide rich and focused responses (as opposed to merely a list of documents)—a process known as semantic search. The final chapter concludes the book by discussing the limitations of current approaches, and suggesting directions for future research. Researchers and graduate students are the primary target audience of this book. A general background in information retrieval is sufficient to follow the material, including an understanding of basic probability and statistics concepts as well as a basic knowledge of machine learning concepts and supervised learning algorithms.


WSDM'15

WSDM'15

Author: Xueqi Cheng

Publisher:

Published: 2015

Total Pages: 466

ISBN-13: 9781450333177

DOWNLOAD EBOOK


Trustworthy Online Controlled Experiments

Trustworthy Online Controlled Experiments

Author: Ron Kohavi

Publisher:

Published: 2020-02-14

Total Pages: 292

ISBN-13: 1108601375

DOWNLOAD EBOOK

This practical guide for students, researchers and practitioners offers real world guidance for data-driven decision making and innovation.


Machine Learning and Knowledge Discovery in Databases

Machine Learning and Knowledge Discovery in Databases

Author: Albert Bifet

Publisher: Springer

Published: 2015-08-28

Total Pages: 365

ISBN-13: 3319234617

DOWNLOAD EBOOK

The three volume set LNAI 9284, 9285, and 9286 constitutes the refereed proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2015, held in Porto, Portugal, in September 2015. The 131 papers presented in these proceedings were carefully reviewed and selected from a total of 483 submissions. These include 89 research papers, 11 industrial papers, 14 nectar papers, 17 demo papers. They were organized in topical sections named: classification, regression and supervised learning; clustering and unsupervised learning; data preprocessing; data streams and online learning; deep learning; distance and metric learning; large scale learning and big data; matrix and tensor analysis; pattern and sequence mining; preference learning and label ranking; probabilistic, statistical, and graphical approaches; rich data; and social and graphs. Part III is structured in industrial track, nectar track, and demo track.


Data Science for Economics and Finance

Data Science for Economics and Finance

Author: Sergio Consoli

Publisher: Springer Nature

Published: 2021

Total Pages: 357

ISBN-13: 3030668916

DOWNLOAD EBOOK

This open access book covers the use of data science, including advanced machine learning, big data analytics, Semantic Web technologies, natural language processing, social media analysis, time series analysis, among others, for applications in economics and finance. In addition, it shows some successful applications of advanced data science solutions used to extract new knowledge from data in order to improve economic forecasting models. The book starts with an introduction on the use of data science technologies in economics and finance and is followed by thirteen chapters showing success stories of the application of specific data science methodologies, touching on particular topics related to novel big data sources and technologies for economic analysis (e.g. social media and news); big data models leveraging on supervised/unsupervised (deep) machine learning; natural language processing to build economic and financial indicators; and forecasting and nowcasting of economic variables through time series analysis. This book is relevant to all stakeholders involved in digital and data-intensive research in economics and finance, helping them to understand the main opportunities and challenges, become familiar with the latest methodological findings, and learn how to use and evaluate the performances of novel tools and frameworks. It primarily targets data scientists and business analysts exploiting data science technologies, and it will also be a useful resource to research students in disciplines and courses related to these topics. Overall, readers will learn modern and effective data science solutions to create tangible innovations for economic and financial applications.


Linguistics across Disciplinary Borders

Linguistics across Disciplinary Borders

Author: Steven Coats

Publisher: Bloomsbury Publishing

Published: 2024-01-11

Total Pages: 265

ISBN-13: 135036228X

DOWNLOAD EBOOK

This volume highlights the ways in which recent developments in corpus linguistics and natural language processing can engage with topics across language studies, humanities and social science disciplines. New approaches have emerged in recent years that blur disciplinary boundaries, facilitated by factors such as the application of computational methods, access to large data sets, and the sharing of code, as well as continual advances in technologies related to data storage, retrieval, and processing. The “march of data” denotes an area at the border region of linguistics, humanities, and social science disciplines, but also the inevitable development of the underlying technologies that drive analysis in these subject areas. Organized into 3 sections, the chapters are connected by the underlying thread of linguistic corpora: how they can be created, how they can shed light on varieties or registers, and how their metadata can be utilized to better understand the internal structure of similar resources. While some chapters in the volume make use of well-established existing corpora, others analyze data from platforms such as YouTube, Twitter or Reddit. The volume provides insight into the diversity of methods, approaches, and corpora that inform our understanding of the “border regions” between the realms of data science, language/linguistics, and social or cultural studies.


Machine Learning for Algorithmic Trading

Machine Learning for Algorithmic Trading

Author: Stefan Jansen

Publisher: Packt Publishing Ltd

Published: 2020-07-31

Total Pages: 822

ISBN-13: 1839216786

DOWNLOAD EBOOK

Leverage machine learning to design and back-test automated trading strategies for real-world markets using pandas, TA-Lib, scikit-learn, LightGBM, SpaCy, Gensim, TensorFlow 2, Zipline, backtrader, Alphalens, and pyfolio. Purchase of the print or Kindle book includes a free eBook in the PDF format. Key FeaturesDesign, train, and evaluate machine learning algorithms that underpin automated trading strategiesCreate a research and strategy development process to apply predictive modeling to trading decisionsLeverage NLP and deep learning to extract tradeable signals from market and alternative dataBook Description The explosive growth of digital data has boosted the demand for expertise in trading strategies that use machine learning (ML). This revised and expanded second edition enables you to build and evaluate sophisticated supervised, unsupervised, and reinforcement learning models. This book introduces end-to-end machine learning for the trading workflow, from the idea and feature engineering to model optimization, strategy design, and backtesting. It illustrates this by using examples ranging from linear models and tree-based ensembles to deep-learning techniques from cutting edge research. This edition shows how to work with market, fundamental, and alternative data, such as tick data, minute and daily bars, SEC filings, earnings call transcripts, financial news, or satellite images to generate tradeable signals. It illustrates how to engineer financial features or alpha factors that enable an ML model to predict returns from price data for US and international stocks and ETFs. It also shows how to assess the signal content of new features using Alphalens and SHAP values and includes a new appendix with over one hundred alpha factor examples. By the end, you will be proficient in translating ML model predictions into a trading strategy that operates at daily or intraday horizons, and in evaluating its performance. What you will learnLeverage market, fundamental, and alternative text and image dataResearch and evaluate alpha factors using statistics, Alphalens, and SHAP valuesImplement machine learning techniques to solve investment and trading problemsBacktest and evaluate trading strategies based on machine learning using Zipline and BacktraderOptimize portfolio risk and performance analysis using pandas, NumPy, and pyfolioCreate a pairs trading strategy based on cointegration for US equities and ETFsTrain a gradient boosting model to predict intraday returns using AlgoSeek's high-quality trades and quotes dataWho this book is for If you are a data analyst, data scientist, Python developer, investment analyst, or portfolio manager interested in getting hands-on machine learning knowledge for trading, this book is for you. This book is for you if you want to learn how to extract value from a diverse set of data sources using machine learning to design your own systematic trading strategies. Some understanding of Python and machine learning techniques is required.


Dynamic Provisioning for Community Services

Dynamic Provisioning for Community Services

Author: Li Qi

Publisher: Springer Science & Business Media

Published: 2013-08-17

Total Pages: 185

ISBN-13: 3642345131

DOWNLOAD EBOOK

Dynamic Provisioning for Community Services outlines a dynamic provisioning and maintenance mechanism in a running distributed system, e.g. the grid, which can be used to maximize the utilization of computing resources and user demands. The book includes a complete and reliable maintenance system solution for the large-scale distributed system and an interoperation mechanism for the grid middleware deployed in the United States, Europe, and China. The experiments and evaluations have all been practically implemented for ChinaGrid, and the best practices established can help readers to construct reliable distributed systems. This book is intended for researchers, developers, and graduate students in the fields of grid computing, service-oriented architecture and dynamic maintenance for large distributed systems. Li Qi is an Associate Professor and the Deputy Director of the R&D Center for the Internet of Things at the Third Research Institute of Ministry of Public Security (TRIMPS), China. Hai Jin is a Professor and the Director of Department of Computer Science, Huazhong University of Science and Technology, China.


Text Analytics for Corpus Linguistics and Digital Humanities

Text Analytics for Corpus Linguistics and Digital Humanities

Author: Gerold Schneider

Publisher: Bloomsbury Publishing

Published: 2024-05-02

Total Pages: 241

ISBN-13: 1350370835

DOWNLOAD EBOOK

Do you want to gain a deeper understanding of how big tech analyses and exploits our text data, or investigate how political parties differ by analysing textual styles, associations and trends in documents? Or create a map of a text collection and write a simple QA system yourself? This book explores how to apply state-of-the-art text analytics methods to detect and visualise phenomena in text data. Solidly based on methods from corpus linguistics, natural language processing, text analytics and digital humanities, this book shows readers how to conduct experiments with their own corpora and research questions, underpin their theories, quantify the differences and pinpoint characteristics. Case studies and experiments are detailed in every chapter using real-world and open access corpora from politics, World English, history, and literature. The results are interpreted and put into perspective, pitfalls are pointed out, and necessary pre-processing steps are demonstrated. This book also demonstrates how to use the programming language R, as well as simple alternatives and additions to R, to conduct experiments and employ visualisations by example, with extensible R-code, recipes, links to corpora, and a wide range of methods. The methods introduced can be used across texts of all disciplines, from history or literature to party manifestos and patient reports.