Hands-On Data Science for Librarians

Hands-On Data Science for Librarians

Author: Sarah Lin

Publisher: CRC Press

Published: 2023-05-09

Total Pages: 199

ISBN-13: 1000863174

DOWNLOAD EBOOK

Librarians understand the need to store, use and analyze data related to their collection, patrons and institution, and there has been consistent interest over the last 10 years to improve data management, analysis, and visualization skills within the profession. However, librarians find it difficult to move from out-of-the-box proprietary software applications to the skills necessary to perform the range of data science actions in code. This book will focus on teaching R through relevant examples and skills that librarians need in their day-to-day lives that includes visualizations but goes much further to include web scraping, working with maps, creating interactive reports, machine learning, and others. While there’s a place for theory, ethics, and statistical methods, librarians need a tool to help them acquire enough facility with R to utilize data science skills in their daily work, no matter what type of library they work at (academic, public or special). By walking through each skill and its application to library work before walking the reader through each line of code, this book will support librarians who want to apply data science in their daily work. Hands-On Data Science for Librarians is intended for librarians (and other information professionals) in any library type (public, academic or special) as well as graduate students in library and information science (LIS). Key Features: Only data science book available geared toward librarians that includes step-by-step code examples Examples include all library types (public, academic, special) Relevant datasets Accessible to non-technical professionals Focused on job skills and their applications


Data Science for Librarians

Data Science for Librarians

Author: Yunfei Du

Publisher: Libraries Unlimited

Published: 2020-03-26

Total Pages: 0

ISBN-13: 1440871213

DOWNLOAD EBOOK

More data, more problems -- A new strand of librarianship -- Data creation and collection -- Data for the academic librarian -- Research data services and the library ecosystem -- Data sources -- Data curation (archiving/preservation) -- Data storage, management, and retrieval -- Data analysis and visualization -- Data ethics and policies -- Data for public libraries and special libraries -- Conclusion: library, information, and data science.


A Hands-On Introduction to Data Science

A Hands-On Introduction to Data Science

Author: Chirag Shah

Publisher: Cambridge University Press

Published: 2020-04-02

Total Pages: 459

ISBN-13: 1108472443

DOWNLOAD EBOOK

An introductory textbook offering a low barrier entry to data science; the hands-on approach will appeal to students from a range of disciplines.


Data Management

Data Management

Author: Margaret E. Henderson

Publisher: Rowman & Littlefield

Published: 2016-10-25

Total Pages: 215

ISBN-13: 144226439X

DOWNLOAD EBOOK

Libraries organize information and data is information, so it is natural that librarians should help people who need to find, organize, use, or store data. Organizations need evidence for decision making; data provides that evidence. Inventors and creators build upon data collected by others. All around us, people need data. Librarians can help increase the relevance of their library to the research and education mission of their institution by learning more about data and how to manage it. Data Management will guide readers through: Understanding data management basics and best practices. Using the reference interview to help with data management Writing data management plans for grants. Starting and growing a data management service. Finding collaborators inside and outside the library. Collecting and using data in different disciplines.


Data Science for Librarians

Data Science for Librarians

Author: Yunfei Du

Publisher: Bloomsbury Publishing USA

Published: 2020-03-26

Total Pages: 169

ISBN-13:

DOWNLOAD EBOOK

This unique textbook intersects traditional library science with data science principles that readers will find useful in implementing or improving data services within their libraries. Data Science for Librarians introduces data science to students and practitioners in library services. Writing for academic, public, and school library managers; library science students; and library and information science educators, authors Yunfei Du and Hammad Rauf Khan provide a thorough overview of conceptual and practical tools for data librarian practice. Partially due to how quickly data science evolves, libraries have yet to recognize core competencies and skills required to perform the job duties of a data librarian. As society transitions from the information age into the era of big data, librarians and information professionals require new knowledge and skills to stay current and take on new job roles, such as data librarianship. Such skills as data curation, research data management, statistical analysis, business analytics, visualization, smart city data, and learning analytics are relevant in library services today and will become increasingly so in the near future. This text serves as a tool for library and information science students and educators working on data science curriculum design.


Hands on Data Science for Biologists Using Python

Hands on Data Science for Biologists Using Python

Author: Yasha Hasija

Publisher: CRC Press

Published: 2021-04-08

Total Pages: 299

ISBN-13: 1000345483

DOWNLOAD EBOOK

Hands-on Data Science for Biologists using Python has been conceptualized to address the massive data handling needs of modern-day biologists. With the advent of high throughput technologies and consequent availability of omics data, biological science has become a data-intensive field. This hands-on textbook has been written with the inception of easing data analysis by providing an interactive, problem-based instructional approach in Python programming language. The book starts with an introduction to Python and steadily delves into scrupulous techniques of data handling, preprocessing, and visualization. The book concludes with machine learning algorithms and their applications in biological data science. Each topic has an intuitive explanation of concepts and is accompanied with biological examples. Features of this book: The book contains standard templates for data analysis using Python, suitable for beginners as well as advanced learners. This book shows working implementations of data handling and machine learning algorithms using real-life biological datasets and problems, such as gene expression analysis; disease prediction; image recognition; SNP association with phenotypes and diseases. Considering the importance of visualization for data interpretation, especially in biological systems, there is a dedicated chapter for the ease of data visualization and plotting. Every chapter is designed to be interactive and is accompanied with Jupyter notebook to prompt readers to practice in their local systems. Other avant-garde component of the book is the inclusion of a machine learning project, wherein various machine learning algorithms are applied for the identification of genes associated with age-related disorders. A systematic understanding of data analysis steps has always been an important element for biological research. This book is a readily accessible resource that can be used as a handbook for data analysis, as well as a platter of standard code templates for building models.


Hands-on Scikit-Learn for Machine Learning Applications

Hands-on Scikit-Learn for Machine Learning Applications

Author: David Paper

Publisher: Apress

Published: 2019-11-16

Total Pages: 247

ISBN-13: 1484253736

DOWNLOAD EBOOK

Aspiring data science professionals can learn the Scikit-Learn library along with the fundamentals of machine learning with this book. The book combines the Anaconda Python distribution with the popular Scikit-Learn library to demonstrate a wide range of supervised and unsupervised machine learning algorithms. Care is taken to walk you through the principles of machine learning through clear examples written in Python that you can try out and experiment with at home on your own machine. All applied math and programming skills required to master the content are covered in this book. In-depth knowledge of object-oriented programming is not required as working and complete examples are provided and explained. Coding examples are in-depth and complex when necessary. They are also concise, accurate, and complete, and complement the machine learning concepts introduced. Working the examples helps to build the skills necessary to understand and apply complex machine learning algorithms. Hands-on Scikit-Learn for Machine Learning Applications is an excellent starting point for those pursuing a career in machine learning. Students of this book will learn the fundamentals that are a prerequisite to competency. Readers will be exposed to the Anaconda distribution of Python that is designed specifically for data science professionals, and will build skills in the popular Scikit-Learn library that underlies many machine learning applications in the world of Python. What You'll LearnWork with simple and complex datasets common to Scikit-Learn Manipulate data into vectors and matrices for algorithmic processing Become familiar with the Anaconda distribution used in data scienceApply machine learning with Classifiers, Regressors, and Dimensionality Reduction Tune algorithms and find the best algorithms for each dataset Load data from and save to CSV, JSON, Numpy, and Pandas formats Who This Book Is For The aspiring data scientist yearning to break into machine learning through mastering the underlying fundamentals that are sometimes skipped over in the rush to be productive. Some knowledge of object-oriented programming and very basic applied linear algebra will make learning easier, although anyone can benefit from this book.


A Hands-On Introduction to Data Science

A Hands-On Introduction to Data Science

Author: Chirag Shah

Publisher: Cambridge University Press

Published: 2020-04-02

Total Pages: 460

ISBN-13: 1108673902

DOWNLOAD EBOOK

This book introduces the field of data science in a practical and accessible manner, using a hands-on approach that assumes no prior knowledge of the subject. The foundational ideas and techniques of data science are provided independently from technology, allowing students to easily develop a firm understanding of the subject without a strong technical background, as well as being presented with material that will have continual relevance even after tools and technologies change. Using popular data science tools such as Python and R, the book offers many examples of real-life applications, with practice ranging from small to big data. A suite of online material for both instructors and students provides a strong supplement to the book, including datasets, chapter slides, solutions, sample exams and curriculum suggestions. This entry-level textbook is ideally suited to readers from a range of disciplines wishing to build a practical, working knowledge of data science.


Data Science

Data Science

Author: John D. Kelleher

Publisher: MIT Press

Published: 2018-04-13

Total Pages: 282

ISBN-13: 0262535432

DOWNLOAD EBOOK

A concise introduction to the emerging field of data science, explaining its evolution, relation to machine learning, current uses, data infrastructure issues, and ethical challenges. The goal of data science is to improve decision making through the analysis of data. Today data science determines the ads we see online, the books and movies that are recommended to us online, which emails are filtered into our spam folders, and even how much we pay for health insurance. This volume in the MIT Press Essential Knowledge series offers a concise introduction to the emerging field of data science, explaining its evolution, current uses, data infrastructure issues, and ethical challenges. It has never been easier for organizations to gather, store, and process data. Use of data science is driven by the rise of big data and social media, the development of high-performance computing, and the emergence of such powerful methods for data analysis and modeling as deep learning. Data science encompasses a set of principles, problem definitions, algorithms, and processes for extracting non-obvious and useful patterns from large datasets. It is closely related to the fields of data mining and machine learning, but broader in scope. This book offers a brief history of the field, introduces fundamental data concepts, and describes the stages in a data science project. It considers data infrastructure and the challenges posed by integrating data from multiple sources, introduces the basics of machine learning, and discusses how to link machine learning expertise with real-world problems. The book also reviews ethical and legal issues, developments in data regulation, and computational approaches to preserving privacy. Finally, it considers the future impact of data science and offers principles for success in data science projects.


The Accidental Data Scientist

The Accidental Data Scientist

Author: Amy L. Affelt

Publisher: Information Today

Published: 2015

Total Pages: 0

ISBN-13: 9781573875110

DOWNLOAD EBOOK

Amy Affelt, author of The Accidental Data Scientist, notes that "Librarians and information professionals have always worked with data in order to meet the information needs of their constituents, thus 'Big Data' is not a new concept for them." With The Accidental Data Scientist, Amy Affelt shows information professionals how to leverage their skills and training to master emerging tools, techniques, and vocabulary; create mission-critical Big Data research deliverables; and discover rewarding new career opportunities by embracing their inner Data Scientist.