This textbook is a practical guide to language test development for classroom use, educational program evaluation, placement at instructional levels, measurement and research. It has served for many years as a classic classroom textbook at both graduate and undergraduate university levels. This new edition makes it available once again at low cost for student use. It provides useful information about reliability and validity estimation and about test and questionnaire development for a variety of educational and evaluation purposes. It includes information useful for the development of both paper-and-pencil and computer-adaptive measurement instruments.
Leading experts describe the state-of-the-art in developing and constructing psychometric tests This latest volume in the series Psychological Assessment – Science and Practice describes the current state-of-the-art in test development and construction. The past 10-20 years have seen substantial advances in the methods used to develop and administer tests. In this volume many of the world's leading authorities collate these advances and provide information about current practices, thus equipping researchers and students to successfully construct new tests using the best modern standards and techniques. The first section explains the benefits of considering the underlying theory when designing tests, such as factor analysis and item response theory. The second section looks at item format and test presentation. The third discusses model testing and selection, while the fourth goes into statistical methods that can find group-specific bias. The final section discusses topics of special relevance such as multi-trait multi-state analyses and development of screening instruments.
With the current push toward educational reform, there is great potential for innovation and change, particularly in large scale testing. One area where change is possible is in cognitive diagnostic assessment. Researchers in educational measurement and cognitive psychology are finally in a position to design tests targeted specifically for providing valuable information about students' cognitive strengths and weaknesses. This self-contained volume organizes what is known about cognitive diagnostic assessment in education, including its conceptual and philosophical basis, methods, and applications. The complete list of topics includes educational demand, philosophical rationale, construct validity, cognitive methods, test construction, statistical models, and unresolved issues (e.g., how to best translate diagnostic information into teaching practices). Leighton and Gierl present a comprehensive and up-to-date examination of cognitive diagnostic assessment in education.
Constructing test items for standardized tests of achievement, ability, and aptitude is a task of enormous importance. The interpretability of a test's scores flows directly from the quality of its items and exercises. Concomitant with score interpretability is the notion that including only carefully crafted items on a test is the primary method by which the skilled test developer reduces unwanted error variance, or errors of measurement, and thereby increases a test score's reliability. The aim of this entire book is to increase the test constructor's awareness of this source of measurement error, and then to describe methods for identifying and minimizing it during item construction and later review. Persons involved in assessment are keenly aware of the increased attention given to alternative formats for test items in recent years. Yet, in many writers' zeal to be `curriculum-relevant' or `authentic' or `realistic', the items are often developed seemingly without conscious thought to the interpretations that may be garnered from them. This book argues that the format for such alternative items and exercises also requires rigor in their construction and even offers some solutions, as one chapter is devoted to these alternative formats. This book addresses major issues in constructing test items by focusing on four ideas. First, it describes the characteristics and functions of test items. A second feature of this book is the presentation of editorial guidelines for writing test items in all of the commonly used item formats, including constructed-response formats and performance tests. A third aspect of this book is the presentation of methods for determining the quality of test items. Finally, this book presents a compendium of important issues about test items, including procedures for ordering items in a test, ethical and legal concerns over using copyrighted test items, item scoring schemes, computer-generated items and more.
Language Assessment: Principles and Classroom Practices is designed to offer a comprehensive survey of essential principles and tools for second language assessment. Its first and second editions have been successfully used in teacher-training courses, teacher certification curricula, and TESOL master of arts programs. As the third in a trilogy of teacher education textbooks, it is designed to follow H. Douglas Brown's other two books, Principles of Language Learning and Teaching (sixth edition, Pearson Education, 2014) and Teaching by Principles(fourth edition, Pearson Education, 2015). References to those two books are made throughout the current book. Language Assessment features uncomplicated prose and a systematic, spiraling organization. Concepts are introduced with practical examples, understandable explanations, and succinct references to supportive research. The research literature on language assessment can be quite complex and assume that readers have technical knowledge and experience in testing. By the end of Language Assessment, however, readers will have gained access to this not-so-frightening field. They will have a working knowledge of a number of useful, fundamental principles of assessment and will have applied those principles to practical classroom contexts. They will also have acquired a storehouse of useful tools for evaluating and designing practical, effective assessment techniques for their classrooms.
The United States is formally represented around the world by approximately 14,000 Foreign Service officers and other personnel in the U.S. Department of State. Roughly one-third of them are required to be proficient in the local languages of the countries to which they are posted. To achieve this language proficiency for its staff, the State Department's Foreign Service Institute (FSI) provides intensive language instruction and assesses the proficiency of personnel before they are posted to a foreign country. The requirement for language proficiency is established in law and is incorporated in personnel decisions related to job placement, promotion, retention, and pay. A Principled Approach to Language Assessment: Considerations for the U.S. Foreign Service Institute evaluates the different approaches that exist to assess foreign language proficiency that FSI could potentially use. This report considers the key assessment approaches in the research literature that are appropriate for language testing, including, but not limited to, assessments that use task-based or performance-based approaches, adaptive online test administration, and portfolios.
The field of language testing and assessment has recognized the importance and underlying theoretical and practical underpinnings of language assessment literacy (LAL), an area that is gradually coming to prominence. This book addresses issues that promote the concept of LAL for language research, teaching, and learning, covering a range of topics. It brings together 14 chapters based on high-stakes and classroom-based studies authored by academics, professionals and researchers in the field. The text examines diverse issues through a multifaceted approach, presenting high-quality contributions that fill a gap in a research area that has long been in need of theoretical and empirical attention.
In the United States, the nomenclature of adult education includes adult literacy, adult secondary education, and English for speakers of other languages (ESOL) services provided to undereducated and limited English proficient adults. Those receiving adult education services have diverse reasons for seeking additional education. With the passage of the WIA, the assessment of adult education students became mandatory-regardless of their reasons for seeking services. The law does allow the states and local programs flexibility in selecting the most appropriate assessment for the student. The purpose of the NRC's workshop was to explore issues related to efforts to measure learning gains in adult basic education programs, with a focus on performance-based assessments.
"Psychological Testing by Theresa J. B. Kline is an accessible, easy-to-read book that effectively communicates the current concepts, trends, and controversies in the field of psychological testing. Readers are provided with an in-depth analysis of psychometrics in a format that will keep their attention and that they will be able to relate to the significance of psychological testing across numerous areas such as schools, businesses, clinical settings, military, or government." -Todd L. Chmielewski, PsycCRITIQUES, December 7, 2005 VOL. 50, NO. 49, ARTICLE 12 Psychological Testing: A Practical Approach to Design and Evaluation offers a fresh and innovative approach to students and faculty in the fields of testing, measurement, psychometrics, research design, and related areas of study. Author Theresa J.B. Kline guides readers through the process of designing and evaluating a test, while ensuring that the test meets the highest professional standards. The author uses simple, clear examples throughout and fully details the required statistical analyses. Topics include—but are not limited to—design of item stems and responses; sampling strategies; classical and modern test theory; IRT program examples; reliability of tests and raters; validation using content, criterion-related, and factor analytic approaches; test and item bias; and professional and ethical issues in testing. With the student in mind, Kline has created features that ease them into more difficult ideas, always stressing the practical use of theoretical concepts. Features include A step-by-step approach to designing a test, including construct identification, construct operationalization, collecting data, item assessment, and reliability and validity techniques Examples of data analyses with printouts and interpretation Up-to-date coverage of psychometric topics, such as difference scores, change scores, translation, computer adaptive testing, reliability and validity generalization, professional and ethical guidelines, and references IRT program outputs (dichotomous and multiple response) Coverage of traditional topics in the context of how they would be used, such as standard errors and confidence intervals Sampling approaches and their strengths and weaknesses, as well as response rates and missing data management Psychological Testing is perfectly suited as a main text for upper-level undergraduate and graduate Testing or Psychometrics courses in departments of Psychology, Education, Sociology, Management, and in the Human Services disciplines. Professional researchers, educators, and consultants will also want to add this to their libraries for up-to-date coverage of test design and evaluation techniques. "Professor Kline′s attempts to de-mystify complex measurement concepts are beautifully simplified and illustrated in her countless illustrations of practical and relevant problems for the mathematically-challenged student. This book is also a must-have for those who simply do not have the desire for the theoretical jargon used in similar textbooks but are interested in the important conceptual and practical aspects of measurement as they apply in their disciplines." —Arturo Olivarez, Jr., Texas Tech University "Kline′s Psychological Testing provides a well-written treatment of the critical issues in designing and evaluating psychometric instruments. This book will be very useful to advanced undergraduate students, graduate students, and researchers." —Richard Block, Montana State University