共查询到20条相似文献,搜索用时 0 毫秒
1.
The validity of an intelligence test is discussed. "The Lowry Reasoning Test Combination has been found to be relatively free of social status bias and to measure intellectual function. It is easily administered and simply scored and does not depend upon a high level of verbal ability. Variance in concept difficulty is obtained by altering combinations of constructs while keeping the verbal material on a uniformly simple level. Whereever such a discriminative and effective selection device is needed the present writers would recommend that the Lowry test be tried." (PsycINFO Database Record (c) 2010 APA, all rights reserved) 相似文献
2.
van der Maas Han L. J.; Molenaar Dylan; Maris Gunter; Kievit Rogier A.; Borsboom Denny 《Canadian Metallurgical Quarterly》2011,118(2):339
This article analyzes latent variable models from a cognitive psychology perspective. We start by discussing work by Tuerlinckx and De Boeck (2005), who proved that a diffusion model for 2-choice response processes entails a 2-parameter logistic item response theory (IRT) model for individual differences in the response data. Following this line of reasoning, we discuss the appropriateness of IRT for measuring abilities and bipolar traits, such as pro versus contra attitudes. Surprisingly, if a diffusion model underlies the response processes, IRT models are appropriate for bipolar traits but not for ability tests. A reconsideration of the concept of ability that is appropriate for such situations leads to a new item response model for accuracy and speed based on the idea that ability has a natural zero point. The model implies fundamentally new ways to think about guessing, response speed, and person fit in IRT. We discuss the relation between this model and existing models as well as implications for psychology and psychometrics. (PsycINFO Database Record (c) 2011 APA, all rights reserved) 相似文献
3.
The authors describe the initial development of the Wagner Assessment Test (WAT), an instrument designed to assess critical thinking, using the 5-faceted view popularized by the Watson-Glaser Critical Thinking Appraisal (WGCTA; G. B. Watson & E. M. Glaser, 1980). The WAT was designed to reduce the degree of successful guessing relative to the WGCTA by increasing the number of response alternatives (i.e., 80% of WGCTA items are 2-alternative, multiple-choice), a change that was hypothesized to result in more desirable test information and standard-error functions. Analyses using the 3-parameter logistic item response theory (IRT) model in a sample of undergraduates (N = 407) supported this prediction, even when the WAT item pool was shortened to match the length of the WGCTA. Convergent validity between full-pool IRT score estimates was r = .69. Implications for subsequent research on IRT-based measurement of critical thinking are discussed. (PsycINFO Database Record (c) 2010 APA, all rights reserved) 相似文献
4.
Chernyshenko Oleksandr S.; Stark Stephen; Drasgow Fritz; Roberts Brent W. 《Canadian Metallurgical Quarterly》2007,19(1):88
The main aim of this article is to explicate why a transition to ideal point methods of scale construction is needed to advance the field of personality assessment. The study empirically demonstrated the substantive benefits of ideal point methodology as compared with the dominance framework underlying traditional methods of scale construction. Specifically, using a large, heterogeneous pool of order items, the authors constructed scales using traditional classical test theory, dominance item response theory (IRT), and ideal point IRT methods. The merits of each method were examined in terms of item pool utilization, model-data fit, measurement precision, and construct and criterion-related validity. Results show that adoption of the ideal point approach provided a more flexible platform for creating future personality measures, and this transition did not adversely affect the validity of personality test scores. (PsycINFO Database Record (c) 2011 APA, all rights reserved) 相似文献
5.
Dere Jessica; Ryder Andrew G.; Kirmayer Laurence J. 《Canadian Metallurgical Quarterly》2010,42(2):134
Despite the rapid growth of the acculturation research literature in recent years, few studies have examined acculturation among community samples of immigrants in Canada. The present study used a bidimensional approach to examine acculturation among Anglophone Caribbean (n = 109), Vietnamese (n = 97), and Filipino (n = 109) first-generation immigrant adults living in a diverse urban community in Montreal, Quebec, Canada. Heritage and mainstream cultural orientations were independently assessed in 3 domains of acculturation: loyalty, behaviour, and situated identity. Across the 3 domains and the 3 groups, the 2 cultural orientations were largely independent, though in the Vietnamese and Filipino samples heritage group loyalty was positively related to mainstream group loyalty. Overall, results support a bidimensional model of acculturation and suggest the value of separately assessing different acculturation domains. (PsycINFO Database Record (c) 2010 APA, all rights reserved) 相似文献
6.
This study demonstrated the application of an innovative item response theory (IRT) based approach to evaluating measurement equivalence, comparing a newly developed Spanish version of the Posttraumatic Stress Disorder Checklist-Civilian Version (PCL-C) with the established English version. Basic principles and practical issues faced in the application of IRT methods for instrument evaluation are discussed. Data were derived from a study of the mental health consequences of community violence in both Spanish speakers (n = 102) and English speakers (n = 284). Results of differential item functioning (DIF) analyses revealed that the 2 versions were not fully equivalent on an item-by-item basis in that 6 of the 17 items displayed uniform DIF. No bias was observed, however, at the level of the composite PCL-C scale score, indicating that the 2 language versions can be combined for scale-level analyses. (PsycINFO Database Record (c) 2010 APA, all rights reserved) 相似文献
7.
Jane J. Serrita; Oltmanns Thomas F.; South Susan C.; Turkheimer Eric 《Canadian Metallurgical Quarterly》2007,116(1):166
The authors examined gender bias in the diagnostic criteria for Diagnostic and Statistical Manual of Mental Disorders (4th ed., text revision; American Psychiatric Association, 2000) personality disorders. Participants (N=599) were selected from 2 large, nonclinical samples on the basis of information from self-report questionnaires and peer nominations that suggested the presence of personality pathology. All were interviewed with the Structured Interview for DSM-IV Personality (B. Pfohl, N. Blum, & M. Zimmerman, 1997). Using item response theory methods, the authors compared data from 315 men and 284 women, searching for evidence of differential item functioning in the diagnostic features of 10 personality disorder categories. Results indicated significant but moderate measurement bias pertaining to gender for 6 specific criteria. In other words, men and women with equivalent levels of pathology endorsed the items at different rates. For 1 paranoid personality disorder criterion and 3 antisocial criteria, men were more likely to endorse the biased items. For 2 schizoid personality disorder criteria, women were more likely to endorse the biased items. (PsycINFO Database Record (c) 2010 APA, all rights reserved) 相似文献
8.
Carlson Mike; Wilcox Rand; Chou Chih-Ping; Chang Megan; Yang Frances; Blanchard Jeanine; Marterella Abbey; Kuo Ann; Clark Florence 《Canadian Metallurgical Quarterly》2011,23(2):558
Reverse-scored items on assessment scales increase cognitive processing demands and may therefore lead to measurement problems for older adult respondents. In this study, the objective was to examine possible psychometric inadequacies of reverse-scored items on the Center for Epidemiologic Studies Depression Scale (CES-D) when used to assess ethnically diverse older adults. Using baseline data from a gerontologic clinical trial (n = 460), we tested the hypotheses that the reversed items on the CES-D (a) are less reliable than nonreversed items, (b) disproportionately lead to intraindividually atypical responses that are psychometrically problematic, and (c) evidence improved measurement properties when an imputation procedure based on the scale mean is used to replace atypical responses. In general, the results supported the hypotheses. Relative to nonreversed CES-D items, the 4 reversed items were less internally consistent, were associated with lower item-scale correlations, and were more often answered atypically at an intraindividual level. Further, the atypical responses were negatively correlated with responses to psychometrically sound nonreversed items that had similar content. The use of imputation to replace atypical responses enhanced the predictive validity of the set of reverse-scored items. Among older adult respondents, reverse-scored items are associated with measurement difficulties. It is recommended that appropriate correction procedures such as item readministration or statistical imputation be applied to reduce the difficulties. (PsycINFO Database Record (c) 2011 APA, all rights reserved) 相似文献
9.
This article describes a general item response theory-based factor analytic procedure that allows assessment of the equivalence between 2 administrative modes of a questionnaire: paper and pencil, and Internet based. The theoretical relations between the present procedure and other methods used in previous empirical research are shown, and the advantages of the procedure are discussed. An empirical application based on 2 personality questionnaires is given, and the results are compared with the results of using traditional procedures for assessing equivalence. The substantive implications of the results, as well as suggestions for further research and methodology, are discussed. (PsycINFO Database Record (c) 2010 APA, all rights reserved) 相似文献
10.
Most item response theory models assume conditional independence, and it is known that interactions between items affect the estimated item discrimination. In this article, this effect is further investigated from a theoretical perspective and by means of simulation studies. To this end, a parametric model for item interactions is introduced. Next, it is shown that ignoring a positive interaction results in an overestimation of the discrimination parameter in the two-parameter logistic model (2PLM), whereas ignoring a negative interaction leads to an underestimation of the parameter. Furthermore, it is demonstrated that in some cases the item characteristic curves of the 2PLM and of an item involved in an interaction are quite similar, indicating that the 2PLM can provide a good fit to data with interactions. (PsycINFO Database Record (c) 2010 APA, all rights reserved) 相似文献
11.
Rodebaugh Thomas L.; Woods Carol M.; Heimberg Richard G.; Liebowitz Michael R.; Schneier Franklin R. 《Canadian Metallurgical Quarterly》2006,18(2):231
The widely used Social Interaction Anxiety Scale (SIAS; R. P. Mattick & J. C. Clarke, 1998) possesses favorable psychometric properties, but questions remain concerning its factor structure and item properties. Analyses included 445 people with social anxiety disorder and 1,689 undergraduates. Simple unifactorial models fit poorly, and models that accounted for differences due to item wording (i.e., reverse scoring) provided superior fit. It was further found that clients and undergraduates approached some items differently, and the SIAS may be somewhat overly conservative in selecting analogue participants from an undergraduate sample. Overall, this study provides support for the excellent properties of the SIAS's straightforwardly worded items, although questions remain regarding its reverse-scored items. (PsycINFO Database Record (c) 2010 APA, all rights reserved) 相似文献
12.
Elementary decision theory is applied to the problems of evaluating discrete tests or test items used to classify people into several categories, and choosing which of several treatments is best for persons falling within each response category. The technique explicitly considers the base rates of various criterion groups and the relative seriousness of different types of errors of classification, as well as the proportion of each criterion group falling in each response category. (PsycINFO Database Record (c) 2010 APA, all rights reserved) 相似文献
13.
The present study took a critical look at a central construct in couples research: relationship satisfaction. Eight well-validated self-report measures of relationship satisfaction, including the Marital Adjustment Test (MAT; H. J. Locke & K. M. Wallace, 1959), the Dyadic Adjustment Scale (DAS; G. B. Spanier, 1976), and an additional 75 potential satisfaction items, were given to 5,315 online participants. Using item response theory, the authors demonstrated that the MAT and DAS provided relatively poor levels of precision in assessing satisfaction, particularly given the length of those scales. Principal-components analysis and item response theory applied to the larger item pool were used to develop the Couples Satisfaction Index (CSI) scales. Compared with the MAS and the DAS, the CSI scales were shown to have higher precision of measurement (less noise) and correspondingly greater power for detecting differences in levels of satisfaction. The CSI scales demonstrated strong convergent validity with other measures of satisfaction and excellent construct validity with anchor scales from the nomological net surrounding satisfaction, suggesting that they assess the same theoretical construct as do prior scales. Implications for research are discussed. (PsycINFO Database Record (c) 2010 APA, all rights reserved) 相似文献
14.
Chorpita Bruce F.; Reise Steven; Weisz John R.; Grubbs Kathleen; Becker Kimberly D.; Krull Jennifer L. 《Canadian Metallurgical Quarterly》2010,78(4):526
Objective: To support ongoing monitoring of child response during treatment, we sought to develop a brief, easily administered, clinically relevant, and psychometrically sound measure. Method: We first developed child and caregiver forms of a 12-item Brief Problem Checklist (BPC) interview by applying item response theory and factor analysis to Youth Self-Report (YSR; Achenbach & Rescorla, 2001) and Child Behavior Checklist (CBCL;Achenbach & Rescorla, 2001) data for a sample of 2,332 youths. These interviews were then administered weekly via telephone to an ethnically diverse clinical sample of 184 boys and girls 7–13 years of age and their caregivers participating in outpatient treatment, to examine psychometric properties and feasibility. Results: Internal consistency and test–retest reliability were excellent, and factor analysis yielded 1 internalizing and 1 externalizing factor. Validity tests showed large and significant correlations with corresponding scales on paper-and-pencil administrations of the CBCL and YSR as well as with diagnoses obtained from a structured diagnostic interview. Discriminant validity of the BPC interviews was supported by low correlations with divergent criteria. Longitudinal data for the initial 6 months of treatment demonstrated that the BPC significantly predicted change on related measures of child symptoms. Estimates obtained from random coefficient growth models showed generally higher slope reliabilities for the BPC given weekly relative to the CBCL and YSR given every 3 months. Conclusions: Given their combination of brevity and psychometric strength, the child and caregiver BPC interviews appear to be a promising strategy for efficient, ongoing assessment of clinical progress during the course of treatment. (PsycINFO Database Record (c) 2010 APA, all rights reserved) 相似文献
15.
Change in cognitive functioning associated with ApoE genotype in a community sample of older adults.
Hofer Scott M.; Christensen Helen; Mackinnon Andrew J.; Korten Alisa E.; Jorm Anthony F.; Henderson Alexander S.; Easteal Simon 《Canadian Metallurgical Quarterly》2002,17(2):194
The influence of a genetic risk factor, apolipoprotein E (apoE) ε4 variant, was assessed in older adults aged 70 to 94 on 3 occasions over 7 years. The results of latent growth curve analyses are reported for individuals genotyped for apoE at the 2nd measurement occasion (n = 601) and for a subsample of individuals without probable or definite dementia during the 1st or 2nd occasion (n = 434). ApoE-ε4 status was a significant predictor of level and change in memory performance and change in speed performance in the full sample, and of initial level and change in memory performance in the nondemented subsample. These results support previous findings that apoE-ε4 is associated with accelerated memory deterioration in individuals without clinical dementia. (PsycINFO Database Record (c) 2010 APA, all rights reserved) 相似文献
16.
"Two forms of a 20-item test of creativity were developed through analyses of item response data of 345 engineering students at Purdue University. Three scores were developed for the test: Fluency score, Flexibility score, and Originality score. Investigations of the validity, reliability, interscorer agreement, relationships with other tests, and 'face validity' of the Creativity scores were made with 64 product development engineers and process engineers in a large automobile accessories manufacturing company." Significant validity was found (PsycINFO Database Record (c) 2010 APA, all rights reserved) 相似文献
17.
Reviews the book, "Introduction to testing and the use of test results in public schools," by Arthur E. Traxler, Robert Jacobs, Margaret Selover, and Agatha Townsend (see record 1954-01580-000). This book is designed to serve as a "practical, down-to-earth handbook for schools beginning the use of objective tests, for teacher discussion groups, for in-service training programs, for persons who have had experience with tests but who desire to brush up on the simpler fundamentals of testing, and for introductory classes in tests and measurements." This brief, nontechnical book should be distinctly useful to the groups of readers toward whom it is directed. Despite its title, the revision seems equally appropriate for public and independent schools. From the standpoint of the former, the more detailed discussions of test selection and program planning included in the revised edition should be of particular interest. (PsycINFO Database Record (c) 2010 APA, all rights reserved) 相似文献
18.
There is considerable confusion about ethnic and racial identity, multicultural constructs, and the tools available to assess them. The conceptualization and measurement of the constructs in the field also are complicated by the increasing observation that human beings have multiple, intertwined identities that influence one another in ways that are not fully understood. Measurement problems are compounded by the growing popularity of identity to the extent that theory, construct clarity, and appropriate statistical analyses are ignored. The problems could influence counselors who are confronted with their client's identity distortions and confusions. To work through a client's uncertainty about his or her identity, counselors should understand the origins of identity constructs and how the client frames his or her identity problems and confusion. Given the state of pandemonium in ethnic and racial identity, it is essential that considerations are given to the historical developments of the constructs and what they mean for contemporary research and development. (PsycINFO Database Record (c) 2010 APA, all rights reserved) 相似文献
19.
Beevers Christopher G.; Strong David R.; Meyer Bj?rn; Pilkonis Paul A.; Miller Ivan W. 《Canadian Metallurgical Quarterly》2007,19(2):199
Despite a central role for dysfunctional attitudes in cognitive theories of depression and the widespread use of the Dysfunctional Attitude Scale, form A (DAS-A; A. Weissman, 1979), the psychometric development of the DAS-A has been relatively limited. The authors used nonparametric item response theory methods to examine the DAS-A items and develop a briefer version of the scale. Using DAS-A data obtained from depressed participants enrolled in 2 large depression treatment studies (N = 367), the authors developed a 9-item DAS form (DAS-SF?). In addition, because 2 versions of the DAS are needed for certain study designs, they also developed a 2nd short version (DAS-SF?). These short forms were highly correlated with the original 40-item DAS-A (rs ranged from .91 to .93), exhibited change similar to that of the DAS-A over the course of treatment, were moderately correlated with related self-report assessments, predicted concurrent depression severity, and predicted change in depression from before to after treatment. Taken together, the authors believe the DAS-SF? and DAS-SF? provide an efficient and accurate assessment of dysfunctional attitudes among depressed individuals. (PsycINFO Database Record (c) 2010 APA, all rights reserved) 相似文献
20.
This study examined the psychometric properties of the Readiness and Motivation Interview (RMI), a symptom-specific measure of readiness and motivation for change in the eating disorders. For 4 symptom domains, the RMI assesses the extent to which individuals are in precontemplation, contemplation, and action/maintenance, and the extent to which change is made for internal versus external reasons. Ninety-nine individuals with eating disorders completed the RMI and measures to assess convergent, divergent, and criterion validity. RMI profiles revealed differences in readiness and motivation across symptom domains. The RMI demonstrated good reliability and construct validity, and RMI scores predicted anticipated difficulty of recovery activities, completion of recovery activities, decision to enroll in an intensive symptom-reduction program, and treatment dropout. The RMI may have important clinical applications by providing much-needed information on client readiness for action-oriented treatment. (PsycINFO Database Record (c) 2010 APA, all rights reserved) 相似文献