Katon W,Sullivan M D.Depression and chronic medical illness.Journal of Clinical Psychiatry,1990,51 Suppl:3-11.

Kelderman H,Rijkes C P M.Loglinear multidimensional IRT models for polytomously scored items.Psychometrika,1994,59(2):149-176.

Kim H,Plake B S.Monte Carlo simulation comparison of two-stage testing and computerized adaptive testing.Paper presented at the meeting of the National Council on Measurement in Education,1993.

Kim J,Chung H,Dodd B G,Park R.Panel Design Variations in the Multistage Test Using the Mixed-Format Tests.Educational and Psychological Measurement,2012,72(4):574-588.

Kim S.A comparative study of IRT fixed parameter calibration methods.Journal of Educational Measurement,2006,43(4):355-381.

Kingsbury G G.Adaptive item calibration:A process for estimating item parameters within a computerized adaptive test.In the GMAC conference on computerized adaptive testing,2009.

Kingsbury G G.Item review and adaptive testing.Paper presented at the annual meeting of the National Council on Measurement in Education,1996.

Kolen M J,Brennan R L.Test equating,scaling,and linking:Methods and practices(2nd ed.).New York,NY:Springer-Verlag,2004.

Laux L,Glanzmann P,Schaffner P,Spielberger C D.Das State-Trait-Angstinventar.Beltz Test GmbH,G?ttingen,1981.

Lee Y H,Ip E H,Fuh C D.A Strategy for Controlling Item Exposure in Multidimensional Computerized Adaptive Testing.Educational and Psychological Measurement,2008,68(2):215-232.

Leighton J P,Gierl M J,Hunka S M.The attribute hierarchy method for cognitive assessment:A variation on Tatsuoka’s rule-space approach.Journal of Educational Measurement,2004,41(3):205-237.

Leung C K,Chang H H,Hau K T.Item selection in computerized adaptive testing:Improving the a-stratified design with the Sympson-Hetter algorithm.Applied Psychological Measurement,2002,26(4):376-392.

Lewis C,Sheehan K.Using Bayesian Decision Theory to Design a Computerized Mastery Test.Applied Psychological Measurement,1990,1990(2):i-8.

Li Y H,Schafer W D.Trait parameter recovery using multidimensional computerized adaptive testing in reading and mathematics.Applied Psychological Measurement,2005,29(1):3-25.

Linden W J v d,Glas G A W.Computerized adaptive testing:Theory and practice.New York:Kluwer Academic Publishers,2000.

Linden W J,Adema J J.Simultaneous assembly of multiple test forms.Journal of Educational Measurement,1998,35(3):185-198.

Liu H Y,You X F,Wang W Y,Ding S L,Chang H H.The Development of Computerized Adaptive Testing with Cognitive Diagnosis for an English Achievement Test in China.Journal of Classification,2013,30(2):152-172.

Lord F M.Estimating norms by item-sampling.Educational and Psychological Measurement,1962,22(2):259-267.

Lord F M,Novick M R.Birnbaum A.Statistical Theories of Mental Test Scores.American Educational Research Journal,1968,6(1):112.

Luecht R M.Multidimensional computerized adaptive testing in a certification or licensure context.Applied Psychological Measurement,1996,20(4):389-404.

Luecht R M.Computer-assisted test assembly using optimization heuristics.Applied Psychological Measurement,1998,22(3):224-236.

Luecht R M.Exposure control using adaptive multistage item bundles.Adaptive Testing,2003.

Luecht R M,Nungester R J.Some practical examples of computer-adaptive sequential testing.Journal of Educational Measurement,1998,35(3):229-249.

Luecht R M,Brumfield T,Breithaupt K.A testlet assembly design for adaptive multistage tests.Applied Measurement in Education,2006,19(3):189-202.

MacCallum R C,Browne M W,Sugawara H M.Power analysis and determination of sample size for covariance structure modeling.Psychological Methods,1996,1(2):130-149.

Macready G B,Dayton C M.The use of probabilistic models in the assessment of mastery.Journal of Educational Statistics,1977,2(2):99-120.

Makransky G,Glas C A W.An automatic online calibration design in adaptive testing.Journal of Applied Testing Technology,2010,11(1):29.

Mao X Z,Xin T.The application of the monte carlo approach to cognitive diagnostic computerized adaptive testing with content constraints.Applied Psychological Measurement,2013,37(6):482-496.

Maris E.Estimating multiple classification latent class models.Psychometrika,1999,64(2):187-212.

Masters G N.A rasch model for partial credit scoring.Psychometrika,1982,47(2),149-174.

Marveled J M,Glas C A,Landeghem G V,Damme J V.Application of multidimensional item response theory models to longitudinal data.Educational and Psychological Measurement,2006,66(1):5-34.

Masters G N,Wright B D.The essential process in a family of measurement models.Psychometrika,1984,49(4):529-544.

McBride J R.Research antecedents of applied adaptive testing.Washington,DC,US:American Psychological Association,1997:47-57.

McDonald R P.Future directions for item response theory.International Journal of Educational Research,1989,13(2):205-220.

McGlohen M,Chang H H.Combining computer adaptive testing technology with cognitively diagnostic assessment.Behavior Research Methods,2008,40(3):808-821.

McGrath R E,Terranova R,Pogge D L,Kravic C.Development of a short form for the MMPI-2 based on scale elevation congruence.Assessment,2003,10(1):13-28.

McKinley R L,Reckase M D.The Use of the General Rasch Model with Multidimensional Item Response Data.Goodness of Fit,1982:38.

Mead A D.An Introduction to Multistage Testing.Applied Measurement in Education,2006,19(3):185-187.

Meijer R R,Nering M L.Trait level estimation for nonfitting response vectors.Applied Psychological Measurement,1997,21(4):321-336.

Meijer R R,Nering M L.Computerized adaptive testing:Overview and introduction.Applied Psychological Measurement,1999,23(3):187-194.

Meyer T D,Hautzinger M.Allgemeine Depressions-Skala(ADS).Diagnostica,2001.

Michael C E,David B F,David T.Multistage Computerized Adaptive Testing With Uniform Item Exposure.Applied Measurement in Education,2012,25(2):118-141.

Mislevy R J.Foundations of a new test theory.Ets Research Report Series,1982,1982(2):i-32.

Mulder J,van der Linden W J.Multidimensional adaptive testing with optimal design criteria for item selection.Psychometrika,2009,74(2):273-296.

Mulder J,van der Linden W J.Multidimensional adaptive testing with Kullback-Leibler information item selection.New-York:Springer Science Business Media,2009.

Muraki E.A generalized partial credit model:Application of an EM algorithm.Applied Psychological Measurement,1992,1992(1):i-30.

Muraki E,Carlson J E.Full-information factor analysis for polytomous item responses.Applied Psychological Measurment,1995,19(1):73-90.

Muraki E,Engelhard G.Full-Information Item Factor Analysis:Applications of EAP Scores.Applied Psychological Measurement,1985,9(4):417-430.

Muthén L K,Muthén B O.Mplus user’s guide:The comprehensive modeling program for applied researchers,2000.

Muthny F A.Lebenszufriedenheit bei koronarer Herzkrankheit:ein Vergleich mit anderen lebensbedrohlichen Erkrankungen.Lebensqualit?t bei kardiovaskul?ren Erkrankungen.Grundlagen,Messverfahren und Ergebnisse.Hogrefe,G?ttingen,1991:196-210.

Nagelkerke N J D.A note on a general definition of the coefficient of determination.Biometrika,1991,78(3):691-692.

Nichols P D.A Framework for Developing Cognitively Diagnostic Assessments.Review of Educational Research,1994,64(4):575-603.

Nunnally J C.Psychometric theory.McGraw-Hill,New York,1978.

Olea J O,Revuelta J,Ximénez M C,Abad F J.Psychometric and psychological effects of review on computerized fixed and adaptive tests.Psicológica,2000,21(1):157-173.

Olsen L R,Jensen D V,Noerholm V,Martiny K,Bech P.The internal and external validity of the Major Depression Inventory in measuring severity of depressive states.Psychological Medicine,2003,33(2):351-356.

Orlando M,Sherbourne C D,Thissen D.Summed-score linking using item response theory:application to depression measurement.Psychological Assessment,2000,12(3):354-359.

Osman A,Downs W R,Barrios F X,Kopper B A,Gutierrez P M,et al.Factor structure and psychometric characteristics of the beck depression inventory-II.Journal of Psychopathology and Behavioral Assessment,1997,19(4):359-376.

Owen R J.A bayesian sequential procedure for quantal response in the context of adaptive mental testing.Journal of the American Statistical Association,1975,70:351-356.

Papanastasiou E C.A ‘rearrangement procedure’ for scoring adaptive tests with review options.Paper presented at the the National Council of Measurement in Education,New Orleans,LA,2002.

Papanastasiou E C,Reckase M D.A ‘rearrangement procedure’ for scoring adaptive tests with review options.International Journal of Testing,2008:387-407.

Patsula L N,Hambleton R K.A comparative study of ability estimates from computer-adaptive testing and multi-stage testing.Paper presented at the annual meeting of the National Council on Measurement in Education,1999.

Quellmalz E S,Pellegrino J W.Technology and Testing.Science,2009,323(5910):75-79.

Quilty L C,Zhang K A,Bagby R M.The latent symptom structure of the Beck Depression Inventory-II in outpatients with major depression.Psychological Assessment,2010,22(3):603-608.

Ramsay J O.TestGraf A Program for the Graphical Analysis of Multiple Choice Test and Questionnaire Data.Montreal:McGill University,1995.

Reckase M D.Unifactor latent trait models applied to multifactor tests:Results and implications.Journal of Educational and Behavioral Statistics,1979,4(3):207-230.

Reckase M D.The difficulty of test items that measure more than one ability.Applied Psychological Measurement,1985,9(4):401-412.

Reckase M D.Multidimensional item response theory.New York:Springer,2009.

Reckase M D,McKinley R L.Some Latent Trait Theory in a Multidimensional Latent Space.Iowa City,IA:American College Service,1982.

Reckase M D,Mckinley R L.The discriminating power of items that measure more than one dimension.Applied Psychological Measurement,1991,15(4):361-373.

Reise S P,Morizot J,Hays R D.The role of the bifactor model in resolving dimensionality issues in health outcomes measures.Quality of Life Research,2007,16(1):19-31.

Robin F,Steffen M.Test Design for the GRE Revised General Test.In Wendler C,Bridgeman B(Eds.).The Research Foundation for the GRE revised General Test:A Compendium of Studies.Princeton,NJ:Educational Testing Service,2014:132-143.

Robin F,Steffen M,Liang L.The Multistage Test Implementation of the GRE Revised General Test.Educational Testing Service,2014.

Rose M,Hess V,Scholler G,Br?hler E,Klapp B F.Mobile computerized psychometrical diagnostics - results concerning economic benefit and test reliability.PPmP - Psychotherapie · Psychosomatik · Medizinische Psychologie,1999,49:202-207.

Samejima F.Estimation of latent ability using a response pattern of graded scores.Psychometrika,1968,1968(1):i-169.

Samejima F.Normal ogive model on the continuous response level in the multidimensional latent space.Psychometrika,1974,39(1):111-121.

Santor D A,Coyne J C.Examining symptom expression as a function of symptom severity:item performance on the hamilton rating scale for depression.Psychological Assessment,2001,13(1):127-139.

Schoeneich F,Rose M,Danzer G,Thier P,Weber C,Klapp B F.Narzissmusinventar-90(NI-90).Psychother Psych Med,2000,50(9/10):396-405.

Scholler G,Fliege H,Klapp B F.Fragebogen zu Selbstwirksamkeit,Optimismus und Pessimismus:Restrukturierung,Itemselektion und Validierung eines Instruments an Untersuchungen Klinischer Stichproben.Psychotherapie Psychosomatik Medizinische Psychologie,1999,49(8):275-283.

Segall D O.Multidimensional Adaptive Testing.Psychometrika,1996,61(2):331-354.

Segall D O.Calibrating CAT pools and online pretest items using MCMC methods.Paper presented at the annual meeting of the National Council on Measurement in Education,2003.

Segall D O.Principles of multidimensional adaptive testing.New York:Springer Science Business Media,2010.

Seo D G.Application of the Bifactor Model to Computerized Adaptive Testing.Dissertations & Theses-Gradworks,2011:228.

Seo D G.Application of the Bifactor Model to Computerized Adaptive Testing.Dissertations & Theses-Gradworks,2011:228.

Shannon C E.A mathematical theory of communication.ACM SIGMOBILE Mobile Computing and Communications Review,2001,5(1):3-55.

Shannon C E.A Mathematical Theory of Communication.Bell System Technical Journal,1948,27(3):379-423.

Silvey S D.Optimal Design.London:Chapman and Hall,1980.

Snow R E,Mandinach E B.Integrating assessment and instruction:A research and development agenda.ETS Research Report Series,1991,1991(1):i-176.

Steven L,Wise Sara J.Examinee Judgments of Changes in Item Difficulty:Implications for Item Review in Computerized Adaptive Testing.Applied Psychological Measurement,1999,12(2):185-198.

Stocking M L.Scale drift in on-line calibration.Princeton,NJ:Educational Testing Service,1988,1988(1):i-122.

Stocking M L.An alternative method for scoring adaptive tests.Journal of Educational and Behavioral Statistics,1996,21(4):365-389.

Stocking M L,Lewis C.Controlling item exposure conditional on ability in computerized adaptive testing.Journal Educational and Behavioral Statistics,1998,23(1):57-75.

Stocking M L,Steffen M,Eignor D R.An exploration of potentially problematic adaptive tests.Princeton,NJ:Educational Testing Service,2002.

Stocking M L.Revising answers to item responses in computerized adaptive tests:A comparison of three models.Applied Psychological Measurement,1997,21(2):129-142.

Su Y H.A Comparison of Constrained Item Selection Methods in Multidimensional Computerized Adaptive Testing.Applied Psychological Measurement,2016,40(5).

Swaminathan H,Rogers H J.Detecting differential item functioning using logistic regression procedures.Journal of Educational Measurement,1990,27(4):361-370.

Swanson L,Stocking M L.A model and heuristic for solving very large item selection problems.Applied Psychological Measurement,1993,17(2):151-166.

Sympson J B,Hetter R D.Controlling item-exposure rates in computerized adaptive testing.Proceedings of the 27th Annual Meeting of the Military Testing Association,1985.

Takane Y,Leeuw J D.On the relationship between item response theory and factor analysis of discretized variables.Psychometrika,1987,52(3):393-408.

Tam S S.A comparison of methods for adaptive estimation of a multidimensional trait.New York:Columbia University,1992.

Tambs K,Moum T.How well can a few questionnaire items indicate anxiety and depression?.Acta Psychiatrica Scandinavica,1993,87(5):364-367.

Tatsuoka C.Data analytic methods for latent partially ordered classification models.Journal of the Royal Statistical Society,2002,51(3):337-350.

Tatsuoka C,Ferguson T.Sequential classification on partially ordered sets.Journal of the Royal Statistics,2003,65(1):143-157.

Tatsuoka K K.Rule space:An approach for dealing with misconceptions based on item response theory.Journal of Educational Measurement,1983,20(4):345-354.

Tatsuoka K K.Toward an integration of item-response theory and cognitive error diagnoses.In Frederiksen N,Glaser R,Lesgold A,Shafto M C(Eds.).Diagnostic Monitoring of Skill and Knowledge Acquisition,1990:543-588.

Tatsuoka K K.Boolean algebra applied to determination of universal set of knowledge states.ETS Research Report Series,1991,1991(2):i-36.

Tatsuoka K K,Tatsuoka M M.Computerized cognitive diagnostic adaptive testing:effect on remedial instruction as empirical validation.Journal of Educational Measurement,1997,34(1):3-20.

Tatsuoka K K.Architecture of knowledge structures and cognitive diagnosis:a statistical pattern recognition and classification approach.Cognitively Diagnostic Assessments,1995:327-359.

Tay,PoH Hua.On-the-Fly Assembled Multistage Adaptive Testing.Applied Psychological Measurement,2015,39(2):104-118.

Thissen D,Wainer H.Test scoring.Mahwah,NJ:Lawrence Erlbaum Associates Publishers,2001.

U.S.House of Representatives.No Child Left Behind Act of 2001,2001.

van der Linden W J.Optimal assembly of psychological and educational tests.Applied Psychological Measurement,1998,22(3):195-211.

van der Linden W J.Multidimensional adaptive testing with a minimum error-variance criterion.Journal of Educational and Behavioral Statistics,1999,24(4):398-412.

van der Linden W J.Linear models for optimal test design.New York:Springer,2005.

van der Linden W J,Chang H H.Implementing content constraints in alpha-stratified adaptive testing using a shadow test approach.Applied Psychological Measurement,2003,27(2):107-120.

van der Linden W J,Glas C A W.Computerized adaptive testing:Theory and practice.Boston,MA:Kluwer.Academic Publishers,2010.

van der Linden W J,Hambleton R K.Handbook of modern item response theory.New York:Springer-Verlag,1997.

van der Linden W J,Hambleton R K.Handbook of Modern Item Response Theory.Berlin:Springer,1996.

van der Linden W J,Jeon M.Modeling answer changes on test items.Journal of Educational and Behavioral Statistics,2012,37(1):180-199.

van der Linden W J,Ren H.Optimal Bayesian Adaptive Design for Test-Item Calibration.Psychometrika,2015,80(2):263-288.

Veldkamp B P,van der Linden W J.Multidimensional adaptive testing with constraints on test content.Psychometrika,2002,67(4):575-588.

Vispoel W P,Clough S J,Bleiler T.A closer look at using judgments of item difficulty to change answers on computerized adaptive tests.Journal of Educational Measurement,2005,42(4):331-350.

Vispoel W P,Clough S J,Bleiler T,Hendrickson A B,Ihrig D.Can examinees use judgments of item difficulty to improve proficiency estimates on computerized adaptive vocabulary tests?.Journal of Educational Measurement,2002,39(4):311-330.

Vispoel W P,Henderickson A B,Bleiler T.Limiting Answer Review and Change on Computerized Adaptive Vocabulary Tests:Psychometric and Attitudinal Results.Journal of Educational Measurement,2000,37(1):21-38.

Vispoel W P,Rocklin T R,Wang R,Bleiler T.Can examinees use a review option to obtain positively biased ability estimates on a computerized adaptive test?.Journal of Educational Measurement,1999,36:141-157.

Waddell D L,Blankenship J C.Answer changing:A meta-analysis of the prevalence and patterns.Journal of Continuing Education in Nursing,1994,25(4):155-158.

Wainer H.Computerized Adaptive Testing:A Primer,2nd ed.Hillsdale,NJ:Erlbaum,2000.

Wainer H,Kiely G L.Item clusters and computerized adaptive testing:A case for testlets.Journal of Educational Measurement,1987,24(3):185-201.

Wainer H,Mislevy R J.Item Response Theory,Item Calibration and Proficiency Estimation.Bioresource Technology,1990.

Wainer H,Bradlow E T,Du Z.Computerized adaptive testing:Theory and practice.Kluwer Academic Publishers,2002:245-269.

Wainer H,Dorans N J,Green B F,Steinberg L,Flaugher R,Mislevy R J,Thissen D.Computerized adaptive testing:A primer(second edition).Quality of Life Research,2001,10(8):733-734.

Wainer H.Some practical considerations when converting a linearly administered test to an adaptive format.Educational Measurement:Issues and Practice,1993,12(1):15-20.

Wang C.Multidimensional computerized adaptive testing:Early development and recent advancements.In Cheng Y,Chang H H(Eds.).Advances in modern international testing:Transition from summative to formative assessment.Charlotte,NC:Information Age,2014.

Wang C.On Latent Trait Estimation in Multidimensional Compensatory Item Response Models.Psychometrika,2015,80(2):428-449.

Wang C,Chang H H.Kullback-Leibler information in multidimensional adaptive testing:theory and application.University of Illinois at Urbana-Champaign,2009.

Wang C,Chang H H.Item selection in multidimensional computerized adaptive testing-gaining information from different angles.Psychometrika,2011,76(3):363-384.

Wang C,Chang H H,Boughton K A.Kullback-Leibler information and its applications in multidimensional adaptive testing.Psychometrika,2011,76(1):13-39.

Wang C,Chang H H,Boughton K A.Deriving stopping rules for multidimensional computerized adaptive testing.Applied Psychological Measurement,2013,37(2):99-122.

Wang C,Chang H H,Douglas J.Combining CAT with cognitive diagnosis:A weighted item selection approach.Behavior Research Methods,2012,44(1):95-109.

Wang C,Chang H H,Huebner A.Restrictive stochastic item selection methods in cognitive diagnostic computerized adaptive testing.Journal of Educational Measurement,2011,48(3):255-273.

Wang S.The accuracy of ability estimation methods for computerized adaptive testing using the generalized partial credit model.University of Pittsburgh,1999.

Wang S D,Wang T Y.Precision of Warm’s Weighted Likelihood Estimates for a Polytomous Model in Computerized Adaptive Testing.Applied Psychological Measurement,2001,25(4):317-331.

Wang S,Zheng Y,Zheng C,Su Y H,Li P.An Automated Test Assembly Design for a Large-Scale Chinese Proficiency Test.Applied Psychological Measurement,2016,40(3):233-237.

Wang W C.Multidimensional Rasch models:Theories and applications.In Cheng Y,Chang H H(Eds.).Advances in modern international testing:Transition from summative to formative assessment.Charlotte,NC:Information Age,2014.

Wang W C,Chen P H.Implementation and measurement efficiency of multidimensional computerized adaptive testing.Applied Psychological Measurement,2004,28(5):295-316.

Ward L C.Comparison of factor structure models for the Beck Depression Inventory-II.Psychological Assessment,2006,18(1):81-88.

Ware J J,Bjorner J B,Kosinski M.Practical implications of item response theory and computerized adaptive testing:a brief summary of ongoing studies of widely used headache impact scales.Medical Care,2000,38(9 Suppl):73-82.

Warm T A.Weighted likelihood estimation of ability in item response theory.Psychometrika,1989,54(3):427-450.

Weiss D J,Gibbons R D.Computerized adaptive testing with the bifactor model.In Weiss D J(Ed.).Proceedings of the 2007 GMAC Conference on Computerized Adaptive Testing,2007.

Weissman A.IRT-Based Multistage Testing.(in):Yan D I,Davier A v,Lewis C(Eds.).Computerized Multistage Testing:Theory and Applications.New Jersey,Princeton:Educational Testing Service,2014:153-168.

Whitely S E.Multicomponent latent trait models for ability tests.Psychometrika,1980,45(4):479-494.

Wise S L,Finney S J,Enders C K,Freeman S A,Severance D D.Examinee judgments of changes in item difficulty:Implications for item review in computerized adaptive testing.Applied Measurement in Education,1999,12(2):185-198.

Xu X,Douglas J.A simulation study to compare CAT strategies for cognitive diagnosis.Paper presented at the Annual Meeting of the American Educational Research Association,2003.

Yan D,Davier A A v,Lewis C.Computerized Multistage Testing:Theory and Applications.Boca Raton:CRC Press,2014.

Yang X,Embretson S.Construct validity and cognitive diagnostic assessment.In Leighton J P,Gierl M J(Eds.).Cognitive Diagnostic Assessment for Education:Theory and Applications.Cambridge:Cambridge University Press,2007.

Yang Y B,Sun Y F,Zhang Y,Jiang Y,et al.Bifactor Item Response Theory Model of Acute Stress Response.PLoS One,2013,8(6).

Yao L H.Multidimensional CAT item selection methods for domain scores and composite scores:Theory and applications.Psychometrika,2012,77(3):495-523.

Yao L H.Comparing the performance of five multidimensional CAT selection procedures with different stopping rules.Applied Psychological Measurement,2013,37(1):3-23.

Yao L H.Multidimensional item response theory for score reporting.In Cheng Y,Chang H H(Eds.).Advances in modern international testing:Transition from summative to formative assessment.Charlotte,NC:Information Age,2014.

Yao L H.Multidimensional CAT item selection methods for domain scores and composite scores with item exposure control and content constrains.Journal of Educational Measurement,2014,51(1):18-38.

Yao L H,Schwarz R D.A multidimensional partial credit model with associated item and test statistics:An application to mixed-format tests.Applied Psychological Measurement,2006,30(6):469-492.

Yen W M.Scaling performance assessments:Strategies formanaging local item dependence.Journal of Educational Measurement,1993,30:187-214.

Yen Y C,Ho R G,Liao W W,Chen L J.Reducing the Impact of Inappropriate Items on Reviewable Computerized Adaptive Testing.Educational Technology & Society,2012,15(2):231-243.

Yi Q,Chang H H.a-Stratified CAT design with content blocking.British Journal of Mathematical and Statistical Psychology,2003,56(2):359-378.

Ying Z L,Wu C F J.An asymptotic theory of sequential designs based on maximum likelihood recursions.Statistica Sinica,1997,7(1):75-92.

Zhang B,Stone C A.Evaluating item fit for multidimensional item response models.Educational and Psychological Measurement,2008,68(2):181-196.

Zheng C.Some practical item selection algorithms in cognitive diagnostic computerized adaptive testing:smart diagnosis for smart learning.University of Illinois at Urbana-Champaign,2015.

Zheng Y.New Methods of Online Calibration for Item Bank Replenishment.University of Illinois at Urbana-Champaign,2014.

Zheng Y.Exploring online calibration of polytomous items in computerized adaptive testing.Paper presented at the 80th Annual Meeting of the Psychometric Society,2015.

Zheng Y.Online calibration of polytomously scored items.Applied Psychological Measurement,2016,40(6):434-450.

Zheng Y,Chang C H,Chang H H.Content-balancing strategy in bifactor computerized adaptive patient-reported outcome measurement.Quality of Life Research,2012,22(3):491-499.

Zheng Y,Chang C H,Chang H H.Content-balancing strategy in bifactor computerized adaptive patient-reported outcome measurement.Quality of Life Research,2013,22(3):491-499.

Zheng Y,Chang H H.On-the-Fly Assembled Multistage Adaptive Testing.Applied Psychological Measurement,2015,39(2):104-118.

Zheng Y,Wang C,Culbertson M J,Chang H H.Overview of Test Assembly Methods in Multistage Testing.In Yan D l,Davier A V,Lewis C(Eds.).Computerized multistage testing:Theory and applications,Chapman and Hall/CRC,2014:87-99.

Zhu R C.Implementation of Optimal Design for Item Calibration in Computerized Adaptive Testing(cat).University of Illinois at Urbana-Champaign,2006.

Zumbo B D.A Handbook on the Theory and Methods of Differential Item Functioning(DIF):Logistic Regression Modeling as a Unitary Framework for Binary and Likert-Type(Ordinal)Item Scores.Ottawa National Defense Headquarters,1999.