Estimate-based goodness-of-fit test for large sparse multinomial distributions

Cited 7 time in webofscience Cited 0 time in scopus
  • Hit : 717
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorKim, Sung-Hoko
dc.contributor.authorChoi, Hyemiko
dc.contributor.authorLee, Sangjinko
dc.date.accessioned2013-03-11T06:49:59Z-
dc.date.available2013-03-11T06:49:59Z-
dc.date.created2012-02-06-
dc.date.created2012-02-06-
dc.date.issued2009-02-
dc.identifier.citationCOMPUTATIONAL STATISTICS DATA ANALYSIS, v.53, no.4, pp.1122 - 1131-
dc.identifier.issn0167-9473-
dc.identifier.urihttp://hdl.handle.net/10203/98575-
dc.description.abstractThe Pearson's chi-squared statistic (X(2)) does not in general follow a chi-square distribution when it is used for goodness-of-fit testing for a multinomial distribution based on sparse contingency table data. We explore properties of [Zelterman, D., 1987. Goodness-of-fit tests for large sparse multinomial distributions. J. Amer. Statist. Assoc. 82 (398), 624-629] D(2) statistic and compare them with those of X(2) and compare the power of goodness-of-fit test among the tests using D(2), X(2), and the statistic (L,) which is proposed by [Maydeu-Olivares, A., Joe, H., 2005. Limited- and full-information estimation and goodness-of-fit testing in 2(n) contingency tables: A unified framework. J. Amer. Statist. Assoc. 100 (471), 1009-1020] when the given contingency table is very sparse. We show that the variance of D(2) is not larger than the variance of X(2) under null hypotheses where all the cell probabilities are positive, that the distribution of D(2) becomes more skewed as the multinomial distribution becomes more asymmetric and sparse, and that, as for the L(r) statistic, the power of the goodness-of-fit testing depends on the models which are selected for the testing. A simulation experiment strongly recommends to use both D(2) and L, for goodness-of-fit testing with large sparse contingency table data. (C) 2008 Elsevier B.V. All rights reserved.-
dc.languageEnglish-
dc.publisherELSEVIER SCIENCE BV-
dc.subjectCONTINGENCY-TABLES-
dc.subjectCHI-SQUARE-
dc.subjectLIMITED-INFORMATION-
dc.subjectSTATISTICS-
dc.titleEstimate-based goodness-of-fit test for large sparse multinomial distributions-
dc.typeArticle-
dc.identifier.wosid000263626700028-
dc.identifier.scopusid2-s2.0-58549099646-
dc.type.rimsART-
dc.citation.volume53-
dc.citation.issue4-
dc.citation.beginningpage1122-
dc.citation.endingpage1131-
dc.citation.publicationnameCOMPUTATIONAL STATISTICS DATA ANALYSIS-
dc.identifier.doi10.1016/j.csda.2008.10.011-
dc.contributor.localauthorKim, Sung-Ho-
dc.contributor.nonIdAuthorChoi, Hyemi-
dc.contributor.nonIdAuthorLee, Sangjin-
dc.type.journalArticleArticle-
dc.subject.keywordPlusCONTINGENCY-TABLES-
dc.subject.keywordPlusCHI-SQUARE-
dc.subject.keywordPlusLIMITED-INFORMATION-
dc.subject.keywordPlusSTATISTICS-
Appears in Collection
MA-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 7 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0