論文の公開元へ

書き出し

Refer/BibIX

RIS

BibTeX

TSV

Asymptotic properties of Cucconi test statistics

西野拓哉 Takuya Nishino 東京理科大学 DOI:info:doi/10.20604/00003571

2021.06.09

概要

The statistical testing hypothesis is one of the most important techniques used in nonparametric statistics. Various nonparametric statistics have been proposed and discussed for several years. The nonparametric test is a method used to evaluate a statistical hypothesis without assuming a specific distribution function. In general, researchers assume a normal distribution when they analyze experimental data. However, it is difficult to assume normality if there is a large number of outliers. The nonparametric test is beneficial when normal or specific distributions cannot be clearly assumed. There are various nonparametric test statistics, most of which are based on ranking method. Triggered by Wilcoxon (1945), researchers studied the rank- based mathod. From the late 1940s to the 1950s, many researchers studied the association with nonparametric test statistics and parametric test statistics. As a result of them, from the viewpoint of the asymptotic efficiency, nonparametric test statistics are shown to be effective as same as parametric test statistics. Since the 1960s, the nonparametric test statistics have become the most important task and have been extremely valuable. In the late 1960s, many theories underlying the nonparametric test statistics were constructed. With the development of information technology, the nonparametric test statistics have developed remarkably. A study of the statistics of non-parametric tests is actively performed, and the importance is confirmed to the present day.

If it is assumed that population distributions may differ only in location, many nonparametric tests may be used, such as Wilcoxon (1945), Mann and Whitney (1947). There are also many tests for the scale problem, such as Mood (1954), Ansari and Bradley (1960). If the scale parameters change, the test statistic for the location parameter is not useful. Similarly, if the location parameters change, the test statistic for the scale parameter is not useful. To resolve the dilemma, for example, the Lepage test (Lepage, 1971) is well-known in determining the two- sample location-scale problem. It combines the Wilcoxon (1945) and Ansari and Bradley (1960) test statistics. After the Lepage test was developed, many researchers studied combinations of test statistics, such as Pettitt (1976). In addition, many researchers investigated Lepage- type tests, such as Bu¨ning and Thadewald (2000), Neuh¨auser (2000), Bu¨ning (2002), Murakami (2007). In contrast, the test statistics for the location-scale problem were suggested by Cucconi (1968). The structure of the Cucconi test is based on the Mahalanobis distance between two rank-sum test statistics. Although the Cucconi test was developed earlier than the Lepage test, little is known about it. The explanation was published in Italian in a paper by Cucconi (1968). Marozzi (2009) pioneered the Cucconi test and determined its advantages. First, convergence to the limiting distribution is excellent when the sample sizes are almost the same in comparison with the exact critical values of the Cucconi test. Second, the Cucconi test is more powerful than the Lepage and four Podgor-Gastwirth (Podgor and Gastwithe, 1994) tests. Third, less computing is required compared with the Lepage test. Recently, the Cucconi test has been applied in various fields, including hydrology (Rutkowska and Banasik, 2016) and psychology (Marmolejo-Ramos et al., 2017). Moreover, the Cucconi test is highly valued in industrial quality control, and several control charts have been based on this test statistics; see Chowdhury et al. (2014), Mukherjee and Marozzi (2017a), Mukherjee and Marozzi (2017b).

Because we sometimes conduct analyses to determine the presence of various cumbersome data, we require techniques that correspond to these situations. Censored data is one of the most significant categories that is frequently observed in survival analysis. Censoring can be divided into types. If the event of interest has already occurred (or will occur), and the data are included this information, we call it left-censoring (or right-censoring) . For the cause to be generated, two types require classification. Type-I censored data are obtained by setting a fixed time to run the units to determine whether they survive or fail. In addition, Type-II censoring occurs if the number for taking the data is fixed. We note the testing hypothesis of Type-I left-censored and right-censored data. Epstein (1954) established the test statistics for right-censored at a fixed point and small samples under the exponential distributions based on the maximum likelihood estimation. In nonparametric test, several researchers established the two-sample nonparametric significance test for censored samples. Halperin (1960) proposed the test statistics for right-censored data based on the Mann-Whitney test. Sugiura (1963) suggested Wilcoxon-type left-censored test statistics. By focusing on the kernel function, which is used in comparing the magnitude of two observation values, Gehan (1965) proposed the single-censored test statistic. As in the other tests, the log-rank test (Peto and Peto, 1972) and a class of distance test (Pepe and Fleming, 1989) were presented.

Additionally, the nonparametric one-way layout analysis of variance (ANOVA) plays an im- portant role in biometry. The extension of the Cucconi test to multisample location-scale prob- lems was proposed by Marozzi (2014), who showed that the multisample Cucconi test was more powerful than the multisample Lepage test suggested by Rubl´ık (2005). Because the derivation of the critical value of the multisample Cucconi test is dependent on the permutation method, the amount of calculation required is enormous. However, asymptotic and limiting distributions are unknown. More recently, Murakami (2016a) presented test statistics based on the all-pair Cucconi test for multiple comparisons. To challenge the assumption of population distribution functions, many researchers have applied the tied ranking method to various nonparametric tests; see Hemelrijk (1952), Putter (1955), Paul and Mielke (1967).

In this paper, we focus on the versatility of the Cucconi test in (i) a two-sample case and (ii) a multisample case. The results of this paper are based on Nishino and Murakami (2018), Nishino and Murakami (2019a), Nishino and Murakami (2019b) and Nishino and Murakami (2020). The paper is organized as follows. First, we discuss the two-sample case. In Chapter 2, we propose a generalized two-sample Cucconi test and investigate its properties and specifications based on Nishino and Murakami (2019b). In Chapter 3, we suggest the Cucconi test for use with specific censored data, and derive the limiting distribution. Moreover, we confirm the empirical power and analyze the actual data. This chapter is based on Nishino and Murakami (2019a). Then we discuss the multisample case. In Chapter 4, we derive the null and non- null limiting distributions of the multisample Cucconi test based on Nishino and Murakami (2018). In Chapter 5, we propose the generalized multisample Cucconi test statistics for not only continuous but also discrete populations based on Nishino and Murakami (2020). Finally, in Chapter 6, we conclude the paper.

論文の公開元へ

参考文献

R. P. Agarwal, N. Elezovi´c, and J. Pecaric. On some inequalities for beta and gamma functions via some classical inequalities. Journal of Inequalities and Applications, 5: 593–613, 2005.

A. R. Ansari and R. A. Bradley. Rank-sum tests for dispersions. The Annals of Mathematical Statistics, 31: 1174–1189, 1960.

H. Bu¨ning. Robustness and power of modified Lepage, Kolmogorov-Smirnov and Cram´e r-von Mises two-sample tests. Journal of Applied Statistics, 29: 907–924, 2002.

H. Bu¨ning and T. Thadewald. An adaptive two-sample location-scale test of Lepage type for symmetric distributions. Journal of Statistical Computation and Simulation, 65: 287–310, 2000.

S. Chowdhury, A. Mukherjee, and S Chakraborti. A new distribution-free control chart for joint monitoring of location and scale parameters of continuous distributions. Quality and Reliability Engineering International, 30: 191–204, 2014.

O. Cucconi. Un nuovo test non parametrico per il confront tra due gruppi campionari. Giornale degli Econmisti Annali di Econmia, 27: 225–248, 1968.

S. S. Dragomir. Some integral inequalities of Gru¨ss type. RGMIA Research Report Collection, 1, 1998.

S. S. Dragomir, R. P. Agarwal, and N. S. Barnett. Inequalities for beta and gamma functions via some classical and new integral inequalities. Journal of Inequalities and Applications, 5: 103–165, 2000.

B. S. Duran, W. S. Tsai, and T. O. Lewis. A class of location-scale nonparametric tests. Biometrika, 63: 173–176, 1976.

C. van Eeden. Note on the consistency of some distribution-free tests for dispersion. Journal of the American Statistical Association, 59: 105–119, 1964.

B. Epstein. Truncated life tests in the exponential case. The Annals of Mathematical Statistics, 25: 555–564, 1954.

E. A. Gehan. A generalized Wilcoxon test for comparing arbitrarily singly-censored samples. Biometrika, 52: 203–224, 1965.

M. N. Goria. Some locally most powerful generalized rank tests. Biometrika, 67: 497–500, 1980.

G. Gru¨ss. U¨ ber das maximum des absoluten Betrages von 1 ∫ bf (x)g(x)dx 1b− (b − a)2 a f (x)dx ba ab g(x)dx. Mathematische Zeitschrift, 39: 215–226, 1935.

M. Halperin. Extension of the Wilcoxon-Mann-Whitney test to samples censored at the same fixed point. Journal of the American Statistical Association, 55: 125–138, 1960.

J. Hemelrijk. Note on Wilcoxon’s two-sample test when ties are present. The Annals of Mathe- matical Statistics, 23: 133–135, 1952.

W. H. Kruskal. A nonparametric test for the several sample problem. The Annals of Mathe- matical Statistics, 23: 525–540, 1952.

Y. Lepage. A combination of Wilcoxon’s and Ansari-Bradley’s statistics. Biometrika, 58: 213– 217, 1971.

H. B. Mann and D. R. Whitney. On a test of whether one of two random variables is stochastically larger than the other. The Annals of Mathematical Statistics, 18: 50–60, 1947.

F. Marmolejo-Ramos, J. C. Correa, G. Sakarkar, G. Ngo, S. Ruiz-Fernandez, N. Butcher, and Y. Yamada. Placing joy, surprise and sadness in space: a cross-linguistic study. Psychological Research, 81: 750–763, 2017.

M. Marozzi. The Lepage location-scale test revisited. Far East Journal of Theoretical Statistics, 24: 137–155, 2008.

M. Marozzi. Some notes on the location-scale Cucconi test. Journal of Nonparametric Statistics, 21: 629–647, 2009.

M. Marozzi. The multisample Cucconi test. Statistical Methods and Applications, 23: 209–227, 2014.

M. Marozzi. Multivariate tests based on interpoint distances with application to magnetic resonance imaging. Statistical Methods in Medical Research, 25: 2593–2610, 2016.

A. M. Mood. On the asymptotic efficiency of certain nonparametric two-sample tests. The Annuals of Mathematical Statistics, 25: 514–522, 1954.

A. Mukherjee and M. Marozzi. A distribution-free phase-II cusum procedure for monitoring service quality. Total Quality Management Business Excellence, 28: 1227–1263, 2017a.

A. Mukherjee and M. Marozzi. Distribution-free Lepage type circular-grid charts for joint mon- itoring of location and scale parameters of a process. Quality and Reliability Engineering International, 33: 241–274, 2017b.

H. Murakami. Lepage type statistic based on the modified Baumgartner statistic. Computational Statistics & Data Analysis, 51: 5061–5067, 2007.

H. Murakami. Approximations to the distribution of a combination of the Wilcoxon and Mood statistics: a numerical comparison. Journal of the Japanese Society of Computational Statis- tics, 24: 1–11, 2011.

H. Murakami. All-pairs multiple comparisons based on the Cucconi test. Advances in Statistical Analysis, 100: 355–368, 2016a.

H. Murakami. A moment generating function of a combination of linear rank tests and its asymptotic efficiency. Test, 25: 674–691, 2016b.

M. Neuh¨auser. An exact two-sample test based on the Baumgartner-weiß-Schindler statistic and a modification of Lepage’s test. Communications in Statistics-Theory and Methods, 29: 67–78, 2000.

T. Nishino and H. Murakami. The null and non-null limiting distributions of the modified multisample Cucconi test. Statistics, 52: 1344–1358, 2018.

T. Nishino and H. Murakami. The Cucconi statistic for Type-I censored data. Metrika, 82: 903–929, 2019a.

T. Nishino and H. Murakami. The generalized Cucconi test statistic for the two-sample problem. Journal of the Korean Statistical Society, 48: 593–612, 2019b.

T. Nishino and H. Murakami. The generalized multisample Cucconi test statistic for the location and scale parameters. Journal of Statistical Computation and Simulation, 90: 2291–2305, 2020.

W. Paul and J. R. Mielke. Note on some squared rank tests with existing ties. Technometrics, 9: 312–314, 1967.

M. S. Pepe and T. R. Fleming. Weighted Kaplan-Meier statistics: a class of distance tests for censored survival data. Biometrics, 45: 497–507, 1989.

R. Peto and J. Peto. Asymptotically efficient rank invariant test procedures. Journal of the Royal Statistical Society. Series A (General), 135: 185–198, 1972.

A. N. Pettitt. A two-sample Anderson-Darling rank statistic. Biometrika, 63: 161–168, 1976.

M. J. Podgor and J. L. Gastwithe. On non-parametric and generalized tests for the two-sample problem with location and scale change alternatives. Statistics in Medicine, 13: 747–758, 1994.

J. Putter. The treatments of ties in some nonparametric tests. The Annals of Mathematical Statistics, 26: 368–386, 1955.

C. R. Rao and S. K. Mitra. Generalized inverse of matrices and its applications. Wiley New York, 1971.

F. Rubl´ık. The multisample version of the Lepage test. Kybernetika, 41: 713–733, 2005.

F. Rubl´ık. On the asymptotic efficiency of the multisample location-scale rank tests and their adjustment for ties. Kybernetika, 43: 279–306, 2007.

A. Rutkowska and K. Banasik. The Cucconi test for location-scale alternatives in application to asymmetric hydrological variables. Communiation in Statistics–Simulations and Compu- tation, 45: 1–15, 2016.

N. Sugiura. On a generalization of the Wilcoxon test for censored data. Osaka Mathematical Journal, 15: 257–268, 1963.

F. Wilcoxon. Individual comparisons by ranking methods. Biometrics, 1: 80–83, 1945.

参考文献をもっと見る

分野

大学

学位論文種類・取得年

言語

Asymptotic properties of Cucconi test statistics

概要

関連論文

A Geometry-Based Multiple Testing Correction for Contingency Tables by Truncated Normal Distribution

Profile analysis and tests for mean vectors with two-step monotone missing data

Automated sleep stage scoring employing a reasoning mechanism and evaluation of its explainability

Exploratory assessment of treatment-dependent random-effects distribution using gradient functions

Improving the Efficiency of Hedge Trading Using Higher-Order Standardized Weather Derivatives for Wind Power

参考文献

分野

大学

学位論文種類・取得年

言語

コピーが完了しました

URLをコピーしました

Asymptotic properties of Cucconi test statistics

概要

関連論文

A Geometry-Based Multiple Testing Correction for Contingency Tables by Truncated Normal Distribution

Profile analysis and tests for mean vectors with two-step monotone missing data

Automated sleep stage scoring employing a reasoning mechanism and evaluation of its explainability

Exploratory assessment of treatment-dependent random-effects distribution using gradient functions

Improving the Efficiency of Hedge Trading Using Higher-Order Standardized Weather Derivatives for Wind Power

参考文献