Akaike, H. (1998). Information theory and an extension of the maximum likelihood principle. in Selected papers of hirotugu akaike: Springer, Berlin, pp. 199–213.
Anderson, T. W. (1971). The statistical analysis of time series: John Wiley & Sons, New York.
Belloni, A. and Chernozhukov, V. (2011). ℓ1-penalized quantile regression in high-dimensional sparse models. The Annals of Statistics, Vol. 39, pp. 82–130.
Ben-Tal, A., El Ghaoui, L., and Nemirovski, A. (2009). Robust optimiza- tion: Princeton University Press.
Beran, J. (1992). Statistical methods for data with long-range dependence.Statistical Science, pp. 404–416.
Birge, J. R. and Louveaux, F. (2011). Introduction to stochastic program- ming : Springer Science & Business Media, Berlin.
Birr, S., Volgushev, S., Kley, T., Dette, H., and Hallin, M. (2017). Quantile spectral analysis for locally stationary time series. Journal of the Royal Statistical Society Series B.
Brillinger, D. R. (1981). Time series: data analysis and theory : Holden- Day, San Francisco.
Brockwell, P. J. and Davis, R. A. (2009). Time Series: Theory and Meth- ods: Springer, Berlin.
Calafiore, G. and Dabbene, F. (2006). Probabilistic and randomized meth- ods for design under uncertainty : Springer, Berlin.
Candes, E. and Tao, T. (2007). The Dantzig selector: Statistical estimation when p is much larger than n. The Annals of Statistics, pp. 2313–2351.
Chen, W. and Sim, M. (2009). Goal-driven optimization. Operations Re- search, Vol. 57, pp. 342–357.
Cheng, R., Miamee, A. G., and Pourahmadi, M. (1998). Some extremal problems in Lp(ω). Proceedings of the American Mathematical Society, Vol. 126, pp. 2333–2340.
Cox, D. R. (1961). Prediction by exponentially weighted moving averages and related methods. Journal of the Royal Statistical Society: Series B, Vol. 23, pp. 414–422.
Dehling, H. and Taqqu, M. S. (1989). The empirical process of some long- range dependent sequences with an application to U-statistics. The An- nals of Statistics, pp. 1767–1783.
Dette, H., Hallin, M., Kley, T., Volgushev, S. et al. (2015). Of copu- las, quantiles, ranks and spectra: An L1-approach to spectral analysis. Bernoulli, Vol. 21, pp. 781–831.
Duren, P. L. (1970). Theory of Hp spaces: Academic Press New York.
Dzhaparidze, K. (1986). Parameter estimation and hypothesis testing in spectral analysis of stationary time series: Springer, Berlin.
Franke, J. (1985). Minimax-robust prediction of discrete time series. Zeitschrift fu¨r Wahrscheinlichkeitstheorie und Verwandte Gebiete, Vol. 68, pp. 337–364.
Granger, C. and Joyeux, R. (1980). An introduction to long-memory time series models and fractional differencing. Journal of Time Series Anal- ysis, Vol. 1, pp. 15–29.
Grenander, U. and Rosenblatt, M. (1954). An extension of a theorem of G. Szego¨ and its application to the study of stochastic processes. Transac- tions of the American Mathematical Society, Vol. 76, pp. 112–126.
Hallin, M., Taniguchi, M., Serroukh, A., and Choy, K. (1999). Local asymp- totic normality for regression models with long-memory disturbance. The Annals of Statistics, Vol. 27, pp. 2054–2080.
Hannan, E. J. (1970). Multiple Time Series: Wiley, New York.
Hastie, T., Tibshirani, R., and Friedman, J. (2009a). The elements of sta- tistical learning: data mining, inference, and prediction: Springer Sci- ence & Business Media, Berlin.(2009b). Overview of supervised learning. in The elements of sta- tistical learning : Springer, Berlin, pp. 9–41.
Hjort, N. L. and Jones, M. C. (1996). Locally parametric nonparametric density estimation. Annals of Statistics, pp. 1619–1647.
Hosoya, Y. (1978). Robust linear extrapolations of second-order stationary processes. The Annals of Probability, Vol. 6, pp. 574–584.(1982). Harmonizable stable processes. Zeitschrift fu¨r Wahrscheinlichkeitstheorie und Verwandte Gebiete, Vol. 60, pp. 517–533.
Hunt, R. A. (1968). On the convergence of Fourier series. in Orthogonal expansions and their continuous analogues, pp. 235–255.
Hurvich, C. M. and Tsai, C.-L. (1989). Regression and time series model selection in small samples. Biometrika, Vol. 76, pp. 297–307.
Jankov´a, J., Shah, R. D., Bu¨hlmann, P., and Samworth, R. J. (2019). Goodness-of-fit testing in high-dimensional generalized linear models. arXiv preprint arXiv:1908.03606.
Kato, H., Taniguchi, M., and Honda, M. (2006). Statistical analysis for multiplicatively modulated nonlinear autoregressive model and its appli- cations to electrophysiological signal analysis in humans. IEEE Trans- actions on Signal Processing, Vol. 54, pp. 3414–3425.
Katznelson, Y. (2004). An introduction to harmonic analysis: Cambridge University Press.
Kaul, A. (2014). Lasso with long memory regression errors. Journal of Statistical Planning and Inference, Vol. 153, pp. 11–26.
Kedem, B. (1994). Time series analysis by higher order crossings : IEEE press New York.
Kharin, Y. (2013). Robustness in statistical pattern recognition, Vol. 380: Springer Science & Business Media, Berlin.
Kiefer, J., Wolfowitz, J. et al. (1952). Stochastic estimation of the maximum of a regression function. The Annals of Mathematical Statistics, Vol. 23,pp. 462–466.
Kim, Y., Kwon, S., and Choi, H. (2012). Consistent model selection criteria on high dimensions. The Journal of Machine Learning Research, Vol. 13, pp. 1037–1057.
Kock, A. B. and Callot, L. (2015). Oracle inequalities for high dimensional vector autoregressions. Journal of Econometrics, Vol. 186, pp. 325–344.
Koenker, R. (2005). Quantile Regression, No. 38: Cambridge university press.
Koenker, R. and Bassett, J. G. (1978). Regression quantiles. Econometrica: journal of the Econometric Society, Vol. 46, pp. 33–50.
Koenker, R. and Hallock, K. F. (2001). Quantile regression. Journal of Economic Perspectives, Vol. 15, pp. 143–156.
Koenker, R. and Zhao, Q. (1994). L-estimatton for linear heteroscedastic models. Journaltitle of Nonparametric Statistics, Vol. 3, pp. 223–235.
Kolmogorov, A. (1941a). Interpolation and extrapolation of stationary ran- dom sequences. Izv. Akad. Nauk SSSR Ser. Mat., Vol. 5, pp. 3–14.
(1941b). Stationary sequences in Hilbert space. Byull. Moskov.Gos. Univ. Mat., Vol. 2, pp. 1–40.
Koul, H. L. and Mukherjee, K. (1994). Regression quantiles and related processes under long range dependent errors. Journal of multivariate analysis, Vol. 51, pp. 318–337.
Ku¨nsch, H. (1987). Statistical aspects of self-similar processed. In: Prokhorov, Yu., Sazanov, V.V. (Eds.), Proceedings of the First World Congress of the Bernoulli Society, pp. 67–74.
Liu, R. Y., Singh, K. et al. (1992). Moving blocks jackknife and bootstrap capture weak dependence. Exploring the limits of bootstrap, Vol. 225, p. 248.
Liu, Y. (2017). Robust parameter estimation for stationary processes by an exotic disparity from prediction problem. Statistics & Probability Letters, Vol. 129, pp. 120–130.
Luati, A., Proietti, T., and Reale, M. (2012). The variance profile. Journal of the American Statistical Association, Vol. 107, pp. 607–621.
Mallows, C. L. (2000). Some comments on Cp. Technometrics, Vol. 42,pp. 87–94.
Mandelbrot, B. and Taqqu, M. (1979). Robust R/S analysis of long run serial correlation. 42nd Internat. Statist. Inst., Manila, pp. 1–38.
Mandelbrot, B. B. and Van Ness, J. W. (1968). Fractional Brownian mo- tions, fractional noises and applications. SIAM review, Vol. 10, pp. 422– 437.
Medeiros, M. C. and Mendes, E. F. (2016). ℓ1-regularization of high- dimensional time-series models with non-Gaussian and heteroskedastic errors. Journal of Econometrics, Vol. 191, pp. 255–271.
Meinshausen, N., Bu¨hlmann, P. et al. (2006). High-dimensional graphs and variable selection with the lasso. The annals of statistics, Vol. 34, pp. 1436–1462.
Miamee, A. and Pourahmadi, M. (1988). Best Approximations in Lp(dµ) and Prediction Problems of Szeg¨o, Kolmogorov, Yaglom, and Nakazi. Journal of the London Mathematical Society, Vol. 2, pp. 133–145.
Mutapcic, A. and Boyd, S. (2009). Cutting-set methods for robust convex optimization with pessimizing oracles. Optimization Methods & Soft- ware, Vol. 24, pp. 381–406.
Nakazi, T. (1984). Two problems in prediction theory. Studia Mathematica, Vol. 1, pp. 7–14.
Newey, W. K. (1991). Uniform convergence in probability and stochastic equicontinuity. Econometrica: Journal of the Econometric Society, pp. 1161–1167.
Osborne, M. R., Presnell, B., and Turlach, B. A. (2000). On the lasso and its dual. Journal of Computational and Graphical statistics, Vol. 9, pp. 319–337.
Politis, D. N. and Romano, J. P. (1992). A general resampling scheme for triangular arrays of α-mixing random variables with application to the problem of spectral density estimation. Annals of Statistics, pp. 1985– 2007.
Pourahmadi, M., Inoue, A., and Kasahara, Y. (2007). A prediction problem in L2(ω). Proceedings of the American Mathematical Society, Vol. 135,pp. 1233–1239.
Pourahmadi, M. (1984). On minimality and interpolation of harmonizable stable processes. SIAM Journal on Applied Mathematics, Vol. 44, pp. 1023–1030.
Proietti, T. and Luati, A. (2015). The generalised autocovariance function.Journal of Econometrics, Vol. 186, pp. 245–257.
Rajput, B. S. and Sundberg, C. (1994). On some extremal problems in Hp and the prediction of Lp-harmonizable stochastic processes. Probability Theory and Related Fields, Vol. 99, pp. 197–210.
Rozanov, Y. (1967). Stationary random processes: Holden Day. San Fran- cisco.
S, D. and G, L. (2012). Large sample inference for long memory processes: World Scientific Publishing Company, Singapore.
Schilder, M. (1970). Some structure theorems for the symmetric stable laws.
The Annals of Mathematical Statistics, Vol. 41, pp. 412–421.
Shapiro, A., Dentcheva, D., and Ruszczyn´ski, A. (2009). Lectures on stochastic programming: modeling and theory : SIAM.
Shimotsu, K. and Phillips, P. C. (2006). Local Whittle estimation of frac- tional integration and some of its variants. Journal of Econometrics, Vol. 130, pp. 209–233.(2016). From Statistical Decision Theory to Robust Optimization: A Maximin Perspective on Robust Decision-Making. Robustness Analy- sis in Decision Aiding, Optimization, and Analytics. International Series in Operations Research & Management Science, Vol. 241, pp. 59–87.
Sniedovich, M. (2010). A bird’s view of info-gap decision theory. The Jour- nal of Risk Finance, Vol. 11, pp. 268–283.
Stein, C. M. (1981). Estimation of the mean of a multivariate normal dis- tribution. The annals of Statistics, pp. 1135–1151.
Surgailis, D., Koul, H. L., and Giraitis, L. (2012). Large sample infer- ence for long memory processes: World Scientific Publishing Company, London.
Tang, L., Zhou, Z., and Wu, C. (2012). Efficient estimation and variable selection for infinite variance autoregressive models. Journal of Applied Mathematics and Computing, Vol. 40, pp. 399–413.
Taniguchi, M. (1980). On estimation of the integrals of certain functions of spectral density. Journal of Applied Probability, pp. 73–83.(1981a). An estimation procedure of parameters of a certain spec- tral density model. Journal of the Royal Statistical Society. Series B, Vol. 43, pp. 34–40.(1981b). Robust regression and interpolation for time series. Jour- nal of Time Series Analysis, Vol. 2, pp. 53–62.(1987). Minimum contrast estimation for spectral densities of sta- tionary processes. J. Roy. Statist. Soc. Ser. B, Vol. 49, pp. 315–325.
Taniguchi, M., Hirukawa, J., and Tamaki, K. (2007). Optimal statistical inference in financial engineering : CRC Press.
Taniguchi, M. and Kakizawa, Y. (2012). Asymptotic theory of statistical inference for time series: Springer Science & Business Media, Berlin.
Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological), Vol. 58, pp. 267–288.
Tikhonov, A. (1963). Solution of incorrectly formulated problems and the regularization method. Soviet Meth. Dokl., Vol. 4, pp. 1035–1038.
Tjøstheim, D. and Hufthammer, K. O. (2013). Local Gaussian correlation: A new measure of dependence. Journal of Econometrics, Vol. 172, pp. 33–48.
Verdu, S. and Poor, H. V. (1984). On minimax robustness: A general approach and applications. Information Theory, IEEE Transactions on, Vol. 30, pp. 328–340.
Wald, A. (1939). Contributions to the theory of statistical estimation and testing hypotheses. The Annals of Mathematical Statistics, Vol. 10, pp. 299–326.(1945). Statistical decision functions which minimize the maxi- mum risk. Annals of Mathematics, pp. 265–280.
Wang, H., Li, G., and Tsai, C.-L. (2007). Regression coefficient and au- toregressive order shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Statistical Methodology), Vol. 69, pp. 63–78.
Yaglom, A. M. (1962). An introduction to the theory of stationary random functions: Prentice-Hall.
Yajima, Y. (1991). Asymptotic properties of the LSE in a regression model with long-memory stationary errors. The Annals of Statistics, Vol. 19,pp. 158–177.
Ye, J. (1998). On measuring and correcting the effects of data mining and model selection. Journal of the American Statistical Association, Vol. 93, pp. 120–131.
Yuan, M. and Lin, Y. (2007). Model selection and estimation in the Gaus- sian graphical model. Biometrika, Vol. 94, pp. 19–35.
Zou, H., Hastie, T., and Tibshirani, R. (2007). On the “degrees of free- dom” of the lasso. The Annals of Statistics, Vol. 35, pp. 2173–2192.