46
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Online Updating of Statistical Inference in the Big Data Setting

      Preprint

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          We present statistical methods for big data arising from online analytical processing, where large amounts of data arrive in streams and require fast analysis without storage/access to the historical data. In particular, we develop iterative estimating algorithms and statistical inferences for linear models and estimating equations that update as new data arrive. These algorithms are computationally efficient, minimally storage-intensive, and allow for possible rank deficiencies in the subset design matrices due to rare-event covariates. Within the linear model setting, the proposed online-updating framework leads to predictive residual tests that can be used to assess the goodness-of-fit of the hypothesized model. We also propose a new online-updating estimator under the estimating equation setting. Theoretical properties of the goodness-of-fit tests and proposed estimators are examined in detail. In simulation studies and real data applications, our estimator compares favorably with competing approaches under the estimating equation setting.

          Related collections

          Most cited references5

          • Record: found
          • Abstract: not found
          • Article: not found

          A scalable bootstrap for massive data

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Large complex data: divide and recombine (D&R) with RHIPE

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Aggregated estimating equation estimation

                Bookmark

                Author and article information

                Journal
                1505.06354

                General statistics,Mathematical modeling & Computation
                General statistics, Mathematical modeling & Computation

                Comments

                Comment on this article