eScholarship Repository eScholarship Repository California Digital Library
eScholarship > UCLASTAT > PAPERS > Paper 2003102301

Statistics Papers

Statistics Website

Policies

Search Statistics

Submit a Paper

Notify me of new papers

institute_logo

Department of Statistics, UCLA
University of California, Los Angeles

Statistics Papers  •  Statistics Website  •  Policies  •  Search Statistics  •  Submit a Paper

Approximating the Distribution of Pareto Sums
Ilya Zaliapin, UCLA Institute of Geophysics and Planetary Physics
Yan Y. Kagan, UCLA Institute of Geophysics and Planetary Physics
Federic P. Schoenberg, UCLA Department of Statistics

Download the Paper (557 K, PDF file) - October 23, 2003 Tell a colleague about it.
Printing Tips: Select 'print as image' in the Acrobat print dialog if you have trouble printing.

ABSTRACT:

Heavy tailed random variables (rvs) have proven to be an essential element in modeling a wide variety of natural and human induced processes, and the sums of heavy tailed rvs represent a particularly important construct in such models. Oriented toward both geophysical and statistical audiences, this paper discusses the appearance of the Pareto law in seismology and addresses the problem of the statistical approximation for the sums of independent rvs with common Pareto distribution F(x)=1 - xα for 1/2 < α < 2. Such variables have infinite second moment which prevents one from using the Central Limit Theorem to solve the problem. This paper presents five approximation techniques for the Pareto sums and discusses their respective accuracy. The main focus is on the median and the upper and lower quantiles of the sum?s distribution. Two of the proposed approximations are based on the Generalized Central Limit Theorem, which establishes the general limit for the sums of independent identically distributed rvs in terms of stable distributions; these approximations work well for large numbers of summands. Another approximation, which replaces the sum with its maximal summand, has less than 10% relative error for the upper quantiles when α < 1. A more elaborate approach considers the two largest observations separately from the rest of the observations, and yields a relative error under 1% for the upper quantiles and less than 5% for the median. The last approximation is specially tailored for the lower quantiles, and involves reducing the non-Gaussian problem to its Gaussian equivalent; it too yields errors less than 1%. Approximation of the observed cumulative seismic moment in California illustrates developed methods.

SUGGESTED CITATION:
Ilya Zaliapin, Yan Y. Kagan, and Federic P. Schoenberg, "Approximating the Distribution of Pareto Sums" (October 23, 2003). Department of Statistics, UCLA. Department of Statistics Papers. Paper 2003102301.
http://repositories.cdlib.org/uclastat/papers/2003102301

 
bar
Open Archives Initiative eScholarship is a service of the California Digital Library bepress