quantile function - RDocumentation (2024)

Description

The generic function quantile produces sample quantiles corresponding to the given probabilities. The smallest observation corresponds to a probability of 0 and the largest to a probability of 1.

Usage

quantile(x, …)# S3 method for defaultquantile(x, probs = seq(0, 1, 0.25), na.rm = FALSE, names = TRUE, type = 7, …)

Arguments

numeric vector whose sample quantiles are wanted, or an object of a class for which a method has been defined (see also ‘details’). NA and NaN values are not allowed in numeric vectors unless na.rm is TRUE.

probs

numeric vector of probabilities with values in $[0,1]$. (Values up to 2e-14 outside that range are accepted and moved to the nearby endpoint.)

na.rm

logical; if true, any NA and NaN's are removed from x before the quantiles are computed.

Types

quantile returns estimates of underlying distribution quantiles based on one or two order statistics from the supplied elements in x at probabilities in probs. One of the nine quantile algorithms discussed in Hyndman and Fan (1996), selected by type, is employed.

All sample quantiles are defined as weighted averages of consecutive order statistics. Sample quantiles of type $i$ are defined by: $$Q_{i}(p) = (1 - \gamma)x_{j} + \gamma x_{j+1}$$ where $1 \le i \le 9$, $\frac{j - m}{n} \le p < \frac{j - m + 1}{n}$, $x_{j}$ is the $j$th order statistic, $n$ is the sample size, the value of $\gamma$ is a function of $j = \lfloor np + m\rfloor$ and $g = np + m - j$, and $m$ is a constant determined by the sample quantile type.

Discontinuous sample quantile types 1, 2, and 3

For types 1, 2 and 3, $Q_i(p)$ is a discontinuous function of $p$, with $m = 0$ when $i = 1$ and $i = 2$, and $m = -1/2$ when $i = 3$.

Type 1: Inverse of empirical distribution function. $\gamma = 0$ if $g = 0$, and 1 otherwise.
Type 2: Similar to type 1 but with averaging at discontinuities. $\gamma = 0.5$ if $g = 0$, and 1 otherwise.
Type 3: SAS definition: nearest even order statistic. $\gamma = 0$ if $g = 0$ and $j$ is even, and 1 otherwise.

Continuous sample quantile types 4 through 9

For types 4 through 9, $Q_i(p)$ is a continuous function of $p$, with $\gamma = g$ and $m$ given below. The sample quantiles can be obtained equivalently by linear interpolation between the points $(p_k,x_k)$ where $x_k$ is the $k$th order statistic. Specific expressions for $p_k$ are given below.

Type 4: $m = 0$. $p_k = \frac{k}{n}$. That is, linear interpolation of the empirical cdf.
Type 5: $m = 1/2$. $p_k = \frac{k - 0.5}{n}$. That is a piecewise linear function where the knots are the values midway through the steps of the empirical cdf. This is popular amongst hydrologists.
Type 6: $m = p$. $p_k = \frac{k}{n + 1}$. Thus $p_k = \mbox{E}[F(x_{k})]$. This is used by Minitab and by SPSS.
Type 7: $m = 1-p$. $p_k = \frac{k - 1}{n - 1}$. In this case, $p_k = \mbox{mode}[F(x_{k})]$. This is used by S.
Type 8: $m = (p+1)/3$. $p_k = \frac{k - 1/3}{n + 1/3}$. Then $p_k \approx \mbox{median}[F(x_{k})]$. The resulting quantile estimates are approximately median-unbiased regardless of the distribution of x.
Type 9: $m = p/4 + 3/8$. $p_k = \frac{k - 3/8}{n + 1/4}$. The resulting quantile estimates are approximately unbiased for the expected order statistics if x is normally distributed.

Further details are provided in Hyndman and Fan (1996) who recommended type 8. The default method is type 7, as used by S and by R < 2.0.0.

Details

A vector of length length(probs) is returned; if names = TRUE, it has a names attribute.

NA and NaN values in probs are propagated to the result.

The default method works with classed objects sufficiently like numeric vectors that sort and (not needed by types 1 and 3) addition of elements and multiplication by a number work correctly. Note that as this is in a namespace, the copy of sort in base will be used, not some S4 generic of that name. Also note that that is no check on the ‘correctly’, and so e.g.quantile can be applied to complex vectors which (apart from ties) will be ordered on their real parts.

There is a method for the date-time classes (see "POSIXt"). Types 1 and 3 can be used for class "Date" and for ordered factors.

References

Becker, R. A., Chambers, J. M. and Wilks, A. R. (1988) The New S Language. Wadsworth & Brooks/Cole.

Hyndman, R. J. and Fan, Y. (1996) Sample quantiles in statistical packages, American Statistician 50, 361--365. 10.2307/2684934.

Examples

Run this code

# NOT RUN {quantile(x <- rnorm(1001)) # Extremes & Quartiles by defaultquantile(x, probs = c(0.1, 0.5, 1, 2, 5, 10, 50, NA)/100)### Compare different typesquantAll <- function(x, prob, ...) t(vapply(1:9, function(typ) quantile(x, prob=prob, type = typ, ...), quantile(x, prob, type=1)))p <- c(0.1, 0.5, 1, 2, 5, 10, 50)/100signif(quantAll(x, p), 4)## for complex numbers:z <- complex(re=x, im = -10*x)signif(quantAll(z, p), 4)# }

Run the code above in your browser using DataLab

quantile function - RDocumentation (2024)

FAQs

What does quantile function in R return? ›

quantile returns estimates of underlying distribution quantiles based on one or two order statistics from the supplied elements in x at probabilities in probs .

Learn More ›

What does 75% quantile mean? ›

75th Percentile - Also known as the third, or upper, quartile. The 75th percentile is the value at which 25% of the answers lie above that value and 75% of the answers lie below that value.

Keep Reading ›

What is the 90% quantile? ›

Quartiles are essentially ranking mechanisms. The 90th percentile is that position in a data set which has 90% of data points below it, and 10% above it. The upper quartile is that position in the data set which has 75% of values below it and 25% above it.

Know More ›

What does the quantile function tell you? ›

Intuitively, the quantile function associates with a range at and below a probability input the likelihood that a random variable is realized in that range for some probability distribution.

Learn More ›

How do you interpret quantile regression? ›

The short answer is that you interpret quantile regression coefficients just like you do ordinary regression coefficients. The long answer is that you interpret quantile regression coefficients almost just like ordinary regression coefficients. We can illustrate this with a couple of examples using the hsb2 dataset.

Get More Info ›

What is the normal quantile function? ›

The quantile function of a normal distribution is equal to the inverse of the distribution function since the latter is continuous and strictly increasing. However, as we explained in the lecture on normal distribution values, the distribution function of a normal variable has no simple analytical expression.

Show Me More ›

What is a good quantile score? ›

For example, a student's Quantile measure should be at 1350Q by high school graduation to handle the math needed in college and most careers. A student Quantile measure helps you to know: Which skills and concepts students are ready to learn.

Read The Full Story ›

What is the 95% quantile? ›

It is also called the median. A quantile is called a percentile when it is based on a 0-100 scale. The 0.95-quantile is equivalent to the 95-percentile and is such that 95 % of the sample is below its value and 5 % is above.

Learn More ›

What is the 99 quantile in a normal distribution? ›

The 99th percentile in a normal distribution is 2.3263 standard deviations above the mean; 99 is 49 more than 50—thus 49 points above the mean; 49/2.3263 = 21.06.

Tell Me More ›

Is quantile the same as percentile in R? ›

Percentiles are given as percent values, values such as 95%, 40%, or 27%. Quantiles are given as decimal values, values such as 0.95, 0.4, and 0.27. The 0.95 quantile point is exactly the same as the 95^th percentile point. R does not work with percentiles, rather R works with quantiles.

Explore More ›

Is there a 100th percentile? ›

In some cases, percentiles are defined as the points that divide the data. Because 99 divisions are needed to do this, percentiles are generally numbered 1 through 99. In other cases, the 0th percentile is defined as the minimum point in the data and the 100th percentile as the maximum.

Discover More Details ›

What do quantiles tell you? ›

Quantiles are values that split sorted data or a probability distribution into equal parts. In general terms, a q-quantile divides sorted data into q parts. The most commonly used quantiles have special names: Quartiles (4-quantiles): Three quartiles split the data into four parts.

quantile function - RDocumentation (2024)

Description

Usage

Arguments

Types

Details

References

See Also

Examples

FAQs

What does quantile function in R return? ›

Is there a 100th percentile? ›