jaydow的个人博客分享 http://blog.sciencenet.cn/u/jaydow

博文

空间自相关与Moran's I指数

已有 25894 次阅读 2015-9-29 20:31 |个人分类:概率&统计|系统分类:科研笔记| 空间自相关

    空间自相关与Moran's I指数

    空间自相关(spatial autocorrelation)是指一些变量在同一个分布区内的观测数据之间潜在的相互依赖性。Tobler(1970)曾指出“地理学第一定律:任何东西与别的东西之间都是相关的,但近处的东西比远处的东西相关性更强”。

    在地理统计学科中应用较多,现已有多种指数可以使用,但最主要的有两种指数,即Moran的I指数Geary的C指数



下面介绍了:相关 -> 时间维自相关 -> 空间自相关


摘自wikipedia: https://en.wikipedia.org/wiki/Autocorrelation

 

相关

Different fields of study define autocorrelation differently, and not all of these definitions are equivalent. In some fields, the term is used interchangeably with autocovariance.

In statistics, the autocorrelation of a random process describes the correlation between values of the process at different times, as a function of the two times or of the time lag. Let X be some repeatable process, and i be some point in time after the start of that process. (i may be an integerfor a discretetime process or a real number for a continuoustime process.) Then Xi is the value (or realization) produced by a given run of the process at time i. Suppose that the process is further known to have defined values for mean μi and variance σi2 for all times i. Then the definition of the autocorrelation between times s and t is


R(s,t) = .frac{.operatorname{E}[(X_t - .mu_t)(X_s - .mu_s)]}{.sigma_t.sigma_s}., ,

where "E" is the expected value operator. Note that this expression is not well-defined for all time series or processes, because the variance may be zero (for a constant process) or infinite (for processes with distribution lacking well-behaved moments, such as certain types of power law). If the function R is well-defined, its value must lie in the range [−1, 1], with 1 indicating perfect correlation and −1 indicating perfect anti-correlation.

时间自相关

If Xt is a widesense stationary process then the mean μ and the variance σ2 are timeindependent, and further the autocorrelation depends only on the lag between t and s: the correlation depends only on the timedistance between the pair of values but not on their position in time. This further implies that the autocorrelation can be expressed as a function of the timelag, and that this would be an even function of the lagτ = st. This gives the more familiar form


R(.tau) = .frac{.operatorname{E}[(X_t - .mu)(X_{t+.tau} - .mu)]}{.sigma^2}, .,

and the fact that this is an even function can be stated as


R(.tau) = R(-.tau)..,

It is common practice in some disciplines, other than statistics and time series analysis, to drop the normalization by σ2 and use the term "autocorrelation" interchangeably with "autocovariance". However, the normalization is important both because the interpretation of the autocorrelation as a correlation provides a scalefree measure of the strength of statistical dependence, and because the normalization has an effect on the statistical properties of the estimated autocorrelations.

空间自相关

Spatial autocorrelation statistics measure and analyze the degree of dependency among observations in a geographic space. Classic spatial autocorrelation statistics include Moran's I, Geary's C, Getis's G and the standard deviational ellipse. These statistics require measuring a spatial weights matrix that reflects the intensity of the geographic relationship between observations in a neighborhood, e.g., the distances between neighbors, the lengths of shared border, or whether they fall into a specified directional class such as "west". Classic spatial autocorrelation statistics compare the spatial weights to the covariance relationship at pairs of locations. Spatial autocorrelation that is more positive than expected from random indicate the clustering of similar values across geographic space, while significant negative spatial autocorrelation indicates that neighboring values are more dissimilar than expected by chance, suggesting a spatial pattern similar to a chess board.

Spatial autocorrelation statistics such as Moran's I and Geary's C are global in the sense that they estimate the overall degree of spatial autocorrelation for a dataset. The possibility of spatial heterogeneity suggests that the estimated degree of autocorrelation may vary significantly across geographic space. Local spatial autocorrelation statistics provide estimates disaggregated to the level of the spatial analysis units, allowing assessment of the dependency relationships across space. G statistics compare neighborhoods to a global average and identify local regions of strong autocorrelation. Local versions of the I and C statistics are also available.



Moran I 指数

In statistics, Moran's I is a measure of spatial autocorrelation developed by Patrick Alfred Pierce Moran.Spatial autocorrelation is characterized by a correlation in a signal among nearby locations in space. Spatial autocorrelation is more complex than onedimensional autocorrelation because spatial correlation is multidimensional (i.e. 2 or 3 dimensions of space) and multi-directional.

Moran's I is defined as

 I = .frac{N} {.sum_{i} .sum_{j} w_{ij}} .frac {.sum_{i} .sum_{j} w_{ij}(X_i-.bar X) (X_j-.bar X)} {.sum_{i} (X_i-.bar X)^2}

where N is the number of spatial units indexed by i and j; X is the variable of interest; .bar X is the mean of X; and w_{ij} is an element of a matrix of spatial weights.

The expected value of Moran's I under the null hypothesis of no spatial autocorrelation is

 E(I) = .frac{-1} {N-1}

Its variance equals

 .operatorname{Var}(I) = .frac{NS_4-S_3S_5} {(N-1)(N-2)(N-3)(.sum_{i} .sum_{j} w_{ij})^2} - (E(I))^2

where

 S_1 = .frac {1} {2} .sum_{i} .sum_{j} (w_{ij}+w_{ji})^2  S_2 = .sum_{i} ( .sum_{j} w_{ij} + .sum_{j} w_{ji})^2  S_3 = .frac {N^{-1} .sum_{i} (x_i - .bar x)^4} {(N^{-1} .sum_{i} (x_i - .bar x)^2)^2}   S_4 = (N^2-3N+3)S_1 - NS_2 + 3 (.sum_{i} .sum_{j} w_{ij})^2  S_5 = (N^2-N) S_1 - 2NS_2 + 6(.sum_{i} .sum_{j} w_{ij})^2

[3] Negative values indicate negative spatial autocorrelation and the inverse for positive values. Values range from −1 (indicating perfect dispersion) to +1 (perfect correlation). A zero value indicates a random spatial pattern. For statistical hypothesis testing, Moran's I values can be transformed to Z-scores in which values greater than 1.96 or smaller than −1.96 indicate spatial autocorrelation that is significant at the 5% level.

Moran's I is inversely related to Geary's C, but it is not identical. Moran's I is a measure of global spatial autocorrelation, while Geary's C is more sensitive to local spatial autocorrelation.






https://wap.sciencenet.cn/blog-659252-924379.html

上一篇:多元随机变量四大分布
下一篇:SAR图像时间维样本是否服从某种分布?
收藏 IP: 59.172.176.*| 热度|

0

该博文允许注册用户评论 请点击登录 评论 (0 个评论)

数据加载中...

Archiver|手机版|科学网 ( 京ICP备07017567号-12 )

GMT+8, 2024-6-10 03:33

Powered by ScienceNet.cn

Copyright © 2007- 中国科学报社

返回顶部