Spatial Sampling & Interpolation

June Skeeter
19 May 202119:02

Summary

TLDRThe video script delves into the intricacies of spatial sampling, a critical process in gathering and analyzing geographical data. It explains the concept of a sample frame and the importance of unbiased sampling where every element has an equal chance of selection. However, due to the complexities of spatial data, biased sampling is sometimes necessary, especially when proximity influences relationships among objects. The script outlines various sampling methods, including random, systematic, stratified, cluster, and transect sampling, each with its advantages and limitations. It also touches on the challenges of implementing random sampling in practice and the trade-offs between representativeness and resource investment. Furthermore, the script introduces spatial interpolation as a technique to estimate values between sampled points, leveraging the spatial autocorrelation found in continuous fields. This method is particularly useful for estimating weather conditions, elevation, and adjusting raster image resolutions. The inverse distance weighting example illustrates how spatial interpolation can be applied effectively.

Takeaways

  • πŸ“ **Spatial Sampling Definition**: Selecting points from within an area, known as the sample frame, to gather data.
  • βš–οΈ **Bias in Sampling**: A sample is biased if elements have unequal chances of selection, while unbiased sampling gives every element an equal chance.
  • 🌐 **Spatial Autocorrelation**: Objects close to each other are more likely to be related, which is important in spatial sampling.
  • πŸ”’ **Random Sampling**: A straightforward method where x and y coordinates are randomly generated within the sample frame.
  • πŸ”„ **Repeated Sampling**: Taking multiple random samples to increase representativeness, but it's more time and resource-intensive.
  • πŸ“ **Systematic Sampling**: Imposing a regular grid on the sample space to ensure evenness, but it may miss features with regular patterns.
  • 🀝 **Combined Sampling**: Using a combination of systematic and random sampling to balance structure and randomness.
  • πŸͺ **Cluster Sampling**: Intensively sampling features within selected clusters, useful for focusing on specific areas.
  • 🚢 **Transect Sampling**: Focusing sampling efforts on a specific area of interest, efficient but requires good pre-existing understanding.
  • 🌳 **Sample Quantity**: The number of samples needed depends on the spatial homogeneity or heterogeneity of the population.
  • πŸ“ˆ **Spatial Interpolation**: Estimating values between known samples for continuous fields, using patterns observed in the dataset.
  • πŸ” **Inverse Distance Weighting**: A spatial interpolation technique that estimates values based on distance from known points, with closer samples weighted more.

Q & A

  • What is spatial sampling?

    -Spatial sampling is the process of selecting points from within an area, known as the sample frame, to gather spatial data for analysis.

  • Why is it impossible to capture and describe everything about the world at once?

    -The world is essentially infinitely complex, making it impossible to capture and describe everything due to the vast amount of information and variables involved.

  • What are the two main types of sampling that can be applied to spatial data?

    -The two main types of sampling are unbiased (where each element has an equal chance of being selected) and biased (where some elements have a greater or lower chance of being selected).

  • Why might we intentionally design our samples to be biased in spatial sampling?

    -In some cases, such as geography, objects that are close to each other are more likely to be related, so a biased sample that focuses on these relationships can be more useful than a completely random one.

  • What are the advantages of a random sample?

    -A random sample is fairly easy to define and implement in theory, providing a good range of possible values in a distribution, and it helps to avoid bias.

  • What are the potential drawbacks of random sampling?

    -Random sampling might result in oversampling of large homogeneous areas or missing smaller elements, leading to an unrepresentative sample of the population.

  • What is systematic sampling and how does it differ from random sampling?

    -Systematic sampling involves imposing a regular grid on the sample space to ensure evenness. It differs from random sampling by introducing a structured bias, which can make the sample more spatially representative but may miss some features with regular patterns.

  • How can we combine the benefits of systematic and random sampling?

    -By defining a regularly spaced grid and then taking random samples within them, we can induce some randomness into the sampling method, which helps to alleviate issues with overlapping periodic features.

  • What is cluster sampling and when is it beneficial?

    -Cluster sampling involves intensive sampling of features in clusters around selected locations. It is beneficial when focusing on certain areas, such as shopping centers for a survey, as it allows for efficient targeting of the desired population.

  • What is transect sampling and how is it used?

    -Transect sampling allows for focusing efforts on a specific area of interest, making it an efficient way to sample a key feature without sampling outside of the focus area. It is commonly used along linear features like roads or rivers.

  • How does the number of samples required to represent a population relate to the spatial homogeneity of the area?

    -The number of samples required is a function of how similar the units of a population are. More homogenous areas require fewer samples, while areas with high spatial heterogeneity need more samples to adequately represent the population.

  • What is spatial interpolation and why is it used?

    -Spatial interpolation is a technique used to estimate values of a continuous field at places where measurements are not available. It is used to fill in gaps between samples, making estimations between weather stations, estimating elevation, and changing the resolution of raster images.

  • How does inverse distance weighting work in spatial interpolation?

    -Inverse distance weighting is a method where values are interpolated based on their distance from known points, with closer samples given more weight than distant ones. The further away an object is, the less weight it has in the interpolation process.

Outlines

00:00

πŸ“ Understanding Spatial Sampling

The first paragraph introduces the concept of spatial sampling, which is the process of selecting points within an area known as the sample frame. It discusses the complexity of the world and the need to focus on subsets of data. The narrator explains the importance of unbiased sampling where every element has an equal chance of being selected, contrasting it with biased sampling that is sometimes intentionally designed in spatial contexts. The advantages of random sampling are highlighted, including its ease of implementation, but the potential drawbacks, such as oversampling or missing critical elements, are also noted. The paragraph concludes by mentioning the possibility of repeated sampling to improve representativeness, despite its resource-intensive nature.

05:01

πŸ” Biased and Systematic Sampling Methods

The second paragraph delves into alternative sampling methods used in geography, focusing on biased and systematic sampling. It contrasts the imperfections of systematic sampling, which may miss regularly patterned features, with the potential adjustments that can be made to compensate. The paragraph explains the concept of a systematic sample, which involves imposing a regular grid on the sample space for evenness, and discusses its suitability for areas with few features and abrupt boundaries. The limitations of systematic sampling for periodic features are also highlighted. The narrator then suggests combining systematic and random sampling to address issues with overlapping periodic features, and introduces stratified random sampling and cluster sampling as additional methods, emphasizing their utility in specific contexts.

10:02

🌳 Efficient Sampling Techniques

The third paragraph discusses efficient sampling techniques like cluster sampling and transect sampling. Cluster sampling involves intensive sampling around selected locations, which is beneficial when focusing on specific areas. Transect sampling allows for focused efforts on an area of interest, which is efficient but requires a good understanding of the spatial structure. The paragraph also addresses the question of how many samples are needed, explaining that it depends on the homogeneity of the population. It emphasizes the importance of knowing the study area to determine the best sampling method, balancing effective coverage with the cost of data collection.

15:03

πŸ”— Spatial Interpolation for Data Estimation

The fourth paragraph explores spatial interpolation, a technique used to estimate values between sampled points in a dataset. It explains that interpolation is an 'intelligent guesswork' that makes reasonable estimates of continuous fields where measurements are absent. The narrator distinguishes between linear interpolation in one dimension and spatial interpolation in two or three dimensions. The concept of inverse distance weighting is introduced as a method of spatial interpolation that gives more weight to closer samples, aligning with Tobler's First Law of Geography. The paragraph concludes with a mention of the practical applications of spatial interpolation, such as estimating weather conditions between stations, measuring elevation, and changing the resolution of raster images.

Mindmap

Keywords

πŸ’‘Spatial Sampling

Spatial sampling refers to the process of selecting points within a certain area, known as the sample frame, to collect data. It is a crucial part of gathering spatial data and is integral to the video's theme of understanding how to effectively collect and analyze geographic information. The script discusses how the complexity of the world necessitates sampling due to the impossibility of capturing everything at once. An example from the script is the selection of areas within a frame to either focus on a subset of data or to manage overly complex areas.

πŸ’‘Sample Frame

A sample frame is the defined area from which spatial data is collected through sampling. It is a fundamental concept in the video as it sets the boundaries within which the sampling process occurs. The script mentions the sample frame in the context of selecting areas for data collection, emphasizing that not all areas within the frame are selected, but rather a subset is chosen based on the study's requirements.

πŸ’‘Biased Sample

A biased sample occurs when some elements within the sample frame have a greater or lower chance of being selected than others. This concept is significant in the video's discussion on scientific sampling, where the goal is often to achieve an unbiased sample. However, the video also notes that in spatial sampling, sometimes a biased sample is intentionally designed to focus on specific areas of interest. An example given is that if objects close to each other are more likely to be related, a biased sample might be more appropriate.

πŸ’‘Unbiased Sample

An unbiased sample is one where every element of interest within the sample frame has an equal chance of being selected. This ensures that the sample is representative of the entire population. The video contrasts this with a biased sample and discusses the theoretical ease of implementing an unbiased sample through random selection. However, it also points out the practical challenges, such as the possibility of missing certain elements or oversampling homogeneous areas.

πŸ’‘Random Sampling

Random sampling is a method where sample points are selected randomly within the sample frame, giving each location an equal chance of being chosen. The video explains that while this method is straightforward in theory, it can lead to issues like oversampling or missing smaller elements. An example from the script is the potential for random samples to miss certain tree species if they are not adequately represented within the randomly selected sample points.

πŸ’‘Systematic Sampling

Systematic sampling is an approach where a structured pattern, such as a grid, is imposed on the sample space to ensure evenness in the selection of sample points. This method is highlighted in the video as a common practice in geography, as it can be more efficient and spatially representative than random sampling. However, the video also notes its limitations, such as the potential to miss features with regular patterns or clusters.

πŸ’‘Stratified Random Sampling

Stratified random sampling involves dividing the population into different strata or layers and then taking a random sample from each stratum. The video discusses this as an alternative approach to sampling that can be more representative of the entire population. It is particularly useful when there are distinct subgroups within the population that need to be separately analyzed. An example provided is using grid methods with random intervals to ensure a more balanced representation.

πŸ’‘Cluster Sampling

Cluster sampling is a technique where intense sampling is conducted within selected clusters or groups of features. This method is beneficial when the focus is on specific areas or features that are naturally clustered. The video uses the example of conducting a survey of shoppers' opinions within shopping centers rather than in less relevant areas like city parks, highlighting the efficiency and relevance of cluster sampling in such contexts.

πŸ’‘Transect Sampling

Transect sampling involves focusing sampling efforts along a linear feature, such as a road or river. This method is efficient for studying features that are aligned in a linear pattern. The video emphasizes that it requires a good understanding of the spatial structure of the area being studied. An example given is defining transects across a stream to measure its profile, which is more manageable than random sampling across the entire area.

πŸ’‘Spatial Interpolation

Spatial interpolation is a technique used to estimate values of a continuous field at locations where direct measurements are not available. It is a key concept in the video as it allows for the estimation of data points between known samples. The video explains that this method is particularly useful for fields like temperature, precipitation, and elevation, where there is a strong spatial autocorrelation. An example used is estimating the average temperature across Canada by interpolating between values at different weather stations.

πŸ’‘Inverse Distance Weighting

Inverse Distance Weighting (IDW) is a spatial interpolation method where values are estimated based on their distance from known points, with closer points having more influence on the interpolated value. This concept is introduced in the video as a way to fill in gaps between sample points, leveraging the principle that closer samples are more similar. The video provides a visual example of how IDW can be used to interpolate elevation data across space, given a few sample points.

Highlights

Spatial sampling involves selecting points from within an area, known as the sample frame, to gather data.

The world is infinitely complex, necessitating the collection of a subset of information or samples to understand it.

Scientific sampling requires each element in the sample frame to have a pre-specified chance of selection to avoid bias.

In some cases, spatial sampling is intentionally designed to be biased to account for the spatial relationships between objects.

Random sampling is theoretically easy to define but can lead to oversampling of large homogeneous areas or missing smaller elements.

Repeated random sampling can lead to a more representative sample but is more time-consuming and resource-intensive.

Systematic sampling involves imposing a regular grid on the sample space to ensure evenness, but may miss features with a regular pattern.

Combining systematic and random sampling can help alleviate issues with overlapping periodic features.

Stratified random sampling involves spacing grids at random intervals to maintain some randomness in the sampling method.

Cluster sampling focuses on intensive sampling of features in clusters around selected locations, useful for targeting specific areas.

Transect sampling allows for focused efforts on specific areas of interest, efficient for linear features like roads or rivers.

The number of samples required depends on the spatial homogeneity of the population; more heterogeneity requires more samples.

Knowledge of the study area is crucial for determining the best sampling method to balance effective coverage with cost.

Spatial interpolation is a technique used to estimate values between known samples, particularly useful for continuous fields.

Inverse distance weighting is a spatial interpolation method that estimates values based on distance from known points.

Spatial interpolation exploits Tobler's First Law of Geography, giving more weight to closer samples.

Continuous fields like temperature, precipitation, and elevation exhibit strong spatial autocorrelation, making spatial interpolation effective.

Spatial interpolation is used for estimating weather conditions between stations, measuring elevation, and changing raster image resolutions.

The choice of sampling method depends on the features being studied and the resources available for data collection and analysis.

Transcripts

play00:00

all right now i'm going to talk a bit

play00:03

about spatial sampling

play00:05

so how do we collect and gather spatial

play00:08

data

play00:09

and then um i'll loop back around to

play00:12

talking a little bit about how we can

play00:14

exploit a little bit of the special

play00:16

nature of spatial data

play00:19

you can think of sampling as the process

play00:22

of

play00:22

selecting points from within an area

play00:25

this area

play00:26

is also sometimes referred to as the

play00:28

sample frame

play00:30

we select some areas from within the

play00:33

frame but we discard

play00:35

other right so as i've talked about a

play00:38

bit already

play00:38

the world is essentially infinitely

play00:41

complex there's no way

play00:43

that we can capture and describe

play00:47

everything about it all at once so

play00:49

depending on

play00:50

the task at hand maybe we focus on just

play00:53

one subset and collect all the

play00:54

information we can about it

play00:57

or maybe the reality or the

play01:00

area that we're looking at is too

play01:02

complex and we have to just

play01:04

stick to looking at a subset so we take

play01:06

samples

play01:07

of that area and use that to work with

play01:12

but how do we choose which points we can

play01:14

keep and determine

play01:16

the quality of our data

play01:22

scientific sampling requires that each

play01:24

element in a sample frame has

play01:26

some pre-specified chance of selection

play01:31

if some elements have either a greater

play01:33

or lower

play01:34

chance of being selected then our sample

play01:37

is said to be biased

play01:39

and if every element of interest has an

play01:41

equal chance of being selected then our

play01:43

sample is said to be

play01:44

unbiased now biased

play01:47

isn't necessarily a bad thing in the

play01:50

context of spatial sampling sometimes we

play01:53

explicitly design our samples to be

play01:55

biased

play01:57

even though again in many of the

play02:00

hard sciences you want an unbiased

play02:03

sample

play02:05

sometimes that's just not a feasible

play02:07

option in geography

play02:10

because again objects that are close to

play02:12

each other are more likely to be related

play02:14

things that are farther away are more

play02:16

likely to be

play02:18

less related in theory

play02:22

a completely random that is unbiased

play02:25

sampling process is best with this

play02:28

each location has an equal chance of

play02:30

being selected

play02:32

one of the advantages of a random sample

play02:35

is that it's

play02:35

fairly easy to do at least in theory

play02:39

i'll explain why it's not always

play02:41

easy in practice but it's easy to define

play02:43

you create your sample frame that's the

play02:46

area that you're looking at

play02:47

and then you just randomly generate x

play02:49

and y coordinates

play02:50

within the sample thing typically

play02:53

speaking this

play02:54

provides a good range of possible values

play02:58

in a distribution

play03:04

however sorry the dog is chewing on a

play03:07

squeaky toy

play03:09

however there is a chance that all

play03:11

samples will be taken from the same type

play03:13

of elements within a certain population

play03:16

so for instance we might only sample

play03:18

from urban areas

play03:20

and miss forested areas and it's often

play03:23

difficult to implement this

play03:25

in practice so if you look at the figure

play03:28

down here

play03:29

you can see some of the drawbacks of

play03:32

random sampling

play03:34

so all the blue circles are the samples

play03:37

we take

play03:38

and with random samples you might end up

play03:42

by chance

play03:42

over sampling large elements like a

play03:45

building or

play03:46

crop field so these are homogeneous

play03:48

areas where you don't need a lot of

play03:50

samples to describe what they are

play03:53

or you might completely miss smaller

play03:55

elements so

play03:56

for instance maybe there's some certain

play03:58

tree species in an area that you

play04:00

need to get samples from

play04:03

if you rely on a random sample maybe

play04:04

you'll miss all those trees

play04:06

and then you won't include that in your

play04:10

abstraction of reality

play04:15

we have another we have a number of

play04:17

approaches to limit

play04:19

the impact of this we can do repeated

play04:22

sampling

play04:23

so theoretically if you repeat a random

play04:26

sample over and over and over again

play04:28

you'll get a lot more samples and it'll

play04:30

be more representative of the entire

play04:32

population

play04:34

the downside to this is that it's more

play04:36

time consuming and it requires more

play04:38

resources

play04:40

typically speaking collecting data

play04:43

physically

play04:44

going out and doing it is fairly

play04:47

expensive

play04:47

um you have to pay people to collect the

play04:50

data or

play04:51

invest your own time in it it's labor

play04:54

intensive it's a lot of work

play04:56

if you've ever done any field work you

play04:58

know that it's

play04:59

[Music]

play05:00

a lot so typically we don't want to do

play05:04

just random samples you can end up with

play05:07

a lot of redundancy of labor and things

play05:09

like that

play05:10

generally speaking in geography we rely

play05:13

on biased

play05:15

systematic sampling methods instead so

play05:18

this is where we create

play05:19

some sort of sampling design that trades

play05:21

a sampling scheme

play05:23

for randomness so instead of complete

play05:25

randomness we define some structure

play05:27

we induce some bias to our sample but we

play05:30

do that with the intent

play05:32

of making our sample easier more

play05:34

spatially representative things like

play05:36

that

play05:39

so systematic sampling isn't perfect

play05:42

either it may miss some features that

play05:44

have a regular pattern or

play05:45

cluster but we can make some adjustments

play05:48

to that

play05:49

to compensate

play05:52

a systematic sample is just where some

play05:55

sort of regular grid is

play05:57

imposed on the sample space to ensure

play06:00

evenness so this can be a solid strategy

play06:04

for areas that contain

play06:06

only a handful of features with abrupt

play06:08

boundaries like buildings or something

play06:11

but it's not ideal for things that

play06:13

exhibit some degree of periodicity

play06:16

that is like regular intervals like rows

play06:19

because you might end up over sampling

play06:21

or under sampling

play06:23

the feature if all of your sample points

play06:26

match

play06:26

up with the period of whatever

play06:30

entity you're looking at so for instance

play06:33

if you

play06:34

overlay a grid on a road network

play06:38

it's possible that all of these sample

play06:40

points might completely miss

play06:42

the road network it's also possible that

play06:45

the

play06:45

samples might mostly fall on the roads

play06:48

so the purely systematic sample

play06:52

generally speaking isn't the best option

play06:55

here

play06:59

one workaround is you can combine

play07:02

the benefits of the systematic sample

play07:06

with the benefits of a random sample

play07:09

generally speaking the systematic sample

play07:12

is nice because

play07:13

it makes your sampling scheme a bit more

play07:16

organized and regular it's usually

play07:17

easier to collect

play07:21

but you get rid of all the randomness

play07:25

instead you can address this issue

play07:28

by defining a regularly spaced grid and

play07:30

then taking

play07:31

random samples within them so this

play07:34

induces some randomness

play07:37

to the sampling method and will

play07:39

alleviate issues

play07:40

with overlapping periodic features

play07:44

however this goes back to the main issue

play07:47

with

play07:47

one of the main issues with random

play07:49

sampling is that it's

play07:51

time consuming and costly

play07:55

alternatively another form of stratified

play07:58

random sampling is you can

play08:00

rely on the grid method as in your d on

play08:03

the bottom left

play08:04

but you can space the grids

play08:07

at random intervals instead it's

play08:10

essentially doing the same thing

play08:17

another option is cluster sampling

play08:20

so you can do intense sampling of

play08:22

features in

play08:24

clusters around a number of selected

play08:26

locations

play08:28

this is useful if you know that you want

play08:31

to focus

play08:31

on certain areas so for instance

play08:34

shopping centers

play08:38

maybe you want to do a survey of

play08:42

the opinions of shoppers for some

play08:44

certain thing

play08:46

right it might make sense to send

play08:49

canvassers to specific shopping centers

play08:53

to interview people who pass by

play08:57

it wouldn't make sense to send one of

play08:59

your canvassers to a city park to ask

play09:02

them their opinion on

play09:03

something if it relates to shopping it

play09:06

wouldn't be a good idea

play09:08

to have the surveyors go door-to-door

play09:11

right you're not necessarily going to be

play09:13

getting the

play09:14

target population you're looking for

play09:16

that would be a waste of resources to go

play09:18

to a park

play09:18

or to canada's neighborhood when they

play09:20

could simply stand outside of the

play09:22

shopping center and get the target

play09:24

population they're looking for

play09:28

so that's one context where cluster

play09:31

sampling is very beneficial if you

play09:33

are working with things that are

play09:34

specifically clustered and you know you

play09:36

just want to target those things you can

play09:38

ignore the rest of the sampling area

play09:47

another option is to just randomly

play09:50

define

play09:51

the or you can select the cluster

play09:54

locations at random

play09:56

and then intensively sample those

play09:58

clusters

play09:59

so this could be beneficial if you're

play10:01

trying to do a vegetation survey for

play10:03

instance

play10:04

you have a large area and you randomly

play10:07

define

play10:08

vegetation plots and then you take

play10:10

samples within those vegetation plots

play10:12

so again this will this allows you to

play10:15

use some of the randomness and exploit

play10:18

some of the advantages

play10:19

of unbiased sampling while also

play10:22

exploiting some of the

play10:23

time saving advantages of biosampling so

play10:26

you choose a handful of random locations

play10:29

and then intensively sample them

play10:31

rather than sporadically sampling a

play10:33

whole bunch of random

play10:34

locations this is really efficient time

play10:39

wise but it might

play10:40

might not be representative of the

play10:42

broader population that you're

play10:44

interested in

play10:48

another option is transect sampling this

play10:51

allows you

play10:52

to focus your efforts only on a specific

play10:55

area of interest

play10:57

it's really efficient way to sample just

play11:00

the key feature that you're looking at

play11:02

you don't sample outside of the focus

play11:04

area

play11:06

but one drawback

play11:10

is that it requires really good

play11:12

pre-existing understanding

play11:14

of the spatial structure of the object

play11:17

you

play11:17

or whatever you're studying for maximum

play11:19

effectiveness

play11:21

the most common application for this

play11:23

type of sampling method might be

play11:25

along linear features like roads or

play11:28

rivers

play11:28

so you want to get a stream profile you

play11:31

just

play11:32

define transects across the stream

play11:35

and so you just focus on those specific

play11:37

areas you don't need to focus

play11:39

outside of the stream and

play11:42

uh also because of the

play11:45

sort of effort and difficulty involved

play11:48

with like

play11:49

measuring stream depth you have to have

play11:50

somebody walk across that stream it

play11:52

might be easier to just have them walk

play11:55

in lines across the river rather than

play11:57

trying to navigate it and

play11:59

doing random samples at a bunch of

play12:01

different locations

play12:04

and then within the transect sample

play12:07

samples can either be randomly as

play12:09

randomly spaced stratified or some

play12:12

combination of the two approaches

play12:14

so with most of these different types of

play12:17

sampling methods you can induce some

play12:19

degree of randomness to make them a

play12:20

little bit

play12:21

less biased if you want

play12:26

how many samples do we need the number

play12:29

of samples

play12:30

required to adequately represent a

play12:32

population is a function

play12:33

is a function of how similar the units

play12:36

of a population are

play12:40

so if you think about the spatially

play12:42

homogeneous tree farm example

play12:44

if we have a field where all of the

play12:47

trees are the same species

play12:49

and they were planted in the same year

play12:50

they're all given the same amount of

play12:52

fertilizer grown under the same light

play12:54

conditions

play12:55

there's going to be some small degree of

play12:57

variability

play12:58

in tree height so you want to measure

play13:00

the average height of the trees across

play13:02

this field

play13:02

you might need to sample say 30 trees

play13:05

and go around and measure their height

play13:06

you can take the average and get a good

play13:08

idea

play13:08

of the average tree height on this tree

play13:11

farm because

play13:12

it's fairly homogenous they're all

play13:14

essentially the same

play13:16

whereas if you want to measure the

play13:17

average tree height in a natural

play13:19

landscape with a high degree of spatial

play13:21

heterogeneity

play13:22

you're going to need more samples you've

play13:24

got different tree species

play13:27

the trees all sprout in different years

play13:30

especially if it's like an old growth

play13:32

forest

play13:33

you've got different soil conditions a

play13:35

variety of factors that are going to

play13:37

lead to trees of different heights

play13:39

spread across the landscape so you're

play13:40

going to need a lot more samples to

play13:42

determine the average tree height in

play13:44

this landscape

play13:45

than this landscape and

play13:48

due to spatial heterogeneity because

play13:51

structure can vary widely across the

play13:53

landscape

play13:53

it's important to have a bit of

play13:55

knowledge about your study area this is

play13:56

going to help you determine what the

play13:58

best

play13:58

sampling method is because the goal is

play14:01

to maximize returns for minimal effort

play14:04

need to balance effective coverage with

play14:07

the cost of collecting the data

play14:08

and i mean cost not just in money but in

play14:11

time and

play14:12

in effort so as is often going to be the

play14:16

case in this course there's no one

play14:18

right answer the specific type of

play14:21

sampling method that you're going to use

play14:23

will oftentimes depend on the features

play14:25

that you're looking at and the resources

play14:27

you have

play14:28

to conduct whatever analysis you're

play14:30

working on

play14:32

so i want to now i'm going to

play14:36

briefly take a moment to talk about one

play14:37

of the techniques that we can use

play14:39

to exploit the special nature of spatial

play14:42

data

play14:43

what if we have a data set where we took

play14:45

some samples

play14:46

but we want to know what falls between

play14:48

the samples

play14:50

we can do something called spatial

play14:51

interpolation to figure out

play14:53

what might go this is a really simple

play14:56

one-dimensional example so you've got a

play14:58

string of numbers one

play15:00

two four and five we didn't sample the

play15:02

middle one

play15:03

we want to know what's there i think we

play15:06

can all

play15:06

assume that it's going to be a three

play15:10

if you take that to two dimensions

play15:12

things get a little bit more complicated

play15:14

but again you can fill in the blanks

play15:17

using the patterns observed in the data

play15:20

set

play15:22

so this is known as spatial

play15:25

interpolation

play15:26

and the process of or

play15:30

the process of filling in blanks in

play15:31

general is known as interpolation so if

play15:34

you do it over

play15:35

one dimension one two three four five

play15:38

that's known as linear interpolation but

play15:41

if you do it over two or three

play15:43

dimensions then we're going to call it

play15:44

spatial interpolation essentially it's

play15:48

just

play15:48

intelligent guesswork where we attempt

play15:50

to make a reasonable estimate of

play15:52

values of a continuous field at a place

play15:57

where we don't have measurements

play16:01

spatial interpolation only makes sense

play16:04

for a

play16:04

continuous field so something

play16:08

where it varies across space like

play16:11

temperature or precipitation or

play16:13

elevation

play16:14

with categorical data like land cover

play16:17

it can be a pretty problematic uh thing

play16:19

to do and it doesn't work very well

play16:22

the spatial interpolation works really

play16:24

well with rainfall

play16:26

temperature pressure most weather

play16:28

observations in general

play16:30

so you can use it to make estimations

play16:33

between

play16:34

weather stations so canada

play16:37

has a network of weather stations spread

play16:39

across the country

play16:40

some more dense and populated areas some

play16:43

more sparse especially in the north

play16:46

but we can use the information from

play16:48

these weather stations to

play16:50

estimate the average temperature at all

play16:53

spaces

play16:54

at all locations across the country by

play16:57

interpolating between the values at

play17:00

weather stations

play17:01

it can be used to measure or estimate

play17:04

elevation between measured locations

play17:07

and it's also used when we change

play17:09

resolution of raster images

play17:14

so all spatial interpolation methods

play17:18

incorporate distance to known samples

play17:21

and if this sounds familiar that's

play17:22

because they exploit tobler's first law

play17:25

of geography

play17:26

closer samples are going to be given

play17:28

more weight than

play17:29

distant ones and a threshold is usually

play17:32

set to determine the maximum distance to

play17:34

take samples from

play17:37

most continuous fields tend to exhibit

play17:39

very strong

play17:40

spatial autocorrelation so it's pretty

play17:43

reasonable to assume

play17:45

that values that are missing are likely

play17:47

to be similar

play17:48

to those that are around them in the

play17:50

field

play17:53

so here's just a visual example of how

play17:56

you might go about doing spatial

play17:58

interpolation

play17:59

with elevation data there is a tool

play18:02

called

play18:03

inverse distance weighting where

play18:06

values are interpolated

play18:09

based on their distance from known

play18:12

points

play18:13

where the farther away

play18:16

the object is the less weight it has

play18:19

that is it is inversely weighted with

play18:22

its distance

play18:24

so you can see a one-dimensional and

play18:27

two-dimensional example

play18:28

of inverse distance weighting where on

play18:31

the left

play18:32

using it to just interpolate the

play18:35

values between points on a line and on

play18:37

the right we're using it to

play18:39

estimate elevation across space given

play18:42

a small number of samples

play18:46

so i will discuss or i'll show an

play18:49

example

play18:50

later on in the term of specifically how

play18:52

to do inverse distance weighting

play18:54

in arcgis pro but i just wanted to

play18:57

introduce this concept because it kind

play18:59

of ties in nicely with what i've already

play19:00

talked about

Rate This
β˜…
β˜…
β˜…
β˜…
β˜…

5.0 / 5 (0 votes)

Related Tags
Spatial SamplingData QualityUnbiased SamplingBiased SamplingGeographySampling MethodsSpatial DataRandom SamplingSystematic SamplingCluster SamplingTransect SamplingInterpolation