Abstract
Information on street trees is critical for management of this important resource. Sampling of street tree populations provides an efficient means to obtain street tree population information. Long-term repeat measures of street tree samples supply additional information on street tree changes and can be used to report damages from catastrophic events. Analyses of several street tree populations reveal that a 2%–3% sample of block segments with known length within a city will likely produce estimates on the total number of trees with a standard error around 10% of the total population estimate (relative standard error of 10%). Ratio estimates of number trees per length of street sampled reduced the number of block segments needed to attain a 10% relative standard error. Communities with a small tree population, or analyses of specific subsets of the population (e.g., individual species information), will likely need a higher proportion of block segments sampled to attain the same relative precision. This paper presents a simple means to sample street tree populations to aid in street tree management and presents information on how many block segments need to be sampled to achieve a desired sampling precision. Results can be used to develop simple, cost-efficient, and accurate means to sample street tree populations.
Understanding and quantifying the structure of the street tree resource (e.g., species composition, dbh distribution, health) is critical to basic street tree management. The most appropriate procedure to aid in street tree management is a comprehensive inventory (or census) of the street tree population. These inventories provide data on various tree characteristics and location that are essential to day-to-day management (e.g., Tate 1985; Smiley and Baker 1988). In cases where street tree inventories cannot be conducted (e.g., limited budgets), street tree sampling can provide a relatively inexpensive alternative for acquiring street tree data on overall street tree composition, health, and/or maintenance needs. However, street tree sampling does not provide the essential data (e.g., comprehensive data on specific tree locations and conditions) that inventories can provide. Rather, sampling provides general overall statistics on the street tree population characteristics, such as species composition, tree sizes, health, and/or maintenance needs. Sampling can also provide a useful means to detect changes and essential information needed for budget planning.
One of the goals of sampling is to produce a more precise estimate (e.g., number of trees) at minimal cost. One way to assess precision is to compare the standard error (variability) of the estimate to the estimate itself. Dividing the standard error of an estimate by the estimate value produces a percent relative standard error (RSE). The lower the percent relative standard error, the higher the precision of the estimate. Increasing the sample size often increases precision, but sample design also affects precision.
Various methods of street tree sampling have been reported in the literature. Mohai et al. (1978) and Valentine et al. (1978) describe a method that entails drawing approximately 50 to 100 random samples of streets with a length of two to three city blocks, and systematically sampling every xth tree of a species to produce a sample size of about 100 trees. This method can provide an efficient and accurate estimate of street tree populations, but requires information on species and population distribution such that sampling interval per species can be selected to reach a sample size of 100 trees.
Jaenson et al. (1992) proposed a sampling method that stratifies the cities into zones and randomly samples blocks such that about 2,000 to 2,300 trees are sampled. This method can provide accurate estimates of the street tree population, but requires the extra step of pre-sampling of the zones to determine how to distribute the sample and requires that a certain number of trees be sampled. This methodology was used to estimate street tree population structure and functions in Davis, California, U.S., and Lisbon, Portugal (Maco and McPherson 2003; Soares et al. 2011).
Various studies have investigated simple random samples of street tree populations. Sun and Bassuk (1991) recommended a minimum sample size of between 5% and 50% of the population based on computer simulations of hypothetical tree populations. Percent of the population needing to be sampled varied with total population size and species diversity. Alvarez et al. (2005) compared simple random and stratified random sampling (i.e., random sampling within predefined zones) of street trees in a neighborhood of Piracicaba, Brazil. Random sampling of 20 blocks produced a relative standard error (ratio of standard error to total) of 47%. Stratification of the 20 blocks into four zones revealed relative standard errors for the zones that varied between 32% and 70%. The advantage of simple random sampling lies in its simplicity, but more complex designs can reduce standard errors of the estimate (e.g., Cochran 1977).
The purpose of this paper is to demonstrate a simple random sampling technique of trees within block segments (i.e., road segments between intersections) for assessing street tree populations. In particular, this paper will use street tree data from several cities to: a) determine the sample size (percent of block segments) needed to attain a 10% relative standard error for the street tree population total and b) illustrate how information on length of block segment can improve the precision of the population estimates. This simple method does not require pre-sampling or pre-existing data and will allow for easy and accurate estimates of the street tree population that can be stratified into various zones of the city if desired. All that is required for this sampling is a listing of the individual block segments that make up the study area. If the length of these block segments are also known, then ratio estimates can be used. The ease of this procedure should increase efficiency in collecting street tree population data and improve street tree management.
METHODS
The approach used in this paper is a simple random sampling of block segments to determine street tree population characteristics. Complete street tree inventory data (100% of the street tree population) from six U.S. cities were obtained from the Davey Resource Group: Buffalo, New York (2010 population = 261,310 people, 105.15 km2, 2,485 people/km2); Lansing, Michigan (114,297 people; 87.80 km2, 1,301 people/km2); Livonia, Michigan (96,942 people; 92.46 km2, 1,048 people/km2); Parkersburg, West Virginia (31,492 people; 29 km2, 1,086 people/km2); Syracuse, New York (145,170 people; 65 km2, 2,233 people/km2); and Wilmington, Delaware (70,851 people; 27.97 km2, 2,533 people/km2). These cities were selected based on available complete street tree inventory data with geographic coordinates so that tree locations could be located on street maps in a geographic information system (GIS) to aid in sample selection and analysis.
Number and length of block segments in each city were determined using TIGER/Line files (U.S. Census Bureau 2013) within a GIS. Block segments were randomly selected, and all trees on the segment (both sides of the road) were assigned to that block segment. Each block segment was then considered a sampling unit or plot. To determine street tree population characteristics and total, plot calculations were done with and without knowing the block segment length. The following text illustrates the calculations. For these examples, suppose the goal of the sampling was to determine the total number of street trees within a city.
Simple Random Sampling (Block Segment Lengths Unknown)
In a city there are a total of N block segments with the number of trees labeled yi for the i-th block segment (i=1, 2, …, N). The total number of trees in the population is:
1
where Y is the population parameter of interest (the total number of trees).
Now suppose a simple random sample of n block segments is selected without replacement (i.e., the plot is not put back in the sample pool after being selected and cannot be selected again). One estimator of the total number of trees () is:
2
where
3
is the mean number of trees per block segment in the sample.
The estimate of the standard error of the total number of trees is calculated as
4
where
5
The standard error is a measurement of how well the sample represents the population. The smaller the standard error, the more representative the sample is of the overall population (e.g., Cochran 1977; Thompson 2002).
Simple Random Sampling (Block Segment Lengths Known)
When the block lengths (where xi is the length x for the i-th block segment) are known, a ratio estimator of trees per road distance can be used to calculate population totals and standard errors. In this case, population total is calculated as
6
where
7
is the mean of the block segment lengths in the whole population, and
8
is the mean block segment length in the sample. The subscript R in stands for ratio estimator.
As the number of street trees is likely correlated with the length of the block segment (i.e., longer block segments tend to have more trees), the ratio estimator should be better than the simpler estimator . There can be cases when this is not true (Cochran 1977), but these cases are likely rare with street tree populations. If block information can be easily obtained, they would likely be beneficial in reducing the standard error of the estimate.
The estimate of the standard error of has a similar form to and is given by
9
where
10
Note that the standard error formula for is an approximation, but is usually very good (Cochran 1977; Schaeffer et al. 1986; Levy and Lemeshow 1991; Thompson 2002). Block length information can be obtained from city GIS shapefiles or from Census Tiger/Line data.
Sample Size
To help determine the effect of sample size on the standard error for street tree population estimates, various analyses of the six street tree populations were performed. Analyses were performed for the total street tree population and subpopulations (e.g., number of trees in one species) as standard error estimates tend to increase with subpopulation estimates. The goals of the analyses were to determine the number of block segments that need to be sampled to reach a relative standard error of 10% in each city. The RSE is the ratio of the standard error to the total. For example, a street tree population with 1,000 trees that has a standard error of 300, which has a RSE of 30%. Analyses were conducted for the total number of street trees, total number of removals, and total number of one species (Norway maple, Acer platanoides) in each city. For each block segment in the city, the number of trees, removals, and Norway maples were calculated, as well as the average and standard deviation of trees per block in each city (Table 1). To show how RSE varies with sample size, the number of block segments was varied within the formula for standard error with the known population variance for each city.
Variation in Sample Estimates
To illustrate how random samples can yield varying estimates of population totals, five block segments in Syracuse, New York, were randomly selected 30 different times to estimate the total street tree population. Then 10 block segments were randomly selected 30 different times, followed by 15 block segments, 20 block segments, and so on, until 100 block segments were randomly selected 30 times. Individual sample estimates along with average street tree population estimates and standard errors from the 30 samples were graphed and contrasted against the actual street tree population total. Plotting the 30 samples gives a visual idea of how much the individual estimates will vary. A larger sample size is not needed as RSE does not need to be estimated and is calculated using the exact formula for RSE for any sample size.
Ratio Estimates
Street tree data from one city with street length information (Syracuse, New York) were used to illustrate how the RSE may differ between simple random sample estimates and ratio estimates using known block lengths. For each block segment in the city, the number of trees was counted, as well as the average and standard error of trees per block using simple random and ratio estimates based on block length. To illustrate how RSE varies with sample size between the simple random and ratio estimate approaches, the number of block segments sampled was varied within the formula for standard error with the known population variance for each city.
RESULTS
For five out of the six cities analyzed, the percent of block segments needing to be sampled to attain a 10% RSE was between 2.2% and 4.6%, with total number of block segments sampled between 89 and 185 (Table 1; Table 2). However, for the smallest city analyzed, Parkersburg, West Virginia, nearly 32% of the block segments need to be sampled to attain a 10% RSE, which required 279 block segments to be sampled.
When analyzing subsets of the population (e.g., removals, Norway maples), the number and percent of block segments needing to be sampled to attain a 10% RSE increased. For removals, the percent of block segments needing to be sampled ranged between 9.7% and 58.8%; for Norway maple it ranged between 6.0% and 72.8% (Table 1).
Sample size affected the variability of the estimate. Relative standard error decreased at differing rates among the cities as sample size increased (Figure 1). Testing of various sample sizes in Syracuse, New York (Figure 2), illustrates that as sample size increases, the estimate tends to have reduced variability and cluster more tightly around the population total. With a sample size of five block segments, sample estimates ranged between 2,588 and 81,972 trees; estimates from sampling of 100 block segments ranged between 25,751 and 39,189 trees. The average of 30 selections of all block segment sample sizes (n = 5 to 100) tends to be close to the population total (Figure 2), with the estimates, on average, only being 1,161 trees (3.3%) from true population total (34,717 trees). The minimum difference was 122 trees (0.4%; with 80 street segments) and the maximum difference was 2,759 trees (7.9%; with 15 street segments). However, as sample size increases, the certainty of the population estimate also increases (i.e., one is less likely to get an estimate that is considerably different from the true answer).
Using block lengths for a ratio estimate in Syracuse, New York, reduced the number of plots needed to attain a 10% RSE compared with simple random sampling by 29% (from 185 block segments (3.6% of block segments) to 131 block segments (2.6%) (Figure 3). The reduction in relative standard error decreased with each additional plot. The most substantial reduction in RSE occurred in the first 50 plots, with at least a one percent reduction in RSE for the first 17 street segments sampled (Figure 4).
DISCUSSION
Street tree sampling is a good option, when 100% inventories are not feasible, to provide data to aid in street tree management. With the exception of Parkersburg, West Virginia, which had a substantially smaller number of block segments, trees per block segment, and total tree population, a sample size of about 3% of the block segments (with unknown block length), or between 89 to 185 block segments in a city produced a relative standard error of 10% (Table 1). Adding block length information to a sample increased precision and reduced the sample size needed to gain desired precision, making sampling more efficient (Figure 3). Increasing the number of block segments sampled also reduced the variability of the estimate (Figure 2).
Urban foresters could begin by sampling 3% of the block segments to try and attain an estimate of total number of trees with a relative standard error of 10%. If the data are not adequate (e.g., 10% RSE is not attained), then an additional block segments could be sampling to help reach the desired precision level. However, the decreasing reduction in RSE with additional plots (Figure 4) illustrates that at some point the cost of an increased sample outweighs the gain in precision by taking this additional sample (e.g., Stanovick et al. 2002). Desired sampling precision and costs need to be considered in determining the sampling size.
To increase precision to a 5% relative standard error for most cities (excluding Parkersburg, West Virginia), an average of 11%, or 462 block segments, were sampled (Table 2). To increase precision to a one-percent relative standard error, an average of about 75%, or 3,240 block segments, were sampled. At this point of precision, it would likely be best to complete a whole street tree inventory with locations and attain 100% precision. Full inventories can provide more essential data for street tree management than street tree samples.
In cities with small tree populations (e.g., less than 2,000 trees) or for sub-classes within a larger tree population (e.g., individual species), a larger proportion of block segments will be needed to attain a desired relative standard error. However, smaller street tree populations may have a lower absolute standard error than a larger street tree population with a lower RSE. For example, in Lansing, Michigan, a 10% RSE produces a standard error of 5,790 trees (Table 1). In Parkersburg, West Virginia, a 10% RSE equates to a standard error of only 173 trees. Thus, Parkersburg could a have a RSE of 100% (1,734 trees) and still produce an absolute standard error less than Lansing, Michigan. An urban forester must decide what level of precision is desired by the sample and whether the precision is relative to the total (RSE) or an absolute number in terms of trees (standard error).
Simple random sampling or ratio estimates are only one of many possible ways of sampling street tree populations. Although relatively simple, there are other means of sampling that could produce lower relative standard errors. For example, sampling with probability proportional to size, where larger blocks have a higher chance of being selected (compared with equal probabilities of selection as in simple random sampling) could produce lower standard errors, but require additional work in selecting samples (Cochran 1977).
Other enhancements to existing methods (Cochran 1977) could also maximize reductions in standard error for a fixed cost. These techniques consider that getting to a particular randomly selected block is the biggest cost and sampling neighboring blocks can be done at little additional cost. While these approaches can add complexity to the sample design, they can be more cost-effective and produce lower standard errors than simple random sampling. The advantage of simple random sampling lies in its simplicity, but more complex designs can potentially reduce costs and standard errors of the estimate.
Stratification of the population in subareas for analysis can reduce standard error if the areas have similar population structure (e.g., similar species and/or sizes). Most stratification for street trees is done based on geography (e.g., management units, neighborhoods). This type of stratification may or may not reduce standard errors, but will provide information on subareas that may be useful management (e.g., neighborhood analyses).
Having block length information adds increased precision to the estimate and is relatively easy to obtain via TIGER/Line files (U.S. Census Bureau 2013). A GIS-based procedure is also available at no cost through the i-Tree software suite (www.itreetools.org) to help randomly select block segments within cities. To aid in the analysis of population or sub-class population totals and standard error, an Excel® spreadsheet calculator is available online (Street Tree Calculator 2015), or data can be run through i-Tree.
In establishing a street tree sample, the following types of information should be considered: a) funding—is there funding to establish a complete street tree inventory, and if not, how much is available for sampling; b) variables—what type of information is required from the sample (e.g., number of trees, tree health, maintenance needs, tree sizes, planting sites); c) precision—what is an acceptable standard error for the types of information desired; d) what types of data are available (e.g., any existing street tree data to aid in developing sample design, GIS layer of roads); and e) sample design—will simple random sampling or ratio estimates meet the desired precision, or should other sample designs be developed? Adequate sample preparation and thought can help the sampling process meet the desired objectives of the user in the most cost-effective manner.
If an urban forester would like to sample a street tree population using this procedure, potential step-by-step procedures are as follows:
Obtain a list of all block segments and their lengths (e.g., from the city GIS department). If lengths are not available, then obtain a list of all block segments. Block segments should have address information so they can be located in the field.
Load the list into a spreadsheet and randomly select 3% of the block segments
Measure all trees on selected block segments (both sides of the street)
Summarize data for each block segment in a spreadsheet (e.g., number of trees per block, number of 2.5 cm Acer rubrum, number of dead trees)
Transfer data into i-Tree spreadsheet to calculate totals and standard errors (Street Tree Calculator 2015) or calculate using equations given in methods
Optional step: Add additional randomly-selected block segments and measure trees on these block segments if desired precision is not reached in step #5. Add these new data to existing data in step #4
Re-measure sampled block segments in the future to calculate change
Street tree sampling can also provide a relatively inexpensive means to monitor change in a street tree population. By following a sample tree population through time, information can be attained on rates of changes and factors associated with tree survival or mortality (e.g., what environmental or species factors lead to varying mortality rates). In addition, re-measurement of street tree samples can be used to generate information on tree losses and/or debris clean-up related to storms or other natural events (e.g., pest outbreaks). As street trees are only a small component of the whole urban forest, urban foresters should also consider sampling the whole urban forest (e.g., through i-Tree protocols). This additional sampling would ascertain information on the structure and values associated with entire urban forest resource, as well as potential threats (e.g., pests, climate change) that may alter urban forest structure in the future.
CONCLUSION
Although 100% street tree inventories provide the most useful data for day-to-day street tree management, simple random sampling of a street tree population can provide an easy means to gather information on the street tree resource. For many cities, a 2%–3% sampling of block segments with known block lengths will provide around a 10% relative standard error for estimates of the total number of trees. Fewer block segments would be needed using ratio estimates. Smaller cities with relatively few trees may require significantly more samples to attain the same relative standard error. For estimates of subpopulation characteristics (e.g., individual species), the relative standard error or estimate uncertainty will increase. The information gained from these sample data can be used to improve urban forest management and budget estimates, as well as provide a means to assess tree damage after catastrophic events and evaluate changes in the street tree population.
Acknowledgments
Thanks to the Davey Resource Group for providing the street tree data used in this study. The use of trade, firm, or corporation names in this publication is for the information and convenience of the reader. Such does not constitute an official endorsement or approval by the United States Department of Agriculture or Forest Service of any product or service to the exclusion of others that may be suitable.
- Copyright © 2015, International Society of Arboriculture. All rights reserved.