The Credible Diversity Plot: A Graphic for the Comparison of Biodiversity

Similar documents
Field Guide to the Freshwater Mussels of Minnesota. Bernard E. Sietman

Mussel Relocation Plan and Mussel Survey Results

Acknowledgements. Introduction

User s Guide for inext Online: Software for Interpolation and

Single Phase Pressure Drop and Flow Distribution in Brazed Plate Heat Exchangers

Changes in the Mussel Community of Ohio Brush Creek

SAMPLE MAT Proceedings of the 10th International Conference on Stability of Ships

Chapter 12 Practice Test

A SURVEY OF 1997 COLORADO ANGLERS AND THEIR WILLINGNESS TO PAY INCREASED LICENSE FEES

Section I: Multiple Choice Select the best answer for each problem.

SWG-11-JM-10. F Limit Reference Points about Preventing Recruitment Overfishing and its Uncertainty

A Cost Effective and Efficient Way to Assess Trail Conditions: A New Sampling Approach

Volunteer. Training Manual. Ecological Inventory and Monitoring

The Mussel Fauna of the Glover River, Oklahoma

Optimal Weather Routing Using Ensemble Weather Forecasts

Conservation Limits and Management Targets

B I O L O G I C A L A S S E S S M E N T

4.1 Why is the Equilibrium Diameter Important?

Module 7 Lecture 1. Swelling and Collapse Behavior

Swinburne Research Bank

SAFETY EFFECTIVENESS OF STOP CONTROL AT ULTRA-LOW VOLUME UNPAVED INTERSECTIONS Corresponding Author:

STANDARD SCORES AND THE NORMAL DISTRIBUTION

Lesson Z-Scores and Normal Distributions

46 Chapter 8 Statistics: An Introduction

Lab 4: Root Locus Based Control Design

LOCK AND DAM 3 FISH PASSAGE IMPROVEMENT

A definition of depth for functional observations

Abalone spatial- and age-structured assessment model and projections for Zones A, B, C and D

Target Shooting by Hunters and Their Use of Shooting Ranges: 1975, 1991, and 2011

Journal of Quantitative Analysis in Sports

One of the most-celebrated feats

Freshwater mussels of the Little Wabash River basin

Outfall Monitoring: Benthic Infauna. James A. Blake and Nancy J. Maciolek AECOM Environment, Marine & Coastal Center Woods Hole, Massachusetts

Ozobot Bit Classroom Application: Boyle s Law Simulation

Presented to the Israel Annual Conference on Aerospace Sciences, 2009 RISK-ANALYSIS A SUPPLEMENT TO DAMAGE-TOLERANCE ANALYSIS

The final set in a tennis match: four years at Wimbledon 1

Assessment Schedule 2016 Mathematics and Statistics: Demonstrate understanding of chance and data (91037)

PSY201: Chapter 5: The Normal Curve and Standard Scores

Status and Distribution of the Bobcat (Lynx rufus) in Illinois

CFD AND EXPERIMENTAL STUDY OF AERODYNAMIC DEGRADATION OF ICED AIRFOILS

An Analysis of Reducing Pedestrian-Walking-Speed Impacts on Intersection Traffic MOEs

Verification of Peening Intensity

CURRENT STATUS OF FRESHWATER MUSSELS (ORDER UNIONOIDA) IN TH E WABASH RIVER DRAINAGE OF INDIAN A

Determining the Effects of Temporal Period and Bait Types for Trapping Small Mammals. at Cooper Farm in Muncie, Indiana. An Honors Thesis (BIO 498)

An Application of Signal Detection Theory for Understanding Driver Behavior at Highway-Rail Grade Crossings

b) (2 pts.) Does the study show that drinking 4 or more cups of coffee a day caused the higher death rate?

Updated and revised standardized catch rate of blue sharks caught by the Taiwanese longline fishery in the Indian Ocean

Bhagwant N. Persaud* Richard A. Retting Craig Lyon* Anne T. McCartt. May *Consultant to the Insurance Institute for Highway Safety

THE BIOLOGY OF THE PRAWN, PALAEMON

Reliability Analysis Including External Failures for Low Demand Marine Systems

Quantifying the Bullwhip Effect of Multi-echelon System with Stochastic Dependent Lead Time

Environmental. Effects of Dredging

Measuring Relative Achievements: Percentile rank and Percentile point

CHAPTER 1 ORGANIZATION OF DATA SETS

Investigation of Suction Process of Scroll Compressors

Evaluation of aerodynamic criteria in the design of a small wind turbine with the lifting line model

Abstract. 1 Introduction

Project on the evaluation of the human dimensions of the target audiences regarding Eastern wolves conservation in La Mauricie National Park of

EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Wenbing Zhao. Department of Electrical and Computer Engineering

1. The data below gives the eye colors of 20 students in a Statistics class. Make a frequency table for the data.

SECTION 2 HYDROLOGY AND FLOW REGIMES

HEADWAY AND SAFETY ANALYSIS OF SPEED LAW ENFORCEMENT TECHNIQUES IN HIGHWAY WORK ZONES

Transposition Table, History Heuristic, and other Search Enhancements

COMPUTATIONAL FLUID DYNAMIC ANALYSIS OF AIRFOIL NACA0015

Outline. Terminology. EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Steps in Capacity Planning and Management

ASSESSING THE RELIABILITY OF FAIL-SAFE STRUCTURES INTRODUCTION

A IMPROVED VOGEL S APPROXIMATIO METHOD FOR THE TRA SPORTATIO PROBLEM. Serdar Korukoğlu 1 and Serkan Ballı 2.

Lower Columbia River Dam Fish Ladder Passage Times, Eric Johnson and Christopher Peery University of Idaho

Level 3 Mathematics and Statistics (Statistics), 2013

Hitting with Runners in Scoring Position

A Study of the Relationship between Horse Slaughter and Reported Cases of Abuse and Neglect

Analysis of the Article Entitled: Improved Cube Handling in Races: Insights with Isight

SCIENTIFIC COMMITTEE SEVENTH REGULAR SESSION August 2011 Pohnpei, Federated States of Micronesia

FINAL PERFORMANCE REPORT

A Survey of the Commercially Valuable Mussels of the Wabash and White Rivers of Indiana 1

SEMI-SPAN TESTING IN WIND TUNNELS

The Aging Curve(s) Jerry Meyer, Central Maryland YMCA Masters (CMYM)

COMPARATIVE STUDY OF SOME INDICES AND BODY COEFFICIENTS OF SILURUS GLANIS AND CYPRINUS CARPIO IN DIFFERENT STAGES OF DEVELOPMENT

GRI-GM20 * Selecting Variable Intervals for Taking Geomembrane Destructive Seam Samples Using Control Charts

May 11, 2005 (A) Name: SSN: Section # Instructors : A. Jain, H. Khan, K. Rappaport

COMPUTER-AIDED DESIGN AND PERFORMANCE ANALYSIS OF HAWT BLADES

FISH 415 LIMNOLOGY UI Moscow

Valve Losses in Reciprocating Compressors

Box-and-Whisker Plots

Comparisons of Discretionary Lane Changing Behavior

Naval Postgraduate School, Operational Oceanography and Meteorology. Since inputs from UDAS are continuously used in projects at the Naval

Analysis of Unsignalized Intersection

Variable Face Milling to Normalize Putter Ball Speed and Maximize Forgiveness

A Fair Target Score Calculation Method for Reduced-Over One day and T20 International Cricket Matches

FISH 415 LIMNOLOGY UI Moscow

Chapter 20. Planning Accelerated Life Tests. William Q. Meeker and Luis A. Escobar Iowa State University and Louisiana State University

Freshwater Mussel Surveys of the Big Darby Creek System in Central Ohio G. THOMAS WATTERS

Freshwater Mussels of the Spoon River Basin

FORM A PASCO COUNTY ACCESS CONNECTION PERMIT APPLICATION

39 Hurricane Ida Wind Record

Aerodynamic Analysis of a Symmetric Aerofoil

PUV Wave Directional Spectra How PUV Wave Analysis Works

Effect of Depth of Periphery Beams on Behavior of Grid Beams on Grid Floor

Swinburne Research Bank

Discussion on the Selection of the Recommended Fish Passage Design Discharge

Transcription:

The Credible Diversity Plot: A Graphic for the Comparison of Biodiversity Christopher J. Mecklin Department of Mathematics & Statistics Murray State University Murray KY 42071 E-mail: christopher.mecklin@murraystate.edu Key Words: Statistical Ecology, Biodiversity, Entropy, Bayesian Estimation, Bootstrap. Abstract A wide variety of indices exist for the measurement of ecological diversity. Common choices include the Shannon index and the Simpson index. Unfortunately, the arbitrary selection of diversity index can lead to conflicting results. However, both Shannon s and Simpson s measures are special cases of Renyi entropy. Others have developed diversity profiles, which use Renyi entropy to graphically compare the diversity index collected from different locations or from the same location over time. We extend this work by using both the bootstrap and Bayesian methods to estimate Renyi entropy and hence construct a graphic that we refer to as credible diversity profiles. Several examples using our method will be shown. 1. Introduction Biodiversity has been defined to be an average property of a community (Patil and Taillie (1982)), where the property in question is species rareness. In a diverse community, the typical species is rare; that is, the relative abundance of that species make up a small proportion of the population). A wide variety of statistics exist that purport to quantify the ecological diversity of a particular community. We will only mention a few of these indexes here; details on the computation of several indexes not mentioned here can be found in Ludwig and Reynolds (1988); Magurran (1988). The most commonly used measure of diversity is the Shannon index: (Shannon and Weaver (1949)) H = ˆp i ln(ˆp i ) (1) where ˆp i is the observed proportional abundance of species i. Another common species diversity measure is the Simpson index as: (Simpson (1949)) S = 1 ˆp 2 i (2) There is no reason to always prefer the use of either the Shannon or Simpson index (or one of their many competitors) for measuring diversity. Typically, the sample with the highest diversity as estimated by the Shannon index will also have the highest diversity when the Simpson index is used. However, situations exist where the arbitrary choice of diversity index will determine which of two samples appears to have the greater diversity. For instance, the following artificial data set, originally used by Hurlbert (1971) is an instance where the two measures disagree. Sample 1: C 1 = {33, 29, 28, 5, 5, 0} Sample 2: C 2 = {42, 30, 10, 8, 5, 5} Here H 1 = 1.381 < 1.457 = H 2 but S 1 = 0.724 > 0.712 = S 2 1

2. Entropy Patil and Taillie (1979, 1982); Gove et al. (1994); Tóthmérész (1995) studied a variety of parametric diversity index families to graphically display differences in diversity between samples. Tóthmérész suggested use of Renyi s entropy H α = log p α i 1 α, α 0, α 1 (3) Shannon s index is a limiting function of H α as α 1. Tóthmérész suggested plotting diversity profiles using the Renyi s index family. If the curve for a sample lies above the curve of the other sample over the entire range, then the first sample can be said to have higher diversity. However, when the curves intersect, particularly if the intersection occurs when 1 α 2, then the samples are said to be non-comparable. The profiles obtained through this construction are very similar to the β profiles of Patel and Taillie. The artificial data set shown earlier is an example of two communitites that are non-comparable. 3. Parameter Estimation An ecologist would often wish to compare indices of diversity over multiple samples representing either different sampling sites or different intervals of time. It is typical that a species not observed in one sample would be observed in a subsequent sample. Mecklin (2002, 2003) has considered use of the Bayesian paradigm for the estimation of biodiversity. In particular, he utilized Markov chain Monte Carlo methods by running iterations of a Gibbs sampler with the WinBUGS program. Suppose we have a sample with S distinct taxa. The abundance parameter p i, i = 1,, S is estimated for each species by running iterations of a Gibbs sampler (using WinBUGS). For each iteration, the Renyi entropy is computed. The 95% Bayesian credible intervals for the diversity profiles consist of the 2.5% percentile, median, and 97.5% percentile of the simulated posterior distribution of Renyi entropy. We will call the graph obtained from plotting these intervals the Bayesian credible diversity profile. We also considered the use of bootstrapping techiques (Efron and Tibshirani (1993)) for the estimation of biodiversity. In our examples in this paper, we limited ourselves to a simple and possibly simplistic application of the percentile bootstrap. It would certainly be advisable to consider a more sophisticated form of the bootstrap such as bootstrap-t or the bias-corrected accelerated (BCa) bootstrap. The bootstrap diversity profile consists of the 2.5% percentile, median, and 97.5% percentile of the bootstrapped values for Renyi entropy plotted along with the diversity profiles of the samples. When the profiles of the original data lie outside the bootstrap profiles, this indicates a significant difference in diversity; when they lie inside, there is no significant difference. The credible diversity plot, whether based on Bayesian or bootstrapped estimates, can be used as an informal graphical assessment of the comparability or significance of the diversity between samples. 4. Examples We will consider three examples to illustrate the use of credible diversity profiles. 1. Hurlburt s (1971) artificial example (also used by Tóthmérész (1995)), where the diversity profiles appear to be non-comparable. 2. The dinosaur data, used by Sheehan et al. (1991) and Fritsch and Hsu (1999), where the diversity profiles are comparable but there appears to be no significant difference in diversity. 3. The freshwater mussels data, used by Mecklin (2002, 2003), where the diversity profiles are comparable and there appears to be a significant difference in diversity. 4.1 Artificial Example The diversity was non-comparable for the artificial example. This is clearly shown in Figure 1, where the profiles intersect at α 1.6. This intersection of the profiles with 1 α 2 is expected, since the Shannon and Simpson indexes differed for these two artificial samples. 2

Plot of Renyi Entropy (Tothmeresz) 1.0 1.2 1.4 1.6 1.8 Figure 1: Diversity profile of the artificial data Since there were 7 species present in the full version of the artificial example (although only 6 of those species are found in the two samples used here)), we estimated the number of species in the population as Ŝ=7. We then estimated (using the Gibbs sampler) the posterior distributions of the 7 parameters p 1, p 2,, p 7 for each sample. The 95% Bayesian credible diversity profiles heavily overlap each other in this example, as seen in Figure 2. In the 95% bootstrap credible diversity profile, notice that the original profiles lie within the bootstrap profiles. All of these figures indicate no significant difference in diversity between the two artificial samples. Plot of Renyi Entropy (Tothmeresz) 1.2 1.4 1.6 1.8 2.0 Figure 2: Bayesian credible diversity profile of the artificial data 3

Table 1: Number of individuals per family of dinosaur observed Time Interval Dinosaur Family Upper Middle Ceratopsidae 50 53 Hadrosauridae 29 51 Hypsilophodontidae 3 2 Tyrannosauridae 3 3 Ornithomimidae 4 8 Saurornithoididae 1 6 Diversity Profiles (Renyi Entropy) 0.0 0.5 1.0 1.5 2.0 Artificial Sample 1 Artificial Sample 2 Bootstrap 2.5% Bootstrap Median Bootstrap 97.5% 0 1 2 3 4 Figure 3: Bootstrap diversity profile of the artificial data 4.2 Dinosaur Example Fossil data was collected in North Dakota and Montana concerning the diversity of families of dinosaurs for three time intervals in the Cretaceous period. It was concluded that the biodiversity of dinosaurs had remained relatively constant and had not been declining before mass extinction was caused by a cataclysmic event. For this example, I will only consider the data for the upper and middle Cretaceous time periods and will not use the lower Cretaceous. The frequency data is in Table 1. Notice that the dinosaur diversity is slightly higher (but not to a significant degree) for the Middle Cretaceous period using either the Shannon or Simpson index. H U = 1.107 < 1.210 = H M S U = 0.583 < 0.635 = S M Notice that while the original Renyi diversity profiles do not overlap (Figure 4), the 95% Bayesian credible diversity profiles do overlap (Figure 5), and the original profiles are contained within 95% bootstrap credible diversity profiles (Figure 6). 4

Plot of Renyi Entropy (Dinosaur) 0.6 0.8 1.0 1.2 1.4 1.6 1.8 Figure 4: Diversity profile of the dinosaur data Plot of Renyi Entropy (Dinosaur) 0.6 0.8 1.0 1.2 1.4 1.6 1.8 Figure 5: Credible diversity profile of the dinosaur data Diversity Profiles (Renyi Entropy) 0.0 0.5 1.0 1.5 2.0 Upper Cretaceous Middle Cretaceous Bootstrap 2.5% Bootstrap Median Bootstrap 97.5% 0 1 2 3 4 Figure 6: Bootstrap diversity profile of the dinosaur data 4.3 Freshwater Mussel Example This data set consists of the count and diversity of 20 different species of freshwater mussels observed at 5

Table 2: Number of individuals per species of mussel observed Year Mussel Species 1999 2000 Amblema plicata 36 15 Ellipsaria lineolata 14 4 Fusconaia ebena 49 49 Lampsilis cardium 3 0 Lampsilis ovata 2 0 Ligumia recta 0 2 Megalonaias nervosa 3 2 Obliquaria reflexa 10 0 Obovaria olivaria 1 1 Plethobasus cyphyus 3 1 Quadrula apiculata 1 0 Quadrula metanevra 14 5 Quadrula nodulata 10 2 Quadrula pustulosa 18 2 Quadrula quadrula 5 0 Tritogonia verrucosa 3 1 the same site in the Ohio River in western Kentucky over 7 different years between 1990 to 2000. All 7 samples were collected in autumn. The descriptive statistics indicate an apparent drop in unionid mussel diversity between 1999 and 2000. I will focus on the samples collected in 1999 and 2000 in this example. The frequency data for each species for those years is found in Table 2. We used Ŝ = 20 since that was the total number of distinct freshwater mussel taxa that were observed at least once over the ten year period of sampling, although only 16 of these species were seen in 1999 and/or 2000. Notice that the freshwater mussel diversity has noticeably decreased between 1999 and 2000 when using either the Shannon or Simpson index. H 1999 = 2.157 > 1.449 = H 2000 S 1999 = 0.842 > 0.619 = S 2000 Notice that neither the original Renyi diversity profiles (Figure 7) nor the 95% credible diversity profiles (Figure 8) overlap. In the bootstrap diversity profile (Figure 9), the 1999 profile stays almost entirely above the 97.5% bootstrap percentile and the 2000 profile stays below the 2.5% bootstrap percentile. Plot of Renyi Entropy (Mussels) 1.2 1.4 1.6 1.8 2.0 2.2 2.4 2.6 Figure 7: Original diversity profile of the mussel data 6

Plot of Renyi Entropy (Mussels) 1.0 1.5 2.0 2.5 3.0 Figure 8: Credible diversity profile of the mussel data Diversity Profiles (Renyi Entropy) 0.0 0.5 1.0 1.5 2.0 2.5 3.0 1999 2000 Bootstrap 2.5% Bootstrap Median Bootstrap 97.5% 0 1 2 3 4 Figure 9: Bootstrap diversity profile of the dinosaur data 5. Conclusion This paper has presented a graphical method for comparing measures of biodiversity that combines the use of diversity profiles with either Bayesian or bootstrap methods of parameter estimation. As many have pointed out before, the attempt to quantify a multidimensional concept such as biodiversity with a single number is an endeavor that is doomed to failure. It is inevitable that certain situations where the arbitrary choice of diversity index will make a difference in which of two samples is found to have higher diversity. Diversity profiles are often used to alleviate this concern. We feel that it is beneficial to utilize Bayesian methods in estimating diversity when constructing diversity profiles. Advantages of using Bayesian methodology include being able to incorporate both prior knowledge about the biodiversity of the community and the sampling model in the data analysis. The bootstrap approach is an alternative that one might choose to use in this situation if there is a lack of information to formulate reasonable priors. References Efron, B. and R. Tibshirani (1993). An Introduction to the Bootstrap. New York, NY: Chapman and Hall. Fritsch, K. and J. Hsu (1999). Multiple comparison of entropies with application to dinosaur biodiversity. Biometrics 55, 1300 1305. 7

Gove, J., G. Patil, B. Swindel, and C. Taillie (1994). Ecological diversity and forest management, pp. 409 462. Handbook of Statistics, 12 (ed. G.P. Patil and C.R. Rao. Amsterdam: North Holland. Hurlbert, S. (1971). The nonconcept of species diversity: A critique and alternative parameters. Ecology 52, 577 586. Ludwig, J. and J. Reynolds (1988). Statistical Ecology: A Primer on Methods and Computing. New York: Wiley. Magurran, A. (1988). Ecological Diversity and Its Measurement. Princeton, NJ: Princeton University Press. Mecklin, C. (2002). The estimation of species diversity over time. In 2002 Proceedings of the American Statistical Association, Biometrics Section [CD-ROM], Alexandria, VA. American Statistical Association. Mecklin, C. (2003). Bayesian estimation of diversity profiles. In 2003 Proceedings of the American Statistical Association, Section on Statistics & the Environment [CD-ROM], Alexandria, VA. American Statistical Association. Patil, G. and C. Taillie (1979). An overview of diversity, pp. 3 27. Ecological Diversity in Theory and Practice (eds. J.F. Grassle, G.P. Patil, W.K. Smith and C. Taillie. Fairland, MD: International Cooperative Publishing House. Patil, G. and C. Taillie (1982). Diversity as a concept and its measurement. Journal of the American Statistical Association 77, 548 561. Shannon, C. and W. Weaver (1949). The Mathematical Theory of Communication. Urbana, IL: University of Illinois Press. Sheehan, P., D. Fastovsky, R. Hoffmann, C. Berghaus, and D. Gabriel (1991). Sudden extinction of the dinosaurs: Latest Cretaceous, Upper Great Plains, U.S.A. Science 254, 835 838. Simpson, E. (1949). Measurement of diversity. Nature 163, 688. Tóthmérész, B. (1995). Comparison of different methods for diversity ordering. Journal of Vegetation Science 6, 283 290. 8