Environmental Research Letters LETTER • OPEN ACCESS Exploring the relation between remotely sensed vertical canopy structure and tree species diversity in Gabon To cite this article: Suzanne Mariëlle Marselis et al 2019 Environ. Res. Lett. 14 094013 View the article online for updates and enhancements. This content was downloaded from IP address 160.111.254.17 on 11/09/2019 at 20:34 Environ. Res. Lett. 14 (2019) 094013 https://doi.org/10.1088/1748-9326/ab2dcd LETTER Exploring the relation between remotely sensed vertical canopy structure and tree species diversity in Gabon SuzanneMariëlleMarselis1,13, HaoTang1 , JohnArmston1, KatharineAbernethy2,10, AlfonsoAlonso4, Nicolas Barbier5, Pulchérie Bissiengou6, Kathryn Jeffery2, DavidKenfack7, Nicolas Labrière3, Seung-Kuk Lee1, SimonLLewis8,9, HervéMemiaghe10, JohnRPoulsen11, LeeWhite2,9,12 andRalphDubayah1 1 Department ofGeographical Sciences, University ofMaryland, College Park,MD,United States of America 2 Division of Biological andEnvironmental Sciences Tropical Ecology andConservation, University of Stirling, FK9 4LA,United Kingdom 3 Laboratoire Evolution et Diversité Biologique, Toulouse, France 4 Center for Conservation and Sustainability, SmithsonianConservation Biology Institute,Washington,DC,United States of America 5 AMAP, IRD, CNRS, INRA,UnivMontpellier, CIRAD,Montpellier, France 6 Institut de pharmacopée et demédecine traditionnelle (HerbierNational duGabon), CENAREST, Libreville, Gabon 7 Center for Tropical Forest Science—Forest Global EarthObservatory, Smithsonian Tropical Research Institute, Smithsonian Institution, WashingtonDC,United States of America 8 Department ofGeography, University College London, London, UnitedKingdom 9 School of Geography, University of Leeds, Leeds, UnitedKingdom 10 Institut de Recherche en Ecologie Tropicale (IRET), CENAREST, Libreville, Gabon 11 Nicolas School of the Environment, DukeUniversity, Durham,NC,United States of America 12 AgenceNationale des ParcsNationaux (ANPN), Gabon 13 Author towhomany correspondence should be addressed. E-mail:marselis@umd.edu Keywords: biodiversity, radar, lidar, LVIS, ICESat, GEDI, TanDEM-X Supplementarymaterial for this article is available online Abstract Mapping tree species diversity is increasingly important in the face of environmental change and biodiversity conservation.We explore a potential way ofmapping this diversity by relating forest structure to tree species diversity inGabon. First, we test the relation between canopy height, as a proxy for niche volume, and tree species diversity. Then, we test the relation between vertical canopy structure, as a proxy for vertical niche occupation, and tree species diversity.We use large footprint full-waveform airborne lidar data collected across four study sites inGabon (Lopé,Mabounié, Mondah, andRabi) in combinationwith in situ estimates of species richness (S) and Shannon diversity (H′). Linearmodels using canopy height explained 44% and 43%of the variation in S andH′ at the 0.25 ha resolution. Linearmodels using canopy height and the plant area volume density profile explained 71%of this variation.Wedemonstrate applications of thesemodels bymapping S andH′ in Mondah using a simulatedGEDI-TanDEM-X fusion height product, across the four sites usingwall- to-wall airborne lidar data products, and across and between the study sites using ICESat lidar waveforms. Themodeling results are encouraging in the context of developing pan-tropical structure- diversitymodels applicable to data from current and upcoming spaceborne remote sensingmissions. 1. Introduction Spatial information on tree species diversity is impor- tant to enable effective conservation and biodiversity management (Turner et al 2003) and allow for a better understanding of scale dependent relationships between forest composition and productivity (Luo et al 2019). Information on the local presence, absence, and diversity of tree species has traditionally been collected through in situ forest inventories (Rios- Saldaña et al 2018). Yet, those are time-consuming and expensive, which often prevents extensive and spatially representative coverage. Using such inventories in combination with remotely sensed data is one approach to overcome some of these limitations and relationships between remotely sensed environmental OPEN ACCESS RECEIVED 6March 2019 REVISED 26 June 2019 ACCEPTED FOR PUBLICATION 28 June 2019 PUBLISHED 9 September 2019 Original content from this workmay be used under the terms of the Creative CommonsAttribution 3.0 licence. Any further distribution of this workmustmaintain attribution to the author(s) and the title of thework, journal citation andDOI. © 2019TheAuthor(s). Published by IOPPublishing Ltd data (e.g. mean annual temperature, annual precipita- tion, dry season length, etc) and vegetation species have been developed to map Amazonian and Global vascular plant diversity (Ter Steege et al 2003, Mutke and Barthlott 2005). However, these approaches incorporate little information on the vegetation itself and the data products are provided at spatial resolu- tions (typically ∼100 km grid cells) not optimized for conservationmanagement. Huston (1979) pointed out that there is a large body of literature regarding the relation between forest structure and the diversity of different taxa, while only few studies explore the relation between forest struc- ture and tree species diversity itself. It has since been argued on several occasions that vegetation structure could be related to tree species diversity via the occu- pation of vertical niche space. This is the basis of a potential structure-diversity relationship, and three hypotheses have been proposed to explain it (Kohyama 1996, Sheil and Burslem 2003, Marks et al 2016, Gatti et al 2017). First, the forest architecture hypothesis asserts that tree species in the tropics have a higher variance in adult tree height, corresponding to higher species diversity, than in the temperate zone where short-statured stands with trees of uniform adulthood-height demonstrate less diversity (Kohyama 1993, 1996). Second, the gap-size fre- quency hypothesis, proposed by Sheil and Burslem (2003), asserts that canopy gaps provide a range of light conditions to which different species can bemore or less specialized. A changing gap-size frequency dis- tribution would allow different light regimes and thus different species to grow (Denslow 1980). Third, the height-diversity hypothesis asserts that a higher forest volume provides greater niche space and would thus likely result in a higher species diversity. A significant positive relation between tree species diversity and tree height, as a proxy for forest volume, was found, explaining a limited amount of the variance in tree species diversity within the USA and on a global scale (Marks et al 2016, Gatti et al 2017). However, Givnish (2017) objects that the use of just forest height leads to deceptive results since it only provides information on the potential trait space (volume below canopy) that could be occupied, but does not measure the occupa- tion of this trait space. In this study we address this caveat by exploring the relation between tree species diversity and the vertical vegetation structure; here- after referred to as the structure-diversity relationship. Active remote sensing systems, such as lidar and radar provide information on the vertical arrange- ment of the canopy (Bergen et al 2009). Specifically, full-waveform lidar data provide detailed and more direct measurements of the vertical profile of canopy elements, which is available from the previously earth- orbiting GLAS instrument onboard of the ICESat satellite (Tang and Dubayah 2017), NASA’s airborne Land Vegetation and Ice Sensor (LVIS) (Blair et al 1999, Tang et al 2012) and the recently launched Global Ecosystem Dynamics Investigation (GEDI). Such vertical canopy profiles providing information on the amount of plant material (leaves and branches) along the vertical axis have previously been used to distinguish successional vegetation types in a tropical savanna-forest mosaic in Gabon (Marselis et al 2018). We expect that these profiles also provide information on the occupation of available niche space between the forest floor and the top of the canopy and can be rela- ted to tree species diversity. The implication is that mapping tree species diversity in the tropics may then be feasible due to the launch of GEDI, which is collect- ing billions of lidar waveforms between 51.6° latitude north and south of the equator, in December 2018. Data from other spaceborne missions, such as the TanDEM-X radar and the ICESat-2 lidar missions, may aid in the mapping process by gap-filling GEDI data products, and increasing the spatial resolution and extent of these data (Qi and Dubayah 2016, Lee et al 2018). The overall goal of this study is to explore the structure-diversity relationship in a tropical forest and savanna landscape in Gabon. In particular, we seek to understand whether vertical canopy structure explains more of the variance in tropical tree species diversity than canopy height alone and whether we can use this relationship to predict tree species diversity across Gabon using different remote sensing datasets. The remainder of the study is organized as follows: (i) we first create linear models to relate canopy height from lidar waveforms to tree species diversity from field data; (ii) we then use two sets of metrics describ- ing the vertical canopy profile and relate these to the tree species diversity; (iii)we then compare the perfor- mance of these models; (iv) and use the height model to predict tree species diversity from LVIS canopy height and fromGEDI-TanDEM-X fusion height pro- ducts in Mondah; (v) and we apply the vertical struc- ture model to gridded LVIS products and vertical profiles derived from ICESat to predict tree species diversity across and between four study sites. 2.Method In this section, we introduce the four study sites and the datasets used. We explain the processing of the field and lidar data, followed by the model develop- ment for testing the structure-diversity relationship and the application of these models to create spatial predictions of tree species diversity from different active remote sensing data products. 2.1. Field data processing Gabon was chosen as case-study location because of the large availability of field and remote sensing datasets collected during the AfriSAR campaign (Fatoyinbo et al 2017). The four study sites in Gabon are referred to as Lopé, Mondah, Mabounié and Rabi 2 Environ. Res. Lett. 14 (2019) 094013 (figure 1) and each site includes multiple sampling plots. All sites fall within the general classification of tropical terra firme broadleaf forest, but have different species compositions and assemblies, disturbance history, and management regimes. The plots in this analysis are distributed between savanna, successional, degraded and old-growth tropical forest and are situated in climate zones with different annual pre- cipitation and temperature (table 1, see SI1 is available online at stacks.iop.org/ERL/14/094013/mmedia for detailed information). In Lopé twelve stem mapped field plots (nine 1 ha, three 0.5 ha) were established in 2016 as part of the AfriSAR campaign. These data are available through ForestPlots.net (Lopez-Gonzalez et al 2009, 2011, Labrière et al 2018). TheMondah data were also collected during the AfriSAR campaign in 2016 and are publicly available through theNASAOak Ridge National Laboratory Distributed Active Archive Center (ORNLDAAC) (Fatoyinbo et al 2018). The site comprises fifteen 1 ha stem mapped plots, thirteen of which are coincident with the LVIS lidar data. We excluded three of the thirteen plots from the Mondah site that contained a high percentage of unidentified trees (SI1). Twelve 1 ha field plots were established in Mabounié in 2012 (Labrière et al 2018), of which 10 are coincident with the LVIS lidar data and suitable for this study. A 25 ha plot comprises the Rabi site. These data were collected by the Smithsonian Conservation Biology Institute, National Zoological Park, and the Forest Global EarthObservatory (ForestGEO), and are available on request through the ForestGeo website14. From the Rabi plot, we selected the thirteen non- adjacent hectares and considered them as separate plots for the analysis. Subsampling kept this study site fromdominating themodels, as the Rabi site consisted of a total sampled area twice as large as the other sites (figure 2(a)). Field data collection at the four sites was performed by different people and organizations, which led to datasets with varying characteristics (in terms of e.g. plot layout and minimum DBH mea- sured). Details about the study sites and field datasets are described in SI1. For consistency among the sites we only included trees with DBH…10 cm in our analysis. The availability of stem maps (in Lopé, Mondah, and Rabi) and subplots (in Mabounié) enabled testing of the structure-diversity relation at different resolutions. This was necessary because species diversity and plot size are not linearly related (species-area curve, described by MacArthur and Wilson 1967) and no optimal resolution has been identified for the structure-diversity relation. Smaller plots were created by subdividing each original plot into smaller squares or rectangles to create 5 spatial resolutions: 1, 0.5, 0.25, 0.0625 and 0.04 ha (figure 2(b)). Mabounié data were only used at the 1 ha and 0.04 ha resolution, due to the absence of stem maps. At each resolution only plots with at least one identified tree were included in the analysis. This resulted in respectively 41, 64, 128, 481 and 935 plots for the resolutions 1.0 ha through 0.04 ha. Tree species diversity was then quantified for each of the plots at all resolutions using two variables: the Shannon diversity (H′) and tree species richness (S) expressed as the total number of tree species per area (Morris et al 2014). 2.2. Lidar data processing Full waveform lidar data were collected using the LVIS instrument with a 1 km swath width, a 1064 nm wavelength and a nominal footprint diameter of ∼20 m across all study sites as part of the AfriSAR campaign in February andMarch 2016. The following metrics were derived from all lidar waveforms using mature, validated and published algorithms: Relative Height 100 (RH100), Plant Area Index (PAI) at 5 m Figure 1. Field and lidar datasets from four regions (Mondah,Mabounié, Rabi and Lopé) in central andwestGabon are used in this study. LVIS lidar acquisitions are displayed as gridded canopy height (m). Insets showdistribution of field plots across each study site, markers are not to scale offield plot size. 14 https://forestgeo.si.edu/sites/africa/rabi 3 Environ. Res. Lett. 14 (2019) 094013 Table 1. Information for each of the study sites regarding the total sampled area, temperature and precipitation, the tree density, species richness and Shannon diversity calculated including only trees withDBH…10 cm, at the 1 ha resolution. Tree density (trees/ha) Total no. of species Species richness (S) (No. species/ha) Shannon diversity (H′) (H′/ha) Name Total Area (ha) Mean annual temper- ature (°C) Mean annual precipitation (mm/y) Total no. of trees Range Mean std Range Mean Std Range Mean Std Lope 10.5 26–33a 1500a 3140 9–501 308 174 118 2–54 32 21.6 0.64–3.08 2.15 0.90 Mondah 10 25b 3000–3500b 2368 26–453 237 173 139 7–75 32 21.3 1.46–3.58 2.60 0.69 Mabounié 10 26b 2030b 3537 222–444 354 61 183 44–68 54 7.7 2.90–3.61 3.27 0.23 Rabi 13 24–28c 2300c 6056 231–689 466 119 211 55–101 82 13.5 3.35–4.04 3.70 0.18 a White (2001). b Labrière et al (2018). c Lee et al (2006). 4 E nviron.R es.Lett.14 (2019)094013 vertical resolution forming the Plant Area Volume Density (PAVD) profile and cumulative PAI at 5 m vertical resolution (Blair et al 1999, Tang et al 2012, Marselis et al 2018). These footprint-level metrics are publicly available as part of the AfriSAR deliverables through the NASA ORNL DAAC (Tang et al 2018). RH100 was used to represent canopy top height. We chose the PAVD and cumulative PAI profile because they describe the vertical vegetation structure as a biophysical property that can also be derived fromdata from other instruments such as ICESat and GEDI, which enables transferability of the models. Addition- ally, they have previously been used to understand structural and functional differences between vegeta- tion communities (Cuni-Sanchez et al 2016, Decuyper et al 2018, Marselis et al 2018). For each plot, the average of all waveform metrics and the standard deviation of Canopy Height (RH100_sd) were calcu- lated using all waveforms with the footprint center locatedwithin the plot boundary. 2.3.Data analysis The field and lidar data processing resulted in five datasets, one for each spatial resolution, comprised of n samples, with n equivalent to the number of plots at the specific resolution. Each sample had one value for S, H′ and each of the lidar metrics. Four sets of lidar metrics were created to test the structure-diversity relationship (table 2). We built linear models using metric sets 1 through 4 at each of the five spatial resolutions. Variable importance could not be rigor- ously assessed for the individual metrics due to collinearity between themetrics describing the vertical profile, since the existence of plant material higher in the canopy is dependent on the existence of material lower in the canopy. For each model, we performed leave-one-plot-out cross-validation. All plots within a 1 ha plot were kept aside in the cross-validation approach for models built at the 0.5, 0.25, 0.0625 and 0.04 ha resolutions, to avoid overestimation of model performance by the model being trained towards the local variationwithin the 1 ha plot. For eachmodel, we calculated the R-squared (R2) and the root mean squared difference, as a percentage (RMSD%) of the mean observed S and H′, from the cross-validated predictions to evaluate model performance (Piñeiro et al 2008). Bias was evaluated as the intercept of the linear relation between predicted (x-axis) and observed (y-axis) S and H′. An intercept not signifi- cantly different from 0 indicated an unbiased Figure 2. (a)Thirteen 1 ha plots (dark grey)were selected from the 25 haRabi plot and included in this study. (b) Subdivision of 1 ha plots to allow for analyzing the structure-diversity relationship at 5 spatial resolutions led to plots of sizes I. (100×100 m), II. (100×50 m), III. (50×50 m), IV. (25×25 m) andV. (20×20 m). Table 2. Sets ofmetrics used in linearmodels predictingH′ and S. Metric set Model name Explanatory variables 1 Height RH100 2 Height and standard deviation of height RH100,RH100_sd 3 PAVDprofile+height P5a, P10, P15, P20, P25, P30, P35, P40, P45, P50b, RH100 4 Cumulative PAI profile+height CP5c, CP10, CP15, CP20, CP25, CP30, CP35, CP40, CP45, CP50, RH100 a P5=PAI between 0 and 5m aboveground, subsequently P10= the PAI between 5 and 10metc. b P50=PAI between 45 and 50mabove ground, in case vegetation is higher than 50m this value indicates the PAI above 45m. c CP5=Cumulative PAI between 0 and 5m aboveground, subsequently CP10= the PAI between 0 and 10m, etc. 5 Environ. Res. Lett. 14 (2019) 094013 prediction. The slope of the linear relation indicated the consistency of the predicted and observed vari- ables. A slope significantly different from 1 indicated no consistency (Piñeiro et al 2008). We used 95% confidence intervals around theR2 to evaluate whether metric sets 3 and 4 (canopy structure) led to signifi- cantly better diversity predictions than sets 1 and 2 (canopy height). We used the canopy height and standard deviation of canopy height model at the 0.25 ha resolution to predict S andH′ forMondah from two datasets: (1) the canopy height product from LVIS gridded at 50 m and (2) a simulated GEDI-TanDEM-X fusion height pro- duct at 20 m resolution (Lee et al 2018). TanDEM-X is a twin X-band SAR satellite mission from which the HH polarization was used for the fusion product. GEDI simulations were created from the LVIS wave- forms using the GEDI simulator, considering 50% cloud cover (Lee et al 2018, Hancock et al 2019). We resampled the GEDI-TanDEM-X product to 50 m resolution and computed mean and standard devia- tion of forest height for each cell. We used the PAVD (starting at P10) and canopy height model at 0.25 ha resolution to predict S and H′ for all four regions. We applied these models to (1) LVIS data products gridded at 50 m resolution and (2) ICESat waveforms collected within and between the four study sites. We used all ICESat waveforms col- lected between 2004 and 2006. P5 was omitted as it has a low accuracy due to the interference of slope with vegetation close to the surface within the nominal footprint of 50 m. 3. Results We first provide the results of the linear models using canopy height alone to predict S and H′. Then we provide the results of the models using the vertical canopy profile metrics to predict S and H′. Last, we show the spatial predictions of S and H′ created using the calibrated and validatedmodels. 3.1.Modeling results using canopy height The cross-validated linear models using canopy height (set 1) explains a limited percentage of the variance in H′ and S. At the 1.0 ha resolution, 28% and 35% of the variation in H′ and S is explained (R2=0.28 and R2=0.35). At the 0.25 ha resolution R2=0.43 and 0.44 (figure 3 and SI2). Incorporating information on canopy height variation (set 2) increases R2 to 0.58 and 0.52 at the 0.25 ha resolution. However, the R2 confidence intervals comparing the model perfor- mance using metric set 1 and 2 are overlapping at all spatial scales, suggesting that adding information on the variance in canopy height within a plot does not necessarily lead to better predictive models (figure 4, SI2). 3.2.Modeling results using canopy structure Including information on the vertical canopy profile (set 3) increases the explained variance in both H′ and S to 71% at the 0.25 ha resolution, a significant improvement in model performance (figure 4 and SI2). Metric sets 3 and 4, including two different expressions of the vertical canopy profile, generally lead to similar results and only the results of set 3 are discussed in the remainder of the paper (SI2). Our results show the best model performances for the structure-diversity relation (highest R2, lowest RMSD% and unbiased predictions) at the 0.25 ha resolution, for both H′ and S models (figures 4 and 5 and and SI2). Figure 3.Canopy height explains up to 44%of the variation in tree species richness and 43%of the variation in Shannon diversity across the study sites inGabon (cross-validated results). 6 Environ. Res. Lett. 14 (2019) 094013 3.3. Predictivemodeling results The 0.25 ha linear models using metric set 2 (model performance for H′:R2=0.58, RMSD=24%; and S; R2=0.52, RMSD=40%)were used tomapH′ and S in Mondah from (1) LVIS gridded products and (2) GEDI-TanDEM-X gridded products (figure 6). GEDI- TanDEM-X derived predictions for both species rich- ness and Shannon Diversity are consistent with the LVIS predictions (R2=0.68 andR2=0.66) but over- all biased slightly low (figure 6). Spatial predictions of H′ and S for all study sites were created from the LVIS wall-to-wall data products and ICESat waveforms using the 0.25 ha linear models with the PAVD profile starting at P10 to P50 and canopy height (model performance for H′: R2=0.71, RMSD=20%; and S; R2=0.71, RMSD=31%) (figure 7). The ICESat footprints overlapping with the wall-to-wall lidar show ICESat predictions of S and H′ are slightly higher than predictions from LVIS. Shan- non diversity predictions (with R2=0.66 and RMSD=26%) are more accurate than the species richness predictions (with R2=0.62 and RMSD=38%). 4.Discussion Our study presents regional results relating canopy height and vertical canopy structure, derived from active remote sensing data, to tree species diversity in African tropical forest. We first discuss the modeling results and the relationship between canopy structure and tree species diversity. Then we discuss some limitations to our approach and close with an outlook on the applications of the structure-diversitymodels. 4.1.Model performance Canopy height has been hypothesized to be related to tree species diversity by assuming canopy height is a proxy for canopy volume, representing the available niche space. Our study shows a significant relationship between canopy height and the Shannon index and species richness; affirming that canopy height explains a limited percentage of the variation in tree species diversity. These results are in line with previous results found in the USA and globally by Marks et al (2016) and Gatti et al (2017). We expected that including information on the variance in canopy height within a Figure 4 Linearmodels using height alone as a predictive variable generally show the lowestR2 and highest error (RMSD%). Adding variation in canopy height (RH100_sd) to themodel leads to slightly better, but not statistically different,model performance (see overlapping error bars on the R2 plots). Incorporating information on canopy structure significantly improves the S andHmodels at the 0.04–0.25 ha resolutions, when compared to using height alone. Error bars are larger at larger plot size because of smaller sample sizes. 7 Environ. Res. Lett. 14 (2019) 094013 plot might lead to better predictive models as the combination of metrics provides a more accurate proxy for forest volume than canopy height alone. Our models show an increase in R2 across all resolutions, but the improvement is not statistically significant.We then hypothesized that including information about the occupancy of vertical space, implemented by including the PAVD profile in the canopy height model, would lead to better predictions of tree species diversity. The PAVD profile is a direct measure of the degree of occupation of vertical niche space (divided in 5 m height bins) by plant material (PAI index) and thus resembles occupied niche space instead of avail- able niche space. Our results support the hypothesis as we find a statistically significant improvement in model performance at the 0.04–0.25 ha resolutions when including the PAVD profile. At larger plot sizes (0.5 and 1.0 ha) the R2 values increased but not Figure 5.Predicted versus observed species richness and Shannon diversity resulting from themodels usingmetric set 3 (PAVD profiles and height), points are colored by study site. 1.0 ha resolutionmodels for S are significantly biased, 0.25 hamodels show highest accuracy and statistically unbiased predictions for both species richness and Shannon diversity. 8 Environ. Res. Lett. 14 (2019) 094013 Figure 6. Shannon diversity and species richness predicted inMondah using canopy height (RH100) and standard deviation of canopy height (RH100_sd). Predictions created fromgridded LVIS data products and a simulatedGEDI-TanDEM-X fusion product. Points in center panel are colored by density. Figure 7. ICESat data enabled predictions ofH′ and Swithin and between the study sites. ICESat richness predictions were biased slightly high compared to LVIS predictions. Points in center panel are colored by density. 9 Environ. Res. Lett. 14 (2019) 094013 significantly.We tested the structure-diversity relation at different resolutions because neither species rich- ness nor Shannon diversity is linearly related to plot size (MacArthur and Wilson 1967, Hill 1973). Our models performed best at the 0.25 ha resolution (a decrease in performance was found for both smaller and larger plot sizes). We consider two potential explanations for this phenomenon. The first one is related to the quality of measurement: large plots tend to have greater sampling errors caused by a smaller sample size, while smaller plots are less precise due to a higher influence of geolocation error (though mini- mized by Labrière et al (2018), a small misalignment between plot and lidar data may still result in poorer model performance). The second potential explana- tion is the maximum natural variation in the struc- ture-diversity relation at an intermediate resolution. In a small plot only a limited number of trees could possibly grow due to space constraints, reducing the total variability or upper boundary of tree species diversity. By contrast, the amount of different tree species may get close to a certain maximum in a large plot (as species-area curves suggest) while similar canopy structure may be found as in smaller plots. In our study, it is at an intermediate resolution that the largest variation in tree species/plot was found, as indicated by the highest coefficient of variation in tree species richness at the 0.25 ha resolution (SI3). This trend is consistent with the trend of canopy structure with its maximum coefficient of variation at the 0.25 ha resolution. Yet, it is difficult to decouple the two factors due to the limited data sets in this study, and additional research would be needed to better identify the ecological mechanisms behind this phenomenon. Nonetheless, the modeling results for predicting Shannon diversity and species richness follow the same trend—for both diversity variables the PAVD profile with height provides a significantly better model than using height alone at the 0.25 ha resolution—strengthening our confidence in the exis- tence of a structure-diversity relationship across these study sites inGabon. 4.2. Limitations Time lag between field and lidar data collection was minimized, but we had to accept a maximum time-lag of five years (Rabi) which may have affected the strength of the structure-diversity relationship. The new census of the Rabi field data, which is under collection at the time of writing, may provide more insight on tree species diversity change over time and the effect of time-lag in the structure-diversity relation. The diversity metrics used to establish the struc- ture-diversity relation rely heavily on the accuracy of species identification in the field. Tree species identifi- cation in the tropics can be very complex and highly trained local botanists are needed to provide the most accurate identification. In most of our study sites (Rabi, Mabounié and Lopé) the percentage of trees of unknown species was low, but in Mondah this percentage was much higher because the species iden- tification was carried out by botanists with knowledge of vascular plant taxonomy in Gabon, but not specifi- cally for the Mondah region (SI1). Slik et al (2015) showed that higher percentages of unidentified trees can affect model outcomes, which should be kept in mind when evaluating our results and future research could assess the specific impact of unidentified trees on our models, for example by re-inventory of the trees inMondah. In our study, we were not able to include small trees because only two field inventory datasets inclu- ded trees with DBH…1 cm, thus we included only trees with DBH…10 cm to allow for consistency across all study sites. However, Fricker et al (2015) showed that the inclusion of small trees in Barro Col- orado Island (Panama) improved their model perfor- mance (relating species richness to various metrics describing terrain, hydrology and canopy structure) from R2=0.25 when using trees with DBH…10 cm to R2=0.35 when using trees with DBH…1 cm. More information on small trees will be needed to assess their specific impacts on the models created in this study. The plots included in the analysis were distributed across different vegetation types: savanna, forest with different degrees of degradation, successional forest and low-disturbance old-growth tropical forest. It is likely that the encountered structure-diversity rela- tionship is driven partly by this gradient in forest types. More undisturbed old-growth tropical forest plots would be needed to study the structure-diversity rela- tionship within old-growth forest. Additionally, it is yet unclear how the structure-diversity relationship holds or changes across different biogeographic regions, with changing limitations to tree growth, and different climatic niches that may change the canopy structure and the use of vertical niche space. Future research should focus on the transferability of the structure-diversity models to other regions and con- tinents to establish the potential of using this method pan-tropically or potentially across multiple biomes. Lastly, studying how the structure-diversity relation- ship changes between the local-scale, such as in a large plot like BCI (Fricker et al 2015; Wolf et al 2012), and at a regional scale, such as developed here and in Mao et al (2018) and Robinson et al (2018), will be impor- tant to ultimately evaluate the structure-diversity rela- tionship pan-tropically. 4.3. Applications and future research The commissioning of new satellite missions, provid- ing information on vertical canopy structure, is an important incentive to explore the potential applica- tions of the structure-diversity relationship. Here we 10 Environ. Res. Lett. 14 (2019) 094013 demonstrated the application of the developedmodels to wall-to-wall LVIS data products to create predictive maps of diversity and richness in the four study sites. The spatial area covered by these wall-to-wall products is limited due to the airborne nature of the data and data gaps occur as the laser energy does not penetrate clouds, which are often prevalent over Gabon. But, the real strength of our modeling approach lies in its potential to be applied to data from other instruments such as ICESat, GEDI, and TanDEM-X. The ICESat predictions overlapping with the LVIS wall-to-wall products showed S and H′ predictions around the 1:1 line. The scatter around the 1:1 line is partly caused by the model accuracy, the time-lag between these two data products (collected 10–12 years before the LVIS data) and the accuracy of the ICESat data products. But, the major drawback for using ICESat is the sparse sampling, leaving tens of kilometers between tracks over Gabon. On the contrary, the recently launched GEDI lidar mission will provide much denser lidar sampling with between-track spacing of 600 m and along-track spacing of 60 m. However, the nominal footprint size (∼22 m diameter) is close to the smallest resolution tested here and our modelling results showed a higher predictive error compared to the models that can be applied directly to ICESat data. On the other hand, fusion of the GEDI waveforms with other active remote sensing data, such as from TanDEM-X radar, has the potential to change this resolution, fill in data gaps and provide wall-to-wall data products allowing for the application of models developed at different resolutions (Qi and Dubayah 2016, Lee et al 2018). It may then also be possible to build structure-diversity models directly on fused GEDI-TanDEM-X data products when these data become available more widely after GEDI’s data collection. So far, it has been shown that canopy height can be retrieved using GEDI-TanDEM-X fusion, but, if we would be able to extract more information on the vertical canopy profile using GEDI and TanDEM-X data fusion, or fusion with another interferometric sensor such as on the BIOMASS mission, it may be possible to accomplish true wall-to-wall mapping of tree species diversity in the pan-tropics and beyond. 5. Conclusion Canopy height alone, used as a proxy for niche volume, can predict a limited percentage of the variation in tree species diversity across a savanna- tropical forest landscape in Gabon, Africa. Including information on the vertical canopy structure, used as a proxy for vertical niche occupation, derived from large footprint full-waveform lidar data improved these models significantly. This structure-diversity relation- ship and themodels developed here show potential for application to data from active remote sensing satel- lites such as ICESat, TanDEM-X, BIOMASS and GEDI. Further research is encouraged to study the structure-diversity relationship across the tropics to establish the potential of mapping pantropical tree species diversity using measurements of canopy structure. Acknowledgments This work is supported by NASA Headquarters under the NASA Earth and Space Science Fellowship Pro- gram—Grant 80NSSC17K0321, under theNASANew Investigator Program—Grant 80NSSC18K0708, under NASA’s Global Ecosystem Dynamics Invest- igation (GEDI, Grant NNL15AAO3C) and NASA’s AfriSAR campaign. The AfriSAR campaign was con- ducted in Gabon in collaboration with l’Agence gabonaise d’études et d’observation spatiale (AGEOS) and the Agence Nationale des Parcs Nationaux (ANPN). Research permission was granted by the Centre National de la Recherche Scientifique et Technologique (CENAREST—permit numbers AR0011/17/MESRSFC/CENAREST/CG/CST/ CSAR and AR0024/10/MESRDT/CENAREST/CG/ CST/CSAR) and the Agence Nationale des Parcs Nationaux (ANPN). Lidar data collection in Lopé, Mondah, Mabounié and Rabi was carried out by the NASALandVegetation and Ice Sensor (LVIS) team led by Bryan Blair,MichelleHofton andDavid Rabine and hosted by AGEOS (Ghislain Moussavou, Tanguy Gahouma), we express our sincere gratitude to their team for their effort to collect these data. We are grateful to ANPN and AGEOS for the pro- vision of logistical support that facilitated both field work and lidar data collection, specifically to Flore Koumba Pambo, Josue EdzangNdong andDavid Leh- mann from ANPN for their support. The field data collection in Lopé was funded by ESA through the AfriSAR campaign, coordinated by Simon Lewis and guided by Nicolas Labrière, and hosted by ANPN and the University of Stirling at the SEGC field station. We would like to specifically thank Carl Ditougou, Pacôme Dimbonda, Arthur Dibambou, Edmond Dimoto, and Napo Milamizokou. The field data col- lection in Mondah was funded by NASA through the AfriSAR campaign, coordinated and guided by John Poulsen, and hosted by ANPN. The field data collec- tion in Mabounié was led by Nicolas Barbier and per- formed in collaboration with the Missouri Botanical Garden (Tariq Stevart) and Golder Associates and we would specifically like to thank P Ploton, V Droissart, and Y Issembe for their help and expertise. The field data collection in Rabi was funded through Shell Gabon and the Smithsonian Conservation Biology Institute. The data were collected under guidance of Alfonso Alonso, HerveMemiaghe, David Kenfack and Pulcherie Bissiengou. This is contribution No. 171 of the Gabon Biodiversity Program. We express our 11 Environ. Res. Lett. 14 (2019) 094013 sincere gratitude to everyone involved in these field campaigns and for making the data available for this study. ORCID iDs HaoTang https://orcid.org/0000-0001-7935-5848 References BergenKM,Goetz S J, DubayahRO,HenebryGM,Hunsaker CT, ImhoffML,NelsonR F, ParkerGG andRadeloff VC 2009 Remote sensing of vegetation 3D structure for biodiversity and habitat: Review and implications for lidar and radar spacebornemissions J. Geophys. Res. 114 13 Blair J B, RabineDL andHoftonMA1999The Laser vegetation imaging sensor: amedium-altitude, digitisation-only, airborne laser altimeter formapping vegetation and topography ISPRS J. Photogramm. Remote Sens. 54 115–22 Cuni-Sanchez A,White L J T, Calders K, Jeffery K J, Abernethy K, Burt A,DisneyM,GilpinM,Gomez-Dans J L and Lewis S L 2016African savanna-forest boundary dynamics: a 20 year studyPLoSOne 11 23 DecuyperM et al 2018Assessing the structural differences between tropical forest types using terrestrial laser scanning Forest Ecol.Manage. 429 327–35 Denslow J S 1980 Patterns of plant species diversity during succession under different disturbance regimesOecologia 46 18–21 Fatoyinbo L et al 2017The 2016NASAAfriSAR campaign: airborne SAR and lidarmeasurements of tropical forest structure and biomass in support of future satellitemissions IEEE Int. Geo. andRemote Sens. Symp. (IGARSS) (https://doi.org/10.119/ IGARSS.2017.8127949) FatoyinboTE, Saatchi S S, Armston JD, Poulsen J,Marselis SM, PintoN,White L J T and Jeffery K J 2018AfriSAR:Mondah Forest Tree Species, Biophysical, and BiomassData, Gabon, 2016 (OakRidge, TN:ORNLDAAC) FrickerGA,Wolf J A, Saatchi S S andGillespie TW2015 Predicting spatial variations of tree species richness in tropical forests fromhigh-resolution remote sensingEcol. Appl. 25 1776–89 Gatti RC,Di Paola A, Bombelli A, Noce S andValentini R 2017 Exploring the relationship between canopy height and terrestrial plant diversity Plant Ecol. 218 899–908 Givnish T J 2017Tree diversity in relation to tree height: alternative perspectives Ecol. Lett. 20 395–7 Hancock S, Armston JD,HoftonM,Xiaoli S, Duncanson L I, Kellner J R andDubayahR 2019TheGEDI simulator: a large- footprint waveform lidar simulator for calibration and validation of spacebornemissionsEarth Space Sci. 6 294–310 HillMO1973Diversity and evenness: a unifying notation and its consequences Ecology 54 427–32 HustonM1979A general hypothesis of species diversityAm.Nat. 113 81–101 KohyamaT 1993 Size-structured tree populations in gap-dynamic forest–the forest architecture hypothesis for the stable coexistence of species J. Ecol. 81 131 KohyamaT 1996 the role of architecture in enhancing plant species diversityBiodiversity: an Ecological Perspective ed TAbe et al (NewYork: Springer) pp 21–33 LabrièreN et al 2018 In situ reference datasets from the TropiSAR andAfriSAR campaigns in support of upcoming spaceborne biomassmissions IEEE J. Sel. Top. Appl. EarthObs. Remote Sens. 11 3617–27 LeeM,Alonso A,Dallmeier F, Campbell P and Pauwells O 2006The GambaComplex of Protected Areas: an Illustration of Gabon’s biodiversityGamba, Gabon: Biodiversity of an Equatorial African RainforestBulletin of the Biological Society ofWashington edAAlonso et al vol 12 (Washington,DC: Biological Society ofWashington) pp 229–41 Lee S-K, FatoyinboT,QiW,Hancock S, Armston J andDubayahR 2018Gedi andTandem-X Fusion for 3DForest Structure Parameter Retrieval IGARSS 2018–2018 IEEE Int. Geoscience andRemote Sensing Symp. pp 380–2 Lopez-GonzalezG, Lewis S L, BurkitM and PillipsO L 2011 ForestPlots. net: a web application and research tool to manage and analyse tropical forest plot data J. Veg. Sci. 22 610–3 Lopez-GonzalezG, Lewis S L, BurkittM, Baker TR and PhillipsOL 2009 ForestPlots.net database (www.forestplots.net) (Accessed: 1December 2017) LuoW, Liang J, Cazzolla Gatti R, ZhaoX andZhangC 2019 Parameterization of biodiversity–productivity relationship and its scale dependency using georeferenced tree-level data J. Ecol. (https://doi.org/10.1111/1365-2745.13129) MacArthur RHandWilson EO1967The Theory of Island Biogeography (Princeton andOxford: PrincetonUniversity Press) MaoL,Dennett J, Bater CW,Tompalski P, CoopsNC, FarrD, KohlerM,White B, Stadt J J andNielsen S E 2018Using airborne laser scanning to predict plant species richness and assess conservation threats in the oil sands region of Alberta’s boreal forest For. Ecol.Manage. 409 29–37 Marks CO,Muller-LandauHC andTilmanD2016Tree diversity, tree height and environmental harshness in eastern and westernNorth AmericaEcol. Lett. 19 743–51 Marselis SM, TangH, Armston JD, Calders K, LabrièreN and DubayahR 2018Distinguishing vegetation types with airbornewaveform lidar data in a tropical forest-savanna mosaic: A case study in LopéNational Park, GabonRemote Sens. Environ. 216 626–34 Morris EK et al 2014Choosing and using diversity indices: insights for ecological applications from theGerman biodiversity exploratories Ecol. Evol. 4 3514–24 Mutke J and BarthlottW2005 Patterns of vascular plant diversity at continental to global scalesBiol. Skr. 55 521–31 PiñeiroG, Perelman S, Guerschman J P and Paruelo JM2008How to evaluatemodels: observed versus predicted or predicted versus observed? Ecol.Modelling 216 316–22 QiWandDubayahRO2016Combining tandem-X InSAR and simulatedGEDI lidar observations for forest structure mappingRemote Sens. Environ. 187 253–66 Rios-SaldañaCA,Delibes-MateosMand Ferreira CC2018Are fieldwork studies being relegated to second place in conservation science?Glob. Ecol. Conserv. 14 e00389 RobinsonC, Saatchi S, ClarkD, Astaiza JH andHubel A F 2018 Topography and three-dimensional structure can estimate tree diversity along a tropical elevational gradient inCosta RicaRemote Sens. 10 629 Sheil D andBurslemDFRP 2003Disturbing hypotheses in tropical forestsTrends Ecol. Evol. 18 18–26 Slik JWF et al 2015An estimate of the number of tropical tree speciesProc. Natl Acad. Sci. 112 7472–7 Ter SteegeH et al 2003A spatialmodel of treeα-diversity and tree density for theAmazonBiodivers. Conserv. 12 2255–77 TangH, Armston JD,Hancock S,HoftonM, Blair J B and DubayahRO2018AfriSAR: CanopyCover andVertical ProfileMetricsDerived fromLVIS, Gabon, 2016 (OakRidge, TN:ORNLDAAC) TangH andDubayahR 2017 Light-driven growth inAmazon evergreen forests explained by seasonal variations of vertical canopy structure Proc. Natl. Acad. Sci. 2016 16943 TangH,Dubayah R, SwatantranA,HoftonM, Sheldon S, ClarkDB andBlair B 2012Retrieval of vertical LAI profiles over tropical rain forests usingwaveform lidar at La Selva, Costa RicaRemote Sens. Environ. 124 242–50 TurnerW, Spector S, GardinerN, FladelandM, Sterling E and SteiningerM2003Remote sensing for biodiversity science and conservationTrends Ecol. Evol. 18 306–14 White L 2001 Forest-savanna dynamics and the origins of marantaceae forest in the lope reserveGabonAfrican Rain Forest Ecology andConservation: an Interdisciplinary 12 Environ. Res. Lett. 14 (2019) 094013 Perspective ch 11 (NewHaven, CT: YaleUniversity Press) pp 165–82 Wolf J A, Fricker GA,Meyer V,Hubbell S P, Gillespie TWand Saatchi S S 2012 Plant species richness is associatedwith canopy height and topography in a neotropical forestRemote Sens. 4 4010–21 13 Environ. Res. Lett. 14 (2019) 094013