Sample data for simulation in Jing-Jin-Ji region (dataset)

Sample data for simulation in Jing-Jin-Ji region

doi: 10.4121/uuid:1b27dc6b-b77e-4f18-b035-e8a249f595c0

Datacite citation style:

Yi, Sangui (2020): Sample data for simulation in Jing-Jin-Ji region. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/uuid:1b27dc6b-b77e-4f18-b035-e8a249f595c0

Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite

Dataset

usage stats

1302

views

372

downloads

categories

keywords

Machine learning, Simulation, Vegetation classification, Vegetation distribution

geolocation

Jing-Jin-Ji region

lat (N): 39.6

lon (E): 117.0

view on openstreetmap

licence

CC0

export as...

RefWorks, BibTeX, Reference Manager, Endnote, DataCite, NLM, DC, CFF

by Sangui Yi

Vegetation distribution simulations could help to understand vegetation distribution patterns and trends, but it is difficult to accurately simulate the distribution of vegetation especially in regions that are heavily affected by human disturbance. Climate, topographic, and spectral data were used as input predictor variables of four machine learning models, including the random forest (RF), decision tree (DT), support vector machine (SVM) and maximum likelihood methods, in three vegetation classification units, including the vegetation group, vegetation type, and formation and subformation, in the Jing-Jin-Ji region, which is one of the most developed regions in China. A total of 2789 vegetation points were used for model training, and 974 vegetation points were used for model assessment. The result showed that the random forest method was the best of the four models and could simulate the distribution of the vegetation in all three classification units well. Kappa coefficients indicated that the random forest method had the highest prediction ability in regard to vegetation type, followed by vegetation group, formation and subformation. Five predictor variables, including 4 climate variables (annual mean temperature, max temperature of warmest month, min temperature of coldest month and annual precipitation) and 1 geospatial variable (elevation), were the most important for three vegetation classification levels. The winter surface albedo of band 4, the slope and the three summer spectral variables (the summer surface albedo of bands 2 and 6 and the summer brightness index) could also increase the accuracy of vegetation classification to some extent.

history

2020-05-11 first online, published, posted

publisher

4TU.Centre for Research Data

format

media types: application/octet-stream, application/pdf, application/vnd.openxmlformats-officedocument.spreadsheetml.sheet, application/zip, text/plain, text/xml

organizations

Key Laboratory of Resource Plants, West China Subalpine Botanical Garden, Institute of Botany, Chinese Academy of Sciences, Xiangshan, Beijing, China

DATA

files (2)

25,151 bytesMD5:311cd15f4d2c8538fb0d81f2a91b80ba README.pdf
4,337,640 bytesMD5:756f7c707624f0c6713085dc4049617c data.zip
download all files (zip)
4,362,791 bytes unzipped