研究目的
Investigating whether utilizing additional wavelengths from MODIS as inputs to statistical and machine learning models can improve satellite-derived estimates of total suspended solids in the Chesapeake Bay.
研究成果
The Random Forest model using multiple MODIS bands modestly outperforms the single-band algorithm, especially under high TSS conditions, but both methods are similarly generalizable. The choice of method depends on the application's objectives, with multispectral approaches offering advantages in accuracy for certain conditions.
研究不足
The multispectral model is more data-intensive and requires more computational resources. The study is limited to the Chesapeake Bay and may not generalize to other estuaries. High TSS values are underestimated due to log-normal distribution of data. Spatial resolution is limited to 1-km for multispectral bands compared to 250-m for red/NIR bands.
1:Experimental Design and Method Selection:
The study uses statistical and machine learning models (Random Forest, SVM, GAM, GLM, NN, MARS, CART, BART) to estimate TSS from MODIS data, comparing with a single-band algorithm. Data is split into 80% training and 20% holdout for cross-validation.
2:Sample Selection and Data Sources:
Satellite data from MODIS Aqua for 2003-2016, processed to Level 2 using SeaDAS. In situ TSS measurements from Chesapeake Bay Program stations, matched with satellite data within 250 meters and same day.
3:List of Experimental Equipment and Materials:
MODIS Aqua satellite sensor, SeaDAS software, R Statistical Computing Environment, in situ water quality monitoring stations.
4:Experimental Procedures and Operational Workflow:
Download and process MODIS data, match with in situ measurements, train models on training set, validate on holdout set, compare performance metrics (MAE, MSE, RMSE).
5:Data Analysis Methods:
Use of R for statistical analysis, partial dependence plots for model interpretation, geographic and temporal cross-validation.
独家科研数据包,助您复现前沿成果,加速创新突破
获取完整内容