Svoboda | Graniru | BBC Russia | Golosameriki | Facebook
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (11,523)

Search Parameters:
Keywords = random forest

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
19 pages, 6209 KiB  
Article
Construction and Optimization of Landslide Susceptibility Assessment Model Based on Machine Learning
by Xiaodong Wang, Xiaoyi Ma, Dianheng Guo, Guangxiang Yuan and Zhiquan Huang
Appl. Sci. 2024, 14(14), 6040; https://doi.org/10.3390/app14146040 (registering DOI) - 10 Jul 2024
Viewed by 54
Abstract
The appropriate selection of machine learning samples forms the foundation for utilizing machine learning models. However, in landslide susceptibility evaluation, discrepancies arise when non-landslide samples are positioned within areas prone to landslides or demonstrate spatial biases, leading to differences in model predictions. To [...] Read more.
The appropriate selection of machine learning samples forms the foundation for utilizing machine learning models. However, in landslide susceptibility evaluation, discrepancies arise when non-landslide samples are positioned within areas prone to landslides or demonstrate spatial biases, leading to differences in model predictions. To address the impact of non-landslide sample selection on landslide susceptibility predictions, this study uses the western region of Henan Province as a case study. Utilizing historical data, remote sensing interpretation, and field surveys, a sample dataset comprising 834 landslide points is obtained. Ten environmental factors, including elevation, slope, aspect, profile curvature, land cover, lithology, topographic wetness index, distance from river, distance from faults, and distance from road, are chosen to establish an evaluation index system. Negative sample sampling areas are delineated based on the susceptibility assessment outcomes derived from the information value model. Two sampling strategies, whole-region random sampling (I) and partition-based random sampling (II), are employed. Random Forest (RF) and Back Propagation Neural Network (BPNN) models are used to forecast and delineate landslide susceptibility in the western region of Henan Province, with prediction accuracy evaluated. The model prediction accuracy is ranked as follows: II-BPNN (AUC = 0.9522) > II-RF (AUC = 0.9464) > I-RF (AUC = 0.8247) > I-BPNN (AUC = 0.8068). Under the Receiver Operating Characteristic (AUC) curve and accuracy, the II-RF and II-BPNN models exhibit increases in the region by 12.17% and 15.61%, respectively, compared to the I-RF and I-BPNN models. Moreover, the II-BPNN model shows improvements over the I-BPNN model with increases in AUC and accuracy by 14.54% and 16.52%, respectively. This indicates enhancements in model performance and predictive capability. In terms of recall and specificity, the II-RF and II-BPNN models demonstrate increases in recall by 15.09% and 17.47%, respectively, and in specificity by 15.80% and 14.99%, respectively. These findings suggest that the optimized models have better predictive capabilities for identifying landslide and non-landslide areas, effectively reducing the uncertainty introduced by point data in landslide risk prediction. Full article
27 pages, 12235 KiB  
Article
Enhancing the Performance of Landslide Susceptibility Mapping with Frequency Ratio and Gaussian Mixture Model
by Wenchao Huangfu, Haijun Qiu, Weicheng Wu, Yaozu Qin, Xiaoting Zhou, Yang Zhang, Mohib Ullah and Yanfen He
Land 2024, 13(7), 1039; https://doi.org/10.3390/land13071039 (registering DOI) - 10 Jul 2024
Viewed by 56
Abstract
A rational landslide susceptibility mapping (LSM) can minimize the losses caused by landslides and enhance the efficiency of disaster prevention and reduction. At present, frequency ratio (FR), information value (IV), and certainty factor (CF) are widely used to quantify the relationships between landslides [...] Read more.
A rational landslide susceptibility mapping (LSM) can minimize the losses caused by landslides and enhance the efficiency of disaster prevention and reduction. At present, frequency ratio (FR), information value (IV), and certainty factor (CF) are widely used to quantify the relationships between landslides and their causative factors; however, it remains unclear which method is the most effective. Moreover, existing landslide susceptibility zoning methods lack full automation; thus, the results are full of uncertainties. To address this, the FR, IV, and CF were used to analyze the relationship between landslides and causative factors. Subsequently, three distinct sets of models were developed, namely random forest models (RF_FR, RF_IV, and RF_CF), support vector machine models (SVM_FR, SVM_IV, and SVM_CF), and logistic regression models (LR_FR, LR_IV, and LR_CF) using the analysis results as inputs. A Gaussian mixture model (GMM) was introduced as a new method for landslide susceptibility zoning, classifying the LSM into five distinct levels. An accuracy evaluation of the models and a rationality analysis of the LSM indicated that the FR is superior to the IV and CF in quantifying the relationship between landslides and causative factors. Additionally, the quantile method was employed as a comparative approach to the GMM, further validating the effectiveness of the GMM. This research contributes to more effective and efficient LSM, ultimately enhancing landslide prevention measures. Full article
23 pages, 11874 KiB  
Article
Spatial Downscaling of Nighttime Land Surface Temperature Based on Geographically Neural Network Weighted Regression Kriging
by Jihan Wang, Nan Zhang, Laifu Zhang, Haoyu Jing, Yiming Yan, Sensen Wu and Renyi Liu
Remote Sens. 2024, 16(14), 2542; https://doi.org/10.3390/rs16142542 - 10 Jul 2024
Viewed by 35
Abstract
Land surface temperature (LST) has a wide application in Earth Science-related fields, and spatial downscaling is an important method to retrieve high-resolution LST data. However, existing LST downscaling methods have difficulties in simultaneously constructing and expressing spatial non-stationarity, spatial autocorrelation, and complex non-linearity [...] Read more.
Land surface temperature (LST) has a wide application in Earth Science-related fields, and spatial downscaling is an important method to retrieve high-resolution LST data. However, existing LST downscaling methods have difficulties in simultaneously constructing and expressing spatial non-stationarity, spatial autocorrelation, and complex non-linearity during the LST downscaling process, which limits the performance of the models. Moreover, there is a lack of research on high-resolution nighttime land surface temperature (NLST) reconstruction based on spatial downscaling, which does not meet the data needs for urban-scale nighttime urban heat island (UHI) studies. Therefore, this study combined Geographically Neural Network Weighted Regression (GNNWR) with Area-to-Point Kriging interpolation (ATPK) to propose a Geographically Neural Network Weighted Regression Kriging (GNNWRK) model for NLST downscaling. To verify the model’s generality and robustness, this study selected four study areas with different landform and climate type for NLST spatial downscaling experiments. The GNNWRK was compared with four benchmark downscaling methods, including TsHARP, Random Forest, Geographically Weighted Regression, and GNNWR. The results show that compared to these four benchmark methods, the GNNWRK method has higher accuracy in NLST downscaling, with a maximum Pearson’s Correlation Coefficient (Pcc) of 0.930 and a minimum Root Mean Square Error (RMSE) of 0.886 K. Moreover, the validation based on MODIS NLST data and ground-measured NLST data also indicates that the GNNWRK model can obtain more accurate, high-resolution NLST with richer and more detailed texture. This enhances the potential of NLST in studying the effects of urban nighttime heat islands at a finer scale. Full article
(This article belongs to the Special Issue Advances in Thermal Infrared Remote Sensing II)
13 pages, 622 KiB  
Article
Investigation of Machine and Deep Learning Techniques to Detect HPV Status
by Efstathia Petrou, Konstantinos Chatzipapas, Panagiotis Papadimitroulas, Gustavo Andrade-Miranda, Paraskevi F. Katsakiori, Nikolaos D. Papathanasiou, Dimitris Visvikis and George C. Kagadis
J. Pers. Med. 2024, 14(7), 737; https://doi.org/10.3390/jpm14070737 (registering DOI) - 10 Jul 2024
Viewed by 49
Abstract
Background: This study investigated alternative, non-invasive methods for human papillomavirus (HPV) detection in head and neck cancers (HNCs). We compared two approaches: analyzing computed tomography (CT) scans with a Deep Learning (DL) model and using radiomic features extracted from CT images with machine [...] Read more.
Background: This study investigated alternative, non-invasive methods for human papillomavirus (HPV) detection in head and neck cancers (HNCs). We compared two approaches: analyzing computed tomography (CT) scans with a Deep Learning (DL) model and using radiomic features extracted from CT images with machine learning (ML) models. Methods: Fifty patients with histologically confirmed HNC were included. We first trained a modified ResNet-18 DL model on CT data to predict HPV status. Next, radiomic features were extracted from manually segmented regions of interest near the oropharynx and used to train four ML models (K-Nearest Neighbors, logistic regression, decision tree, random forest) for the same purpose. Results: The CT-based model achieved the highest accuracy (90%) in classifying HPV status. Among the ML models, K-Nearest Neighbors performed best (80% accuracy). Weighted Ensemble methods combining the CT-based model with each ML model resulted in moderate accuracy improvements (70–90%). Conclusions: Our findings suggest that CT scans analyzed by DL models hold promise for non-invasive HPV detection in HNC. Radiomic features, while less accurate in this study, offer a complementary approach. Future research should explore larger datasets and investigate the potential of combining DL and radiomic techniques. Full article
12 pages, 3074 KiB  
Article
Machine-Learning Approaches in N Estimations of Fig Cultivations Based on Satellite-Born Vegetation Indices
by Karla Janeth Martínez-Macias, Aldo Rafael Martínez-Sifuentes, Selenne Yuridia Márquez-Guerrero, Arturo Reyes-González, Pablo Preciado-Rangel, Pablo Yescas-Coronado and Ramón Trucíos-Caciano
Nitrogen 2024, 5(3), 598-609; https://doi.org/10.3390/nitrogen5030040 (registering DOI) - 10 Jul 2024
Viewed by 75
Abstract
Nitrogen is one of the most important macronutrients for crops, and, in conjunction with artificial intelligence algorithms, it is possible to estimate it with the aid of vegetation indices through remote sensing. Various indices were calculated and those with a correlation of ≥0.7 [...] Read more.
Nitrogen is one of the most important macronutrients for crops, and, in conjunction with artificial intelligence algorithms, it is possible to estimate it with the aid of vegetation indices through remote sensing. Various indices were calculated and those with a correlation of ≥0.7 were selected for subsequent use in random forest, gradient boosting, and artificial neural networks to determine their relationship with nitrogen levels measured in the laboratory. Random forest showed no relationship, yielding an R2 of zero; and gradient boosting and the classical method were similar with 0.7; whereas artificial neural networks yielded the best results with an R2 of 0.93. Thus, estimating nitrogen levels using this algorithm is reliable, by feeding it with data from the Modified Chlorophyll Absorption Ratio Index, Transformed Chlorophyll Absorption Reflectance Index, Modified Chlorophyll Absorption Ratio Index/Optimized Soil Adjusted Vegetation Index, and Transformed Chlorophyll Absorption Ratio Index/Optimized Soil Adjusted Vegetation Index Full article
(This article belongs to the Special Issue Nitrogen Signaling in Plants)
Show Figures

Figure 1

23 pages, 5524 KiB  
Article
Proposing Machine Learning Models Suitable for Predicting Open Data Utilization
by Junyoung Jeong and Keuntae Cho
Sustainability 2024, 16(14), 5880; https://doi.org/10.3390/su16145880 - 10 Jul 2024
Viewed by 144
Abstract
As the digital transformation accelerates in our society, open data are being increasingly recognized as a key resource for digital innovation in the public sector. This study explores the following two research questions: (1) Can a machine learning approach be appropriately used for [...] Read more.
As the digital transformation accelerates in our society, open data are being increasingly recognized as a key resource for digital innovation in the public sector. This study explores the following two research questions: (1) Can a machine learning approach be appropriately used for measuring and evaluating open data utilization? (2) Should different machine learning models be applied for measuring open data utilization depending on open data attributes (field and usage type)? This study used single-model (random forest, XGBoost, LightGBM, CatBoost) and multi-model (stacking ensemble) machine learning methods. A key finding is that the best-performing models differed depending on open data attributes (field and type of use). The applicability of the machine learning approach for measuring and evaluating open data utilization in advance was also confirmed. This study contributes to open data utilization and to the application of its intrinsic value to society. Full article
(This article belongs to the Section Economic and Business Aspects of Sustainability)
Show Figures

Figure 1

22 pages, 18268 KiB  
Article
Enhancement of Comparative Assessment Approaches for Synthetic Aperture Radar (SAR) Vegetation Indices for Crop Monitoring and Identification—Khabarovsk Territory (Russia) Case Study
by Aleksei Sorokin, Alexey Stepanov, Konstantin Dubrovin and Andrey Verkhoturov
Remote Sens. 2024, 16(14), 2532; https://doi.org/10.3390/rs16142532 - 10 Jul 2024
Viewed by 112
Abstract
Crop identification at the field level using remote sensing data is a very important task. However, the use of multispectral data for the construction of vegetation indices is sometimes impossible or limited. For such situations, solutions based on the use of time series [...] Read more.
Crop identification at the field level using remote sensing data is a very important task. However, the use of multispectral data for the construction of vegetation indices is sometimes impossible or limited. For such situations, solutions based on the use of time series of synthetic aperture radar (SAR) indices are promising, eliminating the problems associated with cloudiness and providing an assessment of crop development characteristics during the growing season. We evaluated the use of time series of synthetic aperture radar (SAR) indices to characterize crop development during the growing season. The use of SAR imagery for crop identification addresses issues related to cloudiness. Therefore, it is important to choose the SAR index that is the most stable and has the lowest spatial variability throughout the growing season while being comparable to the normalized difference vegetation index (NDVI). The presented work is devoted to the study of these issues. In this study, the spatial variabilities of different SAR indices time series were compared for a single region for the first time to identify the most stable index for use in precision agriculture, including the in-field heterogeneity of crop sites, crop rotation control, mapping, and other tasks in various agricultural areas. Seventeen Sentinel-1B images of the southern part of the Khabarovsk Territory in the Russian Far East at a spatial resolution of 20 m and temporal resolution of 12 days for the period between 14 April 2021 and 1 November 2021 were obtained and processed to generate vertical–horizontal/vertical–vertical polarization (VH/VV), radar vegetation index (RVI), and dual polarimetric radar vegetation index (DpRVI) time series. NDVI time series were constructed from multispectral Sentinel-2 images using a cloud cover mask. The characteristics of time series maximums were calculated for different types of crops: soybean, oat, buckwheat, and timothy grass. The DpRVI index exhibited the highest stability, with coefficients of variation of the time series that were significantly lower than those for RVI and VH/VV. The main characteristics of the SAR and NDVI time series—the maximum values, the dates of the maximum values, and the variability of these indices—were compared. The variabilities of the maximum values and dates of maximum values for DpRVI were lower than for RVI and VH/VV, whereas the variabilities of the maximum values and the dates of maximum values were comparable for DpRVI and NDVI. On the basis of the DpRVI index, classifications were carried out using seven machine learning methods (fine tree, quadratic discriminant, Gaussian naïve Bayes, fine k nearest neighbors or KNN, random under-sampling boosting or RUSBoost, random forest, and support vector machine) for experimental sites covering a total area of 1009.8 ha. The quadratic discriminant method yielded the best results, with a pixel classification accuracy of approximately 82% and a kappa value of 0.67. Overall, 90% of soybean, 74.1% of oat, 68.9% of buckwheat, and 57.6% of timothy grass pixels were correctly classified. At the field level, 94% of the fields included in the test dataset were correctly classified. The paper results show that the DpRVI can be used in cases where the NDVI is limited, allowing for the monitoring of phenological development and crop mapping. The research results can be used in the south of Khabarovsk Territory and in neighboring territories. Full article
(This article belongs to the Special Issue Remote Sensing in Land Management)
Show Figures

Figure 1

19 pages, 2329 KiB  
Review
Artificial Intelligence and Heart-Brain Connections: A Narrative Review on Algorithms Utilization in Clinical Practice
by Giuseppe Micali, Francesco Corallo, Maria Pagano, Fabio Mauro Giambò, Antonio Duca, Piercataldo D’Aleo, Anna Anselmo, Alessia Bramanti, Marina Garofano, Emanuela Mazzon, Placido Bramanti and Irene Cappadona
Healthcare 2024, 12(14), 1380; https://doi.org/10.3390/healthcare12141380 - 10 Jul 2024
Viewed by 153
Abstract
Cardiovascular and neurological diseases are a major cause of mortality and morbidity worldwide. Such diseases require careful monitoring to effectively manage their progression. Artificial intelligence (AI) offers valuable tools for this purpose through its ability to analyse data and identify predictive patterns. This [...] Read more.
Cardiovascular and neurological diseases are a major cause of mortality and morbidity worldwide. Such diseases require careful monitoring to effectively manage their progression. Artificial intelligence (AI) offers valuable tools for this purpose through its ability to analyse data and identify predictive patterns. This review evaluated the application of AI in cardiac and neurological diseases for their clinical impact on the general population. We reviewed studies on the application of AI in the neurological and cardiological fields. Our search was performed on the PubMed, Web of Science, Embase and Cochrane library databases. Of the initial 5862 studies, 23 studies met the inclusion criteria. The studies showed that the most commonly used algorithms in these clinical fields are Random Forest and Artificial Neural Network, followed by logistic regression and Support-Vector Machines. In addition, an ECG-AI algorithm based on convolutional neural networks has been developed and has been widely used in several studies for the detection of atrial fibrillation with good accuracy. AI has great potential to support physicians in interpretation, diagnosis, risk assessment and disease management. Full article
Show Figures

Figure 1

16 pages, 4010 KiB  
Article
A New Spectral Index for Monitoring Leaf Area Index of Winter Oilseed Rape (Brassica napus L.) under Different Coverage Methods and Nitrogen Treatments
by Hao Liu, Youzhen Xiang, Junying Chen, Yuxiao Wu, Ruiqi Du, Zijun Tang, Ning Yang, Hongzhao Shi, Zhijun Li and Fucang Zhang
Plants 2024, 13(14), 1901; https://doi.org/10.3390/plants13141901 - 10 Jul 2024
Viewed by 123
Abstract
The leaf area index (LAI) is a crucial physiological indicator of crop growth. This paper introduces a new spectral index to overcome angle effects in estimating the LAI of crops. This study quantitatively analyzes the relationship between LAI and multi-angle hyperspectral reflectance from [...] Read more.
The leaf area index (LAI) is a crucial physiological indicator of crop growth. This paper introduces a new spectral index to overcome angle effects in estimating the LAI of crops. This study quantitatively analyzes the relationship between LAI and multi-angle hyperspectral reflectance from the canopy of winter oilseed rape (Brassica napus L.) at various growth stages, nitrogen application levels and coverage methods. The angular stability of 16 traditional vegetation indices (VIs) for monitoring the LAI was tested under nine view zenith angles (VZAs). These multi-angle VIs were input into machine learning models including support vector machine (SVM), eXtreme gradient boosting (XGBoost), and Random Forest (RF) to determine the optimal monitoring strategy. The results indicated that the back-scattering direction outperformed the vertical and forward-scattering direction in terms of monitoring the LAI. In the solar principal plane (SPP), EVI-1 and REP showed angle stability and high accuracy in monitoring the LAI. Nevertheless, this relationship was influenced by experimental conditions and growth stages. Compared with traditional VIs, the observation perspective insensitivity vegetation index (OPIVI) had the highest correlation with the LAI (r = 0.77–0.85). The linear regression model based on single-angle OPIVI was most accurate at −15° (R2 = 0.71). The LAI monitoring achieved using a multi-angle OPIVI-RF model had the higher accuracy, with an R2 of 0.77 and with a root mean square error (RMSE) of 0.38 cm2·cm−2. This study provides valuable insights for selecting VIs that overcome the angle effect in future drone and satellite applications. Full article
(This article belongs to the Special Issue The Application of Spectral Techniques in Agriculture and Forestry)
Show Figures

Figure 1

20 pages, 2441 KiB  
Article
A Novel Method on Recognizing Drum Load of Elastic Tooth Drum Pepper Harvester Based on CEEMDAN-KPCA-SVM
by Xinyu Zhang, Xinyan Qin, Jin Lei, Zhiyuan Zhai, Jianglong Zhang and Zhi Wang
Agriculture 2024, 14(7), 1114; https://doi.org/10.3390/agriculture14071114 - 10 Jul 2024
Viewed by 139
Abstract
The operational complexities of the elastic tooth drum pepper harvester (ETDPH), characterized by variable drum loads that are challenging to recognize due to varying pepper densities, significantly impact pepper loss rates and mechanical damage. This study proposes a novel method integrating complete ensemble [...] Read more.
The operational complexities of the elastic tooth drum pepper harvester (ETDPH), characterized by variable drum loads that are challenging to recognize due to varying pepper densities, significantly impact pepper loss rates and mechanical damage. This study proposes a novel method integrating complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN), kernel principal component analysis (KPCA), and a support vector machine (SVM) to enhance drum load recognition. The method consists of three principal steps: the initial experiments with ETDPHs to identify the critical factors affecting drum load and to formulate classification criteria; the development of a CEEMDAN-KPCA-SVM model for ETDPH drum load recognition, where drum spindle torque signals are processed by CEEMDAN for decomposition and reconstruction, followed by feature extraction and dimensionality reduction via KPCA to refine the model’s accuracy and training efficiency; and evaluation of the model’s performance on real datasets, highlighting the improvements brought by CEEMDAN and KPCA, as well as comparative analysis with other machine learning models. The results describe four load conditions—no load (mass of pepper intake (MOPI) = 0 kg/s), low load (0 < MOPI ≤ 0.658 kg/s), normal load (0.658 < MOPI ≤ 1.725 kg/s), and high load (MOPI > 1.725 kg/s)—with the CEEMDAN-KPCA-SVM model achieving 100% accuracy on both training and test sets, outperforming the standalone SVM by 6% and 12.5%, respectively. Additionally, it reduced the training time to 2.88 s, a 10.9% decrease, and reduced the prediction time to 0.0001 s, a 63.6% decrease. Comparative evaluations confirmed the superiority of the CEEMDAN-KPCA-SVM model over random forest (RF) and gradient boosting machine (GBM) in classification tasks. The synergistic application of CEEMDAN and KPCA significantly improved the accuracy and operational efficiency of the SVM model, providing valuable insights for load recognition and adaptive control of ETDPH drum parameters. Full article
(This article belongs to the Section Agricultural Technology)
Show Figures

Figure 1

12 pages, 1318 KiB  
Article
The Prediction of Pectin Viscosity Using Machine Learning Based on Physical Characteristics—Case Study: Aglupectin HS-MR
by Przemysław Siejak, Krzysztof Przybył, Łukasz Masewicz, Katarzyna Walkowiak, Ryszard Rezler and Hanna Maria Baranowska
Sustainability 2024, 16(14), 5877; https://doi.org/10.3390/su16145877 - 10 Jul 2024
Viewed by 163
Abstract
In the era of technology development, the optimization of production processes, quality control and at the same time increasing production efficiency without wasting food, artificial intelligence is becoming an alternative tool supporting many decision-making processes. The work used modern machine learning and physical [...] Read more.
In the era of technology development, the optimization of production processes, quality control and at the same time increasing production efficiency without wasting food, artificial intelligence is becoming an alternative tool supporting many decision-making processes. The work used modern machine learning and physical analysis tools to evaluate food products (pectins). Various predictive models have been presented to estimate the viscosity of pectin. Based on the physical analyses, the characteristics of the food product were isolated, including L*a*b* color, concentration, conductance and pH. Prediction was determined using the determination index and loss function for individual machine learning algorithms. As a result of the work, it turned out that the most effective estimation of pectin viscosity was using Decision Tree (R2 = 0.999) and Random Forest (R2 = 0.998). In the future, the prediction of pectin properties in terms of viscosity recognition may be significantly perceived, especially in the food and pharmaceutical industries. Predicting the natural pectin substrate may contribute to improving quality, increasing efficiency and at the same time reducing losses of the obtained final product. Full article
(This article belongs to the Section Sustainable Food)
Show Figures

Figure 1

18 pages, 11633 KiB  
Article
Mapping the Future: Revealing Habitat Preferences and Patterns of the Endangered Chilean Dolphin in Seno Skyring, Patagonia
by Liliana Perez, Yenny Cuellar, Jorge Gibbons, Elias Pinilla Matamala, Simon Demers and Juan Capella
Biology 2024, 13(7), 514; https://doi.org/10.3390/biology13070514 - 10 Jul 2024
Viewed by 150
Abstract
Species distribution modeling helps understand how environmental factors influence species distribution, creating profiles to predict presence in unexplored areas and assess ecological impacts. This study examined the habitat use and population ecology of the Chilean dolphin in Seno Skyring, Chilean Patagonia. We used [...] Read more.
Species distribution modeling helps understand how environmental factors influence species distribution, creating profiles to predict presence in unexplored areas and assess ecological impacts. This study examined the habitat use and population ecology of the Chilean dolphin in Seno Skyring, Chilean Patagonia. We used three models—random forest (RF), generalized linear model (GLM), and artificial neural network (ANN)—to predict dolphin distribution based on environmental and biotic data like water temperature, salinity, and fish farm density. Our research has determined that the RF model is the most precise tool for predicting the habitat preferences of Chilean dolphins. The results indicate that these dolphins are primarily located within six kilometers of the coast, strongly correlating with areas featuring numerous fish farms, sheltered waters close to the shore with river inputs, and shallow productive zones. This suggests a potential association between dolphin presence and fish-farming activities. These findings can guide targeted conservation measures, such as regulating fish-farming practices and protecting vital coastal areas to improve the survival prospects of the Chilean dolphin. Given the extensive fish-farming industry in Chile, this research highlights the need for greater knowledge and comprehensive conservation efforts to ensure the species’ long-term survival. By understanding and mitigating the impacts of fish farming and other human activities, we can better protect the habitat and well-being of Chilean dolphins. Full article
(This article belongs to the Section Conservation Biology and Biodiversity)
Show Figures

Figure 1

18 pages, 7108 KiB  
Article
Inversion of Soybean Net Photosynthetic Rate Based on UAV Multi-Source Remote Sensing and Machine Learning
by Zhen Lu, Wenbo Yao, Shuangkang Pei, Yuwei Lu, Heng Liang, Dong Xu, Haiyan Li, Lejun Yu, Yonggang Zhou and Qian Liu
Agronomy 2024, 14(7), 1493; https://doi.org/10.3390/agronomy14071493 - 10 Jul 2024
Viewed by 130
Abstract
Net photosynthetic rate (Pn) is a common indicator used to measure the efficiency of photosynthesis and growth conditions of plants. In this study, soybeans under different moisture gradients were selected as the research objects. Fourteen vegetation indices (VIS) and five canopy structure characteristics [...] Read more.
Net photosynthetic rate (Pn) is a common indicator used to measure the efficiency of photosynthesis and growth conditions of plants. In this study, soybeans under different moisture gradients were selected as the research objects. Fourteen vegetation indices (VIS) and five canopy structure characteristics (CSC) (plant height (PH), volume (V), canopy cover (CC), canopy length (L), and canopy width (W)) were obtained using an unmanned aerial vehicle (UAV) equipped with three different sensors (visible, multispectral, and LiDAR) at five growth stages of soybeans. Soybean Pn was simultaneously measured manually in the field. The variability of soybean Pn under different conditions and the trend change of CSC under different moisture gradients were analysed. VIS, CSC, and their combinations were used as input features, and four machine learning algorithms (multiple linear regression, random forest, Extreme gradient-boosting tree regression, and ridge regression) were used to perform soybean Pn inversion. The results showed that, compared with the inversion model using VIS or CSC as features alone, the inversion model using the combination of VIS and CSC features showed a significant improvement in the inversion accuracy at all five stages. The highest accuracy (R2 = 0.86, RMSE = 1.73 µmol m−2 s−1, RPD = 2.63) was achieved 63 days after sowing (DAS63). Full article
Show Figures

Figure 1

13 pages, 2130 KiB  
Article
Exploring Pattern of Relapse in Pediatric Patients with Acute Lymphocytic Leukemia and Acute Myeloid Leukemia Undergoing Stem Cell Transplant Using Machine Learning Methods
by David Shyr, Bing M. Zhang, Gopin Saini and Simon C. Brewer
J. Clin. Med. 2024, 13(14), 4021; https://doi.org/10.3390/jcm13144021 - 10 Jul 2024
Viewed by 122
Abstract
Background. Leukemic relapse remains the primary cause of treatment failure and death after allogeneic hematopoietic stem cell transplant. Changes in post-transplant donor chimerism have been identified as a predictor of relapse. A better predictive model of relapse incorporating donor chimerism has the [...] Read more.
Background. Leukemic relapse remains the primary cause of treatment failure and death after allogeneic hematopoietic stem cell transplant. Changes in post-transplant donor chimerism have been identified as a predictor of relapse. A better predictive model of relapse incorporating donor chimerism has the potential to improve leukemia-free survival by allowing earlier initiation of post-transplant treatment on individual patients. We explored the use of machine learning, a suite of analytical methods focusing on pattern recognition, to improve post-transplant relapse prediction. Methods. Using a cohort of 63 pediatric patients with acute lymphocytic leukemia (ALL) and 46 patients with acute myeloid leukemia (AML) who underwent stem cell transplant at a single institution, we built predictive models of leukemic relapse with both pre-transplant and post-transplant patient variables (specifically lineage-specific chimerism) using the random forest classifier. Local Interpretable Model-Agnostic Explanations, an interpretable machine learning tool was used to confirm our random forest classification result. Results. Our analysis showed that a random forest model using these hyperparameter values achieved 85% accuracy, 85% sensitivity, 89% specificity for ALL, while for AML 81% accuracy, 75% sensitivity, and 100% specificity at predicting relapses within 24 months post-HSCT in cross validation. The Local Interpretable Model-Agnostic Explanations tool was able to confirm many variables that the random forest classifier identified as important for the relapse prediction. Conclusions. Machine learning methods can reveal the interaction of different risk factors of post-transplant leukemic relapse and robust predictions can be obtained even with a modest clinical dataset. The random forest classifier distinguished different important predictive factors between ALL and AML in our relapse models, consistent with previous knowledge, lending increased confidence to adopting machine learning prediction to clinical management. Full article
(This article belongs to the Special Issue Advances in Pediatric Leukemia)
Show Figures

Figure 1

21 pages, 3527 KiB  
Article
Quantifying Predictive Uncertainty and Feature Selection in River Bed Load Estimation: A Multi-Model Machine Learning Approach with Particle Swarm Optimization
by Xuan-Hien Le, Trung Tin Huynh, Mingeun Song and Giha Lee
Water 2024, 16(14), 1945; https://doi.org/10.3390/w16141945 - 10 Jul 2024
Viewed by 169
Abstract
This study presents a comprehensive multi-model machine learning (ML) approach to predict river bed load, addressing the challenge of quantifying predictive uncertainty in fluvial geomorphology. Six ML models—random forest (RF), categorical boosting (CAT), extra tree regression (ETR), gradient boosting machine (GBM), Bayesian regression [...] Read more.
This study presents a comprehensive multi-model machine learning (ML) approach to predict river bed load, addressing the challenge of quantifying predictive uncertainty in fluvial geomorphology. Six ML models—random forest (RF), categorical boosting (CAT), extra tree regression (ETR), gradient boosting machine (GBM), Bayesian regression model (BRM), and K-nearest neighbors (KNNs)—were thoroughly evaluated across several performance metrics like root mean square error (RMSE), and correlation coefficient (R). To enhance model training and optimize performance, particle swarm optimization (PSO) was employed for hyperparameter tuning across all the models, leveraging its capability to efficiently explore complex hyperparameter spaces. Our findings indicated that RF, GBM, CAT, and ETR demonstrate superior predictive performance (R score > 0.936), benefiting significantly from PSO. In contrast, BRM displayed lower performance (0.838), indicating challenges with Bayesian approaches. The feature importance analysis, including permutation feature and SHAP values, highlighted the non-linear interdependencies between the variables, with river discharge (Q), bed slope (S), and flow width (W) being the most influential. This study also examined the specific impact of individual variables on model performance by adding and excluding individual variables, which is particularly meaningful when choosing input variables for the model, especially in limited data conditions. Uncertainty quantification through Monte Carlo simulations highlighted the enhanced predictability and reliability of models with larger datasets. The correlation between increased training data and improved model precision was evident in the consistent rise in mean R scores and reduction in standard deviations as the sample size increased. This research underscored the potential of advanced ensemble methods and PSO to mitigate the limitations of single-predictor models and exploit collective model strengths, thereby improving the reliability of predictions in river bed load estimation. The insights from this study provide a valuable framework for future research directions focused on optimizing ensemble configurations for hydro-dynamic modeling. Full article
(This article belongs to the Special Issue Application of Machine Learning in Hydrologic Sciences)
Show Figures

Figure 1

Back to TopTop