Svoboda | Graniru | BBC Russia | Golosameriki | Facebook
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (5,113)

Search Parameters:
Keywords = ensemble models

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
17 pages, 1946 KiB  
Article
Data-Driven PM2.5 Exposure Prediction in Wildfire-Prone Regions and Respiratory Disease Mortality Risk Assessment
by Sadegh Khanmohammadi, Mehrdad Arashpour, Milad Bazli and Parisa Farzanehfar
Fire 2024, 7(8), 277; https://doi.org/10.3390/fire7080277 (registering DOI) - 7 Aug 2024
Abstract
Wildfires generate substantial smoke containing fine particulate matter (PM2.5) that adversely impacts health. This study develops machine learning models integrating pre-wildfire factors like weather and fuel conditions with post-wildfire health impacts to provide a holistic understanding of smoke exposure risks. Various [...] Read more.
Wildfires generate substantial smoke containing fine particulate matter (PM2.5) that adversely impacts health. This study develops machine learning models integrating pre-wildfire factors like weather and fuel conditions with post-wildfire health impacts to provide a holistic understanding of smoke exposure risks. Various data-driven models including Support Vector Regression, Multi-layer Perceptron, and three tree-based ensemble algorithms (Random Forest, Extreme Gradient Boosting (XGBoost), and Natural Gradient Boosting (NGBoost)) are evaluated in this study. Ensemble models effectively predict PM2.5 levels based on temperature, humidity, wind, and fuel moisture, revealing the significant roles of radiation, temperature, and moisture. Further modelling links smoke exposure to deaths from chronic obstructive pulmonary disease (COPD) and lung cancer using age, sex, and pollution type as inputs. Ambient pollution is the primary driver of COPD mortality, while age has a greater influence on lung cancer deaths. This research advances atmospheric and health impact understanding, aiding forest fire prevention and management. Full article
(This article belongs to the Special Issue Forest Fuel Treatment and Fire Risk Assessment)
Show Figures

Figure 1

20 pages, 18214 KiB  
Article
Optimized Landslide Susceptibility Mapping and Modelling Using the SBAS-InSAR Coupling Model
by Xueling Wu, Xiaoshuai Qi, Bo Peng and Junyang Wang
Remote Sens. 2024, 16(16), 2873; https://doi.org/10.3390/rs16162873 - 6 Aug 2024
Viewed by 241
Abstract
Landslide susceptibility mapping (LSM) can accurately estimate the location and probability of landslides. An effective approach for precise LSM is crucial for minimizing casualties and damage. The existing LSM methods primarily rely on static indicators, such as geomorphology and hydrology, which are closely [...] Read more.
Landslide susceptibility mapping (LSM) can accurately estimate the location and probability of landslides. An effective approach for precise LSM is crucial for minimizing casualties and damage. The existing LSM methods primarily rely on static indicators, such as geomorphology and hydrology, which are closely associated with geo-environmental conditions. However, landslide hazards are often characterized by significant surface deformation. The Small Baseline Subset-Interferometric Synthetic Aperture Radar (SBAS-InSAR) technology plays a pivotal role in detecting and characterizing surface deformation. This work endeavors to assess the accuracy of SBAS-InSAR coupled with ensemble learning for LSM. Within this research, the study area was Shiyan City, and 12 static evaluation factors were selected as input variables for the ensemble learning models to compute landslide susceptibility. The Random Forest (RF) model demonstrates superior accuracy compared to other ensemble learning models, including eXtreme Gradient Boosting, Logistic Regression, Gradient Boosting Decision Tree, and K-Nearest Neighbor. Furthermore, SBAS-InSAR was utilized to obtain surface deformation rates both in the vertical direction and along the line of sight of the satellite. The former is used as a dynamic characteristic factor, while the latter is combined with the evaluation results of the RF model to create a landslide susceptibility optimization matrix. Comparing the precision of two methods for refining LSM results, it was found that the method integrating static and dynamic factors produced a more rational and accurate landslide susceptibility map. Full article
Show Figures

Figure 1

20 pages, 8199 KiB  
Article
A Similarity Clustering Deformation Prediction Model Based on GNSS/Accelerometer Time-Frequency Analysis
by Houzeng Han, Rongheng Li, Tao Xu, Meng Du, Wenxuan Ma and He Wu
Appl. Sci. 2024, 14(16), 6889; https://doi.org/10.3390/app14166889 - 6 Aug 2024
Viewed by 316
Abstract
Structural monitoring is crucial for assessing structural health, and high-precision deformation prediction can provide early warnings for safety monitoring. To address the issue of low prediction accuracy caused by the non-stationary and nonlinear characteristics of deformation sequences, this paper proposes a similarity clustering [...] Read more.
Structural monitoring is crucial for assessing structural health, and high-precision deformation prediction can provide early warnings for safety monitoring. To address the issue of low prediction accuracy caused by the non-stationary and nonlinear characteristics of deformation sequences, this paper proposes a similarity clustering (SC) deformation prediction model based on GNSS/accelerometer time-frequency analysis. First, the complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) algorithm is used to decompose the original monitoring data, and the time-frequency characteristic correlations of the deformation data are established. Then, similarity clustering is conducted for the monitoring sub-sequences based on their frequency domain characteristics, and clustered sequences are combined subsequently. Finally, the Long Short-Term Memory (LSTM) model is used to separately predict GNSS displacement and acceleration with clustered time series, and the overall deformation displacement is reconstructed based on the predicted GNSS displacement and acceleration-derived displacement. A shake table simulation experiment was conducted to validate the feasibility and performance of the proposed CEEMDAN-SC-LSTM model. A duration of 5 s displacement prediction is analyzed after 153 s of monitoring data training. The results demonstrate that the root mean square error (RMSE) of predicted displacement is 0.011 m with the proposed model, which achieves an improvement of 64.45% and 61.51% in comparison to the CEEMDAN-LSTM and LSTM models, respectively. The acceleration predictions also show an improvement of 96.49% and 95.58%, respectively, the RMSE of the predicted acceleration-reconstructed displacement is less than 1 mm, with a reconstruction similarity of over 99%. The overall displacement reconstruction similarity can reach over 95%. Full article
Show Figures

Figure 1

25 pages, 17857 KiB  
Article
Spatial Prediction of Organic Matter Quality in German Agricultural Topsoils
by Ali Sakhaee, Thomas Scholten, Ruhollah Taghizadeh-Mehrjardi, Mareike Ließ and Axel Don
Agriculture 2024, 14(8), 1298; https://doi.org/10.3390/agriculture14081298 - 6 Aug 2024
Viewed by 241
Abstract
Soil organic matter (SOM) and the ratio of soil organic carbon to total nitrogen (C/N ratio) are fundamental to the ecosystem services provided by soils. Therefore, understanding the spatial distribution and relationships between the SOM components mineral-associated organic matter (MAOM), particulate organic matter [...] Read more.
Soil organic matter (SOM) and the ratio of soil organic carbon to total nitrogen (C/N ratio) are fundamental to the ecosystem services provided by soils. Therefore, understanding the spatial distribution and relationships between the SOM components mineral-associated organic matter (MAOM), particulate organic matter (POM), and C/N ratio is crucial. Three ensemble machine learning models were trained to obtain spatial predictions of the C/N ratio, MAOM, and POM in German agricultural topsoil (0–10 cm). Parameter optimization and model evaluation were performed using nested cross-validation. Additionally, a modification to the regressor chain was applied to capture and interpret the interactions among the C/N ratio, MAOM, and POM. The ensemble models yielded mean absolute percent errors (MAPEs) of 8.2% for the C/N ratio, 14.8% for MAOM, and 28.6% for POM. Soil type, pedo-climatic region, hydrological unit, and soilscapes were found to explain 75% of the variance in MAOM and POM, and 50% in the C/N ratio. The modified regressor chain indicated a nonlinear relationship between the C/N ratio and SOM due to the different decomposition rates of SOM as a result of variety in its nutrient quality. These spatial predictions enhance the understanding of soil properties’ distribution in Germany. Full article
Show Figures

Figure 1

15 pages, 507 KiB  
Article
Automatic Age and Gender Recognition Using Ensemble Learning
by Ergün Yücesoy
Appl. Sci. 2024, 14(16), 6868; https://doi.org/10.3390/app14166868 - 6 Aug 2024
Viewed by 189
Abstract
The use of speech-based recognition technologies in human–computer interactions is increasing daily. Age and gender recognition, one of these technologies, is a popular research topic used directly or indirectly in many applications. In this research, a new age and gender recognition approach based [...] Read more.
The use of speech-based recognition technologies in human–computer interactions is increasing daily. Age and gender recognition, one of these technologies, is a popular research topic used directly or indirectly in many applications. In this research, a new age and gender recognition approach based on the ensemble of different machine learning algorithms is proposed. In the study, five different classifiers, namely KNN, SVM, LR, RF, and E-TREE, are used as base-level classifiers and the majority voting and stacking methods are used to create the ensemble models. First, using MFCC features, five base-level classifiers are created and the performance of each model is evaluated. Then, starting from the one with the highest performance, these classifiers are combined and ensemble models are created. In the study, eight different ensemble models are created and the performances of each are examined separately. The experiments conducted with the Turkish subsection of the Mozilla Common Voice dataset show that the ensemble models increase the recognition accuracy, and the highest accuracy of 97.41% is achieved with the ensemble model created by stacking five classifiers (SVM, E-TREE, RF, KNN, and LR). According to this result, the proposed ensemble model achieves superior accuracy compared to similar studies in recognizing age and gender from speech signals. Full article
(This article belongs to the Special Issue Advances and Applications of Audio and Speech Signal Processing)
Show Figures

Figure 1

16 pages, 948 KiB  
Article
Masketeer: An Ensemble-Based Pseudonymization Tool with Entity Recognition for German Unstructured Medical Free Text
by Martin Baumgartner, Karl Kreiner, Fabian Wiesmüller, Dieter Hayn, Christian Puelacher and Günter Schreier
Future Internet 2024, 16(8), 281; https://doi.org/10.3390/fi16080281 - 6 Aug 2024
Viewed by 175
Abstract
Background: The recent rise of large language models has triggered renewed interest in medical free text data, which holds critical information about patients and diseases. However, medical free text is also highly sensitive. Therefore, de-identification is typically required but is complicated since medical [...] Read more.
Background: The recent rise of large language models has triggered renewed interest in medical free text data, which holds critical information about patients and diseases. However, medical free text is also highly sensitive. Therefore, de-identification is typically required but is complicated since medical free text is mostly unstructured. With the Masketeer algorithm, we present an effective tool to de-identify German medical text. Methods: We used an ensemble of different masking classes to remove references to identifiable data from over 35,000 clinical notes in accordance with the HIPAA Safe Harbor Guidelines. To retain additional context for readers, we implemented an entity recognition scheme and corpus-wide pseudonymization. Results: The algorithm performed with a sensitivity of 0.943 and specificity of 0.933. Further performance analyses showed linear runtime complexity (O(n)) with both increasing text length and corpus size. Conclusions: In the future, large language models will likely be able to de-identify medical free text more effectively and thoroughly than handcrafted rules. However, such gold-standard de-identification tools based on large language models are yet to emerge. In the current absence of such, we hope to provide best practices for a robust rule-based algorithm designed with expert domain knowledge. Full article
(This article belongs to the Special Issue eHealth and mHealth)
Show Figures

Figure 1

12 pages, 5278 KiB  
Article
In 50 Shades of Orange: Germany’s Photovoltaic Power Generation Landscape
by Reinhold Lehneis and Daniela Thrän
Energies 2024, 17(16), 3871; https://doi.org/10.3390/en17163871 - 6 Aug 2024
Viewed by 224
Abstract
Spatiotemporally resolved data on photovoltaic (PV) power generation are very helpful to analyze the multiple impacts of this variable renewable energy on regional and local scales. In the absence of such disaggregated data for Germany, numerical simulations are needed to obtain the electricity [...] Read more.
Spatiotemporally resolved data on photovoltaic (PV) power generation are very helpful to analyze the multiple impacts of this variable renewable energy on regional and local scales. In the absence of such disaggregated data for Germany, numerical simulations are needed to obtain the electricity production from PV systems for a time period and region under study. This manuscript presents how a physical simulation model, which uses open access weather and plant data as input vectors, can be created. The developed PV model is then applied to an ensemble of approximately 1.95 million PV systems, consisting of ground-mounted and rooftop installations, in order to compute their electricity production in Germany for the year 2020. The resulting spatially aggregated time series closely matches the measured PV feed-in pattern of Germany throughout the simulated year. Such disaggregated data can be applied to investigate the German PV power generation landscape at various spatiotemporal levels, as each PV system is taken into account with its technical data and the weather conditions at its geo-location. Furthermore, the German PV power generation landscape is presented as detailed maps based on these simulation results, which can also be useful for many other scientific fields such as energy system modeling. Full article
(This article belongs to the Special Issue Advances in Solar Systems and Energy Efficiency: 2nd Edition)
Show Figures

Figure 1

15 pages, 547 KiB  
Article
An Explainable Deep Learning Approach for Stress Detection in Wearable Sensor Measurements
by Martin Karl Moser, Maximilian Ehrhart and Bernd Resch
Sensors 2024, 24(16), 5085; https://doi.org/10.3390/s24165085 - 6 Aug 2024
Viewed by 243
Abstract
Stress has various impacts on the health of human beings. Recent success in wearable sensor development, combined with advancements in deep learning to automatically detect features from raw data, opens several interesting applications related to detecting emotional states. Being able to accurately detect [...] Read more.
Stress has various impacts on the health of human beings. Recent success in wearable sensor development, combined with advancements in deep learning to automatically detect features from raw data, opens several interesting applications related to detecting emotional states. Being able to accurately detect stress-related emotional arousal in an acute setting can positively impact the imminent health status of humans, i.e., through avoiding dangerous locations in an urban traffic setting. This work proposes an explainable deep learning methodology for the automatic detection of stress in physiological sensor data, recorded through a non-invasive wearable sensor device, the Empatica E4 wristband. We propose a Long-Short Term-Memory (LSTM) network, extended through a Deep Generative Ensemble of conditional GANs (LSTM DGE), to deal with the low data regime of sparsely labeled sensor measurements. As explainability is often a main concern of deep learning models, we leverage Integrated Gradients (IG) to highlight the most essential features used by the model for prediction and to compare the results to state-of-the-art expert-based stress-detection methodologies in terms of precision, recall, and interpretability. The results show that our LSTM DGE outperforms the state-of-the-art algorithm by 3 percentage points in terms of recall, and 7.18 percentage points in terms of precision. More importantly, through the use of Integrated Gradients as a layer of explainability, we show that there is a strong overlap between model-derived stress features for electrodermal activity and existing literature, which current state-of-the-art stress detection systems in medical research and psychology are based on. Full article
Show Figures

Figure 1

12 pages, 2934 KiB  
Article
Streamlined Deep Learning Models for Move Prediction in Go-Game
by Ying-Chih Lin and Yu-Chen Huang
Electronics 2024, 13(15), 3093; https://doi.org/10.3390/electronics13153093 - 5 Aug 2024
Viewed by 229
Abstract
Due to the complexity of search space and move evaluation, the game of Go has been a long-standing challenge for artificial intelligence (AI) to achieve a high level of proficiency. It was not until DeepMind proposed the deep neural network and tree search [...] Read more.
Due to the complexity of search space and move evaluation, the game of Go has been a long-standing challenge for artificial intelligence (AI) to achieve a high level of proficiency. It was not until DeepMind proposed the deep neural network and tree search algorithm AlphaGo in 2014 that an efficient learning algorithm was developed, marking a significant milestone in AI technology. In light of the key technologies in AI Computer Go, this work examines move prediction across different Go rankings and sophisticatedly develops two deep learning models by combining and extending the feature extraction methods of AlphaGo. Specifically, effective modules for neural networks are proposed to guide learning through complicated Go situations based on the Inception module in GoogLeNet and the Convolutional Block Attention Module (CBAM). Subsequently, the two models are combined by ensemble learning to improve generalization, and these streamlined models significantly reduce the number of model parameters to the scale of one hundred thousand. Experimental results show that our models achieve prediction accuracies of 46.9% and 50.8% on two different Go datasets, outperforming conventional models by significant margins. This work not only advances AI development in the Go-game but also offers an innovative approach to related studies. Full article
(This article belongs to the Special Issue AI in Information Processing and Real-Time Communication)
Show Figures

Figure 1

18 pages, 4076 KiB  
Article
Deep Ensemble Learning-Based Sensor for Flotation Froth Image Recognition
by Xiaojun Zhou and Yiping He
Sensors 2024, 24(15), 5048; https://doi.org/10.3390/s24155048 - 4 Aug 2024
Viewed by 296
Abstract
Froth flotation is a widespread and important method for mineral separation, significantly influencing the purity and quality of extracted minerals. Traditionally, workers need to control chemical dosages by observing the visual characteristics of flotation froth, but this requires considerable experience and operational skills. [...] Read more.
Froth flotation is a widespread and important method for mineral separation, significantly influencing the purity and quality of extracted minerals. Traditionally, workers need to control chemical dosages by observing the visual characteristics of flotation froth, but this requires considerable experience and operational skills. This paper designs a deep ensemble learning-based sensor for flotation froth image recognition to monitor actual flotation froth working conditions, so as to assist operators in facilitating chemical dosage adjustments and achieve the industrial goals of promoting concentrate grade and mineral recovery. In our approach, training and validation data on flotation froth images are partitioned in K-fold cross validation, and deep neural network (DNN) based learners are generated through pre-trained DNN models in image-enhanced training data, in order to improve their generalization and robustness. Then, a membership function utilizing the performance information of the DNN-based learners during the validation is proposed to improve the recognition accuracy of the DNN-based learners. Subsequently, a technique for order preference by similarity to an ideal solution (TOPSIS) based on the F1 score is proposed to select the most probable working condition of flotation froth images through a decision matrix composed of the DNN-based learners’ predictions via a membership function, which is adopted to optimize the combination process of deep ensemble learning. The effectiveness and superiority of the designed sensor are verified in a real industrial gold–antimony froth flotation application. Full article
(This article belongs to the Special Issue Deep Learning Technology and Image Sensing: 2nd Edition)
Show Figures

Figure 1

17 pages, 2589 KiB  
Article
Adaptive Evolutionary Computing Ensemble Learning Model for Sentiment Analysis
by Xiao-Yang Liu, Kang-Qi Zhang, Giacomo Fiumara, Pasquale De Meo and Annamaria Ficara
Appl. Sci. 2024, 14(15), 6802; https://doi.org/10.3390/app14156802 (registering DOI) - 4 Aug 2024
Viewed by 332
Abstract
Standard machine learning and deep learning architectures have been widely used in the field of sentiment analysis, but their performance is unsatisfactory if the input texts are short (e.g., social media posts). Specifically, the accuracy of standard machine learning methods crucially depends on [...] Read more.
Standard machine learning and deep learning architectures have been widely used in the field of sentiment analysis, but their performance is unsatisfactory if the input texts are short (e.g., social media posts). Specifically, the accuracy of standard machine learning methods crucially depends on the richness and completeness of the features used to represent the texts, and in the case of short messages, it is often difficult to obtain high-quality features. Conversely, methods based on deep learning can achieve better expressiveness, but these methods are computationally demanding and often suffer from over-fitting. This paper proposes a new adaptive evolutionary computational integrated learning model (AdaECELM) to overcome the problems encountered by traditional machine learning and deep learning models in sentiment analysis for short texts. AdaECELM consists of three phases: feature selection, sub classifier training, and global integration learning. First, a grid search is used for feature extraction and selection of term frequency-inverse document frequency (TF-IDF). Second, cuckoo search (CS) is introduced to optimize the combined hyperparameters in the sub-classifier support vector machine (SVM). Finally, the training set is divided into different feature subsets for sub-classifier training, and then the trained sub-classifiers are integrated and learned using the AdaBoost integrated soft voting method. Extensive experiments were conducted on six real polar sentiment analysis data sets. The results show that the AdaECELM model outperforms the traditional ML comparison methods according to evaluation metrics such as accuracy, precision, recall, and F1-score in all cases, and we report an improvement in accuracy exceeding 4.5%, the second-best competitor. Full article
(This article belongs to the Special Issue Artificial Intelligence in Complex Networks (2nd Edition))
Show Figures

Figure 1

27 pages, 15447 KiB  
Article
High-Resolution Rainfall Estimation Using Ensemble Learning Techniques and Multisensor Data Integration
by Maulana Putra, Mohammad Syamsu Rosid and Djati Handoko
Sensors 2024, 24(15), 5030; https://doi.org/10.3390/s24155030 - 3 Aug 2024
Viewed by 372
Abstract
In Indonesia, the monitoring of rainfall requires an estimation system with a high resolution and wide spatial coverage because of the complexities of the rainfall patterns. This study built a rainfall estimation model for Indonesia through the integration of data from various instruments, [...] Read more.
In Indonesia, the monitoring of rainfall requires an estimation system with a high resolution and wide spatial coverage because of the complexities of the rainfall patterns. This study built a rainfall estimation model for Indonesia through the integration of data from various instruments, namely, rain gauges, weather radars, and weather satellites. An ensemble learning technique, specifically, extreme gradient boosting (XGBoost), was applied to overcome the sparse data due to the limited number of rain gauge points, limited weather radar coverage, and imbalanced rain data. The model includes bias correction of the satellite data to increase the estimation accuracy. In addition, the data from several weather radars installed in Indonesia were also combined. This research handled rainfall estimates in various rain patterns in Indonesia, such as seasonal, equatorial, and local patterns, with a high temporal resolution, close to real time. The validation was carried out at six points, namely, Bandar Lampung, Banjarmasin, Pontianak, Deli Serdang, Gorontalo, and Biak. The research results show good estimation accuracy, with respective values of 0.89, 0.91, 0.89, 0.9, 0.92, and 0.9, and root mean square error (RMSE) values of 2.75 mm/h, 2.57 mm/h, 3.08 mm/h, 2.64 mm/h, 1.85 mm/h, and 2.48 mm/h. Our research highlights the potential of this model to accurately capture diverse rainfall patterns in Indonesia at high spatial and temporal scales. Full article
(This article belongs to the Special Issue Atmospheric Precipitation Sensors)
Show Figures

Figure 1

17 pages, 786 KiB  
Article
A Parallel Approach to Enhance the Performance of Supervised Machine Learning Realized in a Multicore Environment
by Ashutosh Ghimire and Fathi Amsaad
Mach. Learn. Knowl. Extr. 2024, 6(3), 1840-1856; https://doi.org/10.3390/make6030090 - 2 Aug 2024
Viewed by 523
Abstract
Machine learning models play a critical role in applications such as image recognition, natural language processing, and medical diagnosis, where accuracy and efficiency are paramount. As datasets grow in complexity, so too do the computational demands of classification techniques. Previous research has achieved [...] Read more.
Machine learning models play a critical role in applications such as image recognition, natural language processing, and medical diagnosis, where accuracy and efficiency are paramount. As datasets grow in complexity, so too do the computational demands of classification techniques. Previous research has achieved high accuracy but required significant computational time. This paper proposes a parallel architecture for Ensemble Machine Learning Models, harnessing multicore CPUs to expedite performance. The primary objective is to enhance machine learning efficiency without compromising accuracy through parallel computing. This study focuses on benchmark ensemble models including Random Forest, XGBoost, ADABoost, and K Nearest Neighbors. These models are applied to tasks such as wine quality classification and fraud detection in credit card transactions. The results demonstrate that, compared to single-core processing, machine learning tasks run 1.7 times and 3.8 times faster for small and large datasets on quad-core CPUs, respectively. Full article
(This article belongs to the Section Learning)
Show Figures

Figure 1

21 pages, 433 KiB  
Article
FinSoSent: Advancing Financial Market Sentiment Analysis through Pretrained Large Language Models
by Josiel Delgadillo, Johnson Kinyua and Charles Mutigwe
Big Data Cogn. Comput. 2024, 8(8), 87; https://doi.org/10.3390/bdcc8080087 - 2 Aug 2024
Viewed by 353
Abstract
Predicting the directions of financial markets has been performed using a variety of approaches, and the large volume of unstructured data generated by traders and other stakeholders on social media microblog platforms provides unique opportunities for analyzing financial markets using additional perspectives. Pretrained [...] Read more.
Predicting the directions of financial markets has been performed using a variety of approaches, and the large volume of unstructured data generated by traders and other stakeholders on social media microblog platforms provides unique opportunities for analyzing financial markets using additional perspectives. Pretrained large language models (LLMs) have demonstrated very good performance on a variety of sentiment analysis tasks in different domains. However, it is known that sentiment analysis is a very domain-dependent NLP task that requires knowledge of the domain ontology, and this is particularly the case with the financial domain, which uses its own unique vocabulary. Recent developments in NLP and deep learning including LLMs have made it possible to generate actionable financial sentiments using multiple sources including financial news, company fundamentals, technical indicators, as well social media microblogs posted on platforms such as StockTwits and X (formerly Twitter). We developed a financial social media sentiment analyzer (FinSoSent), which is a domain-specific large language model for the financial domain that was pretrained on financial news articles and fine-tuned and tested using several financial social media corpora. We conducted a large number of experiments using different learning rates, epochs, and batch sizes to yield the best performing model. Our model outperforms current state-of-the-art FSA models based on over 860 experiments, demonstrating the efficacy and effectiveness of FinSoSent. We also conducted experiments using ensemble models comprising FinSoSent and the other current state-of-the-art FSA models used in this research, and a slight performance improvement was obtained based on majority voting. Based on the results obtained across all models in these experiments, the significance of this study is that it highlights the fact that, despite the recent advances of LLMs, sentiment analysis even in domain-specific contexts remains a difficult research problem. Full article
17 pages, 15935 KiB  
Article
Automatic Weight Redistribution Ensemble Model Based on Transfer Learning to Use in Leak Detection for the Power Industry
by Sungsoo Kwon, Seoyoung Jeon, Tae-Jin Park and Ji-Hoon Bae
Sensors 2024, 24(15), 4999; https://doi.org/10.3390/s24154999 - 2 Aug 2024
Viewed by 235
Abstract
Creating an effective deep learning technique for accurately diagnosing leak signals across diverse environments is crucial for integrating artificial intelligence (AI) into the power plant industry. We propose an automatic weight redistribution ensemble model based on transfer learning (TL) for detecting leaks in [...] Read more.
Creating an effective deep learning technique for accurately diagnosing leak signals across diverse environments is crucial for integrating artificial intelligence (AI) into the power plant industry. We propose an automatic weight redistribution ensemble model based on transfer learning (TL) for detecting leaks in diverse power plant environments, overcoming the challenges of site-specific AI methods. This innovative model processes time series acoustic data collected from multiple homogeneous sensors located at different positions into three-dimensional root-mean-square (RMS) and frequency volume features, enabling accurate leak detection. Utilizing a TL-driven, two-stage learning process, we first train residual-network-based models for each domain using these preprocessed features. Subsequently, these models are retrained in an ensemble for comprehensive leak detection across domains, with control weight ratios finely adjusted through a softmax score-based approach. The experiment results demonstrate that the proposed method effectively distinguishes low-level leaks and noise compared to existing techniques, even when the data available for model training are very limited. Full article
(This article belongs to the Section Industrial Sensors)
Show Figures

Figure 1

Back to TopTop