Svoboda | Graniru | BBC Russia | Golosameriki | Facebook
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (1,335)

Search Parameters:
Keywords = BERT

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
29 pages, 721 KiB  
Article
Multimodal Affective Communication Analysis: Fusing Speech Emotion and Text Sentiment Using Machine Learning
by Diego Resende Faria, Abraham Itzhak Weinberg and Pedro Paulo Ayrosa
Appl. Sci. 2024, 14(15), 6631; https://doi.org/10.3390/app14156631 (registering DOI) - 29 Jul 2024
Abstract
Affective communication, encompassing verbal and non-verbal cues, is crucial for understanding human interactions. This study introduces a novel framework for enhancing emotional understanding by fusing speech emotion recognition (SER) and sentiment analysis (SA). We leverage diverse features and both classical and deep learning [...] Read more.
Affective communication, encompassing verbal and non-verbal cues, is crucial for understanding human interactions. This study introduces a novel framework for enhancing emotional understanding by fusing speech emotion recognition (SER) and sentiment analysis (SA). We leverage diverse features and both classical and deep learning models, including Gaussian naive Bayes (GNB), support vector machines (SVMs), random forests (RFs), multilayer perceptron (MLP), and a 1D convolutional neural network (1D-CNN), to accurately discern and categorize emotions in speech. We further extract text sentiment from speech-to-text conversion, analyzing it using pre-trained models like bidirectional encoder representations from transformers (BERT), generative pre-trained transformer 2 (GPT-2), and logistic regression (LR). To improve individual model performance for both SER and SA, we employ an extended dynamic Bayesian mixture model (DBMM) ensemble classifier. Our most significant contribution is the development of a novel two-layered DBMM (2L-DBMM) for multimodal fusion. This model effectively integrates speech emotion and text sentiment, enabling the classification of more nuanced, second-level emotional states. Evaluating our framework on the EmoUERJ (Portuguese) and ESD (English) datasets, the extended DBMM achieves accuracy rates of 96% and 98% for SER, 85% and 95% for SA, and 96% and 98% for combined emotion classification using the 2L-DBMM, respectively. Our findings demonstrate the superior performance of the extended DBMM for individual modalities compared to individual classifiers and the 2L-DBMM for merging different modalities, highlighting the value of ensemble methods and multimodal fusion in affective communication analysis. The results underscore the potential of our approach in enhancing emotional understanding with broad applications in fields like mental health assessment, human–robot interaction, and cross-cultural communication. Full article
15 pages, 1314 KiB  
Article
Joint Extraction Method for Hydraulic Engineering Entity Relations Based on Multi-Features
by Yang Liu, Xingzhi Wang, Xuemei Liu, Zehong Ren, Yize Wang and Qianqian Cai
Electronics 2024, 13(15), 2979; https://doi.org/10.3390/electronics13152979 - 28 Jul 2024
Viewed by 325
Abstract
During the joint extraction of entity and relationship from the operational management data of hydraulic engineering, complex sentences containing multiple triplets and overlapping entity relations often arise. However, traditional joint extraction models suffer from a single-feature representation approach, which hampers the effectiveness of [...] Read more.
During the joint extraction of entity and relationship from the operational management data of hydraulic engineering, complex sentences containing multiple triplets and overlapping entity relations often arise. However, traditional joint extraction models suffer from a single-feature representation approach, which hampers the effectiveness of entity relation extraction in complex sentences within hydraulic engineering datasets. To address this issue, this study proposes a multi-feature joint entity relation extraction method based on global context mechanism and graph convolutional neural networks. This method builds upon the Bidirectional Encoder Representations from Transformers (BERT) pre-trained model and utilizes a bidirectional gated recurrent unit (BiGRU) and global context mechanism (GCM) to supplement the contextual and global features of sentences. Subsequently, a graph convolutional network (GCN) based on syntactic dependencies is employed to learn inter-word dependency features, enhancing the model’s knowledge representation capabilities for complex sentences. Experimental results demonstrate the effectiveness of the proposed model in the joint extraction task on hydraulic engineering datasets. The precision, recall, and F1-score are 86.5%, 84.1%, and 85.3%, respectively, all outperforming the baseline model. Full article
(This article belongs to the Topic Artificial Intelligence Models, Tools and Applications)
Show Figures

Figure 1

24 pages, 896 KiB  
Article
Comparative Analysis of SAAS Model and NPC Integration for Enhancing VR Shopping Experiences
by Surasachai Doungtap, Jenq-Haur Wang and Varinya Phanichraksaphong
Appl. Sci. 2024, 14(15), 6573; https://doi.org/10.3390/app14156573 - 27 Jul 2024
Viewed by 227
Abstract
This article examines the incorporation of the Shopping Assistance Automatic Suggestion (SAAS) model into Virtual Reality (VR) environments in order to improve the online shopping experience. The SAAS model employs sophisticated deep learning methods to offer customized product recommendations, which are conveyed by [...] Read more.
This article examines the incorporation of the Shopping Assistance Automatic Suggestion (SAAS) model into Virtual Reality (VR) environments in order to improve the online shopping experience. The SAAS model employs sophisticated deep learning methods to offer customized product recommendations, which are conveyed by non-player characters (NPCs) via voice-based interactions. Our goal is to develop an interactive shopping experience that replicates real-life interactions by integrating AI-powered recommendations with immersive VR technology. We gather and standardize data from several open commerce databases, such as Amazon Product and Customer Reviews. The SAAS model, in conjunction with GPT-3, BERT, and T5, undergoes training and testing to evaluate its effectiveness across multiple criteria. The results demonstrate that the SAAS model surpasses other models in delivering contextually aware and pertinent recommendations. The integration process outlines the specific steps involved in capturing, processing, and transforming user interactions in virtual reality (VR) into vocal suggestions provided by non-player characters (NPCs). This strategy improves customization and utilizes the immersive features of virtual reality to effectively engage people. The results of our research establish a higher standard for e-commerce, with the goal of enhancing the user experience of online purchasing by making it more instinctive, engaging, and pleasurable. Full article
15 pages, 6423 KiB  
Article
Sustainability in Leadership: The Implicit Associations of the First-Person Pronouns and Leadership Effectiveness Based on Word Embedding Association Test
by Qu Yao, Yingjie Zheng and Jianhang Chen
Sustainability 2024, 16(15), 6403; https://doi.org/10.3390/su16156403 - 26 Jul 2024
Viewed by 278
Abstract
The first-person pronoun is an indispensable element of the communication process. Meanwhile, leadership effectiveness, as the result of leaders’ leadership work, is the key to the sustainable development of leaders and corporations. However, due to the constraints of traditional methods and sample bias, [...] Read more.
The first-person pronoun is an indispensable element of the communication process. Meanwhile, leadership effectiveness, as the result of leaders’ leadership work, is the key to the sustainable development of leaders and corporations. However, due to the constraints of traditional methods and sample bias, it is challenging to accurately measure and validate the relationship between first-person pronouns and leadership effectiveness at the implicit level. Word Embedding Association Test (WEAT) measures the relative degree of association between words in natural language by calculating the difference in word similarity. This study employs the word and sentence vector indicators of WEAT to investigate the implicit relationship between first-person pronouns and leadership effectiveness. The word vector analyses of the Beijing Normal University word vector database and Google News word vector database demonstrate that the cosine similarity and semantic similarity of “we-leadership effectiveness” are considerably greater than that of “I-leadership effectiveness”. Furthermore, the sentence vector analyses of the Chinese Wikipedia BERT model corroborate this relationship. In conclusion, the results of a machine learning-based WEAT verified the relationship between first-person plural pronouns and leadership effectiveness. This suggests that when leaders prefer to use “we”, they are perceived to be more effective. Full article
Show Figures

Figure 1

16 pages, 1560 KiB  
Article
SSuieBERT: Domain Adaptation Model for Chinese Space Science Text Mining and Information Extraction
by Yunfei Liu, Shengyang Li, Yunziwei Deng, Shiyi Hao and Linjie Wang
Electronics 2024, 13(15), 2949; https://doi.org/10.3390/electronics13152949 - 26 Jul 2024
Viewed by 240
Abstract
With the continuous exploration of space science, a large number of domain-related materials and scientific literature are constantly generated, mostly in the form of text, which contains rich and unexplored domain knowledge. Natural language processing technology has made rapid development and pre-trained language [...] Read more.
With the continuous exploration of space science, a large number of domain-related materials and scientific literature are constantly generated, mostly in the form of text, which contains rich and unexplored domain knowledge. Natural language processing technology has made rapid development and pre-trained language models provide promising information extraction tools. However, due to the strong professionalism of space science, there are many domain concepts and technical terms. Moreover, Chinese texts have complex language structures and word combinations, which may yield suboptimal performance by general pre-trained models such as BERT. In this work, we investigate how to adapt BERT to Chinese space science and propose the space science-aware pre-trained language model, namely, SSuieBERT. We validate it through downstream tasks such as named entity recognition, relation extraction, and event extraction, which can perform better than general models. To the best of our knowledge, our proposed SSuieBERT is the first pre-trained language model in space science, which can promote information extraction and knowledge discovery from space science texts. Full article
(This article belongs to the Section Artificial Intelligence)
Show Figures

Figure 1

15 pages, 1178 KiB  
Article
Enhanced Text Classification with Label-Aware Graph Convolutional Networks
by Ming-Yen Lin, Hsuan-Chun Liu and Sue-Chen Hsush
Electronics 2024, 13(15), 2944; https://doi.org/10.3390/electronics13152944 - 25 Jul 2024
Viewed by 226
Abstract
Text classification is an important research field in text mining and natural language processing, gaining momentum with the growth of social networks. Despite the accuracy advancements made by deep learning models, existing graph neural network-based methods often overlook the implicit class information within [...] Read more.
Text classification is an important research field in text mining and natural language processing, gaining momentum with the growth of social networks. Despite the accuracy advancements made by deep learning models, existing graph neural network-based methods often overlook the implicit class information within texts. To address this gap, we propose a graph neural network model named LaGCN to improve classification accuracy. LaGCN utilizes the latent class information in texts, treating it as explicit class labels. It refines the graph convolution process by adding label-aware nodes to capture document–word, word–word, and word–class correlations for text classification. Comparing LaGCN with leading-edge models like HDGCN and BERT, our experiments on Ohsumed, Movie Review, 20 Newsgroups, and R8 datasets demonstrate its superiority. LaGCN outperformed existing methods, showing average accuracy improvements of 19.47%, 10%, 4.67%, and 0.4%, respectively. This advancement underscores the importance of integrating class information into graph neural networks, setting a new benchmark for text classification tasks. Full article
22 pages, 5434 KiB  
Article
A Sustainable Rental Price Prediction Model Based on Multimodal Input and Deep Learning—Evidence from Airbnb
by Hongbo Tan, Tian Su, Xusheng Wu, Pengzhan Cheng and Tianxiang Zheng
Sustainability 2024, 16(15), 6384; https://doi.org/10.3390/su16156384 - 25 Jul 2024
Viewed by 345
Abstract
In the accommodation field, reasonable pricing is crucial for hosts to maximize their profits and is also an essential factor influencing tourists’ tendency to choose. The link between price prediction and findings about the causal relationships between key indicators and prices is not [...] Read more.
In the accommodation field, reasonable pricing is crucial for hosts to maximize their profits and is also an essential factor influencing tourists’ tendency to choose. The link between price prediction and findings about the causal relationships between key indicators and prices is not well discussed in the literature. This research aims to identify comprehensive pricing determinants for sharing economy-based lodging services and utilize them for lodging price prediction. Utilizing data retrieved from InsideAirbnb, we recognized 50 variables classified into five categories: property functions, host attributes, reputation, location, and indispensable miscellaneous factors. Property descriptions and a featured image posted by hosts were also added as input to indicate price-influencing antecedents. We proposed a price prediction model by incorporating a fully connected neural network, the bidirectional encoder representations from transformers (BERT), and MobileNet with these data sources. The model was validated using 8380 Airbnb listings from Amsterdam, North Holland, Netherlands. Results reveal that our model outperforms other models with simple or fewer inputs, reaching a minimum MAPE (mean absolute percentage error) of 5.5682%. The novelty of this study is the application of multimodal input and multiple neural networks in forecasting sharing economy accommodation prices to boost predictive performance. The findings provide useful guidance on price setting for hosts in the sharing economy that is compliant with rental market regulations, which is particularly important for sustainable hospitality growth. Full article
Show Figures

Figure 1

13 pages, 1303 KiB  
Article
Research on Named Entity Recognition Based on Gated Interaction Mechanisms
by Bin Liu, Wanyuan Chen, Jialing Tao, Lei He and Dan Tang
Appl. Sci. 2024, 14(15), 6481; https://doi.org/10.3390/app14156481 - 25 Jul 2024
Viewed by 267
Abstract
Using long short-term memory (LSTM) networks to build a named entity recognition model is important for the task of named entity recognition. However, traditional memory networks lack a direct connection between input information and hidden states, leading to key feature information not being [...] Read more.
Using long short-term memory (LSTM) networks to build a named entity recognition model is important for the task of named entity recognition. However, traditional memory networks lack a direct connection between input information and hidden states, leading to key feature information not being fully learned during training and causing information loss. This paper designs a bidirectional variant of the long short-term memory (BiLSTM) network called Mogrifier-BiGRU, which combines the BERT pre-trained model and the conditional random field (CRF) network model. The Mogrifier gating interaction unit is set with more hyperparameters to achieve deep interaction of gating information, changing the relationship between input and hidden states so that they are no longer independent. By introducing more nonlinear transformations, the model can learn more complex input–output mapping relationships. Then, by combining Bayesian optimization with the improved Mogrifier-BiGRU network, the optimal hyperparameters of the model are automatically calculated. Experimental results show that the model method based on the gating interaction mechanism can effectively combine feature information, improving the accuracy of Chinese-named entity recognition. On the dataset, an F1-score of 85.42% was achieved, which is 7% higher than traditional methods and 10% higher for the accuracy of some entity recognition. Full article
Show Figures

Figure 1

19 pages, 2505 KiB  
Article
Automated Construction Method of Knowledge Graphs for Pirate Events
by Cunxiang Xie, Zhaogen Zhong and Limin Zhang
Appl. Sci. 2024, 14(15), 6482; https://doi.org/10.3390/app14156482 - 25 Jul 2024
Viewed by 221
Abstract
With the development of seaborne trade, international maritime crime is becoming increasingly complex. Detecting maritime threats by fusing the physical movement data from traditional physical sensors is not sufficient. Thus, soft data, including intelligence reports and news articles, need to be incorporated into [...] Read more.
With the development of seaborne trade, international maritime crime is becoming increasingly complex. Detecting maritime threats by fusing the physical movement data from traditional physical sensors is not sufficient. Thus, soft data, including intelligence reports and news articles, need to be incorporated into the situational awareness models of maritime threats. In this regard, this study developed an automated construction method of knowledge graphs for pirate events, which lays a foundation for subsequent maritime threat reasoning and situational awareness. First, a knowledge graph ontology model for pirate events was designed. Secondly, the BERT-BiLSTM-CRF model is proposed for named-entity recognition, and an entity linking algorithm based on distant learning and context attention mechanism is proposed to remove the conceptual ambiguity. Thirdly, based on traditional distant supervision relation extraction, which is based on sentence-level attention mechanism, bag-level and group-level attention mechanism methods are additionally proposed to further enhance the performance of distant supervision relation extraction. The proposed model demonstrated high performance in named-entity recognition, entity linking, and relation extraction tasks, with an overall F1-score of over 0.94 for NER and significant improvements in entity linking and relation extraction compared to traditional methods. The constructed knowledge graphs effectively support maritime threat reasoning and situational awareness, offering a substantial contribution to the field of maritime security. Our discussion highlights the model’s strengths and potential areas for future work, while the conclusion emphasizes the practical implications and the readiness of our approach for real-world applications. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
Show Figures

Figure 1

15 pages, 469 KiB  
Article
Using Enhanced Representations to Predict Medical Procedures from Clinician Notes
by Roberto Móstoles, Oscar Araque and Carlos Á. Iglesias
Appl. Sci. 2024, 14(15), 6431; https://doi.org/10.3390/app14156431 - 24 Jul 2024
Viewed by 296
Abstract
Nowadays, most health professionals use electronic health records to keep track of patients. To properly use and share these data, the community has relied on medical classification standards to represent patient information. However, the coding process is tedious and time-consuming, often limiting its [...] Read more.
Nowadays, most health professionals use electronic health records to keep track of patients. To properly use and share these data, the community has relied on medical classification standards to represent patient information. However, the coding process is tedious and time-consuming, often limiting its application. This paper proposes a novel feature representation method that considers the distinction between diagnoses and procedure codes, and applies this to the task of medical procedure code prediction. Diagnosis codes are combined with text annotations, and the result is then used as input to a downstream procedure code prediction task. Various diagnosis code representations are considered by exploiting a code hierarchy. Furthermore, different text representation strategies are also used, including embeddings from language models. Finally, the method was evaluated using the MIMIC-III database. Our experiments showed improved performance in procedure code prediction when exploiting the diagnosis codes, outperforming state-of-the-art models. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
Show Figures

Figure 1

24 pages, 927 KiB  
Review
Genetic Engineering and Innovative Cultivation Strategies for Enhancing the Lutein Production in Microalgae
by Bert Coleman, Elke Vereecke, Katrijn Van Laere, Lucie Novoveska and Johan Robbens
Mar. Drugs 2024, 22(8), 329; https://doi.org/10.3390/md22080329 - 23 Jul 2024
Viewed by 553
Abstract
Carotenoids, with their diverse biological activities and potential pharmaceutical applications, have garnered significant attention as essential nutraceuticals. Microalgae, as natural producers of these bioactive compounds, offer a promising avenue for sustainable and cost-effective carotenoid production. Despite the ability to cultivate microalgae for its [...] Read more.
Carotenoids, with their diverse biological activities and potential pharmaceutical applications, have garnered significant attention as essential nutraceuticals. Microalgae, as natural producers of these bioactive compounds, offer a promising avenue for sustainable and cost-effective carotenoid production. Despite the ability to cultivate microalgae for its high-value carotenoids with health benefits, only astaxanthin and β-carotene are produced on a commercial scale by Haematococcus pluvialis and Dunaliella salina, respectively. This review explores recent advancements in genetic engineering and cultivation strategies to enhance the production of lutein by microalgae. Techniques such as random mutagenesis, genetic engineering, including CRISPR technology and multi-omics approaches, are discussed in detail for their impact on improving lutein production. Innovative cultivation strategies are compared, highlighting their advantages and challenges. The paper concludes by identifying future research directions, challenges, and proposing strategies for the continued advancement of cost-effective and genetically engineered microalgal carotenoids for pharmaceutical applications. Full article
(This article belongs to the Special Issue Algal Cultivation for Obtaining High-Value Products)
Show Figures

Graphical abstract

7 pages, 967 KiB  
Brief Report
Detecting Intrathoracic Airway Closure during Prehospital Cardiopulmonary Resuscitation Using Quasi-Static Pressure–Volume Curves: A Pilot Study
by Maxim Vanwulpen, Arthur Bouillon, Ruben Cornelis, Bert Dessers and Saïd Hachimi-Idrissi
J. Clin. Med. 2024, 13(14), 4274; https://doi.org/10.3390/jcm13144274 - 22 Jul 2024
Viewed by 402
Abstract
Background: Intrathoracic airway closure frequently occurs during cardiac arrest, possibly impairing ventilation. Previously, capnogram analysis was used to detect this pathophysiological process. In other populations, quasi-static pressure–volume curves obtained during constant low-flow inflations are routinely used to detect intrathoracic airway closure. This study [...] Read more.
Background: Intrathoracic airway closure frequently occurs during cardiac arrest, possibly impairing ventilation. Previously, capnogram analysis was used to detect this pathophysiological process. In other populations, quasi-static pressure–volume curves obtained during constant low-flow inflations are routinely used to detect intrathoracic airway closure. This study reports the first use of quasi-static pressure–volume curves to detect intrathoracic airway closure during prehospital cardiopulmonary resuscitation. Methods: Connecting a pressure and flow sensor to the endotracheal tube enabled the performance of low-flow inflations during cardiopulmonary resuscitation using a manual resuscitator. Users connected the device following intubation and performed a low-flow inflation during the next rhythm analysis when chest compressions were interrupted. Determining the lower inflection point on the resulting pressure–volume curves allowed for the detection and quantification of intrathoracic airway closure. Results: The research device was used during the prehospital treatment of ten cardiac arrest patients. A lower inflection point indicating intrathoracic airway closure was detected in all patients. During cardiac arrest, the median pressure at which the lower inflection point occurred was 5.56 cmH20 (IQR 4.80, 8.23 cmH20). This value varied considerably between cases and was lower in patients who achieved return of spontaneous circulation. Conclusions: In this pilot study, quasi-static pressure–volume curves were obtained during prehospital cardiopulmonary resuscitation. Intrathoracic airway closure was detected in all patients. Further research is needed to determine whether the use of ventilation strategies to counter intrathoracic airway closure could lead to improved outcomes and if the degree of airway closure could serve as a prognostic factor. Full article
(This article belongs to the Special Issue Cardiac Arrest in Intensive Care: Management and Prognosis)
Show Figures

Figure 1

23 pages, 4933 KiB  
Article
Advanced Multimodal Sentiment Analysis with Enhanced Contextual Fusion and Robustness (AMSA-ECFR): Symmetry in Feature Integration and Data Alignment
by Qing Chen, Shenghong Dong and Pengming Wang
Symmetry 2024, 16(7), 934; https://doi.org/10.3390/sym16070934 - 22 Jul 2024
Viewed by 602
Abstract
Multimodal sentiment analysis, a significant challenge in artificial intelligence, necessitates the integration of various data modalities for accurate human emotion interpretation. This study introduces the Advanced Multimodal Sentiment Analysis with Enhanced Contextual Fusion and Robustness (AMSA-ECFR) framework, addressing the critical challenge of data [...] Read more.
Multimodal sentiment analysis, a significant challenge in artificial intelligence, necessitates the integration of various data modalities for accurate human emotion interpretation. This study introduces the Advanced Multimodal Sentiment Analysis with Enhanced Contextual Fusion and Robustness (AMSA-ECFR) framework, addressing the critical challenge of data sparsity in multimodal sentiment analysis. The main components of the proposed approach include a Transformer-based model employing BERT for deep semantic analysis of textual data, coupled with a Long Short-Term Memory (LSTM) network for encoding temporal acoustic features. Innovations in AMSA-ECFR encompass advanced feature encoding for temporal dynamics and an adaptive attention-based model for efficient cross-modal integration, achieving symmetry in the fusion and alignment of asynchronous multimodal data streams. Additionally, the framework employs generative models for intelligent approximation of missing features. It ensures robust alignment of high-level features with multimodal data context, effectively tackling issues of incomplete or noisy inputs. In simulation studies, the AMSA-ECFR model demonstrated superior performance against existing approaches. The symmetrical approach to feature integration and data alignment contributed significantly to the model’s robustness and precision. In simulations, the AMSA-ECFR model demonstrated a 10% higher accuracy and a 15% lower mean absolute error than the current best multimodal sentiment analysis frameworks. Full article
Show Figures

Figure 1

30 pages, 12265 KiB  
Article
Toward Robust Arabic AI-Generated Text Detection: Tackling Diacritics Challenges
by Hamed Alshammari and Khaled Elleithy
Information 2024, 15(7), 419; https://doi.org/10.3390/info15070419 - 19 Jul 2024
Viewed by 431
Abstract
Current AI detection systems often struggle to distinguish between Arabic human-written text (HWT) and AI-generated text (AIGT) due to the small marks present above and below the Arabic text called diacritics. This study introduces robust Arabic text detection models using Transformer-based pre-trained models, [...] Read more.
Current AI detection systems often struggle to distinguish between Arabic human-written text (HWT) and AI-generated text (AIGT) due to the small marks present above and below the Arabic text called diacritics. This study introduces robust Arabic text detection models using Transformer-based pre-trained models, specifically AraELECTRA, AraBERT, XLM-R, and mBERT. Our primary goal is to detect AIGTs in essays and overcome the challenges posed by the diacritics that usually appear in Arabic religious texts. We created several novel datasets with diacritized and non-diacritized texts comprising up to 9666 HWT and AIGT training examples. We aimed to assess the robustness and effectiveness of the detection models on out-of-domain (OOD) datasets to assess their generalizability. Our detection models trained on diacritized examples achieved up to 98.4% accuracy compared to GPTZero’s 62.7% on the AIRABIC benchmark dataset. Our experiments reveal that, while including diacritics in training enhances the recognition of the diacritized HWTs, duplicating examples with and without diacritics is inefficient despite the high accuracy achieved. Applying a dediacritization filter during evaluation significantly improved model performance, achieving optimal performance compared to both GPTZero and the detection models trained on diacritized examples but evaluated without dediacritization. Although our focus was on Arabic due to its writing challenges, our detector architecture is adaptable to any language. Full article
(This article belongs to the Section Artificial Intelligence)
Show Figures

Figure 1

12 pages, 1800 KiB  
Article
Research on Public Service Request Text Classification Based on BERT-BiLSTM-CNN Feature Fusion
by Yunpeng Xiong, Guolian Chen and Junkuo Cao
Appl. Sci. 2024, 14(14), 6282; https://doi.org/10.3390/app14146282 - 18 Jul 2024
Viewed by 399
Abstract
Convolutional neural networks (CNNs) face challenges in capturing long-distance text correlations, and Bidirectional Long Short-Term Memory (BiLSTM) networks exhibit limited feature extraction capabilities for text classification of public service requests. To address the abovementioned problems, this work utilizes an ensemble learning approach to [...] Read more.
Convolutional neural networks (CNNs) face challenges in capturing long-distance text correlations, and Bidirectional Long Short-Term Memory (BiLSTM) networks exhibit limited feature extraction capabilities for text classification of public service requests. To address the abovementioned problems, this work utilizes an ensemble learning approach to integrate model elements efficiently. This study presents a method for classifying public service request text using a hybrid neural network model called BERT-BiLSTM-CNN. First, BERT (Bidirectional Encoder Representations from Transformers) is used for preprocessing to obtain text vector representations. Then, context and process sequence information are captured through BiLSTM. Next, local features in the text are captured through CNN. Finally, classification results are obtained through Softmax. Through comparative analysis, the method of fusing these three models is superior to other hybrid neural network model architectures in multiple classification tasks. It has a significant effect on public service request text classification. Full article
Show Figures

Figure 1

Back to TopTop