Applied Computer Systems Feed

Hierarchical Text Classification: Fine-tuned GPT-2 vs BERT-BiLSTM

Sat, 15 Mar 2025 00:00:00 GMT

Hierarchical Text Classification (HTC) is a specialised task in natural language processing that involves categorising text into a hierarchical structure of classes. This approach is particularly valuable in several domains, such as document organisation, sentiment analysis, and information retrieval, where classification schemas naturally form hierarchical structures. In this paper, we propose and compare two deep learning-based models for HTC. The first model involves fine-tuning GPT-2, a large language model (LLM), specifically for hierarchical classification tasks. Fine-tuning adapts GPT-2’s extensive pre-trained knowledge to the nuances of hierarchical classification. The second model leverages BERT for text preprocessing and encoding, followed by a BiLSTM layer for the classification process. Experimental results demonstrate that the fine-tuned GPT-2 model significantly outperforms the BERT-BiLSTM model in accuracy and F1 scores, underscoring the advantages of using advanced LLMs for hierarchical text classification.

Comparison of Language Models for English-Latvian Semantic Search

Fri, 07 Feb 2025 00:00:00 GMT

In this study, ten language models are explored and compared in an English-Latvian semantic information retrieval setting, where the indexed collection of documents is written in English while the query documents are written in Latvian. Currently, no similar research has been done regarding the Latvian language. A dataset of 77736 pairs of articles from Latvian and English Wikipedia was created, transformed into embedding vectors, and used for retrieval experiments with brute force search, Hierarchical Navigable Small World method, and Inverted File Indexing method. The LaBSE language model achieved the best performance for short texts and a version of Sentence-BERT and E5-large for long texts.

Monocular Depth Estimation: A Review on Hybrid Architectures, Transformers and Addressing Adverse Weather Conditions

Fri, 24 Jan 2025 00:00:00 GMT

Monocular depth estimation is one of the essential tasks in computer vision as it can provide depth information from 2D images and is extremely beneficial for applications such as autonomous driving, robot navigation, etc. Monocular depth estimation has significantly improved over the past couple of years and deep learning-based methods have surpassed traditional and machine learning-based methods. Deep learning-based methods have further been enhanced using transformer and hybrid approaches. This paper first discusses the sensors used for depth estimation and their limitations. Then, we briefly discuss the evolution of depth estimation. Then we dive into the deep learning methods including transformer and CNN-transformer hybrid methods and their limitations. Later, we discuss several methods addressing challenging weather conditions. Finally, we discuss the current trends, challenges and future directions of the transformer and hybrid methods.

Peering into the Heart: A Comprehensive Exploration of Semantic Segmentation and Explainable AI on the MnMs-2 Cardiac MRI Dataset

Tue, 21 Jan 2025 00:00:00 GMT

Accurate and interpretable segmentation of medical images is crucial for computer-aided diagnosis and image-guided interventions. This study explores the integration of semantic segmentation and explainable AI techniques on the MnMs-2 Cardiac MRI dataset. We propose a segmentation model that achieves competitive dice scores (nearly 90 %) and Hausdorff distance (less than 70), demonstrating its effectiveness for cardiac MRI analysis. Furthermore, we leverage Grad-CAM, and Feature Ablation, explainable AI techniques, to visualise the regions of interest guiding the model predictions for a target class. This integration enhances interpretability, allowing us to gain insights into the model decision-making process and build trust in its predictions.

Method for Creating Domain-Specific Dataset Ontologies from Text in Uncontrolled English

Tue, 21 Jan 2025 00:00:00 GMT

Automated understanding of activities in enterprises is challenging due to a lack of domain specifications and a lack of domain ontologies. The goal of this research is to develop a method to extract elements of domain-specific processes from textual documents in unstructured English and form domain dataset ontologies. In order to achieve the goal, the related work on discourse analysis and business process modelling have been considered. The prominent technologies for implementation of the proposed method are machine learning, including classification algorithms and natural language processing using a large language model. The first experimental results are presented, and further research is discussed. Potentially, the method proposed can be implemented as a part of some assisting tool for system analysts and can support an analysis of the domain-specific information by providing contextual information from this and potentially related domains.

A Comparative Analysis of Automated Machine Learning Libraries for Electricity Price Forecasting

Fri, 06 Dec 2024 00:00:00 GMT

Reliable and accurate electricity price forecasting algorithms can be used to inform efficient energy consumption schedules and maximise profits for electricity traders. Operating within Ireland’s Integrated Single Electricity Market (I-SEM), traders can buy and sell electricity at fluctuating hourly rates whose day-ahead prices are published at approximately 13:00 GMT day-1. Access to electricity price predictions earlier than this publication time allows stakeholders an expanded timeframe to facilitate energy cost-aware scheduling.

While many studies have been conducted to espouse various machine learning and statistical approaches to electricity price forecasting, these models tend to be bespoke and require in-depth knowledge regarding model implementation. The problem of requiring such expertise is not unique to time series forecasting, and research into mitigating such limitations exists in the form of Automated Machine Learning (AutoML). AutoML aims to derive effective models while automating various steps typically required for machine learning experimentation, such as pre-processing, model selection, validation, etc.

Given the increasing proliferation of AutoML tools and frameworks, this paper applies eight Python-based AutoML libraries to day-ahead electricity price forecasting on an excerpt of I-SEM data. These libraries are compared across a series of error metrics and training times to produce an empirical benchmark that can be utilised to select high-performing AutoML tools for further price forecasting research and other forms of time series forecasting. AutoKeras is found to produce accurate forecasts but requires careful configuration to avoid long runtimes. PyCaret, Ludwig, FLAML and FEDOT also generate favourable results while being significantly easier to configure.

Age Prediction from Facial Images Using Deep Learning Architecture

Fri, 06 Dec 2024 00:00:00 GMT

Predicting age and gender through images is a common computer vision problem with many practical applications. However, this problem faces many difficulties because a person’s age can be affected by genetics, living environment, diet, health, gender, and other factors. Therefore, the accuracy of the prediction model may decrease due to the enormous diversity and variability in the data. In this study, we use three models, including Unet, MobileNets, and EfficientNets, to test the performance of predicting a person’s age and gender through their photos. In addition, we also adjust the learning rate parameter to find optimal performance. The best results for gender prediction are achieved by the Unet model with the highest accuracy of 97.22 %, and the MobileNets model gives age prediction results with MAE = 2.248, learning rate 0.001 for optimal performance in the models of our study.

Detection of Arabic and Algerian Fake News

Fri, 06 Dec 2024 00:00:00 GMT

In an era characterised by the rapid dissemination of information through digital platforms, the proliferation of fake news has emerged as a pressing global concern. Misinformation, deliberately fabricated or misleading content presented as factual news, poses significant threats to public discourse, trust, and decision-making processes. The research highlights the significance of fake news detection in the Arabic language, with a specific focus on the Algerian dialect. The Arabic language exhibits great diversity and complexity, making the detection of false information, all the more crucial. The rapid spread of fake news through social media platforms has a significant impact on individuals and society as a whole. To address this challenge, this paper presents TruthGuardian, an innovative solution that combines machine learning and deep learning techniques with voting system for the last decision. This solution enables fast and accurate identification of fake news in the Arabic language, with emphasis on the Algerian dialect. It provides reliable and effective results in countering misinformation.

Low-Cost Embedded System Design for Smart Home Automation Using RF Technology

Fri, 06 Dec 2024 00:00:00 GMT

Applications for smart home technologies can be described as the adaptation of control systems used in industrial areas to modern living environment of people. Home automation systems are the systematic and seamless adaptation of smart home technologies to the individual’s personal needs and desires. Smart homes encompass a range of technologies designed to cater to the needs of residents, simplifying their daily routines and contributing to a more comfortable and secure way of life. Nowadays, most buildings are constructed with expensive, builtin smart home technology. However, when it comes to old buildings without smart home technology, integrating this technology into those structures can be challenging due to the requirement for intrusive overhead wall modifications necessary for installing the technological infrastructure. In this study, a radio frequency based low-cost embedded system is designed to bring intelligent technology to existing homes or old buildings. Various sub-units have been designed and established to be situated in different places of the house. Physical events and situations that occur in the house are sensed by the sub-units and transmitted to the main control unit by means of a specially developed communication protocol via radio frequency signals. This removes the cable clutter and creates a more flexible and easy installation environment.

Unveiling Trends of Chatbot and Conversational Agents: A Bibliometric Study

Fri, 06 Dec 2024 00:00:00 GMT

Recent years have seen remarkable growth and diversification in the study of chatbots and conversational agents. This research employs bibliometric and network -analytical methodology to thoroughly investigate the latest trends and themes in chatbot technology, a topic that has gained prominence in contemporary research discourse. The primary aim for this paper is to examine the evolution, prevailing trends, and provide an extensive overview of the chatbot field. Using the Web of Science core collection database, this study evaluates articles published from 1980 to 2024 by scanning over 7327 journal articles, ultimately focusing on 2622 key articles from prominent journals, institutions, and authors in the field. Key findings indicate a consistent increase in publication count related to chatbots recently. The study also identifies discrimination for critical areas such as advancements in artificial intelligence, machine learning, and natural language processing and underscores the importance of quantitatively assessing their impact and applications in diverse areas. Additionally, it sheds light on the collaboration among researchers, institutions, and nations in the development of this field. Furthermore, an analysis of written abstracts indicates a concentrated effort on enhancing user interactions and the technological progression of chatbots. The findings of this study provide insight into various sectors related to the development of chatbot technology in digital communication and AI advancement. Therefore, this bibliometric analysis offers a unique and in-depth view of the evolving chatbot research landscape, serving as a valuable guide for future research and strategic planning in this rapidly advancing area.

Alzheimer’s Disease Detection: A Comparative Study of Machine Learning Models and Multilayer Perceptron

Thu, 15 Aug 2024 00:00:00 GMT

The intersection of Artificial Intelligence (AI) and medical science has shown great promise in recent years for addressing complex medical challenges, including the early detection of Alzheimer’s disease (AD). Alzheimer’s disease presents a significant challenge in healthcare, and despite advancements in medical science, a cure has yet to be found. Early detection and accurate prediction of AD progression are crucial for improving patient outcomes. This study comprehensively evaluates four Machine Learning (ML) models and one Perceptron Model for early detection of AD using the Open Access Series of Imaging Studies (OASIS) dataset. The evaluated models include Logistic Regression, Random Forest, XGBoost, CatBoost, and a Multi-layer Perceptron (MLP). This study assesses the performance of each model, on metrics like accuracy, precision, recall, and AUC ROC. The MLP model emerges as the top performer, achieving an impressive accuracy of 95 %, highlighting its efficacy in accurately predicting AD status based on biomarker indicators. While other models, such as Logistic Regression (85 %), Random Forest (87 %), XGBoost (83 %), and CatBoost (89 %), demonstrate considerable accuracy, they are outperformed by the MLP model.

ANN Approach for SCARA Robot Inverse Kinematics Solutions with Diverse Datasets and Optimisers

Thu, 15 Aug 2024 00:00:00 GMT

In the pursuit of enhancing the efficiency of the inverse kinematics of SCARA robots with four degrees of freedom (4-DoF), this research delves into an approach centered on the application of Artificial Neural Networks (ANNs) to optimise and, hence, solve the inverse kinematics problem. While analytical methods hold considerable importance, tackling the inverse kinematics for manipulator robots, like the SCARA robots, can pose challenges due to their inherent complexity and computational intensity. The main goal of the present paper is to develop efficient ANN-based solutions of the inverse kinematics that minimise the Mean Squared Error (MSE) in the 4-DoF SCARA robot inverse kinematics. Employing three distinct training algorithms – Levenberg-Marquardt (LM), Bayesian Regularization (BR), and Scaled Conjugate Gradient (SCG) – and three generated datasets, we fine-tune the ANN performance. Utilising diverse datasets featuring fixed step size, random step size, and sinusoidal trajectories allows for a comprehensive evaluation of the ANN adaptability to various operational scenarios during the training process. The utilisation of ANNs to optimise inverse kinematics offers notable advantages, such as heightened computational efficiency and precision, rendering them a compelling choice for real-time control and planning tasks. Through a comparative analysis of different training algorithms and datasets, our study yields valuable insights into the selection of the most effective training configurations for the optimisation of the inverse kinematics of the SCARA robot. Our research outcomes underscore the potential of ANNs as a viable means to enhance the efficiency of SCARA robot control systems, particularly when conventional analytical methods encounter limitations.

Determination of Ataxia with EfficientNet Models in Person with Early MS using Plantar Pressure Distribution Signals

Thu, 15 Aug 2024 00:00:00 GMT

Multiple Sclerosis (MS) is a central nervous system disease that causes ataxia and balance disorders. In ataxia, the first symptom is usually seen as gait disturbance. In gait ataxia, symptoms can be clinically defined by shortened stride length and irregular strides. Evaluation of gait disturbance in clinical cases is important for the detection of the first stage of ataxia. With the increasing amount of data, high-performance models can be produced, especially in the field of healthcare, with computer machine learning, deep learning and artificial intelligence methods. This study aimed to identify ataxia in individuals with Multiple Sclerosis (MS) by analysing images that encompass plantar pressure distribution signals. A total of 105 images, each containing plantar pressure distribution signals, were utilized to extract features through pre-trained EfficientNet architectures. Then the feature vectors obtained were classified by SVM, k-NN, and ANN methods. As a result of this study, the best classification performance was obtained with SVM classifier with 88.09 % Acc, 80.55 % Sen, 93.75 % Spe and 85.29 % F1 Score. The results show that the study will help the clinician in the detection of PwMS ataxia and will be a pioneer for future studies.

Definition of a Set of Use Case Patterns for Application Systems: A Prototype-Supported Development Approach

Thu, 15 Aug 2024 00:00:00 GMT

UML diagrams are a base for the planning of development in most software projects. It is used for representing different artefacts during software development and project structure. The use case is one of the diagrams in Unified Modelling Language (UML), which allows describing the dynamic flow of the system. There are a lot of tools that are used for creating this diagram before starting the actual coding process, and the diagram needs to be specific and easily understandable. Meantime, the creation of a UML use case diagram from scratch for complex systems can be time-consuming and confusing for people, which needs to be optimised. The authors of the paper attempt to solve the addressed problem. Therefore, in this research paper a new definition for UML use case diagrams will be introduced, where the main question will be whether it is possible to formalise use case modelling by introducing pre-defined use case patterns. This is academic research and discussion, which is based on the analysis of advanced UML tools, which use case diagram templates contain. The solution to this research question contains an initial set of UML use case patterns, created by analysing of the existing use case diagram templates. Moreover, in order to validate work, the pre-defined patterns were demonstrated on a developed prototype. The operation principle of the prototype focused on giving the ability to the user to construct a use case diagram by the combination of pre-defined patterns. The prototype can be useful for the development/management process in case of correct implementation. It will allow decreasing spent time on the use case diagram creation as well as avoid creating anti-patterns.

Analysing the Analysers: An Investigation of Source Code Analysis Tools

Thu, 15 Aug 2024 00:00:00 GMT

The primary expectation from a software system revolves around its functionality. However, as the software development process advances, equal emphasis is placed on the quality of the software system for non-functional attributes like maintainability and performance. Tools are available to aid in this endeavour, assessing the quality of a software system from multiple perspectives.

This study aims to perform a comprehensive analysis of a particular set of source code analytical tools by examining diverse perspectives found in the literature and documentations. Given the vast array of programming languages available today, selecting appropriate source-code analytical tools presents a significant challenge. Therefore, this analysis aims to provide general insights to aid in selecting a more suitable analytical tool tailored to specific requirements.

Seven prominent static analysis tools, namely SonarQube, Coverty, CodeSonar, Snyk Code, ESLint, Klocwork, and PMD, were chosen based on their prevalence in the literature and recognition in the software development community. To systematically categorise and organise their distinctive features and capabilities, a taxonomy was developed. This taxonomy covers crucial dimensions, including input support, technology employed, extensibility, user experience, rules, configurability, and supported languages.

The comparative analysis highlights the distinctive strengths of each tool. SonarQube stands out as a comprehensive solution with a hybrid approach supporting static and dynamic code evaluations, accommodating multiple languages and integrating with popular Integrated Development Environments (IDEs). Coverity excels in identifying security vulnerabilities and defects, making it an excellent choice for security -focused development. CodeSonar prioritises code security and safety, offering a robust analysis. Snyk Code and ESLint, focusing on JavaScript, emphasise code quality and standards adherence. Klocwork is exceptional in defect detection and security analysis for C, C++, and Java. Lastly, PMD specialises in Java, emphasising code style and best practices.

Generative Artificial Intelligence Use in Optimising Software Engineering Process: A Systematic Literature Review

Thu, 15 Aug 2024 00:00:00 GMT

Generative AI is only a few years old but already being applied in Software Engineering (SE). This literature review examines the most popular SE sub-fields of such cases and research methods that are typically used. 117 studies starting from 2020 have been assessed, and literature review has shown that the most active research is ongoing in the code generation area. It is not clearly defined by researchers, but the majority of the methods can be assumed as experiments. It is concluded that researchers often do not define the used research method with exclusions such as literature review or opinion survey. However, different validation methods are highly valued and applied thoroughly.

Knowledge Elicitation Using the Delphi Technique in Developing Diagnosis Systems

Thu, 15 Aug 2024 00:00:00 GMT

Knowledge elicitation is important in designing knowledge-based diagnosis systems. Various approaches such as interviews and questionnaires have been used to elicit knowledge from experts. These approaches elicit knowledge from individual experts separately. Medical practitioners have diverse knowledge and experience in the diagnosis and management of a particular disease. A major challenge is in producing a harmonised diagnosis from different practitioners, which will inform the level of agreement among them on the treatment of Sickle Cell Disease (SCD). Therefore, it is important to elicit and integrate knowledge from different medical practitioners in developing an effective diagnosis system. Thus, the Delphi technique was employed in this study to elicit domain knowledge in developing SCD diagnosis systems in African Traditional Medicine (ATM) since there is no gold standard for achieving diagnosis in ATM. A kappa value of 0.487 was achieved. This implies that the Herb sellers averagely agree in the ranking of the SCD symptoms. Therefore, to build an effective SCD diagnosis system, further work should be done by conducting more Delphi rounds to ensure that a high level of consensus is reached. The Delphi technique used in this study helped in the area of requirement elicitation of SCD diagnosis in ATM which could be used in the development of an SCD diagnosis system.

An Immense Approach of High Order Fuzzy Time Series Forecasting of Household Consumption Expenditures with High Precision

Thu, 15 Aug 2024 00:00:00 GMT

Fuzzy Time Series (Fts) models are experiencing an increase in popularity due to their effectiveness in forecasting and modelling diverse and intricate time series data sets. Essentially these models use membership functions and fuzzy logic relation functions to produce predicted outputs through a defuzzification process. In this study, we suggested using a Second Order Type-1 fts (S-O T-1 F-T-S) forecasting model for the analysis of time series data sets. The suggested method was compared to the state-of-theart First Order Type 1 Fts method. The suggested approach demonstrated superior performance compared to the First Order Type 1 Fts method when applied to household consumption data from the Magene Regency in Indonesia, as measured by absolute percentage error rate (APER).

Analysis of the Compressed Video with HEVC under Optical Link Transmission

Thu, 15 Aug 2024 00:00:00 GMT

We study the feasibility of video transmission over optical fibre to optimise bandwidth with the implementation of HEVC codec features. We use simulation (Matlab and the OptiSystem software). Different values of the CRF are used to evaluate its impact on the visual quality and the size of the encoded file, as well as its influence on the video transmission performance. The simulation results show that by adjusting the CRF, the encoders can optimise the compression of the video data to reduce the file size while preserving an acceptable visual quality. This makes it possible to adapt the transmission to the bandwidth constraints of the optical fibre, by choosing higher CRF values to reduce the size of the files and save bandwidth, or lower values to maintain optimal quality when the bandwidth is sufficient. In addition, from the optical fibre point of view, the dispersion weakens and the eye opens, and it is observed that the length of the fibres is inversely proportional to the signal transmission quality. Thus, the judicious use of different CRF values can contribute to efficient and high-quality video transmission via optical fibre.

Adaptation of the Automotive Product Development Process for AI Development

Thu, 15 Aug 2024 00:00:00 GMT

Artificial Intelligence (AI) functionalities are increasingly being used in vehicle applications. While current product development models take the increasing proportion of software into account, the special requirements of artificial intelligence developments are hardly ever explicitly considered. The new requirements result both from increasing standardisation and regulation and from the iterative and explorative approach inherent in AI model development. This paper identifies the key adaptations to the standard automotive product development process that are required to cover the requirements of AI development. The adapted development model was trialled in two vehicle developments, the most important lessons learnt of which are summarised in this paper.