Data Augmentation and Class Imbalance Compensation Using CTGAN to Improve Gas Detection Systems

Research output: Contribution to Book/Report typesContribution to conference proceedingspeer-review

5 Citations (Scopus)

Abstract

The use of sensors in gas detection systems for environmental monitoring is largely affected by sensor drift over time which reduces accurate classification. This drift can be minimized by using machine learning models trained on sensor data. Here, two different machine learning models are trained on the Gas Sensor Array Drift Dataset. However, this dataset, which has been collected over three years, suffers not only from drift but also from class imbalance. As a result, machine learning models cannot perform properly on this dataset. To address these problems, this paper introduces an innovative methodology for data compensation and augmentation using Conditional Tabular Generative Adversarial Networks (CTGAN). By employing this methodology, we can counteract the class imbalance and limit drift by bringing diversity to the dataset, which in turn improves the accuracy of machine learning models for gas detection systems. With class imbalance compensation, Multi-Layer Perceptron (MLP) and Support Vector Machines (SVM) achieved an improvement in classification accuracy in five batches, up to 20% for certain batches. Through data augmentation, they reached higher accuracy across six batches, with certain batches exceeding a 10% improvement. These achievements highlight the effectiveness and reliability of the use of synthetic data generation in tabular data for sensors.

Original languageEnglish
Title of host publicationI2MTC 2024 - Instrumentation and Measurement Technology Conference
Subtitle of host publicationInstrumentation and Measurement for Sustainable Future, Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350380903
DOIs
Publication statusPublished - 2024
Event2024 IEEE International Instrumentation and Measurement Technology Conference, I2MTC 2024 - Glasgow, United Kingdom
Duration: 20 May 202423 May 2024

Publication series

NameConference Record - IEEE Instrumentation and Measurement Technology Conference
ISSN (Print)1091-5281

Conference

Conference2024 IEEE International Instrumentation and Measurement Technology Conference, I2MTC 2024
Country/TerritoryUnited Kingdom
CityGlasgow
Period20/05/2423/05/24

!!!Keywords

  • CTGAN
  • Data augmentation
  • Data balancing
  • Gas detection
  • Gas sensor

Fingerprint

Dive into the research topics of 'Data Augmentation and Class Imbalance Compensation Using CTGAN to Improve Gas Detection Systems'. These topics are generated from the title and abstract of the publication. Together, they form a unique fingerprint.

Cite this