Please use this identifier to cite or link to this item:
Title: Machine Learning Approach-based Big Data Imputation Methods for Outdoor Air Quality forecasting
Authors: D, Narasimhan
M, Vanitha
Keywords: Air quality;Big data analytics;Classification;Ensemble;Multiple imputation
Issue Date: Mar-2023
Publisher: NIScPR-CSIR, India
Abstract: Missing data from ambient air databases is a typical issue, but it is much worse in small towns or cities. Missing data is a significant concern for environmental epidemiology. These settings have high pollution exposure levels worldwide, and dataset gaps obstruct health investigations that could later affect local and international policies. When a substantial number of observations contain missing values, the standard errors increase due to the smaller sample size, which may significantly affect the final result. Generally, the performance of various missing value imputation algorithms is proportional to the size of the database and the percentage of missing values within it. This paper proposes and demonstrates an ensemble – imputation – classification framework approach to rebuild air quality information using a dataset from Beijing, China, to forecast air quality. Various single and multiple imputation procedures are utilized to fill the missing records. Then ensemble of diverse classifiers is used on the imputed data to find the air pollution level. The recommended model aims to reduce the error rate and improve accuracy. Extensive testing of datasets with actual missing values has revealed that the suggested methodology significantly enhances the air quality forecasting model’s accuracy with multiple imputation and ensemble techniques when compared to other conventional single imputation techniques.
Page(s): 338-347
ISSN: 0022-4456 (Print); 0975-1084 (Online)
Appears in Collections:JSIR Vol.82(03) [March 2023]

Files in This Item:
File Description SizeFormat 
JSIR 82(03) 338-347.pdf1.7 MBAdobe PDFView/Open

Items in NOPR are protected by copyright, with all rights reserved, unless otherwise indicated.