Term of Award

Fall 2022

Degree Name

Master of Science, Computer Science (M.S.C.S.)

Document Type and Release Option

Thesis (open access)

Copyright Statement / License for Reuse

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.

Department

Department of Computer Sciences

Committee Chair

Weitian Tong

Committee Member 1

Lixin Li

Committee Member 2

Felix Hamza-Lup

Abstract

Forecasting time series data is a big challenge due to the temporal and multivariate dependencies in the data combined with complex geospatial relations. In this project, we present a new approach named as TSPPM25 (Transformer-based Spatiotemporal Prediction of PM2.5) for forecasting PM2.5, a key air quality indicator. It is based on the state-of-the-art Transformer neural network and various data embedding techniques. By performing attention calculations among features over time steps as well across geospatial locations, TSPPM25 mimics cognitive attention and selectively enhances essential parts of the input data while diminishing other parts. TSPPM25 can effectively capture temporal and spatial relations in tandem with multiple influencing meteorological features as well as satellite-derived Aerosol Optical Depth. Experiments demonstrate its effectiveness by comparing it with a cutting-edge ensemble deep learning model from Zhang et al. [Information Sciences 544:427–445 (2021)]. Our TSPPM25 model outperforms Zhang et al.’s model under the same experimental setting on a well-researched benchmark dataset. As Zhang et al.’s model is restricted to univariate PM2.5 prediction, our TSPPM25 model bypasses this restriction and further improves the prediction accuracy when considering more influencing meteorological features. Moreover, our TSPPM25 model can maintain high prediction accuracy over longer periods of time as compared to the Long-Short Term Memory (LSTM) and Bidirectional LSTM models (BiLSTM). Our TSPPM25 model’s spatial learning capabilities are evaluated on a novel robust California PM2.5 dataset including Aerosol Optical Depth. Our model effectively predicts PM2.5 across a multitude of stations and is better at predicting far outlier data points than the LSTM, BiLSTM, and Linear Regression models. Our TSPPM25 model is also more stable than LSTM and BiLSTM models by maintaining higher accuracy on more complex periodic PM2.5 as determined by Fourier pattern analysis.

Research Data and Supplementary Material

Yes

Cali_statistics.xlsx (19 kB)
510_Cali_1_k0.csv (3 kB)
510_Cali_2_k0.csv (3 kB)
510_Cali_3_k0.csv (3 kB)
510_Cali_4_k0.csv (3 kB)
510_Cali_5_k0.csv (3 kB)
510_Cali_6_k0.csv (3 kB)
510_Cali_7_k0.csv (3 kB)
510_Cali_8_k0.csv (3 kB)
510_Cali_9_k0.csv (3 kB)
510_Cali_10_k0.csv (3 kB)
510_Cali_11_k0.csv (3 kB)
510_Cali_13_k0.csv (3 kB)
510_Cali_12_k0.csv (3 kB)
510_Cali_15_k0.csv (3 kB)
510_Cali_16_k0.csv (3 kB)
510_Cali_17_k0.csv (3 kB)
510_Cali_18_k0.csv (3 kB)
510_Cali_20_k0.csv (3 kB)
510_Cali_21_k0.csv (3 kB)
510_Cali_22_k0.csv (3 kB)
510_Cali_23_k0.csv (3 kB)
510_Cali_25_k0.csv (3 kB)
510_Cali_26_k0.csv (3 kB)
510_Cali_27_k0.csv (3 kB)
510_Cali_30_k0.csv (3 kB)
510_Cali_31_k0.csv (3 kB)
510_Cali_32_k0.csv (3 kB)
510_Cali_33_k0.csv (3 kB)
510_Cali_34_k0.csv (3 kB)
510_Cali_36_k0.csv (3 kB)
510_Cali_37_k0.csv (3 kB)
510_Cali_38_k0.csv (3 kB)
510_Cali_39_k0.csv (3 kB)
510_Cali_43_k0.csv (3 kB)
510_Cali_44_k0.csv (3 kB)
510_Cali_45_k0.csv (3 kB)
510_Cali_47_k0.csv (3 kB)
510_Cali_52_k0.csv (3 kB)
510_Cali_53_k0.csv (3 kB)
510_Cali_54_k0.csv (3 kB)
510_Cali_58_k0.csv (3 kB)
510_Cali_60_k0.csv (3 kB)
510_Cali_61_k0.csv (3 kB)
510_Cali_63_k0.csv (3 kB)
510_Cali_64_k0.csv (3 kB)
510_Cali_65_k0.csv (3 kB)
510_Cali_66_k0.csv (3 kB)
510_Cali_70_k0.csv (3 kB)
510_Cali_71_k0.csv (3 kB)
510_Cali_72_k0.csv (3 kB)
510_Cali_73_k0.csv (3 kB)
510_Cali_74_k0.csv (3 kB)
510_Cali_76_k0.csv (3 kB)
510_Cali_77_k0.csv (3 kB)
510_Cali_78_k0.csv (3 kB)
510_Cali_88_k0.csv (3 kB)
510_Cali_96_k0.csv (3 kB)
510_Cali_97_k0.csv (3 kB)
510_Cali_100_k0.csv (3 kB)
510_Cali_102_k0.csv (3 kB)
510_Cali_105_k0.csv (3 kB)
510_Cali_106_k0.csv (3 kB)
510_Cali_107_k0.csv (3 kB)
510_Cali_110_k0.csv (3 kB)
510_Cali_111_k0.csv (3 kB)
510_Cali_112_k0.csv (3 kB)
510_Cali_113_k0.csv (3 kB)
510_Cali_114_k0.csv (4 kB)
510_Cali_1_k1.csv (12 kB)
510_Cali_2_k1.csv (6 kB)
510_Cali_4_k1.csv (6 kB)
510_Cali_6_k1.csv (6 kB)
510_Cali_8_k1.csv (6 kB)
510_Cali_8_k1.csv (6 kB)
510_Cali_10_k1.csv (6 kB)
510_Cali_12_k1.csv (6 kB)
510_Cali_13_k1.csv (6 kB)
510_Cali_16_k1.csv (6 kB)
510_Cali_17_k1.csv (6 kB)
510_Cali_18_k1.csv (6 kB)
510_Cali_19_k1.csv (6 kB)
510_Cali_20_k1.csv (6 kB)
510_Cali_23_k1.csv (6 kB)
510_Cali_25_k1.csv (6 kB)
510_Cali_27_k1.csv (6 kB)
510_Cali_34_k1.csv (6 kB)
510_Cali_30_k1.csv (6 kB)
510_Cali_36_k1.csv (6 kB)
510_Cali_39_k1.csv (6 kB)
510_Cali_43_k1.csv (6 kB)
510_Cali_44_k1.csv (6 kB)
510_Cali_52_k1.csv (6 kB)
510_Cali_53_k1.csv (6 kB)
510_Cali_58_k1.csv (6 kB)
510_Cali_61_k1.csv (6 kB)
510_Cali_64_k1.csv (6 kB)
510_Cali_66_k1.csv (6 kB)
510_Cali_74_k1.csv (6 kB)
510_Cali_76_k1.csv (6 kB)
510_Cali_88_k1.csv (6 kB)
510_Cali_96_k1.csv (6 kB)
510_Cali_100_k1.csv (6 kB)
510_Cali_105_k1.csv (6 kB)
510_Cali_110_k1.csv (6 kB)
510_Cali_111_k1.csv (6 kB)
510_Cali_114_k1.csv (6 kB)
510_Cali_1_k3.csv (12 kB)
510_Cali_3_k3.csv (12 kB)
510_Cali_2_k3.csv (12 kB)
510_Cali_3_k1.csv (6 kB)
510_Cali_5_k1.csv (6 kB)
510_Cali_7_k1.csv (6 kB)
510_Cali_9_k1.csv (6 kB)
510_Cali_11_k1.csv (6 kB)
510_Cali_15_k1.csv (6 kB)
510_Cali_4_k3.csv (12 kB)
510_Cali_4_k3.csv (12 kB)
510_Cali_5_k3.csv (12 kB)
510_Cali_6_k3.csv (12 kB)
510_Cali_7_k3.csv (12 kB)
510_Cali_8_k3.csv (12 kB)
510_Cali_9_k3.csv (12 kB)
510_Cali_10_k3.csv (12 kB)
510_Cali_11_k3.csv (12 kB)
510_Cali_12_k3.csv (12 kB)
510_Cali_13_k3.csv (12 kB)
510_Cali_15_k3.csv (12 kB)
510_Cali_16_k3.csv (12 kB)
510_Cali_17_k2.csv (9 kB)
510_Cali_18_k3.csv (12 kB)
510_Cali_19_k3.csv (12 kB)
510_Cali_20_k3.csv (12 kB)
510_Cali_23_k3.csv (12 kB)
510_Cali_25_k3.csv (12 kB)
510_Cali_26_k3.csv (12 kB)
510_Cali_27_k3.csv (12 kB)
510_Cali_17_k3.csv (12 kB)
510_Cali_30_k3.csv (12 kB)
510_Cali_36_k3.csv (12 kB)
510_Cali_39_k3.csv (12 kB)
510_Cali_43_k3.csv (12 kB)
510_Cali_44_k2.csv (9 kB)
510_Cali_44_k3.csv (12 kB)
510_Cali_52_k3.csv (12 kB)
510_Cali_53_k3.csv (12 kB)
510_Cali_58_k3.csv (12 kB)
510_Cali_61_k3.csv (12 kB)
510_Cali_64_k3.csv (12 kB)
510_Cali_64_k2.csv (9 kB)
510_Cali_66_k3.csv (12 kB)
510_Cali_74_k3.csv (12 kB)
510_Cali_76_k2.csv (9 kB)
510_Cali_76_k3.csv (12 kB)
510_Cali_88_k2.csv (9 kB)
510_Cali_88_k3.csv (12 kB)
510_Cali_96_k3.csv (12 kB)
510_Cali_105_k2.csv (9 kB)
510_Cali_105_k3.csv (12 kB)
510_Cali_110_k3.csv (12 kB)
510_Cali_111_k3.csv (12 kB)
510_Cali_114_k3.csv (12 kB)
510_Cali_1_k5.csv (18 kB)
510_Cali_2_k5.csv (18 kB)
510_Cali_3_k5.csv (18 kB)
510_Cali_4_k5.csv (18 kB)
510_Cali_5_k5.csv (18 kB)
510_Cali_6_k5.csv (18 kB)
510_Cali_7_k5.csv (18 kB)
510_Cali_8_k5.csv (18 kB)
510_Cali_9_k5.csv (18 kB)
510_Cali_10_k5.csv (18 kB)
510_Cali_11_k5.csv (18 kB)
510_Cali_12_k5.csv (18 kB)
510_Cali_13_k5.csv (18 kB)
510_Cali_15_k5.csv (18 kB)
510_Cali_16_k5.csv (18 kB)
510_Cali_17_k5.csv (18 kB)
510_Cali_18_k5.csv (18 kB)
510_Cali_19_k5.csv (18 kB)
510_Cali_20_k5.csv (18 kB)
510_Cali_23_k5.csv (18 kB)
510_Cali_25_k5.csv (18 kB)
510_Cali_26_k5.csv (18 kB)
510_Cali_27_k5.csv (18 kB)
510_Cali_30_k5.csv (18 kB)
510_Cali_36_k5.csv (18 kB)
510_Cali_39_k5.csv (18 kB)
510_Cali_43_k5.csv (18 kB)
510_Cali_44_k5.csv (18 kB)
510_Cali_52_k5.csv (17 kB)
510_Cali_53_k5.csv (18 kB)
510_Cali_58_k5.csv (18 kB)
510_Cali_61_k5.csv (18 kB)
510_Cali_64_k5.csv (18 kB)
510_Cali_66_k5.csv (18 kB)
510_Cali_74_k5.csv (18 kB)
510_Cali_76_k5.csv (18 kB)
510_Cali_88_k5.csv (18 kB)
510_Cali_96_k5.csv (18 kB)
510_Cali_96_k5.csv (18 kB)
510_Cali_105_k5.csv (18 kB)
510_Cali_110_k5.csv (18 kB)
510_Cali_111_k5.csv (18 kB)
510_Cali_114_k5.csv (18 kB)
traffic-collision-data-from-2010-to-present.csv (178034 kB)
Correlations.xlsx (63 kB)
Example_Fourier_Analysis.xlsx (111 kB)
Outlier Comparison results.txt (1 kB)
PA510_raw.xlsm (3728 kB)
PM2.5_Station_Selection.txt (1 kB)
Removed_stations.txt (3 kB)
5cities_geocoord.csv (1 kB)
5citiesKey.txt (1 kB)
Data_XuhuiClean.csv (2506 kB)
DataComparison_V3.xlsx (1054 kB)
5cities_OG_norm.csv (25323 kB)
5cities_OG_norm_knn.csv (2927 kB)
5cities_OG_norm_knn_C2.csv (2616 kB)
5cities_OG_norm_knn_imputation.csv (3147 kB)
5cities_OG_norm_MIC1.csv (4256 kB)
5cities_OG_norm_MIC2.csv (3109 kB)
5cities_OG_norm_old.csv (25323 kB)
5cities_OG_norm_reduce.csv (12344 kB)
5cities_OG_norm_reduce_noprecip.csv (11566 kB)
5cities_OG_norm_reduce_noprecip_imputation.csv (12715 kB)
5cities_OG_replication.csv (34 kB)
5cities_OG_replication.csv (34 kB)
5cities_OG_replication_multi.csv (94 kB)
5cities_OG_replication_multi_unnorm.csv (47 kB)
Reformed_5cities_OG_replication_unnorm_DEWP.csv (28 kB)
Reformed_5cities_OG_replication_unnorm_DEWPHUMIlws.csv (38 kB)
Reformed_5cities_OG_replication_unnorm_DEWPHUMIlws.csv (38 kB)
Reformed_5cities_OG_replication_unnorm_DEWPHUMIPRES.csv (40 kB)
Reformed_5cities_OG_replication_unnorm_DEWPHUMIPRESlws.csv (43 kB)
Reformed_5cities_OG_replication_unnorm_DEWPHUMIPRESTEMP.csv (44 kB)
Reformed_5cities_OG_replication_unnorm_DEWPHUMITEMP.csv (38 kB)
Reformed_5cities_OG_replication_unnorm_DEWPHUMITEMP.csv (38 kB)
Reformed_5cities_OG_replication_unnorm_DEWPHUMITEMPlws.csv (41 kB)
Reformed_5cities_OG_replication_unnorm_DEWPPRES.csv (33 kB)
Reformed_5cities_OG_replication_unnorm_DEWPPRESlws.csv (37 kB)
Reformed_5cities_OG_replication_unnorm_DEWPPRESTEMP.csv (37 kB)
Reformed_5cities_OG_replication_unnorm_DEWPPRESTEMPlws.csv (40 kB)
Reformed_5cities_OG_replication_unnorm_DEWPTEMP.csv (31 kB)
Reformed_5cities_OG_replication_unnorm_DEWPTEMPlws.csv (34 kB)
Reformed_5cities_OG_replication_unnorm_HUMI.csv (31 kB)
Reformed_5cities_OG_replication_unnorm_HUMIlws.csv (34 kB)
Reformed_5cities_OG_replication_unnorm_HUMIPRES.csv (37 kB)
Reformed_5cities_OG_replication_unnorm_HUMIPRESTEMP.csv (40 kB)
Reformed_5cities_OG_replication_unnorm_HUMITEMP.csv (35 kB)
Reformed_5cities_OG_replication_unnorm_PRES.csv (30 kB)
Reformed_5cities_OG_replication_unnorm_PRESLWS.csv (33 kB)
Reformed_5cities_OG_replication_unnorm_PRESLWS.csv (33 kB)
Reformed_5cities_OG_replication_unnorm_PRESTEMP.csv (34 kB)
Reformed_5cities_OG_replication_unnorm_PRESTEMPlws.csv (37 kB)
Reformed_5cities_OG_replication_unnorm_TEMP.csv (28 kB)

Share

COinS