研究生: 劉繼遠
Chi-Yuan Liu
論文名稱: 以深度卷積自動編碼器分辨異常軌道號誌紀錄
Anomalous Railway Signaling Logs Detection with Deep Convolutional Autoencoder
指導教授: 楊振雄
Cheng-Hsiung Yang
口試委員: 陳金聖
Chin-Sheng Chen
Chang-Shi Wu
Yong-Lin Kuo
Cheng-Hsiung Yang
學位類別: 碩士
系所名稱: 工程學院 - 自動化及控制研究所
Graduate Institute of Automation and Control
論文出版年: 2021
畢業學年度: 109
語文別: 中文
論文頁數: 83
中文關鍵詞: 異常檢測離群值檢測深度學習集成學習軌道號誌
外文關鍵詞: anomaly detection, outlier detection, deep learning, ensemble learning, railway signaling
點閱:402下載:2
實驗所使用的每個測試集的異常樣本佔總樣本數的0.2% 以下。在約兩成偽陽性的情況下,深度卷積自動編碼器可以達到九成二的準確率。而將標記為異常的樣本依經驗法則控制在0.3%左右時,可以達到約八成的準確率。

The subject of this thesis is to use the deep convolutional autoencoder (DCAE) as an analyzer, analyzing the event logs of railway signaling system using information such as train location, speed and acceleration in discrete form in order to classify anomalies where slip/slide occurs. Also, trying to optimize the model by several experiments and to understand whether the DCAE can classify anomalies with high accuracy in spite of the lack of detail information such as axle speeds and min-max boundary of acceleration.
The data of railway signaling system event logs is not labelled. The meaning of each data value is therefore explained and then the conditions to label anomalous (slip/slide) samples are described. The samples which are labelled as anomalies are removed from the training set for the DCAE to learn the normal samples. It is called semi-supervised learning. Before feeding the data to the DCAE for training and prediction, the data will be normalized. The selection principle of normalization methods is explained as well.
What the DCAE does is to reconstruct the input data with the experience it learned from the training set. By measuring the error between input and output of the DCAE, it is able to classify anomalies. Two methods to measure the errors are picked, one is MAE (Mean Absolute Error) and the other is Mahalanobis distance. The former method is intuitive, the error is larger when difference between input and output is larger. The latter is commonly used in statistics in order to find anomalies (outliers) and is suitable for multivariate data. The error is larger when the samples are far from the cluster.
The test sets used in the experiments are contaminated with less than 0.2% anomalies. The DCAE is able to achieve 92% accuracy if approximately 20% false positive rate is not a concern. However, when the number of anomalies is limited at around 0.3% with empirical rules, the predicting accuracy of the DCAE is around 80%.

摘要 I ABSTRACT II 目錄 III 圖目錄 V 表目錄 VI 第一章 緒論 1 1.1 前言 1 1.2 文獻探討 1 1.3 研究動機 2 1.4 本文大綱 4 第二章 軌道號誌系統 7 2.1. 自動列車控制和IEEE標準中的CBTC系統 7 2.2. 號誌系統架構 9 2.3. 自動列車保護 12 2.4. 自動列車監視(ATS) 與自動列車操作 (ATO) 17 2.5. 路網圖表示法 19 第三章 深度學習網路 21 3.1. 自動編碼器 21 3.2. 深度卷積自動編碼器 23 3.3. 判別異常樣本 25 3.3.1. 平均絕對誤差 25 3.3.2. Mahalanobis距離 25 3.3.3. 決定閾值 26 第四章 事件紀錄特性與前處理 28 4.1. 事件紀錄內容 28 4.2. 事件前處理 33 4.3. 標記已知異常樣本 35 4.4. 選擇資料樣本 39 4.5. 字串和列舉類型資料轉換 39 4.6. 正規化 40 第五章 實驗與結果 44 5.1. 性能指標 44 5.2. 實驗過程與結果 47 5.2.1. 神經網路容量 47 5.2.2. 損失函數 53 5.2.3. 啟動函數 56 5.2.4. 學習率 64 5.2.5. 批次正規化和隨機關閉神經元 74 第六章 結論 78 6.1 實驗結果分析 78 6.2 結論與未來工作 79 參考文獻 81

