研究生: |
吳昆輝 Kun-Hui Wu |
---|---|
論文名稱: |
嵌入式裝置應用機器學習CPU 動態頻率管理 Machine Learning for CPU Dynamic frequency Management Technique on Embedded Devices |
指導教授: |
楊振雄
Chen-Hsiung Yang |
口試委員: |
陳金聖
CHIN-SHENG CHEN 吳常熙 Chang-Shi Wu 郭永麟 Yong-Lin Kuo 楊振雄 Chen-Hsiung Yang |
學位類別: |
碩士 Master |
系所名稱: |
工程學院 - 自動化及控制研究所 Graduate Institute of Automation and Control |
論文出版年: | 2018 |
畢業學年度: | 106 |
語文別: | 英文 |
論文頁數: | 88 |
中文關鍵詞: | 機器學習 、線性回歸 、電源管理 、多核心系統 |
外文關鍵詞: | Machine Learning, Power Management, Multi-Core Systems, Linear Regression |
相關次數: | 點閱:634 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
有2 個理由讓我們相信在不久的未來,使用機器學習來有效控制嵌入式裝置會變的越來越
重要。其一為近5 年Intel 處理器的發展,顯示出製程已經接近物理極限。這意味著過往透過
製程的改善所得到的能源效益已達到上限。其二為近期的能源學家指出,ICT 產業在2020 年
耗電會佔全球20%。假使能有效達到5%的省電效果,未來相當於省掉1%的耗電量。
我們透過Intel 與ARM 公開的許多資訊,利用電源特性來收集在CPU 在SPLASH2 不同
benchmark 的狀況。透過收集的這些數據,我們設計了一個適用於資料中心的分散式架構與一
個省電演算法,這個架構使用script language 完成,除了可以跨處理器更是可以跨作業糸統。
其中我們使用線性迴歸做為主要的機器學習演算法。這個演算法用來預測合適的省電時機,
在演算法的驗證部份,我們使用了史丹佛大學所研發的SPLASH2 做實際驗證,透過SPLASH2
的各種情境可以看出來,我們所設計的演算法及架構可以達到5.34%的省電效果。
There are two reasons why we believe that in the near future, the use of machine learning to
effectively control embedded devices will become more and more important.One is the development
of Intel processors in the past five years, showing that the process is close to the physical limit.This
means that the energy efficiency obtained through the improvement of the process has closed to
upper limit.Second, recent energy scientists point out that the ICT industry will consume 20% of the
world's electricity consumption by 2020.If we can achieve 5% power saving effect, it is equivalent to
saving 1% of power consumption from ICT industry.
We use the power features of Intel and ARM to collect the different benchmarks from
SPLASH2 in the CPU .Through the collection of these data, we designed a decentralized architecture
for the data center and a power-saving algorithm. This architecture is implemented using a script
language, which can be cross-processed across processors.
We use linear regression as the main machine learning algorithm.This algorithm is used to
predict the appropriate power-saving timing. In the verification part of the algorithm, we use the
SPLASH2 developed by Stanford University to do the actual verification. Through the various
scenarios of SPLASH2, we can design the algorithm and The architecture can achieve 5.34% power
saving effect.
Reference
[1] B.E. Outlook, BP Global, 2017 Edition, (2017).
[2] A.S. Andrae, T. Edler, On global electricity usage of communication technology: trends to
2030, Challenges, 6 (2015) 117-157.
[3] T. Bisson, S.A. Brandt, D.D. Long, NVCache: Increasing the effectiveness of disk
spin-down algorithms with caching, in: Modeling, Analysis, and Simulation of Computer
and Telecommunication Systems, 2006. MASCOTS 2006. 14th IEEE International
Symposium on, IEEE, 2006, pp. 422-432.
[4] T. Bostoen, S. Mullender, Y. Berbers, Power-reduction techniques for data-center storage
systems, ACM Computing Surveys (CSUR), 45 (2013) 33.
[5] D.P. Helmbold, D.D. Long, B. Sherrod, A dynamic disk spin-down technique for mobile
computing, in: Proceedings of the 2nd annual international conference on Mobile
computing and networking, ACM, 1996, pp. 130-142.
[6] A.K. Singh, C. Leech, B.K. Reddy, B.M. Al-Hashimi, G.V. Merrett, Learning-based
run-time power and energy management of multi/many-core systems: current and future
trends, Journal of Low Power Electronics, 13 (2017) 310-325.
[7] A. Cohen, F. Finkelstein, A. Mendelson, R. Ronen, D. Rudoy, On estimating optimal
performance of cpu dynamic thermal management, IEEE Computer Architecture Letters, 2
(2003) 6-6.
[8] R.R. Schaller, Moore's law: past, present and future, IEEE spectrum, 34 (1997) 52-59.
[9] A. Shah, The PC upgrade cycle slows to every five to six years, in,
https://www.pcworld.com/article/3078010/hardware/the-pc-upgrade-cycle-slows-to-every-fiv
e-to-six-years-intels-ceo-says.html.
[10] Intel Chips Timeline, in,
https://www.intel.com/content/www/us/en/history/history-intel-chips-timeline-poster.html.
[11] A.L. Shimpi, The Haswell Review, in,
https://www.anandtech.com/show/7003/the-haswell-review-intel-core-i74770k-i54560k-teste
d/5.
[12] K. Levermore, A brief history of the microchip, in,
http://dialoguereview.com/brief-history-microchip/.
[13] C. Auth, C. Allen, A. Blattner, D. Bergstrom, M. Brazier, M. Bost, M. Buehler, V.
Chikarmane, T. Ghani, T. Glassman, A 22nm high performance and low-power CMOS
technology featuring fully-depleted tri-gate transistors, self-aligned contacts and high density
MIM capacitors, in: VLSI technology (VLSIT), 2012 symposium on, IEEE, 2012, pp.
75
131-132.
[14] T.P. Morgan, CAVIUM IS TRULY A CONTENDER WITH ONE-TWO ARM SERVER
PUNCH, in,
https://www.nextplatform.com/2017/11/27/cavium-truly-contender-one-two-arm-server-punc
h/.
[15] 16.12 Release Notes, in,
https://platforms.linaro.org/documentation/Reference-Platform/Platforms/Enterprise/Release
Notes-16.12.md/.
[16] Designing Systems Without a Suspend Supply in,
https://www.intel.com/content/dam/www/public/us/en/documents/white-papers/systems-with
out-suspend-supply-paper.pdf.
[17] S. Fischer, Technical overview of the 45nm next generation intel core microarchitecture
(penryn), in: Intel Developer Forum, 2007.
[18] F.P. Miller, A.F. Vandome, J. McBrewster, Advanced Configuration and Power Interface:
Open Standard, Operating System, Power Management, Cross-Platform, Intel Corporation,
Microsoft, Toshiba,... Sleep Mode, Hibernate (OS Feature), Synonym, (2009).
[19] D. Wu, Performance compares between C and python, in,
https://github.com/DesmondWu/dvfs.
[20] D. Katz, R. Gentile, Dynamic Power Management Optimizes Performance vs. Power in
Embedded Applications of BlackfinTM DSPs, Analog Dialogue, (2002) 36-04.
[21] M.T. Jones, Access the Linux kernel using the/proc filesystem, IBM developerWorks,
(2006).
[22] Reference Software Platform, in,
https://www.96boards.org/blog/reference-software-platform-16-06-release/.
[23] C. Lomont, Introduction to x64 assembly, Recuperado de: https://software. intel.
com/en-us/articles/introduction-to-x64-assembly Programación en Python Referencias,
(2012).
[24] ARM® Developer Suite Assembler Guide, in,
http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.dui0068b/BABJAJIB.html.
[25] H.-C. Hsieh, K.-D. Chang, L.-F. Wang, J.-L. Chen, H.-C. Chao, ScriptIoT: A script
framework for and internet-of-things applications, IEEE Internet of Things Journal, 3 (2016)
628-636.
[26] R.S. Sutton, A.G. Barto, Reinforcement learning: An introduction, MIT press Cambridge,
1998.
[27] C.J. Watkins, P. Dayan, Q-learning, Machine learning, 8 (1992) 279-292.
76
[28] G.-Y. Pan, B.-C.C. Lai, S.-Y. Chen, J.-Y. Jou, A learning-on-cloud power management
policy for smart devices, in: Computer-Aided Design (ICCAD), 2014 IEEE/ACM
International Conference on, IEEE, 2014, pp. 376-381.
[29] J.M. Arnold, D.A. Buell, E.G. Davis, Splash 2, in: Proceedings of the fourth annual
ACM symposium on Parallel algorithms and architectures, ACM, 1992, pp. 316-322.
[30] J. Neter, W. Wasserman, M.H. Kutner, Applied linear regression models, (1989).
[31] J.C.M.F.M. Schoffstall, C. Davin, RFC 1157: Simple network management protocol
(SNMP), IETF, April, (1990).
[32] K. McCloghrie, M. Rose, RFC 1156," Management Information Base for Network
Management of TCP/IP-based internets, (1990).
[33] D. Stirling, F. Al-Ali, Zero configuration networking, Crossroads, 9 (2003) 19-23.
[34] S. Cheshire, B. Aboba, E. Guttman, Dynamic configuration of IPv4 link-local addresses
(RFC 3927), May 2005, in.
[35] D. Plummer, An Ethernet address resolution protocol (RFC 826), Network Working
Group, (1982).
[36] Nanopi-M1, in, http://www.nanopi.org/NanoPi-M1_Feature.html.
[37] Allwinner H3 Datasheet, in,
http://wiki.friendlyarm.com/wiki/index.php/File:Allwinner_H3_Datasheet_V1.2.pdf.
[38] A. Cortex, a7 mpcore technical reference manual, 2013, URL: http://infocenter. arm.
com/help/index. jsp.