吳安宇Wu, An-Yeu臺灣大學:電子工程學研究所許展誠Hsu, Chan-chengChan-chengHsu2010-07-142018-07-102010-07-142018-07-102009U0001-2107200914024300http://ntur.lib.ntu.edu.tw//handle/246246/189201本論文中,為了提高晶片內網路之可容錯性並降低其在容錯情況下的效能損失,我們提出兩種晶片內網路架構:1) 20-path router with BIST/SD/FI (20PR):內建自我測試/診斷/錯誤隔離電路的路由器設計。2) Surrounding Test Ring (STR),一個由外部對晶片內網路進行測試與診斷的架構。它們除了具有自我測試/診斷(Built-in Self-Test/Self-Diagnosis)和錯誤隔離(Fault-Isolation)的功能以外,還可以使用路由器中未損壞的部份以降低容錯情況下的效能損失,如此的架構可以讓系統運用其特性重新分配工作到無錯誤的路徑上以維持系統的正常運作。我們的實驗中,20PR內建的自我測試診斷電路可以在117個週期時間中測試完畢,而STR可在144~376個週期中測試完畢。使用20PR的晶片內網路須付出15.17%的額外硬體成本,而使用STR的則需付出8.48%~13.3%。而在效能的方面,在我們的實驗中,與傳統將整個錯誤路由器關閉的作法,需重新配置的封包在20PR中降低了75.68%~83.29%,而在STR中降低了68.33%~79.31%。而系統的延遲在20PR中降低了7.25%~24.57%,在STR中則降低了4.86%~23.6%。實驗的結果呈現出來我們提出的可容錯晶片內網路架構可以有效的減少錯誤晶片內網路的效能損失。In this thesis, to improve fault-tolerance and reduce performance degradation in faulty on-chip networks, two on-chip network (OCN) architectures are proposed: 1) 20-path router (20PR), a router embedded with Built-in Self-Test/Self-Diagnosis (BIST/BISD) and Fault-Isolation (FI) circuits. 2) Surrounding Test Ring (STR), an external test architecture which externally perform test and diagnosis of the on-chip network. They embed BIST/SD and FI circuits that detect, locate, and isolate the impacts of the faulty FIFOs and MUXs in the faulty routers. Moreover, 20PR and STR apply undamaged datapaths in faulty routers to reduce performance degradation. The operation system can remap the tasks onto undamaged datapaths the proposed architectures found to maintain system function.n our experiments, the BIST/SD of the 20PR can be executed in 117 constant test cycles and the STR can be executed in 144~376 test cycles. The overhead of the OCN using 20PRs increases 15.17%, while the OCNs with STRs increase 8.48%~13.3%. The experiments also show the performance improved over prior approaches which completely disable faulty routers. The remapped packets are reduced by 75.68%~83.29% for 20PR and 68.33%~79.31% for STR comparing to traditional approaches. The system latencies are also reduced by 7.25%~24.57% for 20PR and 4.86%~23.6% for STR comparing to traditional approaches. The experiment shows proposed fault-tolerant OCN architectures can perform graceful degradation in faulty mesh OCNs.摘要 Ibstract IIIontents Vist of Figures VIIist of Tables XIhapter 1 Introduction 1.1 Motivation & Goal 1.1.1 On-chip Communication Trends 1.1.2 Design Trends in Very Deep Submicron (VDSM) Era 4.1.3 Fault-tolerant On-chip Networks 6.1.4 Goal 8.2 Thesis Organization 11hapter 2 Review of Related Works 13.1 Fault-tolerant OCN Routing/Router Architecture 13.1.1 Fault-tolerant OCN Routing 14.1.2 Fault-tolerant OCN Router Architectures 15.2 On-chip Network Router Testing 17.2.1 Structural Approaches 18.2.2 Functional Approaches 20.3 Summary 22hapter 3 20-path Router for 2D-mesh Based OCN with BIST/SD 23.1 20-path Router Model for 2D-mesh Based OCNs 23.2 Design of 20-path router with BIST/SD/FI 25.2.1 Architecture of Generic Router 25.2.2 Proposed Embedded BIST and BISD Circuit 26.2.3 Proposed Fault Isolation Circuit 28.3 Experiments on coverage of test patterns 30.4 Summary 31hapter 4 A Scalable BIST/SD Architecture for mesh-based OCN 33.1 Issues and Limitations of BIST Architectures in 20PR 33.2 Surrounding Test Ring (STR) 34.2.1 Test and Diagnosis Algorithms of STR 36.2.2 Implementation of STR 45.3 Summary 49hapter 5 Implementations & Experiments 51.1 Hardware Overhead & Test Cycle 51.2 Simulation of System Performance 54.2.1 Simulation on Faulty OCN Routers 55.2.2 Simulation on Faulty Links 59.3 Comparisons 61hapter 6 Conclusions & Future Works 63.1 Conclusions 63.2 Future Works 64eference 651241211 bytesapplication/pdfen-US晶片內網路容錯設計內建自我測試內建自我診斷On-chip NetworkFault-tolerant DesignBuilt-in Self-testBuilt-in Self-diagnosis適用於二維格狀多處理器晶片系統之可容錯晶片內網路架構Fault-tolerant On-chip Network Architecture for 2D-mesh Based Chip Multiprocessor Systemsthesishttp://ntur.lib.ntu.edu.tw/bitstream/246246/189201/1/ntu-98-R96943012-1.pdf