Repository logo
  • English
  • 中文
Log In
Have you forgotten your password?
  1. Home
  2. College of Electrical Engineering and Computer Science / 電機資訊學院
  3. Electronics Engineering / 電子工程學研究所
  4. Architecture and Algorithm Design for Image Coding and Scalable Video Application
 
  • Details

Architecture and Algorithm Design for Image Coding and Scalable Video Application

Date Issued
2009
Date
2009
Author(s)
Pan, Chia-Ho
URI
http://ntur.lib.ntu.edu.tw//handle/246246/189127
Abstract
Multimedia applications are more and more popular in our life as the rapid progress of image sensor, display devices, communication, VLSI manufacture, computing engines, and image/video coding standards. Many advanced multimedia applications require image and video compression technology with higher compression ratio and better visual quality. High quality, high compression rates of digital image/video, and low computational cost are important factors in many areas of consumer electronics. These requirements usually involve computationally intensive algorithms imposing trade-offs between quality, computational resources, and throughput. Hence, the researches of hardware-oriented algorithms and VLSI architectures push the progress of multimedia applications. This dissertation has two main purposes: to propose VLSI architectures for efficient implementation of the image coding systems and to provide algorithm designs of scalable video application systems for the emerging requirement in real multimedia applications.n the first part, we describes system analysis and architecture design of JPEG XR encoder. We proposed two chip implementations for JPEG XR image coding.Firstly, a 4:4:4 lossless/lossy symbol-based JPEG XR encoder is implemented on a 3.222 mm$^{2}$ with 90nm CMOS technology dissipating 95.7 mW at 0.9 V and 62.5 MHz. It is capable of processing 34.1 Mega samples within one second for lossless/lossy coding. The timing schedule and pipelining of color conversion, pre-filter, PCT and quantization modules are well designed. In order to prevent accessing the coefficients from off-chip memory, n on-chip SRAM is designed to buffer the coefficients. We use well arranged sub-pipeline timing schedule for the implementation of the entropy encoding module to increase the throughput about 3 times. This design is dedicated for the DSC and digital frame application. The another chip design is channel parallel JPEG XR encoder. An five-stage block pipelined architecture with proposed system scheduling supports real-time 4:4:4 full-HD(1920x1080p) lossless/lossy processing ability. We analysis the dependency of RLE and Flexbits modules and adopt multi-symobl architecture to reduce the processing cycles. It is implemented on 9.61 mm$^{2}$ with 0.18 um CMOS technology and 81 MHz. The 187 Mega samples/sec throughput are achieved by proposed system scheduling, high degree of parallelism, reducing memory access, and algorithmic optimization. In addition, the processing ability is six times larger than our first work. Our proposed architecture is the worldwide first reported JPEG XR single-chip encoder. he second part of this dissertation describes two algorithm designs for scalable video application which is emerging recently. When transmitting video over a heterogeneous network, it is required to satisfy the different constraints due to the preferences and equipment selections of different users. More than one video parameters include spatial frame size, temporal frame rate, and visual quality resolution are utilized to provide better scalability in scalable video application. It is difficult to find the relationship between the various video parameter settings and user preferences. In this part, we propose a multidimensional adaptation selection scheme to match the preferences of the video parameters for each user. This scheme characterizes the relationship between spatial, temporal and SNR scalabilities according to the subjectiveness of each user. An objectivity-derived emulation scheme is used in video adaptor to realize the selection of multidimensional adaptation. Therefore, our proposed video adaptor provides more appropriate adjustments of the video parameters for each user. After objectivity-derived scheme is derived, optimization fitting of proposed model for each user is the next important things. We proposed soft-decision optimization scheme to overcome the uncertainty of the user, which not discussed in presented literatures. Besides, our proposed video adaptor identifies the key frames in sequence to utilize the bandwidth in a more efficient way and achieve better subjective visual quality. The proposed method improves the average accuracy prediction rate from 75% to 94% in overall available adaptation bandwidth of test sequences. The experimental results show that the video adaptor provides high consistence of quality between the adjusted video stream and the expectation of users. Because we analyze the user preference according to the compress-domain data, this scheme can be used in video proxy or gateway without much computation overhead. o satisfy the urgent of providing real-time video service over error prone network in scalable video transmission systems, how to protect the video streaming to have better visual quality is also an important issue. We propose a way to protect the video header information in application layer without modifying standardized syntax. Because we will not modify the syntax of the existed standard and the redundant bits can be embedded in the bitstream, this scheme can be used in combination with any video codecs. Our method can be applied for the environment of video streaming system that we practically used today, since the effort that we made is confined in the application layer. Beside, we also consider channel condition of wireless transmission and propose a way to reduce redundant bits used in channel coding. By doing this, the bitstream can be simply transmitted over practical network such as mobile TV in scalable video streaming application and the reconstructed picture quality outperforms the original one.n brief, we believe that with the technologies proposed in this dissertation can be realized in many real practical systems. We sincerely hope that our research contributions can create a new era for digital multimedia life.
Subjects
Scalable Video Coding(SVC)
APEC
Type
thesis
File(s)
Loading...
Thumbnail Image
Name

ntu-98-D91943003-1.pdf

Size

23.32 KB

Format

Adobe PDF

Checksum

(MD5):03c5c2a402a062aa2f8af627e7bc4015

臺大位居世界頂尖大學之列,為永久珍藏及向國際展現本校豐碩的研究成果及學術能量,圖書館整合機構典藏(NTUR)與學術庫(AH)不同功能平台,成為臺大學術典藏NTU scholars。期能整合研究能量、促進交流合作、保存學術產出、推廣研究成果。

To permanently archive and promote researcher profiles and scholarly works, Library integrates the services of “NTU Repository” with “Academic Hub” to form NTU Scholars.

總館學科館員 (Main Library)
醫學圖書館學科館員 (Medical Library)
社會科學院辜振甫紀念圖書館學科館員 (Social Sciences Library)

開放取用是從使用者角度提升資訊取用性的社會運動,應用在學術研究上是透過將研究著作公開供使用者自由取閱,以促進學術傳播及因應期刊訂購費用逐年攀升。同時可加速研究發展、提升研究影響力,NTU Scholars即為本校的開放取用典藏(OA Archive)平台。(點選深入了解OA)

  • 請確認所上傳的全文是原創的內容,若該文件包含部分內容的版權非匯入者所有,或由第三方贊助與合作完成,請確認該版權所有者及第三方同意提供此授權。
    Please represent that the submission is your original work, and that you have the right to grant the rights to upload.
  • 若欲上傳已出版的全文電子檔,可使用Open policy finder網站查詢,以確認出版單位之版權政策。
    Please use Open policy finder to find a summary of permissions that are normally given as part of each publisher's copyright transfer agreement.
  • 網站簡介 (Quickstart Guide)
  • 使用手冊 (Instruction Manual)
  • 線上預約服務 (Booking Service)
  • 方案一:臺灣大學計算機中心帳號登入
    (With C&INC Email Account)
  • 方案二:ORCID帳號登入 (With ORCID)
  • 方案一:定期更新ORCID者,以ID匯入 (Search for identifier (ORCID))
  • 方案二:自行建檔 (Default mode Submission)
  • 方案三:學科館員協助匯入 (Email worklist to subject librarians)

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science