DATE 2006 Session 5B: Timing and Noise Analysis

Similar documents
Impact of Local Interconnects on Timing and Power in a High Performance Microprocessor

Fig. 1: Typical PLL System

AUTOMATIONWORX. User Manual. UM EN IBS SRE 1A Order No.: INTERBUS Register Expansion Chip IBS SRE 1A

A BIST Scheme for Testing and Repair of Multi-Mode Power Switches

Lecture 10: Efficient Design of Current-Starved VCO

Analog Design and System Integration Lab. Graduate Institute of Electronics Engineering National Taiwan University

Design of Low-Power CMOS Cell Structures Using Subthreshold Conduction Region

PHASE-LOCKED loops (PLLs) are one of the key building

A low supply voltage and wide-tuned CMOS Colpitts VCO

Texas A&M University Electrical Engineering Department. ELEN 665 RF Communication Circuits Laboratory Fall 2010

>Talko. Have your modular speak out the CV, the LPC, 8 bits, vintage way. Page 1

Low-Power 1-bit CMOS Full Adder Using Subthreshold Conduction Region

The Do s and Don ts of Pressure Transducers

AND BIOMEDICAL SYSTEMS Rahul Sarpeshkar

Robust Subthreshold Circuit Design to Manufacturing and Environmental Variability. Masanori Hashimoto a

Standard Products ACT15530 CMOS Manchester Encoder / Decoder May 16, 2005

TOTAL IONIZING DOSE TEST REPORT

Adaptive Mode Control: A Static-Power-Efficient Cache Design

Evaluation of heavy wire bond tools with ceramic tip technology

Transitioning from the CS4362 to the CS4362A or CS4365

Multi-band LC VCO GHz PMCC_VCOMB12G

Slew-Aware Clock Tree Design for Reliable Subthreshold Circuits Jeremy R. Tolbert, Xin Zhao, Sung Kyu Lim, and Saibal Mukhopadhyay

GLAST Large Area Telescope:

Design Methodology for Fine-Grained Leakage Control in MTCMOS

An Energy Efficient Sensor for Thyroid Monitoring through the IoT

5GHz Band SPDT Switch + LNA GaAs MMIC

What is a stepper motor?

VHDL implementation of Neuron based classification for future artificial intelligence applications

SSO Simulation with IBIS

TOTAL IONIZING DOSE TEST REPORT

Verilog-AMS-PAM: Verilog-AMS integrated with Parasitic-Aware Metamodels for Ultra-Fast and Layout-Accurate Mixed- Signal Design Exploration

Total Ionizing Dose Test Report. No. 16T-RT3PE3000L-CG896-QMLPK

Post-Synthesis Sleep Transistor Insertion for Leakage Power Optimization in Clock Tree Networks

SNJB College of Engineering Department of Computer Engineering

Section 1. Objective What is an electronic component? 2. What are passive and active components? 3. What are discrete components and ICs?

Spiking Inputs to a Winner-take-all Network

Rolls,E.T. (2016) Cerebral Cortex: Principles of Operation. Oxford University Press.

Bulk CMOS Device Optimization for High-Speed and Ultra-Low Power Operations

Project: Feedback Systems for Alternative Treatment of Obstructive Sleep Apnea

Course Syllabus. Operating Systems, Spring 2016, Meni Adler, Danny Hendler and Amnon Meisels 1 3/14/2016

PFL780 PCB Fault Locator

Best Practices in PROFIBUS Network Diagnostics

A 2.4 GHZ FULLY INTEGRATED LC VCO DESIGN USING 130 NM CMOS TECHNOLOGY

Modulation Coding for Flash Memories

Test Equipment Depot Washington Street Melrose, MA TestEquipmentDepot.com

VLSI Design, Fall Design of Adders Design of Adders. Jacob Abraham

SiS9255. Projected Capacitive. Touch-Screen Micro Processor. Data sheet. Rev. 1.2 August 18, 2015

Recap DVS. Reduce Frequency Only. Reduce Frequency and Voltage. Processor Sleeps when Idle. Processor Always On. Processor Sleeps when Idle

Viability of Low Temperature Deep and Ultra Deep Submicron Scaled Bulk nmosfets on Ultra Low Power Applications

Implementation of a Multi bit VCO based Quantizer using Frequency to Digital Converter

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF MECHANICAL ENGINEERING QUESTION BANK. Subject Name: ELECTRONICS AND MICRIPROCESSORS UNIT I

When designing with these products it is important that the Designer take note of the following considerations:

DC Unbalance Tolerance Testing in PSE s

Mike Davies Director, Neuromorphic Computing Lab Intel Labs

Efficient Feature Extraction and Classification Methods in Neural Interfaces

Avaya 2500 Series Analog Telephones Voluntary Product Accessibility Template (VPAT)

Comparative Analysis of Voltage Controlled Oscillator using CMOS

Shannon Expansion Based Supply-Gated Logic for Improved Power and Testability

Statistical Analysis of MUX-based Physical Unclonable Functions

MPS Baseline proposal

A Multiple-Crystal Interface PLL With VCO Realignment to Reduce Phase Noise

Fault Injection Attacks

Line Coding. Acknowledgments:

Convolutional Coding: Fundamentals and Applications. L. H. Charles Lee. Artech House Boston London

BHARATH INSTITUTE OF HIGHER EDUCATION AND RESEARCH ODD SEMESTER UNIVERSITY EXAMINATION TIME TABLE / NOV

Mechanical Simulation of Probing on SMART POWER POA devices

IN RECENT years, the demand for power-sensitive designs

MAE 298, Lecture 10 May 4, Percolation and Epidemiology on Networks

Design Considerations and Clinical Applications of Closed-Loop Neural Disorder Control SoCs

>Talko _. Die Stimme der Maschinen. Page 1

Symbolic CTL Model Checking

Unified Padring Design Flow

Fan Out Simple to Complex John Hunt

CSA Z32 TESTING GUIDELINE AND PROCEDURES

Digital. hearing instruments have burst on the

Reduction of Subthreshold Leakage Current in MOS Transistors

SUBTHRESHOLD CIRCUIT DESIGN AND OPTIMIZATION. Sungil Kim

Timing-Based Delay Test for Screening Small Delay Defects

Heat Drain Device for Ultrasound Imaging Probes ESAOTE SPA

OIML R 122 Annex C RECOMMENDATION. Edition 1999 (E) ORGANISATION INTERNATIONALE INTERNATIONAL ORGANIZATION

EKG Sensor Product Number: ENEKG189

ERA: Architectures for Inference

Avaya G450 Branch Gateway, R6.2 Voluntary Product Accessibility Template (VPAT)

Development of an Electronic Glove with Voice Output for Finger Posture Recognition

HIGH-VOLTAGE SUPER-JUNCTION SOI-LDMOSFETS WITH REDUCED DRIFT LENGTH

Analog Devices Welcomes Hittite Microwave Corporation NO CONTENT ON THE ATTACHED DOCUMENT HAS CHANGED

Total Ionizing Dose Test Report. No. 18T-RT4G150-LG1657-K04J0

Feasibility Study for: IEEE GbE Electrical Backplane / Copper Cabling Study Group. Fort Lauderdale, FLA. Vittal Balasubramanian

STUDY ON APPLICATION OF STRAIN MEASURING TECHNOLOGY IN BOARD LEVEL ASSEMBLY PROCESS

Path to High-Quality Films on Continuous Substrates

Safety Issue of Static Magnetic Field (1)

Power Blackouts and the Domino Effect: Real-Life examples and Modeling

PoDL- Decoupling Network Presentation Andy Gardner. May 2014

Stacked Die Advanced Interconnect Technologies

Avaya G450 Branch Gateway, Release 7.1 Voluntary Product Accessibility Template (VPAT)

CSE 599E Lecture 2: Basic Neuroscience

Hoare Logic and Model Checking. LTL and CTL: a perspective. Learning outcomes. Model Checking Lecture 12: Loose ends

Statistical Analysis of MUX-based Physical Unclonable Functions

International Journal of Information Technology Convergence and Services (IJITCS) Vol.2, No.5, October 2012

TIMELY: RTT-based Congestion Control for the Datacenter

Transcription:

DATE 2006 Session 5B: Timing and Noise Analysis Bus Stuttering : An Encoding Technique To Reduce Inductive Noise In Off-Chip Data Transmission Authors: Brock J. LaMeres, Agilent Technologies Sunil P. Khatri, Texas A&M University March 8, 2006 Bus Stuttering 1

Agenda Problem Motivation Our Solution Experimental Results March 8, 2006 Bus Stuttering 2

Why is IC Packaging Important? All Electronic Circuitry Resides in a Package - The package serves many purposes: 1) Protection of devices 2) Density Translation 3) Thermal Dissipation 4) Manufacturing Standardization Packaging Limits System Performance March 8, 2006 Bus Stuttering 3

Why is packaging limiting performance? IC Design/Fabrication is Outpacing Package Technology - We re seeing exponential increase in IC transistor performance - >1.3 Billion transistors on 1 die [Fall IDF-05] March 8, 2006 Bus Stuttering 4

Why is packaging limiting performance? Packages Have Been Designed for Mechanical Performance - Electrical performance was not primary consideration - IC s limited electrical performance - Package performance was not the bottleneck March 8, 2006 Bus Stuttering 5

Why is packaging limiting performance? VLSI Performance Exceeds Package Performance - Packages optimized for mechanical reliability, but still used due to cost - IC performance far exceeds package performance On-Chip -fic > 4GHz - large signal counts - exponential scaling Package -fpkg < 2GHz - limited signal counts - linear scaling March 8, 2006 Bus Stuttering 6

Why is packaging limiting performance? Package Interconnect Contains Parasitic Inductance - Long interconnect paths Wire Bond Inductance (~2.8nH) - Large return loops - L = Φ Area March 8, 2006 Bus Stuttering 7

Why is packaging limiting performance? Package Parasitics Limit Performance - Excess inductance causes package noise - Noise limits how fast the package can transmit date 1) Supply Bounce 2) Signal Coupling March 8, 2006 Bus Stuttering 8

Why is packaging limiting performance? Aggressive Package Design Helps, but is expensive - 95% of VLSI design-starts are wire bonded - Goal: Extend the life of current packages QFP Wire Bond : 4.5nH $0.22 / pin BGA Wire Bond : 3.7nH $0.34 / pin *** BGA Flip-Chip : 1.2nH $0.63 / pin March 8, 2006 Bus Stuttering 9

Our Solution Encode Off-Chip Data to Avoid Inductive Cross-talk Avoid the following cases: 1) Excessive switching in the same direction = reduce ground/power bounce 2) Excessive X-talk on a signal when switching = reduce edge degradation 3) Excessive X-talk on signal when static = reduce glitching March 8, 2006 Bus Stuttering 10

Our Solution This results in: 1) A subset of vectors is transmitted that avoids inductive X-talk. 2) The off-chip bus can now be ran at a higher data rate. 3) The subset of vectors running faster can achieve a higher throughput over the original set of vectors running slower (including overhead). Throughput of less vectors at higher data-rate Throughput of more vectors at lower data-rate March 8, 2006 Bus Stuttering 11

Bus Stuttering CODEC Intermediate States are Inserted Between Noise Causing Transitions - Stutter states limit the number of simultaneously switching signals - The source synchronous clock is gated during stutter state No Encoding Un-encoded: Core A A A B B B C C C A A A B B B C C C Package B C Vector Sequence Causes Noise Limit Violation w/ Encoding A B C A B stutter C Encoded: Core A A B B C C Encoder A B stutter C A B stutter C Package B C Vector Sequence is eliminated using Stutter March 8, 2006 Bus Stuttering 12

Bus Stuttering CODEC Noise Sources Simultaneous Switching Noise Supply Bounce Induced Self Voltage V self di n i = L i dt Glitching Coupling onto Non-Switching Signals Edge Degradation Coupling onto Switching Signals Data Dependent Delay V couple di k k = M1 k 1 dt March 8, 2006 Bus Stuttering 13

Bus Stuttering CODEC - Constraints For Each Possible Noise Source on the Bus, a Constraint is written: 1) v j 0 = VDD - P bnc > (L/2) (# of v ij pins =1) < Pbnc 2) v j 1 = 1 k1 (v 2j ) + k2 (v 3j ) > P 1 3) v j 1 = -1 k1 (v 2j ) + k2 (v 3j ) < P -1 4) v j 1 = 0 - P 0 < k1 (v 2j ) + k2 (v 3j ) < P 0 Each Constraint is Evaluated to Find Illegal Transitions: v 1 j = 1 = rising v 1 j v 2 j v 3 j v 1 j = 0 = static v 1 j = -1 = falling 1 0 1 1-1 0 1-1 -1 1-1 1 1 1 0 1 1-1 1 1 1 violates user-defined glitch parameter violates user-defined supply bounce parameter March 8, 2006 Bus Stuttering 14

Bus Stuttering CODEC - Algorithm Constraints are Evaluated and a Legal Directed Graph is Created G Directed Graph is Used to Map Transitions Between any Two Vectors - A transition path (which may include stutters) exists between any two vectors if: There exists at least two outgoing edges for each vector v s G (including self-edge) There exists at least two incoming edges for each vector v d G (including self-edge) March 8, 2006 Bus Stuttering 15

Bus Stuttering CODEC - Construction Multiple Stutter States can be used - between 0 and 2 (Wbus-1) stutters can be inserted between any two vectors - experimental results show that for segments up to 8 bits, more than 3 stutters is rare Overhead - Overhead increases as segments sizes increase - Still useful since segments greater than 8 bits are rarely used (SPG=8:1:1) Overhead = 2 ( W bus 1) k = 1 (#_ Trans _ Re quiring _ k _ stutters) k 2 (2 W ) bus March 8, 2006 Bus Stuttering 16

Bus Stuttering CODEC Physical Results Circuit Implementation - 32 pipeline stages used - pipeline reset after 32 idle states (similar to SRIO, HT, and PCI Express) - protocol inherently handles pipeline overflow March 8, 2006 Bus Stuttering 17

Bus Stuttering CODEC Physical Results SPICE Simulations - 3 bit segment (SPG=3:1:1) - fixed di/dt - Maximum noise reduced by limiting simultaneously switching signals Ground Bounce Glitching Edge Degradation March 8, 2006 Bus Stuttering 18

Bus Stuttering CODEC Physical Results TSMC 0.13um Synthesis Results - RTL design, synthesized and mapped - Segment sizes 2 8 implemented - Logic, delay, and area evaluated March 8, 2006 Bus Stuttering 19

Bus Stuttering CODEC Physical Results Xilinx FPGA, 0.35um Implementation Results - RTL design implemented - Xilinx, VirtexIIPro, FPGA March 8, 2006 Bus Stuttering 20

Bus Stuttering CODEC Physical Results Xilinx FPGA, 0.35um Implementation Results - RTL design, implemented - Logic operation verified - Noise Reduced from 16% to 4% (4 bit, SPG=4:1:1) March 8, 2006 Bus Stuttering 21

Conclusion Packaging Performance is the Largest System Bottleneck Stutter Encoding Avoids Worst-Case Noise Patterns Performance Improved Even After Considering Encoding Overhead March 8, 2006 Bus Stuttering 22