A Fast-Lock Mixed-Mode DLL with Wide-Range Operation and Multiphase Outputs

Kuo-Hsing Cheng and Yu-Lung Lo
Department of Electrical Engineering
National Central University, Jung-Li, Taoyuan, Taiwan R.O.C.
Email: {cheng,yllo}@ee.ncu.edu.tw

Abstract

This paper describes a fast-lock mixed-mode delay-locked loop (MMDLL) for wide-range operation and multiphase outputs. The architecture of the proposed DLL uses the mixed-mode time-to-digital converter (TDC) scheme for frequency range selector, a start-up circuit and coarse tune circuit to offer the faster lock time. And the multi-controlled delay cell for voltage-controlled delay line (VCDL) was used to provide the wide locked range and low-jitter performance. The charge pump circuit is implemented by digital controlled scheme to reach bandwidth tracking. The chip has been fabricated using the TSMC 0.25-µm single-poly five-metal CMOS process with a 2.5 V power supply voltage. From the measurement results, this DLL can operate correctly when the input clock frequency is changed from 32 to 320 MHz and generate ten-phase clocks within just one clock cycle. Moreover, the proposed DLL can solve the problem of false locking associated with conventional DLL’s and wide-range operation. At 320 MHz, the measured peak-to-peak jitter and root-mean-squared jitter are 37.2 ps and 2.492 ps, respectively. Furthermore, the locking time is less than 22 clock cycles based on the HSPICE simulation results. The DLL occupies smaller area (0.32×0.22 mm²) and dissipates less power (15 mW) than other wide-range DLL’s [1]–[7].

1. Introduction

For high-speed and high-integration density VLSI systems, the phase-locked loops (PLL’s) and delay-locked loops (DLL’s) have been widely adopted to solve clock signal skew and jitter in microprocessors, memory interfaces and communication IC’s. Generally, if there is no need for frequency multiplication, DLL’s would be more attractive than PLL’s because they are more stable and easier to implement together with digital circuits and exhibit better jitter accumulation characteristics than PLL’s. Therefore, the DLL’s offer better jitter performance than PLL’s [1]–[2]. In many DLL applications, such as high speed clock/data recovery (CDR), DDR [7] and multiphase clock generation, the multiphase VCDL output is usually used to implement the circuit function.

Conventional DLL’s may suffer from harmonic locking over wide operating range. Therefore, various wide-range DLL’s architectures [1]–[7] are proposed to overcome the problem of false locking. However, such DLL’s resulted in complex architectures that faced such problems as increased area, added power consumption and degradation of jitter performance. According to the previous works, a mixed-mode architecture, which increases the lock range having no degradation of jitter performance with a relatively small overhead in area and power, is proposed in this paper.

Figure 1: Block diagram of the proposed DLL.
2. **Architecture**

2.1 **Architecture and Operating Principle of Proposed DLL**

The architecture of the proposed DLL is shown in Fig. 1. It has two major blocks, the frequency range selector and DLL core. The frequency range selector can detect the input frequency range and generate digital control signals (S1, S0) to switch the delay time range of the multi-controlled VCDL and charge pump for tuning the loop bandwidth. The digital-mode control scheme for range selector and the flat gain of the VCDL not only offer the faster lock time and wide locked range but also provide a low-jitter performance for various frequency application. The other block is DLL core that consists of PD, CP, LF, start-up circuit, coarse tune circuit and the multi-controlled delay line for VCDL. The start-up circuit sets the initial control voltage of the loop filter to VDD. On the same time, the coarse tune circuit is activated to generate a large discharge current until the output clock (out_clk) of the delay line is close to the input reference clock (ref clk). Therefore, the coarse-tuning scheme can offer the faster lock time in this DLL.

[Diagram of DLL architecture]

2.2 **Circuits Description**

2.2.1 **Frequency Range Selector**

A time-to-digital converter (TDC) is a unit able to perform a time interval measurement and convert the result with a digital readout. In this DLL, the mixed-mode TDC scheme was applied as the frequency range selector. It generates 2-bit control signals (S1, S0) and thus switch the delay time range of the multi-controlled delay cell and the current of the charge pump to achieve adaptive bandwidth as shown in Fig. 2. The frequency range selector can choose the delay time range in the VCDL, according to the various input reference clock. The architecture consists of a ten-stage delay line, an encoder and a control unit. The circuit of the ten-stage delay line is similar to the multi-controlled delay cell which is used to avoid the process variation. The encoder and the control unit are composed of only latches and simple logic gates.

While the DLL is activated, the ref_clk will be propagated in the delay cell of the ten-stage delay line. The outputs (o1 – o10) of the delay line will be AND with the ref_clk. The outputs (Q1,Q3,Q5,Q10) of the encoder are the digital value to describe the time interval in which the ref_clk is high, therefore, the frequency range of the input reference clock is determined. Four different operating ranges of this DLL are selected by the 2-bit control signals (S1, S0). For example, when the reference clock is 100 MHz, the outputs of the encoder are bit streams of “1110” and the 2-bit control signals (S1, S0) is (0, 1). When the input reference clock is 250 MHz, the outputs of the encoder are bit streams of “1000”. Then the 2-bit control signals (S1, S0) is (1, 1). Based on the operating of frequency range selector, a wide operating range of this DLL is achieved.

[Diagram of frequency range selector]

2.2.2 **Multi-Controlled Delay Cell**

In PLL’s, a voltage-controlled oscillator (VCO) with a larger gain (Kvco) is more prone to high spurs and phase

[Diagram of multi-controlled delay cell]
noise. One means of achieving simultaneously a large tuning range and small $K_{VCO}$ is to divide a single wide-range tuning curve into several narrow-range sections with sufficiently frequency. In this DLL, a multi-controlled delay cell is proposed for the flat gain of the VCDL, to provide a wide operating frequency range and low-jitter performance. The multi-controlled delay cell is employed as the basic half delay cell element (HDCE) and two HDCEs compose a delay cell (DC), as shown in Fig. 3.

The multi-controlled delay cell comprises four operating frequency ranges. Figure 4 shows the principle of operating range of the proposed DLL. The delay time ($T_{VCDL_{max}}$, $T_{VCDL_{min}}$ and $T_{d_1}$) of the multi-controlled delay cell is determined by three control signals $V_{ctrl}$, $S0$ and $S1$. In range1, the control signals ($S1$, $S0$) are (0,0), the operating range is between $T_{VCDL_{max}}$ and $T_{d_1}$. In range2, the $S0$ is high and the $S1$ is 0; the operating range is between $T_{d_2}$ and $T_{d_3}$, and so on. The mixed-mode wide-range DLL must design the boundary ($T_b$) of the delay line to determine various frequency ranges. To avoid the false locking with process, voltage supply, temperature, and loading (PVTL) variations, the operating range will overlap for each other. If the overlapping region is too small so the points $T_{b_1}$, $T_{b_2}$ and $T_{b_3}$ disappear, then the gain of VCDL can be increased to make these points appear. From simulation results of VCDL transfer function reveal that $K_{VCDL1} > K_{VCDL2} > K_{VCDL3} > K_{VCDL4}$. Where the gain ($K_{VCDL}$) of VCDL in range1 is $K_{VCDL1}$; $K_{VCDL}$ is in range2 is $K_{VCDL2}$, and so on.

2.2.3 Coarse Tune Circuit

In this DLL, the coarse tune circuit is proposed to offer faster locking time and provide better jitter performance as shown in Fig. 5. When the input reference clock is activated, the output pulse of start-up circuit is used to set the initial control voltage of the loop filter to VDD. The out_clk of the delay line leads before the ref_clk. The dn and dn1 signals are activated and the Vctrl greatly decreases. When the out_clk is close to the ref_clk, the dn1 signals will disappear. The PD up and dn signals are then used to fine-tune this DLL.

The coarse tune circuit consists of a D-type flip-flop (D_FF) and an adjustable pulse width circuit. The adjustable pulse width circuit output (out_apw) is generated to reset the out_clk, when the out_clk leads before the ref_clk. A schematic of the adjustable pulse width circuit is shown in Fig. 5(b). The adjustable pulse width is controlled by the ref_clk and control voltage (Vctrl) of filter. The adjustable pulse width circuit consists of three stages VCDL and a AND gate. The delay between ref clk and int clk of the three stages VCDL is controlled by Vctrl. When the Vctrl increases that cause the delay between ref_clk and int_clk of the three stages VCDL decreasing. After the AND gate, the narrow pulse width is generated.

Figure 6: The schematic of the digital controlled charge pump and loop filter.
Table 1: The charge pump currents versus the KVCDL.

<table>
<thead>
<tr>
<th>Region</th>
<th>Control Signals</th>
<th>Condition</th>
<th>Current (Ich)</th>
<th>Gain (KVCDL)</th>
</tr>
</thead>
<tbody>
<tr>
<td>Range 1</td>
<td>S0=0, S0=0</td>
<td>TVCDL.max &lt; 2×Td1</td>
<td>Ich1=Id1</td>
<td>KVCDL1</td>
</tr>
<tr>
<td>Range 2</td>
<td>S0=0, S0=1</td>
<td>Td2 &lt; 2×Td3</td>
<td>Ich2=Id1+Id2</td>
<td>KVCDL2</td>
</tr>
<tr>
<td>Range 3</td>
<td>S0=1, S0=0</td>
<td>Td4 &lt; 2×Td5</td>
<td>Ich3=Id1+Id3</td>
<td>KVCDL3</td>
</tr>
<tr>
<td>Range 4</td>
<td>S0=1, S0=1</td>
<td>Td6 &lt; 2×TVCDL.min</td>
<td>Ich4=Id1+Id2+Id3</td>
<td>KVCDL4</td>
</tr>
</tbody>
</table>

Figure 7: Microphotograph of the proposed DLL.

2.2.4 Digital Controlled Charge Pump

To design of a suitable DLL depends on various considerations, such as bandwidth requirement, jitter performance and lock time. From the analysis of the DLL small-signal AC model indicates that the fast acquisition time in the initial state and the jitter performance in the locked state are two important concerns. The wide-range operation DLL has various gains (KVCDL) of VCDL, and the loop bandwidth changes with different KVCDL. Furthermore, the current of the charge pump also influences the loop bandwidth. If the loop bandwidth is wide, then the lock time is short, but the input noise is larger. However, when the loop bandwidth is narrow, then the lock time is longer, but the input noise is smaller. Therefore, the loop bandwidth of DLL must be chosen carefully. Equation 1 shows the ratio of the loop bandwidth to the input operating frequency is [8]

\[
\frac{\omega_B}{\omega_{REF}} = \frac{I_{CH} \cdot KVCDL}{2\pi \cdot C}
\]

Figure 8: Clock waveforms at 32 MHz. (a) Output clock and phase 2. (b) Output clock and phase 5.

Figure 9: The jitter histogram. (a) When operation frequency at 32 MHz and (b) at 320 MHz.

Figure 10: Shmoo plot of input frequency versus supply voltage.
3. Experiment and Comparison Results

The proposed DLL has been realized in a 0.25-μm standard CMOS technology and the chip area is 0.823×0.823 mm²; this area includes the areas of the input and output digital buffers and I/O pads. The active area of DLL is 0.32×0.22 mm² and the loop filter consumes ~50% of the total active area. A microphotograph of this DLL is shown in Fig. 7. Figure 8 (a) shows the waveforms of output clock and phase 2 at 32MHz, respectively, and have a 72° phase difference. Figure 8 (b) shows the waveforms of output clock and phase 5, which are an inversion of each other with a 180° phase difference. Figure 9 (a) shows the measured jitter histogram for the DLL output clock at 32 MHz. The peak-to-peak jitter is 4.77ps with quiet supply, 3.297ps with quiet supply and 3.290ps with supply 2.3 V, respectively. From the simulation results, the maximum locking time is 22 clock cycles at 160 MHz.

The Shmoo plot of the proposed DLL is obtained to investigate the dependence, as shown in Fig. 10. The y-axis represents the supply voltage VDD from 1.1 to 2.5 V, and the x-axis represents the input frequency from 32 to 320 MHz. Even in the low supply voltage condition, the proposed DLL can still remain in the locked state. The measurements show that the proposed DLL has low power and a wide operating frequency range.

<table>
<thead>
<tr>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>Process</td>
<td>0.25-μm CMOS</td>
<td>0.25-μm CMOS</td>
<td>0.35-μm FPM CMOS</td>
<td>0.18-μm DRAM</td>
<td>0.25-μm 1P3M CMOS</td>
</tr>
<tr>
<td>Supply Voltage</td>
<td>2.5 V</td>
<td>2.5 V</td>
<td>2.5 V</td>
<td>2.5 V</td>
<td>2.5 V</td>
</tr>
<tr>
<td>Operating Range</td>
<td>150 ~ 600MHz</td>
<td>62.5 ~ 250MHz</td>
<td>6 ~ 130MHz</td>
<td>6 ~ 130MHz</td>
<td>32 ~ 320MHz</td>
</tr>
<tr>
<td>Lock Time</td>
<td>NA</td>
<td>NA</td>
<td>NA</td>
<td>22 clock cycles (Max.)</td>
<td></td>
</tr>
<tr>
<td>RMS Jitter</td>
<td>6.68ps with quiet supply</td>
<td>6.79ps with quiet supply</td>
<td>4.77ps with quiet supply</td>
<td>4.77ps with quiet supply</td>
<td></td>
</tr>
<tr>
<td>Peak-to-Peak Jitter</td>
<td>5ps with quiet supply</td>
<td>25ps with quiet supply</td>
<td>25ps with quiet supply</td>
<td>49.2ps with quiet supply</td>
<td></td>
</tr>
<tr>
<td>Power Consumption</td>
<td>6.6mW (500MHz)</td>
<td>12.6mW (350MHz)</td>
<td>13.2mW (350MHz)</td>
<td>326mW (320MHz)</td>
<td></td>
</tr>
<tr>
<td>Active Area</td>
<td>0.13 mm²</td>
<td>0.06 mm²</td>
<td>0.27 mm²</td>
<td>0.39 mm²</td>
<td></td>
</tr>
</tbody>
</table>

The comparison results are according to various wide-range DLL architectures, as shown in Table 2. The parameters are captured from the references [1], [3], [5] and [7], where the ADL DLL is reference [1], the DLL of the reference [3] is AAAM DLL, the AWR DLL is reference [5], the ALJ DLL is reference [7]. From the comparison results, the proposed DLL can achieve wide operating frequency than the the ADL DLL [1], AAAM DLL [3] and the ALJ DLL [7], and has faster lock time than the AWR DLL [5]. Furthermore, the area cost and power consumption of the prototype chip are much smaller than those of other wide-range DLL’s [1] – [7].

4. Conclusions

In this paper, a fast-lock mixed-mode delay-locked loop for wide-range operation and multiphase outputs with a single clock cycle is proposed. The mixed-mode TDC architecture for the frequency range selector can automatically choose one of the delay lines in the VCDL according to various frequencies. The multi-controlled delay cell element for VCDL was switched by the frequency range selector, providing a wide operating frequency range and low-jitter performance. The coarse tune circuit is proposed to reduce the lock time and keep a better jitter performance in this DLL. The charge pump circuit is implemented by the digital control scheme to achieve adaptive bandwidth. Consequently, this DLL can operate correctly when the input clock frequency is changed from 32 to 320 MHz and generate ten-phase clocks.

Acknowledgment

The authors thank National Chip Implementation Center, Taiwan, R.O.C., for chip implementation.

References