A Modified Scan-D Flip-flop Design to Reduce Test Power.

Dr. Sunil P Khatri, Assistant Professor          Sivakumar Ganesan, Graduate Student
Department of ECE, Texas A&M University.

Abstract— Power consumption in scan based testing is high due to the toggling of the combinational logic during the scan shift. In this paper, we present a modified Scan Flip-flop architecture with a minimal area overhead which completely eliminates the switching power dissipation and also reduces the leakage power in the combination circuit during the shift phase of a scan based test. This also enables us to increase the shift frequency since power is no longer a limiting factor during scan shift and help in achieving test time reduction.

Index Scan based test. Scan Flip-flop, scan per vector.

I. INTRODUCTION

Testing of VLSI circuits can lead to generation of excessive heat which can damage the chips under test. In scan based testing methodology, high-performance CMOS circuits consume significant dynamic power during testing because of enhanced switching activity. The power and energy consumption of digital systems may increase significantly during testing. This extra power consumption during test application may reduce circuit reliability. The consumed energy directly corresponds to the switching activity generated in the circuit during test application. This has an impact on battery lifetime during remote self-testing. Test-per-scan techniques involve applying test patterns from a loaded scan chain to a circuit under test (CUT) every \( m+1 \) clock cycles, where \( m \) is the number of flip-flops in the scan chain. Most of the power-dissipation in the CUT will occur during the shifting of bits into the scan chain. The two types of power consumption that are of importance, during test mode of a circuit, are peak power and average power.

- The average power consumption is given by the ratio between the total energy and the test time. This parameter is more important than the instantaneous energy as hot spots and reliability problems may be caused by sustained high power consumption.

- The peak power consumption corresponds to the highest switching activity generated in the circuit under test, during one clock cycle. The peak power determines the thermal and electrical limits of components and the system packaging requirements. If the peak power exceeds certain limits, the correct functioning of the entire circuit is no longer guaranteed.

The main motivation for considering power consumption during test application is that power and energy of a digital system can be considerably higher in test mode than in system mode. The reason is that test patterns can cause many combinational nodes switch while a power saving system mode only activates a few modules at the same time. Another reason is that successive functional input vectors applied to a given circuit during system mode have a significant correlation. There is no specific correlation between the consecutive test patterns generated by an ATPG tool (for external testing) or that created by an LFSR (for BIST) for testing such circuits. This can cause considerably larger switching activity in the circuit during test than during its normal operation. Since power dissipation in CMOS circuits is proportional to switching activity, this excessive switching activity during test can cause several problems. In particular, it is shown that this increased power may be responsible for increased cost, reduced reliability, harder performance verification and technology related problems. Reduced power dissipation during test application is thus becoming an equally important figure of merit in today’s VLSI circuits design and is expected to become one of the major objectives in the near future. Practiced solutions include:

- Over designing the power supply, package and cooling to tolerate the increased current during testing. Breaks are inserted into the test process for avoiding hot spots.
- Testing with reduced operation frequency.
- Partitioning of the system under test and appropriate test planning.

The first solution increases both hardware costs and test application time. The second proposal uses less hardware, but the reduced system frequency increases test application time and may lead to a loss of defect coverage as dynamic faults may be masked. Moreover, this solution reduces the power consumption at the expense of a longer test time, but does not reduce the total energy consumption during test. Test partitioning and test planning allows the detection of dynamic faults, but increases hardware overhead and test time during BIST.
II. BACKGROUND

The two components of power dissipated in a CMOS circuit are:

i) Static dissipation due to leakage current through the channel and the tunneling current through the gate oxide

ii) Dynamic dissipation due to switching transient currents and charging and discharging of load capacitances.

Power consumption can be reduced by applying the test patterns at a slower frequency. Average dynamic power consumed due to charging and discharging of load capacitances is given by $1/2CV^2f^*N_G$ where $C$ is the output capacitance, $V$ is the voltage at which the test is run, $f$ is the frequency of operation during the test and $N_G$ is the total number of Gate output transitions[3]. Reducing $f$ obviously reduces the power consumption. This method of power reduction causes an unacceptable increase in test time. As designs get more complex, the test pattern volume grows, resulting in increased test time. Reducing frequency during test only worsens the problem.

With the shrinking of technology, the power due to leakage currents dominates the power due to active switching. The following graph from Cadence shows the trend in leakage current with reduction in gate length.

![Figure 1: Power Trends in Recent Technologies](image)

This clearly shows that power contribution due to leakage currents is a major factor and needs to be reduced.

III. PREVIOUS WORK

Previous work on low-power test-per-scan IC design can be classified into two broad categories: circuit modifications and test vector alterations [4], [5], [6]. Circuit modification techniques can be further divided into two categories, CUT and scan D flip-flop (DFF) modifications. Circuit partitioning divides the CUT into a number of smaller modules; each can be tested pseudo-exhaustively to reduce the average power consumption. Previous work also describes a toggle suppression technique that reduces circuit activity and average power consumption with the use of a modified D-Flip-flop that supplies constant 0 values to the CUT during the scan operation. One approach uses an additional slave latch in a normal master slave Flip-flop and uses the additional slave latch to maintain a constant value at the Q output which feeds to the combinational cloud during scan shift [1]. The design of such a Flip-flop with additional slave latch is shown in Figure 2.

![Figure 2: Scan D-Flip-flop with Latch as gating logic](image)

The additional overhead in this scheme is an extra latch. Another approach uses an AND gate instead of a latch to maintain a constant value at the Q output during the scan shift operation [2]. This approach compared to the approach of [1] has a reduced area penalty. The structure of such a D-Flip-flop, gated by an AND gate is shown in Figure 3.

![Figure 3: Scan D-Flip-flop with AND gate as gating logic](image)

IV. PROPOSED TECHNIQUE.

The current proposed technique also gate the Q output during the scan shift operation. This method uses a transmission logic gate followed by a pull-up or pull down transistor to gate the Q output and maintain it to a constant value of 1 or 0. Using this scheme, the combinational logic can be set to a desired low leakage input pattern, thereby minimizing the leakage power.

The two main advantages of this approach over the other previous approach are:

- Reduced area overhead (only 3 transistors)
- The ability to control the leakage of the combinational logic by setting the inputs of the combinational logic to a low leakage state during scan shift.

The structure of the D-Flip flop with pull-up and pull-down are as shown in the Figure 4 and Figure 5:
We clearly see that the above structures have a less area compared to a latch or an AND gate approach, along with the additional benefit of allowing us to park the combinational circuit at its lowest leakage input.

V. EXPERIMENTAL RESULTS

To demonstrate the modified D-Flip-flop’s functionality, the circuits of Figure-4 and Figure-5 was created in the 65nm technology and simulated in Spectre to verify its functionality.

In order to demonstrate the reduction in leakage power with input values compared to the input of all 0’s (which is the input state of the combinational logic if the technique of [1] or [2] are used), we developed standard cells for the 65nm technology in Cadence. Their leakage power was calculated using Spectre. The results are shown in Table 1, 2 and 3.

<table>
<thead>
<tr>
<th>Benchmark Circuit</th>
<th>All “0” pattern</th>
<th>Random pattern</th>
<th>% Saving</th>
</tr>
</thead>
<tbody>
<tr>
<td>ALU2</td>
<td>5719</td>
<td>5333.6</td>
<td>6.73894</td>
</tr>
<tr>
<td>apex6</td>
<td>12578</td>
<td>12578</td>
<td>0</td>
</tr>
<tr>
<td>apex7</td>
<td>3929</td>
<td>3818</td>
<td>2.825146</td>
</tr>
<tr>
<td>C17</td>
<td>108</td>
<td>108</td>
<td>0</td>
</tr>
<tr>
<td>C432</td>
<td>4008.4</td>
<td>3370</td>
<td>15.92655</td>
</tr>
<tr>
<td>C449</td>
<td>10966</td>
<td>10946</td>
<td>0.182382</td>
</tr>
<tr>
<td>C880</td>
<td>7733</td>
<td>6445</td>
<td>16.65589</td>
</tr>
<tr>
<td>C1355</td>
<td>10966</td>
<td>10946</td>
<td>0.182382</td>
</tr>
<tr>
<td>C1908</td>
<td>10608</td>
<td>9554</td>
<td>9.935897</td>
</tr>
<tr>
<td>C3540</td>
<td>26785</td>
<td>23057</td>
<td>13.91824</td>
</tr>
<tr>
<td>C5315</td>
<td>39816</td>
<td>33397</td>
<td>16.12166</td>
</tr>
<tr>
<td>C6288</td>
<td>64561</td>
<td>51808</td>
<td>19.75341</td>
</tr>
<tr>
<td>C7552</td>
<td>46740</td>
<td>44684</td>
<td>4.398802</td>
</tr>
<tr>
<td>Dala</td>
<td>15165</td>
<td>13954</td>
<td>7.985493</td>
</tr>
<tr>
<td>Decod</td>
<td>243.5</td>
<td>243.5</td>
<td>0</td>
</tr>
<tr>
<td>I1</td>
<td>787.1</td>
<td>549.6</td>
<td>30.17406</td>
</tr>
<tr>
<td>I2</td>
<td>5094.9</td>
<td>1573.6</td>
<td>69.11421</td>
</tr>
<tr>
<td>I3</td>
<td>1053</td>
<td>618.6</td>
<td>41.25356</td>
</tr>
</tbody>
</table>

Table 4: Leakage power in benchmark designs.

VI. CONCLUSION

We present an approach which apart from eliminating the switching power in the combinational logic during scan shift also minimizes the leakage power. Also this approach has a low area overhead compared to previous schemes.

VII. REFERENCES

[4] V. Dabholkar, S. Chakravarthy, I. Pomeranz “Techniques for minimizing power dissipation in scan and combinational circuits during test application”