### Microscopic Bit-Level Wear-Leveling for NAND Flash Memory

Yong Song<sup>1</sup>, Woomin Hwang<sup>1</sup>, Ki-Woong Park<sup>2</sup>, and Kyu Ho Park<sup>1</sup>

<sup>1</sup> CORE Lab., Dept. of Electrical Engineering, KAIST, Korea
<sup>2</sup> Dept. of Computer Hacking and Information Security, Daejeon University, Korea {ysong, whwang, bpat'k@core.kaist.ac.kr, woongbak@dju.kr

Abstract. By microscopically observing widely used data files, we identified the cosiderable room of file time improvement in NAND finahmemory, which is due to the discovery of a non-uniformity in bit-level data patterns. In an attempt to exploit the discovery, we propose a nove bit-level was-leveling scheme. Instead of considering only the view of page-level or block-level, we incorporate the non-uniformity in data encoding patterns into wear-leveling scheme. Because of its orthogonality to the existing block-level was-incorporate the non-uniformity in data enadopted over the existing solutions without considerable overhead and extend NAND finably life span up to 36% in case of SLC.

#### 1 Introduction

During the last decade, NAND flash memory has become a de facto solid state storage technology. In the NAND flash memory-based storage systems, it is essential to reliably store data for years, however, NAND flash has endurance problem that each NAND flash cell is worn-out by program/rease cycling and eventually loses its capability to store data. Thus, the endurance problem of NAND flash has been considered in many previous researches and they proposed wear-leveling algorithms, which even out the wearing of different blocks of NAND flash memory.

To enlarge NAND fiash's life span, many researches proposed their algorithms for wear-leveling which tries to make wear-out of different blocks of finds memory even [34] [34] [34]. To expand lifetime of NAND fiash blocks, commonly, the previous algorithms take page-granularity approaches and use block crase count as a wear-out indicator. This is based on an implcit assumption all cells in same page or same block have the same probability to be worro-out by P/E cycle. To identify whether the assumption is right or not, we obtained data patterns of widely used field acta frequently written to disk or accounting for large part files. After analyzing the data, we found out a case that the bir-level data patterns in out uniform. Base on our observation, we build a simple statistical wear-out model of NAND fiash cell and we propose bit-level wear-leveling schemes. The evaluation aboves that it expands life span of NAND fiash up to 30%.



Fig. 1. (a) Bitwise counts of programmed state of Linux source codes, (b)Bitwise count of programmed state of DB workloads

#### 2 Motivation

In case of SLC NAND flash, a cell stores a bit and a page is composed of a series of NAND cells and a block contains multiple pages. To store data on them, a block erase operation should be performed prior to write data, which makes all cells in the block changed to the erased state, representing logic '1'. If logical value of '0' is written to a cell when page writing operation of NAND flash, a state transition of the cell occurs from the erase state to the programmed state, inforcing 0''. On the other hands, if logical value of '1' is written to a cell, there is no state transition of the cell and it remains in the erased state. Because a state transition from erase state to programmed state makes a cell worm out each cell can experience different wear-out progress according to the number of its actual programs.

A problem arises from that the previous block-based wear-leveling algorithms treat all cells comprising a block together as if they experience the same number of state transitions between the erase state and the programmed state, thus having the same degree of vear-out. If we assume that the physical endurance of each cell is same, among all cells comprising a block, the cell experiencing the highest number of programs is to be completely worn out most rapidly. If the number of completely worn-out cells is bigger than the number of correctable bit the life time of the page is ended. Even though other cells still have their life time left and they can operate normally, the page is not used any more. This leads to the underestimation of the life time of the target page. It therefore makes the block.

Outstanding frequency of programs in a specific bit position appears signifcatly in text-based workloads. We selected Linux source codes as a representative text-based workloads we selected Linux source codes as a terpresentative text-based workloads and obtained their bit-level data patterns by traversing Linux source code tree. Figure [La] shows the accumulated counts of the case of logical U' at each bit position per tyte, which requires the state transition of the cell from erase state to the programmed state. We can see that the most sigmifant bit (MSB) position in a brite(character) experimense the highest number of the state transiton than other bit positions. This is because ASCII code for human readable data does not use MSB.

In order to observe data patterns of files to be frequently written to disk, we performed DBT2[1] benchmarks for SQLite[8], an open-source database. The test creates database workloads and simulate heavy user loads for OLTP. We log all of the contents synchronously written to disk through the test. We can see that the bit-level data pattern of the result obtained from the benchmark test is similar to the previous one, as shown in Figure 1(b). Consequently, the difference in degree of wear-out of each cell would be apart more as the ratio of ASCII characters increases and it provides us rooms for prolonging lifetime of the target NAND block.

## 3 Proposed Scheme

In this section, we present a bit-level wear-leveling scheme based on a simplified statistical model to span the life time of NAND flash memory, which is derived from the observation of the data non-uniformity as described in Section 2. First, we define the fundamental wear-out model of SLC cell, in accordance with the writing pattern of the data. Then, we present the proposed bit-level wear-leveling schemes to make the degree of wear-out of cells uniform.

**Our Assumption**: Our model is basically derived from the concept of the charge-to-breakdown,  $Q_{BD}[2]$ . The  $Q_{BD}$  is defined to amounts of charges crossing the tunnel oxide in a NAND flash cell until the oxide is broken down. It is from the following physical characteristics: the oxide of each NAND flash cell is little by little worn-out by repeated program/erase cycling, and the cell eventually loses its capability to store data. Base on the above, the following assumption is obtained. The degree of wear-out of a cell after a P/E cycling is proportional to the amount of charge passing the oxide in the cell and the number of injected and ejected charges during a cycling is always same, and threshold voltage for sensing stored data is also proportional to the amount of stored charges. This assumption is a base of the wear-out modeling for a SLC NAND cell.

SLC can store only one bit value per cell, which basically is a threashold voltage level. Typically, in NAND flash, a logical bit value '1' of an SLC cell denotes the erased state, and a logical bit value '0' denotes the programmed state.  $W_{slc}$  is defined to the degree of worn-out of the oxide by charge injection/ejection during a P/E cycling. In case of SLC, a cell having a bit value of '0(Programmed)' is worn out by  $W_{slc}$  in a P/E cycling. And a cell having a bit value of '1(Erased)' is not worn-out because there is no charge tunneling the oxide during the cycling. We simply assume that the degree of wear-out of a cell not in programmed state through the oxide. Based on the above modeling principles for SLC, the degree of wear-out of each cell is simply represented to the counts of the case that it has a bit value of 0.

In order to even out the usage of each cell, the technique for distributing the erasures and re-writes evenly across the cells is necessary. To achieve it, we



Fig. 2. Our proposed schemes for expanding lifetime of NAND flash cells



Fig. 3. (a) Page-wide view of wear-out for Linux source code, (b) Page-wide view of wear-out for DB workload

propose a simple bit-level shifting scheme for SLC as shown in Figure 2. To make the frequency of programmed state of each cell statistically even, every page-level program and erase cycling, a byte of data is written from the pivot bit in the byte and the rest of bits are written from the LSB bit position. The location of pivot bit is shifted by one bit. In practical implementation of this scheme, we can set the location of pivot bit as the value of block erase counts modulo 8.

# 4 Evaluation

We evaluated our bit-level wear leveling scheme based on the proposed simple wear-out model for SLC NAND flash. The evaluation is based on the previously obtained data patterns for different file data workloads, Linux source code files,



**Fig. 4.** (a) Results of our wear-leveling scheme for Linux source code, (b) Results of our wear-leveling scheme for DB workload

which are the most good reference for docuement file data composed of mostly ASCII code-based data, and DB update data, which is frequently update to the storage and have big influence on NAND flash's wear-out.

To simulate and visualize the page write operations onto NAND flash memory, we examine data pattern of the two workloads in page-level. We assume that the page size of NAND flash is 2KB. Figure 3(a) and Figure 3(b) show pagewide view of data pattern of each workload for SLC. As previously described, the probability for each bit to be programmed is remarkably different and it is neccessary to be wear-leveled. The reason why the counts of lower indexed bytes in a page are slightly higher than the others in case of Linux source codes, is beacause the size of many files in Linux source tree is lower than the page size and we obtained the data by only accumulating the count of bit value having logic 0. In case of smaller sized file than the size of one page, the dummy data contents in higher offset than the size of files, had better being filled with logic '1' for NAND flash's endurance.

The results in Figure 4(a) and Figure 4(b) are obtained by applying our wearleveling scheme, bit-level shifting, on the workloads in case of SLC. By applying the proposed scheme, the estimiated wear-out counts of each cell become nearly uniform. With comparing the maximum count, in case Linux source code, the degree of wear-out becomes 36% lower and 15% lower in case of DB workload. Additionally, in case of DB workloads, lower offset of bytes in a page tends to be worn out more than the others in spite of bit-level wear leveling.

### 5 Conclusion

We introduce our observation that bit-level data patterns of widely-used file data are not uniform and, based on the breakdown mechanism of oxide, we also propose simple wear-out models for SLC NAND flash. To expand endurance of NAND flash, we emphasize the consideration of bit-level wear-leveling scheme and proposed simple approaches. We expect that our solutions can be adopted in the controller-level together with other wear-leveling scheme or other encoding schemes.

# References

- 1. DBT2 benchmark, http://sourceforge.net/apps/mediawiki/osdldbt/index.php.
- BIN, Z. W., MING, X., ZHENG, X., AND KANG, Z. A. Charge-to-breakdown ( *Q<sub>BD</sub>*): a method to monitor the ultrathin tunnel oxide in E<sup>2</sup>PROM. In Solid- State and Integrated Circuit Technology, 1998. Proceedings. 1998 5th International Conference on (1998), pp. 287 –290.
- CHANG, L.-P., AND HUANG, L.-C. A low-cost wear-leveling algorithm for blockmapping solid-state disks. SIGPLAN Not. 46, 5 (Apr. 2011), 31–40.
- 4. CHANG, L.-P., AND KUO, T.-W. Efficient management for large-scale flash-memory storage systems with resource conservation. TOS 1, 4 (2005), 381–418.
- 5. CHANG, Y.-H., HSIEH, J.-W., AND KUO, T.-W. Improving flash wear-leveling by proactively moving static data. *IEEE Trans. Computers* 59, 1 (2010), 53–65.
- HUSSAIN, S., AND MANSOOR, A. Flash modelling for wearleveling algorithms. In High Capacity Optical Networks and Enabling Technologies (HONET), 2011 (dec. 2011), pp. 267 –272.
- MURUGAN, M., AND DU, D. H.-C. Rejuvenator: A static wear leveling algorithm for NAND flash memory with minimized overhead. In *MSST* (2011), A. Brinkmann and D. Pease, Eds., IEEE Computer Society, pp. 1–12.
- 8. SQLITE. Sqlite official home page. http://sqlite.org.