Академический Документы
Профессиональный Документы
Культура Документы
ABSTRACT
Modern embedded processors include both scratchpad automotive, industry, and medical applications. These
memory (SPM) and cache memory in their applications are safety-critical
critical requiring high reliable
architectures. SPM’s are prone to soft errors like processors. Scratchpad memory (SPM) is widely used
Single event upsets (SEUs) and Single event multiple in modern embedded processors to overcome the
upsets (SEMUs). For correcting
cting soft errors, we use limitations of cache memory.
Error Correcting Codes (ECC) like single single-error
Cache memory, also called CPU memory, is random
correction double-error
error detection (SEC
(SEC-DED) and
access memory (RAM
RAM) that a
SEC-DED double-adjacent
adjacent error correction (SEC
(SEC-
computer microprocessor can access more quickly
DED-DAEC)
DAEC) and parity duplication approach. These
than it can access regular RAM. This memory is
approaches are used only error correctio
correction for less
typically integrated
egrated directly with the CPU chip or
number of corrections and thus increases speed.
placed on a separate chip that has a
separate bus interconnect with the CPU. The basic
This paper proposes a duplication scheme, cachecache-
purpose of cache memory is to
assisted duplicated SPM(CADS) to correct SEUs and
store program instructions that are frequently re- re
SEMUs in data SPM lines detected by a low low-cost
referenced by software during operation.
error detecting code. This duplication aapproach does
Fast access to these instructions increases
incre the overall
error detection in SPM which is faster than ECC
speed of the software program.
providing multibit error correction.
As the microprocessor processes data, it looks first in
The extension is to include DMA with ring the cache memory; if it finds the instructions there, it
descriptors. The descriptor-based
based model provides the does not have to do a more time-consuming
time reading
most flexibility in managing a system's DMA faster of data from larger memory or other
data transfers
rs to or from Scratchpad memory. data storage devices.
Most programs use very few resources once they
Keywords: Cache memory, Scratchpad memory, error have been opened and operated for a time, mainly
correcting codes, soft errors, replicas and DMA because frequently re-referenced
referenced instructions tend to
be cached.. This explains why measurements
I. INTRODUCTION of system performance in computers with
A wide range of modern embedded processors slower processors but larger caches tend to be faster
includes both Scratchpad memory (SPM) and cache than Measurements of system performance in
memory in their architectures to fulfill the application computers with faster processors but more limited
requirements of predictability,, performance, and cache space.
energy budget. Examples of such processors are ARM
Cortex-R
R Series and SH7785, which are used in
@ IJTSRD | Available Online @ www.ijtsrd.com | Volume – 2 | Issue – 4 | May-Jun 2018 Page: 1391
International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456-6470
2456
decoder as many times as the bit planes in the memory SPM Reliability Enhancement
modules are used.
An data duplication
plication plot utilizes dead blocks of SPM
Each output line from the row and column decoders to keep a duplicate of live blocks. Dead blocks are
chooses (enacts)
nacts) a succession of bit cells that have a distinguished at compiler level utilizing a algorithm
place with planes of back to back bits in a memory that analyzes the examples of getting to SPM lines. At
word. In such memory module, the full network of bit runtime, the redundant blocks are made if a free block
blo
cells has three-dimensional structure. exists in SPM. This plan experiences two noteworthy
constraints. To begin with, because of high usage of
SPM lines and lack of dead blocks, a huge division of
SPM blocks stay unprotected. Second, refreshing the
imitation on each write operation to SPM will w force
high execution overhead.
@ IJTSRD | Available Online @ www.ijtsrd.com | Volume – 2 | Issue – 4 | May-Jun 2018 Page: 1393
International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456-6470
the block has beenmodified (a dirty block) after
data has been written into the cache. The write
back updating takes more time efficient, as the
block cells modification is done many times while
being in the cache and are updated in the main
memory only once.
@ IJTSRD | Available Online @ www.ijtsrd.com | Volume – 2 | Issue – 4 | May-Jun 2018 Page: 1394
International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456-6470
and single-event multiple upsets (SEMUs) are two of non-cacheable SPM lines. In particular, CADS
types of soft errors in SPM and cache as ON-chip duplicates all dirty SPM lines in cache memory
SRAM memories Correcting soft errors in ON-chip considering the fact that clean SPM lines have
memories (SPM or cache) can be categorized into two inherently a copy in lower memory hierarchy. To this
approaches . The first approach is the use of error- aim, we propose a cache controller circuitry that is
correcting codes (ECCs), e.g., single-error correction capable of storing a copy of non-cacheable SPM lines
double-error detection (SEC-DED) and SEC-DED in cache memory. To reduce the energy consumption
double-adjacent-error correction (SEC-DED-DAEC), overhead of CADS, we further propose a buffered
to detect and correct errors. All of the ON-chip CADS (BCADS) technique in which a mini buffer is
memories can be protected using this approach. inserted between SPM and cache to minimize the
extra cache accesses for updating the replicas.
@ IJTSRD | Available Online @ www.ijtsrd.com | Volume – 2 | Issue – 4 | May-Jun 2018 Page: 1395
International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456-6470
updating the replica in the plan in and CADS are more difficult to implement and most DMA-
52% and 43%, separately. controllers do not implement device to device
transfers.
To beat this overhead and decrease the cache
accessing because of SPM line duplication, we Advantages:
improve the CADS architecture by inserting a buffer
The Cache assisted duplicated SPM is the best method
between the SPM and the cache. This improved
to resolve soft errors in SPMs with very little area
architecture is named BCADS. This architecture
over head and performance overheads and no change
lessens the cache access by putting away most as of
in SPM access policies.
late updated SPM line. The span of the support is
viewed as the same as the extent of a solitary cache Applications:
line, e.g., 32 B in our arrangement. Utilizing this In embedded processors for use in applications in IOT
buffer, we can keep the replica of eight 32-b SPM and in DSP processors where processing happens on
words dwelling in a similar cache line. large arrays of data.
For every write operation to the SPM, the write address Future scope:
is compared with the address of data line stored in the
buffer. If the replica is already in the buffer, which can Implement a Robust ECC scheme using less number
be interpreted as a buffer hit, the buffer entry will be of redundant bits to detect more number of errors.
updated. Otherwise, on a buffer miss, after writing References:
back the buffer entry to the cache, the buffer will be
allocated to the new data written to the SPM. For the 1) (2015). The ARM Corporation. [Online].
write-back data line from the buffer to the cache, if Available:
the replica line is already allocated in the cache, the http://www.arm.com/products/processors/cortex-r
line will be rewritten. Otherwise, a new cache line is 2) (2015). The Renesas Electronics Corporation.
selected for replication before writing back the buffer [Online]. Available:
entry. http://www.renesas.com/products/mpumcu/superh
BCADS can significantly reduce the extra cache /sh7780/sh7785/index.jsp
accesses by utilizing the locality of references in SPM 3) W. Zhang, S. Gurumurthi, M. Kandemir, and A.
access. The SPM lines are mainly allocated to data Sivasubramaniam, “ICR: In-cache replication for
arrays accessed in loops, which have a highly enhancing data cache reliability,” in Proc. Int.
localized access pattern. Therefore, it is highly Conf. Dependable Syst. Netw. (DSN), Jun. 2003,
probable that consecutive requests for updating the pp. 291–300.
replicas refer to the same cache line. By taking
4) M. Manoochehri, M. Annavaram, and M. Dubois,
advantage of such a locality, the buffer will be able to
“CPPC: Correctable parity protected cache,” in
catch the majority of replica update requests.
Proc. 38th Annu. Int. Symp. Comput. Archit.
DMA: (ISCA), Jun. 2011, pp. 223–234.
A direct memory access (DMA) is an operation in 5) L. A. D. Bathen and N. D. Dutt, “Embedded
which data is copied (transported) from one resource RAIDs-on-chip for busbased chip-
to another resource in a computer system without the multiprocessors,” ACM Trans. Embedded
involvement of the CPU. The task of a DMA- Comput. Syst., vol. 13, no. 4, pp. 83:1–83:36,
controller (DMAC) is to execute the copy operation of 2014.
data from one resource location to another. The copy
6) A. Dixit and A. Wood, “The impact of new
of data can be performed from: - I/O-device to
technology on soft error rates,” in Proc. IEEE Int.
memory - memory to I/O-device - memory to memory
Rel. Phys. Symp. (IRPS), Apr. 2011, pp. 5B.4.1–
- I/O-device to I/O-device A DMAC is an
5B.4.7.
independent (from CPU) resource of a computer
system added for the concurrent execution of DMA- 7) N. N. Sadler and D. J. Sorin, “Choosing an error
operations. The first two operation modes are ’read protection scheme for a microprocessor’s L1 data
from’ and ’write to’ transfers of an I/O-device to the cache,” in Proc. Int. Conf. Comput. Design
main memory, which are the common operation of a (ICCD), Oct. 2007, pp. 499–505.
DMA-controller. The other two operations are slightly
@ IJTSRD | Available Online @ www.ijtsrd.com | Volume – 2 | Issue – 4 | May-Jun 2018 Page: 1396
International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456-6470
8) J. Kim, N. Hardavellas, K. Mai, B. Falsafi, and J. 18) J. L. Henning, “SPEC CPU2006 benchmark
Hoe, “Multi-bit error tolerant caches using two- descriptions,” ACM SIGARCH Comput. Archit.
dimensional error coding,” in Proc. 40th Annu. News, vol. 34, no. 4, pp. 1–17, 2006.
IEEE/ACM Int. Symp. Microarchit. (MICRO),
19) M. R. Guthaus, J. S. Ringenberg, D. Ernst, T. M.
Dec. 2007, pp. 197–209.
Austin, T. Mudge, and R. B. Brown, “MiBench: A
9) H. Farbeh and S. G. Miremadi, “PSP-cache: A free, commercially representative embedded
low-cost fault-tolerant cache memory benchmark suite,” in Proc. IEEE Int. Workshop
architecture,” in Proc. Design, Autom. Test Eur. Workload Characterization, Dec. 2001, pp. 3–14.
(DATE), Mar. 2014, pp. 1–4.
20) H. Farbeh, M. Fazeli, F. Khosravi, and S. G.
10) A. Neale and M. Sachdev, “A new SEC-DED Miremadi, “Memory mapped SPM: Protecting
error correction code subclass for adjacent MBU instruction scratchpad memory in embedded
tolerance in embedded memory,” IEEE Trans. systems against soft errors,” in Proc. 9th Eur.
Device Mater. Rel., vol. 13, no. 1, pp. 223–230, Dependable Comput. Conf. (EDCC), May 2012,
Mar. 2013. pp. 218–226.
11) A. Sánchez-Macián, P. Reviriego, and J. A. 21) H. Sayadi, H. Farbeh, A. M. H. Monazzah, and S.
Maestro, “Enhanced detection of double and triple G. Miremadi “A data recomputation approach for
adjacent errors in Hamming codes through reliability improvement of scratchpad memory in
selective bit placement,” IEEE Trans. Device embedded systems,” in Proc. IEEE Int. Symp.
Mater. Rel., vol. 12, no. 2, pp. 357–362, Jun. Defect Fault Tolerance VLSI Nanotechnol. Syst.
2012. (DFT), Oct. 2014, pp. 228–233.
12) F. Li, G. Chen, M. Kandemir, and I. Kolcu, 22) P. R. Panda, N. D. Dutt, and A. Nicolau,
“Improving scratch-pad memory reliability “Efficient utilization of scratchpad memory in
through compiler-guided data block duplication,” embedded processor applications,” in Proc. Eur.
in Proc. IEEE/ACM Int. Conf. Comput.-Aided Design Test Conf. (ED&TC), Mar. 1997, pp. 7–
Design (ICCAD), Nov. 2005, pp. 1002–1005. 11.
13) W. Zhang, “Replication cache: A small full 23) LEON2 Processor User’s Manual, Version 1.0.30,
associative cache to improve data cache XST ed., Aeroflex Gaisler, Gothenburg, Sweden,
reliability,” IEEE Trans. Comput., vol. 54, no. 12, Jul. 2005.
pp. 1547–1555, Dec. 2005.
24) J. Lee et al., “FaCSim: A fast and cycle-accurate
14) A. M. H. Monazzah, H. Farbeh, S. G. Miremadi, architecture simulator for embedded systems,” in
M. Fazeli, and H. Asadi, “FTSPM: A fault- Proc. Conf. Lang., Compil., Tools Embedded Syst.
tolerant scratchpad memory,” in Proc. 43rd Annu. (LCTES), Jun. 2008, pp. 89–100.
IEEE/IFIP Int. Conf. Dependable Syst. Netw.
25) C. A. Argyrides, P. Reviriego, D. K. Pradhan, and
(DSN), Jun. 2013, pp. 1–10.
J. A. Maestro, “Matrix-based codes for adjacent
15) A. Dutta and N. A. Touba, “Multiple bit upset error correction,” IEEE Trans. Nucl. Sci., vol. 57,
tolerant memory using a selective cycle avoidance no. 4, pp. 2106–2111, Aug. 2010.
based SEC-DED-DAEC code,” in Proc. 25th IEEE
26) N. Muralimanohar, R. Balasubramonian, and N. P.
VLSI Test Symp. (VTS), May 2007, pp. 349–354.
Jouppi, “CACTI 6.0: A tool to model large
16) J. Hong, J. Kim, and S. Kim, “Exploiting same tag caches,” HP Inc., Palo Alto, CA, USA Tec. Rep.
bits to improve the reliability of the cache HPL-2009-85, 2009.
memories,” IEEE Trans. Very Large Scale Integr.
27) L. Li, V. Degalahal, N. Vijaykrishnan, M.
(VLSI) Syst., vol. 23, no. 2, pp. 254–265, Feb.
Kandemir, and M. J. Irwin, “Soft error and energy
2015.
consumption interactions: A data cache
17) S. Wang, J. Hu, and S. G. Ziavras, “Replicating perspective,” in Proc. Int. Symp. Low Power
tag entries for reliability enhancement in cache tag Electron. Design (ISLPED), Aug. 2004, pp. 132–
arrays,” IEEE Trans. Very Large Scale Integr. 137.
(VLSI) Syst., vol. 20, no. 4, pp. 643–654, Apr.
2012.
@ IJTSRD | Available Online @ www.ijtsrd.com | Volume – 2 | Issue – 4 | May-Jun 2018 Page: 1397
International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456-6470
28) Design compiler user guide, Ver. E-2010.12, 38) L. Wu, Y. Ding, and W. Zhang, “Characterizing
Synopsys Inc., Mountain View, CA, USA, Dec. energy consumption of real-time and media
2010. benchmarks on hybrid SPM-caches,” in Proc.
IEEE 11th Int. Conf. High Perform. Comput.
29) J. Suh, M. M. Annavaram, and M. Dubois,
Commun. (HPCC), Aug. 2014, pp. 526–533.
“MACAU: A Markov model for reliability
evaluations of caches under single-bit and multi- 39) W. Zhang and L. Wu, “Exploiting hybrid SPM-
bit upsets,” in Proc. IEEE Int. Sym. High Perform. cache architectures to reduce energy consumption
Comput. Archit. (HPCA), Feb. 2012, pp. 1–12. for embedded computing,” in Proc. IEEE Int.
Conf. High Perform. Comput. Commun. (HPCC),
30) D. H. Yoon and M. Erez, “Memory mapped ECC:
Aug. 2014, pp. 340–347.
Low-cost error protection for last level caches,” in
Proc. Int. Symp. Comput. Archit. (ISCA), 2009, 40) W. Zhang and Y. Ding, “Hybrid SPM-cache
pp. 116–127. architectures to achieve high time predictability
and performance,” in Proc. IEEE 24th Int. Conf.
31) D. H. Yoon and M. Erez, “Flexible cache error
Appl.-Specific Syst., Archit. Process. (ASAP),
protection using an ECC FIFO,” in Proc.
Jun. 2013, pp. 297–304.
IEEE/ACM Int. Conf. High Perform. Comput.
Netw. Storage, Anal. (SC), Nov. 2009, pp. 1–12. Authors profile:
65.
ASAPU HARIKA Received her
32) W. Ma, X. Cui, and C.-L. Lee, “Enhanced error bachelor’s degree in 2015 in electronics
correction against multiple-bit-upset based on and communication engineering from
BCH code for SRAM,” in Proc. IEEE 10th Int. Stanley College of Engineering and
Conf. ASIC (ASICON), Oct. 2013, pp. 1–4. Technology for Women, Abids,
33) L.-J. Saiz-Adalid, P. Gil, J.-C. Baraza-Calvo, J.-C. Hyderabad, India, Which is affiliated to Osmania
Ruiz, D. Gil-Tomás, and J. Gracia-Morán, University, India. Her areas of interest include VLSI
“Modified Hamming codes to enhance short burst design. She is pursuing his M-Tech in VLSI
error detection in semiconductor memories (short SYSTEM DESIGN from CMR Institute of
paper),” in Proc. 10th Eur. Dependable Comput. Technology.
Conf. (EDCC), May 2014, pp. 62–65.
34) L. Delshadtehrani, H. Farbeh, and S. G. Miremadi,
“In-scratchpad memory replication: Protecting Mr.A Sai Kumar Goud is working
scratchpad memories in multicore embedded as Associate professor in ECE
systems against soft errors,” ACM Trans. Design department of CMRIT. He has an
Autom. Electron. Syst., vol. 20, no. 4, pp. 61:1– academic experience of 16 years In
61:28, 2015. Teaching as well as research. He
worked in various Reputed Engineering colleges as
35) A. Chakraborty, H. Homayoun, A. Khajeh, N.
Assistant professor, Associate professor HOD, of
Dutt, A. Eltawil, and F. Kurdahi, “E < MC2: Less
ECE. He has qualified for UGC-NET. He is ratified as
energy through multi-copy cache,” in Proc. Int.
Associate professor by JNTUH. His areas of research
Conf. Compil., Archit. Synth. Embedded Syst.
interest are VLSI, Communications & signal
(CASES), Oct. 2010, pp. 237–246.
processing. He is a life member of ISTE and
36) S. Kang and A. G. Dean, “Leveraging both data published several National and international journals.
cache and scratchpad memory through synergetic
data allocation,” in Proc. IEEE 18th Real-Time Mr. S Pradeep Kumar Reddy B.E,
Embedded Technol. Appl. Symp. (RTAS), Apr. M.Tech (Ph.D), Working as
2012, pp. 119–128. Associate Professor in CMR institute
of technology.
37) G. Wang, L. Ju, Z. Jia, and X. Li, “Data allocation
for embedded systems with hybrid on-chip And has 15 Years of Experience in
scratchpad and caches,” in Proc. IEEE 10th Int. teaching field. His area of interest is Image
Conf. High Perform. Comput. Commun. (HPCC), processing.
Nov. 2013, pp. 366–373.
@ IJTSRD | Available Online @ www.ijtsrd.com | Volume – 2 | Issue – 4 | May-Jun 2018 Page: 1398