## Spin Electronics \_

# One Trillion True Random Bits Generated with a Field Programmable Gate Array Actuated Magnetic Tunnel Junction

Andre Dubovskiy<sup>1</sup>, Troy Criss<sup>1</sup>, Ahmed Sidi El Valli<sup>1</sup>, Laura Rehm<sup>1</sup>, Andrew D. Kent<sup>1\*</sup>, and Andrew Haas<sup>1</sup>

<sup>1</sup>Department of Physics, New York University, New York, NY 10003 USA \*Fellow, IEEE

Received xxxx 00, 0000, revised xxxx 00, 0000, published xxxx 00, 0000, current version xxxx 00, 0000 (Dates will be inserted by IEEE; "published" is the date the accepted preprint is posted on IEEE Xplore<sup>®</sup>; "current version" is the date the typeset version is posted on Xplore<sup>®</sup>).

Abstract—Large quantities of random numbers are crucial in a wide range of applications. We have recently demonstrated that perpendicular nanopillar magnetic tunnel junctions (pMTJs) can produce true random bits when actuated with short pulses. However, our implementation used high-end and expensive electronics, such as a high bandwidth arbitrary waveform generator and analog-to-digital converter, and was limited to relatively low data rates. Here, we significantly increase the speed of true random number generation (TRNG) of our stochastic actuated pMTJs (SMART-pMTJs) using Field Programmable Gate Arrays (FPGAs), demonstrating the generation of over 10<sup>12</sup> bits at rates exceeding 10Mb/s. The resulting bitstreams pass the NIST Statistical Test Suite for randomness with only one XOR operation. In addition to a hundred-fold reduction in the setup cost and a thousand-fold increase in bitrate, the advancement includes simplifying and optimizing random bit generation with a custom-designed analog daughter board to interface an FPGA and SMART-pMTJ. The resulting setup further enables FPGA at-speed processing of MTJ data for stochastic modeling and cryptography.

Index Terms— Spin electronics, magnetic tunnel junction, FPGA, probabilistic computing, p-bits.

#### I. INTRODUCTION

Probabilistic computing architectures have been proposed to efficiently solve real-world problems with uncertain or complex input data [Feynman 1982]. Applications include cryptography [McInnes 1991], neuromorphic systems [Schuman 2017], and Monte Carlo simulations [Harrison 2010]. The acquisition of probabilistic bits, or p-bits, needed for these uses can be a technical challenge of its own. In fact, certain particle physics collision simulations spend more than half of the computational time simply generating random numbers [Misra 2023]. These random numbers are also often pseudo-random, produced in software from a seed with chaotic but deterministic algorithms.

True random number generators (TRNGs) for use in probabilistic applications promise a significant improvement in operational speed and efficiency. Existing CMOS solutions typically require large numbers of transistors [Yang 2014; Nguyen 2020] or have practical limitations in terms of size, speed, and operating conditions [Zhun 2001; Herrero-Collantes 2017]. Thus the development of TRNG devices that are compact, fast, energy-efficient, and reliable is an important goal.

Magnetic noise offers a promising source of true random numbers. This is because nanometer scale ferromagnetic elements can function as two-state systems, with their magnetization states "up" or "down," separated by an energy barrier denoted as  $E_b$ . When the energy barrier is on a similar scale as the thermal energy kT



Fig. 1. A custom-designed circuit interfaces an FPGA to a pMTJ. A DAC is used to apply a reset (R)-verify (V)-write (W)-measure (M) pulse sequence, and a transimpedance amplifier converts the current flow through the MTJ into a voltage input. The current flow during the measurement pulse is used to create a sequence of 0s and 1s that are queued and sent to a computer. A schematic of the pMTJ is shown on the right. It consists of a reference layer (RL), a tunnel barrier (MgO), and a free layer (FL). The orientation of the free layer's magnetic moment can be parallel (P) or anti-parallel (AP) to that of the reference layer, resulting in low and high resistance states, respectively.

— where k is the Boltzmann constant and T is the operating temperature of the device — the magnetization fluctuates between these two states, a phenomenon referred to as superparamagnetism.

A device known as a magnetic tunnel junction (MTJ) can transform the magnetization fluctuations into easily detectable two-level

1949-307X © 2024 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See https://www.ieee.org/publications/rights/index.html for more information.

Corresponding authors: Andre Dubovskiy (dubovski@nyu.edu), Andrew D. Kent (andy.kent@nyu.edu) and Andrew Haas (andy.haas@nyu.edu) Digital Object Identifier 10.1109/LMAG.2020.Doi Number



Fig. 2. The FPGA controls voltages from the DACs via analog switches based on the pulse timings. These voltages are added and output to the junction.

electrical signals. Additionally, MTJs can be seamlessly incorporated with complementary metal-oxide semiconductor (CMOS) technology, as highlighted in references [Prenat 2007; Matsunaga 2008; Zhao 2008; Kent2015; Deng 2016; Kumar 2019; Barla 2020; Barla 2020]. Notably, there has been an extensive exploration into the use of MTJs with a superparamagnetic layer for the purpose of probabilistic computing, as detailed in references [Vodenicarevic 2017; Borders 2019; Kaiser 2019; Parks 2018; Hayakawa 2021; Safranski 2021].

In contrast to superparamagnetic MTJs, Rehm *et al.* showed that stable perpendicularly magnetized MTJs (pMTJs) ( $E_b \approx 39kT$ ) actuated by pulses to be an excellent alternative approach to TRNG [Rehm 2023]. Using a stochastic write pulse with a switching probability P of close to 50% to generate each bit results in a stream of true random bits, much like a series of coin flips results in a stream of heads or tails. The generated bit stream can be much less sensitive to device operating temperature, device-to-device variations, and other operating conditions [Rehm 2023; Morshed 2023]; the device was denoted a stochastic magnetic actuated random transducer, or SMART device.

The key advancement we will describe here is the simplification, optimization, and dramatic acceleration of the random bit generation from a SMART device with the help of a field programmable gate array (FPGA) and a custom-designed analog daughter board to interface the FPGA with the SMART-pMTJ.

## **II. EXPERIMENTAL SETUP**

To generate random bits with MTJs, we used a custom FPGA daughter board and associated FPGA (Intel Cyclone 10 GX FPGA, qDK-DEV-10CX220-A) as illustrated schematically in Fig. 1. First, we create and send voltage pulses across the junction. The pulses are created from a custom voltage source by toggling analog switches (part # SN74AUC2G66DCTR) controlled by the FPGA with a cyclic state machine in Verilog. Since the FPGA itself can only output boolean signals, we made an analog daughter board that creates constant voltage sources with an 8-channel, 12-bit digital-to-analog converter (part # DAC7578SPWR) to adjust the amplitudes of the pulses. Four voltages are inverted with an additional amplifier stage to create negative outputs. All eight outputs' adjusted voltages are

then added with an operational amplifier and combined into one output that gets sent to the MTJ, as shown in Fig. 2.

While outputting the voltage pulses, we measure the junction current. A transimpedance amplifier (part # OPA699ID) on the daughter board converts the current flowing through the junction to a voltage that the FPGA can measure. This circuit virtually grounds one terminal of the MTJ and transforms a  $\simeq 20\mu A$  current to the order of 2V. This allows the FPGA to determine if the junction is in a high or low resistance state, recorded as bit 0 or 1. The high and low resistance states correspond to approximately  $2k\Omega$  and  $1k\Omega$ . The probabilistic bit is the result of a write pulse that switches the junction state approximately 50% of the time, i.e., a stochastic write. A measure pulse is used to determine if the junction switched to a parallel (P) state, corresponds to a bit 1, or the junction remained in an anti-parallel (AP) state, a bit 0. The amplified signal is sampled by the FPGA at a rate of 468.75MHz. The firmware of the FPGA selects one sample per pulse cycle of the experiment during the measure pulse to collect one random bit per pulse cycle. Finally, the random bits from the FPGA are sent to a computer over a 10Gb/s Ethernet User Datagram Protocol (UDP) connection to record bit stream and for further data analysis.

## **III. MEASUREMENT PROCEDURE**

Generating a single random bit or one proverbial coin flip consists of the following process:

- 1) Resetting the MTJ to a known state. The AP state here.
- 2) Verifying that the MTJ has been reset to the AP state.
- 3) A stochastic write, a short pulse of a specific voltage and duration.
- 4) Measuring the resulting junction state, AP (bit 0) or P (bit 1).

The corresponding output voltage sequence is shown in Fig. 3. The reset step (step 1) pulse amplitude was set such that we recorded zero reset errors during the verify phase (step 2), which consists of a lower amplitude output voltage used to determine the junction state. The pulse amplitude for the stochastic write (step 3) is set specifically so that the probability of changing states is as close to 50% as possible. The verify and measure pulses (which are both read pulses) were set high enough to supply a sufficient current to discern the state of the junction but low enough that they do not disturb the junction's state.

To generate one trillion random bits, we had to flip two trillion proverbial coins. The factor of two is necessary to debias the resulting bitstream with an XOR operation, as explained in the next section. The pulse sequence described above — with a frequency of 10.9MHz — was applied for 50 hours, resulting in a  $2 \times 10^{12}$  random bits.

## **IV. RESULTS AND ANALYSIS**

The switching probability for bins of  $10^7$  bits was computed and is plotted in Fig. 4. Over the course of the data acquisition, there is a drift from the intended switching probability of 50%. This drift



Fig. 3. A single coin flip. This scope trace shows the actuating pulses provided by the custom board at a rate of 10.8MHz.

is greater than expected based on changes in the temperature in the lab during the experiment, as we have measured and characterized the dP/dT in Ref. [Rehm 2023], and we are still investigating its origin. Nonetheless, these variations can be remedied with one XOR operation.



Fig. 4. Probability of switching vs bin index for a total of 2 trillion bits which corresponds to a 50-hour experiment.

The XOR operation was performed post-experiment by splitting the bitstream into two halves (the chronological first and second terabits) and XORing them. The results are shown in Fig. 5 for the first  $2 \times 10^{11}$  bits in the stream. This operation made the probability of switching significantly closer to 50% while, of course, reducing the number of total bits by a factor of two. We characterize the probability bias by  $\epsilon$ , its deviation from 0.5, i.e.,  $P = 0.5 + \epsilon$ . For the first data set  $\epsilon = 0.027$  and the second data set  $\epsilon = -0.0015$ . The XORed data has a significantly reduced bias of  $\epsilon = 0.0004$ .

We tested the resulting bitstreams using the NIST statistical test suite [Bassham 2010]. This suite encompasses a range of tests, both frequency and non-frequency-related. Frequency-related assessments gauge the occurrence of 'ones' within the dataset, while nonfrequency tests are designed to identify specific patterns. The results



Fig. 5. Switching probability after one XOR operation of the first and second data set of the bitstream.

Table 1. NIST Statistical Test Suite for randomness results on raw data and XOR'd data. The criteria for passing each test across multiple trials is a 95% or higher success rate [Bassham 2010].

| Test name / XOR stages                | 0       | 1       |
|---------------------------------------|---------|---------|
| Frequency (Monobit)                   | 5/100   | 98/100  |
| Frequency within a Block              | 69/100  | 100/100 |
| Run                                   | 0/100   | 100/100 |
| Longest Run of Ones in a Block        | 22/100  | 100/100 |
| Binary Matrix Rank                    | 99/100  | 100/100 |
| Discrete Fourier Transform (Spectral) | 97/100  | 100/100 |
| Non-Overlapping Template Matching     | 104/148 | 142/148 |
| Overlapping Template Matching         | 67/100  | 98/100  |
| Maurer's Universal Statistical        | 0/100   | 100/100 |
| Linear Complexity                     | 99/100  | 99/100  |
| Serial                                | 198/200 | 200/200 |
| Approximate Entropy                   | 52/100  | 98/100  |
| Cumulative Sums                       | 10/200  | 196/200 |

are shown in Table I. The raw data passes only 4 out of 13 tests. In contrast, the XORed data passes all NIST tests.

## V. CONCLUSION

This demonstration that an FPGA with a custom board can efficiently control and sample the state of MTJ devices advances the field and, as we have emphasized, the application of stochastic MTJs. Our FPGA-custom board solution can be used directly in applications that require random numbers, reducing computational loads. For example, in generating and conditioning random bits, the XOR operation could be implemented directly on the firmware of the FPGA to debias the bits in real-time rather than after the fact in the software. Further, several NIST tests could be implemented on the FPGA to verify the bits as they are being produced. There are also possibilities for fast in-situ tuning of device operating parameters to reduce the effect of environmental impacts and device-to-device variations. Finally, the speed of our setup, scale, and low hardware cost will enable a broader range of experimental studies of MTJs, including faster device characterization and, thus, optimization.

#### **VI. ACKNOWLEDGMENTS**

We acknowledge support from the DOE Office of Science (ASCR/BES), Microelectronics Co-Design project COINFLIPS, and the Office of Naval Research (ONR) under Award No. N00014-23-1-2771.

## REFERENCES

- R. P. Feynman, "Simulating physics with computers," *International Journal of Theoretical Physics*, vol. 21, no. 6-7, pp. 467–488, 1982. Available: https://doi.org/10.1007/BF02650179
- J. L. McInnes and B. Pinkas, "On the impossibility of private key cryptography with weakly random keys," in *Advances in Cryptology-CRYPTO*' 90, A. J. Menezes and S. A. Vanstone, Eds. Berlin, Heidelberg: Springer Berlin Heidelberg, 1991, pp. 421–435. Available: https://link.springer.com/content/pdf/10.1007/3-540-38424-3\_ 31.pdf
- C. D. Schuman, T. E. Potok, R. M. Patton, J. D. Birdwell, M. E. Dean, G. S. Rose, and J. S. Plank, "A survey of neuromorphic computing and neural networks in hardware," *CoRR*, vol. abs/1705.06963, 2017. Available: http://arxiv.org/abs/1705. 06963
- R. L. Harrison, "Introduction to Monte Carlo Simulation," AIP Conference Proceedings, vol. 1204, no. 1, pp. 17–21, 2010. Available: https://aip.scitation.org/doi/abs/10. 1063/1.3295638
- S. Misra, L. C. Bland, S. G. Cardwell, J. A. C. Incorvia, C. D. James, A. D. Kent, C. D. Schuman, J. D. Smith, and J. B. Aimone, "Probabilistic Neural Computing with Stochastic Devices," *Advanced Materials*, vol. 35, no. 37, p. 2204569, 2023. Available: https://onlinelibrary.wiley.com/doi/abs/10.1002/adma.202204569
- K. Yang, D. Fick, M. B. Henry, Y. Lee, D. Blaauw, and D. Sylvester, "16.3 A 23Mb/s 23pJ/b fully synthesized true-random-number generator in 28nm and 65nm CMOS," in 2014 IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC), 2014, pp. 280–281. Available: https://ieeexplore.ieee. org/document/6757434
- N. Nguyen, G. Kaddoum, F. Pareschi, R. Rovatti, and G. Setti, "A fully CMOS true random number generator based on hidden attractor hyperchaotic system," *Nonlinear Dynamics*, vol. 102, no. 4, pp. 2887–2904, 2020. Available: https: //link.springer.com/article/10.1007/s11071-020-06017-3
- H. Zhun and C. Hongyi, "A truly random number generator based on thermal noise," ASICON 2001. 2001 4th International Conference on ASIC Proceedings (Cat. No.01TH8549), pp. 862–864, 2001. Available: https://api.semanticscholar.org/ CorpusID:60475908
- M. Herrero-Collantes and J. C. Garcia-Escartin, "Quantum random number generators," *Rev. Mod. Phys.*, vol. 89, p. 015004, Feb 2017. Available: https://link.aps.org/doi/ 10.1103/RevModPhys.89.015004
- G. Prenat, M. El Baraji, W. Guo, R. Sousa, L. Buda-Prejbeanu, B. Dieny, V. Javerliac, J.-P. Nozieres, W. Zhao, and E. Belhaire, "CMOS/magnetic hybrid architectures," in 2007 14th IEEE International Conference on Electronics, Circuits and Systems. IEEE, 2007, pp. 190–193. Available: https://ieeexplore.ieee.org/document/4510962 https://ieeexplore.ieee.org/document/4510962
- S. Matsunaga, J. Hayakawa, S. Ikeda, K. Miura, H. Hasegawa, T. Endoh, H. Ohno, and T. Hanyu, "Fabrication of a nonvolatile full adder based on logic-in-memory architecture using magnetic tunnel junctions," *Applied Physics Express*, vol. 1, no. 9, p. 091301, 2008. Available https://iopscience.iop.org/article/10.1143/APEX. 1.091301/pdf
- W. Zhao, E. Belhaire, C. Chappert, F. Jacquet, and P. Mazoyer, "New non-volatile logic based on spin-MTJ," *Physica Status Solidi* (a), vol. 205, no. 6, pp. 1373–1377, 2008. Available: https://doi.org/10.1002/pssa.200778135
- A. D. Kent and D. C. Worledge, "A new spin on magnetic memories," *Nature Nanotechnology*, vol. 10, no. 3, pp. 187–191, 2015. Available: https://doi.org/10. 1038/nnano.2015.24
- E. Deng, G. Prenat, L. Anghel, and W. Zhao, "Non-volatile magnetic decoder based on MTJs," *Electronics Letters*, vol. 52, no. 21, pp. 1774–1776, 2016. Available: https://doi.org/10.1049/el.2016.245
- S. D. Kumar and H. Thapliyal, "Exploration of non-volatile MTJ/CMOS circuits for DPA-resistant embedded hardware," *IEEE Transactions on Magnetics*, vol. 55, no. 12, pp. 1–8, 2019. Available: https://ieeexplore.ieee.org/document/8903621

- P. Barla, D. Shet, V. K. Joshi, and S. Bhat, "Design and Analysis of LIM Hybrid MTJ/CMOS Logic Gates," in 2020 5th International Conference on Devices, Circuits and Systems (ICDCS), 2020, pp. 41–45. Available: https://ieeexplore.ieee. org/document/9075774
- P. Barla, V. K. Joshi, and S. Bhat, "A novel low power and reduced transistor count magnetic arithmetic logic unit using hybrid STT-MTJ/CMOS circuit," *IEEE Access*, vol. 8, pp. 6876–6889, 2020. Available: https://ieeexplore.ieee.org/document/ 8949515
- D. Vodenicarevic, N. Locatelli, A. Mizrahi, J. S. Friedman, A. F. Vincent, M. Romera, A. Fukushima, K. Yakushiji, H. Kubota, S. Yuasa, S. Tiwari, J. Grollier, and D. Querlicz, "Low-energy truly random number generation with superparamagnetic tunnel junctions for unconventional computing," *Phys. Rev. Applied*, vol. 8, p. 054045, Nov 2017. Available: https://link.aps.org/doi/10.1103/PhysRevApplied.8. 054045
- W. A. Borders, A. Z. Pervaiz, S. Fukami, K. Y. Camsari, H. Ohno, and S. Datta, "Integer factorization using stochastic magnetic tunnel junctions," *Nature*, vol. 573, no. 7774, pp. 390–393, 2019. Available: https://doi.org/10.1038/s41586-019-1557-9
- J. Kaiser, A. Rustagi, K. Y. Camsari, J. Z. Sun, S. Datta, and P. Upadhyaya, "Subnanosecond fluctuations in low-barrier nanomagnets," *Phys. Rev. Applied*, vol. 12, p. 054056, Nov 2019. Available: https://link.aps.org/doi/10.1103/PhysRevApplied. 12.054056
- B. Parks, M. Bapna, J. Igbokwe, H. Almasi, W. Wang, and S. A. Majetich, "Superparamagnetic perpendicular magnetic tunnel junctions for true random number generators," *AIP Advances*, vol. 8, no. 5, p. 055903, 2018. Available: https: //doi.org/10.1063/1.5006422
- K. Hayakawa, S. Kanai, T. Funatsu, J. Igarashi, B. Jinnai, W. A. Borders, H. Ohno, and S. Fukami, "Nanosecond random telegraph noise in in-plane magnetic tunnel junctions," *Phys. Rev. Lett.*, vol. 126, p. 117202, Mar 2021. Available: https://link. aps.org/doi/10.1103/PhysRevLett.126.117202
- C. Safranski, J. Kaiser, P. Trouilloud, P. Hashemi, G. Hu, and J. Z. Sun, "Demonstration of nanosecond operation in stochastic magnetic tunnel junctions," *Nano Letters*, vol. 21, no. 5, pp. 2040–2045, 03 2021. Available: https://doi.org/10.1021/acs. nanolett.0c04652
- L. Rehm, C. C. M. Capriata, S. Misra, J. D. Smith, M. Pinarbasi, B. G. Malm, and A. D. Kent, "Stochastic magnetic actuated random transducer devices based on perpendicular magnetic tunnel junctions," *Phys. Rev. Appl.*, vol. 19, p. 024035, 2023. Available: https://link.aps.org/doi/10.1103/PhysRevApplied.19.024035
- L. Rehm, M. G. Morshed, S. Misra, A. Shukla, S. Rakheja, M. Pinarbasi, A. W. Ghosh, and A. D. Kent, "Temperature-resilient random number generation with stochastic actuated magnetic tunnel junction devices," *Applied Physics Letters*, vol. 124, no. 5, 2024. Available: https://doi.org/10.1063/5.0186810
- M. G. Morshed, L. Rehm, A. Shukla, Y. Xie, S. Ganguly, S. Rakheja, A. D. Kent, and A. W. Ghosh, "Reduced sensitivity to process, voltage and temperature variations in activated perpendicular magnetic tunnel junctions based stochastic devices," *arXiv* preprint arXiv:2310.18781, 2023. Available: https://arxiv.org/abs/2310.18781
- L. Bassham, A. Rukhin, J. Soto, J. Nechvatal, M. Smid, S. Leigh, M. Levenson, M. Vangel, N. Heckert, and D. Banks, "Statistical Test Suite for Random and Pseudorandom Number Generators for Cryptographic Applications," 2010-09-16 2010. Available: https://tsapps.nist.gov/publication/get\_pdf.cfm?pub\_id=906762