Hardware Implementation Study of Particle Tracking Algorithm on FPGAs

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

Hardware Implementation Study of Particle Tracking Algorithm on FPGAs. / Gabrielli, Alessandro; Alfonsi, Fabrizio; Annovi, Alberto; Camplani, Alessandra; Cerri, Alessandro.

In: Electronics, Vol. 10, No. 20, 2546, 18.10.2021.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Gabrielli, A, Alfonsi, F, Annovi, A, Camplani, A & Cerri, A 2021, 'Hardware Implementation Study of Particle Tracking Algorithm on FPGAs', Electronics, vol. 10, no. 20, 2546. https://doi.org/10.3390/electronics10202546

APA

Gabrielli, A., Alfonsi, F., Annovi, A., Camplani, A., & Cerri, A. (2021). Hardware Implementation Study of Particle Tracking Algorithm on FPGAs. Electronics, 10(20), [2546]. https://doi.org/10.3390/electronics10202546

Vancouver

Gabrielli A, Alfonsi F, Annovi A, Camplani A, Cerri A. Hardware Implementation Study of Particle Tracking Algorithm on FPGAs. Electronics. 2021 Oct 18;10(20). 2546. https://doi.org/10.3390/electronics10202546

Author

Gabrielli, Alessandro ; Alfonsi, Fabrizio ; Annovi, Alberto ; Camplani, Alessandra ; Cerri, Alessandro. / Hardware Implementation Study of Particle Tracking Algorithm on FPGAs. In: Electronics. 2021 ; Vol. 10, No. 20.

Bibtex

@article{66f7f5f41ec445b9a05f5ccc9c387513,
title = "Hardware Implementation Study of Particle Tracking Algorithm on FPGAs",
abstract = "In recent years, the technological node used to implement FPGA devices has led to very high performance in terms of computational capacity and in some applications these can be much more efficient than CPUs or other programmable devices. The clock managers and the enormous versatility of communication technology through digital transceivers place FPGAs in a prime position for many applications. For example, from real-time medical image analysis to high energy physics particle trajectory recognition, where computation time can be crucial, the benefits of using frontier FPGA capabilities are even more relevant. This paper shows an example of FPGA hardware implementation, via a firmware design, of a complex analytical algorithm: The Hough transform. This is a mathematical spatial transformation used here to facilitate on-the-fly recognition of the trajectories of ionising particles as they pass through the so-called tracker apparatus within high-energy physics detectors. This is a general study to demonstrate that this technique is not only implementable via software-based systems, but can also be exploited using consumer hardware devices. In this context the latter are known as hardware accelerators. In this article in particular, the Xilinx UltraScale+ FPGA is investigated as it belongs to one of the frontier family devices on the market. These FPGAs make it possible to reach high-speed clock frequencies at the expense of acceptable energy consumption thanks to the 14 nm technological node used by the vendor. These devices feature a huge number of gates, high-bandwidth memories, transceivers and other high-performance electronics in a single chip, enabling the design of large, complex and scalable architectures. In particular the Xilinx Alveo U250 has been investigated. A target frequency of 250 MHz and a total latency of 30 clock periods have been achieved using only the 17 & DIVIDE; 53% of LUTs, the 8 & DIVIDE; 12% of DSPs, the 1 & DIVIDE; 3% of Block Rams and a Flip Flop occupancy range of 9 & DIVIDE; 28%.",
keywords = "Hough transform, particle physics, hardware accelerators, particle tracking algorithms, FPGAs",
author = "Alessandro Gabrielli and Fabrizio Alfonsi and Alberto Annovi and Alessandra Camplani and Alessandro Cerri",
year = "2021",
month = oct,
day = "18",
doi = "10.3390/electronics10202546",
language = "English",
volume = "10",
journal = "Electronics",
issn = "1450-5843",
publisher = "Faculty of Electrical Engineering Banja Luka",
number = "20",

}

RIS

TY - JOUR

T1 - Hardware Implementation Study of Particle Tracking Algorithm on FPGAs

AU - Gabrielli, Alessandro

AU - Alfonsi, Fabrizio

AU - Annovi, Alberto

AU - Camplani, Alessandra

AU - Cerri, Alessandro

PY - 2021/10/18

Y1 - 2021/10/18

N2 - In recent years, the technological node used to implement FPGA devices has led to very high performance in terms of computational capacity and in some applications these can be much more efficient than CPUs or other programmable devices. The clock managers and the enormous versatility of communication technology through digital transceivers place FPGAs in a prime position for many applications. For example, from real-time medical image analysis to high energy physics particle trajectory recognition, where computation time can be crucial, the benefits of using frontier FPGA capabilities are even more relevant. This paper shows an example of FPGA hardware implementation, via a firmware design, of a complex analytical algorithm: The Hough transform. This is a mathematical spatial transformation used here to facilitate on-the-fly recognition of the trajectories of ionising particles as they pass through the so-called tracker apparatus within high-energy physics detectors. This is a general study to demonstrate that this technique is not only implementable via software-based systems, but can also be exploited using consumer hardware devices. In this context the latter are known as hardware accelerators. In this article in particular, the Xilinx UltraScale+ FPGA is investigated as it belongs to one of the frontier family devices on the market. These FPGAs make it possible to reach high-speed clock frequencies at the expense of acceptable energy consumption thanks to the 14 nm technological node used by the vendor. These devices feature a huge number of gates, high-bandwidth memories, transceivers and other high-performance electronics in a single chip, enabling the design of large, complex and scalable architectures. In particular the Xilinx Alveo U250 has been investigated. A target frequency of 250 MHz and a total latency of 30 clock periods have been achieved using only the 17 & DIVIDE; 53% of LUTs, the 8 & DIVIDE; 12% of DSPs, the 1 & DIVIDE; 3% of Block Rams and a Flip Flop occupancy range of 9 & DIVIDE; 28%.

AB - In recent years, the technological node used to implement FPGA devices has led to very high performance in terms of computational capacity and in some applications these can be much more efficient than CPUs or other programmable devices. The clock managers and the enormous versatility of communication technology through digital transceivers place FPGAs in a prime position for many applications. For example, from real-time medical image analysis to high energy physics particle trajectory recognition, where computation time can be crucial, the benefits of using frontier FPGA capabilities are even more relevant. This paper shows an example of FPGA hardware implementation, via a firmware design, of a complex analytical algorithm: The Hough transform. This is a mathematical spatial transformation used here to facilitate on-the-fly recognition of the trajectories of ionising particles as they pass through the so-called tracker apparatus within high-energy physics detectors. This is a general study to demonstrate that this technique is not only implementable via software-based systems, but can also be exploited using consumer hardware devices. In this context the latter are known as hardware accelerators. In this article in particular, the Xilinx UltraScale+ FPGA is investigated as it belongs to one of the frontier family devices on the market. These FPGAs make it possible to reach high-speed clock frequencies at the expense of acceptable energy consumption thanks to the 14 nm technological node used by the vendor. These devices feature a huge number of gates, high-bandwidth memories, transceivers and other high-performance electronics in a single chip, enabling the design of large, complex and scalable architectures. In particular the Xilinx Alveo U250 has been investigated. A target frequency of 250 MHz and a total latency of 30 clock periods have been achieved using only the 17 & DIVIDE; 53% of LUTs, the 8 & DIVIDE; 12% of DSPs, the 1 & DIVIDE; 3% of Block Rams and a Flip Flop occupancy range of 9 & DIVIDE; 28%.

KW - Hough transform

KW - particle physics

KW - hardware accelerators

KW - particle tracking algorithms

KW - FPGAs

U2 - 10.3390/electronics10202546

DO - 10.3390/electronics10202546

M3 - Journal article

VL - 10

JO - Electronics

JF - Electronics

SN - 1450-5843

IS - 20

M1 - 2546

ER -

ID: 284621733