?
Deploying Elbrus VLIW CPU ecosystem for materials science calculations: performance and problems
P. 149–159.
Modern Elbrus-4S and Elbrus-8S processors show floating point performance comparable to the popular Intel processors in the field of high-performance computing. Tasks oriented to take advantage of the VLIW architecture show even greater efficiency on Elbrus processors. In this paper the efficiency of the most popular materials science codes in the field of classical molecular dynamics and quantum-mechanical calculations is considered. A comparative analysis of the performance of these codes on Elbrus processor and other modern processors is carried out
Serzhantova M. V., Titov M. A., Obvertkin I. V., Journal of Physics: Conference Series 2019 No. 1353 Article 012020
This article presents a research on Hexagonal Boron Nitride (h-BN) monolayer cell strain effect 2 % and 4 %. Structure of h-BN with nitrogen vacancy, with boron vacancy and with divacancy was considered for this. The calculations were carried out within framework of the density functional formalism with gradient corrections and using the VASP package. ...
Added: February 20, 2023
Sergei Valentinovich Fedorenko, IEEE Access 2021 Vol. 9 P. 38673–38686
A novel method for finding roots of polynomials over finite fields has been proposed.
This method is based on the cyclotomic discrete Fourier transform algorithm.
The improvement is achieved by using the normalized cyclic convolutions,
which have a small complexity and allow matrix decomposition,
as well as methods of adapting the truncated normalized cyclic convolutions calculation.
For small values of ...
Added: April 15, 2021
Zlotnik A.A., Zlotnik I.A., Computational Mathematics and Mathematical Physics 2020 Vol. 60 No. 2 P. 240–257
We present direct logarithmically optimal in theory and fast in practice algorithms to implement the tensor products finite element method (FEM) based on the tensor products of the 1D high-order FEM spaces on multi-dimensional rectangular parallelepipeds for solving the $N$-dimensional Poisson type equation $-\Delta u+\alpha u=f$ ($N\geq 2$) with the Dirichlet boundary conditions. They are based ...
Added: May 19, 2020
Kondratyuk N., Smirnov G., Agarkov A. et al., , in: Supercomputing. RuSCDays 2019. Communications in Computer and Information ScienceVol. 1129: Supercomputing. RuSCDays 2019.: Springer, 2019. P. 597–609.
8 of top 10 supercomputers of Top500 list published in November 2018 consist of computing nodes with hybrid architectures that require special programming techniques. 5 systems among these are based on Nvidia GPUs. In this paper, we consider the benchmark results of the brand new hybrid supercomputer installed in March 2019 in NRU HSE. This ...
Added: December 11, 2019
Guschina O., Shevgunov T., Efimov E. et al., , in: Advances in Intelligent Systems and Computing* 2. Vol. 1047: Proceedings of 3rd Computational Methods in Systems and Software 2019.: Springer, 2019. P. 167–175.
This paper deals with the technique known as the periodic synchronous averaging. The exact analytical expression for the fast Fourier transform (FFT) representing the digital spectrum of the signal undergoing periodic synchronous averaging is derived using the general signal and spectral framework. This formula connects the coefficient of Fourier series of the original continuous-time signal ...
Added: December 1, 2019
Fedorenko Sergei Valentinovich, IEEE Signal Processing Letters 2019 Vol. 26 No. 9 P. 1320–1324
An effective calculation of the Reed-Solomon code syndrome is proposed. The method is based on the use of the partial normalized cyclic convolutions in the partial inverse cyclotomic discrete Fourier transform. The method is the best of the known algorithms, in terms of multiplicative complexity. ...
Added: September 4, 2019
Stegailov V., Timofeev A., , in: Supercomputing. RuSCDays 2018. Communications in Computer and Information Science, vol 965. Springer, Cham.: Springer, 2019. P. 543–553.
Modern Elbrus-4S and Elbrus-8S processors show floating point performance comparable to the popular Intel processors in the field of high-performance computing. Tasks oriented to take advantage of the VLIW architecture show even greater efficiency on Elbrus processors. In this paper the efficiency of the most popular materials science codes in the field of classical molecular ...
Added: March 10, 2019
Stegailov V., Дергунов Д. О., Timofeev A., , in: Communications in Computer and Information ScienceVol. 910: Parallel Computational Technologies.: Springer, 2018. P. 92–103.
Modern Elbrus-4S and Elbrus-8S processors provide a level of floating-point performance close to that of widespread x86_64 CPUs that are predominantly used in high-performance computing (HPC). The uniqueness of the software ecosystem of Elbrus processors requires special attention in the case of their deployment for execution of mainstream computational codes. In this paper, we consider ...
Added: March 10, 2019
Ефимов Е. Н., Shevgunov T., Жуков Д. М., Электросвязь 2018 № 12 С. 43–47
Описывается технический прототип программно-аппаратной системы анализа циклостационарных сигналов, построенной на базе программно-определяемого радио. В качестве аппаратной платформы используется чип REALTEK RTL2832U; программное обеспечение (ПО) выполнено с помощью свободного ПО – фреймворка Qt и предназначено для работы на компьютере под управлением операционной системы семейства GNU\Linux. Приведено описание основных функциональных блоков системы, рассмотрены варианты реализации алгоритма оценки ...
Added: February 5, 2019
Sergei Valentinovich Fedorenko, IEEE Signal Processing Letters 2016 Vol. 23 No. 6 P. 824–827
A novel method for computing the discrete Fourier transform (DFT) over a finite field based on the Goertzel-Blahut algorithm is described. The novel method is currently the best one for computing the DFT over even extensions of the characteristic two finite field, in terms of multiplicative complexity. ...
Added: January 26, 2018
Stegailov V., Vecher V., , in: Communications in Computer and Information ScienceBook 793: Supercomputing.: Switzerland: Springer, 2017. P. 430–441.
Nowadays, the wide spectrum of Intel Xeon processors is available. The new Zen CPU architecture developed by AMD has extended the number of options for x86_64 HPC hardware. This large number of options makes the optimal CPU choice for HPC systems not a straightforward procedure. Such a co-design procedure should follow the requests from the ...
Added: November 29, 2017
Shevgunov T., Ефимов Е. Н., Жуков Д. М., Электросвязь 2017 № 6 С. 50–57
Предложен алгоритм 2N-БФП, предназначенный для оценивания циклической спектральной плотности мощности цифровых сигналов на основе их реализаций конечной длительности. Алгоритм основан на методе усреднения во времени циклических периодограмм, в котором для достижения необходимого разрешения по циклической частоте применена спектральная интерполяция, выполняемая с использованием алгоритма БПФ. Работа алгоритма проиллюстрирована на примере двух сигналов: АМ-радиосигнала и сигнала с ...
Added: July 10, 2017
Zlotnik A.A., Zlotnik I.A., Doklady Mathematics 2017 Vol. 95 No. 2 P. 129–135
A new fast direct algorithm for implementing a finite element method (FEM) of order on rectangles as applied to boundary value problems for Poisson-type equations is described that extends a well-known algorithm for the case of difference schemes or bilinear finite elements (n = 1). Its core consists of fast direct and inverse algorithms for ...
Added: February 28, 2017