Performance and Scalability of Materials Science and Machine Learning Codes on the State-of-Art Hybrid Supercomputer Architecture
8 of top 10 supercomputers of Top500 list published in November 2018 consist of computing nodes with hybrid architectures that require special programming techniques. 5 systems among these are based on Nvidia GPUs. In this paper, we consider the benchmark results of the brand new hybrid supercomputer installed in March 2019 in NRU HSE. This system gives us the possibility to estimate the performance of several widely used material science and machine learning codes that we discuss in this work within the framework of the results available for older HPC systems.
A measurement of the charm-mixing parameter yCP using D0 → KþK−, D0 → πþπ−, and D0 → K−πþ decays is reported. The D0 mesons are required to originate from semimuonic decays of B− and B0 mesons. These decays are partially reconstructed in a data set of proton-proton collisions at center-of-mass energies of 7 and 8 TeV collected with the LHCb experiment and corresponding to an integrated luminosity of 3 fb−1. The yCP parameter is measured to be ð0.57 0.13ðstatÞ 0.09ðsystÞÞ%, in agreement with, and as precise as, the current world-average value.
Online social networks have become an essential communi- cation channel for the broad and rapid sharing of information. Currently, the mechanics of such information-sharing is captured by the notion of cascades, which are tree-like networks comprised of (re)sharing actions. However, it is still unclear what factors drive cascade growth. Moreover, there is a lack of studies outside Western countries and platforms such as Facebook and Twitter. In this work, we aim to investigate what fac- tors contribute to the scope of information cascading and how to predict this variation accurately. We examine six machine learning algorithms for their predictive and interpretative capabilities concerning cascades’ structural metrics (width, mass, and depth). To do so, we use data from a leading Russian-language online social network VKontakte capturing cascades of 4,424 messages posted by 14 news outlets during a year. The results show that the best models in terms of predictive power are Gradient Boosting algorithm for width and depth, and Lasso Regression algorithm for the mass of a cascade, while depth is the least predictable. We find that the most potent factor associated with cascade size is the number of reposts on its origin level. We examine its role along with other factors such as content features and characteristics of sources and their audiences.
Proceedings of Machine Learning Research: Volume 97: International Conference on Machine Learning, 9-15 June 2019, Long Beach, California, USA
The Fifth HCT Information Technology Trends (ITT 2018) is a major international research conference for the presentation of innovative ideas, approaches, technologies, research findings and outcomes, best practices and case studies, national and international projects, institutional standards and policies on Emerging Technologies for Artificial Intelligence. ITT 2018 will provide an outstanding forum for researchers, practitioners, students, policy makers, and users to exchange ideas, techniques and tools, raise awareness and share experiences related to all practical and theoretical aspects of Emerging Technologies for Artificial Intelligence, so as to develop solutions related to communications, computer science and engineering, control systems as well as interdisciplinary research and applications.
Nowadays, the wide spectrum of Intel Xeon processors is available. The new Zen CPU architecture developed by AMD has extended the number of options for x86_64 HPC hardware. This large number of options makes the optimal CPU choice for HPC systems not a straightforward procedure. Such a co-design procedure should follow the requests from the end-users community. Modern computational materials science studies are among the major consumers of HPC resources worldwide. The VASP code is perhaps the most popular tool for these research. In this work, we discuss the benchmark metric and results based on a VASP test model that give us the possibility to compare different CPUs and to select best options with respect to time-to-solution and energy-to-solution criteria.
L’ouvrage d’Adrian Mackenzie, professeur au Département de sociologie à l’Université de Lancaster, est d’un genre inédit au sein de la littérature émergente, mais encore peu étendue en sciences humaines et sociales, qui explore le fonctionnement du machine learning (ML). Les avancées spectaculaires de cette branche de l’intelligence artificielle (IA) depuis quelques années ont éclipsé les autres approches en la matière et ont soudainement transformé l’IA en un problème social et politique. Plusieurs auteurs ont déjà insisté sur la nécessité de focaliser le regard sur les outils de l’IA, en pointant les limites des travaux qui ne traitent que des effets sociaux des « algorithmes ». Comme le fait remarquer l’anthropologue des sciences et des techniques Nick Seaver, la plupart des travaux sur le sujet s’agitent au sujet des « algorithmes » ou le « big data », en insistant sur leurs effets néfastes, voire catastrophiques, pour la société sans jamais préciser exactement ce qu’ils sont. Le transfert des connaissances et des perspectives entre les spécialistes en IA et en SHS (d’ailleurs dans les deux sens) est pourtant indispensable pour en proposer une critique informée et efficace.
A search for CP violation in the Cabibbo-suppressed D0 → K+K−π+π− decay mode is performed using an amplitude analysis. The measurement uses a sample of pp collisions recorded by the LHCb experiment during 2011 and 2012, corresponding to an integrated luminosity of 3.0 fb−1. The D0 mesons are reconstructed from semileptonic b-hadron decays into D0μ−X final states. The selected sample contains more than 160 000 signal decays, allowing the most precise amplitude modelling of this D0 decay to date. The obtained amplitude model is used to perform the search for CP violation. The result is compatible with CP symmetry, with a sensitivity ranging from 1% to 15% depending on the amplitude considered.
Modern Elbrus-4S and Elbrus-8S processors show floating point performance comparable to the popular Intel processors in the field of high-performance computing. Tasks oriented to take advantage of the VLIW architecture show even greater efficiency on Elbrus processors. In this paper the efficiency of the most popular materials science codes in the field of classical molecular dynamics and quantum-mechanical calculations is considered. A comparative analysis of the performance of these codes on Elbrus processor and other modern processors is carried out.