SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing

Bankevich A.; Nurk S.; Antipov D.; Gurevich A.; Dvorkin M.; Kulikov A.; Lesin V.; S. I. Nikolenko; Pham S.; Prjibelski A.; Pyshkin A.; A. Sirotkin; Vyahhi N.; Tesler G.; Alekseyev M.; Pevzner P.

?

SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing

Journal of Computational Biology. 2012. Vol. 19. No. 5. P. 455–477.

Bankevich A., Nurk S., Antipov D., Gurevich A., Dvorkin M., Kulikov A., Lesin V., Nikolenko S. I., Pham S., Prjibelski A., Pyshkin A., Sirotkin A., Vyahhi N., Tesler G., Alekseyev M., Pevzner P.

The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V−SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online (http://bioinf.spbau.ru/spades). It is distributed as open source software.

Research target: Earth Sciences Biology Computer Science

Priority areas: IT and mathematics

Language: English

Full text

Text on another site

Keywords: bioinformatics биоинформатика genome assembly сборка геномов

Комплексный (мультимодельный) прогноз порывов ветра у поверхности земли с заблаговременностью до 6 суток по территории России, Беларуси, Казахстана и государств Центральной Азии

Багров А. Н., Gordin V. A., Светлова Н. А. et al., Метеорология и гидрология 2026 № 1 С. 48–56

A technology for operational complex forecasting (postprocessing)1 of squall wind gusts at the points of weather station location in described. The technology was implemented at the Hydrometeorological Research Center of Russia. It uses the results of forecasts of several forecast models and the complex forecast of average wind. The tests have shown an increase in ...

Added: February 17, 2026

Оценка корреляционных функций и мультимодельный прогноз геопотенциала и температуры в тропосфере и нижней стратосфере

Gordin V. A., Смирнов М. А., Метеорология и гидрология 2025 № 12 С. 18–33

Statistical evaluation of three-dimensional auto- and cross-correlation functions for increments from the first guess was performed to interpolate the complex forecast (postprocessing) of geopotential height and temperature to regular grid points. The forecast fields from the ICON model were used as the first guess. Positive definiteness was provided in the evaluation. The verification of the ...

Added: February 17, 2026

(Re)defining the human chromatome: an integrated meta-analysis of localization, function, abundance, physical properties, and domain composition of chromatin proteins

Yan K., Yu H., Chen S. et al., Nucleic Acids Research 2026 Vol. 54 No. 2 Article gkaf1489

The full complement of chromatin-associated proteins—collectively referred to as the chromatome—enables genome functioning in eukaryotes by participating in a wide range of physico-chemical processes. These include mediating diverse specific and nonspecific intermolecular interactions, catalyzing in situ synthesis and modification of macromolecules, facilitating ATP-dependent chromatin remodeling, etc. Despite considerable progress in epigenomics and the structural characterization ...

Added: February 17, 2026

Управление правами на данные дистанционного зондирования Земли (ДЗЭ): теория и практика

Kalyatin V., Аношин М. И., Егорова И. Н. et al., Экономика космоса 2025

В статье раскрыты основные подходы к правовой охране и защите прав на данные дистанционного зондирования Земли из космоса, используемые в Российской Федерации, а также различные варианты структурирования сделок по передаче данных ДЗЗ потребителям, включая опыт защиты данных ДЗЗ в иностранных юрисдикциях. Отдельное внимание в статье уделяется правовым конструкциям сделок с данными ДЗЗ как в рамках ...

Added: February 17, 2026

Continuous software monitoring backed by process mining: a systematic literature review

Evgenii V. Stepanov, Mitsyuk A. A., International Journal of Data Science and Analytics 2026 Vol. 22 P. 1–29

Software systems are monitored constantly, as it is the only way to ensure their well-functioning. There are several approaches for software monitoring: starting with debugging and profiling of simple programs, and ending with large distributed systems which are monitored by a complex logging infrastructure. As a result of such a monitoring, aggregated numbers (i.e., the ...

Added: February 16, 2026

Learning to hear broken motors: Signature-guided data augmentation for induction motor diagnostics

Ali S., Khizhik A., Ryzhikov A. et al., Engineering Applications of Artificial Intelligence 2026 No. 170 Article 114137

The application of machine learning algorithms in the intelligent diagnosis of three-phase engine has the potential to significantly enhance diagnostic performance and accuracy. Traditional methods largely rely on signature analysis, which, despite being a standard practice, can benefit from the integration of advanced machine learning techniques. In our study, we innovate by combining machine learning ...

Added: February 16, 2026

The Fourteenth International Conference on Learning Representations (ICLR 2026)

International Conference on Learning Representations, 2026.

The Fourteenth International Conference on Learning Representations ...

Added: February 16, 2026

Операционная система Linux. Дистрибьюция программного обеспечения

Silakov D., Юрайт, 2025.

В курсе рассматривается операционная система Linux как платформа для разработки, сборки и распространения программного обеспечения. Предложены как классические подходы к доставке приложений с помощью пакетов, так и современные альтернативы, основанные на использовании контейнеров. Интерактивная комбинация теории, контрольных тестов и практических заданий обеспечивает эффективное и интересное погружение в учебный процесс как для студентов, так и для ...

Added: February 15, 2026

Качество программного кода. Позаботьтесь о долгой жизни ваших программных продуктов

Silakov D., Системный администратор 2025 № 10 С. 42–47

Понятие «качество программного продукта» включает в себя не только полноту и корректность реализации требуемого функционала, но и простоту поддержки и модификации программы. Как же обезопасить себя и коллег от кошмара поддержки нечитаемого кода? ...

Added: February 15, 2026

Искусственный интеллект в решении актуальных социальных и экономических проблем ХХI века : сборник статей по материалам Десятой всероссийской научно-практической конференции с международным участием

Yasnitsky L., Plotnikova E. G., Radionova M. V. et al., Пермский государственный национальный исследовательский университет, 2025.

Представлены материалы Десятой всероссийской научно-практической конференции с международным участием «Искусственный интеллект в решении актуальных социальных и экономических проблем ХХI века», которая проводилась 9–10 октября 2025 г. в Перми, ПГНИУ. Сборник предназначен для научных и педагогических работников, преподавателей, аспирантов, магистрантов, студентов и всех, кто интересуется и занимается проблемами развития и применения методов искусственного интеллекта. ...

Added: February 15, 2026

Special Issue Sensing Technology for Smart Cities: Data, Analytics, and Visualizations

Kharlamov A. A., Pilgun M., [б.и.], 2024.

The analysis of large volumes of data collected from heterogeneous sources is increasingly important for the development of megacities, the advancement of smart city technologies, and ensuring a high quality of life for citizens. This study aimed to develop algorithms for analyzing and interpreting social media data to assess citizens’ opinions in real time and ...

Added: February 15, 2026

Программные инструментальные средства для разработки мероприятий по снижению брака серийного производства

Yasnitsky L., Голдобин М. А., Мезенцев А. С., Прикладная математика и вопросы управления 2025 № 2 С. 99–116

Представлен обзор современных методов и основанных на них программных инструментах, применяемых для математического моделирования серийных производственных процессов с целью снижения брака и повышения качества производимых изделий. Перечисляются группы работ, нацеленных на обнаружение и классификацию дефектов, работ, в которых решаются задачи прогнозирования образования дефектов и определения значимости параметров, работ направленных на поиск оптимального сочетания технологических параметров изготовления изделий, ...

Added: February 15, 2026

Управление жизненным циклом информационных систем

Zaramenskikh E., М.: Образовательная платформа Юрайт, 2025.

В курсе рассматривается история и современное состояние информационных систем, а также все этапы их жизненного цикла — от подготовительного этапа до утилизации. Подробно разбирается теория и практика управления жизненным циклом информационных систем, самые разные методологии структурного анализа и моделирования бизнес-процессов, классические и гибкие процессы разработки информационных систем и предназначенные для этого программные инструменты, а также ...

Added: February 15, 2026

Grain-size and geochemical evidence for sediment transport mechanisms in the northeastern part of the East Siberian Sea and on the adjacent continental slope

Aliev R., Journal of Marine Systems 2025 Vol. 252 Article 104140

Grain-size analyses, end-member modeling, X-ray fluorescence, and radionuclide activity measurements were conducted on sediment minicores collected from the middle-outer shelf of the East Siberian Sea (ESS) and the upper part of the adjacent continental slope to elucidate the sedimentation mechanisms in this poorly studied region. The grain-size data demonstrate that clayey silt and silt strongly ...

Added: February 14, 2026

Total conditional complexity of certain objects

Vereshchagin N., Information and Computation 2026 Vol. 308 P. 1–12

The fine approach to measure information dependence is based on the total conditional complexity CT( y |x), which is defined as the minimal length of a total program that outputs y on the input x. It is known that the total conditional complexity can be much larger than the plain conditional complexity. Such strings x, y are defined ...

Added: February 14, 2026

Diffusion models for synthetic tabular data generation

Hushchyn M., Telesheva E., Doklady Mathematics 2025 No. 527 P. 388–399

he problem of generating high-quality synthetic data is crucial for many data science tasks. A generated dataset can cut the costs on the augmentation of the existing data with additional instances, for example, in physics, or help with its privacy protection, for instance, in banking. However, generating a tabular dataset is challenging, as the data ...

Added: February 12, 2026

Живое наследие памяти: Пётр Владимирович Боярский

Plusnin J., Институт наследия, 2023.

This collection presents materials from a roundtable discussion dedicated to the multifaceted legacy of the outstanding scientist Pyotr Vladimirovich Boyarsky. The discussion took place in April 2023 at the D.S. Likhachev Russian Research Institute of Cultural and Natural Heritage (Heritage Institute), as part of the "Living Heritage of Memory" project. The publication includes welcome addresses from ...

Added: February 12, 2026

Не-эволюционный взгляд на поведение учёного

Plusnin J., Управление наукой: теория и практика 2025 Т. 7 № 2 С. 199–209

The problem of professional motivations of scientists’ activity, their style of behavior in science and the preceding choice of a life path is discussed from the perspective of the concept of invariance of psychobiological bases of behavior. The author substantiates the assertion that the nature of the scientist (their personality type, behavior style and motivational ...

Added: February 12, 2026

Real-Bogus Classification for ZTF Data Releases: Two Approaches

Semenikhin n., Kornilov M., Lavrukhina A. et al., Communications in Computer and Information Science 2026 Vol. 2641 P. 211–219

We considered two fundamentally different approaches to real-bogus classification within the Zwicky Transient Facility survey data. The first approach is based on neural networks that take sequences of object images as input. The second approach uses features extracted from light curves and classical machine learning methods. Several models for both approaches were tested. Quality metrics ...

Added: February 12, 2026

Proglacial successions of springtail assemblages (Collembola) along retreating glaciers in Kabardino-Balkaria, Greater Caucasus, Russia

Антипова М. Д., Бушуева И. С., Бабенко А. Б., Nature conservation research. Zapovednaâ nauka 2026 Vol. 11 No. 1 P. 71–92

Since the end of the Little Ice Age, glacier retreat has been recorded globally, with its rates steadily increasing. Glacier forelands serve as convenient areas for studying the patterns of biotic community formation during primary succession. Collembola (hereinafter – springtails) typically play key roles in primary successions, being among the first colonists of territories newly ...

Added: February 12, 2026

Multimodal graph, surface, and language-based model for protein protein interaction prediction

Arteaga Moreano B. D., Poptsova M., Scientific Reports 2026 No. 16 Article 4772

Accurate prediction of protein-protein interactions (PPIs) is fundamental to understanding biological processes and disease mechanisms. While deep learning offers a powerful alternative to costly experimental methods, existing approaches often overlook critical protein-surface information and rely on simplistic feature fusion techniques, thereby limiting performance. To address this, we introduce GSMFormer-PPI, a novel multimodal framework that integrates ...

Added: February 4, 2026

Iterative Ricci-Foster Curvature Flow with GMM-Based Edge Pruning: A Novel Approach to Community Detection

Sorokin K., Beketov M., Онучин А. et al., / arxiv.org. Серия cs.SI "Social and Information Networks ". 2025.

Community detection in complex networks is a fundamental problem, open to new approaches in various scientific settings. We introduce a novel community detection method, based on Ricci flow on graphs. Our technique iteratively updates edge weights (their metric lengths) according to their (combinatorial) Foster version of Ricci curvature computed from effective resistance distance between the ...

Added: January 15, 2026

Implementing Transport Coding in OMNeT++ for Message Delay Reduction

Petrovanov I., Sergeev A., / Series Computer Science "arxiv.org". 2025. No. 2512.18332.

Transport coding reduces message delay in packet-switched networks by introducing controlled redundancy at the transport layer: original packets are encoded into coded packets, and the message is reconstructed after the first successful deliveries, effectively shifting latency from the maximum packet delay to the -th order statistic. We present a concise, reproducible discrete-event implementation of transport coding in OMNeT++, including ...

Added: December 24, 2025

Prediction of protein-protein interactions using point transformer and spherical Convex Hull graphs

David Arteaga, Poptsova M., Computational and Structural Biotechnology Journal 2026 Vol. 31 P. 82–93

Accurate predictions and large-scale identification of protein-protein interactions (PPIs) are crucial for understanding their inherent biological mechanisms and protein functions in virtually all biological processes. Nowadays, graph-based deep learning models have made significant contributions in modeling proteins with physicochemical and geometric features. However, most of these models rely on conventional graph construction methods, such as ...

Added: December 22, 2025