Workshop of the 5th International Conference on Learning Representations (ICLR)

?

Workshop of the 5th International Conference on Learning Representations (ICLR)

2017.

The performance of machine learning methods is heavily dependent on the choice of data representation (or features) on which they are applied. The rapidly developing field of representation learning is concerned with questions surrounding how we can best learn meaningful and useful representations of data. We take a broad view of the field and include topics such as deep learning and feature learning, metric learning, compositional modeling, structured prediction, reinforcement learning, and issues regarding large-scale learning and non-convex optimization. The range of domains to which these techniques apply is also very broad, from vision to speech recognition, text understanding, gaming, music, etc.

Semantic embeddings for program behaviour patterns

Chistyakov A., Lobacheva E., Kuznetsov A. et al., , in : Workshop of the 5th International Conference on Learning Representations (ICLR). : [б.и.], 2017. P. 1-4.

In this paper, we propose a new feature extraction technique for program execution logs. First, we automatically extract complex patterns from a program's behavior graph. Then, we embed these patterns into a continuous space by training an autoencoder. We evaluate the proposed features on a real-world malicious software detection task. We also find that the ...

Added: October 31, 2018

Language: English

Text on another site

Keywords: deep learning

Workshop of the 5th International Conference on Learning Representations (ICLR)

Fault detection in Tennessee Eastman process with temporal deep learning models

Lomov I., Lyubimov M., Makarov I. et al., Journal of Industrial Information Integration 2021 Vol. 23 Article 100216

Automated early process fault detection and prediction remains a challenging problem in industrial processes. Traditionally it has been done by multivariate statistical analysis of sensor readings and, more recently, with the help of machine learning methods. The quality of machine learning models strongly depends on feature engineering, that in turn heavily relies on expertise of ...

Added: March 21, 2021

Эмоциональный анализ постов в ВКонтакте: классификатор или регрессор

Kolmogorova A., Калинин А. А., В кн. : Компьютерная лингвистика и интеллектуальные технологии: по материалам международной конференции «Диалог 2022», выпуск 21. Вып. 21.: Изд-во РГГУ, 2022. С. 311-322.

The article summarizes the results of two tasks in machine learning paradigm: the task of classification according to the criterion of dominating emotion on the data of social networks posts in Russian and the regression task using the same data. The experiments are conducted on the data set collected from VKontakte social network and consisted of 3879 posts ...

Added: March 18, 2024

Lost in Conversation: A Conversational Agent Based on the Transformer and Transfer Learning

Golovanov S., Tselousov A., Rauf Kurbanov et al., , in : The NeurIPS '18 Competition: From Machine Learning to Intelligent Conversations. : Springer, 2020. P. 295-315.

Added: February 20, 2021

A Deep Learning Method Study of User Interest Classification

Malafeev A., Nikolaev K., , in : Analysis of Images, Social Networks and Texts. 8th International Conference, AIST 2019, Kazan, Russia, July 17–19, 2019, Revised Selected Papers. Communications in Computer and Information Science. Vol. 1086.: Springer, 2020. P. 154-159.

In this paper, a deep learning method study is conducted to solve a new multiclass text classification problem, identifying user interests by text messages. We used an original dataset of almost 90 thousand forum text messages, labeled for ten interests. We experimented with different modern neural network architectures: recurrent and convolutional, as well as simpler ...

Added: November 7, 2019

Semantic embeddings for program behaviour patterns

Chistyakov A., Lobacheva E., Kuznetsov A. et al., , in : Workshop of the 5th International Conference on Learning Representations (ICLR). : [б.и.], 2017. P. 1-4.

Added: October 31, 2018

Traffic4cast at NeurIPS 2021 - Temporal and Spatial Few-Shot Transfer Learning in Gridded Geo-Spatial Processes

Eichenberger C., Neun M., Martin H. et al., , in : Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track. : PMLR, 2022. P. 97-112.

Added: October 11, 2022

Unet-boosted classifier – мультизадачная архитектура для малых выборок на примере классификации МРТ снимков головного мозга

Sobyanin K., Kulikova S., Информатика и автоматизация (Труды СПИИРАН) 2024 Т. 23 № 4 С. 1022-1046

The problem of training deep neural networks on small samples is especially relevant for medical problems. The paper examines the impact of pixel-wise marking of significant objects in the image, over the true class label, on the quality of the classification. To achieve better classification results on small samples, we propose a multitasking architecture -- ...

Added: June 29, 2024

User-controllable Multi-texture Synthesis with Generative Adversarial Networks

Alanov A., Kochurov M., Volkhonskiy D. et al., , in : Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISAPP 2020). Vol. 4.: SciTePress, 2020. P. 214-221.

We propose a novel multi-texture synthesis model based on generative adversarial networks (GANs) with a user-controllable mechanism. The user control ability allows to explicitly specify the texture which should be generated by the model. This property follows from using an encoder part which learns a latent representation for each texture from the dataset. To ensure ...

Added: November 8, 2020

Bayesian Sparsification of Recurrent Neural Networks

Lobacheva E., Chirkova N., Vetrov D., / International Conference on Machine Learning. Series 1 "Workshop on Learning to Generate Natural Language". 2017.

Recurrent neural networks show state-of-the-art results in many text analysis tasks but often require a lot of memory to store their weights. Recently proposed Sparse Variational Dropout (Molchanov et al., 2017) eliminates the majority of the weights in a feed-forward neural network without significant loss of quality. We apply this technique to sparsify recurrent neural ...

Added: October 19, 2017

On Embeddings for Numerical Features in Tabular Deep Learning

Gorishniy Y., Ivan Rubachev, Babenko A., , in : Thirty-Sixth Conference on Neural Information Processing Systems : NeurIPS 2022. : Curran Associates, Inc., 2022. Ch. 1. P. 24991-25004.

Added: January 28, 2023

Weight Averaging Improves Knowledge Distillation under Domain Shift

Berezovskiy V., Morozov N., , in : The 2nd Workshop and Challenges for Out-of-Distribution Generalization in Computer Vision. ICCV 2023. : [б.и.], 2023.

Knowledge distillation (KD) is a powerful model compression technique broadly used in practical deep learning applications. It is focused on training a small student network to mimic a larger teacher network. While it is widely known that KD can offer an improvement to student generalization in i.i.d setting, its performance under domain shift, i.e. the ...

Added: November 20, 2023

Nucleus segmentation: towards automated solutions

Hollandi R., Moshkov N., Paavolainen L. et al., Trends in Cell Biology 2022

Single nucleus segmentation is a frequent challenge of microscopy image processing, since it is the first step of many quantitative data analysis pipelines. The quality of tracking single cells, extracting features or classifying cellular phenotypes strongly depends on segmentation accuracy. Worldwide competitions have been held, aiming to improve segmentation, and recent years have definitely brought ...

Added: January 21, 2022

Z-flipon variants reveal the many roles of Z-DNA and Z-RNA in health and disease

Umerenkov D., Herbert A., Konovalov Dmitrii et al., Life Science Alliance 2023 Vol. 6 No. 7 Article e202301962

Identifying roles for Z-DNA remains challenging given their dynamic nature. Here, we perform genome-wide interrogation with the DNABERT transformer algorithm trained on experimentally identified Z-DNA forming sequences (Z-flipons). The algorithm yields large performance enhancements (F1 = 0.83) over existing approaches and implements computational mutagenesis to assess the effects of base substitution on Z-DNA formation. We ...

Added: June 9, 2023

Method of Critical Set construction for Successive Cancellation List Decoder of Polar Codes Based on Deep Learning of Neural Networks

Kotov F., Ivanov F., Timokhin I., , in : 2023 XVIII International Symposium Problems of Redundancy in Information and Control Systems (REDUNDANCY). : IEEE, 2023. P. 64-69.

The Successive Cancellation List (SCL) algorithm is a widely used decoding technique in communication systems. However, constructing the critical set for SCL decoding is a challenging task, as it requires a large number of computations and can lead to significant decoding delays. In this paper, a new approach to critical set construction for SCL decoding ...

Added: December 9, 2023

Speech decoding from a small set of spatially segregated minimally invasive intracranial EEG electrodes with a compact and interpretable neural network

Petrosyan A., Voskoboynikov A., Sukhinin D. et al., Journal of Neural Engineering 2022 Vol. 19 No. 6 Article 066016

Objective. Speech decoding, one of the most intriguing brain-computer interface applications, opens up plentiful opportunities from rehabilitation of patients to direct and seamless communication between human species. Typical solutions rely on invasive recordings with a large number of distributed electrodes implanted through craniotomy. Here we explored the possibility of creating speech prosthesis in a minimally ...

Added: December 9, 2022

Deep Learning for Non-Invasive Cortical Potential Imaging

Razorenova A., Yavich N., Malovichko M. et al., , in : Machine Learning in Clinical Neuroimaging and Radiogenomics in Neuro-oncology. Third International Workshop, MLCN 2020, and Second International Workshop, RNO-AI 2020. Lecture Notes in Computer Science. Vol. 12449: Machine Learning in Clinical Neuroimaging and Radiogenomics in Neuro-oncology.: Springer, 2020. Ch. 5. P. 45-55.

Electroencephalography (EEG) is a well-established non-invasive technique to measure the brain activity, albeit with a limited spatial resolution. Variations in electric conductivity between different tissues distort the electric fields generated by cortical sources, resulting in smeared potential measurements on the scalp. One needs to solve an ill-posed inverse problem to recover the original neural activity. In this article, ...

Added: December 10, 2020

Training restricted Boltzmann machines to generate human-like eye movements

Krasovskaya S., Zhulikov G., MacInnes W., , in : European Conference on Visual Perception 2017 Abstract Book. : [б.и.], 2017. Ch. 2. P. 18-18.

Approximately twenty years ago, Laurent Itti and Christof Koch created a saliency map of visual attention in an attempt to recreate the work of biological pyramidal neurons by mimicking neurons with centre-surround receptive fields. The Saliency Model launched many studies that contributed to the understanding of layers of vision and the sphere of visual attention. ...

Added: October 15, 2018

Recognition of DNA Secondary Structures as Nucleosome Barriers with Deep Learning Methods

Pavlov F., Poptsova M., , in : 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). : Seul : IEEE, 2020. P. 2800-2805.

Added: March 29, 2021

Interpretable Feature Generation in ECG Using a Variational Autoencoder

Kuznetsov V. V., Moskalenko V. A., Gribanov D. et al., Frontiers in Genetics 2021 Article 638191

We propose a method for generating an electrocardiogram (ECG) signal for one cardiac cycle using a variational autoencoder. Our goal was to encode the original ECG signal using as few features as possible. Using this method we extracted a vector of new 25 features, which in many cases can be interpreted. The generated ECG has ...

Added: October 29, 2021

ABC: A Big CAD Model Dataset For Geometric Deep Learning

Koch S., Matveev A., Jiang Z. et al., , in : Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2019). : IEEE, 2019. P. 9601-9611.

We introduce ABC-Dataset, a collection of one million Computer-Aided Design (CAD) models for research of geometric deep learning methods and applications. Each model is a collection of explicitly parametrized curves and surfaces, providing ground truth for differential quantities, patch segmentation, geometric feature detection, and shape reconstruction. Sampling the parametric descriptions of surfaces and curves allows ...

Added: November 26, 2019

The Deep Weight Prior

Atanov A., Ashukha A., Struminsky K. et al., , in : Proceedings of the 7th International Conference on Learning Representations (ICLR 2019). : ICLR, 2019. P. 1-17.

Bayesian inference is known to provide a general framework for incorporating prior knowledge or specific properties into machine learning models via carefully choosing a prior distribution. In this work, we propose a new type of prior distributions for convolutional neural networks, deep weight prior (DWP), that exploit generative models to encourage a specific structure of ...

Added: September 2, 2019

Differentiable Rendering with Reparameterized Volume Sampling

Morozov N., Rakitin D., Oleg Desheulin et al., , in : Neural Fields across Fields: Methods and Applications of Implicit Neural Representations. ICLR 2023 Workshop. : [б.и.], 2023. Ch. 8.

In view synthesis, a neural radiance field approximates underlying density and radiance fields based on a sparse set of scene pictures. To generate a pixel of a novel view, it marches a ray through the pixel and computes a weighted sum of radiance emitted from a dense set of ray points. This rendering algorithm is ...

Added: July 18, 2023

Определение заболеваний маниока методами компьютерного зрения

Терещенко С. Н., Perov A., Осипов А. Л., Siberian Journal of Life Sciences and Agriculture 2021 Т. 13 № 1 С. 144-155

Background. Development of a convolutional neural network model for detecting cassava diseases from a mobile phone photo. Materials and methods. The material for the research was taken images with various types of cassava diseases, published in open access of the Kaggle platform. Research methods: theory of design and development of information systems, programming, methods of augmentation and extension ...

Added: November 17, 2021

Интерфейс мозг-компьютер: опыт построения, использования и возможные пути повышения рабочих характеристик

Volkova K., Dagaev N., Киселёв А. С. et al., Журнал высшей нервной деятельности им. И.П. Павлова 2017 Т. 67 № 4 С. 504-520

Brain-computer interfaces find application in a number of different areas and have the potential to be used for research as well as for practical purposes. The clinical use of BCI includes current studies on neurorehabilitation ([Frolov et al., 2013; Ang et al., 2010]), and there is the prospect of using BCI to restore movement and ...

Added: October 19, 2017