Big Data in Bioinformatics
Sequencing of the human genome began in 1994. It took 10 years of work by many scientific teams to get a rough sequence of human DNA. Modern sequencing technologies allow you to get the genome of a specific person in a few days. We discuss the success of modern bioinformatics associated with the emergence of high-performance sequencing platforms, which not only contributed to the expansion of the capabilities of various areas of biology and other related Sciences, but also gave rise to the phenomenon of big data. The article substantiates the need to develop new technologies and methods for organizing storage, management, analysis and visualization of big data. Modern bioinformatics is faced not only with the problem of big data, but also with a huge variety of processing and presentation methods, the simultaneous existence of various software tools and data formats. We discuss ways to solve these problems, in particular by using the experience of working with big data from other areas of modern life, such as network analysis and business data analysis. New database management systems other than relational ones will help solve the problem of storing big data and ensuring acceptable search query execution time. New programming technologies, such as generalized programming and visual programming, are designed to solve the problem of the diversity of genomic data formats and provide the ability to quickly create your own scripts for data processing.