• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site

Article

Автоматическое извлечение текстовых и числовых веб-данных для целей социальных наук

The paper describes the procedures of automatic data extraction from web pages (web scraping), its advantages and limitations, as well as gives an overview of the basic minimum of competencies for web scraping: in particular, programming using Python and navigating through a web pages’ code. A detailed illustration is also given based on a fragment of the data collection process from a recent relevant Russian study.