The paper presents a quantitative research of characters' direct speech patterns in Leo Tolstoy's War and Peace. Tolstoy was known to put a lot of emphasis on the language in which his characters express themselves, and conscious modification of their speech is acknowledged as part of the author's literary technique. In an attempt to measure the scope and intensity of such modification, we extracted speech activity instances from the text of War and Peace, associated them with speakers and identified some distinctive features. We then used these features to train a classifier to recognize the speaker according to the speech. Our hypothesis was that if Tolstoy’s characters actually possessed any unique speech characteristics, the classifier would be able to predict the speaker with some tolerable accuracy.
In this paper, we describe our work on an ongoing project titled “Tolstoy Digital”. Our chief objective is to convert the 90-volume collected works of Leo Tolstoy into a proper “digital edition” with help of TEI.
The significance of memoir complex associated with the large Russian historians of the XX century, as an important historical source.
The paper argues that we should rethink the relation between facts and scholarship in the humanities. This thesis should not be misunderstood as an argument for unreflective positivism. But new technological developments in the 'digital humanities' suggest that the collection of facts in machine-readable form (e. g. as 'nanopublications') facilitates new strategies for interpreting, visualizing or archiving information in the humanities. The paper discusses a concrete application of these insights in the history of philosophy, namely the use of nanopublications as an instrument in 'collective doxography'.