Evaluating the Performance of a New Text Rhythm Analysis Tool
The paper assesses and evaluates the performance of the ProseRhythmDetector (PRD) Text Rhythm Analysis Tool. The research is a case study of 50 English and 50 Russian fictional texts (approximately 88,000 words each) from the 19th to the 21st century. The paper assesses the PRD tool accuracy in detecting stylistic devices containing repetition in their structure such as diacope, epanalepsis, anaphora, epiphora, symploce, epizeuxis, anadiplosis, and polysyndeton. The article ends by discussing common errors, analysing disputable cases and highlighting the use of the tool for author and idiolect identification.
Balint, M., & Trausan-Matu, S. (2016). A critical comparison of rhythm in music and natural language. Annals of the Academy of Romanian Scientists, Series on Science and Technology of Information, 9(1), 43–60.
Boychuk, E., & Belyaeva, O. (2019). La téchnique de stylométrie réalisée à la base de l’analyse informatique du rythme du texte. 10-ièmes Journées Internationales de Linguistique de Corpus (JLC). Université Grenoble-Alpes, 26-28.11.19. 163-167.
Boychuk E., Vorontsova I., Shliakhtina E., Lagutina K., Belyaeva O. (2020) Automated Approach to Rhythm Figures Search in English Text. In Wil M. P. van der Aalst, V. Batagelj, D. I. Ignatov, M. Khachay, V. Kuskova, A. Kutuzov, S. O. Kuznetsov, I. A. Lomazova, N. Loukachevitch, A. Napoli, P. M. Pardalos, M. Pelillo, A. V. Savchenko, E. Tutubalina (Eds.), Analysis of Images, Social Networks and Texts. AIST 2019, Communications in Computer and Information Science, vol 1086 (pp. 107-119). Springer. https://doi.org/10.1007/978-3-030-39575-9_11
Dubremetz, M., & Nivre, J. (2018). Rhetorical Figure Detection: Chiasmus, Epanaphora, Epiphora. Frontiers in Digital Humanities, 5(10). 1-16. https://doi.org/10.3389/fdigh.2018.00010
Dumalus, A., & Fernandez, P. (2011). Authorship attribution using writers rhythm based on lexical stress. Proceedings of the 11th Philippine Computing Science Congress. 82–88
Hou, R., & Huang, C. (2020). Robust stylometric analysis and author attribution based on tones and rimes. Natural Language Engineering, 26(1), 49-71. https://doi.org/10.1017/S135132491900010X
Lagutina, K., Lagutina, N., Boychuk, E., Vorontsova, I., Shliakhtina, E., Belyaeva, O., Paramonov, I. (2019). A Survey on Stylometric Text Features. 25th Conference of Open Innovations Association (FRUCT), Helsinki, Finland, 2019, 184-195. https://doi.org/10.23919/FRUCT48121.2019.8981504
Larionov, V., Petryakov, V., Poletaev, A., Lagutina, K., Manakhova, A., Lagutina, N. and Boychuk, E., (2020). ProseRhythmDetector. K.D. Ushinsky Yaroslavl State Pedagogical University, Yaroslavl, Russia. https://github.com/text-processing/prose-rhythm-detector
Plecháč, P., Bobenhausen, K., Hammerich, B. (2018). Versification and authorship attribution. A pilot study on Czech, German, Spanish, and English poetry. Studia Metrica et Poetica, 5(2), 29–54. https://doi.org/10.12697/smp.2018.5.2.02
Copyright (c) 2020 Elena Boychuk, Ksenia Lagutina, Inna Vorontsova, Elena Mishenkina, Olga Belyayeva
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
All published articles in the ESNBU are licensed under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). This license lets others remix, tweak, and build upon your work non-commercially, and although their new works must also acknowledge you and be non-commercial, they don’t have to license their derivative works on the same terms.
In other words, under the CC BY-NC 4.0 license users are free to
Share — copy and redistribute the material in any medium or format
Adapt — remix, transform, and build upon the material
Under the following terms:
Attribution (by) - All CC licenses require that others who use your work in any way must give you credit the way you request, but not in a way that suggests you endorse them or their use. If they want to use your work without giving you credit or for endorsement purposes, they must get your permission first.
NonCommercial (nc) - You let others copy, distribute, display, perform, and modify and use your work for any purpose other than commercially unless they get your permission first.
If the article is to be used for commercial purposes, we suggest authors be contacted by email.
If the law requires that the article be published in the public domain, authors will notify ESNBU at the time of submission, and in such cases the article shall be released under the Creative Commons 1 Public Domain Dedication waiver CC0 1.0 Universal.
Copyright for articles published in ESNBU are retained by the authors, with first publication rights granted to the journal. Authors retain full publishing rights and are encouraged to upload their work to institutional repositories, social academic networking sites, etc. ESNBU is not responsible for subsequent uses of the work. It is the author's responsibility to bring an infringement action if so desired by the author.
Occasionally ESNBU may co-publish articles jointly with other publishers, and different licensing conditions may then apply.