Analysis of Poetic Corpora with Litcon
DOI:
https://doi.org/10.5944/rhd.vol.7.2022.34586Keywords:
Stylistic Analysis, Content Analysis, Literature, Software, ToolsAbstract
Research in corpus analysis has grown and evolved significantly in recent years, with an increase in the tools and resources available for the analysis and exploration of textual corpora. After a brief review of the available corpus analysis programs, this article presents Litcon, a new software developed with special attention to the study of poetic texts in verse. Litcon is cross-platform, it includes common tools of corpus analysis software, such as the generation of concordances, word frequency lists and keywords, and even functionalities for contrasting corpora, tagging and analyzing Part-of-Speech tagged text, and creating random samples. Litcon and its tools are presented through a case study with the poetry of Fernando de Herrera.
Downloads
References
Alonso Ramos, M. (1994). Hacia una definición del concepto de colocación: De J. R. Firth a I. A. Mel’čuk. Revista de Lexicografía, I, 9-28. https://ruc.udc.es/dspace/bitstream/handle/2183/5383/RL_1-1.pdf?sequence=1
Anthony, L. (2013). A critical look at software tools in Corpus Linguistics. Linguistic Research, 30(2), 141-161.
Anthony, L. (2022a). What can corpus software do? En A. O’Keeffe & M. McCarthy (Eds.), The Routledge Handbook of Corpus Linguistics (2nd ed.) (pp. 103-125). Routledge.
Anthony, L. (2022b). Antconc (4.1.1) [Computer software]. Waseda University. https://www.laurenceanthony.net/software/antconc/
Baker, P., McEnery, A., & Hardie, A. (2006). A Glossary of Corpus Linguistics. Edinburgh University Press.
Corpas Pastor, G. (2001). En torno al concepto de colocación. Euskera, XLVI(1), 89-108.
Eder, M., Rybicki, J., & Kestemont, M. (2016). Stylometry with R: A Package for Computational Text Analysis. The R Journal, 8(1), 107-121. https://journal.r-project.org/archive/2016/RJ-2016-007/index.html
Gutiérrez, S. (2019). Análisis de corpus con Voyant Tools. The Programming Historian en español, 3. https://programminghistorian.org/es/lecciones/analisis-voyant-tools
Heiden, S. (2010). The TXM Platform: Building Open-Source Textual Analysis Software Compatible with the TEI Encoding Scheme. En 24th Pacific Asia Conference on Language, Information and Computation (pp. 389-398). Institute for Digital Enhancement of Cognitive Development, Waseda University. https://halshs.archives-ouvertes.fr/halshs-00549764
Heiden, S. (2018). TXM (0.7.9) [Computer software]. http://textometrie.ens-lyon.fr/?lang=fr
Heiden, S., Magué, J.-P., & Pincemin, B. (2010). TXM : Une plateforme logicielle open-source pour la textométrie – conception et développement. En S. Bolasco (Ed.), Proc. Of 10th International Conference on the Statistical Analysis of Textual Data-JADT 2010 (Vol. 2, pp. 1021-1032). Edizioni Universitarie di Lettere Economia Diritto. https://halshs.archives-ouvertes.fr/halshs-00549779
Hernández-Lorenzo, L. (2020). Los textos poéticos de Fernando de Herrera: Aproximaciones desde la Estilística de corpus y la Estilometría. https://idus.us.es/handle/11441/93465
Hernández-Lorenzo, L. (2021). Nueva Luz para la problemática de Versos: Una aproximación a su léxico desde las Humanidades Digitales y los estudios de corpus. En J. Montero & P. Ruiz Pérez (Coords.), De Herrera. Estudios reunidos con motivo del IV Centenario de Versos (1619) (pp. 151-206). Universidad de Sevilla.
Herrera, F. de. (1975). Obra poética (J. M. Blecua, Ed.). Boletín de la Real Academia Española.
Hockey, S. (2004). The History of Humanities Computing. En S. Schreibman, R. Siemens, & J. Unsworth (Eds.), A Companion to Digital Humanities (pp. 3-19). Blackwell.
Jannidis, F. (2014). PyDelta [Python]. Universität Julius-Maximilians Würzburg. https://github.com/cophi-wue/pydelta
Juola, P. (2005). A Prototype for Authorship Attribution Software. En P. Liddell, R. Siemens, A. Bia, M. Holmes, P. Baer, G. Newton, & S. Arneil (Eds.), The International Conference on Humanities Computing and Digital Scholarship. The 17th Joint International Conference (pp. 97-99). University of Victoria.
Jurafsky, D., & Martin, J. H. (2021). Sequence Labeling for Parts of Speech and Named Entities. En Speech and Language Processing. Prentice Hall.
Macrí, O. (1972). Fernando de Herrera. Gredos.
McEnery, A., & Hardie, A. (2012). Corpus linguistics: Method, theory and practice. Cambridge University Press.
McEnery, A., & Wilson, A. (2001). Corpus Linguistics. An Introduction. Edinburgh University Press.
Montero, J. (2021). La transmisión de los textos poéticos de Fernando de Herrera: Estado de la cuestión y nuevas perspectivas. En De Herrera. Estudios reunidos en el centenario de versos (1619) (pp. 107-149). Editorial Universidad de Sevilla.
Padró, L. (2011). Analizadores multilingües en freeling. Linguamática, 3, 13-20. http://upcommons.upc.edu/handle/2117/14772
Pincemin, B., & Heiden, S. (s. f.). Qu’est-ce que la textométrie? Présentation. Site du projet Textométrie. https://pages.textometrie.org/textometrie/Introduction
Rayson, P., & Garside, R. (2000). Comparing Corpora using Frequency Profiling. Proceedings of the Workshop Comparing Corpora, held in conjuction with the 38th annual meeting of the Association for Computational Linguistics (ACL 2000), 9, 1-6.
Rockwell, G., & Sinclair, S. (2016). Hermeneutica. Computer-Assisted Interpretation in the Humanities. MIT Press.
Rojas Castro, A. (2013). Las Humanidades Digitales: Principios, valores y prácticas. Janus: estudios sobre el Siglo de Oro, 2, 74-99. http://www.janusdigital.es/articulo.htm?id=24
Rüdiger, J. O. (2018a). Corpus Explorer (2.0) [Computer software]. https://notes.jan-oliver-ruediger.de/software/corpusexplorer-overview/
Rüdiger, J. O. (2018b). CorpusExplorer v2.0-Seminartauglich in einem halben Tag. DHd2018. Kritik der digitalen Vernunft, 28-30. http://dhd2018.uni-koeln.de/wp-content/uploads/boa-DHd2018-web-ISBN.pdf
Schmid, H. (1994). Probabilistic Part-of-Speech Tagging Using Decision Trees. Proceedings of International Conference on New Methods in Language Processing.
Scott, M. (2010). What can corpus software do? En A. O’Keeffe & M. McCarthy (Eds.), The Routledge Handbook of Corpus Linguistics (1st ed.) (pp. 136-151). Routledge.
Scott, M. (2016). WordSmith Tools (7.0) [Computer software]. Lexical Analysis Software. https://www.lexically.net/wordsmith/
Sinclair, S., & Rockwell, G. (2016). Voyant Tools [Web application]. https://voyant-tools.org/
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2022 Laura Hernández-Lorenzo

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
RHD provides immediate free access to its content under the principle that making research available to the public for free favors greater global knowledge sharing. RHD does not charge fees to authors for the submission or processing of articles
Users can read, download, distribute, print, search, partially reproduce or link to the texts without requesting prior permission from the editor or the author.
RHD does not charge fees to authors for the processing of works, nor fees for the publication of their articles.
RHD is free from the moment of the publication of each issue and its contents are distributed with Creative Commons license No Commercial 4.0 International , which allows the user free and open access, criteria that meet the definition of open access of the Budapest Declaration in favor of open access. This means that they can be copied, used, disseminated, transmitted and exhibited publicly, provided that the authorship and the original source of their publication are cited (magazine, editorial and URL of the work, not used for commercial purposes, mention the existence and specifications of this license of use.
The authors retain the copyright and guarantee the journal the right to be the first publication of the work. The authors are free to distribute their work published in the magazine in other media, such as an institutional repository or inclusion in a book.