A generative syntax for the computational modeling of Spanish nominal ellipsis
Published 2023-09-30
Keywords
- grammar,
- nominal ellipsis,
- generative grammar,
- Partial Identity Theory,
- computational linguistic
How to Cite
Copyright (c) 2023 RASAL Lingüística
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Abstract
In this paper, Spanish nominal ellipsis is studied from the general principles of Generative Grammar (Chomsky, 1965, 1995) and Partial Identity Theory (Saab, 2008, 2019). On the basis of their explanation, nominal ellipsis is represented in a computational grammar capable of not only identifying this information, but also of replacing the respective nominal element that had been elided. Even though a computational component is shown, the main focus is on the grammatical structure and how this is a suitable tool to solve some problems of automatic identification of these constructions and similar ones. However, nominal ellipsis has not been as meticulously studied as verbal ellipsis has. To verify the computational representation of this syntactic phenomenon, two tools were created in NooJ (Silberztein, 2005, 2016): an electronic dictionary and a computational grammar. Both were evaluated in a 423,000-word corpus from the medical domain. It is important to mention that this computational grammar was not trained or previously modified in any other corpus. In a manual revision, 355 cases of ellipsis were identified, 335 of which were detected. The results were the following: 99.11% of accuracy, 94.36% of coverage and 96.68% of f-measure.
Downloads
References
- Ackema, P., y Szendroi, K. (2002). Determiner sharing as an instance of dependent ellipsis. Journal of Comparative Germanic Linguistics 5, 3–34.
- Aelbrecht, L. (2010). The syntactic licensing of ellipsis. John Benjamins Publishing Company.
- Arregi, K., y Centeno, N. (2005). Determiner sharing and cyclicity in Wh-movement. En R. Gess y E. J. Rubin (Eds.), Theoretical and experimental approaches to Romance linguistics (pp. 1–19). John Benjamins.
- Bosque, I. y Demonte, V. (Eds.). (1999). Gramática descriptiva de la lengua española. Espasa.
- Bosque, I., y Gutiérrez, J. (2016). Fundamentos de Sintaxis Formal. Akal.
- Bresnan, J. (1973). Syntax of the Comparative Clause Construction in English. Linguistic Inquiry 4 (3), 275-343.
- Brucart, J. (1987). La elisión sintáctica en español. Bellaterra.
- Brucart, J. (1999). La elipsis. En I. Bosque y V. Demonte (Eds.), Gramática descriptiva de la lengua española (pp. 2787-2863). Espasa.
- Brucart, J. M., y MacDonald, J. E. (2012). Empty Categories and Ellipsis. En The Handbook of Hispanic Linguistics (pp. 579-601). Wiley Blackwell.
- Brucart, J. M., y Fernández-Sánchez, J. (2018). Ellipsis (in Romance languages) [Archivo PDF]. http://docs.wixstatic.com/ugd/d72a87_e042fdaabcb64f09a137aba5763a16cc.pdf
- Centeno, N. (2012). Determiner sharing in Spanish: DET y DETFoc. En R. Campos-Astorkiza y J. Franco (eds.), Papers in linguistics by the BIDE generation, Anuario del Seminario de Filología Vasca “Julio de Urquijo” XLVI-1 (2012), 109-123 [Archivo PDF]. https://addi.ehu.es/bitstream/handle/10810/49505/12597-46273-1PB.pdf?sequence=1
- Chomsky, N. (1957). Syntactic Structure. Mouton.
- Chomsky, N. (1965). Aspects of the Theory of Syntax. MIT Press.
- Chomsky, N. (1995). The Minimalist Program. MIT Press.
- Chung, S., Ladusaw, W., y McCloskey, J. (1995). Sluicing and Logical Form. Natural Language Semantics, 3, 239-282.
- Chung, S. (2013). Syntactic identity in sluicing: How much and why. Linguistics Inquiry 44(1), 1-44.
- Cinque G. (1993). On the Evidence for Partial N-movement in the Romance DP. University of Venice Working Papers in Linguistics 3 (2), 21-40.
- Contreras, H. y Masullo, P. (2000). Motivating Merge. En M. Leonetti., O. Fernández y V. Escandell Vidal (Eds.) Current Issues in Generative Grammar. Universidad de Alcalá, Madrid.
- Correas, G. (1967). Arte Kastellana. Universidad de Santiago de Compostela.
- Embick, D. (2000). Features, Syntax, and Categories in the Latin Perfect. Linguistic Inquiry 31(2), 185-230.
- Embick, D. (2007). Linearization and Local Dislocation: Derivational Mechanics and Interactions. Linguistic Analysis 33(3-4), 2-35.
- Embick, D., y Noyer, R. (2001). Movement Operations after Syntax. Linguistic Inquiry 32(4), 555-595.
- Embick, D., y Halle, M. (2004). Word Formation: Aspects of the Latin Conjugation in Distributed Morphology. Mouton de Gruyter.
- Embick, D y Marantz, A. (2008). Architecture and blocking. Linguistic Inquiry 39(1), 1-53.
- Fiengo, R. (1977). On trace theory. Linguistic Inquiry 8, 35-61.
- Fiengo, R., y May, R. (1994). Indices and Identity. MIT Press.
- Fox, D. (2000). Economy and Semantic Interpretation. MIT Press.
- Gallego, A. (2011). Sobre la elipsis. Madrid: Arco.
- Grimshaw, J. (1990). Argument Structure. MIT Press.
- Gruet-Skrabalova, H. (2016). Ellipsis in sequences of multiple wh-words. Linguistica Brunensia, Masarykova univerzita, Brno 64(1), 57-69.
- Gutiérrez, J., y Bosque, I. (2009). Fundamentos de sintaxis formal. Akal.
- Halle, M. (1997). Distributed Morphology: Impoverishment and Fission. MIT Working Papers in Linguistics 30, 425-449.
- Halle, M., y Marantz, A. (1993). Distributed Morphology and the Pieces of Inflection. En K. Hale y S. Keyser (Eds.). The view from Building 20 (pp. 111-176). MIT Press.
- Hankamer, J., y Sag, I. (1976). Deep and Surface Anaphora. Linguistic Inquiry 3 (7): 391-426.
- Hardt, D. (1993). Verb phrase ellipsis: Form, meaning and processing [Tesis doctoral, University of Pennsylvania].
- Hardt, D. (1997). An empirical approach to VP ellipsis. Computational Linguistics, 23(4),525–541.
- Hardt, D., y Rambow, O. (2001). Generation of VP ellipsis: A corpus-based approach. En Proceedings of ACL.
- Harley, H., y Noyer, R. (1999). Distributed Morphology. Glot International 4(4), 3-9.
- Kehler, A. (2002). Coherence, reference, and the theory of grammar. Stanford: CDLI Publications.
- Kester, P. M., y Sleeman, P. (2002). N-Ellipsis in Spanish. En H. Broekhuis, y P. Fikkert (Eds.), Linguistics in the Netherlands 2002 (pp. 107-116). Benjamins.
- Kornfeld, L. (2012). Núcleos complejos y división del trabajo lingüístico [Tesis de Maestría, Universidad Nacional de Comahue].
- Kornfeld, L., y Saab, A. (18-20 de agosto, 2005). Hacia una tipología de las anáforas nominales en español. III Encuentro de Gramática Generativa, Neuquén.
- Laca, B. (1999). Presencia y ausencia del determinante. En Gramática descriptiva de la lengua española. Ignacio Bosque y Violeta Demonte (Eds.). 3 tomos. Espasa-Calpe.
- Lappin, S. (2005). A sequenced model of anaphora and ellipsis resolution. En A. Branco., T. McEnery y R. Mitkov (Eds.), Anaphora processing: Linguistic, cognitive, and computational modelling, (pp 3–16). Ámsterdam: John Benjamins.
- Lappin, S., y Leass, H. (1994). A Syntactically Based Algorithm for Pronominal Anaphora Resolution. Computational Linguistics, 20, 535-561.
- Lappin, S. y Shih, H. (1996), A generalized reconstruction algorithm for ellipsis resolution, in Proceedings of COLING-96, University of Copenhagen, pp. 687–692.
- Lasnik, H. (1999). A Note on Pseudogapping. En Minimalist Analysis (pp. 151-174). Malden MA: Blackwell.
- Lechner, W. (2019). Comparative deletion. En J. van Craenenbroeck y T. Temmerman (Eds.), The Oxford Handbook of Ellipsis (pp. 19-45). Oxford.
- Lechner, W. (2004). Ellipsis in Comparatives. Mouton de Gruyter.
- Lobeck, A. (1995). Ellipsis: Functional Heads, Licensing and Identification. Oxford University Press.
- Longobardi, G. (1994). Reference and Proper Names: A Theory of N-movement in Syntax and Logical Form. Linguistic Inquiry 25(4), 609-665.
- Marantz, A. (1997). No Escape from Syntax: Don’t try Morphological Analysis in the Privacy of Your Own Lexicon. En Alexis Dimitriadis, Laura Siegel, Clarissa Surek-Clark, y Alexander Williams (eds.), Proceedings of the 21st Penn Linguistics Colloquium. Philadelphia: UPenn Working Papers in Linguistics, 201-225.
- Marantz, A. (1995). Cat’as a Phrasal Idiom: Consequences of Late Insertion in Distributed Morphology. MIT.
- McShane, M., y Babkin, P. (2016). Detection and Resolution of Verb Phrase Ellipsis. Linguistic Issues in Language Technology, 13(1), 1–34.
- Mel’čuk, I. (2006). Aspects of the Theory of Morphology. Mouton de Gruyter.
- Merchant, J. (1999). The Syntax of Silence: Sluicing, Islands and Identity in Ellipsis [Tesis de Doctorado, Universidad de Santa Cruz].
- Merchant, J. (2001). The syntax of silence: sluicing, islands and theory of ellipsis. Oxford University Press.
- Merchant, J. (2013). Ellipsis phenomena. Jeroen van Craenenbroeck and Jason Merchant. In Marcel den Dikken (ed.), The Cambridge Handbook of Generative Syntax, 701-745. Cambridge University Press: Cambridge.
- Mitkov, R. (2001). Outstanding issues in anaphora resolution. En Al. Gelbukh (Ed.) Computational Linguistics and Intelligent Text Processing (pp. 110–125). Springer.
- Mitkov, R. (2002). Anaphora Resolution. Longman.
- Nielsen, L. (2004). Verb phrase ellipsis detection using automatically parsed text. En COLING ’04: Proceedings of the 20th international conference on Computational Linguistics (pp. 1093–1099). Ginebra.
- Ntelitheos, D. (2004). Syntax of elliptical and discontinuous nominals [Tesis de Maestría, UCLA].
- Panagiotidis, P. (2002). Pronouns, clitics and empty nouns. John Benjamins.
- Panagiotidis, P. (2003). Empty Nouns. Natural Language and Linguistic Theory 21:381-432.
- Peral J., Ferrández, A. (2000). Generation of Spanish Zero-Pronouns into English. En D.N Christodoulakis (Ed.), Natural Language Processing (pp. 252–260). Springer.
- Pereltsvaig, A. (2013). Noun phrase structure in article-less Slavic languages: DP or not DP? Language and Linguistics Compass 7(3), 201–219.
- Raposo, E. (2018). Nominal ellipsis and prepositional modifiers in Portuguese: A phase-theoretic approach. Probus 30(2), 277-303.
- Real Academia Española (2009). Nueva Gramática de la Lengua Española. Espasa.
- Rello, L. (2010). Elliphant: A machine learning method for identifying subject ellipsis and impersonal constructions in Spanish. [Tesis de Maestría, University of Wolverhampton y Universitat Autonoma de Barcelona].
- Rello, L., y Ilisei, I. (2009). A Rule-Based Approach to the Identification of Spanish Zero Pronouns. En Proceedings of Student Research Workshop, RANLP (pp. 60–65).
- Ruda, M. (2016). NP Ellipsis (Effects) in Polish and Hungarian: FFs on Fs, Agree, and Chain Reduction. The Linguistic Review 33, 649–677.
- Saab, A. (7-9 de agosto, 2003). Identidad Morfológica Estricta e Inserción Tardía. II Encuentro de Gramática Generativa, Instituto de Lenguas Vivas Juan Ramón Fernández.
- Saab, A. (2008). Hacia una teoría de la identidad parcial en la elipsis [Tesis de Doctorado, Universidad Nacional de Buenos Aires].
- Saab, A. (2012). On the notion of partial pro-drop in Romance languages and beyond. En M. Kato y F. Ordóñez (Eds.), The Morphosyntax of Portuguese and Spanish in Latin America (pp. 49-77). Oxford University Press.
- Saab, A. (2019). Nominal Ellipsis. En J. van Craenenbroeck y T. Temmerman (Eds.), The Oxford Handbook of Ellipsis. Oxford University Press
- Sag, I. (1976). Deletion and Logical Form [Tesis de Doctorado, MIT].
- Silberztein, M. (2005). NooJ: A Linguistic Annotation System for Corpus Processing. Demo. En Proceedings of the HLT/EMNLP2005 Conference.
- Silberztein, M. (2016). Formalizing Natural Languages: The Nooj Approach. Wiley-Iste.
- van Craenenbroeck, J. (2010a). The syntax of ellipsis: Evidence from Dutch dialects. Oxford University Press.
- van Craenenbroeck, J. (2010b). Invisible last resort: A note on clefts as the underlying source for sluicing. Lingua, 120, 1714–1726.
- Williams, E. (1977). Discourse and Logical Form. Linguistic Inquiry, 8(1), 101-139.
- Zdravkovska-Adamova, B. (2017). Ellipsis in the Macedonian noun phrase. Seeu Review 12 (2),82-107.