Estructuración y clasificación automática de información: aplicación a una colección de textos médicos
by Jorge Morato
J. Morato, J.A. Moreiro, M. Velasco, J. Llorens. Estructuración y clasificación automática de información: aplicación a una colección de textos médicos. Revista Interamericana de Bibliotecología, Vol 24, No. 1 (2001)
Se describe una herramienta que mediante una aproximación multidimensional permite la estructuración y clasificación... more Se describe una herramienta que mediante una aproximación multidimensional permite la estructuración y clasificación de textos. El fin que se persigue es el estudio de las distintas secciones del documento. En el desarrollo del modulo se emplearon algoritmos de filtrado (N-grams) y de clasificación (K-means y chen). La estructuración de los documentos se realizó mediante marcadores linguísticos, tipográficos y herramientas estadísticas. Para la evaluación del método se recopilaron de Medline documentos médicos o texto completo y se incorporó una herramienta de comparación, el MeSH Mediante un análisis estadístico y comparativo, se ha comprobado la necesidad y validez de este tipo de aproximaciones. Por último, se propone la integración del método en un módulo que optimice la asignación de pesos en el diseño de herramientas de clasificación y recuperación documental
Phonetic Annotation of Paite DESCRIPTION OF PAITE SOUND SYSTEM USING PRAAT AND WAVEPAD SOFTWARES
by Atanu Saha
The article is a description of the documentation of the Paite language spoken in the state of Manipur in India and... more The article is a description of the documentation of the Paite language spoken in the state of Manipur in India and bordering country Myanmar in South East Asia. The language is considered to be endangered as cited in the UNESCO Atlas of the World's Languages in Danger . The work is a primary investigation into the sound system of the language and talks about the transcription of the phonemes with the help of the software known as PRAAT and preparation of the metadata using MS-excel format.
Documenting the sound of a language
by Atanu Saha
This is article is published in http://www.7sisters.in/index.html in May 2011
56 views
Seen by:The phonologist and the design of documentary fieldwork: Assuming a role in data production from the outset
by Erich Round
Paper presented at the 14th Manchester Phonology Meeting, 27 May, 2006.
Phonology and fieldwork can most often be found interfacing within a methodological virtuous circle: the findings of... more
Phonology and fieldwork can most often be found interfacing within a methodological virtuous circle: the findings of fieldwork provide input to phonological analysis and theory, which in turn provide insightful questions to take back to the field. In the case of endangered languages, however, this process can be cut off before even a single full cycle has been completed. In light of this, there is an important role to be played by the phonologist in designing fieldwork strategies which ensure (i) that the initial production of data is as rich as possible, even in the absence of input from advanced phonological analysis, and (ii) that such data is delivered in a ‘user friendly’ format for those who will provide the subsequent theoretical analysis ¬– thereby allowing the virtuous circle to be completed without unnecessary delay. This presentation reports on a recent attempt to implement such ideas within a documentation project carried out with the last speakers of the moribund Australian (non Pama-Nyungan) language, Kayardild.
Already in a highly precarious position, Kayardild is unlikely to survive much longer than five or ten more years. When it ceases to be spoken, the entire Tangkic language family will have become extinct, and while this window of five or so years provides invaluable time for research, it is not long. In response to this, features were built into the design of a documentation project run in 2005 with a view both to practical feasibility and to the production of data in a form as outlined above. Primary among these was the enrichment of interlinear text glosses through the addition of two tiers of prosodic information; secondary was the adoption of a phonologically shrewd approach to vocabulary documentation. Neither of these strategies required any particularly advanced phonological training on the part of the field researcher – that is, they should be relatively easy to incorporate into other projects – and despite their simplicity, they appear to have proven successful.
In the presentation then, I discuss the precise nature of the rhythmic and intonational transcriptions made for Kayardild, outline how they have already proven useful, and comment and how the methods could be extended to other field projects. I also offer some observations on mundane but nevertheless important details which can impact on the effectiveness of phonological/phonetic data collection.
The talk should be of interest to any phonologist in a position to offer advice to fieldworkers on the collection of phonological data – that is, to most of us.
14 views
Seen by:Expressing Location In Tlacolula Valley Zapotec
Lillehaugen, Brook Danielle. 2006. Expressing Location in Tlacolula Valley Zapotec. Ph.D. dissertation: UCLA.
Hacia Una Tipología De Locativos De Partes
Lillehaugen, Brook Danielle and Pamela Munro. 2008. "Hacia una tipología de locativos de partes." In Memorias: IX Encuentro Internacional de Lingüística en el Noroeste, Tomo 2. Rosa María Ortiz Ciscomani, Ed. Hermosillo, Sonora: Editorial Unison. pp. 231 – 252.
32 views
Seen by:Partes del cuerpo y la codificación semántica de ENTIDAD y LUGAR en el zapoteco del valle de Tlacolula
in Lingüística Mexicana, vol. III, Núm. 2, 2006.
60 views
Seen by:Partes del cuerpo y la codificación semántica de ENTIDAD y LUGAR en el zapoteco del valle de Tlacolula
in Lingüística Mexicana, vol. III, Núm. 2, 2006.
60 views
Seen by:Body Parts and the Encoding of THING and PLACE In Zapotec
Lillehaugen, Brook Danielle and John O. Foreman. 2009. "Body parts and the encoding of THING and PLACE in Zapotec." In Studies in Role and Reference Grammar. Lilián Guerrero, Sergio Ibáñez-Cerda, Valeria A. Belloro, eds. México: Universidad Nacional Autónoma de México. pp. 203-230.
57 views
Seen by:
