|
Teacher
|
LANCIONI GIULIANO
(syllabus)
The course will offer an introduction to Arabic corpus linguistics and will present its problems, metodology and results. Students will be introduced to issues in annotation of linguistic texts in Standard and spoken Arabic, with a special focus on mixed and informal texts. A theoretical section will be accompanied by an empirical application on some of the reference open source tools for the semi-automatic analysis of Arabi texts. Students will also be introduced to principles and applications of standards in the linguistic annotation of texts (TEI, Universal Dependencies)
(reference books)
Bassiouney, R., 2009. Arabic sociolinguistics: topics in diglossia, gender. identity, and politics. Georgetown University Press, Washington, DC.
Boella, M., Romani, F.R., Al-Raies, A., Solimando, C., Lancioni, G., 2011. The SALAH Project: Segmentation and Linguistic Analysis of ḥadīṯ Arabic Texts, in: Information Retrieval Technology, Lecture Notes in Computer Science. Presented at the Asia Information Retrieval Symposium, Springer, Berlin, Heidelberg, pp. 538–549.
Lancioni, G., Joosse, N.P., 2016. The Arabic Diatessaron Project: Digitalizing, Encoding, Lemmatization. JRMDC 5, 205–227.
Pasha, A., Al-Badrashiny, M., Diab, M., Kholy, A.E., Eskander, R., Habash, N., Pooleery, M., Rambow, O., Roth, R., 26-31. MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic, in: Chair), N.C. (Conference, Choukri, K., Declerck, T., Loftsson, H., Maegaard, B., Mariani, J., Moreno, A., Odijk, J., Piperidis, S. (Eds.), Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14). European Language Resources Association (ELRA), Reykjavik, Iceland.
Ryding, K.C., 1991. Proficiency Despite Diglossia: A New Approach for Arabic. The Modern Language Journal 75, 212–218.
TEI Consortium (Ed.), 2018. TEI P5: Guidelines for Electronic Text Encoding and Interchange. 5.0. TEI Consortium.
|