ABOUT THE COURSE: This is a course on language documentation, a sub-field of linguistics. The growing global recognition of indigenous knowledge systems has brought an urge to document the dying languages. There are different projects going on all over the world including India where scholars from different fields are working together to save and document the dying languages. The aim of this course is to introduce the students to the issues related to language endangerment and various aspects of language documentation focused on the Indian scenario. The course along with making the students aware of the theories related to language documentation will also train them in data collection, data processing and creating various applications with the available data.INTENDED AUDIENCE: UG & PG Students of Linguistics & NLP, Research Scholars of Linguistics and related disciplinesPREREQUISITES: Not required but it will be helpful for the students if they have completed an introductory course in linguistics.INDUSTRY SUPPORT: Companies working on NLP and smaller languages, e.g. Google, Microsoft, Facebook, Flipkart etc.
Tools and Technologies of Language Documentation
Indian Institute of Technology, Kharagpur and NPTEL via Swayam
-
32
-
- Write review
Overview
Syllabus
Week 1:
1. Introduction:
1.1. What is Language Endangerment?
1.2. Language vitality and shift.
1.3. Linguistic diversity, biodiversity and wellbeing.
1.4. UNESCO and language endangerment.
Week 2:
2. What is Language Documentation?
2.1. What to document?
2.2. How to document?
2.3. How to collect ethno-linguistic data?
2.3. How to make tools for data collection?
2.3.1 Preparing picture books & videos
Week 3:
3. Different ways of data collection:
3.1. Designing quantitative questionnaire
3.2. Designing qualitative questionnaire
Week 4:
4. In the Field:
4.1. What are the dos and don’ts of the field?
4.2. How to create metadata in the field?
4.3. How to get prepared with the equipment?
4.3. How to be ethical in the field?
Week 5:
5. After the Field:
5.1. How to clean the field data?
5.2. How to create metadata after the field?
5.3. How to process the data?
5.3.1. Sound Slicing
Week 6:
5.3.2. IPA Transcription
5.3.3. Glossing Rules
Week 7:
6. How is language documentation useful in language revitalization?
6.1. Creating digital dictionaries
Week 8:
6.2. Designing pedagogical materials
6.3. Creating language games & pedagogical Apps
7. Case studies of Language Documentation from India and other parts of the world
1. Introduction:
1.1. What is Language Endangerment?
1.2. Language vitality and shift.
1.3. Linguistic diversity, biodiversity and wellbeing.
1.4. UNESCO and language endangerment.
Week 2:
2. What is Language Documentation?
2.1. What to document?
2.2. How to document?
2.3. How to collect ethno-linguistic data?
2.3. How to make tools for data collection?
2.3.1 Preparing picture books & videos
Week 3:
3. Different ways of data collection:
3.1. Designing quantitative questionnaire
3.2. Designing qualitative questionnaire
Week 4:
4. In the Field:
4.1. What are the dos and don’ts of the field?
4.2. How to create metadata in the field?
4.3. How to get prepared with the equipment?
4.3. How to be ethical in the field?
Week 5:
5. After the Field:
5.1. How to clean the field data?
5.2. How to create metadata after the field?
5.3. How to process the data?
5.3.1. Sound Slicing
Week 6:
5.3.2. IPA Transcription
5.3.3. Glossing Rules
Week 7:
6. How is language documentation useful in language revitalization?
6.1. Creating digital dictionaries
Week 8:
6.2. Designing pedagogical materials
6.3. Creating language games & pedagogical Apps
7. Case studies of Language Documentation from India and other parts of the world
Taught by
Prof. Bornini Lahiri, Prof. Dripta Piplai (Mondal)