Tagarela
Tagarela
Tagarela
TAGARELA TAGARELA
Detmar Meurers & Detmar Meurers &
Ramon Ziai Ramon Ziai
Introduction Introduction
Feedback Feedback
Conclusion
categories (Long 1991, 1996; Ellis 1994; Schmidt 1995; Lyster 1998; Conclusion
Berlin. October 15, 2009 Lightbown & Spada 1999; Norris & Ortega 2000; Schulz 2002)
Appendix Appendix
Screenshots Screenshots
1 / 39 2 / 39
Introduction Introduction
I What role does the system play in teaching? Feedback Feedback
→ Self-guided activities accompanying teaching The three models The three models
Conclusion Conclusion
Similar to traditional workbook exercises, plus audio. Appendix
Appendix
Screenshots
What should the system interfaces look like?
Screenshots
I
Introduction Introduction
Feedback Feedback
System Architecture System Architecture
The three models The three models
Expert model: NLP Expert model: NLP
Annotation-based setup Annotation-based setup
Activity model Activity model
Relevance for processing Relevance for processing
Conclusion Conclusion
Appendix Appendix
Screenshots Screenshots
5 / 39 6 / 39
Introduction Introduction
Feedback Feedback
System Architecture System Architecture
The three models The three models
Expert model: NLP Expert model: NLP
Annotation-based setup Annotation-based setup
Activity model Activity model
Relevance for processing Relevance for processing
Conclusion Conclusion
Appendix Appendix
Screenshots Screenshots
7 / 39 8 / 39
Feedback on Wrong Word
TAGARELA TAGARELA
Detmar Meurers & Detmar Meurers &
Ramon Ziai Ramon Ziai
Introduction Introduction
Feedback Feedback
System Architecture System Architecture
The three models The three models
Expert model: NLP Expert model: NLP
Annotation-based setup Annotation-based setup
Activity model Activity model
Relevance for processing Relevance for processing
Conclusion Conclusion
Appendix Appendix
Screenshots Screenshots
9 / 39 10 / 39
Introduction Introduction
Feedback Feedback
System Architecture Web Interface System Architecture
Student Input Feedback Message
The three models The three models
Expert model: NLP Expert model: NLP
Annotation-based setup Annotation-based setup
Activity model Activity model
Relevance for processing Expert Module Relevance for processing
Instruction Model
Analyzing learner Linguistic Analysis Feedback Analyzing learner
language sub-modules language
Activity Model Generation
On Tokenization • Form Analysis: On Tokenization
Interpretation • tokenizer Interpretation
• Feedback selection
Solution • parser Solution
Analysis Manager
Interpretation Student Model Interpretation
• correct answer
• Strategic Analysis
Portuguese Properties • token matcher Portuguese Properties
Mismatches in the • canonic matcher Personal information Mismatches in the
interpretation of tokens • pos matcher interpretation of tokens
Solution Solution
Wrapping up Wrapping up
Strategic Analysis Interaction Preferences
Conclusion sub-modules Conclusion
• Ranking
Appendix • task strategies Language Competence Appendix
Screenshots • task appropriateness Screenshots
• transfer
11 / 39 12 / 39
The three models NLP analysis modules in TAGARELA
TAGARELA TAGARELA
Detmar Meurers & Detmar Meurers &
Ramon Ziai Ramon Ziai
Introduction Introduction
Feedback
I Form Analysis: Feedback
I model of domain knowledge (linguistic knowledge) Expert model: NLP Expert model: NLP
Annotation-based setup (cliticization, contractions, abbreviations) Annotation-based setup
I learner model Activity model Activity model
I instruction/activity model
Relevance for processing I lexical/morphological lookup: returns multiple analyses Relevance for processing
Analyzing learner
language
based on CURUPIRA lexicon (Martins et al. 2006) Analyzing learner
language
I What is the point of learner and activity models? On Tokenization
Interpretation
I disambiguator: finite state disambiguation rules narrow On Tokenization
Interpretation
Portuguese Properties down lexical information, in the spirit of Constraint Portuguese Properties
13 / 39 14 / 39
I Allow the analysis manager to flexibly employ NLP Introduction Activities can be characterized and differ in: Introduction
modules relevant to a particular activity. Feedback
System Architecture I task specification
Feedback
System Architecture
The three models The three models
I To support a flexible control structure, the data Expert model: NLP I e.g.: listen, read, write, comment, complete Expert model: NLP
Annotation-based setup Annotation-based setup
structures serving as input and as output for the Activity model I level Activity model
Relevance for processing Relevance for processing
analysis modules need to be uniform and explicit. Analyzing learner I e.g.: basic, intermediate, advanced Analyzing learner
language language
I NLP analysis = a process of enriching the learner input On Tokenization I expected input On Tokenization
Interpretation Interpretation
with annotations (parallel to XML-based corpus annotation) Portuguese Properties
I e.g.: word, phrase, sentence Portuguese Properties
Mismatches in the Mismatches in the
I In addition to the information obtained by analyzing the I generally: improve the required skills and abilities
input, we need information about the activity.
15 / 39 16 / 39
Where it matters for processing Property identification in TAGARELA
TAGARELA TAGARELA
Detmar Meurers & Detmar Meurers &
Ramon Ziai Ramon Ziai
Analyzing learner
I Rephrasing: as above + syntactic processing and basic Analyzing learner
I Which linguistic properties (incl. errors) of the learner language content assessment (correct answer, token matcher) language
input can actually be observed in a given activity? On Tokenization On Tokenization
Interpretation I Reading: as above + all content analysis modules Interpretation
to select as likely error cause (or other relevant aspect)? identification of tokens I Why not always run everything? identification of tokens
Solution Solution
On interpreting accented I “Don’t guess what you know.” On interpreting accented
I Which of the identified properties is most likely to characters characters
I Which of the identified errors should be the focus of the Mismatches in the types of variation, and the potential errors NLP needs to Mismatches in the
interpretation of tokens interpretation of tokens
feedback given activity and its specific pedagogical goals? Solution detect, Solution
Wrapping up Wrapping up
I the more specific information we can diagnose
I Feedback Strategy: Which strategy does it choose? E.g.: Conclusion Conclusion
I with higher reliability
I explicit feedback on form for FIBs Appendix Appendix
I scaffolding for reading comprehension (i.e., encouraging Screenshots Screenshots
17 / 39 18 / 39
Introduction Introduction
Feedback Feedback
System Architecture System Architecture
The three models The three models
Expert model: NLP Expert model: NLP
Annotation-based setup Annotation-based setup
Activity model Activity model
I The system was used by beginning Portuguese Relevance for processing Relevance for processing
Analyzing learner
students at The Ohio State University. Analyzing learner
language language
Solution Solution
Wrapping up Wrapping up
Conclusion Conclusion
Appendix Appendix
Screenshots Screenshots
19 / 39 20 / 39
Identifying tokens (II)
TAGARELA TAGARELA
Detmar Meurers &
Properties of Portuguese Detmar Meurers &
Ramon Ziai Ramon Ziai
Tokenization
Introduction Introduction
Feedback Feedback
System Architecture System Architecture
The three models The three models
Expert model: NLP I Certain Portuguese words are syntactically complex. Expert model: NLP
Annotation-based setup Annotation-based setup
Activity model Activity model
Relevance for processing I Contraction: preposition + determiner/pronoun Relevance for processing
Encliticization:
Portuguese Properties Portuguese Properties
Mismatches in the
I Mismatches in the
interpretation of tokens interpretation of tokens
Solution I comprá-lo = comprar (to buy) + o (it) Solution
Wrapping up Wrapping up
Appendix Appendix
Screenshots
I comprei-a = comprei (bought) + a (it) Screenshots
21 / 39 22 / 39
Introduction Introduction
Feedback Feedback
System Architecture System Architecture
The three models The three models
I Learner input: O Amazonas fica no região norte. Expert model: NLP Expert model: NLP
Annotation-based setup I The system needs to connect the surface form provided Annotation-based setup
Activity model Activity model
Relevance for processing by the student with the system analysis of this input. Relevance for processing
Conclusion Conclusion
Appendix Appendix
Screenshots Screenshots
23 / 39 24 / 39
Example Token Representation Interpreting tokens: Accents (I)
TAGARELA TAGARELA
Detmar Meurers & Detmar Meurers &
Ramon Ziai Ramon Ziai
Token
begin
Introduction Introduction
0 Feedback Feedback
end 2
System Architecture System Architecture
Conclusion Conclusion
I avô (grandfather) – avó (grandmother)
Appendix Appendix
Screenshots I meaning Screenshots
27 / 39 28 / 39
Mismatches in the interpretation of accents Addressing the Interpretation of Accents
TAGARELA TAGARELA
Detmar Meurers & Detmar Meurers &
Ramon Ziai Ramon Ziai
Introduction Introduction
Feedback Feedback
System Architecture I Learners perceive the unaccented and accented System Architecture
⇒ The student did not include the main verb. Portuguese Properties
Mismatches in the
I Treat accented and unaccented characters parallel to Portuguese Properties
Mismatches in the
identification of tokens
Solution
common L1-transfer phonological confusions. identification of tokens
Solution
⇒ The lack of an accent is a spelling error. Conclusion I Where this is the case, provide dedicated feedback Conclusion
Appendix
Screenshots
alerting learner of this confusion. Appendix
Screenshots
29 / 39 30 / 39
by the system is inadequate. Analyzing learner I provide opportunities for students to practice their Analyzing learner
language language
On Tokenization listening, reading, and writing skills On Tokenization
I We discussed two such mismatches for Portuguese Interpretation I provide individualized feedback to learner Interpretation
tokens in TAGARELA:
Portuguese Properties Portuguese Properties
Mismatches in the
identification of tokens
I foster learner awareness of language forms and categories Mismatches in the
identification of tokens
I identification of tokens: contraction, encliticization Solution I provide contextualized activities integrating meaning Solution
On interpreting accented On interpreting accented
I interpretation of tokens: accented characters characters and form characters
Interpretation Interpretation
31 / 39 32 / 39
Language Learning. Computer-Assisted Language Learning 21(4), 323–338.
References TAGARELA
Detmar Meurers &
URL http://purl.org/dm/papers/amaral-meurers-call08.html.
TAGARELA
Detmar Meurers &
Ramon Ziai Ramon Ziai
Amaral, L. & D. Meurers (2009). Little Things With Big Effects: On the Identification
Amaral, L. (2007). Designing Intelligent Language Tutoring Systems: integrating Introduction and Interpretation of Tokens for Error Diagnosis in ICALL. CALICO Journal Introduction
Natural Language Processing technology into foreign language teaching. Feedback
27(1). Feedback
Ph.D. thesis, The Ohio State University. System Architecture System Architecture
The three models Bailey, S. & D. Meurers (2006). Exercise-driven selection of content matching The three models
Amaral, L. & D. Meurers (2005). Towards Bridging the Gap between the Needs of Expert model: NLP Expert model: NLP
methodologies. Peer reviewed conference presentation. EUROCALL’06.
Foreign Language Teaching and NLP in ICALL. In A. Pedros-Gascon (ed.), Annotation-based setup Annotation-based setup
Activity model September 6, 2006. University of Granada. Activity model
Proceedings of the 8th Annual Symposium on Hispanic and Luso-Brazilian Relevance for processing Relevance for processing
Literatures, Linguistics, and Cultures. Bailey, S. & D. Meurers (2008). Diagnosing meaning errors in short answers to
Analyzing learner Analyzing learner
reading comprehension questions. In J. Tetreault, J. Burstein & R. D. Felice
Amaral, L. & D. Meurers (2006). Where does ICALL Fit into Foreign Language language language
(eds.), Proceedings of the 3rd Workshop on Innovative Use of NLP for Building
Teaching? URL http://purl.org/net/icall/handouts/calico06-amaral-meurers.pdf. On Tokenization On Tokenization
Interpretation Educational Applications, held at ACL 2008. Columbus, Ohio: Association for Interpretation
23rd Annual Conference of the Computer Assisted Language Instruction Portuguese Properties
Computational Linguistics, pp. 107–115. URL Portuguese Properties
Consortium (CALICO), May 19, 2006. University of Hawaii. Mismatches in the Mismatches in the
identification of tokens http://aclweb.org/anthology-new/W/W08/W08-0913.pdf. identification of tokens
Amaral, L. & D. Meurers (2007a). Conceptualizing Student Models for ICALL. In Solution Solution
On interpreting accented Bick, E. (2000). The Parsing System “Palavras”: Automatic Grammatical Analysis On interpreting accented
C. Conati & K. F. McCoy (eds.), User Modeling 2007: Proceedings of the characters characters
of Portuguese in a Constraint Grammar Framework . Aarhus University Press.
Eleventh International Conference. Wien, New York, Berlin: Springer, Lecture Interpretation Interpretation
Karlsson, F., A. Voutilainen, J. Heikkilä & A. Anttila (eds.) (1995). Constraint TAGARELA awareness in foreign language learning, Honolulu: University of Hawaii Press, TAGARELA
Grammar: A Language-Independent System for Parsing Unrestricted Text. Detmar Meurers &
pp. 1–63. Detmar Meurers &
No. 4 in Natural Language Processing. Berlin and New York: Mouton de Ramon Ziai
Schulz, R. A. (2002). Hilft es die Regel zu wissen um sie anzuwenden? Das
Ramon Ziai
Long, M. H. (1991). Focus on form: A design feature in language teaching Expert model: NLP
Ziai, R. (2009). A Flexible Annotation-Based Architecture for Intelligent Language Expert model: NLP
Annotation-based setup Annotation-based setup
methodology. In K. D. Bot, C. Kramsch & R. Ginsberg (eds.), Foreign language Activity model Tutoring Systems. Master’s thesis, Universität Tübingen, Seminar für Activity model
research in cross-cultural perspective, Amsterdam: John Benjamins, pp. Relevance for processing Sprachwissenschaft. Relevance for processing
error types and learner repair in immersion classroom. Language Learning 48, Solution Solution
On interpreting accented On interpreting accented
183–218. characters characters
Martins, R., R. Hasegawa & M. das Graças Nunes (2006). Curupira: a functional Interpretation
Portuguese Properties
Interpretation
Portuguese Properties
parser for Brazilian Portuguese. In Computational Processing of the Mismatches in the Mismatches in the
Portuguese Language, 6th International Workshop, PROPOR. Lecture Notes interpretation of tokens
Solution
interpretation of tokens
Solution
in Computer Science 2721. Faro, Portugal: Springer. URL Wrapping up Wrapping up
417–528.
Schmidt, R. (1995). Consciousness and foreign language: A tutorial on the role of
attention and awareness in learning. In R. Schmidt (ed.), Attention and
32 / 39 32 / 39
TAGARELA TAGARELA
Detmar Meurers & Detmar Meurers &
Ramon Ziai Ramon Ziai
Introduction Introduction
Feedback Feedback
System Architecture System Architecture
The three models The three models
Expert model: NLP Expert model: NLP
Annotation-based setup Annotation-based setup
Activity model Activity model
Relevance for processing Relevance for processing
Conclusion Conclusion
Appendix Appendix
Screenshots Screenshots
33 / 39 34 / 39
TAGARELA TAGARELA
Detmar Meurers & Detmar Meurers &
Ramon Ziai Ramon Ziai
Introduction Introduction
Feedback Feedback
System Architecture System Architecture
The three models The three models
Expert model: NLP Expert model: NLP
Annotation-based setup Annotation-based setup
Activity model Activity model
Relevance for processing Relevance for processing
Conclusion Conclusion
Appendix Appendix
Screenshots Screenshots
35 / 39 36 / 39
TAGARELA TAGARELA
Detmar Meurers & Detmar Meurers &
Ramon Ziai Ramon Ziai
Introduction Introduction
Feedback Feedback
System Architecture System Architecture
The three models The three models
Expert model: NLP Expert model: NLP
Annotation-based setup Annotation-based setup
Activity model Activity model
Relevance for processing Relevance for processing
Conclusion Conclusion
Appendix Appendix
Screenshots Screenshots
37 / 39 38 / 39
TAGARELA
Detmar Meurers &
Ramon Ziai
Introduction
Feedback
System Architecture
The three models
Expert model: NLP
Annotation-based setup
Activity model
Relevance for processing
Analyzing learner
language
On Tokenization
Interpretation
Portuguese Properties
Mismatches in the
identification of tokens
Solution
On interpreting accented
characters
Interpretation
Portuguese Properties
Mismatches in the
interpretation of tokens
Solution
Wrapping up
Conclusion
Appendix
Screenshots
39 / 39