University of Southern California

CSCI 544 — Applied Natural Language Processing

Final paper: NLP prototype design proposal

Updates

Due: November 26, 2020 (extended from November 24)

Every student will receive a personalized submission link through Crowdmark. Do not share the link with others: it is linked to your email. The completed assignments will be accepted only through the online system.

Overview

The prototype design proposal is an in-depth activity, where students describe the design for a future natural language processing application. The proposal can be on any aspect of natural language processing, for any human language. You will formulate a research question, identify potential resources and methods to address the question, describe a procedure for evaluating the prototype.

Proposal structure

The proposal should be a document of about 1500 words, written in English in good academic style. Proposals that substantially exceed this length (above 1600 words) will be penalized. The structure of the document should be as follows.

If you need to give examples of text in languages other than English, please use the following multi-line format, to make the examples readable to English speakers. Below is an example for how to present a sentence in Hindi.

किसने दवाई को खरीदा (the original text in its native script)
kisne davaaii kokhariidaa (a transcription into Latin script)
whoERG medicine ACCbought (a word-by-word gloss)
‘Who bought the medicine?’ (a translation into English)

The explanations on the right (in parentheses) are part of the instructions: they do not need to be repeated with the example. The second line (transcription into Latin script) is not needed if the language natively uses a version of the Latin script.

Grading

The grade for the assignment will be broken down as follows.

The prototype design proposal counts for 20% of the overall course grade.

Notes

The following are edited versions of responses to student questions about the assignment.