Damir Cavar's Homepage

Logo

Damir Cavar is a Natural Language Processing and AI, and Classical and Quantum Computing scientist

curriculum vitae

publications

talks

research

teaching

code

blog

View My GitHub Profile

CSCI-B 659 Topics in Artificial Intelligence - Advanced Natural Language Processing

LING-L 645 Advanced Natural Language Processing

This is the course page for Topics in Artificial Intelligence / Advanced Natural Language Processing (NLP) by Damir Cavar.

Damir Cavar

– August 2025 –


Course Arrangements

Meeting time: Tuesday and Thursday, 11:10 AM - 12:25 PM

Course website: Assignments, slides, and other material will be posted on Canvas.

Credits: 3

Instructor: Associate Professor, Dr. Damir Cavar

Office: Ballantine Hall (BH) 511

Office hours: Thursday 1-2 PM and by arrangement

Course Description

Symbolic, statistical, and neural methods are at the core of Computational Linguistics and Natural Language Processing (NLP) in research and applications. This course introduces advanced techniques for NLP based on statistical modeling and machine learning algorithms, including neural network Deep Learning approaches, and transformers, bringing them together with symbolic and knowledge-based systems. We aim to bridge research and insights from language and linguistic disciplines and the application of NLP and linguistic technologies from the computer and information science perspective.

This course will cover fundamental notions in probability and information theory, focusing on the concepts needed for common NLP tasks. We will discuss N-gram models, exemplified by an approach to document classification or Part-of-Speech (PoS) tagging. In the next step, we will extend to probabilistic methods and to sentiment analysis. We will study advanced neural network approaches (Deep Learning) for NLP, used for various speech and language processing tasks.

Additionally, we will cover concrete topics such as information extraction and graph-based knowledge representations used for text classification, natural language understanding, dialog systems (AIs), or information retrieval, and how to use various NLP methods in the context of such systems. There is space to focus in part on topics of interest related to the choice of concrete applications of NLP methods.

We are discussing advanced hybrid NLP methods, covering symbolic, statistical, and neural network methods in the context of particular tasks. All the methods we use apply to a range of tasks in NLP. The mission is to teach students techniques, algorithms, and existing environments to enable them to develop their own strategies to analyze linguistic phenomena using language data, to apply NLP in the domain of information extraction from unstructured data, or to research in the field of AI, psycholinguistic or cognitive language faculty, verbal behavior, and general speech and language technologies.

Learning Outcomes

Crucial aspects of course outcomes are:

This course provides an essential platform for further work in NLP.

Assessment

Grades are based on the following schema:

Coding and Computational Experiments

Students are encouraged to bring their laptops or other computational devices to class.

The readings and exercises will be accompanied by practical examples using:

Literature

Main textbook

We will be using the most recent 3rd edition of the textbook and additional material shared on Canvas.

Additional books and articles

General Information and Notes

Participation in the NLP-Lab Projects

Students are welcome to participate in NLP-Lab meetings and projects after consultation with the instructor. See for more details: https://nlp-lab.org/

Disclaimer

This syllabus is subject to change and likely will change. All critical changes will be made in writing, with ample time for adjustment.


(C) 2025 by Damir Cavar