Skip to content

VincentCCL/MTAT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

202 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This repo contains the scripts that were created for the course Machine Translation Advanced Topics.

Most of the scripts were vibe-coded with ChatGPT with lots of testing and back-and-forth conversation.

Chapters in the course

You can find the slides of each chapter in the slides directory.

Chapter 1. Introduction and MT pre-neural history

The course starts with an introduction to MT and a description of what happened before the NMT paradigm.

Chapter 2. Data Preparation

Chapter 3. MT Evaluation

Translation through Python with commercial engines and evaluation with most common metrics

Chapter 4. RNN Language Modeling

Before we start on MT, we explain RNN language modeling with a toy example and later expand it to a larger language model.

Chapter 5. RNN Machine Translation

Chapter 6. Subwording and Transformers

Chapter 7. Pretrained Encoder-Decoders

Current work

We are in the process of integrating and testing all the different transformer encoder-decoder scripts from chapters 6 and 7 into a single script. Current version (unfinished) is in [code/mtat.py]

About

Scripts for the course Machine Translation Advanced Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors