Large Language Models Independent Study

The report for this quarter and the previous quarter are uploaded as a single document in the name "Independent_Study_Compilation.pdf".

All the code files related to the model are present in the folder "SumGenToBT". Due to the large model size, it wasn't possible to upload the checkpoints and the final weights of the model on Github. However, those files are present on the VM, and can be shared if and when required.

"SumGenToBT/sumgen/base_finetuned" contains all the output files generated by the model post-training. The model was trained using the "run.sh" file present in "sumgen" folder, and that was the point I was stuck at. Ideally, what we need to do to get it working is first perform java-to-english translation, send the results for processing (binarization), and get those outputs into the model for english-to-python translation. The code was doing that impicitly somewhere, which I wasn't able to find out and that's why this part had me stymied. Another thought was to just use the evaluation code, but the evaluation code was using the back-translation, and the input was given accordingly. Deeper understanding of what was happening behind the curtains was required to pin-point exactly what we should be adding in our explicit call.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Reading Material		Reading Material
SumGenToBT		SumGenToBT
GPT_2.ipynb		GPT_2.ipynb
GPT_2.pdf		GPT_2.pdf
Independent_Study_Compilation.pdf		Independent_Study_Compilation.pdf
Initial Reading List.pdf		Initial Reading List.pdf
README.md		README.md
Research Paper Notes.pdf		Research Paper Notes.pdf
Research Papers Preliminary Information.pdf		Research Papers Preliminary Information.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Large Language Models Independent Study

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Large Language Models Independent Study

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages