MLE Linear Regression (July 2024, Python):
Brief explanation of how we can derive the Ordinary Least Squares formula from a normal (gaussian) distribution Maximum Likelihood Estimation.
[link]
NN.c (July 2024, C):
Feedforward neural network & other ML models implemented from scratch in pure C.
[link]
Transformers Learn to Associate Efficiently Tokenizable Code Representations (May/June/July 2024, Python, PyTorch, Transformers):
My second research internship at the LaBRI (Bordeaux Computer Science Laboratory). This time, I focused on a project at the crossroads of Deep Learning and Software Engineering. My work consisted in finding ways to train and use Large Language Models more effectively for Software Engineering. I learned how to train Transformers from scratch, as well as how to fine-tune or internally modify pre-trained models and conduct experiment on them. If you understand French, I gave a quick talk about my work at the end of my internship, and this presentation was filmed.
[presentation] [link]
FuncSkeleton (June 2024, Python):
Python package to generate the CFG "skeleton" of a function, for Machine Learning purposes. It is a tool built during my internship that I used to create a dataset from CodeSearchNet.
[link]
MMA Predictions (April 2024, Python, Pandas, Sklearn, PyTorch):
Using various machine learning algorithms on UFC fights data to try to predict outcomes such as the winner, the result or the finish round.
[link] [paper]
Thrift Store (April 2024, Java):
Concurrent thrift store simulation in Java.
[link]
Three-Address Code Compiler (February 2024, Java):
Compiler taking an hypothetical (simplified) programming language source code file and converting it to three-address code for a given interpreter.
[link]
Mail.tm (January 2024, Python):
Simple wrapper around the Mail.tm API in Python to interact easily with it.
[link]
Feistel Network Framework (September 2023, Python):
Feistel network framework implementation where users can provide their own round and key derivation functions.
[link]
Market Exchange (August 2023, Go):
Market exchange simulation project where users can place orders via an API.
[link]
CRT (July 2023, OCaml):
Simple Chinese Remainder Theorem implementation for solving systems of modular congruences.
[link]
LZSS (May 2023, C):
Lempel–Ziv–Storer–Szymanski algorithm implementation in C for file compression & decompression.
[link]
Busy Beaver (April 2023, Go):
Simple 2-state Turing machine implementation in Go to experiment on the busy beaver problem. It is as an appendix for a small article (in french) that I wrote as part of an assignment for my university.
[link]
Advent of OCaml (March 2023, OCaml):
My attempt to Advent of Code 2023 using only OCaml.
[link]
Huffman Coding Calculator (March 2023, C++):
Huffman coding gain calculator for text files, it encodes a given file and computes the difference of size.
[link]