Projects — Charles Vielzeuf

Ongoing projects that are my main focus right now.

Genome language models for epigenomic predictions and distal regulation

April 2026 – Now

At RIKEN AIP, Statistical Genetics Team, I am looking at large-context DNA sequence models such as Evo2 on NVIDIA GH200 for epigenomic readouts and distal regulation, combining Evo2HiC (sequence and Hi-C) with enhancer–promoter interaction models such as EPInformer.

Evo 2 — DNA double helix and biological sketches

Overview slide: genome language models, Evo2, and Hi-C integration

Transcription factors binding to DNA regulatory sites

Slides: view.

Supervised by R. Sakurai and G. Tamiya.

ROADEF Challenge proposed by Orange Research

April 2026 – Now

Several-month optimization challenge proposed by ROADEF (French Operations Research Society) and Orange Research.

The T-Adaptive Segment Routing (T-ASR) problem consists of routing a set of traffic demands on an IP/MPLS network under planned intervention scenarios (with unavailable links). Segment Routing, makes it possible to deviate traffic from shortest paths by adding waypoints, using Networktools as a baseline.

Per-thread proxy MLU during parallel local search (best solid, current dotted)

Subject accessible here.
Challenge website: here.

coolDPLP — GPU-agnostic primal-dual LP solver in Julia

April 2026 – Now

Julia implementation of a primal-dual linear programming (PDLP) solver with GPU-agnostic kernels: JuliaDecisionFocusedLearning/CoolPDLP.jl.
I am currently focusing on the crossover step — moving from an interior-point solution to a basic feasible solution at a vertex of the feasible polytope. A first, simple version uses snapshots of intermediate iterates; a more advanced phase, inspired by the NVIDIA cuOpt simplex crossover, is planned next.

CharlesGPT — home-made documented chatbot

April 2026 – Now

End-to-end project to explore how a small generative model works in practice: web chat UI, FastAPI backend, local GPT-2 inference, token-level metrics (top‑k, embeddings, perplexity) and adjustable sampling parameters.

CharlesGPT demo: tokenized chat, sampling parameters, and collapsible metrics panels

Here for a CharlesGPT demo.

Project page →

Planned projects I want to work on once I have the time. Feel free to ask questions or propose collaborations — send a message.

modularyNN — GPU acceleration

Planned

Extend modularyNN with GPU kernels for benchmarking, testing, and hands-on practice alongside the existing from-scratch CPU implementation.

Repository: charles-vzf/modularyNN.

Lean formalization — shift-commutation equivalence for cellular automata

Planned

Working through Lean tutorials and aiming to formalize the Curtis–Hedlund–Lyndon equivalence from my TIPE project: a global map on a configuration space is a cellular automaton if and only if it is continuous and commutes with the shift action (σ^v ∘ F = F ∘ σ^v).

KIRO — higher scores and stronger heuristics

Planned

Build on past KIRO participations (see completed entry): push for a better score, with new move operators and column generation ideas.

Repository from earlier work: Projet_RO_RTE.

Forum engagement bot — RL experiment

Planned

Documented side project: scrape content from an active online community (e.g. image-editing forum), then train a reinforcement learning agent to post edited images and optimize engagement — comments first, then broader reach. Survey similar open-source projects on GitHub; run in an isolated environment and document progress here.

Contact form — human challenge anti-spam

Planned

The contact form has no CAPTCHA, CSRF token, or advanced spam filtering yet. Plan: design an original challenge that is easy for humans but hard for bots — not a standard Turnstile/reCAPTCHA widget — possibly powered by a small model whose prompts or scoring improve over time via reinforcement learning on real user responses.

List of selected projects, some of them have their dedicated page you can access.

COPER latents — clinical EHR trajectories (MIMIC-III)

March 2026 – April 2026

Research at RIKEN AIP, Statistical Genetics Team on MIMIC-III time series: Perceiver- and transformer-style models, including Continuous Patient State Perceiver (COPER) (Chauhan et al., 2024), for patient trajectory embeddings; Markov decision process head-level experiments for interpretability.

On the ICU sepsis MDP (Komorowski et al., AI Clinician), I ran a small RL algorithm benchmark (Q-learning, SARSA, DQN, PPO, SAC) and compared several reward shaping schemes (sparse survival, SOFA-based signals, COPER mortality score).

Preview of MDP / COPER MIMIC research poster

Repository: COPER-latents. Slides: view. Poster: view.

Presented at the AIP Research Exchange Event, 1 June 2026, Nihombashi.

Identifying codes in graphs: study and coding experiments

December 2025 to February 2026

2-month research project on identifying codes in graphs, based on the paper Charon et al., Minimum Sizes of Identifying Codes in Graphs Differing by One Vertex.

Under the supervision of Prof. O. Hudry.

Poster: view.
Coding experiments and implementations: Identifying_codes_graphs.

Participation in the 2026 Land-Doig MIP Competition

Spring 2026

Participation in the 2026 Land-Doig MIP Competition on GPU-accelerated primal heuristics for mixed-integer linear programming, alongside the Mixed Integer Programming Workshop 2026. I implemented large neighborhood search in Julia with KernelAbstractions.jl, a GPU-agnostic kernel framework.

Workshop: mixedinteger.org/2026. Competition & tutorial: here.

CPSolver — Constraint Satisfaction Problem solver

November 2025 – February 2026

A constraint programming solver developed as part of a Parisian Master of Operations Research Constraint Programming class (course page).

Under Prof. D. Savourey.

╔══════════════════════════════════════════════════════════════════════════════╗
║                                                                              ║
║    ██████╗██████╗      ███████╗ ██████╗ ██╗    ██╗ ╔██ ███████╗██████╗       ║
║   ██╔════╝██╔══██╗     ██╔════╝██╔═══██╗██║    ██║ ║██ ██╔════╝██╔══██╗      ║
║   ██║     ██████╔╝ ██  ███████╗██║   ██║██║    █║   ║█ █████╗  ██████╔╝      ║
║   ██║     ██╔═══╝      ╚════██║██║   ██║██║    ║█   █║ ██╔══╝  ██╔══██╗      ║
║   ╚██████╗██║          ███████║╚██████╔╝╚██████╚█████╝ ███████╗██║  ██║      ║
║    ╚═════╝╚═╝          ╚══════╝ ╚═════╝  ╚════╝ ╚═══╝  ╚══════╝╚═╝  ╚═╝      ║
║                                                                              ║
║              Constraint Satisfaction Problem Solver                          ║
╚══════════════════════════════════════════════════════════════════════════════╝

Main solver components include the DIMACS CSP parser, constraint propagation with AC-3 and forward checking, variable/value ordering heuristics (such as MRV and LCV), and a backtracking search engine to enumerate solutions.

Repository: charles-vzf/CPSolver.

(Un)capacitated Facility Location Problem

November-December 2025

The (Un)capacitated Facility Location Problem is a classic optimization problem in operations research and logistics. The problem involves determining the optimal locations for facilities (such as warehouses, distribution centers, or service centers) to serve a set of customers while minimizing total costs.
We developed a multithreaded simulated annealing heuristic based on several initialization and perturbation strategies, reviewed by Prof. A. Plateau.

View report (in french) and code.

Experiments for propagation of chaos in mean field underdamped regime for wide neural networks

April 2025 to August 2025

5-month internship at RIKEN AIP (Deep Learning Theory Team).
Under the supervision of Prof. T. Suzuki.
Studying Propagation of Chaos for Mean-Field Langevin Dynamics and its Application to Model Ensemble. Discussing hypothesis, and providing experimental validation, see code, and report: view.

Mean-field Langevin dynamics internship report — page 1

Mean-field Langevin dynamics internship report — page 34

ModularyNN — modular neural network framework (Python)

April 2025 to August 2025

Lightweight, composable neural network building blocks for experiments and teaching: customizable layers and training loops.

from scratch (no advanced libraries).

Repository: modularyNN.

Technical support for Ardian startup studio

July 2024

3-day Ardian Startup Studio hackathon.
Technical support for prototyping ideas for internal use at Ardian France; demonstrated feasibility and identified long-term limitations.
Yearly event in Paris (see Artefact press release).

KIRO Inter School Operational Research Challenge

2022–2025

3 participations with strong results on optimization problems from RTE, Renault, and Pelico. See KIRO challenge.

KIRO hackathon poster preview KIRO objective function extract and annotations KIRO wind farm optimization illustration

Repository from first participation (extended): Projet_RO_RTE.

AI and machine learning for theme park attraction attendance prediction

May 2024

One-day challenge by Eleven Strategy at Ecole des Ponts; ranked 2nd. With Mathis Wauquiez and Célestin Hans as teammates.
See our slides: view.

Simulation and learning for gas storage real-time refueling optimization

January 2024 to June 2024

6-month project at École des Ponts (mathematical engineering and computer science department), in partnership with Air Liquide R&D, on optimizing hydrogen tank refueling for user convenience and cost efficiency.

High-pressure refueling heats hydrogen tanks and can approach material limits. The aim was to keep refueling time reasonable while holding the tank below 85 °C and limiting energy use. We used a 0D/1D thermal model for the tank walls, coupled simulations with optimization solvers, and developed a real-time scheme to compute optimal injection temperature profiles — with better cost results than previous naive methods.

Poster: view.
You can see our report here: view.

Differential and Riemannian geometry — general relativity and black hole trajectory simulation

January 2023 to June 2023

6-month research project (1st year, Ecole des Ponts).
Reproducing results from generic simulations. Supervised by Prof. Éric Cancès.
You can see our report here: view.

Study of the torsion of a parallelepiped in 3D Modeling with Cast3M

This project models and visualizes the 3D torsion behavior of a parallelepiped using Cast3M, a structural analysis and fluid mechanics simulation code, to analyze its thermomechanical response under deformation (see Cast3M). Supervised by Prof. L. Dormieux.

Report: view PDF (in French).

Modeling granular media using cellular automata

December 2020 to July 2022

TIPE (Travail d’initiative personnelle encadré) during two years of mathematical and physics preparatory classes (MP).

After a general study of cellular automata — discrete dynamical systems with local evolution rules — I built a simplified solid/liquid erosion model reproducing transport and crater formation. Open-source simulations in Golly with custom rules, combining hashing techniques with macro-cell methods so large runs remain tractable. Brief discussions with A. Trevorrow.

See the short description and the slides (both in French).

Project page →

Eratosthenes: history & science overview (3rd c. BC)

2020

Cross-disciplinary high-school group project between history and science, centred on Eratosthenes of Cyrene: the sieve of Eratosthenes for prime numbers and his method for estimating Earth’s circumference with surprisingly good accuracy once measurement uncertainty is taken into account—impressive given that many civilizations did not yet treat Earth as round long afterwards.

Portrait of Eratosthenes (historical engraving)

Report (French): view PDF.

TI-83 Premium CE coding

2018

Early prgm management, game design, recursivity, and small calculator utilities on TI-83 Premium CE.
Download programs directly: Snake, DDJump, Guitare (easy tab visualization), Labyrinth.