Kislay Aditya Oj

Computer Science and Engineering | MS · IIT Bombay | BTech · IIT Kanpur

About

I'm Kislay, currently pursuing an MS by Research in Computer Science & Engineering at IIT Bombay, where I'm part of the CFILT Lab. I completed my B.Tech from IIT Kanpur in 2025. I have a strong interest in reinforcement learning, machine learning and language technologies, and enjoy working on problems that involve learning theory, decision making and statistics.

Outside of academics, I really enjoy chess, I follow top games closely and play competitively (currently around 1900, though I have touched 2000 before… we don't talk about the rating drop, also it's chess com rating not FIDE, i'm not a genius). I also follow Formula 1, sketch in my free time, and enjoy reading books. Lately, I've been learning to play the piano as well. You can find me on X, check out my occasional blogs in the notes section, or browse through my projects to see what I've been working on.

Research

Theoretical Reinforcement Learning

I work on theoretical reinforcement learning with a focus on bandit problems involving hidden structure, such as latent user states or clustered populations. My research studies how offline models and shared structure can be combined with online exploration to design sample efficient algorithms with provable regret guarantees.

Mechanistic Interpretability

I investigate how neural networks, especially LLMs, implement internal computation mechanisms rather than just observing input output behavior. This includes studying representation geometry, transformer circuit structure, and causal mechanisms behind reasoning and figurative language, while exploring SVD-based decomposition of internal representations as a promising direction for understanding how such computations arise.

Reinforcement Learning for Adaptive LLM Agents

I study how reinforcement learning ideas can be used to help large language model (LLM) agents improve from feedback during real use. This work focuses on test time adaptation using reflection, memory, and personalization rather than model fine tuning. As part of the Flipkart–IIT Bombay collaboration, my goal is to build LLM agents that become more reliable and user aware over time.

Publications

working on it

Contact

Email: kislay@cse.iitb.ac.in

Lab: CFILT Lab, Room 401, Computer Centre
Department of Computer Science
Indian Institute of Technology, Bombay

Google Scholar · GitHub · Twitter · LinkedIn

Notes

Scattered thoughts and reflections. Click title to expand.

Attending my first Conference
December 2025
Volunteering at AACL 2025 gave me a behind the scenes view of how conferences actually run, the chaos, coordination, people and conversations that never show up in papers or schedules.
Book this week #1 - The Alchemist
Paul Cohelo · December 2025
A short and simple story about chasing dreams and learning from the journey. Predictable at times, but still a solid and meaningful read, especially for beginners.
Tier List #2 - Movies
September 2025
Personal thoughts on movies and TV shows based on impact rather than objectivity. Some unforgettable, some enjoyable, some overhyped, all filtered through mood, timing and questionable taste.
Tier List #1 - Anime
September 2025
A very subjective take on anime I've watched over the years, ranked less by technical quality and more by how much they stuck with me. Strongly biased by characters, long term impact and pure vibes.

← Back to all notes

Attending my first Conference

December 2025

So last 6 months was full of me receiving insights about research, phd applications, conferences and etc. and generally trying to understand where I fit into all of this. So when AACL 2025 was announced, happening at our own campus and organized largely through our lab, there was no way I was going to miss the chance to volunteer. Getting to observe and work closely with researchers and organizers felt like an opportunity that does not come often.

Preparations started months in advance. There were regular meetings, planning sessions, and constant coordination to make sure things would run smoothly. I did not attend every meeting, but I stayed updated and could see how much effort people like Sushma ma'am, Deepak sir, Gajanan sir and Dhiren sir were putting in to get everything right. Watching that process itself was a learning experience.

The conference ran from December 19 to 24, right in the middle of winter break. I went home briefly and then came back to campus on the 19th, which turned out to be an adventure in itself. Flights were chaotic, accommodation was uncertain and things were generally messy. Somehow, everything worked out. At this point, the lab honestly feels more like home than my hostel room.

I was initially assigned to the registration desk, but later moved to the tech team, handling audio, microphones, Zoom and cameras for oral presentations and talks. On the first day, I was completely lost. I picked up my volunteer tee and ID, and mostly followed Sharath around trying to understand what was going on. The day started with the opening ceremony and keynote in the main auditorium, after which we moved to Room 21 for oral presentations.

That first day, I mostly observed. We had a detailed run-of-the-show sheet that helped keep things on track. There were some technical hiccups, but nothing major. Most of the time, I was just passing microphones around and making sure Zoom was running. Other rooms were not as lucky and faced serious audio and connectivity issues, so we were grateful ours went smoothly. Between sessions, there were coffee and lunch breaks. The food was surprisingly good, apparently from Gulmohar. I liked the atmosphere of people discussing work and interests with each other. I did not talk much myself, as I was unsure of what I would even say given how early I am in my research.

Day 2 went much more smoothly. It was mostly me, Vijendra and Sharath handling Room 21. That day also included CFILT Day, where alumni gathered to remember Prof. Pushpak Bhattacharyya and share memories. The highlight was the cultural event and the gala dinner. The dinner was great, and I managed to talk to a few professors. I still wonder how many of those conversations will actually turn into email replies, but it was enjoyable nonetheless.

Day 3 was rough. Only Sharath showed up early in the morning, and he had to handle three rooms alone. Sessions started late, and things were chaotic. I had assumed he would set things up and I would join later, but after seeing how bad it got, I made it a point to come early for the remaining days. The rest of the day went better, and we ended with the closing ceremony, award distribution and a photoshoot.

The real volunteering challenge began after that. The next day had eleven parallel workshops, each needing tech support. By then, my intern phase was over and I was assigned to handle an entire room on my own. My room, Room 14, had delayed setup, microphone issues and general confusion. Thankfully, the workshop presenters were understanding. Attendance was not very high, but we managed to complete the sessions without major issues.

By Day 4 evening, we were exhausted. Me, Vijendra, and Sharath went to Versova beach to unwind and later visited a cat cafe. It was a much needed break and one of the nicest moments of the week.

Day 5 was easily the best. I deliberately chose to be in Room 21 because Prof. Ashutosh Modi was running a workshop there. He is the reason I got into research and was my professor at IIT Kanpur. To my surprise, he remembered me. I spent most of the day with his students, who were all extremely sharp and we even discussed a potential project together. I knew Prof. Modi was chill, but this was on another level. I did have a small moment of regret about not staying back at IIT Kanpur, but that is life.

Overall, volunteering at AACL was an intense but rewarding experience. From waking up at 6 in the morning to working late into the night, I got to see a conference from an organizer's perspective. The academic exposure was great, but the networking and informal conversations were the real highlight. By the final day, I had spoken to students from different countries and learned how similar and different research culture can be across places. Next time, I would like to be on the other side of the conference as an attendee. This was fun and a great experience. Kudos to the entire team.

← Back to all notes

Book this week #1 - The Alchemist

Paul Cohelo · December 2025

The Alchemist is about a boy who wants to follow his dreams and the different obstacles, experiences, and lessons he encounters along the way. The story is straightforward and easy to follow, and most of its ideas are conveyed through simple events rather than complex plot twists.

I think this book works especially well as a starter read. It doesn't demand much from the reader in terms of attention or background, and the lessons are communicated clearly. In many places, the book feels less like a story meant to surprise you and more like a lesson you're supposed to reflect on while reading. Because of this, it helps to keep an open mind rather than treating it as something to rush through.

From a broader view, the story is fairly predictable, particularly if you are familiar with similar themes in anime or other coming-of-age narratives. This can make parts of the book feel slow or unexciting. Still, considering how old the book is, that predictability is not too surprising and doesn't completely take away from the experience.

One thing I liked is that the book is short and well-paced. You can finish it in one sitting without feeling tired or overwhelmed. Even if the ideas are not entirely new, the book presents them in a calm and readable way. Overall, I would recommend The Alchemist. It's not a deep or complex book, but it's an easy and thoughtful read that can be enjoyable, especially if you're just getting into reading.

Here are some quotes I liked from the book:

"It's the possibility of having a dream come true that makes life interesting."

"If you start out by promising what you don't even have yet, you'll lose your desire to work towards getting it."

"I weep for Narcissus, but I never noticed that Narcissus was beautiful. I weep because, each time he knelt beside my banks, I could see, in the depths of his eyes, my own beauty reflected."

Rating: ★★★★★★☆☆☆☆ (6/10)

Projects

Software · Datasets · Tools

Analysis of EBMs — Boltzmann Machines
Energy-Based Models · Python · 2025
Experimental study of energy-based models (Boltzmann machines) investigating the effect of Contrastive Divergence (CD) steps on sample quality. Includes analysis scripts, configurations, and result outputs for varying CD schedules and sampling strategies.
GitHub
Know.Study.Help
Python · Flask · Web · 2025
A lightweight web platform for academic management and study workflows. The repo contains a Flask backend (app.py / server.py), HTML templates and static assets, and utilities to manage study-related content and pages. Built as a small full-stack project for course/organisational use.
GitHub
Transliteration (Roman → Devanagari)
Transliteration · LSTM / Transformer · Python · 2025
Code and experiments for Hindi transliteration (Roman to Devanagari). The repo includes LSTM and transformer checkpoints, sampling and evaluation scripts, and a short paper/report describing the approach. A demo script is provided for quick sampling; some LLM-based inference paths require an NVIDIA API key (noted in the README).
GitHub
Prompt Tuning
Prompt tuning · Experimental · Python · 2025
Research-and-development code exploring prompt / prefix tuning approaches for LLMs. Contains experimental code, notebooks and utilities used to run prompt tuning experiments and compare prompt-based interventions across models.
GitHub
LLM-Based Scraper for Amazon Order Information
Python · Selenium · LLMs · 2024
Built an automation pipeline to extract structured order history data from Amazon using Selenium. Integrated an open-source LLM (GPT-Neo) to process and structure raw HTML into JSON/CSV formats, with support for swapping in stronger models. Focused on robustness, security, and scalability.
GitHub
Curiosity-Driven Exploration via Self-Supervised Prediction
Reinforcement Learning · PyTorch · 2024
Implemented the Intrinsic Curiosity Module (ICM) across DQN, A3C, and PPO to study exploration in sparse-reward environments. Demonstrated how intrinsic rewards improve learning efficiency and policy performance across different RL algorithms.
GitHub
Exploration and Analysis of Deep Reinforcement Learning Methods
Deep RL · PyTorch · 2024
A comprehensive study and implementation of classical and deep RL algorithms including bandits, Monte Carlo, TD methods, SARSA, Q-learning, DQN variants, PPO, TD3, and DDPG. Includes experiments, comparisons, and visual analysis.
GitHub
Computer Vision using C
C · Python · Computer Vision · 2023
Implemented core image processing and computer vision algorithms in C with Python bindings. Covered filtering, convolutions, edge detection, hybrid images, and color space transformations, with visual validation through experiments.
GitHub
Tour De OAAR – Astronomy Club, IIT Kanpur
Mentorship · Python · Web APIs · 2023
Mentored a team of students on Python programming and automation projects for the campus observatory. Guided development of tools such as weather monitoring systems using APIs and JSON, alongside teaching astronomy fundamentals and telescope operations.
GitHub