Md Sifat Hossain

CS researcher & software engineer working on large language models, agentic AI systems, and autonomous software engineering.

Open to PhD positions for Fall 2027.

Email GitHub LinkedIn CV (PDF)

Google Scholar·16 citations·h-index 1

About

I am a Software Engineer at Therap BD Ltd and a recent Computer Science graduate from the University of Dhaka. My research lives at the intersection of large language models, autonomous software engineering, and agentic AI - building systems that can reason about, write, and refine code through structured multi-model feedback.

During my undergraduate years I worked at the Data Mining Research Lab (DU) with Md. Fahim Arefin and Prof. Tarannum Shaila Zaman (UMBC), where I co-authored two papers studying how state-of-the-art LLMs solve ICPC-level competitive programming problems and how persistent multi-model feedback loops can make autonomous code generation substantially more reliable.

I am actively applying for PhD programs starting Fall 2027, where I hope to continue working on reliable, reasoning-capable AI systems for software engineering.

Research interests

Large Language Models for Code Generation
Agentic AI Systems & Multi-Model Feedback
Autonomous Software Engineering
Cross-Language Program Repair & Polyglot Debugging
Competitive Programming as LLM Benchmark
AI Alignment & RLHF

Publications

* denotes equal contribution. See the publications page for full details and BibTeX.

A-ProS: Towards Reliable Autonomous Programming Through Multi-Model Feedback
Anika Tabassum*, Md Sifat Hossain*, Md. Fahim Arefin, Tariqul Islam, Tarannum Shaila Zaman
ACM Transactions on Software Engineering and Methodology (TOSEM), 2026 · * Equal contribution
TOSEM 2026arXiv PDF Project Page Code
LLM-ProS: Analyzing Large Language Models’ Performance in Competitive Problem Solving
Md Sifat Hossain, Anika Tabassum, Md. Fahim Arefin, Tarannum Shaila Zaman
LLM4Code 2025 Workshop, ICSE 2025 - 47th International Conference on Software Engineering, Ottawa, Canada
ICSE 2025 (LLM4Code)arXiv PDF Code

Research Experience

Research Assistant · Data Mining Research Lab, University of Dhaka

Aug 2023 - Present

Dhaka, Bangladesh

Supervisor: Md. Fahim Arefin (DU CSE), in collaboration with Prof. Tarannum Shaila Zaman (UMBC, Information Systems).

Designed and implemented LLM-ProS, a novel evaluation framework for benchmarking LLM performance on ICPC World Finals problems. Curated a 166-problem dataset (2011-2024), built automated submission pipelines via Codeforces Gym, and analyzed five state-of-the-art models across correctness, resource utilization, and chain-of-thought reasoning. Published at ICSE 2025 (LLM4Code).
Extended this work into A-ProS, an autonomous multi-model agentic framework that separates solution generation from specialized debugging feedback under a 2×3 factorial design. Developed the full orchestration pipeline, Codeforces browser automation (Selenium + Playwright), verdict capture, and SQLite logging. Ran ablations on persistent vs. stateless context and trust calibration (ECE) across critic models. Accepted at ACM TOSEM 2026.
Completed undergraduate thesis “A Hybrid LLM Feedback Framework for Automated Competitive Programming Workflows,” proposing an iterative test-driven benchmarking pipeline integrating OpenAI o3-mini with specialist LLMs (DeepSeek, Qwen) for error diagnosis and code refinement via Codeforces-based validation.
(Ongoing) Polyglot Codebase Debugging and Cross-Language Program Repair with LLMs and Agentic AI - investigating the 40-60 point performance gap of LLM agents on cross-boundary bugs in Java+TypeScript and Python+Go enterprise stacks. Designing a localize-then-edit agent built on SCIP-based unified cross-language call graphs and REST/gRPC schema-aware reasoning, extending IBM iSWE-style AST tooling across language boundaries. Targeting an empirical taxonomy paper (MSR/ICSE) and a full systems evaluation on SWE-PolyBench and xCodeEval for ICSE/FSE 2027.

RLHF Data Researcher & Pod Lead · Turing Enterprises Inc.

Jul 2025 - Dec 2025

Remote

Contributed to Reinforcement Learning from Human Feedback (RLHF) data creation supporting large-scale AI model alignment research, focusing on dataset quality, consistency, and annotation methodology for code and reasoning tasks.
Led a team of 10, establishing quality control protocols that ensured annotation reliability across diverse task domains.
Coordinated cross-functional team activities and maintained consistency standards critical to downstream model training, directly supporting AI alignment objectives.

Machine Learning Research Intern · Brainwave Matrix Solutions

Aug 2024 - Sep 2024

Remote, India

Developed a fraud detection model applying anomaly detection and supervised learning on imbalanced datasets, achieving 85% precision. Investigated model behavior under class imbalance and precision-recall trade-offs.
Automated model training and deployment pipelines using Docker and Jenkins, enabling reproducible ML experimentation and continuous integration of model updates.

Projects

Research & data

Polyglot Codebase Debugging (Ongoing)

LLM Agents · SCIP · tree-sitter · Java · TypeScript · Python · Go

Ongoing research on cross-language fault localization and program repair for polyglot codebases - targeting the Java+TypeScript and Python+Go enterprise stacks where LLM agents drop from ~80% to 10-33% resolution. Building a localize-then-edit agent that unifies SCIP-based cross-language call graphs with REST/gRPC schema reasoning, extending IBM iSWE-style AST tooling across language boundaries. Empirical taxonomy of cross-language bug root causes plus systems evaluation on SWE-PolyBench and xCodeEval; targeting ICSE / FSE 2027.

A-ProS

Python · OpenAI · DeepSeek · Selenium · Playwright · SQLite

Reference implementation of A-ProS (TOSEM 2026) - an autonomous agentic framework separating solution generation (GPT-4 / GPT-5) from specialized debugging feedback (DeepSeek-R1, Llama-3.3, Codestral) under a 2×3 factorial design. Persistent multi-model feedback loops achieve 2.2-2.3× greater gains than stateless baselines on 367 ICPC and Codeforces problems.

Code on GitHub →

Hybrid Feedback Loop - LLM Benchmark Pipeline

Python · Selenium · BeautifulSoup · SQLite3

Data and orchestration pipeline underlying LLM-ProS (ICSE 2025) and the broader A-ProS benchmark - scraping 166 ICPC World Finals problems, normalizing LaTeX/HTML, structuring statements / I/O specs / constraints / samples, automating Codeforces Gym submissions, and capturing per-attempt verdict + runtime + memory in SQLite. Extended to Codeforces, forming the 367-problem benchmark used in A-ProS.

Code on GitHub →

TikTok Scraper

Python · Selenium · Requests · BeautifulSoup · SQLite3

Scraper extracting video descriptions and author metadata for specified keywords and tags, enabling structured analysis across 5,000+ videos. Selenium for dynamic rendering, BeautifulSoup for parsing, SQLite3 for storage - the same scraping architecture later applied in the LLM benchmark pipeline.

Code on GitHub →

Software

Smart Event Ticketing System

Java · Spring MVC · Hibernate/JPA · PostgreSQL · JSP

Multi-role event management platform with pessimistic locking to guarantee transactional consistency and prevent concurrent booking conflicts at scale. Real-time event filtering with asynchronous data retrieval for responsive search.

JobGenie

React.js · FastAPI · MongoDB

Job search platform with automated CV generation and personalized job matching, integrating live job scraping to fetch and rank relevant listings.

Code on GitHub →

OyeAmigo

Kotlin · Android SDK

Personality-based social networking Android app with null-safe Kotlin architecture, reducing crash rates and improving runtime stability.

Code on GitHub →

Education

University of Dhaka

Jan 2020 - Feb 2025

Bachelor of Science in Computer Science and Engineering · CGPA 3.13 / 4.00

Undergraduate Thesis

A Hybrid LLM Feedback Framework for Automated Competitive Programming Workflows

Proposed a novel test-driven iterative benchmarking framework integrating multiple LLMs (OpenAI o3-mini, DeepSeek, Qwen) with Codeforces-based validation to evaluate and improve automated code generation and error correction in competitive programming contexts.

Relevant coursework

Data Structures & Algorithms
Object-Oriented Programming
Software Design Patterns
Artificial Intelligence
Machine Learning
Natural Language Processing
Theory of Computation
Compiler Design
Operating Systems
Database Management Systems
Software Engineering
Probability & Statistics

Industry Experience

Software Engineer · Therap BD Ltd

Apr 2025 - Present

Dhaka, Bangladesh

Develop and maintain scalable features for Therap’s EHR SaaS platform (used across all 50 US states) using Java, Spring, Hibernate, JSP, and Oracle DB, with a focus on correctness and reliability under HIPAA constraints.
Build and containerize full-stack modules using React.js and Docker, deployed on WebLogic Server, contributing to platform stability and consistent delivery across environments.

Software Engineer (Part-time) · Zeroxa DT

Mar 2023 - Jun 2024

Remote, London, UK

Built and deployed scalable web applications for 5+ clients using React.js and FastAPI, significantly reducing average page load times through targeted performance optimization.
Architected CI/CD pipelines with automated testing and deployment workflows on AWS (EC2, S3, RDS), accelerating release cycles while maintaining production code quality.

Achievements

Honors

Zelf Hackathon 2.0 - Honorable Mention - Scraping Engineer track.

Competitive programming

Codeforces - Max rating 1603 (Expert) · 1000+ problems solved.
CodeChef - 4★ · max rating 1921.
AtCoder - Handle sifat_sif · regular contest participant.
LeetCode - Handle sifat_sif.
ICPC Dhaka Regional Onsite - Top-35 of 220+ teams (2023) · Top-49 of 309 teams (2024).
BUET Inter-University Programming Contest 2023 - 5th of 102 teams.
Samsung R&D BD Coding Contest 2024 - Final Round qualifier · 55th of 908 in Round 1.
Meta Hacker Cup 2024 - Round 2 qualifier · global rank 2,166.
NCPC 2023 - 44th of 198 teams.

Technical Skills

Languages

Python
Java
C++
JavaScript
TypeScript
Kotlin

Frameworks & Libraries

Spring Boot
Hibernate
FastAPI
React.js
Next.js
JSP
Selenium
Playwright

Databases

PostgreSQL
Oracle DB
MongoDB
SQLite

Research & ML

PyTorch
HuggingFace Transformers
OpenAI API
Pandas
NumPy
Jupyter
LaTeX

Tools

Git
Linux
Bash
Docker
Jenkins
Nginx
JUnit5

Mentees

Beyond the named mentees below, I have mentored 30+ students one-on-one through the Bangladesh Higher Secondary Certificate (HSC, classes 11-12) curriculum, and guided 10+ students preparing for Bangladesh public and private university admission tests.

Ilhaan S Abdullah

University of California, Riverside (UCR)

B.S. in Computer Science with Business Applications

Guided Ilhaan through the theoretical foundations and applied problem-solving for a broad slice of the UCR CS undergraduate core - from assembly and discrete structures through algorithms, theory of computation, operating systems, networks, security, and AI. Focus was on building durable intuition (why an algorithm works, what an OS abstraction is really protecting) rather than answer patterns.

Courses guided

CS 061Machine Organization and Assembly Language Programming
CS 100Software Construction
CS 105Computer Systems and C Programming
CS 111Discrete Structures
CS 141Intermediate Data Structures and Algorithms
CS 150Automata and Formal Languages
CS 153Design of Operating Systems
CS 164Computer Networks
CS 165Computer Security I
CS 170Introduction to Artificial Intelligence

Md Mahi

BRAC University, Dhaka

B.Sc. in Computer Science

Mentored Mahi since class 9 - starting with training for the Bangladesh Mathematical Olympiad, then progressing into competitive programming (data structures, algorithms, and problem-solving under contest constraints) and foundational Python. Continuing to advise on university-level CS coursework and project work.

Courses guided

OlympiadBangladesh Mathematical Olympiad training
CPCompetitive Programming (algorithms, data structures)
PythonIntroductory & intermediate Python programming

Leadership & Service

Vice President, ICT and Graphics · Notre Dame English Club, Notre Dame College, Dhaka

2018 - 2019

Dhaka, Bangladesh

Organized the 6th National English Carnival, a national-level academic competition with 10,000+ participants across 30+ events; previously co-ordinated the 5th edition (5,000+ participants).
Owned judge coordination and external communications, managing scheduling and briefing for judges drawn from industry, academia, and senior education across parallel event tracks.
Led sponsorship outreach, securing corporate partnerships that funded event operations and prize pools.

Contact

The fastest way to reach me is by email at sifatb910@gmail.com. I am currently applying for PhD positions for Fall 2027; I would be happy to discuss research fit or share my CV, thesis, and code on request.

Md Sifat Hossain

About

Publications

A-ProS: Towards Reliable Autonomous Programming Through Multi-Model Feedback

LLM-ProS: Analyzing Large Language Models’ Performance in Competitive Problem Solving

Research Experience

Research Assistant · Data Mining Research Lab, University of Dhaka

RLHF Data Researcher & Pod Lead · Turing Enterprises Inc.

Machine Learning Research Intern · Brainwave Matrix Solutions

Projects

Polyglot Codebase Debugging (Ongoing)

A-ProS

Hybrid Feedback Loop - LLM Benchmark Pipeline

TikTok Scraper

Smart Event Ticketing System

JobGenie

OyeAmigo

Education

University of Dhaka

Industry Experience

Software Engineer · Therap BD Ltd

Software Engineer (Part-time) · Zeroxa DT

Achievements

Technical Skills

Mentees

Ilhaan S Abdullah

Md Mahi

Leadership & Service

Vice President, ICT and Graphics · Notre Dame English Club, Notre Dame College, Dhaka

Contact