Hi, I'm Saqlain
Engineering AI products at scale and creating practical solutions across domains.

About

Focusing on designing and refining AI systems with a strong emphasis on clarity, stability, and rigorous reasoning. My work covers machine learning, NLP, and language models, where I take initial concepts and develop them into reliable, well-structured systems. I examine prototypes from the ground up, isolate weak points, and rebuild their logic until the behavior holds under varied conditions. I work with models, data, and algorithms in a direct, methodical way that keeps the system predictable and controlled. Outside technical work, I spend my time playing cricket or training in martial arts (Black Belt, Dan 1). I write clean code, question assumptions without delay, and depend on solid algorithmic structure.

Work Experience

I
Research Intern - Machine Learning
Working with the NMCAD Lab on an eVTOL project as part of the machine learning team, contributing to data driven models for improving system performance and operational efficiency.
P
Designed and deployed RESTful APIs for a compliance platform, enhancing performance and reducing response latency by over 40%. Developed a RAG based chatbot leveraging hybrid retrieval techniques to deliver precise responses for ABPI case and clause related queries, improving overall information accuracy.
B
Research Intern
Built a clinical RAG pipeline using LLMs and a FAISS vector store to extract symptoms and answer medical queries with reduced hallucinations. Developed a live streaming ASR system using open source models with a substantial accuracy gain.

Open Source Contributions

Added Differential Diffusion to Kolors

HuggingFace Diffusers 🤗
Python
Diffusion Models
Image Generation
PyTorch

Optimized ALBERT Test Model Size

HuggingFace Transformers 🤗
Python
Transformers
Model Optimization
NLP

Projects

Katha - Hindi SLM

Katha is a Hindi story generation model based on Llama-3.2-1B, fine-tuned with LoRA adapters on the TinyStories-Hindi dataset. It creates coherent and engaging short stories, completes Hindi text prompts, and assists with creative writing in Devanagari script. The project leverages Transformers, PEFT, and LoRA for efficient adaptation and was trained on high-performance GPUs. Katha is optimized for natural, creative Hindi storytelling and text completion.

Python
Transformers
PEFT
LoRA Adapters

Syndata - Synthetic Data Generation Platform

Syndata is a synthetic data generation platform for creating Q&A pairs from PDF documents. It is useful for evaluation tasks and data generation, enabling users to quickly generate high-quality question-answer datasets from their own documents.

Python
Next.js
FastAPI

Retro Reels - Shortform Content Generation Pipeline

Built a fully automated content pipeline that generates and uploads shortform videos using LLMs and YouTube API. Integrated GitHub Actions to schedule daily creation and publishing workflows for consistent automation and automated the generation of video scripts, titles, hashtags, and metadata for optimized reach.

Python
LLMs
GitHub Actions
YouTube API

Presently - Web to Presentation Video Tool

Developed a tool to transform web content into presentation videos with synchronized narration and music. Automated slide creation, narration, and audio generation using Gemini and text-to-speech models and streamlined content curation through web scraping and AI-driven summarization for presentation flow.

Python
MoviePy
Gemini
python-pptx

PACLI - Personal Assistant CLI

Built a command-line LLM assistant to manage and modify calendar events via natural language. Integrated an Agentic RAG framework for reasoning over contextual information about scheduled events and automated the discovery and scheduling of upcoming Codeforces contests based on user intent.

Python
LangChain
LLMs
Agentic AI

ReTweetify - Turkish Hate Speech Classification

Fine-tuned multilingual BERT for Turkish hate speech classification using parameter-efficient LoRA adapters. Implemented token-level explainability using SHAP to visualize word contributions in predictions and built a rewriting pipeline to paraphrase hateful text while retaining tone and semantic meaning.

Python
Transformers
BERT
LoRA
SHAP

Hackathons

Solutions built as part of hackathons
Precision Budget Intelligence
Team project
A personal finance platform that manages user budgets by extracting expenses and categories from uploaded bills, auto-updating all records in the database. It includes an insight-driven chatbot for financial guidance, monthly overspend detection, and goal tracking to keep budgets aligned.
NextJS
FastAPI
MongoDB
Gemini Ecosystem
Smart Ticket Triage Platform
Team project
A ticketing system where users file routine requests. An agent monitors each ticket, identifies those that match predefined automation paths, and prepares a resolution plan. Administrators view these flagged tickets in a dashboard. When an admin approves, the agent executes the resolution directly. This removes repetitive IT work and shortens handling time for standard issues.
NextJS
FastAPI
MongoDB
Strands Framework
Agentic AI
Artisan Connect & Digital Market Hub
Team project
A platform that provides artisans real-time updates on upcoming exhibitions and venues, offers a marketing dashboard for generating posters and campaign content, supports profile creation and discovery, enables connections with local artisans and users, and includes a RAG-based chatbot that answers queries about government schemes relevant to artisans.
ReactJS
FastAPI
MongoDB
Gemini Ecosystem

Research and Publications

BiasNet: A Contrastive GNN Based Framework for Classifying Political Stance in News

FTNCT 2025•Paper (Coming Soon)
Accepted
We introduce BiasNet, a framework using Graph Neural Networks and contrastive learning to classify political stance in news articles. By modeling relationships between articles, our approach effectively distinguishes left, right, and neutral biases, outperforming traditional baselines. This method supports media monitoring and fact-checking by detecting subtle bias in news content.
Graph Neural Networks
Contrastive Learning
Political Bias
News Classification

Reading Between the Lines: LLM-Powered Topic Modelling and Graph-Based Insights from Research Abstracts

ICIVC 2025•Paper (Coming Soon)
Accepted
This study presents an innovative approach to research abstract analysis by combining advanced topic modeling with large language models (LLMs) and graph-based analytics. We employ BERTopic for topic extraction, LLama2 for semantic enhancement, and graph neural networks to uncover hidden patterns and relationships within academic literature. Our methodology demonstrates significant improvements in identifying emerging research trends, cross-disciplinary connections, and knowledge evolution patterns compared to traditional approaches.
Topic Modelling
BERTopic
LLama2
Graph Analytics
LLM

Insights & Updates

Skills

Python
Java
C
JavaScript
SQL
R
TensorFlow
PyTorch
Pandas
NumPy
Scikit-learn
Keras
🤗HuggingFace
Peft
Langchain
Langgraph
MLflow
AWS
Azure
Docker
Git/GitHub
FastAPI
MySQL
MongoDB
FAISS
Pinecone
Qdrant
Neo4j

Education

Bachelor of Technology, Computer Science and Engineering
Specialization: Artificial Intelligence and Machine Learning

Achievements

ACM ICPC Asia West Regionalist

Qualified for the ACM ICPC Asia West Regional Contest.

Knight at LeetCode

Achieved the title of Knight at LeetCode with a rating of 2000+.

First Runner-Up, Flipr Hackathon 26.1

Secured the First Runner-Up position in the ML/AI track at Flipr Hackathon 26.1 for building an AI powered personal finance tracker.

Black Belt Dan 1 in Karate

Attained the rank of Black Belt Dan 1 in Karate, reflecting dedication, discipline, and expertise in martial arts.