Projects

A collection of my research work, academic projects, and side experiments.

🔬

Research Projects

Bayesian Neural Network Initialization

Research

Developed a novel neural network initialization algorithm using co-occurrence statistics and product quantization. Achieved 81% accuracy on MNIST with only 20% of the data and 65% on CIFAR-10 without gradient descent.

PyTorchNeural NetworksBayesian Methods

Semantic Shift Analysis of PubMedBERT

Ongoing

Yearly continual pretraining of PubMedBERT on a longitudinal PubMed corpus followed by embedding-space, neighborhood, and concept-proximity analyses to quantify semantic shift and evolving biomedical knowledge over time.

NLPBERTBiomedical

BioBERT Semantic Shift Detection

Research

Fine-tuning BioBERT to detect semantic shift in medical text (PubMed, 1975-2025), including corpus preparation, vocabulary drift checks, and evaluation on shift-sensitive examples.

NLPBERTMedical AI

Internalized Self-Stigma Analysis

Research

Longitudinal analysis of internalized self-stigma among Reddit users using GPT API pipelines and prompt engineering for social media analytics.

NLPSocial Media AnalyticsGPT
🎓

Academic Projects

Agentic AI for Information Collection

2025

Chain-of-thought LLM agent leveraging OpenAI's function-calling to orchestrate Wikipedia API tools and deliver citation-backed responses.

LLMAgentsOpenAI

Adversarial Attacks on Vision Models

2025

Comprehensive white-box (PGD) and black-box adversarial attacks on ResNet-18 and YOLOv5 to quantify vulnerabilities.

Computer VisionSecurityPyTorch

Stock Market Trading Bot

Reinforcement learning based stock trading bot using Proximal Policy Optimization (PPO) for various market conditions.

Reinforcement LearningPPOFinance

LSTM Lyrics Generation from Scratch

2025

Implemented Long Short-Term Memory algorithm from scratch using Python and NumPy for song lyrics generation.

Deep LearningNumPyNLP

Classical ML in C++ from Scratch

2024

Logistic Regression and SVMs implemented in C++ using Eigen Linear Algebra Library for CIFAR-10 classification.

C++ML from ScratchEigen

Deep Learning Sentiment Analysis

Published

COVID-19 news video sentiment analysis using LSTM, Bi-LSTM, CNN, and GRU. Published at ICITA-2021.

Deep LearningNLPResearch
📁

Earlier Projects

Dry Beans Classification

2021

Machine learning models for classifying dry bean species using R, with EDA using Power BI.

RPower BIClassification

LAN-Based Communication System

2021

Multi-client chat application using Java Swing, TCP socket programming, and Oracle SQL.

JavaNetworkingSQL

RSA Encryption in C

2020

Implementation of the RSA cryptographic algorithm in C for encrypting and decrypting information.

CCryptographySecurity

UAV Payload Delivery System

2020

Quadcopter built using Ardupilot with a payload system for airdropping packages via servo motor and Arduino.

ArduinoC++Hardware

Want to see more?

Check out my GitHub for more projects and code samples.

© 2026 Milan Varghese. All rights reserved.

Built with Astro & TailwindCSS