Gaurab Chhetri - Building things that make a difference.

Gaurab Chhetri

Building things that make a difference.

Austin, TX9:35 PM

Undergraduate Researcher in Computer Science. I work on AI, data mining, and full-stack web development, and I like building end-to-end systems that mix machine learning with solid engineering. I believe in the mantra: “Do what you want, not what you can!”

emailMe viewResume

experience

October 2024 - Present

Undergraduate Researcher

AIT Lab - Texas State University, San Marcos, TX

Engineered and deployed 25+ data-driven web tools for AI and transportation research, processing 150k+ mobility and crash records with reproducible pipelines in Python and JavaScript/ TypeScript.
Implemented ML workflows for crash analytics and safety prediction, contributing to 5+ published papers.
Developed and optimized the AIT Lab website (Next.js + Tailwind), achieving 90+ Lighthouse performance and SEO scores.

Applied AI/MLWeb DevelopmentData Engineering

featuredProjects

HackathonAdaptly - The AI-Adaptive UI Library for React & Next.js

Adaptly - The AI-Adaptive UI Library for React & Next.js

Adaptly brings intelligence to modern web dashboards. It's a TypeScript-first library that lets your UI understand what users mean, not just what they click.

TypeScriptJavaScript

learnMore

PersonalBhanai - A Custom Programming Language with a Nepali Touch

Bhanai - A Custom Programming Language with a Nepali Touch

Bhanai is a simple and intuitive programming language with a Nepali touch. It leverages Node.js under the hood for execution, allowing users to create `.bhn` files and run them seamlessly.

JavaScript

learnMore

ResearchCognitiveSky - Scalable Sentiment and Narrative Analysis for Decentralized Social Media

CognitiveSky - Scalable Sentiment and Narrative Analysis for Decentralized Social Media

CognitiveSky is an open-source research infrastructure for analyzing mental health narratives on Bluesky, combining real-time ingestion, NLP pipelines, and an interactive Next.js dashboard. Accepted for presentation at HICSS 2026. Preprint available on arXiv, final version will appear in official proceedings.

PythonTypeScript

learnMore

PersonalVidXiv - ArXiv Paper to Video Generator

VidXiv - ArXiv Paper to Video Generator

VidXiv automatically converts research papers from ArXiv into engaging narrated videos with scene-by-scene breakdowns, AI-generated scripts, and export-ready MP4s for YouTube or Shorts.

Python

learnMore

viewAllProjects

education

Expected Graduation: May 2028

Bachelor of Science in Computer Science

Texas State University, San Marcos

I am currently pursuing a Bachelor of Science in Computer Science at Texas State University, where I am learning and gaining hands-on experience in various aspects of computer science, in software development, data structures, algorithms, and web technologies.

Computer ScienceSoftware DevelopmentData StructuresAlgorithmsWeb TechnologiesFull Stack DevelopmentAI & Machine LearningData ScienceResearch

recentDevLogs

November 26, 2025

Why I Prefer Free Tier Infrastructure and What It Teaches You

Sharing how using free tier infrastructure taught me to optimize software and systems for highest possible efficiency.

September 16, 2025

What I learned building my first CLI tool

A behind-the-scenes look at building my first CLI tool, optimize-images-cli. From handling image formats and performance challenges to improving user experience, here are the key lessons I learned as a developer.

August 30, 2025

What Four Hackathons Taught Me About AI, Teamwork, and Sleep Deprivation

Reflections on four hackathons - lessons on AI, teamwork, creativity under pressure, and why sleep deprivation sometimes sparks innovation.

June 25, 2025

Do What You Want, Not What You Can - My Mantra for Building Meaningful Projects

Discover why building projects you truly want to create leads to more meaningful work than simply doing what you can. A personal reflection on coding, hackathons, and purpose-driven software development.

viewAllDevLogs

researchPublications

September 14, 2025 | arXiv preprint arXiv:2509.11444

CognitiveSky: Scalable Sentiment and Narrative Analysis for Decentralized Social Media

Gaurab Chhetri, Anandi Dutta, Subasish Das

The emergence of decentralized social media platforms presents new opportunities and challenges for real-time analysis of public discourse. This study introduces CognitiveSky, an open-source and scalable framework designed for sentiment, emotion, and narrative analysis on Bluesky, a federated Twitter or X.com alternative. By ingesting data through Bluesky's Application Programming Interface (API), CognitiveSky applies transformer-based models to annotate large-scale user-generated content and produces structured and analyzable outputs. These summaries drive a dynamic dashboard that visualizes evolving patterns in emotion, activity, and conversation topics. Built entirely on free-tier infrastructure, CognitiveSky achieves both low operational cost and high accessibility. While demonstrated here for monitoring mental health discourse, its modular design enables applications across domains such as disinformation detection, crisis response, and civic sentiment analysis. By bridging large language models with decentralized networks, CognitiveSky offers a transparent, extensible tool for computational social science in an era of shifting digital ecosystems.

paper repository

September 14, 2025 | arXiv preprint arXiv:2509.11443

A Transformer-Based Cross-Platform Analysis of Public Discourse on the 15-Minute City Paradigm

Gaurab Chhetri, Darrell Anderson, Boniphace Kutela, Subasish Das

This study presents the first multi-platform sentiment analysis of public opinion on the 15-minute city concept across Twitter, Reddit, and news media. Using compressed transformer models and Llama-3-8B for annotation, we classify sentiment across heterogeneous text domains. Our pipeline handles long-form and short-form text, supports consistent annotation, and enables reproducible evaluation. We benchmark five models (DistilRoBERTa, DistilBERT, MiniLM, ELECTRA, TinyBERT) using stratified 5-fold cross-validation, reporting F1-score, AUC, and training time. DistilRoBERTa achieved the highest F1 (0.8292), TinyBERT the best efficiency, and MiniLM the best cross-platform consistency. Results show News data yields inflated performance due to class imbalance, Reddit suffers from summarization loss, and Twitter offers moderate challenge. Compressed models perform competitively, challenging assumptions that larger models are necessary. We identify platform-specific trade-offs and propose directions for scalable, real-world sentiment classification in urban planning discourse.

paper repository

September 14, 2025 | arXiv preprint arXiv:2509.11449

Tabular Data with Class Imbalance: Predicting Electric Vehicle Crash Severity with Pretrained Transformers (TabPFN) and Mamba-Based Models

Shriyank Somvanshi, Pavan Hebli, Gaurab Chhetri, Subasish Das

This study presents a deep tabular learning framework for predicting crash severity in electric vehicle (EV) collisions using real-world crash data from Texas (2017-2023). After filtering for electric-only vehicles, 23,301 EV-involved crash records were analyzed. Feature importance techniques using XGBoost and Random Forest identified intersection relation, first harmful event, person age, crash speed limit, and day of week as the top predictors, along with advanced safety features like automatic emergency braking. To address class imbalance, Synthetic Minority Over-sampling Technique and Edited Nearest Neighbors (SMOTEENN) resampling was applied. Three state-of-the-art deep tabular models, TabPFN, MambaNet, and MambaAttention, were benchmarked for severity prediction. While TabPFN demonstrated strong generalization, MambaAttention achieved superior performance in classifying severe injury cases due to its attention-based feature reweighting. The findings highlight the potential of deep tabular architectures for improving crash severity prediction and enabling data-driven safety interventions in EV crash contexts.

paper

August 26, 2025 | arXiv preprint arXiv:2508.19239

Model Context Protocols in Adaptive Transport Systems: A Survey

Gaurab Chhetri, Shriyank Somvanshi, Md Monzurul Islam, Shamyo Brotee, Mahmuda Sultana Mimi, Dipti Koirala, Biplov Pandey, Subasish Das

The rapid expansion of interconnected devices, autonomous systems, and AI applications has created severe fragmentation in adaptive transport systems, where diverse protocols and context sources remain isolated. This survey provides the first systematic investigation of the Model Context Protocol (MCP) as a unifying paradigm, highlighting its ability to bridge protocol-level adaptation with context-aware decision making. Analyzing established literature, we show that existing efforts have implicitly converged toward MCP-like architectures, signaling a natural evolution from fragmented solutions to standardized integration frameworks. We propose a five-category taxonomy covering adaptive mechanisms, context-aware frameworks, unification models, integration strategies, and MCP-enabled architectures. Our findings reveal three key insights: traditional transport protocols have reached the limits of isolated adaptation, MCP's client-server and JSON-RPC structure enables semantic interoperability, and AI-driven transport demands integration paradigms uniquely suited to MCP. Finally, we present a research roadmap positioning MCP as a foundation for next-generation adaptive, context-aware, and intelligent transport infrastructures.

paper repository

viewAllPublications

<Gaurab />

pages

connect

Undergraduate Researcher

Bachelor of Science in Computer Science

Why I Prefer Free Tier Infrastructure and What It Teaches You

What I learned building my first CLI tool

What Four Hackathons Taught Me About AI, Teamwork, and Sleep Deprivation

Do What You Want, Not What You Can - My Mantra for Building Meaningful Projects

CognitiveSky: Scalable Sentiment and Narrative Analysis for Decentralized Social Media

A Transformer-Based Cross-Platform Analysis of Public Discourse on the 15-Minute City Paradigm

Tabular Data with Class Imbalance: Predicting Electric Vehicle Crash Severity with Pretrained Transformers (TabPFN) and Mamba-Based Models

Model Context Protocols in Adaptive Transport Systems: A Survey