Rishi G. | BASIS Independent Schools

Rishi G. 2026 | BASIS Independent Fremont

Project Title: Building an MLOps Platform for Predicting Opinion Shifts
BASIS Independent Advisor: Mr. Dievendorf
Internship Location: Stanford University, 616 Jane Stanford Way, Stanford, CA 94305 (Hybrid)
Onsite Mentor: Mr. Samuel Tong

Deliberative discussions play a critical role in shaping public opinion, yet current machine-learning models for predicting opinion change remain difficult to reproduce, scale, and compare across studies. Although prior research has shown that transformer-based and neural network architectures can capture linguistic and behavioral signals associated with opinion shifts, these studies are typically limited by incomplete data pipelines, inconsistent preprocessing, and a lack of standardized experimental infrastructure. This project proposes and implements a cloud-based Machine Learning Operations (MLOps) platform designed to automate, scale, and improve the reproducibility of machine-learning experiments for predicting opinion shifts within deliberative conversations. Using large-scale deliberation datasets provided by the Deliberative Democracy Lab, the platform integrates data preprocessing, feature engineering, text-embedding storage, model training, evaluation, and version tracking into a unified and automated pipeline built on Amazon SageMaker and DynamoDB. Structured features are managed through SageMaker Feature Store, while high-dimensional textual embeddings are stored and queried using a custom vector database architecture. PyTorch-based models, including neural and transformer-based architectures, are trained and evaluated through SageMaker Pipelines. The platform quantitatively evaluates model performance using accuracy, stability, and robustness metrics, supported by cross-validation and bootstrapping to mitigate overfitting and small-sample bias. Results are benchmarked against prior deliberation-prediction studies to assess whether automated MLOps workflows improve both predictive performance and experimental reproducibility. The final outcome is an MLOps system and a research paper that demonstrates how scalable infrastructure can quantitatively advance the study of opinion dynamics and cognitive science in structured dialogue.

My Posts

Week 3: Data Preparation

March 13, 2026

Hi everyone, welcome back! This week focused on preparing the dataset so it can be reliably used for analysis later in the project. The goal was to take a large and messy collection of survey and participant files and transform them into a cleaner, more organized dataset. The first step was selecting the right source […]

Week 2: Structuring the Pipeline

March 6, 2026

Hi everyone, welcome back! This week involved a mix of reading and reconsidering how the technical workflow should actually be structured. One of the main sources I worked through was Continuous Delivery: Reliable Software Releases Through Build, Test, and Deployment Automation by Jez Humble and David Farley. Even though this book is about software engineering, […]

Week 1: Starting with Data Preparation

February 27, 2026

Hi everyone, welcome back! This is the first official week of the project! When planning this project out, I decided to dedicate the first two weeks to something that doesn’t always get much attention in machine learning: the data. Before any models can be trained or evaluated, the datasets have to be cleaned, structured, and […]

Week 0: Entering a World of Opinions

February 6, 2026

Ask someone why people change their minds, and you will get answers ranging from “good arguments” to “group pressure” to “emotions.” Ask how to measure that change, and things suddenly get much more complicated. Public opinion shapes elections, policy, and social movements. However, the tools we use to study opinion change often lag behind the […]

Page 1 of 1