Bhagyesh Rathi

SOFTWAREDEVELOPER& AI/MLENGINEER

About Me

I am a Software Developer and AI/ML Engineer with a passion for building intelligent, scalable systems. Currently pursuing my Master's in Artificial Intelligence at San Jose State University, my focus lies at the intersection of robust backend engineering and cutting-edge machine learning.

At Rakuten, I engineered high-impact microservices, implemented secure OAuth 2.0 architectures, and orchestrated GCP deployments with Kubernetes. Whether it's developing interactive RAG pipelines, optimizing distributed systems, or training predictive models, I thrive on turning complex technical challenges into seamless user experiences.

When I'm not writing code, you can find me exploring the latest advancements in LLMs or refining my problem-solving skills.

Experience

Software Engineer Intern

Rakuten · San Mateo, CA

May 2023 — Aug 2023
  • Engineered Kotlin microservices for Social Authentication (Google, Apple, Facebook), driving a 40% adoption rate and a 37% increase in conversion rate.
  • Developed robust REST APIs utilizing OpenID Connect and OAuth 2.0 to implement secure token-based authentication and authorization flows
  • Architected scalable infrastructure on Google Cloud Platform (GCP), orchestrating Docker containers with Kubernetes via automated GitLab CI/CD pipelines
  • Established comprehensive system observability by integrating OpenTelemetry with GCP Cloud Trace and structured Log4j logging for real-time performance insights
KotlinSpring BootGCPDockerKubernetesOAuth2.0CI/CD

Software Engineer Co-op

Rakuten · San Mateo, CA

Aug 2023 — Dec 2023
  • Architected and deployed a core internal SDK that abstracted complex service-to-service communications, successfully published to Artifactory via GitLab CI/CD
  • Engineered secure event listeners using Kotlin to intercept and process critical compliance signals (consent revocation, account deletion) from OAuth providers
  • Implemented robust security protocols by validating and decoding JSON Web Tokens (JWT) to securely authenticate inter-service requests
KotlinJWTGitLab CI/CDJaCoCoSonarQube

RESEARCH INTERESTS

Geometric Consistency: Latent Space Pruning for Chain-of-Thought Reasoning (In Progress)

This paper proposes and evaluates a novel unsupervised method, Geometric Consistency, designed to enhance reasoning reliability by filtering outliers within the latent vector space.

PythonSentence-Transformers

AutoML (In Progress)

Automated Machine Learning

PythonScikit-LearnFlaskReact

Projects

RAG Based Interactive Resume

Implemented a production-grade RAG deployment on a user-facing portfolio site

  • Ingestion: Parsed and chunked resume PDF into semantic segments
  • Embedding: Embedded text chunks using a transformer model to generate dense vector representations
  • Storage & Retrieval: Stored vectors in a Pinecone vector database and implemented retrieval to compare user queries against stored vectors
  • Generation: Passed retrieved context + query to Vertex AI (Gemini) LLM to generate accurate, context-aware responses
PythonTypescriptNextJSVercelPineconeVertex AI

FullStack AutoML platform and code generator

  • Engineered an AutoML system using Scikit-Learn & Flask that automates preprocessing, task detection, and parallel model training/evaluation
  • Designed a responsive React frontend with real-time interactive ROC/Scatter plots and confusion matrix for visualizations of top model
  • Reduced training latency by 40% implementing Stratified Sampling and dynamic model switching to handle large datasets efficiently
  • Developed a context manager to profile real-time CPU/RAM usage & a leaderboard to sort models by accuracy, time, or resource efficiency
  • Built a transpiler engine to enable one-click downloads for both serialized models (.pkl) and their reproduction code
PythonScikit-LearnFlaskReact

Bank Churn Data Analysis and Prediction using ML

Data analysis and prediction using ML

  • Analyzed data of 10,000 account holders at a Multinational Bank by doing exploratory data analysis with Pandas
  • Constructed a streamlined pipeline for training 5 machine learning models to predict customer churn using Scikit-Learn
  • Implemented XGBoost, Random Forest, KNN, SVM, and Naive Bayes models and compared them
  • Utilized N-fold cross-validation, F1-score, confusion matrix to evaluate the performance of each model
PythonScikit-LearnPandasSeabornMatplotlib

Skin Cancer Detection using CNN

CNN-based image classification for skin cancer detection

  • Developed a CNN using Scikit-Learn to classify skin lesion images into 7 cancer categories, achieving 80% accuracy
  • Preprocessed data with resizing, normalization, one-hot encoding, and oversampling to address class imbalance
  • Optimized model performance using the Adam optimizer, learning rate annealing, and hyperparameter tuning
PythonTensorFlowScikit-LearnPandasSeaborn

Skills & Technologies

Java
Python
JavaScript
Kotlin
SQL
React
NextJS
Spring
Spring Boot
Flask
Scikit-Learn
TensorFlow
PyTorch
Pandas
NumPy
Seaborn
LangChain
Hugging Face
REST APIs
Distributed Systems
Microservices
CI/CD
Java
Python
JavaScript
Kotlin
SQL
React
NextJS
Spring
Spring Boot
Flask
Scikit-Learn
TensorFlow
PyTorch
Pandas
NumPy
Seaborn
LangChain
Hugging Face
REST APIs
Distributed Systems
Microservices
CI/CD
Java
Python
JavaScript
Kotlin
SQL
React
NextJS
Spring
Spring Boot
Flask
Scikit-Learn
TensorFlow
PyTorch
Pandas
NumPy
Seaborn
LangChain
Hugging Face
REST APIs
Distributed Systems
Microservices
CI/CD
Java
Python
JavaScript
Kotlin
SQL
React
NextJS
Spring
Spring Boot
Flask
Scikit-Learn
TensorFlow
PyTorch
Pandas
NumPy
Seaborn
LangChain
Hugging Face
REST APIs
Distributed Systems
Microservices
CI/CD
Agile
Unit/Integration Testing
Monitoring & Alerting
Retrieval-Augmented Generation (RAG)
MongoDB (NoSQL)
MySQL
PostgreSQL
Redis (GCP MemoryStore)
Pinecone (Vector DB)
GCP
Docker
Kubernetes
AWS
Vercel
Vertex AI
Git
GitHub
GitLab
Postman
Jira
Confluence
Ollama
Agile
Unit/Integration Testing
Monitoring & Alerting
Retrieval-Augmented Generation (RAG)
MongoDB (NoSQL)
MySQL
PostgreSQL
Redis (GCP MemoryStore)
Pinecone (Vector DB)
GCP
Docker
Kubernetes
AWS
Vercel
Vertex AI
Git
GitHub
GitLab
Postman
Jira
Confluence
Ollama
Agile
Unit/Integration Testing
Monitoring & Alerting
Retrieval-Augmented Generation (RAG)
MongoDB (NoSQL)
MySQL
PostgreSQL
Redis (GCP MemoryStore)
Pinecone (Vector DB)
GCP
Docker
Kubernetes
AWS
Vercel
Vertex AI
Git
GitHub
GitLab
Postman
Jira
Confluence
Ollama
Agile
Unit/Integration Testing
Monitoring & Alerting
Retrieval-Augmented Generation (RAG)
MongoDB (NoSQL)
MySQL
PostgreSQL
Redis (GCP MemoryStore)
Pinecone (Vector DB)
GCP
Docker
Kubernetes
AWS
Vercel
Vertex AI
Git
GitHub
GitLab
Postman
Jira
Confluence
Ollama

Education

Masters of Science in Artificial Intelligence

Expected: May 2027

San Jose State University · San Jose, CA

GPA: 3.9

Relevant Coursework:

Machine LearningDeep LearningAI Threat IntelligenceNatural Language Processing (NLP)AI and Data Engineering

Bachelors of Science in Computer Science

August 2020 — May 2024

San Jose State University · San Jose, CA

Magna Cum Laude

Relevant Coursework:

Java OOPData Structures and AlgorithmsSoftware EngineeringRelational DatabasesData Visualization

Get in Touch

I'm always open to discussing new opportunities, collaborations, or interesting projects.

Ask AI