Loading...

Nirbhay Singh Naruka

Machine Learning Engineer | Generative AI Specialist | Data Engineer

I architect intelligent systems that transform data into business value. Specializing in Generative AI and ML pipelines, I build production-ready solutions like fine-tuned LLMs and RAG systems. My work bridges cutting-edge tech (TensorFlow, LangChain, Hugging Face) with measurable outcomes - from 70% cost savings at National Public Radio to Forbes 30 Under 30 recognition. With an MS in Business Analytics and Fortune 500 experience, I solve complex problems where data meets decision-making.

Professional Journey

An interactive timeline of my career progression. Click on a company to see the details.

Education & Academic Foundations

My education provides a strong base. It combines deep theory with practical skills, shaping my data-driven approach.

Cal Poly Logo

Master of Science in Business Analytics

California Polytechnic State University-San Luis Obispo, CA

(August 2023 - June 2024) • GPA: 3.8/4.0

This program honed my analytical mindset and cutting-edge technical skills for success in today's data analytics marketplace.

Key Learning Areas & Acquired Skills:

  • Advanced Machine Learning & Data Mining: Predictive analytics, model evaluation (e.g., ROC-AUC, F1-Score), deployment strategies, classification, regression, clustering.
  • Generative AI & Text Mining: Social media analytics, natural language processing (NLP), large language model (LLM) applications, prompt engineering, retrieval-augmented generation (RAG).
  • Data Management & Big Data Technologies: Relational database management systems, data warehousing, data marts, distributed data environments (NoSQL), SQL for data quality and integration, BigQuery.
  • Advanced Statistics & Econometrics: Hypothesis testing, linear and multiple regression, ANOVA, time series analysis, quantitative marketing analytics, causal inference.
  • Cloud Services & Applications for Business Analytics: AWS (SageMaker, S3, Lambda), GCP, cloud-based data solutions, MLOps practices.
  • Optimization & Prescriptive Analytics: Linear programming, simulation, decision-making under uncertainty.
  • Data Visualization & Communication: Storytelling with data, dashboard design (Looker, Power BI), effective presentation of insights.
  • Business Metrics & Analytics: Problem framing, data collection, cleaning, and analysis, ethical considerations in data, business process optimization.
  • Team Science & Innovation: Collaborative problem-solving, design thinking, cross-functional team leadership.
SRMIST Logo

Bachelor of Technology in Computer Science and Engineering

SRM Institute of Science and Technology (SRMIST), India

(July 2017 - May 2021)

This program provided a strong theoretical and practical foundation in computer science, instilling problem-solving abilities.

Key Learning Areas & Acquired Skills:

  • Core Programming & Data Structures: C/C++, Java, Python, advanced data structures (arrays, linked lists, trees, graphs) and algorithms (sorting, searching, dynamic programming).
  • Object-Oriented Design & Software Engineering: Principles of OOP, software development lifecycle (SDLC), project management, version control (Git).
  • Database Management Systems: Relational database concepts, SQL, database design, normalization, query optimization.
  • Operating Systems: Process management, memory management, file systems, concurrency, Linux fundamentals.
  • Computer Networks: Network protocols (TCP/IP), network architecture, data communication, cybersecurity basics.
  • Computer Organization & Architecture: CPU design, memory hierarchy, assembly language, digital logic.
  • Theory of Computation: Formal languages, automata theory, computability, complexity.
  • Web Technologies: HTML, CSS, JavaScript, basic web development frameworks (e.g., Flask concepts).
  • Mobile Application Development: Android development principles (Java/Kotlin concepts), Dart/Flutter for cross-platform.
  • Artificial Intelligence & Machine Learning: Introduction to AI concepts, basic search algorithms, supervised and unsupervised learning paradigms.
  • Mathematics for Computing: Discrete mathematics, calculus, linear algebra, probability, and statistics.

Technical Skillset

A visual overview of my core competencies, followed by a detailed breakdown of all my skills, filterable by category.

🧠 Machine Learning & AI

My expertise in building intelligent systems, from predictive models to generative AI.

TensorFlow, Keras, Scikit-learn, PyTorch, Hugging Face, LangChain, LLMs (GPT, BERT), Classification, Regression, Clustering, RAG.

💻 Programming & Databases

Foundational skills in coding and managing structured and unstructured data.

Python, SQL (BigQuery, PostgreSQL), R, JavaScript, NoSQL (MongoDB), Firebase, C/C++.

☁️ Cloud & Data Engineering

Building scalable data infrastructure and deploying solutions on leading cloud platforms.

AWS (S3, SageMaker, Lambda), GCP, ETL Pipelines, Data Modeling, Docker, CI/CD (GitHub Actions).

📊 BI & Visualization Tools

Transforming data into clear, actionable insights through powerful dashboards and reports.

Looker, Power BI, Tableau, Matplotlib, Seaborn, Plotly, Streamlit.

Comprehensive Skill Breakdown

Projects & Publications

Explore my work. This section highlights key projects and publications, from research to hands-on development.

Stock Analysis Model

Publication - ASIANCON 2021

I developed a predictive model for stock analysis. Using machine learning, this model improved investment decision-making. My research focused on analyzing prospective stocks to enhance outcomes.

Machine Learning Python

Churn Prediction

Project at Salesforce

Led development of HR churn prediction system using TensorFlow/Keras, integrating real-time GenAI insights from employee communications I improved accuracy by 20%. I deployed it via a React/Flask interface, enabling real-time analysis for HR.

View on GitHub →

Generative AI & LLM Workflows

Personal & Academic Projects

I built several Generative AI projects. This included fine-tuning GPT-2 for text generation. I also developed Retrieval-Augmented Generation (RAG) systems for document Q&A, using LangChain and modern APIs.

Read on Medium →

Consumer Behavior Analysis

Project at Claritas

Applied K-means clustering and Random Forest models to segment consumer behavior from diverse datasets, improving marketing personalization and strategy. Improved ETL pipeline efficiency by 30% using SQL and AWS Lambda.

Data Analysis AWS

My Personal Website

Portfolio

A curated collection of my professional work, projects, and detailed case studies showcasing my skills in action.

Explore My Work →

Forbes 30 Under 30 Showcase

Achievement at BAKUP

Contributed as a Full Stack Developer to a multi-language website with integrated APIs, which was showcased at the Forbes 30 Under 30 event in Detroit, 2019.

Full Stack Dev

MovieLens Recommendation System

Academic Project

Built a movie recommendation engine using collaborative filtering with TensorFlow and Keras, focusing on embedding layers and dot product scoring.

Recommender Systems TensorFlow

Financial Fraud Detection

Academic Project

Created a lightweight binary classifier using Neural Networks for imbalanced fraud datasets, employing Dropout and SMOTE for robust performance.

Classification Neural Networks

Heart Attack Risk Classification

Academic Project

Developed models (Logistic Regression, Decision Tree, KNN) to classify heart attack risk, providing actionable insights on risk factors.

Healthcare ML Classification

House Price Estimation

GSB 544 Project

Developed a property pricing model pipeline using XGBoost, RandomForest, Ridge, and Lasso regressions with extensive feature engineering.

Regression XGBoost

Recurrent Neural Networks (RNNs)

Academic Project

Built models for sequence prediction and text classification using TensorFlow and Keras (LSTM layers), applied to airline satisfaction and text datasets.

Deep Learning Time Series

CNNs for Image Classification

Academic Project

Trained convolutional models for high-accuracy image classification tasks using Keras and TensorFlow, achieving over 90% accuracy on MNIST.

Vision AI CNN

SRM Docker

Academic Project

Developed a Docker-based solution for SRM University, likely related to streamlining development or deployment environments.

Docker DevOps

BAKUP Website

Company Project

Contributed to the development of the BAKUP multi-language website with integrated APIs, a project showcased at Forbes 30 Under 30.

Full Stack Dev APIs

ToDo-Notes-App

Personal Project

Developed a mobile ToDo and Notes application, demonstrating proficiency in mobile app development principles.

Mobile App Android

ScrapSrm-Academia

Academic Project

A project focused on scraping academic data from SRM University, likely for analysis or utility purposes.

Web Scraping Data Collection

E-Commerce Grocery

Personal Project

Developed an e-commerce platform for groceries, showcasing full-stack development skills and database integration.

E-commerce Web Dev

Course App

Personal Project

Developed a web application for managing courses, demonstrating skills in user interface design and data management.

Web App UI/UX

CareIndia App

Academic Project

Developed a mobile application for "CareIndia", likely focused on social impact or healthcare, showcasing mobile development skills.

Mobile App Social Impact

App - Car Rental Service

Personal Project

Developed a mobile application for a car rental service, demonstrating full-stack mobile development capabilities.

Mobile App Full Stack

Website - Car Rental Service

Personal Project

Developed a web-based platform for a car rental service, showcasing web development and database integration.

Web Dev Database

Research & Publications

My contributions to academic research and technical discourse.

Prospective Stock Analysis Model to improve the investment chances using Machine Learning

Published at Asian Conference on Innovation in Technology (ASIANCON) 2021

This research focused on developing a predictive model using machine learning techniques to analyze prospective stocks, aiming to enhance investment decision-making and improve overall investment outcomes. It demonstrates a practical application of ML in financial analysis.

View Publication →

Medium Blog Articles

Ongoing Technical Contributions

  • Exploring Digital Art in the Cloud: Analyzing r/place 2023 Dataset with AWS SageMaker and S3.
  • SQL Simplified: Learn to Communicate with Your Database.
  • The Data Paradox: Emphasizing the importance of extracting meaningful information from vast amounts of data.
  • Cyber Threat Analysis (Two-part series): Leveraging AWS services (SageMaker, Lambda, CloudWatch) for automated and efficient data analysis in a cloud environment.
Read My Blog →
Contact Person

Let's Connect

I'm always open to discussing new projects, creative ideas, or opportunities to be part of an ambitious team.

Aimethods Logo
View My Business Solutions