Brijesh Kumar

Data Scientist

I'm Brijesh Kumar, a data science enthusiast who loves turning complex problems into simple, intelligent solutions. I explore how AI, analytics, and real-world data can work together to drive innovation and make technology more human.

Projects
19
Total
Awards Wins
7
Followers
2800+
Tech Stack
38
Domains
6
<AI/>
{ data }
ML
[]
()
//
ML
Brijesh Kumar portrait

Skills

Explore my technical skills organized by category

Skills

Web Development

Building robust full-stack architectures using Next.js, FastAPI, and Spring Boot, with specialized expertise in secure Stripe payment integrations and AWS cloud deployment.

  • HTMLHTML
  • CSSCSS
  • JavaScriptJavaScript
  • TypeScriptTypeScript
  • ReactReact
  • Next.jsNext.js
  • Tailwind CSSTailwind CSS
  • Node.jsNode.js
  • ExpressExpress
  • MongoDBMongoDB
  • FirebaseFirebase
  • VercelVercel
  • Google Cloud PlatformGoogle Cloud Platform
  • AWAWS

Experience

  • Experience
    Current Role
    Machine Learning and Data Analyst Researcher

    ASU Biodesign Institute

    📍 Tempe, AZ
    Apr 2025 – Present

    Leading ML research at ASU's cutting-edge Biodesign Institute, where genomic data meets cloud-scale machine learning to transform autism therapy outcomes.

    Tech Stack

    Python
    R
    SQL
    XGBoost
    Random Forest
    ARIMA
    SARIMAX
    Jupyter
    Git
    Power BI
    Matplotlib

    Key Achievements

    • Analyzed 10TB microbiome data using Python, R, SQL; performed EDA & PCA to engineer features, improving prediction accuracy by 27%.
    • Built healthcare prediction models using logistic regression, XGBoost, random forest; cross-validation tuning improved AUROC by 0.21.
    • Built longitudinal forecasts using ARIMA, SARIMAX in Python to model treatment response trends, improving outcome stability by 18%.
    • Automated reproducible analyses using RMarkdown, Jupyter, Git, standardizing workflows and reducing reanalysis effort by 35%.
    • Designed healthcare dashboards with Power BI, Matplotlib, visualizing microbiome shifts that guided three data-driven research decisions.
  • Experience
    Software Engineer

    AWL Metaverse Pvt. Ltd

    📍 India
    Mar 2024 – Dec 2024

    Engineered scalable ML and NLP pipelines on AWS, building intelligent systems that automate clinical workflows and deliver real-time analytics.

    Tech Stack

    Python
    AWS SageMaker
    Transformers
    spaCy
    RAG
    Power BI
    SQL
    NLP

    Key Achievements

    • Built end-to-end Python ML pipelines on AWS SageMaker with monitoring, improving model AUC by 0.08 and cutting false alerts 20%.
    • Developed clinical NLP pipelines using Transformers, spaCy in RAG systems, automating summaries and cutting clinician effort 50–70%.
    • Created analytics dashboards using Power BI, SQL, Python for real-time monitoring, reducing operational incident resolution time by 60%.
  • Experience
    Software Developer

    Quicket Solutions

    📍 India
    May 2023 – Mar 2024

    Architected high-performance payment infrastructure with modern cloud technologies, delivering enterprise-grade reliability and blazing-fast semantic search.

    Tech Stack

    React
    Spring Boot
    PostgreSQL
    pgvector
    AWS EC2
    RDS
    CloudFront

    Key Achievements

    • Built Quicket Pay with React+Spring Boot+PostgreSQL; normalized schema and indexed queries reduced lookup latency 30% at scale.
    • Implemented vector embeddings in PostgreSQL with pgvector for semantic search; similarity queries on 100K+ records in under 200ms.
    • Deployed AWS infrastructure with EC2+RDS multi-AZ+CloudFront; caching and failover improved throughput 40%, hit 99.9% uptime.

Featured Projects

Click the arrow or press space to explore each project

JobHunt: AI-Powered Career Platform

JobHunt: AI-Powered Career Platform

A full-stack career platform utilizing RAG and vector embeddings to deliver personalized resume-job matching, increasing application rates by 40%. Engineered resilient microservices for auth and search handling 180 RPS with zero errors, and implemented automated Java web scraping to expand job databases by 3x.

Tech Stack

Next.js
Spring Boot
MongoDB
Java
RAG
Vector Embeddings
Microservices

Tags

GenAI
Full Stack
Automation

Real-Time Fashion Recommender

Real-Time Fashion Recommender

A high-performance recommendation engine using CLIP and ResNet50 for vector-search, boosting user engagement by 32%. Features a real-time React+TypeScript swipe UI backed by an optimized FastAPI service with ANN, achieving sub-150ms p95 latency for 10k+ users with 99.99% uptime.

Tech Stack

Python
FastAPI
TypeScript
React
CLIP
ResNet50
PostgreSQL
AWS

Tags

Computer Vision
Deep Learning
E-commerce

Nexus: AI-Powered Smart Contact Manager

Nexus: AI-Powered Smart Contact Manager

An intelligent contact management system leveraging Spring Boot and Kafka for microservices orchestration. Integrated AWS Bedrock and FAISS for RAG-based contact enrichment, raising match accuracy by 70%. Optimized with Redis read-through caching to deliver sub-3s network summaries.

Tech Stack

Next.js
Java
Spring Boot
Kafka
AWS Bedrock
FAISS
Redis
Docker

Tags

Microservices
GenAI
Cloud Engineering

SlideSage AI

SlideSage AI

An AI-powered learning assistant that transforms dense concepts into short, dialogue-based explainer videos with quizzes, streaks, and badges. If a student doesn’t understand a concept, SlideSage instantly regenerates fresh analogies until it clicks — making learning adaptive, fun, and addictive.

Tech Stack

Next.js
TailwindCSS
shadcn/ui
Groq
Pexels API
TypeScript
Vercel

Tags

AI
EdTech
Video Generation

Doctor Knowledge Engine (RAG)

Doctor Knowledge Engine (RAG)

A hybrid RAG healthcare system combining semantic and keyword search with Sentence-BERT, improving top-k precision by 75%. Deployed a local LLaMA2 model with strict context-only guardrails to achieve 100% test-query accuracy, served via a robust Flask API on AWS ECS.

Tech Stack

Python
Flask
LangChain
ChromaDB
LLaMA2
Docker
AWS ECS

Tags

NLP
Healthcare AI
LLMs

Spotify Top Songs Analytics Dashboard

Spotify Top Songs Analytics Dashboard

An end-to-end data pipeline ingesting tracks via Spotify API, automating ETL to reduce manual prep by 80%. Utilized Tableau and Python for EDA to identify viral trends and trained Scikit-learn models predicting playlist inclusion with 0.92 R2 score.

Tech Stack

Power BI
Python
SQL
DAX
Spotify API
Scikit-learn
Tableau

Tags

Data Analytics
Visualization
Machine Learning

Education

Arizona State University

Currently Pursuing
Tempe, AZ
Jan 2025 – Present
4.00 / 4.00
CGPA
18 / 18
Credits Earned
7
Projects
2
Hackathons Won

M.S. in Computer Science

Key Coursework:

  • Statistical Machine Learning
  • Semantic Web Mining
  • Data Mining
  • Data Processing

Vellore Institute of Technology

Completed
Chennai, India
Aug 2019 – May 2023
4.00 / 4.00
CGPA
160 / 160
Credits Earned
12
Projects
3
Hackathons Won

B.Tech in Computer Science (AI/ML Specialization)

Key Coursework:

  • Machine Learning
  • Deep Learning
  • Natural Language Processing
  • Data Structures and Algorithms

Awards & Recognition

Hacktoberfest Super Contributor - DigitalOcean & GitHub
Hacktoberfest Super Contributor

DigitalOcean & GitHub

November 2025

Recognized as a Super Contributor for exceptional open-source contributions during Hacktoberfest 2024. Made significant impact across multiple repositories, demonstrating technical excellence and collaborative spirit in the global developer community.

Finalist — Code for Good 2024 - JPMorgan Chase & Co.
Finalist — Code for Good 2024

JPMorgan Chase & Co.

November 2024

Selected as a finalist among thousands of participants for developing innovative tech solutions addressing real-world social challenges. Earned the opportunity for direct interviews with JPMorgan Chase engineering teams, recognizing technical prowess and social impact focus.

Winner — CrozeAI Hackathon - CrozeAI
Winner — CrozeAI Hackathon

CrozeAI

August 2023

Won the CrozeAI Hackathon by architecting an intelligent routing algorithm for NYC bike navigation. Implemented advanced pathfinding algorithms that optimized route selection based on safety, efficiency, and real-time traffic patterns, delivering seamless urban mobility solutions.

Top 0.2% — CodeKaze Programming Contest - Coding Ninjas
Top 0.2% — CodeKaze Programming Contest

Coding Ninjas

March 2024

Achieved All India Rank 315 out of 150,000+ participants and secured College Rank 3 at VIT in the prestigious CodeKaze programming competition. Demonstrated exceptional problem-solving skills and algorithmic expertise in highly competitive coding challenges.

Publications

Early Dementia Detection and Classification of Stage by Efficient Segmentation and Artificial Neural Network
Early Dementia Detection and Classification of Stage by Efficient Segmentation and Artificial Neural Network

Authors: Brijesh Kumar, et al.

Journal: Springer Nature Conference Proceedings (2024)

View Publication

Contact

Let's Connect

Feel free to reach out for collaborations, opportunities, or just to say hi!

Location

Tempe, Arizona

Follow Me

Send Me a Message

I'd love to hear from you!