Hello, I'm

Indra Prasad Sapkota

AI Researcher & Computer Engineer

Exploring the frontiers of Deep Learning, Large Language Models, and Computer Vision. Building intelligent systems that make a difference.

Indra Prasad Sapkota
Scroll Down

About Me

Indra Prasad Sapkota

AI Researcher & Engineer

I am a final-year Computer Engineering student with a strong focus on deep learning, large language models, and computer vision. I have hands-on experience re-implementing state-of-the-art LLM models (LLaMA-3 and Pali-Gemma) and computer vision applications along with deployment and backend technologies.

I aim to pursue a career in AI research and engineering, working on scalable and impactful intelligent systems.

Pokhara, Nepal
+977 9845242839
bishal.sap21@gmail.com

Technical Skills

AI/ML
Deep Learning Computer Vision LLM RAG MLOps
Frameworks
PyTorch TensorFlow Huggingface Langchain Langgraph
Development
JavaScript TypeScript React Node.js React Native
Backend & Tools
MongoDB SQL Docker Django Git
Download Resume

Education

Apr 2022 — Apr 2026

Bachelor in Computer Engineering

Pashchimanchal Campus, IOE, Tribhuvan University

Currently in the 4th year of my course with an aggregate of around 84%.

Deep Learning Focus Computer Vision LLM Research
Aug 2018 — Aug 2021

High School

SOS Hermann Gmeiner Secondary School

Graduated with a 3.7 GPA with A+ in Mathematics and Physics.

A+ in Mathematics A+ in Physics

Projects

Explore my work across AI Research, Software Development, and Engineering

All Tags LLM Computer Vision Deep Learning Transformer NLP Multimodal YOLO TTS PyTorch Huggingface React Node.js Next.js WebSocket Mobile Full Stack
AI Research

LLaMA 3.1 8B Implementation

Re-implemented every model element of LLM from tokenizer to transformer model with grouped attention according to LLaMA paper from scratch. Tested inference with open source weights from Huggingface.

LLM PyTorch Transformer Deep Learning
View Project
AI Research

Vision LLM (Pali-Gemma)

Re-implemented Google Pali-Gemma vision LLM model from scratch with SigLIP encoder for image embeddings and Gemma language model for conditional generation.

Vision LLM Computer Vision PyTorch Multimodal
View Project
AI Research

Swarlekha - Zero Shot Voice Cloning

Developed a zero-shot voice cloning and text-to-speech model for English and Nepali using Resemble AI Chatterbox model with T3 language model and S3Gen audio model.

Voice Cloning TTS Deep Learning NLP
View Project
AI Engineering Jun 2024

uTECHsil - Object Detection

Built traditional items detection system as part of YANTRA Hackathon using YOLOv8 with custom scraped dataset. Achieved mAP50 of ~0.9 and deployed using Flask and React.

mAP50: ~0.9 Hackathon Project
YOLOv8 Flask React Real-time
View Project
Development

Sambad - Real-time Chat Platform

Full-featured real-time chat application with server creation, channels, WebSocket messaging, audio/video calls using LiveKit, and complete authentication system.

Real-time Messaging Video/Audio Calls Server Management
Next.js MySQL WebSocket LiveKit
View Project
Development

Marketplace - E-commerce

Developed a full-stack web app using MERN stack with complete authentication system, Stripe API payment integration, and modern web development tools.

MERN Stripe MongoDB React

Achievements

1st Place

Project Demonstration - Innosphere

Pokhara University

June 2025

Demonstrated the PlantCare cauliflower disease detection app and secured first place.

2nd Place

Yantra Hackathon

Object Detection Project

June 2024

Built uTechsil object detection model and achieved second place.

3rd Place

Global IME Bank National AI/ML Hackathon

Revenue Prediction Model

April 2025

Built a revenue prediction model using real-world dataset in the national level hackathon.

Mentor

AI Bootcamp

ICES Collaboration

August 2025

Teaching students in college who are starting in the field of AI.

Extra-Curricular Activities

Apr 2025

Global IME Bank National AI/ML Hackathon

Achieved third position in national level AI/ML hackathon where we built a revenue prediction model using real world dataset.

Hackathon AI/ML National Level
Dec 2024

ANAIS V By NAAMI

Annual AI School organized by NAAMI where we learned about AI technologies from industry experts and professors on topics like Graph Neural Networks, 3D Reconstruction, and Large Language Models.

AI School GNN LLM
Jun 2024

Yantra Hackathon - Second Place

Built a simple object detection model (uTechsil) during this hackathon and achieved second place.

Hackathon Object Detection 2nd Place
Jun 2023

ANAIS IV By NAAMI

Annual AI School organized by NAAMI where we learned about AI technologies from industry experts and professors on topics like Geometric Deep Learning, Computer Vision, and NLP.

AI School Computer Vision NLP
Dec 2022

Mentorship Program by ICES

Built the marketplace project (e-commerce web app with features from JWT authentication to Stripe payment integration) during this program and presented to the audiences.

Mentorship Web Development MERN Stack

Certificates & Courses

DeepLearning.AI Coursera

Machine Learning Specialization

Oct 2023 — Feb 2024
  • Supervised Machine Learning: Regression and Classification
  • Advanced Learning Algorithms
  • Unsupervised Learning, Recommenders, Reinforcement Learning
DeepLearning.AI Coursera

Deep Learning Specialization

Jul 2024 — Oct 2024
  • Neural Networks and Deep Learning
  • Improving Deep Neural Networks: Hyperparameter Tuning
  • Structuring Machine Learning Projects
  • Convolutional Neural Networks
  • Sequence Models

Publications

Research papers and academic contributions

Published

Cauliflower Disease Detection using YOLO Models

Presented at the IEEE International Conference on ICT and Photonics

A research paper on detecting diseases in cauliflower plants using YOLO object detection models for agricultural applications.

Research Interests

Large Language Models Computer Vision Multimodal Learning Speech Synthesis Neural Architecture MLOps

Get In Touch

Feel free to reach out for collaborations, research opportunities, or just a chat!