Computer Engineering

Sangam Parajuli

Computer Engineering student at IOE Thapathali Campus, Tribhuvan University. Passionate about machine learning, speech processing, and building meaningful software.

Sangam Parajuli

About

I am a Computer Engineering student at the Institute of Engineering (IOE), Thapathali Campus, Tribhuvan University, with a strong foundation in programming, machine learning, and web development. My interests span across artificial intelligence, speech processing, and software engineering.

Proficient in Python, C, C++, and C#, I have experience building applications ranging from machine learning models and web applications to game development. I worked as an AI Research Intern at Wiseyak, focusing on multilingual ASR and TTS systems supporting English, Nepali, Hindi, and Maithili.


Education

Bachelors in Computer Engineering

Institute of Engineering (IOE), Thapathali Campus, Tribhuvan University

2022 May – 2026 May

Kathmandu, Nepal


Experience

AI Research Intern

Wiseyak

Dec 2025 – March 2026 Internship Certificate

Contributing to the Speech team in building multilingual end-to-end Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) systems with support for English, Nepali, Hindi, and Maithili.

  • Led data refinement pipelines to ensure high-quality, clean training corpora for multilingual ASR model development.
  • Performed domain classification to organize and categorize speech data across diverse topics, improving model generalization and robustness.
  • Drove continuous improvement of ASR models through iterative experimentation, error analysis, and fine-tuning across all supported languages.
  • Collaborated cross-functionally to align speech system capabilities with real-world product requirements.

Skills

Programming Languages

Python C C++ C# JavaScript SQL

Machine Learning & AI

TensorFlow Detectron2 PaddleOCR OpenCV NumPy pandas Matplotlib scikit-learn

Web Development

Flask Django HTML5 CSS3 Bootstrap Tailwind CSS SQLite

Tools & Platforms

Git GitHub Unity Engine Qt Framework Linux REST APIs

Projects

Nepali and English Multilingual Audio to Audio

Python NVIDIA NeMo PyTorch Conformer ASR GPT-2 RAG FAISS ONNX INT8 Quantization
  • Designed and implemented a fully modular bilingual (English + Nepali) conversational AI pipeline, decoupling ASR, language modeling, RAG-based retrieval, translation, and text-to-speech into separately trainable and swappable components.
  • Fine-tuned Conformer-based ASR models for both languages, achieving 6.14% WER on LibriSpeech (English) and 31.40% WER and 11.51% CER on the Nepali FLEURS test set, surpassing all Whisper variants while using fewer parameters.
  • Built a custom decoder-only transformer language model (GPT-2 architecture) with a FAISS-backed Retrieval-Augmented Generation system trained on recent Nepali news, reducing hallucination and improving factual grounding (BERTScore-F1: 0.9303, ROUGE-L: 0.5862).
  • Applied ONNX export and INT8 dynamic quantization to compress the Nepali ASR model from 493 MB to 134 MB (71.5% reduction) with negotiable performance change, enabling deployment on resource-constrained hardware.

Graphical Content Recognition with Editable Charts and Layout Retention

Python Detectron2 (R50_FPN_3x) PaddleOCR OpenCV TensorFlow
  • Converts static graphical documents (PDFs and images) into editable digital formats while preserving their original layout and structure.
  • Created a custom dataset to train a machine learning model that detects and processes text, tables, bar graphs, and pie charts.
  • Reconstructs each element in an editable form using tailored computer vision and OCR algorithms.

Sathi: A Minimalist Social Media

Python Django SQLite Bootstrap HTML5 CSS3 JavaScript
  • A minimalistic family- and friend-focused social network prioritizing meaningful connections.
  • Users share updates, photos, views, and comments within a trusted circle.
  • Built with Django backend, SQLite database, and Bootstrap responsive frontend.

CMMS (Class Marking and Management System)

C++ Qt Framework
  • A spreadsheet-style desktop application allowing users to manage data via rows and columns.
  • Perform calculations, and save/load files in CSV format.
  • Built with C++ and Qt Framework for data input, manipulation, and CSV saving, improving user productivity.

Farty Rocket

Game Project
C# Unity Engine
  • A humorous side-scrolling arcade game inspired by Flappy Bird.
  • Players navigate a rocket propelled by strategic flatulence through obstacles.
  • Features 2D physics, collision detection, and engaging game mechanics.

Certifications


Get In Touch

I'm always open to discussing new opportunities, collaborations, or just having a conversation about technology and research.