Skip to content
Open to AI engineering and applied research opportunities

AI/ML Engineer and Applied AI Researcher

Shaurav Khadka

Building trustworthy AI systems for messy real-world data.

AI/ML engineer with 5+ years across software development and technical operations, plus production-oriented AI R&D experience. I build systems across NLP and RAG, temporal graph learning, computer vision with ROS2 deployment, reinforcement learning, and document intelligence.

From research idea to dependable implementation.

Scroll
Focus
  • Generative AI
  • Retrieval-Augmented Generation
  • Temporal Graph Learning
  • Natural-Language Processing
  • Semantic Search
  • Computer Vision
  • Robotics
  • Reinforcement Learning
  • Document Intelligence
  • AI Reliability

5+

Years across development and technical operations

Web systems, digital operations, and applied AI evaluation.

95.24%

Robot-image accuracy after Sim2Real adaptation

Improved from 2.38% through deployment-specific fine-tuning.

1,925

AirRaid PPO mean reward

Improved from 300 using frame skipping and frame stacking.

0.68

Precision@5 and Recall@5 in RedditPulse retrieval

Evaluated with 384-dimensional embeddings and FAISS.

Selected Work

Selected AI Systems

A focused selection of end-to-end systems, applied research projects, and production-oriented evaluations. The numbers shown below are verified project outcomes, not decorative metrics.

Generative AI and NLP

01

RedditPulse: Social-Media Intelligence Platform

Co-developed a seven-module NLP and LLM prototype over 1,989 filtered posts and 1,588 comments from 10 subreddits. The platform integrates preprocessing, NER, TF-IDF, sentiment modelling, zero-shot topic classification, sentence-transformer embeddings, FAISS retrieval, grounded insight generation, and a Streamlit dashboard. Retrieval evaluation reached Precision@5 = 0.68 and Recall@5 = 0.68.

Collaborative academic prototype.

  • Python
  • NLP
  • RoBERTa
  • RAG
  • FAISS
  • Streamlit

AI Reliability and Document Intelligence

02

Production AI Reliability and Document Intelligence

Built a repeatable OCR-evaluation workflow for veterinary-insurance invoices across multiple OCR configurations, transformation rules, confidence scores, and error codes. Supported adversarial fraud-detection evaluation and produced recommendations on traceability, reproducibility, validation dependencies, latency, and cost.

Production-oriented AI/ML R&D internship work.

  • Document Intelligence
  • OCR
  • AWS S3
  • Azure Document Intelligence
  • Python
  • AI Reliability

Computer Vision and Robotics

03

Fine-Grained Vision and ROS2 Robot Deployment

Co-developed a pasta-recognition and vision-to-action robot pipeline. A ResNet50 model achieved 90.69% accuracy across 21 subclasses; separate three-class experiments reached 99.21%. Deployment-specific fine-tuning and augmentation improved robot-image accuracy from 2.38% to 95.24%, with predictions mapped to ROS2 navigation commands.

Collaborative computer-vision deployment project.

  • ROS2
  • Computer Vision
  • PyTorch
  • ResNet50
  • Transfer Learning
  • Sim2Real

Reinforcement Learning

04

Reinforcement-Learning Benchmark Suite

Implemented and compared agents across Taxi-v3, LunarLander-v3, AirRaidNoFrameskip-v4, and custom MountainCar-v0 wrappers. Frame skipping and frame stacking improved AirRaid PPO mean reward from 300 to 1,925. A momentum-position reward design achieved a 100% final MountainCar success rate.

  • Reinforcement Learning
  • PPO
  • DQN
  • Q-Learning
  • Gymnasium
  • Stable-Baselines3

Temporal Graph Learning

05

Temporal GNN for Blockchain Fraud Detection

Modelled blockchain transactions as dynamic graphs and built a TGAT-style architecture with temporal encodings, attention-based message passing, and dual-task learning for fraud detection. Developed an end-to-end graph-learning pipeline and benchmarked temporal modelling against an XGBoost baseline.

  • PyTorch
  • NetworkX
  • TGAT
  • Temporal GNNs
  • XGBoost
  • Fraud Detection

Generative AI and Retrieval

06

LLM-Powered Financial Advisory and Multilingual AI Prototype

Built a retrieval-augmented decision-support prototype combining document retrieval, semantic search, vector embeddings, and context-aware LLM responses. Extended conversational workflows with multilingual language detection, translation support, and cross-lingual context management.

Research prototype. Not financial advice.

  • LLMs
  • RAG
  • Semantic Search
  • Vector Embeddings
  • Multilingual NLP
  • Conversational AI

Capabilities

Technical Capabilities

My work spans the complete AI-development lifecycle: problem definition, data collection, preprocessing, modelling, experimentation, evaluation, error analysis, system integration, deployment, and technical communication.

Operating Method

From ambiguity to dependable systems.

A disciplined engineering loop for turning research ideas into real-world AI systems that can be understood, tested, and improved.

  1. 01

    Define

    Frame the real problem, constraints, evidence, and failure modes before modelling.

  2. 02

    Build

    Create modular pipelines that connect data, models, retrieval, and interfaces.

  3. 03

    Evaluate

    Measure performance, inspect edge cases, and challenge assumptions beyond headline accuracy.

  4. 04

    Integrate

    Translate experiments into maintainable systems with operational context.

  5. 05

    Communicate

    Document trade-offs, limitations, and decisions clearly enough to act on.

Experience

Experience Built Across Systems and Operations

A practical foundation spanning production-oriented AI evaluation, software delivery, digital operations, and client-facing problem solving.

  1. TRUUTH

    AI/ML R&D Intern

    Feb 2026 — May 2026

    Sydney, NSW, Australia

    Document intelligence, OCR evaluation, adversarial fraud-detection analysis, and production-AI reliability recommendations across traceability, reproducibility, latency, cost, and validation dependencies.

    • Python
    • AWS S3
    • Azure Document Intelligence
    • Fraud Detection
  2. Ingleburn Convenience Store

    Operations & Digital Support Assistant · Part-time

    Oct 2024 — Present

    Ingleburn, NSW, Australia

    Maintain accurate transaction and inventory records, troubleshoot POS and basic network issues, support business email administration, and maintain digital business operations under time pressure.

    • Digital Operations
    • Data Accuracy
    • POS Support
    • Customer Service
  3. Picpoint Nepal

    Software Developer

    Aug 2022 — May 2024

    Kathmandu, Nepal

    Maintained web applications and databases, implemented security enhancements, monitored performance, and resolved production issues across client-facing digital systems.

    • Web Applications
    • Databases
    • Security
    • Production Support
  4. Threadscript

    Web Developer

    Jul 2020 — Jul 2022

    Kathmandu, Nepal

    Built web prototypes, integrated APIs, coordinated iterative delivery, and resolved usability and technical issues through structured debugging.

    • Web Development
    • API Integration
    • Prototyping
    • Debugging

Foundation

Education and Selected Credentials

Formal study, applied leadership programs, and focused technical learning.

Education

Macquarie University

Master of Information Technology · Artificial Intelligence

2024 — Present · Sydney, NSW, Australia

Relevant work: NLP and LLM systems, graph machine learning, advanced computer vision and action, reinforcement learning, AI governance, and an industry AI/ML R&D internship.

Education

London Metropolitan University · Islington College

BSc Computer Science · First Class Honours

2017 — 2021 · Kathmandu, Nepal

Ranked among the top 10 students in the cohort. Built recommendation, trip-planning, and database-backed systems across Python, PHP/MySQL, Oracle, C#, Java, and GUI development.

Selected Credentials

  • Global Leadership Program

    Macquarie University

    Structured co-curricular leadership development focused on global engagement and professional growth.

  • MQ Incubator × KPMG Design Thinking

    MQ Incubator × KPMG

    Entrepreneurship, innovation, and human-centred problem solving.

  • UPG Sustainability Leadership · 2024

    UPG

    Selected as one of 500 participants from a global applicant pool.

  • CS50x Computer Science

    Harvard University

  • Elements of AI

    University of Helsinki

  • Ethics and Governance of AI for Health

    World Health Organization

Independent Publishing

Books & Independent Publishing

Explore my authored and collaborative publishing catalogue: technology and well-being, children’s storytelling, illustration, editing, and creative production.

The catalogue below includes the seven distinct works listed on my Goodreads Author profile. Purchase links lead to Amazon Australia where a verified listing is available; Goodreads links provide the public catalogue record.

07

Works

Amazon

Buy

Goodreads

Discover

Featured authored publication

The Digital Equilibrium

Navigating Technological Advancement for Optimal Well-Being

An independent authored work exploring how technological progress can be balanced with human well-being and intentional living.

Catalogue

Complete publication catalogue

Authored and collaborative publishing work, presented with direct reading and purchase paths.

Illustrator · Editor

02

Children Stories

Bal Katha

A children’s-story collection created with Gokul Khadka, with illustration and editorial contribution by Shaurav Khadka.

Illustrator · Creative contributor

03

Joyful Stories

Joyful Stories

An illustrated story collection listed in the Goodreads Author catalogue.

Illustrator · Editor

04

Joyful Stories

Mazzako Katha

A colourful illustrated collection designed for younger readers and created with Gokul Khadka.

Illustrator · Creative contributor

05

Joyful Stories

Mazzako Katha · Alternate edition

An alternate catalogue edition of the illustrated Mazzako Katha collection.

Illustrator · Editor

06

Words of Wisdom

Amritvani

A collaborative illustrated publication centred on devotional reflections and words of wisdom.

Illustrator · Creative contributor

07

2 in 1 Joyful, Children Stories

Combined children’s-story edition

A combined illustrated edition bringing together children’s stories in a single collection.

About

Building Across Disciplines

I am an AI/ML engineer and applied AI researcher focused on building trustworthy systems for messy real-world data.

My work spans generative AI, NLP, semantic search, temporal graph learning, computer vision, robotics, reinforcement learning, document intelligence, and production AI reliability. I care about the full path from research idea to dependable implementation: defining the problem clearly, working with imperfect data, testing assumptions, evaluating limitations, integrating components, and communicating results honestly.

My background also includes software development, technical operations, leadership programs, and independent publishing. That range matters: useful AI systems are not only model artefacts. They are products, workflows, decisions, explanations, and responsibilities.

Stack

Tools and Technologies

  • Python
  • SQL
  • Java
  • C#
  • PHP
  • pandas
  • NumPy
  • Matplotlib
  • scikit-learn
  • TensorFlow
  • PyTorch
  • NetworkX
  • Hugging Face Transformers
  • Sentence Transformers
  • FAISS
  • Streamlit
  • Jupyter
  • Gymnasium
  • Stable-Baselines3
  • ROS2
  • Docker
  • Linux
  • AWS S3
  • boto3
  • Azure Document Intelligence
  • JSON
  • MySQL
  • Oracle
  • Git

Writing

Engineering Notes

Short technical reflections on applied AI systems, research decisions, evaluation, and real-world deployment constraints.

Lab Note

Evaluating AI reliability beyond headline accuracy

Why confidence calibration, error-code analysis, and traceability often matter more than a single accuracy number when systems leave the demo.

Lab Note

What sparse rewards teach us about system design

Lessons from reward shaping in MountainCar and continuous-control tasks — and how sparse feedback reshapes how we structure learning systems.

Lab Note

From keyword matching to semantic retrieval

Comparing TF-IDF baselines with transformer embeddings and vector search, and the practical trade-offs of moving to semantic retrieval.

Contact

Let’s build something useful.

I am open to AI engineering opportunities, applied research collaborations, and conversations about trustworthy real-world AI systems.