Hey, I'm Adarsh Raj 👋

`< DATA & AI ENGINEER />`

Architecting enterprise-scale data and AI systems at GSK

🧠 About Me

I'm a Data & AI Engineer with 2+ years of experience at GSK (GlaxoSmithKline), building intelligent, enterprise-scale systems across metadata management, data governance, and AI-powered search.

I own the full engineering lifecycle — from designing semantic search with OpenAI & Azure AI Search, to shipping Apache Spark pipelines on Databricks, to building FastAPI microservices and enterprise integrations with ServiceNow, Collibra, and SailPoint.

🔭 Currently building AI-powered metadata and governance platforms at GSK
🤖 Passionate about LLMs, semantic search, and intelligent automation
🛠️ I own end-to-end: architecture → backend → DevOps → delivery
🌱 Exploring Databricks Genie (NLQ) and NLP-driven analytics
🏆 Eliminated 90% manual effort and saved ~200 hours/year through automation
📍 Based in Bengaluru, India — Silicon Valley of India

📊 Impact at a Glance

Metric	Result
Manual effort eliminated	90% (~200 hrs/year saved)
Data sync time reduction	99.6% (30 min → 7 sec)
Metadata retrieval improvement	87.5% (8 hrs → 1 hr)
Tutorial views	50K+ developers reached

🚀 Enterprise Projects

🔍 Rover — Semantic Search & Conversational AI Platform

OpenAI · Azure AI Search · FastAPI · Python

Designed and implemented enterprise-scale semantic search and conversational query capabilities for the Rover metadata platform at GSK. Metadata assets are now discoverable via natural language, powered by OpenAI embeddings and Azure AI Search.

⚡ ServiceNow–Collibra Governance Automation

ServiceNow · Collibra · Azure Function Apps · GraphQL · Python

End-to-end integration automating governance workflows between ServiceNow and Collibra — eliminating 90% of manual effort (~200 hrs/year) and achieving a 99.6% reduction in data sync time (30 min → 7 sec).

🔥 Spark Metadata Ingestion & Cataloguing Pipelines

Apache Spark · Databricks · Azure Data Factory · Delta Lake

Owned development and optimisation of Spark-based ingestion pipelines on Databricks, achieving significant runtime reductions and automating metadata cataloguing across enterprise systems. Integrated DQ outputs into Collibra for transparent governance reporting.

🛠️ Tech Stack

AI & Search

Data Engineering

Backend & APIs

DevOps & Cloud

Enterprise Integrations

🎓 Education

B.Tech — Computer Science & Information Science Engineering M.S. Ramaiah University of Applied Sciences, Bengaluru | 2020–2024 | CGPA: 8.2/10.0

🏅 Certifications

Certification	Issuer	Link
Career Essentials in Generative AI	Microsoft & LinkedIn	Link
Machine Learning with Python	Cognitive Class	Link
SQL (Advanced)	HackerRank	Link
Introduction to Cybersecurity	Cisco	Link
Python Masterclass	Udemy	Link
Build Smarter, Scalable Al Agents with UiPath	GeeksforGeeks	Link

🏆 Achievements

🥇 Best Project Award — out of 250 students at M.S. Ramaiah University
🎯 Rank 5 on GeeksforGeeks at university level
🏁 Rank 281 in GFG Coding Contest 119
📺 50K+ views on web development tutorial

📈 GitHub Stats

"Turning data into intelligence, one integration at a time"

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hey, I'm Adarsh Raj 👋

`< DATA & AI ENGINEER />`

🧠 About Me

📊 Impact at a Glance

🚀 Enterprise Projects

🔍 Rover — Semantic Search & Conversational AI Platform

⚡ ServiceNow–Collibra Governance Automation

🔥 Spark Metadata Ingestion & Cataloguing Pipelines

🛠️ Tech Stack

AI & Search

Data Engineering

Backend & APIs

DevOps & Cloud

Enterprise Integrations

🎓 Education

🏅 Certifications

🏆 Achievements

📈 GitHub Stats

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Hey, I'm Adarsh Raj 👋

< DATA & AI ENGINEER />

🧠 About Me

📊 Impact at a Glance

🚀 Enterprise Projects

🔍 Rover — Semantic Search & Conversational AI Platform

⚡ ServiceNow–Collibra Governance Automation

🔥 Spark Metadata Ingestion & Cataloguing Pipelines

🛠️ Tech Stack

AI & Search

Data Engineering

Backend & APIs

DevOps & Cloud

Enterprise Integrations

🎓 Education

🏅 Certifications

🏆 Achievements

📈 GitHub Stats

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

`< DATA & AI ENGINEER />`

Packages