Data Scientist, Engineer I
Samsung Research & Developement Noida · Full-time
July 2023 - present · 1 yr 2 mos
Noida, Uttar Pradesh, India
• Developed an AI model with 93% accuracy for predicting suspects in Samsung's big data services using defect data, integrated MLOps for continuous training, and reduced manual effort by 35%.
• Implemented a Gen-AI multi-modal system for generating social media captions with hashtags and emojis in Samsung Gallery, using RAG & LLM, and developed an award-winning API for on-device integration.
• Developed a multi-modal AI for real-time age-appropriate content blocking on smartphones, covering audio, video, and text formats, with on-device integration and award recognition.
• Developed an application for on-device TTS, STT, TTT feature verification comparing S24 on-device AI model with Open-Source Models. Generated detailed comparison Excel reports reducing manual effort by 90% and improving accuracy by 20%.
• Developed an application for multilingual relevant text extraction from images via OCR with LLM. Compared results with on-device gallery AI capture feature verification, generating Excel reports. Reduced manual efforts by 70% and enhanced ROI by 50%.
• Implemented video-to-audio conversion and automated S3 bucket uploads, streamlining training of videos in multiple designated categories for MLOps pipeline in Amazon Sage Maker. Generated Excel reports with detailed analysis and corrected data improved model performance by 30% and resulted in a 15% ROI enhancement.
Skills: 🧑💻⌨️
Python
Generative AI
Natural Language Processing (NLP)
Named Entity Recognition (NER)
LangChain
RAG
Large Language Models (LLM)
Massive Multitask Language Understanding (MMLU)
General Language Understanding Evaluation (GLUE)
Transformer Models
FastAPI
Computer Vision
TensorFlow
Long Short-term Memory (LSTM)
Transfer Learning
PyTorch
Audio Processing
Text-to-Speech
Speech To Text
Multilingual Speech Processing
Pandas
Vector Databases