6+ years of software work experience with domain expertise in ML foundation model modelling training and inference, distributed cloud infrastructure for ML infrastructure and big data pipelines. Demonstrated experience in technical leadership, establishing technical standards and best practices.
Download my resume hereA collection of projects authored by Arjun, and likely shared out with the community as an open source project.
Express Wiki gives you a personal online wikipidea powered by AI so you can read and organize your wikipidea.
Google Chrome Extension that generates a caption for an image using deep learning network RCNN
A collection of efforts to which I contributed, but did not create. Contributing back to Open Source projects is a strong passion of mine, and requires a considerate approach to learn norms, standards and approach for each community for a successful merge!
Worked and experimented with CLR algorithm which is implemented in Keras that uses cyclical learning rates to train deep neural networks.
Won Aurora hackathon 2021 for developing novel prefetch algorithm for data read ahead for Aurora MySQL 8.0 engine.
Won prestigious kudos award 2020 for my contributions at CoreTech Adobe among team of 100+ builders.
Winner Adobe shark tank event for best product idea pitch amongst 100+ new hires.
Best B.Tech. Project Award received from director IIT Roorkee amongst 1816 students.
Secured All India Rank of 401 among ~1,50,000 participants for Kishore Vaigyanik Protsahan Yojana
All India Senior School Certificate Examination 2015 certificate received from education minister India.
Building the next generation of Palmyra LLMs and optimizing training and inference techniques for them.
Lead for SageMaker HyperPod Distributed Training Recipes that automate the end-to-end training loop, including loading datasets, applying distributed training techniques, and managing checkpoints for faster recovery from faults. Lead for SageMaker Distributed Model Parallel library, which makes large scale distributed deep learning training easier and performant. Optimized multi-GPU training of LLM’s like Transformers, Llama and Mistral by leading 5-person team. Developed Aurora zero-ETL integration with Amazon Redshift and Amazon SageMaker to enables near real-time analytics and machine learning (ML) using data from Aurora.
Developed a novel prefetch algorithm for data read-ahead by incorporating clustering algorithms based on data access patterns, increasing SQL read throughput by 30%.
Developed Genshop which is Adobe's in-house deep learning python library for Generative models primarily GANs. Worked on CPU (single and multithreaded) and GPU workflows of Adobe Color Engine (ACE) and Adobe Font Engine(CoolType).
Interned with Big Data Experience Lab, Adobe Research on the project to personalized highlighting based on reader feedback. Incorporated information retrieval and reinforcement learning techniques for keyword extraction.
Researched on action recognition with feature fusion which involved combining traditional computer vision techniques with deep learning to improve the accuracy of action recognition.