My professional work experience spans multiple responsibilities
and diverse workplaces. I have 4+ years of experience in the
domains of healthcare analytics, consumer technology, marketplace,
financial technology, and enterprise software.
At a Glance
Data Scientist Intern
Mathematica
Multivariate data modeling and causal inferencing for healthcare data
Random Forest and Decision Tree based methods for estimate prediction using pruned estimators
Dimensionality reduction and explainable variable selection using Sparse PCA (PCA with lasso penalty)
Framework for database creation in Redshift using multi-source raw data using Pandas, Boto3, and Step Functions
Modeling standalone and crosswalk datasets in Redshift and RDS for medical insurance pricing database
Deep Learning Researcher (part-time)
Adobe Research
Learning optimal image compression algorithms for computer vision pipelines
Generalized latent representations of salient features for object detection, segmentation, and classification
Latency and memory reduction using an autoencoder based approach with information from pretrained hyperpriors
GPU training and testing with modular code using Pytorch Lightning
Software Engineer II
Uber
Batch analytics at scale (100PB+) using Spark and Hive query engines on HDFS and S3
Real-time streaming pipelines and analytics using Apache Flink engine on Kafka topics
Query performance and resource optimizations using Spark and SQL best practices
Python and Java application development for supporting monitoring and governance use cases
Writing PRDs and proposal documentation for adding new features and integrating new acquisitions data resources
Data modeling for efficient querying, fault tolerance, and memory optimization using advanced SQL and design patterns
Software Engineer
SAP Labs
Workflow orchestration and CI/CD using Git, Jenkins
Infrastructure provisioning, configuration management, and deployments using Terraform, Ansible, Chef
Analytics datalake and warehouse solutions using Hadoop HDFS, Spark, Hive, and ElasticSearch
Automated testing suite for application features using Java, Selenium
Health, server, and API traffic monitoring using Splunk, Zabbix, Kibana, and Grafana
More details in the resume