Kartik Gupta

My professional work experience spans multiple responsibilities and diverse workplaces. I have 4+ years of experience in the domains of healthcare analytics, consumer technology, marketplace, financial technology, and enterprise software.

At a Glance

Data Scientist Intern

Mathematica

Multivariate data modeling and causal inferencing for healthcare data

Random Forest and Decision Tree based methods for estimate prediction using pruned estimators

Dimensionality reduction and explainable variable selection using Sparse PCA (PCA with lasso penalty)

Framework for database creation in Redshift using multi-source raw data using Pandas, Boto3, and Step Functions

Modeling standalone and crosswalk datasets in Redshift and RDS for medical insurance pricing database

Deep Learning Researcher (part-time)

Adobe Research

Learning optimal image compression algorithms for computer vision pipelines

Generalized latent representations of salient features for object detection, segmentation, and classification

Latency and memory reduction using an autoencoder based approach with information from pretrained hyperpriors

GPU training and testing with modular code using Pytorch Lightning

Software Engineer II

Uber

Batch analytics at scale (100PB+) using Spark and Hive query engines on HDFS and S3

Real-time streaming pipelines and analytics using Apache Flink engine on Kafka topics

Query performance and resource optimizations using Spark and SQL best practices

Python and Java application development for supporting monitoring and governance use cases

Writing PRDs and proposal documentation for adding new features and integrating new acquisitions data resources

Data modeling for efficient querying, fault tolerance, and memory optimization using advanced SQL and design patterns

Software Engineer

SAP Labs

Workflow orchestration and CI/CD using Git, Jenkins

Infrastructure provisioning, configuration management, and deployments using Terraform, Ansible, Chef

Analytics datalake and warehouse solutions using Hadoop HDFS, Spark, Hive, and ElasticSearch

Automated testing suite for application features using Java, Selenium

Health, server, and API traffic monitoring using Splunk, Zabbix, Kibana, and Grafana