Experience
-
IBM (Watson Healthcare & Life Sciences): Responsible for the successful delivery of data science solutions and services in a client consulting environment. Built an end-to-end system to process, manipulate, and analyze text from PDF documents and news articles. Developed entity and relationship extraction models using n-gram similarities and Markov chain achieving F1 score of 0.5. Employed dynamic systems that improve learning based on outcomes to get smarter with each iteration
-
Harvard Medical School: Simulated a Markov chain using the Monte Carlo method to mimic a genetic network. Used Principal Component analysis for feature engineering and logistic regression for classification. Statistical analysis and analytical calculations correlated strongly with the experimental observations
-
IIT Delhi: Optimized the performance of a diagnostic device with respect to the concentration, temperature, and humidity. Applied nonlinear regression analysis on experimental data sets yielding R2 values as high as 0.98
Projects
-
Knowledge Transfer in Reinforcement Learning: Analyzed RL agents in the context of generalizing prior experiences to new unseen environments. Trained the agent using Deep-Q leaning with experience replay algorithm coupled with different transfer learning regimes. Employed the policy learned in one environment to evaluate the difference in the agent’s learning time in a second environment
-
Semantic Segmentation: Implemented the Unet architecture with pixel shuffle for dense prediction on Cityscapes dataset. Devised a new training loss function to enforce background prediction for inconsistent structures. Achieved 0.826 mean IoU on the holdout set
-
DeepRecommender: Developed a model for the rating prediction task in recommender systems using Autoencoders. Refined the model using dense refeeding (to combat the sparsity of the user-item matrix) and dropout regularization. Achieved 0.925 RMSE on the holdout set of Amazon android apps ratings
-
Financial time-series forecast: Devised a Dual-Stage Attention-Based RNN to predict DJIA stock closing prices for the next 50 days from historic data. Incorporated feature engineering techniques including moving averages and exponential moving averages. Achieved 92% directional accuracy on the test set
-
Image Inpainting: Implemented DCGAN architecture (Deep Convolutional Generative Adversarial Networks) architecture to fill in the missing portions in an image (inspired by this paper). With a fully-convolutional neural network, the images produced are both locally and globally consistent
-
Face Detection and Recognition: Designed a 8 layered CNN for person recognition and bounding box regression. Augmented the training dataset by horizontal flipping and adding gaussian noise to the images. Model achieved a classification accuracy (for recognizing the employee by employee ID) of 94% on test set and 0.71 IoU
-
Flight Delay Prediction: Worked in a team of three to design a model to predict flight delays for flights departing from JFK airport based on historical data of flight delays, past weather data and US Bank holidays data. Achieved 0.78 AUC using ensemble methods
-
Sentiment analysis: Classified Amazon reviews as postive and negative using RNN and bidirectional multilayer LSTM. Achieved classification accuracy of 0.952 on the test set