Email: rohan.kumar.suryan@gmail.com | Phone: +91 7406688825
Experienced Data Science & Analytics professional with 7 years of work experience in various industries such as IT, BFSI, Healthcare, Pharma, Auto, and Marketing. Skilled in AI / ML / NLP, Python / PySpark, Analytics / BI, Big Data / SQL, SAS / R, Technical solution designing and proposal development, Project & Stakeholder Management.
Bachelors of Technology Computer Science, IIIT Delhi India, 2012-2016
Responsibilities include providing solutions to multiple clients & refining based on feedback, creating detailed plans with timelines and tasks, and closely collaborating with clients and leaders to ensure timely completion of all subtasks. Managing the workflow of data scientists and analysts, to maintain high velocity and quality standards.
i. Sales Estimation, Location Scouting & Store Designing – For a Philippinean fast food chain built ensemble of demand cannibalization gravity models, regression using demographics, socio-economic factors like traffic, footfall, buying power to estimate sales for a new store location at channel level.
ii. New Customer Acquisition Model for a Safety Science Leader – Currently Leading development of XGBOOST model for generating conversion propensity for leads.
iii. Customer Segmentation for a Safety Science Leader – Led development of profiles for rule based segments and ML based segments using K-Means clustering in 2 months
iv. Cross-sell / upsell recommendation for a Safety Science Leader – Provided solution, guidance and feedback to onshore team for building a KNN based collaborative filtering model. Recommended cross-sell opportunities for 20% of the customers and upsell opportunities for 80% of customers
v. Lead Scoring Model for a Cloud Computing Giant – Client was using a 3rd party black-box tool for lead scoring. We built multiple in-house models within a quarter. The Champion model captured 95% conversions in top 30% leads.
vi. Customer Journey Optimization for a Software Firm – Utilized sequential clustering and Multi-Channel Attribution Modeling based on Markov Chains to identify the stage of leads / customers during their engagement journey and optimized their movement to the next stage. Achieved a 27% reduction in average conversion time from 51 to 37 weeks.
vii. Accounts Profiling using BI – Client wanted to easily navigate through list of customers buying certain products basis their interests, automated Power BI dashboard to segment customers for sales team. The new system replaced a manual excel reporting structure and migrated data to snowflake tables.
Led a team, developing models to mitigate credit / fraud losses & payment defaults across credit lines, cards, and loans. Provided strategic insights to key stakeholders in the underwriting and collections teams. Utilized big data from credit reporters for data-driven decision making.
i. Deposits Fraud Model for one of the largest US Bank – Fraudsters can run away even before a check is processed. For real time fraud detection client built suboptimal DataRobot’s model. Using LGBM & multiple modeling techniques we built robust models which generated daily alerts for bad check deposits. Our implementation resulted in savings of over $25 million annually.
ii. Updated BI & reporting post-merger of two banks - Collaborated with Data Engineering & BI teams across verticals to merge data and implement new reporting structure within a quarter.
iii. CFPB complaints analytics using deep learning to expedite monetary redemption within 15 days
iv. Early Payment Default Model – Reduced Losses to one third on all lending products
Built the data science team at Jaarvis & developed multiple features, Dashboards & Time Series Models in AWS for Ride Sharing apps.
i. Customer Churn Model for a Payment Wallet; improved retention by 23%
ii. Demand & Supply Time Series Models for Middle Eastern Ride Sharing platform; managing fleet in 30+ cities
Developed Wipro’s First Healthcare AI Product called Smartance.
i. Claims classification using NLP for an Insurance Giant; improved processing time by 47%
ii. Pharmacovigilance using NLP for a Pharma Giant - Client wanted to process thousands of forms where patients were listing out adverse reactions from drugs using AI, this data was needed for creating better drugs. We reduced the drug approval time by up-to 2 years.
Coding: Jupyter R-Studio SAS-EG Databricks;
NLP: NLTK Scikit-Learn Gensim Doc2Vec Glove Statsmodels Spacy Pandas NumPy PySpark MLlib Tesseract;
Neural Networks: Keras Theano TensorFlow;
Big Data Engineering: Hadoop Hive Impala Cloudera PySpark Airflow Snowflake;
MLOps : AWS Sagemaker IBM-Watson DataRobot;
BI: Tableau Power BI Excel;
ReST API : Flask Django;