Data-Driven Insights by Winston Wang

Discover the journey of a curious data scientist. Transforming data into actionable insights across various industries.

I’m Winston Wang, a Data Scientist With 4+ Years of Experience And Passion for Discovery and Innovation


Born and raised in China, my Chinese name is Dawei (pronounced like Dah-Way). I go by Winston. My journey into data science began with bioinformatics during my first master’s degree in biology, where my curiosity and passion for learning new fields allowed me to quickly master the algorithms that sparked my path into data science. Although my interests and experiences are broad — from bioengineering to data algorithms — this passion for discovery continues to drive me forward as I explore new challenges in data science.

I believe that data holds the key to unlocking endless possibilities and shaping a better future. Since becoming a data scientist, I have worked across healthcare, IoT, and transportation industries, crafting tailored solutions for various teams and challenges. My scientific training instilled a rigorous approach, while my engineering background has given me a practical mindset to ensure real-world applicability and efficiency in the solutions I develop. As I keep up with the latest developments in AI and data science, I publish tutorials and explainers on Medium—ranging from hands-on guides to breakdowns of recent research—translating new tools and concepts into clear, approachable insights for both myself and the broader community. I also built GridMaster, an open-source AutoML tool that reflects the same drive: making advanced techniques more accessible and practical.

Beyond my technical work, I take pride in being hands-on with everything I do. I designed, configured, and ran this website on an Orange Pi server at home, driven by the same curiosity guiding my professional journey. Inspired by traditional Chinese calligraphy, my website logo represents my cultural roots and the blend of modern technology with timeless values. My mission is simple: to make a meaningful contribution to the world, one data-driven solution at a time.

Winston Wang

Email

mail@winston-wang.com​

Phone

(312) 292-7535​

Time Zone

Central Daylight/Standard Time​

Education

Master of Science in Data Science

Computational Methods​

2020 – 2022

DePaul University
• Awards: Graduate with Distinction (GPA: 3.98/4.0)
• Leadership: Data Science Group (President)

Master of Science in Biology

Cellular and Molecular Biology

2016 – 2019

Illinois Institute of Technology

Bachelor of Engineering in Bioengineering

Biochemical Engineering

2010 – 2014

Tianjin University of Science and Technology
Awards: The 8th “Challenge Cup” Fosun National College Students Business Plan Competition
Bronze Prize
Project manager of a National Undergraduate Training Program for Innovation and Entrepreneurship

Experience

Machine learning scientist​

July 2022 – Present
TOGO Car-sharing

Streamlined interviews with business partners to redefine the vehicle flow and allocation problem, resulting in a machine learning model and aligned performance metrics tailored to operational needs.

Conducted a comprehensive assessment of predictive models — ranging from classical machine learning to advanced deep learning — leading to a 20% improvement in revenue forecasting accuracy.

Pioneered the development and application of a cutting-edge CNN-LSTM deep learning model, achieving 1–2 orders of magnitude reduction in RMSE and outperforming open-source baselines by ~45%.

Developed and launched an MLOps dashboard for real-time model monitoring and streamlined detection of concept and data drift, reducing model retraining time by 40%.

Fine-tuned Llama 3.1 70B on industry-specific data to automate business document processing and AI-powered customer support, while also developing semantic user behavior anomaly detection systems.

Junior Data Scientist

June 2022 – August 2022
AbbVie

Engineered a user-centric SQL Generator and Data Retriever tool that abstracted medical claim data (Optum CDM, Merative MarketScan) into intuitive concepts, reducing dependency on data engineers by 80%.

Orchestrated end-to-end development of an advanced pipeline, merging real-world medical and prescription claim data into a navigable interface for seamless data exploration.

Leveraged A/B testing to optimize interface usability and data flow, reducing data processing time by 60% across user types.

Built an early stage working prototype demonstrating the tool’s feasibility, projected to save up to $1M/year in license fees upon deployment.

Collaborated on feature engineering and model development using ICD-10-based comorbidity (Charlson index) and NDC drug features; XGBoost achieved ~94% AUC in identifying future high-risk patients.

Lead Data Scientist

January – June 2021
Instahub

Crafted the development of a strategic blueprint for the department, emphasizing AI-powered energy waste detection in building HVAC operations, guided development priorities, established standards and protocols, and directed the project lifecycle from inception to completion.

Designed and deployed a lightweight Extended Kalman Filter model to predict indoor temperature based on thermodynamic modeling and real-time weather data, achieving <0.9°F (0.5°C) MAE for short-term forecasts and reducing energy usage by 20% across pilot buildings.

Partnered with cross-functional teams to design and deploy an analytics dashboard for real-time monitoring of heat transfer data, improved operational efficiency by 25%, and enabled proactive maintenance strategies.

Certificates

machine learning – specialty

In Progress · Expected July 2025
AWS Certified

Show credential

Azure AI fundamentals

May 2025
Microsoft Certified

Show credential

Deep Learning specialization

February 2023
Coursera/DeepLearning.AI

Show credential

Transfer Learning for NLP with TensorFlow Hub

February 2023
Coursera

Show credential

My Skills

Programming and Software Dev.

Python, R, SQL, Java, Perl
Git, Docker, Nginx, WordPress (This site)
Waterfall/Agile Dev.
Microsoft Project

Data Analysis and Data Visualization

Pandas, NumPy, JAGS
Tableau, PowerBI, Matplotlib, Seaborn, ggplot2, Qlik Sense

Big Data Technologies

MongoDB, Apache Hive
Databricks, Apache Hadoop, Pig, Storm, Spark

Cloud and OS

Amazon Web Services (AWS Certified),
Azure (MS Certified),
Snowflake, Linux

Machine Learning and Artificial Intelligence

Scikit-learn, TensorFlow, PyTorch, MLX, LangChain
Recommender Systems, Reinforcement Learning, Transfer Learning
Deep Learning, Natural Language Processing (NLP), Large Language Models (LLMs) Finetuning

Model Performance Analysis, Hyperparameter Tuning, Model Deployment

Data holds the key to unlocking endless possibilities and shaping a better future.

Winston Wang

Endorsements​

get to know more


Explore my background and skills in detail.

View my resume.

Scroll to Top