Page Not Found
Page not found. Your pixels are in another canvas.
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Page not found. Your pixels are in another canvas.
This is a page not in th emain menu
Published:
Apache Spark has gained a lot of popularity recently for it’s vestatility and speed. It is among the Forbes’ top 10 best Data Analytics and BI platforms and tools of the 2020. I have started an online course on Apache Spark ecosystem and databricks which builds on top of Spark adding reliable and performance data pipelines.
Published:
This is my humble website where I post opereations research and data science related stuff. Check back often as I keep updating it with cool projects, and ideas on advanced analytics.
Published:
In the current clinical practice, priority-specific wait time targets are typically determined by the consensus of medical specialists and healthcare administrators. The problem with this rationale is that it does not consider the efficient use of clinical resources and the patient volume associated with each class. The aim of this method presented here is to determine wait time targets in a multi-priority patient setting in a systematic fashion that both respects clinically acceptable wait time targets and considers clinic size and demand distribution. This approach utilizes predictive, prescriptive and descriptive analytics. More specifically, simulation, deep neural network, regression, and inverse optimization approaches are used.
Published:
A central bank of Canada runs a distribution network and maintains an inventory of bank of notes at regional distribution points for multiple types of denominations. Both shortage and capacity overage of notes at the regional inventories need to be avoided. The goal of this research exploration is to come up with a forecasting model that can help the Bank Note Distribution System (BNDS) operations team to provide right amount of notes in the right place at the right time. Implemented models include classical time series approaches such as STL decomposition, TBATS, Dynamic Harmonic Regression (i.e., Arima with harmonic terms) and deep neural network approaches such as Multi-layer perceptron (MLP), Long-Short Term Memory(LSTM) and Light Gradient Boosting Method (LightGBM).
Published:
In many healthcare systems patients require multiple visits to a healthcare provider. In general, the first visit is known as the consult visit and all the subsequent visits are known as the follow-up visits. The latter typically occur according to predefined booking guidelines. A Markov Decision Process model is used to efficiently allocate available capacity to consult and follow-up visits in a dynamic fashion. To solve this model, a Linear Programming approach to Approximate Dynamic Programming (ADP) is used. The characteristics of the approximate optimal booking (AOP) policy for a multi-class patient setting is derived through simulation.
Published:
Canada is the 10th largest pharmaceutical market in the world. In 2015, drug sales amounted to $25 billion dollars. Public and private plans constitute 42% and 58% of the drug insurance market respectively. Generics drugs sell at an average of 36% of brand prices, represent 2/3 of prescriptions and amount to 1/4 of total drug expenditures. The goal of this project is to provide a method that helps determine if a new drug should be included in the formulary and whether an existing (i.e., covered) drug should be excluded from the formulary using Multi-Criteria Decision Analysis approach called UTADIS. The method was illustrated using statins data from the National Prescription Drug Utilization Information System (NPDUIS) and applied on oncology drugs using the pan-Canadian Oncology Drug Review (pCODR) recommendations.
Published:
In this project, I’ll develop prediction models using the house prices dataset from Aimes, IA. The goal is to demonstrate the 4 steps of the Data Science project lifecycle: Define, Discover, Develop and Deploy. First, I’ll establish simple baseline model using the OLS regression, and then I’ll develop a few predictive models, namely, random forest, xgboost and lightgbm regression models and compare the performance of these models against the baseline with the aim to get better predictive performance. The implementation of similar price prediction models will potentially allow the housing agencies (e.g., CMHC in Canada), real-estate companies, central and commercial banks, municipial governments and home buyers to make informed decisions with respect to market pricing.GitHub Repo
Published:
Dream Housing Finance company provides mortgage lending solutions to home buyers. Using this partial dataset, the company wants to automate the loan eligibility process (in real-time) based on customer information upon submission of the online application. These details include Gender, Marital Status, Education, Number of Dependents, Income, Loan Amount, Credit History and others. The goal is to classify the applications into Loan and No Loan classes. To this end, I’ll explore three classification models in this notebook. GitHub Repo
Published in Journal of Population Therapeutics and Clinical Pharmacology, 2013
C Piwko, G Koren, V Babashov, C Vicente, TR Einarson. Journal of Population Therapeutics and Clinical Pharmacology, 20(2), e149–e160, 2013.
Published in Value in Health, 2014
C Vicente, V Babashov, F Husein, F Saad, S Naidoo, S Holmstrom. Value in Health, 17(3), A89–A90, 2014.
Published in Ontario Health Technology Assessment Series, 2015
V Babashov, S Palimaka, G Blackhouse, D O’Reilly. Ontario Health Technology Assessment Series, 15(5), 1–31, 2015.
Published in Current Oncology, 2017
V Babashov, MA Begen, J Mangel, GS Zaric. Current Oncology, 24(1), 6–14, 2017.
Published in Clinical Oncology, 2017
V Babashov, I Aivas, MA Begen, JQ Cao, G Rodrigues, D D’souza, E Yu. Clinical Oncology, 29(6), 385–391, 2017.
Published in Medical Decision Making, 2020
V Babashov, S Ben Amor, G Reinhardt. Medical Decision Making, 40(4), 438–447, 2020.
Published in Production and Operations Management, 2023
V Babashov, A Sauré, O Ozturk, J Patrick. Production and Operations Management, 32(6), 1924–1941, 2023.
Published:
Published:
Published:
Published:
Published:
Published:
Published:
Click here for more info.
Published:
Published:
Undergraduate BCom course, University of Ottawa, Telfer School of Management, 2018
Undergraduate BCom course, University of Ottawa, Telfer School of Management, 2019
Undergraduate BCom course, University of Ottawa, Telfer School of Management, 2020
Published:
Determining clinically and operationally optimal wait time targets using simulation, deep learning, regression, and inverse optimization.
Published:
Agentic retrieval-augmented generation system for product discovery and enriched customer interactions, using vector search and LLM orchestration.
Published:
Forecasting bank note demand across a central bank’s regional distribution network to optimize inventory and avoid shortages or capacity overages.
Published:
Production ML model to identify false in-stock signals in retail inventory, preventing approximately $13M in annual lost sales.
Published:
Optimizing appointment scheduling for consult and follow-up visits using Markov Decision Processes and Approximate Dynamic Programming.
Published:
Multi-criteria decision analysis for pharmaceutical formulary inclusion/exclusion decisions using the UTADIS method.
Published:
End-to-end regression pipeline comparing OLS, Random Forest, XGBoost, and LightGBM for real estate price prediction.
Published:
Imbalanced classification pipeline for automating mortgage loan eligibility decisions based on applicant data.
Published:
Production recommender system for weekly offer personalization targeting loyalty customers, using transactional data and promotional calendars.
Published:
Decision-optimization system for staff scheduling and operational planning, delivering approximately $1M in annual savings.