Deploy, but don't drift (away from the data)
Image courtesy of censius.ai Director's cut * What determines the success of a data science project? Is it the company's data, organizational structure, or culture? Joshi et al. (2021) identified the top five reasons as (i) misapplication of the analytical techniques, (ii) unrecognized sources of bias, (iii) misalignment between the business objectives and data science, (iv) lack of design thinking (designing the solution for the wrong user), and (v) diversion of responsibilities (such as data scientists are expected to champion the project). My shorter list includes two issues that are related to model deployment and monitoring in one way or another: (i) ambitious questions with extremely low return on investment, and (ii) lack of infrastructure. I saw models that consumed the time of the data science team, other data science resources, and computing power for such a negligible return. Sometimes we would solve problems on a smaller scale and stop. Lack of infrastruct