Eat Mor Chikin, and fast? The story of Chick-fil-A’s multimodal data analysis and optimization

Image courtesy of Imago / Pubity Illustration - pubity.com

Introduction to the business case

The Wall Street Journal article reveals how Chick-fil-A is using innovative data collection and analysis methods to optimize its drive-through operations. The company has developed a "Film Studies" unit that combines drone footage with security camera data to create comprehensive "game films" of its restaurant operations. This multimodal data collection approach, inspired by NFL game analysis, allows the company to model traffic patterns, identify operational bottlenecks, and analyze service efficiency in drive-through operations. Read the article here.

The data-centric insights have led to significant operational improvements, including the development of new restaurant designs with elevated kitchens and multiple drive-through lanes capable of serving 700 cars per hour. The analysis also helped optimize staffing patterns, identify gaps in Wi-Fi coverage, and improve order processing workflows. One location reported a 50% increase in drive-through transactions after implementing the changes.

In this post, we'll answer the following question:

What's interesting about this case, and what does it say about the future of data collection, analysis, and data-centric optimization?

Academic's take

This case is really about data-centric optimization, so I'll leave most of the discussion to the Director, whose background is in management science and operations research. What I find most fascinating about this case is the novelty and multimodal nature of the data collection. With recent advances in deep learning, object recognition is now making its way into retail. This opens up a new venue for experimentation. This is interesting. Previously unthinkable data for a fast food chain can now be collected using drones, as detailed in the case, and combined with in-store camera data and data from transaction processing systems. The story does not go into the details of the modeling, but it does point out how data from different modalities complemented each other to solve a decades-old optimization problem: how to get customers through the drive-through quickly and keep them happy.

Chick-fil-A's combination of transactional data with video and other data to model and optimize the cars going through the restaurant's drive-through reminds me of the growing interest in multimodal applications. While the case does not hint at the methods used, I'd like to add a note on multimodal AI.

Multimodal AI: The future of data collection and analysis?

Multimodal AI describes machine learning models that are capable of processing and integrating information from multiple data modalities. These data modalities can include text, images, audio, video, and other forms of input. In contrast to traditional models, which are typically designed to process a single type of data, multimodal AI combines and analyzes different forms of data input to achieve a more comprehensive understanding of the problem.

Liang et al. (2022) describe three characteristics of multimodal AI: heterogeneity, connections, and interactions. Heterogeneity refers to the different properties, structures, and representations of modalities. A textual description of an event will differ fundamentally in quality, structure, and representation from a video of the same event. Connectivity refers to the complementary information shared between different modalities, which can be reflected in similarity scores. Finally, interactions refer to how different modalities interact when brought together.

With that as a side note, I will leave the discussion of the optimization problem to the next section.

Director's cut

In my opinion, the Chick-fil-A case and success story illustrates how to approach process optimization. There are several lessons to be learned. The first is relatively obvious: Optimization efforts generally require a focus on the entire system rather than its individual parts. The second is somewhat hidden between the lines: systems are complex, so optimization efforts must be coupled with experimentation. Let's take a closer look at each lesson.

Lesson one: Focus on the whole system, not the parts

Chick-fil-A realized early on that eliminating one bottleneck would only create another. When the ordering process is optimized, the kitchen becomes overloaded. When the kitchen capacity is improved (along with any layout changes to accommodate larger vent hoods), the teams that handle the bagging and delivery of orders are overwhelmed. If the bagging and delivery operations and stations are optimized, customers eating in their cars in the parking lot at that time would leave little room for walk-in customers. Walk-in demand would decrease, and the overall productivity of the system would decrease again.

Process optimization, when done right, requires a focus on the entire system rather than its individual parts. Optimizing individual operations independently fails to consider downstream effects and often results in suboptimal overall performance. A holistic (systems) approach ensures that improvements are consistent throughout the process and do not maximize the performance of one operation at the expense of another.

In Chick-fil-A's case, data collection efforts are not limited to drive-through lanes. The team not only monitors what's happening outside the restaurant, but also in the kitchens to understand the impact on the overall system. It appears that solutions are not siloed, and the downstream effects are also being analyzed.

Lesson two: Couple optimization with experimentation

Chick-fil-A pays for its franchisees' remodeling costs to experiment with different design ideas. Why does Chick-fil-A need to experiment in addition to all this data? Wouldn't simulations help determine which design would produce the best results?

Simulation models are only as good as their underlying assumptions. They may fail to mimic behaviors that may emerge over time and affect actual system performance, especially if the process has manual components. Humans are extremely creative and adaptive, organically finding better and more efficient ways to achieve the same goal. When bottlenecks occur, frontline workers may find workarounds. Experiments would reveal how people adapt and interact with processes to overcome system limitations. These adaptations are difficult to predict in simulations.

There is an additional benefit to using experimentation as part of the optimization process. Each franchisee has local insight into their unique market characteristics and customer base. As a result, the effectiveness of solutions and customization approaches may vary from market to market. By encouraging franchisees to test different designs, Chick-fil-A taps into the collective intelligence of its entire network and gathers a greater variety of ideas.

This is a great case of how process optimization, when coupled with experimentation, can provide a more complete understanding of system behavior and reveal insights that simulations alone cannot capture.

References

Liang, P. P., Zadeh, A., & Morency, L. P. (2022). Foundations and trends in multimodal machine learning: Principles, challenges, and open questions. arXiv preprint arXiv:2209.03430.

Podcast-style discussion of the article

The raw/unedited podcast discussion produced by NotebookLM (proceed with caution):

Search

Data Duets