Kenny Shirley
Supply Chain Optimization Technology
Amazon, New York, NY
JSM
Baltimore, MD
August 1, 2017
Talk reviews
Literature Review: 3 taxonomies for interactive graphics
A small example of interactive graphics to do forecasting @ Amazon
Three very nice review papers: (1) Stats, (2) HCI, (3) InfoVis
Propose a hierarchy of interactive graphics, focusing on
manipulating (vs. rendering)
multivariate data (vs. univariate or other types)
for exploratory analysis (vs. explanatory)
Quote: “The gain resulting from animation and real-time control is not incremental but a quantum leap”
Type by Task Taxonomy (TTT): 7 data types x 7 interactive tasks
Overview | Zoom | Filter | Details-on-demand | Relate | History | Extract | |
---|---|---|---|---|---|---|---|
1-dimensional | |||||||
2-dimensional | |||||||
3-dimensional | |||||||
temporal | |||||||
multidimensional | |||||||
tree | |||||||
network |
Quote: “Visual Information Seeking Mantra: Overview first, zoom and filter, then details-on-demand”
One could map elements of the Buja/Cook/Swayne taxonomy to this one…
Overview | Zoom | Filter | Details-on-demand | Relate | History | Extract | |
---|---|---|---|---|---|---|---|
1-dimensional | |||||||
2-dimensional | |||||||
3-dimensional | 3-d rotation | ||||||
temporal | |||||||
multidimensional | |||||||
tree | |||||||
network |
Quote: “Visual Information Seeking Mantra: Overview first, zoom and filter, then details-on-demand”
One could map elements of the Buja/Cook/Swayne taxonomy to this one…
Overview | Zoom | Filter | Details-on-demand | Relate | History | Extract | |
---|---|---|---|---|---|---|---|
1-dimensional | |||||||
2-dimensional | |||||||
3-dimensional | 3-d rotation | ||||||
temporal | |||||||
multidimensional | Linked Brushing | ||||||
tree | |||||||
network |
Quote: “Visual Information Seeking Mantra: Overview first, zoom and filter, then details-on-demand”
Animated Transitions in Statistical Data Graphics
Quote: “The challenge of designing animations is to visually interpolate the syntactic features [visual elements] such that semantic changes [meaning of the data] are most effectively communicated.”
At Amazon, the Supply Chain Optimization Technology (SCOT) organization handles forecasting, buying, and placement of retail items.
A major need: visualize and understand time series data to develop models (i.e. iterate between “visualize” and “model” here in the figure below)
[image: “R for Data Science” by Grolemund and Wickham]
Static ggplot of product sales time series. Not a bad start…
### 1 ### draw a single plot containing as much info as possible (within reason)
draw_canonical_plot <- function(product) {
# spend a long time (hours or days) creating a plot for a single instance (product)
# prototype by trying on a few randomly sampled instances
}
### 2 ### compute a summary statistic of interest
mean_demand <- mean(product$demand)
### 3 ### take a stratified sample of size, say, n = 500:
stratified_sample <- ...
### 4 ### create the plots and collect into one .pdf file:
pdf(file = "plots_stratified_mean_demand.pdf", width = 10, height = 6)
for (i in 1:500) {
draw_canonical_plot(product[stratified_sample[i]])
}
dev.off()
### 5 ### spend hours poring over the plots & share (via email attachment)
Trelliscope offers a great speedup:
Sorting by a summary statistic is easy
Stratified sampling is possible (by creating dummies for quintiles and then sorting, for instance…)
Linked brushing allows for comparisons; query a single variable and see the result across panels.
[quick live demo in R]
R’s `manipulate’
R plotly (better)
a pure javascript solution: probably best, how hard to code/achieve?
Question: What types of interactive visualization:
Increase productivity substantially?
Increase productivity by a “quantum leap”?
Allow for discoveries that were truly not possible using static graphics?
Also a big thanks to Gabe Becker (Genentech Research) for organizing the session around a great topic.
Also, thanks to Carson Sievert for helpful discussions and a pointer to his Ph.D. thesis, which discusses recent developments in interactive graphics and provides historical references:
Kenny Shirley
Amazon Supply Chain Optimization Technology (SCOT)