Network constraints on learnability of probabilistic motor sequences

Ari E Kahn, Elisabeth A Karuza, Jean M Vettel, Danielle S Bassett, Ari E Kahn, Elisabeth A Karuza, Jean M Vettel, Danielle S Bassett

Abstract

Human learners are adept at grasping the complex relationships underlying incoming sequential input1. In the present work, we formalize complex relationships as graph structures2 derived from temporal associations3,4 in motor sequences. Next, we explore the extent to which learners are sensitive to key variations in the topological properties5 inherent to those graph structures. Participants performed a probabilistic motor sequence task in which the order of button presses was determined by the traversal of graphs with modular, lattice-like or random organization. Graph nodes each represented a unique button press, and edges represented a transition between button presses. The results indicate that learning, indexed here by participants' response times, was strongly mediated by the graph's mesoscale organization, with modular graphs being associated with shorter response times than random and lattice graphs. Moreover, variations in a node's number of connections (degree) and a node's role in mediating long-distance communication (betweenness centrality) impacted graph learning, even after accounting for the level of practice on that node. These results demonstrate that the graph architecture underlying temporal sequences of stimuli fundamentally constrains learning, and moreover that tools from network science provide a valuable framework for assessing how learners encode complex, temporally structured information.

Figures

Figure 1.. Experimental Setup.
Figure 1.. Experimental Setup.
(A) An example of the first few steps of a graph traversal defined by a walk on the graph. Top: Each node is uniquely associated with a key combination, and the sequence of key combinations is determined by a walk on the graph. Bottom: A series of trials are presented to the participant. The red squares indicate which keys to press on that trial. Colored arrows illustrate the edge from the graph at top being traversed. However, the participant only is shown the five squares. (B) The mapping between fingers and keys, and average reaction times for each key press. Top: A schematic of the mapping between visual stimuli (squares) and response effectors (fingers). Bottom: The average reaction time (RT) for each key or pair of keys across all data. The diagonal elements of the matrix represent trials in which a single key was pressed, and the off-diagonal elements of the matrix represent trials in which a pair of keys was pressed. (C) The three graph structures that we examine in this study. From left to right, we show a modular graph, a lattice graph, and a random graph with N = 15 nodes connected by E = 30 edges.
Figure 2.. Modular Graph Learning Effects.
Figure 2.. Modular Graph Learning Effects.
(A) Mean reaction times (RTs) as a function of trial number for stage 1 of Experiment 1, among participants exposed to the modular graph. The red line indicates the mean for cross-cluster trials, and the black line indicates the mean for all other trials, each binned in sets of 30 trials (n=30 subjects). (B) Mean reaction times on correct trials for the modular graph. An increase in reaction time across cluster boundaries can be seen, here visualized by yellower colors in the matrix elements that sit between the larger blocks. (C) Mean reaction times collapsed across the symmetric structure of the modular graph. All three clusters were structurally identical and starting position was randomized between subjects, so we combine reaction times across the three clusters into one ‘canonical’ cluster for visualization purposes only. The mean increase in reaction time between clusters is more apparent, here visualized by yellower colors on the edges that connect the top cluster with the two bottom clusters. (D) Relationship between surprisal effect on stage 1 (random walk) and surprisal effect on stage 2 (Hamiltonian walk) for each subject of Experiment 2. Subjects that displayed a strong surprisal effect in stage 1 likewise do so when the walk structure is changed (n=59).
Figure 3.. Learning Rate and Edge Surprisal.
Figure 3.. Learning Rate and Edge Surprisal.
Impact of new edges on reaction time. (A) Mean RT increases in stage 2 when new edges are added to the graph (trials 1501–2000). Included for comparison are Experiments 2 and 3, where – respectively – only the walk or a subset of edges were changed. In both cases the increase in RT is much smaller. (B) Per-subject learning rate correlated with the novel edge effect, defined as the mean difference in reaction time for a subject learning the second graph when responding to a novel edge versus a familiar edge (see Methods; n=109). Learning rate, the model coefficient for log(trial), was scaled amongst all subjects to the range [0,1]. The blue line is the least squares fit, with the gray envelope indicating the 95% confidence interval. (C) Individual correlations shown for the three types of graphs trained on in the first stage. Subjects exposed to the modular and lattice graphs show a significant relationship (p < 0.01, n = 30 and p < 0.03, n = 43, respectively), while those exposed to the random graph do not (p < 0.2, n = 36). Solid lines represent least squares fits, and gray envelopes represent the respective 95% confidence intervals. (D) Differences in reaction time by graph type, across graphs learned in sequence. Each bar shows the number of milliseconds by which the modeled effect for the top listed graph is faster. The increase in RT from lattice to modular, and from random to modular graphs, are both significant to p = 0.02 and p = 0.001, respectively (See Supplementary Table 5). Error bars indicate standard error as estimated in the mixed effects model. Asterisks indicate significance in the mixed effects model. Group sizes: Lattice-Random: n=70, Modular-Lattice: n=72, Modular-Random: n=71. (E) Examples of the graph types.
Figure 4.. Relation Between Small and Large…
Figure 4.. Relation Between Small and Large Scale Graph Statistics and Reaction Time.
(A) Illustration of node betweenness centrality. We show shortest paths from a number of nodes on the far left to a node on the far right, which all pass through the blue node. (B) Relationship between node degree and reaction time (RT), after regressing out the number of visits to a node, with each point representing a separate node, e.g., 15 points per subject. The regression line shows least squares fit, and the gray envelope is the 95% confidence interval. Reported correlation is based on Kendall’s τ (n=177 subjects). (C) Relationship between node betweenness centrality and reaction time using the same approach as used with node degree. (D) Mean reaction time shown as a function of degree, where the mean was z-scored across the 15 nodes for a given subject. Error bars represent bootstrapped 95% confidence intervals. (E) Mean reaction time as a function of node betweenness centrality using the same approach as used with node degree.

References

    1. Aslin RN & Newport EL Statistical Learning: From Acquiring Specific Items to Forming General Rules. Current Directions in Psychological Science 21, 170–176. ISSN: 0963–7214 (2012).
    1. Newman MEJ Networks: An Introduction (Oxford University Press, 2010).
    1. Schapiro AC, Rogers TT, Cordova NI, Turk-Browne NB & Botvinick MM Neural representations of events arise from temporal community structure. Nature Neuroscience 16, 486–492. ISSN: 1097–6256 (2013).
    1. Karuza EA, Kahn AE, Thompson-Schill SL & Bassett DS Process Reveals Structure: How a Network Is Traversed Mediates Expectations About Its Architecture. Scientific Reports 7, 12733 (2017).
    1. Newman MEJ Complex Systems: A Survey. Am. J. Phys 79, 800–810 (2011).
    1. Saffran JR, Aslin RN & Newport EL Statistical Learning by 8-Month-Old Infants. Science 274, 1926–1928. ISSN: 0036–8075 (1996).
    1. Nissen MJ & Bullemer P Attentional requirements of learning: Evidence from performance measures. Cognitive Psychology 19, 1–32. ISSN: 00100285 (1987).
    1. Hunt RH & Aslin RN Statistical learning in a serial reaction time task: Access to seperable statistical cues by individual learners. Journal of experimental psychology. General 130, 658–680. ISSN: 0096–3445 (2001).
    1. Fiser J & Aslin RN Statistical learning of higher-order temporal structure from visual shape sequences. Journal of Experimental Psychology: Learning, Memory, and Cognition 28, 458–467. ISSN: 0278–7393 (2002).
    1. Turk-Browne NB, Jungé JA & Scholl BJ The Automaticity of Visual Statistical Learning. Journal of Experimental Psychology: General 134, 552–564. ISSN: 1939–2222 (2005).
    1. Cleeremans A & McClelland JL Learning the structure of event sequences. Journal of Experimental Psychology: General 120, 235–253. ISSN: 1939–2222 (1991).
    1. Furl N et al. Neural prediction of higher-order auditory sequence statistics. NeuroImage 54, 2267–2277. ISSN: 10538119 (2011).
    1. Newport EL & Aslin RN Learning at a distance I. Statistical learning of non-adjacent dependencies. Cognitive Psychology 48, 127–162. ISSN: 00100285 (2004).
    1. Gómez RL Variability and Detection of Invariant Structure. Psychological Science 13, 431–436. ISSN: 0956–7976 (2002).
    1. Bollobas B Random Graphs (Cambridge University Press, 2001).
    1. Jarvis JP & Shier DR in Applied mathematical modeling: a multidisciplinary approach (1999). doi:.
    1. Goldstein R & Vitevitch MS The influence of clustering coefficient on word-learning: how groups of similar sounding words facilitate acquisition. Frontiers in Psychology 5, 2009–2014. ISSN: 1664–1078 (2014).
    1. Bales ME & Johnson SB Graph theoretic modeling of large-scale semantic networks. J Biomed Inform 39, 451–464 (2006).
    1. Vitevitch MS What can graph theory tell us about word learning and lexical retrieval? J Speech Lang Hear Res 51,408–422 (2008).
    1. Palla G, Barabasi AL & Vicsek T Quantifying social group evolution. Nature 446, 664–667 (2007).
    1. Girvan M & Newman MEJ Community structure in social and biological networks. Proceedings of the National Academy of Sciences 99, 7821–7826. ISSN: 0027–8424 (2002).
    1. Karuza EA, Thompson-Schill SL & Bassett DS Local Patterns to Global Architectures: Influences of Network Topology on Human Learning. Trends Cogn Sci 20, 629–640 (2016).
    1. Steyvers M & Tenenbaum JB The large-scale structure of semantic networks: Statistical analyses and a model of semantic growth. Cognitive Science 29, 41–78. ISSN: 03640213 (2005).
    1. Mengistu H, Huizinga J, Mouret JB & Clune J The Evolutionary Origins of Hierarchy. PLoS Comput Biol 12,e1004829 (2016).
    1. Hermundstad AM et al. Variance predicts salience in central sensory processing. eLife 3, 1–28. ISSN: 2050084X (2014).
    1. Heathcote A, Brown S & Mewhort DJ The power law repealed: the case for an exponential law of practice. Psychonomic bulletin & review 7, 185–207 (June 2000).
    1. Karuza EA, Farmer TA, Fine AB, Smith FX & Jaeger TF On-line measures of prediction in a self-paced statistical learning task. Proceedings of the 36th Annual Meeting of the Cognitive Science Society (2014).
    1. Cleeremans A, Destrebecqz A & Boyer M Implicit Learning: News From the Front. Trends in Cognitive Sciences 2, 406–416 (1998).
    1. Robertson EM The Serial Reaction Time Task: Implicit Motor Skill Learning? Journal of Neuroscience 27, 10073–10075 (2007).
    1. Reber AS Implicit Learning of Artificial Grammars. Journal of Verbal Learning and Verbal Behavior 6, 855–863 (1967).
    1. Verwey WB, Abrahamse EL & de Kleine E Cognitive Processing in New and Practiced Discrete Keying Sequences. Frontiers in Psychology 1, 1–13 (2010).
    1. Kiesel A et al. Control and Interference in Task Switching-A Review. Psychological Bulletin 136, 849–874 (2010).
    1. Koch I Automatic and Intentional Activation of Task Sets. Journal of Experimental Psychology: Learning, Memory, and Cognition 27, 1474–1486 (2001).
    1. Gotler A, Meiran N & Tzelgov J Nonintentional Task Set Activation: Evidence From Implicit Task Sequence Learning. Psychonomic Bulletin & Review 10, 890–896 (2003).
    1. Schneider DW & Logan GD Hierarchical control of cognitive processes: Switching tasks in sequences. Journal of Experimental Psychology 135, 623–640 (4 2006).
    1. Gleiser P & Danon L Community structure in jazz. Adv. Compl. Syst 6, 565–573 (2003).
    1. Tompson SH, Kahn AE, Falk EB, Vettel JM & Bassett DS Individual differences in learning social and nonsocial network structures. Journal of Experimental Psychology: Learning, Memory, and Cognition. ISSN: 1939–1285 (2018).
    1. Gebhart AL, Aslin RN & Newport EL Changing structures in midstream: Learning along the statistical garden path. Cognitive Science 33, 1087–1116. ISSN: 03640213 (2009).
    1. Deroost N & Soetens E Perceptual or motor learning in SRT tasks with complex sequence structures. Psychological Research 70, 88–102 (2006).
    1. Messinger A, Squire LR, Zola SM & Albright TD Neuronal representations of stimulus associations develop in the temporal lobe during learning. Proceedings of the National Academy of Sciences 98, 12239–12244. ISSN: 0027–8424 (2001).
    1. Li N & DiCarlo JJ Unsupervised Natural Experience Rapidly Alters Invariant Object Representation in Visual Cortex. Science 321, 1502–1507. ISSN: 0036–8075 (2008).
    1. Garvert MM, Dolan RJ & Behrens TE A map of abstract relational knowledge in the human hippocampal–entorhinal cortex. eLife 6, 1–20. ISSN: 2050–084X (2017).
    1. Newman ME Modularity and community structure in networks. Proc Natl Acad Sci U S A 103, 8577–8582 (2006).

Source: PubMed

3
Subscribe