A cluster-based approach to selecting representative stimuli from the International Affective Picture System (IAPS) database

Alexandra C Constantinescu, Maria Wolters, Adam Moore, Sarah E MacPherson, Alexandra C Constantinescu, Maria Wolters, Adam Moore, Sarah E MacPherson

Abstract

The International Affective Picture System (IAPS; Lang, Bradley, & Cuthbert, 2008) is a stimulus database that is frequently used to investigate various aspects of emotional processing. Despite its extensive use, selecting IAPS stimuli for a research project is not usually done according to an established strategy, but rather is tailored to individual studies. Here we propose a standard, replicable method for stimulus selection based on cluster analysis, which re-creates the group structure that is most likely to have produced the valence arousal, and dominance norms associated with the IAPS images. Our method includes screening the database for outliers, identifying a suitable clustering solution, and then extracting the desired number of stimuli on the basis of their level of certainty of belonging to the cluster they were assigned to. Our method preserves statistical power in studies by maximizing the likelihood that the stimuli belong to the cluster structure fitted to them, and by filtering stimuli according to their certainty of cluster membership. In addition, although our cluster-based method is illustrated using the IAPS, it can be extended to other stimulus databases.

Keywords: Cluster analysis; Emotion; IAPS; International affective picture system; Stimulus selection.

Figures

Fig. 1
Fig. 1
Correlations between the pleasure/valence arousal, and dominance dimensions, with deviations from linearity that give rise to the specific shapes of the relationships
Fig. 2
Fig. 2
Various clustering indices indicate different “optimal” values for k. These graphs may change slightly with every run of the clustering algorithm, due to the random seeds that k-means uses. As such, 100,000 repetitions were run on the k-means clustering algorithm each time, with a range for k from 2 to 8, and with the values of the Caliński, Ball, Hartigan, and SSI criteria computed each time (with the Ball criterion having to be minimized, unlike the other three criteria, which must be maximized). The average values for these criteria were then computed across all of the repetitions and indicated (left to right, and top to bottom) that three, eight, eight, and three clusters should be extracted, respectively
Fig. 3
Fig. 3
Data structure of the IAPS images. It is worth noting that large portions of the 3-D space remain unpopulated, signaling either that the IAPS does not cover those combinations between valence arousal, and dominance, or that photographic material in general would have difficulty with this
Fig. 4
Fig. 4
The amount of dissimilarity (as computed using the R package clue: Hornik, 2005) between cases is accounted for by ever-increasing values for k
Fig. 5
Fig. 5
Cluster boxplots, for each dimension. The boxplots indicate, for each cluster (coded by colors), the spread of cases assigned to it, in terms of valence arousal, and dominance. The boxplot widths are proportional to the cluster sample sizes
Fig. 6
Fig. 6
Bivariate scatterplots showing the default classifications of cases and the uncertainties provided by Mclust() in R. The uncertainties are coded using one of three symbols: ringed black dots for candidates with a high certainty of cluster membership; orange (light gray) asterisks for less clear cluster memberships; and red (dark gray) squares for cases to avoid using as stimuli, with very unclear memberships. Point size is an additional indicator for the level of classification uncertainty, with larger points indicating higher uncertainty
Fig. 7
Fig. 7
Selection of the 20 most likely IAPS cases per cluster, for the k = 5 clustering solution. The color coding was chosen to be consistent with Fig. 8 below
Fig. 8
Fig. 8
Selection of the five most likely IAPS cases per cluster, for the k = 5 clustering solution, along with IAPS image codes. The color coding was chosen to be consistent with Fig. 7 above

References

    1. Aguilar de Arcos F, Verdejo-García A, Peralta-Ramírez MI, Sánchez-Barrera M, Pérez-García M. Experience of emotions in substance abusers exposed to images containing neutral, positive, and negative affective stimuli. Drug and Alcohol Dependence. 2005;78:159–167. doi: 10.1016/j.drugalcdep.2004.10.010.
    1. Anderson E, Siegel EH, Barrett LF. What you feel influences what you see: The role of affective feelings in resolving binocular rivalry. Journal of Experimental Social Psychology. 2011;47:856–860. doi: 10.1016/j.jesp.2011.02.009.
    1. Ball GH, Hall DJ. ISODATA, a novel method of data analysis and pattern classification (Technical report) Menlo Park, CA: Stanford Research Institute; 1965.
    1. Bernat E, Patrick CJ, Benning SD, Tellegen A. Effects of picture content and intensity on affective physiological response. Psychophysiology. 2006;43:93–103. doi: 10.1111/j.1469-8986.2006.00380.x.
    1. Borcard D, Gillet F, Legendre P. Numerical ecology with R. New York, NY: Springer; 2011.
    1. Bradley MM, Lang PJ. Measuring emotion: The self-assessment manikin and the semantic differential. Journal of Behavior Therapy and Experimental Psychiatry. 1994;25:49–59. doi: 10.1016/0005-7916(94)90063-9.
    1. Bradley MM, Lang PJ. The International Affective Picture System (IAPS) in the study of emotion and attention. In: Coan JA, Allen JJ, editors. Handbook of emotion elicitation and assessment, Series in affective science. New York, NY: Oxford University Press; 2007. pp. 29–46.
    1. Caliński T, Harabasz J. A dendrite method for cluster analysis. Communications in Statistics—Theory and Methods. 1974;3:1–27. doi: 10.1080/03610927408827101.
    1. Deen B, Pitskel NB, Pelphrey KA. Three systems of insular functional connectivity identified with cluster analysis. Cerebral Cortex. 2011;21:1498–1506. doi: 10.1093/cercor/bhq186.
    1. Delplanque S, N’diaye K, Scherer K, Grandjean D. Spatial frequencies or emotional effects? A systematic measure of spatial frequencies for IAPS pictures by a discrete wavelet analysis. Journal of Neuroscience Methods. 2007;165:144–150. doi: 10.1016/j.jneumeth.2007.05.030.
    1. Dimitriadou E, Dolničar S, Weingessel A. An examination of indexes for determining the number of clusters in binary data sets. Psychometrika. 2002;67:137–159. doi: 10.1007/BF02294713.
    1. Do CB, Batzoglou S. What is the expectation maximization algorithm? Nature Biotechnology. 2008;26:897–899. doi: 10.1038/nbt1406.
    1. Dolnicar S, Grabler K, Mazanec JA. A tale of three cities: Perceptual charting for analyzing destination images. In: Woodside AG, Crouch GI, Mazanec JA, Oppermann M, Sakai MY, editors. Consumer psychology of tourism, hospitality and leisure. Wallingford, UK: CAB International; 1999. pp. 39–62.
    1. Eizenman M, Yu LH, Grupp L, Eizenman E, Ellenbogen M, Gemar M, Levitan RD. A naturalistic visual scanning approach to assess selective attention in major depressive disorder. Psychiatry Research. 2003;118:117–128. doi: 10.1016/S0165-1781(03)00068-4.
    1. Fraley C, Raftery AE. Model-based clustering, discriminant analysis, and density estimation. Journal of the American Statistical Association. 2002;97:611–631. doi: 10.1198/016214502760047131.
    1. Fraley, C., & Raftery, A. E. (2006). MCLUST version 3: An R package for normal mixture modeling and model-based clustering (Report No. 504). Seattle, WA: University of Washington, Dept. of Statistics.
    1. Glenn CR, Blumenthal TD, Klonsky ED, Hajcak G. Emotional reactivity in nonsuicidal self-injury: Divergence between self-report and startle measures. International Journal of Psychophysiology. 2011;80:166–170. doi: 10.1016/j.ijpsycho.2011.02.016.
    1. Grühn D, Scheibe S. Age-related differences in valence and arousal ratings of pictures from the International Affective Picture System (IAPS): Do ratings become more extreme with age? Behavior Research Methods. 2008;40:512–521. doi: 10.3758/BRM.40.2.512.
    1. Hallahan M, Rosenthal R. Statistical power: Concepts, procedures, and applications. Behaviour Research and Therapy. 1996;34:489–499. doi: 10.1016/0005-7967(95)00082-8.
    1. Hamann SB, Ely TD, Hoffman JM, Kilts CD. Ecstasy and agony: Activation of the human amygdala in positive and negative emotion. Psychological Science. 2002;13:135–141. doi: 10.1111/1467-9280.00425.
    1. Hartigan JA. Clustering algorithms. New York, NY: Wiley; 1975.
    1. Hartigan JA, Wong MA. Algorithm AS 136: A k-means clustering algorithm. Journal of Applied Statistics. 1979;28:100–108. doi: 10.2307/2346830.
    1. Hautamäki, V., Cherednichenko, S., Kärkkäinen, I., Kinnunen, T., & Fränti, P. (2005). Improving k-means by outlier removal. In Proceedings of the 14th Scandinavian Conference on Image Analysis (pp. 978–987). Berlin, Germany: Springer-Verlag.
    1. Hlavac, M. (2013). stargazer: LaTeX code and ASCII text for well-formatted regression and summary statistics tables (version 4.5.3) [R package and documentation]. Retrieved on June 20, 2015, from = stargazer.
    1. Hornik K. A CLUE for CLUster ensembles. Journal of Statistical Software. 2005;14:1–25. doi: 10.18637/jss.v014.i12.
    1. Hubert L, Arabie P. Comparing partitions. Journal of Classification. 1985;2:193–218. doi: 10.1007/BF01908075.
    1. Ito TA, Cacioppo JT, Lang PJ. Eliciting affect using the International Affective Picture System: Trajectories through evaluative space. Personality and Social Psychology Bulletin. 1998;24:855–879. doi: 10.1177/0146167298248006.
    1. Jain AK. Data clustering: 50 years beyond K-means. Pattern Recognition Letters. 2010;31:651–666. doi: 10.1016/j.patrec.2009.09.011.
    1. Jain, A. K., & Dubes, R. C. (1988). Algorithms for clustering data (Vol. 6). Englewood Cliffs: Prentice Hall.
    1. Kantardzic M. Data mining: Concepts, models, methods, and algorithms. 2. Hoboken, NJ: Wiley; 2011.
    1. Kaufman L, Rousseeuw PJ. Finding groups in data: An introduction to cluster analysis. Hoboken, NJ: Wiley; 2005.
    1. Koenigsberg HW, Fan J, Ochsner KN, Liu X, Guise K, Pizzarello S, New A. Neural correlates of using distancing to regulate emotional responses to social situations. Neuropsychologia. 2010;48:1813–1822. doi: 10.1016/j.neuropsychologia.2010.03.002.
    1. Lane RD, Reiman EM, Bradley MM, Lang PJ, Ahern GL, Davidson RJ, Schwartz GE. Neuroanatomical correlates of pleasant and unpleasant emotion. Neuropsychologia. 1997;35:1437–1444. doi: 10.1016/S0028-3932(97)00070-5.
    1. Lang PJ. Technology in mental health care delivery systems. In: Sidowski JB, Johnson JH, Williams TA, editors. Behavioral treatment and bio-behavioral assessment: Computer applications. Norwood, NJ: Ablex; 1980. pp. 119–l37.
    1. Lang, P. J., Bradley, M. M., & Cuthbert, B. N. (2008). International affective picture system (IAPS): Affective ratings of pictures and instruction manual (Report No. A-8). Gainesville, FL: University of Florida, NIMH Center for the Study of Emotion and Attention.
    1. LeDoux JE. The emotional brain. New York: Simon & Shuster; 1996.
    1. Leys C, Ley C, Klein O, Bernard P, Licata L. Detecting outliers: Do not use standard deviation around the mean, use absolute deviation around the median. Journal of Experimental Social Psychology. 2013;49:764–766. doi: 10.1016/j.jesp.2013.03.013.
    1. Lithari C, Frantzidis C, Papadelis C, Vivas AB, Klados M, Kourtidou-Papadeli C, Bamidis P. Are females more responsive to emotional stimuli? A neurophysiological study across arousal and valence dimensions. Brain Topography. 2010;23:27–40. doi: 10.1007/s10548-009-0130-5.
    1. MacQueen, J. (1967). Some methods for classification and analysis of multivariate observations. In L. Lecam & J. Neyman (Eds.), Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability (Vol. 1, pp. 281–297). Berkeley, CA: The Regents of the University of California.
    1. Mehrabian A. Pleasure-arousal-dominance: A general framework for describing and measuring individual differences in temperament. Current Psychology. 1996;14:261–292. doi: 10.1007/BF02686918.
    1. Meilă M. Comparing clusterings—An information based distance. Journal of Multivariate Analysis. 2007;98:873–895. doi: 10.1016/j.jmva.2006.11.013.
    1. Mikels JA, Fredrickson BL, Larkin GR, Lindberg CM, Maglio SJ, Reuter-Lorenz PA. Emotional category data on images from the International Affective Picture System. Behavior Research Methods. 2005;37:626–630. doi: 10.3758/BF03192732.
    1. Mikels JA, Larkin GR, Reuter-Lorenz PA, Carstensen LL. Divergent trajectories in the aging mind: Changes in working memory for affective versus visual information with age. Psychology and Aging. 2005;20:542–553. doi: 10.1037/0882-7974.20.4.542.
    1. Most SB, Chun MM, Widders DM, Zald DH. Attentional rubbernecking: Cognitive control and personality in emotion-induced blindness. Psychonomic Bulletin & Review. 2005;12:654–661. doi: 10.3758/BF03196754.
    1. Mun EY, von Eye A, Bates ME, Vaschillo EG. Finding groups using model-based cluster analysis: Heterogeneous emotional self-regulatory processes and heavy alcohol use risk. Developmental Psychology. 2008;44:481–495. doi: 10.1037/0012-1649.44.2.481.
    1. Perri RL, Berchicci M, Lucci G, Cimmino RL, Bello A, Di Russo F. Getting ready for an emotion: Specific premotor brain activities for self-administered emotional pictures. Frontiers in Behavioral Neuroscience. 2014;8:197.
    1. R Development Core Team. (2015). R: A language and environment for statistical computing (Version 3.2.1) [Computer software]. Vienna, Austria: R Foundation for Statistical Computing. Retrieved from:
    1. Russell JA, Mehrabian A. Evidence for a three-factor theory of emotions. Journal of Research in Personality. 1977;11:273–294. doi: 10.1016/0092-6566(77)90037-X.
    1. Stins JF, Beek PJ. Effects of affective picture viewing on postural control. BMC Neuroscience. 2007;8:83. doi: 10.1186/1471-2202-8-83.
    1. Tomaszczyk JC, Fernandes MA, MacLeod CM. Personal relevance modulates the positivity bias in recall of emotional pictures in older adults. Psychonomic Bulletin & Review. 2008;15:191–196. doi: 10.3758/PBR.15.1.191.
    1. Watson D, Clark LA, Tellegen A. Development and validation of brief measures of positive and negative affect: The PANAS scales. Journal of Personality and Social Psychology. 1988;54:1063–1070. doi: 10.1037/0022-3514.54.6.1063.
    1. Wu J. Advances in k-means clustering: A data mining thinking. Berlin: Springer Verlag; 2012.
    1. Xing C, Isaacowitz DM. Aiming at happiness: How motivation affects attention to and memory for emotional images. Motivation and Emotion. 2006;30:243–250. doi: 10.1007/s11031-006-9032-y.
    1. Xu, R., & Wunsch, D. (2009). Clustering (Vol. 10). New Jersey: John Wiley & Sons.
    1. Zhang X, Hui WY, Barrett LF. How does this make you feel? A comparison of four affect induction procedures. Frontiers in Psychology. 2014;5:689.
    1. Zhao X, Sandelin A. GMD: Measuring the distance between histograms with applications on high-throughput sequencing reads. Bioinformatics. 2012;28:1164–1165. doi: 10.1093/bioinformatics/bts087.

Source: PubMed

3
Prenumerera