Bridging Big Data: Procedures for Combining Non-equivalent Cognitive Measures from the ENIGMA Consortium
journal contribution
posted on 2023-04-04, 02:02authored byEamonn Kennedy, Shashank Vadlamani, Hannah M Lindsey, Pui-Wa Lei, Mary Jo-Pugh, Maheen Adamson, Martin Alda, Silvia Alonso-Lana, Sonia Ambrogi, Tim J Anderson, Celso Arango, Robert F Asarnow, Mihai Avram, Rosa Ayesa-Arriola, Talin Babikian, Nerisa Banaj, Laura J Bird, Stefan Borgwardt, Amy Brodtmann, Katharina Brosch, Karen CaeyenberghsKaren Caeyenberghs, Vince D Calhoun, Nancy D Chiaravalloti, David X Cifu, Benedicto Crespo-Facorro, John C Dalrymple-Alford, Kristen Dams-O’Connor, Udo Dannlowski, David Darby, Nicholas Davenport, John DeLuca, Covadonga M Diaz-Caneja, Seth G Disner, Ekaterina Dobryakova, Stefan Ehrlich, Carrie Esopenko, Fabio Ferrarelli, Lea E Frank, Carol Franz, Paola Fuentes-Claramonte, Helen Genova, Christopher C Giza, Janik Goltermann, Dominik Grotegerd, Marius Gruber, Alfonso Gutierrez-Zotes, Minji Ha, Jan Haavik, Charles Hinkin, Kristen R Hoskinson, Daniela Hubl, Andrei Irimia, Andreas Jansen, Michael Kaess, Xiaojian Kang, Kimbra Kenney, Barbora Keřková, Mohamed Salah Khlif, Minah Kim, Jochen Kindler, Tilo Kircher, Karolina Knížková, Knut K Kolskår, Denise Krch, William S Kremen, Taylor Kuhn, Veena Kumari, Jun Soo Kwon, Roberto Langella, Sarah Laskowitz, Jungha Lee, Jean Lengenfelder, Spencer W Liebel, Victoria Liou-Johnson, Sara M Lippa, Marianne Løvstad, Astri Lundervold, Cassandra Marotta, Craig A Marquardt, Paulo Mattos, Ahmad Mayeli, Carrie R McDonald, Susanne Meinert, Tracy R Melzer, Jessica Merchán-Naranjo, Chantal Michel, Rajendra A Morey, Benson Mwangi, Daniel J Myall, Igor Nenadić, Mary R Newsome, Abraham Nunes, Terence O’Brien, Viola Oertel, John Ollinger, Alexander Olsen, Victor Ortiz García de la Foz, Mustafa Ozmen, Heath Pardoe, Marise Parent, Fabrizio Piras, Federica Piras, Edith Pomarol-Clotet, Jonathan Repple, Geneviève Richard, Jonathan Rodriguez, Mabel Rodriguez, Kelly Rootes-Murdy, Jared Rowland, Nicholas RyanNicholas Ryan, Raymond Salvador, Anne-Marthe Sanders, Andre Schmidt, Jair C Soares, Gianfranco Spalleta, Filip Španiel, Alena Stasenko, Frederike Stein, Benjamin Straube, April Thames, Florian Thomas-Odenthal, Sophia I Thomopoulos, Erin Tone, Ivan Torres, Maya Troyanskaya, Jessica A Turner, Kristine M Ulrichsen, Guillermo Umpierrez, Elisabet Vilella, Lucy Vivash, William C Walker, Emilio Werden, Lars T Westlye, Krista Wild, Adrian Wroblewski, Mon-Ju Wu, Glenn R Wylie, Lakshmi N Yatham, Giovana B Zunta-Soares, Paul M Thompson, David F Tate, Frank G Hillary, Emily L Dennis, Elisabeth A Wilde
AbstractInvestigators in the cognitive neurosciences have turned to Big Data to address persistent replication and reliability issues by increasing sample sizes, statistical power, and representativeness of data. While there is tremendous potential to advance science through open data sharing, these efforts unveil a host of new questions about how to integrate data arising from distinct sources and instruments. We focus on the most frequently assessed area of cognition - memory testing - and demonstrate a process for reliable data harmonization across three common measures. We aggregated raw data from 53 studies from around the world which measured at least one of three distinct verbal learning tasks, totaling N = 10,505 healthy and brain-injured individuals. A mega analysis was conducted using empirical bayes harmonization to isolate and remove site effects, followed by linear models which adjusted for common covariates. After corrections, a continuous item response theory (IRT) model estimated each individual subject’s latent verbal learning ability while accounting for item difficulties. Harmonization significantly reduced inter-site variance by 37% while preserving covariate effects. The effects of age, sex, and education on scores were found to be highly consistent across memory tests. IRT methods for equating scores across AVLTs agreed with held-out data of dually-administered tests, and these tools are made available for free online. This work demonstrates that large-scale data sharing and harmonization initiatives can offer opportunities to address reproducibility and integration challenges across the behavioral sciences.Significance StatementData sharing can increase the quality and rigor of scientific claims, but with large scale data sharing comes an increased need to combine non-equivalent measurements across studies. Auditory verbal learning tasks (AVLTs) are one of the most common research and clinical tools to evaluate memory constructs, but numerous distinct AVLTs are in common use, which creates challenges for data aggregation. We report methods to convert raw scores across common verbal learning instruments, constructed using harmonizing data from 53 studies from around the world. This approach can be replicated in other domains to address long standing data compatibility issues for researchers and clinicians.