Recommendation

Can unconscious experience drive perceptual learning?

Vishnu Sreekumar based on reviews by Jeffrey Saunders and 1 anonymous reviewer

A recommendation of:

STAGE 1

Can one-shot learning be elicited from unconscious information?

Adelina-Mihaela Halchin, Christoph Teufel, Aline Bompas https://osf.io/xmy7u?view_only=b05c622bfae04562af871bb8ec9a9e52 version 1

Read report on server

Abstract

EN

AR

ES

FR

HI

JA

PT

RU

ZH-CN

Can one-shot learning be elicited from unconscious information?

The human brain has the remarkable ability to make sense of highly impoverished images once relevant prior information is available. Fitting examples of this effect are two-tone images, which initially look like meaningless black-and-white patches but can become perceptually integrated into meaningful objects (or disambiguated) after seeing the corresponding greyscale (i.e., template) image, thereafter acting as prior information. Recently, it has been claimed that this type of one-shot learning can be elicited from images that participants had no conscious recognition of. In this Registered Report, we seek to thoroughly re-evaluate this claim while addressing important design and analysis limitations of the original study. We also aim to illustrate the impact that different criteria for categorizing conscious and unconscious events may have on conclusions related to the role of unconscious information. In two experiments, we will present participants with two-tone images, before and after exposure to visually congruent and incongruent greyscale images, for which likelihood of being consciously perceived is experimentally manipulated. Experiments 1 and 2 will replicate each other using different objective identification measures, free-naming or multiple-choice questions. Given the heterogeneity of measurements for evaluating (un)consciousness in the literature, we will collect both an objective measure, namely image identification, and a widely used subjective measure, namely the Perceptual Awareness Scale. We will compare our conclusions when using only the former, the latter, or combinations of these measures.

unconscious learning, perceptual priors, one-shot learning

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

هل يمكن استخلاص التعلم دفعة واحدة من المعلومات اللاواعية؟

يتمتع العقل البشري بقدرة رائعة على فهم الصور الفقيرة للغاية بمجرد توفر المعلومات المسبقة ذات الصلة. من الأمثلة المناسبة لهذا التأثير هي الصور ذات اللونين، والتي تبدو في البداية وكأنها بقع بيضاء وسوداء لا معنى لها ولكن يمكن أن تصبح مدمجة إدراكيًا في كائنات ذات معنى (أو غير غامضة) بعد رؤية الصورة ذات التدرج الرمادي (أي القالب)، وبعد ذلك تعمل كسابقة. معلومة. في الآونة الأخيرة، زُعم أن هذا النوع من التعلم المفرد يمكن استخلاصه من الصور التي لم يكن لدى المشاركين أي إدراك لها. في هذا التقرير المسجل، نسعى إلى إعادة تقييم هذا الادعاء بدقة مع معالجة قيود التصميم والتحليل المهمة للدراسة الأصلية. نهدف أيضًا إلى توضيح التأثير الذي قد تحدثه المعايير المختلفة لتصنيف الأحداث الواعية وغير الواعية على الاستنتاجات المتعلقة بدور المعلومات اللاواعية. في تجربتين، سنقدم للمشاركين صورًا ذات لونين، قبل وبعد التعرض لصور رمادية متطابقة وغير متطابقة بصريًا، حيث يتم التلاعب تجريبيًا باحتمال إدراكها بشكل واعي. ستكرر التجربتان 1 و2 بعضهما البعض باستخدام مقاييس تحديد موضوعية مختلفة، أو التسمية الحرة أو أسئلة الاختيار من متعدد. نظرًا لعدم تجانس قياسات تقييم (عدم) الوعي في الأدبيات، فسوف نقوم بجمع مقياس موضوعي، وهو تحديد الصورة، ومقياس شخصي يستخدم على نطاق واسع، وهو مقياس الوعي الإدراكي. وسوف نقوم بمقارنة استنتاجاتنا عند استخدام الأول فقط، أو الأخير، أو مزيج من هذه التدابير.

التعلم اللاواعي، والإدراك الحسي المسبق، والتعلم دفعة واحدة

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

¿Se puede obtener un aprendizaje único a partir de información inconsciente?

El cerebro humano tiene la notable capacidad de dar sentido a imágenes muy empobrecidas una vez que se dispone de información previa relevante. Ejemplos apropiados de este efecto son las imágenes de dos tonos, que inicialmente parecen manchas en blanco y negro sin sentido, pero que pueden integrarse perceptualmente en objetos significativos (o eliminarse de la ambigüedad) después de ver la imagen en escala de grises correspondiente (es decir, plantilla), actuando posteriormente como imagen previa. información. Recientemente, se ha afirmado que este tipo de aprendizaje de una sola vez puede obtenerse a partir de imágenes que los participantes no reconocían conscientemente. En este Informe Registrado, buscamos reevaluar exhaustivamente esta afirmación y al mismo tiempo abordar importantes limitaciones de diseño y análisis del estudio original. También pretendemos ilustrar el impacto que diferentes criterios para categorizar eventos conscientes e inconscientes pueden tener en las conclusiones relacionadas con el papel de la información inconsciente. En dos experimentos, presentaremos a los participantes imágenes de dos tonos, antes y después de la exposición a imágenes en escala de grises visualmente congruentes e incongruentes, para las cuales se manipula experimentalmente la probabilidad de ser percibidas conscientemente. Los experimentos 1 y 2 se replicarán entre sí utilizando diferentes medidas de identificación de objetivos, preguntas de elección múltiple o de denominación libre. Dada la heterogeneidad de las medidas para evaluar la (in)consciencia en la literatura, recopilaremos tanto una medida objetiva, a saber, la identificación de imágenes, como una medida subjetiva ampliamente utilizada, a saber, la Escala de Conciencia Perceptual. Compararemos nuestras conclusiones cuando utilicemos solo la primera, la segunda o combinaciones de estas medidas.

aprendizaje inconsciente, antecedentes perceptivos, aprendizaje de una sola vez

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Un apprentissage ponctuel peut-il être obtenu à partir d’informations inconscientes ?

Le cerveau humain a la capacité remarquable de donner un sens à des images très pauvres une fois que des informations préalables pertinentes sont disponibles. Des exemples appropriés de cet effet sont les images bicolores, qui ressemblent initialement à des taches noires et blanches dénuées de sens, mais peuvent devenir perceptuellement intégrées dans des objets significatifs (ou levées d'ambiguïté) après avoir vu l'image en niveaux de gris correspondante (c'est-à-dire, le modèle), agissant ensuite comme un modèle préalable. information. Récemment, il a été affirmé que ce type d’apprentissage ponctuel pouvait être obtenu à partir d’images dont les participants n’avaient pas conscience. Dans ce rapport enregistré, nous cherchons à réévaluer en profondeur cette affirmation tout en abordant les limites importantes de la conception et de l'analyse de l'étude originale. Nous visons également à illustrer l’impact que peuvent avoir différents critères de catégorisation des événements conscients et inconscients sur les conclusions liées au rôle de l’information inconsciente. Dans deux expériences, nous présenterons aux participants des images bicolores, avant et après exposition à des images en niveaux de gris visuellement congruentes et incongrues, pour lesquelles la probabilité d'être consciemment perçu est manipulée expérimentalement. Les expériences 1 et 2 se reproduiront en utilisant différentes mesures d'identification objective, des questions à dénomination libre ou à choix multiples. Compte tenu de l’hétérogénéité des mesures d’évaluation de l’(in)conscience dans la littérature, nous collecterons à la fois une mesure objective, à savoir l’identification d’images, et une mesure subjective largement utilisée, à savoir la Perceptual Awareness Scale. Nous comparerons nos conclusions en utilisant uniquement la première, la seconde ou une combinaison de ces mesures.

apprentissage inconscient, a priori perceptuels, apprentissage ponctuel

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

क्या अचेतन जानकारी से एक बार में सीख प्राप्त की जा सकती है?

प्रासंगिक पूर्व सूचना उपलब्ध होने पर मानव मस्तिष्क में अत्यधिक खराब छवियों को समझने की उल्लेखनीय क्षमता होती है। इस आशय के उपयुक्त उदाहरण दो-रंग की छवियां हैं, जो शुरू में अर्थहीन काले और सफेद पैच की तरह दिखती हैं, लेकिन संबंधित ग्रेस्केल (यानी, टेम्पलेट) छवि को देखने के बाद अवधारणात्मक रूप से सार्थक वस्तुओं (या असंबद्ध) में एकीकृत हो सकती हैं, उसके बाद पहले की तरह कार्य कर सकती हैं। जानकारी। हाल ही में, यह दावा किया गया है कि इस प्रकार की एक-शॉट शिक्षा उन छवियों से प्राप्त की जा सकती है जिनके बारे में प्रतिभागियों को कोई सचेत पहचान नहीं थी। इस पंजीकृत रिपोर्ट में, हम मूल अध्ययन की महत्वपूर्ण डिजाइन और विश्लेषण सीमाओं को संबोधित करते हुए इस दावे का पूरी तरह से पुनर्मूल्यांकन करना चाहते हैं। हमारा उद्देश्य यह भी बताना है कि चेतन और अचेतन घटनाओं को वर्गीकृत करने के विभिन्न मानदंड अचेतन जानकारी की भूमिका से संबंधित निष्कर्षों पर क्या प्रभाव डाल सकते हैं। दो प्रयोगों में, हम प्रतिभागियों को दृष्टिगत अनुरूप और असंगत ग्रेस्केल छवियों के संपर्क से पहले और बाद में दो-टोन छवियों के साथ प्रस्तुत करेंगे, जिसके लिए सचेत रूप से देखे जाने की संभावना को प्रयोगात्मक रूप से हेरफेर किया जाता है। प्रयोग 1 और 2 अलग-अलग वस्तुनिष्ठ पहचान उपायों, मुक्त-नामकरण या बहुविकल्पीय प्रश्नों का उपयोग करके एक-दूसरे की नकल करेंगे। साहित्य में (अ)चेतना के मूल्यांकन के लिए माप की विविधता को देखते हुए, हम एक वस्तुनिष्ठ माप, अर्थात् छवि पहचान, और एक व्यापक रूप से उपयोग किए जाने वाले व्यक्तिपरक माप, अर्थात् अवधारणात्मक जागरूकता स्केल, दोनों को एकत्र करेंगे। हम केवल पहले, बाद वाले या इन उपायों के संयोजन का उपयोग करते समय अपने निष्कर्षों की तुलना करेंगे।

अचेतन शिक्षा, अवधारणात्मक प्राथमिकताएँ, एक-शॉट शिक्षा

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

無意識の情報から一発学習を引き出せるのか？

人間の脳は、関連する事前情報が利用可能であれば、非常に貧弱な画像を理解する驚くべき能力を備えています。この効果の適切な例はツートーン画像です。最初は意味のない白黒パッチのように見えますが、対応するグレースケール (つまり、テンプレート) 画像を見ると、知覚的に意味のあるオブジェクトに統合され (または曖昧さがなくなり)、その後は以前と同じように機能します。情報。最近、このタイプのワンショット学習は、参加者が意識的に認識していなかった画像から引き出すことができると主張されています。この登録報告書では、元の研究の重要な設計と分析の制限に対処しながら、この主張を徹底的に再評価することを目指しています。また、意識的イベントと無意識的イベントを分類するためのさまざまな基準が、無意識の情報の役割に関連する結論に与える影響を示すことも目的としています。 2 つの実験では、視覚的に一致するグレースケール画像と不一致なグレースケール画像を曝露する前後のツートーン画像を参加者に提示します。意識的に認識される可能性は実験的に操作されます。実験 1 と 2 は、異なる客観的識別手段、自由命名または多肢選択式の質問を使用して相互に再現します。文献における（無意識の）意識を評価するための測定値が不均一であることを考慮して、客観的測定値、つまり画像識別と、広く使用されている主観的測定値、つまり知覚認識スケールの両方を収集します。前者のみ、後者のみ、またはこれらの手段の組み合わせを使用した場合の結論を比較します。

無意識学習、知覚事前学習、ワンショット学習

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

A aprendizagem única pode ser obtida a partir de informações inconscientes?

O cérebro humano tem a notável capacidade de dar sentido a imagens altamente empobrecidas, uma vez que informações prévias relevantes estejam disponíveis. Exemplos adequados desse efeito são as imagens em dois tons, que inicialmente parecem manchas em preto e branco sem sentido, mas podem se tornar perceptualmente integradas em objetos significativos (ou desambiguadas) após ver a imagem correspondente em escala de cinza (ou seja, modelo), agindo posteriormente como imagem anterior. Informação. Recentemente, afirmou-se que este tipo de aprendizagem única pode ser obtido a partir de imagens das quais os participantes não tinham reconhecimento consciente. Neste Relatório Registrado, procuramos reavaliar minuciosamente esta afirmação, ao mesmo tempo em que abordamos importantes limitações de desenho e análise do estudo original. Pretendemos também ilustrar o impacto que diferentes critérios de categorização de acontecimentos conscientes e inconscientes podem ter nas conclusões relacionadas com o papel da informação inconsciente. Em dois experimentos, apresentaremos aos participantes imagens em dois tons, antes e depois da exposição a imagens em tons de cinza visualmente congruentes e incongruentes, para as quais a probabilidade de serem percebidas conscientemente é manipulada experimentalmente. Os experimentos 1 e 2 serão replicados usando diferentes medidas objetivas de identificação, questões de nomeação livre ou de múltipla escolha. Dada a heterogeneidade de medidas para avaliação da (in)consciência na literatura, iremos recolher tanto uma medida objetiva, nomeadamente a identificação de imagens, como uma medida subjetiva amplamente utilizada, nomeadamente a Escala de Consciência Percetual. Compararemos nossas conclusões ao usar apenas a primeira, a última ou combinações dessas medidas.

aprendizagem inconsciente, antecedentes perceptivos, aprendizagem única

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Можно ли извлечь однократное обучение из бессознательной информации?

Человеческий мозг обладает замечательной способностью осмысливать весьма скудные изображения, когда доступна соответствующая предварительная информация. Подходящим примером этого эффекта являются двухцветные изображения, которые изначально выглядят как бессмысленные черно-белые пятна, но могут стать перцептивно интегрированными в значимые объекты (или устранить неоднозначность) после просмотра соответствующего изображения в оттенках серого (т. е. шаблона), после чего действовать как предшествующее. информация. Недавно было заявлено, что этот тип однократного обучения можно извлечь из изображений, которые участники не распознавали сознательно. В этом зарегистрированном отчете мы стремимся тщательно переоценить это утверждение, одновременно устраняя важные ограничения дизайна и анализа оригинального исследования. Мы также стремимся проиллюстрировать влияние, которое различные критерии классификации сознательных и бессознательных событий могут оказать на выводы, касающиеся роли бессознательной информации. В двух экспериментах мы предоставим участникам двухцветные изображения до и после воздействия визуально конгруэнтных и неконгруэнтных изображений в оттенках серого, вероятность сознательного восприятия которых экспериментально манипулируется. Эксперименты 1 и 2 будут повторять друг друга с использованием различных объективных критериев идентификации, вопросов с произвольным названием или вопросов с несколькими вариантами ответов. Учитывая неоднородность измерений для оценки (не)сознания в литературе, мы соберем как объективную меру, а именно идентификацию изображения, так и широко используемую субъективную меру, а именно шкалу перцептивной осведомленности. Мы сравним наши выводы при использовании только первых, вторых или комбинаций этих показателей.

бессознательное обучение, перцептивное априорное обучение, однократное обучение

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

可以从无意识信息中引发一次性学习吗？

一旦获得相关先验信息，人类大脑就具有理解高度贫乏图像的非凡能力。这种效果的合适例子是双色调图像，它们最初看起来像无意义的黑白斑块，但在看到相应的灰度（即模板）图像后可以在感知上融入有意义的对象（或消除歧义），此后充当先验图像信息。最近，有人声称这种一次性学习可以从参与者无意识识别的图像中引发。在这份注册报告中，我们寻求彻底重新评估这一主张，同时解决原始研究的重要设计和分析局限性。我们还旨在说明区分有意识和无意识事件的不同标准可能对与无意识信息的作用相关的结论产生的影响。在两个实验中，我们将向参与者展示双色调图像，在暴露于视觉一致和不一致的灰度图像之前和之后，通过实验操纵被有意识地感知的可能性。实验 1 和 2 将使用不同的客观识别措施、自由命名问题或多项选择题相互重复。鉴于文献中评估（无）意识的测量方法存在异质性，我们将收集客观测量方法（即图像识别）和广泛使用的主观测量方法（即感知意识量表）。我们将比较仅使用前者、后者或这些措施的组合时的结论。

无意识学习、知觉先验、一次性学习

Submission: posted 30 November 2022
Recommendation: posted 15 October 2023, validated 15 October 2023

Cite this recommendation as:
Sreekumar, V. (2023) Can unconscious experience drive perceptual learning? . Peer Community in Registered Reports, . https://rr.peercommunityin.org/articles/rec?id=350

Recommendation

Unconscious priming effects have fascinated not just psychologists but also ad-makers and consumers alike. A related phenomenon in perception is illustrated by presenting participants with two-tone images, which are degraded versions of images of objects and scenes. These two-tone images look like and are indeed judged as meaningless dark and light patches. Upon presenting the actual template image, however, the two-tone image is accurately recognized. This perceptual learning is abrupt, robust, and long-lasting (Daoudi et al., 2017). Surprisingly, Chang et al. (2016) showed that such perceptual disambiguation of two-tone images can happen even in the absence of conscious awareness of having seen the template image.

Halchin et al. (2023) in the current study propose to conduct a conceptual replication of Chang et al. (2016) with important modifications to the procedures to address limitations with the earlier work. Specifically, there was no explicit manipulation of levels of conscious awareness of the template images in the original study. Therefore, miscategorization of low-confidence awareness as unaware could have led to an erroneous conclusion about unconscious priors guiding perceptual learning. Such miscategorization errors and how to tackle them are of interest to the broader field of consciousness studies. Furthermore, a conceptual replication of Chang et al. (2016) is also timely given that prior related work suggests that masking impairs not only conscious awareness of visual features but also blocks processing of higher-level information about the images (e.g. object category).

To address the issues identified above, Halchin et al. (2023) propose to experimentally manipulate conscious awareness by masking the template image very quickly (i.e., a short stimulus onset asynchrony; SOA) or by allowing some more time to induce weak and strong conscious awareness, respectively. The SOAs were validated through pilot studies. Furthermore, they include a four-point perceptual awareness scale instead of the original yes/no options to gauge participants’ subjective awareness of the template images. The authors also propose multiple experiments to include different ways of testing participants’ objective ability to identify the masked template images. Last but not least, the proposed design includes a stronger control condition than the original study by using masked images created from related images (e.g. belonging to the same semantic category). Depending on the results obtained in the main experiments, the inclusion of this control allows the authors to conduct a third experiment to investigate whether the results in the first two can be explained by semantic priming. The proposed study is sufficiently powered (as demonstrated through simulations), and Bayesian statistical procedures will be used to test the main hypotheses. In summary, the proposed work offers a significant improvement in terms of experimental procedures over the original study. If the Chang et al. (2016) results are replicated, the stronger design in the current study is likely to lead to a better understanding of the mechanisms underlying unconscious priors guiding perceptual learning. On the other hand, a failure to replicate not just Chang et al. (2016)’s results but also effects across the three experiments in the current study would raise legitimate questions about the reality of unconscious information guiding perceptual learning.

The study plan was refined across two rounds of review, with input from two external reviewers who both agreed that the proposed study is well designed, timely, and scientifically valid. The recommender then reviewed the revised manuscript and judged that the study met the Stage 1 criteria for in-principle acceptance (IPA).

URL to the preregistered Stage 1 protocol: https://osf.io/juckg

Level of bias control achieved: Level 3. At least some of the data/evidence that will be used to answer the research question already exists AND is accessible in principle to the authors BUT the authors certify that they have not yet accessed any part of that data/evidence.

List of eligible PCI-RR-friendly journals:

References

1. Daoudi, L. D., Doerig, A., Parkosadze, K., Kunchulia, M. & Herzog, M. H. (2017). The role of one-shot learning in #TheDress. Journal of Vision, 17, 15-15. https://doi.org/10.1167/17.3.15

2. Chang, R., Baria, A. T., Flounders, M. W., & He, B. J. (2016). Unconsciously elicited perceptual prior. Neuroscience of Consciousness, 2016. https://doi.org/10.1093/nc/niw008

3. Halchin, A.-M., Teuful, C. & Bompas, A. (2023). Can one-shot learning be elicited from unconscious information? In principle acceptance of Version 3 by Peer Community in Registered Reports. https://osf.io/juckg

Conflict of interest:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article.

Reviews

Evaluation round #2

DOI or URL of the report: https://osf.io/df3bj?view_only=b05c622bfae04562af871bb8ec9a9e52

Version of the report: 1

Author's Reply, 20 Sep 2023

Download author's reply https://doi.org/10.24072/pci.rr.100350.ar2

Decision by Vishnu Sreekumar, posted 19 Sep 2023, validated 19 Sep 2023

Dear authors,

Thank you for your patience. We were unable to get the opinions of the first reviewer whose questions you responded to satisfactorily, in my opinion. We have a brief comment from the other reviewer that I have reproduced below under the dashed line. I do think that the reviewer's concern about PAS ratings reflecting some other cue in the images is valid and deserves attention. I also understand the reviewer's concern about the masking paradigm not being strong enough. However, since some of the data have been collected already, I understand that it is not possible to modify the contrast of the mask. That said, I am convinced by your response that in your pilot data, most trials not marked as PAS1 were marked PAS2. I would ask you though to thoroughly discuss the possibility raised by the reviewer in the eventual discussion section.

I have gone through the manuscript and have read your detailed responses to both reviewers' questions in the previous round of reviews. This is a well designed set of experiments that will potentially resolve some important questions about the influence of unconscious stimuli on perception. So please respond to the final set of comments below from the first reviewer before we make a final decision.

Thanks,

Vishnu

-------------
In the comment marked 2.7, the suggestion was to compare PAS ratings from pilot data to the RMS contrast of images. The authors instead compared RMS contrast and other properties across experimental conditions. The suggestions was primarily to test if these qualities were being used as cues for PAS ratings, in which case, the authors could correct the issue (ensure all images have the same RMS contrast) prior to data collection. It is a shame data has already been collected and so the authors cannot correct the methods, where methodological improvement is normally a major benefit of the registered report. I would suggest the authors do compare RMS contrast and PAS ratings in the experimental data, to answer the question “are participants making PAS ratings according to instructions (visibility of contents) or based on some cue such as RMS contrast”.

It is a shame the authors cannot implement the suggested changes to the methods. The pilot data suggests the masking paradigm is not strong enough to guarantee the participants had no conscious recognition of the images. I suggest the authors discuss this possibility in their eventual discussion, in relation to the frequency of actually giving a PAS of 1 in the condition that is supposed to be ‘unconscious’.

https://doi.org/10.24072/pci.rr.100350.d2

Reviewed by anonymous reviewer 1, 03 Aug 2023

The authors have satisfactorily responded to most of my previous comments.

In the comment marked 2.7, the suggestion was to compare PAS ratings from pilot data to the RMS contrast of images. The authors instead compared RMS contrast and other properties across experimental conditions. The suggestions was primarily to test if these qualities were being used as cues for PAS ratings, in which case, the authors could correct the issue (ensure all images have the same RMS contrast) prior to data collection. It is a shame data has already been collected and so the authors cannot correct the methods, where methodological improvement is normally a major benefit of the registered report. I would suggest the authors do compare RMS contrast and PAS ratings in the experimental data, to answer the question “are participants making PAS ratings according to instructions (visibility of contents) or based on some cue such as RMS contrast”.

It is a shame the authors cannot implement the suggested changes to the methods. The pilot data suggests the masking paradigm is not strong enough to guarantee the participants had no conscious recognition of the images. I suggest the authors discuss this possibility in their eventual discussion, in relation to the frequency of actually giving a PAS of 1 in the condition that is supposed to be ‘unconscious’.

https://doi.org/10.24072/pci.rr.100350.rev21

Evaluation round #1

DOI or URL of the report: https://osf.io/89ubd?view_only=b05c622bfae04562af871bb8ec9a9e52

Version of the report: 1

Author's Reply, 14 Jul 2023

Download author's reply https://doi.org/10.24072/pci.rr.100350.ar1

Decision by Vishnu Sreekumar, posted 08 Feb 2023, validated 08 Feb 2023

As you may know, it has been hard to find reviewers recently but we now have two high quality reviews for this submission. Both reviewers agree that the proposed study is well designed, timely, and scientifically valid. However, they have identified some concerns about sampling plan, some experimental parameters, and other issues that need to be addressed. I request the authors to address the concerns and provide a point by point response with their resubmission which will be sent back to the same reviewers.

https://doi.org/10.24072/pci.rr.100350.d1

Reviewed by Jeffrey Saunders, 05 Dec 2022

This is a well-thought study that will contribute to the literature. It is closely based on a previous study, so the general idea is not novel. However, it will provide a better controlled test of the previously reported effect.

The authors do a good analysis of previous studies, and present a compelling case for revisiting the findings of Chang et al (2016). They discuss theoretical and empirical reasons to doubt that masked information could allow future disambiguation of two-tone images, and identify limitations of the methods of Chang et al (2016). The background and motivation for the study are clearly presented, with good arguments.

The authors have also given careful thought to the methodology. The primary challenge is ensuring that "unconscious" stimuli are truly unconscious. I think the authors do a good job meeting this challenge. Trials will be classified based on multiple measures in a graded manner. The criteria for "fully unconscious" is more conservative than in the previous study, so we can more confident that any post-exposure effects will not be due to some conscious awareness. I think this is the main feature of the new study. They have also given good consideration to details like attention checks, exclusion criteria, and statistical power. The fact that it is pre-registered RR is positive feature in itself.

I question whether Experiment 2 is needed. Experiment 1 already implements blind ratings, and specifies a coding plan. Any errors or biases in rating responses would just add noise or shift the baselines. As the authors point out, adopting the forced choice method also has drawbacks, which might end up increasing the variability. More data is always nice, so if the authors want to repeat with this variation in method, that is fine. It seems like a lot of extra data collection to address an issue that is unlikely to affect the results, and might introduce some new problems.

If Experiment 2 is going to be included, the authors should say more about what they would conclude if the results from the two experiments are not entirely consistent. What if Experiment 1 finds strong evidence for unconscious priming but Experiment 2 finds only a weak trend? Would they conclude that there was experimenter bias in Experiment 1, and that the effect may not be reliable? Or conclude that the data in Experiment 2 was noisier due to methodological issues, so it should be discounted?

I think that the third experiment makes more sense as a follow-up because it addresses a potential alternate explanation that is more likely and problematic, and which might be ruled out by the Experiment 1 results. If the main trials and catch trials don't show a difference, then the follow-up experiment will be important, but if there is a clear difference between main trials and catch trials, then it isn't needed. This is more important than the issue of subjective ratings. In fact, I suggest reversing the order. If the evidence suggests that effects in Experiment 1 are due to spontaneous disambiguation, then it would be better to know this before conducting the proposed Experiment 2, which would otherwise have the same confound.

For the third experiment (the results contingent follow-up), the authors should say something about the conclusions that would be drawn from different possible outcomes. What if Experiment 1 appears to show disambiguation from unconscious stimuli, but the follow-up study does not?

To evaluate the planned analysis and presentation of the results, I would like to see some sort of draft of the results section. The authors could use simulated data or placeholders for statistical results. The authors describe the planned analyses in the study design table, but there are a lot of hypotheses and analyses, and it is a bit hard to follow. Presenting the planned analyses in the format of a results section will make it easier to check that the analyses make sense and nothing is missing, and also provides an opportunity for reviewers to give feedback about the presentation.

I am not a fan of the "study design table" required by PCI-RR. Answering all the questions in a single row for each hypothesis requires a table that spans multiple pages, with narrow text blocks. The sampling plan is generally the same for all hypotheses, so that column has redundant information. The space limitation encourages enumeration of hypotheses, so a reader has to keep track of many non-descriptive labels (H1a, H1b, …). Given the limitations of the format, I think the authors did a reasonable job conveying the information. I hope that PCI-RR changes this requirement, or allows some flexibility in how the information is organized. In the meantime, it would be helpful to see the analysis plan presented as a results section.

Using the Bayesian sequential sampling procedure is a good idea, and the proposed stopping criteria should provide good power for a range of possible effects. I have some suggestions.

For computation of Bayes' factors, the authors propose using a Cachy prior with scale parameter r = 1/sqrt(2). Schönbrodt & Wagenmakers (2018), following Rouder et al (2009), recommend a scale parameter of r = 1. They note that smaller scale parameters take longer to reach the H0 criteria in the null case. Their simulations of stopping criteria BF>6 also found that the Type I rate is slightly inflated with r = 1/sqrt(2), but not with r = 1. I suggest that they follow Schönbrodt & Wagenmakers (2018) and use r = 1.

I also think that the authors should provide a justification for the choice of boundary criteria based on expected effect size, and describe the power for one or more possible effect sizes. The methods section includes statement about the boundary criteria: "A BF of 6 (or 1/6), taken to indicate moderate evidence (Lee & Wagenmakers, 2014, as cited in Quintana & Williams, 2018), was chosen as an estimated equivalent for a medium effect size." That helps make a connection from the BF criteria to effect size, but does not say anything about why a medium effect size is targeted. Later, the authors report estimated effect sizes from the previous study, but that is not connected to the choice of stopping criteria.

The boundary criteria and prior determine the range of possible effect sizes that could be reliably detected, so a given BF criteria implies a target effect size. For example, the simulation results of Schönbrodt & Wagenmakers (2018) found that a criteria of BF>6 and r=1 would have 86% power for d = 0.4 in a between-subjects, so this criteria would correspond to targeting an effect size of d ≥ 0.4. In the present study, using BF>6 will allow detection of smaller effects because it is a within-subjects design. Reporting the minimum effect size that could be reliably detected will make it easy for the reader to see that the study is well-powered (even if not familiar with BFs).

The lower bound on sample size, N=60, seems higher than necessary. Sequential procedures are more efficient because they can stop early if the evidence shows clear evidence one way or the other. This efficiency is lost if the lower bound is higher than needed. An effect size of d = 0.5 only needs N=44 for 90% power. In the case of no effect, N=30 would be enough for reasonably sized confidence intervals around zero in the not-recognized condition (SE = 5.3%/sqrt(30) = 1.02%). I suggest that the authors use a smaller lower bound, N=30-40, so they can take advantage of the efficiency of the sequential testing. The sample size will still go past N=60 if the data is ambiguous, but not if the true effect turns out to be large or zero. If the authors want to ensure power for smaller effects, the BF criteria for stopping could be slightly increased, which would be more efficient than using a large minimum sample size.

Minor points

I am not sure that abbreviated labels "C1", "C2" etc are needed. Descriptive labels could be used ("Fully Unconscious", "Mostly Unconscious", etc) without adding too much clutter in the text. Or could use "U" and "C" in the abbreviations to make it easy to remember which are unconscious vs conscious, "U","MU","MC","C", or "U1", "U2", "C2", "C1".

This topic sentence in the introduction is awkward: "Another relevant literature is the one referring to longer-term learning effects, and pertains to the increase in accuracy following repeated exposure to some stimuli over time." I suggest re-wording in a simpler manner, and maybe breaking off the second part to a new sentence.

Another line that could be simplified: "In a conceptually similar context to that adopted by Chang and colleagues (2016), we aim to study whether the visual system can organise two-tone images into meaningful percepts after masked greyscale image exposure." Maybe something like this: "Using a similar method as Chang and colleagues (2016), we tested whether the visual system can organise two-tone images into meaningful percepts after masked greyscale image exposure."

The use of catch trials is listed as a different in method, but Chang et al (2016) also had catch trials. Are the catch trials different in the proposed study different?

https://doi.org/10.24072/pci.rr.100350.rev11

Reviewed by anonymous reviewer 1, 01 Feb 2023

1A. The scientific validity of the research question(s).

The authors present a sound argument for the validity of their research question. A previous publication (Chang et al., 2016) argued that grey-scale stimuli rendered unconscious by backward masking at an SOA of 67 ms could improve disambiguation of their Mooney (thresholded) counterparts. The grey-scale stimuli were categorised as ‘unconscious’ if the participant reported that they could not recognise the image. The present authors question whether the grey-scale stimuli were unconscious, as the previous authors did not control for response bias (where a participant may be unwilling to report that they could recognise an image though they consciously perceived it).

I would offer that, in addition to the arguments already made, the authors may wish to note that 67 ms is quite a long time for backward masking of visual stimuli. Bacon-Macé and colleagues (2005) show 85% correct performance in discriminating whether a natural scene contains an animal with an SOA of 44 ms and a much stronger mask (and accuracy was above chance at 12 ms SOAs). The weaker masks used by Chang and colleagues, and the proposed mask in this study, can be compared to RSVP studies, where performance above 75% correct can be achieved a stimulus presentation duration of 13 ms (even when the categorisation decision is only indicated after stimulus presentation; Potter et al., 2014). Based on this, it is very unlikely that the manipulation presented in Chang and colleagues 2016 experiment resulted in ‘unconscious’ stimuli, given this previous research showing participants can make quite accurate decisions about the contents of images presented at much shorter durations with stronger masking.

1B. The logic, rationale, and plausibility of the proposed hypotheses, as applicable.

The authors’ hypotheses are multifaceted. My interpretation is that the main aim is to test whether unconscious grey-scale stimuli can improve identification of their Mooney counterparts when using a combination of objective and subjective measures. The secondary hypotheses include comparing different criteria for consciousness, and the effect this has on whether Mooney identification can be considered significantly improved following exposure. Overall, these hypotheses are logical, rational, and plausible, and the results will be interesting whether the null is accepted or rejected.

It was often not clear what the authors meant by ‘unconscious’. Most often, they do not qualify, and readers might presume they mean ‘undetectable’. Frequently, they use the term ‘conscious recognition’, yet presumably they aim to present participants with images they have never seen before, and so even if the participant were fully conscious of the image, they would not recognise it on the first presentation. Sometimes they describe the phenomenon in terms of ‘contents’ (as they use in the description of the PAS ratings to observers). They should be careful here too, about the interpretation of what counts as ‘content’: would it be sufficient to have information about approximate figure/ground segmentation, or a general theme such as ‘animal’, or perhaps if the observer could tell whether the image was presented upright or inverted? It is also unclear if the authors presume an image could be detected (the observers are conscious of the presence of the image) while the ‘content’ is ‘unconscious’ – this might be important for some readers to understand whether the authors’ criterion for consciousness matches their own.

1C. The soundness and feasibility of the methodology and analysis pipeline (including statistical power analysis or alternative sampling plans where applicable).

Given the previous literature and the pilot data, it is not clear that the proposed backward masking paradigm is strong enough: participants report seeing a brief glimpse of the content of the image on about 50% of trials at the short SOA. The authors could increase the contrast of the mask to get stronger backward masking (Bachmann & Francis, 2014).

On that note, the example mask in Figure 4 looks as if it has broad vertical columns, it does not look like a phase scrambled version of the target stimulus as described.

There are not enough trials to perform the analysis. The authors seek to test for an increase in disambiguation at C1 (PAS 1 + incorrect + short SOA) against catch trials. With 24 stimuli, half catch, half long SOA, half rated PAS 2 at the short SOA, that leaves 3 trials at C1 per participant. Based on Chang et al., 2016, they expect disambiguation to rise from ~2.5% to ~5%. Chang and colleagues had ~15 trials per participant, with 5% disambiguation this means that ~20/25 participants were able to disambiguate 1 trial each (20 trials out of the total 375). Even with 120 participants (the maximum), the authors will have a total of 360 trials, 18 correct, meaning 18/120 participants get 1 trial correct each. The number of trials should be at least doubled. The authors could fit double the number of trials in a similar amount of time by reducing the mask duration (500 ms should be more than sufficient) and the fixation duration (or some of the beginning of the trial – Figure 4 suggests there is 0.5s blank, 1s fixation, 0.1s blank, 0.2s fixation – 0.2s blank followed by 0.5s fixation should suffice). Increasing the masking effect (by increasing the contrast of the mask) should also help to get more trials at PAS 1 at the short SOA.

The pilot data suggests that participants are remarkably good at the task in the pre-exposure phase, with accuracy ~15% correct free naming identification (Figure 6). This is much better than reported in Chang et al., 2016. It could also be problematic that there is quite a substantial increase in catch trial performance, considering the small number of trials. All this will make it even more difficult to get good measures of performance with only 3 trials per participant. The authors could get a better estimate of the likely statistics by dividing pilot participants’ data as if they were different participants (the pilot has 19 participants with 12 trials = 228 trials total, can be divided into 76 participants with 3 trials each, to estimate whether the effects could be reliably detected with so few trials).

It is worrying that so many of the long SOA trials were rated as PAS 2 in the pilot data. This could indicate that participants do have some bias, or that they are relying on some other cues to make their ratings. The example grey-scale images look as though they have different RMS contrast and spatial frequency properties. The catch-trial peacock appears much more difficult to see (in the pdf, long duration) than the ‘main’ trial peacock. I wonder if some participants might be using these cues to separate out their PAS ratings over clearly visible stimuli. The authors could check this in their pilot data, they should also report on the variability of low-level stimulus properties, or even control the low-level stimulus properties.

The effect size calculation compares accuracy to 0, and so does not match the main hypothesis – compare ‘unconscious’ exposure to catch trials (~5% to ~2.5% correct). However, this estimation is not used for anything, so it could be removed.

1D. Whether the clarity and degree of methodological detail is sufficient to closely replicate the proposed study procedures and analysis pipeline and to prevent undisclosed flexibility in the procedures and analyses.

The methodology is highly detailed, however there are some typographical errors and ambiguities. Experiment 1 stimuli description lists 23 images + 9 attention checks, the table in Appendix 2 lists 24 images, the detail about attention checks lists 9 visible + 6 absent, the detail in point 2 of participant rejection suggests 6 visible.

The authors may wish to specify that their hypothesis (and analysis) is one-sided (they expect increased performance after exposure, but presumably a decrease in performance would be evidence for the null).

1E. Whether the authors have considered sufficient outcome-neutral conditions (e.g. absence of floor or ceiling effects; positive controls; other quality checks) for ensuring that the obtained results are able to test the stated hypotheses or answer the stated research question(s).

Yes, the catch trials should be sufficient control.

Minor points:

There are some typographical errors, e.g. page 5 1st line “Chang et colleagues” and last line “undistinguishable”. “for which likelihood of” to “which the likelihood” (abstract)…

Figure 6 does not have panel labels.

Perhaps there is a better name for ‘normal trials’/’main trials’ – for example ‘relevant exposure trials’ or ‘test trials’.

https://doi.org/10.24072/pci.rr.100350.rev12

User comments

No user comments yet

or Register
Submit a report