Close printable page

Recommendation

Does familiarity really breed contempt?

Yuki Yamada based on reviews by Philipp Schoenegger and Zoltan Kekecs

A recommendation of:

STAGE 1

Does learning more about others impact liking them?: Replication and extension Registered Report of Norton et al. (2007)’s Lure of Ambiguity

Zöe Horsham, Ashleigh Haydock-Symonds, Hirotaka Imada, Hiu Ching Tai, Wing Lam Lau, Tsz Lui Shum, Yuqing Zeng, Hiu Tang Chow, Gilad Feldman https://osf.io/eygzp version 4

Read report on server

Abstract

EN

AR

ES

FR

HI

JA

PT

RU

ZH-CN

Does learning more about others impact liking them?: Replication and extension Registered Report of Norton et al. (2007)’s Lure of Ambiguity

[IMPORTANT: Abstract, method, and results were written using a randomized dataset produced by Qualtrics to simulate what these sections will look like after data collection. These will be updated following the data collection. For the purpose of the simulation, we wrote things in past tense, but no pre-registration or data collection took place yet.]

Norton et al. (2007) demonstrated a counterintuitive phenomenon that knowing other people better and/or having more information about them is associated with decreased likings. They summarized it as - ambiguity leads to liking, whereas familiarity can breed contempt. In a Registered Report with a US Prolific undergraduate student sample (N= 800), we directly replicated Studies 1a, 1b, and 2, and conceptually replicated Studies 3 and 4 from Norton et al. (2007). Extending on their research, we also proposed that curiosity provides an alternative path to liking, hypothesizing that curiosity mediates the relationship between knowledge and liking. [The following is a demo placeholder based on the random simulated and will be updated following data collection in Stage 2] Overall, we found [weak/medium/strong] support for the original findings. With our extensions to the replication study, we found… [To be completed in Stage 2]. Materials, data, and code are available on: https://osf.io/j6tqr/

impression formation, liking, less is more, similarity, ambiguity, curiosity, registered replication, decision making

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

هل يؤثر تعلم المزيد عن الآخرين على الإعجاب بهم؟: النسخ المتماثل والتوسيع لتقرير مسجل من Norton et al. (2007) إغراء الغموض

[هام: تمت كتابة الملخص والطريقة والنتائج باستخدام مجموعة بيانات عشوائية أنتجتها Qualtrics لمحاكاة الشكل الذي ستبدو عليه هذه الأقسام بعد جمع البيانات. وسيتم تحديث هذه بعد جمع البيانات. لغرض المحاكاة، قمنا بكتابة الأشياء بصيغة الماضي، ولكن لم يتم إجراء أي تسجيل مسبق أو جمع بيانات حتى الآن.]

نورتون وآخرون. (2007) أظهر ظاهرة غير بديهية مفادها أن معرفة الآخرين بشكل أفضل و/أو الحصول على مزيد من المعلومات عنهم يرتبط بانخفاض الإعجاب. وقد لخصوا الأمر على النحو التالي: الغموض يؤدي إلى الإعجاب، في حين أن الألفة يمكن أن تولد الازدراء. في تقرير مسجل مع عينة طلابية غزيرة الإنتاج في الولايات المتحدة (العدد = 800)، قمنا مباشرة بتكرار الدراسات 1أ و1ب و2، وتكرار الدراسات 3 و4 من الناحية المفاهيمية من نورتون وآخرين. (2007). توسيعًا لأبحاثهم، اقترحنا أيضًا أن الفضول يوفر طريقًا بديلاً للإعجاب، مفترضين أن الفضول يتوسط العلاقة بين المعرفة والإعجاب. [ما يلي هو عنصر نائب تجريبي يعتمد على المحاكاة العشوائية وسيتم تحديثه بعد جمع البيانات في المرحلة 2] بشكل عام، وجدنا دعمًا [ضعيف/متوسط/قوي] للنتائج الأصلية. ومن خلال امتداداتنا لدراسة التكرار، وجدنا... [سيتم استكماله في المرحلة 2]. المواد والبيانات والتعليمات البرمجية متاحة على: https://osf.io/j6tqr/

تكوين الانطباع، الإعجاب، الأقل هو الأكثر، التشابه، الغموض، الفضول، النسخ المتماثل المسجل، اتخاذ القرار

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

¿Aprender más sobre los demás influye en que les gusten?: Replicación y extensión Informe registrado de Norton et al. (2007) El atractivo de la ambigüedad

[IMPORTANTE: El resumen, el método y los resultados se escribieron utilizando un conjunto de datos aleatorio producido por Qualtrics para simular cómo se verán estas secciones después de la recopilación de datos. Estos se actualizarán después de la recopilación de datos. Para los fines de la simulación, escribimos cosas en tiempo pasado, pero aún no se realizó ningún registro previo ni recopilación de datos.]

Norton et al. (2007) demostraron un fenómeno contraintuitivo: conocer mejor a otras personas y/o tener más información sobre ellas se asocia con una disminución de los gustos. Lo resumieron así: la ambigüedad genera simpatía, mientras que la familiaridad puede generar desprecio. En un informe registrado con una prolífica muestra de estudiantes universitarios de EE. UU. (N = 800), replicamos directamente los estudios 1a, 1b y 2, y conceptualmente replicamos los estudios 3 y 4 de Norton et al. (2007). Ampliando su investigación, también propusimos que la curiosidad proporciona un camino alternativo hacia el agrado, planteando la hipótesis de que la curiosidad media la relación entre el conocimiento y el agrado. [El siguiente es un marcador de posición de demostración basado en la simulación aleatoria y se actualizará después de la recopilación de datos en la Etapa 2] En general, encontramos apoyo [débil/medio/fuerte] para los hallazgos originales. Con nuestras extensiones del estudio de replicación, encontramos… [Se completará en la Etapa 2]. Los materiales, datos y códigos están disponibles en: https://osf.io/j6tqr/

formación de impresiones, agrado, menos es más, similitud, ambigüedad, curiosidad, replicación registrada, toma de decisiones

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

En savoir plus sur les autres a-t-il un impact sur le fait de les apprécier ? : Rapport enregistré de réplication et d'extension de Norton et al. (2007) L'attrait de l'ambiguïté

[IMPORTANT : le résumé, la méthode et les résultats ont été rédigés à l'aide d'un ensemble de données randomisées produit par Qualtrics pour simuler l'apparence de ces sections après la collecte des données. Ceux-ci seront mis à jour suite à la collecte des données. Pour les besoins de la simulation, nous avons écrit les choses au passé, mais aucune pré-inscription ni collecte de données n'a encore eu lieu.]

Norton et coll. (2007) ont démontré un phénomène contre-intuitif selon lequel mieux connaître les autres et/ou avoir plus d’informations à leur sujet est associé à une diminution des appréciations. Ils l'ont résumé ainsi : l'ambiguïté mène à l'appréciation, alors que la familiarité peut engendrer le mépris. Dans un rapport enregistré auprès d'un échantillon d'étudiants de premier cycle prolifiques aux États-Unis (N = 800), nous avons directement répliqué les études 1a, 1b et 2, et répliqué conceptuellement les études 3 et 4 de Norton et al. (2007). Poursuivant leurs recherches, nous avons également proposé que la curiosité constitue une voie alternative vers l’appréciation, en émettant l’hypothèse que la curiosité médiatise la relation entre la connaissance et l’appréciation. [Ce qui suit est un espace réservé de démonstration basé sur la simulation aléatoire et sera mis à jour après la collecte de données à l'étape 2] Dans l'ensemble, nous avons trouvé un soutien [faible/moyen/fort] pour les résultats d'origine. Grâce à nos extensions à l'étude de réplication, nous avons trouvé… [À compléter à l'étape 2]. Les documents, les données et le code sont disponibles sur : https://osf.io/j6tqr/

formation d'impressions, goût, moins c'est plus, similarité, ambiguïté, curiosité, réplication enregistrée, prise de décision

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

क्या दूसरों के बारे में अधिक जानने से उन्हें पसंद करने पर प्रभाव पड़ता है?: नॉर्टन एट अल की प्रतिकृति और विस्तार पंजीकृत रिपोर्ट। (2007) का ल्यूर ऑफ एम्बिगुइटी

[महत्वपूर्ण: सार, विधि और परिणाम क्वाल्ट्रिक्स द्वारा उत्पादित यादृच्छिक डेटासेट का उपयोग करके लिखे गए थे ताकि यह अनुकरण किया जा सके कि डेटा संग्रह के बाद ये अनुभाग कैसे दिखेंगे। डेटा संग्रह के बाद इन्हें अपडेट किया जाएगा। सिमुलेशन के उद्देश्य से, हमने चीजों को भूतकाल में लिखा है, लेकिन अभी तक कोई पूर्व-पंजीकरण या डेटा संग्रह नहीं हुआ है।]

नॉर्टन एट अल। (2007) ने एक विपरीत घटना का प्रदर्शन किया कि अन्य लोगों को बेहतर जानना और/या उनके बारे में अधिक जानकारी रखने से पसंद में कमी आती है। उन्होंने इसका सारांश इस प्रकार दिया - अस्पष्टता पसंद की ओर ले जाती है, जबकि परिचितता अवमानना को जन्म दे सकती है। यूएस प्रोलिफिक स्नातक छात्र नमूने (एन = 800) के साथ एक पंजीकृत रिपोर्ट में, हमने सीधे अध्ययन 1 ए, 1 बी, और 2 को दोहराया, और नॉर्टन एट अल से वैचारिक रूप से अध्ययन 3 और 4 को दोहराया। (2007)। उनके शोध को आगे बढ़ाते हुए, हमने यह भी प्रस्तावित किया कि जिज्ञासा पसंद करने का एक वैकल्पिक मार्ग प्रदान करती है, यह परिकल्पना करते हुए कि जिज्ञासा ज्ञान और पसंद के बीच के रिश्ते में मध्यस्थता करती है। [निम्नलिखित रैंडम सिम्युलेटेड पर आधारित एक डेमो प्लेसहोल्डर है और चरण 2 में डेटा संग्रह के बाद इसे अपडेट किया जाएगा] कुल मिलाकर, हमें मूल निष्कर्षों के लिए [कमजोर/मध्यम/मजबूत] समर्थन मिला। प्रतिकृति अध्ययन में हमारे विस्तार के साथ, हमने पाया... [चरण 2 में पूरा किया जाना है]। सामग्री, डेटा और कोड यहां उपलब्ध हैं: https://osf.io/j6tqr/

धारणा निर्माण, पसंद करना, कम अधिक है, समानता, अस्पष्टता, जिज्ञासा, पंजीकृत प्रतिकृति, निर्णय लेना

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

他の人についてもっと知ることは、その人を好きになることに影響しますか?: ノートンらの登録レポートの複製と拡張。 (2007) の曖昧さの誘惑

[重要: 要約、方法、結果は、データ収集後にこれらのセクションがどのように見えるかをシミュレートするために、クアルトリクスによって作成されたランダム化されたデータセットを使用して書かれています。これらはデータ収集後に更新されます。シミュレーションのために過去形で書いていますが、事前登録やデータ収集はまだ行われていません。]

ノートンら。 (2007) は、他人のことをよく知ること、および/または他人についてより多くの情報を持っていることは、好感度の低下と関連しているという直観に反する現象を実証しました。彼らはそれを次のように要約しました - 曖昧さは好意を生みますが、親しみやすさは軽蔑を生む可能性があります。米国の多産な学部学生サンプル（N= 800）を使用した登録レポートでは、研究 1a、1b、2 を直接再現し、Norton らの研究 3 と 4 を概念的に再現しました。 (2007)。彼らの研究を拡張して、私たちはまた、好奇心が知識と好みの間の関係を媒介するという仮説を立て、好奇心が好きへの別の道を提供することを提案しました。 [以下は、ランダムにシミュレートされたものに基づくデモのプレースホルダーであり、ステージ 2 でのデータ収集後に更新されます] 全体として、元の調査結果に対する [弱い/中程度/強い] 裏付けが見つかりました。再現研究を拡張した結果、次のことがわかりました… [ステージ 2 で完了予定]。マテリアル、データ、コードは https://osf.io/j6tqr/ から入手できます。

印象形成、好感度、少ないほど良い、類似性、曖昧さ、好奇心、登録された複製、意思決定

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Aprender mais sobre os outros tem impacto em gostar deles?: Replicação e extensão Relatório registrado de Norton et al. A atração da ambigüidade de (2007)

[IMPORTANTE: O resumo, o método e os resultados foram escritos usando um conjunto de dados aleatório produzido pela Qualtrics para simular a aparência dessas seções após a coleta de dados. Eles serão atualizados após a coleta de dados. Para efeito da simulação, escrevemos as coisas no pretérito, mas ainda não ocorreu nenhum pré-cadastro ou coleta de dados.]

Norton et al. (2007) demonstraram um fenômeno contra-intuitivo de que conhecer melhor outras pessoas e/ou ter mais informações sobre elas está associado à diminuição de gostos. Eles resumiram assim: a ambigüidade leva à simpatia, enquanto a familiaridade pode gerar desprezo. Em um Relatório Registrado com uma amostra prolífica de estudantes de graduação nos EUA (N = 800), replicamos diretamente os Estudos 1a, 1b e 2, e replicamos conceitualmente os Estudos 3 e 4 de Norton et al. (2007). Ampliando a sua investigação, propusemos também que a curiosidade proporciona um caminho alternativo para o gostar, levantando a hipótese de que a curiosidade medeia a relação entre o conhecimento e o gostar. [A seguir está um espaço reservado de demonstração baseado na simulação aleatória e será atualizado após a coleta de dados no Estágio 2] No geral, encontramos suporte [fraco/médio/forte] para as descobertas originais. Com nossas extensões ao estudo de replicação, encontramos… [A ser concluído na Etapa 2]. Materiais, dados e código estão disponíveis em: https://osf.io/j6tqr/

formação de impressão, gostar, menos é mais, semelhança, ambiguidade, curiosidade, replicação registrada, tomada de decisão

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Влияет ли получение дополнительной информации о других на то, чтобы они понравились вам?: Репликация и расширение зарегистрированного отчета Norton et al. (2007) Приманка двусмысленности

[ВАЖНО: Аннотация, метод и результаты были написаны с использованием рандомизированного набора данных, созданного Qualtrics для моделирования того, как эти разделы будут выглядеть после сбора данных. Они будут обновляться после сбора данных. Для моделирования мы писали слова в прошедшем времени, но предварительной регистрации или сбора данных еще не было.]

Нортон и др. (2007) продемонстрировали противоречивый феномен: лучшее знание других людей и/или получение большей информации о них связано с уменьшением симпатий. Они резюмировали это так: двусмысленность приводит к симпатии, тогда как знакомство может породить презрение. В зарегистрированном отчете с выборкой плодовитых студентов бакалавриата США (N = 800) мы напрямую воспроизвели исследования 1a, 1b и 2, а также концептуально воспроизвели исследования 3 и 4 из Norton et al. (2007). Продолжая свое исследование, мы также предположили, что любопытство обеспечивает альтернативный путь к симпатии, выдвинув гипотезу, что любопытство опосредует связь между знанием и симпатией. [Ниже приведен демонстрационный заполнитель, основанный на случайном моделировании, который будет обновлен после сбора данных на этапе 2.] В целом мы обнаружили [слабую/среднюю/сильную] поддержку первоначальных результатов. Продолжив исследование репликации, мы обнаружили… [Завершается на этапе 2]. Материалы, данные и код доступны по адресу: https://osf.io/j6tqr/

формирование впечатления, симпатия, меньше значит больше, сходство, двусмысленность, любопытство, зарегистрированное повторение, принятие решения

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

了解更多关于他人的信息会影响喜欢他们吗？：复制和扩展诺顿等人的注册报告。 (2007) 歧义的诱惑

[重要提示：摘要、方法和结果是使用 Qualtrics 生成的随机数据集编写的，以模拟这些部分在数据收集后的外观。这些将在数据收集后更新。为了模拟的目的，我们用过去时态来写东西，但还没有进行预注册或数据收集。]

诺顿等人。（2007）证明了一种违反直觉的现象，即更好地了解他人和/或拥有更多关于他们的信息与喜好度下降有关。他们总结为——模糊会产生喜欢，熟悉会产生轻蔑。在一份美国多产本科生样本（N = 800）的注册报告中，我们直接复制了研究 1a、1b 和 2，并在概念上复制了 Norton 等人的研究 3 和 4。（2007）。扩展他们的研究，我们还提出好奇心提供了另一种喜欢的途径，假设好奇心调解了知识和喜欢之间的关系。 [以下是基于随机模拟的演示占位符，并将在第二阶段的数据收集后更新]总体而言，我们发现对原始发现的[弱/中/强]支持。随着我们对复制研究的扩展，我们发现……[将在第二阶段完成]。材料、数据和代码可在以下位置获取：https://osf.io/j6tqr/

印象形成、喜欢、少即是多、相似性、模糊性、好奇心、注册复制、决策

Submission: posted 11 July 2023
Recommendation: posted 23 May 2024, validated 30 May 2024

Cite this recommendation as:
Yamada, Y. (2024) Does familiarity really breed contempt?. Peer Community in Registered Reports, . https://rr.peercommunityin.org/PCIRegisteredReports/articles/rec?id=496

Related stage 2 preprints:

Does learning more about others impact liking them?: Replication and extension Registered Report of Norton et al. (2007)’s Lure of Ambiguity
Zöe Horsham, Ashleigh Haydock-Symonds, Hirotaka Imada, Hiu Ching Tai, Wing Lam Lau, Tsz Lui Shum, Yuqing Zeng, Hiu Tang Chow, Gilad Feldman
https://osf.io/ygkft

Recommendation

In interpersonal evaluation, the amount of information available about the other person has a significant impact. Norton et al. (2007) conducted systematic experiments suggesting a 'less is more' effect – that a lack of information leads to a more positive evaluation. However, subsequent studies have not always reached the same conclusion.

In the current study, Horsham et al. (2024) aim to address this issue by conducting direct and conceptual replications of the Norton et al. (2007) experiments, as well as additional extensive experiments focusing on the effects of curiosity. The authors seek to confirm in a reliable way the relationship between ambiguity and liking, and even to clarify the factors that mediate this relationship. The results should significantly advance our understanding of the importance of information management in interpersonal relationships.

The Stage 1 manuscript was peer-reviewed by two experts; after four rounds of review and based on their revisions and detailed responses to the reviewers' comments, the recommender judged that the manuscript met the Stage 1 criteria and awarded it in-principle acceptance (IPA).

URL to the preregistered Stage 1 protocol: https://osf.io/7mc4y

Level of bias control achieved: Level 6. No part of the data or evidence that will be used to answer the research question yet exists and no part will be generated until after IPA.

List of eligible PCI RR-friendly journals:

References

1. Norton, M. I., Frost, J. H., & Ariely, D. (2007). Less is more: The lure of ambiguity, or why familiarity breeds contempt. Journal of Personality and Social Psychology, 92, 97-105. https://doi.org/10.1037/0022-3514.92.1.97

2. Horsham, Z., Haydock-Symonds, A., Imada, H., Tai, H. C., Lau, W. L., Shum, T. L., Zeng, Y., Chow, H. T., & Feldman, G., (2024). Does learning more about others impact liking them? Replication and extension Registered Report of Norton et al. (2007)’s Lure of Ambiguity. In principle acceptance of Version 4 by Peer Community in Registered Reports. https://osf.io/7mc4y

Conflict of interest:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article.

Reviews

Reviewed by Zoltan Kekecs, 22 May 2024

Now I am happy with all of the changes made by the authors.

I would like to thank the authors for their perseverance throughout this review process and for their helpful and detailed responses.

https://doi.org/10.24072/pci.rr.100496.rev41

Evaluation round #3

DOI or URL of the report: https://osf.io/ywkqp

Version of the report: 3

Author's Reply, 10 May 2024

Download author's reply Download tracked changes file

Revised manuscript: https://osf.io/eygzp

All revised materials uploaded to: https://osf.io/j6tqr/ , updated manuscript under sub-directory "PCIRR Stage 1\PCI-RR submission following R&R 3"

https://doi.org/10.24072/pci.rr.100496.ar3

Decision by Yuki Yamada, posted 07 May 2024, validated 07 May 2024

We have asked the reviewer to check the manuscript again. As you can see, some minor issues have been raised. I also feel that all these should be resolved before granting an IPA. Please consider them and I would appreciate it if you could revise them again.

https://doi.org/10.24072/pci.rr.100496.d3

Reviewed by Zoltan Kekecs, 07 May 2024

Review notes by Zoltan Kekecs, PhD

I am grateful for the authors’ detailed response to my suggestions. I have a few further observations and suggestions that may help the authors to improve the study and the manuscript:

- Regarding the order effect: I appreciate the authors’ concern that addressing the order effect in formal statistical inference would create unwanted complexity to the situation, which could threaten the confirmatory nature of this investigation. But instead of doing a confirmatory analysis on the order effect, I simply suggest to do an exploratory sensitivity analysis and/or some other investigations that could hint at the effect of study presentation order. I especially don’t like the authors current proposal that the order effect analysis would only be done if the effect was not confirmed. This practically “stacks the deck” in favor of the authors. If they find the effect, no further investigation is done (which could question the authors’ interpretation), but if the effect is not found, an analysis of the order effect could still salvage the situation and gives an extra chance for finding the effect. I suggest that the authors simply state that an exploratory analysis will be undertaken to investigate the possible influence of order of presentation. This exploratory analysis could include visual analysis of graphs plotted according to presentation order, and displaying descriptive statistics by order of presentation. These graphs and figures could be included in a supplement if these are too big to include in the main article. As the authors say, these analyses are straightforward and do not require too much effort, and this way they also don’t threaten confirmatory power.

- Sample size rationale: I am happy that the authors have revised their power analysis and now provide reproducible R code to support the sample size rationale of their proposal. I would like to point out that the sample size rationale in the current version of the manuscript is still inconsistent. The authors say that “multiplying the largest required sample size among all target studies (208) by 2.5 to 723”. However, 208 x 2.5 = 520.

- It seems that the authors have re-classified H3 as an exploratory analysis. However, this is not properly reflected in the current version of the manuscript. Please, explicitly state in the main text that this research question is not a confirmatory hypothesis, rather, this will be an exploratory analysis. I would also not characterize this as a hypothesis (“H”) anymore, since no inferential statistics should be run on exploratory analyses.

https://doi.org/10.24072/pci.rr.100496.rev31

Evaluation round #2

DOI or URL of the report: https://osf.io/m6c7w

Version of the report: 2

Author's Reply, 22 Apr 2024

Download author's reply Download tracked changes file

Revised manuscript: https://osf.io/ywkqp

All revised materials uploaded to: https://osf.io/j6tqr/ , updated manuscript under sub-directory "PCIRR Stage 1\PCI-RR submission following R&R 2"

https://doi.org/10.24072/pci.rr.100496.ar2

Decision by Yuki Yamada, posted 05 Mar 2024, validated 05 Mar 2024

Thank you so much for submitting your revised manuscript.
Two reviewers have checked it again, and as you can see, one reviewer is satisfied with the revision.
Another reviewer is still commenting on the data analysis and power analysis. I hope the authors would address this again.

https://doi.org/10.24072/pci.rr.100496.d2

Reviewed by Zoltan Kekecs, 26 Feb 2024

Reply to Response #1:

I don’t find most of the arguments by the authors very convincing in why a joint design is needed other than increased sample size. The additional insight in the exploratory research questions is nice, but in a confirmatory RR, these are secondary to being able to adequately address the main effect.

However, I am still satisfied with the action the authors took with one small request. The authors say that “We therefore pre-register that if we fail to find support for our hypotheses that we rerun exploratory analyses for the failed study by focusing on the participants that completed that study first, and examine order as a moderator. “ Maybe the authors misunderstood my comment to mean that I was afraid that the effect would be masked by combining the studies. On the contrary, I am afraid that the effect would only be due to running the studies together. I would like to ask that they re-run the analysis regardless whether the main hypothesis was confirmed or not by focusing on the participants that completed that study first. This way it will be revealed if the effect is only due to combining the studies.

Reply to Response #7:

This reviewer note was about target sample size. The authors say that they intend to analyze all valid cases, and say that they “see no reason to worry about or suspect optional stopping”. Nobody is “worried about optional stopping” before they started collecting data for their own study. Everyone is the hero in their own life’s story. Nevertheless, having clear stopping rules and pre-specified analyzed sample size targets still make sense to prevent conscious or unconscious biases in research. The study is already well powered, with considerable slack. I still suggest that the authors only analyze the data from the first 800 valid responses in their confirmatory analyses. In the exploratory, anything goes.

Relatedly, I would also like to ask the authors to specify the exclusion criteria from the analysis. I could not find now in the manuscript what are the planned exclusion criteria, although 10% exclusion is accounted for in the sample size rationale.

Reply to Response #8:

This note was related to the power to detect all effects, if you have multiple tests and plan for 90% power to detect each effect separately. The authors reply that this is not common practice, and that the community on X was also divided.

Most of the detailed responses you got on X seemed to agree with the point. (Others seemed to misinterpret the question and responded about alpha adjustment, which is not really an issue here).

The important thing is that this is a mathematical necessity. You can calculate this on a napkin, or in R. Simply run a simulation of a study having two effects (with the same effect size and being independent of each other to simplify things), and a sample size powered to be 90% powerful to detect any one of these effects. When you look at how many times you were able to detect both effects, you will find that the probability is 81%. As some posters on X point out, this 81% is a “worst case scenario”, because if the effects do correlate, you will have a correspondence of when you are able to find them, so you power to detect all effect will be closer to the individually calculated powers.

Here is a simple simulation showing the issue: we are simulating 5 effects independent from each other, with a sample size enough to detect each effect 90% of the times. However, in any study, there is only about 59% chance for all of the 5 effects to be significant: https://github.com/kekecsz/power_to_detect_all/blob/main/power_to_detect_all.R

All I am saying is, that in a study where power is set to 90% to detect each effect, the study will have a lower chance to detect all effects in the study.

“Note: We would be happy to revise given clear editorial guidelines and instructions on what to amend. If the reviewer or editor feel that an adjustment in sample target is needed - then we ask that you please provide us with relevant citations and an example or two of other Registered Reports (preferably PCIRR, preferably replications) that has done something similar, and taking into consideration cost/benefit of going beyond the already large planned sample of 800.” – I find this request unnecessary. This mathematical fact does not require citation in my view, since it is easy to demonstrate (see code above)(although the responses on X did contain some useful works if you are interested).

I suggest the authors add a paragraph in the power analysis section, that says something like this: “It is worth noting that even though the power for this study to detect each hypothesized effect is at least 90%, the power of this study to detect all of these effects simultaneously is unknown.” (If you don’t like “unknown”, here you can give the worst-case scenario estimate as I mentioned above, or, if you have reliavble pilot data, calculate the tue power based on the dependency of the effects from there. For all the effects in this study with various effect sizes this might b a complicated calculation, maybe easiest to do with simulation).

Relatedly, the authors say in this sentence: “We conducted a series of a priori power analyses based on these effect sizes and we found that 234 participants would be enough to detect the effect sizes with 90% statistical power at alpha = .05 (see supplementary materials and analysis code for more details).” I don’t understand why the authors say 234. In the PCIRR Study Design Table they say “Based on the reported correlations between knowledge, similarity, and liking (Study 3 in Norton et al., 2007), we conducted a power analysis. It revealed that N = 310 and 400 would achieve statistical power of 80% and 90% respectively to detect the interaction effect.” Shouldn’t the authors have used 400 instead of 234?

I found all other responses by the authors adequate and have no other issues about the registered report.

https://doi.org/10.24072/pci.rr.100496.rev21

Reviewed by Philipp Schoenegger, 05 Mar 2024

The authors have responded to all my comments, either directly changing their manuscript properly in response or explaining why they did not follow my recommendations. Why I do not persoanlly agree with all their reasoning in cases where they chose not to follow my recommendations, I can see their point of view, with the remaining disagreements not being scientifically important.

I am thus happy to recommend the Stage 1 for acceptance and am looking forward to seeing the results!

https://doi.org/10.24072/pci.rr.100496.rev22

Evaluation round #1

DOI or URL of the report: https://osf.io/4sejv

Version of the report: 1

Author's Reply, 20 Feb 2024

Download author's reply Download tracked changes file

Revised manuscript: https://osf.io/m6c7w

All revised materials uploaded to: https://osf.io/j6tqr/, updated manuscript under sub-directory "PCIRR Stage 1\PCI-RR submission following R&R"

https://doi.org/10.24072/pci.rr.100496.ar1

Decision by Yuki Yamada, posted 06 Sep 2023, validated 07 Sep 2023

First, I apologize that it has taken somewhat longer to collect the peer review reports. The reviewers all submitted their reports very quickly after accepting our request. What took time was the rest of the process, for which I bear the responsibility.

Now, I have just received very helpful peer review comments from two experts. As you can see, both are very positive about this study. And at the same time, they focus on almost the same aspects: power analysis and adjustments for test multiplicity. Please see their specific comments, but I believe their points are in line with current standards of research practice. I encourage you to carefully consider them. The reviewers also made some really constructive comments on the wording of the text, so please take those into consideration as well.

I very much look forward to receiving your revised manuscript!

https://doi.org/10.24072/pci.rr.100496.d1

Reviewed by Zoltan Kekecs, 16 Aug 2023

Review by Zoltan Kekecs, Phd:

The manuscript describes the protocol for a replication of Norton et al. (2007)’s lure of ambiguity effect. The registered report is thorough and shows not only the protocol but also the results for a simulated scenario. The replication attempt makes a reasonable effort at testing the replicability of the critical results of Norton et al. 2007, and includes extensions to the original research questions that help further evaluate the mechanisms underlying the effect. I really like that materials, data, and analysis code used to produce the manuscript are made openly available by the authors, enabling a thorough evaluation of the work. All in all I think this is going to be a valuable project that has a good chance to replicate the effects if they exist and that can provide deeper insight into the influencing factors and mechanisms at play. Below I list a number of suggestions that may help the authors improve the manuscript and the protocol.

- It seems to me that all participants will do all experiments. However, the subsequent experiments might influence each-other. For example a person who first claimed that they think more traits lead to more liking might respond accordingly in the second study to make their responses more consistent. It seems sensible to me to at least separate Study1 and the rest of the studies to prevent such effects.

- “However, we found the choice of analytic strategies somewhat arbitrary; to directly test the effect of the quasi-experimental condition on liking, it is sensible to conduct a t-test rather than computing the correlation. Thus, while we aimed to replicate the correlation, we also planned to test the relationship with a t-test to see whether the quasi-experimental condition influenced liking.” – There is no point in replicating an inappropriate analysis, especially that this is already a conceptual replication. You should use the best analysis method available to answer the research question.

- I really like the fact that the analysis codes and power analyses codes are available.

- I don’t see why there is not sample size calculation (power analysis) for H2-2 and for H4-3. Instead of effect sizes provided in the original study (not available in this case), you can use smallest effect size of interest, or some other effect size estimation method to gain the required numbers. Simulation can also help. You can do a simulation-based power analysis (of course that would also require setting effect sizes and variances for the simulation).

- In the power analysis for H3 the authors write: “Since the paper does not offer information about standard deviations, we assumed they were 1 and conducted the analysis.” This seems arbitrary. I am not a domain expert so I do not know whether this is a reasonable assumption. This should be supported somehow, for example data from another study, or data from a pilot study. Alternatively, a range of reasonable SDs could be tried in this analysis and the range of estimated samples sizes could be reported in the paper.

- “…the data of the 30 participants will not analyzed other than to assess survey completion duration, feedback regarding possible technical issues and payment, and needed pay adjustments. Unless in the case of serious technical issues that affect data quality and require survey modification, these participants will be included in the overall analyses” These two statements seem to be contradictory. Please reconcile.

- The Participants and design section indicates that 1383 participants were included in the data analysis. This is much higher than the target sample size. Why is this the case? If you exceed the target sample size, it may seem as optional stopping, collecting data until you get the desired results. I suggest that for the confirmatory analyses you only take into account the first X responses, X being the target sample size. If you want you can repeat the analyses on the full sample as a robustness check/sensitivity analysis. Also, the exploratory analyses can be conducted on the full sample.

- You have marked H5 through H9 as exploratory “hypotheses”. Something is either an exploratory analysis or a confirmatory hypothesis test. In the “Extensions” section you use inferential statistics and confirmatory hypothesis testing language for these analyses. Please, decide whether these are exploratory analyses or confirmatory hypothesis tests. If exploratory analyses, do not use p-vlaues, or testing language. Just focus on the descriptive results, effect size estimates, dispersion statistics, and visualization of these. If they are confirmatory tests, do not mark them as exploratory, and provide sample size calculation (power analyses) for these as well.

- The power analysis provides sample size targets to reach at least 90% power for each replication hypothesis tests individually. This seems to assume that you will have at least 90% power to detect the effects in this study. However, this is incorrect, since you are testing multiple hypotheses. For example if you have two hypotheses and have a 90% power to detect each effect, you only have 81% probability to detect both effects, and thus, 19% probability to miss at least one of them. With more hypotheses, this missing effect chance can stack up quickly. You could either power your study to have a 90% probability to detect ALL effects, or be explicit about the probability of missing a number of effects in the Power and sensitivity analysis section.

- „We did not include Studies 3 and 5 as targets of direct replications as these involved experiments using real online dating platforms” – do you mean 4 instead of 5? study 5 ws not mentioned until this point.

https://doi.org/10.24072/pci.rr.100496.rev11

Reviewed by Philipp Schoenegger, 06 Sep 2023

The paper under review provides a commendable effort in directly replicating Norton et al. (2007). The authors have done an excellent job in motivating the study and setting up the project. The manuscript is not only well-structured but also remarkably transparent, with a plethora of resources and data made available at the OSF. The research question and hypotheses are clearly articulated, and the methods employed are appropriate. Furthermore, the manuscript is characterized by a high level of detail, making it easy for the reader to follow the research process and understand the nuances of the study. However, I have a number of points that I would like to see the authors address before running the study. Some are minor while others are major, but I believe that this study is very much worth running and would be a great addition to the scientific record. The former are suggestions that may be disregarded with some reasonable explanation, but the latter should be followed or at least be rejected with a detailed argumentation as to the reason for doing so.

1) For the abstract, the use of ‘results’ seems too strong to me, especially given the fact that you use much more associational language throughout the paper. I would suggest you use more uniform language to avoid misunderstandings.

2) Additionally, the term ‘less liking’ should be set-up better in the abstract, a less informed reader may not be able to follow.

3) There is also a small typo in the abstract, it should be ‘Overall, we found’. I generally suggest reworking the abstract for clarity.

4) In the introduction, I would improve the set up early on somewhat better motivate the term ‘stranger’. It seems to me that one may not be able to meet the same stranger regularly (without that person not becoming a stranger anymore). At least this may be the case with the standard definition.

5) Additionally, it is worth keeping an eye on consistent writing of the ‘less is more’ effect. This can be with ‘ or cursive, etc. Just keep it consistent.

6) In the ‘Target for Replication’ section, I would again point out that there is a stark difference in the presentation of the results, particularly in the language used to describe the findings. The manuscript alternates between associational language, such as 'tended to report,' and causal language, represented by terms like 'results.' This inconsistency could lead to confusion regarding the level of causality that the study aims to establish. I suggest adopting a more cautious and consistent approach to causal language. Specifically, it would be prudent to decide on a uniform level of causality that the study aims to establish and maintain this consistently throughout the manuscript. This is particularly important because some of the studies that you are replicating or referring to are associational in nature. Using inconsistent or overly strong causal language could risk misleading interpretations and should therefore be avoided.

7) When you write that “Ullrich et al. (2013) also challenged Norton et al. (2007),” I would change it to a phrasing that suggests that a paper or finding is challenged, not a set of authors.

8) Lastly, in the ‘Conceptual replications of Study 3 and 4’ section, I think your treatment of the conceptual replication of Study 3 could use more detail. I suggest you explain why you do not replicate Study 3 (I assume because of the specific sample, but simply having a different sample is not reason enough, the reason may be cost, temporal differences, effort, etc), and what potential differences this change in sample may bring with it with respect to interpreting any given results.

Below I outline my three bigger concerns:

1) In the methods section, while the inclusion of a power analysis is commendable and aligns with best practices in research methodology, there are several issues that need to be addressed. Firstly, the code provided for the power analysis is not immediately executable without minor modifications (at least for me). For instance, the line of code “esc_chisq(chisq = 112,67, totaln = 294, es.type = "r")” contains a syntax error; the comma should be replaced with a period to read “112.67” instead of “112,67.” Although I was able to replicate your final sample size of 234 participants after making these adjustments, the code should be cleaned up to ensure that it runs seamlessly for other researchers who may wish to replicate or extend your work.

2) Secondly, the choice of approach for determining effect sizes in the power analysis warrants discussion and perhaps revision. The manuscript seems to rely on expected effect sizes derived from a single paper, which is a methodological choice that could be problematic. Given that the very essence of replication studies like this one is to question the generalizability of such expected effect sizes, it would be more prudent to adopt a 'smallest effect size of interest' approach instead of using the observed effect size as the expected effect size. This would involve identifying the smallest effect that would still be of scientific interest and using that as the basis for the power analysis, rather than relying on potentially inflated or context-specific effect sizes from previous work. The 'smallest effect size of interest' could be anchored initially to the expected effect size but should be revised downwards to reflect a more conservative and scientifically rigorous estimate. This approach would align better with the overarching goals of replication studies, which aim to rigorously test the robustness and generalizability of previous findings. If the authors disagree with this suggestion, a detailed justification for the current approach would be beneficial, including why it is considered superior or more appropriate for this specific study. I am also aware that this is likely to reduce the effect size and thus increase the sample size needed, but for an important replication like this, these additional costs seem justified and indeed crucial.

3) Lastly, I noticed that the 'Results' section (or any other place) does not include adjustments for multiple hypothesis testing. Given that this is a replication study with multiple hypotheses, it would be beneficial to consider some form of adjustment to maintain the integrity of the results. I checked the analysis code and, as far as I can tell, found no evidence of any such adjustments that may not have been mentioned in the paper, with any instances of 'p_adjust' set to 'none.' I would strongly suggest that the authors consider incorporating a method to correct for the multiple hypotheses being tested in this study.

Section Results: Lastly, I am surprised not to see any adjustments for the multiple hypotheses that are tested in this replication. I went to the analysis code to ensure it wasn’t just left out of the manuscript but I could find no adjustment either, with the few instances of p_adjust set to none. I would strongly suggest that the authors include some type of adjustment that corrects for the large number of hypotheses tested in this replication.

https://doi.org/10.24072/pci.rr.100496.rev12