Recommendation

Does self regulation by gaming companies for the use of loot boxes work?

Zoltan Dienes based on reviews by Chris Chambers, Lukas J. Gunschera and Andy Przybylski

A recommendation of:

STAGE 1

Assessing compliance with UK loot box industry self-regulation on the Apple App Store: a 6-month longitudinal study on the implementation process

Leon Y. Xiao https://osf.io/7xft9 version 3

Read report on server

Abstract

EN

AR

ES

FR

HI

JA

PT

RU

ZH-CN

Assessing compliance with UK loot box industry self-regulation on the Apple App Store: a 6-month longitudinal study on the implementation process

Loot boxes in video games can be purchased with real-world money in exchange for random rewards. Stakeholders are concerned about loot boxes’ similarities with gambling and their potential harms (e.g., overspending). The UK Government has decided to first try relying on industry self-regulation to address the issue, rather than to impose legislation. These self-regulations have since been published by Ukie (UK Interactive Entertainment). Many stakeholders are interested in a transparent and independent assessment of their implementation. Compliance with some of these self-regulatory measures are empirically testable. The highest-grossing iPhone games will be played for up to one hour to confirm whether they contain loot boxes. If they do, probability disclosures will be searched for in-game; presence disclosures will be searched for on the Apple App Store product page; and attempts will be made to purchase loot boxes without parental consent whilst pretending to be under 18 (e.g., entering an appropriate birth date whenever demanded). This will be done six months after the publication of the principles (baseline; to track the implementation progress) and then again 12 months after their publication (follow-up; to check compliance, as by that point all games would be required to comply). Conclusions will be drawn as to whether the measures have been complied with by companies to an adequate degree. In addition, by checking whether the games identified as non-compliant at baseline have since complied or been removed at follow-up, conclusions will be drawn as to whether stakeholders (e.g., Apple) are enforcing the regulations.

Loot boxes; Video gaming regulation; Consumer protection

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

تقييم الامتثال للتنظيم الذاتي لصناعة صناديق المسروقات في المملكة المتحدة على متجر تطبيقات Apple: دراسة طولية مدتها 6 أشهر حول عملية التنفيذ

يمكن شراء صناديق الغنائم في ألعاب الفيديو بأموال حقيقية مقابل الحصول على مكافآت عشوائية. يشعر أصحاب المصلحة بالقلق إزاء أوجه التشابه بين صناديق المسروقات والمقامرة وأضرارها المحتملة (على سبيل المثال، الإفراط في الإنفاق). قررت حكومة المملكة المتحدة أن تحاول أولاً الاعتماد على التنظيم الذاتي للصناعة لمعالجة هذه المشكلة، بدلاً من فرض التشريعات. تم نشر هذه اللوائح الذاتية منذ ذلك الحين بواسطة Ukie (UK Interactive Entertainment). ويهتم العديد من أصحاب المصلحة بإجراء تقييم شفاف ومستقل لتنفيذها. إن الامتثال لبعض هذه التدابير ذاتية التنظيم يمكن اختباره تجريبيا. سيتم تشغيل ألعاب iPhone الأعلى ربحًا لمدة تصل إلى ساعة واحدة للتأكد مما إذا كانت تحتوي على صناديق غنائم. إذا فعلوا ذلك، فسيتم البحث عن الإفصاحات الاحتمالية داخل اللعبة؛ سيتم البحث عن الإفصاحات الخاصة بالتواجد على صفحة منتج Apple App Store؛ وسيتم إجراء محاولات لشراء صناديق الغنائم دون موافقة الوالدين أثناء التظاهر بأنك أقل من 18 عامًا (على سبيل المثال، إدخال تاريخ ميلاد مناسب عند الطلب). سيتم ذلك بعد ستة أشهر من نشر المبادئ (خط الأساس؛ لتتبع التقدم المحرز في التنفيذ) ثم مرة أخرى بعد 12 شهرًا من نشرها (المتابعة؛ للتحقق من الامتثال، حيث أنه بحلول ذلك الوقت سيكون مطلوبًا من جميع الألعاب الالتزام بها). . وسيتم استخلاص استنتاجات حول ما إذا كانت الشركات قد التزمت بالتدابير بدرجة كافية. بالإضافة إلى ذلك، من خلال التحقق مما إذا كانت الألعاب التي تم تحديدها على أنها غير متوافقة في الأساس قد امتثلت منذ ذلك الحين أو تمت إزالتها عند المتابعة، سيتم استخلاص استنتاجات حول ما إذا كان أصحاب المصلحة (مثل Apple) يطبقون اللوائح.

صناديق المسروقات؛ تنظيم ألعاب الفيديو؛ حماية المستهلك

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Evaluación del cumplimiento de la autorregulación de la industria de cajas de botín del Reino Unido en la App Store de Apple: un estudio longitudinal de 6 meses sobre el proceso de implementación

Las cajas de botín de los videojuegos se pueden comprar con dinero del mundo real a cambio de recompensas aleatorias. Las partes interesadas están preocupadas por las similitudes de las cajas de botín con los juegos de azar y sus posibles daños (por ejemplo, gasto excesivo). El Gobierno del Reino Unido ha decidido intentar primero confiar en la autorregulación de la industria para abordar el problema, en lugar de imponer legislación. Estas autorregulaciones han sido publicadas desde entonces por Ukie (UK Interactive Entertainment). Muchas partes interesadas están interesadas en una evaluación transparente e independiente de su implementación. El cumplimiento de algunas de estas medidas de autorregulación es empíricamente comprobable. Los juegos de iPhone con mayor recaudación se jugarán durante hasta una hora para confirmar si contienen cajas de botín. Si lo hacen, se buscarán revelaciones de probabilidad en el juego; las divulgaciones de presencia se buscarán en la página del producto Apple App Store; y se intentará comprar cajas de botín sin el consentimiento de los padres mientras se pretende ser menor de 18 años (por ejemplo, ingresando una fecha de nacimiento apropiada cuando se le solicite). Esto se hará seis meses después de la publicación de los principios (línea de base; para rastrear el progreso de la implementación) y luego nuevamente 12 meses después de su publicación (seguimiento; para verificar el cumplimiento, ya que en ese momento todos los juegos deberían cumplir). . Se extraerán conclusiones sobre si las empresas han cumplido adecuadamente las medidas. Además, al verificar si los juegos identificados como no conformes en la línea de base han cumplido o han sido eliminados en el seguimiento, se extraerán conclusiones sobre si las partes interesadas (por ejemplo, Apple) están haciendo cumplir las regulaciones.

Cajas de botín; Regulación de videojuegos; Protección al Consumidor

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Évaluation de la conformité à l'autoréglementation de l'industrie britannique des loot boxes sur l'App Store d'Apple : une étude longitudinale de 6 mois sur le processus de mise en œuvre

Les coffres à butin des jeux vidéo peuvent être achetés avec de l'argent réel en échange de récompenses aléatoires. Les parties prenantes s’inquiètent des similitudes entre les loot boxes et les jeux de hasard et de leurs préjudices potentiels (par exemple, dépenses excessives). Le gouvernement britannique a décidé d'essayer d'abord de s'appuyer sur l'autorégulation de l'industrie pour résoudre le problème, plutôt que d'imposer une législation. Ces autorégulations ont depuis été publiées par Ukie (UK Interactive Entertainment). De nombreuses parties prenantes souhaitent une évaluation transparente et indépendante de leur mise en œuvre. Le respect de certaines de ces mesures d’autoréglementation peut être vérifié empiriquement. Les jeux iPhone les plus rentables seront joués pendant une heure maximum pour confirmer s'ils contiennent des coffres à butin. Si tel est le cas, des informations de probabilité seront recherchées dans le jeu ; les informations de présence seront recherchées sur la page produit de l'App Store d'Apple ; et des tentatives seront faites pour acheter des coffres à butin sans le consentement des parents tout en prétendant avoir moins de 18 ans (par exemple, en saisissant une date de naissance appropriée chaque fois que cela est demandé). Cela sera fait six mois après la publication des principes (référence ; pour suivre les progrès de la mise en œuvre), puis à nouveau 12 mois après leur publication (suivi ; pour vérifier la conformité, car à ce stade, tous les jeux seraient tenus de s'y conformer). . Des conclusions seront tirées quant à savoir si les mesures ont été respectées de manière adéquate par les entreprises. De plus, en vérifiant si les jeux identifiés comme non conformes au départ ont depuis été conformes ou ont été supprimés lors du suivi, des conclusions seront tirées quant à savoir si les parties prenantes (par exemple, Apple) appliquent la réglementation.

Coffres à butin ; Réglementation des jeux vidéo ; La protection des consommateurs

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

ऐप्पल ऐप स्टोर पर यूके लूट बॉक्स उद्योग स्व-नियमन के अनुपालन का आकलन: कार्यान्वयन प्रक्रिया पर 6 महीने का अनुदैर्ध्य अध्ययन

वीडियो गेम में लूट बक्से को यादृच्छिक पुरस्कारों के बदले वास्तविक दुनिया के पैसे से खरीदा जा सकता है। हितधारक लूट बक्से की जुए से समानता और उनके संभावित नुकसान (उदाहरण के लिए, अधिक खर्च) के बारे में चिंतित हैं। यूके सरकार ने इस मुद्दे के समाधान के लिए कानून लागू करने के बजाय पहले उद्योग स्व-नियमन पर भरोसा करने का प्रयास करने का निर्णय लिया है। ये स्व-विनियम तब से उकी (यूके इंटरएक्टिव एंटरटेनमेंट) द्वारा प्रकाशित किए गए हैं। कई हितधारक अपने कार्यान्वयन के पारदर्शी और स्वतंत्र मूल्यांकन में रुचि रखते हैं। इनमें से कुछ स्व-नियामक उपायों का अनुपालन अनुभवजन्य रूप से परीक्षण योग्य है। सबसे ज्यादा कमाई करने वाले iPhone गेम्स को एक घंटे तक खेला जाएगा ताकि यह पुष्टि की जा सके कि उनमें लूट बॉक्स हैं या नहीं। यदि वे ऐसा करते हैं, तो गेम में संभाव्यता प्रकटीकरण की खोज की जाएगी; ऐप्पल ऐप स्टोर उत्पाद पृष्ठ पर उपस्थिति प्रकटीकरण की खोज की जाएगी; और 18 साल से कम उम्र का दिखावा करते हुए माता-पिता की सहमति के बिना लूट के बक्से खरीदने का प्रयास किया जाएगा (उदाहरण के लिए, जब भी मांग की जाए तो उचित जन्मतिथि दर्ज करना)। यह सिद्धांतों के प्रकाशन के छह महीने बाद किया जाएगा (आधारभूत; कार्यान्वयन की प्रगति को ट्रैक करने के लिए) और फिर उनके प्रकाशन के 12 महीने बाद (अनुवर्ती; अनुपालन की जांच करने के लिए, क्योंकि उस समय तक सभी खेलों को अनुपालन करना आवश्यक होगा) . निष्कर्ष निकाला जाएगा कि क्या कंपनियों द्वारा पर्याप्त मात्रा में उपायों का अनुपालन किया गया है। इसके अलावा, यह जांच कर कि क्या बेसलाइन पर गैर-अनुपालक के रूप में पहचाने गए खेलों ने अनुपालन किया है या अनुवर्ती कार्रवाई में हटा दिया गया है, निष्कर्ष निकाला जाएगा कि क्या हितधारक (उदाहरण के लिए, ऐप्पल) नियमों को लागू कर रहे हैं।

बक्से लूटें; वीडियो गेमिंग विनियमन; उपभोक्ता संरक्षण

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Apple App Store における英国のルートボックス業界自主規制への準拠の評価: 導入プロセスに関する 6 か月にわたる縦断調査

ビデオゲームのルートボックスは、ランダムな報酬と引き換えに現実世界のお金で購入できます。利害関係者は、ルートボックスとギャンブルの類似点や、その潜在的な害（過剰支出など）を懸念しています。英国政府は、この問題に対処するために、法律を制定するのではなく、まず業界の自主規制に頼ることを決定しました。これらの自主規制はその後、Ukie (UK Interactive Entertainment) によって公開されました。多くの利害関係者は、実装の透明性のある独立した評価に関心を持っています。これらの自主規制措置の一部の遵守は経験的にテスト可能です。最も収益の高い iPhone ゲームは、戦利品ボックスが含まれているかどうかを確認するために最大 1 時間プレイされます。そうする場合、確率開示はゲーム内で検索されます。プレゼンスの開示は Apple App Store の製品ページで検索されます。また、18 歳未満であるふりをして、親の同意なしにルートボックスを購入しようとする試みが行われます (例: 要求された場合に適切な生年月日を入力するなど)。これは原則の公開から 6 か月後に (ベースライン; 実装の進捗状況を追跡するため)、次に公開から 12 か月後に再度行われます (フォローアップ; その時点までにすべてのゲームが準拠する必要があるため、準拠を確認するため)。。企業がこの措置が適切な程度遵守されているかどうかについて結論が導き出される。さらに、ベースラインで非準拠と特定されたゲームがその後準拠したか、フォローアップで削除されたかを確認することで、利害関係者 (Apple など) が規制を施行しているかどうかについて結論が導き出されます。

戦利品ボックス;ビデオゲーム規制。消費者保護

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Avaliando a conformidade com a autorregulação da indústria de loot box do Reino Unido na Apple App Store: um estudo longitudinal de 6 meses sobre o processo de implementação

As caixas de saque em videogames podem ser compradas com dinheiro real em troca de recompensas aleatórias. As partes interessadas estão preocupadas com as semelhanças das loot boxes com os jogos de azar e com os seus potenciais danos (por exemplo, gastos excessivos). O Governo do Reino Unido decidiu primeiro tentar confiar na auto-regulação da indústria para resolver o problema, em vez de impor legislação. Desde então, essas autorregulamentações foram publicadas pela Ukie (UK Interactive Entertainment). Muitas partes interessadas estão interessadas numa avaliação transparente e independente da sua implementação. O cumprimento de algumas destas medidas de autorregulação é empiricamente testável. Os jogos de maior bilheteria para iPhone serão jogados por até uma hora para confirmar se contêm caixas de saque. Se o fizerem, as divulgações de probabilidade serão pesquisadas no jogo; divulgações de presença serão pesquisadas na página de produto da Apple App Store; e serão feitas tentativas de comprar caixas de saque sem o consentimento dos pais, fingindo ser menor de 18 anos (por exemplo, inserindo uma data de nascimento apropriada sempre que solicitado). Isto será feito seis meses após a publicação dos princípios (linha de base; para acompanhar o progresso da implementação) e novamente 12 meses após a sua publicação (acompanhamento; para verificar a conformidade, já que nesse ponto todos os jogos seriam obrigados a cumprir) . Serão tiradas conclusões sobre se as medidas foram cumpridas de forma adequada pelas empresas. Além disso, ao verificar se os jogos identificados como não conformes na linha de base já cumpriram ou foram removidos no acompanhamento, serão tiradas conclusões sobre se as partes interessadas (por exemplo, a Apple) estão aplicando os regulamentos.

Caixas de saque; Regulamentação de videogames; Proteção do consumidor

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Оценка соблюдения требований саморегулирования индустрии лутбоксов в Великобритании в Apple App Store: шестимесячное продольное исследование процесса внедрения

Лутбоксы в видеоиграх можно приобрести за реальные деньги в обмен на случайные награды. Заинтересованные стороны обеспокоены сходством лутбоксов с азартными играми и их потенциальным вредом (например, перерасходом средств). Правительство Великобритании решило сначала попробовать использовать саморегулирование отрасли для решения этой проблемы, а не навязывать законодательство. Эти правила саморегулирования впоследствии были опубликованы Ukie (UK Interactive Entertainment). Многие заинтересованные стороны заинтересованы в прозрачной и независимой оценке их реализации. Соблюдение некоторых из этих мер саморегулирования поддается эмпирической проверке. В самые кассовые игры для iPhone можно будет играть до одного часа, чтобы проверить, содержат ли они лутбоксы. Если они это сделают, вероятность раскрытия информации будет искаться в игре; раскрытие информации о присутствии будет искаться на странице продукта Apple App Store; и будут предприняты попытки купить лутбоксы без согласия родителей, притворяясь, что вам меньше 18 лет (например, вводя соответствующую дату рождения, когда это потребуется). Это будет сделано через шесть месяцев после публикации принципов (базовый уровень; для отслеживания хода реализации), а затем еще раз через 12 месяцев после их публикации (последующий контроль; для проверки соответствия, поскольку к этому моменту все игры должны будут соответствовать требованиям). . Будут сделаны выводы о том, соблюдаются ли меры компаниями в достаточной степени. Кроме того, проверив, соответствуют ли игры, определенные изначально как несоответствующие требованиям, или были ли они удалены при последующих действиях, можно сделать выводы о том, соблюдают ли заинтересованные стороны (например, Apple) эти правила.

Лутбоксы; регулирование видеоигр; Защита потребителя

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

评估苹果应用商店对英国战利品盒行业自我监管的遵守情况：针对实施过程的为期 6 个月的纵向研究

视频游戏中的战利品盒可以用现实世界的金钱购买，以换取随机奖励。利益相关者担心战利品箱与赌博的相似之处及其潜在危害（例如超支）。英国政府决定首先尝试依靠行业自律来解决这个问题，而不是强制立法。这些自律随后由 Ukie（英国互动娱乐）发布。许多利益相关者对对其实施情况进行透明和独立的评估感兴趣。其中一些自律措施的遵守情况是可以通过经验检验的。收入最高的 iPhone 游戏将进行长达一小时的游戏，以确认它们是否包含战利品箱。如果这样做，将在游戏中搜索概率披露；将在 Apple App Store 产品页面上搜索存在披露信息；假装未满 18 岁（例如，在需要时输入适当的出生日期），并试图在未经父母同意的情况下购买战利品箱。这将在原则发布六个月后完成（基线；跟踪实施进度），然后在发布后 12 个月再次完成（后续行动；检查合规性，因为到那时所有游戏都需要遵守）。将就公司是否充分遵守这些措施得出结论。此外，通过检查基线时被识别为不合规的游戏是否已合规或在后续被删除，将得出利益相关者（例如苹果）是否正在执行法规的结论。

战利品箱；视频游戏监管；消费者保护

Submission: posted 27 August 2023
Recommendation: posted 25 March 2024, validated 25 March 2024

Cite this recommendation as:
Dienes, Z. (2024) Does self regulation by gaming companies for the use of loot boxes work?. Peer Community in Registered Reports, . https://rr.peercommunityin.org/articles/rec?id=549

Related stage 2 preprints:

Non-compliance with and non-enforcement of UK loot box industry self-regulation on the Apple App Store: A longitudinal study on poor implementation
Leon Y. Xiao, Mie Lund
https://osf.io/3re4n

Recommendation

Video games may provide the option of spending real money in exchange for probabilistically receiving game-relevant rewards; in effect, encouraging potentially young teenagers to gamble. The industry has subscribed to a set of regulatory principles to cover the use of such "loot boxes", including 1) that they will prevent loot box purchasing by under 18s unless parental consent is given; 2) that they will make it initially clear that the game contains loot boxes; and 3) that they will clearly disclose the probabilities of receiving different rewards.

Can the industry effectively self regulate? Xiao (2024) will evaluate this important question by investigating the 100 top selling games on the Apple App Store and estimating the percentage compliance to these three regulatory principles at two time points 6 months apart.

The Stage 1 manuscript was evaluated over one round of in-depth review. Based on detailed responses to the reviewers' comments, the recommender judged that the manuscript met the Stage 1 criteria and therefore awarded in-principle acceptance (IPA).

URL to the preregistered Stage 1 protocol: https://osf.io/3knyb

Level of bias control achieved: Level 2. At least some data/evidence that will be used to answer the research question has been accessed and partially observed by the authors, but the authors certify that they have not yet observed the key variables within the data that will be used to answer the research question.

List of eligible PCI RR-friendly journals:

References

1. Xiao, L. (2024). Assessing compliance with UK loot box industry self-regulation on the Apple App Store: a 6-month longitudinal study on the implementation process. In principle acceptance of Version 3 by Peer Community in Registered Reports. https://osf.io/3knyb

Conflict of interest:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article.

Reviews

Evaluation round #2

DOI or URL of the report: https://osf.io/dkxg7

Version of the report: 1

Author's Reply, 29 Jan 2024

Download author's reply Download tracked changes file

Please find (i) my response to the recommender comments and (ii) the manuscript file with all changes tracked separately attached below. Thank you!

All files (including a clean version of the manuscript with all changes confirmed) are available as one document via https://osf.io/7xft9 (version 1).

https://doi.org/10.24072/pci.rr.100549.ar2

Decision by Malte Elson, posted 28 Jan 2024, validated 29 Jan 2024

Dear Dr. Xiao,

Thank you for your submission to PCI-RR. I now had the opportunity to read your revised manuscript and the response to the reviewers. Let me first apologise for the considerable delay in my response. I understand that this may have caused changes to the timeline of your project, with the original date of the first sampling of games being January 18. There were unfortunate private and professional circumstances on my side that I failed to manage to provide a timely response. Again, my sincerest apologies.

I have decided not to send out again your revised manuscript for re-review. The points raised by the reviewers in round 1 were quite clear, and it felt like an undue use of their time to ask them again to check your response to them.

Overall, I am quite happy with your points, counterpoints, and changes to the manuscript. Referring to the major points in my previous letter:

1. I no longer see any issues with using Ukie and non-Ukie games, given their claims that you cite, but also their normative influence on the gaming industry in the UK. I also appreciate the changes you made to the title. Further, I also find it acceptable that you do not scale your sample against the number of players per games, and essentially give each game the same “weight” in your assessment. However, I expect that this importance point will receive extensive attention in the discussion of your findings, whatever they will be.

2. You offer a useful definition of what you consider a lootbox “event” in your study. This is now much clearer, and also how you will proceed playing and encountering lootboxes, if they exist.

3. To be frank, I still have some concerns with regards to your use of hypothesis tests and cutoffs. In your response letter, you argue that the cutoffs are arbitrary, but that you will use them regardless to prevent yourself from changing your interpretation the final %, as different people might interpret the same rate quite differently (industry person vs advocacy group). Further, you argue that the observed rate should not be generalised beyond the top 100 grossing games, and you argue that it is not a sample, but a population.

I can follow your reasoning for the first point, though I would have expected an elaboration of this point in the manuscript itself rather than merely the response letter. Certainly, one could argue that rules are rules, and that therefore any deviation from 100% is a failure of self-regulation. But then: I am sure that the same industry, or other industries, also do not perfectly self-regulate with regards to other issues, although one might generally say that, on average, the system works. It would probably be best to offer your line of reasoning to the reader.

Regarding the second point, I will admit I remain unconvinced. I can only imagine that it will be excruciatingly difficult for you to write the results section of your manuscript in a way that does not generalise beyond the two populations examined (the January and the July top 100 grossing games). Simply considering the title of the current manuscript, “Assessing compliance with UK loot box industry 1 self-regulation on the Apple 2 App Store”, readers might not get the impression this is merely a study on two specific lists of games at two arbitrary points in time. Similarly, the abstract says “[c]onclusions will be drawn as to whether the measures have been complied with by companies to an adequate degree”, which certainly suggests you seem to think your findings can be used to infer something about companies in this industry generally (which I’d agree with). Finally, although this may be more a linguistic habit, you do refer to these lists of games as “samples” throughout the manuscript (the word is mentioned 19 times, whereas “population” is not mentioned once).

I do not think this point is super important for the manuscript or the empirical work. Looking at the top 100 grossing games makes sense, as these are certainly the games where compliance with the regulation would be more important. However, I do not see what is special about the top 100 grossing games that would justify excluding games not on this list from your interpretation. At the very least, you have not yet provided a compelling argument for this.

4. Finally, I appreciate that you have shifted from a programmatic RR to a standard one.

With kind regards

Malte Elson

https://doi.org/10.24072/pci.rr.100549.d2

Evaluation round #1

DOI or URL of the report: https://osf.io/3en2x

Version of the report: 1

Author's Reply, 24 Nov 2023

Download author's reply Download tracked changes file

Please find (i) my response to the recommender and reviewer comments and (ii) the manuscript file with all changes tracked separately attached below. Thank you!

All files (including a clean version of the manuscript with all changes confirmed) are available as one document via https://osf.io/dkxg7 (version 1).

https://doi.org/10.24072/pci.rr.100549.ar1

Decision by Malte Elson, posted 26 Oct 2023, validated 26 Oct 2023

Dear Dr. Xiao,

Thank you for your submission to PCI-RR. I now had the opportunity to read the paper in-depth, and the excellent reviews provided to evaluate the merit of your research proposal.

All three reviewers – Dr. Przybylski, Dr. Chambers, and Dr. Gunschera – mention they found your proposed study timely, and the research question worthy of an in-depth investigation as suggested. I too, share this view: Lootbox regulation, and industry compliance with it, are a topic of increasing attention within the gaming community and the public sphere. As such, a study as the one proposed could easily become a material piece of evidence in the evaluation of policy effectiveness, and perhaps even affect compliance with regulation itself.

However, all three reviewers raised important concerns with the proposed design of the study. I fully concur with them, and will add a few of my own observations below. Some of these points you might disagree with, and I invite you to provide counterarguments in a response letter. Others might be addressed by providing more details and improving clarify of the manuscript. And yet others, I believe, will require changes to your study protocol. The reviewers have offered guidance how the study design and the writing in the manuscript might be improved – please consider these points as you prepare a revision of your research protocol.

STUDY SAMPLE AND GENERALISABILITY

Dr. Przybylski has remarked on the choice to include games not represented by Ukie, and that this weakens the severity of your test. I agree with this point: If this study is designed to test compliance with self-regulation principles by an industry trade body, then it does not seem ideal to include studies that do not fall under this self-regulation, and whose developers are not represented by Ukie. Whether there is a difference in regulation compliance between Ukie and non-Ukie games may itself be an interesting empirical question. I will leave it up to you to decide whether to pursue this or not, but if you do, then you need to account for this in your sample size and sampling strategy somehow. For example, if only 10% of the top 100 games are actually represented by Ukie (or vice versa), a serious empirical estimate of the difference would probably not be within reach. If resources are an issue, as you state, then it may be advisable to only include those games that are represented by Ukie, at the price of narrowing generalisability of your findings.

On this point, I also agree with the reviewer that the focus on the UK market should be represented in the title and conclusions of the paper. Going further, I believe it would also be appropriate to highlight the focus on mobile games, as the sample is restricted to games in the Apple store.

WHAT IS A LOOTBOX?

Dr. Chambers and Dr. Gunschera both raised aspects that regard the definition of lootboxes in your study. Whereas Dr. Chambers asks whether one hour is enough to “encounter” a lootbox, Dr. Gunschera raises concerns regarding the focus on lootboxes that can be bought with real currency rather than in-game currency. Both of these points are important, and I believe they concern a mutual point: What is a lootbox, empirically, in your study? Surely it is not the virtual representation as a box, nor can it be any in-game purchase, nor any chance-based event. As such, I invite you to provide further details how you define and identify lootboxes in games, and by which means: You mention each game will be played for an hour. Does that mean “typical” game actions will be performed (as if you were a regular player), or will you just have the app open for this time? I am asking because it is conceivable that certain in-game actions are linked to lootbox drops. Overall, the manuscript lacks procedural and methodological details that the readers of the paper would surely appreciate.

GAMES VS GAMERS

There is another important point by Dr. Przsybylski regarding the sampling framework as it affects the conclusions from your observations: Are you studying games or gamers? That is, if only games that with a small following are noncompliant, then surely we would have to conclude that the problem is smaller than if the top games (by number of “encounters” with lootboxes) were noncompliant. I think this is a conceptual problem that deserves further attention, and that may not be easily “fixed” given that even obtaining reliable numbers on the games’ market share might be difficult to obtain.

CUTOFFS

Dr. Przybylski and Dr. Gunschera have both remarked on the somewhat arbitrary choice of cutoffs to determine the compliance level. I, too, was confused where they came from, and to be honest I was wondering about the utility of defining cutoffs for the purpose of making a dichotomous decision in a hypothesis framework when just knowing about the empirical rate itself is of great interest (though I am happy to be convinced otherwise, maybe this just needs some justification). Further exacerbating, the point estimates you propose using will suffer from substantial uncertainty. For example, an incidence rate of 95 in a sample of 100 games has a 95% confidence interval of 76.861 to 116.133, the lower bound being below your cutoff for “inadequate compliance”. Of course, I understand this is not a random sample of an unknown population of games: the top 100 are the top 100. Then again, I am sure you would prefer generalising your findings to games not included in the sample.

PROGRAMMATIC RR

Dr. Chambers raises a concern regarding your proposal to register this study as a programmatic RR. To be honest, I overlooked this point until I read his review, but I tentatively agree that I currently do not see the value or necessity to have two separate publications rather than one comprehensive paper that encompasses all research questions and data. Of course, I cannot stop you from writing two papers rather than one, but if you do insist on submitting this as a programmatic RR rather than a single RR, please consider the guidance offered by the reviewer, and highlight the different contributions of each paper, and why it is important or sensible to treat these differently.

With kind regards

Malte Elson

https://doi.org/10.24072/pci.rr.100549.d1

Reviewed by Andy Przybylski, 26 Oct 2023

Question 1A. The scientific validity of the research question(s)
Reply 1A. The question of whether companies comply with statutory or suggested regulatory initiative is an interesting one to me. I approach reading this believing that there is very low compliance, the report suggests I should expect 1 in 3 games might comply if the UK is like the US. I am not quite sure that they research questions that are research questions in the classic academic sense. It is some form of policy or programme evaluation to my reading. I will defer to the editor on this point but note that the UK focus should be consistent from title to interpretation.
Question 1B. The logic, rationale, and plausibility of the proposed hypotheses (where a submission proposes hypotheses)
Reply 1A. Given the UK-specific focus that justifies the research questions (and by extension the hypotheses) I am concerned by the framing of the research questions and how they’re translated into testable hypotheses. If this is indeed a study of industry practices in the UK and premised on principles articulated by UKIE, shouldn’t these hypotheses be focused on paid loot boxes in games that are represented by UKIE?
I do not believe it is a fair test of the principles if they don’t only apply to companies who are represented by UKIE. Like social media, and online safety conversation more generally, this is a thorny problem. How might we regulate global tech industries (e.g. porn, social media, games) when these firms and the decisions they take are determined in Beijing or Palo Alto? I think the VGRF and these principles are very good ideas but I don’t think it’s a fair test of their local effectiveness to examine top grossing games in the UK if they’re creators are based in the USA (ESA), or EU (VGE), or other jurisdictions. I believe the UK has many smaller developers, but I am not sure if they’re represented in the top 100 or more likely to be on mobile or console/pc platforms. Is this the case?
Similarly, I’m not sure that 100 top grossing makes sense given that I doubt these are equally profitable or popular games in the UK. For example, it might be the case that the top 4 or 5 games accounts for 80% of the play volume and spending. And the remaining 95% of the top 100 are just 20% of the market. If these 5 games were 100% compliant with the principles would you count this as 80% compliance or 5%? I think this materially effects all of the research question including the incidence/prevalence of probability disclosures.
Finally, without knowing the base rate of “ask to buy” I find it difficult to assess how well-justified disregarding this feature is. I know I use this feature with our under18s and I would not allow our children to use the app store at all without it. I think that this would introduce an unknown source of error or uncertainty in any of the point estimates which would be reported in the work.
Question 1C. The soundness and feasibility of the methodology and analysis pipeline (including statistical power analysis or alternative sampling plans where applicable)
I think this is a feasible way to approach it if the above base rate and geography issues are tackled. I am not really sure where the 95, 80, and below 80 levels come from though. Reading earlier in the report, I might expect the rate to be 35%. I could envision this being the standard and movement starting from this level (to I would hope something much higher) being the standard. My sense is that the author is interested in improvement, so how much improvement would be needed to know if progress is being made in the UK?
The author might also consider starting with a prior belief there is a 50/50 chance that a UK game creator is getting things right is the correct starting point and seeing if this is true at the start of the data collection and if this has improved at the 6 month mark.
I do not understand how (or who) at DCMS or UKIE would preregister their hypotheses (lines 473 and 474) or what the value of this would be. I don’t think most video game researchers would be able to do this.
Question 1D. Whether the clarity and degree of methodological detail is sufficient to closely replicate the proposed study procedures and analysis pipeline and to prevent undisclosed flexibility in the procedures and analyses.
I do not believe so. I think a detailed protocol with its own figure would be helpful.
Question 1E. Whether the authors have considered sufficient outcome-neutral conditions (e.g. absence of floor or ceiling effects; positive controls; other quality checks) for ensuring that the obtained results are able to test the stated hypotheses or answer the stated research question(s).
I do not think so, but I am not sure that is a problem. As the study is framed currently, I do not see a situation where the hypotheses won’t be confirmed.

https://doi.org/10.24072/pci.rr.100549.rev11

Reviewed by Chris Chambers, 20 Oct 2023

I enjoyed reviewing this Stage 1 RR – it tackles a timely and important research question and clearly spells out the rationale, hypotheses and proposed methodology. I am not a researcher in this area and will defer to experts for specialist assessments. Instead I focus my evaluation on issues that are generally relevant across most Stage 1 RRs. I hope my comments are helpful.

1. On the issue of sampling bias, I think you make a good point that we cannot know whether compliance is driven by the current changes or prior external intervention (pp8-9); and that consequently this makes it difficult to generalise the eventual results to compliance rates more broadly. To address this point specifically, could it be useful to include an exploratory analysis at Stage 2 within the subset top-100 games for which no previous intervention was known? Could a comparison be useful (even descriptively) between games subject to prior intervention vs no prior intervention?

2. You have allocated 1 hour per game to detect loot boxes. How confident are you that this is long enough to detect loot boxes where they exist? I would recommend including some justification for this specific period. Ideally, the sensitivity of this test could be confirmed through evidence rather than intuition: e.g. the strongest case would be previous data confirming that in cases where loot boxes are known to exist, 1 hour is sufficient time to always detect them (and if the detection rate is less than 100%, then what consequence will this have on the sensitivity of the current design to test the hypotheses).

3. p15: “Stakeholders (specifically, the DCMS and Ukie) will be invited to preregister how they will interpret different potential results that may be found by the present study.” If possible, I would suggest inviting them to do this now and then including this pre-specification in the revised Stage 1 RR – that way they will be as bound by their prospective interpretation of the findings as you are.

4. On the issue of delisting resulting in loss of apps: To ensure an adequate sample size, I suggest anticipating the likely delisting rate and overrecruiting in Jan 2024 by that amount to maximise the probability that the July 2024 sample still includes the top 100 at that time (e.g. if a 5% delisting rate were to be expected then take top 105 games in Jan 2024).

5. Precision of hypotheses. The hypotheses are generally clear but I would recommend two changes. First, they should make explicit mention of the two time periods and whether the same predictions are made at each point. Second, even though there is no inferential statistical analysis, this is still quantitative hypothesis testing so the manuscript should include a study design template.

6. The Jul 2024 period is very reasonably at the conclusion of the implementation period. I am wondering however if there would be any value in pushing this back to Aug 2024 to capture any possible delays in compliance? I don’t know enough about this area or the regulatory frameworks that operate, but is there any possibility that a company could intend to comply but just be a few weeks late? By allowing a post-implementation “grace period” of e.g. 1 month (from Jul to Aug), would the demonstration of low compliance rates be a more powerful signal to stakeholders and provide less wriggle room for non-compliant companies to plead minor delays? I will defer to the author’s judgment on this point and note it for consideration only.

7. My final comment is about the programmatic nature of the submission. I can certainly see the value of separately evaluating compliance during and following the implementation period. However, it also seems to me that the final results will be more coherent as a single encapsulated Stage 2 RR rather than two RRs. I am also not sure that the pre vs post implementation components are sufficiently substantive to justify 2 x Stage 2 outputs under Stage 1 criterion 1C (though I concede I am viewing this through a non-specialist lens and do not intend to devalue the amount of labour involved). Also: A programmatic Stage 1 RR typically includes separate sections to explain which specific parts of the proposal will be presented in the different outputs, sometimes going as far as to indicate different font colours to show which text will go in which manuscripts, and these details are always specified in advance (e.g. see here and here for examples). So, in the event that the submission ends up being programmatic, some similar structural work will be needed here.

Minor:

Lines 152-155: I struggled to parse this sentence.

https://doi.org/10.24072/pci.rr.100549.rev12

Reviewed by Lukas J. Gunschera, 16 Oct 2023

The manuscript at hand addresses an important issue, the compliance of the mobile game industry with UK self-regulation loot box measures. This work is timely and will make a great contribution to literature and policy concerning gaming consumer protection. That being said, I have found that the manuscript may be improved in the following areas.

1) The scope of the present manuscript concerns loot boxes purchased with real currencies as opposed to in-game obtained currencies. Although this distinction is common in the literature, I believe it warrants elaboration and think the proposed work would benefit from recording data on all possible avenues of purchasing loot boxes (i.e., whether players have the option to purchase the loot box with in-game currencies in addition to real currencies). I believe this is informative due to the fact that the gambling-like characteristics of loot boxes persist irrespective of the currency used to obtain them. The value of any currency, whether real or virtual, is learned. Therefore, beyond the concerns for parents’ wallets, the psychological effects of loot box purchasing may span across the currencies used to purchase them.

Furthermore, the psychological effects of loot boxes may even be strengthened for purchases with in-game obtained currencies, as opposed to money. Players who have invested many hours into obtaining the said in-game currency may perceive this to be a much larger investment than money, especially when the money comes from their parent’s wallet. While I understand that the distinction between real-world and in-game currencies is common, I believe it would be worthwhile collecting information on the currencies that can be used to obtain loot boxes (money, in-game, both) for each of the 100 mobile games (ll. 87-93, 364-368).

2) Despite resource constraints and stakeholders’ heightened interest in the highest-grossing mobile games, the sample size rationale is insufficient. A power analysis/simulation would help determine which effects the study would be sensitive to, especially in consideration of the fact that precise decision cut-offs are given for all hypotheses (ll. 245-254).

3) For Hypothesis 4 the decision criterion is different to the preceding hypotheses. Please add a brief explanation for this change (ll. 227-230).

4) Overall, the manuscript would benefit from some type-editing. This includes breaking up long and convoluted sentences; using accessible language as opposed to unnecessarily complex words; and using precise and objective wording. Some examples below:

ll. 152-155 Convoluted sentence structure

ll. 171-174 Complicated wording

ll. 196-198 Subjective/moral wording

Download the review https://doi.org/10.24072/pci.rr.100549.rev13

User comments

No user comments yet

or Register
Submit a report