Artificial Intelligence - AI
- Hilbert, M., Cingel, D., Zhang, J., Vigil, S., Shawcroft, J Xue, H. Thakur, A., Shafiq,Z. (2025, Nov) #BigTech @Minors: social media algorithms have actionable knowledge about child users and at-risk teens, Telematics and Informatics, Volume 103, 102341, ISSN 0736-5853, https://doi.org/10.1016/j.tele.2025.102341.
- Iftikhar, Z. et al (2025, Oct 15) How LLM Counselors Violate Ethical Standards in Mental Health Practice: A Practitioner-Informed Framework, Proceedings of the Eighth AAAI/ACM Conference on AI, Ethics and Society, DOI: https://doi.org/10.1609/aies.v8i2.36632
- Pre-Print, Zhao, J., Fu, T., Schaeffer, R., Sharma, M., & Barez, F. (2025). Chain-of-Thought Hijacking, https://arxiv.org/abs/2510.2641
- Pre-Print, Berg, C., Lucena, D.S., & Rosenblatt, J. (2025). Large Language Models Report Subjective Experience Under Self-Referential Processing, https://arxiv.org/abs/2510.24797
- Pre-Print, Geng, J., Chen, H., Liu, R., Ribeiro, M.H., Willer, R., Neubig, G., & Griffiths, T.L. (2025). Accumulating Context Changes the Beliefs of Language Models, https://arxiv.org/abs/2511.01805
- Pre-Print. Gu, L., Zhu, Y., Sang, H., Wang, Z., Sui, D., Tang, W., Harrison, E.M., Gao, J., Yu, L., & Ma, L. (2025). MedAgentAudit: Diagnosing and Quantifying Collaborative Failure Modes in Medical Multi-Agent Systems.
- Pre-Print. Chakrabarty, T., Ginsburg, J.C., & Dhillon, P. (2025). Readers Prefer Outputs of AI Trained on Copyrighted Books over Expert Human Writers.
- Pre-Print. Xing, S., Hong, J., Wang, Y., Chen, R., Zhang, Z., Grama, A.Y., Tu, Z., & Wang, Z. (2025). LLMs Can Get "Brain Rot"! https://arxiv.org/abs/2510.13928
- Pre-Print. Sharma, S., Alaa, A.M. & Daneshjou, R. A longitudinal analysis of declining medical safety messaging in generative AI models, npj Digit. Med. 8, 592 (2025). https://doi.org/10.1038/s41746-025-01943-1
- Pre-Print, De Freitas, Julian, Zeliha Oğuz Uğuralp, and Ahmet Kaan Uğuralp. "Emotional Manipulation by AI Companions." Harvard Business School Working Paper, No. 26-005, August 2025. (Revised October 2025.)
- Pre-Print, Morrin, H., Nicholls, L., Levin, M., Yiend, J., Iyengar, U., DelGuidice, F., … Pollak, T. (2025, July 11). Delusions by design? How everyday AIs might be fuelling psychosis (and what can be done about it). https://doi.org/10.31234/osf.io/cmy7n_v5
- Pre-Print, Larooij, M., & Törnberg, P. (2025). Can We Fix Social Media? Testing Prosocial Interventions using Generative Social Simulation. ArXiv, abs/2508.03385.
- Moore, J., Grabb, D., Agnew, W., Klyman, K., Chancellor, S., Ong, D., Haber, N. (2025) Expressing stigma and inappropriate responses prevents LLMs from safely replacing mental health providers, https://arxiv.org/abs/2504.18412
- Sharkey, L., Chughtai, B., Batson, J., Lindsey, J., Wu, J., Bushnaq, L., Goldowsky-Dill, N., Heimersheim, S., Ortega, A., Bloom, J., Biderman, S., Garriga-Alonso, A., Conmy, A., Nanda, N., Rumbelow, J., Wattenberg, M., Schoots, N., Miller, J., Michaud, E.J., Casper, S., Tegmark, M., Saunders, W., Bau, D., Todd, E., Geiger, A., Geva, M., Hoogland, J., Murfet, D., & McGrath, T. (2025). Open Problems in Mechanistic Interpretability. ArXiv, https://arxiv.org/abs/2501.16496
- Randomized Control Study, Fang, C.M., Liu, A.R., Danry, V., Lee, E., Chan, S.W., Pataranutaporn, P., Maes, P., Phang, J., Lampe, M., Ahmad, L., & Agarwal, S. (2025). How AI and Human Behaviors Shape Psychosocial Effects of Chatbot Use: A Longitudinal Randomized Controlled Study. ArXiv, abs/2503.17473.