{"id":2589,"date":"2025-02-14T11:43:05","date_gmt":"2025-02-14T11:43:05","guid":{"rendered":"https:\/\/blogs.bath.ac.uk\/iprblog\/?p=2589"},"modified":"2025-02-14T11:43:05","modified_gmt":"2025-02-14T11:43:05","slug":"large-language-models-are-not-an-existential-threat-to-humanity","status":"publish","type":"post","link":"https:\/\/blogs.bath.ac.uk\/iprblog\/2025\/02\/14\/large-language-models-are-not-an-existential-threat-to-humanity\/","title":{"rendered":"Large language models are not an existential threat to humanity"},"content":{"rendered":"<p><em><a href=\"https:\/\/blogs.bath.ac.uk\/iprblog\/wp-content\/uploads\/sites\/115\/2025\/02\/Blog-Images-123.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-2590 size-full\" src=\"https:\/\/blogs.bath.ac.uk\/iprblog\/wp-content\/uploads\/sites\/115\/2025\/02\/Blog-Images-123.png\" alt=\"\" width=\"1024\" height=\"576\" srcset=\"https:\/\/blogs.bath.ac.uk\/iprblog\/wp-content\/uploads\/sites\/115\/2025\/02\/Blog-Images-123.png 1024w, https:\/\/blogs.bath.ac.uk\/iprblog\/wp-content\/uploads\/sites\/115\/2025\/02\/Blog-Images-123-300x169.png 300w, https:\/\/blogs.bath.ac.uk\/iprblog\/wp-content\/uploads\/sites\/115\/2025\/02\/Blog-Images-123-768x432.png 768w, https:\/\/blogs.bath.ac.uk\/iprblog\/wp-content\/uploads\/sites\/115\/2025\/02\/Blog-Images-123-382x215.png 382w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/a><\/em><\/p>\n<p><em>From advancing cutting-edge AI systems by leveraging expertise in computer science and mathematics to exploring the political and social implications of emerging technologies, researchers at the University of Bath are at the forefront of numerous research projects that are helping to shape AI policy and understand AI\u2019s impacts on society. Our work spans a wide range of topics, including the development of accountable, responsible, and transparent AI, applications of AI in government and the third sector, regulation and governance, ChatGPT and other large language models, generative AI, and machine learning for policy.<\/em><\/p>\n<p><em>AI is increasingly influencing all aspects of our lives, and in this mini blog series, we aim to highlight both established and innovative AI capabilities, their applications, and their implications for society and policy. We hope you find it insightful and engaging.<\/em><\/p>\n<p><em>For those working in policy, operating at a senior policy level, you may also find our <a href=\"https:\/\/www.bath.ac.uk\/campaigns\/ipr-ai-policy-fellowship-programme\/\">AI Policy Fellowship Programme<\/a> of interest.<\/em><\/p>\n<div><\/div>\n<p><em><a href=\"https:\/\/researchportal.bath.ac.uk\/en\/persons\/harish-tayyar-madabushi\">Harish Tayyar Madabushi<\/a> is a <span class=\"job-title\">Lecturer in the<\/span>\u00a0<a class=\"link department\" href=\"https:\/\/researchportal.bath.ac.uk\/en\/organisations\/department-of-computer-science\" rel=\"Organisation\">Department of Computer Science<\/a> at the University of Bath. Dr. Tayyar Madabushi's research focuses on understanding the fundamental mechanisms that underpin the performance and functioning of Large Language Models such as ChatGPT. His work was included in the discussion paper on the Capabilities and Risks of Frontier AI, which was used as one of the foundational research works for discussions at the UK AI Safety Summit held at Bletchley Park.\u00a0<\/em><\/p>\n<p>&nbsp;<\/p>\n<p>The introduction of ChatGPT has significantly increased public access to Artificial Intelligence (AI), making the need for a clear and coherent policy system more crucial than ever. Unfortunately, this is not an easy task due to the many different dimensions that must be considered. This post is aimed at discussing the various aspects of AI that policymakers are evaluating and provide technical insights to support informed decision-making.<\/p>\n<p>The driving force behind the current generation of AI systems is <a href=\"https:\/\/srinstitute.utoronto.ca\/news\/gen-ai-llms-explainer\">large language models (LLMs),<\/a> which are trained to complete sentences based on the initial input provided. To the surprise of researchers and industry practitioners alike, scaling up these models by increasing their memory and the amount of data they are trained on has enabled them to perform some unexpectedly complex tasks. These tasks can be broadly categorised into two types: those that involve improved language fluency and those that require some form of what we might loosely call \u201creasoning.\u201d<\/p>\n<p>&nbsp;<\/p>\n<h3>The democratisation of language fluency<\/h3>\n<p>&nbsp;<\/p>\n<p>The inherent bias of perceiving fluently written content as more \u201ctrue\u201d or \u201cvalid\u201d has long existed. The widespread access to systems that enhance language fluency should be celebrated, particularly for how they assist those working in languages that are not their native tongue. Therefore, an outright dismissal or ban of these models could potentially harm those who stand to benefit the most from their use.<\/p>\n<p>Unfortunately, this increased accessibility also makes it easier to misuse these systems, with the most pressing threat being the <a href=\"https:\/\/www.sciencedirect.com\/science\/article\/pii\/S2666827024000215\">widespread dissemination of fake news and propaganda<\/a>. With the help of LLMs, bad actors can quickly and at nearly no cost generate fake news that is written in perfect English. The threat this poses to democratic systems worldwide cannot be overstated. However, addressing this issue requires policy changes beyond the realm of AI alone. For example, implementing methods to limit the spread of misinformation on social platforms\u2014such as reducing the visibility of posts unless they are backed by verified sources\u2014is urgently needed.\u00a0 However, such policies present challenges related to free speech and market dynamics.<\/p>\n<p>Therefore, as a first step, there is an urgent need to raise awareness of these new dangers: linguistic fluency no longer equates to authority. Everything from email scams to student essays will become more polished, but this doesn\u2019t necessarily mean they are authoritative or plagiarised\u2014it simply reflects the use of LLMs to enhance writing quality. Just as typewriters made handwriting less important, LLMs have reduced the need for linguistic flair.<\/p>\n<p>&nbsp;<\/p>\n<h3>Machines that Reason and Think<\/h3>\n<p>&nbsp;<\/p>\n<p>Why would anyone believe that systems, trained on the task of \u201cauto completing\u201d a sentence, can \u201creason\u201d? What may seem ridiculous at first becomes more reasonable when we consider that completing sentences in a meaningful way is actually quite complex. For instance, to write up the next step in a cooking recipe, there needs to be an \u201cunderstanding\u201d of the previous instructions, the dish being prepared, and the required ingredients or techniques. When we consider that models are trained to similarly complete sentences from across the entirety of the internet, the range of knowledge they acquire can be extensive.<\/p>\n<p>One of the more interesting aspects of reasoning in LLMs has been the phenomenon of \u201c<a href=\"https:\/\/cset.georgetown.edu\/article\/emergent-abilities-in-large-language-models-an-explainer\/\">emergence,<\/a>\u201d which refers to LLMs having reported to develop certain capabilities without explicit training. These \u201cemergent abilities\u201d have become a focal point in AI safety discussions, as their unpredictable nature raises concerns. While LLMs present various risks, such as generating fake news or powering social media bots, the concern about their ability to acquire new skills autonomously has heightened fears of an existential threat to humanity. This is especially troubling when these emergent abilities involve complex tasks like autonomous reasoning and planning. Such concerns have even led to calls for a six-month pause in the development of models with emergent capabilities.<\/p>\n<p>Concerns about an existential threat are not fringe beliefs. Several prominent researchers, including <a href=\"https:\/\/mitsloan.mit.edu\/ideas-made-to-matter\/why-neural-net-pioneer-geoffrey-hinton-sounding-alarm-ai\">Geoffrey Hinton<\/a>, the British computer scientist known as the \u201cgodfather of AI,\u201d have voiced these fears. This has contributed to significant actions such as the <a href=\"https:\/\/www.gov.uk\/government\/topical-events\/ai-safety-summit-2023\">Bletchley Park Safety Summit,<\/a> President Biden\u2019s <a href=\"https:\/\/www.federalregister.gov\/documents\/2023\/11\/01\/2023-24283\/safe-secure-and-trustworthy-development-and-use-of-artificial-intelligence\">executive order on AI safety<\/a>, and, more recently, the enactment of <a href=\"https:\/\/leginfo.legislature.ca.gov\/faces\/billNavClient.xhtml?bill_id=202320240SB1047\">California\u2019s AI safety law.<\/a><\/p>\n<p>&nbsp;<\/p>\n<h3>A different perspective on machine reasoning<\/h3>\n<p>&nbsp;<\/p>\n<p>However, <a href=\"https:\/\/aclanthology.org\/2024.acl-long.279\/\">our research<\/a> presents a different perspective, challenging the notion that the \u201cemergent abilities\u201d of LLMs are inherently uncontrollable or unpredictable. Instead, we propose a novel theory that attributes these abilities to the LLMs' capacity for \u201cin-context learning\u201d (ICL), where they complete tasks based on a few examples presented to them in their prompts. We show that the combination of ICL, memory, and linguistic proficiency explains both the capabilities and limitations of LLMs, thereby demonstrating the absence of emergent reasoning abilities in these models.<\/p>\n<p>Why does the parallel to following examples presented in prompts imply the lack of autonomy? Consider what it means to complete a task based on examples - it requires explicit and clear instructions beyond the most obvious tasks. And this is what our research shows is required when prompting LLMs. For instance, LLMs can <a href=\"https:\/\/eur01.safelinks.protection.outlook.com\/?url=https%3A%2F%2Fopenreview.net%2Fforum%3Fid%3DuyTL5Bvosj&amp;data=05%7C02%7Coc514%40bath.ac.uk%7Cc9f888f1b76e414c3b7b08dd4b9158f0%7C377e3d224ea1422db0ad8fcc89406b9e%7C0%7C0%7C638749809805830798%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&amp;sdata=aKEUdByRbh4m4bFl6wQgHlNvkIKTwSsaYQg2VV9Tgtw%3D&amp;reserved=0\">answer questions about social situations<\/a> without ever being explicitly trained to do so. While <a href=\"https:\/\/eur01.safelinks.protection.outlook.com\/?url=https%3A%2F%2Faclanthology.org%2FD19-1454%2F&amp;data=05%7C02%7Coc514%40bath.ac.uk%7Cc9f888f1b76e414c3b7b08dd4b9158f0%7C377e3d224ea1422db0ad8fcc89406b9e%7C0%7C0%7C638749809805859426%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&amp;sdata=lDj3VMwcCE2xR%2FvpZJlBOrn8cYy9Nq0nrh0zbBO99xw%3D&amp;reserved=0\">earlier research<\/a> suggested that this was due to models \"knowing\" about social contexts, researchers found it was actually the result of LLMs becoming better at following instructions. The distinction between the ability to follow instructions and the inherent ability to solve problems is subtle but significant, with important implications for how LLMs are used and the tasks they are assigned. Simply following instructions without applying reasoning can generate outputs that align with the instructions but may lack logical or commonsense accuracy. This is evident in the phenomenon of <a href=\"https:\/\/arxiv.org\/abs\/2311.05232\">\u201challucination,\u201d<\/a> where LLMs produce fluent but factually incorrect content.<\/p>\n<p>Overall, our research indicates that LLMs do not pose an existential threat, nor is there any evidence suggesting such a threat is imminent. While a completely different technology might present such a risk in the future, there is currently no indication that this is likely or even possible. Therefore, policy decisions should focus on addressing existing threats, such as the spread of fake news, rather than on hypothetical future risks for which there is no evidence. Focusing on unproven dangers could divert attention from the real and immediate challenges we need to manage.<\/p>\n<p>For end users, this means that relying on LLMs to handle complex tasks requiring advanced reasoning without clear instructions is likely to lead to errors. Instead, users will benefit from explicitly outlining what they want the models to do and providing examples where possible, except for the simplest tasks.<\/p>\n<p>&nbsp;<\/p>\n<div class=\"post-copy\">\n<p><em><span class=\"s12\">All articles posted on this blog give the views of the author(s), and not the position of the IPR, nor of the University of Bath.<\/span><\/em><\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>From advancing cutting-edge AI systems by leveraging expertise in computer science and mathematics to exploring the political and social implications of emerging technologies, researchers at the University of Bath are at the forefront of numerous research projects that are helping...<\/p>\n","protected":false},"author":1742,"featured_media":2591,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_feature_clip_id":0,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_post_was_ever_published":false},"categories":[150,143,116,151,126],"tags":[],"class_list":["post-2589","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai","category-emerging-technologies","category-evidence-and-policymaking","category-policy-engagement","category-science-and-research-policy"],"acf":[],"jetpack_featured_media_url":"https:\/\/blogs.bath.ac.uk\/iprblog\/wp-content\/uploads\/sites\/115\/2025\/02\/Blog-Images-AI-LLMs.png","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/blogs.bath.ac.uk\/iprblog\/wp-json\/wp\/v2\/posts\/2589","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.bath.ac.uk\/iprblog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.bath.ac.uk\/iprblog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.bath.ac.uk\/iprblog\/wp-json\/wp\/v2\/users\/1742"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.bath.ac.uk\/iprblog\/wp-json\/wp\/v2\/comments?post=2589"}],"version-history":[{"count":0,"href":"https:\/\/blogs.bath.ac.uk\/iprblog\/wp-json\/wp\/v2\/posts\/2589\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blogs.bath.ac.uk\/iprblog\/wp-json\/wp\/v2\/media\/2591"}],"wp:attachment":[{"href":"https:\/\/blogs.bath.ac.uk\/iprblog\/wp-json\/wp\/v2\/media?parent=2589"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.bath.ac.uk\/iprblog\/wp-json\/wp\/v2\/categories?post=2589"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.bath.ac.uk\/iprblog\/wp-json\/wp\/v2\/tags?post=2589"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}