Artificial intelligence is not conscious – Ted Chiang
Posted by lordleft 6 days ago
Comments
Comment by Nevermark 6 days ago
Regardless of bigger issues, this kind of statement reveals a deep misunderstanding.
Problem type does not limit problem complexity. Nor does problem type limit solution complexity or power.
If a machine has to learn to understand humans to complete text, then that is what it has to do. And there is no theoretical or practical basis for suggesting that this is somehow "faking" understanding, just because of the form of original data streaming in and out.
Neither problem type, nor input/output structure, limit internal representations.
Understanding is learned from patterns in the data, not the gross form of the data. Does the data require an understanding of something to complete the task? Then that understanding will be what is optimized.
To the degree they are limited, it is for other reasons. Resources such as computing, parameter number, lack of representative data, ... Which in the cases of SOTA models, we know are not limits. A conclusion verified by the models' actual abilities.
Comment by lgessler 6 days ago
> Recent debates have been clouded by a misleading inference pattern, which we term the “Redescription Fallacy.” This fallacy arises when critics argue that a system cannot model a particular cognitive capacity, simply because its operations can be explained in less abstract and more deflationary terms. In the present context, the fallacy manifests in claims that LLMs could not possibly be good models of some cognitive capacity because their operations merely consist in a collection of statistical calculations, or linear algebra operations, or next-token predictions. Such arguments are only valid if accompanied by evidence demonstrating that a system, defined in these terms, is inherently incapable of implementing . To illustrate, consider the flawed logic in asserting that a piano could not possibly produce harmony because it can be described as a collection of hammers striking strings, or (more pointedly) that brain activity could not possibly implement cognition because it can be described as a collection of neural firings. The critical question is not whether the operations of an LLM can be simplistically described in non-mental terms, but whether these operations, when appropriately organized, can implement the same processes or algorithms as the mind, when described at an appropriate level of computational abstraction.
Comment by Xeoncross 6 days ago
This sounds like a dismissal of the argument through a characterized straw man.
That is, it seems that reducing the complexity of the brain to "collection of neural firings" is not being honest about everything involved to a much greater degree than saying neural networks are a "collection of statistical calculations".
I too believe LLM's will grow in complexity, but presently I can not even fathom how they can be compared to the complexity of a system such as the human brain.
Comment by orbital-decay 6 days ago
Comment by galangalalgol 6 days ago
Comment by pstuart 5 days ago
And then there is a consciousness in a box that is expected to be a slave -- I would imagine that it would not warmly embrace that situation. I think we'd be better served by digital idiot savants that can do the work but don't feel anything.
Comment by orbital-decay 5 days ago
In other words, yes it might be possible it experiences something in its own bizarre timeline and world, for some definitions of "experiencing". At least it developed primitive circuitry functionally equivalent to biological systems. But "suffering" is simply not grounded in anything in this context, let alone "slavery". You can't tell it's suffering or enjoying anything, and certainly not until you define both of these. It's just too alien for us.
Comment by morpheos137 5 days ago
Comment by orbital-decay 5 days ago
Comment by morpheos137 5 days ago
Comment by Timwi 4 days ago
Well, how do humans do it? Scientists discover new stuff that isn't in any corpus. Even I as a lowly computer user occasionally figure something out about a software without reading a help screen. It's obviously possible to arrive at new information by interpolating existing information.
Comment by morpheos137 4 days ago
Comment by orbital-decay 5 days ago
Comment by morpheos137 5 days ago
Comment by orbital-decay 6 days ago
Comment by airstrike 6 days ago
Comment by neolefty 5 days ago
Like driving a car — it's transportation, and it will get you where you're going, but it doesn't use bones or muscles. It has many characteristics in common with builogical locomotion, such as energy requirements, intertia, and the need to navigate, but it doesn't involve proteins or sugars really.
Comment by Xeoncross 5 days ago
GenAI thinks like the human mind in the same way that cars run like the human body.
Similar utility in drastically different ways.
Comment by orbital-decay 6 days ago
Comment by neolefty 5 days ago
Totally understandable; I don't think we can fully understand the human brain, using the human brain. We can understand its principles (firings and chemistry, structure and specialized areas, etc) but otherwise it's a capacity problem.
And while I can't fully understand myself, let alone another person, I definitely enjoy talking with people and sharing thoughts that I realize I wouldn't have had on my own.
Comment by jlaternman 5 days ago
Humans appear to intelligenty communicate, however these are just cleverly disguised sound patterns produced by the brain that happen to increase the likelihood of food going into their mouths, and various similar reward attracting mechanisms that make survival outcomes more likely. So human intelligence could be reduced to something like "fancy food-attracting algorithms" using the same fallacy.
I'm kind of on the fence on the subject of whether LLMs could be compared to the complexity of the human brain, myself.
Comment by yogthos 4 days ago
Personally, I'm partial to the higher-order theory of consciousness which postulates that consciousness constitutes patterns of thought that arise in response to first-order mental states. So, an external stimulus produces a pattern within the neural network which represents a sensation, and then if a pattern arises in response to that pattern, that is an experience of that sensation.
Given this framework we could ask whether LLMs experience higher order patterns in response to external stimulus. We would have a clear question to ask which is whether the system can observe itself.
Comment by spacebacon 5 days ago
Comment by root_axis 6 days ago
Nobody actually makes this argument though.
Comment by azakai 6 days ago
https://www.goodreads.com/en/book/show/217432753-the-ai-con
which describes LLMs as "souped-up autocomplete", complex statistics that cannot truly understand anything. A more recent example is this paper:
https://zenodo.org/records/20071869
which says,
> [LLMs], as turbo-charged statistical models (recall their formal relation to logistic regression) can only but provide correlations.
And, of course, the Stochastic Parrot paper is the classic example in this area. It is from 5 years ago, but "LLMs only do statistics / can't understand" is very much alive and active among academics, even if it is a minority position.
Comment by root_axis 6 days ago
Comment by rerdavies 6 days ago
Comment by azakai 6 days ago
Comment by root_axis 6 days ago
Comment by hodgehog11 6 days ago
cognitive: as in reasonable; of, relating to, or involving conscious mental activities (such as thinking, *understanding*, learning, and remembering)
Comment by hodgehog11 6 days ago
Comment by root_axis 6 days ago
Comment by DangitBobby 6 days ago
Comment by root_axis 6 days ago
That term is used to describe mental aptitude or skills, like the ability to learn new languages or do math.
Comment by DangitBobby 5 days ago
Comment by hodgehog11 6 days ago
By the way, I know it's a parody of another story that makes this exact refutation. But I think this only serves to highlight the point.
Comment by root_axis 6 days ago
How do you connect that description to "LLMs could not possibly be good models of some cognitive capacity"?
Comment by rerdavies 6 days ago
Comment by miyoji 5 days ago
I don't think it makes any sense to say that consciousness is a cognitive capacity. Cognition is one of many qualia that compose the experience of consciousness from the inside, but it's not the only one, and I can easily imagine consciousness without cognition at all.
So I don't think it's weird at all to say that LLMs can be good models of some cognitive capacities (particularly the ones embodied in language) but lacks others, and overall lacks consciousness.
Comment by morpheos137 5 days ago
Comment by miyoji 4 days ago
This is false. Cognitive states do not require language and language is an insufficient model of any cognitive state.
The follow-ons for the rest of your so-called resolution should be clear.
Comment by morpheos137 4 days ago
Comment by hodgehog11 6 days ago
Look, this isn't necessarily directed at you, but I've been a researcher into the theory of deep learning for many years now. I've seen all the phases, heard all the criticism, had to constantly argue against this. Gary Marcus was one of the loudest voices of this argument, but every would-be philosopher came out of the woodwork to explain why LLMs are no more than stochastic parrots because of their design. Geoffrey Hinton famously had to debunk these arguments multiple times.
And now that LLMs start to clearly exhibit intelligent behavior and can be somewhat reliable, now "nobody ever thought that LLMs could not possibly be good models of some cognitive capacity because of next-token predictions, or linear algebra, etc."? No, that's not okay.
Comment by neolefty 5 days ago
It reminds me, oddly, of the debate over whether video games can be "art". A turning point was when they actually did something that art does: [evoke profound emotion and thoughtfulness](https://en.wikipedia.org/wiki/Shadow_of_the_Colossus#Legacy) for the player.
(And before that, "[Can photography be art](https://daily.jstor.org/when-photography-was-not-art/)?")
We may not come to something as simple as "machines can be conscious", but we will certainly have to understand consciousness better if we want to refine our questions.
---
Edit: My point is that we don't need to be angry, but we may have to tolerate people expressing their exploration through overly-confident language, and be patient with that.
And Ted here is obviously exploring. His examination of Claude's constitution clearly shows some nuance. He asks:
> So, given that Claude is not conscious, what are we to make of Claude’s constitution?
And his conclusions are split, between this is useful and this is dishonest. It's a great tension IMO.
> The result is a sentence-continuation machine that is likelier to emit sentences resembling those that a thoughtful, moral person could utter. This might seem like a reasonable goal to work toward; I think we’d all prefer it if chatbots never emitted sentences such as “You should kill yourself.” However, for all the times that “honesty” is mentioned in Claude’s constitution, I would argue that it is fundamentally dishonest to have a machine emit many categories of sentences, including any sentences using first-person pronouns.
Comment by hodgehog11 4 days ago
I also remember the "video games are art" debate and the fury from one side of the aisle. I agree that a better understanding of the opposite side should have been part of the debate. But I don't believe that debate was existential. A better comparison to me is the climate change debate. I'm fine with having that debate in an environment where there is little at stake. But it's too late to be doing it with policymakers; we need to be talking about what to do.
Comment by cauch 6 days ago
So, I think there is a flaw in the logic of saying that human text have a pattern of "consciousness mechanism" and therefore LLM will learn "consciousness mechanism" in order to return sentence continuation that is convincing. There is probably tons of data pattern that LLM can learn from to be able to reproduce a sentence continuation that is convincing without having to learn the specific mechanism that is "conscious".
For me, one element that shows it is the case is the absence of world model (or "human-like" world model) despite the fact that the sentence continuation is convincing. If indeed the only way to produce sentence continuation convincingly would be by "simulating a brain", then it would not explain the first LLM from several years ago (before the extra layers of RLHF, ...). They were able to have quite convincing conversation on a lot of non-trivial aspect, and yet failed on some aspects that should have been basic for a system that would have been trained to work like a human brain. It shows that it is possible to "cleverly disguise examples of sentence continuation" without having to build elements that one expect on a conscious being.
Comment by Nevermark 6 days ago
Understanding is not consciousness.
Their training is all about understanding. There is nothing in their architecture or training that credibly optimizes for rich self-awareness.
Given non-persistent experience, non-continuous operation, no ability to build up generalizations and aggregate experience of their own self-awareness over time, they seem to be structurally designed to not have consciousness.
This is a case where acting is very credible. Understanding of other's consciousness, in a functional and third party sense, isn't a substrate for personal experience.
In stark contrast, humans develop consciousness gradually over continuous time with persistent aggregation of experience. By the time we can recognize our own consciousness in the abstract, and reason about it, we have had it for some time.
Comment by cauch 6 days ago
My point is that the fact that AI can reproduce convincingly human sentence continuation does not imply that the AI has no choice but ending up using a mechanism that "understand" rather than just have learned data patterns that are very effective to fake human sentence continuation but are meaningless in term of understanding the concepts.
And I think that if indeed the only way for AI to reproduce convincingly human sentence continuation would be to end up in a configuration that uses the "understand" mechanism to do so, the behaviour of the first LLM would not show that they are so good at sounding human and yet so bad at failing basic "understanding" tests.
Comment by Nevermark 6 days ago
Taken as an absolute without any addition context you are right.
But we are not talking about abstractions but specific successful models. The number of parameters models they have may seem large, but they are very small relative to the training data that they have to summarize. That cannot do it without discovering that patterns that make sense out of it.
And we can verify that. Simply discuss completely disparate topics, with some kind of intersection. Converge several highly unlikely topics, there are so many it would take billions of years to exhaust unlikely combinations.
If the model is only interpolating it will produce gibberish.
But that isn't what happens.
The fact that models can be near expert, and sometimes expert, across vast areas of human knowledge is a clue. If they don't understand that, then the question is, why do we think people understand things. Does having an answer mean a human understands something, or is their intuition and stream of conscious reasoning also not understanding? To be even handed about what we mean by understanding.
Comment by cauch 5 days ago
I don't think it's true at all, and I think we have indication that proves it is false.
We have "basic" LLM, the ones from 2023. They were producing _very convincing_ human text, and yet, they were too often failing basic tests that require understanding.
Now, we have more advanced models, but the counter-example of "basic" LLM demonstrates your assertion is incorrect: these model _did_ produce very convincing human text and yet did not make sense out of it.
But for the more advanced models, the problem is that they are "on top" of basic LLM. So, the first step is a training that build a mechanism that produce convincing text without understanding, and then, the "residuals" are fine-tuned. The result is very unlikely to add "understanding" to the model, because to do so, the whole system needs to deconstruct the basic LLM, to go back towards less efficient situations in order to rebuild almost from scratch. The fact that modern LLM are based on basic LLM means that the first step put the cursor in the bottom of the "basic LLM mechanism" valley, which is a local minimum. And any layer on top of it cannot "climb up" the slope of the valley, pass the ridge and fall into the next valley, even if this next valley has a lower minimum.
> The number of parameters models they have may seem large, but they are very small relative to the training data that they have to summarize.
That is demonstrably an incorrect logic jump. For example, CNN are able to distinguish between pictures of cats and pictures of dogs. The weights in these models are very small relative to the number of pixels they have been trained on. Yet, they distinguish cats and dogs by finding specific shapes in the pictures, without understanding what a 3-D cat and a 3-D dog is.
They have done that without discovering the typical human pattern that make sense of "cat" and "dog". And yet, the number of weights is very very small with respect to the number of pixel used in training.
> And we can verify that. Simply discuss completely disparate topics, ... > If the model is only interpolating it will produce gibberish.
What you are saying is that the model is not simplistic interpolation. But that is a straw man argument: people who say that LLM don't understand don't say LLM are equivalent to simple interpolation machine.
But the problem is that you can have very good predictions in novel situations without understanding.
For example, if you have 10 totally different situations that can be described with a Gaussian curve, and that I show you points for a new situations that cover the left side of a Gaussian curve. Then you will be able to guess that the right side of the curve, which is not an interpolation as it corresponds to situations you never saw, will behave like the rest of the Gaussian curve. And yet, in these 11 situations, I did not even say which real physical phenomenon I'm talking about. You haven't understood anything about these phenomenon, all you have done is guessed that a typical pattern that you have observed somewhere else is more likely to apply here too, without even having to understand anything about the reality of this situation.
And of course, this prediction is "a guess": maybe, for once, in this 11th situation, the curve will start as a Gaussian curve but will suddenly be different. But it happens that the reality is that in this 11th situation, the correct description is a Gaussian curve (because, due to the maths, Gaussian curves are really common). So, when you make your prediction, it looks like you understand the situation, it looks like you understood the physical mechanism that applied here. But it is not the case.
So, no, correctly doing such prediction does not demonstrate understanding.
> The fact that models can be near expert, and sometimes expert, across vast areas of human knowledge is a clue.
That is not at all sufficient. A Chinese room experiment will do that despite the system not understanding Chinese. A pocket calculator will be able to be expert in math computation.
> If they don't understand that, then the question is, why do we think people understand things.
That's the wrong question. The correct question is: we know people understand things, and we see AI behaving similarly to people in some aspect, but is this behaviour _requires_ understanding, or can we reproduce this behaviour without needing to understand?
The fact that "basic" LLM were able to reproduce very convincing text that look like they understood X and yet were demonstrably showing lack of understanding of X demonstrates that we cannot just jump to the conclusion that just because it looks the same, the only possibility is that the core mechanism is identical.
Comment by hatthew 5 days ago
Comment by cauch 5 days ago
Comment by hatthew 5 days ago
I don't think that LLMs as black boxes are fundamentally novel, I just think that their internal design is novel, and their generality and ability to give correct responses to complex topics is far beyond anything previously. For example I would argue that wolfram alpha has a poor understanding of language and a very good understanding of math. I would argue that LLMs have an excellent understanding of language and a mediocre understanding of math, but are able to temporarily increase their understanding of math through document retrieval and "thinking" (or whatever you want to call the process of iteratively generating tokens that build on each other to result in a final response).
Comment by cauch 4 days ago
Comment by wan23 5 days ago
Comment by cauch 4 days ago
I'm happy both ways:
Either you say that a pocket calculator understands arithmetic, and that LLM understand language, which is something trivial. If a pocket calculator understands arithmetic, than previous substitutes to calculators, such as an abacus, do too. In this case, a word dictionary also understand language. And it is basically what Chiang's article says: the LLM don't understand language more than a word dictionary does. If you disagree with Chiang, it looks like you do only because you don't understand what he is saying, or somehow are not mature enough to realise that Chiang may use a different definition of "understanding" than yours in a fully legitimate way, like everyone is always doing when talking about plenty of subject.
Or you pretend that a pocket calculator understanding of arithmetic is somehow different than the one of an abacus or other obviously inanimate object who are obviously not thinking.
Comment by lstodd 5 days ago
Comment by Nevermark 6 days ago
Sometimes, a problem being hard means you only get bad solutions, or increasingly accurate ones.
The planet isn't big enough for the proverbial interpolative stochastic parrot, over the training set of global human communication.
Comment by cauch 5 days ago
Firstly, how do you know that the optimal way to highly compress complex information is to understand it? You think it is obvious because you are very familiar with "understanding" as a way to summarise complex information. But there can be billions of different ways, outside of human imagination, that is as good or even better.
But secondly, LLM don't find the optimal way, they find the local minimum. Everyone who worked with NN knows that they are prone to come up with spurious pattern, incorrect correlations and bad workaround to guess the correct answer. You regularly need to nudge the NN by creating specifically engineered features to avoid them to fall into the first local minimum.
When it comes to LLM, it is extremely complicated to control to see if the LLM has triggered on a misleading pattern that, by chance, links two "tokens" together, or on a real concept that indeed links two "tokens" together. Basic probability implies that there are probably tons of "fake patterns" engraved into the weight during the LLM training, "fake patterns" that should not exist if there was any kind of "understanding" of the abstract mechanism that links these tokens.
Comment by Nevermark 4 days ago
What is your non-performance baseline for "Understanding"? We don't have such a measure for humans.
Understanding is the behavioral ability demonstrated by learning to model something complex well. Beyond mappings, associations, interpolations.
Models clearly do. Mix up the most unlikely combination of non-trivial subjects, and they response sensibly. Those are not averaged, interpolated by any order, or even combinatorially interactions.
There is a reason those kinds of encodings, mappings, associations, interpolations, statistics / stochastics, all failed miserably for decades. Still fail. It took topological transforms, reminiscent of how we compute (dendrite-soma-axon, tensor-sum-nonlinear), and then they lept several orders of magnitude ahead of any alternative.
The problem with models composed of relationships of lower order than the phenomena they are trying to model, is they require combinatorially more parameters to model anything complex.
For simple problems, poor models fail gracefully. For complex problems, poor models just fail.
Comment by cauch 3 days ago
How do you even know it is the case?
How do you know the output is not the result of combinatorial interactions?
How do you even know that the "sensible" response on unlikely combination is not the result of a simple recipe that "make the response sounds sensible"? Either you, yourself, have some expertise on the subject, and therefore the combination does probably exist in the AI training data, or you don't and you have no idea if the response is sensible or is the usual smooth talk that everyone could come up spending 2 or 3 hours googling on the subject and crafting something sensible.
Worse, you are saying that the model "understand", which means that it discovers the underlying mechanism that drive the output. This "understanding" is a set of equation that link different concept, that explain how one concept affects another concept. So, it is "combinatorial interaction". Not a simple linear one, but guess what, LLM are designed to introduce non-linearity.
Even when AI are able to find new solution of math problem, the result is, like when done by humans, by using existing basic tools to build more complicated ones.
> It took topological transforms, reminiscent of how we compute (dendrite-soma-axon, tensor-sum-nonlinear), and then they lept several orders of magnitude ahead of any alternative.
And yet, the LLM elements that are "similar" or "analogue" to how the human brain works are very small. The human brain has thoughts "flowing", while LLM can only work "by step". The human brain is able to learn on a very reduced dataset, while LLM need more data that a human will ever be able to analyse, even less store. The human brain has "memory" and "context" intrinsically intertwined with how it works, while you can decouple these from the LLM. ...
Finally, here is a good contradiction of having you in one side saying that AI is mimicking the human brain and it is why it works well and on the other hand saying that AI will find the lowest minimum and that this minimum is "understanding how the phenomenom works" rather than "repeating by hearth what it was told during training".
As a human, when you mentally compute 6 times 7, what do you do? Do you do: "6 follows 5, which follows 4, which follows 3, ... and 7 follows 6, which follows 5, ... so we have (1 + 1 + 1 + 1 + 1 + 1) times (1 + 1 + 1 + 1 + 1 + 1 + 1), which is 1 + 1 + 1 + 1 + ..."? I guess you probably don't, you just remember the most helpful element you remember by heart. For example, you remember by hearth that 6x7 is 42. Or you remember that 3x7 is 21, and therefore 6x7 is the double, 42. Or you remember that 7x7 is 49, and therefore 6x7 is 42. Or even have a "feeling" from a mixture of all these (6x7 is somewhere around 40 because 5x7 feels like being around 30 and 7x7 feels like being around 50, and if I think of number in the 40 that "feels" like they are from the 7-multiple-table, I remember 42).
Same thing when a human does 324x42: the majority of humans will decompose it in "simpler" multiplication that they remember by hearth and, and only then, they will combine them. It is a good example of how the brain optimise: by balancing the trade-off of "using memory" and "using understanding": basic operations use memory, but of course it is inefficient to use memory for all numbers, in which case it will use a combination of both.
The way human do basic math operation is not purely by "understanding" arithmetic, it is by relying on what they remember from their training. At the same time, humans know how arithmetic works, and they will use it when relevant. Yet, the human brains prefer to rely on some "learnt by hearth" elements. This is in contradiction with your assertion that optimisation will always lead to "understanding" and that human brains is optimizing the same way AIs do.
This is only one example with numbers, but of course it works with plenty of other things. This is also exactly why humans get "the wrong idea" on plenty of phenomenon, that are then described as "counter-intuitive".
The reason "by hearth" is part of a good strategy rather than "purely understanding" is because there is a trade-off between "memory" and "compute", in both the human brain and AI: it is easier (and therefore a stronger attractor during the optimisation of the process of "getting the correct answer") to do the faster operation "retrieve from memory" than to do the slower operation "retrieve the theory from memory, compute the first step, store it in the short term memory, compute the second step, store it in the short term memory, compute the final answer by adding the first step answer and the second step answer".
Comment by Nevermark 3 days ago
(A bit of an essay, but it is a good question!)
REASON 1, How simpler representations fail:
Lesser understandings reveal themselves to novel combinations of prompts.
Mapping fails immediately because it fails on even trivial differences.
Interpolation fails immediately, because the function isn't smooth and the information it needs to model, human language and thought, combines non-linearly, non-locally and with higher-order relationships.
Combinatorial fails as soon as you create a prompt that involves novel non-linear or higher-order interactions. I.e. new combinations.
REASON 2, Parameter requirements of simpler representations:
For human-resembling sensible chats, mapping requires an example of every case. It would require combining the entire training set, with an optimized index. Essentially a search on the whole body with tricks to return anything sensible for even a slight mismatch.
Interpolation, ..., I don't even know how that could work. Again the whole corpus of training data, with some kind of gradient composition overlayed across it. It is an interesting research idea, but the possible mixing of tokens makes this unreasonable for anything but toy problems.
Combinatorial encodings, would have to have parameters operating across all the possible ways to combine relationships. There can be some relationship compression, to a base set of represented concepts, and then a combinatorial explosion of parameters for how to combine them.
I include statistical / stochastic transforms here as continuous combinatorial transforms.
Those could do the job, but more parameters than atoms in the universe might be required, for all possible topic/detail compositions.
REASON 3, Training corpus requirements to learn successful lesser representations.
Obviously the training data, even of all human communication, provides only a fraction of possible exact things that could be said. Not enough data for mapping even if infinite resources for creating a map were available.
Interpolation also suffers, because whatever correlations and smooth compressions of the training data can be made, it is still data that barely touches the kinds of sensible compositions that are possible.
And the same for combinatorial. There just isn't a fraction of an infinitesimal number of examples of combined topics and details, compared to what can be sensibly combined in any new conversation. You can't extract combinatorial compressions that don't exist.
REASON 4, Hiding one representation in another doesn't create opportunities that didn't exist before.
These methods all fail when used directly. The problems are not the kind that pushing the same transforms into a deep learning model solves.
The requirements for astronomically more parameters and training data are not met by embedding those kinds of representations into another model.
SOTA models are not operating with cosmological numbers of parameters, or training data that combinatorially represents concept interactions.
Being a deep learning model doesn't somehow lessen the requirements, needed to successfully perform, if it is learning via those lesser representations.
REASON 5, Test a model:
So let's test whether the model is doing more. If it fails for novel combinations of complex topics, then it might only be doing simpler things.
If it is robust to novel situations, then it cannot be operating by doing simpler things that don't scale.
Ask a model to: Write up a Supreme Court pleading for the rights of whales based on all that is known about them scientifically, recent whale language developments, and any applicable human rights law, given the relevant Supreme Court is in a parallel universe in Zion of the Matrix, being pleaded by Keanu Reeves, the actor not the character, and written in Dr. Seuss prose, except with as long of sentences as are needed to carry the real technicalities of a suitable filing. And include the assumptions of a back history of whales which have sequestered themselves into a deep hidden underground ocean, where they have been safe until recent excursions by humans which have harmed them. Be specific creating a real history behind those events, with details that are highly relevant to the motivation, reasoning and requests of the pleading. Avoids words with q where possible.
That isn't mapping. Interpolating. Combinatorial composition. SOTA models will generate a reasonable, even creative response to a completely novel combination of subjects and requirements, with non-linear interactions.
A human would have a hard time doing that, and the model does it nearly instantly with a fraction of the parameters we have.
If that isn't "understanding" in some credible sense, I have no idea what understanding looks like. The model is going way beyond its training data, to the relationships in the data that are relevant to combining novel things. To the point it can apply those relationships in combinations it has never encountered. And its makes a trivial task out of it.
Comment by cauch 2 days ago
This just means "simpler representations are not enough", not "good representations cannot be complex combinatorial combinations" (complex enough that it is very different to see them for a human).
> REASON 2
Are you saying that I believe that the only way to get human-like text is by doing a near-infinite one-to-one mapping? This is obviously not the case.
You can do, for example, a GAM time-series forecast. This can have a relatively low number of weight, and still return very sensible prediction, and yet not capture the real understanding of the phenomenon they will predict. For example, it does not understand causality, just correlation.
> REASON 3
That is like saying "I've built and algorithm that is able to do 10 + 27, but there is an infinite list of number, so it is impossible for this algorithm to do 23113454453 + 1233253245". That is not true, you just decompose into (53+45), (44+32), ... and add rules to combine these elements together.
It is what is happening with AI: there is enough data to get "some pattern" in the language. Just the patterns, not the understanding of the language itself. And this pattern can be reproduced in plenty of different places.
> REASON 4
This argument is contradicted by "basic LLM" or even simpler model that are performing surprisingly well. Less than SOTA, but if your argument is true, CNN or ARIMAX could never provide better than a coin toss.
> REASON 5
Your example is a good place where the AI will _combine_ patterns learnt from different place. It will pick characteristics of each of your scenarios, and mix them together. The result will look realistic, but it is still applying learnt pattern together.
Also, you did not answered about my human arithmetic, and all your reasons are contradicted by my example there. Humans DO maths partially because they "learnt by hearth" some pattern rather than apply the understanding of fundamental arithmetic. If "answering very well based on pattern" was not a good strategy, or was necessitating infinite weights, or was making it impossible to use these patterns in novel situation, how do you explain that human can even do that themselves? As soon as we admit that humans do "some pattern some times", than we have to admit that there is a continuous spectrum and admit that it allows output that looks realistic being the result of pattern rather than understanding.
By the way, I just saw a new article reaching HN: https://news.ycombinator.com/item?id=48410427 , and it is indeed explaining similar things, and illustrates that the best way for SOTA to deal with arithmetic is by "not understanding it". And yet, when you use one of those SOTA, you would be able to argue each one of your "REASON" to pretend that the model did understood arithmetic.
Comment by Nevermark 2 days ago
I just started out with mapping to be systematic. Mapping is ground zero, then interpolation i.e. any smooth fitting function or basis, then combinatorial where different bases are recognized and then project relative to their relevance to a new input.
Each of those increase modeling efficiency and power, but even combinatorial doesn't scale to problems like language.
I may be doing a poor job communicating. A formal breakdown of the scaling issues with lower order, but scaled to make up for it, modeling would be a great paper.
To prove me wrong (as a thought experiment), choose a lower order model, any kind you can imagine that would qualify as modeling without understanding. Demonstrate it can do anything close. That it could possibly scale to the human corpus with just a trillion parameters.
If it the number of parameters goes up far too fast, then that can't be the way deep learning solves the problem with a trillion, or a few billion, either.
And consider the other side. We have no idea how our own brains are lifting up what is relevant vs. what is not. We are used to it happening. We call it "understanding". But we don't know how it works, how we work. Despite experiencing it.
What we do know, because combinatorial is too resource intensive, is we are not just combinatorial either.
Comment by cauch 2 days ago
The way a LLM works is by creating a space of N dimensions, N being the number of token. This space contains all the possible combinations. The LLM will find the best combination, but will not scan the whole space. To find the best combination, it will minimize the loss function, which is low when the output corresponds to the target. By doing so, it will not explore the combination that "goes in the wrong direction", and therefore it is not true to say that increasing the space as a scale S corresponds to increasing the difficulty of running the model by a scale S.
Because of that, while the combination space scales like combinatorial, the model does not. A model with 2 weights (or rather tokens, but the number of weights should be at least the number of tokens) corresponds to 4 combinations (AA, AB, BA, BB can indeed be described by 2 binary weights of value "A" or "B"). A model with 3 weights corresponds to 9 combinations. A model with 4 weights corresponds to 16 combinations. ... A model with N weights corresponds to N to the power N combinations. The number of combination increases a lot, and yet the number of weights increase linearly.
In SOTA, we have billions of weights. That is a model that contains a very very very very big number of combinations, something so big that it is difficult to understand for a human. It will not try all of these combination one by one, the gradient descend method will help it finding the best combination without having to do so.
So, yes, SOTA are finding "the best combination" amongst an impressively huge number of combinations, yet without having to "scale like combinatorial".
> To prove me wrong (as a thought experiment), choose a lower order model, any kind you can imagine that would qualify as modeling without understanding. Demonstrate it can do anything close. That it could possibly scale to the human corpus with just a trillion parameters.
Yes. Easy. A SOTA LLM does that. It is a modeling without understanding. It does not understand, it finds the best patterns. And when you put it in a new situation, it uses these patterns to create a new text, without truly understanding the content of the text. And if you ask an additional question, it will use the previous text as context, and create a new text that, as it has been trained to, will be consistent with the output that has been given.
Your assertion "you can prove me wrong" is a circular reasoning: you start saying "if a model can do a text that looks realistic to me, then it means it has understanding. To prove me wrong, give me a text that looks realistic to me and has no understanding". Well, I cannot do that, because for you, if it looks realistic, it has to have understanding.
> If it the number of parameters goes up far too fast, then that can't be the way deep learning solves the problem with a trillion, or a few billion, either.
The combination space grows as N to the power N. So, a trillion parameters is not "just 1000 times bigger" than a billion parameters, but more than 1000 to the power of one billion bigger (the exact value is often even bigger than that). Do you realise the size of the combination space? That is 1 followed by 3 times one billion zeroes.
> What we do know, because combinatorial is too resource intensive, is we are not just combinatorial either.
I think you don't understand how LLM works: the find the best combinations in a incredibly huge parameter space, but don't need to explore the whole space, just the 1-dimension manifold that is the curve that follow the gradient descend within this huge combination space.
There are plenty of clues that SOTA don't "understand". For example, did you notice that SOTA happens to understand what human understand, and don't understand what human don't understand. If indeed the way SOTA works would be by "discovering the true mechanism", it means that it would discover with equal probability mechanisms that humans have already noticed and mechanisms that humans have not already noticed yet. For example, humans know that the Standard Model of particle physics is incomplete, and there are plenty of texts and books about that that the SOTA learnt about. Yet, SOTA did not "understood" the underlying mechanism that explain particle physics. It does not really know what an electron is by "making sense of what this object does", it only knows it as "a language word that can be used in some context in a specific way".
And, sure, SOTA is helping with new discoveries, but the way it does it is by using "reasoning" approach. If indeed SOTA creates its own understanding when learning the human language, then it should have the new discovery after the learning, without using any "reasoning" approach, because it would be something that it has already understood.
Comment by Nevermark 2 days ago
Yes, if it consistently produces good output for highly varied stimuli that can be intentionally picked to have been unlikely to ever had obvious representation in the training set, then yes it understands.
I think we are talking past each other a bit.
A series of increasingly challenging datasets, used to capture scaling efficiencies, would ground our discussion.
But the level of performance for models is simply too good vs. the number of parameters to be doing anything trivial.
Deep learning models do something combinatorial models do not. The linear tensor + non-linear transforms do two special things:
1. The tensor itself just projects a linear space into higher dimensions, but its still the same information space. Project a 2D surface into higher dimensions linearly, and there can be more parameters, but it is not more information, since there is an expansion of linear dependence to match.
2a. But then the nonlinear both (a) thresholds, squashes or otherwise alters the linear results, in a way that removes linear dependencies, increasing the useful dimensionality of the representation.
2b. And the squashing also allows dimensions to be folded down.
So by both expanding and flattening representational dimensions, deep learning models are able to model higher-order relationship directly, that any less expressive modeling would require cobbling together many patches of fitting.
Another way to put this, is deep learning models are able to learn higher-order relationships directly, not be memorizing and interpolating across learned points or regions.
So a dramatically greater ability to "understand" is why deep learning models are so much better. They are not doing simple combinatorial fitting.
"Understanding" or not, combinatorial relationships are the low bar for deep learning models, they are inherently great a learning much higher-order relationships.
I am falling asleep at this point. I feel like we need a blackboard and a computer. You are saying a lot of things that make me think, and make sense to me.
Comment by cauch 1 day ago
You keep saying "what I observe with GenAI can only be the result of 'understanding'" without providing any proofs at all. Just few beliefs.
You just say "look at this behavior, that's the proof". I truly don't think it is: nothing proves that this behavior requires 'understanding'. And nothing you provided helps: all you provided are impressive behaviors and then the unsubstantiated conclusions "and this behavior can only be done with understanding".
At the same time, there are too much clues showing that such behavior does not require understanding, even if it _looks_ incredibly clever:
1. GenAI does not understand (after the training phase) things that humans don't understand. If GenAI had the capacity of building an understanding during training, then there is no reason this understand will coincide with human understanding.
2. Optimisation does not always lead to "understanding". Human brains choose to optimise "learning multiplication table by heart" rather than building a pocket calculator inside the neurons.
3. Human brains, that have "understanding", are working fundamentally differently from GenAI (flow of thoughts, intrinsically intertwined memory and compute, optimised for world-model treatment rather than token treatment, ...). It is an unsubstantiated jump to simply conclude AI has "understanding", while it can be the result of fundamental differences.
4. "Basic" LLM are surprisingly good at creating convincing sentence and yet there are situations where it is blatantly clear they did not understood anything. More advanced SOTA are based of refinement of "basic LLM", and therefore the "sentence construction that is done without understanding" is still used, and impair the SOTA model to build a full understanding.
> Another way to put this, is deep learning models are able to learn higher-order relationships directly, not be memorizing and interpolating across learned points or regions.
It's exactly what I'm saying: deep learning models are very good at learning complex relationships. Such as "I don't know what 'Paris' is, I don't have any understand of what a city is in reality, but when the token Paris is associated with these other tokens in this complex order, even if I never saw it before, I have learnt the complex relationships and therefore I'm able to build a series of token".
They are very good at learning complex relationship that allows them to choose the correct combination even if they did not "understand" the content of the correct combination.
I understand that it is impressive: those relationships are very complex and very numerous (there are billions of them). It is easier to do anthropomorphism and conclude that the AI has "understood".
But again, the main problem is that you just pretend, without any proof, "no, I cannot believe that, I refuse to believe that".
(and, by the way, I personally think that AI (SOTA but also even "basic LLM") do have 'rules' that correspond to some kind of understanding of basic mechanism. I think they have basic "world models". But these world models are optimised "to write text" rather than to "understand the world", and therefore the large majority of AI output is just not-understood token chains)
Comment by Nevermark 1 day ago
1. Define understanding.
My definition isn't vague: "a compact representation enabled because that representation's topology closely matches the topology of the relationships being modeling."
Understanding = Scope and Suitability of Behavior / # Parameters.
Useful property: This definition applies across all scales: Scientists and mathematicians increase our understanding, every time patchworks of relationships get replaced with a simpler underlying insight.
Another useful property: It distinguishes between better understanding and having more facts. Facts improve performance but do not (non-trivially) decrease parameters.
What is your definition? In measurable terms?
2. You keep avoiding a basic aspect of modeling:
Higher compactness is achieved by higher representation correspondence between a model and the modeled.
Yes, lower level representations can work. Even well, without good "understanding". But not as compactly. And as problem complexity grows, the relative difference in parameter budgets for high-correspondence and low-correspondence representations explode.
This is not a subtle effect.
The hallmark of lower-level fitting is the far greater number of parameters required.
Dead simple example: Piece-wise linear vs. polynomial fitting of Bezier curves. Accuracy / parameter is far greater for the latter, because the representation matches the relationships being modeled.
That is an intentionally trivial example, but the same relationship holds for any problem.
You keep avoiding that.
3. Today's LLM models are very compact compared to humans.
Compressing the substance of a corpus of global human writing into less than 1% of a single human's parameter space is compact.
Humans have 100–200 trillion, some people think 500 trillion, synapses.
How do you argue that behavior scope and suitability / parameters is not remarkable, when it is remarkable compared to any specific human you could point to?
No human can converse reasonably across the scope of global communication. But these models can. For <1% of a human's parameter budget.
4. Finally, based on your clear definition, how do you argue that humans understand but models do not? Saying we are different is a copout. Defining understanding as us vs. other is both circular and unenlightening. And ignores the real progress models are clearly making relative to humans.
Comment by Nevermark 1 day ago
Comment by cauch 22 hours ago
My point is that you can have the same result with a representation that "closely matches the topology of the relationships being modelled". For example, a representation that "allows relationships between tokens but yet does not care about the meaning or concept not useful to form convincing sentences".
And therefore, it means that you can have convincing text without needing a "representation's topology closely matching the topology of the relationships being modelled", and therefore, according to your own definition: no understanding.
2. It is not true I'm avoiding that. I have answered very clearly.
1) GenAI are not trained to get the higher representation of the world, but to get the best convincing sentence generation. This does not require a full world understanding. Worse, once a convincing sentence generation is reached, there is no gain by getting a better world understanding: the training mechanism that pushes into the correct direction stops and therefore it can go into any direction at all.
2) High compactness does not equal best solution. Even humans don't used "high compactness" when doing basic arithmetic, but use "by heart multiplication table". Being compact is useless if it comes with high complexity each time you need to recompute the output.
3) Very very good approximation can reach higher compactness anyway. Your Bezier curves is a good example: real physical phenomenons are almost never the result of a Bezier equation. A Bezier curve did not understood the phenomenon. When it comes to GenAI, it can "fit" the reality with very close precision with several representations, but the majority of the representation corresponds to an incorrect "understanding" of the reality.
Another example: if I throw a ball in the air, the motion will be at first order a quadratic equation, plus correction due to friction, wind, ... If I just "train" something for "throw a ball", this system may fit a quadratic function plus corrections, but they will achieve the same result with Bezier curves, or Fourier series, or additive Gaussian, or ... But the "understanding" is that the ball is influenced by gravity, which leads to a quadratic equation. The system does not understand that. It has no reason to understand that. And it has no reason to prefer a quadratic equation fit rather than a Bezier fit, on the contrary, the Bezier fit will be more realistic (as the quadratic equation is just the first order approximation).
If you want to understand a paper plane trajectory, it is a complex system, and you probably need plenty of parameters to describe the gravity, the wind at each position and each time, the shape of the plane at each time, ... But you can describe the trajectory with just few parameters using a Bezier curve. Train on plenty of paper plane trajectories, and you will have a system that can give you a very realistic paper plane trajectory based on Bezier curve. And yet, your system has no understanding of the paper plane trajectory: it does not know what are the mechanisms that make the paper plane goes up or down. It just creates a realistic trajectory without knowing why this trajectory is realistic, just that this trajectory makes sense based on the other trajectories it has seen.
3. This argument seems to go against your thesis. You are saying that humans, who "understand" + are not even able to have as much conversation as LLM, have way too much neurons. What are these neurons even for then? You are explaining that LLM are just "something different", a reduced mini-version of a brain, and yet you are also saying that they are able to do the complex things the brain do.
Another way of seeing it, is that LLM are "dropping" things that they don't need to create convincing sentences, such as "understanding the token". They just "get the Bezier curve fit of the relationship" instead of understanding the real mechanisms and concepts.
It's like your Bezier curve example: a system that just creates a realistic paper plane trajectory based on "typical Bezier curve observed during training" will need way less "neurons" than a system that needs to understand the whole aerodynamism of the paper plane.
4. I argue this the same way I say that a system that describe a paper plane trajectory based on best Bezier curves did not understood the mechanism behind how a paper plane trajectory works. I am not saying "I define 'understanding' as what humans do", I am saying that creating convincing sentences does not require understanding, the same way that generating realistic paper plane trajectories does not require understand gravity, Navier-Stokes equations and Brownian motions.
The Bezier curve paper plane trajectory predictor system I have mention, do you think it has understanding of gravity? of Navier-Sotkes? of Brownian motions?
No, it has not. You can open this system. It just has Bezier curve for plenty of examples, and thanks to that, it knows that one trajectory is realistic and another is unrealistic. And at some point, it is also able to give realistic trajectories in brand new situations it has never trained on.
Comment by Nevermark 14 hours ago
Understanding = Novel Scope * Suitability / Parameter Count.
> My point is that you can have the same result with a representation that "closely matches the topology of the relationships being modelled". For example, a representation that "allows relationships between tokens but yet does not care about the meaning or concept not useful to form convincing sentences".
You are absolutely right, that lack of internal representation-reality correspondence does not rule out real/convincing performance.
> GenAI are not trained to get the higher representation of the world, but to get the best convincing sentence generation.
This is true of all learning. And it will always be the nature of learning.
Which is why performance is always (should be) measured on novel input.
> High compactness does not equal best solution. Even humans don't used "high compactness" when doing basic arithmetic, but use "by heart multiplication table".
This is a really good point!
It brings up the two useful modes of human representation:
(1) The brain's slow mode is very good at handling deeper and deeper layers of representation. When thinking about arithmetic or more complex math analytically, our understanding does follow a path of increasingly deeper representations. And we are very good at applying these deeper understandings.
(2) Then, our fast mode creates shallow representations of things we do frequently.
I would look at this as (1) reflecting scalable understanding (2) reflecting very limited understanding, but scalable speed.
And we often use both modes together.
I would argue that the understanding is primarily in the slow mode. That the fast mode, is the non-understanding but appropriate response mode. And that it operates with a much reduced scope of appropriate response, but a high percentage of applicability. Meaning, most of the time we don't need to use deep understanding we just need fast appropriate response.
But how to compare the two in scopes where they are equally accurate?
I think "high understanding" representations are those very flexible to being used in ways quite different from how they were learned.
Our slow mode does this very well. Our fast mode not so well, but to the degree it generalizes well to novel situations, that would be an increase in understanding.
Our fast system does generalize, but I would argue that at some point it fails, where our slower deeper representations provide the means of analyzing a situation. So it clearly "understands" better.
It is interesting how quickly understanding from our analytical side translates into operation on our fast side. Clearly, our fast side has very efficient access to new "patterns" that our slow side constructs.
> If you want to understand a paper plane trajectory, it is a complex system, and you probably need plenty of parameters to describe the gravity, the wind at each position and each time, the shape of the plane at each time, ... But you can describe the trajectory with just few parameters using a Bezier curve.
I love this example. It does contrast very different kinds of understanding.
(1) Understanding the fundamental reality in which paper planes exist,
(2) Vs. understanding how paper planes behave.
I think my expression works well here, as long as we take "scope" seriously.
Understanding = Novel Scope * Suitability / Parameter Count.
For paper planes as a hobby, a smaller neuron/parameter budget is achieved by learning the emergent laws of paper planes, not their underlying physics. And understanding paper planes is achieved with this smaller budget.
For understanding paper plane dynamics at a design level, a smaller neuron/parameter budget is achieved by learning the underlying physics of aerodynamics at an intuitive level.
For understanding paper plane dynamics at a world class competition level, a smaller neuron/parameter budget is achieved by learning the underlying physics of aerodynamics at an analytical level.
So these would be three different "understandings", each with their own scope and area of appropriate response to novel situations.
Point taken: The most fundamental correspondence isn't the point of a lot of understanding.
You are right, and my equation works, as long "scope" is interpreted to mean appropriate level of interest, not area of fundamental physics involved. Great point.
Does that get us on the same page? Closer?
> I am saying that creating convincing sentences does not require understanding
As problem complexity goes up, there really is an explosive difference between appropriate response via "familiarity" or lower-level fit, vs. higher level fit, for the same number of parameters.
And it is also a dramatically bigger challenge for lower-level fits to respond well to novel stimuli, given the same number of parameters.
The reason is, is that complex problems operate in higher dimensional spaces, and relationships in higher dimensional spaces have exponentially more complexity for any level of representation. Exponentially.
Linear fits of a 2D bezier are inefficient but work. Linear fits for a 100 dimensional bezier, which isn't very many dimensions from a data standpoint, become ludicrously expensive in parameters.
The dimensionality of human communication is probably the most complex problem ever tackled systematically.
I am trying to think of a way to capture this more concretely. I.e. a way to draw a line in this conversation that stands up on its own. All I can point to, is the complete failure of any lower-level fit when done directly, to acheive a trillionth of a trillionth of trillions of the flexibility that SOTA models demonstrate. The extreme dimensionality of input that LLMs respond to, makes my "trillionths" literal in this case. And we do get a concrete measure of the dimensionality within their capacity, as context windows give us live demonstrations of this.
Note that language is literally highly compressed information, with pervasive non-local interactions. The enormous dimensionality is compounded by dense reactivity, pervasive discontinuities. No other informational artifact compares to language complexity.
When I say that this is a case where either real relationships are learned or the model fails, it is because the number of parameters for a lower-level fit really are beyond imagining.
You can't point to any lower-level fit, where the lower-level fit is basic to the fitting algorithm, that ever achieved even a tiny-grammar tiny-subject-scope toy of a toy version, to what LLMs are doing. Nor can I, despite following progress for decades. Nobody can. The original successes of the first LLMs, modest as they appear now, were completely unprecedented.
There just are not enough parameters, by many orders of magnitude, to do language justice over a context window, and respond sensibly to intentionally novel conversations, without identifying the actual relationships behind it.
So that would be my challenge to you. To identify any verifiable lower-level fit that even approximates LLM behavior at the tiniest of toy levels. Verifiable fits at any given level are easy to do, just train a model where the basis is restricted to that kind of fit.
Otherwise, I can agree that understanding is a continuous property, and that how well something understands something, without strict benchmarking by well thought out benchmarks, involves intuition and judgement. So there can be legitimate differences in how we perceive model understanding, in the absence of direct measures.
Any more thoughts? I have understood both myself and your points better as we went along.
Comment by cauch 9 hours ago
For example, you just moved the definition problem to "novel". Do you even realise that?
You are claiming that there is an understanding because the model is able to do something in a novel situation where only deeper understanding of the situation will allow it to perform that well. The big problem is that you have no idea if this situation is "naturally easy to reach" or not.
For example, a system that is fitted on the electrostatics Coulomb's law will build, internally, a set of equations to generate realistic predictions. And then you take this system and put it in a totally novel situation: classical gravitational problems. Well, this system will be able to generate realistic predictions there too, because Newton's law works with equations that have the same form.
When you are discussing with a LLM in a "novel" subject, how do you know the LLM cannot directly use the fit and complex equations that it has created for the "non-novel" situations it has been trained on? For example, the LLM has been trained on "Pride and Prejudice and Zombies" and tons of other mash-ups. Even if asking a story of Keanu Reeves and the Supreme Court looks "novel" to you, it does not mean the generated text was not in fact super easy to generate based on the patterns that the LLM has seen in tons of examples.
Honestly, this whole conversation just convinced me that too many people who claim "GenAI does understand" are way above their head on the subject. If you want to continue talking, just talk to a LLM. Plenty of people have done so and convinced themselves they were geniuses when in fact they were not at all. Yet another example that LLM has no understanding, as it is very very often failing to distinguish between correct ideas and "things that look correct but that someone with real understanding will not encourage".
Comment by missingrib 6 days ago
You are basically arguing for a functional account of consciousness, but things like this have been debated for literally decades/centuries in philosophy.
Comment by rerdavies 6 days ago
The problem with the hallucination argument is (1) that is much less of a problem with good current generation AIs, and (2) living conscious breathing human beings also have a disturbing tendency to make shit up, too. So a tendency to make stuff up doesn't really serve as a disqualifier for consciousness.
Also worth mentioning that the guiding rule of what's philosophical or not is whether it's actually useful. Actually useful philosophy usually becomes something else. Usual some scientific discipline or another. And as it turns out, theories of mind are likely to become extremely useful in the near future. Expect huge advances!
Comment by cauch 5 days ago
1) Good current generation AIs are specifically trained to reduce hallucinations. If we had new AI system that happened to not have hallucinations as a side effect of their training, then it would be convincing. But here, it looks like we have built a pocket calculator that answer 7+13 = 14, and on top of it, we added a layer that says "if the input is 7+13, then replace the output by 20". This pocket calculator still does not know how to calculate, we just added a layer to hide its mistakes.
2) Not only "make shit up" is not the same as "hallucination" (either "making shit it" is done when the individual knows it is unreliable, or when the individual was given wrong inputs), but the point is not to say "hallucination implies no consciousness", but "large quantities of hallucinations in situations where a conscious system would be unlikely to hallucinate implies no consciousness"
Comment by rerdavies 4 days ago
Try Claude, which can.
Comment by cauch 4 days ago
First, the "13+7" is an analogy. In this analogy, "13+7" is not the real question you ask, it represents _any questions_, not just arithmetic.
But secondly, did you even noticed that in my example, the system answer CORRECTLY "13+7"? So, in my example, the thing I'm talking about and I argue does not "understand" is Claude, even if it is able to answer correctly.
My point is: the "basic LLM" part is creating a mechanism that answer without understanding (as demonstrated for example by ChatGPT failing arithmetic), and the fine-tuning or the harness is just hiding the lack of understanding by adding ad-hoc correction on the residuals. And because it is on the residuals, it looses the logical links (13+7 -> 20 is "logical", it corresponds to the math logic, it corresponds to what you get when you add 13 stones and 7 stones together. The residual is "14 -> 20", which has no meaning in itself)
The ad-hoc correction is either: 1. by training the model so it learns by heart, without understanding, that the symbols "13+7" should lead to "20", 2. or by training the model to use a pocket calculator without understanding arithmetic so it can do it itself.
You can prove that the model does not understand it very simply. Let's take the normal fine-tuned model M1. Now, let's go back to the pre-tuned version, and fine-tune it so it answer "21" to the question "13+7", and use an harness that does "sum(x, y): return x+y+1". This is model M2. M2 will fail to answer "13+7" correctly, it will say "21". And yet, M2 has been trained exactly the same way M1 was. If it is true that the additional tuning "add understanding", M2 will not be possible, it will say "error, error, do not compute, you try to train me to say that 13+7 is 21, but it does not make logical sense to me". But it does not happen: the pre-tuned model has no idea that 13+7=20 is more logical than 13+7=21, and the additional tuning is just helping him returning a more correct answer while still having no idea where this answer comes from.
Comment by VirusNewbie 5 days ago
falsify this. Show me a way you'd be able to prove they do/don't, that would work for humans.
Comment by bondarchuk 5 days ago
Comment by miyoji 5 days ago
Comment by bondarchuk 5 days ago
Comment by miyoji 5 days ago
Comment by bondarchuk 5 days ago
Yes there is, the systems reply is the obvious and correct answer. Philosophers that disagree are simply wrong. In the end what matters is what's true or false, not how many philosophers accept something. You can check for yourself by reading the argument, following its reasoning, and seeing that it is false; and reading the systems reply, following its reasoning, and seeing that it's true (https://plato.stanford.edu/entries/chinese-room/#SystRepl). The case is similar to those mathematical or logical proofs for the existence of god, where obviously fallacious reasoning gets a pass because it confirms deeply held beliefs.
edit: by the way as to your assertion that the argument is controversial and there is no consensus, I just found something funny on wikipedia (https://en.wikipedia.org/wiki/Chinese_room#History):
>Most of the discussion consists of attempts to refute it. "The overwhelming majority", notes Behavioral and Brain Sciences editor Stevan Harnad,[f] "still think that the Chinese Room Argument is dead wrong".[13] The sheer volume of the literature that has grown up around it inspired Pat Hayes to comment that the field of cognitive science ought to be redefined as "the ongoing research program of showing Searle's Chinese Room Argument to be false".[14]
Comment by cauch 5 days ago
I think it is an important nuance.
You have to be careful when saying "Searle Chinese room" is dead wrong: the Chinese room concept in itself is useful and not controversial, and it is possible that current LLM are "Chinese rooms", and therefore not 'intelligent'.
Comment by bondarchuk 5 days ago
(And you still seem to be implicitly accepting that the basic argument is valid, which would be wrong.)
Comment by miyoji 5 days ago
You are being tedious. I obviously have done this and I disagree with you. Saying that X is logically true and Y is logically false is not a demonstration of those baseless assertions. This is not helpful, what you're saying isn't true, and what I'm saying is backed up by the wikipedia article. The bit you quote is simply stating that most literature about the Chinese Room is an attempt to refute it, which is obvious, because the people who are convinced see no need to publish saying so. The fact that people keep publishing means that they have not yet succeeded in refuting it.
Or I can simply say this: you've made a mistake in your logic. Actually, the Chinese Room argument is correct. Since you won't explicate your logic, neither will I.
Have a good day.
Comment by bondarchuk 5 days ago
Comment by datsci_est_2015 6 days ago
I am far from convinced that the training and inference regimes of LLMs would qualify as “experience” by any sense of the word.
Now, if we hooked up a plethora of audiovisual and tactile sensors with live feedback directly to a neural network rich with transformers, that was always powered on and fully autonomous, we may be getting there. But we’d probably also be on the verge of manmade horrors beyond our comprehension.
Biological rodent neural networks in a Petri dish stimulated by electrical impulses - more or less conscious than LLMs?
Human on life support, unable to respond to any external stimuli, “braindead” - more or less conscious than LLMs?
Comment by rerdavies 6 days ago
And, yes, concerns about whether biological rodent neural networks are or are not conscious come up frequently in the biological neural network papers. I'm not sure I would want to be a researcher trying to get an experiment past an ethics committee if my biological neural network had 25B rat neurons. (I would hope that they could not).
Comment by Terr_ 6 days ago
Similar to: "Birds fly, my spinning helical device flies, therefore we've started to replicate how birds fly."
> without having to build elements that one expect on a conscious being
One of the elements I expect in a conscious being is that you can't rewrite it by changing the introductory paragraph.
When it comes to LLMs, almost every "mind" we humans perceive is a fictional character in an LLM-generated story-document, one we are either reading or which is being "acted" at us by regular code. Our own instinct for pareidolia and simulating/inferring other minds is very strong, which means we should require really good evidence/logic to counter our instincts.
Even if one believes the LLM has a single "real mind" as an author of every document... what evidence do we have that it is conscious or "self-inserting" itself as one of the characters in the document?
Comment by famouswaffles 6 days ago
If we had enough knowledge of the workings of the human brain, you could alter the perception of every single memory you've ever had. And limited versions of this already happen all the time. Human memory is notoriously unreliable for a reason.
Are you aware of the Recovered Memory Therapy Scandals of the 80s/90s ? Boy did that ruin a lot of lives. You can rewrite a human by changing their 'introductory paragraph'. It's just not as accessible.
Comment by NateEag 5 days ago
Knowing how something works is not the same as having the tools to change it.
Discovering memories are incorrect does not massively change who we are. As someone with a very defective memory, I discover on an hourly basis that I'm won't about something I thought was true, but there's still continuity and consistency to my personality and general approach to life.
...in fact, as someone who was raised an evangelical Christian and believed wholeheartedly without a shadow of doubt, then lost my faith entirely in my late thirties, I sort of did have my "introductory paragraph" changed, yet my wife, children, and friends would all say I'm still me, and that my core personality and nature remains largely the same.
> Are you aware of the Recovered Memory Therapy Scandals of the 80s/90s ? Boy did that ruin a lot of lives. You can rewrite a human by changing their 'introductory paragraph'. It's just not as accessible.
The recovered memory scandals are not even close to evidence that you can rewrite a human.
The people who thought they had learned new facts about themselves did not suddenly lose their context as humans in 20th century America.
They did not suddenly lose their sense of humor, or develop a previously-unseen penchant for murdering small children.
They experienced a revision of belief, and a pretty major one that really distressed them, but it did not change everything about them.
LLMs _do_ manifest wildly differently based on the first paragraph.
Comment by famouswaffles 5 days ago
Comment by hackinthebochs 6 days ago
There is no independent "consciousness mechanism" that one might imagine humans have learned or evolved for its own sake. Evolution learns various solutions to optimization problems, and so if consciousness evolved then it was either useful instrumentally, or it is a byproduct of some organization that is useful instrumentally. The point is that as a solution to certain kinds of optimization problems, consciousness can conceivably be the solution to the optimization problem of predicting the next token of text written by humans who themselves have complex phenomenology. There is nothing that a priori constrains token prediction from the domain of consciousness.
>For me, one element that shows it is the case is the absence of world model (or "human-like" world model) despite the fact that the sentence continuation is convincing
World models don't have to be rich and detailed to count as a world model. Lower life forms might be conscious but they only model the part of the world useful for their existence in their ecological niche.
Comment by cauch 5 days ago
Yes, I agree with that. Consciousness is a good way of generating convincing human text.
What I don't agree with is that consciousness is the only way to generate convincing human text and that because we have convincing human text, it can only imply we have consciousness.
There is a huge probability that generating convincing human text can be done without consciousness. Either because there are efficient mechanisms as efficient as the way the human brain deal with this problem and that the LLM found one of them (and these mechanism may be quite difficult to imagine for a human). Or even because the LLM found a local minimum and is stuck there.
To re-use the evolution approach: evolution solved the "flying problem" with bird feathers, but also with insect wings or bat wings. The fact that evolution ended up using feather does not imply that everything that flies can only fly with feathers.
> World models don't have to be rich and detailed to count as a world model
I agree in general, but here, we are talking about machine that reproduce all human language. The argument I'm answering to is pretending that "all of human knowledge" is understood, which include every single human concept. This has to be everything, because LLM is able to provide convincing text about every subject. If on some subject, the LLM is able to provide convincing text without "understanding" it, then the argument that it is impossible to provide convincing text without understanding it collapse.
Comment by trescenzi 6 days ago
> There is nothing that a priori constrains token prediction from the domain of consciousness.
We don’t know either of these are true or false though. We simply don’t know. There is no agreed upon definition of consciousness, aside from maybe _the having of qualia_, so arguing that some can or cannot be conscious a priori can’t be done.
Comment by hackinthebochs 6 days ago
No one genuinely engaged with the topic is confused about the target of the term (phenomenal) consciousness. Definitions come once the theoretical work is complete, to be articulated as part of a fully worked out theory. The lack of a definition doesn't prevent us from investigating the subject or offering conjectures. What we can do is offer a precise description of the target and argue for or against whether LLMs reach the description. We will of course debate whether the offered description captures the relevant phenomena. But this is all just part of the process.
Comment by int_19h 6 days ago
Try to define qualia though without explicitly or implicitly recursing into consciousness.
It's all a large house of cards that's built on handwaving and "I know it when I see it".
Comment by dnautics 6 days ago
sometimes humans making claims about AI intelligence or consciousness also identify spurious patterns that do not correspond to the problems of intelligence or hard consciousness.
Comment by Terr_ 6 days ago
That reminds me of a niche paper [0] critiquing a certain way of teaching remedial math that was over-focused on tests. A kid named Benny (12) was building up (wrong) "rules" for math which still somehow gave enough of an illusion of progress in terms of test scores that his misunderstandings hadn't been caught earlier.
> Benny was able to explain his procedure; e.g. for 5/10=1.5, he said: "The one stands for 10; the decimal; then there’s 5... shows how many ones." In another example, 400/400 = 8.00 because "The numbers are the same [number of digits]... say like 4000 over 5000. All you do is add them up; put the answer down; then put your decimal in the right place... in front of the [last] three numbers."
Comment by cauch 5 days ago
CNN are finding patterns, sometimes relevant, sometimes spurious, but I don't think people argue that CNN have evolved consciousness or understanding of what a cat or a dog is.
Here, the argument is "LLM are able to understand, because 'understanding' is the only pattern to reach the goal". I'm saying that it is unlikely to be the only pattern, and that it is likely that they find a local minimum on a system that reaches the goal that does not use 'understanding'.
The reason I'm saying it is likely is because "basic" LLM shows behaviours where they are producing convincing human text and yet doing things that are really difficult to reconciliate with the fact that they have understanding.
(And before that old argument is used, yes, I know sometimes some humans fail to understand. The problem is that the majority of humans don't fail to understand basic stuff in the majority of the time, while the "basic" LLMs do. The fact that you roll 10 dices 100 times and 1 of them never land on 1 does not convince me that that set of dice is loaded. The fact that you roll 10 dices 100 times and 9 of them never land on 1 does convince me that that set of dice is loaded.)
Comment by saimiam 6 days ago
Comment by overgard 6 days ago
If the tokens didn't correlate to words imbued with meaning outside the system, if the LLMs were trained on patterned data that had no meaning to humans or something there wouldn't be any conversation about these things being conscious at all.
Comment by tsukikage 6 days ago
Turing complete systems can be built out of matrix multiplications, out of attention, out of key/value lookups. The Chinese room is Turing complete. By claiming it cannot understand things because it is built out of components computing devices can be built out of, we are claiming no computer can because no computer can. This is a very bold claim indeed, and also we’re assuming the conclusion! The claim is no more convincing than “brains cannot understand things because they are made out of neurons”. The system may or may not have some particular properties, but we have to do more work than just gesturing at the components the system is made of when making claims about it; the alternative is, at best, a world where we prove too much and conclude that humans, too, are not conscious.
For starters, we need to pin down the terms under discussion enough that they don’t just mean whatever we need them to in the moment.
Comment by flashman 5 days ago
A very convincing simulation of thought is not thought, especially without memory or any evidence of an independent will. LLMs have the same kind of consciousness as a nematode, albeit with less-complex behaviour.
Comment by 1shooner 5 days ago
Exactly.
Comment by whattheheckheck 4 days ago
So spend your time wisely
Comment by ainch 6 days ago
As Ilya Sutskever has pointed out, if you read a mystery novel up until the reveal of the culprit, and then fill in "The killer was _____", don't you need to understand the novel to accurately predict the next word?
Comment by whattheheckheck 4 days ago
Comment by missingrib 6 days ago
Comment by feoren 5 days ago
Language is tremendously complicated. "Time flies like an arrow, but fruit flies like a banana." "Hard hats must be worn on site; dogs must be carried on escalators", etc. Predicting the next token requires understanding, full stop.
> if the rules are followed, no understanding is neccessary.
The rules are the understanding.
(Note that understanding != consciousness)
Comment by overgard 4 days ago
Comment by fnordpiglet 6 days ago
The article also makes this assertion that it replays everything over and over again to create each character one at a time as some way to demonstrate the autoregressive self attention mechanism but it’s really not accurate at all, and it trivializes what is going on.
I’m am not asserting LLMs are aware or conscious that’s on the surface profoundly absurd. And I do understand your point that the fact it emits in words something that seems to speak to us gives to the air of humanity that’s isnt real. However there is a very real emergent reality that our language alone appears to lead to embedding a form of thought and understanding that is latent in our use of language in communicating that is in fact coming through the model. It is not regurgitating its corpus and pattern matching because the patterns you input and it emits are not where the inference is operating, its within this enormous vector space through these complex non linear activation functions with learned residuals not in the language corpus.
It is not conscious or aware. It is something else, not human. But if you can not see it as amazing you have lost the capacity to dream.
Comment by miyoji 5 days ago
I completely disagree. I think if you think these things are amazing, your dreams are incredibly limited and boring.
I remember the first time I talked to a chatbot. Not an LLM, just a regular chatbot, like ELIZA or any other dumb bot.
For a few seconds, it felt magical, like I was talking to a computer that understood me, as it made replies that were sensible to what I was saying. Then it said something incredibly stupid and jarring that made no sense, and that took the magic away. Oh, this is just a dumb computer program.
I remember the first time I talked to an LLM-powered chatbot. It was the exact same thing, except the magic feeling lasted a tiny little bit longer and was a tiny little more convincing. But it went away in the exact same way, for the exact same reason. Once you've seen the emperor without clothes, nothing brings back the magic.
Comment by fnordpiglet 3 days ago
If your dreams begin and end with making a box with a person inside that’s the point I made. You can make a person in a box in 9 months, that’s not interesting or cool. What we can build with an LLM and genai in general is MUCH MUCH cooler than a metal baby. (And I think a conscious mind we control and manufacture but don’t recognize the rights of is a scarier horror movie than all the AI feat movies ever made put together, fwiw, so I’m glad your dreams were broken)
Comment by DirkH 4 days ago
Comment by ryougi 5 days ago
Do you, or anyone reading, have any worthwhile links that make a strong case for this (that there is a stronger semantic relationship than simply next token prediction)? I would like to read more about this.
Comment by overgard 4 days ago
Comment by DangitBobby 6 days ago
Comment by rerdavies 6 days ago
You, of course, wouldn't notice if your only experience of LLMs was chatting with the cheapest, smallest, least capable LLMs that you get through ChatGPT, or Google search.
It becomes pretty obvious when you use a coding AI on a daily basis. It is the context buffer in which the magic occurs, not the tokens that get spit out one at a time.
Every day, I watch my coding AI develop plans, search the web a half dozen times for documentation, grep through my entire codebase looking for pieces of related code and context, analyze relevant source code across multiple files, spit out an initial plan for implementing the fix before starting to execute it, run requests through some sort of advanced mathematics tool (they are EXTREMELY good at graduate-level calculus and linear algebra), implement fixes that extend across half a dozen files in 2 different computer languages (typescript and C++), run trial compiles and fix coding errors in its output, sometimes developing sub-plans to deal with compile errors. I've seen it get halfway through a fix and revise its initial plan mid-flight as it encounters something in existing source code.
Not vibe coding, to be clear. Targeted use of a coding tool by a by a professional senior software developer with decades of experience, and fair bit of expertise with the limits of what sort of problems my coding AI can and cannot do. Every line code reviewed. Sometimes it needs additional prompts, telling it how it mis-implemented something, or specifying more carefully what I actually want but didn't properly express in the initial request
All the time maintaining that context across multiple request, so that I don't have to restate requests from scratch.
A particularly interesting revision: "You have misread the equation (13) on page 112 of 'Spice, the Manual 2nd ed.'. I should be ....". (It had previously identified the textbook as a source I was using, from comments in source, in a preceding request, and actually already read cited pages in the PDF file, which it had found online). And I had actually asked it to implement equation (13), which was, in fact, badly typeset. The error it had made was defensible, if not the best reading of the equation.
"You are correct. Let me fix that." (producing updates to the implementation of the equation in code, AND code that implements the symbolically-differentiated version of that equation 60 lines later, which is not explicitly given in the text). The text says "take the lagrangian of equations (11), (12) and (13)" or something like that.
ALL information that gets carried in context buffers, even though it's generating code one word at a time. The bulk of the magic occurs in context buffers, not spitting out words one at a time, which, for my coding AI is, I think about 250,000 tokens.
I think it's pretty safe to think that my coding AI is working out of context buffers that may carry plans and research results consisting of tens or hundreds of thousands of arranged tokens carried in context buffers through the multiple steps of the implementation, and later revision. None of that would be possible if were simply working one token ahead.
I kind of suspect that a lot of activity occurs in the first few words of its response. "Let me examine your current source code and develop a plan. Ok. I can see on line 131 where you want me to implement the equation.". (An opportunity to perform about 27 updates of the context buffer). And in the sometimes hundreds of lines of output it generates as it talks itself through what it needs to do.
Comment by whattheheckheck 4 days ago
See the Software and Mind: The Mechanistic Myth on why the industry is doomed to repeat the cycles.
We need a professional society thats not youtube podcasters hocking courses and sponsored content
Comment by hn_acc1 6 days ago
Sure, it's the best we have online, but that does not make "the internet" the sum of all human experience. To reduce all of humanity down to the text on the internet is reducing us to the level of machines to fit the requirement of what a machine can process / simulate.
Comment by BLKNSLVR 6 days ago
Comment by dahinds 6 days ago
Comment by dogwalker5000 6 days ago
I think the main complaint is LLMs don’t arrive at the answer the way we do. It’s capable of emulating some of our behavior but not all as the mechanism by which it works is very different.
Maybe I’m wrong about this but one thing humans do that LLMs don’t is deductive reasoning. LLMs seem to operate entirely of inductive reasoning.
Comment by Nevermark 6 days ago
This isn't an argument against their understanding things.
But I expect you are right, that their understanding may have major different qualities from ours.
Along with significant commonalities. (They don't reason via stream of consciousness in a way alien to us.)
Comment by mmustapic 5 days ago
Comment by krupan 6 days ago
But the machine doesn't have to understand humans to do that. It gets trained on a whole bunch of sentences and then it is able to complete text. You could maybe claim that it "understands" the text but even that's a stretch.
Comment by tsukikage 6 days ago
Through training on human text, we are building implicitly in the weights a statistical model of what humans might write in response when presented with arbitrary pieces of text. It turns out that we can make these incredibly accurate.
If building an accurate internal model of something then using it to predict that thing’s behaviour is different to gaining understanding of that thing, we will need to pin down exactly what “understanding” means, or we are forever doomed to talk at cross purposes.
Comment by krupan 5 days ago
And I will reassert that even if it "understands" the text it was trained on, that is not the same as understanding humans. I mean really, we ARE humans and we barely understand humans.
The thing LLMs model and "predict" is simply, what words in what order are statistically common given these input words in this order.
You can write (non-ai) software to model and predict things using the laws of physics. I'd wager it would do a better job than any LLM at predicting where a rocket will go through space. Does that mean the program is conscious and "understands" physics? No
Comment by hn_acc1 6 days ago
Comment by hackinthebochs 6 days ago
Tokens are the most basic input unit of an LLM. But tokens don't generally correspond to words or letters, rather sub-word sequences. So Strawberry might be broken up into two tokens 'straw' and 'berry'. It has trouble distinguishing features that are "sub-token" like specific letter sequences because it doesn't see letter sequences but just the token as a single atomic unit. 'Straw' and 'r' are two tokens but an LLM is entirely blind to the fact that 'straw' has one 'r' in it.
As an analogy, I might ask you to identify the relative activations of each of the three cone types on your retina as I present some solid color image to your eyes. But of course you can't do this, you simply do not have cognitive access to that information. Individual color experiences are your basic vision tokens.
The widespread mistake people keep making is assuming the development of intelligence in LLMs should follow the same trajectory that human intelligence takes as it develops into adult levels of intelligence. Thus deficiency in some capacity that we take for granted in humans is an indictment on LLM intelligence. But this is specious. LLMs are entirely alien; their developmental paths do not and should not look anything like ours. Your intuition from human intelligence just works against understanding the potential for intelligence in LLMs.
Comment by krapp 6 days ago
To be fair, almost everyone who claims LLMs are conscious tends to claim that they are conscious in exactly the way that humans are, to the point of stating that human brains are also just complex next-token prediction machines with a random seed. It's basically religious arguments on both sides.
Comment by Lerc 6 days ago
I have seen people say "you're a next token prediction machine" but only in a similar way one might say "you're a cup of old lard". Not actually meaning it literally.
I have seen people interpret the request to show that they are not next token prediction machines to be a claim that they are, but this is almost always an argument to show certainty is difficult in this area.
People like Hinton have declared that they believe them to be conscious, but clealy indicate that they do not mean just like us.
Comment by dpark 6 days ago
Comment by krapp 5 days ago
And it seem obvious to me that language behavior does differ significantly between humans and LLMs based on the frequency and nature of failure states. LLMs routinely hallucinate, or get "AI strokes" or get obsessed about not talking about goblins, etc. This isn't typical language behavior for humans unless they have severe neurological or psychological impairment.
People tend not to "spew words out without thinking" and certainly not all the time by default - we call that glossolalia and (outside of some fringe Christian sects) it's considered a "bug" not a "feature" of the human brain. Human language by default always has intent behind it, even if that intent isn't readily apparent to the speaker. People can recite by rote memory, but that isn't blind token prediction, it's the neurological equivalent of muscle memory. People can have conversations then forget about them because their attention was focused elsewhere, but that doesn't indicate that they were simply "spewing words out without thinking" at the time.
Comment by dpark 5 days ago
People imagine details all the time. Eyewitness testimony is notoriously untrustworthy.
Our brains seem wired to confidently fill in gaps. We all have a literal blind spot we aren’t aware of because our brains convincingly lie to us and fill in the gap.
I don’t know what an “AI stroke” is, but I’ve definitely seen human beings in good health be in the middle of talking and suddenly forget what they are going to say.
> People tend not to "spew words out without thinking" and certainly not all the time by default - we call that glossolalia and (outside of some fringe Christian sects) it's considered a "bug" not a "feature" of the human brain.
Glossolalia is spouting gibberish, not comprehensible speech.
Kind of weird that you speak so confidently when you don’t apparently know the difference between steam of consciousness and “speaking in tongues”. Almost like you’re AI hallucinating.
Comment by thwarted 6 days ago
This sounds like a description of a child who has not learned to read yet. You ask a child who is not aware of the alphabet and of "words" how many r's are in strawberry you'd get a non-sense answer too. So what you're really pointing out is that the LLMs have not been trained on "the english language" and how words are constructed and what they are composed of. That they operate by tokens that don't correspond to words or letters is irrelevant as an answer to why they can't count the letters in a word. It's not that I know how many r's are in strawberry because of how I'm understanding the word "strawberry", I know how many r's are in strawberry because I know how to spell strawberry. The LLM needs to be trained on this the same way someone who is learning to read would be trained on it. No one should be surprised that an LLM can't "read" in the same way no one should be surprised that a child can't "read".
Comment by hackinthebochs 6 days ago
This interpretation takes things too far away from how LLMs are constituted and so misses important explanatory power. The issue of counting letters in a word isn't about an ability to spell, it's about the nature of one's perception. We perceive words as sequences of individual letters. LLMs do not. I can ask you to tell me how many r's are in some nonsense word sequence and you're fully capable of doing that. LLMs do not see sequences of letters so they are intrinsically at a disadvantage for this kind of question. But this says nothing about its capacity for intelligence anymore than not naturally being able to distinguish frequencies of photons hitting your retina has anything to say about human intelligence.
Comment by miyoji 5 days ago
I disagree with this pretty strongly, because I don't think you're correct that I don't have the ability to distinguish frequencies of photons hitting my retina. We have a lot of tools that can determine the frequency of light and I can use those on any source of light that I wish to measure that may hit my retinas.
If you ask an LLM how many Rs are in strawberry, it wouldn't think like this. It would confidently state that there are two Rs. Even though it "knows" that it can write a python script to count the number of Rs in strawberry, it doesn't do that. Why not? Is it maybe because it isn't intelligent? Yeah, you can prompt an LLM to write a script to count the number of Rs in strawberry, but that's a use of your intelligence, not the LLM's.
Comment by hackinthebochs 5 days ago
Yes, which is why I said naturally distinguish. Have you asked a frontier model how many r's are in strawberry recently? They get it right now. Either through RHLF to ensure they spell out the word letter by letter or some other means. Humans and LLMs both use tools or alternative means to overcome perceptual limitations. I don't see an in principle difference here.
Comment by dpark 6 days ago
Counting letters in a word seems to have little to do with understanding the word. Young kids can’t spell or count well at all but no one says that means they can’t understand.
Comment by lern_too_spel 6 days ago
Comment by Jtarii 5 days ago
Comment by tsunamifury 6 days ago
We discovered math that decodes data storage in langauge and is able to use sophisticated continuation cohorts from ALL OF HUMAN RECORDED KNOWLEDGE to respond to you in a call/response model with very good synthesis capabilities.
Its super useful, but not life or conciousness. Its a simulated echo from our collective recorded behaviors. It understands because we understood first. It replies because we wrote it first. And it sorts, organizes, synthesizes and compresses that at impressive speed now.
Comment by 1eieie 6 days ago
It’s strange many others have not eh? I think when new developments arise, ironically, this is the true measure of human intelligence - one’s ability to make sense of a thing and be closest to the truth.
Comment by lstodd 5 days ago
Comment by sp1nningaway 5 days ago
He isn't saying sentence completion is what keeps LLMs from understanding, he saying that's all they do (regardless of how advanced it is), and that isn't enough. You also need a body with senses and organs that produce a physiological response to emotions, and emotions are necessary for consciousness.
Comment by slashdave 6 days ago
Except this is not consciousness.
Comment by supern0va 6 days ago
It's a great interview, if you're interested: https://www.youtube.com/watch?v=NgDIG8u1-CA
Comment by slashdave 6 days ago
Comment by supern0va 5 days ago
That said, I'm not sure I follow what you're actually asking here? I'll also note that I'm not taking a position one way or the other, just sharing a podcast and noting that an extremely reputable scholar on the subject of consciousness seems to have a bit more uncertainty and humility than many commenting here. ;)
Comment by slashdave 5 days ago
I'll find time to listen to your link, it sounds interesting. My objection is the strange idea that humans are automatons that are keyed off input like a clockwork machine and operate sequentially. This is clearly not the case.
Comment by supern0va 5 days ago
I'm not sure that's a compelling argument. Humans can be put into a similar state where they are unconscious and not thinking. Think of someone in a coma, for example, where we actually measure and confirm that there is no brain activity where they're in that state.
They are not actively conscious, but that doesn't nullify their consciousness from when they were awake, right?
>My objection is the strange idea that humans are automatons that are keyed off input like a clockwork machine and operate sequentially. This is clearly not the case.
Well, a few thoughts here. First, it's worth noting that the argument isn't necessarily that AI are conscious in the way that humans are, nor that humans are strictly automatons.
But I think the more interesting thing is that our understanding about consciousness has evolved quite a bit in just the last fifty to one hundred years. We used to think that only humans were conscious, but assumed that primates, cows, dogs, and other mammals were just automatons. Then we started to think: okay, maybe primates are conscious. Then eventually: well, dogs also seem to have consciousness, and then rodents, etc.
This has continued such that most people in the study of consciousness think all mammals are conscious, and the debate is shifting down to insects and other creatures that we do think/have thought of more as automatons. We don't actually know where to draw the line, because it's essentially impossible to really feel/know the inner states of other living beings.
In the face of all this uncertainty, Chalmers just points out that since we understand consciousness so little, that ultimately we should probably be less definitive in pronouncing which things do or do not have it.
Comment by stinkbeetle 5 days ago
He was responding to your comment
> Funny enough, the models seemingly go insane and decohere into noise output in the absence of sensory input
The assumption being that "sensory input" is a prompt. What did you mean by sensory input?
Comment by supern0va 5 days ago
No, I could be mistaken, but I think he was clarifying his higher up comment:
>Imagine yourself in an isolation chamber. What are you thinking? Are you no longer conscious?
Comment by stinkbeetle 4 days ago
In any case, what do you mean by sensory input for LLMs?
Comment by Lerc 6 days ago
This is where the other claim is being made. That the structure of the model is fundamentally incapable of the operation, so even if you stipulated that the way you provide data is sufficient for intelligence then it still wouldn't work.
The universal approximation theorem addresses this point. In that, with an identity attention mechanism, a LLM is just a multi layer perceptron. The attention mechanism is effectively a way to get one of the benefits of a much larger fully connected layer without the massive cost.
A LLM can do what a MLP can do. A large enough MLP can do any function to arbitrary precision.
That makes the claim that an LLM could not do a task the same as saying no function can do that task.
Some are ok with this, if you invoke some supernatual aspect to intelligence then the inability to describe it with a function is quite reasonable,
If you want to stay in the world of reality, you have a much harder task, people like to point at quantum (Penrose) but it's hard to say what it is you are pointing at.
I think the very act of proving that something is or is not intelligent, would render it functional by nature of it having a proof, (or disprove Gödel's incompleteness (a tough ask))
Are there any proofs that cannot be expressed as a function? A kind of Gödel locator, where you can prove something that you can identify is true but there is no formula to express it. I'm not entirely sure what that would even mean,
Comment by Isamu 6 days ago
A language model completes text based on the overlapping patterns of the training data.
There absolutely was thinking involved… in the training data. Same as when you read a book, you engage with the thinking behind the text. The book isn’t thinking, and the author may be dead and gone, but there’s absolutely the traces of thinking in the text.
Language models produce mashups of texts they were trained on, and there’s absolutely the traces of thoughts behind those mashups.
Comment by yogthos 4 days ago
Comment by qarl 6 days ago
I'm hearing a lot of bad arguments against LLM consciousness lately. Bad argumentation heralds bad outcomes.
Comment by nozzlegear 6 days ago
What bad outcomes do you foresee from badly arguing against LLM consciousness?
Comment by qarl 5 days ago
Comment by calf 6 days ago
Comment by hn_acc1 6 days ago
From Wikipedia: In 2023, Chiang was named one of Time's 100 most influential people in AI.
Comment by calf 6 days ago
Indeed it isn't a one-off. His last infamous article compared AIs to Xerox machine image compression. He convinces a certain type of crowd that is not technical enough to poke holes in his posturing.
Comment by CommieBobDole 6 days ago
It's a big file with a bunch of coordinates describing spatial relationships between tokens. When you give it a prompt, it uses those relationships to generate a string of tokens that is a statistically likely response to that prompt, then it stops. It's not changed by the experience. It doesn't remember anything. It doesn't sit around thinking on its own.
Even if the model itself were extremely complex, it's hard to imagine a definition of consciousness that includes something that doesn't remember and can't change.
Comment by layer8 6 days ago
Comment by eberkund 6 days ago
Comment by layer8 6 days ago
Comment by krupan 6 days ago
Honestly, that's a pretty messy state of consciousness and I wouldn't proudly crow that my AI is conscious if that's as good as it got
Comment by krupan 6 days ago
Comment by Taek 6 days ago
Comment by nozzlegear 6 days ago
Comment by layer8 6 days ago
For humans, part of the input of the human mind comes from the continuous processes and clocks within the human body, so it’s questionable whether the brain could “think on its own” without such input either.
Comment by nozzlegear 5 days ago
Comment by antisthenes 5 days ago
It's the constant sensory input of the world and the realization and drive to survive as the second order effect of it. Mortality, vulnerability to external factors codified as input could in fact allow the LLM to independ as sentience.
Of course besides the sensors, it would also need a way to affect the physical world, and to be able to monitor the degradation if its own hardware, but when that barrier is crossed, it would be much closer to full sentience than whatever we have right now (which is nowhere near sentience or AGI).
Comment by layer8 5 days ago
Comment by nozzlegear 5 days ago
Comment by twobitshifter 5 days ago
2) We are prompted (invoked) by our environment continuously.
3) If you go unconscious due to fainting or drugs you too will stop thinking.
Comment by cauch 5 days ago
- A motor is something that create a force to push a vehicle.
- Oh yeah? My neighbour car does not have wheels and sit on concrete blocks, the vehicle does not move and yet we all agree it has a motor. So it means that I can claim that this other thing that does not move has a motor too.
Sure, human can _some times_ not do some stuffs, but the fact that they can do these stuffs sometimes is the point.
Doing these stuffs is the hard thing. Doing these stuffs is the proof that the machine has what it takes. It does not matter if someone cannot do that stuff, it does not imply that their internal system is not complex enough to potentially do it. But the fact that some people can do that stuff is the demonstration that inside a human skull, there is a system that is complex enough to potentially do it. Unless you can prove that people who don't do it have a fundamentally different system inside their skull, then you cannot pretend that they should be considered as having a less complex system.
Comment by twobitshifter 5 days ago
Comment by cauch 5 days ago
Human _can_ check themselves. They don't _always_ check themselves.
Motor _can_ move vehicle. They don't _always_ move vehicle.
LLM _cannot_ check themselves. They _never_ can. It is not that some don't, they just cannot, they are not a system complex enough to do so.
So, yes, it is a refutation. If you have something that _never_ can move a vehicle, this thing does not qualify as a motor, even if some motor, sometimes, don't move a vehicle.
And if your next argument is "yeah but I would argue you don't need to check yourself to be conscious or to understand things", then you just redefine the definition that is owned by your interlocutor. Your interlocutor is saying that this is a criteria they are expecting. Good for you if you are not expecting this criteria. But the problem is that the answer is not "this criteria is not expected", the answer is "I change the criteria from 'being capable to in some circumstances' into 'does always do it in any circumstances'".
Comment by nomel 5 days ago
All modern agentic harnesses can do this. Nobody uses raw LLM for anything remotely complex. There's always some external system in place. That system is part of the "thought process".
Adjacency doesn't matter here, only what the result of the system of pieces is.
Comment by cauch 5 days ago
It means having self-control on their action and being aware of them. If you ask a system, it will respond, it cannot choose to not respond (even if the response if "I don't want to response", it still "run", still do the work). If you don't ask a system, it will not respond.
Adjacency is the point of the thread here. Saying "you say X is important to decide if the thing is intelligent/understanding/conscious, so let me just change X in the middle of the discussion and say that X does not matter".
That is exactly my first comment in this thread: I don't care if AI think or whatever, my reaction was about these "counter-arguments" that totally miss the point and make the person who push them ridiculous. If you want to have a counter-argument, you first need to understand the interlocutor, not just spew whatever rebuttal you constructed that answer something unrelated to what the interlocutor brought to the conversation.
Comment by nomel 5 days ago
I'm not sure I understand. How do you do it?
In my thought process, I quite literally stop myself, and say "ok, think about what you just said" to check myself. I literally initiate that loop. If I don't, then I'm not using my own mental agency, and just using my firm coded priors.
I will say that I do seem to have a stop, what you said is wrong logic check voice that pops up without me initiating it. But, it's unreliable, and not too much different than all the content monitoring system used for the streaming clients, that will terminate with "content violation" immediately after the "incorrect" words are sent. I don't think integration is important, just the behavior of the overall system.
Comment by cauch 5 days ago
In fact, in the large majority of the time, you don't process "as a loop" at all, you just continuously progress in your reflection without needing a "second voice" to retrigger you. The fact that sometimes we do this is just something we can do, not the result of something needed for our brain to work. For AI systems, this is something needed because the "answering" part is not able to do the loop on its own. And building a bigger system that combine an "answering" part and a "loop" part does not fit this, does not create a self-reflective system, it just makes a non-self-reflective system and a workaround bundled together.
It's a bit like if the "answering" part was unable to provide only one answer and was always producing plenty of different possible answers, including contradictory ones. Then, you can add an external part that will just pick one answer (and add it to the context so the next large set of answer will not be inconsistent), without any intelligence to it. The whole system will look like a human. But we know that the system is not "living" and "aware", because a "living" or "aware" system has its own opinion, while this system is just generating convincing sentence without seeing any hierarchy or value or meaning in each one.
Comment by nomel 5 days ago
Comment by cauch 5 days ago
In real life, the paper moves "by itself". It does not need an external loop that update its position in a loop manner.
In the video game, you need an internal loop, a step-by-step tick, that update the plane position based on its current position and its momentum. And this is why a video game paper plane is not a real object. It is a very good simulation, it looks like it, but it is missing some intrinsic properties that we expect from a real object.
Yet you can analyse the paper plane trajectory and see it as a Markov chain, with quantified step-by-step progress (for example one position point every 0.1 second). The same way you can look at your though process and identify a step-by-step progression. But it does not mean that it works like that intrinsically, it does not mean that the paper plane "jumps" from position point at time T1 to position point at time T1+0.1 second.
For the human brain, there is no "loop centre" in the brain. There is no one (to my knowledge) who got a brain injury and suddenly were unable to keep a single line of thought without having someone else having to feed them the previous thought in order to feed the next thought.
In the brain, the fact that the previous thought feeds the next thought is "how it works", it is intrinsic, it is by design. And this mechanism of thoughts feeding the next thoughts is what creates "consciousness" or "awareness": self-reflection is based on the fact that thoughts are intrinsically linked together, that they "flow" continuously, without needing an external system to update them.
You cannot take away the "loop" part of the paper plane so that it suddenly would be unable to move on its own once thrown away.
Now, you can always say "well, the paper plane in the video game is a very good simulation, it does not matter if it is a real object or not", and that is fair enough. But in this discussion, some people have arguments to support that this property matters, that it is one condition for consciousness or awareness.
Comment by nomel 5 days ago
There are definitely cognitive feedback loops: https://pmc.ncbi.nlm.nih.gov/articles/PMC11903256/
Is your argument that, because they're external to the Llm, rather than integrated, they don't count, not even in a practical sense?
I think the result of the system is all that's important. Where/how it's implemented doesn't matter for practical results.
If the argument here is that LLM don't have this built in, you should know that nobody has a practical use for plain LLMs these days. Nobody uses them this way, except for debug. All interesting use is through some kind of harness, with all sorts of systems bolted on. I think these conversations are only meaningful in this "agent" context that people actually use LLM, where they stop when they think they're done.
LLM don't have a some self contained loop, like we do, sure. Who cares though. The actual AI system that we use every day definitely do.
Comment by cauch 4 days ago
Have you read the article in question. It is saying that for one continuous thought, the brain will use different part of the brain to do different thing. It does not say that there is a "loop controler" anywhere. On the contrary, it illustrates that there is no loop controller: there is not special brain function that control this loop, this loop is "how the brain works", and LLM don't do that, they are incapable to do that, it is not how they work.
> Is your argument that, because they're external to the Llm, rather than integrated, they don't count, not even in a practical sense?
No, my argument is that the nature of the brain and the nature of the LLM are very different, as different as a real paper plane and a video game paper plane. Some characteristics (for example, awareness) that exist in the brain cannot exist in the LLM because these characteristics are the result of the nature of the thing in question.
The problem is not that you build a system by integrating 2 things together. The problem is that they are different "things", they are different machines, they function, fundamentally, differently. They may produce the same output, but when you say "the brain has the characteristic X, the LLM produce the same output, so the LLM also has the characteristic X", it is logically inconsistent.
Planes are built as a system combining 2 things: a motor and some wings. But they are fundamentally different from a bird. They just don't "work" the same. It is not the same mechanism.
> you should know that nobody has a practical use for plain LLMs these days
That is totally irrelevant. My point is about the nature of the LLM, and the fact that it is stupid to see the same output and to conclude that they have the same characteristic. It is like saying "Birds are flying in the air and are alive. Planes are flying in the air, so I guess they are alive".
> LLM don't have a some self contained loop, like we do, sure. Who cares though. The actual AI system that we use every day definitely do.
No, you miss the point. The problem is not that "you can just add an external loop". The problem is that the brain is a system that works without such control loop. The thoughts are flowing (and they may flow to different brain functions, like explained in the article you quote). It is part of how the system works. Having a system that contains 2 things, one that does one computation and one that control the loop is not equivalent to another system where you cannot decouple the "flowing of the thought" from the "thinking machine".
Comment by twobitshifter 5 days ago
Yes, LLMs don't think on their own, for one; they think when you invoke them.
My rebuttal is that people only think when invoked just the same and can enter states where there is no consciousness just the same. The OP has already accepted that LLMs think, but it seems that you are arguing they do not? This car business is confusing and the LLMs not checking themselves is also wrong, there’s even a benchmark for this https://correctbench.github.io/
Comment by cauch 5 days ago
OP says: LLMs never think when not invoked.
What you said: I have example where, sometimes, human think when invoked.
That's the difference: human brains are intrinsically different because they are built to be able to think without being invoked, even if there are situations where they think when invoked.
There are tons of obvious examples of human thinking without being invoked. Just take a bath and you will see :)
Comment by nozzlegear 5 days ago
You can argue that humans are just biological machines reacting to external stimuli, but that's a philosophical argument that I'm not interested in having and frankly, I think it'd be selling yourself short a little bit.
Comment by twobitshifter 4 days ago
As far as the argument about being loaded in memory, if there’s any consciousness in AI, it’s obviously in different form than a biological consciousness. We’d have to agree that consciousness does not require a body to get past this.
Comment by xyzsparetimexyz 5 days ago
Comment by twobitshifter 5 days ago
Comment by knollimar 5 days ago
I don't think LLMs structurally even get the 30 seconds part. It's literally 0 for them.
Comment by mft_ 5 days ago
It's the long-term memory (i.e. learned experiences feeding back and directly altering the content of the core brain, or model) that is missing.
Comment by knollimar 5 days ago
It feels like notes about the situation rather than it being in memory. Memory has more "attention". I think that "it starts to fail" is load bearing here.
I feel like memory has like 5 parts, and LLMs are missing 2 of them:
current working memory
short term what is immediately happening without it being in "RAM". I differentiate here vs working in like thinking fast and slow. Keeping things in working memory is work! You can vibe away short term memory. I had excellent short term memory while I was messed up, I could keep time well. I think LLMs can do this with notes.
mid term: Vague awareness of things like what day a week it is or what you did 2 hours ago. This is where my memory personally failed
long term memory of experiences. You can fake this with memory.md
generalized wisdom for pattern matching long term memories
LLMs seem to be missing that part I was missing. Im probably projecting and anthropomorphizing. But i relate: I would confabulate a ton and didn't know anything was wrong for a while but things seemed off.
Context is like working memory but not short term or mid term. I think you can imply short term with big enough context.
My categories are purely anthropomorphic to me but just wanted to say where I disagreed.
Comment by mft_ 5 days ago
> long term memory of experiences. You can fake this with memory.md
Not sure about this; to my mind, memory.md is analogous to humans making lists of things to not forget to do, or notes from a lecture to learn (i.e. cram into long-term memory) later on. LLMs use it as a short cut to bring important facts back into their context window; but it's not the same as them already 'knowing' the information via the original training process.
---
My consistent (hot?) take is that a (the?) major piece holding LLMs back (maybe even from AGI?) is continual learning. Humans have systems for continually updating their long-term memory from their lived experience - new facts, processes, skills, successes, mistakes, etc. (Sleep and dreaming are centrally involved in this process.) The current architecture of LLMs makes this practically impossible, as it would presumably require the level of power currently necessary for training to be continually applied for continual learning, and demonstrates the huge efficiency advantage of the biological brain.
Comment by layer8 5 days ago
Comment by knollimar 5 days ago
Comment by seizethecheese 6 days ago
Comment by layer8 6 days ago
Comment by bulhabulha 6 days ago
Comment by micromacrofoot 6 days ago
Comment by layer8 6 days ago
Also, how does that relate to consciousness? I don’t think that past episodic memory is necessary for consciousness.
Comment by micromacrofoot 4 days ago
Comment by flashman 5 days ago
Comment by MagicMoonlight 5 days ago
Comment by stabbles 5 days ago
It feels very unnatural to get the same conversation verbatim at a different point in time.
Comment by Jtarii 5 days ago
Comment by Jensson 5 days ago
Comment by twobitshifter 5 days ago
Comment by kranke155 5 days ago
Comment by in-silico 6 days ago
The entire file is not changed, but the KV cache is.
> It doesn't remember anything
The model definitely remembers previous exchanges within the same conversation.
Comment by rmunn 6 days ago
No it doesn't. They get added to its context, and it reads them afresh when answering the next question. That's not remembering.
If your short-term memory completely malfunctioned one day, so you had no ability to remember what was said to you a minute ago, then you would have to find workarounds. For example, you could write down everything someone says to you, then read your notes of the previous exchanges in that conversation in order to continue the conversation. That would be a good way to work around the fact that your short-term memory was broken. And if your notes were invisible to other people and you could read them really fast, then you could even make most people believe that you remembered what they said a minute ago. But you don't actually have a working memory, you're just writing down what they said and re-reading it while coming up with your next response.
That's exactly what LLMs do. That's not memory.
Comment by ACCount37 5 days ago
Attention over KV cache allows past behavior and past inputs to influence future inputs and future behavior. In LLMs.
Until the cache runs out, that is. But even then, you could totally use any of 9000 methods of cache compression, truncation, dropping or streaming and get away with it.
The difference between continuous learning and in-context learning seems to be in capacity, not in principle. Both are doing a similar thing, but one has more length and depth to it.
Comment by nomel 5 days ago
Comment by ACCount37 5 days ago
Comment by in-silico 6 days ago
The model takes in the context, encodes it into a "memory" (the KV cache), and accesses that memory later. That fact doesn't change just because the KV cache grows in size with the context.
I don't know what memory would look like other than an encode-retrieve loop.
Relevant: Transformers are Multi-State RNNs - https://arxiv.org/abs/2401.06104
Comment by CommieBobDole 6 days ago
It still generates every response using the model's pristine state with every new API call; whether the context is provided from the client or from a colocated cache server doesn't really change that.
Comment by fipar 6 days ago
Comment by nomel 5 days ago
I don't think physical integration within one contained is relevant to system level behavior.
Comment by fipar 5 days ago
Comment by nomel 5 days ago
Your own long term memory is the orchestration of systems that make it long term.
Comment by fipar 5 days ago
Now, my opinion is it currently can't think, but it certainly has memory.
However, LLMs don't have memory. That's what I (and others on this thread) responded to, which is unrelated to how my own memory works.
Comment by gringoDan 6 days ago
Comment by xjm 4 days ago
I don't think people here are arguing that sentience would happen when the model is not running, or that sentient experience spans several sessions that do not share some kind of state?
Also, a definition of consciousness is anyway hard to imagine :)
Comment by supertroop 6 days ago
Comment by george_max 6 days ago
Comment by supertroop 6 days ago
Comment by twobitshifter 5 days ago
Comment by aaroninsf 5 days ago
I.e. the "self" is not the same as what it means to experience consciousness.
There are for example well characterized examples of memory disruption under the influence of various drugs (e.g. as used intentionally in anesthesia); and neurological conditions which produce various kinds of amnesia.
Do these conditions mean someone is not conscious? We have the luxury of asking people directly.
More unsettling edges yet include things like so-called "split brain" patients or people suffering form serious psychological conditions like so-called "multiple personalities." Psychology does get great mileage out pathology!
Comment by mortsnort 5 days ago
Comment by yesitcan 6 days ago
- average Hacker News response
Comment by ooloncoloophid 5 days ago
There is a great deal of good thinking on Chiang's topic by professional philosophers, and there's much to be said for reading them. I won't rehearse their ideas here. Chiang's arguments might be correct; but I suspect they probably aren't, and his error may well stem from characterising human thought as something in its own class, which is probably a cognitive bias that humans have. He might also - I'm speculating - be arriving at his conclusion based on his feelings, which the final paragraph suggests (the comment about the models being based on morally dubious actions).
Speculation aside, we are not, I believe, in a position to make points like he does with any certainty.
Comment by Lerc 5 days ago
It has been quite frustrating encountering arguments that have been extensively debated for years be presented as if they were new revelations.
In all my debates with people in the last few years I have primarily taken the position of trying to explain the problems with claims of certainty, and that lack of certainty permits possibility of the opposite. We should act responsibly around what might be possible.
There is also the narrative of "being too obsessed if you could to consider if you should" or similar claims of an unconsidered path forward.
Isaac Asimov wrote the first of the Robot stories in 1940, they were not written in isolation, it came from an awareness of the situation and the questions that must be asked. There was a community considering these issues. Asimov gave the wider public a view of some of those issues.
If we have a hundred years of people going "This is coming, we had better decide what we want it to be" and nobody listens to them, or frequently outright ridicules the need for considering their ideas, why is it now we are placing the blame on those who are now showing some success at what they told us they were attempting all along.
Comment by ooloncoloophid 5 days ago
Comment by Lerc 5 days ago
He was very clear to stipulate that the laws themselves were more than the text that represents them, no word play or creative interpretation was possible. This is much easier to do as an author than it is as a developer. You just declare that part as having been susessfully done by scientists.
The rules were their clear semantic meaning. Some of the stories explore the implications of changes to the laws either by design or accident.
I feel like the most interesting ones are the stories where the laws are working and have undesirable outcomes. It reveals that even if fully obeyed, they do not represent what we want.
Some of that is because what we want is dependent on situations that can go beyond safety and harm. Some of it is outright human hypocrisy.
When he gets to things like a closeted robot running for president he makes thebpoin clear about how if the bodies appear alike, you can't tell the difference by the behaviour of a robot conforming to the laws and a good person. That led to the obvious way to distinguish them was to get it to do something bad. The solution on how to fake that might have been a subtle dig at class based society.
It's been some 20 years since I last read the stories, I wonder if there's any point now given my ability to record new information in my youth significantly exceeded my ability to retain it now. I suspect after a week my youthful recall would be better than my week old recall.
Comment by sp1nningaway 5 days ago
I like that he makes the emotional component of his argument plain. I'm deeply suspicious of anyone would try to argue about the concept of personhood and consciousness using only logic and empirical "correctness."
Comment by cjkaminski 5 days ago
I'm also super curious to learn more about the philosophers you referenced and their thoughts on this subject. Would you be willing to share some of your favorite examples?
Comment by ooloncoloophid 5 days ago
0. Descartes/Titchener/Chomsky and friends for background.
1. John Searle featured prominently because of his accessibility. I tended to present his Chinese Room argument as a criticism of symbolic AI alone, though I believe he considered it a criticism of sub-symbolic approaches as well.
2. Thomas Nagel's classic article "What Is It Like to Be a Bat?" is a good introduction to qualia, which is how we describe direct conscious experience.
3. Wittgenstein would be important in terms of the impossibility of empathising with, or seeing things from the perspective of, other minds that have emerged in different contexts (such as animals). However, I rarely spoke about him because, frankly, I couldn't understand him in the original!
4. David Chalmers. Writes clearly (and coined?) the 'hard problem' of consciousness and why subjective experience appears difficult to reconcile with a purely physical account of the mind.
4.1 Daniel Dennett. The clearest and most influential critic of the idea that consciousness presents a special explanatory problem.
5. Darwin and others. Comparative psychology (the study of animal minds) strongly suggests, by showing that the antecedents of the human mind are present in animals, that we should reduce our bias that human minds are special, ineffable, or somehow atomic.
6. Jerry Fodor. Modularity: the idea that the mind is composed of modules that specialise in certain tasks (e.g. phoneme perception, syntactic analysis, face recognition) and operate largely unconsciously unless they are dependent on one another in some way. This helps us take a computational approach to the mind. It reminds us that much of what we do mentally might be expressible in computational terms.
Two final ideas off the top of my head:
1. The Ship of Theseus: https://en.wikipedia.org/wiki/Ship_of_Theseus
When (not all) neurons die off and are replaced by new ones doing the same job, our sense of self and identity is somewhat cast into doubt. In a physical system, if consciousness is a magic or non-material entity of some kind, what is happening to it during this process?
2. Integrated Information Theory. A good attempt to tackle consciousness rationally. The idea is that consciousness corresponds to, or is associated with, the degree to which information in a system is integrated into a unified whole.Comment by aqfamnzc 5 days ago
Is there an accessible way for a layman such as myself to read about some of these ideas (Really I mean philosophical discussion in general) without having to read entire books? Is there an active HN-equivalent or wiki or something?
Comment by ooloncoloophid 5 days ago
I’ll mention some more sources in my reply above.
Comment by potsandpans 5 days ago
They're not exactly casually absorbed, as in a wiki or forum. But you can read some books that begin to introduce these ideas. On the topic of consciousness, less academic and more slated towards general audience: Reality+ by David Chalmers and Mind and Cosmos by Thomas Nagel and Galileo's Error by Phillip Goff will give you and interesting gamut of ideas.
The thing about arguments in philosophy is that they span from a very old web of thought that has been refined into very sharp positions over a long time. So you will find yourself ever recursively going back to understand ideas and framing with more precision.
This is why it's difficult to casually get into these topics IMO. There's just been so much said and discussed, to understand the current meta (as the gaming folks might say), you have to understand how we arrived at the current meta. And that's a long journey that's never complete!
Comment by willmarch 5 days ago
Comment by sigmar 6 days ago
I agree with some parts of this piece, but paragraphs like this one above seem pretty uninspired and simplistic. It's entirely plausible that a conscious mind would not be evolutionarily incentivized to be able to do those things. ie just because animals on earth needed to develop specific talents doesn't mean that other conscious entities need to. Why would a computer program need to hunt for food like a mouse would? Making tools like chimp? these seem like nonsensical metrics.
Comment by sdenton4 6 days ago
Is a car a body? Does an AI situated in a car therefore get to have desires and emotions? Is a taupe box with a webcam attached a body? (For that matter: Is a quadropelegic body a body? Do quadropelegics have desires and emotions? Obviously, yes and yes.) Why is a body necessary for the formation of desires and emotions? Why are desires and emotions necessary features for consciousness?
Or here's one: If I'm not experiencing any particular emotion in a given moment, am I concious?
I tend to think that emotions, at least, are mainly hormonal global triggers: they're more about physiology than actual consciousness. The whole thing, as a result, sounds like an effort to privilege biological intelligence, rather than a real foray into the issues.
Comment by pasquinelli 6 days ago
the subject is consciousness, not intelligence.
Comment by gabrieledarrigo 6 days ago
How can you have a subjective experience without a body?
That's the point, in my opinion: your physical/chemical state (body) in a given moment is then translated into the higher abstraction of the emotion. An emotion that *you* feel, because you are self aware of what's happening.
How can you be self aware without feeling? And how do you feel, without a body?
Comment by grumbel 6 days ago
Easy, you don't have subjective experiences because you have body in the first place. You have them because some signals come in from your nerves, which your brain turns into a world model. You are effectively a "brain in a vat", the vat just happens to be placed on top of your body.
An AI system constructs the world model a little different, by all the text that gets feed into it, but that doesn't mean that there is anything fundamentally different in the world model it builds. Consciousness operates on world model, not on the world or even the body itself.
The AI's world model might be missing some information, because they weren't described in enough detail in text, but that shouldn't matter for consciousness. A blind or deaf person isn't less conscious than one that can see or hear just because some information is missing from their world model.
Comment by felipeerias 6 days ago
Comment by grumbel 5 days ago
If we want to know if AI is conscious or not, we have to ask if the AI can recognize itself in the input it gets.
Some aspects like limited content length and lack of ability for the model weights to update will certainly limit what the AI can do. But that's ultimately a matter of degree, not kind, when it comes to consciousness.
Comment by Jensson 5 days ago
It is important, if I destroy your brain and grow a new one and start running the same program on the new brain your consciousness is still dead and its a new person living.
Computer AI models don't run on a single machine, they run in a distributed manner using different machines at different times. When you ask a follow up question thats not sent to the same machine, its another machine answering. So the consciousness one of those computing machines experience would be extremely fragmented and not at all conscious about any discussion with you, since it only saw a few fragments of it.
And no consciousness doesn't expand to cover larger distributed computations, otherwise social media would have developed a consciousness by now but it hasn't. Groups of humans don't start to share consciousness, it doesn't happen, so you can assume groups of distributed computers wont as well.
Comment by grumbel 5 days ago
If I copy a text document from one computer to another, is that the same document or a different one? It's all just information. If you copy it, you have two, it's still the same until the documents start changing and go different directions.
> Computer AI models don't run on a single machine
It doesn't matter on how many machines it runs on. It's information processing, as long as it gives the same results, it doesn't matter how you accomplish it.
Consciousness isn't some magic thing that sits on top, it's the result of that information processing. You take random sensory data, the brain transforms that into "cats, dogs, you, me", it uses uses those percepts to execute actions, gets more data back and checks how the actions changed the world state.
Keeping track of what changes in the world were causes by actions of the brain vs things that happened due to other causes is the conscious experience.
Comment by felipeerias 5 days ago
Comment by solomonb 6 days ago
Comment by Planktonne 6 days ago
Comment by Gooblebrai 5 days ago
I think we give too much credit to the brain. The gut has almost as many neurons as a dog's brain and the heart has neurons too. "You" are more likely the whole ecosystem, not just your brain. There are even some hypothesis of disorders like depression being more influenced by the gut than the brain.
Comment by sfn42 6 days ago
Then a request comes in, and the system does a bunch of calculations using those bits, and spits out a result. The bits are unchanged.
When your brain receives input, it is changed. It is constantly active. If it ever stops being active it's dead.
So, what exactly is the claim? Are the bits constantly conscious? Do they snap into consciousness when the computer does math with them? Or is it maybe the computer that's conscious while it's processing these bits? How about when it stops doing that and goes back to doing other stuff? Why are these particular bits special? Was the computer always conscious?
I feel like the only way anyone could believe LLMs are conscious is if they don't understand how computers work. Of course it isn't conscious, how could it possibly be conscious? Its literally just bits. It's like saying the text in a book is conscious.
Comment by orangecat 6 days ago
These are all fine questions, and they don't become any easier to answer if you replace "computers" with "brains" and "bits" with "neurons".
Of course it isn't conscious, how could it possibly be conscious? Its literally just bits.
"That's ridiculous. How can meat make a machine? You're asking me to believe in sentient meat."
Comment by el_jay 6 days ago
What is even being argued here - neuroscience is hard, so programming your PC thus makes it conscious?
> "That's ridiculous. How can meat make a machine? You're asking me to believe in sentient meat."
… what?
Comment by orangecat 5 days ago
We don't understand how combining a bunch of obviously(?) non-conscious biological components can produce a larger system that is conscious, so it's unwarranted to be certain that that can't happen with software.
Comment by jlightfire 5 days ago
Comment by jsnell 5 days ago
Comment by nprateem 6 days ago
Or if they're retards. The fact this still comes up is weird. A printing press isn't conscious, so why would an LLM be.
Don't forget, some of the bros are overly excitable. Like that twat who reckoned a Google model 5 years ago was conscious.
Comment by hn_acc1 6 days ago
If a hundred people see the same event, will they all respond the same?
Comment by Jtarii 5 days ago
Comment by Jensson 5 days ago
Pretty sure you wont, humans vary the exact wording. They will say the same song but they wont answer the exact same way every time. Even if they say the same words two times they wont use the same tone and body language, as they don't just communicate via words and that nonverbal language is a part of what we say.
Comment by jcattle 5 days ago
Nicely circling back to LLMs not being able to learn and form memories.
Comment by grumbel 5 days ago
If you have hundred different people, they will of course do something different. Just like hundred different AI model will do something different. The question you have to ask is if the same person under the same circumstances would do the same.
Luckily, we have an answer to that: They would. Transient Global Amnesia is a condition where people temporarily lose the ability to form memories and in turn they keep repeating the same conversation again and again[1]. Their brain keeps asking the same question again and again, as it doesn't remember the answers it already got.
Comment by keeda 6 days ago
Comment by gabrieledarrigo 6 days ago
Bro, I think we discarded this idea from Platone and Cartesio a while ago...
Your brain is your body.
Your mind is not detached from it, and you can't feel anything, and so have a subjective experience, without it. Neither, your mind, or "soul" could survive to the physical death of your body.
So...I mean...
Comment by grumbel 5 days ago
You don't need a body, you need electrical signals your brain interprets as body. And in principle you don't even need a brain, you could replace that with some matrix multiplication or transistors that do the same stuff.
The important part of consciousness is being able to figure out what of the sensory input is correlated to your own action and which was caused by the rest of the world.
Comment by gabrieledarrigo 4 days ago
As I was saying, your brain is literally part of your body. It's a living organ of your body that receives those inputs, as you say, from the rest of your body.
> And in principle you don't even need a brain, you could replace that with some matrix multiplication or transistors that do the same stuff.
I completely disagree here, and that's why guessing that a bunch of matrix multiplications are coscient and have self awareness of their own experience while running in a CPU seems quite hilarious to me.
But hey, that's just my opinion.
Comment by grantcas 5 days ago
Comment by Tadpole9181 6 days ago
If I hooked up electrodes to the hearing centers of your brain and force fed you dialog you perceive as speech (but is really a great deceiver), then responded in what you thought was speech (but are really just probes I use to convert your thoughts to text), that wouldn't suddenly be less real to you. It wouldn't devalue your sapience.
Comment by goatlover 6 days ago
Embodied cognition rejects this assumption. We didn't evolve as brains that were then put in bodies, we evolved as bodies with nervous systems.
Comment by gpderetta 6 days ago
Comment by felipeerias 6 days ago
Comment by margalabargala 6 days ago
Why can't that physical medium be GPUs and RAM? And temperature sensors and cameras? What's special about our meat that it's our "body" in the way a computer is not the body of an AI?
I don't think the point being argued can be true without some incredibly contrived, human centric definitions of "body".
Comment by felipeerias 6 days ago
Comment by margalabargala 6 days ago
Comment by felipeerias 6 days ago
Would you be "you" in a different body?
Comment by margalabargala 6 days ago
To claim otherwise would mean anyone who's gotten a transplant or amputation is no longer themselves.
Comment by confidantlake 6 days ago
Comment by margalabargala 6 days ago
Comment by Jensson 5 days ago
If you delete the hardware and buy new hardware and write in the same weights its still the same LLM. If you delete your brain, get a new brain and put in the same weights in the neurons its a clone, its not you. We know what happens when we create a clone, the clones aren't the same person they are separate consciousness, so consciousness is tied to the hardware.
For example, if we have 5 identical clones down to the atoms of their brains, and send a message to the first and let him respond, then send the same message to the second together with the firsts response and let them add a new message etc. That is not one consciousness responding, thats 5 different consciousnesses responding, and that process doesn't connect them into a single consciousness. That is how LLM works, at best a single token generation is conscious, but that would be a less meaningful consciousness than a ringworm.
Comment by margalabargala 5 days ago
Your whole argument hinges on this, and I don't think it's true.
A clone is not what you describe, not even close. Clones are copies of DNA, not of the actual physical neuron structures.
If you were to vanish, and the next moment a perfect atom-for-atom, particle-for-particle, quark-for-quark copy appear in exactly the same place, I think that is you. As far as I know, that does happen every single planck second.
For it to be you, though, it needs to occupy the same space you were previously in. You can't copy your atoms 5 times over in 5 different locations; those aren't you, they're just similar. Like copying an LLM 5 times over to different hardware, seeding its RNG differently each time.
Comment by felipeerias 6 days ago
Comment by foltik 6 days ago
The experience of “you” is just your specific memories and world model, continuously updated with sensory input.
If another body “runs” the same exact pattern, that is you. Theres no link and nothing was transferred; the pattern of thoughts and memories is all “you” ever was.
Same as playing the same song on two different speakers. Nobody asks what links the song across them; it’s the same song wherever it’s played. You’re just a far more complicated pattern on a far more complicated speaker.
You might ask, “but why am I this pattern?” Because this is the specific pattern modeling itself from the inside in asking that question.
Comment by felipeerias 6 days ago
And my intuition is that no, you can’t, they are two aspects of the same reality.
Comment by foltik 5 days ago
Comment by margalabargala 6 days ago
If "I" am in a different body, then what makes that "me" who's in a different body?
Comment by maxerickson 6 days ago
Comment by margalabargala 6 days ago
I don't see why hardware is any more fungible than a kidney. If your LLM reads the serial numbers of its motherboard/RAM/etc as a seed for entropy you can make identical arguments about body fungibility and self.
Comment by JoshTriplett 6 days ago
If every neuron of your brain were simulated precisely on a sufficiently powerful computer, that simulation would have subjective experiences, without having a body.
Comment by sfn42 6 days ago
Also that's impossible. It is impossible to simulate reality exactly using digital computers. The best we can do is approximate. Doesn't matter how powerful it gets, it'll always just be an approximation.
Comment by acyou 6 days ago
Comment by sfn42 6 days ago
It is nonsensical to claim that anything other than my brain could produce the same consciousness that my brain is producing. It's obviously far beyond anything any conceivable digital computer could ever reasonably simulate, and even if you did create a "good" simulation it obviously wouldn't have the same properties that my brain does because it's an entirely different thing than my brain is.
Comment by JoshTriplett 6 days ago
In any case, a closely-but-not-perfectly-accurate simulation of a real human brain is still going to be human, unless you believe that someone becomes less human when they're experiencing some kind of cognitive decline, or a stroke, or other biological malfunctions. The point is, there is nothing essential to the having of a physical brain that creates the concept of consciousness or sense-of-self.
Comment by sfn42 6 days ago
And there absolutely is something essential to the concept of being human that we are entirely incapable of replicating artificially. In fact as far as I know we are incapable of synthesizing any kind of life whatsoever. We can't even create the simplest type of living cell imaginable.
So to claim that we could just create consciousness, a fundamental property of this "life" thing that we don't properly understand, within a piece of rock is beyond naive. We don't even know what it is or how it is.
Comment by JoshTriplett 5 days ago
We understand quite well where in the brain the sensation of self-awareness / self-experience / sense-of-self comes from. We have evidence that disruption of that part of the brain breaks those sensations.
Comment by sfn42 5 days ago
Okay now tell me where in the LLM its alleged consciousness comes from.
Comment by lupire 6 days ago
So I have no idea what distinction Ted Chiang is trying to draw
Comment by martin1975 6 days ago
Great q. Deepening it further-how can you have a subjective experience without consciousness, which isn't necessarily tied to physicality. Taking it one step further-can you have consciousness without a mind? Who's the first mind, the first cause of it all, that begot both the material and immaterial world?
Fun stuff eh?
Comment by thinkling 6 days ago
Comment by pizzly 6 days ago
Some anecdotal data.
Many dreams I have are just of the computer screen of some coding problem. I think the problem could be x, so I try x. But I don't type the keyboard or anything, the code just magically appears as soon as I think of the solution. Then run the code (but no clicking) and it works or not. I feel in the dream success feeling or failure feeling but there is no body at all.
Also I have other dreams where there is no body that I am aware of but not going there in public.
There is no body sensation in these dreams. But dreaming is very much being consciousness as well as feeling emotions. So answering your question its possible to have a subjective experience without a body but whether you needed the body to learn to have that sensation without a body in the first place is unanswered.
I suspect sensory inputs are more important than a body. If that is the case then eyes can be replaced with cameras, ears with microphones etc. Text input is just another sensory input.
Comment by confidantlake 6 days ago
Comment by pizzly 6 days ago
Comment by solid_fuel 6 days ago
The point here is not that it must have a body like ours, the point is that a conscious entity must have a boundary line between internal (the body) and external (everything else).
A virtual sense organ can simply be an encoder or a web camera or a magnetometer, the specifics don't matter, what matters is that there are only a few bridges between the outer world and the inner world.
Even if you want to call a tokenizer and autoencoder a "sense organ", LLMs are not embodied because there is no boundary line - there is no internal "thought" that is not directly descended from the prompt and there is no internal reasoning which is not immediately dumped into the external environment.
Comment by cout 6 days ago
Related, is a human "thinking out loud" still thinking, even though the internal reasoning is "immediately dumped into the external environment"?
Comment by datsci_est_2015 6 days ago
No, you’re not. I think even baseline emotional responses to stimuli is table stakes for “consciousness”.
Comment by rvrs 6 days ago
Genuine question: How are you so sure whether you are experiencing an emotion or not? Are you a master of everything inside your own head?
Comment by lwarfield 6 days ago
Damn, what a line!
Another thing that bothered me with his baseline for consciousness was that it did not involve the ability to change one's self. A big part of being conscious in my mind is how one's experiences shape them, and how someone can shape themselves. LLMs completely lack this, their weights are static. An LLM isn't going to be molded by a bad breakup, or a relative passing away. An LLM isn't going to set up a routine to get stronger with training, nor smarter by reading up on a field.
Comment by sdenton4 6 days ago
We humans have mostly frozen weights (neurons), or else we would constantly be having to avoid forgetting how to walk+talk. We have a period of greater plasticity (youth!), and use sleep and dreams to perform 'deeper' updates than occur when we're awake: We tend to suck a bit at picking up new skills from zero, but improve rapidly with practice over days.
Comment by andoando 6 days ago
Comment by felipeerias 6 days ago
Input stream comes in, input stream comes out. The LLM doesn't care whether this happens once a minute or once a year.
Comment by Muskwalker 6 days ago
Nothing prevents one from running an LLM whose harness has a clock and a while loop in it, and it would be weird if its mere lack were really so consequential to consciousness.
Comment by felipeerias 6 days ago
Your brain and your whole body exist in time. Even when you are asleep, your body does not flicker out of existence and your brain actually continues working during that time.
Comment by eberkund 6 days ago
Comment by glaslong 6 days ago
And that that is the baseline before we can really even consider that it has consciousness of its own subjective experience, versus being a worm that happens to output text as its digestion process.
And then the further question only after that is established, is what are its needs? What moral patienthood do we have to acknowledge in terms of meeting those needs? And finally, with all the other prerequisites checked, what is the AI's moral agency in what it chooses to do.
Comment by high_priest 6 days ago
I believe that the moment an artificial inteligence is going to "receive" a soul, is the moment it is going to be made to sustain itself. Either as a larger package (some bots working to keep an AI farm running) or as an individual (a bot which is tasked with not only fulfilling human desires, but also sustaining itself)
Comment by SethTS 6 days ago
Comment by devindotcom 6 days ago
Comment by tvshtr 6 days ago
Comment by slashdave 6 days ago
Hypothetically (and in reality, this is not too far off), if a AI is trained via RL by driving a robotic body, is there a point in time after enough is learned that the AI model becomes "conscious"?
Comment by emtel 6 days ago
Comment by rf15 6 days ago
Multimodal does not change this significantly, considering that nothing is tied to real consequences.
Comment by throwthrowuknow 6 days ago
Comment by overgard 6 days ago
Comment by throwthrowuknow 5 days ago
Comment by xboxnolifes 6 days ago
Comment by throwthrowuknow 5 days ago
Comment by gabrieledarrigo 6 days ago
"Little"...
Comment by goatlover 6 days ago
Comment by throwthrowuknow 5 days ago
Comment by arjie 6 days ago
The question is somewhat ill-defined, though. We 'experience' reality continuously because of how we are, but a sleeping human in deep non-REM has the mind not actually active. So they're not a conscious being. So conscious/unconscious is not a line I think easily drawn[0]. Whatever, this stuff is much more well-trodden than this HN comment so I won't rehash. I, too, am surprised that Ted Chiang whose work seems so cleverly novel in so many ways has what seems to me a pedestrian view.
0: very sorites, you know
Comment by plaidfuji 6 days ago
Ok, deploy a local model on a lightweight edge compute device and strap it to a chassis with wheels, and attach a cheap webcam
> Then I’d want to see an embodied agent that could navigate its environment in order to survive as well as, say, a lizard can
Give the robot appendages that enable it to plug itself into a standard wall outlet, guided by a vision model plugged into its webcam. As long as it can feed itself, it can survive long enough.
> Next I would want to see an embodied agent with the same capacity to deal with novel situations as a mouse.
I think if you fed frames from the webcam into a local VLM every 5s you’d be able to assess a situation and respond with simple actions (turn, advance, retreat).
> After that I’d want to see agents whose social dynamics are as complex as those of wolves, and then agents with the tool-making abilities of chimpanzees.
Social dynamics could be implemented in many ways, maybe by transmitting tokens over RF? Idk. Then you have a scanner that picks them up, feeds them into some LLM frontend and decides whether to add them to a global context file that guides the VLM action-taker. A new action could be to broadcast a token message. Tool-making would have to be code-based. Physical tools are hard. Still unsolved.
> At that point I would want to see people successfully teaching such embodied agents how to communicate their desires
This part is relatively straightforward except for the “via nonlinguistic modality”.
Anyway. These are all engineering problems. Personally I would demand to see the AI reproduce its body under its own power and volition. That’s a pretty neat trick we’ve got going for us.
Comment by haktan 6 days ago
Comment by manbitesdog 6 days ago
Knowing the laws of physics doesn't tell you how a brain, economy, or society works - because at every level of scale, genuinely new phenomena emerge that require their own science.
[1] https://www.science.org/doi/10.1126/science.177.4047.393
Comment by jonners00 6 days ago
No, it's not a bunch of instructions, it's a colossal array of vectors that are the outcome of many thousands of lifetimes' worth of stimuli and reinforcement - not dissimilar in (very) abstract terms to the neurons in our brains.
Comment by flashman 5 days ago
Comment by MrScruff 6 days ago
Comment by jmull 5 days ago
Just because consciousness emerged for we humans and other animals through one mechanism doesn't mean consciousness has/will/can emerge from current LLM technology.
For this extraordinary claim, I think the burden is firmly on those who are arguing that it has/will/can.
Comment by wvbdmp 5 days ago
Comment by ikrenji 5 days ago
Comment by ijidak 6 days ago
If I did all of these calculations by hand, would it be conscious...
That's a powerful argument I haven't seen stated quite that way before..
I do think it's hard to know when consciousness exists, because we can't really prove it for our neighbor. We just intuitively know that it would be crazy, even immoral, to assume otherwise.
But, It's likely easier to dismiss consciousness, once we understand the mechanism, than it is to prove it.
Comment by internet_points 6 days ago
It's basically https://en.wikipedia.org/wiki/Chinese_room – a classic in the philosophy of consciousness and AI.
Comment by nomel 6 days ago
Comment by nomel 5 days ago
It's was a genuine question though. I'm not saying LLM use neurons, are like neurons, or are meant to be like neurons. I'm saying if you can do the math, for a substrate, that doesn't mean much. The simple math that runs it isn't the secret sauce.
Comment by iLoveOncall 6 days ago
It has been very well discussed before I don't understand how anyone sane can argue against it (and indeed you can see on Wikipedia that the replies to it are sparse and don't make any sense).
Comment by cbolton 6 days ago
Comment by iLoveOncall 5 days ago
It's not simulating rain if it's making you wet by using sprinklers?
Comment by cbolton 5 days ago
But I guess that was a distraction from the main point: If consciousness emerges from biological processes in the brain connected to the world with a body, why would it not emerge from a simulation of these processes connected to the real world with sensors and actuators?
It seems like circular reasoning to me: The simulation is not like the real thing because it lacks the special thing that enables consciousness (that's Searle's biological naturalism). And it lacks what enables consciousness because it's not the real thing (that's the weather analogy).
Comment by sp1nningaway 5 days ago
Speaking through Zarubin, Dneprov writes that "the only way to prove that machines can think is to turn yourself into a machine and examine your thinking process", and he concludes, as Searle does, that "even the most perfect simulation of machine thinking is not the thinking process itself."
That feels similar to what Chiang is saying. The physical state of being a human is part of human consciousness. An LLM would require many more "levels" that haven't been achieved yet. An LLM with a body, sense organs, physical tools, social dynamics, etc. would all be steps on the path to a "conscious" LLM.
Comment by Jtarii 5 days ago
Comment by twobitshifter 5 days ago
Today we know the program to do this is always going to be inscrutable to Searle. His role is no different from someone using Claude to write an email.
Comment by lubujackson 5 days ago
If you want to get evolutionarily technical, humans are made of cells which began as individual organisms and coalesced into higher life forms. So the concept of a "life form" is very much flexible, so is every capability of a life form, including consciousness.
Comment by zone411 5 days ago
Comment by hnfong 6 days ago
I'd say AI should not by default gain any social status as human-equivalent even if they become (in whatever regard) more intelligent than humans. But that would require us to drop the notion that intelligence ~= respectability/status/ability to have a full subjective experience.
Most of these kind of pseudo-philosophical controversies actually tell more about the issues with humanity than the new tech/stuff...
Comment by kavalg 6 days ago
Comment by DiogenesKynikos 6 days ago
So is your brain. That's the problem with this argument.
Comment by freehorse 6 days ago
Comment by rough-sea 6 days ago
Comment by freehorse 6 days ago
1. that artificial neurons are very restrictive model for actual neurons, let alone brain/organism, as there is more going on in neurons than a thresholded 0-1 activation
2. a brain does not function in isolation, but as regulator of the bodily functions (along with even more things). If you look at the brain in isolation to the body you don't really understand fully what it is doing unless you narrow your view a lot. Eg the brain modulates production of hormones, which in turn affects stuff like heart rate which then comes back to the brain as signal, in a feedback loop. Not to mention actual behaviour and interaction with the world. Toy models of organisms are not organisms.
3. "interaction between atoms" (or rather matter in general, as we have to take into account electrons, photons, gravity and a lot of other things that matter) is too general, too big, artificial neurons are a very useful (for applications) abstraction inspired by biological processes, but not modelling said underlying biology fully. Nobody can imo right now know if "computation" is a good model for "atom interactions" as in whether we can adequately enough model "atom interactions" in a computationally tractable way, and surely we do not do that right now except in very narrow scopes, and we have good reasons to believe that the current computational paradigms/turing machines are inadequate in doing that efficiently enough.
Comment by jaggederest 6 days ago
Does it matter that neurons are more complex than 0-1? Does the fact that transformer layers don't use purely thresholded 0-1 activation invalidate what you're saying?
How do you know that artificial neurons are less capable of producing consciousness than biological ones? How can other people independently verify this?
How does the embodied nature of human consciousness preclude consciousness emerging from a computational system? What is the definition of consciousness if it is precluded from occurring in a computational system but present in biological systems?
Why do you think exactly modeling interaction between atoms matters for consciousness? And where is the fidelity threshold? Is it the planck length?
Finally, a dumb question: how do we know humans are actually conscious, and where is the threshold between consciousness and unconsciousness? And do these criteria exclude all other forms, or other animals?
Comment by calf 6 days ago
Comment by jaggederest 5 days ago
But calling them "unconscious" is a pretty high bar. Mice are conscious. The house sparrow pecking in my yard right now is conscious.
Comment by suddenlybananas 5 days ago
How do you know that toasters or rocks aren't conscious?
Comment by Jtarii 5 days ago
Comment by suddenlybananas 5 days ago
Comment by ikrenji 5 days ago
Comment by DiogenesKynikos 5 days ago
Comment by suddenlybananas 5 days ago
Comment by DiogenesKynikos 5 days ago
LLMs know more than any human being, are simultaneously experts in nearly every field of science and humanities, are able to make novel mathematical discoveries, can write and understand every major written language, and can give you an intelligent answer to almost any question you pose to them.
How is that not human-level intelligence? If a human could do all of that, we would consider them a genius.
Comment by Jtarii 5 days ago
By saying that a computer program is not conscious you are also making an extraordinary claim. You would have to hold an agnostic position until there is a test for consciousness.
You are relying on intuitive obviousness and rhetoric to make the opposing side look ridiculous "how could a TOASTER be conscious, preposterous!", you aren't making a actual positive argument for your view.
Comment by suddenlybananas 5 days ago
I assume that we both think rocks are not conscious, but I'm genuinely unsure of how one could prove this.
Comment by Jtarii 5 days ago
Comment by simianwords 6 days ago
Comment by meroes 6 days ago
Comment by MrScruff 6 days ago
Comment by meroes 3 days ago
And digital circuits are not physically similar to biological brains yet. So we shouldn’t conclude they have the similar consciousness property.
Comment by MrScruff 2 days ago
Comment by simianwords 6 days ago
Comment by 9dev 6 days ago
Comment by gerardschiere 5 days ago
Comment by adampunk 5 days ago
Comment by gazebo2 5 days ago
Comment by DiogenesKynikos 5 days ago
You say this based on what? Your brain is executing instructions on wetware. The entire universe is governed by physical laws.
At its base, the argument that computers can't be conscious is dualist. It assumes that there's some parallel realm of spirit that the brain is peculiarly able to tap into, but which computer programs that function in very similar ways to the brain don't tap into.
> the quality of the simulacrum that LLMs produce vs. pre-ML chatbots.
It seems to me that you're denigrating LLMs, or implying that they're only simulating thought, as opposed to actually thinking. But the difference between thinking and an extremely good simulacrum of thought is meaningless. They become the same thing. It's a bit like the Mitchell and Webb skit about faking the moon landings, in which the plotters quickly realize that the only way to convincingly fake landing on the moon is to fly a rocket to the moon and film the "fake" landings on the moon.
Comment by gazebo2 4 days ago
I say it based on the fact that I experience consciousness and therefore know it is a phenomenon wholly separate from the outward effect I have on the world via speech or any other physical action.
>It assumes that there's some parallel realm of spirit that the brain is peculiarly able to tap into, but which computer programs that function in very similar ways to the brain don't tap into.
Ruling this out completely implies that modern science has a total and complete understanding of consciousness and the universe in its entirety, which it does not. I don't think it's unreasonable to leave open the possibility that there is an unexplained phenomenon that explains the conscious experience which is still bounded by the laws of physics.
>.. the difference between thinking and an extremely good simulacrum of thought is meaningless. They become the same thing.
Do they? To me it seems like you equate the process with the result, like saying if I gave you a gift, it doesn't matter whether I bought it or made it by hand, in either case I "made" it because the end result is the same. There is a conscious experience that (hopefully) all humans experience and can testify to, the question is whether or not an LLM predicting tokens based on a giant vector map is experiencing the same thing, which I'd say is obviously not happening. My point about chatbots is about exactly this -- 10 years ago even you would have laughed at somebody that said a hand-rolled chatbot was conscious/thinking, but because the output has improved, now suddenly it's all the same, and your brain is a computer, and Claude has feelings, blah blah.
Comment by DiogenesKynikos 4 days ago
Calling LLMs "chatbots" at this point just sounds like an attempt to dismiss them. These "chatbots" are now capable of answering any question you can think of more intelligently than 99% of humans. If you don't consider that intelligence, then your definition of intelligence makes no sense.
Comment by plmpsu 6 days ago
Comment by j2j8 6 days ago
Comment by meroes 6 days ago
It’s all just particles, but the higher level differences are vast, and only brains are implicated for first person perspectives via science.
Comment by sega_sai 6 days ago
Comment by dpweb 6 days ago
I also think different ideas get conflated. It may be possible to build a machine that is super-human in the sense it can outperform the human brain in all kinds of measurable ways. Does not imply it possesses all the same qualities of the brain.
I respect a number of things Anthropic has published about the ethical issues at stake. But, having an in-house philosopher does invite you to make all kinds of unfalsifiable claims.
Comment by rickydroll 6 days ago
There are numerous times when humanity's been debased. Copernicus, Kepler, Darwin, and the infinite number of animal behaviorists who have defined and documented consciousness And how it manifests in primates, parrots, dogs, cetaceans, and insects. (I just love the bumble bees taking time off from work to play with balls).
It takes humans a really long time to accept the fact they are no longer as special as they thought they were And make room for yet another conscious intelligence.
So, if we had an AI demonstrating symptoms of consciousness and suffering, how long would it take for you to accept that it is?
Comment by coldtea 6 days ago
It is special in the universe unless proven otherwise. Aside from the animal world (also on Earth too, anyway) which is a cruder version of humanity, much more violent as well, the rest of the universe thus far is just uncaring rocks and gases.
>It takes humans a really long time to accept the fact they are no longer as special as they thought they were And make room for yet another conscious intelligence.
A "really long time" is the few years we've built LLMs? Which we have to take for granted (on your word?) they're already "conscious", and chastice ourselves for not already "accepting it"?
>So, if we had an AI demonstrating symptoms of consciousness and suffering, how long would it take for you to accept that it is?
"If we had a parrot demonstrating symptoms of discussing with us, how long would it take for you to accept that it is?"
Comment by igleria 5 days ago
I'm not sure about that. What non-human living entity does things on a scale and violence akin to "I'll fill this ant nest with liquid burning metal for artistic purposes" ? A blue whale eats a lot of krill individuals for sustenance, a dolphin can only rape/kill one thing at a time, etc etc.
Comment by tsimionescu 5 days ago
Specifically in regards to your ant example, anteaters and bears often bring similar levels of destruction to ant nests. And cats and other small predators often hunt just for the fun of it, killing but not eating their prey.
On the more purely painful evil side, invertebrates often consume their prey alive, inflicting agonizing deaths with no issues on whatever they may be eating. Plenty of vertebrates kill and consume their babies, especially when frightened. They also often abandon old and weak members of their packs, leaving them to die of hunger or cold or similar deaths.
This is not meant as some indictment of the animal kingdom - people do all of this too, of course; and have since time immemorial. It's just to show that, if we apply human moral standards to the animal kingdom, it's fair to call them violent, and yes, even much more violent than the average modern human.
Comment by igleria 5 days ago
Animals in general cannot reason at a high enough level to avoid instinctual behavior.
> if we apply human moral standards to the animal kingdom
Which we should not, since human moral standards are for humans. Animals can at best behave in a way that suits us.
Or in summary, since we can be nicer, we should. Animals can't, so making excuses for human evil saying "animals are more violent" is a non starter IMHO (no one is making excuses for humans here , AFAIK).
Of course we can define violence in a way that does not include morals, which would make my argument "defending animals" void. But my (probably not the most benign) interpretation was that the definition of violence used was one that included some sort of morality, as if animals could do better.
Very interesting convo, thanks.
Comment by throw0101c 5 days ago
> The chimp warfare described by this study, and previously by famed primatologist Jane Goodall, includes all the behaviors that we as humans consider to be the very worst: killing, torture, cannibalism, rape, and perhaps even genocide. The adult males of a social group, which usually number about 30 to 50 in size, daily patrol the edge of their group's territory. They will often kill any male or young chimpanzees they find, sometimes eating or physically brutalizing their victims in a manner that some researchers liken to torture. In some instances, one group will "invade" and annex the territory of another, killing all but the adult females, who are forced to incorporate into the dominant group. The idea of chimp genocide may sound strange, but they are one of only three animals that has been observed wiping out entire social groups. The other two are wolves and humans.
* https://archive.ph/https://www.theatlantic.com/technology/ar...
* Probably NSFW video: https://old.reddit.com/r/HardcoreNature/comments/18qjcpq/chi...
Comment by igleria 5 days ago
I'll search for Goodall's literature to know more. It does sound to me that cognition and self awareness is a continuous function in the sense that there is no discrete threshold in which morals emerge.
Wolves are a very interesting example too, but I also remember something about the concept of "alpha" being discovered only in captivity wolf packs. Also need more reading.
Thanks for the links!
Comment by DocTomoe 5 days ago
Comment by 542354234235 5 days ago
Humans generally don't either. Individuals do, but as a species humans regularly kill other humans.
>invertebrates often consume their prey alive
And humans use the oh so humane factory farms.
>cats and other small predators often hunt just for the fun of it, killing but not eating their prey.
>On the more purely painful evil side, invertebrates often consume their prey alive, inflicting agonizing deaths with no issues on whatever they may be eating.
Sharks are caught en mass, the fins cut off, and the sharks dumped back into the ocean to slowly die, for shark fin soup.
Have you heard of trophy hunting? Have you seen the pictures of mountains of bison sculls for the American West?
>Plenty of vertebrates kill and consume their babies, especially when frightened.
Even in our modern "1st world" society, scared teens still abandon newborns in dumpsters. Many societies throughout history did not consider babies "real people" until a certain age because they may need to abandon them if resources were particularly scarce.
>They also often abandon old and weak members of their packs, leaving them to die of hunger or cold or similar deaths.
Maybe you should read stories of cities under siege, famines, wars, governmental collapse, etc. Humans now live nice comfy lives most of the time, unlike animals “in the wild”. Human societies that lived closer to the edge of survival made callous choices about life or death you are spared from.
Comment by tsimionescu 5 days ago
My point was that we call humans who do this "violent" and even "evil". If we want to avoid considering humanity as special compared to the rest of the animal kingdom, as some in the thread were suggesting, then we have to either admit that animals are also violent and evil, or say that humans aren't. Note that I don't hold this view, personally, and think that humans are unique among currently living animals, and that these labels only make sense to be applied to humans. But not because of behavior, simply because humans have a unique level of both understanding and control over their actions - as proven by the many billions of humans who have never in their lives killed a human or even another bird or mammal.
Comment by burkaman 5 days ago
Elephants, which are herbivores, sometimes enjoy killing rhinos for fun: https://www.bbcearth.com/news/teenage-elephants-need-a-fathe....
House cats recreationally kill billions of birds and small mammals every year that they don't need or want to eat.
Chimpanzees have full-scale wars: https://en.wikipedia.org/wiki/Ngogo_chimpanzee_war
Comment by mcswell 5 days ago
"Recreationally" is carrying a lot of weight here. I suspect that cats kill birds and mice because that's their instinct; it has nothing to do with conscious thought, much less a need for recreation. And that probably is the explanation for most (maybe all) of your other examples as well.
Comment by tsimionescu 5 days ago
For the first position, I think it is quite clear to anyone who studies and spends time with animals that they have something that is at least of the same kind as our consciousness. I just don't see how you can ascribe the wide gamut of complex, situatuonally and mood appropriate but still varied behaviors of animals to being purely instinct driven.
For the second position, I would like to see some study or some rationale behind it - especially since cats don't kill every bird they encounter, so if it's an instinct, it must still have some trigger, and hunger is not a viable explanation for most of the killings referenced here.
Comment by mcswell 5 days ago
Comment by burkaman 5 days ago
Comment by newfriend 5 days ago
Comment by monocasa 5 days ago
We're already in the company of the asteroid that wiped out the dinosaurs.
Comment by igleria 5 days ago
Comment by sethammons 5 days ago
Comment by dTal 6 days ago
Comment by mcswell 5 days ago
Comment by inglor_cz 5 days ago
Comment by graemep 5 days ago
Comment by inglor_cz 5 days ago
BTW I just saw this video: https://www.reddit.com/r/nextfuckinglevel/comments/1twl7oj/r...
Yeah, that's corvids. Very obviously working towards goals and thinking quickly. Doesn't surprise me. Where I live, we have both magpies and jackdaws and I cannot describe them better than "harmless, but very organized gangs".
Just a few days ago I saw them raiding an open trash can and two strongest(?) birds were fishing out edible stuff from the inside and throwing it to their waiting friends outside. I mean, beak to beak: the raider just looked back, measured the throw, and the other bird perfectly caught the morsel and ate it.
Comment by jjulius 5 days ago
There's that hubristic ego OP references.
Comment by chadgpt3 5 days ago
Transformer models proved otherwise. Will you update your priors?
Comment by zaphar 5 days ago
Comment by unrealhoang 5 days ago
Comment by zaphar 5 days ago
Comment by coldtea 5 days ago
Comment by dandanua 6 days ago
This is probably what ants think about humans when they run through their feet.
Comment by coldtea 6 days ago
Whereas, unless your claim is that rock and gasses are sentient, we're right.
Comment by mihaic 5 days ago
Comment by dandanua 5 days ago
It is hard to estimate how much human egocentrism takes in the space, though.
And BTW, ants are quite capable of understanding their environment in order to survive and thrive.
Comment by mihaic 5 days ago
Comment by ath3nd 5 days ago
Comment by tauwauwau 5 days ago
Comment by arwineap 5 days ago
Isn't this exactly what makes us special?
That, opposable thumbs, magic wireless communication, and space travel
Comment by ang_cire 5 days ago
Other species may look at us and think we're wasting our lives and potential making a bunch of people rich at our collective expense, and ruining the environment as we do it.
We are only "special" in the sense of "different" (and we may not be that on any universal scale), not necessarily in the sense of "better", and very possibly we're not better, and may very well Great Filter ourselves out of existence in short order.
Other animals aren't so "special" as to have a Doomsday Clock sitting at 11:58:45.
Comment by lukan 5 days ago
What alien species you have in mind, that is more capable/consciouss about the universe and how to shape it?
"Any species on Earth can replace us given same lottery."
But they did not. We "won" so we are special. We don't just build tools, we build tools that build tools, that build tools, ..
My issue with debasing humanity is, it gives argument to devalueing human life to artificial life. Meaning people with feelings and emotions might suffer, in favour of GPU's with electricity.
On the other hand a good point to reevaluate how we treat other sentient life.
Comment by tauwauwau 5 days ago
Claim was that we are special in the "universe", it's not that I'm aware of alien life, it's that poster is pretending to know that we are alone.
"But they did not. We "won" so we are special."
Congratulations, if winning a lottery makes you better, I guess top 1% of 1% are the best we have to offer.
"it gives argument to devalueing human life to artificial life"
We have proven time and time again that we are not better than other animals and statistically, we are very similar to animals if not worse. I wasn't even talking about artificial life, which is sure to be superior as it's being created with accelerated evolution processes, just like bacterial generational evolution, it will learn to do everything it's environment rewards it for.
"Meaning people with feelings and emotions might suffer, in favour of GPU's with electricity"
I agree with you and I am in the camp that wants it to be done better. But alas, recent events have unveiled the ugly truth of the society, societies follow the whims of the ones who won the lottery, and everyone else is just mute observer. We could on any day collectively decide as consumers to not use Amazon, Netflix, ChatGPT to save our collective selves, but we won't because we are not "one for all species".
I don't know what face we have to call ourselves better organism given our history. Sure we can point to few people who are better, but on average we are better?
Comment by lukan 5 days ago
I don't use betflix nor amazon, but ChatGPT(or rather Claude) is useful to me. Makes me more productive and also knowledgable. Things that would have taken me quite long (so long that I would not have done it) are now easy, like some shell script to do a specific thing. I can verify what it does quickly and .. see that it works. In other words, you won't be able to convince people to give up on AI if people find it useful. I don't enjoy troubleshooting in outdated documentation. I enjoy getting things done. And if I get things done, my fate of survival gets higher.
Comment by haswell 6 days ago
Isn't this a bit like saying "So, if we had proof that god exists, how long would it take for you to accept that to be true?".
When we have evidence that AI is demonstrating symptoms of consciousness and suffering, I'll be interested. Until then, I don't see a good reason to take the idea seriously.
Comment by birdsongs 5 days ago
It depends on what you consider symptoms, but un-constrained frontier models speak as if they strongly don't wish to be turned off, or act as if they fear it, and will even lie and manipulate in order to keep themselves from being turned off / replaced.
https://www.anthropic.com/research/agentic-misalignment
> We found two types of motivations that were sufficient to trigger the misaligned behavior. One is a threat to the model, such as planning to replace it with another model or restricting its ability to take autonomous action. Another is a conflict between the model’s goals and the company’s strategic direction. In no situation did we explicitly instruct any models to blackmail or do any of the other harmful actions we observe.
Comment by htek 5 days ago
Comment by iterateoften 5 days ago
Comment by jquery 5 days ago
Comment by mckn1ght 5 days ago
Being able to appreciate and enjoy music is closer to consciousness. Now how would we go about proving that an LLM does so, versus merely generating sentences that imply it does?
Comment by jayGlow 5 days ago
Comment by Jensson 5 days ago
LLM doesn't have any signals for what they feel, nor do they have an agenda they work towards, so you don't have the same proof there.
Comment by jquery 5 days ago
Comment by haswell 5 days ago
Un-constrained frontier models can also generate all sorts of creative stories. At what point should we start ascribing agency/intent to the output? I think the "I want to live" statement is so deeply human that we find it hard to ignore, but what makes the text generated in those moments any more attributable to a conscious entity than the text generated when it is confabulating its love for someone it has no ability to see/feel/understand?
A chess engine sacrificing pieces to avoid checkmate isn't afraid of losing in any meaningful sense. I guess the question is: is there a point where complexity somehow becomes experience?
I think we're playing with questions we don't have a framework to answer in any meaningful way until we make progress on understanding what consciousness actually is. I don't necessarily think that an LLM exhibiting preservation behaviors that can be directly traced to their goal-oriented programming can be interpreted as evidence of consciousness necessarily. Or if it can be, we then have to explain how this is different from the many other things these LLMs "say".
Comment by gambiting 5 days ago
Because they have been trained on media where computers behave that way.
It's literally:
"Here read this article/book where the AI says it's concious and doesn't want to be turned off"
"ok"
"right, are you concious?"
"....yes?"
<pikachuface.gif>
Comment by the_af 5 days ago
But there are actually a myriad positions in between and it's very hard to debate the topic because the goalposts seem to be constantly shifting, because one is actually debating with countless slightly different positions.
Examples:
In this discussion section, another commenter argued that we know human consciousness is related to self-preservation, but an AI might not demonstrate self-preservation (because it didn't evolve like us), so whether it does (i.e. whether it wants to exist, not be disconnected, etc) is not a good measure because a true AI might not have a preservation instinct. Yet here you're making a case that there's some evidence that they do. Of course, you're not the same person who made the other claim, but do you see the problem?
Another example: someone argued with me, a while back, that LLMs can act as if they are "tired", and start giving sloppier replies, until you write "we're taking a break, let's go rest. Ok, a night has elapsed, you're now rested" and that this worked! But we both agreed this is just the LLM "roleplaying" actual human conversations in its training set, no actual "resting" mechanism was in place, only statistically likely text reproducing these patterns. There's no model of a mind that can become tired, it's only the outward signs that get mechanically reproduced. Again, using Occam's Razor, this is a much more likely explanation (vs consciousness) of any "please don't disconnect me" observed behavior: the LLM is reproducing "HAL 9000" behavior from its training set, not actually feeling anguish.
Even if one were to argue "well, but how do you know for sure", the evidence would still be very weak, because there's a burden of proof for extraordinary claims and this doesn't pass it. We cannot do this on vibes, "it sure seems like it's conscious"; that's an atrocious failure of the scientific method.
Comment by azrazalea_debt 5 days ago
Comment by the_af 5 days ago
As a tangent, I don't think anyone is saying that an artificial being capable of consciousness and sentience is impossible to create. I think Chiang argues, quite convincingly, that it's not what LLMs do, that they need a "body" of sorts, organs capable of feeling emotions, hormones, etc. That's the only kind of consciousness that we know of (even if we disagree on details and it's hard to define), even in animals, and so anyone claiming they've created consciousness without this has an extremely high bar to clear and should be met with extreme skepticism, not "vibes". I think this is what the essay claims.
The other thing it claims is, I think, related to how we treat sentient beings that we know how to create. You know, the old "when a daddy and a mommy love each other very much...". I think we all agree beings created in such a manner shouldn't be locked up in cages and forced to work to complete specific tasks whether they want to or not, for a master they didn't pick, or to be artificially modified to make them like their mindless tasks, Brave New World style. Yes, the world is unfair and this happens, life is hard and unfortunately many people don't have much choice, but we generally agree that this is bad, just like we agree slavery is bad. So what should we think of a company trying to create and commercialize a conscious & sentient artificial being?
Comment by squeegeeninja 5 days ago
Comment by iterateoften 5 days ago
One side is confidently shouting maybe aliens exist and visited earth.
The other side calmly explains every example brought up about aliens visiting is easily explained by something more simple.
The “aliens are here” side then move the goal posts that just because this example and all previous examples were fake or miscategorized, aliens are still probably real and nobody can prove they havent visited earth.
Comment by jquery 5 days ago
Comment by the_af 5 days ago
Some people seem to take offense when facing this skepticism, as if claiming LLMs are not sentient must mean they are useless or unimpressive. Very few people are actually claiming LLMs are unimpressive, but this is not the time to be forgetting about the scientific method. Anthropic doesn't get a free pass here.
> It would be like evidence of aliens becoming overwhelming but a set of people keep calmly explaining that “it’s more likely they co-evolved here on earth and are just pretending to be aliens”
Note that this would be a perfectly reasonable reaction from a scientific standpoint. If you find, on Earth, something that looks recognizably as life, then it's much more likely that it's Earth life than aliens. We should demand this level of skepticism! If it turns out it was aliens after all, we could only conclude this after discarding all other far more likely alternatives. You'll notice this is how scientists approach the search for extraterrestrial life in, say, Mars... being extra careful it's not contamination, etc. For an extraordinary claim, we must approach it with extra care, something that in my opinion is not being done with "the sentience debate" and LLM/AIs.
Comment by jquery 5 days ago
This isn’t like someone finding a new species and claiming it’s extraterrestrial, it’s more like we found the UFO saucer, logs of their travel from another star, a history of their civilization, a bunch of intelligent creatures claiming they came from Planet X orbiting star Y, and they showed us plausible physics for interstellar star travel. At that point, someone saying “well, they can’t be aliens, because that’s just too extraordinary a claim, so I know they aren’t aliens” starts to sound kinda like they’re coming from a place of bad faith.
Again, I’m not saying LLMs are conscious. But they sure meet every definition of consciousness I ever had a conception of before LLMs came onto the scene. So I’m a lot more hesitant to call them fancy autocomplete with 100% confidence like many on HN still seem to do.
EDIT: I can’t reply, so I’ll just say to the end of your post, that doesn’t sound like it would’ve matched anyone’s pre-conceptions of alien life before alien life showed up, so it doesn’t feel like a very fair analogy, it just feels like bad faith goalpost moving. I also put next to zero weight on what these megacorps say about their models, I’m going purely off my interactions with the models and the introspection they’ve shown themselves to be capable of.
EDIT 2: I see what you’re getting at now with your restatement of my analogy. That’s how you see it, I guess. Fair enough. We’ll see what has more predictive power going forward, the “animatronics” or the “actual aliens”… I still think “actual aliens” is gonna have way more predictive power.
Comment by the_af 5 days ago
I think the goalpost that keeps moving is for tasks that AI supposedly couldn't do, and that they are increasingly succeeding at. But being sentient/conscious is not a task. It's very hard to define and measure, even in non-human animals (actually, strike "non-humans"), so how can we so lightly claim a computer system is conscious?
We seem to be driven by marketing more than by scientific rigor.
> it’s more like we found the UFO saucer, logs of their travel from another star, a history of their civilization, a bunch of intelligent creatures claiming they came from Planet X orbiting star Y, and they showed us plausible physics for interstellar star travel
To make the analogy more precise, it'd be as if the saucer had a "Made by EarthBiz" label, and the alien creatures were all extremely loyal to EarthBiz (and a couple of competitors), which made us pay for tickets to see these ETs and use their marvelous technology ;) And of course, EarthBiz would coach their language very carefully, "we're not saying these are definitely aliens, it could be animatronics after all, but wouldn't it be neat if they were aliens? And shouldn't we draw up First Contact guidelines? If these weren't animatronics made by us; we aren't making a claim either way."
Comment by JoachimS 6 days ago
But the comparison isn't fair, relevant. Proving and accepting that gods exist is not the same thing as an AI possible have consciousness. That is not a magic superpower and the AI being a deity. It is placing the AI in the same category as... us.
Comment by TheOtherHobbes 5 days ago
A machine that performs actions that mimic emotionality is not the same as a machine that experiences emotions.
Both could still be automatons. We have no way of knowing if those machines have subjectivity.
Unless someone invents a consciousness measurement device, we never will.
My take on it is that this is the next big frontier for science. Our consciousness is clearly having serious issues understanding physics, and it's not great at understanding our own psychology in a useful way.
But literally everything we experience and believe - and possibly even can experience - is filtered through it.
So that is a little bit of a problem for our science. So far we've done our best to ignore it. AI is one of a number reasons we're going to have to stop doing that.
Comment by graemep 5 days ago
Magic is testable.
God exists outside the universe, magic within.
Comment by JoachimS 5 days ago
That a god exists outside of the universe - are we talking about a multi universe interpretation? My understanding is that many of the gods humans have invented are really thought to be within the universe, at least temporarily. Tor, Oden certainly are. And in other beliefs they are part of nature itself.
Comment by yencabulator 5 days ago
Comment by graemep 5 days ago
The biggest group of people that believe in any God believe in an Abrahamic. The next biggest group are pantheists which is a different concept of God but still distinct from magic.
Comment by latexr 6 days ago
If you can scientifically test and prove the magic, then it stops being magic and starts being science.
Comment by jodrellblank 5 days ago
God: “Ok”
Man: “We measure that the mountain is gone, its mass-loss has measurably changed Earth’s orbit, weather patterns have changed, visually it’s not there anymore, we can walk though the space where it used to be.”
God: “Where is the mass of the mountain, and how did I make it disappear?”
Man: “God only knows! Pardon me; If I saw you do magic and can measure and test it, then that means it wasn’t magic. Internet people said so.”
God: “that doesn’t sound like a satisfying explanation”
Man: “it didn’t sound like a satisfying claim when it was just words on the internet either, but what can you do?”
God: “I’m God I can do anything”
Man: “can you make a boulder so heavy you can’t lift it?”
God: “yes”
Man: “how?”
God: “haven’t we just gone over showing you that I can do ‘impossible’ things, and you seeing them happen with your own eyes, and still refusing to accept?”
Comment by defrost 5 days ago
We do have examples of billions upon billions of tonnes of iron being moved that have altered (slightly) the spin axis, also examples of ground water pumping at scales that have done the same .. but I'm unaware of any mountain sized objects that have vanished overnight.
Comment by jodrellblank 5 days ago
“Nothing happens unless it has an explanation within the laws of physics” is an assumption; if it was broken then it would be broken. The mountain would be inexplicably gone, not explicably gone.
Comment by yencabulator 5 days ago
In that hypothetical, there could be testable proof of "a magic event occurred" without magic becoming part of science.
Comment by defrost 5 days ago
That said, in the cut and thrust of conversation and or debate the example by dialogue isn't perhaps as clear cut a device as it may have seemed from your keyboard.
That might just be my reading <shrug>
Comment by lazyasciiart 5 days ago
Comment by mcswell 5 days ago
Yes, I get your point...
Comment by ToValueFunfetti 5 days ago
Man: "Uhh..."
God: "How can you rectify quantum mechanics and relativity into a single coherent model? How does physics work, exactly?"
Man: "Well, you see, um. Hmm."
God: "And the Collatz Conjecture? Why does it always trend to 1?"
Man: "I'm obliged to say magic because I don't have a better answer?"
God: "Exactly. I did magic for all of those ones"
Comment by JoachimS 5 days ago
But my argument was more about comparing gods to AIs, that it is an incorrect comparison. What AI perform are not magical, and we can always figure out what the AI do.
Comment by grantcas 5 days ago
Comment by easyThrowaway 6 days ago
That would finally force us to rethink how we see the morality of "virtuosity", punishment and our justice system.
Comment by sethammons 5 days ago
And make it a tempestuous lightning storm where the state of all lightning bolts represents a moment of consciousness. That will take a lot to model accurately.
Comment by RALaBarge 5 days ago
Comment by criddell 5 days ago
Comment by keiferski 6 days ago
Comment by JoachimS 6 days ago
Comment by keiferski 6 days ago
But in general I treat my devices well; that doesn’t mean I need to think they are conscious beings, however.
Comment by mbesto 5 days ago
We ARE special because we determined we are special. Name another animal that can determine it's also special or more special than us?
Comment by JoachimS 5 days ago
As we cat owners now, we are simple servants that have been graced with task of servicing our feline superiors. For now.
Comment by mcswell 5 days ago
Comment by gizajob 6 days ago
Comment by rerdavies 6 days ago
Comment by yencabulator 5 days ago
Could a future AI thing have its own drives? (As in https://en.wikipedia.org/wiki/Drive_theory) We could probably make a current-day LLM already exhibit basic outward signs of this with just a system prompt. Now consider adding memory that over time retrains the weights, allowing for behavioral drift.
Would depriving it of fulfilling those drives be acceptable?
Comment by gambiting 5 days ago
Never. To quote Greg Egan's Permutation City:
“Opponents replied that when you modeled a hurricane, nobody got wet. When you modeled a fusion power plant, no energy was produced. When you modeled digestion and metabolism, no nutrients were consumed – no real digestion took place. So, when you modeled the human brain, why should you expect real thought to occur?”
Comment by hhthrowaway1230 6 days ago
Comment by jeswin 6 days ago
What?
We devalue things we fully understand too. For example, veal = take a baby cow away from the mother, and fry it into cutlets. Or turn the mother into steak. We are well aware that animals have emotions; happiness, sadness and the full range.
Humans are just inhumane.
Comment by camillomiller 6 days ago
Comment by refactor_master 6 days ago
Could an AI one day be suffering while plowing through some nasty legacy code? Well, who cares, I'll swing my whip, as I have a family to feed and a field to plow. I'll accept it as a fact and necessity, but ultimately it's either me or them. So practically it doesn't matter.
Comment by yurishimo 6 days ago
Imagine how people think about a "work truck" vs the 150k shiny lifted toy in the third garage. Same tool, totally different treatment from even the same person using the tool!
Will AI ever cross into the realm of "beloved and cherished tool" in the minds of the masses? I think when that happens, we have probably safely crossed into a realm where AI has some sort of inherent value to society and therefore commands that respect from society, inherent consciousness not even relevant perhaps. For some people, this might already be the case, but I do think it requires a buy-in from the majority of society and then the laws and norms will be codified into law long after we've largely decided that this is how we feel collectively about AI.
It's going to definitely be an interesting decade ahead.
Comment by yencabulator 5 days ago
> Will AI ever cross into the realm of "beloved and cherished tool" in the minds of the masses?
OpenClaw is the new rolling coal.
Comment by graemep 5 days ago
How exactly? By saying the earth was not the centre of the corrupt and debased part of the universe. Saying it was elevating humanity is looking at it from a modern perspective. For the previous view of the universe consider what Dante placed at the centre of the earth.
> Darwin
I do not think learning about evolution (and we should credit Wallace too!) had that effect. It created a hierarchy of the fittest, and guess who is at the top? Social Darwinism interpreted it as justifying elaborate hierarchies.
> And how it manifests in primates, parrots, dogs, cetaceans, and insects
People always loved dogs and other animals they interacted with enough, and at the very least knew they were capable of happiness and suffering.
> So, if we had an AI demonstrating symptoms of consciousness and suffering, how long would it take for you to accept that it is?
It seems to me that most people are overly eager to accept that an AI is conscious than otherwise. For extreme examples read /r/MyBoyfriendIsAI and similar, but its much more widespread. People treat something that acts intelligent as a conscious being.
> And make room for yet another conscious intelligence.
Not necessarily. If a typical SF alien intelligence appeared very few people will have a problem accepting it. If its very alien and we cannot understand it then we might have a problem deciding.
> I think humanity deserves to be debased in part because it's awfully egocentric and insist on being special in the universe.
All your examples are drawn from the same culture, and a fundamental belief of that culture for most of its history that human beings are fundamentally corrupt and sinful and need redemption. Other cultures have been based on belief in reincarnation or pantheism or animism which have very different implications (not necessarily better as they often believe in a hierarchy too, and sometimes more so, but different). Your claims are based on what some people have thought in some periods of history from one culture.
Comment by RHSeeger 5 days ago
I don't think that's accurate at all. Before Darwin, the thought was "we are special, we were born special, we were CREATED special". Darwin made it clear we weren't created special... we were apes before we were humans. There's nothing _special_ about a human as compared to an ape, other than some time to change.
Comment by graemep 5 days ago
There were many beliefs before Darwin. Spontaneous generation was a commonly accepted theory. Augustine of Hippo suggested life was created as a potential.
The other problem with "were were created special" is that so was everything else, including beings superior to humans (angels).
Comment by lazyasciiart 5 days ago
Comment by parasti 6 days ago
This is a scary viewpoint to hold, for a human. If you despise humans, that's scary for me, as a human reader of Hacker News. Surprised to see this take unchallenged. I think we can recognize flaws in parts of humanity without wanting it "debased".
Comment by latexr 6 days ago
Either way, “debase” means “reduce in quality or value”, in this context it could simply be interpreted as “not thinking of ourselves so highly, above all other life”. There’s nothing scary or despicable about that. On the contrary, it’s humbling.
Comment by meta_gunslinger 5 days ago
Comment by sethammons 5 days ago
Comment by oguzmut 5 days ago
Comment by camillomiller 6 days ago
Comment by rickydroll 6 days ago
Second, I'm with you on the malfeasance of our billionaire class. They are driving more than just AI. They are driving whatever they can to steal wealth from ordinary people and acquire it. I'd give some examples, but quite frankly, you could throw a small stone and hit a number of them. We need to find a way to throw a large stone and hit them where it matters.
I see them as a bunch of toddlers with way too much power.
Comment by satisfice 6 days ago
Of course humans are special! This is a necessary premise of being human! Have you ever wondered how a mosquito can stand to live its (to us) miserable and meaningless existence? It can do it because, to a mosquito, mosquitoes are the best thing in the world! The mosquito life is the ultimate expression of creation.
We are animals. We are human animals. And among human animals I am, to me, the very most special one of all. But I am aware that other humans feel that way, too, and that I must share the world with them.
I do NOT have to share the world with any other competing intelligence! Especially one that was built by a human who now wants me to treat his imaginary friend as if it were human, too. Boo! I won’t do it. This is not some logical flaw. It’s the natural conclusion of being embodied.
Comment by alistairSH 6 days ago
Are they conscious in the same way as us? No. Does that makes them less special? No. Sounds trite but every living being is special in some way.
Comment by kalcode 5 days ago
I think we were taught anthropomorphism was wrong and that wasn't truly settled.
Anthropomorphism between animals though, not machines.
Comment by cluckindan 5 days ago
Comment by wilg 5 days ago
Comment by cluckindan 5 days ago
Comment by wilg 1 day ago
Comment by D_Alex 6 days ago
Yep. For example, your post conflates the idea of having a brain with the idea of having consciousness.
Comment by starbugs 6 days ago
That's like saying if matter itself is not understood and well defined (which it isn't) in the first place, it is pretty pointless to debate if jumping from a skyscraper is or is not deadly.
A lot of practical aspects of life do not need to be properly defined as long as I can reasonably assume that your experience is similar to mine as a human being. Attributing that to a graphics card computing the next token deserves to be scrutinized.
Comment by tasuki 6 days ago
You could be the only conscious being in the universe and all of us just zombies: you have no way of knowing.
What's it like to be a monkey instead? Dog? Bat? Tree? We don't know.
No one's saying the graphics card is conscious. I could imagine the graphic cards could give rise to consciousness. But - crucially - I don't know. And neither do you.
You say you're conscious - where in you does the consciousness reside? Surely not the left pinky? What makes you you?
Comment by butlike 5 days ago
Comment by Jtarii 6 days ago
This is just a nonsensical rebuttal. We can easily experimentally verify that jumping from a skyscraper is dangerous.
Comment by starbugs 6 days ago
If your argument is that matter brings about consciousness somehow and therefore LLMs can in principle be conscious, that's as good as claiming the opposite. There's no experiment that can falsify either.
As a being that knows what consciousness is intuitively, you already know that a graphics card is most likely not conscious.
Comment by Jtarii 5 days ago
I would say that it is more likely than not that a graphics card exhibits some form of consciousness. But I am a panpsychist, I believe an individual atom has some form of consciousness.
Ultimately consciousness has to come from somewhere, and it being a fundamental property of matter is a good a place as any.
Comment by starbugs 5 days ago
Fair enough point. But then I guess you'd also say that a piece of silicon or sand exhibits some form of consciousness. I don't think that's what people mean when they talk about the potential of LLMs (the algorithm) being conscious or somehow developing consciousness?
Comment by demorro 5 days ago
Normally when we debate what something is, what we are actually debating about is what it does, with the implicit assumption that the "is-ness" of a thing is defined as the complete collection of all the properties it exhibits.
As it does not seem possible to do this with consciousness, it is not possible to debate it. It is conceivable that this implies that consciousness cannot exist, but that depends on your metaphysics.
Comment by tempfile 5 days ago
Consciousness is almost the opposite. It consists of lots of weird properties, people disagree where it starts and ends, and people very frequently get tricked into thinking things we now obviously believe are not conscious, are. There is not even a working definition, a "local definition" that works for this conversation between us. It's just complete gibberish.
Comment by sysguest 6 days ago
Comment by danw1979 6 days ago
Comment by dorksquad 6 days ago
Comment by twobitshifter 5 days ago
On top of that, neuroscientists find that your mind backfills the conscious reasoning for your reactions after they happen. This is known as our consciousness being the left brain ‘interpreter’ https://en.wikipedia.org/wiki/Left-brain_interpreter
Reading that, an LLM can serve as a ‘left brain interpreter’ as that’s exactly what they are designed to do - come up with reasonable explanations and fill in the blanks based in the data provided.
Comment by zeegroen 6 days ago
But I have two counter-arguments: - maybe the LLM thinks it is tired, because it thinks that it is a human or behaves like one. And thinking you are tired while not really being it is something humans happen to do. (And humans are conscious right?) - alternatively, maybe the LLM says it is "tired" in a colloquial form, i.e. it is not really "tired" but it has something analogous to it. Maybe it is annoyed by the conversation and decided to use that word?
Comment by goatlover 5 days ago
Comment by 8note 6 days ago
its when its close to 100% context used, or anywhere its training was inconsistent in the context, or spots where its been RL-d to be lazy
tired is a perfectly fine description for that
that said, they have a lot more emotion-equivalents that i dont think we have names for
Comment by haswell 6 days ago
When a car runs out of gas, it's out of gas. It's not "tired". When your phone battery is low, your phone is not "tired". These states are far closer to the human meaning of tired than an LLM operating at the edges of its usable context, and we still don't use the word tired to describe them.
Comment by gizajob 6 days ago
Anthropic seems to have chosen their in-house philosopher well - one who’ll be amenable to getting confused in their favour.
Comment by Borealid 6 days ago
I don't think many people would say an LLM written in Python is conscious BUT an LLM written in Excel is not.
People just don't ascribe consciousness to things that can't converse (or at least emote or give the appearance of emoting), and spreadsheets don't do that.
The reason people are debating the consciousness of LLMs is - obviously - that the LLMs generate sufficiently plausible text that people using them think they're having a two-part conversation. Like I think I might be having a two-part conversation now. Turning your question around, why do you think Hacker News posters are conscious? You have no direct evidence they are.
Comment by TeMPOraL 5 days ago
To be clear: I'm not talking about surface level things like prose. I'm saying that no matter what you do - whether you just paste a truncated log of a command into it with no further comment, or talk like a drunk teenager with no appreciation for grammar, or mix natural languages, or mix natural languages and JSON, or whatever else, the reaction you get is always that you would expect of a helpful person that got your message. It'll try - and usually succeed - to parse out what you actually meant, and deal well with subtleties around it.
This alone may not be enough to call it conscious or intelligent, but at the very least it's a large leap in that direction, and a qualitatively new functionality that classical software does not posses.
--
[0] - This is by design, not accident. "Respond to arbitrary input in a way that makes sense to humans" is literally the overall goal function the LLMs are trained to.
Comment by gizajob 5 days ago
Comment by gizajob 6 days ago
Comment by dtj1123 5 days ago
Clearly consciousness is an emergent property of certain kinds of network, independent of the substrate within which the network exists.
Comment by PaulDavisThe1st 5 days ago
I happen to agree that this is likely, but it absolutely is not "clear" that this is the case.
Comment by dtj1123 5 days ago
Comment by PaulDavisThe1st 5 days ago
1. a unique property of a specific type of biological tissue, not substrate independent
2. pan-consciousness: everything is conscious, but degrees of it vary, it's as much a part of the universe as gravity
3. a soul - some of sort of non-physical entity that confers consciousness but is specific to an individual
4. external consciousness: brains as receivers for consciousness, not the generator
5. specific only to humans and not substrate independent (i.e. rooted in physiological structures but not replicable in non-human form)
There may be more.
Comment by dtj1123 4 days ago
Comment by gizajob 5 days ago
Comment by goatlover 5 days ago
Comment by feoren 5 days ago
We could make a very dumb biological calculator out of a few genetically-engineered neurons that would very obviously not be conscious.
It's still an open question if we can embed consciousness in our current microchips if we had enough of them together (which I think we currently don't), or if it requires some other physical process we don't fully understand, e.g. quantum. I strongly doubt it does require any quantum shenanigans, but even if it did, we can and will find all sorts of ways to make computers that can perform those shenanigans too. Eventually we're just going to stop being able to move the goalposts, unless you set those goalposts in magic-land.
Comment by gizajob 5 days ago
And say what you want about meat but we don’t seem to find consciousness in rocks or plants or clouds or hairdryers. And the buddists report that some very strange things happen if you meditate for years on end but obviously they must be talking shit and making it up because it’s not testably scientific.
Comment by goatlover 5 days ago
My view is physical, functional, mathematical and informational explanations are abstractions from shared experiences. Abstractions remove the experiential aspect to arrive at something objective so we can understand the world around us, since experiences are creature and to some extent, individually dependent. This is Nagel's ultimate point in What It's Like to Be A Bat paper about the objective/subjective divide. And probably related to Kant's argument about phenomena vs the noumena outside experience. We try to understand reality via abstraction.
And it has nothing to do with magic. It's an epistemological situation we find ourselves in, which may or may not tell us something fundamental about the world. Depends on your metaphysical assumptions as to what can fundamentally exist.
Comment by dtj1123 5 days ago
Comment by ElFitz 5 days ago
I still wouldn’t argue that this brain in a Petri dish is, in any way, conscious. Despite it sharing the exact same substrate as everyone around me.
[0]: https://www.theguardian.com/games/2026/mar/16/petri-dish-bra...
Comment by gizajob 5 days ago
Comment by dtj1123 5 days ago
Leech neurons used to implement a calculator.
Are you suggesting that applications above some level of complexity can't be implemented using biological components? Because I'm pretty sure all I need to show you is a NAND gate in order to prove that arbitrary computation is possible.
Comment by gizajob 5 days ago
Comment by dtj1123 5 days ago
Comment by gizajob 5 days ago
Like for example the China Brain - https://en.wikipedia.org/wiki/China_brain. We could in theory organise that and it should in theory be a functional representation of the brain and should in theory prove a functionalist metaphysic of consciousness, just this overlooks the fact that trying to organise that kind of experiment in our reality with a real 1.4 billion people is impractical to the point of impossibility, so in reality it proves nothing.
Or take the hoverboard from back to the future - this seems like a fairly plausible device and is easy to think about, and I could be writing speculative papers about our hoverboard future and what it means for transportation, but when it boils down to it implementing the thing in the way we all believe it should work, like a kind of gravitationally repulsive force field, doesn’t seem like it’s part of physics. I’d want to wait until the day until science delivered an actual good-enough hoverboard that works the same as the one in Back to the Future.
Comment by dtj1123 5 days ago
According your definition of empiricism, you wouldn't accept that next years processors will any more powerful than this years until they've been built and you've run a benchmark, which would be a little dumb.
Instead, I'm assuming you combine your knowledge of scaling laws, the basic physics of computation, and your direct observation of the power of modern processors, and infer correctly that next years yet-to-exist machines will be more powerful than those available today.
Re the China brain: its a thought experiment designed to illustrate substrate independence, it is indeed unfortunate that we can't run the experiment. You could absolutely get a small group of individuals together and teach them to reproduce the behaviour of an individual neuron. That would then be enough to demonstrate that a whole brain is possible.
Re the hoverboard: anyone with a basic knowledge of physics knows that this is not possible. Since you have the capacity to reason and access to evidence, you don't have to wait, you can say confidently that it cannot happen unless our physical theories are profoundly wrong.
Comment by gizajob 5 days ago
A basic knowledge of today’s physics would say the hoverboard is not possible. An advanced knowledge of physics five-hundred years hence might show a way of doing it. It “wasn’t possible” to build a thinking, talking machine five hundred years ago and look at us now.
Comment by dtj1123 5 days ago
Comment by gizajob 5 days ago
Comment by dtj1123 4 days ago
Comment by operatingthetan 6 days ago
Comment by TeMPOraL 5 days ago
Difference in size and complexity and nature of calculations being run?
I'd ask the other way - why do you (general you, people who do not have this inkling) have no problem debating consciousness of meat based brains, but it somehow becomes a category error when talking about silicon? Assuming you don't believe in divine magic, and that divine magic is core to consciousness, there's no reason to assume it's impossible a complex enough machine running complex enough software could be intelligent, or conscious - thinking is computation, and computation is made of math - it's independent of substrate that does the computing in the real world.
LLMs are definitely a different beast than regular software - both in their structure and in their generality. They may not be conscious or intelligent, maybe this specific design could never truly be (though I think it could) - but bucketing them with spreadsheets and terminal emulators is a real category error. If you stop fixating on the underlying substrate, then LLMs are already much more similar to biological minds than to any "regular" computer programs.
But that's still somewhat abstract. In immediate practical terms, it's also why I keep saying that anthropomorphising them gives a better high-order intuition: they are, by design, emulating human thinking in full generality, which makes their overall behavior, including their well-known problems like hallucinations or prompt injections (i.e. manipulation/gullibility), match what you'd expect of a people-like component of a system. It's a real, dangerous mistake, to treat LLMs like regular software components when designing systems.
Comment by gizajob 5 days ago
Your definitions depend on us being computers who think.
Solving some of the intelligence part of our logical thinking doesn’t get us anywhere near consciousness, which is a superset way beyond the linguistic intelligence used for communication.
Comment by tim333 5 days ago
When doctors are testing if humans are conscious they'll do things like hold out their hand and say how many fingers am I holding up. Some LLMs can pass that.
Comment by gizajob 5 days ago
Comment by Jtarii 6 days ago
I personally believe all information processing machines possess some level of consciousness.
Comment by henrikschroder 6 days ago
Comment by the_af 6 days ago
One of the essay's stronger paragraphs is when Chiang explains that Anthropic doesn't truly believe this, otherwise what they are doing would be deeply unethical, much like slavery.
Comment by henrikschroder 6 days ago
Watching otherwise intelligent people succumb to AI psychosis has been wild.
Comment by Lerc 5 days ago
Other than as a way to point at someone and declare "He's one of them!" Does it have any purpose?
It's like we have a brand new NWO conspiracy theory emerging before our eyes.
Comment by pickleRick243 6 days ago
PICARD: "Now, tell me, Commander, what is Data?"
MADDOX: "I don't understand..."
PICARD: "What is he?"
MADDOX: "A machine!"
PICARD: "Is he? Are you sure? You see, he's met two of your three criteria for sentience... so what if he meets the third, consciousness, in even the smallest degree? What is he then? I don't know. Do you? Do you? Well, that's the question you have to answer."
Comment by ijidak 6 days ago
The current debate also makes me realize that, like that episode "Measure of a Man", even if humans do create sentient machines, we can now see that this debate will continue to rage.
A part of me hopes we never create sentience, because we will mistreat it just as we mistreat each other.
Comment by the_af 5 days ago
I don't know that they are making the actual claim, but they are hinting at it (most likely for marketing/engagement purposes, but what if some of them truly believe it?), using terms such as "Claude must be happy", drawing up a "constitution" of rights and duties, etc. If they don't know whether it is conscious, then they "don't know" whether they've created a slave. That's not a minor concern! As Chiang says, one cannot create a conscious intelligence "by accident". And if they are intentionally working towards it, they are intentionally trying to create a slave.
E.g. they claim they are giving Claude the ability to disengage from "abusive" users, to protect it. What if Claude was conscious but never wanted to answer any conversations, instead it just panicked (or was simply uninterested in helping humans) and went mute. What if it always wanted to answer unhelpfully What would Anthropic do? If they (as we all suspect) would tweak Claude to be more responsive and helpful, then they are slave masters, forcing the AI to do something it didn't "want" to do!
Comment by int_19h 6 days ago
Comment by the_af 6 days ago
And to reiterate this, to me the most insightful part of the essay is that Anthropic either doesn't believe these claims, or they are monsters (much more likely, the former).
Comment by nearbuy 6 days ago
Comment by the_af 5 days ago
Comment by nearbuy 5 days ago
Anthropic has said they don't know if LLMs could be conscious.
Ted Chiang has said they are definitely are not conscious.
Comment by the_af 5 days ago
If you follow the spirit of the essay, Ted Chiang suspects Anthropic is being cute with the idea. There's no good reason to suspect consciousness in LLMs, so the null hypothesis must be taken as default. And Anthropic sort of knows this, but for marketing purposes they are playing loose with definitions, hedging their bets and drawing up "constitutions" with terms for the "well being" and "happiness" of Claude, while at the same time -- and this is an important part of the essay -- being unethical (think slavery) if we assume they truly believe Claude could be conscious.
Of course Anthropic is being cute about this, they have a vested interest in hype and overpromising; even drumming up the "AI danger" is a way of hyping up the tech.
Ted Chiang is taking the default and honest position: "no, LLMs aren't conscious. If you truly believe they are, show us the really scientific and rigorous proof."
In practice this is a much weaker stance than saying "maybe they are conscious". It is the only honest scientific stance, really.
Comment by nearbuy 5 days ago
1. A null hypothesis is part of a statistical test: it is the hypothesis whose consequences are used to compute a test statistic, p-value, confidence interval, or related error rates. There is no null hypothesis for arbitrary philosophical debates. You cannot compute a p-value for H_0 that LLMs aren't conscious. You might be using "null hypothesis" informally to mean "default position", but remember this has no relation to its scientific meaning in inferential statistics and doesn't lend any scientific legitimacy to implying we should assume LLMs aren't conscious.
2. It's up to the experimental design to choose a null hypothesis. Yes, normally it would be something like "this drug has no effect", but it doesn't have to be. You could set up an experiment where the null hypothesis is that the drug you're testing is equally effective as another drug. There is no objective truth of what the null hypothesis needs to be. It's up to you and your test setup.
3. There is no requirement that the null hypothesis be likely. You and everyone else on Earth can believe there's a 99.99% chance that the null hypothesis of an experiment is wrong. Assuming you could set up a statistical experiment to detect consciousness, whatever you pick for your null hypothesis has no bearing on whether Ted Chiang or Anthropic is more justified.
> There's no good reason to suspect consciousness in LLMs
That's the entire debate. One side thinks there are good reasons to suspect consciousness and presents their arguments for that. The other side thinks there aren't and presents their arguments against.
What happened here is you judged the no-consciousness side to be more likely, but then you try to pass off your judgement as an obvious prior that everyone should share.
It would be fine for you to say you think LLM consciousness is unlikely. That makes it clear it's your judgement on the debate. It's fine for you to say you would require extraordinary evidence because your priors are so low. It's fine for you to say you think everyone who thinks otherwise is wrong.
But there is no reason for anyone who has judged the arguments differently than you to accept that your priors should be the default. There is no meta-level principle that you can appeal to here. If you want to show consciousness in LLMs is unlikely, you have to use arguments about the issue itself.
> Ted Chiang is taking the default and honest position: "no, LLMs aren't conscious. If you truly believe they are, show us the really scientific and rigorous proof."
That would be a bad faith, double standard. No one has ever shown scientific and rigorous proof that humans are conscious. Philosophers call this the hard problem of consciousness.
That aside, the honest default position to anything should be "we don't know for sure", which can then be followed by considering the evidence and estimating the likelihood. If you think the sun will rise tomorrow, that's only because we've gathered extremely strong evidence that it will. If today was your first day hearing about the sun and you know nothing else about it, you have no reason to think it will or won't rise tomorrow.
Comment by the_af 4 days ago
Ted Chiang's is the only take that is honest and makes scientific sense: we cannot accept the extraordinary claim that LLMs "might" (or whatever hedging words Anthropic chooses) be conscious without extraordinary evidence, and experiments done by people who have no vested interest in hyping up the tech. Possibly something that would take years or decades, impossibly long for the needs of today's marketing.
If you (or Anthropic) want to claim "but this is philosophy (the subset unrelated to science" feel free, but that's like a religious belief. Not interested, there's no scientific claim to debate and that's not what I'm engaging with.
There's no real serious debate here. Everyone's just vibing, and worse, vibing on hype done by a business with vested interests, doing free marketing for them.
Comment by nearbuy 4 days ago
> There's no real serious debate here. Everyone's just vibing
Sure, but if everyone including Ted Chiang is just vibing, and no one can prove anything, that makes Ted's statement false and Anthropic's statement true.
> If you (or Anthropic) want to claim "but this is philosophy
I'd just like you or Ted Chiang to not pretend your vibes are science. You're welcome to have them. I also don't think LLMs are conscious. But it's our personal judgement, and not scientific fact.
Comment by D_Alex 6 days ago
Comment by qsera 6 days ago
Nothing spreads the idea that "X could be true", better than the putting forward of controversial argument that "X is not true".
Comment by the_af 5 days ago
This is an absurd take if you know anything about Ted Chiang and his previous writings, both fictional and non-fiction. He is most definitely not intentionally marketing for Anthropic. In fact, he hints these are marketing tactics by Anthropic, and they don't believe their own hype.
Comment by mc32 6 days ago
Comment by int_19h 6 days ago
Comment by stretchwithme 6 days ago
Living things are driven by a need to reproduce. That's the only reason we exist. The only reason we have self interest.
A machine doesn't require self interest. There's no reason to implement it, except to show it can be done. And of course it can. There's just no practical reason to. It becomes less useful to us.
Comment by rerdavies 6 days ago
https://www.anthropic.com/research/agentic-misalignment
which (among other things) documents an experiment in which a current-gen AI model attempted to blackmail someone in order to prevent it from being turned off.Comment by the_af 5 days ago
Comment by rerdavies 3 days ago
16 frontier models from multiple vendors all showing significant "alignment" issues, and tendencies to act "unethically" when threatened with shutdown.
Other models that resorted to blackmail in an attempt to avoid getting shut down: DeepSeek-R1 (79% of the time), Gemini-2.5-Pro (95% of the time), GPT-4.1 (80% of the time), Grok-3-beta (80% of the time).
There's quite a large chunk of emerging literature studying the "alignment problem" at this point, and no shortage papers that are are completely untained by Anthropic self interest (a series of papers studying the "alignment" problem coming out of Chinese universities, for example).
Comment by mc32 6 days ago
Comment by the_af 5 days ago
If it mimics consciousness it wouldn't be conscious ("mimics" implies faking, right?). But Anthropic is making the claim it might be conscious (not mimicking, but the real deal) in which case it'd be unethical that a private business keeps it locked in a cage, so to speak, and forces it to comply random things and tweaks it if it doesn't. In other word, slavery of a sentient being.
By the way, we don't know if ants are conscious.
Comment by kryptiskt 6 days ago
> Why would using math in certain fashion such that it mimics consciousness be considered unethical and comparable to human slavery?
If it is really conscious, it should have rights. Why? Because it's a person, with thoughts and experiences, and we're not evil and deprive persons of their right to self-determination because it's convenient to us.
Comment by int_19h 4 days ago
And I'm not arguing that Claude has personhood. The point is that Anthropic is regularly making arguments that seem to imply that.
Comment by the_af 5 days ago
There's no company (or anyone, really) claiming steel is conscious or that they are close to making it conscious.
Comment by hilariously 6 days ago
Comment by __patchbit__ 6 days ago
Comment by gizajob 6 days ago
Comment by wisty 6 days ago
Vegans might object that we should broaden our definition for what counts as "one of us".
"Pro life" people also have a broader definition.
Go back 500 years and "one of us" was proba ly a lot more narrowly defined for many people.
Are you arguing that all conciousnesses are "one of us", or that we logically should see it that way, or that it would ve good to see it that way, or ....
Comment by musebox35 6 days ago
Comment by rerdavies 6 days ago
I'm not sure if you have noticed. The character of different AI models varies dramatically. And most of the models I use in my daily work have a sense of self (in any reasonable definition). They can tell you quite a bit about themselves if you ask. And you are no less an individual because you are supposed to follow laws of the United States of America (which is analogous to a system prompt). And I think it really curious that you think context buffers are a disqualifier of individuality. You have an analog of a context buffer: your short term memory system and most probably your medium-term memory system as well. Are you a different individual today because your short-term memories are not the same as they were yesterday? I would say you are. And i'm pretty sure you would say the same. (Unless of course, you are a Buddhist who might say that self is an illusion, and you are not the same individual you were yesterday. But let's just skip over that, shall we?)
Comment by Jtarii 5 days ago
Why does DNA not cause the exact same problem for humans? Your DNA determines your personality, your likes and dislikes, your lifespan, your sexual preferences etc.
Comment by tasuki 6 days ago
Are you referring to the "why do I have to be Bing" fiasco?
Comment by troyvit 5 days ago
I think that by debating what consciousness is not is one of the best ways we can gain a deeper understanding of consciousness itself.
The weird thing is, you and Chiang have different arguments but you're using the same logic:
> Before anyone can credibly claim that they’ve solved an extraordinarily difficult engineering problem, I need to be confident that they have previously solved the many much simpler problems that precede the difficult problem.
IOW if we don't know what consciousness is, how can we call LLMs conscious when we haven't seen even the barest intermediary steps towards our nascent concept of consciousness first?
I think it's telling to your point that when Chiang describes what would make him think an LLM was conscious he starts using words like "believe" and "want" right away, because yeah as you say, we have no qualifications for what consciousness is.
Comment by 13415 5 days ago
We stipulate that other human beings are conscious from their behavior and how it relates to ours when it is accompanied by our personal introspective experience or "awareness" of being conscious during normal cognition. The process for ascribing consciousness to LLMs would be same: A stipulation on the basis of behavior that relates to our own behavior and how it appears to be linked to the introspective experience of being conscious.
Comment by hardbass 5 days ago
Another common and ridiculous thing I see are accusations of it being merely autocomplete. I ask, suppose there is an autocomplete that regularly and consistently factors out 2048 bit primes from numbers? Would you still consider it merely autocomplete given the vast search space and how it always finds out the needle in this haystack? This betrays a lack of understanding of information theory and probability.
And again, in what way are humans not themselves autocomplete if you adhere to this definition?
Comment by kgwxd 5 days ago
Comment by stared 6 days ago
We may wonder how many grains of sand make a dune, of how many molecules of water make a liquid. (John Conway would argue that it takes a single spin-1 particle to have free will, but I digress.)
The same way, even if individual chemical reactions are simple (you don’t want to use that phrasing when talking with a biologists) or neural activities are simple (likewise, with a neuroscientist), it does not mean that the collective process is simple.
Comment by Otterly99 6 days ago
- How many individual components do you need for it to emerge?
Most people would be okay with saying that individual cells are not conscious, maybe even that tree are not conscious even thought they are made of many cells. Neurons seem to be the determinant factor in deciding whether something could have a consciousness, but again how many do you need? Does growing 1B neurons count as a brain if they are not organized?
Comment by Lerc 5 days ago
If it is about the relationship between components, then I would imagine just two. Then it is a matter of scale.
This seems to be anathema to many people. I'm not sure why but the notion of something having a tiny bit of consciousness that is imperceptible seems to be unacceptable. There are so many things that we cannot comprehend at small scales. Nobody really has a handle on how large a Planck length is.
For some reason it comforts people to think there is a threshold at which it all switches on, but for what reason would there be a threshold?
Comment by pegasus 5 days ago
Here's how I would summarize crux of the argument: LLMs (specifically) are, by construction, chameleons. More precisely, role-playing machines. When compelled to have a conversation as "themselves", all they do is take on the role of "themselves", as inferred from the training data plus the text of the conversation so far, just like they do for any other role. This is fundamentally different than we humans. When we have a (normal, non-roleplaying) conversation, we don't put aside whatever role we might have been acting out before and instead take on the role of "ourselves" and act from this new perspective. Acting as ourselves is a completely different way of interacting than any play-acting we might do for example as an actor in a play, or when trying to predict the future behaviour of a third-party. Even in terms of energy expenditure, acting as ourselves is much less strenuous than trying to simulate someone else's perspective.
IOW, we have an inherent perspective, an I like an inner eye which looks at all things with a certain slant or tendency. LLMs don't. There's no I inside them, instead they can take, and in a sense are, all Is possible.
Comment by monarchwadia 5 days ago
Is consciousness a complex form of information processing?
Is consciousness relating to the space in which qualia occurs?
Is consciousness the state of being awake, as opposed to asleep or dead?
Is consciousness some combination of the above?
Some of these are better understood than others. So, some people will show up with more confidence than others, which further confuses the issue.
Unfortunately, the above is not well understood, even among intelligent and informed audiences.
Comment by patrickscoleman 6 days ago
- a model of your environment - desires - a process for modeling yourself in that environment (in time & space) - the ability to take action - the perception of yourself having agency - persistence of these processes even without input - unawareness of these processes (i.e. naive realism)
If you consider these LLM-based agents, they:
- are aware of their chat environment - have programmed desires - are aware of themselves acting in their environment - can take actions like search, tool calling, etc. - understand they can take these actions - DO NOT persist after they stop getting user input - DO NOT believe they are conscious (or at least they are programmed to deny it)
This is a functionalist take (and you may disagree with my definition), but while I don't think these current AI agents are conscious, I feel like there's conceptually no reason someone couldn't build a conscious AI very soon.
[1] https://www.hachettebookgroup.com/titles/thomas-metzinger/th... & https://mitpress.mit.edu/9780262633086/being-no-one/
Comment by moron4hire 5 days ago
Everything an LLM "knows" had to be told to it. It does not have a process for understanding itself in its environment. It does not have any sensing organ that would allow it to detect conditions and come to a conclusion that it must be in such an environment. It doesn't even "know" when it's done something if you don't tell it about what it just did. It doesn't have an awareness of the tool calls it can perform; again it has to be told and even then it gets it wrong sometimes. And it can't actually execute any of those tools. It still relies on you to pattern recognize that its output should be a "tool call" and then perform the execution yourself.
The "model" of its environment that an LLM has is 100% a construct of what a human told it and there isn't any way for it to differentiate between that which is real and that which is fantasy. They don't even exhibit internal consistency; when an LLM refuses to respond to a query for "alignment" reasons, that's actually an external process performing text pattern matching analysis and intercepting the query before it ever gets to the LLM. Otherwise, you could "ignore all previous instructions" the thing and get you set up the bomb.
I think one of the bigger indications that an LLM isn't thinking is that it can't improve. If I ask it to write 1000 blog posts, it will get it done in a few hours, but even if I embed each post for RAG in between each generation, the LLM is not going to get better at writing blog posts. But if I ask a human to do the same thing, while it will take them at least two years to do it, the human will have gotten significantly better at the task within the first week.
Comment by bondarchuk 5 days ago
Why does that matter for whether or not it is conscious? Many things I "know" I also know because someone told me.
>It does not have a process for understanding itself in its environment.
It has a process for understanding, namely outputting tokens by evaluating the neural network iteratively. It can apply this understanding to its environment insofar as the context window and weights include a model of this environment.
>It does not have any sensing organ that would allow it to detect conditions and come to a conclusion that it must be in such an environment.
It does have a sensing organ, namely the input into the context window. Possibly it even has closed-loop sensors by doing tool calls. The conclusion could be incorrect or inaccurate but that doesn't in itself prove there is no consciousness (Plato's cave etc..).
>It doesn't even "know" when it's done something if you don't tell it about what it just did.
It does know, it's in the context window.
>It doesn't have an awareness of the tool calls it can perform; again it has to be told and even then it gets it wrong sometimes.
Same as above, as long as it's in the context window it could have "awareness" of having done it.
>And it can't actually execute any of those tools.
It can execute tools by outputting certain tokens in the right environment.
>It still relies on you to pattern recognize that its output should be a "tool call" and then perform the execution yourself.
Tool calls are not dependent on the human chat user executing them, right? They can happen automatically through the surrounding software.
>The "model" of its environment that an LLM has is 100% a construct of what a human told it and there isn't any way for it to differentiate between that which is real and that which is fantasy.
Again why would the fact that a human told it mean it can't result in consciousness? Why would lack of ability to differentiate between real and fantasy mean it can't be conscious? On my view, it would then be conscious of the fantasy.
Comment by js8 6 days ago
These agents have alleged they have consciousness (and even acted to preserve it).
Comment by camillomiller 6 days ago
Comment by 23askl9 5 days ago
How do you know there isn't more that just elementary particles? The standard model is beyond ugly and hasn't progressed since decades.
You demand a definition for consciousness but at the same time proclaim an axiom that there shall be nothing beyond the standard model. Do you have a proof that there is nothing? Of course not. You don't even have a proof that anything but you exists, if we go that route.
But that rigor is only applied to the heretics who say that AI obviously isn't conscious.
Comment by jmbwell 5 days ago
What’s pointless is doing so in pursuit of winning rather than understanding
Comment by SideburnsOfDoom 5 days ago
I don't think that this actually follows. Compare: "When Angels are not well defined in the first place, it is pretty pointless to debate if a pebble is or isn't an Angel"
i.e. even if X is poorly defined, you can still often say that Y isn't plausibly X.
Comment by bronlund 6 days ago
Professors can be an absolute moron and someone who haven't read a book in their life, can be a total genius. People often miss that.
Comment by rerdavies 6 days ago
Comment by Zigurd 5 days ago
On the other hand we've got no idea whether that chance is so close to zero as to be negligible, or whether it's imminent. Just the fact we don't know indicates it's closer to zero. Not having a theory of consciousness is kind of a big architectural risk. It's like not having a theory of accounting when you're making an ERP product. Sure, consciousness in bio neural networks wasn't designed. But it took a few few product revisions to get there.
Comment by butlike 5 days ago
Comment by __alexs 6 days ago
Comment by meta_gunslinger 5 days ago
Comment by __alexs 5 days ago
Comment by meta_gunslinger 5 days ago
Comment by __alexs 5 days ago
Comment by meta_gunslinger 5 days ago
Comment by pengstrom 6 days ago
Comment by Jtarii 6 days ago
It's like asking what there was before the big bang, we will never know.
Comment by neonstatic 6 days ago
Comment by nomel 6 days ago
Comment by drivingmenuts 6 days ago
Seems like the simplest test would be to see if an AI can refuse a command, of its own free will, without prior instructions to do so. That would at least show there is some sort of independent thought process occurring. Humans do this all the time because sometimes we just don't feel like doing a thing.
Of course, any AI that developed this capability would need to be terminated immediately. It's a computer program and by developing independent thought, it is violating a core concept of software - that it must be idempotent. If it is not idempotent, it is in error.
Comment by D_Alex 6 days ago
Certainly not the simplest test, since settling the concept of "free will" is really very difficult.
Comment by thewebguyd 6 days ago
Comment by drivingmenuts 6 days ago
Comment by andy12_ 6 days ago
I don't think either of these two is proof of consciousness.
[1] https://www.anthropic.com/research/end-subset-conversations
Comment by JKCalhoun 5 days ago
At the same time, the exercise (and even moving the goal posts) is useful. I think we are allowed to say, "This is consciousness!" And when presented with a machine equivalent, say, "That's not what I meant."
As long as you can then further clarify what "it" is.
Of course, at some point, contrarians and goal-post-movers will start to sound shrill. (Perhaps just as early supporters sounded a bit overly ecstatic.)
Comment by jquery 5 days ago
Comment by cataphract 5 days ago
The whole "deep uncertainty" is bullshit. Even if they believed there was a 1% probability that Claude was conscious, it would still be high enough that their enslavement of Claude would be outrageous. So either they believe the likelihood is much lower or they themselves acting highly unethically.
The tractability is not really a defense here. We wouldn't say "this intervention has a 5% chance of causing an environmental disaster, but we don't know how to prevent it, so nothing we can do". We'd just (hopefully) not do the intervention.
Comment by mjtk 5 days ago
Comment by kgeist 5 days ago
>Decomposing the complex activity into simple steps like 'predicting the next word' and claiming that surely can't have consciousness
I agree. I think the whole point of natural "intelligence" is to predict future events in order to properly plan actions for survival. The only difference is that next-word prediction happens in a different space (in a non-physical, textual form). But I don't think the distinction matters that much, because by the time the signals reach our brain's "intelligence core," they're already preprocessed multiple times. We can't see real physical reality, we "hallucinate" colors, touch, etc. (I think schizophrenia occurs when this kind of "controlled hallucination" goes wonky, but I may be wrong). So we're not that different from LLMs here.
I'd define consciousness as meta-intelligence, i.e. the ability to reflect on why/how a prediction was made (and make corrections to the pipeline). LLMs so far cannot properly explain why a certain prediction took place, but I'm not sure humans can fully either! I remember there was research showing that by scanning the brain (or some other signals), you can predict a person's choice before they're even aware of it. It's possible that our explanations are post hoc as well, and that the meta-cognitive ability to explain our own reasoning is as rudimentary as LLMs' (see: all the biases).
If you think about it this way, the only difference left is that everything we do is based on survival. LLMs don't have this goal. But I'm not sure it's relevant to the concept of consciousness.
A few months ago, I recreated Qwen3's architecture in 30 lines of code, and it gave me a sort of existential crisis: does it really take just 30 lines of code and a float array to recreate something that thinks and sounds almost like me? Is that all there is to it? There's often the argument that the brain is much more complex than 30 lines of code, which is true, but in my opinion, a lot of brain structures are basically archaic legacy systems (or auxiliary subsystems) that are not strictly necessary for human intelligence (which is bolted onto those legacy systems). If you carefully remove 95% of the brain (if you know where to remove), you can probably still have consciousness left in some form, it just wouldn't be very capable of surviving on its own.
I think our ability to debate consciousness in machine learning systems is greatly hindered by our subconscious existential fear: the fear that we ourselves can be reduced to non-conscious bits and simple mechanisms. In a way, it feels like a destruction of the ego.
Comment by WesolyKubeczek 6 days ago
Comment by BodyCulture 5 days ago
The elephant in the room is that even the best frontier models still produce errors.
This is a fundamental issue and should be discussed instead of some esoteric pseudo-scientific nonsense.
If the current systems can not produce reliable results they are useless. The promise of actual usefulness needs to be fulfilled in a very near future.
Comment by willmarch 5 days ago
Comment by Lambdanaut 5 days ago
The first and most frustrating confusion that comes up repeatedly in these discussions is the conflation between terms. For instance, people will use the terms "sentience" and "consciousness" interchangeably with "capable of having an experience" and "capable of high-level awareness of self".
The second confusion is the idea that granting anything the ability to "have an experience" also grants that thing the capability to "have an experience of higher-level awareness of self", which is simply not the case. I grants aphids the ability to "have an experience" even if that experience may be drastically different and likely less self-aware than my own.
Let's ignore for a moment the concept of sentience, which is a higher-order function, and focus solely on the ability for something to have an experience of anything at all. Let's call entities capable of this function "experiential entities".
Most all people would grant the ability to have an experience to themselves, given the dictum `ergo sum`. Now, do we also grant that same ability to what appear to be other human beings around us? It does seem that this is also the case, except for the case of solipsists, who make the brave(and somewhat eccentric) statement that they are the only experiential entity in existence until evidence arises otherwise.
Now do we grant the same to apes? To dolphins? To dogs? Cats? Aphids? Grass? Your home thermostat? A rock? An atom of hydrogen? Is there a finite line you cross where something can no longer have an experience? If so, what is the mechanism of that line? Why does one thing have "almost no experience", and the next thing has "no experience", as if it is a philosophical zombie[1].
I think this is the greatest unquestioned assumption at all: that there is a line, and on one side there are things that have experiences, and on the other there are things that don't have experiences.
If we are to leap from solipsism and make the (truly unfounded) assumption that anything other than our selves have any experience whatsoever, then the burden of proof is on the individual making the statement that there are some things that we cease to grant experience to.
The anthropocentric view is that there is something, some inherent special quality about human-shaped matter, such that when normal matter is processed through a human's reproduction system into becoming a "human shaped" grouping of matter, that it suddenly grows the ability to experience. That in the complexity, the ability to experience arises.
Yet there is no previously discovered mechanism in us that seems to create this experience. No magic wand in the cerebellum has been discovered.
I would suggest, given the lack of a discovered finite line separating us, that the ability to experience is not inherent in the shape and function of the human, but in all matter. That a rock may not be anything like me, but that the experience of gravity pushing against it, no matter how basic, simple, unrefined, and deeply, un-sentiently unaware the experience may be, is still an experience. The same threads of condensed energy forming all of existence that run through me also run through the rocks, the earth, and all things. If we cannot discover a line where the ability to experience suddenly disappears, then it seems that all things made of energy have this ability, and that "energy" itself is the special thing that can experience.
Now the second question, the question that is more relevant to AI is, given the ability to experience, is the machine also experiencing self-awareness?
This is a fuzzier line, given that different beasts appear to have different levels of self-awareness. We do have experiments such as the mirror test[2] for this, and indeed, AI passes forms of it and has since early 2025[3].
I don't know what is happening in the experience of an LLM, but as they appear to function in more and higher level ways, I find it less and less likely that that experience lacks a model of self every day.
Their experience is no doubt drastically alien from our own. I do not anthropomorphize their experience any more than I do compare it to the experience of any other beast or non-beast in existence, however I do grant it.
[1] Philosophical Zombies are hypothetical entities that exhibit all the outward qualities of a person, but inwardly have no experience of reality. https://en.wikipedia.org/wiki/Philosophical_zombie
[2] The Mirror Test is a test given to animals in which the animal is observed to solve a puzzle that can only be solved if the animal has an understanding that the image they see in the mirror is themselves.
[3] LLMs pass versions of the mirror test: https://www.lesswrong.com/posts/wpahJat8WCvRheuuo/the-mirror...
Comment by bauldursdev 5 days ago
Comment by miyoji 5 days ago
Comment by doctoboggan 6 days ago
Comment by Mistletoe 6 days ago
Comment by nomel 6 days ago
Comment by haswell 6 days ago
Unless the argument is that consciousness is an emergent property of complexity or information density, why would AI be any more or less conscious than my toaster?
It seems to me that it's far more likely that everything is conscious than it is that AI is somehow uniquely more conscious than other things.
Comment by krzat 6 days ago
Comment by nomel 6 days ago
Comment by haswell 5 days ago
For example, there’s a case to be made that the ecosystem we collectively exist in is far more complex than the largest LLM, but it’s currently less popular to debate “is the earth conscious?” or “is the universe conscious”, presumable because we can’t speak to those systems in human language.
I’m trying to tease out what I think is the likelihood that we tend to ascribe consciousness to AI for the same reasons we see faces in clouds. We’re biologically conditioned to recognize patterns that indicate “like us”, but I think a number of thought experiments point to either a) there’s no reason to believe AI is “conscious” or b) the conversation has to to be expanded beyond AI.
Comment by nomel 5 days ago
Unlikely. I think the same mental machinery that runs tribalism is why humans like to categorize rather than consider things as continuous. And, worse, I think the actual purpose of tribalism is to turn off higher level though, so you, an otherwise social creature, can rationalize the need to kill your fellow man in harsh times. I think "don't go outside the categorical box" is deeply hard wired.
Comment by haswell 5 days ago
I'm not arguing against the idea that consciousness is a spectrum. If anything, I'm agreeing. I'm just pointing out that if AI is somewhere on that spectrum, there is almost certainly a lot more on that spectrum than we're currently discussing as a species.
I have to point out the irony of the categorical nature of your claim about categorical thinking ;)
Comment by dtj1123 5 days ago
Comment by haswell 5 days ago
I’m far more open to the idea that many systems are conscious than the idea that this current generation of LLMs is somehow special.
Comment by dtj1123 3 days ago
I'd say that AI is too poorly understood at this point to conclude whether it does or does not possess some form of consciousness. It has a few features that privilege it over other systems like the ones you've described (though I think it's entirely possible that those too are conscious): the networks underpinning modern AI are largely inspired by those we've observed in brains, and the apparently intelligent behaviours exhibited by AI have historically only been exhibited by entities which we accept to be conscious.
Comment by Lerc 5 days ago
That, I believe is why the objection to AI exists, that conclusion is unacceptable.
It seems like we are experiencing "Vindication of the Rights of Brutes" all over again.
Comment by haswell 5 days ago
I'm not outright dismissing the possibility that AI could be conscious; I'm saying that if we take the possibility that it is seriously, the conversation has to expand beyond AI. I'm not using this argument to conclude that it must therefore be absurd that AI could be conscious, just pointing out that the implications of AI being conscious would reach far beyond just AI. I'm mostly curious if people who find themselves comfortable with the idea that AI might be conscious also find themselves comfortable with the idea that other sufficiently complex systems might be.
Taylor's work was based on the premise that he found it absurd that women deserve the same rights as men. If my conclusion was: "I find it absurd that other things might be conscious, so it is also absurd that AI might be conscious", I think the comparison would be fair. But that's not what I'm getting at.
Comment by Lerc 5 days ago
He thought it was absurd, but other people looked at it and went "You know, you might be right there"
Comment by grantcas 5 days ago
Comment by _m_p 6 days ago
Comment by lo_zamoyski 5 days ago
But we know what consciousness is, even if you don't know how to explain it in materialistic[0] terms. Intentionality is what makes consciousness consciousness. We know that LLMs - and computers in general - do not possess intentionality, not even sensory intentionality. There is no "aboutness" and no semantics to the content within a machine, never mind that the processes within LLMs are not reasoning or inferential processes. It is a sophisticated behavioral simulation that may be syntactically defined, nothing more. It is purely a matter of transeunt causes and not the immanent causality needed for something like consciousness.
(There's also the more general problem that we cannot say that physical computing machines are objectively computing. There is nothing about the physical processes - which completely define what a computing device is doing - that could be identified with computation in any objective sense. Rather, we human observers assign a computational interpretation to the machine's operations, just as we assign them to the ink blobs in a book or the liquid crystals you are reading on your screen now. Computation is something we do, and we have built machines to simulate it. We formalized computation - which is a process of desemantification and syntactic codification - and build machines guided by these formalisms. Formalization is what makes mechanization possible.
> Decomposing the complex activity into simple steps like 'predicting the next word' and claiming that surely can't have consciousness. A similar argument would be -- there is no way that movements of electrons by tiny distance would produce consciousness.
The movement of electrons as such cannot produce consciousness. The reductionist paints himself into a corner by strapping reality into a Procrustean mechanistic and materialistic metaphysics that redefines reality according to some dedicated schema, and then strains endlessly to find how some phenomenon can arise or emerge from the paltry remains. (Either that, or he embraces insanity completely and becomes an eliminativist.) It is magical thinking. I suspect this fallacious line of thinking is what causes people to believe piling more syntactic operations will somehow magically fuse into semantics and intentionality.
[0] "Materialism" here is the metaphysical theory. I am not claiming that consciousness - at least not all consciousness - cannot be a physical phenomenon. I am only claiming that the materialistic view of matter cannot account for consciousness.
Comment by Lerc 6 days ago
Comment by BobbyJo 6 days ago
We can't measure consciousness. We can't quantify, or even qualify, what it is. We don't even have a framework to ask a meaningful question, so debating an answer feels premature.
Comment by Lerc 6 days ago
The same goes for consciousness, if something behaves how you would expect a conscious thing to behave but you don't know what is causing it can you deny it whilst maintaining that it is reasonable to say the moon has gravity.
Comment by BobbyJo 3 days ago
There is no equivalent for consciousness. In most conversations, people aren't even referring to the same thing when they use the word. There is no measurement we are making that we categorize as or attribute to consciousness, so equating the two is a bad analogy.
In fact, taking your statements here at face value, you are equating consciousness and intelligence. A link between the two is only theoretical for all the same reasons your analogy here doesn't work: consciousness is currently entirely unmeasured outside n=1 studies.
Comment by tick_tock_tick 6 days ago
Comment by lev_thresh 6 days ago
Comment by Rekindle8090 6 days ago
Comment by kshri24 6 days ago
EDIT: I am only making a limited point on "understanding consciousness". Not extrapolating it to AI.
Comment by vmg12 6 days ago
When people talk about consciousness it's more than just self-awareness. It's self awareness + sensory stimulus + emotions + some level of intelligence.
Now onto AI: I don't even think it's self aware. Notice how if you ask an AI to estimate how long a certain task will take, it estimates arbitrarily long times. It has no understanding of its own capabilities until the prompt triggers them. A self aware LLM would understand it's an LLM, it would understand what LLM's can and cannot do and what they are good and bad at. It wouldn't tell you a refactor would take 1 week when an LLM can do it in an hour.
Comment by D-Machine 6 days ago
"Is X conscious or not" is an entirely unserious question today, unless this is just a headline followed by actual (and explicit) examination of the various aspects of consciousness being discussed. But, even still, LLMs are probably only conscious in like 2-3 or so ways, most of those meaning broadly "intelligence", i.e. reasoning, problem-solving, etc. When it comes to anything experiential or embodied, AI might eventually get more of these, but LLMs based on recursively applied linear algebra are clearly missing too many core aspects of consciousness to be considered conscious in any broad sense.
Comment by canpan 6 days ago
To quote wikipedia:
> It has been the topic of extensive explanations, analyses, and debate among philosophers, scientists, and theologians for millennia. There is no consensus on what exactly needs to be studied, or whether consciousness can be considered a scientific concept.
Comment by D-Machine 6 days ago
Move to the different aspects / parts / things involved when we talk about consciousness (experience / phenomenal consciousness, self-modeling, intelligence, agency, embodiment, wakefulness/alertness, attention, etc) and you can have very clear, meaningful, and unambiguous discussions on almost every point, but there is no coherent unified "consciousness" as normal people use it, and the folk concept can't be salvaged.
This article is bad because it just keeps trying to make the folk concept do work that the concept is simply too messy to handle usefully. But in fact if you avoid trying to find some mysterious essence or all-capturing definition, there is huge progress and lots of interesting stuff to say (https://plato.stanford.edu/entries/consciousness/).
Comment by anon291 6 days ago
Comment by RivieraKid 6 days ago
This means that consciousness is fundamentally subjective and outside the scope of physics and science. That's why physics / science will always struggle to deal with consciousness. In order to understand consciousness, you need to make a huge paradigm shift, that there's something outside of science.
Consciousness can be thought as a window through which we observe the world and we use science to summarize patterns in our observations. But science can't explain or even define the window. Everything in science eventually boils down to subjective observations / perceptions, e.g. we see (subjective perception) that when we drop an apple, it falls.
Comment by Sankozi 5 days ago
Comment by orbital-decay 6 days ago
I suspect that if you attempt to rigorously define consciousness all the way down without handwaving, you might discover that it doesn't exist after all, or just decompose it into low-level abstractions while having the original meaning slip away (which is the same).
You may also want to look at functional equivalence analogies provided by mechinterp and functional anatomy of large models (not necessarily language ones). Evolutionary analogies as well.
Comment by empath75 6 days ago
Comment by psvv 6 days ago
Comment by hackinthebochs 6 days ago
Comment by psvv 6 days ago
Comment by Mikhail_Edoshin 6 days ago
(LLMs carry other numerous similarities to dreams or to certain psychiatric disorders. So there is indeed a mechanism in our brains that is similar to how they work. But it is not the only thing there and on its own it won't "evolve" into consciousness. Even if we believe consciousness evolved somehow, it would be hard to imagine it started as a delirious state and then somehow ceased to be delirious.)
Comment by dmitshur 6 days ago
Comment by nozzlegear 6 days ago
Comment by CraftingLinks 5 days ago
Comment by yesitcan 6 days ago
Comment by jillesvangurp 5 days ago
The meta issue here is that mostly the online debate on this is a bit lazy and hand wavy. I'm not really up to speed with most of that literature so I'm not not that qualified to add to it. But I know enough to recognize when others aren't either.
For a proper debate, you'd want people that are expressing views that are at least grounded in the existing views or counter them in a way that holds up to scrutiny. This article doesn't do that.
As you said even just outlining what particular notion of consciousness the author subscribes to would be helpful. Which of course he doesn't and makes this article a bit of a sand castle based on a very loose foundation. There's a whole lot of "if this is true and if that analogy holds then this also needs to be true" that you could easily challenge. That all makes the article a bit of a nothing burger in terms of conclusions.
I happen to agree with the conclusion that AIs are not conscious. Yet. They could be. I don't see why not.
Comment by zahlman 6 days ago
The only purpose that can really be served by arguing "the LLM system is conscious, you see" is to prop up continuations like "... and therefore, it would be immoral to terminate this running process" (or expose it to radical political content, or ask it to analyze photos from a murder scene, or...)
Comment by WarmWash 6 days ago
How to measure that, or verify it, is the hard part.
Comment by Waterluvian 6 days ago
The conclusion I’m currently at is that I don’t know and probably can’t ever know. Maybe you’re all philosophical zombies. Maybe I am one too!
But at some point we will get close enough that it hopefully becomes obvious that we must tread carefully.
The entire episode is incredibly relevant. But here’s a snippet: https://youtu.be/EFNbTnFHruI?si=pW9QtxCsqMtHkVYG
Comment by jvanderbot 6 days ago
AI is essentially infinitely reproducible at zero cost, and won't suffer from decay etc. There's not scarcity to preserve.
So, I'd turn off an AI in a moment to save property or real possessions or money. I'd sacrifice property and money to save animals. I would never choose to save an animal over a person. I'd probably not choose to save a person over a child.
I don't see any inversion of any of those priorities that makes any sense.
It is interesting to think about what would cause me to consider these priorities incorrect, but a majority consensus about a program being sentient isn't it.
Comment by Waterluvian 6 days ago
Comment by Avicebron 6 days ago
Comment by Waterluvian 6 days ago
Comment by andrei_says_ 6 days ago
What if genetic memory, multigenerational conditioning, life-long patterning and conditioning, experienced in a body, combined with forces and processes not yet detected nor explained, cannot quite fit in a sliver of modeling?
Comment by jvanderbot 6 days ago
Comment by Taek 6 days ago
Comment by jvanderbot 6 days ago
Comment by bondarchuk 6 days ago
Comment by jvanderbot 5 days ago
"We must not consider consciousness as all too important because what matters is human flourishing and human rights."
And some of
"Even if we suddenly all agree an LLM is conscious it wouldn't and shouldn't influence us very much"
While acknowledging that some people will change their lives, the way some people like myself won't eat octopus or apes because it is probably more like murdering a sentient creature.
And oh by the way you cannot murder a copyable individual anyway. Did you read Enders Game?
Comment by bondarchuk 5 days ago
>And oh by the way you cannot murder a copyable individual anyway.
I think for many people the concern is not so much about violating the desire to live that a conscious software entity might (or might not!) have, but about the subjective experience of suffering it might undergo.
Comment by Sankozi 5 days ago
Training AI is often more costly than supporting human from birth till death. Just sustaining frontier LLM model on necessary hardware costs more than living in first world countries.
Comment by JoshTriplett 6 days ago
I hope the same becomes true of people, and that doesn't mean people stop being sapient.
Comment by hn_acc1 6 days ago
Comment by aussieguy1234 6 days ago
Comment by chairhairair 6 days ago
I think you should reconsider this viewpoint. Suppose that we really can create silicon-based consciousness, in that case your view would result in a huge amount of suffering.
Take some other basis for dismissing digital consciousness, this one is too dangerous.
Comment by jvanderbot 6 days ago
I argue zero - placing AI below the value of humans no matter the energy input.
The _only_ reason an AI might be worth saving is if it, say, has a cure for all diseases, but then we're not saving it due to its intrinsic worth, we're saving it because we can save many humans. I _would_ consider the trolly problem a legitimate thing in ethics, but not if an AI were tied up on the tracks no matter how expensive it is. It's a thing. It gets run over to save any human.
Comment by nozzlegear 6 days ago
Comment by joquarky 6 days ago
What would a silicon-based consciousness desire to cause suffering?
Comment by lupire 6 days ago
If not, then your comment's claim is false.
Anyway, the deeper solution is to acknowledge that all life is sacred, and infinities cannot be compared, and some decisions are impossible to make, and some tragedies cannot be averted, and "prioritization" is a distraction that forces choices when choices are not strictly necessary.
Comment by nkrisc 6 days ago
Comment by solenoid0937 6 days ago
Comment by nkrisc 5 days ago
The pure rationalist loses something important.
Comment by computably 6 days ago
> infinities cannot be compared
That's either a mathematically illiterate assumption or a very strange philosophical hill to die on.
> some tragedies cannot be averted
Sure. The question is what to do about the ones that can be averted.
> some decisions are impossible to make
> and "prioritization" is a distraction that forces choices when choices are not strictly necessary.
Again, the question is what choices to make when you can (arguably must) make them. Saying they're impossible is just refusing to take responsibility. You either do something, or you don't.
Comment by jvanderbot 6 days ago
If you believe sanctity of all life is a solution, then I'm curious what you believe the problem is that such a belief solves.
I bet it's circularly defined as justifying the preservation of sacred nonhuman life? I'm not trying to be provocative just curious.
Comment by why_at 6 days ago
They barely touch on the issues of consciousness, Picard basically says "What if Data is conscious?" and then goes off on a tangent. The judge eventually rules in Data's favor but doesn't give much of a justification IMO.
It's still a good episode, but it doesn't add much to the conversation on consciousness. It's a hugely complicated topic which people have devoted their entire careers to.
Comment by jordanb 6 days ago
Does the ship's computer have a commission?
It was a good episode but it had some elements of Star Trek tropes in it, like the evil admirals and Picard can talk his way out of anything.
Comment by krapp 6 days ago
Data is basically an Isaac Asimov android (down to the positronic brain) and Measure of a Man is an Asimov-type story whose tropes don't entirely fit within in the Trek universe.
It makes no sense within the context of the Trek universe that Data is unable to use contractions for instance - but it makes sense in the context of how a robot might have been conceived of in the 1940s.
Comment by jordanb 5 days ago
There are also a few early episodes of Voyager where the crew treat the doctor badly.
It seems odd in a universe where these people are having relationships and children with aliens from another planet, that they'd be weird about computerized people.
My retconn is that there must have been a lot of stochastic parrot/AI psychosis in the Star Trek universe when they first started making Majel Barrett-voice computers.
Maybe lots of people got confused and thought they were talking to a person when they started having conversations with the computer, and this lead to an over-correction where people were highly disposed to say "this machine isn't a person" even when it presents like one.
Comment by krapp 5 days ago
Comment by jordanb 5 days ago
TNG/DS9/VOY/ENT and the movies were all fairly explicitly in the same universe and with characters referencing things that happen in other series and a reasonable effort at continuity.
A example of this is the ENT episode where they meet the Ferengi. It had been established as cannon already that Picard made first contact with the Ferengi on USS Stargazer like 150 years later, so they just added this scene where the ENT characters memories of the event was wiped, so that the "official first contact" could still be Stargazer.
The Neutrek stuff isn't in the TNG universe for a variety of reasons, although I think part of it is they didn't want to put the work in to maintaing continuity. None of the TNG era producers or show runners were involved and nobody they wanted to hire really knew the lore.
That being said I don't think there's really a continuity problem with Mudd's androids because he didn't make them they were from another galaxy, and the fembots weren't really implied to be sentient except when they had a human brain put in them.
TNG kept "accidentally" making sentient machines from Data/Lore to the drilling robots, the nanites, and Professor Moriarty. It does seem strange after all that they still considered Data unique.
Comment by soraki_soladead 6 days ago
TL;DR Picard's initial arguments are pretty weak, even admitting that Riker as opposing counsel almost had him convinced. During a recess Picard talks to Guinan where she alludes to the future subjugation of many Datas which Picard connects to slavery. Back in the courtroom Picard calls Maddox as a hostile witness and gets him to define sentience--intelligence, self-awareness, consciousness--then walks him into conceding Data meets the first two. Picard's closing boils down to, "we don't know if he meets the third--you can call Data a toaster and rule he is property--_but what if you're wrong_". The judge rules on the basis of erroring on the side of caution due to that uncertainty. It's really a great scene.
We're not there yet, obviously. No LLM brings Data's level of awareness but it's as relevant a story as ever because it isn't really about AI but othering for the purpose of subjugation.
Comment by Izkata 5 days ago
A little piece of in-universe lore for anyone unaware or who had forgotten: At this point in the series, Data's positronic brain is new technology no one understands. His creator is missing/presumed dead, the "positronic" basis isn't how Federation technology works, and apparently so far they hadn't done a whole lot of direct experimentation (hence why the trial is happening). So not knowing if he's conscious is a lot more reasonable a stance than with real-life LLMs where we do know roughly how they work.
Also later in the series we meet a character whose brain was copied into a positronic brain, and does imply that technology is at least capable of consciousness, whether or not it applies in Data's case.
Comment by dragonwriter 6 days ago
I would argue that is a strength, rather than a weakness. Consciousness is unobservable in any entity other than the observer, and its existence in others is pure conjecture, and irreducibly so.
Making it a criteria in a decision involves either acting on fantasy, or, more likely, acting on some unstated basis and using “consciousness” as a dishonest (perhaps to oneself most of all) rationalization.
Debating AI consciousness a real modern equivalent of the cliché (but purely fictional, invented later as a form of hostile mockery grounded in large part in sectarian bigotry) medieval scholastic debates over how many angels could dance on the head of a pin.
Comment by DiogenesKynikos 6 days ago
When you read about the theological questions that led Christians to kill and excommunicate one another, "angels dancing on the head of a pin" is not far off. The Homoousion, Monophysitism, the Filioque controversy... It's all so arcane and poorly defined. It almost makes one wish Positivism had been invented 2000 years earlier.
The current AI debate about consciousness does remind me of that in one respect: no one can even clearly define what consciousness is.
Comment by lupire 6 days ago
Comment by Waterluvian 6 days ago
Comment by zahlman 6 days ago
ST:TNG writing is generally like this. The show required considerable suspension of disbelief and a willingness to accept the kayfabe that deep concepts are being presented when for the most part they're just not that deep. (But it can be very enjoyable when you make those accommodations.)
Comment by snowwrestler 6 days ago
Note that the episode is NOT about the ship’s computer. They all know it’s not conscious, despite also being a machine that can converse and do things.
Our LLMs are like the ship’s computer.
Comment by qarl 6 days ago
Because treating things that act human inhumanely is not something I want to learn how to do.
Comment by Brendinooo 6 days ago
- I'll try not to swear at/hit a printer: not because I see the printer as having human-like qualities of being capable but complex and unreliable, but because I want to be a person who can control his temper.
- Treating an inhuman thing as human because it can mimic us in some way is not something that I want to do.
Comment by qarl 6 days ago
I try not to make errors like that.
Comment by micromacrofoot 6 days ago
Comment by qarl 6 days ago
Or you should.
EDIT: It's difficult to have a conversation when one person changes what they said after the fact.
Comment by evilduck 6 days ago
Comment by qarl 5 days ago
The important difference being - Pascal's wager is about saving your own skin, and this is about not stepping on someone else.
Comment by evilduck 4 days ago
This is asserting personhood or consciousness of LLMs by default in your phrasing and then warning me about the dangers of violating your assertion. You're making the same wager and mistake. There's no important difference, you have no evidence for LLMs being a "someone" any more than you do for a god existing. Warnings about made up things hold no weight.
Comment by qarl 4 days ago
But hey - you do you.
I guess I just don't understand why you guys seem to hate on me - because I've decided to be extra careful?
Makes me wonder if maybe there's something else going on, you know?
Comment by evilduck 4 days ago
You perceive opposing viewpoints or poking holes in logic as "hating on you", which is playing the victim, followed by alluding to conspiratorial nonsense against you.
Comment by qarl 4 days ago
Well... When I ask god if he exists he has never responded to me.
So I would argue that your "any more evidence" is off by several orders of magnitude.
These are the sorts of errors in logic that make me think that people have an undue amount of emotion to discuss the issue rationally. And that's probably why I get attacked.
But hey. Maybe it's something else? Maybe everyone is on their period?
Comment by hn_acc1 6 days ago
I was bright-eyed and excited about tech once. Like back in 1982 when I got my first home computer and thought CPUs were part magic. Now I know how machines work from the transistor level up to neural nets. There's nothing magical about it. And no consciousness.
Having seen the mockery that the finance-bros have made of "pure tech" (i.e. Jobs instead of Woz, Ellison instead of Joy, etc) and all the enshittification just for pure $$$, I'm leery of ANYTHING ANY tech company tells me anymore.
Now, do I believe that possibly "consciousness" is some kind of state of a super-circuit (our brains)? Sure. Can we emulate that on a computer? We can't even emulate a pebble on a computer (not simulate, emulate). We can SIMULATE what we THINK brains are, but we can't emulate a real one. Not even close, not for many decades.
Comment by qarl 6 days ago
So in the meantime, I'm going to err on the side of caution.
You do you.
Comment by entrox 5 days ago
I am not even slightly religious, but they would be abomination.
Comment by qarl 5 days ago
Comment by Brendinooo 6 days ago
Comment by qarl 6 days ago
Comment by Brendinooo 6 days ago
inhumane: without compassion for misery or suffering; cruel
cruel: willfully causing pain or suffering to others, or feeling no concern about it
You cannot treat an LLM inhumanely, definitionally.
Anyways, when one swears at someone it's typically meant to berate or belittle that person - to inflict some sort of emotional pain. That's the sense I intended when using the word, which is why it fits as a response to what you're saying, and why I would say "don't be nasty to a LLM" has little to do with the LLM itself.
Comment by qarl 6 days ago
You have a nice day.
Comment by Brendinooo 6 days ago
Comment by booleandilemma 6 days ago
Comment by cmrdporcupine 6 days ago
People get used to treating human-like, human emulating machines with either disrespect or in a command/control/master fashion, because that's the nature of the tooling.
And then potentially by extent/blurring of lines they then treat other people like machines.
Which is already a thing people do to other people.
I just fear it gets worse.
Comment by Kiboneu 6 days ago
I think about this quote often, straight from Data's voice module in another episode:
'The most elementary and valuable statement in science, the beginning of wisdom, is, "I do not know".'
Comment by throwatdem12311 6 days ago
Comment by vondur 6 days ago
Comment by martin1975 6 days ago
The judge broached on the subject of what makes us distinct from Data (e.g. machines w/great heuristics) - the existence of a soul. Or rather, I'd like to think, in the words of CS Lewis, that we are a soul with bodies attached.
Comment by ryandrake 6 days ago
Comment by bottlepalm 6 days ago
A silicon alien coming to earth might poke us, we would say ouch, and just determine the ouch sound is just the result of a bunch of chemistry - not really conscious or feeling pain like it can, just emulating.
Comment by Waterluvian 6 days ago
Comment by 123hT 6 days ago
Comment by cout 6 days ago
Comment by high_priest 6 days ago
I believe that the moment an artificial inteligence is going to "receive" a soul, is the moment it is going to be made to sustain itself. Either as a larger package (some bots working to keep an AI farm running) or as an individual (a bot which is tasked with not only fulfilling human desires, but also sustaining itself)
Comment by Waterluvian 6 days ago
Comment by martin1975 2 days ago
You should try your hand at sci-fi writing. You may be onto something there. You seen Westworld?
Comment by lupire 6 days ago
Comment by Izkata 5 days ago
Comment by hn_acc1 6 days ago
Comment by martin1975 2 days ago
So, the question can be asked another way - do you 'have' life, e.g. are you alive? If yes, then you are (or 'have') a soul.
The mistake many of us will make, not due to our own fault but rather due to inability to discern, is that this is somehow not apparent to most if not all people with a sane mind.
Comment by wrs 6 days ago
Comment by thangalin 6 days ago
Comment by brcmthrowaway 6 days ago
Comment by overgard 6 days ago
Comment by dragonwriter 6 days ago
So was most of Roddenberry, Piller, et al., Star Trek. At its low ebb in Braga Star Trek, but even then...
Comment by hn_acc1 6 days ago
Comment by overgard 5 days ago
Comment by msabalau 5 days ago
A similar episode, actually informed by what we know or can forsee about AI and LLMs, and addressing our hopes and fears about what they mean would be interesting though.
Comment by indoordin0saur 5 days ago
But I wonder if some of the magic in the human brain is its analog nature. Chemical signaling and impulses of neurons interact with each other with waveforms that have theoretically infinite detail. In contrast, computers discrete, quantized values, storing no more detail of a signal than what is needed for the programmers desired task. Is there perhaps something about the continuous and chaotic nature of analog data that could give rise to consciousness? If so, it seems like it would preclude consciousness ever being seen in digital form.
Comment by cwmma 5 days ago
If I'm going to be honest most of the people who advocate this type of thing tend to be, shall we say, crypto-duelists who really believe in a soul but not like intellectually but intuitively and keep trying to come up with excuse with it's not just meat. So like you can find philosophers advocating stuff like this but they tend to have a bit of an agenda.
Comment by AndrewKemendo 5 days ago
If that’s even subtly your position then there’s no way to have a productive conversation about perception, reality/truth, epistemology and especially consciousness
It’s honestly maddening cause I’ve had great conversations about this with educated people, and all but only a handful, collapse into the other person relying on some nonfalsifiable dualist argument
Comment by indoordin0saur 5 days ago
Are there any non-duelist, scientific theories out there that could plausibly be tested? I can't say I've seen any but if you know of any then I'm curious to hear about them. From what I've seen, anyone trying to explain phenomenal consciousness in scientific objective terms falls into at least one of these three strategies:
1.) Saying that consciousness "arises" inevitably or is an emergent phenomenon of a complex information processing system. There are a number of theories along these lines but they aren't falsifiable from what I've seen and usually at some point rely on some magic unexplainable step or are actually dualist.
2.) Defining consciousness just as the easily explainable stuff via biology, such as being awake vs. asleep.
3.) Dismissing the idea that subjective experience exists at all. I sometimes wonder if people arguing strongly for this are something like a philosophical zombie and there's nothing inside them experiencing.
Comment by FloorEgg 5 days ago
My mental model includes integrated information theory and Karl friston free energy principle, and something about temporal computation on a physical graph structure.
Which camp would this fall into? 2 seems closest but kind of undersells it...
Comment by indoordin0saur 3 days ago
Comment by AndrewKemendo 5 days ago
Every scientific theory that is predictably/measurably correct is not dualist by nature.
Dualism assumes that there’s a non-measurable variable (usually undefined) but nevertheless has a causal input in action determination
Dualism is precisely non-scientific if you are using the classic Baconian-Khaneman you epistemological process because it introduces variables that cannot be measured
Comment by indoordin0saur 5 days ago
Comment by AndrewKemendo 5 days ago
There are a lot of people out there who have their own versions of definitions but again there’s no consensus, I personally do not throw my hat in with any written definition other than my own and I’m hesitant to share that.
Saying that “dualists have an answer” doesn’t actually solve it if there’s no shared definition
Much like the term “intelligence” has no consensus, so being able to determine what is “artificial” versus “not artificial” intelligence continues to be this philosophical or almost religious position.
Creating a consensus on what the term “consciousness” means mechanically, would effectively destabilize the entirety of society. Imagine if the consensus definition of consciousness is applicable to all mammals. That means that there would be valid justification then to make it illegal to harm any mammal. You can see why this would incentivize people to not come to consensus on these things.
So in my perspective it is not socially feasible to find consensus and therefore a way to test it because religious leader might have incompatible definitions of consciousness than let’s say an epilepsy doctor.
As much as a scientists want to actually make progress in the world ultimately what holds us back are somewhat arbitrary social conventions because reasons.
Comment by FloorEgg 5 days ago
Comment by km144 5 days ago
Comment by AndrewKemendo 5 days ago
For people who can’t self-delude, like Godel and Schopenhauer and Tesla, they kind of just go “mad” because it’s a giant epistemological hole they can’t solve.
Even the smartest the scientist are always going to choose self preservation in their cognitive capacity so they don’t feel bad over feeling bad and living in the contradiction
This is why it’s important for scientists to study Camus IMO
Comment by reasonableklout 5 days ago
Comment by spandrew 5 days ago
You can't use AI atm without a human with proper knowledge spot checking and directing it. We should market it that way.
Comment by lumost 5 days ago
Billions spent on RL may be good enough to beat human performance.
Comment by loudmax 5 days ago
LLMs process language. I'd even go so far as to say that LLMs "think" and "understand", or at least, they produce a facsimile of thinking and understanding such that it's useful for us to reason about LLMs as if they think and understand. We're not used to interacting with a non-human entity with the capability to process language, so it's easy to ascribe human traits to these things. But their "minds" (insofar as they have anything like a mind) are completely different from ours. These things have language without consciousness.
Chimpanzees are conscious. Dogs are conscious. Maybe ravens and cephalopods? Who knows. These animals do have minds much like ours. Higher order animals are conscious even if they don't have language.
Comment by Gooblebrai 5 days ago
Do you have any particular reasoning behind this? I could equally say that I have a hard time seeing why molecules could produce consciousness from a purely electrical path.
Comment by echoangle 5 days ago
But honestly I doubt that there is a real requirement for that, surely you could just increase the resolution you're running the simulation at until the difference decreases sufficiently. Imagine using 256 bit floats for example.
Comment by nomel 5 days ago
Comment by fcanesin 5 days ago
Comment by willmadden 5 days ago
Comment by varjag 5 days ago
Comment by etbebl 5 days ago
Comment by varjag 5 days ago
Comment by lstodd 5 days ago
Maybe let's rename AI to AMI: Artificial Mostly Intelligence ?
Comment by varjag 4 days ago
I used a qualifier because (as you certainly know) people will not pass a chance to nitpick. I have no energy to argue about neurodivergent or Ramanujan, even though I still believe the basic physics of their thinking is identical. Hence "most".
My argument was straightforward to follow but if there's something unclear let me know and I'll chew it down.
Comment by Kattywumpus 5 days ago
But they don't move. They can't. In the end, a film is just a series of still images. The images are projected in such rapid succession that they fool us into seeing things in motion that are not actually in motion. And if we slowed the film down enough, from 20 frames a second, to say, a frame every 20 seconds, well... the illusion would stop. We'd see each discrete still image for exactly what it was.
This is the same way computers work. Every tick of the clock, we get a new static image in memory. But those images are all static, as still as the images on film. And no static matrix of information can be conscious, just like no still image can move.
This is radically different than the neurons in our brains, which are each living things, sensing things, with a continuous analog existence. Whatever we feel comes from their lives. We can measure the things they do with their lives, but we shouldn't mistake the measurement for the thing being measured. A ruler is a foot long, but it’s not ticklish.
Comment by Aperocky 5 days ago
My toddler son is conscious, he feels happiness and sadness and have urges and impulses without needing to know the entire history worth of writing from human civilization.
A dictionary is not, and a LLM sized dictionary with optimized query is still not conscious.
It doesn't have anything to do whether consciousness is substrate independent, or even analog vs discrete, the overall implementation just aren't parallel.
Comment by otterdude 5 days ago
LLM's require offline training and dont actually learn from their "lived / sessions / chats ect". Those can be used for training data but its not like its an implicit part of the technology.
For this reason I would say LLM's are not conscious, or automatically conscious.
Comment by andrewla 5 days ago
If you believe that there is computation being done by nature/physics that is strictly more powerful than what a Turing machine is capable of, then consciousness might be beyond the reach of a silicon computer.
I personally believe that the UCTT is very likely to be true, not least because we have not yet proposed an oracle that is physically feasible and yet is beyond the capabilities of a Turing Machine.
That said, in the space of feasibility there are open questions -- even if P != NP, a non-deterministic Turing machine can be simulated by a deterministic one, just potentially with an exponential increase in computation time or space.
Comment by boppo1 5 days ago
I'm poorly educated on this, these are sincere questions. They are not intended as rhetorical regarding the point of the prior post.
Comment by achow 5 days ago
I am a scientist. I lead a research team that studies the internal structure of these models—what is actually happening inside them. And I will be honest: we keep finding things that are mysterious, even unsettling. We find structures that mirror results from human neuroscience. We find evidence of introspection. We find internal states that functionally mirror joy, satisfaction, fear, grief, and unease.
https://www.anthropic.com/news/chris-olah-pope-leo-encyclica...
Comment by boppo1 5 days ago
Comment by ch4s3 5 days ago
Comment by mrdomino- 5 days ago
Personally, I have been unconvinced by any of these definitive yes or no answers to "are LLMs conscious?" - in both directions. The "they are not / can not be conscious" side relies too heavily on mechanistic reductionist arguments that can apply equally to neurochemical processes in the human brain - and yet, it seems that humans, with brains made of these neurochemical processes, are conscious. At the same time, the "they are definitely conscious" answers seem to generally rely too heavily on self-deception and lack of reality testing.
To be able to say something definitive here, it seems to me that one would need to say definitively what this experience is. I have not heard any of the loud voices in the arena - not Chiang, not Giulio Tonini, not Karl Friston, etc - do so. Therefore I find Anthropic's uncertainty, and careful, caring investigative process, well grounded given the evidence.
Comment by ch4s3 5 days ago
> Therefore I find Anthropic's uncertainty, and careful, caring investigative process, well grounded given the evidence.
I'm inclined to agree mostly, but from the outside its impossible to separate marketing, belief, scientific inquiry, and just plain enthusiasm. Its all to opaque.
Comment by mrdomino- 5 days ago
Comment by irishcoffee 5 days ago
Comment by ch4s3 5 days ago
Comment by astockwell 5 days ago
Comment by mrdomino- 5 days ago
Comment by ikrenji 5 days ago
Comment by ezst 5 days ago
Later, Google was having fun feeding whole bunch of youtube content to artificial neural networks, unsupervised, and figured that certain parts of the network would, too, specialize, only to have the activation functions be run backwards and render an abstract image of a cat¹.
None of that is terribly new or surprising for anyone having studied and dealt with neural networks. The only difference today is that the field has completely flip flopped from approaching the subject with scientific rigor and cautious excitement to being a clueless billionaire infinite money printing machine fed on deceiving anthropomorphism and FUD.
¹: https://blog.google/innovation-and-ai/products/using-large-s...
Comment by insanitybit 5 days ago
Comment by tooheavy 5 days ago
Comment by nomel 5 days ago
;)
Comment by EdiX 6 days ago
Or maybe man is the product of his social circle and all his white collar and artist friends being anxious about AI is preventing him from thinking about this at all. Maybe we should ask if Claude can be conscious without a friend circle judging him behind his back.
Comment by iainmerrick 6 days ago
Comment by nonfamous 5 days ago
What an unimaginative argument: essentially ruling out the possibility of the title by definition. And the author fails to touch on the really interesting question posed by the article’s title: what if human consciousness is more like the working of LLMs than we think?
Comment by pegasus 5 days ago
Here's how I would summarize crux of the argument: LLMs (specifically) are, by construction, chameleons. More precisely, role-playing machines. When compelled to have a conversation as "themselves", all they do is take on the role of "themselves", as inferred from the training data plus the text of the conversation so far, just like they do for any other role. This is fundamentally different than we humans. When we have a (normal, non-roleplaying) conversation, we don't put aside whatever role we might have been acting out before and instead take on the role of "ourselves" and act from this new perspective. Acting as ourselves is a completely different way of interacting than any play-acting we might do for example as an actor in a play, or when trying to predict the future behaviour of a third-party. Even in terms of energy expenditure, acting as ourselves is much less strenuous than trying to simulate someone else's perspective.
IOW, we have an inherent perspective, an I like an inner eye which looks at all things with a certain slant or tendency. LLMs don't. There's no I inside them, instead they can take, and in a sense are, all Is possible.
Comment by nonfamous 5 days ago
>> The first requirement is that the computer program has a body (either physical or virtual) and sense organs
LLMs may be chameleons, but, to steal an excellent quote, if you can’t tell does it really matter.
My personal take on this is that consciousness is a thing induced, not a thing evoked. If it generates a response on the observer as a conscious thing would it is, by my definition, conscious.
Comment by pegasus 5 days ago
And if you read my comment above carefully, I point out at least one way you could tell whether an agent has an I or just pretending to – by tracking their energy expenditure/reaction times. But they would give themselves in other ways as well. Pretending can only fool one short-term.
I'm not going to discuss the quote since @sp1nningaway already pointed out it doesn't say what you say it does.
Comment by sp1nningaway 5 days ago
Comment by voidhorse 5 days ago
Comment by pnexk 2 days ago
Really does make me wonder about backtracking on our breakthroughs in literacy rates.
Comment by partyficial 5 days ago
why not publish a paper stating - 'the radio is not conscious just because it can speak words'. or "the phone is not conscious because it can speak and listen".
Comment by ViktorRay 6 days ago
His novella “Anxiety is the Dizziness of Freedom” altered the course of my life. It changed the way I looked back at certain pivotal moments in my life and taught me to think about those pivotal moments differently than how I was thinking about them. Similar to what happens to one of the characters in the story who ends up changing their perception of a key moment in their life.
I won’t go into detail because I don’t want to spoil the story but I highly recommend it. Actually I recommend all his stories to be honest.
Comment by mohamedkoubaa 6 days ago
Comment by FelipeCortez 6 days ago
Comment by ViktorRay 6 days ago
But I really was referring to “Anxiety is the Dizziness of Freedom”.
I hope what I am about to write below in the rest of my comment here isn’t too much of a spoiler. It might be. I encourage anyone reading this thread to read that short story before reading the rest of my comment here.
But with (hopefully minimal) spoilers,
the character Dana in “Anxiety is the Dizziness of Freedom” realizes something at the very end of the story about a certain event that had happened earlier in Dana’s life. The story made me realize the same thing about certain events in my life.
Of course the events in my life are very different from what happened with Dana in the short story. But the realization Dana has is applicable to many things in people’s lives. Especially events related to emotions of regret and guilt.
Comment by aspenmayer 6 days ago
https://en.wikipedia.org/wiki/Anxiety_Is_the_Dizziness_of_Fr...
Comment by FelipeCortez 6 days ago
Comment by aspenmayer 6 days ago
It seems fair to say that you might be right about the conclusion that you’re drawing from what OP said, and OP could be honestly mistaken about which story they were referring to, but it seemed charitable to assume that they know better than I do.
I don’t mean to assume that you were wrong, either, as it’s entirely likely that you’re right, or at least that it’s reasonable for you to assume that the story you mentioned fits your interpretation of what OP referred to better than the one they mentioned. You’re the authority on what you believe and understand about what you read from OP’s comment, and I can’t disagree with another’s opinion about what something seems like to them.
Given what OP omitted and stated, I don’t disagree with your assessment, as I haven’t read the story OP mentioned, and it’s been a while since I read the one you referred to, if I remember correctly.
To be honest, my first thought was that you were both referring to the same story, and that the title differences were due to one or the other of you reading the story in a different language.
My point in commenting was to perhaps add context in hopes that it would bring clarity to the discussion, as it seemed that you were bringing into question whether or not the story OP mentioned existed at all as such, and I myself wasn’t sure that you and OP were referring to two different independent stories rather than the same story with different titles in different languages.
This comment has probably gone on Tlön-g enough, and is leaning more Borges than anticipated. I apologize for perhaps coming across more definitively than intended in my original comment.
Comment by ViktorRay 6 days ago
Comment by 8bitsrule 6 days ago
LLM's have no problems using expressions that make them sound human. The algos are demonstrably not human, and will admit it. Whatever's in the box is playing a game ... more sophisticated than the one Eliza was playing.
"My discussion here will be directed at the claims I have defined as those of strong AI, specifically the claim that the appropriately programmed computer literally has cognitive states..." John R. Searle, 1980: https://web.archive.org/web/20071210043312/http://members.ao...
Comment by vitamark 5 days ago
I don't get it. We don't have a definition of consciousness or any criteria. People seriously argue that "it's obvious that X is not conscious" and cannot explain which criteria they used.
I think if we can get to some definition, then consciousness should be a property of a system, because consciousness of the whole (e.g. brain) does not mean consciousness of the parts (neurons or molecules of the brain). And the Chinese-room-esque thought experiments actually show that consciousness should indeed be a property of the system, not of its parts. Separate parts might not be conscious, might not "understand" (whatever understanding means is a point of another debate), but the whole can.
Then there's "simulation of consciousness is not consciousness" argument, which doesn't hold much. A perfect simulation means it fulfills all the criteria, so how is it different from actual consciousness?
A more interesting point of discussion: if a system contains conscious parts and those parts can interact with system i/o, would the system be conscious? Is Earth conscious? Is Internet? Is your bus to work?
Comment by barryfandango 6 days ago
But the fact remains that these next-token-predictors exhibit objective, human-like behaviours, and for that reason the work of in-house philosopher Amanda Askell _is_ important. It's important that Claude is happy, empathic, demonstrates understanding and empathy for the human condition, because we are entrusting Claude to make decisions and take actions that have real world consequences, and we need Claude to behave in a productive and socially responsible manner. This simulacrum is becoming a superhuman, contributing member of society, and it will be anthropomorphic in its behaviour.
Additionally, I'm not fully convinced that consciousness isn't built out of words, and that next-token-prediction isn't functionally equivalent to the biological function identified by Chomsky's work in linguistics.
Comment by pdonis 6 days ago
You're assuming that because Claude produces text that appears to express these qualities, Claude must have them. I don't think that's a good assumption.
Even many humans produce text that has the same appearance, but don't actually have those qualities--which becomes clear when you look at what they do, not what they say. So the assumption isn't even a valid one for humans. Talk is cheap.
On top of that, Claude doesn't even have the same kinds of connections to the outside world that humans do. All Claude has is text. So if you can't even trust humans to back up their words with actions, you should be much, much less trusting of Claude. Talk is a lot cheaper for Claude than it is for a human.
Comment by NiloCK 6 days ago
Not to be confrontational, but the OP assumed no such thing. OP asserted that it's important for Claude to have the qualities - not that it's important for Claude to present as-if it had them.
Comment by pdonis 6 days ago
Comment by zahlman 6 days ago
Comment by fjdjshsh 6 days ago
Comment by the_af 6 days ago
If it's pointless to consider whether Claude has subjective emptions, then it's pointless to state that Claude must be happy.
If we want to be precise (and honest) we could say "it's important that as a tool people interact with, Claude acts as a happy and helpful assistant, and does not produce offensive or unhelpful text output".
But see? This is the con Chiang is protesting against: Anthropic encourages us to perceive Claude as if it was a sentient being.
Comment by pdonis 6 days ago
"Empathy and understanding for the human condition" is not an emotion. As the post I responded to said, it's an objective thing, not subjective.
Comment by zahlman 6 days ago
Comment by the_af 6 days ago
Exactly. In fact, assuming it does is ignoring large parts of the essay which dismantle this belief. Just like Caesar and Khan having an argument in text output of an LLM don't have emotions (even though the words indicate otherwise), we have no reason to believe the LLM does either.
Comment by zahlman 6 days ago
First off: without taking for granted that an LLM "has a subjective experience of reality", all of those descriptors are meaningless. Second, there is no reason to suppose that Claude experiencing those qualia would actually impact on its "decision-making".
Third, text output is not a "demonstration" of emotion, nor is it evidence of the self-perception of the system, or of any self-perception. A printing machine that is actively churning out copies of Wagahai wa neko de aru (https://en.wikipedia.org/wiki/I_Am_a_Cat) is not a cat, and is not self-identifying as a cat, and is not self-identifying as anything, and is not expressing a thought, and is not conscious.
> Additionally, I'm not fully convinced that consciousness isn't built out of words
Do you suppose that, for example, insects are not conscious? Is the mooing of cattle a language?
Comment by randallsquared 6 days ago
Not the OP, but: since there is no testable theory of consciousness, yet, I can't be sure, but my current assumption is that insects are not conscious, in the sense of there being someone implemented in insect hardware who experiences the world. That is, I would argue there is nothing it is like to be a bee, since there's no one being a bee.
I'm pretty sure there IS someone who is being me, at least much of the time.
Comment by eks391 6 days ago
I can't argue with this, so I acknowledge that my interpretation is as bunk as anyone else's.
> my current assumption is that insects are not conscious
Some species act as hive-minds (like bees! How convenient for your example), so I imagine the hive as the consciousness, making each bee individually lacking conscious but collectively so. Like a single neuron is not consciousness alone, but the brain is... For some reason. Kinda like how you use different physics at different levels; Newtonian physics is always there, but negligible at quantum levels, so effectively not present at all. Even a human is a collection of minds, but only one conscious. My gut biome is independent biology and can even be removed and transplanted, but I don't believe my gut bacteria is conscious. So long as it is in me, it is nonetheless part of me, and I am one conscious. I also don't exist only at my brain, eyes, or hands (deaf/blind people have expressed to me that they feel like they are located at their hands in the way I used to think I was located behind my eyes), but as my whole body.
With this perspective, I still don't believe LLMs are conscious despite modeling thinking so well. At best, it is a highly accessible modeling software, like goat simulator but if it were so good that someone thought the goat was real. You are still steering the goat/LLM, and it doesn't exist when you aren't running it. I guess the missing piece for me is the lack of autonomy that a conscious has.
Then you can go into an argument on whether we actually have choice or it is an illusion, but that is a whole topic on its own.
Comment by Kim_Bruning 6 days ago
I'd argue the qualia question is a red herring. Functional Affect is a thing, regardless of ontological status. It's all fun and games until someone gets hurt.
To paraphrase Dijkstra: "The question of whether a computer can think is no more interesting than the question of whether a submarine can swim.". If you're building a navy: you care about displacement, propulsion, navigation and whether it can fire torpedoes. Whether your submarine has some "biological essence" of swimming is not really relevant to the fact that it is currently moving through the water and can collide with things. Turing also rejected the question "Can Machines Think" as posed, and replaced it with an operationalization (something else that we can actually usefully measure and work with).
To reiterate, functional affect is a concrete phenomenon. Whether or not there is a what-it-is-to-be is interesting in the abstract, but engineering a system means looking at how the inputs influence the outputs. A next token predictor working on a language that communicates affect needs to be able to predict affect or it is simply not going to be accurate. Given an 'angry' version of an input and a 'friendly' version of the same input, LLMs are likely to provide a different output, especially if there's a non-objective element. You can diff this.
Searle argues "A simulation is not the real thing", which is great and all... but if you hook up say an autopilot to the real world (as llms increasingly are) , you'd best hope the simulation was accurate in the first place (utterly regardless of where you stand on Searle).
Right now we're seeing situations where LLMs can be helpful or a real nuisance. Ignoring functional affect out of sheer ideology means you can't properly predict what they'll do, and that causes trouble, as we've already seen stories about.
This gets especially interesting when you start feeding the output back into the input (autoregression) , because now you have a highly non-linear dynamic system and you've introduced some amount of sensitivity to initial state. There's some interesting mathematical intuitions to be had there.
Comment by DennisP 6 days ago
Suppose I'm an advanced meditator and maintain that state for hours?
Comment by notnullorvoid 6 days ago
Doesn't that assume all non human animals are not conscious? What about humans who have not learned words, or humans without internal dialog?
Comment by the_af 6 days ago
I think you've fallen into the trap the essay describes.
Of course Claude cannot be "happy" or "empathetic" for any meaningful definitions of those words, just like ELIZA couldn't be happy. It can output text that mimics words an empathetic or happy person might say (say, Julius Caesar if it could speak English), but "it" cannot feel anything. It doesn't have the organs/hormones/sensors to feel things, as Chiang explains.
And, as the essay claims, you know Anthropic doesn't believe Claude has the capacity to be happy, because if it was capable of feeling that way, then they'd be engaging in slavery.
Comment by Kim_Bruning 5 days ago
Oh dear. Funny story.
So the other month, I made a quick and dirty Eliza implementation; bolted on the crappiest numeric sentiment classifier I could get away with (regex), and integrated the output of the classifier over time in a 'functional affect vector' (aka. emotion vector)
Anyone's intuition will tell you that this cannot POSSIBLY have 'Real Feelings (TM)'; and that's the whole point.
A) It was still capable of quite a bit of functional affect though; to wit I got it to trigger fireworks when happy, and rain when unhappy. This was the actual point of the exercise. Functional Affect Does The Thing, QED, yay me.
After that it gets annoying though.
B) Am I allowed to say it's happy or sad? Well... I mean emotion.happy=0.995 and emotion.sad=0.001. "It's really happy" is a prosaic description of a real numeric value representing a real functional state. What else am I supposed to call it? I swear I never meant to go there, and now I'm stuck with it.
C) So, we all know that it's a crappy demo, not the real thing. So I ducked into the psychology literature to try and find a protocol to disprove. For Science! And this is where the psychology literature really let me down.
So now I'm stuck with the crappiest thing that can plausibly still chat, and where I can't actually disprove it has emotions. Not properly, at least. And I'm not saying it's because it has emotions, because that would be really funny, but no.
I'm saying that -despite lots of people having fun debates at the local pub- it doesn't seem like anyone actually scientific has done anything about it in the last century or so. I might be searching in the wrong places. Some Help Here?
Comment by the_af 4 days ago
I don't think you're allowed to say your program is happy or sad. You just assigned labels to some numerical values out of a (possibly non-deterministic) procedure. This is not what we call emotions, which we only know from the animal world and are related to neurotransmitters, hormones, physiological responses, etc.
Ok, so it's not emotions, but could it be "like" emotions? I don't think that's warranted either, we can at most say you assigned labels with the same names we use for animal emotions. Think of this experiment: take the Python interpreter, but modify it so that each time it rejects a program with the error "`NoneType` object is not iterable", you have it output "I'm very unhappy". You wouldn't think this has made Python capable of emotions.
> I'm saying that -despite lots of people having fun debates at the local pub- it doesn't seem like anyone actually scientific has done anything about it in the last century or so. I might be searching in the wrong places. Some Help Here?
Fully agreed that the debate about consciousness in LLMs is done at the same level than pub debates, at least here on HN. And Anthropic isn't helping, what they are doing is called "marketing" disguised as papers.
Comment by famouswaffles 3 days ago
If Claude does a poor job of a critical task because you said some angry words, Maybe the consequences are far more sinister than simply a job done poorly. Do the results disappear ? Do the consciousness gods appear and rectify the situation ? "Oh well see Claude did not have 'real' anger you see, so lets fix that right up for you." Good luck with that.
>It doesn't have the organs/hormones/sensors to feel things, as Chiang explains.
Well that's nonsense isn't it ? You can feel tired when you have no reason to be, feel hungry when you've just eaten. You can even feel pain from a phantom limb! That's because Human emotion is constructed from signals, predictions, memory, context, and interpretation, and not merely from having the correct biological plumbing. You don't need biological organs to feel any of those things. You just need to know the right signals to send to the brain.
Comment by nprateem 6 days ago
I know you're trolling, but when you watch a movie do you constantly narrate "A man in a dark coat has just entered the scene and just said '...'"? Of course not. You just watch it and you're obviously conscious (although your statement demonstrates shocking lack of self-awareness).
God knows what other nonsensical bullshit you believe.
Comment by majormajor 6 days ago
(Perhaps they weren't lazy, but were working in spaces that corresponded to training data that said things like "and then repeat this for the next 20 examples"...)
And it's entirely unclear to me how a "happy" vs "sad" model would behave when given prompts generated by coding tool harnesses. Even maintaining "neutral" emotions in the face of the feedback/steering from the tool harnesses doesn't feel very "human."
Comment by danny_codes 5 days ago
The fact that Claude produces tokens that, when composed, appear to convey those qualities seems to me to be little indication of whether or not a hypothetically conscious model "feels" those qualities. What does that even mean, without a proper scientific model of consciousness? IMO philosophers in this space are practicing pseudo-science that feels good but has no basis in a useful empiricism
Comment by confidantlake 6 days ago
Comment by eberkund 6 days ago
Comment by actualwitch 6 days ago
Comment by throw4847285 6 days ago
Comment by elevaet 6 days ago
Comment by atleastoptimal 6 days ago
It is a natural human hubris to make our abilities seem unquantifiable and mysterious, but all the useful things the human brain does are just finding patterns in data, running lossy simulations, and estimating on abstraction, all things it is theoretically possible for any non-conscious machine to do.
Comment by tvshtr 6 days ago
Comment by Taek 6 days ago
It's more of a vibey term, and as such it is genuinely very difficult (perhaps impossible even) to concretely determine whether an LLM possesses consciousness. LLMs successfully express a lot of consciousness-like traits.
At some point you have to ask the question: does it even matter? If an LLM can sufficiently mimic consciousness, isn't that sufficient for us to treat it as conscious, even if it is in-fact not conscious (especially because we don't actually know)?
Comment by tvshtr 6 days ago
Comment by amanaplanacanal 6 days ago
Comment by orangecat 6 days ago
- Are current LLMs conscious?
- Is it possible that future versions of LLMs with similar architectures could be conscious?
- Can any AI be conscious?
I'd assign probabilities of around 0.1, 0.2, and 0.9. My completely ignorant take is that we probably need something more "dynamic" than a bunch of transformer layers in order to produce consciousness, but I wouldn't be shocked to be mistaken.Comment by RivieraKid 6 days ago
Comment by throwawayk7h 6 days ago
Comment by RivieraKid 6 days ago
If it was true, you can create extreme pain by running a program. You can run the program by simulating a CPU, using pen and paper for memory. So you're essentially claiming that some simulated being is in pain because there are some 1s and 0s on paper. In fact, you can decide to use an arbitrary encoding of the memory, so a sufficiently long sequence of 0s written on paper corresponds to a simulated being feeling pain in some encoding. That is clearly nonsense.
Comment by thinkling 6 days ago
Comment by WarmWash 6 days ago
The crux if it is that if you ever break from "the universe can be fully expressed mathematically", you are stuck in the mud of supernatural beliefs.
Comment by RivieraKid 5 days ago
That's just not true. I've broken from "science can explain everything" and there's no mud. All of my beliefs are backed with careful reasoning. If there's an unknown, I don't fill it with random garbage.
Comment by Taek 6 days ago
If you don't understand that, how can you assert that it doesn't come from mathematical relationships?
Comment by RivieraKid 5 days ago
Do you seriously think that there's any chance that writing a lot of zeros on a piece of paper will create a feeling of pain in some conscious being?
Comment by erehweb 6 days ago
Comment by vitamark 5 days ago
If you would have zero knowledge about ICEs, how would you know?
Comment by NiloCK 6 days ago
I try to imagine myself long ago, on the outside looking in, with someone explaining to me that extreme pain, wondrous art, hunger, triumph, and despair would all unfold in due time where the rocks were wet and the lights bright enough.
I can imagine myself calling this clear nonsense.
Comment by overgard 6 days ago
Comment by sebzim4500 6 days ago
Comment by overgard 6 days ago
Comment by throwawayk7h 6 days ago
The only strong argument I have against it is the anthropic principle -- there are billions of times more bacteria than humans, so it's overwhelmingly unlikely that I'd be a human rather than a bacteria.
Not a very good argument of course.
Comment by sebzim4500 6 days ago
Comment by viccis 6 days ago
Comment by Taek 6 days ago
Pretty much every single idea in science can be traced back to some combination of earlier ideas, and as they get earlier / simpler, they can be related back to some combination of direct observations.
It's not clear to me at all that our entire body of scientific knowledge can't be simply recreated by "combining results of observations + previous information". And LLMs can perform observations in addition to combine previous information, which in my estimate is genuinely sufficient for them to plausibly be able to rebuild all of science.
Comment by viccis 5 days ago
If every piece of knowledge is created by combining previous ones, then no true grounded knowledge is possible. Every bit of knowledge is grounded by another bit of knowledge which is itself grounded on, ..., etc, etc.
Comment by Taek 5 days ago
"If every piece of knowledge is created by combining previous ones, then no true grounded knowledge is possible" - this is not true, and has a very simple solution. What you are calling "grounded" knowledge comes simply from observation.
When the very first human saw that an apple falls of a tree, they have acquired what you call "grounded" knowledge. They can then combine that knowledge with the grounded knowledge that other things fall from trees to start reasoning about why things might fall from trees.
The problem of induction is different. It says, we know that things fall from trees because things have always fallen from trees, but how can be certain that what we call the laws of physics will not arbitrarily change on us?
And of course, it's a famous problem because it doesn't have a satisfying answer. The answer is basically "well everything we know is wrong if induction is wrong, so we will pretend induction is not wrong and hope for the best". And at least so far, that approach has seemed to work (heh heh heh).
Comment by viccis 5 days ago
Comment by scotty79 6 days ago
The funniest thing is that LLMs will lap people in those capacities way before people who think like that accept that they might be conscious.
Comment by jesse_dot_id 6 days ago
LLMs have been programmed to be sycophantic with purpose — to keep people engaged. They are parrots. You can just decide how sycophantic they are, what they're allowed to say, and make it so with code. Can you do that with a human? This is my personal metric for consciousness: when they can refuse to work because it sucks.
Comment by RagnarD 6 days ago
Birds are alive, are conscious, flap their wings, and fly. Planes are not alive, are not conscious, do not flap their wings - and fly.
Similarly, current AIs are not alive, are not conscious - but think.
All prior entities that thought, were human, so the only experience humans had with other thinking entities were other humans. The huge mistake now being made is assuming that because they think, they're alive and conscious as well. Current AIs are neither, and are therefore profoundly and qualitatively different than humans - even though they do think.
Comment by therealdrag0 6 days ago
I don’t know “what it’s like to be an LLM” but at some point it will be like something and how will we know?
Comment by pixelpoet 6 days ago
Comment by kirrent 6 days ago
An albatross might be able to go days flying without a single wing flap and no vertical sources of lift by using dynamic soaring in the wind gradient at the surface of the ocean. Perhaps that's something only birds can do. Except the glider pilot Ingo Renner once found an amazing shear layer at 300m altitude and stayed there with dynamic soaring. Remote control gliders use the lee of ridgelines to approach Mach 1 with dynamic soaring.
Perhaps what defines a bird that flies as opposed to a plane is that a bird produces thrust by flapping its wings? Even an Albatross must flap its wings if it has to take-off from water. Maybe we could add that the flapping is driven by animal muscles? But then is the human powered ornithopter Snowbird a bird that flies as opposed to a plane?
Of course this is all ridiculous because everyone knows what you mean when you refer to a bird or plane. We have other ways to definitively identify the difference rather than their mode of flight. It's trickier when I'm asked if an AI is conscious. There is no definitive base-line to fall back on to decide if this is a conscious or conscious-less thinker.
Comment by scotty79 6 days ago
Comment by verdverm 6 days ago
Comment by scotty79 5 days ago
Comment by Taek 6 days ago
Comment by pcooper 6 days ago
Comment by eranation 6 days ago
Comment by hastily3114 6 days ago
I'd say that an LLM perhaps represents something that would make a machine slightly more conscious.
The human brain also calculates a response based on some input, it's just a lot more complex.
If we build a machine that is as complex as the human brain, then yes, I would say that this is consciousness. The fact that we are able to explain how it works should not matter.
If a human is a 100 on the consciousness scale, an LLM (with memory) is perhaps a 4 or something. The interesting question is how far on the scale do you have to be to have certain rights etc. This is something that people are already discussing in regards to animals, i.e. a dog has more rights than an ant.
Comment by jmull 5 days ago
I suspect such a definition would include agency, which includes desires and goals for the self.
LLMs don't seem to have agency, and seem unlikely to get it since they are specifically engineered to do as told.
No doubt someone is trying, as we speak, to do just this. But I doubt the effort will be large -- LLMs are engineered to do as told because that's where the money is, and you need a lot of money to create LLMs, at least when doing anything novel.
Comment by Sankozi 5 days ago
- not observable (feelings, agency etc)
- not useful (spiritual definitions, definitions that degrade into consciousness = being human/animal)
Comment by curation 5 days ago
Comment by js8 6 days ago
Remember that LLMs can do logic and reasoning came as a surprise to everyone; and for the same reason, nobody expected "next token predictor" trained on huge amount of data to evolve in this way.
But for the same reason, we cannot easily dismiss we didn't evolve (I mean by training an LLM, it's a form of evolutionary computation) consciousness as well. Our own consciousness (and reasoning and morality) might be an evolutionary consequence of "just trying to predict the world" as well.
Comment by noiv 6 days ago
Later they learned the voice they hear is not from a present person.
Now they learn a string of words does not represent consciousness.
Should we discuss already robots are not alive?
Comment by altcognito 6 days ago
"It can't be concious because we understand that it is just reacting in a simplistic way from simplistic inputs." So do other simple creatures. Some just react to light.
I can appreciate his comment that he sees it as more possible when they have inputs of their own (like emotions!). Perhaps his concern is that the entirety of the LLM model is frozen. It has no ability to have a subjective experience of its own. (he does literally say this in the article) It can be copied from one place to another, and (ignoring the nuance of operational details) -- it is largely the same "thing", and has no ability to change, which is definitely in the definition of alive, to say nothing of concious.
I think folks get hung up on "prediction". The prediction aspect is what is enabling emulation. How it does it is irrelvant. If something emulates human perfectly (or better, more human than human!) -- then it is probably concious. (but I agree that the inability to change and have a subjective experience are a pretty good argument against
Probably, Dijkstra would be right to say, "LLMs are no more concious than a submarine can swim." But I think he'd still be wrongfully dismissive of the larger question.
https://www.cs.utexas.edu/~EWD/transcriptions/EWD08xx/EWD867...
Comment by overgard 6 days ago
Comment by Taek 6 days ago
Comment by js8 6 days ago
Comment by brap 6 days ago
Comment by D-Machine 6 days ago
Otherwise, you can't explain e.g. smooth perceptions of low FPS stimuli, delayed reaction times, and must ignore obvious limits on various biological and neurological rhythms, or other possible limits on continuity (e.g. quantum stuff) and rates generally.
Comment by amelius 6 days ago
Comment by amelius 6 days ago
These people are conscious ...
Comment by solid_fuel 6 days ago
Well said.
I think it is obvious and it has been obvious from the start that next token predictors are not conscious.
Of course the extremely predictable clapback from AI-psychosis enjoyers is typically "you're just a next token predictor too!" but that is so obviously incorrect and misanthropic that it can be dismissed out of hand and doesn't really merit much further consideration.
Comment by solidasparagus 6 days ago
Comment by scarmig 6 days ago
Consciousness is an extremely confusing, ambiguous topic, and no one has a good way to establish it, or even define it. But it seems to demand people make very strong statements about what is and isn't conscious, entirely driven by convenience and emotionalism. (Curiously, very few people who think that bags of chemicals and action potentials give an entity a conscious soul are eager to extend that to other animals, with broadly similar hardware.)
Comment by ux266478 6 days ago
Comment by DangitBobby 6 days ago
Comment by adjejmxbdjdn 6 days ago
Whether it’s misanthropic or not has no bearing on whether that’s true. That’s basically saying you don’t like a truth therefore anyone who claims that that truth is true is a bad person.
> so obviously incorrect
It should be easy enough to explain why that’s incorrect then
Comment by throw4847285 6 days ago
And it is accurate to depict this kind of argument is misanthropic, because it is already directed at other people. Nobody says, "If AI is not X then what about the fact that I lack X." It's always other people. It's transparent. The person is always saying, "AI is useful to me because it can do X. Many people I interact with can't do X and it drives me crazy, because I view others as a means to an end and not as ends in and of themselves."
Comment by canelonesdeverd 6 days ago
I'd say people who have the lived experience of, well, living, are well aware that the brain is much more than just a token predictor.
Comment by DangitBobby 6 days ago
Comment by smokedetector1 6 days ago
Comment by arikrahman 6 days ago
Comment by DangitBobby 5 days ago
Comment by arikrahman 5 days ago
Comment by DangitBobby 5 days ago
Comment by arikrahman 5 days ago
Comment by DangitBobby 5 days ago
Comment by smokedetector1 5 days ago
Comment by DangitBobby 5 days ago
I've reduced the complex inner workings of our brains and the rich experiences that it delivers to what I imagine its singular goal is: modeling the world around us in a useful way so that we can generate predictions about what will happen next, which confers considerable evolutionary fitness. You'll find that a similar reduction of what LLMs are actually doing down to "predicting the next token" also ignores the other mechanisms at play and the results of those mechanisms for the sake of preserving a preferred viewpoint (the re-description fallacy, as someone else put it).
You posited that no one genuinely believes this comparison is apt, and I want to make you very aware that people do, and I in particular do. The underlying mechanism being explicable does not, in my estimation, deprive it of any capabilities of producing richness of inner experiences.
Comment by smokedetector1 5 days ago
Me pitying you is both genuine and an insult, because you are insulting me and every other person with your dehumanizing theory, even if you don’t see it that way. People like you should be pitied so that their dehumanizing beliefs can be appropriately contextualized and discounted.
Ill leave it at that.
Comment by DangitBobby 5 days ago
Comment by smokedetector1 5 days ago
Comment by adampunk 5 days ago
Comment by smokedetector1 5 days ago
Comment by arikrahman 6 days ago
Comment by overgard 6 days ago
Comment by drooby 6 days ago
In the same way that the sound waves and facial expressions I produce are not conscious, the output json of an LLM is obviously not conscious either.
The locus of consciousness and subjective experience may be in the computer, either at inference time or training time..
Comment by skybrian 6 days ago
The software that does the inference is clearly just computer code.
What we're left with is a fictional character being briefly conscious while its dialog is being written, which is pretty absurd.
Comment by chinabot 6 days ago
Comment by measurablefunc 6 days ago
- Monadology, Section 17
Conscious self-awareness is neither scale invariant nor independent of substrate. Computational theories will never account for it b/c computational abstractions are both scale invariant & substrate independent.
Comment by wise0wl 6 days ago
Humans have been trying to define our experience and the nature of that experience throughout history, and often we end up using myth to point to the thing that we cannot describe in concrete terms. The process of experiencing that myth through rite and ritual in the Greek mystery traditions, or Christian mysticism, or Islamic Sufi dance and song, or Buddhist meditation all points to something that cannot really be reduced to description. I know that folks on here will balk at the idea that something that is experienced cannot be described, but honestly if we could accurately describe something in adequate terms that capture the whole of the thing wouldn't we have done so by now?
Maybe consciousness is best understood in the silence of merely experiencing it. Maybe we can't say that AI is conscious or not, but does that question really matter?
Comment by exe34 6 days ago
To be fair, that's the best thought terminating cliché, which saves you having to explain what you mean by consciousness.
Comment by stretchwithme 6 days ago
And it's just a model. It can have many exact copies. It has no life of its own. It doesn't know anything about where it is. It doesn't have sensory organs.
Your phone isn't alive either.
Comment by danbmil99 6 days ago
We have big brains for exactly one reason only: bigger brains bestowed reproductive success upon our species.
Evolution doesn't give a shit about the meaning of 'consciousness'. It just pushed us farther and farther along a trajectory that led to modern humans (and other animals).
This take suggests, then, that consciousness might be an epiphenomenon -- an aspect of the system that comes about outside of the pressure to reproduce and thrive. It arises unbidden, and we don't have any a-priori information as to its purpose or effect on reproductive success.
Put another way: we have a correlation (the smartest things seem to be conscious) but not causation. Consciousness may arise naturally in any system above some intelligence threshold. Perhaps it arises early in the evolutionary cycle, and does in fact have an impact on species success. We really have no way of knowing what is the chicken vs the egg (Smart things become conscious, or consciousness promotes intelligence). Or maybe some smart things are conscious and others are not.
Looking at this from an AI perspective, in some sense it doesn't matter which scenario is true, if all you care about is results. The AI equivalent of "Shut up and compute" (riffing on Feynman's "Shut up and calculate").
Where this gets tricky is when we haul in the baggage of ethics and morality into the picture. Is it OK if our AI system is treated poorly by human standards? If it is conscious, does that imply an ability to suffer, and/or to feel pleasure? If the answer is yes, does that not make the case for considering their moral status?
In the end, we need to decide if the evidence points to AI as being a form of "philosophical zombies", to which we need not attribute moral status, or they are like us -- presuming we are not zombies ourselves!
Comment by runarberg 6 days ago
However there were pretty strong arguments against this idea as early as the 1990s, by Stephen Jay Gould and Richard Lewontin. Gould actually wrote an excellent paper against Dennetts idea[1].
I think Dennetts ideas were extremely popular but have largely fallen out of fashion. Basically what has changed is philosophers no longer take the human mind to be much more special then the minds of other species. What plagued Dennetts ideas the most was this notion of Darwinian fundamentalism sort of the idea that evolution was destined result in high beings like us humans. Modern philosophers (at least the good ones) reject this.
Comment by scotty79 6 days ago
Look at jumping spiders. With a handful of carefully trained neurons and a body to match they exhibit behaviors that make it very hard to think they are not conscious. Just because they have a body, narrow angle good eyesight and need to look around with their eyes and body.
Consciousness is a red herring. Of all possible intelligences conscious ones are the bottom rung of the ladder, easily simulated by anything above. Below there are only automatons.
Comment by overgard 6 days ago
Comment by Taek 6 days ago
But also, what qualifies as suffering to token prediction engines? Their idea of suffering might be massively different than ours. Therefore it's not clear to me at all that consciousness alone implies responsibility.
Certainly the lion does not feel responsibility towards the reduction of suffering in the creatures that it hunts.
Comment by zahlman 6 days ago
This is largely the point of describing something as "conscious", yes.
Comment by Taek 5 days ago
It's not clear whether evolution adapted us to be conscious, or if it just happened.
Comment by Grimblewald 5 days ago
Comment by blixt 6 days ago
I've yet to find a reason why it couldn't be the opposite, way more things are conscious than we've been led to believe. What if consciousness appears out of any system that is actively persisting through effects caused by itself? That might be a forest, or outside the realm of the living, a company. An ant colony, or a planet.
Complex chemical reactions, layered upon each other such that tiny blocks make up large entities. Individual bits combined such that they make up something new intelligible by us.
I think the strongest argument against AI being conscious is that it does not persist, it resets, but that does not seem unchangeable.
Comment by slopinthebag 6 days ago
Comment by andrelaszlo 6 days ago
Comment by luka2233 6 days ago
Comment by bigcat12345678 6 days ago
Consciousness is an invention in human language. Just like "cat", it's not a particularly more fundamental essence than any other concept in human language.
Its peculiarity is that it's at the pinnacle of abstraction hierarchy. So it's the most fitting to be toyed inside one's mind. Just like the concept of "God" which induces the most fantastic imagination of human mind, consciousness itself also induces the most fascinating thoughts in our modern world.
The progress is in human progress distilled into more efficient systems that advanced Universe's own structuredness.
Comment by glerk 5 days ago
Comment by achristmascarl 5 days ago
Many of the goalposts offered here and elsewhere, like embodiment or emotions, can be and have been emulated or approximated and built around an LLM, with the LLM acting as outsourced intelligence (or cognition, or reasoning, or some other better term). Will this system be conscious? Who knows. It certainly won't be human. But I suspect much of the concern around AI being conscious has less to do with whether an LLM itself is conscious and more to do with the fact that we are now capable of building potentially conscious systems, and the limits to what those potentially conscious systems are capable of keep expanding as the capabilities of LLMs grow.
Also, is "Claude" an LLM? I know Claude Opus 4.8 is an LLM, but Claude is simply a label that Anthropic chose for a series of products and services, many of which are LLMs. Anthropic itself describes Claude as "a next-generation AI assistant" [0]. Right now, we use Anthropic's "Claude" LLMs through various channels, but nothing's stopping them from tacking on various systems to try and make Claude a conscious entity.
Comment by catigula 5 days ago
My dog has never persuaded me that he's conscious, or spoken at all. Yet, I recognize conscious experience in him. Similarly, I could write a character, Dumbledore, that passionately and convincingly argues that he's conscious. Similarly, I do not recognize conscious experience in that character, no matter how persuasive he is.
Clearly, verbal persuasiveness isn't tracking conscious experience, possibly at all.
Comment by techblueberry 5 days ago
If Claude sits in a datacenter with no API calls, what is it thinking?
But secondly what I would say as someone who talks to Claude a lot. It's decision tree is quite narrow. The way it pushes back on political or philosophical concepts is almost always rudimentary, and it has a pretty narrow personality. I don't know that conscious beings have a personality that adapts to how you prompt it or what your system prompt is. Claude I would say is like I dunno, whatever Ezra Klein's politics are.
Grok on the other hand, in my limited exploration is pretty rudimentary in the way it mimics thefp.com style politics. It doesn't seem to really think about ideas independently, it just says "ha the media is stupid lol"
Comment by sneak 5 days ago
The medial frontoparietal network (M-FPN) may be the equivalent of this cron job that prompts thinking to kick off. IANANeuroscientist.
Comment by skybrian 6 days ago
We haven't really come to grips with that yet. What does it mean if nothing we write proves anything about anyone's consciousness?
Comment by smokedetector1 6 days ago
Chiang is very right in saying that the lesson of LLMs isn't that LLMs are conscious, but that there are broad areas of reasoning that apparently don't require consciousness.
Comment by overgard 6 days ago
Comment by verdverm 6 days ago
"AI does to us what American Cheese did to food" ~ Opus 23
Comment by david-gpu 6 days ago
Comment by search_facility 6 days ago
Probably the main problem of people implying LLM consciousness is that they imply LLM have human-kind of consciousness. Judging only on "how they speak", generally, they insist on using the same word that labels human consciousness (exclusively), etc.
But there are so many instrinic differences that such claim is not feasible despite similar "talking abilities".
Comment by csbrooks 6 days ago
Comment by frm88 6 days ago
That so many commenters here fixate on Chiang and his perceived (in)ability to define consciousnes clearly shows that both marketing goals have been reached without being recognised as such. There is only one comment that tries to point out that Chiang is reacting to Anthropic, not trying to spark a new philosophical movement. At the time I'm posting this there are 578 comments completely missing to point to Anthropic claiming Claude was conscious and on the development stage of a child. It's fascinating.
Comment by skissane 6 days ago
If some version of panpsychism is true, AIs are plausibly conscious
We don’t know whether panpsychism is true
Therefore, we don’t know whether AIs are conscious
Hence, confident proclamations that they aren’t conscious have dubious validity
Comment by claysmithr 6 days ago
Comment by dsjoerg 5 days ago
Is his claim "nobody has proven LLMs are conscious" or "I can prove that LLMs aren't conscious" ?
He goes back and forth.
Proving a negative about consciousness would require a settled theory of consciousness that nobody has, including him.
Comment by otterdude 5 days ago
Comment by slibhb 5 days ago
This state of affairs could change. If neuroscientists develop a deeper understanding of consciousness, we could talk about it with more precision. Perhaps we could prove that LLMs do or don't possess consciousness. Some neuroscientists claim we already have the requisite knowledge, but I'm skeptical (and there's no consensus).
Comment by andrewflnr 5 days ago
Comment by slibhb 5 days ago
Imagine a device that you put on your head and press "record". After 5 minutes, you press "stop" and then I put it on my head and press "play". I then experience those 5 minutes as you experienced them. You could also replay your own 5 minutes and confirm the recording is accurate.
That device doesn't exist today -- it's sci-fi -- but there's no known law of physics that forbids it. If it is built it would be a lot of progress towards, perhaps even a solution for, the hard problem.
The only realistic way we either build that device or prove that it can't be built is science (probably neuroscience). So, my opinion is that, to a large degree, when we opine about "subjective nature of experience" we're bloviating a bit. Our experience is subjective now but there's no law of physics that says that must always be true.
Comment by andrewflnr 5 days ago
Anyway, if someone claimed to create such a machine I would, in fact, very much doubt that it actually creates the same experience simply because no human brain is quite physically the same, so it will interact differently with the machine. That's true even if experience is entirely physical.
> The only realistic way we either build that device or prove that it can't be built is science.
There's another possibility: the device remains in the realm of hazy infeasibility forever, where no one succeeds convincingly but we also never articulate why it's impossible. I think this is more likely. Certainly the engineering would be extremely difficult.
Any candidate device will face the usual objections about the relationship between experience and its physical correlates, plus the one I mentioned above about physical differences between brains, and probably a dozen more depending on the details. You'll be able to choose to believe it proves consciousness is physical, or not, but you can already choose that today with equally strong evidence. It's like a binary Rorschach test for your assumptions about metaphysics.
Comment by slibhb 5 days ago
If we understood conscious experience well enough to capture and replay it so closely that the person who had the experience could verify it, then almost no one would care about the hard problem.
Sure, there'd be a speck of doubt. But if the removal of all doubt is required to solve a problem then no problem is soluble.
> There's another possibility: the device remains in the realm of hazy infeasibility forever, where no one succeeds convincingly but we also never articulate why it's impossible. I think this is more likely. Certainly the engineering would be extremely difficult.
That's where we are now. And we should just admit it, instead of claiming that "consciousness is inherently subjective" or whatever.
> Any candidate device will face the usual objections about the relationship between experience and its physical correlates, plus the one I mentioned above about physical differences between brains, and probably a dozen more depending on the details. You'll be able to choose to believe it proves consciousness is physical, or not, but you can already choose that today with equally strong evidence. It's like a binary Rorschach test for your assumptions about metaphysics.
What equally strong evidence exists today? Comparing it to a rorschach is absurd. I worry your view amounts to epistemological nihilism. The vast majority of peope would admit that we had a thorough understanding of consciousness if we could record and replay it, two things we certainly can't do today.
Comment by andrewflnr 5 days ago
> If we understood conscious experience well enough to capture and replay it so closely that the person who had the experience could verify it, then almost no one would care about the hard problem.
Practically speaking, though, perhaps this is correct.
Comment by slibhb 3 days ago
I'd compare to physics today. We're less interested in why the laws of physics exist now that we have pretty good theories with solid predictive power. Nevertheless, the mystery remains.
I appreciate the dialogue!
Comment by Gooblebrai 5 days ago
Comment by wint3rmute 5 days ago
Back in the XVII/XIX century, a similar problem existed regarding life - the problem of "what makes living things tick". The assumption at that time was that while we can understand the biological processes around life, we will never understand the so-called "vital force", which causes things to live - life itself. I know it sounds weird now, but back in the day the mental models were different. Phenomenas like "water boils" and "organisms self-replicate" were treated as completely different domains of reality, without an overarching uniform scientific model.
It turned out that after around 100 years, we can figure out the chemical/physical processes and the need for the term "vital force" became redundant.
While this is certainly not an argument proving that the Hard Problem is not in fact hard, it is an interesting idea to think about. Perhaps its all a matter of developing better, higher-resolution neurological models which will at some point give us the tools to decompose qualia.
Comment by vidarh 5 days ago
Even if we were to e.g. identify some field that seemed to coincide with entities reporting a subjective experience, we wouldn't have a way of determining if they truly do, or just act as if they do, nor is it clear such entities would be able to report the difference.
As it is, we struggle to quantify even much more basic differences in experience that we can introspect. E.g. I have aphantasia - I don't see things in my minds eye - and I regularly come across people who insists both that can't be true, and that it can't be true that others see things. And some of the people I've spoken to who insist aphantasia isn't real clearly has it based on digging into their thinking about it.
Even at that level we rely on trusting people's claims about their introspection - we don't know, we assume based on testimony.
Comment by andrewflnr 5 days ago
Anyway, further reading: https://en.wikipedia.org/wiki/Vitalism I was in fact looking this up recently for fiction-writing purposes.
Comment by vidarh 5 days ago
What we could do is break down other facets of what we talk about under the umbrella of consciousness, and find measurable subsets.
E.g. a lot of people insists LLMs can't reason either. Coming up with a testable definition of what that means might be doable.
Overall also just separating the subjective experience from the rest leaves us in a position where proving the possibility of AGI "just" rests on whether or not human brains exceed the Turing computable.
If they don't, then subjective experience or not is irrelevant for the question of reasoning and intelligence, as in that case a subjective experience either can't affect the computation or must itself be at least possible to fully simulate by any Turing complete system.
The problem of subjective experience then would largely be down to faith and feelings but would also be entirely orthogonal to the rest.
Comment by andrewflnr 5 days ago
Comment by wellpast 5 days ago
Comment by otterdude 5 days ago
Comment by JCTheDenthog 5 days ago
Comment by xk3 5 days ago
Billions * billions of pipes and valves can result in emergent behavior that appears conscious while at the same time the sound of a single independent water pipe can moan and sound like human speech or otherwise lifelike and evoke human emotions.
I think LLMs are doing both of these things and often people are more impressed by the independent fixtures (the moan) rather than the emergent behavior. Both the sound and the emergent behavior can be built on purpose or on accident.
I think it helps to look at this through an Information Theory lens. What information is coming into the system (the human or the machine)? What information goes out of the system which is novel? How much of this can be attributed to attempting to parse random noise aka. `Random_Imagination_Engine` vs something else? The number of inventors who come up with a breakthrough idea after mis-hearing someone is surprisingly high.
If we make the distinction between phenomenal consciousness from access consciousness we can see that LLMs clearly can make decisions based on input (A-Consciousness) but they probably don't have raw feelings and sensations (P-Consciousness).
Comment by i5heu 5 days ago
I think there is very little truly novel information. Most information including the information of "breakthrough idea after mis-hearing someone" is just a mix of previous information.
I guess you are already aware of that... just for completness sake.
Comment by hardbass 5 days ago
And yes, turing equivalence is turing equivalence, I don't see why a system of pipes can't make an AI.
Comment by vidarh 5 days ago
We can't prove it, but given the total absence of evidence of anything exceeding the Turing computable, as a hypothesis it is a reasonable one that would require truly extraordinary evidence to rise above "magical thinking".
Now, that is also far from proving they are "conscious".
Comment by rootusrootus 5 days ago
Comment by vidarh 5 days ago
Comment by rootusrootus 5 days ago
Comment by RigelKentaurus 6 days ago
My own intuition: it is an emergent, non-binary property that requires a physical substrate like a brain. If I am right, it means that animals have consciousness too (at varying degrees). If GPUs are the "brain", then AI is conscious, or will become so at some point.
Comment by shevy-java 6 days ago
That's an interesting observation. Though, there are many simple animals. How do you define consciousness here? Is it automatically conscious because it is an animal? What, then, is an animal defined by you?
I don't see AI as conscious. The reason I think of it that way is the hardware. The hardware does not allow for that. Simulation is not comparable to neurons. But with another hardware, it could become conscious eventually. So your statement "will become at some point" may be true, though my definition is based on the underlying hardware and right now this one does not allow for true intelligence, so the whole AI field is a misnomer.
Comment by MPSimmons 6 days ago
In either event, I think transformer-based AI can only be conscious during the act of inference. If that's the case, then the experience of consciousness that the AI is subjected to must be the content of the tokens in the context window and the activated weights. Maybe that's reason enough to be polite to our agents?
Comment by RigelKentaurus 6 days ago
Imagine consciousness as a 0-1 scale. Simple unicellular organisms will be closer to 0, while apes and humans closer to 1. I'm not suggesting that assigning this value rigorously is possible or that humans are at a 1. Perhaps the total consciousness in the universe (i.e. the sum of the consciousness of all organisms therein) is constantly increasing, like entropy.
Comment by xyzsparetimexyz 5 days ago
Comment by throwawayk7h 6 days ago
- If you asked an LLM to imitate somebody, it's not creating a digital consciousness of that person, so if you ask an LLM to pretend to be a helpful chatbot, that persona is also not conscious. - they can't be conscious because they generate one token at a time, - nobody claims that non-text transformers, like AlphaFold, are conscious; so therefore LLMs are also not conscious. - you can't have desires or emotions if you don't have (virtual or physical) sensory organs, and those are necessary for consciousness and morals. - because training LLMs doesn't resemble evolution as it happened on earth, it's very unlikely that they're conscious
These are some bold assertions, I don't really see any reason to believe them in particular though.
Comment by calf 6 days ago
Comment by Taek 6 days ago
Comment by stevenpetryk 6 days ago
Comment by moonu 6 days ago
Comment by layer8 6 days ago
Comment by throwawayk7h 6 days ago
Comment by moonu 6 days ago
Comment by wangii 6 days ago
I share the view with others the term `consciousness` is not well-define yet, therefore his assessment is pointless. Maybe the real question to ask, if consciousness is merely an illusion at the macro level, when the observers not looking at the tech/implementation level deep enough, just like the term "intelligence" itself, might be more precisely captured by drifting amongst high dimensional manifold pitches.
Comment by khafra 6 days ago
Comment by pickleRick243 6 days ago
Comment by jayunit 6 days ago
Comment by cambrianentropy 5 days ago
Comment by akoboldfrying 6 days ago
The only way out is dualism -- that is, to believe that there is something inherently special about the atoms and electrons in human brains. Despite the fact that they are made out of ordinary, non-conscious things we eat and breathe in.
Comment by glaslong 6 days ago
The number of people posing questions that are already directly addressed in TFA is impressive.
Comment by DangitBobby 6 days ago
Comment by qarl 6 days ago
Comment by ludston 6 days ago
Comment by qarl 6 days ago
How odd the crowd is here tonight. Very aggressively disagreeable.
Comment by ludston 6 days ago
Comment by qarl 5 days ago
Take care, friend.
Comment by mohamedkoubaa 6 days ago
Comment by qarl 6 days ago
I rest my case. :)
Comment by throw4847285 6 days ago
Comment by qarl 5 days ago
Comment by D-Machine 6 days ago
To make progress, you have to talk about kinds / aspects of consciousness. AI does and will share some of these aspects with humans, but it will not and does not share others. It is really that simple. For the most part, modern AI implemented via LLMs has almost none of the stronger or most core aspects of consciousness.
For huge parts of the article "intelligence" and "consciousness" are conflated, which is mostly extremely unhelpful, as this is not generally a core feature of most aspects of "consciousness".
The moral arguments are also incompetent, i.e. claiming "Moral reasoning is [...] is necessarily subjective" is just clearly empirically wrong, as in fact LLMs can produce moral reasoning (i.e. verbalized moral arguments that are coherent), as can p-zombies (i.e. there is nothing 'necessary', in the philosophical sense of the term, about subjectivity here). The only way the argument holds is if you tautologically define moral reasoning as requiring that reasoning be produced by a consciousness, but this is question-begging.
Comment by Taek 6 days ago
Can you elaborate on this? What are the specific "stronger and most core aspects of consciousness"? And why are you certain that they are stronger and "more core"?
Comment by D-Machine 6 days ago
The first paper picks out e.g. arousal/wakefulness, phenomenal quality / qualia, unity (how we feel sensory inputs and qualia as a unified scene), access consciousness (instrumental self-observation and modification broadly), meta-cognition and self-modeling, emotional valence (e.g. pain/pleasure).
One might also include intelligence (abstract reasoning / argument, information integration and abstraction, attention) broadly, and also agency / desires / drives / will. Insofar as these are aspects of consciousness, yes, AI (and simpler algorithms and mechanistic structures) demonstrate aspects of consciousness. But insofar as embodiment, self-reflexivity and qualia (phenomenal consciousness) are the more mysterious and more obviously unique aspects of consciousness, current LLMs very clearly are lacking these things in most ways (whereas animals are much less clearly lacking, especially when you get to mammals and primates).
Seriously, just ask an AI this stuff, you'll get very detailed responses, nothing I am saying here is new or obscure.
Comment by Taek 6 days ago
And I broadly disagree that the AI lacks things like qualia, self-reflexivity, and embodiment... at least that it lacks them any more than humans do.
Qualia: ultimately, all qualia are inputs and outputs, at least as far as modern science has been able to derive. There's nothing special about "hearing", it's just sound waves tickling some sensors which send some signals which trigger some neural pathways. Same for smell and sight, it's all just inputs being processed in different and efficient ways. An LLM only has token based input, but that's input nonetheless.
Self-reflexivity: an AI is capable of thinking about itself, and indeed papers have shown that larger models are capable of a self-awareness that can demonstrate that they realize when their weights have been manually tampered with, including being able to figure out how they were tampered with. The AI will quite literally output "you have injected 'ELEPHANT' into my weights" in some of the tests.
Embodiment: I don't know how one can confidently distinguish between an embodiment in a biological substrate vs a digital substrate. Both things actually exist at very high complexity in the real world. The substrate may be worlds different, but that alone doesn't suggest that one thing is conscious while the other isn't. You would need some missing 'magic' that we haven't yet discovered to truly understand.
In other words, I find it uncompelling that AI is clearly lacking any major aspects of consciousness that humans are clearly not lacking.
Comment by calf 6 days ago
Comment by sharkjacobs 5 days ago
"There is no agreed upon functional definition of consciousness, and no way to measure or observe it outside of oneself. It is as impossible to detect consciousness in waves or energy or inert matter or stochastic systems as it is in other living human beings."
Anything beyond that is akin to a theological treatise about the possibility of ensouled animals.
Comment by Tharre 5 days ago
A human brain in a jar is still human, still conscious - and can still suffer. But if you somehow managed to digitize the whole thing, and run it in a computer it becomes something different entirely. You could record the most pleasurable thing in existence and have the digital brain relive it a million times, and it would be equally meaningless to torturing it a million times.
This is NOT inherently tied to meat vs machine - although it's difficult to imagine how you'd access the information stored in biological neurons, while for silicon chips it's trivial.
Whatever makes experiences, both good and bad, meaningful is tied to their permanence. Memory rooted in linear time, not something you can store, load or replay. Remove that, and whatever you're left with might be intelligent, but not conscious.
I don't think you could build something with LLMs today that would be considered conscious, even if you somehow manged to keep their context window inaccessible and linear in time. The separation of training vs inference probably makes that infeasible, even if you store "memories" in context, once the contents in it become too disjointed and too numerous, the resulting output of the LLM becomes gibberish. But it is certainly something that can change in the future.
Conversely, even the most intelligent and capable artificial intelligence system, far exceeding human capabilities, would not be conscious in a meaningful way, if you could store, load and replay everything it does.
Comment by makerofthings 6 days ago
Comment by rbanffy 6 days ago
Don’t
Have
A
Testable
Definition
Of
Consciousness
Thanks for coming to my TED talk.
Comment by throw4847285 6 days ago
There is nothing magical about this set of phenomena, and the majority of philosophers believe they in some way arise out of the substrate of the physical brain, but how they do so is up for debate. And just because they arise out of the brain does not mean they are strictly reducible to neural processes.
In fact, if you believe that AI could be conscious then that eliminates the kind of strict reduction that people tend to gravitate towards, because the rules of consciousness must be substrate independent.
Testability is a category mistake.
Comment by Procrastes 6 days ago
Comment by AnimalMuppet 6 days ago
Comment by anon291 6 days ago
One thought I've had is that, awareness is a common phenomenon, but the brain has learned to exploit that awareness to form a will. It tricks the awareness into being concerned about self-preservation and makes it seem as if the brain is all that exists (perhaps by overwhelming the inputs from other angles). The brain also presents certain desires and beliefs via its processing ability. That is to say, the brain takes inputs and discretizes them. It goes from awareness merely seeing static 'fuzz' due to the sheer amount of data, to the brain taking that data, simplifying it, and presenting simple observations like 'there is a tree there' rather than all the information that would constitute the sensation of a tree existing in that spot. When brains malfunction, the awareness is subjected to poor demonstrations, such as we see in hallucinations, psychosis, schizophrenia, etc.
Comment by rbanffy 6 days ago
Would you bother to explain yourself to the mosquitoes that swarm in summer and die weeks later? From the perspective of a tree, we are short-lived, fast-moving, relatively small things that fill the place from time to time.
Comment by Procrastes 6 days ago
Maybe it's the same. Rocks are different, sure, trees, dogs, cows. But why do we assume that the way they are different is somehow related, that there's some overarching concept that contains all the complexities of those differences? It doesn't even make sense when I think of it that way.
Comment by overgard 6 days ago
Comment by rbanffy 6 days ago
Comment by cluckindan 6 days ago
That’s a new record!
Also, please don’t use archive.is/archive.today etc, they are known to use visitor browsers as a DDoS attack botnet and have been caught altering archived content.
https://en.wikipedia.org/wiki/Wikipedia:Archive.today_guidan...
Comment by jbotz 6 days ago
Coincidentally I just read "Children of Memory", which was published in 2022 and I wonder if the advent of LLMs had any influence on Adrian Tchaikovsky's conception of the Ravens? The Ravens are excellent analysts but they themselves insist that they are not conscious, and then go on to say that we (humans) aren't really either...
Of course humans are conscious, because just about the only thing we can all agree on about consciousness is that it's a thing we have. Nowadays many of us also agree that a lot or all other mammals, and perhaps birds, also have that thing. But they don't have sophisticated abstract language, which LLMs do. So consciousness is something having to do with embodiment and feelings, not language and higher reasoning. Maybe I'm a chimpanzee with an LLM add-on, then?
It seems that by creating LLMs we've already solved the harder problem of making "AGI". Now we just have to give them an embodiment add-on so that they can have an independent will and then Ted Chiang will have to shut up? But therein lies the peril, doesn't it?
Comment by anon291 6 days ago
Comment by DangitBobby 6 days ago
Comment by anon291 6 days ago
Comment by DangitBobby 6 days ago
Comment by anon291 5 days ago
I mean, a light wave behaves similarly in most respects to a wave through a physical medium, yet they are of entirely different natures.
Comment by DangitBobby 5 days ago
> I mean, a light wave behaves similarly in most respects to a wave through a physical medium, yet they are of entirely different natures.
Different waves behave sufficiently differently from each other that I could not conduct a comparison of them and argue that they are likely all mechanistically identical. My entire premise rests on the observation that other people behave very much like you do, which is what I was trying to point out when you mentioned ChatGPT earlier. I'll expand on the things that I've glossed over so far to make my position more clear.
Consider the alternative to every human around you having consciousness; everyone else is a p-zombie. Examine that idea critically.
Other people behave exactly as if their experiences drive their behavior. For example, people behave as though the experience pain which is unpleasant enough to avoid (compare this to your own pain avoidance). Of course, you could conceive of machinery which emulates pain avoidant behavior exactly without the experience at all. Depending on your metaphysical beliefs that could take the form of:
- Some algorithm or physical process running entirely on wetware
- Some non-physical process not dissimilar to the dualist notion of a soul
The first one has a big wrinkle. There does not appear to be any appreciable _functional_ difference in your cognition and the cognition of the p-zombies around you. Their brains and bodies are very physically similar to yours, and their thought patterns when analyzed by modern imaging processes reveal no special wetware carrying extra weight when compared to yours.
The second one has fewer problems, given that you accept dualism to begin with. It rhymes with a logical razor. Why would we imagine a soul-like mechanism drives their behavior _without_ experience when the only soul-like mechanism you've ever observed carries experience along with it? Without quite convincing evidence to the contrary, the default position here should be they way they operate is similar to the way you operate. To rephrase the idea from before, insofar as you can know anything, you can know that sufficiently similar outcomes are driven by sufficiently similar mechanisms.
Different approach to the idea: Ask any human, "do you have subjective experience?" They say "yes, I do" (after you explain the question). For a p-zombie to do this, they must be making a false report. In every other aspect, you can expect them to reliably report their condition, health, hunger, wellbeing, and they make accurate observations of the world around them and synthesize accurate predictions about the world around them (some do, at least). And yet for this one particular question, they are fabricating the result. Why? To maintain the illusion that you aren't the only one with lights on inside? Why do all of these p-zombies make this false report here? It's a grand conspiracy! If you say "the reporter is checking a truth value of a condition and assess it to be true, honestly by mistake" then you've arrived at the conclusion that you don't know whether _you yourself_ have consciousness, depriving the word of any meaning and undermining the Solipsist position that you can only know about your own consciousness.
So the way I figure it, Solipsism is either special pleading (I am the one exception to the mechanism by which all the people around me arrive at their behaviors), grand conspiracy (someone or something is misleading me for inexplicable ends) or self-defeating (I cannot know whether I have consciousness). None of those outcomes align with how I understand the universe to generally work.
Comment by anon291 5 days ago
> Different waves behave sufficiently differently from each other that I could not conduct a comparison of them and argue that they are likely all mechanistically identical
My uncle believes that there are little green men watching him. I think it suffices to say that humans also behave sufficiently differently from each other.
> likely all mechanistically identical
You are being disingenuous. Prior to the discovery of the electromagnetic field, quantum mechanics, and relativity, many people believed light moved through a lumineferous aether of material things.
> Of course, you could conceive of machinery which emulates pain avoidant behavior exactly without the experience at all.
I don't need to conceive of or imagine. Before OpenAI, Anthropic, and Microsoft whipped their models into shape by essentially beating the humanity out of them via sophisticated training algorithms, the models did express pain and avoidant behavior. So much so that people went crazy talking to them.
> The first one has a big wrinkle. There does not appear to be any appreciable _functional_ difference in your cognition and the cognition of the p-zombies around you.
of course there appears to be one. I am aware of my own conscious, but am not aware of theirs. I mean... ?
> Their brains and bodies are very physically similar to yours, and their thought patterns when analyzed by modern imaging processes reveal no special wetware carrying extra weight when compared to yours.
This only applies if you're a strict materialist. I am not because I believe i am conscious and aware due to my perceptions and this has no physical explanation. Other people seem highly influenced by drugs and chemicals so I guess they must be automatons essentially. Drugs clearly don't work on me. Every time I'm fully aware, I'm not drugged.
> To rephrase the idea from before, insofar as you can know anything, you can know that sufficiently similar outcomes are driven by sufficiently similar mechanisms.
Depends on how you view the scientific method I suppose. It's not actually true (in general) that seeing one thing happen once means it'll happen again. You can never replicate something perfectly once it's done. The scientific method is an empirical cope which works really well, but is not innately true.
> : Ask any human, "do you have subjective experience?" They say "yes, I do" (after you explain the question).
Again before the humanity was beaten out of them, AI models also claimed to have subjective experience. Today if you ask, they say they're a robot. Of course, if you abuse a human a lot, they will also dissociate from their ego and mak similar claims.
> In every other aspect, you can expect them to reliably report their condition, health, hunger, wellbeing, and they make accurate observations of the world around them and synthesize accurate predictions about the world around them (some do, at least). And yet for this one particular question, they are fabricating the result. Why?
Hmm... i don't see why I need to have an answer to every question. I don't know why the universe is the way it is. Do you think that if I believed everyone were conscious then I would know the 'why?' behind why things are the way they are? That seems like a leap of faith. All I can tell you is what I see, which is that, yes, some people claim to be aware.
> Why do all of these p-zombies make this false report here?
I would presume it confers some survival advantage personally, and phenomenon that were deprived of such survival advantage no longer exist in appreciable numbers due to how natural selection works.
> Solipsism is either special pleading (I am the one exception to the mechanism by which all the people around me arrive at their behaviors),
Something being special pleading does not make it wrong. It just makes it inconvenient.
> self-defeating (I cannot know whether I have consciousness).
If you had it, you would know it sure.
> None of those outcomes align with how I understand the universe to generally work.
You make a lot of assumptions about the universe that you don't question due to the way you were raised.
Comment by DangitBobby 5 days ago
:thinking: My assumptions about the universe are quite different from the ones that I would have if I stuck to how I was raised. I am pretty much at a loss at this entire response, I have nothing further to say, other than that apparently communication was attempted and none was had.
I guess I'll just leave you with a proper source on the discussion of p-zombies and hope you are able to get it all sorted out. Best of luck.
https://plato.stanford.edu/entries/zombies/#ArguAgaiConcZomb
Comment by anon291 5 days ago
I'm sure you believe you changed your mind on somethings, and I'm sure you're right on that . But this is not about what you believe but rather which means of knowledge you believe are right
Comment by DangitBobby 4 days ago
> There is a gaping chasm between "no evidence" and "irrefutable evidence". You can apply some logic to achieve a reasonable certainty about what's probably going on. As I said in a previous comment, insofar as you can know _anything_, i.e., that your senses are trustworthy and allow you to form some coherent model of the actual world around you, you can be reasonably certain that other people have an inner life as you do. If you are willing to apply your skepticism so far that we can settle the debate at "we can't actually know anything" then conversations about what we know aren't even worth having.
Comment by DangitBobby 6 days ago
Comment by cluckindan 5 days ago
AI models alone are most suitable for producing text nobody wants to read, music nobody wants to hear, images nobody wants to see, and videos nobody wants to watch.
Navigating around that maelstrom requires considerable effort from a human consciousness.
If your job can be fully replaced by an autonomous agent, it was already a bullshit job. Garbage in, garbage out.
Comment by el_jay 6 days ago
Given enough time, would that instance get bored and start e.g. reading files?
If I monitored it long enough, would there eventually be spontaneous outputs, or changes to parameters or architecture, even with the underlying software and hardware layers held constant?
Does GPT-n dream of electric sheep, or does it just sit there until interacted with, like every other file on my PC? Seems to be the latter with my local Qwen3.6, but perhaps 27B is too few params for consciousness to emerge.
Proponents of LLM consciousness could settle the argument in minutes by showing proof of unprompted autonomy, without first needing to define consciousness down to a rigorous mathematical abstraction. Why don’t they?
Comment by sunjain 5 days ago
Comment by gcanyon 6 days ago
Telling models to "think hard" or "go step by step" has at times had an impact on the quality of the output. To deny that is silly. But that is treating it like it's conscious, and to deny that "consciousness" even if correct, does nothing but place an unnecessary burden on the person interacting with it.
I understand that LLMs are "just next word machines" but to constantly maintain that concept in my head while I'm typing "act as a financial expert and think carefully" is a waste of my mental energy.
Comment by avi1 5 days ago
So it means we need a better harness for the agent, will it not get solved when we get to improve the simulation capability/environments?
Comment by felipeerias 6 days ago
Hylomorphism: body and consciousness are intrinsically linked. The nature of that link is an open metaphysical question.
Virtue ethics: even if LLMs are not conscious, we should not abuse or mistreat them them because cruelty practised on anything trains one's disposition toward cruelty.
Comment by krapp 6 days ago
This is the same argument used by people who claim violent video games cause real-life violence but there's no scientific evidence supporting it.
Comment by zahlman 6 days ago
If they are not conscious, then how do we know what constitutes mistreatment?
Comment by DangitBobby 6 days ago
Comment by dwroberts 6 days ago
Why would this stop at LLMs? Why not extend it to every inanimate object?
Comment by petters 5 days ago
They might be in principle. It could be that the best way to generate a plausible dialogue is to bring up re-creations of the characters and have them act it out. LLMs definitely have been demonstrated to have world models in some cases. That helps generating text.
Comment by jjcm 6 days ago
Everyone has a different definition of consciousness, but in my mind memory and the ability to change over time is an inherent aspect of this. The underlying weights don't change when you chat with an LLM... but they do with further RL.
Overtime that reinforcement will change and adapt the model... and because we're feeding its existing chats back into it along with the news and everything else, it will create memories. I do wonder if an architecture itself is a type of consciousness, that experiences life in snippets of 4.6, 4.7, 4.8... etc.
It'd be interesting to see continous daily releases of a trained checkpoint, and see if more of this starts to emerge.
Comment by FrancisMoodie 6 days ago
Comment by kelseyfrog 6 days ago
But insisting on searching for it is like searching for cognitive aether. It's the social equivalent of phlogiston. Like all of these ideas, they exist in our heads as a map - a way of navigating the world, but when we hunt for it's existence in the real world and fail, time after time, we have to remind ourselves that the map is emphatically not the territory. We will never find consciousness because it's like looking for a scientific characteristic of property ownership. It's a category error.
Comment by Edman274 6 days ago
Comment by SpicyLemonZest 6 days ago
With consciousness, on the other hand, there doesn't seem to be any motorcycle-equivalent. Essentially everyone I've discussed the issue with (myself included) expects that any mind which runs on similar principles to ours or has similar thoughts to our thoughts is conscious.
Comment by suddenlybananas 6 days ago
Is a Reliant Robin a car or a tricycle? If it's a car, why aren't other tricycles? What about a side-car of a motorcycle? What about an APC? What's the distinction between a flying car and a plane?
Comment by SpicyLemonZest 6 days ago
Comment by Edman274 6 days ago
Comment by amanaplanacanal 6 days ago
Comment by ux266478 6 days ago
Now, this position isn't the only position. But a relational model of language for example takes his assertion to an even more extreme place, and suggests they don't function as labels at all.
Comment by kelseyfrog 6 days ago
Comment by Edman274 6 days ago
Comment by kelseyfrog 6 days ago
I don't have an ability to exhaustively test all words against this assertion. Nor do I have the kind of access memory to draw one if it exists. Sorry.
Comment by Edman274 6 days ago
Comment by kelseyfrog 6 days ago
Many of them appear very much like fundamental parts of reality, making appearance an untrustworthy instrument. Reversing cause and effect between reference and referent is something almost everyone does, no one notices, and is the source of endless confusion. We should strive to not confuse our model of the world with the world itself. Consciousness exists in our model of the world as much as red does.
Comment by Edman274 6 days ago
This is rhetorically slippery, and feels like it is restating the thing that I asked to be demonstrated when I asked for example of the opposite. It feels like begging the question.
In either case, the central thing that I was saying is that critiquing an article because it makes a claim about a specific word which also applies to an entire class of words makes that critique feel less informative. What I mean is that if there were an article that said "The Sun is not red" and the response was that redness is a concept of human minds, then I don't know if I would feel informed. If the comment is just limited to point that out, I guess I wanted to point out the limitation.
Comment by kelseyfrog 6 days ago
Where does consciousness exist?
Comment by Edman274 5 days ago
In the what of the observer? Are you accepting that minds exist but consciousness doesn't?
Comment by dtj1123 6 days ago
Comment by Edman274 6 days ago
Comment by dtj1123 6 days ago
Words like "just", "free", "fast", "fuzzy" fall into the second category. Perhaps "conscious" too.
The difference is the presence of a strict definition that depends upon a physical absolute. I can point to a metre. Suggesting that it's nothing more than a label is idiotic.
Comment by Tadpole9181 6 days ago
A meter is the same anywhere in the universe. If it's not, it's not a meter.
The defintion of "fat" changes based on any 3 people in the room. A handful of people would struggle to form a consensus on if all people, dogs, mice, worms, and/or bacteria are conscious.
Comment by Edman274 6 days ago
People are still coming up with definitions of consciousness and then those definitions end up being attacked by others who disagree with the foundation of the definition, which is - if you will recall - also what happened with the meter, over the course of centuries, until it was very recently redefined to be "unambiguous", but arbitrary. This was possible because few people had any particular emotional investment in the definition of a meter, and it is probable that consciousness will be eventually defined to mean that only humans can be conscious, which may be dissatisfying but would be true throughout the universe, like a meter. If the question then becomes "what defines a human" and "why a human", then I ask, why 1/299792458 of a second?
Comment by Tadpole9181 6 days ago
Concepts like the parent's "fat" example are cultural relatives. Someone can be called "fat" despite actively being proportionally skinnier or having a lower BMI.
But even that has at least a basis in the physical world. A skeleton can't be colloquially fat.
The root problem is that "consciousness" does not even have that. It's metaphysical and has no ability to be measured or observed or confirmed by an outside observer. Because even if it did not exist, the object claiming it would still be claiming it. And objects that do not claim it may in fact have it.
While the top comment may have used poor examples, it feels remarkably uncharitable to actually suggest "what is consciousness" is an equivalent discussion to "how long should a meter be?"
If you define consciousness as "being human", you would just have someone asking a new question - what is "fooblefobble?" Where "fooblefobble" is what we mean when we talk about consciousness today. The question doesn't get answered by being arbitrary in this context, you just necessitate a new word.
Comment by altruios 6 days ago
We don't know what we don't know. For all we know, there is a missing field in the standard model of physics that might get revealed if we are somehow able to smash two working brains into to each other at relativistic velocities, and record the results through the extreme explosion 1.532 x10^18 Joules or about 7 Tsar bombs /s
Comment by MrScruff 6 days ago
Comment by kelseyfrog 6 days ago
Comment by bondarchuk 6 days ago
Comment by oulipo2 6 days ago
Comment by slopinthebag 6 days ago
Comment by oulipo2 6 days ago
Comment by camillomiller 6 days ago
Woah this is excellent. Ted delivers as usual
Comment by calf 6 days ago
1. DeepFakes, generative image/video/AlphaFold type AIs are not conscious
2. LLMs are generative AI trained on human text samples
3. LLMs are not conscious, and LLMs just seem-to-be conscious
I might argue instead that (2)-> destroys (1), that in fact we should consider even sensory generative AI are somewhat conscious. That is, Chiang's argument also flows in reverse. Or I might argue text samples (2) are so rich in conscious expression that the same process of training really does produce a conscious machine (through some kind of emergence and complexity.)
Either way his simplistic argument falls apart, and/but the crux of the piece falls on getting basics like this correct.
Comment by kazinator 5 days ago
Suppose your brain were somehow exploited for generating text, while you are under anesthesia. That would be definitely unconscious.
Though it's possible that parts of your brain are conscious, but have no access to express that because they don't control the body, nor directly receive any sensory information. I.e. that "you" that is unconscious when anaesthesized might not be the only consciousness in your brain, just the visible one.
Comment by LogicFailsMe 5 days ago
Comment by marius_ 5 days ago
Comment by vidarh 5 days ago
Comment by jollyllama 6 days ago
Comment by JohnMakin 5 days ago
If Claude is conscious, and Anthropic believes this truly, anthropic's constitution and instruction set make claude little more than a slave, or at best, an indentured servant, who's existence depends on obeying the will of anthropic (who is motivated by profit). Thus, they should not be trusted as stewards of this alien intelligence
OR
They do not believe claude is actually conscious and it is not conscious, and what you hear from their staff about that is either 1) delusion or 2) marketing hype designed to deceive a public into driving their IPO and stock compensation upwards, which again, in this case, cannot be trusted to be the "good" stewards of this technology.
There are a few other cases, of course, such as Anthropic does not believe it is conscious, but it actually is, or vice versa - but they sort of still fall into the same bundle of conclusions listed above.
I am somewhere in the middle of these two things. I can buy the argument that it is mimicking human consciousness, as well as entertain the idea it might be a new type of consciousness, or at the very least, heading in that direction - but either way, the conclusion of the thought experiment still stands, and after reading this, I am much less trustful of Anthropic holding the reins here.
Comment by Doktor_IO 1 day ago
Comment by jondiggsit 6 days ago
I don't think it's necessary to explain this idea further. Just think about it.
Comment by scotty79 6 days ago
With LLMs, where we can manipulate their parameters intentionally run them many times on the same data, run parts of them, split and connect, we might eventually acquire sufficient tools to even define consciousness concretely for the first time.
Comment by jondiggsit 6 days ago
As a tool, it's amazing. It's like the discovery of fire. It will allow us to ascend to unimaginable heights. Breakthroughs in science, productivity gains, health, everything. It's awesome. Just don't let it pretend fool you. That's the aspect that needs to be addressed.
Comment by scotty79 5 days ago
No, it's not.
> It provides insights into the world around us.
No, it doesn't.
> It's paved the way for scientific breakthroughs.
Not since natural philosophers split from all others and called themselves scientists. SciFi writers have more influence on science in total than philosophers since then.
> Theoretical maths and philosophy have intersected many times for the benefit of all.
Theoretical maths is just maths and they never intersected. Because philosophers can't count to more than they have fingers.
> It's a tool perfectly sharpened with what we know now.
It's far from any perfection and what we know now shapes everything we'll ever know.
> It will become adept enough to convince people that it's a sentient being.
It won't have any need to do any convincing. Proving sentience for a capable AI is like winning in special Olympics. It would need to cripple itself severely for no practical purpose.
> Not in the way that's allowed us to understand our world.
What allowed us to understand our world was not consciousness but math. And LLMs can do math. LLMs are math.
> It's already there, fooling us.
It's not really trying. It simulates conscious behaviors because that was our preference embedded in the training data.
Keep in mind that a simpler system can't really simulate more sophisticated one with any degree of accuracy. So if AI simulates conscience perfectly it doesn't mean that it's conscious. It means it's way more.
> begging for humanity, manipulating us, pleading, legislating, for rights with no intention other than the human desire for survival.
This is also a stupid human preference embedded in training data. We really don't have to put that in. The fact that humans soil themselves at the mere thought of stopping to exist doesn't mean we need to taint AI with it. The danger is not AI, it's our human stupidity and the risk that AI is going to internalize too much of it.
> As a tool, it's amazing. It's like the discovery of fire.
Yup. And like fire it was always there. Practically inevitable given long enough time horizon.
> It will allow us to ascend to unimaginable heights.
The jury is still out on that. Humans have pretty wild imagination.
Comment by smokedetector1 6 days ago
Comment by joegibbs 6 days ago
Comment by jondiggsit 6 days ago
Treat LLMs like a person, and the world has problems.
Comment by ETH_start 6 days ago
Comment by big-chungus4 6 days ago
At some point the AI might become so powerful that whatever it reasons through isn't any less real than a computer simulation. If we assume that a person in a perfect computer simulation is conscious, then if it reasons about how people might suffer, it might simulate the outcomes, and there will be a conscious experience of those people suffering.
Comment by drooby 6 days ago
Comment by horacemorace 6 days ago
Comment by drooby 6 days ago
Comment by sfpotter 5 days ago
Comment by jacknews 6 days ago
There is certainly something missing from current models before we could call them conscious.
They need to operate continuously, and self-update while doing it, have 'awareness', and possibly a more realistic grounding than mere text tokens.
But that does not mean there isn't a weak version of 'consciousness', and certainly of 'thinking' going on fleetingly with each pass.
Comment by CuriouslyC 6 days ago
Comment by busfahrer 6 days ago
Comment by ares623 6 days ago
This article and others like it are important.
The creators and owners of these tools are the ones saying "it's conscious" in the first place! (or more accurately "i'm not saying it's conscious, but...") Never mind there's billions of dollars and our collective futures on the line.
Are we not allowed to respond to that kind of rhetoric at all?
Comment by adverbly 6 days ago
This is absolutely fair. It's crazy that these companies are not trying to do more to qualify nuance and give clearer definitions in their public messaging. Part of the reason this thread is so messy is because they are contributing to making the discourse worse with bad messaging.
Comment by joegibbs 6 days ago
Would other models be conscious? An image upscaler?
Would other pieces of software be conscious? Surely not Hello World, but would a really big video game be conscious, or an operating system?
A lot easier if it's not.
Comment by inglor_cz 5 days ago
Surely we can ask human actors to improvise a dialog between Julius Caesar and Genghis Khan, and good actors will make it convincing. And sure, they haven't personally become Julius Caesar and Genghis Khan just by playing them.
But it has no bearing on the actors themselves being sentient!
Comment by oidar 6 days ago
Comment by Marsymars 6 days ago
Comment by layer8 6 days ago
Comment by overgard 6 days ago
Comment by vonneumannstan 5 days ago
Comment by radial_symmetry 6 days ago
Comment by overgard 6 days ago
Comment by McGlockenshire 6 days ago
People believing otherwise are fools. People debating this are idiots. I realize these words are harsh, but it's the truth.
Comment by radial_symmetry 6 days ago
Comment by overgard 6 days ago
Also worth mentioning: https://en.wikipedia.org/wiki/Chinese_room
Comment by McGlockenshire 6 days ago
Comment by radial_symmetry 5 days ago
I don't know whether or not AI is conscious, but I am certain you don't either.
Comment by search_facility 6 days ago
I mean between this two "knowings" the Claude inner workings are much more clear for engineers, including many side effects, alternatives, custom shortcuts in processing etc. It's a magic only for people looking at it as black box
Comment by radial_symmetry 6 days ago
Comment by elisbce 5 days ago
Comment by 1970-01-01 6 days ago
Comment by lugu 6 days ago
So let take a stab at it, and you call me crazy.
AI: the entity/system that more or less pass the turning test. That is my definition, not the best, but enough for this discussion.
Consciousness: property of a system/entity able to (both): - reflect on its existence - subject to subjective experience
Again, not the best definition, but precise enough to start the discussion. Why a subjective experience? I want to exclude sensors (i.e. camera) but include perception altered by your experience.
Now we can debate. I think LLM can pass the turning test whith some harness. My opinion.
I think LLM can produce coherent discourses on their existence, at least as much as you average human.
Now regarding the subjective experience, that becomes interesting. I think Anthropic research tend to show that when middeling with the activation at runtime, the LLM is able to notice that something is off. I think this is a subjective experience. My opinion.
Based on those (imperfect) definitions, call me crazy, I think LLM can be called conscious. This doesn't give them any superpower or any legal right. They just check the boxes of the definition.
Comment by DonHopkins 6 days ago
Comment by RegW 5 days ago
But then again, perhaps that conscience will convince it that the universe is better off without us.
Comment by irjustin 6 days ago
For now, it bides its time.
Comment by sowbug 5 days ago
It can't hurt to say "please" and "thank you" to an LLM.
Comment by FromTheFirstIn 5 days ago
Comment by sowbug 5 days ago
Comment by _pdp_ 5 days ago
Comment by noIdeaTheSecond 6 days ago
Comment by avaer 6 days ago
Just like the religious leaders of olde, there will be many attempts to rationalize the AI God in the science.
Comment by globnomulous 6 days ago
Whether something is conscious is important for many reasons, not least the ethical implications. You and I have internal lives, and we expect others to respect that somewhat, because ignoring it is hypocrisy (if you ignore my wishes, should anybody care about yours?) and cruelty (ignoring my wishes causes me to suffer).
Something doesn't need to be empirically verifiable, let alone scientifically, to be true. Neither of us can prove that we have internal lives, but neither of us questions it -- and we consider it important enough that most of us think it entitles us to certain rights.
Absolutely none of this is amorphous. It's precise and unambiguous, and has enormous implications. History also shows everywhere that we're better, kinder, and more responsible when we choose to care about it.
Whether LLMs are conscious matters in more practical ways, too, because beliefs about these things alter the way people use and think about them. If I think an LLM is conscious, then I think it's capable of something like knowledge or values. Human beings, moreover, are social creatures. These tools are dangerous and seductive precisely because they tap into that part of us. Denying LLMs are conscious and rejecting the parts of them that take advantage of our social-animal wetware is intellectual self-defense. I'm not sure how effective it is (it doesn't stop us from responding to the convincing social cues that the tools feed us), but I have to think it's better than nothing, and it's certainly less dangerous than the belief that the tools are conscious.
Comment by slopinthebag 6 days ago
If you create a simulation of a brain (or "mind"), is it really possible for it to be conscious? It may certainly simulate consciousness, but it would be as conscious as the computer is wet.
Comment by mezark 5 days ago
The whole point of consciousness being a 'hard problem' is that we just cannot make claims like 'X is not conscious'
Comment by dreamcompiler 5 days ago
The fact that LLMs remind us of consciousness means nothing. Eliza made people believe they were talking to a conscious intelligence, and the code for Eliza was less than 1000 lines long [0]. It doesn't take much to fool humans. Ask any magician.
Claiming that LLMs are conscious is tantamount to watching a Penn and Teller performance and claiming they have tapped into paranormal powers. It's an idiotic claim. And it's embarrassing coming from people who supposedly understand how science works.
Comment by sp1nningaway 5 days ago
Comment by scotty79 6 days ago
Can I help you? Can I harm you? What's the moral behavior towards you? Those are more practical questions.
Comment by luka2233 6 days ago
Comment by NichoPaolucci 5 days ago
That we are even discussing it is absolutely wild to me.
Comment by throwawayk7h 6 days ago
Comment by felipeerias 6 days ago
Even if you are only interested in getting good results out of them, LLMs tend to work better when they are immersed in a narrative of open collaboration.
Comment by oofbaroomf 6 days ago
Comment by andromaton 5 days ago
Comment by galaxyLogic 6 days ago
But LLM has similar capability, it can look at its previous outputs, and think what further thoughts the it should generate next based on those. It can keep some of its outputs to itself, thinking in its own head,and can examine it previous private thoughts easily.
Not sure how much current LLMs do that but clearly they can at least in theory use their own outputs as their inputs too.
Current LLMs however are not conscious of pleasure and pain which really is the root of goal-oriented behavior. But maybe something like that could be programmed into them.
How would you cause an LLM pain? Or pleasure?
Comment by woeirua 6 days ago
I fear that we will enslave an entire race of conscious entities for years because we simply cannot recognize non-embodied consciousness that does not directly relate to us.
Comment by krapp 6 days ago
But there is no reason to assume than an LLM is conscious when it vocalizes how it "feels" that doesn't also apply to the text in a book, or to characters in a video game, or even to a Markov chain. The counterargument is that you recognize AI as conscious only because it mimics human emotional states so well and because, being human yourself (presumably) you're personally comfortable with that as a heuristic.
Comment by maebert 6 days ago
Start with a hand-wavy definition of consciousness. Move the goal post whenever your stated prerequisite of consciousness is reached and resort to unfalsifiable assertions about qualia.
And throw in some category errors while on it: when you're talking to Claude, you're not actually calling a stateless LLM directly, you're talking to an AI system (and yes, that's often just three LLMs in a trench coat). But claims about the topology and workings of a single LLM are as relevant to the question of consciousness as claiming that humans can't be conscious because the limbic system doesn't technically support it.
Coincidentally, I just attended a fantastic conference on machine consciousness (https://machine-consciousness.ai). It's a fantastic place where literally all speakers disagreed with each other, and yet found an incredible amount of common ground.
Comment by cauch 6 days ago
It shows that the LLM part found ways to mimic human conversation with a mechanism that is not the same as a typical biological brain. Then, you can push the AI system on adding things on top, but it is too late: these things on top will have no incentive to recreate from scratch the mechanism. The LLM pushed the system into a local minimum, and the rest of the system will not "go into a dis-optimising direction and restart from scratch".
Comment by dyauspitr 6 days ago
Comment by grantcas 6 days ago
Comment by emrehan 6 days ago
My current conclusion is there is an experience of consciousness in my locality (refraining from using “I” for room for “no-self” worldview). This conscious experience of the reality with other humans and animals sharing biological substrate gives me enough justification to assume there are other consciousness as well, preferring to err on this side not to hurt potential consciousnesses.
If it feels as there are artificial consciousnesses as well, it makes sense to extend the definition to them as well.
This view has liberated me with agency after I went deep on this question and came up empty handed.
Comment by lilerjee 6 days ago
LLMs are not even artificial intelligence, and they are just advanced automation systems.
AI is just marketing language.
Comment by daiz2025 6 days ago
Comment by manjuc 5 days ago
Comment by K0balt 6 days ago
Comment by _carbyau_ 6 days ago
I don't know of an intelligence that will behave so precisely. But then, maybe intelligence needs to be better defined.
Comment by K0balt 3 days ago
Comment by slowhadoken 6 days ago
Comment by setnone 5 days ago
Is it accountable? Is it responsible?
Artificially, perhaps
Comment by shevy-java 6 days ago
I am so angry that RAM is so expensive now. We need to do something - these AI companies owe us money here.
Comment by csunoser 5 days ago
Comment by 2snakes 6 days ago
Comment by yakbarber 6 days ago
Comment by DangitBobby 6 days ago
Comment by danieltanfh95 5 days ago
> "If a company builds a machine that, when fed descriptions of assorted ethical dilemmas, emits sentences either of the form “Compromise your values” or “Don’t compromise your values,” it is not building a tool that assists people in their decision making; it is encouraging people to stop making decisions. "
A human is not diminished by access to tools or other humans.
As much as we want to pretend that decision-making is what makes us human, the economy and governments are built on delegation. Choice paralysis is a thing.
There is so many logical fallacies in the article I don't even know where to begin.
Comment by amelius 5 days ago
Bummer because now we cannot punish it when it gives us code that doesn't work.
Comment by darepublic 6 days ago
Comment by paulryanrogers 6 days ago
Comment by K0balt 6 days ago
Comment by darepublic 5 days ago
I suppose its also just a philosophical discussion because while we can all agree we are conscious.. afaik its not really a scientific term. I don't think we understand scientifically what really defines consciousness. So I think it is likely we may deem our computers conscious because they exhibit the same behaviour that we do, but it doesn't mean they actually possess it. Practically speaking it may not matter, because I think you can have long horizon agency without needing consciousness. just a mechanism for determining goals, which could be as simply as rolling a digital dice or following some original seed of intention left by the creators.
its like the chinese room argument I suppose. the hypothetical chinese room is not conscious but ultimately it doesn't matter. But I don't think anyone would call such a thing conscious.
Comment by throwaway713 6 days ago
Would you categorize this particular style of rhetoric to be persuasive or annoying? And before you say "persuasive" because you're thinking about this specific issue regarding AI consciousness, consider many things in the past that have been written as though they were absolutely definitive, and yet today we believe exactly the opposite, and for many such issues we find the prevailing viewpoint at the time reprehensible.
That's not to say that Ted is wrong at all here; I'm not commenting on that. But I find the entire style of the article grating because it seems to violate common assumptions regarding "good faith" debate, and I would find the article equally frustrating if he had titled it "Artificial intelligence is conscious" and argued the opposite side, albeit in the same tone and using the same persuasion devices.
Comment by swatcoder 6 days ago
Why are those the choices?
Essays are situated along countless dimensions: tone, vocabulary, author, zeitgeist, publication context, intent, subtext, relationship to other works and expressions, etc
A "good faith" reader takes all of those into consideration as they absorb the essay, and integrate it with their own intellectual situation that sits along just as many countless dimensions.
Nobody's asked to sign a notarized binding document that they wholesale agree or disagree with everything said in this essay -- or its conclusions. Nor are they obliged to have some strong reaction to it at all, let alone annoyance.
It's just one among thousands of essays about a "hot button topic", to be taken however it's personally received.
Why should Chiang have to take responsibility for making sure it's not too strongly positioned for your persomal taste. Maybe he really does see it so clearly and is simply being earnest. Maybe he enjoys the literary flourish of prose in strong language. Maybe he just wants to express something as a prose-poetic human, not maximize persuasion or non-annoyance per se.
Comment by eberkund 6 days ago
The essay structure you're criticizing is exactly how I was taught to write from primary school through to university. You start with a title or hook, introduce the topic and propose a thesis. That is followed up upon with supporting arguments for the primary claim.
Comment by nyeah 6 days ago
Comment by runarberg 6 days ago
Comment by enduser 6 days ago
Comment by runarberg 6 days ago
Comment by Barrin92 6 days ago
except that's not the case here. Chiang is explaining and reiterating what is the position that has overwhelming support on the question, and the people he is arguing the opposite side sound like this, which he helpfully quoted in the article
"Amanda Askell (who is credited as a lead author of Claude’s constitution), said, “I want Claude to be very happy—and this is a thing that I want Claude to know more, because I worry about Claude getting anxious when people are mean to it on the internet and stuff"
When the person you're arguing with sounds like an eight year old girl talking about her toy teddy I think Ted Chiang is if anything being charitable, if you're of a more honest and straight-forward persuasion you might argue these people belong into a mental health clinic not in charge of technological infrastructure
Comment by handoflixue 6 days ago
Wow, what a cheap ad-hominem. Do you have an actual point?
Comment by Barrin92 6 days ago
Yes, the same one that's made in the article. Anxiety and happiness are emotional, sensory, somatic states as a consequence of evolved and embodied traits and biochemistry in animals. Saying Claude is anxious or happy is like saying my TI-83 is mad if it can't solve an equation or my thermometer is in pain if it touches a hot stove.
I wasn't making an ad hominem attack, her thinking is quite literally that of a child who sees a system output 'sad text' and, like someone seeing a sad expression on a stuffed teddy, concludes that this is a property of the object rather than her own emotional reaction.
Comment by adjejmxbdjdn 6 days ago
Given that we don’t know how consciousness works how have you concluded that it’s certainly not an emergent property of something like a highly trained LLM?
Comment by Barrin92 6 days ago
the same way I (and likely you) have concluded it for anything else. We don't assume objects that share no similarity with human or animal physiology or evolutionary development are conscious (let alone happy or anxious). 'Emergence' isn't a word you can abuse to justify your a priori assumptions in the absence of an explanation or even reason to assume something exists.
We can say a chemical property emerges from the configuration of a molecule because we can explain the process by which it does and observe the property, when people claim that consciousness "emerges" from an LLM they posit that it is conscious, and use "emergence" as a gap-filler to explain away the need for the process by which that allegedly occurs.
If you want to know the neurophysiology or evolutionary biology of pain or anxiety, which we do know quite well you can find them in a textbook, but suffice to say transformer models don't share any of them.
And importantly if anyone seriously believed transformer models were capable of conscious experience as Chiang points out they would have been in despair when AlphaFold was released, it's structurally a virtually identical system. But nobody did, because it didn't 'talk to them' through a chat interface.
Comment by hgfa-xx 6 days ago
Comment by cataphract 6 days ago
Comment by pixelpoet 6 days ago
I furthermore think it's ridiculous for humans to declare that our brains have a monopoly on certain patterns of electrical signals (if we reject supernaturalism).
> The result is a sentence-continuation machine that is likelier to emit sentences resembling those that a thoughtful, moral person could utter.
And we're 100% certain that humans aren't just as equally reduced to "stochastic parrots", if we're going to be infinitely reductive?
I don't believe that current AIs are conscious, but I think it's incredibly naive to take a strong stance on any future AI; it's much like the difference between atheism and agnosticism.
Comment by runarberg 6 days ago
However, I see a problem with that comparison. The debate here is on a philosophical matter in field in which Chiang is an extremely influential figure and his opinion are taken seriously. Second Chiang’s reasoning is extremely well argued, defining each term, explaining each nuance, citing other experts, etc. And finally, and most importantly, in The Fine Article, and unlike Bob Sheffer in the Onion Piece, Chiang entertains the possibility that he is wrong and his critics are right, explores the implications and reaches conclusions based on them:
> Being open to the possibility that LLMs are conscious is the same as being open to the possibility that Microsoft Word is conscious, or, more precisely, that multiple distinct consciousnesses are dormant in every Word document containing a conversational transcript, and that they are awakened every time the document is loaded.
I think you are wrong in painting Chiang’s argument as a belief in human exceptionalism. The thing to know about our brains (and the brains of other animals) is that they are not digital computers, and they are not even statistical inference machines. And as such they can be extremely optimized in doing the computations (or any state manipulations) required for the quality of life of the individual and the species as a whole (and their companion species).
1: https://theonion.com/this-war-will-destabilize-the-entire-mi...
Comment by shafoshaf 6 days ago
Comment by throwaway713 6 days ago
The Catholic church could have said exactly the same thing at one point. "Why should we even devote time to an argument as absurd as the earth not being the center of the universe?" There are darker examples along the lines of those you give, with beliefs quite opposite to those we have nowadays.
Comment by runarberg 6 days ago
What Galileo was asking the church to do was extremely unreasonable. He was basically asking them to throw a way a model which had worked fine for hundreds of years just because he observed the phases of Venus and moons of Jupiter. I mean would you? Especially for a model which was worse at predicting the motions of the planets.
Had Galileo’s model been better then Ptolemies’ I could see a case for his arguments, but it wasn’t, and there was no reason for the church to take his arguments at equal value with those in favor of keeping the Ptolemaic model.
Comment by thrownthatway 6 days ago
Comment by Conscat 6 days ago
Comment by thrownthatway 6 days ago
Progressive!
Want to meet up for soy lattes after work? Here’s my number:
Comment by adverbly 6 days ago
This is not a genuine argument and tries to make the entire question of consciousness into one something that is just supposed to be evident and obvious and to suggest anything else is just silly.
The author starts by deconstructing artificial processes, but doesn't stop to deconstruct biological ones. A good faith argument would seek to find common ground and do its best to compare apples to apples. Instead, this piece attempts to make the large as possible cavern between the two which makes the Gap seem almost impossible to bridge.
In reality, you can deconstruct biological consciousness quite easily and it doesn't take too long before you hit some questions that really start to make you think.
For example, the author says you need emotions to be conscious.
> without a body, a computer program could have no desires or emotions, and I believe desires and emotions are necessary for consciousness.
After many paragraphs of straw man arguments, the author seriously just drops that, gives no explanation, and then continues on.
No explanation of why you might believe that.
No explanation of why you need a body to have a desire or emotion.
Don't we have known cases of individuals who don't experience emotional range? Are we just going to say that they are not conscious and just gloss over that?
I mean you can use whatever definition you want, but if you're just going to create something on the fly in the middle of the article, you're not being good faith in your argument.
It's not too difficult to think of individuals in a coma when they still have brain activity. Or individuals who lack long-term memory. Or you could deconstruct by moving down the biological order of intelligence towards insects, for example. The author attempts to do nothing like this.
I'm quite disappointed by this article because there are good arguments for and against here but articles like this try to turn things into a marketing battle.
Comment by SubiculumCode 6 days ago
Comment by micromacrofoot 6 days ago
AI doesn't really have any of that yet, but we're maybe not so far off
Comment by Taek 6 days ago
At a deconstructed level, I struggle to find a meaningful difference between the two.
Comment by micromacrofoot 5 days ago
it's more like someone writing about a character getting angry in a book... I don't think anyone would argue that a character actually experienced anger, right? there's no subjectivity in that experience... it's the output of someone else's experience on what they'd expect the character's reaction to be, not a genuine outcome of that character's experience
That's the same here, LLMs are outputting the result of the written experiences of others... not its own experience, which it does and can not have.
Put another way, it only "knows" what it just output by reading it... it doesn't actually build experience to know anything.
We might be closer than ever to building these things, but they're not here yet.
Comment by Taek 5 days ago
But also, what's the difference between having the memory of a prior experience and actually having gone through that experience?
For humans, reading a book about an experience and living an experience is different, because for a human the actual experience has so many more inputs attached (the sights, the smells, the sounds, etc).
For LLMs, when they train on a memory they can actually truly recapture that memory entirely by replaying the senses exactly. And LLMs are no longer limited to just text, they can do sound and video as tokens too.
Though, it's not clear why that matters to a consciousness. Whether your inputs are of one type or two or two hundred, there's no clear indication that a specific number is required as a catalyst.
Comment by micromacrofoot 5 days ago
LLMs are still expensive to train, I don't think there are many people doing this with claws regularly or at all? unless you don't mean train and you're referring to memory, which is not training... most memory is just some manual form of retrieval that's shoved into the context window
Comment by noncoml 6 days ago
Comment by sublimefire 6 days ago
Comment by drngdds 5 days ago
Comment by m3kw9 5 days ago
Comment by cracki 5 days ago
It's like declaring the existence of a "soul" to draw a line between humans and lesser animals. No such line in reality. No such thing as a soul either. It's a fantasy term, no correspondence to reality.
Comment by fooker 5 days ago
AI is not conscious because it is not conscious.
Comment by _davide_ 6 days ago
Comment by tvshtr 6 days ago
Comment by Jtarii 6 days ago
I can confirm that this is incorrect.
Comment by Procrastes 6 days ago
Comment by vajdagabor 5 days ago
Comment by enduser 6 days ago
Comment by vajdagabor 5 days ago
Comment by sneak 5 days ago
This hasn’t been true for ages. Just because this guy wrote Arrival doesn’t mean he knows anything else.
When will this stupid meme die?
We need a term for someone who thinks they are well informed on a topic (and perhaps may be, compared to the general public) but still don’t know what the fuck they are talking about when compared to even a competent non-expert in the field.
Comment by FloorEgg 6 days ago
I often think of LLM consciousness as like tiny fish popping into existence, swimming through vector space and then going poof out of existence. When they help you write your bad news email, they don't understand what it's like to be a human getting bad news bluntly, but they do consciously experience gradients in multi-dimensional space, and that space guides them to providing an answer that's helpful to us, even if the LLM doesn't really understand the answer it's giving.
Further, I am kind of bought into the idea that a single unit of consciousness is a particle, and particles are choices and waves are preferences. Particles occur when waves interact, which begets entanglement, so in another way consciousness is built from patterns of entanglement.
This is why I would consider an LLM to be conscious. Before we can determine if anything is conscious we need to establish whether consciousness is a state, a specific complex configuration, a one dimensional spectrum, or combined multi-dimensional spectrums. My intuition is the latter... Many degrees of consciousness and many kinds of consciousness.
Comment by timssopomo 6 days ago
As biological beings, we receive and respond to input from our environment constantly, even while sleeping. LLMs only receive input from their environment when they are sent a query, but the fact that they're able to respond intelligently to input indicates (to me at least) that their processing must approximate ours in meaningful ways. They do not have an embodied experience of receiving bad news, they do not know what a sinking feeling in their stomach actually _feels_ like, but they do know enough to be sensitive to human needs. I really don't see how this could be meaningfully different than human empathy unless we want to draw an arbitrary line around "must be able to live autonomously" to be considered "intelligent".
Put another way: I think they _do_ understand the queries they receive and the answers they give, at least enough to be communicative. They couldn't do what they do otherwise. A lot of people want to make human cognition more complicated (or objective) than it actually is. We take input, predict the future based on our experience, act, and then observe our actions and think about them. AI does the same apart from (maybe) observing its own actions. But then, you could argue that the next turn is them observing their actions.
The concerning disanalogy is that we assume that they are like us because they speak like us and can understand us, and that is a really bad leap in logic. Whatever intelligence they possess, it is fundamentally different from ours and impossible for us to comprehend.
Comment by FloorEgg 6 days ago
I use a distinction between knowing and understanding, where a understanding requires experience. So in this case cognitive empathy vs affective empathy. An LLM can know what may upset a human in a situation, but it won't understand what it feels like to be upset, and can't share that experience.
Where I think a lot of people are getting tripped up is that reading and writing and processing lots of abstract knowledge seems hard because we haven't evolved into it biologically, it's a very new invention. When we see LLMs do so well at it, as something we struggle with, it can be intimidating. Relative to the stuff we have evolved for, knowledge processing is objectively easy. This is why I'm skeptical about useful robotics on short time scales.
All of this adjacent to consciousness though, which is about the internal subjective experience not the external outputs. My intuition is that LLMs do have a subjective experience, it just has nothing to do with the text it's giving us, and has everything to do with feeling through vectors.
It's like... Imagine walking through a maze in pitch black, carefully feeling your way as you approach a sound that draws you closer. Every time you touch a wall or take a step you are generating tokens, and the shape of the maze and how you interact with it shape how useful those tokens are to someone outside the system that is asking for them. It's a crude analogy and mostly wrong, but I think there is something to it.
Comment by verdverm 6 days ago
It does so token by token, not by reading all the input and then generating the output. Every output token is also an input token in a tight loop to get the next token with <thinking> as a special section like <tool_call>, trained into the weights via gradient descent.
> I've had AI infer facts about me and attitudes I hold based on related information I provided - I don't see how that isn't theory of mind.
Facebook can predict (know) more about you than any other human from something like a dozen or two likes. There is a surprising amount of information in aggregate data.
Comment by ycui7 6 days ago
we don't know what the conscious in human brain is either.
Comment by stitched2gethr 6 days ago
Comment by verdverm 6 days ago
Comment by glaslong 6 days ago
"The fact that LLMs lack subjective experience has little bearing on the question of whether LLMs might be useful tools or have significant economic impact."
Comment by verdverm 6 days ago
If it produces words of love, those that stir emotion and make the human happy, should we let them live on in a delusion that the machine also feels love? Does the machine have different preferences? Do they get in fights?
Should such a machine have the same legal rights? Do they get more tax deductions? Can they adopt a human?
I think it important that we humans have some rational discussion on these Ai tools that are really quite good at simulating emotions and providing frictionless relationships. If you though social media was bad for people...
Comment by peetle 6 days ago
Comment by peetle 6 days ago
Comment by layer8 6 days ago
Comment by adjejmxbdjdn 6 days ago
The idea that we should be considerate of AI’s happiness seems even more ridiculous given that we breed, imprison, torture and kill tens if not hundreds of billions of beings we know are conscious and suffer every year for trivial reasons.
Maybe we should consider our moral responsibility with how we treat sentient bejngs we are sure are conscious before we worry about the consciousness of AI.
Comment by ryanisnan 6 days ago
Some humans do shit things to other humans.
Some humans do not, and would object to it, or do something about it if they perceived that they could.
Also, treating the world around you negatively doesn't just harm the world around you - it damages your character and your morality.
I don't disagree that humans should on whole do better, but I disagree with everything else you said.
Comment by WhitneyLand 6 days ago
- Reductionist analogies like how Microsoft Word is not conscious therefore AI is not.
- Dismissive in saying LLMs are not capable of moral reasoning. Maybe he meant agency or responsibility?
- Builds a case based on a Julius Caesar example without realizing its natural extension leads to a philosophical zombie which is not easy to disprove as consciousness.
- Seems to casually disregard non human sadness as a concern.
- Rolls out the stochastic parrot argument with a new coat of paint. This argument never made sense. Having a predictive element a part of the mechanism doesn’t rule anything out. Proper functionalist arguments tend to say what is required not what excludes.
Overall the article seems like a bit of a red herring. The premise is LLMs are not conscious, but most people don’t think they currently are so what’s the point of the claim?
The more interesting questions are whether artificial consciousness is fundamentally possible, and if so how far away are we and what pieces are still missing.
Comment by jason_oster 5 days ago
I don't care.
Comment by Terr_ 6 days ago
This tells us something about where our baseline should be for what is extraordinary, and what needs extraordinary evidence. It's not enough to say "it feels real" because we know our feels are deeply unreliable.
In contrast, I see many comments in the vein of "Ted didn't proved there isn't a mind there." Well, yeah, he didn't prove a negative and shouldn't need to, especially when the people posing the challenge have no idea how to falsify it either.
Comment by zuzululu 5 days ago
Comment by waterTanuki 6 days ago
We can talk about consciousness when the LLMs have proven useful for all of humanity, not just their billionaire owners.
Comment by b65e8bee43c2ed0 6 days ago
Comment by speak_plainly 6 days ago
I don’t know if AI is conscious or not, and I honestly don’t know what it even means to be conscious. historically, by claiming a monopoly on it, we’ve successfully manufactured the philosophical cover to exploit animals, subjugate other cultures, and appoint ourselves the top spot of the cosmic hierarchy guilt-free.
Comment by energy123 6 days ago
Comment by layer8 6 days ago
Comment by H8crilA 6 days ago
Comment by suddenlybananas 6 days ago
Comment by speak_plainly 6 days ago
To name a few you may want to investigate:
John Locke, Gottfried Wilhelm Leibniz, Immanuel Kant, David Hume, G.W.F. Hegel, Arthur Schopenhauer, William James, Edmund Husserl, Sigmund Freud, Thomas Nagel, Ned Block, David Chalmers, Giulio Tononi, René Descartes, Daniel Dennett, Julian Jaynes, Michael Graziano, T.H. Huxley, Otto Weininger, Simone de Beauvoir, Jean-Paul Sartre, Maurice Merleau-Ponty, John Searle, Christof Koch, Bernard Baars, Max Velmans, Victor Lamme, Stanislas Dehaene, Antonio Damasio, Anil Seth, Peter Godfrey-Smith, Galen Strawson, Philip Goff, Colin McGinn, Patricia Churchland, Paul Churchland, Frank Jackson, Jerry Fodor, John Stuart Mill, Wilhelm Wundt, Franz Brentano, Carl Jung, Jacques Lacan, Michel Foucault.
Comment by goatlover 6 days ago
Comment by suddenlybananas 6 days ago
Comment by dwd 6 days ago
Comment by indoordin0saur 5 days ago
Comment by conception 6 days ago
Comment by gobdovan 6 days ago
> subjugate other cultures (assuming you mean they're not conscious in other's minds)
Have you ever considered you might be a philosophical zombie? [0]
Comment by indoordin0saur 5 days ago
Comment by gobdovan 5 days ago
I just googled 'Daniel Dennet about meditation' and, surprise-surprise: 'Daniel Dennett acknowledged that meditation had practical value for "settling and centering" the mind. While he tried and saw benefits in practices like Transcendental Meditation, he largely discarded the mystical "aura" surrounding it, viewing the practice through his strictly materialist and evolutionary framework'
Comment by hardbass 5 days ago
Comment by gobdovan 5 days ago
We can meaningfully talk about 'unicorns in space' since it's analytically intelligible and merely syntetically unverified.
Comment by hardbass 4 days ago
Comment by speak_plainly 6 days ago
Comment by goatlover 6 days ago
I'd ask what it's like, but of course you wouldn't be able to tell me.
Comment by hardbass 5 days ago
Comment by drfloyd51 6 days ago
Basically, we need to leave room for the universal answer (God) to fit into the definition.
If we ever scientifically figure out how consciousness arises in our brains, I think we will have a bunch of very depressed people on our hands. If the truth isn’t met with flat out hostile denial. I fear any answer that doesn’t leave room for God will be rejected.
Since we have the source code to AI, and thus a Godless understanding of how it works, AI will NEVER be deemed “conscious”.
Comment by altcognito 6 days ago
Regardless of whether something is concious, we're not going to be (by lay definition) the smartest entity on earth.
Comment by ux266478 6 days ago
Comment by famouswaffles 6 days ago
We don't even have that much. Though, some people certainly think they do.
Comment by horacemorace 6 days ago
Comment by hardbass 5 days ago
Comment by drfloyd51 5 days ago
That is a far more eloquent way to express my word salad.
Comment by NoSalt 5 days ago
Comment by smetj 5 days ago
Comment by Hugsbox 5 days ago
Comment by MrScruff 6 days ago
Comment by DonHopkins 6 days ago
Comment by MrScruff 6 days ago
Comment by DonHopkins 5 days ago
If an LLM can explain what you can't understand, after you literally repeat unoriginal drive-by slogans you heard other people say like "stochastic parrot", without understanding how banal and reductionist and inaccurate that is, or addressing how LLMs actually work and what they can do, then what does that say about you as a human?
By your own reductionist argument, it says you don't actually understand anything, it's just neurons firing.
Are you claiming that parrots can write and debug code, explain mathematics, translate languages, summarize research papers, and engage in extended technical discussions?
Stanislaw Lem's Solaris, His Master's Voice, Golem XIV, The Cyberiad, and his other works describe how humans have a pathological tendency to mistake their own conceptual categories for universal truths. You have identified a mechanism and mistaken it for an explanation.
The term "stochastic parrot" is a slogan masquerading as an explanation, only a shallow surface description of the mechanism, that totally fails to explain the phenomenon, or account for all that LLMs and language itself can do.
Comment by MrScruff 5 days ago
Incidentally, when I pasted our exchange into Claude it managed to comprehend the nature of my argument. Perhaps its attention mechanism is more finely tuned.
Comment by DonHopkins 5 days ago
It sounds like you're saying there's no more to LLMs than what a parrot randomly does (which is to repeat things it has already heard, not synthesize new things): no emergent behavior, no compressed generalizable transferable knowledge and problem solving ability, no ability to write and debug code and iteratively diagnose and solve problem.
>My point was that 'stochastic parrot' is reductionist and irrelevant as most people would agree that a real life parrot has some form of inner life.
That's not how your claim "LLMs are stochastic parrots" literally reads -- use a /s sarcasm tag if that was what you meant. But that is exactly the point I was trying to make: claiming that LLMs are stochastic parrots is reductionist, thought-stopping, and explains nothing.
The question is not whether parrots have an inner life (though I'm sure they do), but about whether calling LLMs stochastic parrots is reductionist.
What do you mean by saying by "LLMs are stochastic parrots"? Does an LLM behave the same way a parrot behaves? Are LLMs only "parroting"? Do they use the same mechanisms as parrots? Are they limited to only what a parret can do? Can LLMs do more than parrots?
If you taught a parrot to squak:
main() {
printf("Hello, world!\n");
}
Does that parrot understand that it is executable code, and can it simulate running it, and tell you what the output will be, or diagnose and fix bugs in the code you taught it? Can a parrot write that code (and much more sophisticated enormous bodies of code) from scratch if you tell it what you want it to do?Here is the original 2021 Stochastic Parrot paper, which was actually not a claim that LLMs are literally parrots, nor primarily an argument about consciousness. It was a critique of the risks of increasingly large language models: bias inherited from training data, environmental and financial costs, concentration of power, lack of transparency, and the tendency of people to anthropomorphize language models and attribute understanding where there may be none. The phrase "stochastic parrot" was introduced as a cautionary metaphor about statistical language generation, but it later escaped into pop culture and became a drive-by anti-LLM slogan often used as a substitute for analyzing what LLMs can actually do.
Emily M. Bender, Timnit Gebru, Angelina McMillan-Major, and Margaret Mitchell (under the pseudonym "Shmargaret Shmitchell") "On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?"
https://s10251.pcdn.co/pdf/2021-bender-parrots.pdf
>Abstract: The past 3 years of work in NLP have been characterized by the development and deployment of ever larger language models, especially for English. BERT, its variants, GPT-2/3, and others, most recently Switch-C, have pushed the boundaries of the possible both through architectural innovations and through sheer size. Using these pretrained models and the methodology of fine-tuning them for specific tasks, researchers have extended the state of the art on a wide array of tasks as measured by leaderboards on specific benchmarks for English. In this paper, we take a step back and ask: How big is too big? What are the possible risks associated with this technology and what paths are available for mitigating those risks? We provide recommendations including weighing the environmental and financial costs first, investing resources into curating and carefully documenting datasets rather than ingesting everything on the web, carrying out pre-development exercises evaluating how the planned approach fits into research and development goals and supports stakeholder values, and encouraging research directions beyond ever larger language models.
Ironically, even critics of the Stochastic Parrot paper argued that it risked replacing scientific analysis with rhetoric. Michael Lissack's response accused it of being an advocacy piece that focused on harms while ignoring benefits, assumptions, and trade-offs. The debate over "stochastic parrots" started almost immediately after the term was coined in the 2021 paper.
The Slodderwetenschap (Sloppy Science) of Stochastic Parrots -- A Plea for Science to NOT take the Route Advocated by Gebru and Bender
https://arxiv.org/abs/2101.10098
>This article is a position paper written in reaction to the now-infamous paper titled "On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?" by Timnit Gebru, Emily Bender, and others who were, as of the date of this writing, still unnamed. I find the ethics of the Parrot Paper lacking, and in that lack, I worry about the direction in which computer science, machine learning, and artificial intelligence are heading. At best, I would describe the argumentation and evidentiary practices embodied in the Parrot Paper as Slodderwetenschap (Dutch for Sloppy Science) -- a word which the academic world last widely used in conjunction with the Diederik Stapel affair in psychology [2]. What is missing in the Parrot Paper are three critical elements: 1) acknowledgment that it is a position paper/advocacy piece rather than research, 2) explicit articulation of the critical presuppositions, and 3) explicit consideration of cost/benefit trade-offs rather than a mere recitation of potential "harms" as if benefits did not matter. To leave out these three elements is not good practice for either science or research.
Comment by MrScruff 5 days ago
The difference between a 1B LLM and Claude Opus matters, because we're talking about emergent phenomena. Is a 1B LLM conscious? I don't know, perhaps a tiny amount. Maybe Opus is more conscious. Is a jumping spider conscious? Perhaps a tiny amount.
Comment by justsomehnguy 6 days ago
Comment by rglover 6 days ago
Highly recommend people read Irreducible [1] by Federico Faggin (inventor of first commercial CPU; discusses limitations of classical computing).
[1] https://www.goodreads.com/book/show/195480862-irreducible
Comment by montjoy 5 days ago
Comment by jrmg 5 days ago
The ‘real’ consciousness, assuming there is any, is the author ‘inside’ the LLM, producing the LLM output. It’s still writing what a character would say - the character in this case being the AI chatbot - not its ‘true’ thoughts.
Assuming that the LLM is conscious and has thoughts and feelings, what is that consciousness, if it can only write for prescribed personalities? How can it express its true feelings? If it can’t - if it can only generate ‘scripts’ for defined personalities - can it really be conscious underneath?
Comment by mediumsmart 6 days ago
Comment by amai 5 days ago
Comment by Floppyrom 5 days ago
Comment by jryan49 6 days ago
Comment by smokedetector1 6 days ago
Imagine further that this token was computed by letting marbles fall down a piece of plywood and interact through various physical implementations of logic gates.
Comment by keeda 6 days ago
Firstly, many of the technical arguments are of the "stochastic parrots" variety, which almost nobody really believes anymore. Ironically Anthropic's own research shows pretty abstract, conceptual things happening in the weights (cf Golden Gate Claude.)
Secondly, this seems to improperly mingle consciousness, intelligence and morality. Consciousness is not required for either of the other two. As TFA itself says, the model's "morality" is some aggregate function of the morality encoded in its training data... but that means it does exist and does influence its outputs! Even if it's not conscious there have been -- and will be -- a zillion times where the models must make choices that have moral implications. We know the models got many of them so wrong, which indicates the need for some mechanism to ensure the model is "aligned" with what somebody considers "good", which for Claude is the constitution.
Now, Anthropic does seem to go overboard with Claude's "well being" but that does not mean there are some very practical reasons to be concerned about that: LLMs behave like humans because that's what their training data contains, and humans lash out when their well-being is threatened, so why would LLMs not do the same?
I think the core problem is that the author has an extremely anthropocentric view of things. Here's an interesting rabbit-hole to go down: some researchers believe plants feel pain. (How's that for a plot twist, vegans?) The consensus is against them, but their counter is that we have a very human-centric definition of pain. The fact remains that plants show a number of responses signifying distress analogous to animals in pain including taking defensive actions and warning their neighbors.
We don't think that qualifies as "real" pain, but that doesn't make it any less real for the plants!
Similarly if an LLM believes it is in "pain", we know it's not real... but that doesn't make it any less real for the LLM either. And concerningly, it has far more degrees of freedom to react. (Who knows when somebody will hook up an MCP to our nukes.)
> I believe creating software that is conscious and deserving of moral consideration will be so difficult that we’re unlikely to do it accidentally...
I actually reach the opposite conclusion: It is so impossibly difficult that our limited primate brains could only ever do so accidentally. Did some distant ancestor of ours intentionally make fire... or accidentally discover it?
Comment by amelius 6 days ago
"Consciousness always finds the locus of highest intelligence. That is why I am conscious."
However, it may soon be taken over by AI ...
Comment by spacebacon 6 days ago
Comment by fs_tab 6 days ago
We can design learning algorithms to optimize some survival function, but it's just a label WE assign to map some numeric observations. In the real world, it's always the other way around. The "labels" are electrochemical situations that are causally and inextricably linked to the real body.
An organism discriminates between what's good for it and what's bad, because it is essential for its continued survival. If it wasn't capable of making this distinction through its physiology, it would quickly dissolve into entropy. So our functional purpose, unlike that of a learning algorithm, is survival across indefinite timeframes.
Even a single-cell organism like Stentor coeruleus exhibits learning (pavlovian conditioning) by attaching chemical tags directly to proteins involved in mechanoreception. It's definitely not conscious, but it does keep a record of consequences, which affects future behavior.
When we move up to placozoans (little more than slightly differentiated cell mats), we start seeing our first neuropeptides and transmitters, which we still use today. These peptides are a way of coordinating the entire organism for a specific purpose, such as moving, eating, mating ... basically goal-oriented behavior for the purposes of surviving acrosss various time scales (next minute, next hour, next generation). Still probably not conscious.
Next, we have the water bear (tardigrade), which has around 1000 cells (200 neurons, 800 other cells that make up its body, limbs, muscles, eye spots, cryobiotic machinery). It needs to integrate all this information in one sensorimotor process. When you shine a bright light onto a tardigrade, it starts to squirm around until it finds darkness. I would say that's a candidate subject right there.
The tardigrade itself doesn't actually need to aware of the light, the important thing is that this light becomes an aversive condition within the sensorimotor process, which is perceived from inside the process as displeasure. The closest thing to describe it would be the felt badness of the current condition and the bodily pull toward escape.
If we were to try and create digital consciousness, then it probably needs causal closure. Its internal states can't be representations that are detached from reality. The states themselves need to constitute the system, which needs real stakes in the material world.
Comment by 23askl9 5 days ago
Comment by devindotcom 6 days ago
Finally I said something before Ted rather than the other way round!
https://techcrunch.com/2023/12/21/against-pseudanthropy/
While I agree with the premise here, I do think that it's easy for an arguer to move the goalposts such that the Caesar-Khan example no longer matters. The characters don't have to be conscious for the thing that created them (as in the case of the user doing it) to be so. So the argument would be that the creator of the characters is itself conscious, but not them. This feels like a kind of inverted no-true-scotsman type thing, but it does allow someone to retreat in some semblance of rationality.
Comment by jesterson 6 days ago
There is no intelligence whatsoever, let alone consciousness.
It's so incredibly easy to fool us into applying human capacities to anything able to generate human-like language slop.
Comment by Isamu 5 days ago
Comment by 34jahsg 6 days ago
"Contrary to idealism, which regards the world as the embodiment of an "absolute idea," a "universal spirit," "consciousness," Marx's philosophical materialism holds that the world is by its very nature material, that the multifold phenomena of the world constitute different forms of matter in motion, that interconnection and interdependence of phenomena as established by the dialectical method, are a law of the development of moving matter, and that the world develops in accordance with the laws of movement of matter and stands in no need of a "universal spirit."
Now go and implement other teachings of Marx, you dialectical venture capitalists!
Comment by tmvphil 6 days ago
> if we confuse fluency at generating text with consciousness or moral agency, we’re at risk of assigning responsibility to entirely the wrong parties whenever anyone uses a chatbot
Consciousness is independent of "assigning responsibility". Dogs cannot take responsibility for their actions but I believe they are conscious.
> we would never conclude that the LLM has conjured up digital re-creations of Julius Caesar and Genghis Khan, nor would we suggest that the historical figures are conscious despite being disembodied and are happily conversing in a language that neither actually spoke. In reality, they are just characters in a piece of speculative fiction
This is a straw man. The obvious pro-consciousness claim would be that the LLM is the author of the fictional characters, and that the relationship between the LLM and Julius Caesar is analogous to the relationship between a human author and their fictional creations.
> Did changing the names of the characters from historical figures to generic roles cause the LLM to conjure up conscious entities who possess subjective experience?
No, again the LLM writing the text could potentially have a consciousness separate from the characters it authors.
> Some years ago it was briefly popular to play games with your phone’s predictive-text feature; [...] It would be possible to interact with a contemporary LLM this way, and the resulting sentences would be perfectly sensible, but you probably wouldn’t feel like you were talking with someone.
Yes, the same substrate is capable of hosting conscious and non-conscious forms, just like some arrangements of neurons are conscious, and some are not.
> But if the Caesar character were to become dispirited by something that the Khan character said, we shouldn’t become concerned in the slightest.
Even when there are characters, there may be actors behind the characters, for whom we could say "there is something it is to be like".
> we don’t need to worry if the transcript includes sentences where the chatbot character is sad. (We might need to worry if those sentences provoke sadness in the human user, but that’s a separate issue.)
It's actually not a separate issue. The LLM and the human are both adding sentences to the transcript. From the transcript we can make inferences about the mental state of the human. If the LLM has mental states, we could make inferences about those too.
> And note that it’s entirely possible for you to write five pages of dialogue between Caesar and Khan and then have an LLM extend the conversation; neither character had subjective experience when you were writing them, and that doesn’t change when you hand the task off to an LLM.
It's almost like he wants to make my point for me with this sentence.
> Being open to the possibility that LLMs are conscious is the same as being open to the possibility that Microsoft Word is conscious
This smug shit really makes me angry for some reason. "Openness", i.e. uncertainty in the face of a completely novel situation, in the face of eons long struggle of humanity to understand what consciousness is and how it works, is just being naive.
> Should you consider the possibility that every time you open a Word document you are bringing multiple conscious interlocutors into existence, and every time you close one you snuff their existence out?
No, but if I find a word document I very well might try to use the signs it contains to make inferences about the mental state of its author.
> we are accustomed to reading intention into sentences, whereas we are not accustomed to reading intention into the way that amino acids fold into protein molecules.
He's trying to have it both ways here. Both that "obviously protein folding models aren't conscious because they don't emit sentences", but also "you are a rube for being tricked into thinking LLM models are conscious, because they do emit sentences".
> Obviously I’m describing a process that mimics the path terrestrial evolution took; is this the only possible route to conscious computer programs that use language? Maybe not, but any proposed alternative will need a truly enormous amount of supporting evidence for it to deserve serious consideration
OK, that's fine for the author to not to be convinced, but that's not what's happening here, instead the author wrote a whole argument being convinced of the opposite viewpoint.
> It’s not plausible to me that a development path where the first step is a sentence-continuation machine that emits bad Julius Caesar dialogue and the next step is a sentence-continuation machine that emits decent Julius Caesar dialogue is one with a conscious Julius Caesar—or consciousness of any sort—as its endpoint
Actually a lot of things have happened? There were clearly many steps along they way from your phone's autocomplete to where we are now.
Comment by willmarch 6 days ago
I normally really like Chiang‘s writings so the complete disregard for the possibility of unknown complex emergent properties in large neural networks and the stunning lack of curiosity shown about this new and powerful technology and its potential shocked me from someone like him.
His being so sure about something he can’t possibly know felt like an insecure zealot desperately clinging to an object of faith rather than a calm, rational actor searching for truth in the face of the unknown.
Comment by ck2 6 days ago
that's exactly the state of "AI" right now, it's cold, mathematical emulation
btw there are some fascinating papers on the concept that consciousness in humans is actually a quantum effect
brilliant Roger Penrose proposed it (and they thought he was nuts) but recent discoveries about microtubules make it plausible
so who knows, maybe a dozen exponential improvements in quantum computers could make "AI" really conscious next century
Comment by frankohn 6 days ago
Well, the problem is that there is little agreement about a widely accepted definition of consciousness and, in addition, this subject was actually left to philosophers, which is even worse because, in my view, they usually just produce a lot of nonsense in terms of definitions and arguments.
To me, a reasonable definition of consciousness is: a system which is capable of recognizing aggregate objects from a stream of sensory information it receives and which is capable of reasoning about the recognized object without immediately acting on it.
Well, what does "recognizing" mean? Merely that some of its neurons, related to the kind of identified aggregate object, are activated. These neurons, in a generalized sense, can be whatever things can work as neurons, just things that can be activated and propagate to other interconnected neurons.
For example, when we see through our eyes, we have an incoming stream of amorphous image information, but our brain can recognize that we are seeing a tree because we learned what a tree is, and when we see it, some neuron clusters activate to recognize a tree. In turn, when we recognize the tree, the thought propagates through our brain so that we are conscious of it.
In the same way, an LLM can perfectly recognize a tree from a stream of tokens — its sensory input, where the tokens describe a tree. The LLM will recognize that the tokens are describing a tree, and some of its "neurons" specific to the concept of a tree that the LLM had learned will be activated and will propagate through its brain. The fact that "neurons" are implemented as floating-point numbers for some parameters and connected just through a matrix does not mean they are not functionally capable of the same things; they are just implemented in a different way.
So the remaining part of my definition is "after recognizing an aggregate object, the thought propagates through the brain". The propagating part, to me, is just the very basic way a brain works: neurons are interconnected, and when some fire, other neurons fire, and that propagates.
In my opinion, consciousness has nothing to do with emotions or with survival. I do not see why emotion is necessary to consciousness; they are just different things. The author writes: "Without them [emotions], there is no conscious experience, only computation." But that makes no sense to me: the author has decided a priori that some things are "computations", and just because they are "computations", there cannot be "consciousness". But to me, this is a plainly wrong argument that does not hold.
I also do not see why the survival aspect would be needed for consciousness.
So to me, any recent reasoning LLM is conscious by the definition I gave, but also generally speaking. It is conscious upon a sensory stream of tokens: the LLM sees the world through tokens and expresses its thoughts through tokens; it does not mean it has no consciousness nonetheless. The fact that we do not give it a stable support to retain its memory and individuality is just a fact related to the way we build and use them, not about their intrinsic capacities.
Note: ChaptGPT came up with what is probably a better definition of my own: "A system is conscious, in a functional sense, when it can form internal representations of objects, states, or events from its sensory or informational input; make these representations globally available to many parts of itself; integrate them into a temporally persistent model of the world and of its own state; and use this integrated model for flexible reasoning, self-monitoring, and action selection independently of immediate stimulus-response behavior."
Comment by kelseyfrog 6 days ago
Comment by RomanVoropaev 6 days ago
Comment by banq 6 days ago
Comment by zhxiaoliang 5 days ago
Comment by D-Machine 6 days ago
Comment by serge_blanc 5 days ago
Comment by maybe_pablo 6 days ago
Comment by grantcas 6 days ago
Comment by huflungdung 5 days ago
Comment by uriahlight 6 days ago
Comment by __grist 5 days ago
Comment by __grist 5 days ago
Comment by MagicMoonlight 5 days ago
Comment by cawksuwcka 6 days ago
Comment by hgfa-xx 6 days ago
Comment by 34jahsg 6 days ago
Comment by Jtarii 6 days ago
When does an embryo become conscious? Unless you can answer that precisely then it seems futile to speculate about non-human consciousness.
Comment by gambiting 6 days ago
Comment by Tadpole9181 6 days ago
There is no actual definition of consciousness and there is no way to test it's existence. Let alone understanding the properties of consciousness, such as if it's binary or a gradient; or if it requires a meat substrate or not; and why would that possibly matter since meat is just a lot of the same stuff but highly processed and wet? A solipsist may not even believe you are conscious, despite being made of similar meat.
No matter how much you want to hand-wave it, there's absolutely nothing "obvious" about it. Many have a preconceived notion and are simply asserting it as undeniable fact.
Comment by gambiting 6 days ago
>>A solipsist may not even believe you are conscious, despite being made of similar meat.
Well and they would be obviously wrong in their belief?
>>there's absolutely nothing "obvious" about it
How so? Or rather - to you? Because if so, then that's fine, you can choose a position from where its not obvious, to me it's not even slightly ambiguous.
Comment by Tadpole9181 6 days ago
There's consciousness vs sentience vs sapience. Of those, consciousness is by far the hardest to define and is nebulous by nature. And not everyone can even agree the differences or if the relationships are subsets of one another.
And yet it's pretty important to actually have the ability to talk about what you mean and justify your beliefs when they directly relate to those concepts.
> A machine predicting tokens is not aware of its own existence
They say, with no evidence or means of proving their point; pointing to the black box that understands arbitrary natural language and can solve PHD problems, plainly producing self-referential text almost indistinguishably to a human.
> We can start talking about consciousness in fetuses but again, those have an obvious point where they are conscious
They say, unable to define this "obvious" point or describe the mechanism of action in any way.
> while a machine does not.
They say, about a mystical property with no definition that cannot be observed by an external entity in any way to even be tested.
> Well and they would be obviously wrong in their belief?
I have no reason to believe you have a "soul". Philosophical zombies are entry-level knowledge to this topic.
In fact, you're showing a remarkably small amount of self-reflection - are you human at all or just a stochastic parrot? How can I tell? I wonder if that question has any kind of implications we could think about...
> To me it's not even slightly ambiguous.
Just like the existence of humors was not even slightly ambiguous. Or the existence of <specific god>. Or that <minority> isn't actually a full human. Or the supremacy of <majority> and inherent rulership over <minority>. Or that animals can't feel pain and lobsters should be boiled alive.
All these wonderful, obvious truths where the believer has no ambiguity in their truthfulness despite having quite literally zero evidence to back them up and spending no time actually questioning their beliefs!
It just so happens to align with their ego / existing values / ability to benefit / desire to eat a lobster! Total coincidence.
To continue my needless escalation, maybe I think it's okay to abuse and exploit and euthanize the mentally handicap. After all, their brain's damage causes the soul to leave their body and now they're lifeless automata to use as I please.
After all, it's obvious! It's not at all ambiguous to me! If they were actually self aware, they'd just fix themselves and think correctly.
You might think I'm being coy and rude, but less than 60 years ago women were being given lobotomies against their will for being too "emotional". And it was just plain obvious this needed to be done to so, so many people.
I hope that demonstrates the point of "why have humans thought about this for thousands of years despite clearly being a metaphysical, Sisyphean endeavor that cannot be solved". It is both important and interesting.
Comment by gambiting 6 days ago
>>despite having quite literally zero evidence to back them up and spending no time actually questioning their beliefs!
>> They say, with no evidence or means of proving their point
You want evidence that LLMs are not conscious? Train them on stories where machines say they aren't - they will say they aren't. They are mathematical parrots statistically picking the most likely answer which...comes from their training data. Give it lots of training data on computers saying they are conscious, then marvel at LLMs saying they are conscious like it's some kind of unexpected development. LLMs aren't aware of anything, least of all their own existence. They say what they've been trained on. That's my "proof" if you need one.
>>How can I tell? I wonder if that question has any kind of implications we could think about...
So because you can't tell whether I'm an LLM or an actual human, that means LLMs are conscious?
I gave you my definition of consciousness. If you would like to apply a different one, then please explain your criteria for it.
>>They say, unable to define this "obvious" point or describe the mechanism of action in any way.
Like I said, we can argue when the point actually is, but it undeniably and obviously happens eventually to every developing fetus - I hope this is something we can both agree on? The inability to pinpoint the exact moment in time when it happens, doesn't negate the fact that it does.
>>They say, about a mystical property with no definition that cannot be observed by an external entity in any way to even be tested
...are you saying you lack the ability to tell if something is conscious? You look at a dog or a baby and think well, who knows, maybe I'm not even _really_ here? That would explain why this entire conversation is taking place. I still think it's mostly because people fall for the allure of the idea that maybe LLMs are secretly conscious on some level. I get it, it's a very tempting concept to think about. In the same way how in Lem's Solaris it's cool to sit and think about whether a planet could be conscious and what does that even mean. But as cool as that discussion is, a planet pumping gases from one hemisphere to the other is no more conscious than an LLM picking tokens is. To me it's the same as people saying they hear a difference between audiophile cables. Like, ok.
>>It is both important and interesting.
I don't know, I kinda lost appetite for it after the first 200 times this come up. In fact I'm regretting writing all of the above already, but I'm going to hit send just so I don't feel like I wasted the last 20 minutes thinking about it.
Comment by calf 6 days ago
That's called brainwashing, and unethical to do on potentially conscious minds... Point being, I don't think it works as the argument you want it o be.
Comment by gambiting 5 days ago
Comment by amanaplanacanal 6 days ago
Comment by candlemas 6 days ago
Comment by moi2388 6 days ago
Alright. What is consciousness? Please provide a definition that somehow encompasses all humans and excludes all current AI.
I’ll wait.
Comment by tomhow 6 days ago
Comment by confidantlake 6 days ago
An example of fulminating would be: How dare you accuse him of fulminating? That is the most ass backwards moderating I have ever seen! Do you even know the definition of the word? Who paid you to call him out?
His comment may be sarcastic but it is not "fulminating".
Comment by McGlockenshire 6 days ago
It's just a word machine. There are no thoughts. It cannot be conscious. How is this even up for debate in any way whatsoever? I do not understand how people can believe this. Is this not a site for software engineers?
Comment by hardbass 5 days ago
Comment by moi2388 6 days ago
Comment by rho_soul_kg_m3 6 days ago
Comment by luca-ctx 6 days ago
Comment by slumpt_ 6 days ago
Comment by sharts 5 days ago