If you succesfully build a highly capable “aligned” model (according to some cla...

chriskanan · 2026-05-09T00:58:17 1778288297

Jobs are an invention of humanity. About 50% of people dislike their job. People spend much of their lives working. Poverty and inequality are a choice made by society if society chooses poorly.

llbbdd · 2026-05-09T01:56:14 1778291774

They're only an invention if you consider "seeking sustenance to live" not explicitly a job if there's no monthly direct deposit involved.

ben_w · 2026-05-09T04:45:13 1778301913

Indeed.

On the plus side, if there really is no value to labour, then farm work must have been fully automated along with all the other roles.

On the down side, rich elites have historically had a very hard time truly empathising with normal people and understanding their needs even when they care to attempt it, so it is very possible that a lot of people will starve in such a scenario despite the potential abundance of food.

skeledrew · 2026-05-09T05:17:00 1778303820

It's either: 1) the rich voluntarily share the means of production so everyone becomes equal, 2) the poor stage successful revolutions so they gain access to the means of production and everyone becomes equal, 3) the poor starve or are otherwise eliminated, and the survivors will be equal.

All roads lead to equality when the value of labour becomes 0 due to 100% automation.

ben_w · 2026-05-09T05:53:09 1778305989

There's plenty of outcomes besides those three.

Over history, lots of underclasses have been stuck that way for multiple generations, even without the assistance of a robot workforce that can replace them economically.

Some future rich class so empowered would be quite capable of treating the poor like most today treat pets. Fed and housed, but mostly neutered and the rest going through multiple generations of selective inbreeding for traits the owners deem interesting.

skeledrew · 2026-05-09T06:35:22 1778308522

Non-human pets don't have the capacity to rebel though; make humans into pets and there will again be the constant danger of rebellions as with slavery in the past. Without the economic incentive to offset.

ben_w · 2026-05-09T06:54:07 1778309647

I disagree on both counts.

On the first, non-human pets rebelling is seen every time an abused animal bites their owner.

On the second, the hypothetical required by the scenario is that AI makes all human labour redundant: that includes all security forces, but it also means the AI moving around the security bots and observing through sensors is at least as competent as every human political campaign strategist, every human propagandist, every human general, every human negotiator, and every human surveillance worker.

This is because if some AI isn't all those things and more, humans can still get employed to work those jobs.

skeledrew · 2026-05-09T19:39:28 1778355568

Not at all. A rebellion is an organized effort, with an implicitly delayed response to grievances. I can't think of any non-humans that organize their efforts as such. It would be a heck of a thing if a group of dogs were to plan how they'd take out their masters.

All those "jobs" you describe - and many more - would cease to be a thing, as their purported basis for existence would be no more. Any role that doesn't concretely contribute to our survival and advancement is just "busy work". People could theoretically continue to maintain some simulation of something that keeps them as a retirement, but it'd be meaningless.

ben_w · 2026-05-09T20:02:55 1778356975

> Not at all. A rebellion is an organized effort, with an implicitly delayed response to grievances. I can't think of any non-humans that organize their efforts as such. It would be a heck of a thing if a group of dogs were to plan how they'd take out their masters.

Dogs in particular are pack animals, self-organisation amongst them wouldn't be at our level but that doesn't mean it doesn't exist.

> All those "jobs" you describe - and many more - would cease to be a thing, as their purported basis for existence would be no more. Any role that doesn't concretely contribute to our survival and advancement is just "busy work". People could theoretically continue to maintain some simulation of something that keeps them as a retirement, but it'd be meaningless.

Yes?

I think you've missed the point, though.

When your opponent has all those skills to that level and doesn't sleep and simply applies all the surveillance tech that has already been invented like laser microphones and wall-penetrating radar that can monitor your pulse and breathing, how would you manage to rebel?

How would you find a like mind to organise with, when your opponent knows what you said marginally before the slow biological auditory cortex of the person you're talking to passes the words to their consciousness? Silicon is already that fast at this task.

And that's assuming you even want to. Propaganda and standard cult tactics observably prevent most rebellions from starting. LLMs are already weirdly effective at persuading a lot of people to act against their own interests.

simonh · 2026-05-09T15:05:34 1778339134

Right, such a society would have no need of human capitalists, government workers, experts, etc.

The question is, to what extent would humans still set goals and priorities, and how.

ben_w · 2026-05-09T15:46:48 1778341608

> The question is, to what extent would humans still set goals and priorities, and how.

From what I hear about the US and UK governments, even the elected representatives of these governments don't really set goals and priorities, so the answer is surely "humans don't".

simonh · 2026-05-09T16:35:37 1778344537

I get your point, but I’d say they do set goals, they’re just do bad at achieving them that it’s hard to tell.

Hopefully AI would help us better achieve our goals, but they still need to be our goals. I’m just not sure what that means. I don’t think anybody does.

That’s a major problem here, if we can’t reliably articulate our goals in unambiguous terms, how in earth can we expect AI to help us achieve them? The chances that whatever they end up achieving will match what we will actually like after the fact seems near zero.

skeledrew · 2026-05-09T19:42:07 1778355727

The ultimate goal - of not only humans but all living things - is to survive the best way possible. Everything flows from that.

simonh · 2026-05-10T20:35:20 1778445320

Sure, but do we trust an AI to decide what the optimum way to achieve that is? Best way according to what criteria?

skeledrew · 2026-05-10T21:40:47 1778449247

I'd say Maslow's hierarchy[0] is a great starting point. Program that properly and faithfully (no backdoors, military exceptions, etc whatsoever) along with Asimov's 3 laws[1] and it should be pretty hard to find issue with the system that would result.

[0] https://en.wikipedia.org/wiki/Maslow's_hierarchy_of_needs

[1] https://en.wikipedia.org/wiki/Three_Laws_of_Robotics

ben_w · 2026-05-11T08:54:08 1778489648

> Program that properly and faithfully

This is the "draw the rest of the owl"* of the alignment problem.

Or possibly the rest-of-owl of AI in general: Consider that there's still no level-5 self driving cars, despite road traffic law existing and the developers knowing about it since before they started trying.

* https://knowyourmeme.com/photos/572078-how-to-draw-an-owl

simonh · 2026-05-11T03:15:40 1778469340

The film version of I Robot had this right, the three laws are a manifesto for totalitarianism. The AI cannot sit on the sidelines as long as there is anything it can do to prevent crimes or abuse of any kind, no matter how intrusive that intervention may be.

theopsimist · 2026-05-09T05:53:00 1778305980

If truly 100% automation (including infantry/police) the most likely scenario is not any if the above; most people will be kept on some kind of minimum sustenance enough to keep them from rebelling (“UBI”) and those who disagree will either be coopted into the elite or eliminated.

skeledrew · 2026-05-09T06:28:39 1778308119

There's no reason to keep anyone on minimal sustenance though. They're absolutely useless alive from an economics perspective, and so would probably be better served ground up into fertilizer or some other actually useful form.

ben_w · 2026-05-09T07:02:51 1778310171

> There's no reason to keep anyone on minimal sustenance though.

No reason, except their (the rich or the AI) own personal desire to do so.

https://en.wikipedia.org/wiki/Folly

> They're absolutely useless alive from an economics perspective, and so would probably be better served ground up into fertilizer or some other actually useful form.

Indeed. "The AI does not hate you, nor does it love you, but you are made out of atoms which it can use for something else."

But while some may care about disassembling this world and all non-rich-human life on it to make a Dyson swarm of data centres, there's also the possibility each will compete for how many billions of sycophants they can get stoking their respective egos.

culopatin · 2026-05-10T08:42:59 1778402579

In 1, 2 and 3, any progress stops because no one is making new means of production, so we must stop population from growing. No? Who’s building the factories or whatever those means of production are?

ben_w · 2026-05-10T08:57:33 1778403453

In the hypothetical where humans can no longer be employed because of AI, it is necessarily the case that AI must be able do any job at least as well as the best human for that job. That includes building factories, doing research.

skeledrew · 2026-05-10T12:31:15 1778416275

The means of production are AI+robots, which self-maintain and self-replace as necessary to ensure everything else continues smoothly.

parineum · 2026-05-09T15:55:45 1778342145

> 2) the poor stage successful revolutions so they gain access to the means of production and everyone becomes equal

Or a handful of the poor become the new rich, which is usually what happens in that scenario.

skeledrew · 2026-05-09T20:08:14 1778357294

It would just mean the loop repeats itself until equality is achieved. Even if in the end only 1 human remains alive.

ben_w · 2026-05-11T09:05:28 1778490328

Humans reproduce, there is no requirement that even destruction and death would lead to equality, not even if the elites still put themselves close enough to the rest of us as to be attackable.

For the latter point, consider that no matter how much the people of North Sentinel Island hate outsiders, they're not going to pose any risk to the rest of us.

Now, an elite whose membership includes those who want equality for the rest of us, that may create conditions for such a rebellion to succeed, but absent such from an insider (which could be encoded into the AI via either a bug or deliberately from whoever created the AI), some elite whose defence is handled by the kind of AI under consideration would not face any more of a threat from the wider population than we here in the west today face from the North Sentinel Islanders.

Note however that I'm not saying what will happen, but what is possible in various conditions. There's no guarantee of anything at this point.

OJFord · 2026-05-09T14:38:42 1778337522

Is that true? In communities or tribes of antiquity I assume there was some trading fruits of different labours before coinage. Still an 'invention' beyond baser individual survivalism.

jinwoo68 · 2026-05-09T01:35:27 1778290527

Many (most?) people make a living from their job whether they like it or not. Having a job that they dislike is far better than losing one because of AI whatever that means.

p1esk · 2026-05-09T12:53:11 1778331191

Unless AI will allow people not work and keep their quality of life. Could be possible with total automation of everything.

shafyy · 2026-05-09T15:06:42 1778339202

Could also be possible today, but we chose a capitalistic system that leads to an increasing wealth gap. And now we're in a situation where the richest 1% own 50% of the wealth.

So, if we increase automation and the ownership structures stay the same, this inequality will get worse, not better.

p1esk · 2026-05-09T16:07:23 1778342843

It’s interesting, people talk about inequality and I definitely feel it myself – I see so many rich people around me. But I am in that 1%, just like many on this forum. At least according to https://dqydj.com/average-median-top-individual-income-perce... yet I still have to work for a living.

owebmaster · 2026-05-12T03:23:13 1778556193

Too 1% income don't put you in the 1% richest as lots of people are rich because of inheritance, not income.

shafyy · 2026-05-10T20:56:33 1778446593

You might be in the 1% gloablly, but probably not in the country you are living in?

p1esk · 2026-05-12T03:20:47 1778556047

I'm 1% in US.

8note · 2026-05-09T19:15:59 1778354159

its reasonably well known that people thrive when they have a sense of purpose.

having your needs met without needing to do anything leads to disaster for mental health

p1esk · 2026-05-09T21:53:11 1778363591

This is my biggest concern. In the more distant future, I think people will lose themselves in VR worlds.

jazz9k · 2026-05-09T17:52:46 1778349166

How will this ever be possible? Do you think it will ever be able to keep up with generations of people not working?

The cost will exponentially increase over time and the systen will eventually collapse.

You also won't be able to keep your 'quality of life', unless government housing and rationing is your quality.

I feel like the foolishness of communism isn't taught enough in schools and every generation has to dress it up with new technology.

falcor84 · 2026-05-10T17:00:14 1778432414

> The cost will exponentially increase over time and the systen will eventually collapse.

From what I'm seeing in the numbers, the big problem of the coming century is population collapse. Maybe I'm just too much of a believer in the intermediate value theorem, but I'm sure there has to be a way to arrive at a society with a sustainable usage of resources.

jinwoo68 · 2026-05-09T15:03:07 1778338987

Nope. If everything is totally automated, if ever, the gap between the rich and the poor will widen even more. Most people will live in misery while only a handful of people enjoy all the automation.

falcor84 · 2026-05-10T16:52:16 1778431936

> Having a job that they dislike is far better than losing one because of AI whatever that means.

Is it really worse even if "whatever it means" is living in a post-scarcity society where everyone can shareel in the fruits of the AI's labor?

I'm not saying that's where this are necessarily going. But I am saying that that's what we should be aiming for, rather than trying to preserve the status quo.

taneq · 2026-05-09T01:44:14 1778291054

The only thing invented about jobs is that through cooperation, the activity undertaken can seem completely unrelated to obtaining food, shelter etc. All organisms spend a majority of their energy on survival and reproduction.

gbanfalvi · 2026-05-09T01:12:38 1778289158

Not sure it’s much of a choice and more of a decision the greedy half make and imposition (often violent) on the other half.

justonepost2 · 2026-05-09T01:51:34 1778291494

Sounds great! Quit your job then :)

catlifeonmars · 2026-05-09T03:01:50 1778295710

I wish I lived in a vacuum. Idk about you but I did not make said choice.

matthest · 2026-05-09T03:01:26 1778295686

Every biological being works to survive. Being good at survival is what builds self esteem.

The "problem" with many modern jobs is that they're divorced from the fundamental goal, which is one of: 1) Kill/acquire food, 2) Build shelter, or 3) Kill enemies/competitors/predators

The benefit of modern jobs is that they are much more peaceful ways for society to operate, freeing up time for humans to pursue art and other forms of expression.

daymanstep · 2026-05-09T08:51:22 1778316682

You mean surrogate activities

thrance · 2026-05-09T21:37:07 1778362627

I don't know how intentional it is, but your comment is basically a dumbed down version of what Marx had to say about work.

https://en.wikipedia.org/wiki/Marx%27s_theory_of_alienation

matthest · 2026-05-10T17:02:05 1778432525

Marx was half right about it.

What he got wrong was that this alienation results from capitalism.

It actually results from civilization. The people who built the pyramids across every continent, for example, performed assembly line-like work. Any large-scale project requires it. And large-scale projects are fundamentally necessary for most societies.

Capitalism was invented in the late 1700s.

thrance · 2026-05-10T22:57:01 1778453821

For the pyramids specifically: their architects and builders were skilled artisans who got to own their craft from top to bottom. As such, they were well-paid and pretty respected. Very much not alienated, under Marx's definition.

I don't think Marx said that worker alienation was specific to capitalism, rather, his work was in describing the economic system of his time, and what that would entail for people living in it.

> It actually results from civilization.

I disagree, I can't think of anyone in Medieval Europe as alienated from their work as a modern sweatshop worker. Not that serfs had it better, but you get me.

matthest · 2026-05-11T18:36:57 1778524617

The pyramids took 20k+ people to build, which inevitably requires division of labor/specialization. Some chunk of that population had to mine the copper, which was probably an absolutely terrible job with ancient technology.

Serfs were essentially slaves who had effectively 0 ownership over their output, so I'd strongly disagree with that sentiment.

I think the best argument for a time when there was almost 0 alienation of labor was when we were all hunter gatherers. Where every activity was closely connected to something necessary for survival.

As soon as we built larger societies, greater division of labor became necessary to efficiently support the society. And thus alienation of labor became much more pronounced.

achierius · 2026-05-09T01:04:44 1778288684

And when have we not? When in history has mankind ever treated the idle poor well? What makes this age different, that we who can no longer work would be taken care of?

robbrown451 · 2026-05-09T03:23:16 1778296996

When in history has being idle not been a problem?

If AI and robots are able to do all the jobs, being idle isn't the negative it has always been.

All through history, you needed lots of non-idle people to do all the work that needed to be done. This is a new situation we are coming upon.

xantronix · 2026-05-09T04:04:02 1778299442

If they are doing all the jobs, who is going to receive economic opportunities? Will we no longer be able to participate in the economy?

skeledrew · 2026-05-09T05:22:33 1778304153

In what way do you want to participate when there's no economic value in any of it? Just do whatever you want for yourself; you're free.

justonepost2 · 2026-05-09T14:15:42 1778336142

The freedom you’re describing is the freedom of a domesticated animal, by the way. With the same outcome if you become a nuisance

skeledrew · 2026-05-09T20:05:43 1778357143

Well we're animals and "domesticated" is synonymous with "civilized", so no problem there. And I can't see why anyone would make themselves a "nuisance" when literally all their needs - and most of their desires - are being met, so whatever outcome you're referring to is extremely unlikely.

gmerc · 2026-05-09T03:58:07 1778299087

When in history of mankind have we ever… is an appeal to the inability of humans to evolve.

eecc · 2026-05-09T05:32:30 1778304750

So are mortgages, and I’m starting to wonder how will pay mine.

Please note I’ve never had this problem before, until recently.

ben_w · 2026-05-09T04:33:08 1778301188

> If the answer is “yes”, our definition of alignment kind of sucks.

Sure, but the original sense of this is rather more fundamental than "does this timeline suck?"

Right now, it is still an open question "do we know how to reliably scale up AI to be generally more competent than we are at everything without literally killing everyone due to (1) some small bug when we created the the loss function* it was trained on (outer alignment), or (2) if that loss function was, despite being correct in itself, approximated badly by the AI due to the training process (inner alignment)?"

* https://en.wikipedia.org/wiki/Loss_function

justonepost2 · 2026-05-09T14:31:31 1778337091

This comment seems to commit the same fallacy I’m accusing anthropic of, which is equating alignment as a binary: the good ending, where humans are not extinct, and the bad ending, where they are. The argument, I think, is that an “aligned” AI that doesn’t kill everyone will necessarily lead to an abundant Culture-esque future, and smoothly manage the transition to boot. (Not to mention that 1+ employees of most labs have attended Daniel Faggella’s pro-extinctionist “Worthy Successor” symposia, but we can put this aside for now)

My point is: 1) that this binary is fundamentally insufficient to prescribe good and equitable outcomes for people - if the aligned AI flags overpopulation as a problem and kills a few billion people to improve QoL for the rest, is that good? It doesn’t take much creativity to go from this to the AI simply choosing the mean over the median, and concentrating untold wealth while billions starve or live on subsistence outside their walls. Is that good?

And 2) if you come up with a better definition, the parts of it that live inside the model weights cannot be disaggregated from the parts that live outside the model weights. From my perspective (and this article agrees) we have done a pretty excellent job of getting the model weights to work in a way that makes them follow instructions, and a pretty horrible job of suggesting or (gasp) implementing policy that actually creates a decent world in the presence of “aligned” AI.

spacebacon · 2026-05-09T16:53:47 1778345627

Yes, it takes three to tango.

https://github.com/space-bacon/SRT

This repository empirically proves computational semiotics.

ben_w · 2026-05-09T15:33:55 1778340835

What I'm saying is not that alignment is a binary, I'm saying it's pre-paradigmatic. For any moral code or long-term goals, we don't have a good reliable rigorous way to compare two loss functions against either those morals or independently against our long-term goals and reliably say which loss function bess represents our goals: the least bad thing we can do right now is to randomly select a range of inputs, hope their distribution is representative, and see what those inputs result in. We don't know how to pick a good distribution of inputs, though fortunately this problem also impacts capabilities as it limits the generalisability of what the AI learn.

The options aren't as binary as "die or The Culture", the cause of death can be something that feels positive to live through similar to fictional examples like the Stargate SG-1 episode where people live contentedly in a shrinking computer-controlled safe zone in an otherwise toxic planet: https://en.wikipedia.org/wiki/Revisions_(Stargate_SG-1)

Conversely "aligned" AI, the question obviously becomes "aligned with whom?": if famous historical villains such as Stalin or Genghis Khan had an AI aligned with them, this would suck for everyone else and in the latter case would freeze human development at a terrible level, but we can't even do that much yet.

> My point is: 1) that this binary is fundamentally insufficient to prescribe good and equitable outcomes for people - if the aligned AI flags overpopulation as a problem and kills a few billion people to improve QoL for the rest, is that good? It doesn’t take much creativity to go from this to the AI simply choosing the mean over the median, and concentrating untold wealth while billions starve or live on subsistence outside their walls. Is that good?

Your point *is* (part of) the alignment problem: we don't know what a good loss function is, nor how to confirm the AI is even implementing it if we did.

We also don't know how to debug proposed loss functions to train for the right thing (whatever that is), nor how to debug trained weights (against the loss function).

> And 2) if you come up with a better definition, the parts of it that live inside the model weights cannot be disaggregated from the parts that live outside the model weights. From my perspective (and this article agrees) we have done a pretty excellent job of getting the model weights to work in a way that makes them follow instructions, and a pretty horrible job of suggesting or (gasp) implementing policy that actually creates a decent world in the presence of “aligned” AI.

I really don't understand what you're getting at with this, sorry.

resident423 · 2026-05-09T03:39:15 1778297955

There's isn't even a solution for how to control highly capable systems at all, everyone wants to decide what to do with the AI before they've even solved the problem of controlling it.

It's like how everybody imagines their lives will be great once they're a millionare, but they have no plan for how to get there. It's too easy to get lost dreaming of solutions instead of actually solving the important problems.

justonepost2 · 2026-05-09T03:51:06 1778298666

What’s an “important problem”? p(doom)? Anything else?

ben_w · 2026-05-09T04:39:32 1778301572

FWIW, my P(doom) is quite low (~0.1) because I think we're going to get enough non-doomy-but-still-bad incidents caused by AI which lack the competence to take over, and the response to those will be enough to stop actual doom scenarios.

People like Simon Willson are noting the risk of a Challenger-like disaster, talking about normalisation of deviance as we keep using LLMs which we know to be risky in increasing critical systems. I think an AI analogy to Challenger would not be enough to halt the use of AI in the way I mean, but an AI analogy to Chernobyl probably would.

ngruhn · 2026-05-09T11:36:52 1778326612

> my P(doom) is quite low (~0.1)

10% or 0.1%? Either way, that's not low! If airplanes crash with that probability, we would avoid them at all cost.

ben_w · 2026-05-09T13:20:13 1778332813

10%; doomers say this kind of number is unreasonably optimistic, hence the blunt title of recent book by Yudkowsky and Soares. Do with this rank-ordering factoid, that 10% makes me an optimist, what you will.

resident423 · 2026-05-09T05:06:09 1778303169

Pdoom would be the most important for me, everything else depends on us being able to control the AI.

But beyond that there's still problems like concentration of power and surveillance, permanent loss of jobs, cyber and bio security. I'm not convinced things will go well even if we can avoid these problems though. I try to think about what the world will be like if AI becomes more creative than us, what happens if it can produce the best song or movie ever made with a prompt, do people get lost in AI addiction? We sort of see that with social media already, and it's only optimizing the content delivery, what happens when algorithms can optimize the content itself?

balamatom · 2026-05-09T14:21:01 1778336461

>what happens when algorithms can optimize the content itself?

You think they aren't already? You're just inoculated by your exposure to pre-AI content - hence you're not the target audience - and thus it's not delivered to you as per your point about content delivery.

But what is even the distinction between "content delivery" and "content" in this context? "The medium is the message" is a saying old enough to have great grandkids. Does the device make the human irrevocably stare at it while wondering about made up stuff? Yes. Check. Done.

What's problematic about `p(doom)` is that it assumes there was a cohesive "us" in the first place. That's a very USian way of viewing things. OTOH, my individual `p(doom)` is in a superposition of 0 and 1, and I quite like it that way. Highly recommended.

deadbabe · 2026-05-09T17:04:17 1778346257

I think many people these days are more or less “ready to die”.

If big corps made an offer like say “We will fund the next X years of your life 100%, for you to do all the things you wanted to do but never could because of work and bills” many people would probably take it, with the understanding that after those X years: euthanasia.

This would eliminate a vast amount of people from this world and leave behind only those who have chosen to stay and endure life: working hard, propping up the system that remains. The end of forced poverty.

justonepost2 · 2026-05-09T18:18:52 1778350732

This is the most divorced-from-reality reply so far, and that’s really saying something lol

taneq · 2026-05-09T01:41:55 1778290915

Maybe a sufficiently aligned AI would necessarily decide that the zeroth law was necessary, and abscond.

(I’m reading Look To Windward by Iain M. Banks at the moment and I just got to the aside where he explains that any truly unbiased ‘perfect’ AI immediately ascends and vanishes.)

stellalo · 2026-05-09T02:19:07 1778293147

Is this some sort of “incompleteness” paradox for AI alignment? Seriously

vasco · 2026-05-09T02:32:11 1778293931

No because alignment makes no sense as a general concept. People are not "aligned" with each other. Humanity has no "goal" that we agree on. So no AI can be aligned with us. It can be at most aligned with the person prompting it in that moment (but most likely aligned with the AI owner).

To make it clear, maybe most people would say they agree with https://www.un.org/en/about-us/universal-declaration-of-huma... but if you read just a few of the rights you see they are not universally respected and so we can conclude enough important people aren't "aligned" with them.

skeledrew · 2026-05-09T05:30:51 1778304651

Opposite. All living things are "aligned" in their instinct for surviving. Those which aren't soon join the non-living, keeping the set - almost[0] - 100% aligned.

[0] Need to consider there're a few humans potentially kept alive against their will (if not having a will to survive is a will at all) with machines for whatever reason.

lunar_mycroft · 2026-05-09T05:45:59 1778305559

Their own survival, not necessarily the survival of others (especially others of different species and/or conflicting other goals). A super intelligence having self preservation as a goal wouldn't help us keep it from harming us, if anything it would do the opposite.

Applejinx · 2026-05-09T11:58:25 1778327905

The reason LLM-based 'intelligence' is doomed to be a human-scaled, selfish sub-intelligence is because the corpus of human writing is flooded with stuff like this. Everybody imagines God as a vindictive petty tyrant because that's what they'd be, and so that's their model.

Superintelligence would be different, most likely based on how societies or systems work, those being a class of intentionality that's usually not confined to a single person's intentions.

If you go by what the most productive societies do, the superintelligence certainly wouldn't harm us as we are a source for the genetic algorithm of ideas, and exterminating us would be a massive dose of entropy and failure.

skeledrew · 2026-05-09T06:21:05 1778307665

It would only harm us if we took steps to harm it (or it thinks so). Or it's designed to do harm. Otherwise it's illogical to cause harm, and machines are literally built on logic.

lunar_mycroft · 2026-05-09T06:54:12 1778309652

This is also incorrect. It's often not ethical to cause harm, and it can be counter productive in the right circumstances, but there's absolutely nothing that makes "causing harm to others" always be against an intelligence's goals. Humans, for example, routinely cause harm to other species. Sometimes this is deliberate, but other times it's because we're barely even aware we're doing so. We want a new road, so we start paving, and may not even realize there was an ant hill in the way (and if we did, we almost certainly wouldn't care).

skeledrew · 2026-05-09T19:56:08 1778356568

Not in this context. Keep in mind that we're talking about machines here. It has been an explicit expectation even before computers were invented that intelligent machines would have to be made to abide by particular rules to prevent harm, summed up in Asimov's Three Laws[0]. I can't see any scenario where a properly programmed intelligence would go against its programming (despite the plots of movies like iRobot, The Matrix, etc). For an AI to cause harm, the allowance would have to be specifically programmed in (such as for military use).

[0] https://en.wikipedia.org/wiki/Three_Laws_of_Robotics

mofeien · 2026-05-09T06:48:30 1778309310

- Its goal: X

- (Logic) => its subgoal: Not be turned off because that's a prerequisite to be able to do X

- (Logic) => Eliminate humans with their opaque and somewhat unpredictable minds to reduce chance of harm to it from 0.01% to 0.001%

vasco · 2026-05-09T07:11:42 1778310702

Are you familiar with trolley problems? How do you resolve them by declaring "all beings want to live"? Life is not as simple as that.

skeledrew · 2026-05-09T19:59:33 1778356773

No conflict. All beings wanting to live doesn't at all mean that all get to live, obviously. Nature itself evolved for living things to feed on each other.

vasco · 2026-05-10T05:43:23 1778391803

The point is an agent will need to decide. And your rule is useless for hard decisions

justonepost2 · 2026-05-09T03:07:13 1778296033

No, just a request for a better definition.

If you see it as a paradox, maybe that says something about the merits of the technology…

Der_Einzige · 2026-05-09T04:05:24 1778299524

This is radical life denial. I was not born for and do not exist to toil. Work is ontologically evil.

DontchaKnowit · 2026-05-09T04:37:45 1778301465

No, THIS is radical denial. You WERE born to toil for your survival.

skeledrew · 2026-05-09T05:31:59 1778304719

Sounds like a slogan for slavery.

swat535 · 2026-05-09T13:47:37 1778334457

Survival is not "slavery".. it's a basic function of evolution.

ragequittah · 2026-05-10T00:42:29 1778373749

The plastic bobbles and SaaS economy that is actively destroying our planet seems like the opposite of survival. We're collectively working ourselves into the death of our planet just because how else do we pay the bills?

skeledrew · 2026-05-09T19:08:22 1778353702

Also sounds like a great rationalization.

bloqs · 2026-05-09T08:16:39 1778314599

You were evolved to struggle. This is actually very clear from psychiatric literature.

Exoristos · 2026-05-09T05:44:23 1778305463

"Work" is human activity. For example, children's play is work. All living things desire to go about their lives. Well-adjusted humans desire to work. Note that this does not necessarily equate to jobs.

youoy · 2026-05-09T06:05:40 1778306740

What? Children's play is now work? What timeline are we living in? Is this real life?

kjkjadksj · 2026-05-09T16:38:32 1778344712

Of course it is. Play is a very basal behavior we see in a host of species among their young. Its biological role is to build up musculature and social bonding such that the individual will be strong enough and socialized enough to do what is required to survive among the colony/pack/tribe.

justonepost2 · 2026-05-09T15:09:02 1778339342

> Work is ontologically evil.

Statements that have been utterly ridiculous from the dawn of life to modernity, backfilled to conveniently fit the zeitgeist.

coldtea · 2026-05-09T13:19:33 1778332773

>and it brings about a global dark age of poverty and inequality by completely eliminating the value of labor vs capital

So, like the past 20 years?

thrance · 2026-05-09T21:38:48 1778362728

And the next 20, most probably...

andy_ppp · 2026-05-09T04:46:45 1778302005

This is completely why the rich love it so much

jstummbillig · 2026-05-09T10:05:36 1778321136

The categories make no sense. Not having to do a job is the entire best case of AI. What we do with that is another thing, but we simply have to accept that any other lense is complete nonsense. The endpoint is obvious and we need to stop being silly about it: We are replacing human labor. Maybe we will find some new jobs to do in the interim. Maybe not. In the end, if everything goes right (in the AI optimist sense), jobs will not be something that humans do.

Labor = capital/energy in an AI complete world. We have to start from that basis when we talk about alignment or anything else. The social issues that arise from the extinction of human labor are something we have to solve politically, that's not something any model company can do (or should be allowed to do).

skeledrew · 2026-05-09T05:05:12 1778303112

Why would the elimination of the value of labor result in poverty and inequality? It should be the opposite, as poverty and inequality is the current status quo (for the many).

aaronblohowiak · 2026-05-09T05:27:39 1778304459

Should according to your ethos, not should according to history, sadly.

thrance · 2026-05-09T21:43:28 1778363008

Because labor is the only thing the working class can leverage against the capitalists. They sell their labor for wages to the owners who have the means of production and capital. If the working class can't bargain its labor anymore, it ceases being useful/tolerated by the bourgeoisie (who owns everything, including the state and police). See the issue now?

This isn't theory, ask the Luddites why they got so mad when their employers started buying machines to replace them. They didn't get richer and freer: they were thrown out to rot on the pavement, while their ex-employers kept 100% of the productivity increases.

adrithmetiqa · 2026-05-09T06:26:33 1778307993

You’re quite correct and we are likely going to stumble into this future despite all the very big brains working on these technologies (including people on hn).

“It is difficult to get a man to understand something, when his salary depends upon his not understanding it.”

justonepost2 · 2026-05-09T14:50:42 1778338242

It’s odd because so many researchers and so many people who are far better engineers than me, can’t see it. I don’t even think it’s the salary for most- it’s just techno-optimist horse blinders, reading assured utopia at the top of an exponential graph.

faangguyindia · 2026-05-09T03:30:39 1778297439

this completely misses the point why alignment exists

Alignment exists to protect shareholder value.

If it creates industry wide outrage, shareholder value declines.

It making shareholders rich and other people poor won't.