Nate Soares ⏹️ Profile Banner
Nate Soares ⏹️ Profile
Nate Soares ⏹️

@So8res

Followers
7,068
Following
75
Media
7
Statuses
1,276
Explore trending content on Musk Viewer
@So8res
Nate Soares ⏹️
10 months
My current stance on AI is: Fucking stop. Find some other route to the glorious transhuman future. There’s debate within the AI alignment community re whether the chance of AI killing literally everyone is more like 20% or 95%, but 20% means worse odds than Russian roulette.
222
125
747
@So8res
Nate Soares ⏹️
3 years
It takes more cleverness to articulate a thought than to think it. If you're thinking at the limits of your abilities, you have thoughts you can't articulate.
23
27
501
@So8res
Nate Soares ⏹️
9 months
Reminder: my reason for expecting AI to go poorly is, deep down, not about alignment being ultra-hard, but about Earth beeing a very derpy place.
15
17
350
@So8res
Nate Soares ⏹️
4 years
Relatedly, one (among many) of my beefs w/ modern schools (& culture more generally) is how much it harps on the dark aspects of humanity, and how little it highlights the light. Humans are rad. Humanity is rad. Sometimes hapless, sometimes evil, but overall, fuck yeah.
9
23
313
@So8res
Nate Soares ⏹️
5 months
@Aella_Girl c'mon ladies, this is how we get kicked out of paradise
8
2
263
@So8res
Nate Soares ⏹️
4 years
It has come to my att'n that some of my friends are unfamiliar with the "humanity, fuck yeah" genre of writing, in which humanity is depicted as awesome (against an interstellar backdrop). Choice example: . Relevant subreddit: .
10
34
248
@So8res
Nate Soares ⏹️
3 years
"Stop using phrases that meticulously track uncommon distinctions you've made; we already have perfectly good phrases that ignore those distinctions, and your audience won't be able to tell the difference!" No.
2
35
243
@So8res
Nate Soares ⏹️
3 years
The definitional gynmastics required to believe that dolphins aren't fish are staggering.
18
50
222
@So8res
Nate Soares ⏹️
3 years
One of my big takeaways from the discussion on this thread is how many people don't understand how insanely powerful a sample size of 19k is. Like, yeah, the correlations are small, but her likelihood ratios for her 0.06 correlations (vs 0.00) are still like a quintillion to one.
@Aella_Girl
Aella
3 years
Sexual fetishes on the political compass, men and women. Total sample size was over 19,000!
Tweet media one
Tweet media two
635
2K
14K
25
16
218
@So8res
Nate Soares ⏹️
2 years
and people assure me that governments will start acting sane and reasonable around AI in the wake of "warning shot" accidents
@WilliamAEden
William Eden
2 years
Peter Daszak has received another grant from the NIH… …to study bat coronaviruses in the wild. After everything the world has just been though. After all the risky research that was supposed to protect us from a global pandemic failed to stop one.
86
430
2K
8
22
205
@So8res
Nate Soares ⏹️
2 years
it's even worse than @yashkaf depicts: big progress often comes from lots of small reconceptualizations. the "i can't distinguish your idea from a worse one in the literature" police are punishing real progress.
@yashkaf
Jakeup
2 years
this is the best part of TPOT the fuck do I care that someone 200 years ago had the same realization and wrote it down somewhere? fucking good for them! what does it matter if I read it in a thread or a book or a thread quoting the book if it's the same idea?
24
22
469
6
9
195
@So8res
Nate Soares ⏹️
1 year
(ftr: I signed onto because I think that the current path leads to destruction, and that the letter's suggestions are marginal steps in the right direction, not because I endorse all its arguments, nor because I think those steps would help all that much.)
7
17
186
@So8res
Nate Soares ⏹️
2 years
The world is not made of arguments. Think not "which of whese arguments, for these two opposing sides, is more compelling? And how reliable is compellingness?" Think instead of the objects the arguments discuss, and let the arguments guide your thoughts about them.
4
25
173
@So8res
Nate Soares ⏹️
10 months
You can't (validly) argue from "we don't know how many bullets are in the chamber of this revolver" to "so playing Russian roulette with this revolver is fine".
21
18
170
@So8res
Nate Soares ⏹️
3 years
Thread about a particular way in which jargon is great:
4
34
161
@So8res
Nate Soares ⏹️
2 years
A common misconception of Aella's research is that it's constructed from Twitter-polls of her followers. Nope! When she reports research results, she's talking about huge surveys of fairly diverse populations. (Much bigger and more diverse than is usual in academia!)
@Aella_Girl
Aella
2 years
People often say that my research is "twitter polls." I do a ton of twitter polls, but I primarily use them to gauge what might be potentially interesting topics for more thorough surveys in the future! My actual research is stuff like this:
22
10
236
5
6
157
@So8res
Nate Soares ⏹️
10 months
But if someone finds a revolver lying around, spins the barrel, and points it at your kid, then your reaction shouldn't be “no worries, we can’t assign an exact probability because we don't know how many rounds are chambered”. Refusing to act b/c the odds are unclear is crazy.)
4
7
160
@So8res
Nate Soares ⏹️
3 years
Me: who are you to say which one of the dishwasher and the clotheswasher is "the" washing machine Her dishes: shattering loudly during the spin cycle
6
10
150
@So8res
Nate Soares ⏹️
3 years
(A complement I once got from a research partner went something like "you just keep reframing the problem ever-so-slightly until the solution seems obvious". <3)
2
8
141
@So8res
Nate Soares ⏹️
3 years
if vampires are sexy humans, why aren't mosquitos sexy bugs?
16
9
119
@So8res
Nate Soares ⏹️
2 years
It's like being in a room full of LEGO machines, and you look at the machine that reads instructions and assembles the other machines, and it's built not out of LEGO but out of cleverly contorted instruction booklets.
3
6
113
@So8res
Nate Soares ⏹️
7 months
reactions to this are like a microcosm of why you usually can't trust humans with consequentialism.
@ESYudkowsky
Eliezer Yudkowsky ⏹️
7 months
in a world of greater legibility, romantic partners would have the conversation about "I'd trade up if I found somebody 10%/25%/125% better than you" in advance, and make sure they have common knowledge of the numbers
367
26
418
13
5
113
@So8res
Nate Soares ⏹️
3 years
my reflexive response to wordle is the same as my reflexive response to 2048 and other such fads: treat it as a low-grade attentional hazard and ignore it until it fades. this has mostly worked out for me, except for pokemon, which apparently never fades
13
3
112
@So8res
Nate Soares ⏹️
2 years
3. a community is probably stronger when its members just blurt out their beliefs (while meticulously being kind to each other). it's much easier to lose your way if you live in a mental world where PR is king over honesty and integrity. HT @robbensinger
2
7
110
@So8res
Nate Soares ⏹️
10 months
The possible benefits from AI are great, but the benefits are significantly greater if we wait until we don’t have double-digit percent chances of killing literally everyone.
3
9
109
@So8res
Nate Soares ⏹️
1 year
i am tickled by how the etymology of supervillain is essentially "better villager"
1
13
102
@So8res
Nate Soares ⏹️
2 years
Also, while I'm on the topic: a fun hidden fact about Earth is that you don't actually need a license to collect and analyze data! No matter what the "do you have a degree" gatekeepers insinuate.
1
6
98
@So8res
Nate Soares ⏹️
2 years
the "curse of cryonics" is when a problem is both weird and very important, but it's sitting right next to other weird problems that are even more important, so everyone who's able to notice weird problems works on something else instead.
10
9
93
@So8res
Nate Soares ⏹️
2 years
oops, I meant that ribosomes are mostly RNA. (RNA is ofc 100% RNA)
@acidshill
shill 🔍
2 years
@So8res did you mean ribosomes are made out of RNA?
1
0
22
6
0
93
@So8res
Nate Soares ⏹️
2 years
in calculus it's convenient to work with infinitesimals: numbers so small that their square is zero. in computer science, we work instead with coinfinitesimals: numbers so large that their square is infinity. which're why CS folk care so much about avoiding quadratic runtimes.
0
6
93
@So8res
Nate Soares ⏹️
2 years
For people who present as caring a bunch about data integrity, they're weirdly unresponsive to the data on their pet theory that Aella's polling population differs radically from a bigger and more diverse survey population. (The data isn't kind to their theory.)
2
3
91
@So8res
Nate Soares ⏹️
10 months
Civilization should say to these people: no, sorry, the (probabilistic) costs you’re imposing on us are too large, we will not permit you to endanger everyone like this, rather than waiting and attaining those benefits later, once we know what we're doing.
1
6
94
@So8res
Nate Soares ⏹️
3 years
Example: according to me, "my model of Alice wants chocolate" leaves Alice more space to disagree than "I think Alice wants chocolate", in part b/c the denial is "your model is wrong", rather than the more confrontational "you are wrong".
6
3
89
@So8res
Nate Soares ⏹️
10 months
(If you're worried about *you personally* losing access to the future because you'll die of old age first, sign up for cryonics, and help improve cryonics technology. I, too, want everyone currently alive to make it to the future!)
5
3
90
@So8res
Nate Soares ⏹️
3 years
@Aella_Girl @sentientist So what you're saying is... you're a shit-eating whore?
2
0
74
@So8res
Nate Soares ⏹️
7 months
it's notable that so many people object "but 'value' doesn't capture..." rather than cautioning "people might neglect the value of...". as if the word "value" must cover only the shallow and superficial features; as if no word is allowed to capture the deeper intangibles.
2
6
89
@So8res
Nate Soares ⏹️
10 months
“Don't worry, we'll watch for signs of danger and then do something unspecified if we see them" is the sort of reassurance labs give when they're trying to cement a status quo in which they get to plow ahead and endanger us all.
1
7
89
@So8res
Nate Soares ⏹️
2 years
I'm grimly amused that Earth seems perhaps "burned out" about pandemics; seems perhaps *less* likely to react quickly and competently than pre-COVID. (Which does not bode well for the "surely humanity will get its act together after a warning shot" theory of AI alignment.)
@NathanpmYoung
Nathan 🔍
2 years
Can some people either start betting this market down or start panicking please?
Tweet media one
10
25
201
6
6
84
@So8res
Nate Soares ⏹️
3 years
I suspect this phenomenon is one cause of jargon. Eg, when a rationalist says "my model of Alice wouldn't like that" instead of "I don't think Alice would like that", the non-standard phraseology tracks a non-standard way they're thinking about Alice.
1
2
84
@So8res
Nate Soares ⏹️
3 years
My internal language has a bunch of cool features that English lacks. I like these features, and speaking in a way that reflects them is part of the process of transmitting them.
2
3
83
@So8res
Nate Soares ⏹️
1 year
This is Aella, stealing Nate's phone. Its his birthday and I made him a rate-nate birthday survey! If you're familiar with Nate's personality even a little I'd love if you could fill it out. Gonna give him some graphs as a gift.
7
2
84
@So8res
Nate Soares ⏹️
10 months
But more generally, civilization at large should not be accepting this state of affairs. Maybe you can't tell who's right, but you should be able to tell that this isn't what a mature and healthy field sounds like, and that it shouldn't get to endager you like this.
2
5
84
@So8res
Nate Soares ⏹️
2 years
ok my new theory is that girls can both have a feeling active *and* be forming words at the same time. and this is just, like, how they live their lives
19
3
79
@So8res
Nate Soares ⏹️
2 years
Big ambitions are for prioritizing between projects you'd love to work on, not for gatekeeping your enthusiasm.
1
3
82
@So8res
Nate Soares ⏹️
3 years
If I were designing a language, I would not render it easy to assign properties like "correct" to a whole person -- as opposed to, say, that person's map of some particular region of the territory.
4
6
80
@So8res
Nate Soares ⏹️
2 years
If you think you have proofs of both A and ¬A, think not "which proof is more persuasive?". Instead, observe that you are mistaken. Either the two statements are not in fact opposed, or one supposed-proof contains a flaw. Don't weigh proofs; seek flaws. So too with arguments.
3
7
81
@So8res
Nate Soares ⏹️
10 months
Picture putting a planet-sized revolver up against Earth, with one round chambered. That's akin to what companies (or gov'ts!) are doing when they build towards superintelligent AI at our current level of understanding. More than a 1 in 6 chance that literally everybody dies.
5
4
81
@So8res
Nate Soares ⏹️
2 years
You don't have to have slightly different beliefs from others, to show that you're cool. You can just adopt others' beliefs wholesale, if they seem right.
4
1
80
@So8res
Nate Soares ⏹️
10 months
My take on RSPs: it is *both* true that labs committing to any plausible reason why they might stop scaling is object-level directionally better than committing to nothing at all, *and* true that RSPs could have the negative effect of relieving regulatory pressure.
1
12
79
@So8res
Nate Soares ⏹️
7 months
which sure would explain why many people hate on consequentialism; [legible-consequence]alism is a much worse moral theory than [comprehensive-consequence]alism.
5
1
80
@So8res
Nate Soares ⏹️
3 years
Another modern battle ground is "berry". Protip: if your new proposed definition of "berry" includes neither strawberries nor raspberries then it is a BAD PROPOSAL. You can tell by how "strawberry" and "raspberry" have "berry" in the name.
3
8
76
@So8res
Nate Soares ⏹️
3 years
can't tell whether there's only two sex positions that everybody pretends are lots of different positions, or
7
5
77
@So8res
Nate Soares ⏹️
2 years
2. people and institutions lauded as genius are often held together by only bubble-gum, wishes, and a favorable environment. if you rely on those people/institutions to accomplish great feats of competence under pressure, you're in touble.
3
5
76
@So8res
Nate Soares ⏹️
10 months
If the labs were coming right out and saying: “Yes, we’re endangering all your lives, with >1/6 probability, but we believe that’s OK because the benefits are sufficiently great / we believe we have to because otherwise people that you like even less will kill everybody first,"
2
6
76
@So8res
Nate Soares ⏹️
3 years
Enough model-building, more shitposting: the US should start taking that "all [people] are created equal" clause seriously, and declare that everyone in the world has US citizenship if they but wish it so.
3
6
75
@So8res
Nate Soares ⏹️
10 months
(I'm on the "more like 95%" side myself, but this thread is gonna be about how I recommend non-experts respond to the situation, and I think "> 1/6" is both more obvious and is sufficient for my purposes here.)
6
1
75
@So8res
Nate Soares ⏹️
10 months
Continuing: these people who are playing Russian roulette with the planet have no credible offer that’s worth enough that they should be putting all our lives at such grave risk.
1
1
75
@So8res
Nate Soares ⏹️
3 years
Cucumber? Central example of a vegetable. Strawberry? Central example of a fruit. If you're having trouble figuring out which is which, ask some local children to help you out.
3
6
72
@So8res
Nate Soares ⏹️
3 years
In my experience, conceptual clarity is often attained by a large number of minor viewpoint shifts.
2
5
72
@So8res
Nate Soares ⏹️
3 years
(Coarse examples: folks who think in probabilities might become awkward around definite statements of fact; people who get into NVC sometimes shift their language about thoughts and feelings. I claim more subtle linguistic shifts regularly come hand-in-hand w/ good thinking.)
3
1
71
@So8res
Nate Soares ⏹️
10 months
The world is full of people who will say they've "taken appropriate safety measures" or otherwise give a useless superficial response before plowing on ahead without addressing the deeper underlying issues.
2
1
71
@So8res
Nate Soares ⏹️
2 years
i was gonna sit this one out, but everyone else is out there getting a good grind in, so here's a few of mine that seem relevant
@ozyfrantz
ozy brennan 🦙
2 years
the FTX scandal proves the importance of the axe I was grinding all along
5
5
130
3
2
70
@So8res
Nate Soares ⏹️
2 years
Also, academia isn't reliably good at statistics, nor at analyzing social/psychological phenomena, so it's a strange point of comparison. Unless you're just swatting down someone who you think has risen above their station, ofc, in which case the objections make perfect sense.
3
3
71
@So8res
Nate Soares ⏹️
3 years
The people using "bug" to mean icky creepy crawly, "tree" to mean hard leafy greenery, and "berry" to mean bright lil bushfruit have got a good thing going. Stop tryina steal our words. Invent new ones. You can even use pseudolatin if you wanna sound all fancy and educated.
2
6
68
@So8res
Nate Soares ⏹️
10 months
"Have licenses" and "run evals" are fine suggestions, they’re helpful, but they’re not how a sane planet responds to this level of horrific threat. The sane response is to shut it down entirely, and find some other route.
2
2
70
@So8res
Nate Soares ⏹️
3 years
Yet somehow, once we figured out about genealogy, the pedants were like "well actually this fish's uncle was a fuzzy pigdear, so it's not actually a fish, you uneducated idiot, you absolute moron" and then we all forgot what "fish" meant out of sheer shame or something???
2
4
67
@So8res
Nate Soares ⏹️
7 months
it seems many people intuitively think that words like "value" can only apply to the legible and easily articulable aspects of things.
4
5
69
@So8res
Nate Soares ⏹️
10 months
In lieu of much more serious ownership and responsibility from labs, Earth just shouldn't be buying what they're saying. The sounds you're hearing are the noncommittal sounds of labs that just want to keep scaling and see what happens. Civilization shouldn't allow it.
1
4
68
@So8res
Nate Soares ⏹️
3 years
In sum, my internal dialect has drifted away from American English, and that suits me just fine, tyvm. I'll do my best to be newcomer-friendly and inclusive, but I'm unwilling to drop distinctions from my words just to avoid an odd turn of phrase.
1
0
65
@So8res
Nate Soares ⏹️
2 years
1. take note: the status quo is not always stable. your "criticism contest" can fail to reveal the giant risk that was staring you in the face the whole time. regularities you were implicitly and unconsciously relying on can evaporate overnight.
1
1
66
@So8res
Nate Soares ⏹️
10 months
To be clear: @ARC_Evals is asking for more than just "run evals", IIUC it's asking for something more like "name the capabilities that would cause you to pause, and the protective measures you'd take", which is somewhat better.
1
3
67
@So8res
Nate Soares ⏹️
3 years
There's a big diff between "I only saw a little data w/ a 0.06 correlation, which is probably noise" and "I saw such an enormously overwhelming pile of data than I am billions-to-one confident that the correlation is not 0.00, not 0.12, but 0.06 exactly". Aella's doin the latter.
2
2
62
@So8res
Nate Soares ⏹️
10 months
"Fucking stop" is a notably stronger response than “well, I guess we should require all the labs to have licenses” or “well, I guess we should require all the labs to run evals (like or ) so that notice early signs of danger"
1
0
64
@So8res
Nate Soares ⏹️
3 years
In fact, "you are wrong" is a type error in my internal tongue. My English-to-internal-tongue translator chokes when I try to run it on "you're wrong", and suggests (eg) "I disagree" or perhaps "you're wrong about whether I want chocolate".
1
1
62
@So8res
Nate Soares ⏹️
10 months
if they were coming right out and saying that bluntly, then... well, it wouldn't make things better, but at least it'd be honest. At least we could have a discussion about whether they're correct to think that the benefits are worth the risks, vs whether they should wait.
1
3
63
@So8res
Nate Soares ⏹️
2 years
re: the Canadian euthanasia controversy, I propose a compromise: cryopreserve them.
3
3
59
@So8res
Nate Soares ⏹️
3 years
Another part of why I flinch at jargon-policing is a suspicion that if someone regularly renders thoughts that track a distinction into words that don't, it erodes the distinction in their own head. Maintaining distinctions that your spoken language lacks is difficult!
1
0
57
@So8res
Nate Soares ⏹️
10 months
I propose we spell [ʒ] as "zh", like in "I'll have the uzh" (as a shortening of "I'll have the usual") or "It'll be pretty cazh" (as a shortening of "It'll be pretty casual"). Because s : z :: sh : zh.
8
6
57
@So8res
Nate Soares ⏹️
10 months
Also: not all labs are exactly the same on this count. Anthropic has at least committed to make AIs that can produce bioweapons only once they can prevent them from being stolen. …which is a far cry from owning how they're gambling with our lives, but it's better than "yolo".
2
5
61
@So8res
Nate Soares ⏹️
2 years
I just learned that RNA is made (mostly) out of RNA, rather than protein. Which makes complete sense, but also: wow.
8
1
57
@So8res
Nate Soares ⏹️
9 months
and in part because… well, I was hopeful there, for a moment, and those hopes were dashed. Which hurts. And maybe it's also helpful to display a little of that pain, from time to time.
3
0
59
@So8res
Nate Soares ⏹️
3 years
"A fruit is a plant ovary". Nope! Fruits are the sweet yummy ones. Vegetables have a more muted, often savory, often slightly bitter taste, that many people dislike (or at least like much less than fruit) when they're young.
1
5
56
@So8res
Nate Soares ⏹️
7 months
"it ignores how relationships get better with investment" nope, that's an increase in your value to each other that makes it harder to find someone worth trading up for.
2
0
58
@So8res
Nate Soares ⏹️
9 months
And I'm not here to criticize; I dunno what the board knew, but it's clear now that we're not on a sensible track. Even if this is the result of good intent all around, good intent isn't _enough_, on Earth. You try to kick the ball forward and it goes sideways. Derpery prevails.
2
0
58
@So8res
Nate Soares ⏹️
10 months
So, what now? Governments who have been alerted to the risks need to actually respond, and quickly. A research direction that poses such an insane level of risk needs to be halted immediately, and we need to find some saner way through these woods to a wonderful future.
9
3
58
@So8res
Nate Soares ⏹️
3 years
The "my model of Alice"-style phrasing is part of a more general program of distinguishing people from their maps. I don't claim to do this perfectly, but I'm trying, and I appreciate others who are trying.
2
1
54
@So8res
Nate Soares ⏹️
3 years
My stance on open borders is "what's the point of having all these fighter jets if we aren't even escorting transports to the Uighur camps to see who wants their liberty"
2
6
55
@So8res
Nate Soares ⏹️
10 months
I’m deeply skeptical; if they were the sort to notice early hints of later issues and react appropriately then I expect they’d be reacting differently to the hints we already have today (present-day jailbreaks, shallow instances of deception, etc.).
1
1
56
@So8res
Nate Soares ⏹️
10 months
(When I point this out in person, a surprising number of people respond to this point by saying: well, with Russian roulette, we know that the probability is exactly 1/6, whereas with AI we have no idea what our odds are.
2
2
56
@So8res
Nate Soares ⏹️
3 years
(Or, at least, I think this is true of me and of many of the folks I interact with daily. I suspect phraseology is contagious and that bystanders may pick up the alt manner of speaking w/out picking up the alt manner of thinking, etc.)
2
0
55
@So8res
Nate Soares ⏹️
10 months
And: I appreciate that at least some leaders at at least some labs are acknowledging that this tech gambles with all our lives (despite failing to take personal responsibility, and despite saying one thing on tech podcasts and another to congress, and ...).
1
2
56
@So8res
Nate Soares ⏹️
3 years
"But everyone knows that "you're wrong" has a silent "(about X)" parenthetical!", my straw conversational partner protests. I disagree. English makes it all too easy to represent confused thoughts like "maybe I'm bad".
1
2
55
@So8res
Nate Soares ⏹️
10 months
“and as part of carrying that responsibility, we’re testing our AIs in the following ways, and if these tests come back positive we’ll do [some specific thing that's not superficial and that's hard to game]",
1
2
56
@So8res
Nate Soares ⏹️
3 years
PSA: In my book, everyone has an unlimited number of "I don't understand", "plz say that again in different words", "plz expand upon that", and "plz pause while I absorb that" tokens.
2
3
55
@So8res
Nate Soares ⏹️
10 months
A second issue is: what are you supposed to do when the evals say “this AI is dangerous”?
1
1
54
@So8res
Nate Soares ⏹️
3 years
So, look: this isn't about who the fish's uncle is. When a kid points at a whale and says "look, a fish", and you're like "haha no, its tail flaps horizontally and its gradma had hair", who's in the wrong here?
1
3
53
@So8res
Nate Soares ⏹️
3 years
Sometimes a bunch of small shifts leave people talking a bit differently, b/c now they're thinking a bit differently. The old phrasings don't feel quite right -- maybe they conflate distinct concepts, or rely implicitly on some bad assumption, etc.
2
3
53
@So8res
Nate Soares ⏹️
3 years
The big fortresses on the side of righteousness are "crab" () and "tree" () which are such ragged genealogical concepts and such useful functional concepts that the misguided pedantry virus hasn't yet found a foothold.
2
3
53
@So8res
Nate Soares ⏹️
2 years
If an argument misleads you, think not "I have learned that similarly compelling arguments are often wrong." Think instead of which step within it was wrong, and adjust your thoughts so that they are not so easily guided down wrong paths.
1
3
54