Nate Soares ⏹️ @So8res profile

Nate Soares ⏹️

@So8res

Followers

7,068

Following

75

Media

7

Statuses

1,276

https://t.co/IYrooIZ4Cd

Berkeley

Joined January 2011

Don't wanna be here? Send us removal request.

Explore tweets Explore followers Explore following

Explore trending content on Musk Viewer

River • 180698 Tweets

Bill Clinton • 176202 Tweets

#DNC2024 • 157817 Tweets

Botafogo • 139236 Tweets

Simón • 97281 Tweets

Oprah • 96286 Tweets

O Palmeiras • 73475 Tweets

TODAY WIMARNNAM ENGFA • 67871 Tweets

Chama • 54288 Tweets

Abel • 53686 Tweets

Your Season Interview • 41262 Tweets

#ファミマの増量チョコ • 38925 Tweets

Talleres • 36611 Tweets

サンサン • 33546 Tweets

#AEWDynamite • 33408 Tweets

#jjk267 • 33041 Tweets

BOSSNOEUL TBNW Q1 • 31532 Tweets

Gallardo • 31454 Tweets

Borja • 30148 Tweets

Pete Buttigieg • 29152 Tweets

Josh Shapiro • 26340 Tweets

Hakeem Jeffries • 24772 Tweets

Nobara • 24440 Tweets

John Legend • 21416 Tweets

Meza • 21128 Tweets

Stevie Wonder • 20699 Tweets

Bexar County • 19115 Tweets

Joey Votto • 15348 Tweets

Rony • 15187 Tweets

Allianz • 11284 Tweets

San Antonio • 11011 Tweets

Lázaro • 10478 Tweets

Flaco Lopez

Let's Go Crazy

Kranevitter

Igor Jesus

Kesatuan

Tetap Jaga Persatuan

Bustos

Savarino

Sheila E

Leila

Felipe Anderson

Gustavo Gomez

Estevão

#PALxBOT

Amanda Gorman

Veiga

Caio Paulista

Wes Moore

Last Seen Profiles

@Keomis_bread

@SlyDhelali

@runako71667651

@odarnij

@rasikichi

@Drunken_TigerJK

@riantiraf

@kikukawahi56031

@mariausquiano

@bunda__stw

@itsjwills

@rcgarcia1986

@lovebondage

@iwauganda

@hokauzono

@CumonSluts18

@ma_camiladiaz

@liveleakgfz

@redhotmurphy

@liypeony

Nate Soares ⏹️

@So8res

10 months

My current stance on AI is: Fucking stop. Find some other route to the glorious transhuman future. There’s debate within the AI alignment community re whether the chance of AI killing literally everyone is more like 20% or 95%, but 20% means worse odds than Russian roulette.

222

125

747

Nate Soares ⏹️

@So8res

3 years

It takes more cleverness to articulate a thought than to think it. If you're thinking at the limits of your abilities, you have thoughts you can't articulate.

23

27

501

Nate Soares ⏹️

@So8res

9 months

Reminder: my reason for expecting AI to go poorly is, deep down, not about alignment being ultra-hard, but about Earth beeing a very derpy place.

15

17

350

Nate Soares ⏹️

@So8res

4 years

Relatedly, one (among many) of my beefs w/ modern schools (& culture more generally) is how much it harps on the dark aspects of humanity, and how little it highlights the light. Humans are rad. Humanity is rad. Sometimes hapless, sometimes evil, but overall, fuck yeah.

9

23

313

Nate Soares ⏹️

@So8res

5 months

@Aella_Girl c'mon ladies, this is how we get kicked out of paradise

8

2

263

Nate Soares ⏹️

@So8res

4 years

It has come to my att'n that some of my friends are unfamiliar with the "humanity, fuck yeah" genre of writing, in which humanity is depicted as awesome (against an interstellar backdrop). Choice example: . Relevant subreddit: .

10

34

248

Nate Soares ⏹️

@So8res

3 years

"Stop using phrases that meticulously track uncommon distinctions you've made; we already have perfectly good phrases that ignore those distinctions, and your audience won't be able to tell the difference!" No.

2

35

243

Nate Soares ⏹️

@So8res

3 years

The definitional gynmastics required to believe that dolphins aren't fish are staggering.

18

50

222

Nate Soares ⏹️

@So8res

3 years

One of my big takeaways from the discussion on this thread is how many people don't understand how insanely powerful a sample size of 19k is. Like, yeah, the correlations are small, but her likelihood ratios for her 0.06 correlations (vs 0.00) are still like a quintillion to one.

Aella

@Aella_Girl

3 years

Sexual fetishes on the political compass, men and women. Total sample size was over 19,000!

635

2K

14K

25

16

218

Nate Soares ⏹️

@So8res

2 years

and people assure me that governments will start acting sane and reasonable around AI in the wake of "warning shot" accidents

William Eden

@WilliamAEden

2 years

Peter Daszak has received another grant from the NIH… …to study bat coronaviruses in the wild. After everything the world has just been though. After all the risky research that was supposed to protect us from a global pandemic failed to stop one.

86

430

2K

8

22

205

Nate Soares ⏹️

@So8res

2 years

it's even worse than @yashkaf depicts: big progress often comes from lots of small reconceptualizations. the "i can't distinguish your idea from a worse one in the literature" police are punishing real progress.

Jakeup

@yashkaf

2 years

this is the best part of TPOT the fuck do I care that someone 200 years ago had the same realization and wrote it down somewhere? fucking good for them! what does it matter if I read it in a thread or a book or a thread quoting the book if it's the same idea?

24

22

469

6

9

195

Nate Soares ⏹️

@So8res

1 year

(ftr: I signed onto because I think that the current path leads to destruction, and that the letter's suggestions are marginal steps in the right direction, not because I endorse all its arguments, nor because I think those steps would help all that much.)

Pause Giant AI Experiments: An Open Letter - Future of Life Institute

We call on all AI labs to immediately pause for at least 6 months the training of AI systems more powerful than GPT-4.

futureoflife.org

7

17

186

Nate Soares ⏹️

@So8res

2 years

The world is not made of arguments. Think not "which of whese arguments, for these two opposing sides, is more compelling? And how reliable is compellingness?" Think instead of the objects the arguments discuss, and let the arguments guide your thoughts about them.

4

25

173

Nate Soares ⏹️

@So8res

10 months

You can't (validly) argue from "we don't know how many bullets are in the chamber of this revolver" to "so playing Russian roulette with this revolver is fine".

21

18

170

Nate Soares ⏹️

@So8res

3 years

Thread about a particular way in which jargon is great:

4

34

161

Nate Soares ⏹️

@So8res

2 years

A common misconception of Aella's research is that it's constructed from Twitter-polls of her followers. Nope! When she reports research results, she's talking about huge surveys of fairly diverse populations. (Much bigger and more diverse than is usual in academia!)

Aella

@Aella_Girl

2 years

People often say that my research is "twitter polls." I do a ton of twitter polls, but I primarily use them to gauge what might be potentially interesting topics for more thorough surveys in the future! My actual research is stuff like this:

22

10

236

5

6

157

Nate Soares ⏹️

@So8res

10 months

But if someone finds a revolver lying around, spins the barrel, and points it at your kid, then your reaction shouldn't be “no worries, we can’t assign an exact probability because we don't know how many rounds are chambered”. Refusing to act b/c the odds are unclear is crazy.)

4

7

160

Nate Soares ⏹️

@So8res

3 years

Me: who are you to say which one of the dishwasher and the clotheswasher is "the" washing machine Her dishes: shattering loudly during the spin cycle

6

10

150

Nate Soares ⏹️

@So8res

3 years

(A complement I once got from a research partner went something like "you just keep reframing the problem ever-so-slightly until the solution seems obvious". <3)

2

8

141

Nate Soares ⏹️

@So8res

3 years

if vampires are sexy humans, why aren't mosquitos sexy bugs?

16

9

119

Nate Soares ⏹️

@So8res

2 years

It's like being in a room full of LEGO machines, and you look at the machine that reads instructions and assembles the other machines, and it's built not out of LEGO but out of cleverly contorted instruction booklets.

3

6

113

Nate Soares ⏹️

@So8res

7 months

reactions to this are like a microcosm of why you usually can't trust humans with consequentialism.

Eliezer Yudkowsky ⏹️

@ESYudkowsky

7 months

in a world of greater legibility, romantic partners would have the conversation about "I'd trade up if I found somebody 10%/25%/125% better than you" in advance, and make sure they have common knowledge of the numbers

367

26

418

13

5

113

Nate Soares ⏹️

@So8res

3 years

my reflexive response to wordle is the same as my reflexive response to 2048 and other such fads: treat it as a low-grade attentional hazard and ignore it until it fades. this has mostly worked out for me, except for pokemon, which apparently never fades

13

3

112

Nate Soares ⏹️

@So8res

2 years

3. a community is probably stronger when its members just blurt out their beliefs (while meticulously being kind to each other). it's much easier to lose your way if you live in a mental world where PR is king over honesty and integrity. HT @robbensinger

2

7

110

Nate Soares ⏹️

@So8res

10 months

The possible benefits from AI are great, but the benefits are significantly greater if we wait until we don’t have double-digit percent chances of killing literally everyone.

3

9

109

Nate Soares ⏹️

@So8res

1 year

i am tickled by how the etymology of supervillain is essentially "better villager"

1

13

102

Nate Soares ⏹️

@So8res

2 years

Also, while I'm on the topic: a fun hidden fact about Earth is that you don't actually need a license to collect and analyze data! No matter what the "do you have a degree" gatekeepers insinuate.

1

6

98

Nate Soares ⏹️

@So8res

2 years

the "curse of cryonics" is when a problem is both weird and very important, but it's sitting right next to other weird problems that are even more important, so everyone who's able to notice weird problems works on something else instead.

10

9

93

Nate Soares ⏹️

@So8res

2 years

oops, I meant that ribosomes are mostly RNA. (RNA is ofc 100% RNA)

shill 🔍

@acidshill

2 years

@So8res did you mean ribosomes are made out of RNA?

1

0

22

6

0

93

Nate Soares ⏹️

@So8res

2 years

in calculus it's convenient to work with infinitesimals: numbers so small that their square is zero. in computer science, we work instead with coinfinitesimals: numbers so large that their square is infinity. which're why CS folk care so much about avoiding quadratic runtimes.

0

6

93

Nate Soares ⏹️

@So8res

2 years

For people who present as caring a bunch about data integrity, they're weirdly unresponsive to the data on their pet theory that Aella's polling population differs radically from a bigger and more diverse survey population. (The data isn't kind to their theory.)

2

3

91

Nate Soares ⏹️

@So8res

10 months

Civilization should say to these people: no, sorry, the (probabilistic) costs you’re imposing on us are too large, we will not permit you to endanger everyone like this, rather than waiting and attaining those benefits later, once we know what we're doing.

1

6

94

Nate Soares ⏹️

@So8res

3 years

Example: according to me, "my model of Alice wants chocolate" leaves Alice more space to disagree than "I think Alice wants chocolate", in part b/c the denial is "your model is wrong", rather than the more confrontational "you are wrong".

6

3

89

Nate Soares ⏹️

@So8res

10 months

(If you're worried about *you personally* losing access to the future because you'll die of old age first, sign up for cryonics, and help improve cryonics technology. I, too, want everyone currently alive to make it to the future!)

5

3

90

Nate Soares ⏹️

@So8res

3 years

@Aella_Girl @sentientist So what you're saying is... you're a shit-eating whore?

2

0

74

Nate Soares ⏹️

@So8res

7 months

it's notable that so many people object "but 'value' doesn't capture..." rather than cautioning "people might neglect the value of...". as if the word "value" must cover only the shallow and superficial features; as if no word is allowed to capture the deeper intangibles.

2

6

89

Nate Soares ⏹️

@So8res

10 months

“Don't worry, we'll watch for signs of danger and then do something unspecified if we see them" is the sort of reassurance labs give when they're trying to cement a status quo in which they get to plow ahead and endanger us all.

1

7

89

Nate Soares ⏹️

@So8res

2 years

I'm grimly amused that Earth seems perhaps "burned out" about pandemics; seems perhaps *less* likely to react quickly and competently than pre-COVID. (Which does not bode well for the "surely humanity will get its act together after a warning shot" theory of AI alignment.)

Nathan 🔍

@NathanpmYoung

2 years

Can some people either start betting this market down or start panicking please?

10

25

201

6

84

Nate Soares ⏹️

@So8res

3 years

I suspect this phenomenon is one cause of jargon. Eg, when a rationalist says "my model of Alice wouldn't like that" instead of "I don't think Alice would like that", the non-standard phraseology tracks a non-standard way they're thinking about Alice.

1

2

84

Nate Soares ⏹️

@So8res

3 years

My internal language has a bunch of cool features that English lacks. I like these features, and speaking in a way that reflects them is part of the process of transmitting them.

2

3

83

Nate Soares ⏹️

@So8res

1 year

This is Aella, stealing Nate's phone. Its his birthday and I made him a rate-nate birthday survey! If you're familiar with Nate's personality even a little I'd love if you could fill it out. Gonna give him some graphs as a gift.

7

2

84

Nate Soares ⏹️

@So8res

10 months

But more generally, civilization at large should not be accepting this state of affairs. Maybe you can't tell who's right, but you should be able to tell that this isn't what a mature and healthy field sounds like, and that it shouldn't get to endager you like this.

2

5

84

Nate Soares ⏹️

@So8res

2 years

ok my new theory is that girls can both have a feeling active *and* be forming words at the same time. and this is just, like, how they live their lives

19

3

79

Nate Soares ⏹️

@So8res

2 years

Big ambitions are for prioritizing between projects you'd love to work on, not for gatekeeping your enthusiasm.

1

3

82

Nate Soares ⏹️

@So8res

3 years

If I were designing a language, I would not render it easy to assign properties like "correct" to a whole person -- as opposed to, say, that person's map of some particular region of the territory.

4

6

80

Nate Soares ⏹️

@So8res

2 years

If you think you have proofs of both A and ¬A, think not "which proof is more persuasive?". Instead, observe that you are mistaken. Either the two statements are not in fact opposed, or one supposed-proof contains a flaw. Don't weigh proofs; seek flaws. So too with arguments.

3

7

81

Nate Soares ⏹️

@So8res

10 months

Picture putting a planet-sized revolver up against Earth, with one round chambered. That's akin to what companies (or gov'ts!) are doing when they build towards superintelligent AI at our current level of understanding. More than a 1 in 6 chance that literally everybody dies.

5

4

81

Nate Soares ⏹️

@So8res

2 years

You don't have to have slightly different beliefs from others, to show that you're cool. You can just adopt others' beliefs wholesale, if they seem right.

4

1

80

Nate Soares ⏹️

@So8res

10 months

My take on RSPs: it is *both* true that labs committing to any plausible reason why they might stop scaling is object-level directionally better than committing to nothing at all, *and* true that RSPs could have the negative effect of relieving regulatory pressure.

1

12

79

Nate Soares ⏹️

@So8res

7 months

which sure would explain why many people hate on consequentialism; [legible-consequence]alism is a much worse moral theory than [comprehensive-consequence]alism.

5

1

80

Nate Soares ⏹️

@So8res

3 years

Another modern battle ground is "berry". Protip: if your new proposed definition of "berry" includes neither strawberries nor raspberries then it is a BAD PROPOSAL. You can tell by how "strawberry" and "raspberry" have "berry" in the name.

3

8

76

Nate Soares ⏹️

@So8res

3 years

can't tell whether there's only two sex positions that everybody pretends are lots of different positions, or

7

5

77

Nate Soares ⏹️

@So8res

2 years

2. people and institutions lauded as genius are often held together by only bubble-gum, wishes, and a favorable environment. if you rely on those people/institutions to accomplish great feats of competence under pressure, you're in touble.

3

5

76

Nate Soares ⏹️

@So8res

10 months

If the labs were coming right out and saying: “Yes, we’re endangering all your lives, with >1/6 probability, but we believe that’s OK because the benefits are sufficiently great / we believe we have to because otherwise people that you like even less will kill everybody first,"

2

6

76

Nate Soares ⏹️

@So8res

3 years

Enough model-building, more shitposting: the US should start taking that "all [people] are created equal" clause seriously, and declare that everyone in the world has US citizenship if they but wish it so.

3

6

75

Nate Soares ⏹️

@So8res

10 months

(I'm on the "more like 95%" side myself, but this thread is gonna be about how I recommend non-experts respond to the situation, and I think "> 1/6" is both more obvious and is sufficient for my purposes here.)

6

1

75

Nate Soares ⏹️

@So8res

10 months

Continuing: these people who are playing Russian roulette with the planet have no credible offer that’s worth enough that they should be putting all our lives at such grave risk.

1

75

Nate Soares ⏹️

@So8res

3 years

Cucumber? Central example of a vegetable. Strawberry? Central example of a fruit. If you're having trouble figuring out which is which, ask some local children to help you out.

3

6

72

Nate Soares ⏹️

@So8res

3 years

In my experience, conceptual clarity is often attained by a large number of minor viewpoint shifts.

2

5

72

Nate Soares ⏹️

@So8res

3 years

(Coarse examples: folks who think in probabilities might become awkward around definite statements of fact; people who get into NVC sometimes shift their language about thoughts and feelings. I claim more subtle linguistic shifts regularly come hand-in-hand w/ good thinking.)

3

1

71

Nate Soares ⏹️

@So8res

10 months

The world is full of people who will say they've "taken appropriate safety measures" or otherwise give a useless superficial response before plowing on ahead without addressing the deeper underlying issues.

2

1

71

Nate Soares ⏹️

@So8res

2 years

i was gonna sit this one out, but everyone else is out there getting a good grind in, so here's a few of mine that seem relevant

ozy brennan 🦙

@ozyfrantz

2 years

the FTX scandal proves the importance of the axe I was grinding all along

5

130

3

2

70

Nate Soares ⏹️

@So8res

2 years

Also, academia isn't reliably good at statistics, nor at analyzing social/psychological phenomena, so it's a strange point of comparison. Unless you're just swatting down someone who you think has risen above their station, ofc, in which case the objections make perfect sense.

3

71

Nate Soares ⏹️

@So8res

3 years

The people using "bug" to mean icky creepy crawly, "tree" to mean hard leafy greenery, and "berry" to mean bright lil bushfruit have got a good thing going. Stop tryina steal our words. Invent new ones. You can even use pseudolatin if you wanna sound all fancy and educated.

2

6

68

Nate Soares ⏹️

@So8res

10 months

"Have licenses" and "run evals" are fine suggestions, they’re helpful, but they’re not how a sane planet responds to this level of horrific threat. The sane response is to shut it down entirely, and find some other route.

2

70

Nate Soares ⏹️

@So8res

3 years

Yet somehow, once we figured out about genealogy, the pedants were like "well actually this fish's uncle was a fuzzy pigdear, so it's not actually a fish, you uneducated idiot, you absolute moron" and then we all forgot what "fish" meant out of sheer shame or something???

2

4

67

Nate Soares ⏹️

@So8res

7 months

it seems many people intuitively think that words like "value" can only apply to the legible and easily articulable aspects of things.

4

5

69

Nate Soares ⏹️

@So8res

10 months

In lieu of much more serious ownership and responsibility from labs, Earth just shouldn't be buying what they're saying. The sounds you're hearing are the noncommittal sounds of labs that just want to keep scaling and see what happens. Civilization shouldn't allow it.

1

4

68

Nate Soares ⏹️

@So8res

3 years

In sum, my internal dialect has drifted away from American English, and that suits me just fine, tyvm. I'll do my best to be newcomer-friendly and inclusive, but I'm unwilling to drop distinctions from my words just to avoid an odd turn of phrase.

1

0

65

Nate Soares ⏹️

@So8res

2 years

1. take note: the status quo is not always stable. your "criticism contest" can fail to reveal the giant risk that was staring you in the face the whole time. regularities you were implicitly and unconsciously relying on can evaporate overnight.

1

66

Nate Soares ⏹️

@So8res

10 months

To be clear: @ARC_Evals is asking for more than just "run evals", IIUC it's asking for something more like "name the capabilities that would cause you to pause, and the protective measures you'd take", which is somewhat better.

1

3

67

Nate Soares ⏹️

@So8res

3 years

There's a big diff between "I only saw a little data w/ a 0.06 correlation, which is probably noise" and "I saw such an enormously overwhelming pile of data than I am billions-to-one confident that the correlation is not 0.00, not 0.12, but 0.06 exactly". Aella's doin the latter.

2

62

Nate Soares ⏹️

@So8res

10 months

"Fucking stop" is a notably stronger response than “well, I guess we should require all the labs to have licenses” or “well, I guess we should require all the labs to run evals (like or ) so that notice early signs of danger"

1

0

64

Nate Soares ⏹️

@So8res

3 years

In fact, "you are wrong" is a type error in my internal tongue. My English-to-internal-tongue translator chokes when I try to run it on "you're wrong", and suggests (eg) "I disagree" or perhaps "you're wrong about whether I want chocolate".

1

62

Nate Soares ⏹️

@So8res

10 months

if they were coming right out and saying that bluntly, then... well, it wouldn't make things better, but at least it'd be honest. At least we could have a discussion about whether they're correct to think that the benefits are worth the risks, vs whether they should wait.

1

3

63

Nate Soares ⏹️

@So8res

2 years

re: the Canadian euthanasia controversy, I propose a compromise: cryopreserve them.

3

59

Nate Soares ⏹️

@So8res

3 years

Another part of why I flinch at jargon-policing is a suspicion that if someone regularly renders thoughts that track a distinction into words that don't, it erodes the distinction in their own head. Maintaining distinctions that your spoken language lacks is difficult!

1

0

57

Nate Soares ⏹️

@So8res

10 months

I propose we spell [ʒ] as "zh", like in "I'll have the uzh" (as a shortening of "I'll have the usual") or "It'll be pretty cazh" (as a shortening of "It'll be pretty casual"). Because s : z :: sh : zh.

8

6

57

Nate Soares ⏹️

@So8res

10 months

Also: not all labs are exactly the same on this count. Anthropic has at least committed to make AIs that can produce bioweapons only once they can prevent them from being stolen. …which is a far cry from owning how they're gambling with our lives, but it's better than "yolo".

2

5

61

Nate Soares ⏹️

@So8res

2 years

I just learned that RNA is made (mostly) out of RNA, rather than protein. Which makes complete sense, but also: wow.

8

1

57

Nate Soares ⏹️

@So8res

9 months

and in part because… well, I was hopeful there, for a moment, and those hopes were dashed. Which hurts. And maybe it's also helpful to display a little of that pain, from time to time.

3

0

59

Nate Soares ⏹️

@So8res

3 years

"A fruit is a plant ovary". Nope! Fruits are the sweet yummy ones. Vegetables have a more muted, often savory, often slightly bitter taste, that many people dislike (or at least like much less than fruit) when they're young.

1

5

56

Nate Soares ⏹️

@So8res

7 months

"it ignores how relationships get better with investment" nope, that's an increase in your value to each other that makes it harder to find someone worth trading up for.

2

0

58

Nate Soares ⏹️

@So8res

9 months

And I'm not here to criticize; I dunno what the board knew, but it's clear now that we're not on a sensible track. Even if this is the result of good intent all around, good intent isn't _enough_, on Earth. You try to kick the ball forward and it goes sideways. Derpery prevails.

2

0

58

Nate Soares ⏹️

@So8res

10 months

So, what now? Governments who have been alerted to the risks need to actually respond, and quickly. A research direction that poses such an insane level of risk needs to be halted immediately, and we need to find some saner way through these woods to a wonderful future.

9

3

58

Nate Soares ⏹️

@So8res

3 years

The "my model of Alice"-style phrasing is part of a more general program of distinguishing people from their maps. I don't claim to do this perfectly, but I'm trying, and I appreciate others who are trying.

2

1

54

Nate Soares ⏹️

@So8res

1 year

@ESYudkowsky 's TIME article better captures my actual views:

The Open Letter on AI Doesn't Go Far Enough

One of the earliest researchers to analyze the prospect of powerful Artificial Intelligence warns of a bleak scenario

time.com

3

7

56

Nate Soares ⏹️

@So8res

3 years

My stance on open borders is "what's the point of having all these fighter jets if we aren't even escorting transports to the Uighur camps to see who wants their liberty"

2

6

55

Nate Soares ⏹️

@So8res

10 months

I’m deeply skeptical; if they were the sort to notice early hints of later issues and react appropriately then I expect they’d be reacting differently to the hints we already have today (present-day jailbreaks, shallow instances of deception, etc.).

1

56

Nate Soares ⏹️

@So8res

10 months

(When I point this out in person, a surprising number of people respond to this point by saying: well, with Russian roulette, we know that the probability is exactly 1/6, whereas with AI we have no idea what our odds are.

2

56

Nate Soares ⏹️

@So8res

3 years

(Or, at least, I think this is true of me and of many of the folks I interact with daily. I suspect phraseology is contagious and that bystanders may pick up the alt manner of speaking w/out picking up the alt manner of thinking, etc.)

2

0

55

Nate Soares ⏹️

@So8res

10 months

And: I appreciate that at least some leaders at at least some labs are acknowledging that this tech gambles with all our lives (despite failing to take personal responsibility, and despite saying one thing on tech podcasts and another to congress, and ...).

1

2

56

Nate Soares ⏹️

@So8res

3 years

"But everyone knows that "you're wrong" has a silent "(about X)" parenthetical!", my straw conversational partner protests. I disagree. English makes it all too easy to represent confused thoughts like "maybe I'm bad".

1

2

55

Nate Soares ⏹️

@So8res

10 months

“and as part of carrying that responsibility, we’re testing our AIs in the following ways, and if these tests come back positive we’ll do [some specific thing that's not superficial and that's hard to game]",

1

2

56

Nate Soares ⏹️

@So8res

3 years

PSA: In my book, everyone has an unlimited number of "I don't understand", "plz say that again in different words", "plz expand upon that", and "plz pause while I absorb that" tokens.

2

3

55

Nate Soares ⏹️

@So8res

10 months

A second issue is: what are you supposed to do when the evals say “this AI is dangerous”?

1

54

Nate Soares ⏹️

@So8res

3 years

So, look: this isn't about who the fish's uncle is. When a kid points at a whale and says "look, a fish", and you're like "haha no, its tail flaps horizontally and its gradma had hair", who's in the wrong here?

1

3

53

Nate Soares ⏹️

@So8res

3 years

Sometimes a bunch of small shifts leave people talking a bit differently, b/c now they're thinking a bit differently. The old phrasings don't feel quite right -- maybe they conflate distinct concepts, or rely implicitly on some bad assumption, etc.

2

3

53

Nate Soares ⏹️

@So8res

3 years

The big fortresses on the side of righteousness are "crab" () and "tree" () which are such ragged genealogical concepts and such useful functional concepts that the misguided pedantry virus hasn't yet found a foothold.

There’s no such thing as a tree (phylogenetically) — LessWrong

So you’ve heard about how fish aren’t a monophyletic group? You’ve heard about carcinization, the process by which ocean arthropods convergently evol…

www.lesswrong.com

2

3

53

Nate Soares ⏹️

@So8res

2 years

If an argument misleads you, think not "I have learned that similarly compelling arguments are often wrong." Think instead of which step within it was wrong, and adjust your thoughts so that they are not so easily guided down wrong paths.

1

3

54