Worth looking at both the quoted text here and •especially• the linked page, which is quite good.

lily_and_frog@mastodon.art

Yep!
But in the case of LLMs, it's thousands and thousands of years of experience + hundreds and hundreds of years of just... labelling the source material!

temptoetiam@eldritch.cafe

@datarama @inthehands That is a reasonably achievable goal since it is pretty short and written in a very accessible language.
So feel encouraged to try if you wish to!
(As a French person, I never had to learn it the hard way, and admire anyone who does)

mhoye@cosocial.ca

@inthehands

We need to consider the possibility that soon cp(1) will become sentient.

https://exple.tive.org/blarg/2026/02/12/the-pride-of-subject-hometown-here/

https://exple.tive.org/blarg/2026/02/07/on-the-crank-spectrum/

datarama@hachyderm.io

@temptoetiam @inthehands I can read Borges (slowly and with embarrassingly frequent dictionary breaks) in the original Spanish! That's actually one of the ways I maintained being able to at least read the language (though I struggle with understanding spoken Spanish, if it's spoken at a natural pace) since back when I took Spanish in high school.

(I can also read Russian children's literature - *very* far from my goal of being able to read the Strugatsky brothers' science fiction in the original language. )

inthehands@hachyderm.io

As per my posts, I have the luxury of not having LLM vendors shoved down my throat, and I generally avoid them for ethical reasons:

https://hachyderm.io/@inthehands/116581463138461199

But because this all these questions about the usage and limits of these tools keep crashing through my doors, all of our doors, whatever we think of the ethical showstoppers, well…

…fight off amazing percentages of LLM overhype with this one weird question.

/end

joe@f.duriansoftware.com

@inthehands i've noticed a trend in anecdotes recently where people are finding it harder to trace their novel-seeming LLM outputs back to inputs. i wonder if this is a result of them atomizing their inputs more finely, or being "better" at swapping the tokens around to make output look original. (an AI bro might argue that at some point human creativity is doing the same thing…)

gildilinie@beige.party

@inthehands Going further: if you could google the source for a HTTP server in JavaScript in 2015, the LLM should be able to output one in 0 (zero) minutes or it's failed.

inthehands@hachyderm.io

@datarama
I know just enough French to have read Le Petit Prince in the original language (with some struggle), and…it really is beautiful in French in a way that translations don't capture. “On ne voit bien qu'avec le coeur” can translate into English quite directly as “One does not see well but with the heart,” but it just doesn't have the same poetry and magic at all.

temptoetiam@eldritch.cafe

@datarama @inthehands knowing Spanish is a great stepping stone to learn any other romance languages!
Bon courage à toi

datarama@hachyderm.io

@inthehands I've read it in Danish and English. I personally like the Danish translation best.

inthehands@hachyderm.io

@GalbinusCaeli
To be fair, this is an algorithmically difficult problem that was still largely an open question 10-15 years ago! Scale down your expectations by 2 or 3 orders of magnitude, and modern machine learning is truly impressive.

Not a $10 trillion industry. But it's impressive in a “cool research” sense, and also in a “oo, that may pose serious societal danger” sense.

inthehands@hachyderm.io

@michael_w_busch
Yup. Same with passing the bar.

bifouba@kolektiva.social

@inthehands

I'm as anti-"AI" as they come, but this is a much stronger argument against these systems being intelligent, or about to achieve a breakthrough, than it is against the claim that they are useful. The ability even to quickly retrieve a known right answer needle from a haystack of less useful answers (as opposed to coming up with a new right answer from first principles) would potentially be a valuable service, if it were reliable (and less inefficient, ecologically suicidal, etc.).

inthehands@hachyderm.io

@datarama @Linza
My own position on this is that the book is perfect, and should not be adapted.

I'm pretty sure Miyazaki understands this — and if he •were• making an adaptation, it would be because he's actually writing a dramatically different story that is largely new material and profoundly different in its scope and arc, as he did with both Kiki and Howl.

datarama@hachyderm.io

@inthehands @Linza Hence, "interpretation". That is kinda the thing he does.

(BTW, I seem to recall him mentioning that it is his favourite book.)

inthehands@hachyderm.io

@joe
Yeah, people were having that same argument about humans and creativity on the more academic side of my circles back in 2023. It would be an interesting one if it didn't have all this investment money weighing it down! (Human learning, both technical and artistic, almost always starts with imitation and repetition; clearly it's a building block of this messy constellation of things that we call “intelligence.”)

I do think the models are getting better at atomizing, as you put it, and I'm disappointed that there's not more research on this family of reverse-mapping problems. One question I've wondered about: can we quantify how much the output depended on a given input? e.g. how would the probability of given output have changed if the model were trained without <pattern> in its training data?

inthehands@hachyderm.io

@bifouba
The would “right” is doing a bit too much work in that sentence, though. Remove it and replace “less useful” with “other,” and I agree.

shafik@hachyderm.io

@inthehands

"HTML parsers in Portland" is another great example

https://hachyderm.io/@shafik/116044646072511071

datarama@hachyderm.io

@inthehands @Linza Fun fact: Have you ever wondered why the landscapes and architecture in Kiki's Delivery Service look so Scandinavian?

Miyazaki had visited Astrid Lindgren in Sweden, to ask for her permission to do an animated interpretation of Pippi Långstrump. She declined, but Miyazaki spent some time touristing around, photographing and sketching things he saw. He then ended up using that in Kiki.

jzb@hachyderm.io

@inthehands I wonder sometimes if a project or vendor will ever go to the trouble of doing something completely ethical, like creating a new programming language with a corresponding model that only has been fed correct training data that they've provided.

It would be interesting if someone did that specifically for non-programmers to make it easier for people to one-off programs for their own use.