@atoponce Is this news? It's over a year old. IIRC, it's designed to reduce the amount of data sent off-machine. They were talking about space consumption in Canary release in Feb of 2025 (https://groups.google.com/a/chromium.org/g/chrome-ai-dev-preview-discuss/c/PhgjQg4IoQk)
Looking for all the models that have been downloaded for cache in browser, it's not just limited to one. chrome://on-device-internals/
You can use the Tools tab to 'Load Default' model, then type your prompt in the box below. It's slower than a cloud-based response.
Ask 'how to spell "incorriggable" and it takes 2 seconds to spit out: The correct spelling is incorrigible.
Takes 10 more seconds to explain each part of the word.
Then it crashes.
I'd rather handle spell checks on-box than send every word up to a cloud service.
But, don't ask it what the range of radio delay is between the earth and the moon. Highest Quality says 2.5 milliseconds, and crashes. Fastest inference says 22 seconds, and crashes. Not usable.