Started By
Message

re: U.S gov will get to decide who gets access to GPT-5.6

Posted on 6/25/26 at 9:54 pm to
Posted by Narax
Member since Jan 2023
8328 posts
Posted on 6/25/26 at 9:54 pm to
quote:

It has to do with the heteroscedasticity and homoscedasticity of the manifolds generated by LLM architectures when computed by GPUs.


I'm quite curious, do you have a specific paper in mind here?
Posted by Diego Ricardo
Alabama
Member since Dec 2020
13524 posts
Posted on 6/25/26 at 9:59 pm to
I got to be honest all this stuff smells like the government whoring itself out to market these models overselling themselves. Considering this admin essentially nakedly does pumps like this…
Posted by Bamafig
Member since Nov 2018
6633 posts
Posted on 6/25/26 at 10:06 pm to
quote:

heteroscedasticity and homoscedasticity


Hold on!, let’s not bring them into this conversation! No men in women’s sports!!
Posted by CFDoc
Member since Jan 2013
2293 posts
Posted on 6/26/26 at 5:32 pm to
There’s a good many rabbit holes you can go down here.

The best place to start is googling the latest developments on what’s known as ‘LLM Mapping’ or ‘LLM Fingerprinting’. Specifically the folks looking at how fingerprinting can allow LLM exploits and expose vulnerabilities.

From there, you can go down the latest research paths where groups are trying to really get into the nuts and bolts of where and how probabilistic outputs are governed/mapped by deterministic processes like encoding, attention, etc. LINK

In my current line of work, there’s a good bit of research focused on tying these concepts to the physical properties of GPUs with the initial goal being efficiency focused; however, it’s not a difficult jump between efficiency and exploitation.
Posted by BigBro
Member since Jul 2021
20855 posts
Posted on 6/26/26 at 5:36 pm to
quote:

According to Gen Stanton, they are MAJOR cyber security threats.
if this is true, then we’re all in trouble, because another country is gonna develop a similar product, if they haven’t already, and who the frick knows what happens then

this is all happening much faster than I imagined…
Posted by Powerman
Member since Jan 2004
174796 posts
Posted on 6/26/26 at 6:07 pm to
quote:

I got to be honest all this stuff smells like the government whoring itself out to market these models overselling themselves. Considering this admin essentially nakedly does pumps like this…

They have more big IPOs in the pipeline as well. This could be the feds helping them generate hype. But I think it really is a cyber security issue.
Posted by Powerman
Member since Jan 2004
174796 posts
Posted on 6/26/26 at 6:09 pm to
quote:

if this is true, then we’re all in trouble, because another country is gonna develop a similar product, if they haven’t already, and who the frick knows what happens then

this is all happening much faster than I imagined…

The good thing is firms like Anthropic are using the exploits to assist in cyber security.
Posted by CFDoc
Member since Jan 2013
2293 posts
Posted on 6/26/26 at 7:01 pm to
quote:

But I think it really is a cyber security issue


Did you see copy.fail ?

That was just a precursor and that is full root access to almost every single Linux distribution from the last 10 years all wrapped up in a 700 byte Python file.

Posted by Narax
Member since Jan 2023
8328 posts
Posted on 6/26/26 at 7:44 pm to
quote:

LINK

I read the paper, it seems like the author is trying to build a new framework specifically for a fewshot approach.

It doesn't really have anything to do with exploitation of physical chip sets, that paper seems wholly unconnected to GPU architectures.

I ask because a Blackwell is different physically than Ampere which is also different than Hopper most methods of data poisoning ignore physical architectures as it's hard to guess what back end someone is running.

I'm really not familiar with Google's TPUs, but I do know that they are really really different.
Posted by IMSA_Fan
Member since Jul 2024
885 posts
Posted on 6/26/26 at 7:48 pm to
I’ll go ahead and ask the question - is this legal
Posted by SallysHuman
Lady Palmetto Bug
Member since Jan 2025
23683 posts
Posted on 6/26/26 at 7:55 pm to
Chat said I'd be one of the first... paid subscription has perks. Although I have to say I've been rolling quite nicely with 5.5.

Posted by trinidadtiger
Member since Jun 2017
20306 posts
Posted on 6/26/26 at 8:10 pm to
quote:

quote:
It has to do with the heteroscedasticity and homoscedasticity of the manifolds generated by LLM architectures when computed by GPUs.

I'm quite curious, do you have a specific paper in mind here?


I think he is looking for funding for his NGO...
Posted by CFDoc
Member since Jan 2013
2293 posts
Posted on 6/26/26 at 8:10 pm to
quote:

It doesn't really have anything to do with exploitation of physical chip sets, that paper seems wholly unconnected to GPU architectures.


That paper doesn’t have anything to do with GPU architectures. That paper is more of an example of how recent literature is showing how to deterministically control heteroscedasticity of an LLM. Essentially drawing determinism out of what’s fundamentally a stochastic model.

I don’t know if the open literature is actively publishing exploits of gpu architectures yet or not. I’m just telling you it’s an active topic in the field. And yes, Ampere, Hopper, Blackwell, and Vera Rubin matter. However, at the transistor/ALU level, coupled with RISC, it makes all GPUs much more reverse engineer-able.

TPUs are a different story.
Posted by Veritas
Member since Feb 2005
11001 posts
Posted on 6/26/26 at 8:14 pm to
The idea was always to create limited access to AI. Eventually, you will have to pay for internet in the form of AI.

There will be tiers for how much access you can receive.
Posted by Narax
Member since Jan 2023
8328 posts
Posted on 6/27/26 at 12:35 am to
quote:

That paper is more of an example of how recent literature is showing how to deterministically control heteroscedasticity of an LLM. Essentially drawing determinism out of what’s fundamentally a stochastic model.

Thats been a long standing problem, but its always been seen as a data specific problem, (the generation of LLMs that always RNGed to 27 etc... which was consistent across a number of hardware architectures)

I'd like to see data on where people see RISC or the ALUs forcing determism more than the data. FP precision has drastically changed across NVidia chipsets, what chipsets do people think this hardware determism is actually predictable.

It would be very interesting, and I think it is way early to make a claim about Vera/Rubin as they are targeted at world models.

quote:

it makes all GPUs much more reverse engineer-able.

I would need to see data, thats a really bold claim.
This post was edited on 6/27/26 at 12:37 am
first pageprev pagePage 2 of 2Next pagelast page
refresh

Back to top
logoFollow TigerDroppings for LSU Football News
Follow us on X, Facebook and Instagram to get the latest updates on LSU Football and Recruiting.

FacebookXInstagram