My Forums

Trending

Quick Links:

Tiger Rant

•LSU Recruiting

•O-T Lounge

•Politics

•More Sports

•SEC Rant

•Saints

•Home/Garden

•Outdoors

•Movie/TV

•Food/Drink

•Money

•Tech

•Music

•Travel

•TD Help

Page 1 2

Started By

Message

re: U.S gov will get to decide who gets access to GPT-5.6

Posted on 6/25/26 at 9:54 pm to CFDoc

Posted by Narax

Member since Jan 2023

8328 posts

Posted on 6/25/26 at 9:54 pm to CFDoc

quote:
It has to do with the heteroscedasticity and homoscedasticity of the manifolds generated by LLM architectures when computed by GPUs.

I'm quite curious, do you have a specific paper in mind here?

2 ...

Report Post

Posted by Diego Ricardo

Alabama

Member since Dec 2020

13524 posts

Posted on 6/25/26 at 9:59 pm to hawgfaninc

I got to be honest all this stuff smells like the government whoring itself out to market these models overselling themselves. Considering this admin essentially nakedly does pumps like this…

1 ...

Report Post

Posted by Bamafig

Member since Nov 2018

6633 posts

Posted on 6/25/26 at 10:06 pm to CFDoc

quote:
heteroscedasticity and homoscedasticity

Hold on!, let’s not bring them into this conversation! No men in women’s sports!!

0 ...

Report Post

Posted by CFDoc

Member since Jan 2013

2293 posts

Posted on 6/26/26 at 5:32 pm to Narax

There’s a good many rabbit holes you can go down here.

The best place to start is googling the latest developments on what’s known as ‘LLM Mapping’ or ‘LLM Fingerprinting’. Specifically the folks looking at how fingerprinting can allow LLM exploits and expose vulnerabilities.

From there, you can go down the latest research paths where groups are trying to really get into the nuts and bolts of where and how probabilistic outputs are governed/mapped by deterministic processes like encoding, attention, etc. LINK

In my current line of work, there’s a good bit of research focused on tying these concepts to the physical properties of GPUs with the initial goal being efficiency focused; however, it’s not a difficult jump between efficiency and exploitation.

1 ...

Report Post

Posted by BigBro

Member since Jul 2021

20855 posts

Posted on 6/26/26 at 5:36 pm to CFDoc

quote:
According to Gen Stanton, they are MAJOR cyber security threats.

if this is true, then we’re all in trouble, because another country is gonna develop a similar product, if they haven’t already, and who the frick knows what happens then

this is all happening much faster than I imagined…

1 ...

Report Post

Posted by Powerman

Member since Jan 2004

174796 posts

Posted on 6/26/26 at 6:07 pm to Diego Ricardo

quote:
I got to be honest all this stuff smells like the government whoring itself out to market these models overselling themselves. Considering this admin essentially nakedly does pumps like this…

They have more big IPOs in the pipeline as well. This could be the feds helping them generate hype. But I think it really is a cyber security issue.

1 ...

Report Post

Posted by Powerman

Member since Jan 2004

174796 posts

Posted on 6/26/26 at 6:09 pm to BigBro

quote:
if this is true, then we’re all in trouble, because another country is gonna develop a similar product, if they haven’t already, and who the frick knows what happens then

this is all happening much faster than I imagined…

The good thing is firms like Anthropic are using the exploits to assist in cyber security.

0 ...

Report Post

Posted by CFDoc

Member since Jan 2013

2293 posts

Posted on 6/26/26 at 7:01 pm to Powerman

quote:
But I think it really is a cyber security issue

Did you see copy.fail ?

That was just a precursor and that is full root access to almost every single Linux distribution from the last 10 years all wrapped up in a 700 byte Python file.

0 ...

Report Post

Posted by Narax

Member since Jan 2023

8328 posts

Posted on 6/26/26 at 7:44 pm to CFDoc

quote:
LINK

I read the paper, it seems like the author is trying to build a new framework specifically for a fewshot approach.

It doesn't really have anything to do with exploitation of physical chip sets, that paper seems wholly unconnected to GPU architectures.

I ask because a Blackwell is different physically than Ampere which is also different than Hopper most methods of data poisoning ignore physical architectures as it's hard to guess what back end someone is running.

I'm really not familiar with Google's TPUs, but I do know that they are really really different.

1 ...

Report Post

Posted by IMSA_Fan

Member since Jul 2024

885 posts

Posted on 6/26/26 at 7:48 pm to hawgfaninc

I’ll go ahead and ask the question - is this legal

0 ...

Report Post

Posted by SallysHuman

Lady Palmetto Bug

Member since Jan 2025

23683 posts

Posted on 6/26/26 at 7:55 pm to hawgfaninc

Chat said I'd be one of the first... paid subscription has perks. Although I have to say I've been rolling quite nicely with 5.5.

0 ...

Report Post

Posted by trinidadtiger

Member since Jun 2017

20306 posts

Posted on 6/26/26 at 8:10 pm to Narax

quote:
quote:
It has to do with the heteroscedasticity and homoscedasticity of the manifolds generated by LLM architectures when computed by GPUs.

I'm quite curious, do you have a specific paper in mind here?

I think he is looking for funding for his NGO...

0 ...

Report Post

Posted by CFDoc

Member since Jan 2013

2293 posts

Posted on 6/26/26 at 8:10 pm to Narax

quote:
It doesn't really have anything to do with exploitation of physical chip sets, that paper seems wholly unconnected to GPU architectures.

That paper doesn’t have anything to do with GPU architectures. That paper is more of an example of how recent literature is showing how to deterministically control heteroscedasticity of an LLM. Essentially drawing determinism out of what’s fundamentally a stochastic model.

I don’t know if the open literature is actively publishing exploits of gpu architectures yet or not. I’m just telling you it’s an active topic in the field. And yes, Ampere, Hopper, Blackwell, and Vera Rubin matter. However, at the transistor/ALU level, coupled with RISC, it makes all GPUs much more reverse engineer-able.

TPUs are a different story.

1 ...

Report Post

Posted by Veritas

Member since Feb 2005

11001 posts

Posted on 6/26/26 at 8:14 pm to hawgfaninc

The idea was always to create limited access to AI. Eventually, you will have to pay for internet in the form of AI.

There will be tiers for how much access you can receive.

0 ...

Report Post

Posted by Narax

Member since Jan 2023

8328 posts

Posted on 6/27/26 at 12:35 am to CFDoc

quote:
That paper is more of an example of how recent literature is showing how to deterministically control heteroscedasticity of an LLM. Essentially drawing determinism out of what’s fundamentally a stochastic model.

Thats been a long standing problem, but its always been seen as a data specific problem, (the generation of LLMs that always RNGed to 27 etc... which was consistent across a number of hardware architectures)

I'd like to see data on where people see RISC or the ALUs forcing determism more than the data. FP precision has drastically changed across NVidia chipsets, what chipsets do people think this hardware determism is actually predictable.

It would be very interesting, and I think it is way early to make a claim about Vera/Rubin as they are targeted at world models.