The only way to avoid Grammarly using your data for AI is to pay for 500 accounts

soyagi@yiffit.net · 1 year ago

The only way to avoid Grammarly using your data for AI is to pay for 500 accounts

harmonea@kbin.social · 1 year ago

Framing this solely as fear is extremely disingenuous. Speaking only for myself: I’m not against the development of AI or LLMs in general. I’m against the trained models being used for profit with no credit or cut given to the humans who trained it, willing or unwilling.

It’s not even a matter of “if you aren’t the paying customer, you’re the product” - massive swaths of text used to train AIs were scraped without permission from sources whose platforms never sought to profit from users’ submissions, like AO3. Until this is righted (which is likely never, I admit, because the LLM owners have no incentive whatsoever to change this behavior), I refuse to work with any site that intends to use my work to train LLMs.

valveman@lemmy.eco.br · 1 year ago

I’m against the trained models being used for profit with no credit or cut given to the humans who trained it.

Sorry mate, hell’s gonna get cold before this happens. We’re talking about the biggest moth******ers on earth since always. Do you think Meta/[insert big tech company name here] will start to behave all of the sudden? These people literally KILL people everyday for a profit (looking at you Instagram).

The only way to get something from these scumbags is fining them something like 100k per hour, until they start respecting people’s privacy

harmonea@kbin.social · edit-2 1 year ago

I did already say I don’t expect this to ever change, so “sorry mate,” but you’re not exactly telling me anything I don’t know here.

But I suspect this was a knee-jerk rant typed before bothering to read past what you quoted. Oh well. Good thing I can still stand against something even if I don’t expect it to change much.

valveman@lemmy.eco.br · 1 year ago

Sorry if it sounded rude (and yeah, it was kind of a rant, sorry). What I’m trying to say is: these people do much worse things and don’t bother to say “sorry” publicly. The only way to make them behave is to fine them by a huge amount, just like Norway did.

harmonea@kbin.social · 1 year ago

Well, we can agree on that! Make paying contributors the cheaper option.

I won’t hold my breath though. :')

Jaded@lemmy.dbzer0.com · edit-2 1 year ago

Models need vast amounts of data. Paying individual users isnt feasible, and like you said most of it can be scraped.

The only way I see this working is if scraped content is a no go and then you pay the website, publishing house, record company, etc which kills any open source solution and doesn’t really help any of the users or creators that much. It also paves the way for certain companies owning a lot of our economy as we move towards an AI driven society.

It’s definitely a hot mess but the way I see it, the more restrictive we are with it, the more gross monopolies we create for no real gains.

harmonea@kbin.social · 1 year ago

Paying individual users isnt feasible

Sounds like their problem to solve, not mine.

Laticauda@lemmy.ca · edit-2 1 year ago

I mean they’re not even giving credit or asking permission, which both cost nothing. Make a site where people can volunteer their own work, program the ai to generate a list of citations of all the works it used data from when it provides output (I know that this might be lengthy, that’s fine), if you implement it into any sites or software make it so that people can opt out of having their data used, etc. It’s not that hard.

Jaded@lemmy.dbzer0.com · edit-2 1 year ago

Most of the data is scraped, it’s not up to the website. You can’t give a list of citation since it isn’t a search engine, it doesn’t know where the information comes from and it’s highly transformative, it melds information from hundreds if not thousand of different sources.

If it worked only with volunteer work, there would simply be not enough data.

Any law restricting data use in AI is only going to benefit corporations, there isn’t a solution for individual content creators. You can’t pay them for the drop in the bucket they add, thee logistics are insane. You can let them opt out, but then you need to do the same for whole websites which leads to a corporate hellscape where three companies own our whole economy since they are the only ones who can train ais.

kibiz0r@midwest.social · 1 year ago

I don’t see why those are the only two options.

We could update GPL, CC, etc. licensing so that it specifies whether the author intends to allow their work to be used for LLM training. And you could still put a non-commercial or share-alike constraint on it.

Hooray, open source is saved while greedy grubby hands are thwarted.

Jaded@lemmy.dbzer0.com · 1 year ago

What happens when every corporation and website closes their doors to AI? There isn’t any open source if we can’t use scrapped information from stack overflow, GitHub, Reddit etc.

Sure some users will opt out but most won’t. Every single website will restrict though and then they will sell it to google and Microsoft who will be the only companies able to build ais.

kibiz0r@midwest.social · 1 year ago

If I could predict what happens to the tech market when XYZ policy is enacted, I wouldn’t be posting on Lemmy during my tea breaks. Whatever policies end up sticking around, success is gonna require a lot of us having ideas, trying them out, and recombining them.

But I’ll claim this about my personal metric of “success”: If the future of open source looks like copying the extractive data-mining model of big tech and hoping we can shove the entire history of human thought into a blender faster than them, I think we’ve failed.