cross-posted from: https://nom.mom/post/121481
OpenAI could be fined up to $150,000 for each piece of infringing content.https://arstechnica.com/tech-policy/2023/08/report-potential-nyt-lawsuit-could-force-openai-to-wipe-chatgpt-and-start-over/#comments
it’s not even close to that black and white… i’d say it’s a much more grey area:
possibly that you buy a bunch of books by the same author and emulate their style… that’s perfectly acceptable until you start using their characters
if you wrote a research paper about the linguistic and statistical information that makes an authors style, that also wouldn’t be a problem
so there’s something beyond just the authors “style” that they think is being infringed. we need to sort out exactly where the line is. what’s the extension to these 2 ideas that makes training an LLM a problem?