Basically a deer with a human face. Despite probably being some sort of magical nature spirit, his interests are primarily in technology and politics and science fiction.

Spent many years on Reddit and is now exploring new vistas in social media.

  • 0 Posts
  • 2.63K Comments
Joined 1 year ago
cake
Cake day: June 9th, 2023

help-circle





  • you know that a company putting a thing in their terms of service doesn’t make it legally binding, right?

    And you know that doesn’t necessarily imply the reverse? Granting a site a license to use the stuff you post there is a pretty basic and reasonable thing to agree to in exchange for them letting you post stuff there in the first place.

    hence why they all suddenly felt the need to update their terms of services

    As others have been pointing out to you in this thread, that also is not a sign that the previous ToS didn’t cover this. They’re just being clearer about what they can do.

    Go ahead and refrain from using their services if you don’t agree to the terms under which they’re offering those services. Nobody’s forcing you.


  • You mean before or after all the sites updated their ToS it so that they were legally in the clear to sell user posts to AI training companies?

    The ToSes would generally have a blanket permission in them to license the data to third-party companies and whatnot. I went back through historical Reddit ToS versions a little while back and that was in there from the start.

    Also in there was a clause allowing them to update their ToS, so even if the blanket permission wasn’t there then it is now and you agreed to that too.

    Learning from things is a very obviously a completely different process to feeding data into a server farm.

    It is not very obviously different, as evidenced by the fact that it’s still being argued. There are some legal cases before the courts that will clarify this in various jurisdictions but I’m not expecting them to rule against analysis of public data.


  • in the case of ai generated media, companies just decided that they just had the rights to use existing published media, so they harvested it without consent or compensation

    Have you read the ToS of your favourite social media site lately?

    In any event, it might well be that companies (and you yourself) have the rights to use existing published media to train AIs. Copyright doesn’t cover the analysis of public data. I suspect that people wouldn’t like it if copyright got extended to let IP owners prohibit you from learning from their stuff.







  • I just set up an account at fedia.io, which is an mbin instance. When I ran the script posted over here to copy my subscriptions I had 111 subscriptions succeed and 49 fail. I’ve been checking the ones that failed and they’ve all been ones that haven’t had anyone post in them for 5 months or more, so I’m guessing those are “dead” anyway. Seems not too bad, I’ll see how it goes I guess.

    I posted this same response from both of my accounts, I’m going to watch to see how well it federates back and forth.


  • It doesn’t help that whenever this comes up there’s a contingent of users who jump to Ernest’s “defense” by calling the folks raising these issues “concern trolls” and accusing them of shilling for mbin.

    No, this is simply a matter of what is actually working well. The point of federation is that one shouldn’t need to have “loyalty” to any particular instance or any particular platform. Use whichever one’s working.



  • I just gave this a try and I think there’s a potentially worrisome problem, it silently failed on a lot of community subscriptions. The ones that returned HTTP 500 errors were listed in the “fail” list that the importer script generated, but a whole bunch of others returned 404 errors and weren’t listed in either the success or fail lists.

    So I advise those running this to pay attention to the error log to avoid losing track of those communities rather than trusting the “fail” list.