• 1 Post
  • 47 Comments
Joined 9 months ago
cake
Cake day: January 16th, 2024

help-circle















  • because it encodes semantics.

    Please enlighten me on how? I admit I don’t know all the internals of the transformer model, but from what I know it encodes precisely only syntactical information, i.e. what next syntactical token is most likely to follow based on a syntactical context window.

    How does it encode semantics? What is the semantics that it encodes? I doubt they have denatotational or operational semantics of natural language, I don’t think something like that even exists, so it has to be some smaller model. Actually, it would be enlightening if you could tell me at least what the semantical domain here is, because I don’t think there’s any naturally obvious choice for that.



  • This has been said multiple times but I don’t think it’s possible to internalize because of how fucking bleak it is.

    The VC/MBA class thinks all communication can be distilled into saying the precise string of words that triggers the stochastically desired response in the consumer. Conveying ideas or information is not the point. This is why ChatGPT seems like the holy grail to them, it effortlessly1 generates mountains of corporate slop that carry no actual meaning. It’s all form and no substance, because those people – their entire existence, the essence of their cursed dark souls – has no substance.

    1 batteries not included