Linux server admin, MySQL/TSQL database admin, Python programmer, Linux gaming enthusiast and a forever GM.

  • 0 Posts
  • 27 Comments
Joined 1 year ago
cake
Cake day: June 8th, 2023

help-circle

  • So, first of all, thank you for the cogent attempt at responding. We may disagree, but I sincerely respect the effort you put into the comment.

    The specific part that I thought seemed like a pretty big claim was that human brains are “simply” more complex neural networks and that the outputs are based strictly on training data.

    Is it not well established that animals learn and use reward circuitry like the role of dopamine in neuromodulation?

    While true, this is way too reductive to be a one to one comparison with LLMs. Humans have genetic instinct and body-mind connection that isn’t cleanly mappable onto a neural network. For example, biologists are only just now scraping the surface of the link between the brain and the gut microbiome, which plays a much larger role on cognition than previously thought.

    Another example where the brain = neural network model breaks down is the fact that the two hemispheres are much more separated than previously thought. So much so that some neuroscientists are saying that each person has, in effect, 2 different brains with 2 different personalities that communicate via the corpus callosum.

    There’s many more examples I could bring up, but my core point is that the analogy of neural network = brain is just that, a simplistic analogy, on the same level as thinking about gravity only as “the force that pushes you downwards”.

    To say that we fully understand the brain, to the point where we can even make a model of a mosquito’s brain (220,000 neurons), I think is mistaken. I’m not saying we’ll never understand the brain enough to attempt such a thing, I’m just saying that drawing a casual equivalence between mammalian brains and neural networks is woefully inadequate.




  • I’m happy with the Oxford definition: “the ability to acquire and apply knowledge and skills”.

    LLMs don’t have knowledge as they don’t actually understand anything. They are algorithmic response generators that apply scores to tokens, and spit out the highest scoring token considering all previous tokens.

    If asked to answer 10*5, they can’t reason through the math. They can only recognize 10, * and 5 as tokens in the training data that is usually followed by the 50 token. Thus, 50 is the highest scoring token, and is the answer it will choose. Things get more interesting when you ask questions that aren’t in the training data. If it has nothing more direct to copy from, it will regurgitate a sequence of tokens that sounds as close as possible to something in the training data: thus a hallucination.








  • I hate the term intellectual property. It’s a word used to describe vastly different concepts with vastly different legal backgrounds and problems.

    Copyright is theoretically a good thing, giving an artist or writer the time to profit from their work before the work becomes public domain, incentivizing the work. The current international agreements around it are absolutely bonkers thanks to Disney. The fact that the copyright persists after death, let alone for a century, is complete madness. The artist obviously can’t profit from their work after they’re dead. It’s an absolute shameless cash grab that destroys culture.

    Patents are also theoretically a good thing, allowing companies to release specifications of machines that allow for 10 years of exclusive use. Without patents, companies would hide their designs as trade secrets. It guarantees that after a decade, the designs will be publicly available for anyone to see. They need to be much more heavily restricted in what you can put patents on though. Patenting a specific machine design is fine, patenting molecules or math breaks the entire system. Software patents are blatantly absurd and broken.

    EDIT: Should also mention that 3D printers are a patent success story of the system working as intended. Patented in 1986, the inventor made good money making expensive machines with his own company. In 1996, the patent expired and we had an explosion of competing machines, getting ever cheaper and more effective. Everybody won. The inventor made bank for his decade of exclusivity, and then everyone benefited from the design being public domain, free for everyone to use.

    Trade secrets, the protection of specific recipes, client lists and strategies, can be abused to protect companies against disclosing information that may be very pertinent to their customers and governments. The Coca-Cola recipe or lists of clients as a trade secret is fine imho, but they can also abuse trade secret law to hide systems that lie about your car’s emissions.

    Trademarks help protect consumers against knockoff brands that pretend to be what they’re not. This is the least abused type of “IP”. This doesn’t mean there aren’t bad actors out there registering tons of different trademarks to squat on those designs & names, hoping to force a new company to pay up to use the name. Trademark squatting could theoretically be solved by annulling the trademark if the company isn’t actively using it. Trademarks are currently much too easy to maintain.

    All of this to say, lumping all of these different laws into “IP” is not useful at all when talking about the goals of the different legislations, what they’re trying to do, and how they fail.



  • I really don’t understand this whole “learning” thing that everybody claims these models are doing.

    A Markov chain algorithm with different inputs of text and the output of the next predicted word isn’t colloquially called “learning”, yet it’s fundamentally the same process, just less sophisticated.

    They take input, apply a statistical model to it, generate output derived from the input. Humans have creativity, lateral thinking and the ability to understand context and meaning. Most importantly, with art and creative writing, they’re trying to express something.

    “AI” has none of these things, just a probability for which token goes next considering which tokens are there already.




  • Copy-pasting from my other answer here:

    Trite answer: When it’s done

    More in-depth answer: Currently there’s no set date. It depends on how quickly they can tear out all the WebSockets code and replace it with simple HTTP (that’s the BIG change, will fix a lot of different things), and then test those changes. The hot_rank fix has already been merged, that’s done, but they want a stable, cohesive release with all the good stuff.

    Current estimations I’ve seen range from 1-2 weeks, but it all depends on how fast they can get it coded and tested.