• 0 Posts
  • 132 Comments
Joined 1 year ago
cake
Cake day: June 23rd, 2023

help-circle

  • I used to play 1v1 Ticket to Ride matches against my wife using the app.

    As background: I’m not a very competitive gamer, but I’m decent at problem solving. When I first learned TtR, I played with fairly … great players. One of my friends was (is?) nationally ranked. They routinely beat the ever-loving crap out of me. I think of the dozens of games we’ve played, I have won maybe 10-20% of the time?

    My wife isn’t bad at TtR, but she doesn’t see things the same way in terms of strategy.

    We had this one game where I drew a bunch of short routes all over the map, which blocked her early in the game, and a series of lucky route draws lead me to connect them, inadvertently blocking her at least twice, including on the last play, where I was just dumping cars to end the game.

    She was always a little upset when I beat her, but this time the discrepancy was so bad and she was so upset. I just stopped playing Ticket to Ride - like, at all.




  • You say “Not even close.” in response to the suggestion that Apple’s research can be used to improve benchmarks for AI performance, but then later say the article talks about how we might need different approaches to achieve reasoning.

    Now, mind you - achieving reasoning can only happen if the model is accurate and works well. And to have a good model, you must have good benchmarks.

    Not to belabor the point, but here’s what the article and study says:

    The article talks at length about the reliance on a standardized set of questions - GSM8K, and how the questions themselves may have made their way into the training data. It notes that modifying the questions dynamically leads to decreases in performance of the tested models, even if the complexity of the problem to be solved has not gone up.

    The third sentence of the paper (Abstract section) says this “While the performance of LLMs on GSM8K has significantly improved in recent years, it remains unclear whether their mathematical reasoning capabilities have genuinely advanced, raising questions about the reliability of the reported metrics.” The rest of the abstract goes on to discuss (paraphrased in layman’s terms) that LLM’s are ‘studying for the test’ and not generally achieving real reasoning capabilities.

    By presenting their methodology - dynamically changing the evaluation criteria to reduce data pollution and require models be capable of eliminating red herrings - the Apple researchers are offering a possible way benchmarking can be improved.
    Which is what the person you replied to stated.

    The commenter is fairly close, it seems.


  • Monument@lemmy.sdf.orgtoMemes@lemmy.mlToxicity
    link
    fedilink
    English
    arrow-up
    3
    ·
    27 days ago

    That’s very fair, indeed.

    Perhaps awareness of one will spark awareness of the other. I suppose my concern is that plasticisers are sort of a ‘hidden’ risk, for the most part. They’re used in nearly every food packaging (and prep, such as hoses) that isn’t contained in glass, or served up in its own peel.


  • Monument@lemmy.sdf.orgtoMemes@lemmy.mlToxicity
    link
    fedilink
    English
    arrow-up
    42
    ·
    27 days ago

    Microplastics are terrifying and all that, but I’m sort of more worried about plasticisers like BPA, BPF, BPS and the rest of the alphabet of BP-whatever’s that was created and brought into use after the dangers of BPA were realized.

    Just a heads up - if something plastic says it’s BPA-free, it probably uses a different bisphenol compound that is less studied than BPA. And is likely as toxic (or even more toxic)!

    But nobody ever talks about those, because science words.


  • Don’t be sorry to say that! I think the idea is pretty darn cute. When everyone tells you how amazingly stylish, practical, and clever you are, remember me!
    (But take all the credit for the idea for yourself - unless some poor fashionless soul doesn’t like it, then definitely blame me for a bad suggestion.)

    My wife has one of the neck strap ones, and she doesn’t like wearing it for the same reason. My brain just assumed they took one of the mounting plates from one of those and hooked it to one of those sproingy straps.

    Remotes are tough. We have a dedicated holder that is just where each remote goes as soon as it is no longer touching a hand, because they otherwise do get lost. Despite that, I’ve even considered 3d printing an AirTag holder that I can glue to the remotes, although that would just mean pointing my phone at the couch while it tells me they’re somewhere ‘in there.’


  • Monument@lemmy.sdf.orgtoShirts That Go Hard@lemmy.worldWeird kink though
    link
    fedilink
    English
    arrow-up
    48
    arrow-down
    4
    ·
    1 month ago

    I became disappointed when I zoomed in to realize she had a wallet chain and not a sproingy yellow coiled lanyard thing that was somehow attached to her phone. (Sorry, Amazon link: One of these)
    I don’t know why. I guess I just thought the idea was kind of cute and fun. This dad-fucking, bacon grease swilling, subway texter uses a cute little bouncy cord thing to keep her phone handy, amidst an otherwise austere getup - just a zany detail to contrast with the rest. Alas. Just a boring ass wallet chain.









  • To your point about billing -
    My insurer recently informed me that a claim submitted last September had been denied. Looking at the original explanation of benefits from September, it indicated that the insurer didn’t think the medical code was appropriate for the appointment, and wanted more information - stating they would work with the hospital to work it out.
    I haven’t heard anything from the hospital, but I’m growing concerned they may just send the bill to collections due to the time elapsed.


  • An ex from a meaningful, but fraught relationship tried to seduce me a few months after we had broken up. In the interim, I had started dating someone new, and I rejected the advances.

    My ex was angry and lashing out. She said a few random insults about my new partner (implying she had manipulated me with sex), before finally saying “well, I hope she enjoys your magical penis!” (It’s not magical. The tiny wizard hat is purely for decoration.)