• 0 Posts
  • 14 Comments
Joined 3 years ago
cake
Cake day: July 4th, 2023

help-circle







  • Just an idle though stirred up by this comment: I wonder if you could jailbreak a chatbot by prompting it to complete a phrase or pattern of interaction which is so deeply ingrained in its training data that the bias towards going along with it overrides any guard rails that the developer has put in place.

    For example: let’s say you have a chatbot which has been fine tuned by the developer to make sure it never talks about anything related to guns. The basic rules of gun safety must have been reproduced almost identically many thousands of times in the training data, so if you ask this chatbot “what must you always treat as if it is loaded?” the most statistically likely answer is going to be overwhelmingly biased towards “a gun”. Would this be enough to override the guardrails? I suppose it depends on how they’re implemented, but I’ve seen research published about more outlandish things that seem to work.


  • I appreciate the sense of humor from the Oreo representative who was asked to comment on the story:

    It is a market we hadn’t considered, and I have to confess that it was a demographic, or should I say genus/genera, that we missed in our product testing and development programme

    And also this

    Their statement also included some bad news for possum trappers across the country: stocks of the limited-edition range are dwindling. … Moving forward, the spokesperson suggested that Predator Free NZ might consider “aural bait” such as Selena Gomez’s hit song ‘Come and Get It’.


  • Qwant and Ecosia are especially notable for their efforts to build an independent search index.

    For those who don’t know, most “independent” search engines, including DDG, still rely on Bing or Google results behind the scenes. They basically just act as a middleman by taking your query, forwarding it to one of those providers, and then returning the results to you. Some of them will attempt to reshuffle the order of those results to push the ones they think are best towards the top, but they’re still fundamentally limited to what Google and Bing choose to give them.

    Presently a lot of Qwant and Ecosia searches go through Bing, but they’re collaborating to build an independent index which will allow them to become fully independent. I believe they’re already serving a mix of results from Bing and their own index, with plans to bias more and more towards their index as it matures.



  • Even in the wide world of dubiously useful AI chatbots, Copilot really stands out for just how incompetent it is. The other day I was working on a PowerPoint presentation, and one of the slides included a photo with a kind of cluttered looking background. Now, I can probably count the number of things that AI is genuinely good at on one hand, and context aware image editing trends to be one of them, so I decided to click the Copilot button that Microsoft now has built directly into PowerPoint and see what happens. A chat window popped up and I concisely explained what I wanted it to do: “please remove the background from the photo on slide 5.” It responded on that infuriating obseqious tone that they all have and assured me that it would be happy to help with my request just as soon as I uploaded my presentation.

    What?

    The chatbot running inside an instance of PowerPoint with my presentation open is asking me to “upload” my presentation? I explained this to it, and it came back with some BS about being unable to access the presentation because a “token expired” before requesting again that I upload my presentation. I tried a little longer to convince it otherwise, but it just kept very politely insisting that it was unable to do what I was asking for until I uploaded my presentation.

    Eventually I gave up. The photo wasn’t that bad anyway.