• 0 Posts
  • 199 Comments
Joined 3 years ago
cake
Cake day: June 18th, 2023

help-circle














  • The description of this community is not a hard rule written in stone, and I would treat it as more of a vibe than a criteria.

    If you want to take it literally, then yes, Plex doesn’t count, neither does cloudflare or wordpress. And many other proprietary systems commonly used by the self hosting community.

    But I think the spirit of this community is a bit more loose, and there is room for the likes of Plex.




  • int add(int a, int b) {
        return a + b;
    }
    

    This code is clearly functional, it’ll compile and execute.

    However, the customer actually needs the code to do a saturating add.

    With that knowledge, we can clearly see that the code is not correct. It will not saturate, it will wrap around instead.


    Without that knowledge, an LLM will happily write some basic unit tests that won’t cover the saturation edge case, and the bug would live on until its hit in prod.

    If you’re lucky, and your function doco is good, the LLM might spot the bug, and notify you.

    My personal preference for how to generate tests is to ask the agent to write specific tests. E.g: “write a test for add that demonstrates that it saturates”.