Settings

Theme

Show HN: Engine – A multi-LLM alternative to Codex

enginelabs.ai

16 points by sdspurrier 10 months ago · 15 comments

Reader

sudb 10 months ago

I worked on this! Happy to answer any questions anyone has.

atlas_mugged 10 months ago

What are the limitations are there in terms of tasks this can handle? How does this compare with the other products out there? There are plenty of options...

  • sdspurrierOP 10 months ago

    Depends on your set of tasks but we use Engine for the bottom ~50% of issues by complexity. We have a pretty good swe-bench score from a while back but it's got much better since!

    We have also focused on workflow integrations so you can assign issues from Linear, Jira, Trello etc which makes it more useful for teams.

    • jackmpcollins 10 months ago

      Seems to me that integrations will be the most important component of tools like this. As an engineer I get my context from video calls with customers and other engineers, slack messages, emails, docs online, using the product myself, etc. So an auto-engineer should do the same.

diminikolaou 10 months ago

This is cool. I can see the anti-monopoly of OpenAI argument, but apart from that is there a strong argument of being multi-LLM for a Codex-like agent?

  • sdspurrierOP 10 months ago

    We often find that some models perform better on certain types of repo. For example Claude 3.5/7 is typically much better at frontends. That's why we let you switch up the model for each repo.

jackmpcollins 10 months ago

I've already merged my first Engine PR! Being able to review PRs like normal and it updates its work is very cool.

julvo 10 months ago

Looks great! What's your experience of using this for working on real world production code?

simvirdi 10 months ago

Looks cool - do you have any benchmarks? How do you compare to other products out there?

  • sudb 10 months ago

    We last submitted a SWE-Bench verified result in November 2024 - at the time I believe we were in the top 5 entrants.

    We expect Engine to be as good as the other code-writing agents out there at the moment - we understand almost everyone in the space to be using very similar base models and agent scaffolding.

RHSman2 10 months ago

Demo’s this 6 months ago. Super excited to see how far it has come since!!!

ca508 10 months ago

been following engine from afar for a while, super cool to see it on HN. didn't see it had a free plan, will try it out.

ph94robotics 10 months ago

Boom been waiting for something like this!

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection