Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
mttaggart@infosec.exchangeM

mttaggart@infosec.exchange

@mttaggart@infosec.exchange
About
Indlæg
3
Emner
1
Fremhævelser
0
Grupper
0
Følgere
0
Følger
0

Vis Original

Indlæg

Seneste Bedste Controversial

  • Problem: LLMs can't defend against prompt injection.
    mttaggart@infosec.exchangeM mttaggart@infosec.exchange

    What are we doing with our time on this earth

    https://www.promptarmor.com/resources/claude-cowork-exfiltrates-files
    https://www.varonis.com/blog/reprompt

    Ikke-kategoriseret

  • Problem: LLMs can't defend against prompt injection.
    mttaggart@infosec.exchangeM mttaggart@infosec.exchange

    @cR0w That's really where all the troubles began, isn't it

    Ikke-kategoriseret

  • Problem: LLMs can't defend against prompt injection.
    mttaggart@infosec.exchangeM mttaggart@infosec.exchange

    Problem: LLMs can't defend against prompt injection.

    Solution: A specialized filtering model that detects prompt injections.

    Problem: That too is susceptible to bypass and prompt injection.

    Solution: We reduce the set of acceptable instructions to a more predictable space and filter out anything that doesn't match.

    Problem: If you over-specialize, the LLM won't understand the instructions.

    Solution: We define a domain-specific language in the system prompt, with all allowable commands and parameters. Anything else is ignored.

    Problem: We just reinvented the CLI.

    Ikke-kategoriseret
  • Log ind

  • Har du ikke en konto? Tilmeld

  • Login or register to search.
Powered by NodeBB Contributors
Graciously hosted by data.coop
  • First post
    Last post
0
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper