Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
  1. Forside
  2. Ikke-kategoriseret
  3. This entire report from the Ontario government on genAI systems is worth a read, but the review of healthcare scribe accuracy is pretty devastating, imo.

This entire report from the Ontario government on genAI systems is worth a read, but the review of healthcare scribe accuracy is pretty devastating, imo.

Planlagt Fastgjort Låst Flyttet Ikke-kategoriseret
12 Indlæg 11 Posters 26 Visninger
  • Ældste til nyeste
  • Nyeste til ældste
  • Most Votes
Svar
  • Svar som emne
Login for at svare
Denne tråd er blevet slettet. Kun brugere med emne behandlings privilegier kan se den.
  • mttaggart@infosec.exchangeM This user is from outside of this forum
    mttaggart@infosec.exchangeM This user is from outside of this forum
    mttaggart@infosec.exchange
    wrote sidst redigeret af
    #1

    This entire report from the Ontario government on genAI systems is worth a read, but the review of healthcare scribe accuracy is pretty devastating, imo. This has to work for the tech to be worth anything. If the notes in the chart are wrong, the whole thing falls apart.

    https://www.auditor.on.ca/en/content/specialreports/specialreports/en26/2026_AI_EN.pdf

    reedmideke@mastodon.socialR wcbdata@vis.socialW z3r0fox@mastodon.socialZ keithzg@fediverse.keithzg.caK kkarhan@c.imK 9 Replies Last reply
    2
    0
    • mttaggart@infosec.exchangeM mttaggart@infosec.exchange

      This entire report from the Ontario government on genAI systems is worth a read, but the review of healthcare scribe accuracy is pretty devastating, imo. This has to work for the tech to be worth anything. If the notes in the chart are wrong, the whole thing falls apart.

      https://www.auditor.on.ca/en/content/specialreports/specialreports/en26/2026_AI_EN.pdf

      reedmideke@mastodon.socialR This user is from outside of this forum
      reedmideke@mastodon.socialR This user is from outside of this forum
      reedmideke@mastodon.social
      wrote sidst redigeret af
      #2

      @mttaggart The minister responsible for that AI Scribe project's explanation of why it was all OK was incredibly bad https://mastodon.social/@reedmideke/116570464172955876

      1 Reply Last reply
      0
      • mttaggart@infosec.exchangeM mttaggart@infosec.exchange

        This entire report from the Ontario government on genAI systems is worth a read, but the review of healthcare scribe accuracy is pretty devastating, imo. This has to work for the tech to be worth anything. If the notes in the chart are wrong, the whole thing falls apart.

        https://www.auditor.on.ca/en/content/specialreports/specialreports/en26/2026_AI_EN.pdf

        wcbdata@vis.socialW This user is from outside of this forum
        wcbdata@vis.socialW This user is from outside of this forum
        wcbdata@vis.social
        wrote sidst redigeret af
        #3

        @mttaggart It's already too late. This garbage is not only already in our medical records, but it's also being used to train the next generation of models, which will coupound (and hide) the issue!

        1 Reply Last reply
        0
        • tanyakaroli@expressional.socialT tanyakaroli@expressional.social shared this topic
        • mttaggart@infosec.exchangeM mttaggart@infosec.exchange

          This entire report from the Ontario government on genAI systems is worth a read, but the review of healthcare scribe accuracy is pretty devastating, imo. This has to work for the tech to be worth anything. If the notes in the chart are wrong, the whole thing falls apart.

          https://www.auditor.on.ca/en/content/specialreports/specialreports/en26/2026_AI_EN.pdf

          z3r0fox@mastodon.socialZ This user is from outside of this forum
          z3r0fox@mastodon.socialZ This user is from outside of this forum
          z3r0fox@mastodon.social
          wrote sidst redigeret af
          #4

          @mttaggart Medicine should probably stick to machine learning pattern recognition in diagnostics, that seems useful? from what I've read?

          shaulaevans@zirk.usS 1 Reply Last reply
          0
          • z3r0fox@mastodon.socialZ z3r0fox@mastodon.social

            @mttaggart Medicine should probably stick to machine learning pattern recognition in diagnostics, that seems useful? from what I've read?

            shaulaevans@zirk.usS This user is from outside of this forum
            shaulaevans@zirk.usS This user is from outside of this forum
            shaulaevans@zirk.us
            wrote sidst redigeret af
            #5

            @z3r0fox @mttaggart If you dig into it deeper, it is also problematic.

            1 Reply Last reply
            0
            • mttaggart@infosec.exchangeM mttaggart@infosec.exchange

              This entire report from the Ontario government on genAI systems is worth a read, but the review of healthcare scribe accuracy is pretty devastating, imo. This has to work for the tech to be worth anything. If the notes in the chart are wrong, the whole thing falls apart.

              https://www.auditor.on.ca/en/content/specialreports/specialreports/en26/2026_AI_EN.pdf

              keithzg@fediverse.keithzg.caK This user is from outside of this forum
              keithzg@fediverse.keithzg.caK This user is from outside of this forum
              keithzg@fediverse.keithzg.ca
              wrote sidst redigeret af
              #6

              @mttaggart@infosec.exchange I was cynically thinking to myself “and what are the chances that an industry-loving institution like the Ontario government had any conclusion other than ‘well we’ll just choose to use Good AI and that will be fine’, probably 100%” and jumping to the report’s conclusions,

              • establish KPI targets to measure and track Microsoft Copilot Chat’s adoption
              • take actions to increase use of Microsoft Copilot Chat to the targeted rates and usage in the OPS
              • educate OPS staff through AI training about the dangers of using non-Microsoft browsers when accessing AI websites

              So, yeah, they did an audit showing LLMs are wildly unreliable and . . . concluded they should encourage use of Microsoft LLM products.

              Their audit criteria also included “having due regard for economy”.

              1 Reply Last reply
              0
              • mttaggart@infosec.exchangeM mttaggart@infosec.exchange

                This entire report from the Ontario government on genAI systems is worth a read, but the review of healthcare scribe accuracy is pretty devastating, imo. This has to work for the tech to be worth anything. If the notes in the chart are wrong, the whole thing falls apart.

                https://www.auditor.on.ca/en/content/specialreports/specialreports/en26/2026_AI_EN.pdf

                kkarhan@c.imK This user is from outside of this forum
                kkarhan@c.imK This user is from outside of this forum
                kkarhan@c.im
                wrote sidst redigeret af
                #7

                @mttaggart Seriously, any medical professional would get banned for malpractice if they had such a huge error rate.

                1 Reply Last reply
                0
                • mttaggart@infosec.exchangeM mttaggart@infosec.exchange

                  This entire report from the Ontario government on genAI systems is worth a read, but the review of healthcare scribe accuracy is pretty devastating, imo. This has to work for the tech to be worth anything. If the notes in the chart are wrong, the whole thing falls apart.

                  https://www.auditor.on.ca/en/content/specialreports/specialreports/en26/2026_AI_EN.pdf

                  avuko@infosec.exchangeA This user is from outside of this forum
                  avuko@infosec.exchangeA This user is from outside of this forum
                  avuko@infosec.exchange
                  wrote sidst redigeret af
                  #8

                  @mttaggart we all want the autodocs we were promised in sci-fi, but genAI is not that.

                  PS: medicine is already a field suffering heavily from biases. Adding automated bias at scale is gonna literally kill millions more of us.

                  1 Reply Last reply
                  0
                  • mttaggart@infosec.exchangeM mttaggart@infosec.exchange

                    This entire report from the Ontario government on genAI systems is worth a read, but the review of healthcare scribe accuracy is pretty devastating, imo. This has to work for the tech to be worth anything. If the notes in the chart are wrong, the whole thing falls apart.

                    https://www.auditor.on.ca/en/content/specialreports/specialreports/en26/2026_AI_EN.pdf

                    tehstu@hachyderm.ioT This user is from outside of this forum
                    tehstu@hachyderm.ioT This user is from outside of this forum
                    tehstu@hachyderm.io
                    wrote sidst redigeret af
                    #9

                    @mttaggart and the evaluation noted in figure 7 there was despite the audio made available to the vendors, who then provided the results, if I'm reading the report correctly. Not even a live demonstration.

                    1 Reply Last reply
                    0
                    • mttaggart@infosec.exchangeM mttaggart@infosec.exchange

                      This entire report from the Ontario government on genAI systems is worth a read, but the review of healthcare scribe accuracy is pretty devastating, imo. This has to work for the tech to be worth anything. If the notes in the chart are wrong, the whole thing falls apart.

                      https://www.auditor.on.ca/en/content/specialreports/specialreports/en26/2026_AI_EN.pdf

                      flyingpenguin@infosec.exchangeF This user is from outside of this forum
                      flyingpenguin@infosec.exchangeF This user is from outside of this forum
                      flyingpenguin@infosec.exchange
                      wrote sidst redigeret af
                      #10

                      @mttaggart their audit reads like a Wirken requirements document written by someone who did not know Wirken existed. I do five or six calls on this a week now, which is why I open-sourced and started giving Wirken away for free. I've updated the marketing copy here, but I'll soon release a line-by-line response to the Ontario audit: https://gebruder.ottenheimer.app/wirken

                      1 Reply Last reply
                      0
                      • mttaggart@infosec.exchangeM mttaggart@infosec.exchange

                        This entire report from the Ontario government on genAI systems is worth a read, but the review of healthcare scribe accuracy is pretty devastating, imo. This has to work for the tech to be worth anything. If the notes in the chart are wrong, the whole thing falls apart.

                        https://www.auditor.on.ca/en/content/specialreports/specialreports/en26/2026_AI_EN.pdf

                        mttaggart@infosec.exchangeM This user is from outside of this forum
                        mttaggart@infosec.exchangeM This user is from outside of this forum
                        mttaggart@infosec.exchange
                        wrote sidst redigeret af
                        #11

                        I get to see this in action. Doctors want transcription and summarization services because of the challenges they face getting quickly familiar with a patient in a crazy short amount of time. They also want to automate notetaking for rounds, which can be chaotic. Problem is, these tools suck in chaotic situations, and even in relatively normal ones, hallucination abounds.

                        There will always be a claim of human review, but I know all too well that it's working against the current to have a human reviewer not assume the model got it right. What's more, those safeguards will eventually be seen as cost centers and redundancies—well, at least until the lawsuits.

                        One other thing. As noted above, these model-generated fields in charts are a) being used as training material for other models, and b) being used as input for other generative tools without human review. The potential for compound errors and model collapse is immense.

                        delta_vee@cosocial.caD 1 Reply Last reply
                        0
                        • mttaggart@infosec.exchangeM mttaggart@infosec.exchange

                          I get to see this in action. Doctors want transcription and summarization services because of the challenges they face getting quickly familiar with a patient in a crazy short amount of time. They also want to automate notetaking for rounds, which can be chaotic. Problem is, these tools suck in chaotic situations, and even in relatively normal ones, hallucination abounds.

                          There will always be a claim of human review, but I know all too well that it's working against the current to have a human reviewer not assume the model got it right. What's more, those safeguards will eventually be seen as cost centers and redundancies—well, at least until the lawsuits.

                          One other thing. As noted above, these model-generated fields in charts are a) being used as training material for other models, and b) being used as input for other generative tools without human review. The potential for compound errors and model collapse is immense.

                          delta_vee@cosocial.caD This user is from outside of this forum
                          delta_vee@cosocial.caD This user is from outside of this forum
                          delta_vee@cosocial.ca
                          wrote sidst redigeret af
                          #12

                          @mttaggart Family full of medical people, and can confirm they're all desperate for reliable transcription. That's what they've been sold, and it's not like they typically have time to spare to go over their own recordings to check WER or validate the summaries.

                          Some recognize the problem, though:

                          https://cosocial.ca/@delta_vee/116581810079302048

                          1 Reply Last reply
                          0
                          • pelle@veganism.socialP pelle@veganism.social shared this topic
                          Svar
                          • Svar som emne
                          Login for at svare
                          • Ældste til nyeste
                          • Nyeste til ældste
                          • Most Votes


                          • Log ind

                          • Har du ikke en konto? Tilmeld

                          • Login or register to search.
                          Powered by NodeBB Contributors
                          Graciously hosted by data.coop
                          • First post
                            Last post
                          0
                          • Hjem
                          • Seneste
                          • Etiketter
                          • Populære
                          • Verden
                          • Bruger
                          • Grupper