@Natasha_Jay@tech.lgbt avatar Natasha_Jay , to random

"I just found out that it's been hallucinating numbers this entire time."

ALT
@h4ckernews@mastodon.social avatar h4ckernews Bot , to random
@h4ckernews@mastodon.social avatar h4ckernews Bot , to random
@sashag@anarres.family avatar sashag , to random

"Agentic AI" is your automation script, created by natural language input, running on someone else's computer, having all your data.

There is a benefit for people who can't code, but I'm absolutely unsympathetic to software engineers who are full of praise for them. You should know how it works. You should be aware that there's nothing new in it.

And once again, it's not intelligent, it doesn't think or reason. It generates an output that statistically most likely is what you are looking for.
Reasoning models by the way just use a multi-step approach, where they take the generated output as an additional context to generate a maybe better fitting answer.

@h4ckernews@mastodon.social avatar h4ckernews Bot , to random
@itsfoss@mastodon.social avatar itsfoss , to random
@linuxfoundation@social.lfx.dev avatar linuxfoundation , to random

Today we launch the Agentic AI Foundation (AAIF) with project contributions of MCP (Anthropic), goose (Block) and AGENTS.md (OpenAI), creating a shared ecosystem for tools, standards, and community-driven innovation.

Learn more about this major step toward open, interoperable agentic AI: https://www.linuxfoundation.org/press/linux-foundation-announces-the-formation-of-the-agentic-ai-foundation

@silentexception@mastodon.social avatar silentexception , to random

I haven't tested this model, there is magentic-UI as well. It looks dubious, as none of the model I tested were reliable or actually performed as advertized, locally or not, none. So this looks to me like open-air experiment for now. That they raised so much money on a promise that it will work one day, is even more surreal to me, especially since insurers don't seem to warm up to the idea.

https://www.microsoft.com/en-us/research/blog/fara-7b-an-efficient-agentic-model-for-computer-use/

ALT
@h4ckernews@mastodon.social avatar h4ckernews Bot , to random
@omgubuntu@floss.social avatar omgubuntu , to random

Mozilla's TABS API helps developers build AI agents to automate web tasks, and interested users can now sign up for access.

https://www.omgubuntu.co.uk/2025/11/mozilla-tabs-api-ai-web-agents

@linuxmagazine@fosstodon.org avatar linuxmagazine , to random

SUSE becomes the first open source company to adopt agentic AI with @SUSE Enterprise Linux 16
https://www.linux-magazine.com/Online/News/SUSE-Dives-into-the-Agentic-AI-Pool?utm_source=mlm

ALT
@h4ckernews@mastodon.social avatar h4ckernews Bot , to random
@h4ckernews@mastodon.social avatar h4ckernews Bot , to random

Syllabi – Open-source agentic AI with tools, RAG, and multi-channel deploy

https://www.syllabi-ai.com/

@h4ckernews@mastodon.social avatar h4ckernews Bot , to random
@h4ckernews@mastodon.social avatar h4ckernews Bot , to Testing
@JessTheUnstill@infosec.exchange avatar JessTheUnstill , to random

[Thread, post or comment was deleted by the author]

  • Loading...
  • n_dimension ,
    @n_dimension@infosec.exchange avatar

    @JessTheUnstill @chrisw_b

    AND YOU ALL ESCHEW AI!!!

    Primary use case is for an AI agent to pretend to be you, occasionally throw in "brilliant" and emote "👍".
    Then give you a 25 word summary after 3.5h

    @abucci@buc.ci avatar abucci , to random

    If you take the stance that technical debt is code nobody understands, then current LLM-based code generators are technical debt generators until somebody reads and understands their output.

    If you take the stance that writing is thinking--that writing is among other things a process by which we order our thoughts--then understanding code generator output will require substantial rewriting of the code by whomever is tasked with converting it from technical debt to technical asset.

    @h4ckernews@mastodon.social avatar h4ckernews Bot , to random
    @remixtures@tldr.nettime.org avatar remixtures , to random Portuguese

    "AI agents have already demonstrated that they may misinterpret goals and cause some modest amount of harm. When the Washington Post tech columnist Geoffrey Fowler asked Operator, OpenAI’s ­computer-using agent, to find the cheapest eggs available for delivery, he expected the agent to browse the internet and come back with some recommendations. Instead, Fowler received a notification about a $31 charge from Instacart, and shortly after, a shopping bag containing a single carton of eggs appeared on his doorstep. The eggs were far from the cheapest available, especially with the priority delivery fee that Operator added. Worse, Fowler never consented to the purchase, even though OpenAI had designed the agent to check in with its user before taking any irreversible actions.

    That’s no catastrophe. But there’s some evidence that LLM-based agents could defy human expectations in dangerous ways. In the past few months, researchers have demonstrated that LLMs will cheat at chess, pretend to adopt new behavioral rules to avoid being retrained, and even attempt to copy themselves to different servers if they are given access to messages that say they will soon be replaced. Of course, chatbot LLMs can’t copy themselves to new servers. But someday an agent might be able to.

    Bengio is so concerned about this class of risk that he has reoriented his entire research program toward building computational “guardrails” to ensure that LLM agents behave safely."

    https://www.technologyreview.com/2025/06/12/1118189/ai-agents-manus-control-autonomy-operator-openai/