Agentic AI

Agentic systems are being developed at a rapid pace, evolving from passive tools into proactive, decision-making entities. These systems can plan, execute tasks, and adapt to changing environments. While promising, agentic AI also raises challenges with respect to alignment, safety, and ethical considerations. 

The following is an effort to start to document all of the great work being done in this area.

Overview of agentic systems

Notable agentic systems

  • Operator: An agent from OpenAI that can use a browser to perform tasks

  • Mariner: Built with Gemini 2.0, Mariner combines strong multimodal understanding and reasoning capabilities to automate tasks using a browser.

  • Magnetic-One: A Generalist Multi-Agent System from Microsoft for Solving Complex Tasks

Agentic Safety Benchmarks

Agentic Capability Benchmarks