Amazon enters the AI agent race with nova act

In partnership with

Transform your hiring with Flipped.ai – the hiring Co-Pilot that's 100X faster. Automate hiring, from job posts to candidate matches, using our Generative AI platform. Get your free Hiring Co-Pilot.

Dear Reader,

Flipped.ai’s weekly newsletter is read by more than 75,000 professionals, entrepreneurs, decision-makers, and investors around the world.

In this newsletter, we’re excited to share that Amazon has officially entered the competitive AI agent landscape with the launch of Nova Act, an advanced AI model designed to autonomously perform tasks within web browsers. Unveiled on Monday, Nova Act represents the first major release from Amazon's Artificial General Intelligence (AGI) Lab and positions the tech giant as a direct competitor to OpenAI's Operator and Anthropic's Computer Use feature. Alongside this announcement, Amazon has expanded access to its broader Nova family of foundation models through a dedicated portal, nova.amazon.com, making these cutting-edge AI capabilities more accessible to developers and tech enthusiasts across the United States.

Before, we dive into our newsletter, checkout our sponsor for this newsletter.

You’ve heard the hype. It’s time for results.

After two years of siloed experiments, proofs of concept that fail to scale, and disappointing ROI, most enterprises are stuck. AI isn't transforming their organizations — it’s adding complexity, friction, and frustration.

But Writer customers are seeing positive impact across their companies. Our end-to-end approach is delivering adoption and ROI at scale. Now, we’re applying that same platform and technology to build agentic AI that actually works for every enterprise.

This isn’t just another hype train that overpromises and underdelivers.
It’s the AI you’ve been waiting for — and it’s going to change the way enterprises operate. Be among the first to see end-to-end agentic AI in action. Join us for a live product release on April 10 at 2pm ET (11am PT).

Can't make it live? No worries — register anyway and we'll send you the recording!

Amazon's strategic move into AI agents

Nova act: Amazon's entry into the AI agent race

Amazon

In a significant expansion of its artificial intelligence portfolio, Amazon has unveiled Nova Act, a specialized AI model designed to control web browsers and execute complex tasks autonomously. This strategic launch positions Amazon firmly in the rapidly evolving AI agent space, where tech giants and startups alike are racing to develop systems capable of moving beyond simple chat interactions to perform meaningful tasks on behalf of users.

The Nova Act SDK (Software Development Kit) allows developers to create agents that can navigate websites, fill out forms, make purchases, and interact with various web elements such as drop-down menus, date pickers, and pop-up dialogs. These capabilities represent a major step forward in the agent ecosystem, potentially transforming how users interact with digital services.

Behind the innovation: Amazon's AGI lab

Nova Act marks the first public product released by Amazon's San Francisco-based AGI Lab, which was established in December 2024. Led by David Luan, Amazon's Vice President of AGI Autonomy and former OpenAI VP, alongside renowned robotics scholar Pieter Abbeel, the lab represents Amazon's commitment to advancing artificial general intelligence.

"We really think agents are the last missing piece on the path to general intelligence," said Luan in a recent interview. This statement underscores Amazon's view that agent capabilities are not merely a product feature but a fundamental stepping stone toward more advanced AI systems.

Both Luan and Abbeel bring significant expertise to the project, having previously founded successful AI startups before joining Amazon. Luan established Adept, which focused on automating enterprise workflows, while Abbeel co-founded Covariant, a robotics company. Their leadership signals Amazon's serious intentions in the agentic AI space.

The competitive landscape: OpenAI, Anthropic, and beyond

Nova Act enters an increasingly crowded field of AI agents, with notable competitors including:

  • OpenAI's operator: Capable of performing independent web-based tasks including filling out forms, ordering products, booking flights, and making reservations

  • Anthropic's computer use: Allows control of software on a PC, performing on-screen tasks such as moving cursors, clicking buttons, and typing text

  • Various offerings from Google, Microsoft, Salesforce, and emerging startups

What sets Nova Act apart, according to Amazon, is its reliability and performance. The company claims that Nova Act outperforms competitors on several internal benchmarks, including the ScreenSpot Web Text evaluation, where it achieved a 94% score compared to OpenAI's 88% and Anthropic's 90%.

Nova act's technical capabilities and use cases

Breaking down complex tasks into manageable commands

At its core, Nova Act is designed to simplify complex web-based workflows by breaking them down into what Amazon calls "atomic commands." These include fundamental actions like:

  • Searching for information

  • Completing checkout processes

  • Answering questions based on on-screen content

  • Interacting with specific UI elements like dropdowns and popups

The SDK provides developers with tools to add detailed instructions to these commands, enhancing the agent's ability to handle nuanced scenarios. For example, developers can specify that an agent should avoid optional add-ons like insurance when completing a purchase.

Real-world applications and use cases

Amazon has highlighted several practical applications for Nova Act, including:

  1. Automated routine tasks: Submitting out-of-office notifications, scheduling calendar holds, or enabling automatic email replies

  2. Recurrent purchases: Setting up regular orders, such as scheduling a salad delivery every Tuesday evening

  3. Reservation management: Making restaurant or service bookings without manual intervention

  4. Form completion: Filling out standardized forms that would otherwise require repetitive human input

The company envisions Nova Act enabling more complex multi-step tasks in the future, such as organizing events or handling sophisticated IT workflows to increase business productivity.

Advanced capabilities and deployment options

Nova Act offers several advanced features that enhance its practical utility:

  • Headless operation: Agents can run without displaying a browser interface, enabling background processing for business applications

  • Parallel threading: The ability to run multiple agents simultaneously to handle larger workloads

  • API integration: Developers can incorporate API calls to strengthen reliability and extend functionality

  • Scheduled execution: Tasks can be programmed to run at specific times without user intervention

These capabilities make Nova Act particularly useful for enterprise applications where reliability and automation are paramount.

The broader nova ecosystem

Amazon's complete AI model family

Nova Act is part of Amazon's broader portfolio of foundation models, first introduced at re:Invent in December 2024. The complete Nova family includes:

  • Nova micro, lite, and pro: Text generation models of varying sizes and capabilities

  • Nova canvas: Specialized for high-quality image generation

  • Nova reel: Focused on video creation from text and image inputs

Image generation via Amazon Nova Canvas on nova.amazon.com.

These models are integrated with Amazon Bedrock, the company's managed service for building generative AI applications, offering customers scalable infrastructure for deploying AI solutions.

Accessibility through nova

A key aspect of Amazon's announcement is the launch of nova.amazon.com, a dedicated portal that makes the Nova family of models accessible to a broader audience. Through this website, US-based users with an Amazon account can:

  • Explore the capabilities of different Nova models

  • Generate text and images

  • Test the Nova Act SDK for building browser-based agents

  • Discover potential applications for Nova Reel's video generation

"Nova.amazon.com puts the power of Amazon's frontier intelligence into the hands of every developer and tech enthusiast, making it easier than ever to explore the capabilities of Amazon Nova," said Rohit Prasad, Senior Vice President of Amazon Artificial General Intelligence.

Integration with existing amazon products

Nova Act technology will power key features in Amazon's upcoming Alexa+ upgrade, an enhanced version of the company's popular voice assistant. This integration could significantly expand Nova Act's reach, potentially making it one of the most widely deployed AI agent technologies if Alexa+ adoption meets expectations.

Through Alexa+, Nova Act will enable self-directed web navigation to complete tasks for users, even when API access is limited. This development represents a significant evolution for voice assistants, potentially transforming them from simple query responders to proactive agents capable of completing complex tasks.

Performance and reliability

Benchmark results and competitive advantage

Amazon has emphasized Nova Act's exceptional performance on internal benchmarks designed to test agent capabilities:

  • ScreenSpot web text: Nova Act scored 0.939 (94%), outperforming Claude 3.7 Sonnet (0.900) and OpenAI's CUA (0.883)

  • ScreenSpot web icon: Nova Act achieved 0.879 on this benchmark measuring interactions with visual elements

The company acknowledges that Nova Act slightly trails competitors on the GroundUI Web test but views this as an area for future improvement.

Focus on practical reliability

Unlike some AI models that prioritize breadth of capabilities over reliability, Amazon has designed Nova Act with a strong emphasis on consistent performance. The company's approach focuses on ensuring that when an agent built with Nova Act functions as expected, it continues to do so reliably over time.

This reliability-first approach is evident in the SDK's design, which allows developers to precisely define when human intervention is required in an agent workflow. This hybrid approach aims to create more dependable agentic applications that can be trusted with important tasks.

Adaptability to new environments

One of Nova Act's notable strengths is its ability to transfer its user interface understanding to new environments with minimal additional training. Amazon shared an example where Nova Act performed well in browser-based games despite not being specifically trained on video game experiences.

This adaptability makes Nova Act particularly valuable for diverse applications and helps explain its integration into Alexa+, where it enables web navigation for task completion even without comprehensive API access.

The future of amazon's AI agent strategy

Beyond the research preview

Amazon has positioned the current release of Nova Act as a "research preview," indicating that this is an early step in a broader journey toward more capable AI agents. The company has outlined a vision where agents can handle increasingly complex, multi-step tasks through reinforcement learning across varied, real-world scenarios.

"The most valuable use cases for agents have yet to be built," Amazon noted in its announcement. "The best developers and designers will discover them. This research preview of our Nova Act SDK enables us to iterate alongside these builders through rapid prototyping and iterative feedback."

Implications for developers and businesses

For developers, Nova Act represents a new tool for creating automated workflows and enhancing user experiences. The SDK's focus on breaking down complex tasks into reliable atomic commands offers a framework for building agents that can be trusted with important business processes.

For businesses, Nova Act potentially offers a way to automate routine tasks, reduce operational costs, and improve customer experiences through faster, more consistent service delivery. The ability to run agents headlessly and schedule them for specific times opens up new possibilities for business process automation.

The broader vision for AI agents

Amazon's long-term vision extends beyond simple task automation. The company envisions agents capable of handling sophisticated planning and execution across diverse domains, from event organization to complex IT workflows.

This vision aligns with Amazon's definition of AGI as "an AI system that can help you do anything a human does on a computer." By focusing on web-based tasks initially, Nova Act represents a pragmatic first step toward this ambitious goal.

Industry impact and reactions

Accelerating the AI agent race

Amazon's entry into the AI agent market with Nova Act is likely to accelerate competition in this space. With major players including OpenAI, Anthropic, Google, Microsoft, and now Amazon all investing heavily in agent technologies, we can expect rapid innovation and improvement in agent capabilities over the coming months.

This competition will benefit developers and end-users through faster advancement of agent reliability, expanded capabilities, and potentially lower costs as companies compete for market share.

Challenges and limitations

Despite Amazon's impressive claims about Nova Act's performance, AI agents still face significant challenges:

  • Speed and reliability: Early AI agents from competitors have been criticized for being slow and error-prone

  • Security and privacy concerns: Agents with web browser control raise important questions about data security

  • User trust: Convincing users to delegate important tasks to AI agents remains a significant hurdle

Amazon acknowledges these challenges and positions Nova Act as a step toward addressing them, particularly through its focus on reliability and developer control.

The path forward

As Amazon continues to develop Nova Act and integrate it with products like Alexa+, the company is positioning itself as a leader in the emerging AI agent ecosystem. The emphasis on reliability, adaptability, and practical application sets Nova Act apart from some competitors and aligns with Amazon's broader strategy of delivering AI solutions with real-world value.

Conclusion: A new chapter in AI agents

Amazon's launch of Nova Act marks a significant milestone in the evolution of AI agents. By focusing on web-based task automation and reliability, Amazon is taking a pragmatic approach to advancing agent technology while addressing real-world needs.

As the AI agent landscape continues to evolve, Nova Act's performance, developer adoption, and integration with Amazon's broader ecosystem will determine its ultimate impact. With strong leadership from industry veterans and Amazon's substantial resources behind it, Nova Act represents a formidable entry into the AI agent race.

For developers and businesses exploring AI agent technologies, Nova Act offers a promising new option with unique strengths in reliability and web interaction. As the research preview progresses and more developers begin building with the Nova Act SDK, we'll gain a clearer picture of its full potential and limitations.

Whether Nova Act ultimately outperforms offerings from OpenAI, Anthropic, and others remains to be seen, but Amazon's strategic focus on practical reliability and real-world use cases positions it well in this rapidly evolving market segment.

Did you know just about every small business faces financial risks, but more than half of them (56%) aren’t protected with insurance?

Without insurance, you could find yourself having to cover costly claims. Even if it’s not your fault, you may still have to pay the legal expenses to defend yourself. Readers, Simply Business can help find affordable coverages you may need in just minutes. All online, 24/7.

Get started with affordable business coverage today.

Flipped.ai: Revolutionizing Recruitment with AI

At Flipped.ai, we’re transforming the hiring process with our turbocharged AI recruiter, making recruitment faster and smarter. With features like lightning-fast job matches, instant content creation, CV analysis, and smart recommendations, we streamline the entire hiring journey for both employers and candidates.

For Companies:
Looking to hire top talent efficiently? Flipped.ai helps you connect with the best candidates in record time. From creating job descriptions to making quick matches, our AI-powered solutions make recruitment a breeze.

Sign up now to get started: Company Sign Up

For Job Seekers:
Explore professional opportunities with Flipped.ai! Check out our active job openings and apply directly to find your next career move with ease. Sign up today to take the next step in your journey.

Sign up and apply now: Job Seeker Sign Up

For more information, reach out to us at [email protected].

Want to get your product in front of 75,000+ professionals, entrepreneurs decision makers and investors around the world ? 🚀

If you are interesting in sponsoring, contact us on [email protected].

Thank you for being part of our community, and we look forward to continuing this journey of growth and innovation together!

Best regards,

Flipped.ai Editorial Team