Startup DreamersStartup Dreamers
  • Home
  • Startup
  • Money & Finance
  • Starting a Business
    • Branding
    • Business Ideas
    • Business Models
    • Business Plans
    • Fundraising
  • Growing a Business
  • More
    • Innovation
    • Leadership
Trending

‘Daemon X Machina: Titanic Scion’ Switch 2 Preview: A Giant Leap

August 19, 2025

Where Retirees Are Most and Least Likely to Run Out of Money

August 19, 2025

Take These 5 Steps to Future-Proof Your Business

August 19, 2025
Facebook Twitter Instagram
  • Newsletter
  • Submit Articles
  • Privacy
  • Advertise
  • Contact
Facebook Twitter Instagram
Startup DreamersStartup Dreamers
  • Home
  • Startup
  • Money & Finance
  • Starting a Business
    • Branding
    • Business Ideas
    • Business Models
    • Business Plans
    • Fundraising
  • Growing a Business
  • More
    • Innovation
    • Leadership
Subscribe for Alerts
Startup DreamersStartup Dreamers
Home » The Future Of AI Is At The Edge: Cloudflare Leads The Way
Innovation

The Future Of AI Is At The Edge: Cloudflare Leads The Way

adminBy adminNovember 25, 20230 ViewsNo Comments3 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email

Cloudflare, the leading content delivery network and cloud security platform, wants to make AI accessible to developers. It has added GPU-powered infrastructure and model-serving capabilities to its edge network, bringing state-of-the-art foundation models to the masses. Any developer can tap into Cloudflare’s AI platform with a simple REST API call.

Cloudflare introduced Workers, a serverless compute platform at the edge, in 2017. Developers can use this serverless platform to create JavaScript Service Workers that run directly in Cloudflare’s edge locations around the world. With a Worker, a developer can modify a site’s HTTP requests and responses, make parallel requests, and even respond directly from the edge. Cloudflare Workers use an API that is similar to the W3C Service Workers standard.

The rise of generative AI prompted Cloudflare to augment its Workers with AI capabilities. The platform has three new elements to support AI inference:

  • Workers AI operates on NVIDIA GPUs within Cloudflare’s global network, enabling the serverless model for AI. Users only pay for what they use, allowing them to spend less time on infrastructure management and more time on their applications.
  • Vectorize, a vector database, enables easy, rapid, and cost-effective vector indexing and storage, supporting use cases that require access not only to operational models but also to customized data.
  • AI Gateway enables organizations to cache, rate limit, and monitor their AI deployments regardless of the hosting environment.

Cloudflare has partnered with NVIDIA, Microsoft, Hugging Face, Databricks, and Meta to bring the GPU infrastructure and foundation models to its edge. The platform also hosts embedding models to convert text to vectors. The Vectorize database can be used to store, index and query the vectors to add context to the LLMs in order to reduce hallucinations in responses. The AI Gateway provides observability, rate limiting and caching frequent queries, reducing the cost while improving the performance of applications.

The model catalog for Workers AI boasts the most recent and some of the best foundation models. From Meta’s Llama 2 to Stable Diffusion XL to Mistral 7B, it has everything developers need to build modern applications powered by generative AI.

Behind the scenes, Cloudflare uses ONNX Runtime, an open neural network exchange runtime, an open source project led by Microsoft, to optimize running models in resource-constrained environments. It’s the same technology that Microsoft relies on to run foundation models in Windows.

While developers can use JavaScript to write AI inference code and deploy it to Cloudflare’s edge network, it is possible to invoke the models through a simple REST API using any language. This makes it easy to infuse generative AI into web, desktop and mobile applications that run in diverse environments.

In September 2023, Workers AI was initially launched with inference capabilities in seven cities. However, Cloudflare’s ambitious goal was to support Workers AI inference in 100 cities by the end of the year, with near-ubiquitous coverage by the end of 2024.

Cloudflare is one of the first CDN and edge network providers to enhance its edge network with AI capabilities through GPU-powered Workers AI, vector database and an AI Gateway for AI deployment management. Partnering with tech giants like Meta and Microsoft, it is offering a wide model catalog and ONNX Runtime optimization.

Read the full article here

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Articles

‘Daemon X Machina: Titanic Scion’ Switch 2 Preview: A Giant Leap

Innovation August 19, 2025

Amazon’s App Store Decision—48 Hours To Delete Your Apps

Innovation August 18, 2025

‘The Bad Guys 2’ New On Streaming This Week, Report Says

Innovation August 17, 2025

Today’s NYT Mini Crossword Clues And Answers For Saturday, August 16th

Innovation August 16, 2025

Apple’s Robot Strategy Revealed, Grok Adds Spicy GenAI Video, 3D Instagram In Meta VR

Innovation August 15, 2025

‘TOAPLAN Arcade Collection Vol 1 & 2’ Switch Review: Shmup Excellence

Innovation August 14, 2025
Add A Comment

Leave A Reply Cancel Reply

Editors Picks

‘Daemon X Machina: Titanic Scion’ Switch 2 Preview: A Giant Leap

August 19, 2025

Where Retirees Are Most and Least Likely to Run Out of Money

August 19, 2025

Take These 5 Steps to Future-Proof Your Business

August 19, 2025

How a Couple’s ‘Scrappy’ Weekend Side Hustle Made Millions

August 19, 2025

Ford’s Answer to China: A Completely New Way of Making Cars

August 19, 2025

Latest Posts

Want to Maximize the Sale Price of Your Business? Start with These 5 Value Drivers

August 18, 2025

A Hiker Was Missing for Nearly a Year—Until an AI System Recognized His Helmet

August 18, 2025

‘The Bad Guys 2’ New On Streaming This Week, Report Says

August 17, 2025

Warren Buffett’s ‘Mystery’ $1.8 Billion Investment Revealed

August 17, 2025

How to Run Multiple Businesses — From a CEO Who’s Doing It

August 17, 2025
Advertisement
Demo

Startup Dreamers is your one-stop website for the latest news and updates about how to start a business, follow us now to get the news that matters to you.

Facebook Twitter Instagram Pinterest YouTube
Sections
  • Growing a Business
  • Innovation
  • Leadership
  • Money & Finance
  • Starting a Business
Trending Topics
  • Branding
  • Business Ideas
  • Business Models
  • Business Plans
  • Fundraising

Subscribe to Updates

Get the latest business and startup news and updates directly to your inbox.

© 2025 Startup Dreamers. All Rights Reserved.
  • Privacy Policy
  • Terms of use
  • Press Release
  • Advertise
  • Contact

Type above and press Enter to search. Press Esc to cancel.

GET $5000 NO CREDIT