Cloudflare Docs
Workers
Visit Workers on GitHub
Set theme to dark (⇧+D)

Limits

​​ Account plan limits

FeatureFreePaid (Bundled and Unbound)
Subrequests50/request50/request (Bundled),
1000/request (Unbound)
Simultaneous outgoing
connections/request
66
Environment variables64/Worker64/Worker
Environment variable
size
5 KB5 KB
Worker size1 MB1 MB
Number of Workers30100
Number of Cron Triggers
per Worker
33
Number of Cron Triggers
per account
5250

​​ Request limits

URLs have a limit of 16 KB.

Request headers observe a total limit of 32 KB, but each header is limited to 16 KB.

Cloudflare has network-wide limits on the request body size. This limit is tied to your Cloudflare Account’s plan, which is separate from your Workers plan. When the request body size of your POST/PUT/PATCH requests exceed your plan’s limit, the request is rejected with a (413) Request entity too large error.

Cloudflare Enterprise customers may contact their account team or Cloudflare Support to have a request body limit beyond 500 MB.

Cloudflare PlanMaximum body size
Free100MB
Pro100MB
Business200MB
Enterprise500MB (by default)

​​ Response limits

Cloudflare does not enforce response limits, but cache limits for Cloudflare’s CDN are observed. Maximum file size is 512 MB for Free, Pro, and Business customers and 5 GB for Enterprise customers.

​​ Worker limits

FeatureFreeBundled Usage ModelUnbound Usage Model
Request100,000 requests/day
1000 requests/min
nonenone
Worker memory128 MB128 MB128 MB
CPU runtime10 ms50 ms HTTP request
50 ms Cron trigger
30 s HTTP request
15 min Cron Trigger
DurationNo limit*

​​ Bundled Usage Model

Workers on the Bundled Usage Model are intended for use cases below 50 ms. Bundled Workers limits are based on CPU time, rather than duration. This means that the time limit does not include the time a Worker is waiting for responses from network calls. The billing model for Bundled Workers is based on requests that exceed the included number of requests on the Paid plan. Learn more about Usage Model pricing.

​​ Unbound Usage Model

The Workers Unbound Usage Model has a significantly higher limit than the Bundled Usage Model and is intended for use cases up to 30 seconds of CPU time for HTTP requests and up to 15 minutes of CPU time for Cron Triggers. Duration is not capped but after 30 seconds there is a slightly higher chance of eviction. Learn more about Usage Model pricing.

​​ KV limits

FeatureFreePaid
Reads/second100,000 reads per dayunlimited
Writes/second (different keys)1,000 writes per dayunlimited
Writes/second (same key)11
Operations/worker invocation10001000
Namespaces100100
Storage/account1 GBunlimited
Storage/namespace1 GBunlimited
Keys/namespaceunlimitedunlimited
Key size512 bytes512 bytes
Key metadata1024 bytes1024 bytes
Value size25 MiB25 MiB

​​ Cache API limits

FeatureFreeBundled
Max object size512 MB512 MB
Calls/request5050
Storage/request5 GB5 GB

​​ Durable Objects limits

FeatureLimit
Number of objectsunlimited
Storage per account50 GB (can be raised by contacting Cloudflare)
Storage per classunlimited
Storage per objectunlimited
Key size2048 bytes
Value size128 KiB
Websocket message size1 MiB
CPU per request30s

​​ Request

Workers automatically scale onto thousands of Cloudflare edge servers around the world. There is no general limit to the number of requests per second Workers can handle.

Cloudflare’s abuse protection methods do not affect well-intentioned traffic. However, if you send many thousands of requests per second from a small number of client IP addresses, you can inadvertently trigger Cloudflare’s abuse protection. If you expect to receive 1015 errors in response to traffic or expect your application to incur these errors, contact your Cloudflare account team to increase your limit.

The burst rate and daily request limits apply at the account level, meaning that requests on your *.workers.dev subdomain count toward the same limit as your zones. Upgrade to a Paid plan to automatically lift these limits.

​​ Burst rate

Accounts using the Workers Free plan are subject to a burst rate limit of 1000 requests per minute. Users visiting a rate limited site will receive a Cloudflare 1015 error page. However if you are calling your Worker programmatically, you can detect the rate limit page and handle it yourself by looking for HTTP status code 429.

​​ Daily request

Accounts using the Workers Free plan are subject to a daily request limit of 100,000 requests. Free plan daily requests counts reset at midnight UTC. A Worker that fails as a result of daily request limit errors can be configured by toggling its corresponding route in two modes: 1) Fail open and 2) Fail closed.

​​ Fail open

Routes in fail open mode will bypass the failing Worker and prevent it from operating on incoming traffic. Incoming requests will behave as if there was no Worker.

​​ Fail closed

Routes in fail closed mode will display a Cloudflare 1027 error page to visitors, signifying the Worker has been temporarily disabled. Cloudflare recommends this option if your Worker is performing security related tasks.


​​ Memory

Only one Workers instance runs on each of the many global Cloudflare network edge servers. Each Workers instance can consume up to 128 MB of memory. Use global variables to persist data between requests on individual nodes; note however, that nodes are occasionally evicted from memory.

If a Worker processes a request that pushes the Worker over the 128MB limit, the Cloudflare Workers runtime may cancel one or more requests. To view these errors, as well as CPU limit overages, go to Workers on the Cloudflare dashboard > Manage Workers > select the Worker you would like to investigate > scroll down to Invocation Statuses and examine Exceeded Resources.

Use the TransformStream API to stream responses if you are concerned about memory usage. This avoids loading an entire response into memory.


​​ CPU runtime

Most Workers requests consume less than a millisecond. It is rare to find normally operating Workers that exceed the CPU time limit. CPU time is capped at various limits depending on your plan, usage model, and Worker type.

  • A Worker may consume up to 10 milliseconds on the Free plan.
  • A Worker or Scheduled Worker may consume up to 50 milliseconds with the Bundled usage model on the Paid Plan.
  • A Worker may consume up to 30 seconds with the Unbound usage model on the Paid Plan.
  • A Scheduled Worker may consume up to 30 seconds with the Unbound usage model on the Paid Plan, when the schedule interval is less than 1 hour.
  • A Scheduled Worker may consume up to 15 minutes with the Unbound usage model on the Paid Plan, when the schedule interval is greater than 1 hour.

There is no limit on the real runtime for a Worker. As long as the client that sent the request remains connected, the Worker can continue processing, making subrequests, and setting timeouts on behalf of that request. When the client disconnects, all tasks associated with that client request are canceled. You can use event.waitUntil() to delay cancellation for another 30 seconds or until the promise passed to waitUntil() completes.


​​ Duration

Duration is the measurement of wall-clock time. This is measured in Gigabyte-seconds (GB-s). When a Worker is executed, it is allocated 128 MB of memory. As the Worker continues to execute that memory remains allocated, even during network IO requests.

For example, when a Worker executes via a scheduled event, it executes for four seconds, including network-bound IO time: 4s x 0.125GB (or 128Mb) = .5 GB-s.

Duration is most applicable to Unbound Workers on the Paid plan and Durable Objects.


​​ Subrequests

​​ Can a Worker make subrequests to load other sites on the Internet?

Yes. Use the Fetch API to make arbitrary requests to other Internet resources.

​​ How many subrequests can I make?

The limit for subrequests a Worker can make is 50 per request on the Bundled usage model or 1000 per request on the Unbound usage model. Each subrequest in a redirect chain counts against this limit. This means that the number of subrequests a Worker makes could be greater than the number of fetch(request) calls in the Worker.

For subrequests to internal services like Workers KV and Durable Objects, the subrequest limit is 1000 per request, regardless of usage model.

​​ How long can a subrequest take?

There is no set limit on the amount of real time a Worker may use. As long as the client which sent a request remains connected, the Worker may continue processing, making subrequests, and setting timeouts on behalf of that request.

When the client disconnects, all tasks associated with that client’s request are proactively canceled. If the Worker passed a promise to event.waitUntil(), cancellation will be delayed until the promise has completed or until an additional 30 seconds have elapsed, whichever happens first.


​​ Simultaneous open connections

While handling a request, each Worker is allowed to have up to six connections open simultaneously. The connections opened by the following API calls all count toward this limit:

Once a Worker has six connections open, it can still attempt to open additional connections. However, these attempts are put in a pending queue — the connections will not be initiated until one of the currently open connections has closed. Since earlier connections can delay later ones, if a Worker tries to make many simultaneous subrequests, its later subrequests may appear to take longer to start.

If the system detects that a Worker is deadlocked on open connections — for example, if the Worker has pending connection attempts but has no in-progress reads or writes on the connections that it already has open — then the least-recently-used open connection will be canceled to unblock the Worker. If the Worker later attempts to use a canceled connection, an exception will be thrown. These exceptions should rarely occur in practice, though, since it is uncommon for a Worker to open a connection that it does not have an immediate use for.


​​ Environment variables

The maximum number of environment variables (secret and text combined) for a Worker is 64 variables. There is no limit to the number of environment variables per account.

Each environment variable has a size limitation of 5 KB.

​​ Worker size

A Worker can be up to 1 MB in size after compression. If needed, you can request a larger Worker size.

​​ Number of Workers

Unless otherwise negotiated as a part of an enterprise level contract, all paid Workers accounts are limited to a maximum of 100 Workers at any given time. Free Workers accounts are limited to a maximum of 30 Workers at any given time.


​​ KV

Workers KV supports:

  • Up to 100 namespaces per account
  • Unlimited keys per namespace
  • Unlimited storage per namespace (except on the free tier, which is limited to 1 GB total across all namespaces in an account)
  • Keys of up to 512 bytes
  • Values of up to 25 MiB
  • Metadata of up to 1024 bytes per key
  • Unlimited reads per second
  • Unlimited writes per second, if they are to different keys
  • Up to one write per second to any particular key

Workers KV read performance is determined by the amount of read-volume a given key receives. Maximum performance for a key is not reached unless that key is being read at least a couple times per minute in any given data center.

Workers KV is an eventually consistent system, meaning that reads will sometimes reflect an older state of the system. While writes will often be visible globally immediately, it can take up to 60 seconds before reads in all edge locations are guaranteed to see the new value.


​​ Cache API

  • 50 total put(), match(), or delete() calls per-request, using the same quota as fetch()

  • 5 GBs total put() per-request


​​ Durable Objects

  • Unlimited Durable Objects within an account or of a given class

  • 50 GB total storage per account (can be raised by contacting Cloudflare)

  • No storage limit per Durable Object separate from the account limit

  • No storage limit per Durable Object class separate from the account limit

  • Storage keys of up to 2 KiB (2048 bytes)

  • Storage values of up to 128 KiB (131072 bytes)

  • Websocket messages of up to 1 MiB (1048576 bytes). This limit applies to messages received, not sent or proxied through.

  • 30s of CPU time per request, including websocket messages

Durable Objects scale well across Objects, but each object is inherently single-threaded. A baseline of 100 req/sec is a good floor estimate of the request rate an individual Object can handle, though this will vary with workload.

Durable Objects have been built such that the number of Objects in the system do not need to be limited. You can create and run as many separate objects as you want. The main limit to your usage of Durable Objects is the total storage limit per account - if you need more storage, contact your account team.