-
Notifications
You must be signed in to change notification settings - Fork 165
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Should TimeoutLayer
middleware allow returning other status codes?
#300
Comments
From a tower middleware it's not possible to distinguish between reading the request head timing out or the server handling the request. That requires a specific integration into hyper. I think if you want a different status you can write your own middleware for it. It's not complicated. If you're using axum it's even easier with |
Totally agreed. Which makes me feel even more like this middleware shouldn't be assuming the error is a client-side error 🤔 |
But not all servers are proxies so 504 doesn't seem right to me.
|
Ah totally agreed there - that's why I was hoping to make it configurable. Basically "this middleware is generic and can be used anywhere and so can't pick a status code and be correct all the time". So we can leave the default as 408 so it's not a breaking change but if we give consumers another constructor/method to provide a custom Footnotes |
Often I've seen servers return 503 when their own internal timeout triggered (not as a gateway). :) |
Sure we can add a method to change the status code. I don't think we should allow customizing the response body. If people need that I'd recommend writing your own middleware. |
Yeah I can see adding the |
A lot of inspiration for tower came from Finagle, so if there's some prior art for how they handle HTTP timeouts, we could, uh, be inspired again :) |
To provide one example, Firefox appears to rely on 408 to mean "client failed to upload the request in time" to retry requests automatically. This makes sense according to the spec as far as I can tell. This makes returning 408 the entirely wrong thing to do for timeouts caused by server-side processing of the request taking too long: the same slow operation will likely be executed and aborted every time in exactly the same way. |
Firefox will retry the request 10 times in my testing when sending 408 so I think we should either change the default or at least make it configurable. I personally would really change the default as that looks like a big footgun. |
Firefox not handling 408 errors properly is actually a pretty big issue. It makes The issue is also not mentioned in the docs, which means developers using Firefox (like me 😆) will be left scratching their heads and wondering why the connection is being reset instead of getting a timeout status code. Example code (GitHub repo): use std::time::Duration;
use axum::{http::StatusCode, response::IntoResponse, routing::get, BoxError, Router};
use tokio::{net::TcpListener, time::sleep};
use tower::ServiceBuilder;
use tower_http::{timeout::TimeoutLayer, trace::TraceLayer};
use tracing_subscriber::{layer::SubscriberExt, util::SubscriberInitExt};
#[tokio::main]
async fn main() {
tracing_subscriber::registry()
.with(tracing_subscriber::filter::LevelFilter::DEBUG)
.with(tracing_subscriber::fmt::layer())
.init();
// The timeout can be handled with a 504 like this
// let services = tower::ServiceBuilder::new()
// .layer(TraceLayer::new_for_http())
// .layer(axum::error_handling::HandleErrorLayer::new(handle_error))
// .timeout(Duration::from_secs(3));
let services = ServiceBuilder::new()
.layer(TraceLayer::new_for_http())
.layer(TimeoutLayer::new(Duration::from_secs(3)));
let app = Router::new().route("/", get(index)).layer(services);
let address = "127.0.0.1:3000".to_string();
println!("Hosting on http://{address}");
let listener = TcpListener::bind(address).await.unwrap();
axum::serve(listener, app).await.unwrap();
}
async fn index() -> impl IntoResponse {
sleep(Duration::from_secs(5)).await;
"Hello world!".to_string()
}
async fn handle_error(err: BoxError) -> impl IntoResponse {
if err.is::<tower::timeout::error::Elapsed>() {
return (StatusCode::GATEWAY_TIMEOUT, "Request Timeout".to_string());
}
(
StatusCode::INTERNAL_SERVER_ERROR,
"Internal Server Error".to_string(),
)
} Output (with
|
Feature Request
Motivation
I'd like to be able to return a 504 if the timeout given to the
TimeoutLayer
is exceeded. This is because a 408, which is currently returned, is meant to
communicate:
I don't know if in general we can tell if the issue is the client sending the
request or the server processing the request but in the case of the latter
(especially because this service is proxying to another) I'd like to return
a 504 Gateway Timeout instead (which is still not a guarantee - the issue
could have been the upstream or any post-processing of the upstream's
response but that's rare in this case). Servers like
axum
andactix-web
provide other mechanisms for timeouts when receiving things like request
headers12 so maybe something about receiving the request should be handled
at the server level or by something like
BodyTimeout
?Proposal
Changing the response status from 408 all the time to 504 all the time
would equally be incorrect sometimes and would be a breaking change.
Luckily we already have a
new()
constructor forTimeoutLayer
so Iimagine that could stay the same and default to 408 and then we could
have a
new_with_response
constructor that accepts anhttp::Response<B>
or similar that can be returned instead.
Alternatives
The most obvious alternative I can see follows from this comment
which is to use
tower
instead oftower-http
and explicitly catch theerror and perform custom logic on it. This isn't too much code but I
wonder if the alternate constructor is valuable because I'm not sure
how frequent 408 will be the desired response code (though HTTP
semantics are fuzzy enough that maybe it's not worth it!).
Footnotes
https://docs.rs/axum-server/latest/axum_server/struct.HttpConfig.html#method.http1_header_read_timeout ↩
https://docs.rs/actix-web/latest/actix_web/struct.HttpServer.html#method.client_request_timeout ↩
The text was updated successfully, but these errors were encountered: